TitleGenColors Logo

Gene list

Applied filters:

Organism: Mannheimia succiniciproducens MBEL55E, MBEL55E
Gene type: CDS

Number of genes found: 2384

Free access
Sort by:

 



# Mannheimia succiniciproducens MBEL55E, MBEL55E

>MS2121 unknown
MVAKRRIIGFYLYLHKIYKKPTALCKINAVRRFFRYANVKISLHFLLKVA
>MS1285 unknown
MVKSKSALNSQFFDRTFIEIVYFMAGILLIFYV
>MS0576 unknown
MRLFRTLVKFEIKSAVGFERIFVIAQSVRFYIWGFGFVFGMYK
>MS0539 unknown
MKTLKQAKFLKQEGNRIDIQCERDYVLHLFVLEQDIIRVAFTRKNSFKLD
RTWAISPNCEDVPFAGRERFSTEGFSLPTYQLNFEHDVIEIVTEKLKVRV
HQPLTLEWQYNKDGNWLPLIQERKTGAYLFGINNNKISHFIERSLDENCY
GLGEKAGDLNRKGRRFEMRNLDAMGYNAEKTDPLYKHVPFYITRKNDVSY
GIYYDNLAQCWFDLGNELDNYHIAYKSYRAEDGDMDYYVILGPSTLEVTK
KYTALTGGTIFGPRWGLGYSGSTMSYTDAEDAQEQLKKFVDLCKEHDIPC
DSFQLSSGYTSINGKRYVFNWNYDKIPEPLKMSGYFRDAGMQLAANIKPC
MLQDHPRYKEAQELGLFIKDSESELPERSVFWDDEGSHLDFTNPATVQWW
KDNIKEQLLERGIGSTWNDNNEFEIWDDNAKCVGFGKETPIKLIRPLHPL
LMMKASYEAQKEFAPHQRPYLISRSGCAGMNRYVQTWSGDNRTNWTTLRY
NIRMGLGMSLSGLYNVGHDVGGFSGDKPEPELFVRWVQNGIMHPRFTIHS
WNDDKTVNEPWMYPAVTNIIRDTIKLRYKLMPYIYNAFWQSHQDLEPMLR
PTFLDHEHDMKTYEETDDYLFGKDLLVASVVEKGQRQREVYLPQNNAGWY
DFHAHTYYEAGQTVTVGAPLERLPLFVKAGAILPLSERTAYSCAKQDTSR
ELLVFPFIREGEATSTIFDDDGETYRYQHNGYLQLTLKLSCNKDSVNLNI
QKQGTWTPAYNALKITLPETETRPLTVNGKVFKSGTELSLGDIKES
>MS1771 unknown
MKKYHYFIKLKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKA
RFSQGIEQAGLALMAENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQ
KYKRNQELVSGLVRHYYYPSTYFKDNLKISDKYDYQCNNLQGPNGEQLKS
IACEISGKFERPSWLYLGKNNGLSFAETTTINANKIYIQKNLDEIIPIDL
MLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQNNTEEGKVI
SQYNRIGFTSFAFGAQQQNNTAQCYLPYEIKPSITIRNNNYYGGYYNTTM
QYSELLSYVGSNQQRYSYATLAQYFDAFVDYDKTIESINSFDGKDLSSLM
YFSKNSWCLGSANTRINSTYIWAGKNESADLVSRFNRVPALGATLSSSGL
LIGANLLMNTNPDENAQPSKLGANTQRIILVLSDGEDQINNASSSLNITS
TLINQGMCEKIKSKLNSLQDKTYLEQPTRIGFVAFGYGPSGTQKAAWEKC
VGKYYYVANNKEELLESFRKIIGLVEEVGHSTYKEPTYYSN
>MS1596 unknown
MASRSNKVFLGEIRLKVKNLPLKQSNKSTPTADKTNLYQTEV
>MS0129 unknown
MKTSIIFNLPADVINELHKRLRESNYSGFIELENWLKNLGFNVSKSGIHR
YAQKLKSLDGFIGRSGSFDLAVQLNNSIDDNTPLNLLYQELGKLEYQKQQ
ILQKISAMEAENQI
>MS0110 unknown
MTDFFKHNLRNKNMTDKQQDKTTKKKVERKFKGVERFTLSYDASDSEYAE
HKINAHNLVKVVNEMITLIERSDKLLNGKQKTVEIFLQAPESGVIVKKGS
LQIPFAVELYEYICTVKDIVTTIETKDIFTALGLGIPSASVGYGVFKDIF
RTKGEPVIDVKTQDGSNEVELLTENTKIKTTKETAILMQDDEIRRAIKDL
TVAPLANKVDAVFKIKREETTEEQGEITTEETVAVEIESGKEIETLTKLS
ERIAQEPEVELKPEQLITITQINFSSGESGWKMRLDGKERAVVLQDVAFM
ASINADQASFRKGDWLKVNLKRVKTFGTQTKTTYIITEVLEHLVGKDRKL
IEKQDE
>MS0684 unknown
MFIKWEIAMNESKQTNEQTQYNEINFRRRSVLRTLAGSVLLSVTGSTLAK
QCEITSPEILSPRYPDPLIEVSDPSFNKYRLYSSSVERLATGFRWAEGPV
WFGDGQYLLFSDIPNNRIMRYDNITGQTAVFRENANYSNGLARDKQGRLL
ACEHLTRRLTRTEYDGSVTVLADSFEGKPLNSPNDIAVQSNGAIWFTDPT
FGINGYYEGEKAKAEQPTAVYRIDPQTEKLERVLNDLLMPNGIAFSPDEK
HLYIVGRFSETPALREIFRYDVSTDGKLNNRTHFFDGGENGTLDGIAVDE
DGNIWAGWGSINNSKNGLSGDMDGVIVINPQGKQIAHIHLPERCANLCFG
GSKRNRVFMASGHSLYALYVETRGT
>MS1842 unknown
MSMSKYLAQNFQIFNRTFMFKKIVQWLLSS
>MS0618 unknown
MTIAIIIATHGVAAEQLLKTTEMLIGEQENVATIDFVPGENAETIMGKYQ
EKLATTLSHCDQVLFLVDTWGGSPFNAANRVAEGNENMDIVTGVNVPMLV
ETFMARDDNPSLQELVAIALETGRTGVRALRYEEPEEAPVEQAQPVPAAA
PTAQPNVVTNKEGHLEIGLARIDDRLIHGQVATRWTKESRVTRIVVVNDD
VAKDSVRSTMLKSVAPPGVTAHVVNVDKMIRVYNNPEYAGERMMLLFTNP
TDVVKLMQAGVEFKSINIGGMAYKDGKQMITSAVAVDSQDIDAFKILDAK
GIELDVRKVSNDARQYMMDLLKKNNLI
>MS1165 unknown
MVGTPQSSGINPGIPMYSNDSDSSTTKATLTEGRITLNKDSNPTQVTAQS
LGINTQLEGANRQVAAPKDIHRELKDQQILSRAAGDVAGAVSSYVDSRKE
ALKEEQQAALEEAKKAQARGDVTTAEAKLAEANALENEANR
>MS1399 unknown
MNSEQTQLVEHIISLEKQALDKWFKGDTSGYRELWSKQNFSYFDIVHPER
IDSYDNISAFLDSIEGKLFADSYEFKMPRVQLSQDMAILTYQIFAKTNLI
DMRYNCIEVFQKEGDEWKVVHSTWSAIRPMDWDFSTMKAAI
>MS1458 unknown
MERVDNYEQRFGGIGRLYTPQGLERLRQAHVCVIGIGGVGSWCVEALARS
GVGKLTLIDMDDICVTNINRQIHALTGNIGKLKTEVMKERVELINPECKV
EIIDDFISPENLAEYLHSDYDYVIDAIDSVKTKAALIAYCKRNKIKVIMV
GGAGGQTDPTQIQIADLSKTVQDPLASKVRSLLRKNYHFSQNPKRKFGVD
CVFSTQPLIFPQMSEGCGISASMNCENGFGAATMITATFGFFAVSRVVDK
LLTKQ
>MS0095 unknown
MAKVTATVKGSPLLHNGKRYDIGATIELDEAQAENLGIYLDIVKPVGDGN
KQTGNKQTDGAKKTQQKDAGKGDESKVE
>MS1159 unknown
MVVPLSKLTKGNVAYIDSIVANQAFGELDTLVGRRLADLGFSKGVPVEVV
AAGVFGKGPLAVRLSNLSQFSLRAAEASKILCHINK
>MS2120 unknown
MQTKPFGKHPEGQRLARIEQSVHYKAGKFVNHLPTEVQTSDKPLWKIWYD
FLFQQIDHLTPNRPLPVVKTDLQQLSREKNFIVWFGHSSYLIQLDGKRFL
VDPVLVSGSPLSFANKMFQGTNLYQPQDMPDFDYLVITHDHWDHLDYEAV
IQLKNKMKEKVITSLGVGAHLEYWGYPAERIIEMDWNEKTELENHFKITA
LPARHFSGRGVVRNKTLWSSFMLEVPGETIYLGGDSGYDPIYQEIGQRFN
ISLALMENGQYNKDWANIHIQPEQLTLAVKALRPKRLMTVHNAKFALARH
DWRAPLEQIYRNAQKENFNLFTPKIGDVFYFSEQGEADSPNFREPWWQSV
E
>MS1284 unknown
MRWENRHIYRVGGLYIKYKENASHKINNFDKSAVKKLRI
>MS1045 unknown
MLLFVYLGAKMKKLALAALILGSSLALTACDQAKEQASQTTETVTETAKD
VKDNAVEKAGEVKDNAVEKANEVKDAAAEKMDAAVDATKEKVAAAKEAVA
DKAEEVKNAVSDKAAEMKEAVSDKANEVKDAAEK
>MS1068 unknown
MADWILILSVLILRRFRKIYAWQDNKNIGDIMSTSHYVSPKGSMDQLSHM
EIDLLTKRAQSDLYKLFRNSSLAVLNSGAINDDSRALLNKYPNFEISIIC
KERGVTLKLDNSPESAFVDDKIIRNIQYNLFAVLRDILFVNALMQRFGLD
AERGNSFITNQVFSILRNAKALSLNEDPNLVVCWGGHSINQTEYAYCRAV
GLELGLRELNIVTGCGPGVMEAPMKGAAIGHANQRYKQSRFIGITEPSII
ASEPPNPIVNELIIMPDIEKRLEAFVRMGHGIVIFPGGPGTFEEFMFILG
IKLNPENRAQKLPLILTGPKESADYFATIDRFVLDTLGEEAQSLYTIIID
DAVAVARHMKAEMVEIRDFRCKISDSFSFNWSLKIEHQFQQPFLPTHENM
ANLNLHLNQSTVDLAANLRCVFSGIVAGNIKPATQDQIAEKGKFQLYGEP
RLMEKVDNLLQDFIVQHRMKLPTDEAYEPCYEICK
>MS0788 unknown
MPYIRRQTEVKVRSFFTKFLFNGALGSVYKR
>MS0253 unknown
MIYSMTAFARHEIKKDWGDAVWEIRSVNQRYLENFFRLPEQFRGLENNLR
EKLRQNLTRGKIECSLRIDSKKQTSAELNLNKDLAEQVIQSLKWIKQQAG
EGEINLNDVLRYPGVVEAPEQDLDAISQDLLNAFDELLKDFIAMRAREGE
KLHTVIRQRLDAISVEADKVRAQMPEVLQWQRDRILQRFEEIQLQPDPSR
LEQEMVLLAQRIDVAEELDRLQMHVKETASILKKGGAVGRKLDFMMQELN
RESNTLASKSINADITASAVELKVLIEQMREQIQNLE
>MS0990 unknown
MIWMSAIFLRQSNIESVRNIIDFIVRCKSILGKDEGENASWAFLNNFNIL
SEDEKEKIKMNLSEDVINFLRLSLEHHYLLFDDYPLAFLFKDYKCGMDRS
NAINLLKEDVSALFDRYSEHSTKVQTTAFYSMAITGKIVLNASMNIPDFN
SIFSDPESDEAKIVAAFVRSSLNVGNDIISSSNGKNDWSKSFWKQCFDME
ECS
>MS1644 unknown
MSYDANDALNEIEEALSELERVAEDLINNNPNKESELRGQGVHQATKHLR
FRIRNIRRGEAI
>MS1439 unknown
MSLQVNSVAIMLVVLILLGVLSNNNSITISALILLLMHQTFLGKYIPFLE
KNGLKVGIIILTVGVLAPLVSGKVQLPAFKEFLNWQMFLSIVIGIAVAWF
AGRGVNLMSSEPIVVTGLLIGTVLGVAFLGGIPVGPLIAAGILAVILGK
>MS2099 unknown
MLKNSQKFGKEFCGICDSSSVYSKIFAKTYRLPIQCKNAVCNSISNGISK
RLNRI
>MS0995 unknown
MNSQVKNMNRKLENIKFVITDVDGVLTDGQLHYDANGEAIKSFHVRDGLG
VKMLMESGIPVAVLSGRDSAILRKRIADLGIKLAFLGKLEKESACYELMK
EVGVTPEETAYIGDDSVDLPAFNVCGVAFAVADAPDYVKDCADYVLDLRG
GKGAFREMSDMILKAQGKTDVYSSAKGFLKIVTNMAQ
>MS0929 unknown
MPNGKSSAIYIAFQNTRTKFRGIVLGAVYLL
>MS1303 unknown
MRRWLKNVFFLSGKEFRSLFSDPILVILIIYMFTAALYTVATTISTEVKN
GSVAVINNDHSTLSYRLQSSLIPPYFRKVHEITANQADRLMDMGEYTFVI
DIPPNYEVDILAGRNPQIHLSIDATAMTQAAIGSNYISQIFSREINDFLR
LKNNKTFTPIKTAVNVLYNPNYTSKWFMGAMQIVGNLNLLTMLLVGAAII
RERERGTIEHLLVMPVTSSEIAIAKIIANGSVLLVVVGLSLRFVTGGLLG
VPLPAQAIPLFILGSLIFIFAIASLGIMLAIFAPTMPQFGLLCIPVYVVM
YLLSGTTSPIENMPELAQWITQLSPTTIFGSYAQDVIFRGASLDLVWDKL
VKMAAIGFVFLAVALGQFKTMLSRQG
>MS0512 unknown
MIIGPFINAGAIVFGGLIGAALGGRVPERLRTNLTMLFGLCSMCMGIVMI
AKVAQMPAMILALLLGTILGELILLEQGINKLASKTKTIVEKILPNNQKK
GVSHEEFLQKFVGIVILFSFSGTGIFGSMNEGLTGDSSILIVKAFLDFFT
AIIFGTTLGSTIATAAIPQTVLQIALAYSAVLIIPLITPEMRADFAAAGG
MLMVATGFRICGILHFQVANMLPALFIIMPISAIWLQMMG
>MS0877 unknown
MKDERNFYEKNNYCDFACHSNRTFNQCLGG
>MS1440 unknown
MEIAFLLAGKIIELTIIVLLGYALVKSKLLKSQDSYPLSIIGLYLISPSV
MINAFQIDYSPQILNGLLLSLTMAVFLHIILIITGVILKRLLNLDPIEHA
ASIYSNSGNLIIPLVVSMFGQQWVIYATCFIVVQTFLFWTHCRSIICGKG
SISILKMFKNINILSIFLGVFLFAFQIKLPPLISGTLSSLGQFIGPNAML
IAGMLIASIPLRNIITSKRIYLVTFLRLILIPIFLLIIIKLCGFDNWVEN
GETIAMISFLATMSPAAATVTQMALIYGKNANKASAIYGVTTMLCVFSMP
LIIALYQLI
>MS2191 unknown
MMLKNIFAGLTVLLLSACTLVTYQPVDTISHVNAKQGYRMRNAIQQPDGN
LIILMFSGGGSRAASLGYGVLEEFKNAAVRPTAKGTTLIDNVDLVYGVSG
GSVLATYYSLYGRDAVPKFEENFLKKNFQREIISQVFSLSNLPRITSPQF
GRGDLLQEQLDQTLYKGATFGDLERKRKGPFVVVSATDMNLGQKITFTQE
FFDGLCIDLSKMEISRAVAASSSVPLLFSPLTLNNNGGNCHFDIPELIQI
SQNISNDAQKSKNLEELKNTLSLYQNSKERPFIHLVDGGLTDNLGLSGLI
DIYDVAGQEGMYREAVKNQLKNIIVINVNAQNEVSSEIDKTANVPGTRDV
INTIINVPIDRNSQVSLRRFREFTDEWNKSMANKPPKQRINMHFVNLSLK
DLPESQLKKEVLNISTSFYLLHSDVNKLKRSAKILLQQSKEYQDVLRALQ
>MS0568 unknown
MNLRVFLLMMKKCIRFIFLLLLMFAAAGFWGYNYIQKLVNEPVNIKAEQL
LTLERGTTGKKLFALLEKENIIADNILFPLLLKLQPQFNNVKAGTYSLEG
VKTLGDLLTLLNSGKEAQFALRFTDGETWKQVKKSLENAPHLKHELKDKT
DVEVFHQFKEMLPEFEVQNAYKTLDGWIYPDTYNYTPNSTDVALVKRSVE
RMVKTLEKAWAERDEDLPLNNPYEMLILASIVEKESGISAERGKIASVFV
NRLKAKMKLQTDPTVIYGMGESYQGNIRKKDLESPTPYNTYVIDGLPPTP
IANPSEDALNAVAHPERTDFLYFVADGSGGHKFSRSLIEHNKAVQEYLLW
LRRNKNK
>MS0002 unknown
MCGLQEELQKRLGIEEKIVHYYVLIRLSMN
>MS0093 unknown
MSATQPILNDIAQYLKENLPEWDVELFPNNPGTYSLSHINGAVLISYLAS
KFEKPRTTEAVLQTRHVQVALTVLTRDLHDDEGALNLLDKLRLLMVGFRP
VNCTECWLVDEFFNGTDEETGIWQYQLILQTETQQVQQIQTQDLPKFVTA
HLRRADQSVRPD
>MS2313 unknown
MNKNKRNISQSIGFSDKFTKTVVYIFLQIFY
>MS1363 unknown
MKTDFLSSLIFSVGVTLPTILLLILGMLIRKKKMIDDRFCEQSTKVVFNI
TLPVLLFFSVYGKHVDYISQMAVLSVGIIGTISLFLLAELFAARFIAEKR
ERGTFVQAIYRGNSGILGLAFCISAFGDSAAVPASIYSAAVIFLYNILAV
ITLTRSLSTGSVSVVSIMKGVIKNPLIIAILFALIANSISLQLPAPLLST
GNYLANMTLPLALICTGATIDLSVFSNKTSNVVLMGSLGRLVVTPVFMIL
IGKVFGLDGMLLGVVALMNTTPVASAAYAMVRAMGGNSVTVANIIGITTV
GSMITSSLMLLILSQAGWI
>MS1976 unknown
MLSYRHSFHAGNHADVVKHIVEMLIIENLTQKEKGFYYLDTHSGVGRYRL
FSQESEKTAEFEEGIARLWQRDDLPEEVQRYVDLIKKLNYGGKELRYYAG
SPLIAAQMLRPQDRGLLVELHPTDFPLLRNNFKEFKNISVKRDDGFQQVK
ATLPPKERRGLVLMDPPYEMKEDYDLVVNTIVEGYKRFATGVYAVWYPVV
LRQQSKRIVKGLEASGIRKILQIELAVRPDSDQRGMTASGMIVINPPWQL
EAQMKKILPYLTNVLVPEGTGSWSVNWIAPE
>MS2096 unknown
MTIKSVEISKAYRLVQLGSTTMLSAKHDGDADVMAAAWVGLGGPNKIIAY
IGTQAYTRKLVEQNGYFVVHIPTVQQMETVLYVGEHSKHTMPNKLDNLPL
FYQEGVDIPMVEGSAGYLLCQVIPNPQQEQNYDSFMGEIVAAWADDRVFD
GRHWTFDTAPDELRTVHYVAGGQFYAMGKGTKFDHGPGQD
>MS1465 unknown
MITKQMSEVLQQNCGKNYRTLYLFFSVKYG
>MS2015 unknown
MQYEHIHEKFRHLVTADNQERIAFLDEPRWLGYGVAKDIMDNLVSLMNKP
KRPRMLNLLIVGDSNNGKTTLIRRFFDLYGQAYIDSDSNAIYPILLAEAP
PSANEKELYISLLERFYVPYKPTDTIAKLRYQTIHLFREFRVKMLIIDEF
HSLLVGTPRLQRQVMNAIKMLCNELQIPIVGVGTRDAIRVLHTDPQHASR
FDVAELPTWKLDKDFQKLLFQFQGILPLKKCSNLHSPELATKIHTISGGN
LGNVHRLLTVCAVEAITSGTEQITLDIIEKNSWVQPTQGFRKIIG
>MS1834 unknown
MNLVDFLVDKLDALKATEIECIDVRGKSSVTDNMIICTGSSSRHVASVAQ
KLIDESKQAGFESFGEEGKAVADWIVVDFGQAIVHIMQGDARQMYQLEKL
WA
>MS0289 unknown
MSDIAITISILSLAAVLGLWIGQWKIKGVGLGIGGVLFGGIIVSHFSEQN
GLQLDAHTLHFVQEFGLILFVYTIGIQVGPGFFASLRKSGLRLNALATLI
VALGSLIVVIINKAFDVPLDIILGIYSGGVTNTPSLGAGQQILTELGMQN
ITQSMGMAYAVAYPFGICGILASMWLVRLIFRVKVDDEAKKFTQESGQQT
ESLQKINIRVANPNLDGLCLRDIPGFDERGVVCTRLKREENISVPKADTT
IFLNDVLHLVGDSHSLQRMCLIVGEKIELEPSKLVGNIPFRTGCGYQ
>MS1328 unknown
MQKFNQIHSLFEHLPANYGEFSDFEQKIATLAQEMKVDLSLYEIDHLSIR
VNTEDKAKSWLTTLLNYGKILSNNLVNGRAIYLIELEQPLLFMGQRVFII
ELPFPKNKHYPVESWEHIEFVIPFLPNESSIEWVERVQQQFLWNQSGNLT
IKVDEPKVDGEQLPNPSIAVSFADKSQNHVCIKVHPFNIKNIIKVS
>MS0748 unknown
MNHIYKVVWSKTTNSLVVVSELASSQGKAASVVSKGYKLSSVFKKSFQLT
ALSALLISVMPAAQAAIAVGASTVTNWNGAVSVSLNGASATGASVPYNYH
TPNNENYPDQGNNSNSSNIYSGTLSAAQSIAIGINATSQSGSIALGDNSR
ATGGLSLALGAFSQTNQAGAIALGTSALASGFNSFATMRQAAATADFAIA
MGTAANANATNSIAMGSSALALGNQSIAIGSAAMEKKVGSAGGESYRTDY
VGTTNTKAQGDRTIAFGVNTSTTSNDSIAIGSNSKTNSGTGAIAIGWCSS
TSYQDSVAIGSNATANGGYSLALGYNATSTNLTSISIGWNAAASNTGGGH
SQGAVAIGPKTTALGNQSVVLGASASAVEQATAIGNDSKANGFGSIVIGG
DDTGYSRNPNSDPYTPTALGGERIGYLANTATGDNSNYRSSLSSGIGSVV
VGVHGQALSNGSTAIGVYSTAGDNGITFTNDTTSTTAIEATAIGALSRAK
SIRSSAIGYSAEALGNYSTVVGANSTANGTSSLALGHNSTAYSTYSLAAG
YNASANLSNSTAIGSSSNASGLNAIALGTGAQALNTNTISIGTGNIVSGE
NSGAIGDPNNITGSNSYALGNNNVIYANNSFAVGNSIYISDTAQNTLAFG
TNISVPTSTKTNNTLIGTSAKIQGGESSIAFGTNATVSNSVQSSAIAIGN
QSKVEAAVGGIAIGNGSTISSSANNGSIALGQKTNVTGVSSIALGNNASV
TGTQQGSVAIGNNTNVTNTGQGTVAVGSDTNVTVGNAVAIGDHVNVKGQR
SIAIGSSSNVAEGVVNATTIGTGSNVTQNDGTAVGYNAIVSNYNGLALGA
NATSTAQRAVALGADSVAGREGWDQAAYDPYIPANANTSQSAAITATKAT
NNYGAVSVGSDTVKRQIINVAAGSADSDAVNVAQLKAAIGSVNTSWNIQE
NGTQKDIVNAGDNVSFANGTGTTANVSVDSTGKTSTVKYSVNKSGLSVAT
DGTVTAAANGDNFATAEQVAKAINDSEKTTTVEKGSDKVSVTGTTTGTKT
NYVVDLSNAAKSSLDKADSALQSWTAQVNGANAKVVNQTNNTVNFVNGTN
TIVKADANGNISVSTADNVTFNTVNASSFNAGNISIGTNGINAGNTTITN
VANGINASDAVNVSQLNATNANVTNNTQNITKNAADIQSTKDGLNATNAT
VAGNTANITNNTNAIANNTAAINKGINFGNGTTDNNFALGDTINVTSDSN
IVVNTEDDGVKLSLADNVTVGNVTVNNTFKAGDVTINSTGIDAGNHAITN
VANGTQDSDAVNLSQLNATNANVTNNTQNITNNTAAIANNTANISNNTNA
IANNTQNITKNAADIQSTKDGLNATNATVAGNTANITNNTNAIANNTAAI
NKGINFGNGTTDNNFALGSTINVTSDSNIEVSTVADGVKLALASSIAVDN
VTVNDTFKAGDVTINSTGIDAGNHTITNVVKGVNATDAVNLSQLNAGKSS
VEAGDNVAVTSTSDANGTVYTVNANISTVSNGSDKVTVTSSSTGNHTTNY
AVDLSEAAKASLEKADSALQSLTTSADGTKAQTLDKDNSNANFISGSNIR
LTPSADGITIATAENVTFTNVNTTNFKAGDVTINSTGIDAGNHTITNVAN
GTQDSDAVNLSQLNATNANVTNNTQNITNNTAAIANNTANISNNTNAIAN
NTQNITKNAADIQSTKDGLNATNATVAGNTANITNNTNAIASNTATINKG
INFGNGTTANNFALGSTINVTSDSNIEVSTVADGVKLALASSIAVDNLTA
NNSVKVGNVALTQAGINAGNHAITNVTNGTNATDAVNLSQLNAGKSSVEA
GDNVAVTSTSGANGTVYTVNANTSTVSNGSDKITVTQTDAGNHTSNYAVD
LSEAAKSSLNKADSALQSWTAQVNGADAKVVNQTNNTVNFVNGTNTIVKA
DANGNISVSTADNVTFNTVNASTFNAGNVSISNSGINAGNTTITNVANGT
NASDAVNLSQLNATNANVTNNTNNIANNTKNITNVTNLVNQGFNIGADNG
TDDNVKLGEKVDFNGDGNIVTTVTNNAIAFALSNTLNLTDAGSVTMGDTV
VNGSGMIINNGSTNNQTVSLTKDGLNNGGNTITNVANGSNATDAVNLSQL
NAGKSSVEAGDNVAVTSTSDANGTVYTVNANTSTVSNGSDKITVTQTDAG
NHTSNYAVDLSDATKASLDKADNALQSWTAQVNGADAKVVNQTNNTVNFV
NGTNTIVKADANGNISVSTADNVTFNTVNASSFNAGNISIGTNGINAGNT
TITNVANGTNASDAVNVSQLNATNANVTNNTQNITKNAADIQSTKDGLNA
TNATVAGNTANITNNTNAIANNTAEINKGINFGNGTTDNNFALGDTINVT
SDSNIVVNTEDDGVKLSLADNVTVGNVTVNNTFKAGDVTINSTGIDAGNH
AITNVANGTQDSDAVNLSQLNATNANVTNNTQNITNNTAAIANNTANISN
NTNAIANNTQNITKNAADIQSTKDGLNATNATVAGNTANITNNTNAIANN
TAAINKGINFGNGTTANNFALGSTINVTSDSNIEVSTVADGVKLALASSI
AVNNVTVNDTFKAGDVTINSTGIDAGNHTITNVVKGVNATDAVNLSQLNA
GKSSVEAGDNVAVTSTSDANGTVYTVNANISTVSNGSDKVTVTSSSTGNH
TTNYAVDLSEAAKASLEKADSALQSLTTSADGTKAQTLDKDNSNANFISG
SNIRLTPSADGITIATAENVTFTNVNTTNFKAGDVTINSTGIDAGNHTIT
NVAAGTNKTDAVNLGQLEQFIGDNSYNWNLSDGTNNSAVADNSTVAIEGS
ANGDSANTSGIVTMLDGTNVSVDLSDKAKESLDKADSALQSWTAQVNGTD
AKVVNQTNNTVNFVNGTNTIVKADANGNISVSTADNVTFNTVNATTFNAG
NVSISNNGINAGNTTITNVANGTNASDAVNVSQLNATNANVTNNTKNITN
VTNLVNQGFNIGADNGADDNVKLGEKVDFNGDGNIVTTVTNNAIAFALSN
TLNLTDAGSVTMGDTVVNSTGMIINNGSTDNQTVSLTKDGLNNGGNTITG
VANGSNATDAVNLSQLNAGKSSVEAGDNVAVTSTSDANGTVYTVNANTST
VSNGSDKITVTQTDAGNHTSNYAVDLSDAAKASLDKADNALQSWTAQVNG
ADAKVVNQTNNTVNFVNGTNTIVKADANGNISVSTADNVTFNTVNASTFN
AGDVSFNTSGINAGNHTITNVANGSNASDAVNVAQLEANTTRYYSVNSTV
AGNRNNDGATGINAMAAGANAVASGDNATAIGQGTKANSAAAIAIGNNAN
ATSSRNDSVIAIGNNAQSTGSYSIAVGTNSVANHTWSMAMGISAKAIDDY
ATALGSSAQATSQWTTALGAGANATGSAATAVGSNTTATAGGATVVGYNS
SVTGANTTALGNNINVDTEGSVVLGNGSTAASATTETTATVNNLTYSGFA
GADNVATGDYVSVGSVGEERQIKNVAAGNVSATSTDAINGSQLYATQNVI
GNVANSVVNNFGGNATVDQNGNITFTDIGGTGANTIHDAIQNVSNVANMG
WNVQANGDTATKVAPGNTVQFINGQNIEIDRDGTNITVATADNVTFTNVN
TTALTAGPVTINSTGIDAGNHTITNVAAGTNATDAVNLAQLESYVGDNSY
NWNLSDGTNNNAVADNSTVTITGSANGDGANTSGIVTELNGTNVSVDLSN
KTKADIQQGVDANTTVNTKGITFAADSGTATERKLGETLAINGDGDLINT
TVSAGKVEVAASDKLKGAVNNATTALQSWTAQVNGTDAKVVDQTNNTVNF
VDGSNINITNNNGTIKVATTDNVTFNTVNASTFNAGGVSISNSGINAGNT
TITNVANGTQDSDAVNLSQLNATNANVTNNTNNIANNTANITNNTNAIAS
NTVAINKGINFGNGTTANNFALGSTINVTSDSNIVVNTTNAGVQLGLADN
IAVDNVTVNNTFKAGDVTINNNGIDAGNHAITNVTNGTNATDAVNVSQLN
ASKTSIVEGNNVNVTAKTDTNGTVYTVNANTSTVSNGSDKITVTQTDAGN
HTSNYAVDLSDAAKASLDKADSALQSLTTSADGAKAQTLDKDNSNANFIS
GSNIRLTPSADGITIATAENVTFTNVNTTNFKAGDVTINSTGIDAGNHTI
TNVAKGVNATDAVNLAQLESYVGDNSYNWNLSDGTNNNAVADNSTVTITG
SANGDSANTSGIVTELNGTNVSVDLSNKTKADIQQGVDANTTVNTKGITF
AADSGTATERKLGETLAINGDGDLINTTVSAGKVEVAASDKLKDAVNNAT
TALQSWTAQVNGADAKVVNQTNNTVNFVNGTNTIVKADANGNISVSTADN
VTFNTVNASTFNAGGVSISNSGINAGNTTITNVANGTNASDAVNLSQLNA
TNANVTNNTNNIANNTKNITNVTNLVNQGFNIGADNGTDDNVKLGEKVDF
NGDGNIVTTVTNNAIAFALSNTLNLTDAGSVTMGDTVVNGSGMIINNGST
NNQTVSLTKDGLNNGGNTITNVANGSNATDAVNLSQLNAGKSSVEAGDNV
AVTSTSDANGTVYTVNANTSTVSNGSDKITVTQTDAGNHTSNYAVDLSDA
TKASLDKADNALQSWTAQVNGADAKVVNQTNNTVNFVNGTNTIVKADANG
NISVSTADNVTFNTVNASTFNAGNVSISNSGINAGDTTITNVAAGNVSAT
STDAINGSQLYATQNVIGNVANSVVNNFGGNATVDQNGNITFTDIGGTGA
NTIHDAIQNVSNVANMGWNVQANGDTATKVVPGGTVQFINGQNIEISRNG
TNITVATADNVTFNNVNTTTLTAGPVTINNSGIDAGATQIKNVAAGTEDT
DAVNYKQLKDAVSNSSTTWNLTDNNDTANSTTVGNDSTVSFNNGTNTVAV
VNGTNVSYSLADNIALTNNGSVTVGNTMVDNTGISVGDNVTVTNTGFVAG
NVTVKQDGINAGGNKITGVADGDISANSTDAVNGGQLYNVIQNATAGVKT
EVEAGKNIVVTNSTGANGQTVYTVETAKEVDFDKVTVGNVTINKDTNKVS
GIANGDISATSSDAINGSQLYTANQNVADHLGGGSKVDENGNVTAPTYTV
VTNPSTNATTTANNVGDAINGLNTAISKPLTFAADSGSNSEMRLGSTVSI
KGGVSDSTKLSDNNIGVVSDGKGNLTVKLAKDISGLNSVTTGDTTMNSEG
ITIKNGAAGSSVSLTKNGLNNGGNRITNVAPGEVSQDSTDAVNGSQLHAT
NQQVVRNAQAINQVANHVNKVDRNLRAGIAGAMAAGGLYHATLPGKSMVA
AGVGTYRGESAIAVGYSRLSDNGKLGVKFSVNGNTRGDTGAAASVGYQW
>MS0087 unknown
MRDQSVATSAKLKTLGKMTALGVTGALTGIKASGNAVLGLAEPAMKFESA
MADVQKVVDFKTPEGFKNLSNDILDMTRTIPMAAEELAAITASGGQLGVA
EEDLKSFTTTIAKMSVAFDMSADASGDAMAKIANVYGIPITKLGNLGDAI
NELSNNSPARAADIVNAMSRVGGTAKQFGLSENAAAALTNSFISLGKAPQ
VAGTAINGMLTKLMTAEKGGKAFQGALNQVGISAKQLKRNIAKDGQAALV
DFLKRLEKLPKDKAMGVLVDLFGREYADDVAVLAGNVNVLDKSLRTLQET
DANGNLKYLGSMEKEFASRSATTENGLKLLSQSTDEFFKVVGARFLPIIN
TVSGGLAKLMHRVTDFAKEHEGLVDTFIYVGGAIAGVVTGFSALSAVIGV
SGMAWIGLSKPIGMFVSVLGTVFKWLKLGGLLFATLGVKVLDMALTFGKA
MFMMGRALLTNPIGLAITGIALGAYLIWDNWSWLSAKFGSLWQTVTGYFS
AAWDNIKGFFSSGIGNITATILNWSPLGLFYQIFRPVMSWFGVDLPNSFS
GFGKNIIDGLVNGIRRAWNGAKDWVIGLGQSIKGWFTGEMKIHSPSRVFM
EYGDNIAQGLAIGVAKNAVLAADAVLAMGDKMKNAAPKSIPSPVTQPMQV
KTPVQDMADFAVDMAKNAITVPIKPVAQSVTPVIKKSAKQVSKPALIQSA
VKSEQVLAPSVQTSEMPTPVTMKNARLQYIQRMQTLDKMQSTVKSEPLFT
PVKTIAEPILSKEKGFFGSLWDDVKFGANVVGNLLGLSQPSLKTPDFNPS
AGGRDSLIFSDYEPLNRNAVSQRTVNQDAGGIVVNFNPTINVNGNAPQGV
TEQITQALQMTAHDFERLLNRVLDQRQRRAY
>MS0005 unknown
MKINLMNLNVNMNRAILFFSGDLDYYKTKSAVFFTALCCSDE
>MS0412 unknown
MQKNLIILTALLATPGCGTIVQLANPSHKYEAYDGTKYDWQQAQKWGMPI
LDLPLSFLLDSALLPYVLSQE
>MS0012 unknown
MILNYLCSALSKNKDILSNIINKRGFMMKKLLTICVITTILAACSMPNGR
SKPQEGNMQIEEALKACQQAMTSNSTREDFDACMLKKGFERPANKNQPKA
N
>MS0857 unknown
MFTSIQREVNQFINRGLDRTLRIAVTGLSQSGKTAFITSLINQLINIDNV
TNGHLPLFEAARQQRIVGVKRIPQINLNIPRFDYEANLNSLMASPPQWPQ
STRGVSETRLAVRYHNSGLFSHIKEKSTLYLDIFDYPGEWLLDLPLLNLN
YQQWSLEQQNLRQGLRAELAQTWLEKTKKLDLTAMADEDILAQIAKDYTA
YLQACKEQGLHFIQPGRFVLPAELEGAPVLQFFPLLHLAEKDWKKLKEEA
KPNSYFAILNQRYDYYKNKVVKGFYENYFVHFDRQLILADCLTPINHSRQ
AFQDMQEGLQQLFKNFHYGKRRLINRLFYPRIDKLMFIATKADHITSDQI
PNLVSLMRQLVQDGGRHVAFEGIETGFTAIAAIRATKQVLVEQEGKTFKA
LQGIRSKDKRQVTVYPGSVPSRLPSIDFWQQQKFDFDQFEPQPLESGEII
PHLRMDSVLQFLLGDKLA
>MS0310 unknown
MRHIKWRRFPILVKIGRLILSKINIYCFKLNNI
>MS1383 unknown
MLSRNPLFPNKVTRYELTPETVDCVVFCSKNYRLILPDLHKITDRFNTYF
HYTITAYGKDIEPGVPTA
>MS2018 unknown
MTFISSFYRGTIPKPGEISLAHNGVLFLDELPEFERRVLDALRQPLESGE
IIISRATAKIQFPAKFQLIAAMNPSPTGNYQGTHNRTSPQQIMRYLNRLS
GPFLDRFDLSIEVPLLPKGALQSLDNRGETSAQIRQRILQVREIQLTRAG
KVNAHLSGKEIERDCKLSTQDSIFLENALTKLGLSVRAYHRILKVSRTIA
DLDNELHISQRHIAEALGYRAMDRLLQKLNNND
>MS0620 unknown
MYKNRRLTLIPHKNRKVRSKIRKFRLSGENHGNSTALW
>MS0615 unknown
MTNTILLICIILFFAYAFYDQFGMDKRKGETKLKVRLKKQAKIDAVIFVI
LIACLFAYQSKESLLNIDSFTIFLLATAVVLAVYTAFIRSPMLILKEKGF
FYSNLFIEYDKIRQLNLAEGNIFVVDLTNGKRLLLPIADERDKEKVVTFF
GGYKEHKQENK
>MS2359 unknown
MYNHNENQENSTALLELIDGFEQNPAELFQTEMEKVAENMKDLPFYREDI
PCFCPKFVQFENQWIGMALTPWMLSVLVLPGPNQQWKARTVGDKIALAFP
YKTLNFTVSSLDNVPQYLSCSLHSPLEANLSKEHAVQLTKDCLTMLLSLP
IKQKAPSDLNRRNMFGAMLK
>MS0983 unknown
MSKIKTSFNVVEEKSAHLAVLIDADNASAQTIKAILEETTKFGEATVKRI
YGNFVGDSGKWKAVINEYAIKPMQQFAYTKGKNATDGFMIIDAMDLLYTN
RFDGFCIVSSDSDFTALAIRLKEQGVTVYGFGKKQTPEAFLNACSQFIYV
ENLLPELNDDKQVDITLPNASNTQKKVQQTENSTPTVSIDNQKIQSTELP
IETIRKVFEQFDSEWVAMTAFGSTWKRLQVDIDPRSYGCKKFTDLVKKYP
DVFDYKMETDSDTTQEHMYVKLKI
>MS1853 unknown
MNFDRTFFNKKTTYSLPYRISTRTFLHKVGGGA
>MS0952 unknown
MLQSPDLINKQEIIMQALKQYLIEITEQNLNDTLQLSKEHPLVLVFYAPS
HQPSVEFTTLLERYAEQYQGQFALAKVNCETQQAIAMQFQIRNLPTAYLF
KEAQAVDAFQNVISEEELKQRLSQILPKEEEIKFNLALDFLQAEDYDKAL
PLLKEAWELSERKNSDIALLYAETYIAMKKTEPATEILNKIPLQDRDSRW
HGLQAQIELLIKAADTPEIQQLQADFSKKPTTEIALKLAVQLHQANRNEE
ALELLFNILKQDLSAQNGEVKQQFLSILSAIGNNDPLTNKYRRLLYSLLY
>MS1078 unknown
MRDQIKADAKIIGFFVLVFDYSNPYFYSYKSYN
>MS2110 unknown
MANQMVMAFLGRLWEFLTNLWWLPLALLFLAFLKSRWFKGRFGEKAIQSR
LSGLDKKVYRPFHDLIVPSHNTTTQIDHIYVSCFGIFVVETKNYSGWIFG
SEKQARWTQTIYRKKHSFQNPLRQNYAHIKALASLLELPESVFHSVVVFL
GGCEFKTQMPENVCYIGQVEHYIRNIRMVMLDNTEVDRICTILQNKKYAV
NNATRVVHKNNLRQRHQSYN
>MS0989 unknown
MKIENDLIRFIQSYETYLVSSFSKLWSAVKVEYDKLEAYSVIGGLLSRQV
TLAIQMTRSPNILNGHSAPLFLRSMTDLHITLAWIMLDLEERSKKYILYG
LGEEKLLVEHYKKRIDDSPNNPENELMEGMIETRLHWIDSQRRDFLVEVN
LGSWSQLDYRKMAQEANCESLYTFAYKPFSQGAHNMWPHVSRYNCKYCES
PLHKYHLIPDLFEAPADLDYLFRSCKYVHMAYEIFINKFGLDFSELMPLD
WWDNYFMEIDVEENAING
>MS1057 unknown
MRSIFGKIFMRQQINKWKYHFNWIKHYISPSLVVIITVFSAYLVYQQNYL
MKESKRPFLAFTPKIIVLPNNPQMIEAKIVVGNYGEGSAIIDEFKVQING
QTYQSNYSSKWREILSHNQISHSCPLSEGWLIKNAVLKAGDEEDNFLRLL
YPMAEVPNGKEPCITPFIDLIKAGKLTLSVKYHSIYNISYSQENIIEYDF
SALQEQFKSP
>MS0594 unknown
MVCKTVAVVTSTIGRESLERAIRSVHTQTYPCRHYVFVDGEQFHSSAKAI
LDKYPHVIALYLPMNTGANGWFNSYINAAAPFLVKEDILCFLDDDNTYRP
NHIQTIVDCFNAESNLDFAYSLRNFVRPGGAFVVRDDYQSLGRYVHKLVS
GCTYNIHVQGKKIPVVCRFSKQNLIDVNCMALSLVCARRVANIWCERGYG
NDKAVTDYLLANTKGEMTGRYTVDYTINYRDAFSTYDEFAQYLSENFAKE
IAEKFLTLFSQENIDAYFGERPWAKE
>MS2138 unknown
MNPIYKLDNLIKTLATNLSPVILLLTRVIIGYMFLLHGLQKLTGGVELTS
LMGVGGIIETLGGIFIILGLFTRFTAFILAGQMAVAYFMFHASAETLFNP
VENQGELAVLYSMTYLILMITGAGKISLDAKFNK
>MS0455 unknown
MDNTSAKAEAGSVFSFILTVMVEELGALVEDK
>MS0378 unknown
MNVSLINKRNHFKKCGQNFENFDRTLPSRLP
>MS1714 unknown
MNIRWNIILGTIALVLLAWFYTLNQDKPDLTRLIKAPESPEYTGHKMETT
IFSPAGKKQYQAYSDTARHYDQDGHTEFVNPVVFALEVETENQGKQSWKL
TAKSATLTKDNLLYLNGEVVAQSLDPISRLQRIETEAAVVNLKNQDITSD
NMVTIRGLNFTSSGLKLTGNLKQQAATLKEQVKTHYEISNQ
>MS2061 unknown
MWWIILLKSAVNFYEEKGLNNIQAFCFGIQLKISRLR
>MS0168 unknown
MILFAGDPHGYFKHLYPFVRGKEDIALIILGDLQLTTVEELDKLSQYCDL
WYIHGNHDSKTVAAFEALWGSKWKNRNLHGRVAEIQGKKIAGLGGVFRGQ
IWMPPNKPLFLDPIHYCQYCSQEKIWRGGIPLRHRSSIFPADIENLSKET
ADILICHEAPKPHPSGFTVLNELASQMQIKHLFHGHHHENFDYSELAPQT
PFAITNVGFRSLCDEKGNYLLKNIDDRKNKP
>MS0491 unknown
MRQVIMILAAYGDIAQLGERLNGIQEVVGSIPIISTKFKALNLFQGFFVF
ACYVQNSVKINRTFIVKKCG
>MS1712 unknown
MMVDLISALGRNVINSVKALGRAGFMLFGSLVGKPQIKKHFPLLIKQLYV
LGVQSLLIILLSGFFIGMVLGLQGYVILVDFAAEANVGQLVALSLLRELG
PVVTALLFAGRAGSALTAEIGLMKATEQLSSLEMMAVDPLRRVIAPRFWA
GVIAMPVLTVLFTAVGIIGGHLIGVEWKGIDSGSFWSVMQNAVRTLDLWD
GFIKSLVFAFTVTWIALFNGYDCIPTSEGISQATTRTVVNASLLVLGLDF
VLTAIMFGAG
>MS2197 unknown
MTDLQELRAETREIITDLLNDGSDPDALYIIEHHIAHYDFDKLEKIAVDA
FKAGYEVSEAEEFEDENGKVIYCFDIISEVELKPEIIDLQQKEILPILQK
HNGIYDGWGTYFEDPNASDDEYGDDGEFFDDEDDFDDENERPVH
>MS0270 unknown
MLVRLGLYRIFQVYKIKKSATNRGFSVISELIFI
>MS1382 unknown
MESRDIGAYDSCPSGCKYCYANKSSAKARACSNITIRIRPYCSGICVKRM
LSLKARKKAF
>MS0868 unknown
MAGLTDKGTFMEVTIEITVILFTVAVIAGFIDSIAGGGGLITIPALLMTG
MPPALALGTNKLQACGGSFSASWYFIRRRAVDLSAVWLILLMTFIGAVIG
TILIQLVDASLIKKVIPFLVLAIGLYFLFTPKLGEQDARQRLSYGVYAFT
AGVSIGFYDGFFGPGTGSILSLACVTLLGFNLAKATAHAKVFNFTSNFAS
LIFFLIGGHILWSVGLVMLVGQFIGAHFGAKMVLSGGKKIIRPMVVIMSF
IMTVKMAYDQGWFS
>MS1310 unknown
MRLSHHSCKNITDFALINAHCRYFSAEIVFQQKCGEIL
>MS1772 unknown
MWCKNMNVLHSDTTISPLRRFIRQQRGSVTIEFVFMLILLILILAFMTDL
AMLRSTTGRLDNISYSLANILRERTQLYDGKENLATENVNRDVNNFKLLA
KRMAFGDKNSNKEIYVVLEYLAPQNSVYRIIGDSAKCEPYDSLQGLENLS
PRSEINDTRKIPLYQVTVCVPNYSFFSALVPGVAKNMKETIRSSSITVSR
>MS0994 unknown
MKTPNMYGMTENMGKCSFKFKLLTENKDFLREKRKDLKSVKCGRKLGNFT
KKARIFLRLRASCI
>MS0384 unknown
MYQKIRRFLSTCYLRRTIMALRNEEGHIDLKLMERMMKIPGGLVIIPLLL
AVAIKTFFPQFFEIGGFTTGLFQKGQPAMMGIFLILCGASINIRQVGMPL
YKGVVLTSSKFFLGVALGLLVGHLFGPEGIWGLSPIVLIAAITNSNSSLY
ISLSSQFGNSTDTGAISILSLNDGPFFTLIALGASGLANIPFMAVVATLI
PLLIGFVWGNIDTKFRELCGKAQPIIIFFMTIAIGSGTDVSTILKAGASG
IILGILSTLTAAVFFYVFNIFLPKRERNAMGAAIGTTALNSAMTPAAVAD
ADPTFTPHVPLATAQCATASIITLFLCPFVVAFFDRQMRKNKLGIYSAEG
WAGKALAEREALAKSAS
>MS0880 unknown
MPILAYVNLNVVSKMQYSEVVKKKEGNLLQKSAVKIYRFL
>MS1742 unknown
MTNKNLASLFYLALFSASSAVAAVNPSDLIWKSAAFGQSTDLNFGSTILP
EKVGVNKTTVDGHPVQEGALATKFTIESRGGKLANSHEGLTYYYTELPIN
TNFVISADVRLEQLGPETGAKPNRQEGAGIMVRDIVGKPRAEPQPMGYEE
FPAASNMVMNLLRSNTKAHNGKVNINASYREGIKEPWGTAGNKLVREDYA
EGIDFENNPLRLTLEKNDQGFVVTYIQDGKEYKKVLDKVNPGILANQNAD
KQYLGFFASRNAKITVENVDLKLTDGKKVEPAKFTPKAMPLIVNIESSTK
ATGSDYVFQARANYEGSFVLQRGNKTLFKSPVVKAGEYVQHKLKLNQNKT
DLKVQFIPKAKLKENGFEQNISIEKHQLQNPKLLYVATNGSAAGNGSAEK
PLDLVTALELLPPGGTIQLQPGEYAAVTLDTTMSGLKDSPKTLKGIEGKV
KFIGEVLHKASFWNMENIEVSGASLIVHGSHNNFSHIVTHGAPDTGFQIT
SPEKIGRSLWASHNTVTDSISFNNMDDSQINADGFAAKMRIGDGNSFIRC
ISHHNVDDGWDLFNKVEDGPNGVVTIKDSIAFMNGQTLKLKSKSASIGNG
FKLGGEGLPVNHVIKNSISFRNNMDGFTDNFNPGTFTVENNVAIDNKRFN
YLFRKSPYENGPKQGVFKNNRSFRFYQESKYTDVVNGSLLNNNEFLTTTE
MTPAKTELLQKLQALSKVEFSEDNTGLEEVKKIQALLR
>MS0742 unknown
MLMFANRTFLCNIRANFIDSSIYKKTSMDMSQDSVKSLQSTFKTVGVVGR
PRNDSTLQMHKNIFHWLCEQGYQVLVENEIGKALNLSENHLASLDQIGQH
AQLAIVIGGDGNMLSHARILCKYNTPLIGINRGNLGFLTDIDPKNAYAQL
EACLNGEFFVEERFLLEAVVKRHGETVARGNAINELVIHPAKIAHMIDFH
VYIDDKFAFSQRSDGLIVATPTGSTAYSLSAGGPILTPQLNAIALVPMFP
HTLSSRPLVVDGNSKISVNFAEYNIPQLEISCDSQLALDICCNDVVHIQK
SPYKLRLLHLHNYNYYNVLSSKLGWLKKLF
>MS1960 unknown
MLSDVFYDPEEIMAKKTNQVPIAENSQTAYLHNRGTIQDNAVKALLRTPL
FRSRIEKKLKGKGSYQRKAKHAGRYFEKPDDKSFGYKSFIIGFLLGPAYL
L
>MS1449 unknown
MFLSRISFYFTLVCGLIIAMVIAPSAKANMFSVSETEINQYLSKKGEIAD
KIGFPGLFAMDYKVQNLTAKVGQNNDGRVELSGTIDGLLNLQKNDYVGKI
DLTVDTIPYYDAEKGAVYLRDLRITNWTGSPQQYMEKLEPMMPFLSRSLA
ALMATMPIYTLDESKPRDMLIKKFAKGIRVEKGQLSLDAGIL
>MS1425 unknown
MQMKTTANLTALFNLFWLKTKIKWDFTYLAVYS
>MS0355 unknown
MQGLLLDEPLERSNVLSTKWTPKAQKCGQF
>MS1375 unknown
MHCPFCSTEETKVIDSRLVSDGYQVRRRRECTKCHERFTTFETAELVVPK
IIKNNGMREPFNEDKLRRGIQHALEKRPVSADDVEKAISHITHQLRATGE
REVPSKLVGSLVMEELKKLDKVAYIRFASVYLSFENINEFSNEIEKLKD
>MS0944 unknown
MTEQTFIPGKDAALEDSIAKFQQKLTALGFNIEEASWLNPVPNVWSVHIR
DKDCPQCFANGKGGSQKAALASALGEYFERLSTNYFFSDYYLGQDLANGE
FVHYPTEKWFPIEDDSSLPEGILDEFLLNYFDPNRELTPELLVDLQSGNY
DRGIVAMPYVRQSDQQTVYIPQSIIANLYASNGMSAGNTKYEARVQGLSE
VFERYVKNRILKEAISLPPIPQEVIEQYPTIAASINKLEEEGFPILAYDA
SLGGKYPVICVILLNPNNGTCFASFGAHPNFQVAFERTVTELLQGRSLKD
LDVFSPPSFNNGDVADLANLETHFIDSSGLISWDLFKDEADYDFVHWDFS
GTSHEEYDNLMNIFNEDKKEVYIMDYNHLDVYACRSIVPGMSDIYPADDL
IYANNNMGMEWREILLDLPHFHHDKETYLELLEELDEQAIDDATRVREFI
GLVPPPKSGWVTLRTGELKSMLHLALGDLEMALEWANWTYNMNSSVFIPE
RANYYRCLISTLELFLDESRTPIQYRNVFEKMYGKTAADFAWNAVQGGNP
FYDLLADDEHLNKFQAHQKLLKAYEKLQTAKRENWK
>MS1469 unknown
MLTTLLQTHITFAFLSLMLLIVRGYMQLQGKDWRTVKLLKITPHLADTLL
VLSGVALVFVFGYGLQMWLIGKVLLLVLYAFFSAKFFQKNAVKSNILFLI
LALSAFLAAMYLGYFH
>MS2305 unknown
MNYKNVIGFGVAGNFAGHLEQAGEANDFLAVKTQEAVQPKAIFPFYVPSE
KAGFLSVYPLSSNKLRFPDNSGDNLQIEPEIAILCHVIYQNNQVVKLIPY
SFGAYNDCSIRRPNANKICEKKNWGAETKGLSDTLIPLTSFELGGEIDKY
RIACFHRREEKTEIYGLDSPALGYSYFHTKLLDWIVDRMNNQPDQGPMNN
IAELLAIADYPTEAIISIGATRYTEFGESHYLQKGDTSIVVVYNGEKYTK
QQILEMAQAQSFPDDISALIQQVIK
>MS0114 unknown
MENQLAELKNEIEALRTAQEELQLLLGAQKLLFNAVAATLDKEKKQAISQ
AIYEMLNSHAVFSAQEPVVLAARDHLLTFANLMAQQANEQSPE
>MS0482 unknown
MIRFYQLAISPMIGPRCRFTPTCSCYGIEAIKTHGALKGSWLTLKRILKC
HPLSKGGYDPVPPKINNNVEKK
>MS1046 unknown
MHSLTANCVSWGKTSKLNRILLNYLYKEKS
>MS1828 unknown
MSETKTVNLSELPQQKLKDLLEFPCSFTFKVVGAARPDLIDDVVMLVQQH
AKGDYNPRNAVSSKGTYHSVSIDIIAEEIEQIERLYEELAKIEGVRMVL
>MS0520 unknown
MKSAVKKRDFLSVDFANDFKKRSNRITGFNKISEIVFFEKTPRLYIVIIK
PRRLLRG
>MS1448 unknown
MTTKKQAVFSRLVNELVQKNQGKRIFSFDFENQTYWVKQPEKLTGVWKIL
KPHPKQSFREELHILKNLYERGAPVPQVILSGEDFFVLKDVGPTLNHWIE
NAGLNLTPAEKNQILVDAIKALTSLHKKGVTHGRPAIRDIAWRQGKVTFM
DFESHSRSLNLQWHKIRDVLVFIHSLCRSKHLSGEQIQYLINKYEEYCES
DLWQDVLNLVAKFRFLYYILLVFKPVARMDLIAIYRLFQYLLPLTEENK
>MS1369 unknown
MLVVLFSLVKEQINDNFTIITSSQNFDKIDRTFMRIILISSLRRTG
>MS0071 unknown
MFRVVGEINMKKYISDKNFLAGFIFFFVSAFYLISAFQIETKNLVSVEAD
FMPIIYGSLLLTTSIVLMITSFFKIRNTVVNKENKETDWKRIFSVIGLVF
VYVLLMQYIGFIVTSIPFLFCLSVLLTPLYIKKNYIVYSIFSIVLPILAY
FLFSYYLNLTMPSGFLF
>MS0444 unknown
MATMNIVLLTFGSRLENHYQASFAILSFLKDPAVKRVIMVTDRPEFYAFF
GNKIEFIQINEDTLTQWQGEYQFFWRVKIKALEKVQKRYPAEHLLYIDSD
TFLATDLAGIQDKLSQNQLFMHKLECALGDEIDNTTKKMHNSLKDKTFAG
IRLDSQSTMWNAGVIALPANKAKEIIALSLRLCDEICATDCTRRLVEQFS
FSIALNHYGGLNACDHIIGHYWGNKNEWNKLISAFFVNALLKNLSLQDCI
NEVAEFDWNRLPIHKKQRSTNSKLKALTDKYFPDKNISYFSK
>MS1803 unknown
MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ
GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK
SLRR
>MS1397 unknown
MMKKSLFLTALSLAILTGCQNVGSQALQIEKQGSFTVGGSYVTHKGTFKQ
ENFIAPEGQRAYGDFAYVKYQTPTNAKKYPLVFQHGGAQSSRTWESTVDG
REGFDTLFLRKGYSTYLVDQPRSGKSNLSTKAITPDTPWASNPMYADKTF
WILSRMGHYDSHNQPVANAQFPAGEAAYQAFQQAWTIGSGPLDNDLNADV
LTQLVDQTKGAILVTHSMGGTIGWRTALRTDNVKAIVAWEPGGTPFIFPE
NEMPKITKARFEALSGAAMGVPMNEFLKLTKIPIVLYYGDYIQVGSDNVG
EDKWGTELAMAKQFVATINKHGGDATLVHLPEIGIKGNSHFLMGEKNNQQ
LADLMADWLKQKELDK
>MS0299 unknown
MNISVKNRKKTTALFRNEKHQHSRLSAELAF
>MS0848 unknown
MLTAFLFRIGLNFFYRADYCNGNKIQQILTALLTLFCYLFY
>MS1309 unknown
MRIAGTFLRKLFFSKSAVKFCEILSDKIQIGS
>MS0358 unknown
MTTKTTDETFTVDCPICKKAVIWSPQSPYRPFCSKRCQLIDLGEWAAEEK
AIPCENADFAMDPENNEDWAKH
>MS0847 unknown
MADSRIVLDAREQSTSLLSTHKVLRNTYFLLGMTLAFSAFVAYISISLGL
PHPGIIVTLVGFYGLLFLTNSLANSGWGILSAFAFTGFLGYTLGPILNVY
IGAGLSETVVLALSGTAAVFFACSAYVLTTRKDMSFLSGMIFSLFIVLLL
GMVASIFFQTPALHLAISGLFVIFSSAAILFETSNIIHGGETNYIRATVS
LFVSIYNLFLSLLQLLGIFGGDD
>MS0828 unknown
MHWFYKKGGITPTPQAIKSLKIYAKLTALLINRVPL
>MS0867 unknown
MKKYLKTLVFSTALLTGLTTVNAAPEDWQRIKRPIPSSNGQAEPIGSYSN
GCIIGAQAMPARGDGFQVIRMNKNRFYGHPEMISYLQRLGKKVEQAGLPT
MLIGDIAMPGGGRFLTGHASHQMGLDADIWLRMGSMSDQDALNSDGKGLL
VVDRAEQRVDENIWNQNHFNLIKLAAQDSKVSRIFVNPAIKVKLCQTERQ
DRSWLQKIRPWFGHDSHIHVRLTCPYGANYCENQPAIPRGDGCGAELYSW
FEPQKPSSGATTKKTLPPEPFLCQQVLNSPNRSEWQD
>MS0331 unknown
MYMFVLFGLNSEHSQHINYFLYTGELSMKKLLKLSLVAGLAMTALAVQAE
ERFITIGTGGQTGVYYVVGQSICQLVNRDTAKTQIKCNAPSTGASIANLN
AIADKQMDMGIAQSDWQYHAYNGTSAFEGKKNEKLRAVFSLHAEPFTLMA
RDDSGIKTFDDLKGKRVNVGDPGSGTRATINVIMAEKGWTDKNFKVAAEL
KPAEMASAMCDNNLDAITYNVGHPNGALKEAAASCDSHLVPVTGPEIDKL
VSEHSYYAKAVIPGGLYKGTDNPVETFGSYATLVSSTDVDADKVYAVVKA
VFDNFDRFKRLHPAFANLKEEDMIKNALSAPLHEGAERYYKERGWLK
>MS0947 unknown
MTVQSGLLLEHCKAAIYLESEIHNLSLIPKACRQFNQALEQLREQYPDAM
LGAVVAFGDNAWKRLSQNSAPELKSFVALGKGNLAAPATQQDLLVHIQSL
RPDVNFSAAKAAMDAFGEAIRVMQEIHGFRWVEERDLTGFIDGTENPQGD
DRPVVGTIAEGEDADGSYVMTQRYEHELAKWEKLSQHKQEQVIGRTKPDS
EELDEVPETSHVGRTDLKEDGKGLKILRQSLPYGTASGTNGLFFIAYCAT
LHNIEQQLLSMFGEKDGKTDRLLGFTKPVTGSYYFAPSLEKLLSL
>MS0011 unknown
MTKQIAVLIGSGSTTSFSKLVVSHLQKMAPASIQLNIVEIADLPLYDRDL
DENSPAQYTRVREQIANADGVILVSPEHNGAISAMLKNAIDVVSRPMGQS
KWFGKPAGIVTVAAGMAGGVRVADQLRTIASGSFIGMPVYQQNACVGGLF
NGVFDQNGEITIDAVKQMLQQFIDGYAEFVAKF
>MS2009 unknown
MPVFNAHVAQGKLTKEQKQGLADAFVLAIHDALNAPMEDQFVIINEHPQD
NIFIHPTFPNMQRTDKRMVVTVDVSTTRTLEEKRKLTELVTKYAVEKAGI
GQDDISLLIYALPLENMSFGRGILMPDDAEAMVKRTRS
>MS1715 unknown
MKLAINKILLTSALVMTSLSAFALKDDTNQPINIVSDNQSLDMEKSIVTF
TDNVVITQGSILIKANKVIITRPPEGSKQKETVEAFGNPVTFHQMLDDGK
PADGKANRVHYDLGKEFLTLTGNAQLKQLDSTIDGDVITYDVNKQQLKAS
STAKSRVKTVLIPSQLNEKKK
>MS2379 unknown
MTHLIVATHGKFSQEIVNSAAMVFGEDENTHVVTFLPGEGGDDLVAKYKA
IIATLPENEPVLFLVDLFGGSPYNAAARVAAEYENSDIVTGISLPMLLEV
LDAKDGASLPELVETAKEVGLAAVKSFRQPKEEAKPAVKAEVAPAAAPAP
RDPNLKGNMNISLLRIDSRLIHGQVMTSWAKAVKCEAIFAISDEVANDAI
RRELLLQIVPEHLKGYVITVDKAIKVWHNPKYAGKNIIWLVTNPSDIVRL
IEGGVKIRNVNVGGMTFNEGDQLISQAVAINQTDLAAFYKLLELGVDMSL
QQVAANKKEPLDKKRLDEIKF
>MS2343 unknown
MKYLTFNFFLKKNLYMFSKIINTFIICFTLCGCSLINNALEEMQKNIDSN
QQQINSMKKVAIIYNAKEIEAIVVTPAKYANKSPLIESIKKNYEIKTQNY
TLQFNEKVKAEQEKSSINQYFAHELSKKIEQAGYQVTLVDKADMTNLNAY
LAKTPKIDGVIKLNIMLGYTSPDNSFLFEPSTLINYEVYNKTAQRLSHGK
VGAIGGWISDKYMTFNGLYEDADNARAKLKKYLMDNLDSTAQEILKVTVS
NLN
>MS1117 unknown
MKKSIKMSIWSYSLFSVNSSIKYDYILPFLGQNSQLILCK
>MS0177 unknown
MIEKEKKTTALFYYHLGLVIKGEKRGTSEGVERSALTVVPIV
>MS0540 unknown
MEWTTIITLAGSFFLLLAIGVPISFAIGVSSLITIMLAIPFDAAIAVISQ
KMASGLDSFSLLAIPFFILAGNIMNRGGIALRLIEFAKVLGGRLPGSLAH
VNVLANMMFGSISGSAVAAAAAVGGTMAPLQKKEGYDPAFSAAVNITSSP
TGLLIPPSNTFIVYSLISGGTSIGALFLAGYIPGILMGLGIMIIAYFIAK
KNKYPVSPKPTFKEVTHRTLDALPSLGLVVVIIGGIIAGIFTATEASAIA
VVYTLILSMVIYKEISLKELPQIILDAMTTTSIVLLLIGASMGMSWAMAN
ADIPYTISDALLSVSENPIVILLIINLTLLIVGTFMDMTPALLIFTPIFL
PIVTELGMDPVHFGILMAFNLSIGICTPPVGSTLFVGCSVAGVKIDKVIK
PLLPFYAILILTLFLVTFLPQLSLWLPQTLLGY
>MS1898 unknown
MVSVYFLSKRFLLIKTTTPRLTRGVENYGQFKNPVNYTALLTQI
>MS0302 unknown
MKLFLITFGIFILIIFGMSIGYIIKKKTIKGSCGGITALGMKKMCDCEEP
CDNLKDKVAKGEADASELDRFNKEPQFYEVK
>MS1716 unknown
MSILYAENLAKSYKGRQVVSDVSFTVKSNEIVGLLGPNGAGKTTSFYMVV
GLVRHDQGKIRIDDEDISLLPMHNRAQKGVGYLPQEASIFRRLSVYDNLM
AVLEIRKDLTKEQRHARAEELIDEFNIGHIRDNLGQSLSGGERRRVEIAR
ALAANPKFILLDEPFAGVDPISVIDIKKIIKDLRDRGLGVLITDHNVRET
LDVCERAYIVSAGKMIATGTPTDILNDEHVKRVYLGEEFKL
>MS0079 unknown
MASLILTPEWAEEIYQLETSDPVMGGPDGIDNRQAIQLGKRTEYLKQDVE
KRAPIASPTFTGTPKAPTAKTGTATEQLATTQFVSNAISALVGSAPETLD
TLAEIAQAMGEDESLKETLLAEIGKKATNEAFSTLKNLLIGIPFPYPLSA
VPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFIRGWDNGRGVDSS
RELLRSQGAELSAHTHYVTVTRYANSSGEFGAKISTFSAINNSGWLLSGA
DGLLLAANKSGEIVSEKNSVANLISNTGGNETRPRNVAFQYICLAK
>MS1133 unknown
MSDFYHVINWNKLMEHNLTDIINIFNQCFESEYNTKLEKGGEYPIYLPAF
LDENGVKSERPYNVIYFARGFYSSALHEISHWLVAGKERRKLEDFGYWYE
PDGRSTQRQREFERVEVKPQAIEWVLATAADFRYFASADNLNGNPGDTAP
SNKRFIIR
>MS1639 unknown
MRHFIFKNYKDSFMQKYDIHVCLVSAQAAPNLLPTLDKGFKPKKAVFVVS
TRNDIKEKANSLHLAFKQNGIDVDIMNLSDEFDFQRMEIELLDLLTKYKN
ENIALNVTGGTKLMAIAAQNAFSGVKPIFYINTDREEIIFISKEGDNYIP
VQKLNTETFISTYLSGYGITILKNKDNFNFHKLGMFTERFATRQKNYKDI
ISSLNSLACAADNSNLEASLAGYNNDLRLIIEDLADEDLVKLNGDIVDFK
NESTRSFLNGNWLEYFTYKQANSITDVIDVDWNVEVVDSKYEKNKIGVNN
ELDVVFMAKNKCHIIECKTMNFENEENSAKLQGYLDKLKSLKDYGGSLTK
VCLVSFYPIPAHVKRRAEKDNIEIIDDYRIKDLKEKLQNWIREK
>MS2376 unknown
MNFKDFIKAPPAEGYLKNSSKLVTALFIIAGLCYYFSKQYGGYAAVICLV
IAFMVMFGQKLMLSQITKDFNEMYFAKKQFEETQNLDYIRFIQARATQIL
VDNKVLSEKAKRELGFLLQYAEGKLKK
>MS0942 unknown
MKKVRSNFTDFYLKKSSIFDRTFYLKSGAGYLL
>MS1376 unknown
MKVTSSAIKNGAFEDKYGKRGSQFTPNGMPSYSIPFEITGAPEGTKSFAV
VLEDKDAVTASGFVWIHWLIANLERTSVLENESQTATDFIQGANSWSSVL
AKLDITEASAYGGMAPPNCLHRYELFVYALDTKLDLQPGFKFNELHFAMQ
GHILAKAEIMGTYDV
>MS0261 unknown
MLMGICALAFDFGTKSIGCAVGQSITGTAQALPAFKAQNGIPNWDSIEKC
LKEWKPDILVVGLPLNMDGTEQEFTSRARKFANRLHGRFGVKVELQDERL
TTTEARTEIFQRGGYKALNKSKVDGISAALILESWFERHS
>MS1945 unknown
MAGYFALTYLGKITPKLTSLFAPVTLGKLLGKLCILR
>MS0650 unknown
MATIRRHNESRRNLPTLDDVSVYLNKIPYGGIYAIEAEAYLIDEVVMKLI
DLIDPQL
>MS2157 unknown
MHSQRKTMIINTGGRTDTVHYYSKWLLKRFEEGYVLSRNPLFPNKVTRYE
LTPDKVDCVVFCSKNYRPILPDLHKITDRFNTYFHYTITAYGKDIEPGVQ
TIEKSVETLKRLADIVGKQRIAWRYDPVLLTEKYTIERHLETFDYLAREL
TPYVDRCIFSFVEMYKKLAVNMPEIILLTDEDKHRLAKAMGEIAQRYGLY
LQTCATEGDFSAYGIHGSGCMTLDIIGRANGVNFKSLKHKGNRQHCGCVE
SRDIGAYDSCPSGCKYCYANKSPAKARAMQQYHDPDSPLLLGHLRETDVV
TQSPQKSFLAPQQMDLIGLWG
>MS0498 unknown
MKTSLSIFNDQQKRRALILISLFHILIIAASNYLVQVPFEIHLPFTALGA
KENFSFHSTWGTITFPFVFLATDLTVRIFGAKEARWIVFAVMFPALIISY
VISTLFSDGQYQNMSALMTFNSFVFRIALASFCAYAFGQLLDVFVFNRLR
RLKTWWIAPSSSMTFGSLADTFLFFAVAFYQSSDPFMAEHWVELGFVDYL
FKLFVGIVLFVPAYGVALNFILRNVLGITPQHS
>MS2141 unknown
METKPAEQQAIVDIGCENLGISVKTEDGTLTMYHATQKMRRVLT
>MS2014 unknown
MGTTNAGLPKNYRIKLGVIKTPLYSNESISSWLIRAALDCGTEPITFTGF
YWNKWRLWTYDLDRGFEPIAQHIYADITELSLNQQVNLVNHSLYSVLRPI
NGKNTLIKGQAKWVLSRGSRNRSFRVGQSYCPCCLEETPYLRNEWRFAWH
FGCLKHKVLFGSKCSCCGGLYQPHLLSAEKRQLNYCHQCGEKLQVITTPL
NEVEIATMETLDKVFTTNSGECFGKRVDAQVYFAVLRYFINLVRRTAVAK
STHAFARFVEECGISQAEICQTRTALAFEQLPVEERKNLLVNAIKILNLS
SKDFIQATQQSAITQKAFAFENYPMELDTLFKYASKGKTVSRKTVTNKPK
TDSVLSMNRQWERLKRQLKIAA
>MS1405 unknown
MQNILILGATGSLAAQIIPTLLAETDDNLTLFARNPSRLAQFKSERVQIV
QGDMMNIEQLSEALKGKDMVYAGLAGNLEPMAKNLVTAMKSAQVKRLIWV
SSMGIYGETGEDHGAILDPYRRSAQIIEQSGLDYTILRPGWFTNGQEIDY
QLTHKGENFKGHSVSRKSIADFVLKLVQHPELEIKQSVGIAKE
>MS1243 unknown
MSFANQKLSDIALTVPGAIQLFREYDLDYCCGGAVELAVAVQEKNLDINE
INARLTELQNNPVNAEERDWTSASFDELIDYIVPRFHDGHRSQLPELITL
AEKVEQVHGDRPDCPTGVAAELRNMLTDLTQHMMKEEQILFPMIKAGNYM
MARMPIQVMEMEHAEMGDQLEVLKSLTDNLTPPADACTSWLALYSGIEHF
IDELMLHTHTENNILFPRVRNAA
>MS1051 unknown
MWKSVSQVLADQFGAYYNIKDKEKIHSGEVHEAWLINDGIQPVFVKLDEK
SYRSMFRAEADQLQLLARTNTIPVPQVYGVGCSQNHSFLLLEALPLEPIT
AETMGEFGVKLAQLHAQHGSEKFGLDFDTWLGPVYQPNEWKLNWATFFAE
QRIGWQLQICKEKGIELGDIQSIIDMAAGKLVKHKPKPSLLHGNLWIENC
GLVKGKVTTYDPACYWGDRECDLAFTELFEPFPQEFYDNYNRTYPLDKGY
QQRKPLYQLYYLINFSHRFKGHYITLTQKLLNYLMSEEE
>MS0051 unknown
MTVVIFLAVLLGAIILGIPVAFSLLVCGVALMVHLDLFDSQILAQQIVSG
ADSFSLMAIPFFILAGELMNEGGLSKRIIDLPMKLVGHKRGGLGFVAILA
AMIMASLSGSAVADTAAVAAMLLPMMKTTGYPEARSAGLIGTAGIIAPII
PPSIPFIVFGVASGVSITKLFMAGIFPGIMMGICLGMLWWWQAKRLNLMT
FSKATRQDLCISFKNSIWALLLPVIIIGGFRTGIFTPTEAGAVATFYALV
VSMFIYRELKFKDLYKVLLAAAKTTAVVMFLVAAANVTGWLITVAELPAM
LTELLEPLLGNPTVLLLVVMLAVFVIGMVMDLTPTVLILTPVLMPLIEEA
GIDPVYFGVLFILNTSIGLITPPVGNVLNVITGVSKLPFDQAAKGIFPYL
IMMILLLLLFVFVPSLILVPLSWML
>MS2309 unknown
MPIDRTDLLELVTQQDKLANYAKDIAGRMIGRQFKIPTEIQADFMAFVCR
SLDATVQAHRVIEEMDELLETGFKGRELNLVNTMITELDKIEDDTDQMQI
KLRLMLRQIEDRFNPIDVMSLYKTFEWIGVLADQAQRVGSRIELMLARS
>MS1859 unknown
MDLLITLIDDMFFASIPAVGFALIFNVPPKALGYCAILGAIGHASRTLLM
HFGVSIVFGTFFAAGLIGFIGVRIAQHYLAHPKVFTVAAIIPMIPGVYAY
KAMIAIVRINSLGSSPELFNQMVDNFVKGCFILGALVFGLALPRLLFYRG
TPVV
>MS2384 unknown
MKTNNNPVRRSFLLFNNLSDNLCGHLKDSFKQKI
>MS1161 unknown
MRFVWCSIHLRDTISLLLLSVLLINRNLFIMKKTTLAVAIGILAISSTAS
ANWYVQGDVGYSKIKASGMDDLDFKDNVFDQRISAGYDFGDIRLAVDYSH
IGKAKDHYTLFRGEQWETSGSTSVETNSFGISAIYDFNLNTSLMPYVGVR
LSENSLKFEDHWRDNSASESYSETKTKFGYGALAGVQYHLTDNLLLNVGV
EYNRLGKVEEVKIHQYSAKAGLRYNF
>MS2207 unknown
MLNQQISQVIAAELSVQPKQILAAVQLLDEGNTIPFIARYRKEVTGGLDD
TQLRHFETRLIYLRELEDRRQTILKSIDEQGKLSDELRAKINATLSKNEL
EDLYLPYKPKRRTKGQIAIEAGLEPLADLLWSEPEHEPESAALAYVDANK
GVPDTKAALDGARYILMERFAEDAQLLAKVRQYLQHNAVLVSKVIEGKET
DGAKFQDYFDHQELLKNVPSHRALAMFRGRNEGFLQLSLNADPDAEEGSR
SSYCEEIIREHLAVRLTGLPADKWRAQVIAWTWKIKVSLHLETDLMGSLR
EKAEDEAIDVFARNLTALLMAAPAGAKTTIGLDPGLRTGVKVAVVDSTGK
LLATDTIYPHTGRMNEAMVSLYQLGKKYHAELIAIGNGTASRETERFAKE
VIKQSTDWSAQTVVVSEAGASVYSASEFAAAEFPELDVSLRGAVSIARRL
QDPLAELVKIEPKAIGVGQYQHDVNQSQLARKLDAVVEDCVNAVGVDLNT
ASAPLLTRVAGMTKVLAQNIVAYRDENGCFESRQQLLSVPRLGPKAFEQC
AGFMRILNGKNPLDASGVHPETYAVVENILQVTEQSIRDLMGNSNALRRL
DATQFTNEKFGLPTVQDIFKELEKPGRDPRGEFKTATFMEGVEEITDLKA
GMILEGTITNVTNFGAFVDIGVHQDGLVHISSLSDKFVEDPHQVVKTGDV
VKVKVLEVDVARRRIALTMRLDESAVKNSEKSDRTLSTKSGQDRNRRDNR
QPQRNQFANNVFADALKGWKK
>MS1547 unknown
MAKSVDAADSKSAALKSVSVRVRPLAPNSRDRFLAVFFIARISYFS
>MS1767 unknown
MLINIHYSQVLRKQTALLTNAKVRSFFRKF
>MS1485 unknown
MIKMPFEKIQTAFKIKRIRSGNNRPCLRLEV
>MS0034 unknown
MLNYKPLTEKSGFFCILRTLMTQYIIAQTNKGVQLGITAKMANRHGLIAG
ATGTGKTVTLRKLAEAFSDDGVPVFLVDVKGDLSGLTVKGTLQGKIAERV
EQFNLGGENYLSGYPVSFWDVFGETGIPLRTTISEMGPMLLSRLLNLNAT
QEGLLNLVFRVADDKGLLLIDLKDLRAMLKFVAENAKEFQVEYGNVSAAS
VGAVQRALLTLENEGATNLFGEPALNLEDWLQTRDGRGVINILNSEKLIN
SPRMYSAFLLWLMAELFERLPEVGDPEKPKFVMFFDEAHLLFDGVPSALV
DKVEQVVRLIRSKGVGIYFVTQNPLDLPDTVLGQLGNRVQHALRAFTPRD
QKAVKSAAETFRANPQVDVVETISTLGVGQALVSFLDEKGMPTPVEIVGI
FPPKSQLTPLTNEQRTDWVKDDELYPHYRDLVDNESAYEILNDQSVQAQV
QQQVQDEENSDFFSGMISSIFGTKKKSRQTVAEQMVSSVAHQVGRNLRNQ
VTKQILRGILGAITKK
>MS1393 unknown
MGVYNKVPFAKFLPKPTACKNTSFFDRTFIRKENKAYFNKRIAPKEFGKE
ANF
>MS0023 unknown
MIVLDFLLCRFLKDKEQNMSKVSEITRESWILSTFPEWGTWLNEEIELEQ
VPANNFAMWWLGCVGLWVKTPQSANICIDLWCGRGKATKQVKDMVRGHQM
ANMAGVRKLQPNLRNSVGVLDPFAINEVDAIVATHYHNDHIDVNVAAAVV
NNPKLDHVKFIGPQYCVDMWTKWGVPAERCVVVKPGDTVKIKDLELVALD
SFDRTCLVTLPARGAEDNGGELNGICPSDEEMGLKAVNYLIKTPGGNIYH
SGDSHYSIYYAKHGKDYDIDVALGSYGENPLGIQDKMTSIDILRMAECLR
AKVVIPVHHDIWTNFMASTNEILELYRMRKDRLQYQFHPFIWEVGGKYVY
PRDKDLIEYHHPRGFDDCFEQEPNVPFKSIL
>MS0736 unknown
MTDKTNIIREPEGSLILRTLAMPSDTNANGDIFGGWIMSQMDMGGAILAK
EIGKGRVVTVCVDKMTFLRPISVGDVVCCYGKLVKIGRSSMQVKVEVWIK
KVYDGVRDRHCVTEALFTYVAIDKEGKPRAVPREDNPELEQALALLNNHT
TEEN
>MS2154 unknown
MQKLKIETQSGTLLDGVLFSQTPSKTVIIAITGIHGNFYSNPFYYNIGHT
LSQSGIDFIYAQTRNAFGKTDFVNPKTGQPESIGSWNEDFAKTIEDLTAY
VDFAEQKGYQHIVLAGHSLGANKVIHYLAETQDKRVAKFILLSPANVTHL
TNAISEQQRAYIRHQVEKGNSQRLLPFELFGWLPCIADTAFQWLYSPLLN
NVHVEPNSDFSQVAKIQHTGALLIGTLDRFTYGDPPGFLRNINNHFQSAD
KNTLIFIENTGHTYQQKEQEVADKLLDLVKDWGY
>MS0288 unknown
MSVKKSNLSRPNWLAISRSERVVGTNEKVLGKRIRTLGIHQRYGIMISRL
NRAGVELVPTADSILQFGDVLHMVGNVETMDAAISIIGNAKQKLQQVQML
PVFIGICLGVLLGSLPIHIPGFPVALKLGLAGGPLVVALILARIGSIGKL
YWFMPPSANLALREIGIVLFLTVVGLKSGGNFVNTLTQGDGVTWMGYGVL
ITFVPLMAVGIIARIYAKMNYLSICGLLAGSMTDPPALAFANAIKEENGA
AALSYATVYPLVMFLRIISPQLLAILLWVA
>MS1482 unknown
MENVVSQIIAQYANVCILIFVRIFSVNSQQIKVRSNFVYFYRVA
>MS0086 unknown
MYFMLGDIALEAIDLTEFSETFAAEFAEHAVLKGKPRLQAMGEKLNELSF
AIRLHHKIGGVESRYQALLTAKAEQNALALIWGRGKYKGNYVITQLSSTT
LFTDKYGNALCREMTISLKEFVGDSEDSLFGDALNFGSNSLLGSILPSGV
VSTLSTVKNAVSRGVELYNQGKRLVDEVQNTVAVIRKFADDPATALGYLP
LALRNLDGALGSFGEITGLSDTLSGLSDLLPSAVKFSHKIDDIYTDLQIL
KDSFTNASGNDWSNWFTPADNALSSVNESFDYLAKPVAEMTAWIVLRADD
EPDTEKDNDDTDLA
>MS0082 unknown
MGAMNTQIQSTHWQLAPETDGVSVVSGVDDIHLCIANILSTQKGTDILRP
EFGSDHFKFIDYPEDVAVPNFVREITQALQKWENRIVIDEVLVDGEAPHF
TFTVSWSLTDDVYREIYRTQVQQ
>MS0718 unknown
MIMINSVYLLVNKGSWRALSFILAILLTACFFFNIHQFTSELRTANPIWV
VLILWSTVILWIHGMGFDIRSDLGKYLFFPIFGYLISFVALFQHFLY
>MS0234 unknown
MCHNLSRIIDLKFLLMINYAIILHNYLKINNYLDYRHETTA
>MS1075 unknown
MRKNRPHFCFYRLERRLIFYFIRLNRQQITF
>MS1925 unknown
MSFLWSLLSFIIAISVLVSVHEYGHFWAARKCGIKVHRFSIGFGKVLWRK
VDKHGTEFVVSMLPLGGYVKMLDERNEEVPEALKSQAFNNKSVLQRAFVV
MAGPLANFLFAIIAYWAIYTIGIPSVKPVISAVQPQSIAAQAQLPVDSQI
VAVDGTATPDWETVNMVLASKLGNRQVQLTLTPFGENMEFRKTLDLSRWK
YDPEKESAFGSLGIEPVSGKVEMKISKIMEHSPAQKAGLQIGDMIRQSDG
EEINWQAFVKLVQQGKSIPLQIEREGVLFDVILTPEFTDKRWLVGISPTF
EPLNDKYRSELKYDMLEALQKGVEKTAQLSWLTIKVIGKLFSGDLSLNNL
SGPISIAKGAGMSSSIGLVYYLSFMALISVNLGIMNLFPLPVLDGGHLIF
LAAEGIMRKPVSERIQNIGYRIGAILLLMLTAFALFNDFLRL
>MS2107 unknown
MGFNEKINMQKMQQFKKKVNARLGVKLNITT
>MS1679 unknown
MLAERRLELYFAENPPHFFDEMAKSAVILSPENFHNAKNLLGREFDQILF
DGRTSLNLDALAIAAGTLRAGGRLLLWLDKNPHVDPDSLRWSGAEQAVET
PNFYAHFNRLLQVYGCDNGIQAQNNQSVSTQKTNIASTATAEQQQIIRQI
LQADSDIFILTAKRGRGKSALAGLLAKELRNSAQYHKKPFNVYLTAPNKS
AVETLQLFAGEKITFIAPDELCRRIGQNARQFSQDWLLIDEAAMIPLELL
FQLTSTFKHILCCTTIHSYEGTGRGFLLKFLPNLHRSFQQFELIRPLRWA
ENDKLEKFIEELLMLEAEDRLIQPPYSIKSAVKIRQISQNELVEHITDFY
GLLTLAHYRTSPLDLRRLFDAVKQHFLIAEWECYLLAGVWALEEGGFSDK
ALIRAICRGERRPKGNLVAQSLAFNCNLPEACALKSLRISRIAVQPDWQG
RGLGLQLVEKLAQTAQADFLSVSFGYNEELAHFWQKCGFILVNIGEYKEA
TSGCYSAIALRPLTAAGEDLVKRAQQYFRRNLAFTFHPLHDKLSVEKSSA
EKITQLNGQDFGILENFADYHRTFYSSQGAIYRLFIRLGADTSPHVG
>MS0705 unknown
MKVIIFSLGFLKIMKKMTALLSVGENYLMLCSRYRHEVKNNLQFI
>MS0238 unknown
MRIPRIYHPDSLTNIKTCRLTDEAANHVGRVLRMQAGERLELFDGSNHVY
TAVILQADKKSVTAEIRDCQLDDRESHLKIHLGQVISRGERMEFTVQKSV
ELGVNVITPLWSERCGVKLDAERMDKKIQQWQKIAVAACEQCGRNIVPEI
RPMMKLQDWCAEQDGMLKLNLHPRAKYSIQTLPDIPAEGVRLLIGSEGGL
SPQEIAQTERQGFTEVLLGKRVLRTETASLVAITALQLCFGDL
>MS0088 unknown
MLFTRLSPAIALPTLAPIIKAIDKLDDMLYSSFRFLRRQSDMLKFIDTLF
LVFAWLLMTVAGLAMLSVGLYYYPLMTAVLFGLLLLAPVLIAADKYLVNI
NMPASAPKWLQARHGALTNIQQTLWRTAETELKKAHH
>MS1647 unknown
MKFMQTHKIYLTPISPIHIGCGEDFEPTNYVIDNEVLFNFDPANLALNNR
QKTELLNRVNRLDLLSIQRFFLENKEKVLSSTYYFADVAEGLANDYKNKV
GKVAQRESDGNKVINNLSIERTAFLPVKHLPYIPASGFKGALATALLDQA
HQAKNNPRVNKNDHGKLFKEYIGEFAESKLRFVKFADFSPLVQAESKIYY
ALNFKKKVGKIGGEGRAMALRRECIKSGQYRAFLSELALMQGDANKMQIA
DYFTLLKNFYLPIFKQEAELLAERNLVNRHYLKQLEQLFNLPNVALIRLG
KNGADSKTYQADGIAQIKIMGAKGTPLNFKDSSTTVWLAGTNQQQQNDLL
PFGWAIIEADPTAENEPLKQWCDAQPKSKFNRSVILAKREEQKAKQAQLK
AEEEAKQQAKLAEEKAKAEMLNSLSDNQRLIMDFVEKLKNTSERQADNTG
SPLLKEAEALINQAIEWENAERQFACEQITVELLKSGIRITGLKQGYKRP
ASISRTSVDMSAP
>MS1727 unknown
MKLTSKGRYAVTAILDIAINAEDGPVTLSDISERQNISLSYLEQLFAKLR
RHGLVKSVRGPGGGYQLGQPSGQISIGMIIAAVNENISVTKCLGQGNCQG
GKVCLTHHLWAELSDRIENFLNEITLEELVSKQHSQKTHTDFDNLLVVDN
>MS2271 unknown
MKIFGAMYDKTMQWSKHRFAAFWLCLVSFIEAIFFPIPPDVMLIPMSMSK
PKSAVRLALYTAVSSVVGGMIGYAVGYYAFDFVQGYITQWGYQQHWDTAI
SWFQQWGILVVFVAGFSPIPYKVFTIAAGVMQMAFIPFVITAFVSRAARF
LLVAKLAAWGGEKFAAKLRKSIEVIGWAVVVLAVIAYLILK
>MS1514 unknown
MTELTHYNQYIADENAMIAFGQQLIQAINKLDNNKPVVIYLNGDLGAGKT
TLSRGMIQGLGHQGNVKSPTYTLVEEYHLQNKHIYHFDLYRLSDPEELEF
MGIRDYFGTDTICLIEWAEKGIGLLAEPDLIVNIRYADNARDIDLIAQNA
QGEQIITLLAAK
>MS2303 unknown
MMALSQEKRLIEAPVNGGRNYNGPKVAKFLVG
>MS0124 unknown
MFDVLEQLKLQIHQAIVQLEQAEKALHKQKMTHASIYVENAKGILMKLGG
RIK
>MS2276 unknown
MRLFILILSAILLLFQYDLWFGKNGYLDYKETAEEIAMHKAENTKLSQRN
QVVAAEIRDLKDGVEAIQERARLQYELVKPNETFYRIAKENKDNR
>MS0655 unknown
MFAQSYNRRVYIISFFILRLSLMNNTASMSPQNNNDEIDLIDLIKVLWQK
KLVVILTSFFFALIAAIYAFTAKEQWTSKATTIAPKVADMGYYLSLRSEY
ANILNIKEFTSKDVVDNLFNNFRVALFSNNMKREFFAQSKWFQNYANENA
KDEDAKQKLLSDILDKSLIVTIPDIKKNPNALGINISFAAETPKEAQEVL
TEYINFINATVLTEDKIDFLADIKIAIDNLELQKDKIQRDTESVRQVQLE
NLTTALDIAKSAGIKEYSKTSGNVSIPQFALGDAQIPFTDSKLSDGSYLF
MLGEKYLQAQVDTLTNNKVVYPVSFYTIEKQVSLLNSLEQKANTDSKVTS
YYYLTSPDYPTTRDWPKRVLLLLIGAVLGGVLGCLWVLGKQIFSQK
>MS0879 unknown
MCYFKCLFKIKELFNMQTFLKFTNFMSKTFALWVLVFAFLAFQFPAQFAI
FAPYIPYLLGLVMFGMGITLTFNDFGEVFKHPKSVFIGVAGQFVIMPAIA
FCLAKIFNLPADLAVGVILVGSCPGGTSSNVMTYLSRGNTALSVACTTIS
TLLAPFLTPAIFYILASQWLDINAGAMFMSVLKMVLFPIFLGLIVRAIFK
KSISEISRTMPLVSVISIVLILSAVVAVSKDKIVESGLLIFGVVVLHNCL
GYLVGFFGARLFKLNIADSKAVSIEVGMQNSGLGAALAAAHFNPIAAVPS
AVFSFWHNVSGPILANIFANIKNDDKK
>MS0758 unknown
MIIYLHGFSSSRPDDYENVMQLKMIDPDVRVISYSTVHPRHDMTYILNET
HKLVSETQDDKPMICGVGLGGYWAERVGFLCGVKQIILNPNLFPEENMEG
KIDRPEEYLDIKTKCIEDFREKNQSRCLVFLSKNDKVVDPKRSEALLSHY
YEVIWDDTDAHQFKHIAPYIQRLKEFKAA
>MS0318 unknown
MRNNMSEQKITFADQKRKTVETAEFTEDGRYKRKVRSFVLRTGRLSEFQR
NMMNDNWADFGLEHQNNYFDFAEIYGNTNPVILEIGFGMGKSLVEMAEQN
PERNYLGIEVHTPGVGACIAYAVEKQVKNLRVICHDATEILQDCIADDSL
GGLQLFFPDPWHKSKHHKRRIVQPNFVDNVMQKLQQSGFIHMATDWENYA
EQMLDVLSQSKALTNTSKTNDFIPRPDFRPLTKFEQRGHRLGHGVWDLYF
VKN
>MS0387 unknown
MIRFPRFNLRSSTLIAIVALYFTLVLNFAFYGKVLTQHPFTGKPEDYFLL
TVPFFVFFTLNAVFQILAVPLLHKIIMPLLLIISAAIAYSQVFLDVYFTT
DMLENVLQTTSAESTRMITWQYVLWIIGFGIIPAFLYLSVKINYHTWFKE
LGIRLGAILVSAVVIFSISKFFYQDYAAFVRNNKPTVNLILPSNFITAGV
NEIKRIHDANRPYEKIGLDAQQEKPDPYRHFTVIVVGETTRAQNWGLNGY
QRQTTPKLAARGDDVINFNHVTSCGTATAVSVPCMFSYLTKDQYNGSKAE
KMDNLLDVLQRAGVNIFWLDNNSDCKGVCLRVPNETVNMTLKDYCTEGEC
LDEVLLRDFDKILNETTKDTVLILHTIGNHGPTYYERYTPEYKKFVPTCD
TNQIQTCSNEQLVNTYDNSILYIDNFIDSVISKLENRDDLESAVYYVSDH
GESLGENGMYLHGAPYAIAPEQQTRVPMVFWFSKTWKKNEGVDLNCVREK
AKTREFSHDNLFSTVIGMMDMNLKTSVYQPEFDILASCKRH
>MS1692 unknown
MSDFQQYLNFNEEENKRRQLEVYADTHISALTERRIAAISAPQQWLVLGE
PFCPDCRVFVPFVQKFAELNPNIKIKYVARKNYHERSRFDSDEQQKLVVE
THNIPSLFRIENDTTRLVLKEFPEFFKRRAEQAPDQKDQLKADYRAGKFN
EELERELVKLFTV
>MS0107 unknown
MLKELLSTDGKLSTTSTVQLIGALCVFGLTVYAVVTGQPYAESLLNNVLI
YLFGATTAKGVVTSYQAKIKGAINGQITGVSESRKTA
>MS0354 unknown
MSDQIQPNYLLPLSFCLSVSRKSVKIDRTFAPSASILC
>MS1139 unknown
MRSFFFNFYKILVKLTALFGADSGFDGIGEAQGARRGAVGLVNKPQKNSR
KRRTIRFSSLITCI
>MS1108 unknown
MKGKLITFNVFYESKFLTEEGGYTNDNEIFKVLLERA
>MS1456 unknown
MENPFTAKWSTQGNTLCLGHWDINYQGLPLVLPEERRDQDMGTKGIYNFI
DPDDELYLEGLDEDDWLLENIDWLSDVFIEHNIPLEEENMRLFYKAVNKA
DWRCGSCGGCI
>MS0075 unknown
MFKQAPLPFIGQKRMFLKHFERLLEDIPNDGEGWTIIDAFGGSGLLSHVA
KHLKPEATVIYNDFDGYAERLAHIDDINRLRQAIYPLLANCAKSKKVPND
IKTQIIDVIKGFDGYINEHILCSWLCFSGQQVKTLDELFKEDFWNCIRKS
DYPSADGYLDGIEVVSESFHTLLPKYQTDPKALFVLDPPYLCTQQASYKQ
ENYFDLIDFLRLVHLTRPPYVFFSSSKSEFVRFIEAMIEDKWDNWQAFEN
YERVIVKTSSSYSGKYEDNMVFKF
>MS2200 unknown
MQSLKGLFRFGLATPTLKKCCFMTALCIANQSAVSFCKFFLLMQFCKTI
>MS2088 unknown
MSGLQVGEQVVQKAGALINEGDQVEVVLSKGAE
>MS0078 unknown
MTVQFDNQGFAIESGFMTVHVIDAQGVYVHSEEQYISEGGSLSANAVLSE
PKAARQGFAVQWTGKVWQYVEDHRGEVYYNTQTKAEVTISELGKIPENLT
ALQPSDPNCEWNGEVWVLRAEKQAELKAQKLQQFIDGVDNKASRIYSIWT
RFEIEYAQREAAAVAFKAANYQGEVSRFIADFATKAGIDNVTATNLILVQ
AEGLRKLLVELANQRMRKYELKKPNLTEDEMQTIYDDIIQQMDNLAEAYN
NG
>MS1431 unknown
MSEPIVATPALIKRMGKPKDVFDLAANFPVGYILNPQTGKVWDWMLGR
>MS1384 unknown
MNKVTEMFKSNPYFVQIKVFYDYQHRRAHRYRPLLQ
>MS0987 unknown
MILSSLARYYQRLAKETDSVGNPKVPSYGFSEEKIGWVLVINQDGQLVDV
VPNLSDGKKPQPKLLNVPRPEKRTSGVKANFLWDKTAYVLGVESNKDKAT
AKEQPFVISQKTFEAFKQSHLELFKDSQDLGLQAVCHFLEKWQPEHFSQP
PCLTEMLDANLVFKLDGCSGYIHQREAAQTLWADLLKDDNAEQGICLISG
ANAPIARLHPAIKGVFGGQSSGGSIISFNKESFASFGKEQGSNAPVSEVS
AFAYTTTLNYLLRRENNHCLSIGDTSTVFWAEADNSANAEAAEGFFASVF
SPPDDEQESQKVFNILEQIAKGRGIKEVSPDLAPDTRFYILGLAPNAARI
SIRFWLDTTFGQLAEHLAQHWQDLAIEPCPWKTFPSIWRLLLQTAVLSKT
ENISPVLAGEMTRAVITGSLYPMSLLSQLIARIRADGDINGLRIAMIKAV
LQRRFRKGFIQEEIPMSLNTESTNPAYLLGRLFAVLERIQTQALGDLNAG
IADRYYGSASSVPYSVFPRLLSGAKHHLSRLRKDKAGMAVNLDKDLAEVI
GALPDVFPRHLSIDEQGCFAIGYYQQKQRYFTKKETTESTEN
>MS1845 unknown
MKFQTGSFYMGIEGGTSSSNNKSAVKNTALFIK
>MS1823 unknown
MLSILGRINELFIAFHSCYERKVIFKVSPMSFFAILYMLATYFLGSISSA
ILICRLVGLPDPRQSGSGNPGATNVLRIGGRWAALAVLIFDILKGMIPVW
CGYYLGLTPFELGMVALSACLGHIFPIYFKFRGGKGVATAFGAIAPISWG
IAGAMLGTWGIIFLLSGYSSLSAVIAALVTPFYVWWIRPEFTFPVALVCC
LLVYRHHENIQRLWRGQEDKVWAKFKKKEDSQGE
>MS0912 unknown
MSERIAKKQGSVLKTLSCILLSACIGGAAGYGSSYLAKNLWTVQSSLEKP
ALTELGNYYSLYSTYQLLNNEKANQDPTGDIFNRFKQLAGSYEHAKVFWE
NTDYYKQKLTDDSQHDSQLLDQLSREIKLLDTNAATTQLSLELDNPKRAR
ELLTEYIDYTGLANRKNIYGELIVKWKTLFDQVNSAANLNVADTERQKWK
SMLSMMQSVKPLDDQLVSYHFIQKPGQAEISSPNRICWAGIGSTIGAFFG
LFIGLFIRRK
>MS2264 unknown
MAKKPGKADEKDEDVTRLDKWLWAARFYKTRTIAKEMIDGGKVHYNGQRT
KPNKTVEIGATIKLRQGNDEKEIKVTALSTQRRGAPQAQLLYAETEQSIE
NREKNALARKMNAMPHPDHRPNKKERRDLIKFKNQG
>MS1298 unknown
MDVNMSEKTFEKLTALLTENRASFRVIEHPRAGKSEEVAKMRGTELGQGA
KALLCVVKGNGIKQHVLAILPANKKADLQKIALALGGTRASLASPAEVHE
LTDCVFGAIPPFSFHEKLKLVADPGLFGIYEELAFNAGTLERSLLLNTQD
YQRIANPQLIEFATEN
>MS1231 unknown
MKSPEPCFLLDYLRLRKKLAVNDGEKKSVSFVGKETPEMLE
>MS1484 unknown
MQKLVKWVVALSIMSATFTLSAKDFAIYDMMDYVGKPQDLTADKISRAML
IYESELVKPDPTGKRKHGVLNLEKVIELARRSHREGYTVISTDIESWFGN
KGGQLLSPEELKRDFELMFNIFKNENPNAIISNYGLPTETLSVIRFYRGD
VPYQVSLDKWKEFNKRRNKSGVIADYANPVLYIVNPDIATWEKDVIHTVQ
EIKKRYPNKKIIGYIWPQYYSAKKSGYFKQFIDPKTWREMLEITYKYTDG
VMIWSDKRDENDKIVRWEDPRVQAIMAETKAFIRAHDKDIKVEGKKKK
>MS2004 unknown
MKAVKVEYQGKLRQQITHLSNGQTVITDAGKSVGRHGENISPADLLAASL
AGCAMTIMALRAEQLGADFSGCYAEVEKEADMQQFQVTKIVIHFYLKAGF
SDEVRQAVENATRDLCIVGRSLRADLVQEFHFVYQ
>MS0226 unknown
MIYENMLQNIKIYFSMTDMQIRHRHIRRNQR
>MS2300 unknown
MVLMDSLSKKVVYHKIVNAERVIYYRKAINELREKDYKIQSITCDGRRGL
LKDILNTPIQMCQFHQVAIVIRRITRKPKSEAGKELKILIKTLKTSSKNK
FYINLHHWYLKHKNFLNERSSIPDKAGKYPFKHRNLRSAYSSLKRHEEFL
FTFEKYPELKIEKTTNRLEGLFSELKRKLALHNGLSKKNKIMFIKDFLNE
KS
>MS1148 unknown
MNISVWHSALPFALKYLCTIKLVQEFFVNKKFNFI
>MS0388 unknown
MQSFYTEIFSGNKKARNFMNRAFLANRIKLFSATAFR
>MS0905 unknown
MRYKMKKFLVSLEKDIQRRELFFSQRNTQDFEVFNAINTMTQDLTSLGNL
FDIIKFAQYYGRNVTKGEIGCTLSHLAIYQKIADDETINERDYALVCEDD
ALFAENFQQVIQEIVKQPMGADIILTGQSKILEFNHIELEINYPSTFKFL
QKKIANSGYRYSYPYRNYFAGTVCYLITKAAAKRFLAELTNGRLPFWLAD
DFILFNEKFKLNTAIIRPLLAIENPVLTSNLENSRGSLNNNLFKKLLKYP
LKKLLAFKRNL
>MS2086 unknown
MCSHLYTFILCLNKENHMTAYVVFIRDEMKDQAAYDRYLQLGVPTLAPFG
GEILVANGAHEAFEGADFDGSVVLRFPDMASARAWYTSPEYEAVKSMRYC
NADWQPLFAFHTIDSNAGYSGAAIQR
>MS1688 unknown
MKHSMIKTVAFAFITAAFSMQAMAASPMEREIVRNISNQTHQPVKAETNR
YEALKTELDAKLAQLSQASDMTVFNALATDAKELAQQTKAAVLAQTFDFG
EASDNRRAELDADYSGWKLNKLIDSLDQAATKADLAAAKTEIRSHI
>MS2381 unknown
MRKQLYITHGYTANSQSHWFQWLKNQLIPHQIHTNIFDMPDSSKPNPQIW
LAHHQTYINQCDENTVFIGHSLGCIATLRYLQRQKKKIKGLILVAGFDEP
LDNLPELTSFTLQRIYYPELIANIPQRIVIGSSNDEVVAPKYTQKLAANL
QASYLTVENAGHFLARQGFTEFPLLLKECLNIFNG
>MS1410 unknown
MKSAVKIFEFFAYFIRIVYFSSLPHSDGKYMQ
>MS1432 unknown
MLICNLAVTENLEELQQILTEDCYCPNLANYEIREFEPNMIATDMAEFEG
K
>MS1138 unknown
MEGLLIVLVPMLLGYLIKTKNTGLLQSINKTVMVLLFIILFVMGVSLGQL
DDLATKLPVIGISALVFIVCILTCNIIGLLAYDKLSPRPLKHLGTDIPPR
GKLLLDSLKLCSMVIFGFLFGLCTKGGFDLPLHASTYVLVALIFFVGIQL
RNNGISLREVLFNKRGIYTAIIMIITALIGGIVASLWLGLPVTQGLAIAS
GLGWYSLSSVVINDAWGPVFGSIAFFNDLSREVFSLFIIPFFMFNYRSTA
VGLAGATALDCTLPVIQRSGGMEVVPLAISFGFVTNIVPPVLLVFFSSIP
L
>MS1308 unknown
MRLRLLLQHLVMDFQFTRYLGSVEAKCSMGHEAVANWFNSEVRSDSQKIY
TALSVLAQAKKQSYEQEIRLIGAEYSLFINADEVMVKANNLDMTDGSEQD
LEEDFHYYDEESIAFCGLEDFENFLTSYLNFIA
>MS2077 unknown
MRKIHIFLPHFFFKDLYLSLFDSIYLEVLCNTFNWLPI
>MS0914 unknown
MNNNGNNMTTKTERQTWSSKITYIMTVAGATVGFGATWRFPYLVGENGGG
AYVLLFCLAMILIGIPMILVENVIGRRLRVNSIDAFGDKLQDENISGGWK
IIGYMGLLGAFGIMAYYMVLGGWVMNYIISLISGILDISTPITKETAKEF
YDFSIGNSPLHIALYTFIFVIINYIILAKGIIGGIERAVKFLMPLLFVFL
IGMVIRNVTLPGAMDGIIYYLKPDFSKITPKLFIMVLGQVFFALSLGFGV
LITLSSYLSKEENLIQTAVITGFTNTIIAILAGFMIFPSLFSFGIEPNAG
PTLVFQSLPIVFSHLWSGTFFAIVFFSLLLIAALTTSITIYEVIITALQE
KLKMRRSKAILLTLGGIFLLGNIPSILGDNLWKDFRPFDKSIFDAFDFIS
GNILFLLTALGCAVFVGFVLKDKAKAELSPTPDSLFTTVWFNYVKFVVPL
IIIVIFVSNII
>MS1126 unknown
MKKLFLVMLAATFVTACANKDIYFNGSEGSNSGLKYDHNTDSLSINK
>MS1625 unknown
MRNSEPNKNSRFFAALWIGIVLALGIFAYGIYSYFDILAWEQSGQMPHIG
AFSALIYNLFGAVGILLGYGLLALIVFVQGWRAYKRNR
>MS2147 unknown
MQTSNALDNLKSIAKNNKKRLAGTFGLVAAENVLFLTYPVFGSFAVNAMM
SGDVWASLSYSLLVLIIWSIGAMRRAVDTRAFARIYAELAVPVVASQRAK
GLDTSSVTARVALSRQFVDFFEQHLPILIMSAFQIIGSALMLLILEFWAG
VTACAILAFFAFLMPKYAKTNDLLYLKLNNRLEKEVDVIERNNGYQLNKH
YGWLAKLRIRISNREAAGYLWIGVAMALLFGVTVVQIATTQGVKAGHIYA
VITYLWQFAMSLDDMPRLLEQFSNLKDIGKRVEV
>MS0816 unknown
MTALLIGRINQKSAVIFFQIFSRIISPLIEQFS
>MS0010 unknown
MRNLLQNSNLNEIIKRPDLSPVVFLWLILQLTLPSFKLSNKDNSRL
>MS0958 unknown
MGLKDIFNGFLSQKSGYFLLNNRLNISCKNSEKLTALLIIKLTG
>MS1390 unknown
MEFPLSITANINSHEGNFSRELELRSALTFIVGPNGSGKTHLLKGLKESF
SGFTEKKVRFLSAGRLGPLEQYRSNYDQFDRSNESDNARHGNKNEREYRH
KIENINGDLHTLSARPDILIKVRERLQKLFKRNIDVDWDAGSLKISFSRL
GATNTYYSSGREASGLLHLVGILSALYDDEVGVLLIDEPEVSLHPQLQAF
LLKEIQRAAGIPNDDDYKKLIIMATHSTEMLKISNSNSLLNFIFCNDLKE
NPIQIAQNAGELNNKKVKGLIARLGQEHKLALFSKTPLLVEGPSDVIICN
ALSDKLYLNLEAAGSQILPINGKEAMPETVKLLRLMGKNPTVLVDADAFA
DGLNLVNAYFNNTEIKEKANELASKQGNADILSWAKQVYDDFCNAVTNNW
NEISEQAQSHPYFSLSDDVDKKDDIDKKNKRSALCTLFVSENLAKEWTNI
KNRLDVLFSIFQECGLFILKKGAIESYYSTAQFESDDKVDKSVAESENID
SLPSDKIDSLREEYKDVIDCLMYASNSEKIDESRAIRDELLSFITPIHAR
YSEGETSFNKPSTIFSYGINNRDELEISMSSKVLDVKGFPIILRKNDNVT
TVVNSALGLK
>MS2324 unknown
MRLNGYIGLFIKESTMIRKGFVMQVNPDCHAEYKKRHDEIFPELVEELKS
HGAHHYSIFLDKQRNLLFGYVEIENEQRWNDVAKTAACRKWWAFMRDVMP
SNPDNSPVSQELEQVFYLD
>MS0802 unknown
MSLEKRFELIERGSTVRQEIIAGLTTFLAMVYSIIVVPGMLSKAGFPAES
VFIATCLVSGLGSILIGFWANAPMAIGCAISLTAFTAFSLVLGQQVSIPV
ALGAVFLMGAVFTLISATGIRAWILRNLPASIAQGAGIGIGLFLLLIAAN
GVGAVVSNQAGLPVKFGEFTSFPVMMSLIGLAFIIGLEKLQIKGAILWVI
IAITIVGLIFDPNVTFGGEVFKMPSFGEQSLFAALDIQGALQPAILPVVF
ALVMTAVFDATGTIRAVAGQANLLDKDGQIINGGKALTADSVSSLFSGLF
GTAPAAVYIESAAGTAAGGKTGITAIVVGVLFLLMLFFQPLATLVPGYAT
APALMYVGLLMLSNVSKLDFDDFVGAMSGLICAVFIVLTANIVTGIMLGF
AALVIGRIVSGEMKKLNVGTVLIALALVAFYAFGWAI
>MS2266 unknown
MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA
ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI
VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER
SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE
GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS
>MS1910 unknown
MNMAINSKYQDKQVDEILKDIIEVLEKHKAPVDLSLVVLGNMVTNLLTSS
VGANQRTVLAQAFSDALLNSVKTKHH
>MS0904 unknown
MSALINLFYIYDPWLFHIVRMSLVSGLFALLWFGYKWYKKELKRFVLPLD
SLAVCIALILLSVLPVLINGTTEFGVIGMYVKLLVLFSLGIVIYNLFYTS
SNGKDQLIRDLKLGIGAQSLLGFLALAGIPLFITISLATNSDMGGELSRF
IGSEQEYRLYNFTSSAFFPLSAFYLMLLHFLLAYDDNENNGTALKSVYVF
LLLFIGLISGRTFFIFSVISLLLYFKPRYIPAILAFTLLVLFFAYNYPAN
PYVAHALEIVINLIQGGSQISSSSDTLVNKHLFMPELKQLIMGDGQYYVI
GRTANSYYGGSDSGFIRQALYGGVGYILLCFLFTAYFVKRIADNWFNGSW
KFILSALFLLSILNIKADTFAYPGIMFVFLMFVSLFGDKGKIIIVERK
>MS1876 unknown
MINSPKELFLKTTQMIRELEMLSIDLIHLLQDSYKSFI
>MS2160 unknown
MAKPTALLKQTKQEKKMWDSNTLKQICQADDLKIAPFHPDMTSTGTPTWI
WEVAVDGRLFVRAYYGTNSRWYQSALAQKAGKIHAIDQVFEVKFEPIKDE
ALNQKIDDAYRTKYSSSRYMSHMISAGSRAATVEVIPA
>MS0296 unknown
MSHLIVKEQKTIIRNAFFTFLYFTLAIGLAIGILYTDIFYLQNMIEEESL
VEYTQSLSLTILTLMFSRHAYRSPQWRGGFVLITGFFLCMLIRESDALFD
NLIRHGSWAYFAIITALVCIIYAFTHRQSTIDGLAQFAKQKEFHSFIIGL
LTVLLASRLIGYGGLWRFILYNDYPHIVKNIIEETTELFGYLIMLFSCLS
LTRHFK
>MS1814 unknown
MPDRRNPIFRQFIHKVASNSQIAKLSSTNAEK
>MS1207 unknown
MPKKYYFQIGQDVVLTILLLSLTGYHLFEEVTHEWVGLGFFALVLLHIGL
NFWWIKKLNQGEFDAYRMVKTGLNFVLFFVFLTACISGILLSKHIFAEFP
FHLTDDFTRKIHMLSTHWIQIIIAIHLGLHWKTLADFLAQILRWDLNKPL
TKWILPTIWTMLSVYGLWAFIGRDLFPYLMNQVDFAFFDKAESKAIFYFD
YFAMLILFAYSTRVLVWLIFFKNEKK
>MS1070 unknown
MYLVEVFFKNINEDNLPQQIPLINQLIDQWRYNGQIIGREIPVFVANQEN
ERGLATRVICPEQQSLLPEYNNAEVNRCLANIENCGLILHSFQIVGEDLN
SDITYEDKKPDWQILYTTYLQVCSPLHSGDRLAPIPLYKQLKDVPHLSMD
VIKWQENWQACDQLQMNAVALESQALREISDINSRIFKHGYSLTKEIEEH
TGVPTYYYLYRVGGKNLASESARHCPICHGDWKLAQPLFDQFHFKCDHCR
LVSNISWNFL
>MS1809 unknown
MLIRRRKPSLKTKNKLSLSLSKRRNLRLKHLKRRKAKIAARHEYNFVVCQ
DC
>MS1380 unknown
MMTIQSSEILETIRMVADQNFDVRTITIGIDLHDCITNDIDQLNQNIYNK
ITTIGKDLVETAKILSAKYGVPIVNQRISVTPIAQIAAATKADSYVSIAQ
TLDRAAKAIGVSFIGGFSALVQKGMSPSDEVLIRSIPEAMKSTDIVCSSI
NIGSTRAGINMNAVKLAGETIKRTADITPEGFGCAKIVVFCNAVEDNPFM
AGAFHGSGEADAVINVGVSGPGVVKEALANSNATSLTEVAEVVKKTAFKI
TRVGELIGQEASKMLNIPFGILDLSLAPTPAVGDSVARILETMGLTVCGT
HGTTAALALLNDAVKKGGMMASSAVGGLSGAFIPVSEDEGMIAAAESGIL
TLDKLEAMTAVCSVGLDMIAVPGSTPAHTISGIIADEAAIGMINSKTTAV
RIIPVSGKNVGESVEFGGLLGYAPIMPVKEGSCEVFVNRGGRIPAPVQSM
KN
>MS0663 unknown
MMGRINMLQQFTRYFSVGIFNTLIHWLVFAFFYYVFSLDQANSNLIAFIV
AVTFSFFMNAKFTFKQQVSSVKFVSYTCFMGLLSYATGLCADYFNFPAII
TLIGFSVISLICGFLYSKFIVFKG
>MS0316 unknown
MSRTVFCEYLKQEAEGLDFQLYPGELGKRIFDNISKRAWGEWMKKQTMLV
NEKKLNMMNADHRKLLEQEMVNFLFEGKDVHIEGYIPQATN
>MS0314 unknown
MDKTNVLFYVIKKIAQKRPHFFLFLSLKIDNIANKSNYYFLC
>MS0206 unknown
MKIHRTFSKIRTLTMSEGTCSYFCVAHYAYVLNITGEPISALSPA
>MS1982 unknown
MLFYGSHYALLKNKKLRQRSQKLTNKVKLCEYLFFIGIKLTN
>MS1000 unknown
MANLASKGHVSEYFCPADIVLKNPIFKSLIYISSNFYY
>MS0582 unknown
MSPRLTANGLIMIDTNIVKKADVNIIIIAITAL
>MS0054 unknown
MRSSSQIKQFILNFISLETYHNEYGIQFQYMKRISGRAKMKVSYEELKSE
FKRVLLSRNVREDIAEECATVFADTTQAGAYSHGVNRFPRFIQQLENGDI
KPEAQPTKVLSLGAIEQWDAHQAIGNLTAKKMMDRAMELASQNGVGIVAL
RNANHWMRGGSYGWQAAEKGYIGICWTNALAVMPPWGAKECRIGTNPLIV
AVPTTPITMVDMSCSMYSYGMLEVHRLQGRQTFVDAGFDDNNNPTRDPAT
VEKNRRLMPMGFWKGSGLSIVLDMIATLLSNGESTAAVTEDKDDEYCVSQ
VFIAIEVDRLIDGKTKDEKLNRIMDYVKTAEPVDPNQPVRLPGHEFTTIL
ADNKANGIPVDDTVWAKLKSL
>MS1745 unknown
MFLFIFLIGIKVGLLFLLRKRLRILFGFKFMWRFE
>MS0825 unknown
MKTFKSMLALCLALGVSGSVLAVDNANPFATVKDGTKIMAKEKVGATKNT
VNEQLSGGQKAVKNSAGIKSTA
>MS2291 unknown
MDGAGEMFFHLNTSKVRLQKCYFLNVGVFQ
>MS1725 unknown
MKVYTGRTDIKLFEHLIDEPNQTVTHLVLKAGQAVPEHKVSQTVIVVPIK
GRIDFSNREESQEIYPGRIVQMIPDEWHALKALEDSELMVVKSTLAA
>MS0492 unknown
MRNFAIISLFSELVYLIGRLLSKIITNKRKTLV
>MS2182 unknown
MSANIATFYEMVNAPTRLSQYFLPHFYCFSDQNAKTIWHIFNSQYNERGY
FLN
>MS2142 unknown
MTSFKQRNRRKINMYIINITVNADLPEEKQKEMFPLHVEWFKKHFQEGKF
LMLGPFIDTDKHAGVIIASTESREELDAILKEDCYYPDFAKYEIREFEPK
MIAENMADFIEK
>MS0612 unknown
MNLIHSKEGEMNITKFPVILTLAVFSAGIALSPAYARGKNIFTENTERAE
KTYMSYGKSQQLDPNNSKDVNELANSVEFEVYEISENHSSHTIFESNAGI
CRGYQSSNGVELTDSTTYYVDDASDDYYASITGATIYAHASPKNVQYAPI
FNIHDPKILKEIHQDEEKYGKDLATKNVNARENILSKGICR
>MS0330 unknown
MSEKIQSVDYDDLRDLVASNDEGGRNPAGFPKKLIVGTAILWSVFQLYYT
SPFPFWLQEVLTQNNIDLNVVVDDTKARSVHLAFALFLAYLSFPALATSP
KHRIPIIDWICATAGAFLGAYYLFFYQSLVTRFGAPNLQDIIAGCIGIVL
LLEATRRSLGLPLAVIAVIFLLYNFFGQYLPTSWIISHRSGSLSQIINQQ
WITTEGVFGVALGVSTKYVFLFVLFGALLDKAGAGNYFIKTAFAYLGHLS
GGPAKAAVVSSALTGLVSGSSIANVVTTGTFTIPMMKRVGFTQEKAGAVE
VASSVNGQLMPPVMGAAAFLMIEYINMPYNELILHAFLPALISYIALVYI
VHLEACKMGLKGLPRTDPAKPFLVTLIRAIGTFLTLCIIYFVLELTLGWL
KTAVPNEAFLIVCLLLLIVYILLIRRVASFPDLEPDDPNAKIVVLPATKP
TVNAGLHYLLPVVVLMWCLMIERMSPGLSAFWGILALSAIIITQRPLLSL
FRKENTDKFIQLKEGVQELIKGLETGARNMIGIGIATATAGIIVGVVSLT
GFGVQLSGIIEILSMGNVLLMLILVAIFSLILGMGLPTTANYIVVSSLMA
LVIVEVGKQNGLIVPMIAVHLFVFYFGIMADVTPPVGLASFAAAAISGGS
PIKTGATAFYYSLRTAILPFLFIFNTDLLLLDVGWAKGILVFITATIGVM
AFTAATMGYFFTKNKKWEGFALILAAFMLFRPGFFMEYVSPTERHIEPAQ
LVQEIENAAAGQNLTIKVAGLNPYGKEIEFYSKLSIPAGENGEEKLKAMG
LTLLNTGEKIQINGNETDKILIDNVEIDSPAAKAGLNWDQTIIDVEVPKN
SLPKELMFIPALLLVSALAWNQRRRRNS
>MS1007 unknown
MNKSRFAFQLIKSVVIVGATISQIQKKSSN
>MS1142 unknown
MGYYSVFSYSAGNFASCPASALFILSEISVF
>MS0230 unknown
MSRQKKSRNIVDVMPQRKSDKSQISPASYARPSKKLTRYELDAKAREDKK
KKKHKGLTSGSRHSRSEQHNNQQMQEKRDPRLGSRKKVPLVVEFVNNPEK
GQFIQPVQVQPAEEKVKKLDPMLELEQLENNECLNQLLDALDEGKTISAE
DQKFVDECLDRIAQLMDELGIEDEEESEDDLLRTFEKIDINQFK
>MS0407 unknown
MLTFFIITLIVGSIVGFLAGIFGIGGGLVIVPTLLYLLPMVGVPDEKLMA
TALGTSFATIIITSLASAYRHNKLGNVVWEAVKYLAPTLVIATFISGLFI
GKLPKDISSKLFACLVVYLAAKMVLSIRNKKSKTPAKPLTPQSTILGGIL
IGIASSAAGIGGGSFIVPFLNSRGIEMRKSVGSSSFCGAFLGLAGMLSFM
IGGWSVEGMPDWSLGYIYLPAVLGITLTSFFTSKFGAEMANKLPVASLKR
YFAIFLILMAIKMLIG
>MS1905 unknown
MKYQWIFFDADETLFSFNAFAGLQKLFADNGLKFNEQDFTQYEKVNKPLW
VKYQNAEISAEQIQTIRFEPWEQKLGKSAVEINQDYMLALADLCKRNHSH
PGKTGKTGNYYKRLYRLATSSSAKNRFSTIFPVYYYFARTRHSQTGRPNL
RA
>MS2261 unknown
MMKSDKPIECVGCNTFDVGSILNNDELEAKIEKVFAGKEEAEQGLAALTA
KARDIESEPCKISSEITPVDGGYKLTASFEFSCQAEVVIFQLGTRSF
>MS2052 unknown
MNQNQPHFYRGRFSVAPMLDWTTRHCRYFHRQFSRHALLYTEMVTAPAII
HAKYDHLEFDPAENPVALQLGGSDPEQLQHCAKLAEQRGYTEINLNVGCP
SDRVQNGMFGACLMAKADLVAECVEKMQAEVEIPVTVKTRIGIDNLDSYQ
FLCDFIQKVHSKGCNEFIVHARKAWLSGLSPKENREIPPLDYNRVYQLKR
DFPQLSISINGGIKTIEEMTAHLQYVDGVMVGREAYQNPALLGQIDRALF
DLNAPIVTPREAVEKMFPYIERQLSRGVHLNHIVRHMLGAFQNCKGARQW
RRLLSENAHKTGAGIEILETALHFVEE
>MS0077 unknown
MANQVYLALYKNKRSWAKEPWKAFADAITRNFTKGDFSHCELVVERRQFT
SGSHYEHEVIYDCYSSSVQDKGVRCKQINVRDGKWVLIPLQNVTEEQIKH
YFEQTKGKHYDWWGALGVVLGIKQKRSKYFCSEWCFNAIFATEEGWRFSP
NQLAVMFKKGY
>MS2106 unknown
MHIDFLIKSQTTLLPIGHSELCQNNVYILG
>MS1841 unknown
MQKRDRTFKRCYNERPILSILITDFKENMSDTSLSLHPIAIINTPYKEKF
SVPRQPNLVPDGVGTVELLPPFNQPEAVRGLEAFSHLWLIFQFDKVPQGK
WQPTVRPPRLGGNRRIGVFASRSTHRPNPLGLSKVELRKIEISNGKVLLH
LGSVDLVDGTPIFDIKPYIAYVDSEPQAKSSFAQEKPQAKLKVEFTPSIQ
NIIQKIEQKRPHFGRFLTDVIAQDPRPAYQAGKPSEREYGITLYEFNIRW
RIRQNSADVAEVFDIEQTGNI
>MS0737 unknown
MKQLLEFIPLILFFVVYKLAGIREAAIALIIATIFQMLILKLKYGKIEKQ
QIIMGIAVVFFGTLTAYFNKVEYLQWKVTIVYALFALILLISQYGFKKPL
IEKLLGKEIQLPEKIWNKLNLAWAGFFILCMLINIYISQYCSEEVWVDFK
SFGIIAMTFIATLFTGIYVYRYLPKDDQNK
>MS1103 unknown
MLPRKYPLMSAVIKQNFQKIDRTFYKKKSRA
>MS1886 unknown
MNMTFGWLASLAGKIGLDMLNNSPDRLLKIGRIQNQLESERIIETEKAKV
YAEHEASRLKQKLSEVEPEKQSKIVGKIAILNNQIELLTQQQNTINSFIE
TIKDIGDIPKESLKEPDNDWLREWTKNAGRFSNEDANRLWGKVLAGEMKK
PGTFSYRVLDGLRNLSKDDANLILQIIPFITNGLVYRSNDLIFNMGTNWG
NWYQLEEIGIVRHVGSVSTSASTPVDQYTPMYIRGISYALVLSSETQKTI
SEPIIILTELGNAIMQLIEPTFKNDINLVHKQKEYMGNLGNYLKKEYQVT
YSIIKIPNN
>MS0521 unknown
MRALKKISQLLAKNTALVIILTALFTFIVPEAFTWVKGDAQVLVLGIIML
SMGMTLGAKDYQILAKRPLDILIGTVAQYTIMPFVAISIAQAFNLSPGLT
LGLVLVGTCPGGVASNIMSFLCKGDVAFSVGMTTVSTIIAPVMTPLLLNY
LVGETIDMDGWGMFKFMLLVTILPVGLGSLFNMGCHKQKWFNDVRSVMPG
VAVIAFACIVGGVVAFQGERFLESGLIMLMAIGCHNITGYILGFAAGRVF
GMNTAKKRTLSIEVGVQNAGLATGLSAKFFPTNAESVVACAVACVWHSVS
GSVLANIYQWWDKKHGEPVTEIHEIKKPVTESV
>MS0710 unknown
MAGHSKWANIKHRKAAQDAQRGKIFTKLIRELVTAAKIGGGDAGSNPRLR
AAVDKALASNMTRDTINRAIDRGVGGGDDTNMETRIYEGYGPGGTAVMVE
CLSDNANRTISQVRPSFTKCGGNLGTEGSVGYLFNKKGLIIIDAGADEDA
LTEAAIEAGADDIQPQDDGSFEIYTAWEELGDVRDGIEKAGFKIAEAEVS
MIPTTSVDLDAETAPKLLRLIEMLEDCDDVQNVYHNGEISDEVAALL
>MS0100 unknown
MSKFTFEEQAKYFEKKLNLKTDNYLDVLGEEHDYFFMVAGANRNEVLTAL
REAVDAAVLKGETLDGFRRRFDDIIANTGWEYNGGRNWRTRIIYDTNVYG
AYNRGRLQQHLDMAEDMPYWEYQHNDNAHPREQHMAWDGLVLRYDDPWWR
YHYPIKAYGCHCTVVAHDEADLRRYGKKVGTAPEIEFEQKTVGIRSGNPR
TVTVAKGTDVGFTPWNFDRIKQRRNASIDSVLMQKLITAAPKFASLLVEN
ILERPLAVTMLNAAMKDMVDTVAAEKVARGQLKYVGVLAPEIIEKLTALD
KAPQTAVIAVRDEDVLHALRDSKQAKGISLPVEFWETLPEKLRNPQAILL
QAKEQQRDKNAKDVLLFVYDTEQGKVAIKMDYEVKLKGQLSKKKLKHSLN
MVTTGSLFKDTTALHDFDVLWGSLD
>MS0089 unknown
MADLAFWFGFSHSELEEMTLNEIERWLKQAKRQIDANYTKAAV
>MS2253 unknown
MSLEILDQLEGKIKQAVETIQLLQLEVEELKEKNQQAQQANDELRSENEQ
LKGEHNNWQERLRSLLGQIDNV
>MS1050 unknown
MTIKAIIFDMDGVLIDSEPVWKQAGIDIFNAEGIPVTYDDMLALTGIPSL
GIVKAVYEKYQRSPVPVAEMAQRLNDHAISLILAQKPLIDGVQETLQKLT
ALGYKLAVASASPRILLEEITQSCGIDQYFSYLSSATELSHNKPHPAVWL
HAAEMLGVEATECIGIEDSVVGMVSVKAASMKCIVVPGVLGSDDPRWALA
DIKLATLREIDETVIGKLDSI
>MS2125 unknown
MGDLFVKFNFAWPWMGLAMAAVLSVLMISTDIFRSDENSHRWTDPVWLAW
LVVPLYMFHQFEEFALSYNVATDSYNVVTEVCRLHGYKSYEPCPIPAVHF
PFVNVLFAWIAAPLAAIMSKRNSLVGLSLYGFIFAEGVLHLTFGLLDHQP
FLNHGGLITGSLLFIPISLWVIFIGVKANIMSCNAMICTIFAGVIGQICL
FNAYSIFPDFGVTGMLIMDAIAVFIPLVLAALVSRRII
>MS2111 unknown
MGIPAHQSTSGEKQEIFYRLFSLLILLQILFIFLKSA
>MS1918 unknown
MDNQTISDQSPLSLGEQLRRAREKLNISIDEVAAKLNLRSAIIQAIENDE
FVQKSIPSTFMKGYVRNYAKFLKLPDGLLTSSMPNFAEEPKNDLNKNSRT
KHSVNPHAAHSRWVGYLTTLVVLFVAGMTALWWWENYQQSNNERDNLVQN
YVATEDRTAERSDNVVEIPAIQTIPEASTPVPEANTNESVEIAPVVANTP
VVTNETQPVQQTAEQTNTAQAMLQQHSTEPEQAQPTDNETTEPATVTAGD
LQIEVTGVNCWISVKDAKRKVLAQKEYKQGEILTFNEGSPYSVIIGAPGN
VKITYKGEAYPLKVDGRVAKFKLQ
>MS1867 unknown
MQAPTRRCGGRNGVKWRWCYQNTAKISLPTAVIFSVTFLIKRQLWKNSGF
FTTSQDFSAKK
>MS1132 unknown
MPKRAEILRKALVEFYGTEDSIDLAKFDVGKI
>MS0180 unknown
MLHNQQLLEEGLKKLKEIDGSQADKVMDALSDIAPDLGKYIISFAFGEIY
NRPRLDLQQRELITLAALASQGGCEKQLHVHIHASLNVGLSRKQIVETFI
QCIPYLGFPKVLNAVFVAKEVFSERDGTENFEKNDRTFK
>MS1488 unknown
MKSFIASLVIASRWIKLCGLMLLDFFVFPVLLWMCYALRLLDLSAEIVPN
FYLGEFWISLFAVACLFVCSVYHFVIRTFNETLIVRLLIASVMTVIGLLL
LGHFTDIFVPASVAIMFGFMMFLWIWLSRSAIRLTVRYILNPRVTSKRIA
IYGAGIGGQQVVQTLLRSDEHLPLFFIDDDKNLRNRRVGGLKIYSAKAAL
DALERYEIDEILIALPSISRARKNEIVEFLSQSHRRIMELPSLTKLVDGQ
INISDIKEVDIVDLLGREPVDPVPELFSKNIQGKVVMVTGAGGSIGSELC
RQIIRNQPKTLLLFEISEYALYAIEQDLRGIIRKESLVEMEILPLLGNVQ
NKQRLVEIMKAFNVETLYHAAAYKHVPMVEYNVVEGVQNNIFGTYNTAKA
AIEANVDSFVLISTDKAVRPTNVMGTTKRIAELCLQALAQEQGSAHHTLF
SMVRFGNVLGSSGSVIPLFKKQIAQGGPITVTDKRIIRYFMTIPEAAQLV
IQAGAMAKGGDVFILDMGEPVKIVDLARNLIKLSGLTIKDGDNPNGDIEI
RFTGLRPGEKLYEELLIGDDNVEQTYHERIMTAKEDYLPPDKLRELIRQL
EEACDNNDCEQVRRLLLNAPTGYHPVSELADVVWTKQHSDD
>MS0697 unknown
MDDLDYFLCRLPIPRFWSYAMRLEKLKKPVDVFISTFSIIVMVLLVICVT
WQVFSRYVLQIPSTITDEIARFSMIWVGLLGAAYTVGLQKHLSIDLFTHN
LTPRNKAFSNLFINFCIMGFSLGVMIFGGLTLVSNVYASGQLSPSMQIPM
AYIYLALPLSGLLMLFYSILFFIDNLHSLKEYD
>MS2164 unknown
MNIVHNPFPLCVIISLSIRPFFIRLYYEFPNSLLDKNE
>MS0246 unknown
MSMQPKFLLLKSLFYIIPLALTVSGCFDKTAEKLQETPQKSTALSTQESF
PPIKNNYDFAMKDDKIGQNLKANVDYYMLALSWSPSFCWTQYEKYGNHLP
DSAEYQCGIKKKYGWVIHGLWPQSATARTVAGHPRLCKGDLPQVEENVVR
QYMAESPSPNLLQAEWEKHGACAFDRAEQYFAKQQALYRTLTLPTVEMKG
KELFSWLRKNNPQLRHAYLGASRDELYICYDLNWQVINCPKQ
>MS0727 unknown
MTLMRKITKGMTSVSLLITMVLFSVIMLSILQWSGYQRKSAVEIYQYFQA
VQIAENQKQRLFLGLGCESQVVQNGIQFRLLCVGEKITVSYPMGKLTL
>MS2119 unknown
MRLKFDKFWGVDFTPLFITYHLSNCDKCLIRKANDFIKIYLDCTNQPYFT
IGGECGYFYAN
>MS1129 unknown
MNIIEIKELNRYFGEGENTVHVLKNISVNIEKGDFVAIIGQSGSGKSTLM
NIIGCLDTATSGSYKIDGKETNELTSDQLSDLRSQKFGFIFQRYNLLSAL
TAAENVALPAIYAGKSQSERLARAEELLKKLGLDGKEKNKPSELSGGQQQ
RVSIARALMNGGEIILADEPTGALDSHSGENVLEILRQLHSEGHTIIMVT
HDKNIAASANRIIEIKDGEIIDDTQKHPVQNTVNNQSKAKSRFGFSKDQL
MEAFQMSVSAIIAHKMRSLLTMLGIIIGITSVVSVVALGNGSQQKILSNI
SGLGTNTMTIFNGTGFGDRRAEQMQNLTVNDANALAKQSYVQNVTPNSSS
SGLLIYGNQSFTSTNLKGIGEQYFDVEGMTLKQGRSITAQEVRDNAQVAL
LDESSKKSIFPNDNPIDKIVMFAKRPFRIIGVVADRQMGAASSSLNIYAP
YTTVMNKVTGGTKIDSITVKIADNVNTAVAEKSLTEYLTVRHGKKDFFIM
NSDTIKQTIESTTGTMKLLISSIAFISLIVGGIGVMNIMLVSVTERTKEI
GVRMAIGARKSNILQQFLIEAILICMIGGISGIMLSLIIGGIFNVFMTDF
TMVFSTFSIVAAVLCSTLIGVIFGYMPAKNAAQLDPITALARE
>MS0505 unknown
MKTLLLLSTMLLMTACSNSVSVLPLPSTAKPAVKTAVMDKTTQKGTATLY
RCKDDKEVRVVRNINTGNKSKKRQKSGSVINLTFNNVTQKLTSTVSESGN
SYTNIHWHWFERGDANMLTTSVGKVLAEQCIIQKASPLEALEKDTNK
>MS0202 unknown
MEKLRFATRLNSFASKAHSYWPSIKGKPTIRQMIERASKVKGLTDVDLNY
PQHLNEAPKELGKFITDCGLNVNGMAMRYSTNPEFQLGAFTHPDEKVRRE
AIELTKRGIDCGRELGTNLMTIWLGQDGFDYSFQADYNKIWDDLIYAFRE
VAEYAPDCDISIEYKPNEPRSFSILPDVSTTLLAISDIGAKNIGVTLDFA
HVLYADEMPAFAAAMIARRTKIMGLHLNDGYHKRDDGLMVASANPKATLE
LIWQLRKAGFDGAYYFDTFPDASGLDPVHECEVNIQTVNQLVKIADQLNS
VDQLNVAIANQDAVSSQGIINQFLLGR
>MS0098 unknown
MIKITLDDTQAVKKLQSVAAQLKAPRRLYALLGEELKKIHDDRFKTEKDP
NGKPWTPLAAKTLARKRKRGKSLKILRQDGNLANKTAYNILDDGVEFGSP
EVYAALHQFGGKAGKGRQVTIPARPWLGVNKENEYYLLKKAVSHLQKSLG
KIK
>MS1256 unknown
MHRILGEQNEKMDCYFSYCAYLGWNLVENSRTTSAENRIA
>MS2251 unknown
MMKSAVKKTKFFNLACQHQLLNYIFGAIEWKPLSTAF
>MS1200 unknown
MKQNLFSLLIREENFSLKADGIQSVRKKCGGNLSRIY
>MS2220 unknown
MCLYRVILAHPRINHKKCGQKSLFFEGWLCQRLPMGKYPYNDYEYVLKIH
IFKFN
>MS1656 unknown
MNYKISPKMTALFIFLSIKKPFIAIKGLILFILD
>MS0029 unknown
MFEQIYEKNTALLCGILFDDIKLMKVGGDFNEEF
>MS0759 unknown
MKIKPLFLSLFCLAPAVLNAQWANVGKADYNWGPFLVYTVSFDTENGEYQ
DHQSPLMFSFDYAKPVEGKNFSIILIKEMTSLGATKEQTEKWLKELSAIP
MPDFLPNDRLSYIALENTGYFILNDQVLDHYFDAEFNQYFIQVWLSGKTG
FARLQNQLLGKEKGTVTESYPRAPAVVPLTEEDADPQLPPNYQLTDRTII
NC
>MS1470 unknown
MNLGYTNGSKEVSGASKKHRTLIRFNLTVKVRCFTEDYGKFSRRLI
>MS1377 unknown
MNLGYADIPEKKNDPLYGLQILFHDFYHKLSLLLTL
>MS2304 unknown
MIEKEKKTTALFYYHLGLVIKGEKRGTSEGVERSALTVVPIV
>MS2302 unknown
MLTLIKLFFDSQNYIKSAVNFPTVLRLRASVTVMTRRFS
>MS2127 unknown
MKKIFAFLIALFCTTSVLGAPMNIEIQIANSNEKITASLADNQTARDFYA
QLPLSMQLEDYANSEKIGHGIPKRLSIADSPKGYAGKKGDLTYYAPWGNL
AIFYQDSHVGYANGLVYFGKLTAGLETLSKLNGEVVTIKKAE
>MS0985 unknown
MRIISLSALQHYAFCPRQCALIHNEQLWAENFLTAQGNALHERVDSGEPE
TRKGVRFERSVHVSAEQLGISGILDMVECEIQTGKLKPVEYKRGKPKPKP
SDEIQLCAQALCLEEMTGKKVEEGALWYMQTRHRHPVIFSAELREKTLQV
INEVKTLLESGITPPPNYSKSCKACSLIDLCQPKLLERDKSGKYVVGLFW
E
>MS2126 unknown
MLAGYTYYEKQSKAQTFAEVFANSNLSGELTDDFE
>MS0700 unknown
MKNLTALIEQLQAKVQQLTLQFAAFSDKKIYAKFDRTLFSEDFESGQFYF
DQIQHTLAQIAGLKETEIPQIQFFSEKLLAQCTALSDAINQNNGRKTAPT
PKIPSQREKIKHELNQLPPRERLVRYYEALQALNEKINELEDKRDTAHNE
QQKAGYQHQIDITLPRRKRCLEAIEVLEEYLSFKEN
>MS0105 unknown
MVGLLLSFLGCCFGFAKMLVAQFQNAMEERHANQQRVNEKVEDLERMVNK
MNSSMPLVYVLRDDYIRGQTVLEAKMDAVHKTLSDLYKIESAK
>MS1921 unknown
MAYTSLEEQEINEIKNFWKENGKTIIVSVIIAIAGVFGWRYWQSYQLSQH
HQLSDQYQQVIYEFRQDPAAQKDNLAEFIAQNGKSGYAALALFEQAKTAV
EKQDFSQAETALKQALNNAPDEIFASIAALRLANVQFQQKDFDGALVSLN
LVKDTSWDSRKQILNGDILLAKGDKAGAKAAYQQAQKNASALEQQWLQVR
LNNL
>MS2294 unknown
MTDNNILIFMKNTNIVHFLNKNYFHIFSNNLI
>MS0803 unknown
MVGQFNQLEGKNMKLLAKLGAAALLAFTLAACSDPAADLKKLQAWDRDNA
AAQQQIQAELQQALSTVKEPSELEPVLASYKAKVQDLVKSLDQLDIKSNE
IKALKEKTKAVFLESQDVTADSLKVLVVSRTEETVNALKAKTEALNKNVE
ELMKLQNDLQAKFGDKTAETKPAEQAPAQPAEQAPAQPAQQPAEQAAPAQ
PAK
>MS0249 unknown
MKDFQKIYGIITKKTFFLSIYTLNIVNYKVAITHNSVEEEKLC
>MS1642 unknown
MEPISLPEYAGSTLRGAFGRALRKIACMTKQADCKGCPLYRSCPYTNIFE
TPAPTSHELQKFSQVPNGYIIEPPEWGEKIYLTGTELRFNLALFGRLIEQ
LPLIAFAFKRAFEYNVGRGKAHLVDIAKFSQNMTACQSILKEGNIIEHEK
QIILPESLPNYLTIQIETPLRIQENGKPLRENQINADRFFIGLAKRISLL
SEFHHQPLNLDFELIKNDLQAVKYEKNLTWLDWTRYSSRQDQKMKLGGVV
GSWQFENLSPELIQLLYFGQWLHCGKNATFGLGKYRITNL
>MS2002 unknown
MLEILQRHKRLRLNIGYAKGGEAVSAAGKNFLKTDRSFYAFVPNQAAYKS
KR
>MS0433 unknown
MFVLRYLVMWTNFIADFSKQLTPEVWALIGSSTLETVYMSFSATLFAVVL
GLPLGVYNYLTKPNQALANTKVNRFLEWVINIGRSIPFIILLFNLMPVTR
FLVGTTLGTTASIIPLGVCALPFFARLTSNALGDIPSGLTETAKAMGTTV
WQLVTKFYLPEALPILIKATTLTLVTLIGYSAMAGAVGGGGLGNTAISYG
LHRNMPYVLWISTIIIVVIVMLCEKYGNKLADHFDHR
>MS0295 unknown
MAIVSVPVEKSYRLLNIGATTLVSAKAEDIENVMSVAWSCALDYGPLSKV
TTVLDKQAFTRGLVEKSGLFAIQIPVANQAELVVKLGTTSRHNNPHKIDD
VEIFYPDGFDVPLVKGCAGWIICQLIRDENNQQNHDLFIGKVLAAYADDR
VFKDAHWIFEQAPNELRTLHYVAGGQFYLIGESLEVK
>MS1903 unknown
MEIMAFPLRPFPLLLAVFISLLAIWSAVEPVSRAVWYAEVVPVFAVFMLL
IVTYPWFQFSNPAYFCMSLWLILHLIGAHYTFELVPFKWGSDLLAGWLGE
GRNHFDRVAHYIIGFYSFPMAEFLLRRKLTGPIVAGFFSLFFIMSVAAGY
ELIEWQYAVIAGGQEGIAFLGSQGDVWDAQKDILADTLGALTALLLFYFI
RPDKKYS
>MS1049 unknown
MDQNWQTYRTLVNDHIAIFSANLAIFEQFSSEAAKLSKVVQFSIGYCADE
NGLPQAEEHQLLFRNILRALTNLSALSDTLYAGHIVSNGKAKLYFYTNDT
DAFIQVLNGLGYTDDLDIQDDPNWDIYFDFLLPSPLESKMNATEEILDLL
VRNGRDLADIFLVEHTFYFEDKENLLEFIESAELDDVSFNALKYTDEPVP
VNDEEMLYMAKIEQELTLNNNEIFTLVEKFEHLAHQYFGEYVGWECDELE
PNRGQLN
>MS0366 unknown
MKKSVLLFIAASMLVACSTGSISQRTLPADPLLPPSLVQPGFVRMPHNLH
YYADINSVWVDSDSKNMIHFDAVINLRKGPHVYSDRDKIAKSMRQAKVVN
CDTMKLTHLKTDYYSEFWGTGDPVTPEHQKMRTVDLRKGSSLYTLAQVLC
INLYRK
>MS1901 unknown
MFKSAVKISGFFTALFSCPRILQIFHSLSPINRVN
>MS2317 unknown
MADPHIHSPMDAWDYLTVCIYRSGFVLAAIFTALLPYYPDIAQTGLLVAA
VFCASSLHLYLKNFRLILQFATWIALLCRLFSQPELAFGGALLTLGGLCF
KEYFCFRILGLNLQPVFVALLWGSVVFEFSLAINILSAISAVLFLLLSIQ
KWRMPLHFDIGDKTKYQV
>MS1838 unknown
MRSFFGLNLFCLLHLYFKNKKCGKNFRIFYRTFANSI
>MS1391 unknown
MDFVLQRKQAVIPIEVKAEENLKAKSLKVYVEQFQSEKAIRFSMADYREQ
DWLVNVPLYACLNFNY
>MS0814 unknown
MLNAIDLTASELYKEKAEFTLEFKNLPYFFIFRRNYYAI
>MS1381 unknown
MQQYHDPDSPLLLGYLRETDVVTQSPQKSFLATRQMDLIILRKD
>MS1212 unknown
METLDKIKKQIAENPILIYMKGSPKLPACGFSARAVEALINCQVPFGYVD
ILQHADVRAELPKYANWPTFPQLWVEGELIGGCDILLEMYQAGELQTLLK
EVAERHKEQV
>MS0906 unknown
MNSNLIRLIFITLLSLGLTLISSFVLARLLSVQDRGLHQLFITAVSYVVT
FATGGSGFALALSMRKKQYAGWQNYFIAFLALSVLAAIIAIYCFDFTAFH
VLFVLNVVLTAILTMTLEKSKIDANLRVYRQLTLQQPVLLVAVYGICYLL
LGEQPLEIAIELFTLFSAMQALACLYYLKKINADFKRKNEIQPIQKRFFL
KTWFKQNLLQIFGATTASLDKFLIVYFLGNYTLGLYTVCIAFDSLITKFI
NMLADYFYSGLLNNINRIKSVLILILLMAVGAVILVPLLAEPIIIFFFSA
KYAEVAPVLILFIINAIIGGLSWVLSQNMLLLGKQVLLFTRQIIAIAVFV
LLFYLFKDYQLYGVAYAFIGASLTRLIISVIYYLKYPITDVKPEKSAV
>MS0326 unknown
MMKISTKGKHNFWSQLLVSMIAIFALPCAQGLNYSDAVTNENYQAQRTSI
KQPAAKFSALIQQQVAVQQRQAQQCNVDCPKFAKIEPHFCLSPSYFHAPI
RGSPLV
>MS1548 unknown
MMNTLDRYIGKSILGAIFATLLTLVGLSGIIKFVEQFRSVGKGSYDSMQA
FLYTVLTMPKDIETFFPMAALLGALIALGNLASRSELVVMQSAGFSRMKI
GFAVMKTALPLVLLTMVIGEWGIPQTEQFARDMRSKAISGGSMLSVKNGI
WAKDGNDFIYIKRATEDANLNNIYIYSFNDNRQLQRVSHANKASYENGSW
VLKQVNESQISADEIKTKNYLNRPWKTSLTPDKLGIFTVKPTSLSISGLS
SYISFLKETGQDSKKFELTYWRKLFQPISVGVMMMLALSFIFGPLRSVTA
GARIVTGICFGFVFYVINEIFGPLSLVYNVAPIIGALMPSLLFLVITWWL
LSRKRD
>MS1700 unknown
MLIYKPSLLKCGRFFENFFSSHKISKNFTALLILSLWKIFLN
>MS2076 unknown
MIQFIWRFYATHSTGSRFNIQSLNSRILAVKKLAEIAAGIAYIR
>MS1502 unknown
MIKIMQKNAFFIAQYKKYSSDFMGTDHFYSGGLHKTDLLLRQGLK
>MS1718 unknown
MRQFMELIIISGRSGAGKSVALRALEDMGYYCVDNLPINLLPELADILST
SQQSAAVSLDIRNLPHSPETLDTLLQQLADAQHQVRIIFLEADRSTLIRR
YSDSRRLHPLSMQDLSLEAAIEAEAGYLEPLLQNAELVINTSEISTHELA
QRLREFLKGKPDKELKIVVESFGFKYGLPLDADYVFDVRFLPNPHWNPDL
RPMTGLDQPVIDFLGKYSEVNNFIYSTRNYLETWLPMLEQNNRSYLTIAI
GCTGGKHRSVYIAQQLGEYFQAKGKKVKIQHKSLEKHHKKNSA
>MS0139 unknown
MKKNTNSTRSNQSNSKPNQSKGEVRIIAGKWRGRKLPVLNAQGLRPTGDR
VKETLFNWLMPYIADAVCLDCFAGAGSLGFEALSRRAQGVTFLELDKQAA
TQLKKNLQTLNVPVEQGQVLNQNSLDYLKFGQNLPQFDLVFLDPPFHLGL
ADKAIELLGQNNWLKPDALIYVETERDKPLLTPPHWQLLKEKTTGQVSYR
LYQA
>MS0693 unknown
MERDRMKIVIAPDSYKESLSAMNVANIIEKGFKQIFPDATYVKVPVADGG
EGTVDTMVEATNGKRIELDVVGALGSQQKAFWGISHDNSVAFIEIAAACG
IEQVPMEKRNPLITTTYGVGELILSALDSGVRHFIVGLGGSATNDGGAGM
LQALGVKLLDEQGKSLGYGGAELARLSKIDFSTMDCRLAECKFDVACDVT
NPLVGENGASATFGPQKGATPQMVKQLDEALSHYADIIKQDLNIDVKDLP
GSGAAGGLGAAFAGVLKGELKSGIGIITQLLDLESKIKDADLVITGEGRI
DHQSINGKVPVGVAAIAKRYDLPVIGIAGSLGKDIHVVYDYGLDAVFSVL
NKVCSLPEALDPTNAAENLEITARNIAATLKMKIS
>MS1348 hypothetical protein
MERHPMVERVDYPGLASSKDYELKQKYTPNGLCGVLSFELKGDKQTAMKW
LDSLQIISREVHVADIRSCALHPATSTHRQLSDEEMRAANITPGFIRLSI
GIENPEDLLADLQNAFDQIK
>MS0352 unknown
MTLNITSKQMDITPAIRTHVEERLAKLEKWHTQLINPHFILSKIPNGFQV
DASIGTPIGNLQATAQSDDMYKAINEVEEKLEKQLNKLQHKDESRRASER
LKDSFE
>MS0043 unknown
MPDKFHNFSSRIVKFVFIYNKNLVKNDRTL
>MS1678 unknown
MKRRVLMKNENIKEKNGWFKRVIEKYNRFCKDFGYDQATCRSCGVPEIKA
DENGNLLKKEPKNKK
>MS2166 unknown
MNNDLTTSAIARNNVLNNKYALAELETNLQLGGLSFEGETVFTKQQAAQI
LDVTERTIDNYIASSGDELEKNGYRILRGKSLKNIRLAYVDEMNFVDISP
KAPSLGIFTFRALLNLAMLVTESERAKFIRSRMLDIVIDVIAQKSGGKTT
FINQRDVDYLPAAYQEESYRKQFTNALRDYLEMSNVKYGIYTDKIYQIIF
CENTKEYRQILKLAEKDKTRETMYAEVLKAIGSFETGLAAGMKQKSEMLG
RKLTPTELNELLAEAASNPFLQPFILDARTKMSSRDLGFREVLHEKLEKY
IQAIPENDFERFLGERSRSLKEQLEDAETLAVLQRLKDR
>MS0640 unknown
MESVSRHIITRQKLVKKDRTFKVRSIYRRILLFFLSIF
>MS0823 unknown
MKMNEINHNRRKWLALGGIILGATILPNSVLAAASTPSPRILRLRNINTG
ERFSSEIVNGKLLSSSALNQLNWLLRDRRNNHTYRMDPNLFSKLYQIQGN
LGLRNTEIQIICGYRSAATNSAMHRRSRGVASNSFHVKGQAIDFRIDGVS
LANVKRSAESLSNGGVGYYPRSNFVHVDTGPVRTWSGS
>MS1008 unknown
MSILLCMLVCLLEFMKFIGDFCNILCNDDIFLQAYSSL
>MS1578 unknown
MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ
GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK
SLRR
>MS2003 unknown
MFINKPINKRNKCSAFLLRKTDVQRICTICLRPTCRAFSVSSETNVRRRM
PVFRRQNRGDYQRRHRIFVKTNAETLPVFQAKGAVQYEYFSRGKRQKMHY
WSIPDEDVEEREKLQQWFDLGIKALAGA
>MS1435 unknown
MNPLRQNVDKTEKCGGNFENFYRTLGFVISIAICDN
>MS2016 unknown
MNLPRYGGIMMSNSGFTEKRYHHRLDRGRIILQKGNIYLNREDGEQYELV
DYMDEPSQLLVRNLNTRTTKVVSIHQLENFKMNERTDLSVDLTAISNEYW
EKAQQKYEAIKPLLGMDQHRPYAVKARAEDVGVNPRTLYRWLQAYNSIGS
IAGLVDQKRGWQQGNSRLTPEQDKLIVQVINEFYLHKQRPTTEQTIREIR
RRCKIEKVESPSKETIRIRILHISEEERLRKRGQREKARNKFKPKPNSFP
DADYPLSVVQIDHTPVDLIIVDSKYRKPIGRPFLTVAIDIYSRMIVGYYL
SLDAPSVTSVAMCIARGILPKERLLLDLGLQGSEWNAFGYPVKVHVDNGP
DFQALDLSKSCSAHGIHLEFRPMGRPEYGGHIERVIGTFMKEVHSLAGTT
FSNIKERDSYDSEKEAIMTLDEFEKWLVHYIVNVYHKRVHSALGISPEQK
WKIGIFGDENEVGCGYPQLPVDEQTLLLDFLPSITRTIQHNGVTIDGLRY
YDVALNMYISDSDESGKSKEFLFRRDPRNISKIWFYDPKLKRYFPIPFAN
QAMPEMSIWEYREVRSRIANKGDKYINEQQVLDGLTEMREMVAESAQRTK
KARRQAERQKMHKASKPIIETKVETKAVVPVVVTSNLLALDDESLSFGEV
D
>MS0104 unknown
MMEKIRREGMRWNLLNALHKARPYTTHEQFLREVMASIYPNVTPLEIRQQ
LEYLADRKLIELNKQPSGAWYADINRLGVDIVEYTIDCQAGIARPEKYWE
>MS2133 unknown
MWNFCSGWFEFEILPKLTALLENRAKIIEHYAIVF
>MS1155 unknown
MSVRAYCMYYVSLYLSPKRLKTAYVVVVTVAAVKKVVINKSGLS
>MS1025 unknown
MSIVFYFKTNFIHFNMLVAKSAVKNLKILTALFCYHFSAAC
>MS1904 unknown
MRVILAPMQGVLDAFVRQLLTEVNDYDLCISEFVRVVDQLLPEKVFYRLC
PELKNAGKTTTGTPIRVQLLGQFPEWLAENAVRAVELGSFGVDLNCGCPS
KTVNGSHGGASLLKQPELIYQATRAIRTAVPKHLPVSVKVRLGWDSADFA
FDIADAVQQGGANEITIHGRTKADGYKAERINWEKIGELRRKLAIPVIAN
GEIWNWQDGQNCLAVTGCEDLMIGRGALNIPNLSRVVKRNEEKLPWHKVI
RILQKYAHLENIHDTGFYHVARIKQWLQYLKKEYPQATMLFDYIKTCHNA
DELRIKMEHLQ
>MS0986 unknown
MMSKVALQSAITNKNNAISPKKKPPNLPKIKELIMSIQNRYEFVYFFDVT
NGNPNGDPDAGNMPRLDPESSKGLVTDVCLKRKIRNFVELANENQAGYEI
YVKEKSVLNLQNKRAYEALEIEPEAKKLPKDEAKARDITAWMCKNFFDIR
SFGAVMTTEVNSGQVRGPVQLAFAQSIDPIIPLEVSITRMAVTNEKDLEK
ERTMGRKYIVPYALYRVHGFISANLAAKTGFSEEDLQKLWQALQLMFEHD
RSAARGEMAARKLIVFKHDSALGSVPAHKLFDSVKVERINGESGTPATGF
ADYQISIEKDKFNGVSVEELL
>MS0478 unknown
MIEWRYLIQDNNSELNMISYSSLNQQLKSADIGATASELHGLLSGLICGG
INDDSWQPLLYQFTNDNHAYPIALLNEIKEIYQDIGQKLADMDNFSFELW
LPEDNEVFARADALSEWTNNFLLGLGLAQPKLDKETDEIGEALDDLHDIC
QLGYDEEDNEEDLSDALEEIIEYVRTIASLFYTHFHRPQAQEKPVLH
>MS1289 unknown
MFSLKRQQGASFEQQARLFLESQGLQFIAANQNFKCGELDLVMLDGETIV
FVEVRQRKNDHFGSAVESVDWQKQQKWINAASLWLATQNHSLEDTDCRFD
LVAFGATASNVQWLKNFIE
>MS0984 unknown
MSAKLNKFNNEIIPLWDEVLKCVNEIDGVKIPDVYAENFAYLKKIITFLS
EAMSCIDADYLPNDSLNNIKAYLVNIKSYLTNSQNYSNSHVQNVENRLDE
LLKIIFPFILHKGKAIKGLRLGLNEYSKAITDYVENKFSEIKVTQENIDA
IENKLNDELGKFSALREELEEYGESIFSENGVKDKIEELLNNSESKLSEI
EELHVSIYGEDGLKQEIDNFYSNISNQNEAINELKEDSSVTLQSLEDFYN
KIFGKEDENGKKVGGLKQEIEQRKIELDNFKQKQQERYEELNKQIENLLP
GATSAGLSNAYNEMRNKFSGSAKWYGWGFYGSLIVLSVVIYCVRDLLIIK
EIPLDKGLGISLLALLGNFAVKLPFILPALWLVIFVSKRRSEAERLTQEY
AHKESLAKSYDSYKQQIEKLSEEDQNELLPVLMDNMIKAIALNPAETLDK
KHQSDSPISEVLKDKNFLTSIADRVKDMSSNSK
>MS0404 unknown
MDQQISFDEKMMNRALFLADKAEALGEIPVGAVLVDERGNIIGEGWNLSI
VNSDPTAHAEIIALRNAAQKIQNYRLLNTTLYVTLEPCTMCAGAILHSRI
KRLVFGASDYKTGAVGSRFHFFEDYKMNHGVEITSGVLQDQCSQKLSRFF
QKRREQKKQQKATALLQHPRLNSSEK
>MS1648 unknown
MKTYRFTLSPKSAFGTPLVGDSLFGQLCWAIVNRFGEAHLTELLAGYTEQ
RPFMVVSDCFPQGYLPLPTLPSRFWQTDESHKADRKKLKKVQWVRVEDTQ
QQAVKFWQEFAISADFKFEKESQDQYHNTIDRSTGTTGEDIFAPYATELT
WYLQTQQLDLYIVLDEDRFDLDDLKQVLKDVGDFGFGRDVSIGLGKFSLA
DEVQAVEFSPQNANCYFTLANSTPQDLGLNKENSYYQITTRFGRHGDIQA
LSSSPFKKPIILAKVGAIFTPNEYKVRSFLGNGLGNISNTQPNAVHQGYA
PVLNLFVDFENKEKQ
>MS1576 unknown
MMKKWLLTAISGVFLTACGSSSKSGNDLKYIAYQDLDGKTQQVAFLKTLS
TENNADPKTSISGEALQKAKFGNLDTHQKIGDIYTIYDAQANPMNVIFVI
PSRGKSFSPHKADDMAQLAKEKSFDFYEFGKARIAHSQFSAKSAICRDYK
AKSGVDVKIATTYYLDSGGENYLATLVGAQASRKNGEIRKFTYSPSFNID
NKKLQEQIQREVSSHGEKVAKSNVIEKLSVLENIVCR
>MS1106 unknown
MRICKNISLRQKTSQKSAVKFQEFSPHFYQTQ
>MS2252 unknown
METFIHGFLVCGGLIIAIGAQNAFVLKQGLLKNHILAVILTCFICDIVLI
SLGVLGLGSLISESREATVALGIVGALFLTVYGARAFRSAYLGNSSLEIQ
SQRQDNTSSAWKAVLATLAITLLNPHVYLDCFAIIGGIAGTLTPDQKILF
LCGALCTSFLWFFSLGYGARLLIPLFKRPITWRILDFVIGSVMWLIAFGL
AKYAYQLA
>MS1162 unknown
MFFQAGSSKVEYQGIAGAFSQGNWGVAKALSTAVLGQVSDKGRDSGITTS
SVNTKNILIRDGENQQRLFGENVEETVRKLNRENLHQTVNKTDVEKVKSD
LERDLDVATALVKNISDSGDELYYNAEKNEDSSFTVSKKTPDCEHISCLD
IENDNSQQLKALIYSDNILTEEQAKLLSKISIAGMLNFTREEKVASAILY
GDDLASLDDLGVILNRGSAGYWNEFLYAGFERFRAWVNMPTVFGASNATK
DHAQIAKKLDEYNAYAAANGKPQYKLQDMAHSLGVSENKNMLNWSNYLNQ
DYKNTELDYLHAAGSYPSEEIDRQAKSIFAGVTTRYIGVDGDRVYSGILG
GYLIGNNKNAMPNNGISGLEAHSEANQNINNLKYIYDENNSEQSKVMERT
KKLLKLTYPKDSIREFNTIKKDGDGL
>MS2098 unknown
MEKSFAEFAIVQAFIAKFLQKPTVCPFNAKMPSVTQFQTAFQSG
>MS1179 unknown
MNISLFSNIMHRVLFEISSQGVTWNLFKTHQSMQNSF
>MS1636 unknown
MRDRYLIVYDISSSKRRYYTHKYLSAYAVGGQKSFYECWLTNRELVEFKQ
KLINCIDKQEDKLFIFQLNKDTQPQLFGCASLPKFNQPYLII
>MS0027 unknown
MDFKTNLDALPAIDRLSGLDVIKDNEVIHHIPAVAGKLGSLRVYNALAAQ
FNGKLDRTSAQKGVEIFAEHSADAKQNPNKHPNIDLLFDVINNDLTYKLQ
PIEK
>MS2380 unknown
MRLKNENFYNEEDCLIFLIKIPDFRPHFVKISR
>MS1389 unknown
MQSDSELSDGISTFTTPPLSPKFLQKPLGKR
>MS0452 unknown
MTHTIEYIKDLMKKRTFTVKNLTNFTVNERSFILIVFKYLARKSAVNFRQ
IF
>MS0382 unknown
MPVIGVVADDLTGATTTGVLLARSGSKTTVCFNTEAAIKSNAEVPSDSLL
ISTSSRPLLKHEAYKHVKEATQVLKNMGVQYFTKRIDTTMRGGVGVEIDA
MLDTLPENTIAVVVPAMPQSRRILVGGYSVIDGVALTNTPVARDVRTPVR
EGYIPALLASQTRRKVGLVSLTDVMHGVYEIRIALMDQITQGKQVIVVDA
ITLEDVSNIAQACLMLDNPILAVDPGPFTAALGYQRGLIHREEPNIPQTD
ATCAEDKTVLVVAGSATPVTKLQMDTLCQDPRNISISVDPVLLIEGGDIA
DTEAKRVVGLVKDYMANDVRPRAILLETALHGPLLNLDAEDAKRKFVRGE
SAERINAGLGMIVSDIFKQIGHQRFAGIFATGGDTMVNVCNQLNVSAIEM
IDYVIPQSDIGRLVGIFDNKMPIVGKGGLTGDEYTACKIVDRLFLEACRD
K
>MS1027 unknown
MCARRVGILAHQKIITIVNGGQECPPYRKIMSGFNYEKNLPHQEQGIQAV
LGVFDHASRRFHQPDENPQIVFGQNQYTANLQKVQNENGIDRTLSLNSDG
INVLDISMETGTGKTYTYTKTMFDLHRMLGVFKFIVVVPTLSIKAGTQQF
LQSQSLAEHFEQDFGSDYQGVRLKTYVVESQKATKGKKTHIQTAIDAFVK
AENRQEIHVLLINAGMINSPSMSNAGDVALKDLFDNPVEAIAAVRPFVIV
DEPHKFPTRESAKTWKNIKQLNPQYILRYGATFNEQYYNLIYRLTAVDAF
NDGLVKGVRVFQEEMQGGMEASIKLLSLDGKEAVFELTENGKSKKFALSK
GDDLAQIHSAIFELKIDALNKTTLVLSNGLELKRGALLNPYSYAQTMQDA
MMQRAIAEHFKLERELLVERTTKIKPLTLFFIDDIKGYRSGNEISGSLKE
KFESWVKAEAERRLKNETNEFYRAYLQQTLADLSLVHGGYFSKDNSESDD
KIEQEINEILHDKQALLSLDNPRRFIFSKWTLREGWDNPNVFQICKLRSS
GSQTSKLQEVGRGLRLPVNELMERVREPQYKLNYFVDSSEKDFVAELIGE
VNQHSFSETIPQKFDEALEQKILQKYPEIEPLDLMFELVEKGIIDRKKVF
TENGYTRLKVAYPQAFEQTLKKDKIGKAGEGKDTIKMRVGKYEELKALWE
LIHHKAILQYKIGSENEFLALFTAYLRENLTKFKQAGIRTAINETYINNG
IMLNRRKENLENDDFIRFNTMSYREFLSELAVSAKIQMNTLHQAFYALRD
ELNISEFMNQQTINQIRGGFNQFLLNHSFSKFELGYQLVNNRIHPTKFTD
EKGCAKEVNRADLGIFGDTEKRPSENYLFDEIFFDSEIEHQNIADNEIEN
VTVFTKIPKNSIKIPVAGGGTYSPDFAYIVKTKTGETLNFVIEAKGVESS
DILRKSEERKIKHAEKLFTKIAEKVQVKFLTQFEGDMVAELIRRNI
>MS1997 unknown
MTEFKLNYHKTHFMTSAANIHQLPKDEGMEIAFAGRSNAGKSTALNALTN
QKNLARTSKTPGRTQLINLFEVEPQYKLVDLPGYGYAAVPEQMKLQWQKS
LGEYLQHRECLKGVVILMDIRHPLKDLDQQMIEWAVSSDLPVLLLLTKAD
KLSQSARSKQVKTVREAILPFQGDVQVEAFSAQNKIGIDKLAAKLDSWFS
SLLTE
>MS0102 unknown
MSKQQTVELDLNPIMQALSRTPMVLLGYQKRWCEDTNPVKVVEKSRRIGL
TWGEAADCALLAASNSGMDVWYVGYNKDMALEFIRDCANWAKFYGLAAGE
IEETEEVFKEGDEKESILAFTIRFASGWRITALSSSPSNLRGKQGLVIID
EAAFHPCLSELLKAAMALLMWGGRVHIISTHDGVDNPFNELIQEIREGKK
PYSLHTITFEDAMKDGLYERICLRTNRAYSKEGEQQWEAEIRASYGEDAA
EELDCIPKNSGGKWLSRALIESQMHSHTPLVRKEMARDFELIDEPVRAKE
IAQWLQEEIQPLLDDLDKNRPHFLGEDFARKGDLTSLVIAAQQPNLTNEI
QFIVELGNMPYAQQEQIVLYILKALPLFSGAAFDGGGNGGSLAEKARDAF
GESLIHIIQLSEKWYKENTAPFKAALEDGTLTKLPKNADVLADLRAFEIV
RGVPRIPDKRAKSVDGGKNKRHGDTAIALLLLHFATRQDVRLPVVAVTRR
ARRSQTISEGY
>MS0386 unknown
MKCITFNSKIKFITKSAVKFHKNLPHFSLCPFFCYTTKSSV
>MS1675 unknown
MSEQNTFSSPEHITVLLHEAVDGLALKDKGIYIDGTFGRGGHSRLILSKL
TENGRLIAIDRDPRAIAAAEEIQDSRFHIEHNSFSAIPYICEKLGLVGKI
DGILLDLGVSSPQLDDAERGFSFMKDGPLDMRMDTSKGLSAAQWLQQVTE
EDLAWVLKTFGEERFAKRIAHAIVNYNKSAVQNGTEPLTRTLPLAELIAQ
AVPFKDKHKHPATRSFQAIRIFINSELDELESVLHSALTVLAPEGRLSVI
SFHSLEDRMVKHFMRKQSKGESIPKGLPLREDQINRNRTLKVIGKAIQPK
ESEVFANPRSRSAVLRVAERIG
>MS1757 unknown
MHIIKEKLAKSLMFVVIIALCITVMSIILFGINQFKIGSQLASVNQVSNL
SHLLVRQQANLFSMLLVNNAGNEQLTDNLENLTKDKFVLDASIYGKNGEL
LAQTRNTLDLREQLGLNEESSKHHVVNRQQIVEPIYSPNGIEGFLRVTFD
SKYGQTTQNKINQIFHRLYGELIIVFLAGVILASSVHYFLSHYRRARRSQ
ITEQINTVKEIKNSSALVFHRRRRRYR
>MS0786 unknown
MNSDLKEKLMTTPFKPELLSPAGTLKNMRYAFAYGADAIYAGQPRYSLRV
RNNEFNHETLKQAIDEAHSLGKKFYVVVNIAPHNSKLKTFIKDIQAIVDM
NPDALIMSDPGLIMMVRENFPDMDIHLSVQANAVNWATVKFWKQMGLTRV
ILSRELSIEEIAEIRRQVPDIELEIFVHGALCMAYSGRCLLSGYINKRDP
NQGTCTNACRWEYKIEEGTTDDVGNIVPKDNVQKYEPEIVVKNVSPTLGE
GATTDKVFLYTEPNRPDEQMTAFEDEHGTYFMNSKDLRAVQHVEKLTQLG
VHSLKIEGRTKSFYYCARTAQVYRKAIDDAAAGKPFDTSLLDTLESLAHR
GYTEGFLRRHTHDEYQNYEYGYSISERQQFVGEFTGKRNAQGMAEVAVKN
KFLLGDEVEMMTPKGNIVFKINRMLNRKNEEVEAGLGDGHFVFLDVPADI
ELDYALLMRNLTGGNTRNPHQK
>MS2174 unknown
MIMFKKLLIATALCASFSAMADDSFTLKVKGVENGKFQNKHLLSAEYGFG
CAGENISPEIEWKNAPKGTKSFVLTVYDKDAPTGLGWVHWEVVNIPANVS
KLPAGIDAKDNNLPKGALQTRTDFGVPGYGGACPPENEKHRYEFTLTALK
VEQLPNVTADSTPALVGFFTNANAIAKAQVTVETAR
>MS0831 unknown
MMNYVAHAKDQALTAHHDLFSYHPMPFYEDTEQTRSRFHKKLDLNLYCIK
RPQQTCFIRVQNPDLMAWGIEQGDMLVVEKNDSLSIGDLIVIEVNQKLEI
FEFIAYDKNEFVFLSLSSKLNNIRTANWSTLPIIGTVTNTIHQMKPKNTI
SFAA
>MS1860 unknown
MQNQSEVSSQKQREITRLCVHTALLLLQHGAESALVVALTTRLGLALGVD
SVECALTPNAVIVTTLTDGHCLTTTRKNIDKGINMKVVTDVHHIVIAAEH
RIYSLEQVKSKLENMKPIKYNRYFVVLMIGLSCASFAHLSGGDNLICLIT
FIASSIAMYIRQELSVRHFNPLIVFCCTAFVASMISGLALKFQLGNDAQI
ALASSVLLLVPGFPLINSLADILKGHVNMGLARWSIATVLTFGACMGIVF
ALNVLNIANWGY
>MS0312 unknown
MIMKLSNKFSLAALTVASVLLAACQAPSSVLTFAPHAPNTTLNVSNQNAV
VAVVTKDERSQKQVSSYVRDGALFPLTASPEVDTIFQQVMQQNLNSKGFR
LGSANAANTHMLVSVKDFYAKVEEGNLRHKINSKIQLQIHVQGVKGNFTK
SIGTTRTDEGAFTVSNEDIQKALDAALKDVVNGIYADQDIGNAIRQYSN
>MS0769 unknown
MSHFTSIFIGVQFSNIFYAFLRKKSVKLLNFLSG
>MS1387 unknown
MKQLENVRIFGGEQQVWQHQSATLNCTMNFAIFLPKQAKTEKLPVLYWLS
GLTCTEQNFITKAGAQRYAAQHKVIIVAPDTSPRGDDVADNESYDLGKGA
GFYLNATQQPWAKHYQMYDYIVNELPALIAEHFPVNGKQAISGHSMGGHG
ALTIALKNPQRYSSVSAFAPIVAPTQVPWGQKAFQHYLGDNQTQWTQYDA
TALVNAETRLPIRIDQGDKDSFLTEQLRPELFLDACRAHHVACEYYLRQG
YDHSYYFIATFIGEHIAFHAKALYQDSEALPL
>MS1866 unknown
MRGKEWSEMALVLSKYGKNFAPHSGDFFRYISY
>MS0701 unknown
MLTNEVVISILVLLILSLLRINVVIALVISALTAGLVGGLGITKTIETFT
GGLGGGAEVAMNYAILGAFAVAISKSGITDLLAYKVIKRLGNRPTGSSIA
GFKYFILAVLVAFSISSQNLLPVHIAFIPIVVPPLLSIFNKLKLDRRAVA
CVLTFGLTATYMLLPVGFGKIFIESILVKNINEVGAALGLQTSVAQVSMA
MSIPVLGMILGLCTAIFISYRKPREYIVKIAEPTTAEIEQHIANIKPFHV
MASIVAVLVTFGLQLFTSSTIIGGLAGLIIFAVCGIFKLKESNDIFQQGL
RLMAMIGFVMIAASGFANVINSTGGVTELVNSFSQSVGADNKGIAAFLML
VIGLFITMGIGSSFSTVPIITSIYVPLCLTLGFSPLATVAIVGVAAALGD
AGSPASDSTLGPTSGLNMDGRHDHIWDSVVPTFLHFNIPLLVFGWFAAMT
L
>MS1454 unknown
MIRKLTQGFTPKHYLVEILFGLTALLGFYLIIAWSSYSPLDTTWSVSSFQ
PEIINKAGKFGAWVIDLFFVLFGYVGNLLPFLLLIAPIYFIRTKRVDSLT
WTRFSLRMFGFILLVCGLTTLAALTLSNSNYHLAGGVLGGSIVKLVYPSF
GKFGLLMSAVVFSIIGFIFCSGASLIRLLMRFYNWLTEKNEESSLVQAQN
DEEILQQEDEDIQDWIDGDIDRQQDLIQSAEDLQSHRDMITPAHRGINIM
GLSTPSQFTENTEDDETPNPENFGGYAVDEIDNLPEVTISSQNANIDLPN
ENNFTPMWQKQKSLAENMPEFLDGENTGVVLSEEEITRDLLTQVHIPEVK
LTPAKLQHPLTENSAVTKQAAYGMGESESFEDDNMADLAAQFARQEAERE
RIRLEKAQAMGLADLPEPQVSLQPTQPNLFESDEEAETEETNGLRTISID
QAIQLFGDHKPLIKPTTELPSLDLLDKRTSHVQEITPEEIHETSQRIEQQ
LRNFNVKATVKDVLVGPVVTRYELELQPGVKASKVTNIDTDLARALMFKS
IRVAETIPGKPYIGIETPNAYRQIVSLREVLDSDEFRHSKALLPMALGKD
ISGKPIIIDLAKTPHLLVAGSTGSGKSVGINTMILSLLYKVKPEEVKFIM
IDPKVVELSVYNDIPHLLTEVVTDMKKAANALRWCVDEMERRYQLLAKLR
VRNIEGFNERIDEYRAENIAIPDPLWKPGDTLDSVPPILEKLSYIVVIVD
EFADLMMVAGKQVEELIARLTQKARAVGIHVILATQRPSVDVITGLIKSN
IPSRIAFTVVQRNDSRTILDQNGAEALLGRGDMLYLGNGTTDLVRVHGAF
MSDDEVVRVADDWRARGKPNYISEILESTGDDDDDNGLSGEGSEDLDDLF
DEVMEFVIRTGTTSASSIQRRFRVGFNRAARIMDQLEEQGIVSEMRNGKR
EILARNPDY
>MS1125 unknown
MEILKILTALLSFFIFHLVKIMQLDREFWKHKSLLEMNEKEWEALCDGCG
KCCYRKFIEGGGRRERLYFTRVACNLLDCETGKCRDYANRFKLERDCTKL
TKKNLPDFGWLPKTCAYRLLYENKPLFDWHPLISGRAESVIEADILIKNG
IHEKEVIDWFEFVIDEE
>MS2078 unknown
MQRAKIALKVNAAYIFNSVPNRVRFFYAENPYFSTALFL
>MS1010 unknown
MSHYIYLMQNGGINPTLRRNMPNYRRDFTTGGLYFFTVVLKDRSQDYLIK
YINEFRQAYKITQERYPFETVAICVLPDHFHLLMQLPENDSNYSVRIGFL
KSQFSKLLPLQCRKVSESDQKQGDAGIWLRRFWEHLIRNDEDLANHWDYI
YYNPVKHGYVQYVKEWQFSSFHRDVDKGIYPKDWSGCPDLIIKGEM
>MS0555 unknown
MGFKCGIVGLPNVGKSTLFNALTKAGIEAANYPFCTIEPNTGVVPMPDPR
LDALAEIVKPERTLPTTMEFVDIAGLVAGASKGEGLGNKFLANIRETDAI
GHVVRCFENDDIVHVSGQINPADDIDTINTELALADLDSCERAIQRLQKR
AKGGDKEAKFELSVMEKLLPVLENAGMIRSVDLDKEELQAIKGYNFLTLK
PTMYIANVNEDGFENNPYLDRVREIAEKEGAVVVPVCAAIESEIAELDDE
EKVEFLQDLGIEEPGLNRVIRAGYKLLNLQTYFTAGVKEVRAWTVAVGAT
APKAAAVIHTDFEKGFIRAEVIGYDDFIQYKGEQGAKDAGKWRLEGKDYI
VQDGDVMHFRFNV
>MS1343 unknown
MILDMINSYVFSLFIQLYNEQKNKSYIKISLSIKIPKKVQSNGIKYDG
>MS0844 unknown
MKFRLTALAVAALLTSTASFAGVVTTSSNVDFLAIDGQKASKSLIKQARS
FNITDTNQHQVVVRVSEIIRGGSESNLFESDPIVVTFQGTTEDIQISAPT
LRSERDVEKFKQSPVISVTTASGAAVQTKQEYLTQEGFLPSVNLVENLSN
YNASGAKAAVASFATTTMPTAMGTTGAGKVAKGKVTVQGENAAEQMLQYW
FQQADKETQTRFLNWAKKQ
>MS1014 unknown
MKMYKTLKKLTALLLVTQSAWAQEQFEEKFVSLTLCSDRLLMEIARPDQI
AAMSPYSKNPLMMPDKTNRDKPTIEPRLTALLPYLDKTVLINEHFYPQLT
ADLKKLDVKIIPINDSPQTPEQLFELIIRLGKLTQNEEYAERLVTELKTQ
HFNLNQPLPETLILSETGIVDAFLPQYQTLLQLLGLTPLKTAISTQNFAL
EKLLLSQPNLLITLTDKQGYNEQAKLLSHPLLEKLFKNRPHFTLPMKYTY
CFDHGVWQGAKVIYNQPHNSPL
>MS0678 unknown
MRDNHYFLLYRLKNNLVLMDIQKTMLLNSKGVKRISGEWQNSGVTMSY
>MS0907 unknown
MNKMNQTLLSLKQELKKILTLIEDKNDVLYFDYPMHLNVGDLLIYAGTER
FFKDYGINIRLRRSLQAFEINEVRRYVNKNTTILCHGGGNFGDLYPLIQK
LREDLVINFPENRIIVLPQTAHFSSQEALEKSAAVFSKHKNCYLFARDTA
TEKLMRAFSANVQLCPDMAHQLYGTLPFRTKEQQKSAENPQNILYFLRKD
IEASHIEKAVQSRLSAAAVVKDWEDILLPKDMRFEKFCSKLGKLANILNL
GFMKDLLNHIWYKYSLNVIERSRKEFSKYDLVVTSRLHGHIFSCLLGIPN
RVCDNSYGKNSGYYNQWTKNVDYAEKYE
>MS1861 unknown
MISQLFNANLKFTYKIAQIRKAEKWNKNKTRKKPL
>MS1416 unknown
MYIINIAVNDHVSAEQHDKLFAEHAEWFKKYFQAGTFLMLGPFKDQANAG
VIFAVTESRAELDRILAEDCYYPNLASYEIREFEPKLIASNIAEFTGK
>MS1996 unknown
MIFHLVNETFCIDVQKSEKNTALWVTKVAKCGRFLRIFIGLLT
>MS2137 unknown
MEHEVCHRHLTSQDKCGKAGEQAQDDEDSAQGFDDAAYAH
>MS1781 unknown
MAHASSVCLLILTTKKYSFIFMLTNQLTIGLYLIAVLQLIYLSWTDIKSR
IIGNRVIISLFFTMVALSWLKYEQVFVLQGAIGLAVCFILFMLKVMGGGD
AKLIAVLMLSIPPAQLISFFFLTAVFGLLLIIIGWLFFRQSIKQKGLPYG
VAISSGYLATLWLFAS
>MS1327 unknown
MNMKKVTLTIAMIVGLGLTACSGSQKQYDDGYAGEILFSQYEGSNLKLTV
RYNNCDGKEGKVENLVITQPYDSDLPVGACVRVSTAEDGTKNIRNISRSV
SRSWLSRTGIIR
>MS2184 unknown
MGKNTMDNQQDSMPEGFSKFSWAIAAFCLPVFLWPLALLVSTNLEKNPAL
SQQQSMSMSMFLWLYPLLLAVMARICYKLHQGSPKSAKRLLMTSAVIFYG
ILFYVARVGFSG
>MS1947 unknown
MNLQKLLLTRRFISTMAYFAMQSVFFIYLPQSFASDWAAVCSEPMQEPAY
>MS1258 unknown
MIHIQTKFFKKITALFLQCKEQKIISYNFYEYLND
>MS0653 unknown
MSKLKALRFNFLGLFADCLLPPKTDGLINGYGK
>MS0080 unknown
MGNGKMAKLQYPAIIETDKKFTALADLGKRLNSLDKSQIMTSFTYLVPTA
FLELLAEKWSVTGYDGWLLAESEDAKRKLIKRAVELHRYKGTPWAIREII
RQLGFGEVEFLEGLFDKRRDGSFVRDGAYFHGDRSKWAHYRVILKTAITN
EQAALLRKTLRVFAPARCVLASLDYRTVALQHNGKATRNGQYNRGTA
>MS0340 unknown
MLMIAKKEYYYGLDQLRALLMLIGVLTHAASVISPFYRWDYHSDRYQDAL
IHNIVHVAHFFRVEAFFLIAGFFSAMVLLKKGKHYFLKGRYLRVFMPLIS
SILLINTFEVWFVVRHDITPWENIGIGNFIVHAWFLLTLMIISLVCLLPV
DKFLDYLSGFNRFIKLGLFIFYMYLPFGIKFVLNMFVPMADHPLFYSFYG
YLIEKTLYYSIYFFIGYIIYRSEAVRIFFNKKTVKALLWGITVTGLTYQT
LTIGEAKESLPFTMRAINVFIQHASAISVSLLLFNFFFSASFPPSRSVAF
LVRSAIIVYLFHHPVLIVLGYYFDVPGMTPFVYFMILVTCGYLLSFLSYL
IINGNKLTRFLFGLK
>MS1226 unknown
MNLSGCNKHQHYLNFIGTGLYLSLIFYISMLHGTRLFAVFLNIPQCLQYI
LFILLEKI
>MS1093 unknown
MIMHFACPDTKRFFNGERFVRFISCERLAIRKLQQLNAATSLEFLTKLPN
NKLETTLYNHVSYYNLKINEQWSLLFLWDHNSPTDVKLVDMKEV
>MS0259 unknown
MLHLVLEDRHIIGQEKRLGVHSTEQDHLAVVFEDDGETGYFYAINTQEAQ
PVVDSLSVYNVNGIESLQEPRQVQICWSEDGNRAFLLVNGYPHAAFDFTR
LIGYNHSKYPLPELGSMWSHENITDKLVEEWLTP
>MS0375 unknown
MKKILILAAVSFLAGCVGSSTLPQKNAEKLPHIEVPKEIVHNGKTYYLRA
QQDLGSVARYVYLENKENLKNWKSEIEILNDRNTEQRSIADRIALREKVY
KNTGVEHFQLMEKDDSLYAFVIYAPSAQHDDWQVDVAKGENVMGCGFAQY
QYSLKIPKTKKLMNMGKVKLIGYLKKYAVDKEMERLSTTKWNWVCRNNE
>MS2366 unknown
MVNYENNTLNKKKKFENYQKNNKLFTNIYL
>MS0915 unknown
MLKGGDFSLFQCFFIDKTKGFRSLLEKIQKNTRFFE
>MS0965 unknown
MKSAVKKPEILNIYEILKREILTLVMRRNYMPNITNNKKAFLVFH
>MS2124 unknown
MGFNKYYLKMEEHFLYVPYYNHQRRIRVLLPKDYYKEDWQSYPVLYMHDG
QNIFYSKESYSGYSWKIIPTIKYHKEFPKIIIVGIDNATVDRLDEYAPWR
TDVGNTAEARNTGGKGAEYGQWVVETVKPFIDGHYRTKPQRENTLLAGSS
MGAIITAYMGAAYPHIFGHLGVFSLASWFSENEFLRFMHEHPIDRASRVF
IQVGTKEGDDADAQYISNMNQAYIDSTLYYYQALIRTGHPLDNIRLKIMA
NEIHHEKYWASHFVDFLRFSLMGK
>MS1064 unknown
MIRRVAMSDFGYAMMMVVLSLVIVVGLAVAIF
>MS1115 unknown
MFRKYGFIFKFTNRFKQYVKVRSKFLKFLKFTENQPHF
>MS0109 unknown
MNNSEFLTYAILTLGLVMAIPMFVRMGEILSQKVRLMLFPVKKVKIRRWH
NDIFMGYGELDLTSSEPIIAQLDRIDAELKIRKENER
>MS2298 unknown
MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ
GYVLAGSKPISPEQQRIRELENRIKELEEDKDILKKATAILMSLENKNTK
SLRR
>MS1156 unknown
MEQHIIVGLIVGACILYVLRKFVFKSKTAKNSLCGGCDRCGGKKGCH
>MS1214 unknown
MVPEVLAVSAEVDLPVAVLVAVASVEVAPVEVGKVDKFSRF
>MS0001 unknown
MKGGDMSAIKDRLKDIDCALSDLERERKEILLDAGAPEIIGLKDDINALT
VSLEYIDDEILPLLQQLSIDPDAYKYLSEDIKLSLLRDLPESVSAIKAII
NKLTPVKHCIESFNHQNDIGF
>MS0415 unknown
MRNMKLAQHLALLKNRQLINDELSKIMLEIEQRLISHWHVDVTTKQVEMG
LLHLAMALGRIKRGYAAQALHKDIFAEIQSAVCFPKVLKIHSDILALIPF
PIPESEQTHLIANWYSLVIAQPWVLNIT
>MS2224 unknown
MLFEMAWQTTKINFFDRTLIKKSMTKASLTGKSGYN
>MS0321 unknown
MKTINLCLSFRLASVYLRKRKSMSSISFLISTALGLYIFVLMLRMWLQYC
KVDFYHPVSQGIVKLTNPVLTPLRKAIPTVKNIDLAALFFVFVLGMVKVP
LLYIANGQWAAEIIRQEWLQYVLIGALTVVAAFGKMIFYVIFFGAILSWF
NRGNDQFSYLLYQLGEPVLSPIRKILPRTGMIDFSPMVLAFGLFFADKVL
YDIFGILWQLAS
>MS0103 unknown
MAPRSSIEKLPEDVRRWLERALTENGFSGYVELEELLKEKGYQISKSAIH
RYGQKIERRFKAIKDSTEAARIIAEGAEDKEDKRSEALMGLLQSSLFEAL
VDIEDAKEDEKMSPMEKFQALSFAGKNVASLIQASTKLKAYQAEVKQRAE
AAAKEVERVVKKGGLSDEVADEIRRKILGIATK
>MS0184 unknown
MRNPIHKRLENFETWQTLTFMACLCERMYPNYQLFCKVTEQSENGKVYQN
ILNLVWESLTVKGAKINFDNQLEKLENIIPDVNDYEFFGVVPALDACEAL
SELLHAIIAGSVLEQAIKVSQLSLQTIVTMLETQEDHELTETELKASEDI
QQELDVQWQIYRTLKEQEERSVNLILDLKNELREAGISNIGIEIEH
>MS1623 unknown
MWLIRIVYLQNSIFKIQFCFIRANDCRHIKACD
>MS0111 unknown
MQQFLSAIHGGQFGQVINNYYAAPDCWQALSTNELHNAIKITKQKRQLAH
RNKWKNPAVIGGMSCTLFAMVVWVGNLWYLFSDYSRLSTPNSIFSYIAGG
LLLASLLCLYFAAPQIRREKIFIDRCNHVIDICEQLIHERKYD
>MS0660 unknown
MASLRFLKFLLIIIFLGLCIVTIDNIFIVSDFVLFGIFLFLLFLEVIINP
KRNFINSLVLLAVFLNILGVFAIEHSEGSYYLYEVEQWITEEGSIPLLLI
YQFFFLQGVFLFVQERKIENWNITNIHFKSFFILSLTFLLLLTFGLIAKY
SPAPVLRVDRFVYDKEILGKFGQITNILFYYSLGLGILYFKEKKKIYLFL
LFFIELAFLLKGHKFWNLIEILFLFLIPYTSNIIRAKVISLILAVGSVMT
LFIVAAISINVYYFPSFDPVDYARQRLSQEGQLWWSTYKGYEPKLRTDEL
YAELKNYTNLDEYNQYDIGMYKVMRLNTTPERFEWKLDKLSRYVYSTPPL
LYYYLGAGLGILGMFLLGMAFSFWGNLLIYTINRGDLFLSLIVSRFFYIF
RKAAKDGDIYKFFSIEFMLLILISLSFYIFYKYKDDLYKRKSLIAVNGV
>MS0666 unknown
MGESKLAKIEHLIAEINKLHCYFTNDYFKLGKYQKIDFNNGLTKVPLEHI
LSYRLNLHESNNDYLYCADLYDIAYFYRVKTSESILDKIERFKQRSEG
>MS0547 unknown
MNWFVKPIAKTAKKRPHFWGHRWNLSGLRFKIPGKS
>MS1822 unknown
MLKDKMMDALQADWIAPGNIHGFTTYRQGGVSQEPYTSFNLGNHVGDDPN
AVKINRNLLVENFNLPQLPVFLTQTHSIRVITLPFEGTDLDADAVYTAQP
NQVCVVMTADCLPVLFTNKAGTEVAAAHAGWRGLCDGILEETIKCFQCPR
DEIIAWFGPAIGPNAFQVGEDVMKQFVAQDNKAKQAFIADPNTEGKFLGN
LYQIASQRLHNMGVTNISGGEHCTYEEQDKFFSFRRDINTGRMATVIWFE
>MS1166 unknown
MLLKKMKILHKSFLHLHKMDILDIASILAFADNN
>MS0822 unknown
MYGKSTFKLAQLAILISGLCSSCAISDYVSPHSSNKESIDVDLANQQIEQ
EKMAEDARISAEKQRQAEAKLTEIIGERDLQFKSAVAKIYADNEYALLWQ
DKDAEKKFLREYAAMVASGISVRSARSLEAISATNAGDNPVYDILLTDAF
LDYMYYAKNVFNSAQNWLYTINGYKPAKPGETDVEEWLSAVKNGQNFAYV
NSLTTNNSIYQQTIDKIGSSDFDDDKSVNSAILYKLALNAQRLRVIPNFS
NGIFVNIPSYQLNYYRDNQLILNSRVIVGKKERRTPVMYSKLSNVVVNPP
WNAPTRLINEDIVPKIKKNPGYLSAHGYSILDSKGNKVNPNSINWAAIGS
KFPYRIRQDAGDNSALGRFKFNMPSSDAIYLHDTPNHNLFNKQDRALSSG
CVRVEKSNQLASILLKEAGWSEDKKQRVLNSKKTTSAPIYSDNPVYLYYV
TAWVENGQVNTLPDIYGYDIVQQPSYVNWHTVKKYL
>MS1896 unknown
MSKKHQILPQTRWTATSFWSLEFRSLSVLLLSFVIVGIGDGLLLLSNLGS
APWTILSQGVALQGGFGVGWASLLISIMVMLAWFPLKLKLGLGTLLNILV
IALFLGITTAYVPAPTSLLGRLVFVFIGVFCFGVGTAFYLTCHQGAGPRD
GLMVGLCQRFHWRIGIVRTSIEVTVCLLGFLLGGTVGIGTVVFALSIGWV
VQLSLMVINRSPCLLDNT
>MS1854 unknown
MRWQGRRESTNVEDRRSERSGISMGGKKTGVLGFIILLVGAYYGVDLSGL
VGTSSNIGEVGSSLSQNEEETLEKLSRVVLADTESTWQDYFARSGQKYSA
PTMVLYNGATPSACGTGQSAMGPFYCPNDHKVYLDLSFYNDMKNQLGAGG
EAAFAYVIAHEVGHHVQNLTGILPRISRLQQSNPAQANQLSVNLELQADC
FAGVFGYQAVKNNMFEASDLEVAFAAAEAVGDDRLQKRSQGYAVPDSFTH
GTSQQRLTWFRKGLQTGDPTQCNTFTN
>MS1016 unknown
MTVQINKLDPDAAIDIAYDIFLEMAPENLDPADIMLFNLQFEERGAVEFV
ETADNWDEEIGVLIDPDEYAEVWVGLVNENDEMDDIFAKFLISHREDDRE
FHVVWKE
>MS1362 unknown
MLGTGAEILNIRAELPPLGKSRFNLIYISHCVIRLSFPIKFALYCIKPNE
QKGNFMESKLIKQITPAISLHQYNEIPVIKLNHAVGQAEIALQGAHLFSW
KPAYCPQDVLWLSEIEPFKLGTAIRGGIPICYPWFNNAGTPSHGFARISL
WQLSDYEVSAEKVRLEFSLFSEQRLIIAKIQFVFTGECEITFTNYAEENA
QAALHTYFRVGDIRQLELYNLPTRVFNSLTQTEENVPSPRTIGELVDCVY
SAELGATLIQDNQLNRKINVEHINASDIVVWNPWHKPTGGMSETGYQTMV
CVESARINKRLNSGERLGVKISLR
>MS1164 unknown
MCGDKKRQATVIASALSLAVAGKRAEAIAAGAVSPYVKEVIKKATDSPEM
QALNIPLHVLWGEVEAELAGGKAQTGAIAAGVGEVGAAVLAKSVYGKEAS
ELTVEEKQTLLNASKALAGVASAATSANGNAASTLAETSIGMTVAENAVE
NNYLSQLSDNRRIWLREQLNRDDLSSVQREKYEQEFIQLEQDNHTSDILV
AKAKYNPESMTQSDWELYQNYATRYYFESIRTEKPENVIADLDNILSNQY
IKGYSYPYATAEKYRHELPSRWSLFGTNKSADEQFYTDIYSKYQNRKTYQ
ESFDGRVAQSTAEALSYAGTMLSAGTVASVASKVGKFTSNGINKASSAIG
TFATKYPKAAEGIVVGSISTGFDLYNGDASPEKTAMNYILGRGLAGKSWD
KQLSVNAIYKGVISVNENRSDKDIVLGQVSNAIALGSGESVEGLLNLVGQ
KGISKQIISNIVSGYVENKIDNRSKDSKEIRKEGDK
>MS1190 unknown
MKYYQKALYDEMTSFYLITCDDEGDKLGRIRSLIGGLVRKARQVIGQEGD
DKVRIHQLLQLFYGDWGFHCDPEHYFEAENLYLAYVLETHSGMPVSLGAL
LLYLADSLKLPLYPVNFPTQLILRAEVDNEVAFIDPWNGHYLSQAHLQKL
YEGAFGFGAEISSEELERADVNTLLNRFRQLAKNALIRENRNDAAYRYIA
SLLRYHPEDPYEIRDRGLVLAQMGCYQAAAEDLQYFVDQCPQDPTSFLLT
AQLAELKDHFSELH
>MS1040 unknown
MRLFFMNFQEGKMNKIIKFSVVLIILLFLGFWFYTIYMTKLTGCSMKSGD
GFFQDRLICDNQEIVPTGYLSSTLLEPKLIARGVTIYQENGKACYTDEQK
FYIYNIEDKTTQVLNLEEFIKINAVSFKLPSEFYTLPADYLKDYANNCAK
>MS1225 unknown
MIFSVELIEKSFSFFIELVLHLRRFDFLSFRVICST
>MS1643 unknown
MELVASDGEGIYHCSATNLLRHFFTFFSVSFFLKSIFDFKGIRI
>MS1473 unknown
MLELLFLLLPIAAAYGWYMGHRSAKKDQEDVSNKLSRDYVTGVNFLLSNQ
TEKAVDLFLHMLQKQEEENEIDSNSQFEAELTLGNLFRSRGEVDRALRIH
QNLDRSSYYTFEQKLLAKQQLAKDFMSVGFFDRAETLYIMLVDEPEFAEG
ALQQLAVIYQKTKDWKKAINVAEKLAKISPQEDNIELAQYYCEYARTLGE
ESKEQPKEILQQALTVSPSCVRASMLLGDFLIQEEQYAKAVPVLENVLTQ
NASYVGEVLPQLKECYQHLNQLDNFELFLIRANQEYKHNSSVALALADLI
AEQDGRAAAQNKVYQQLTQNPSLFLFHRFVQYQVDDAEEGRGKDSLVLLH
RIVGERIKQSFGYRCTNCGYQSHKLLWCCPSCRQWEKVKPIRGIESQI
>MS2058 unknown
MKCHRLNEVLELLQPYWSKDSDLNLIQILQKIADEAGFEKPLAELSDEVI
IYHLKMHGTDKLEPIPGIKKDYEEDFKTAILKARGIIK
>MS1339 unknown
MYINSIFFYIDYSKLKIGVTMELKEFALKLRKNLTEEESILWYHLRKKQL
AGFHFRKQAVIAPYVVDFICYKAKLIIEIDGEQHFLPSALVYDEKRTFYL
KSKGFRVIRFTNYEIKRELDSVLDKIWYELTGEF
>MS0726 unknown
MRTCKRGFATLMIVFIIAGLAVSTMLFTDDQLHYHRGIMAQRSAYVSQMA
QLQNLAIEQMPVICQQIPDGLPDNTTSYTLPISLSTASSNKSAVEISHFL
RCRRYSLLATKPTKKFESYSTAVNEENIELFRHRFNQSYINEDTGQKVFL
YWLDETTESLILSGDTNAVVIAKEPLKIEGKGRLRGVVISDYPVELEGVQ
LSYNKYVMDFIYREFSLWKLAERSWSDFDAENN
>MS1074 unknown
MLVILPVSDHLLKVKSMFDNALLSLSHEQQQQAVEKIQVLMQRGMSSGEA
IALVAKELREAHDNEKINSEKTKSAEK
>MS1691 unknown
MKQYQYRITLEYLEDNQGNPKDEKIQFTAANHDDIFKIIELSKQREGFTS
DMAEQFTVGLKLMGEVMMAHRDFPLFREIKPHFLEIMKLVKGKGKAE
>MS1634 unknown
MRYLIGYDITDSKRLQRIYRRMIKFATPLQYSVFLFNGTKEQLDKYMQTV
LRLYNKKEDDLRIYPLPVQAKYWQIGKNPMPEGIVLSTFVF
>MS2095 unknown
MKDLTAREFGYGHPTPLFMIGTYDEDGRVNFMNSHWGALNHGGYINLNIN
TNKKTHLNIEKMKAFTVTLATEKLMPYADFFGTYSGFQYPDKFEKSGLTA
HKAKYVNAPIIDGSTLVIECELVEILYQEHIHTIIGRVKNVSVDESVLDA
QGKVDASKLGMIFFDSFSRGYFTLGERVGDAWSIGQSILNS
>MS1507 unknown
MFGKGGLGNLMKQAQQMQERMQKMQEEIAQLEVTGESGAGLVKVTINGAH
NCRRVEIDPSLMEDDKDMLEDLIAAAFNDAARRADELQKEKMASVTAGMP
IPPGFKMPF
>MS2148 unknown
MASGWRFKLPTSGKKHQKFAKPHQQTAKIAKCLTENKENHMTAYVVFIRD
EMKDQAAYDRYLQLGVPTLAPFGGEILVANGAHEAFEGADFDGSVVLRFP
DMASARAWYTSPEYEAVKSMRLEATLGRAVLLEGVA
>MS2240 unknown
MEMTSTQRLILANQYKLMGLLDPANAQKYARLETIVKGGFSLELKELDNE
FLAISEAECQTVLETLEMYHALQVSYENLADKSDLTAHRLQFIGYDAIRE
RKYLNYLRFITGIEGKYQEFMRCAPGCDSQTPMWDKYNKMLDMWKACPHQ
YHLSLVEIQNILNA
>MS2227 unknown
MVTTMAEYDKLRLEWDCRRGMLELDKIIMPFYLEQFDNLTETQKATFVRL
LACTDLQLFSWLFKRARASDTELQQMVDLILEKQGVVINN
>MS1272 unknown
MKNIRTFISIFLILLPLWAQAQREVKCRVVRVSDGDSLTCLARNNKQIKV
RLLDIDAPERRQPFGNKARQQLAQLIFKREITLRISGYDRYNRTLATVFN
EKNENINLKMVQLGLAWAYNQYSENPEYGKAEALAKKRKIGLWRETNPIE
PSRYRRELYKRNIQNKKQRTEKN
>MS1274 unknown
MNLQEQLKNAKNWEERYRLIIQAGKNITKPTEQELAEMQPLSGCEAQVWF
KISQNSDRTLHFQAYSDARIINGLLWILSLAVNGKPTEQCRRFDLTSYYA
ELGIAQRLTSTRLNGLKQIEGCIHQAGN
>MS0116 unknown
MPYFTTTQQGEEMNHTDFQPLPYPQTPESARAYFNLHGINRSEWARYFGI
DQQAISDHLRGRLKGTWGKSHKVAVLLGLKPNPETKVTA
>MS2017 unknown
MRKTHSFQQVRKIKPTWMSVSGHIPFKNGVSIPYESTLERDFLMYFTYLP
SVDKIVSQPTTLPFVKNGITYTYTPDFFLSFTDGRKPMLIEVKPKAKWQK
HWKEWKEKWKAAICFCQENGYVFHVYDEDRIRHLALFNLNYVQRYKRIQH
EQEDINVILAQVKLMGNTTIDYLLSRFFAGSLYRMKGLQIIYHLLATKQL
HCNWFLPLNEFTEVWGNNDE
>MS0881 unknown
MKFADFSLCKFKCCIQNAIQRSRKEKRREFITKKCGQNLSFFMTALNLFY
MRTTGRYFP
>MS0374 unknown
MLAIISPAKTLDYQSAVPKFEISQPQLTQYSQQLIDICKQLSPAQIASLM
SISDKLAGLNAARFADWQADHNEQNARPAIYAFKGDVYTGLDVESLTSDD
VLFAQQHLRMLSGLYGLLKPLDLMQPYRLEMGTKLANKKGKDLYAFWGNV
ITQTLQQALDEQGDNILVNLASDEYYKAVQASQLKARIIKPVFLDNKGGK
YKVISFYAKKARGLMCRYIIQNRLTEAEQLKEFNLAGYWFDEAASTKDEF
VFKRDLGE
>MS1660 unknown
MFNHINYFSIYCISKKSNLQLCYLSPRKNAGLNKSLLLPKLVGLAGCSGC
VVGC
>MS1759 unknown
MPSFDIVSEITMHEVNNAVENANRILSTRYDFRGVEAVIELNEKNETIKL
TTESDFQLEQLIEILIGACIKRNIDSTSLDIPTESEHHGKLYSKEVKLKQ
GIETETAKKITKLIKDSKLKVQTQIQGEQVRVTGKSRDDLQAAIQLVKGA
ELGQPFQFNNFRD
>MS2241 unknown
MKNLTKSALFISFVCTSPLALSAPDDSKTEALQKLEQQCNALKDSNIMNT
SIKSVKWFAGGNLPPDEQASFTGASNSNIEAAPHCVVNGEIEKRIGADGK
EYAIGFQLRLPSNWNNKFLFQGGGGLDGFIAPAIGSIPTHGSTATPALMR
GYAVVSMDSGHTGARDPSFAKDQQARLNFAYASTGKVTTVAKQLIEQMYK
EQPKHSYFMGCSNGGREAMHAAMRYPLEFDGVVAGNPGFRLSYAAVGEAW
DNQQFMKYAPTNEQGEKIVANSLTQEDLDIVSKAVLKRCDAKDGLADGVI
NAWEACDFKPEMVEKEIGKDKVALLNAVFGGAKNSRGENVYASWPYDAGI
NSKGWRAWKIGDSQTAVPNGRNFTMGVESLTNYFMMPISPDFDPMQFDFD
KDTQKVAQIAGMNDADETELTTFQARGGKMIIFEGVSDPVFSAHDLRDWY
NKLNQDMKDANQFARVFMVPGMTHCGGGPALENFDPLTALEQWTDENKAP
DFILAKAGEEFPNKEKEMPLCPYPQVATYKGGDKNKASSFECR
>MS1511 unknown
MTKRKLTQNQKRRIHSNNVKALDRHHRRAKKEIDWQEEMLGDTQDGVVVT
RYSMHADVENSQGEIFRCNLRRTLANVVVGDHVVWRRGHEKLQGISGVIE
AIKPRENEIARPDYYDGLKVMASNIDRIIIVSSVLPALSLNIIDRYLVIC
ENANIPAVILLNKVDLLTDEQWREAEEQLEIYRKIGYETLMISAISGKNM
EKLTALLADGTSIFVGQSGVGKSSLINYILPEVNAQTGEISETSGLGQHT
TTSSRLYHLPQGGNLIDSPGIREFGLWHLEPAQITNGYREFQYFLGTCKF
RDCKHIDDPGCALREAVELGKIHPVRFDNYHRLISSREENKSQRHFMEQD
IR
>MS0268 unknown
MIKNSPESSKCGQKRENFCQKIPLRITPDGVLSDYDAAETI
>MS1635 unknown
MPTLYIDRRTTELKVNGDVLICYEKGERIATIPLASVDRLYMKGDINLQI
SLLSKLGEKGIGVVFLQGRKNKPMQFLPQPHNDAYRRVTQTYLADNKLFC
LTLAKNIVLNKCIKQCQFLAKFIEHNPKIITFIAELQKLFNLIVKQENID
SLRGIEGRMGAIYFAAFADILPRSLGFNGRNRRPPKDPVNAVLSLTYTLL
YSEATLAVYGAGLDPYIGFFHTLHFGRKSLSCDLMEPIRPSVDEWIAECF
TAEVLKIDQFSQTNEGCILGKEGRVIFYTAFEKVVSEWRKIFEKQAYELV
HLICGYQTEYHQDQFDDYTINMAHILGNEKCDI
>MS2021 unknown
MRAIDTECKINFTGYWPYLENTGLSVLYGLVI
>MS0981 unknown
MRKLQNTLYITTQGSYLHKERETLVVEQDRKKVAQLPVHSIGHIFCFGNV
LVSPFLMGFCGENNVNLAFFTETGRYLGRLQGRQSGNVLLRRAQYRISEQ
NPIPIARNIIAAKIQSAKRVLQRRLRNHGEHEEVQAAVMALNFSLQQLKQ
AENLDLIRGIEGDAAARYFGVFQHLLAEKNGFGFDGRNRRPPRDGVNALL
SFLYSILGKDISGALQGVGLDPQVGFLHADRPGRDSLAQDLLEEFRAWWV
DRMVLSLINRGQIKPQDFVTEDGGAVNMKPEARKLLFQSLQAKKQEKIVH
PFLQEEVEIGLLPYIQAMLLARHLRGDLAEYPPFLMR
>MS0142 unknown
MMTTKTIAITAATGQFGTIALDLLVQRKANVIALVRSPEKISNAQARKFD
YANIEGQVEALNGVDTLILVSGNEIGQRFPQHNNVIQSAKKAGVKHIIYT
SLLGASNENTVKSLAGEHVATEQALKESGVPFTILRNTWYTENYTGSIGA
ALANNAFYGSAKDGKIASATRADLAEAAVNVALSEGHEGKTYELAGSTSW
TLADLAAEISKQTGKQIPYIDIPAQDYAAALVKAGLPEGFAGLIAEWDVD
VSKGALYSEDKTLEQILGRPTTSLADAVKAAL
>MS0559 unknown
MSRKFCLPKHISAEDFLRDYWQKKPLIIRNGLPEIVGMFEPEDILELAQN
EDVTARLLKQFSEDSWTFTPSPLTERDFTELPEKWSVLVQNMEQWSAELG
RLWNLFGFIPQWQRDDIMVSYAPAGGSVGKHYDEYDVFLVQGYGQRRWQL
GKWCDPSTEFKPNQPIRIFDDMGELVVDEVMNPGDILYIPSRMAHYGVAQ
SDCLTFSFGLRYPNLSDLMERIQHGFCYQNPEIDLNEFSIPLRLNQSAQP
TGKLSETEIQAMKRQLLEKLTSSPQFDRLFRQAVASAVSSRRYEMLVSDE
ISEPEEVLTALENGAKLLQDNNCKLVYTSNPLCIYANGEWLDELNSVEAE
ILKRLADGEALALTDLMQLIQQTDERDLAMDLLLDGICNWLDDGWILLN
>MS2378 unknown
MTTMEIILVTLVAAICGMGSVLDERQTHRPLVACTLIGLVLGDLQTGIIV
GGTLEMLALGWMNVGAAMAPDAALASVIAAILVIKGGQDKGTAIAIAIPV
AAAGQVLTIFVRTLTIFLQHKADDYAAQANFRGIEFCHFAGLSLQALRVA
VPTLAVALVAGTDTVTAALNAIPEVVTRGLQIAGGFIVVVGYAMVINMMR
AGALMPFFFIGFVIASFSNYNLVGLGMLGACLALIYIQLNPRFNQAQLPA
SSTSQKQLADDELEGL
>MS0528 unknown
MDVIIPISILLILFVIGTPVAFCIFCSTLTYFLMSHQPMVILIQRLAGGL
ESVTLLAIPFFIMAGVFMNHTGISERLLKFCEVLTGHMNGGLAQVNVALS
TLMGGLSGSNIADAAMNSKLLVPQMVARGYSASFSAAVTAAGSLITPIIP
PGIAMIIYGYVNNVSIGRLFLAGVVPGTMLCILMMILVSIISKKRGYLPI
REKRASCKEVIVSAKDAVLALLLPIIIIGGIRMGVFTPTEAGAVAVIYAL
ILGMFIYRNMDAKKLWLATRESALGAANVLLIICVAVAFSKFLTWERVPQ
ALASWMTTVVDSPIAFLMLVNVALLVLGMFLEGNAIMIVLAPLLAPIAHS
YGIDPIHFGIVFIFNGAIGTITPPLGTVMFTTCSITEVPIEKFIKDVLPF
WGLLLLELVLLTYIPTITTWLPNLVYGVAQ
>MS0799 unknown
MSYKLKSNRGNEYELVSFGIEKICGFFGQSKT
>MS0616 unknown
MTTEIKKVTKSDLNSVVLRSNLFQGSWNFERMQALGFMYSISPVIKRLYP
DPNSQERKDAIKRHLEFFNTQPFVAAPVLGVTIAMEEERANGKPIDDAAI
NGIKVGLMGPLAGVGDPIYWGTARPVFAALGAGLALSGSILGPLLFFVLF
NLVRLATRYYGVTYGYKKGLDVVQDMSGGLLQKLTEGASILGLFIMGALV
QKWTSINVPLVVSTIQKQDGTTEITTVQSILDSLMPGLLPLLFTFACMWL
LRNRVNALWIIVGFFVIGIFGAWTGILA
>MS2108 unknown
MGYRVNSVLGTKFRIWATARLKDYLTKGYAINQQHLSQNAHELEQALALI
QKTAKSSGLTLESVWWTLSAVIRKHFYCLQAAEKR
>MS0600 unknown
MRDKDDIENIKIKSFNSTYFSFIENDVALMRKRFHLTV
>MS1856 unknown
MFKVSKEFSFDMAHILDGHDGKCQNLHGHTYKLQVEVMSAQLHQSGAKKG
MVVDFSDLKTVVKKFILDPMDHAFIYDNTSERECKIARLLVELDSKTFGI
PVRTTAEEMSRFIFNRLKHDAGLPVSAIRLWETPTSFCEYRE
>MS0557 unknown
MNWLTLAFGSAFFAGLTAILGKLGVEGINSNLATFIRTIVVLFVSAGVIS
MRNEWQLPQHIAVRPLMFLILSGVATGLSWLCYYRALQLAPASWVAPIDK
LSVVIAIVLGIVILGEPISIKLITGSILILAGVLVLAL
>MS2233 unknown
MGFSARIKICIKGNLSVSFPTMLVSYQTFFSYILGLFYATNSNF
>MS2062 unknown
MIAVYAIAKVKADKITAFEDVVKELVAKSRGDQGCISYACGSVQGKENTY
TFIEQWQSMEDLKLHTQQPHFIEAGAKFADILSAELEINVVDYLA
>MS0286 unknown
MTDKIEKAKNSTREATPQSAVKNSEKTRKWCRRIFCIFCIVVLVPLIGLL
GALSFESGQQGLLKLTDKMTDSLSFEQISGNLQDGLELHNIRYQSSGIDT
LVEKARFQLDFNCLWRREICVEDISLQKTDIHINTALLPPSESERKTDSG
EMSRIYLPFGLTVKNVAVSELALSIDNNYLNLGVFKTAATLNNRRGLTLL
PTIINDFSFVSKTSAEQQAEAEKKAEDEAEQAQPVDWAKIDEILTPALLG
NLNQITLPFDIHVEDIQGQNWQYESFVDERSQQQVIVSRFQLQADATNYD
VELKTFDIVSNLADLQAQGQIRLNEDFPLNLVLHGDIHQDKASVLPMKRL
DLELSGNLKNQTALLLTTQGDVDATLKGTVELGKEKMPLDLQLTSKKAQY
DFAVANLKPLKLQDVNAKITGNLLDYQAEISGQVEGMGAPKTEVDLLGSG
KLYQAEVKQLKLHGLEGRIDLQGDVDWQDGAKWNAELDLNKINIGAYVKD
FPAVLTGKVSTSGLANSKTWQVSVPTLDLTGSVSQRPLVLKGGINLGQEA
LLDIPNLLMTYGENKLIAKGLLSDKSDFNLDINAPNLKGLLPDFSASLVG
KAVLTGDMAEPNLDIDLKGDQIQFQDFYLAKFNVQGKVNSVPQIEGNLAL
DVSGFNYGDINIHSVKLTAKGNEKAHELQLRSEGDPIAAQLNLSGGFDRA
LQQWKGTISQTDIKTPIGDVTNNQFAVNYEHKSAKATISAHCWHNPDVEL
CFPQSFTVGQNGEIPFEMKKLDLNLVNKLTEQENMLAGILTGKGKFAWFA
DKPVKLDASVTSNAIYFSQKVDGKNFKLDMAKLNVNANLENNNLAVTSAI
HLQNQGNVAADIKLLDIDQVGKLSGSLKMSGVNLDLINQILSNKERISGD
VGAALTFAGDLNKPLINGSLDIKNMNAVVQNMPFDITDGNLALRFYGTRS
SLQGYIQTPDSRLNIDGNADWQDINHWHTAVRAKANEFKLDIPSMAKLKV
SPNVEMKASPTLLELTGNVDIPWGRIAIESLPDSAVSVSSDEVILDEPPR
TRIVKLATETDGMVIRSDLKINIGNDVNLEAYGLKTNLNGRLLVKQEKGQ
LGLYGMINLRRGRYASFGQDLLIRKGQISFNGLPSHPMLNIEAIRNPEAM
EDAKVIAGVKVTGLADSPSVDVFSEPAMPQDQALSYLLTGRSLENSGEAG
SGGSVGAALLGMGLAKSGKAVGSIGETFGIQDLNLGTSGIGDSSKVVVSG
NLTPRLQVKYGVGLFNGLAEFTLRYRLLPRLYLQSVTGVNQAVDLLYRFE
F
>MS1424 unknown
MSHKSALSPEQIYSVSPTLATYTKTLISDDMWNRPILSKHDRAMITVAAL
IARQQTMGMKHYFNLAMDLGVSAKEMSEVVLHMAFYAGWSNAFAAVDILK
DIFAERGISPDQLPTLEPEMLPMSQALPDNDFFMGLIDQNIRSFVPKLAD
NSTDVLYHQVWLRPDLNPRDRNLISVTALIAQGLYDFVTVYSLRAKAVGI
SKEEMQELLAHLAFYAGTPYIVPAIPHVAKAYE
>MS1807 unknown
MYLNELNKIPLIIKEIPAKRKRLLMYFYRLERLSAFRENRHNFPLFIYWV
NL
>MS1286 unknown
MQSEMLLPAIGGFIAGVILTYLVLRLTKGSIKNQAKTENALQQAKAELAE
QKKQLERHFAESASLLKTLSEDYQKLYRHLAASSTTLLPEFKELFNGSTV
NQDKPRIEPTISDLKTITEENEDQPRDYSEGASGILRVER
>MS2383 unknown
MSTTAIIMMIISLIVIWGGLALAVLRLPKE
>MS1818 unknown
MNSVLKKFAKLSALFGVMLFATTVQAEQAKVDAKQESPVAAFDQTLDNVR
DPNKYCAQCHNLDTSKDQAVGTNHAGKFHGTHLTKNNPATGKPITCVSCH
GNISEDHRKGAKDVMRFESDIFSTEQPMYSVQEQNQVCFSCHKPDDLRKK
LWAHDVHAMKAPCASCHTLHPAKDAMKDIEPKERVKICVDCHSEQRLRKE
AADAAQSATEQKDKQ
>MS2112 unknown
MVGRNAHPTFQMAFSISNSQREMLYTSPQHGKTTMLEQFIAQLYQQKNAQ
QAV
>MS0835 unknown
MDWLFAWSVWHWLILGFLLLIGEILIPGIFLLWWGISAIIMAAIMALFTT
LTLTVLGISYAVLALLLSLVWWKYQHNKDKSDEARSVLNQRNHAMIGAIG
AVQEIALNGVGRGYFGDTTWRIQGKELKVGDRIEVLKVRGITLIVRKLGN
>MS1067 unknown
MRLAWVAFNTILHKEIRRFTRIWVQTLVPPVITMTLYFVIFGQLIGGRIG
AMNGFSYMQFIVPGLIMMSCITNSFGNVASSFYSTKFARNVEELLISPTS
THVIILGYMAGGMARGLFVGTLVTVVSLFFVEFNIHSWTIIFITVLLTTA
TFSLGGLINAVFARSFDDISIIPTFVLTPLTYLGGVFYSISLLPAFWQGV
SKLNPIVYMINGFRYGFLGISDVGLGYTFAVLITFITALYGFVYYLISHG
VGLRS
>MS0070 unknown
MFILIAMVFIAIFLLVETLKSKMIYFNQILRI
>MS0874 unknown
MKLKYKLCIALFAWVSAFHVAAAPQTHAEVSNVTTELNDIQIRLKAQQSA
DKGDWKTVYTLLLPLAQRGDSQAQVNLGILFSSGRGVEKNLEKAYWWFNE
SAEQGNAKAVTYIGLMYLEGVGVKQDTKHAIRILEKAGRVDYPRAMLALG
NAYYMEKNLQKSFLWFERAAMKGVSEAQFKLGMMYEKGEGTHKDEEQAVY
WYQTSLKANDDIAEFAKERLSALGRLR
>MS1082 unknown
MIAFGYITALSLSYFLLAPDFKGLSFTEYFIQSEAKPIFLTLGLLLPIGF
IVMSKAVEYGGIVRTDAAQRLALFLQIIAAVILFGETLNNMRVGGVIVAF
FALFCLLTKPTKSIENALKAVFALAAVWLIWGVTGILFKKIALMGGAFPT
TLFVTFSIAAVLMFTYLLIKRTFWNASSLVGGIILGCLNFGNILFYIYAH
QYFKENPTIVFATMDIGVICLGMIVGALVFKEKISKINMLGIVLGITAIL
LLRV
>MS1931 unknown
MMLSPSEILKKTTALFAATICLYFACKLILMGTGFYPQPKLTDILLFAIL
IVIFNSSKNLFYFLLLPFIIAHALYAPVGITFGAPSYQYIASVFATDLME
SREFLSQLSIKNYLMPVGIIGLTLAFRWITQKYDLKLHKNKMFLASITAF
MLLANSPFKFIDEISTSGTQVISELQRLNNMTIESEWGDSQLINSNYDDY
VLIVGESARKDYHHAYGYPVKNTPFMSKANGVLIDGMTAGGTNTIASLKL
MFTQPNTQTKEGNYSLNFVDLIKSAGIKTYWISNQGYLGEFDTPISAIAN
KSDEKIFLKSGDSLNSNTSDFELLPKFTQVLERPSTGKRFIVVHLYGSHP
ITCDRLNDYPKLFDDDKIAKKYFNVNCYISSIKKTDEVIKRIYDALAENK
AKTDRTFSMIYFSDHGLAHQITEDNIVIHNSSGKSKRHYDIPLFKISSDD
TKRHEYRVFKSGLNFTAGLAYWVGISNAKLAVREDLFSNEPDKDDYGLKA
EIDKIDVPEDKAVVIPGTH
>MS0477 unknown
MTKIFLYGIIAKNFCITGSHNMSSKTIELNFLGQVLRLNCPEEQHDSLRE
AAKLLDSRVTEMKDRTGILQVEKALAIVALNLSFELLQEQHKTHKTENVL
QNQIEQLTRSLESISASTPTQQASYSID
>MS0679 unknown
MKTKLFIRIVSLVTKNDIRPSVLKCGHFSAFFKNRKKTTALLSDD
>MS0143 unknown
MIFKEIKMNNFFERGNVLAAACPSRQILQHLTSRWGGLVLIALRSGTKRF
SELRKTIDGVSERMLTQTLQQLEEDGMLVRKSYNTVPPHVDYTLTEFGAQ
ASEKMFELVDWLESNLNDILTHKVSKQ
>MS1011 unknown
MHKLKLILLTSTLFGLFACSNTQKTQIKTGYLKDNISQEELSNPTQYKRY
YYSCQNFETGTESYLSTYFPLSRESRMKDNFGIYFQLDNGKVQPFDHIAN
KPLNARASRFEVIYRSYHPIQGAYIDLIASENSSVYYKDYRGMRSPWLDC
KES
>MS0747 unknown
MKELFATTARGFEELLKLELSSLGATECQVAQGGVHFMADDETQYRALLW
SRLSSRILLPIVKTKIYSDLDLYSAVVRQNWLAYFDERVRFLVDFNGTNR
EIRHTQFGAMRVKDGIVDYFERNGKARPNVDKDYPDIRIHAYLNRDDLVL
SLDLSGEALHLRGYREDSGAAPLRETLAAAIVLRSGWKQGTPLVDPMCGS
GTLLIEAAQMEAKIAPQLHRMHWGFDFWRGHNQAAWEKVKREAVAMAEAE
FNKNPNPHFYGFDLDHRVLQKAQRNAQNAGVAHLIKWKQGDVAALKNPTP
EDKGTVICNPPYGERLGTTPALIALYSVFGQRLKEQFPGWNASIFSSEQG
LLDCLRMRSHRQFKAKNGPLDCIQKNYQISDRTLSPENKSAVENAGEFKP
NANVATDFANRLQKNIKKIEKWAKQEGIEAYRLYDADLPDYNLAVDHYGD
HIVVQEYAAPKNIDENKARQRLLDAVTATLAVTGVETNKLILKVRQKQKG
ANQYEKLANKGEYFYVNEYGAKLWVNLTDYLDTGLFLDHRLTRRMVGQMA
KGKDFLNLFAYTGSATVHAALGGAKSTTTVDMSNTYLNWAEQNLILNEAD
GKQHKLIQADCLQWLANCAQQFDLIFVDPPTFSNSKRMEDSWDVQRDHIK
LMGNLKRILRPNGTIVFSNNKRGFKMDFEGLTRLGLKAEEISAKTLPLDF
ERNKQIHNCWIVEFV
>MS2065 unknown
MLFCPVSQIVRQVRPFILTQRALAQIQWQDVPFSQTVKTTLTEQQKQAFT
AQFAGIASPVAAYRIPANQGTLEIEIESPVIDQTLFVPTAVVLDGNFNVA
ATYPSSSFKLQEEGGLKGNRLSAELNLTPAMNQDYIYLLIYTTQQDLAKT
TMMPHPAKVYAKATGRQPPAINDIEVAHSLNGQVQINVSSANGTKFIGLP
TEIFSSNKASTPVGKPAASPATAAQNPNAVVTVVDKDTEAYFNQAVTKAL
KAKDVNKALNLVNEAEKLGLKSPRQIFLKNVNSN
>MS1428 unknown
MNMNKKIVMILKILLAVIVLLTGAVWAFMTYHPVWGGTPDEGSMARIRAS
KAYNATLGKFENQEPTQLLTTDEKPSITTWITRLMAADEGKNPSEPLPSA
AFDKNVLKDGEMVWFGHSTVLFKLGGLNVITDPVFHNASPIPYIGISPFK
TEHSYSVESLPELDIVLLSHDHYDHLDYRAIQELDSKTKHFIVPLGVKAH
LQRWGVADDKITEMDWDEQTKIGTLAITLVPARHFSGRTLNIKDPTLWGG
YIIQSPELKYYYSADSGYGKHYRETIAKHAPFDFVMIENGAYDKKWALIH
ETPEEALQALKDIGATKVLPIHWGKFDMANHVWTDPINRLMKDVASQPEI
SVATPKIGQIFHTQGDLPAEQWWQGVR
>MS1640 unknown
MQIDRFERHLDPSSIQSGDVVIGTLPIHLAADICQKGAKFYFLSVNVRAE
QRGTELTCEQLVEQGCSIEAFYIQKL
>MS1987 unknown
MTEQNLLSSLAHMISEQRNPNSMNLDSLSPLELVTLINNEDKQVPLAIEK
VLPQIAQAVEKIVRTFQQGGRLVYIGAGTSGRLGVLDASECPPTYGVKPE
MVVGLIAGGERALRHPIEGAEDNAEQGKADLQQINFSKKDILVGIAASGR
TPYVIGALNYAKSLGAITISIASNPDSAMASIADIAIDTLVGAEVLTGSS
RMKSGTAQKLVLNMLTTASMVLMGKCYQNLMVDVQASNEKLRARAIRIVM
QATDCEKEVAERFLKAADNNAKLAIMMVLTNLDKQQASVLLQRHQGKLSR
ALSQ
>MS0291 unknown
MYCVQCEQTMVTPKGNGCSFSQGMCGKTAETSDLQDLLIATLHSLSAWAL
KAREHNIIIHEADAFAPRAFFATLTNVNFDSARIAGYAQQALIYRNQLIK
AVNEVEPNPNIDHPLANIELNGISVEQLALQAKQFALDTDRQQIGEEAHG
VRLLCLYGLKGAAAYMEHAYVLDKFDNDIYAEYHGFMSWLGTQPGDLNEL
LEKALAIGSMNFKVMAMLDAGETEHFGNPVPAMVNVRPVKGKCILISGHD
LKDLKELLEQTEGKGINVYTHGEMLPAHGYPELKKYKHLVGNFGSGWQNQ
QKEFARFPGAIVMTSNCLIDPNVGDYADRIFTRNIVGWPGVTHLEDHDFS
PVIEKALQCDGFPYTELEHLITVGFGRKTLIDASDAVIDLVKAGKLSHVF
VIGGCDGDKEERHYYTDLAYALPKDTAVLTLGCGKYRFNKLDFGTIDGGL
PRLLDAGQCNDTYSAIMLAVTLSQKLGIGLNELPLSIVLSWFEQKAIIVL
LTLLALGVKNVYSGPSKPAFLNDNVMALLHEKFGLSGLTTPEQDFGHIIN
KNL
>MS0534 unknown
MYPSHLKFLSLKSAVNFSIVLPASSYSLFADA
>MS2090 unknown
MQTSNALDNLKSIAKNNKKRLAGTFGLVAAENVLFLTYPVFGSFAVNAMM
SGDVWASLSYSLLVLIIWSIGAMRRAVDTRAFARIYAELAVPVVASQRAK
GLDTSSVTARVALSRQFVDFFEQHLPILIMSAFQIIGSALMLLILEFWAG
VTACAILAFFAFLMPKYAKTNDLLYLKLNNRLEKEVDVIERNNGYQLNKH
YGWLAKLRIRISNREAAGYLWIGVAMALLFGVTVVQIATTQGVKAGHIYA
VITYLWQFAMSLDDMPRLLEQFSNLKDIGKRVEV
>MS0451 unknown
MRLILDKFSENFINCAHKKAHLILLNTLSEKFIKI
>MS2109 unknown
MQTLITPEKYIFRRYFLQNSSKIRPLARNFLRDKMNVV
>MS2262 unknown
MNYMQDNDKLYRYLFQDRAVRGEWVRLNQTFIDTLNTHHYPNVVRNLLGE
MMVATNLLTATLKFNGDITVQIQGDGPLRLALVNGNHRQQIRALARIDGE
IRDDMSLHQLIGKGVLVITIAPQEGERYQGIIALDKPTVTECLEEYFQRS
EQLQTQLLIRVGEYEGKPVAAGMLLQIMPDGSGSPDDFDHLATLTATVKD
EEIFGLPAEELLYRLYHEETVELYEPQAIQFHCGCSQERSGSALLLINDD
EIDEILEEHNGSIDMQCECCGTHYFFNKEAIEKLKKSGEEPVTTH
>MS2019 unknown
MLSLFGQLKLFLSSLTGLRLTDIYTMNGLLRKISSKWLTESIIKTLPIFI
SSNLVAIVIWKLQISHLAMPLILGVIAGGLVDLDSSIGGRIKNLIFSLIA
FAISSLGAQISLGYGWIFIPAVMVSAFILVMLGALGQRYSTIAFGTLVVA
IYTCLSYNPEMPWYGNMSMILMGATIYGLVSITVYLCFPNRVTQENLANS
YDALGEYLQAKSEYFDPDDDNLATKQINLAEANRKVMPAFDQTRVSLFYR
LQGHNHQVRTRRLLRYYFSAQDILERASSSHYQYHELFQELNNTDLMFRF
QRVMELQAAACQKIATALRRRETYTHSPRGKKALQGLLDSLNYYNKQGLP
NTYRWQMIAENLRNIENQLSQIEQDNISVEASDNELVKSIRLTGENVSGI
QNMFRVIRGQCTFSSQLFRHAVRLSILMVICSALVQIFNLDSKGYWIALT
AIFVCQPNYVATKKRLIQRIIGTVLGVIVGYSFQYLSPSLEALLGLTVLT
GSLYYFFRLSNYGSSTFFITLLVFVSLNVIGVGANEGILPRLFDTLLGTA
LAWIAVSFLWPDWKYLNLHNNLKSTLSACTEYMRHIIAQLQFGYNDQLAY
RVVRLEVHNNISSLSAVISNMYSEPGKYQKALEFAPKLLGITYTLLSYIS
TMGAYRAASRELDHNTEFSALFFKYGKQTTEILDCLTDKKCSTDNINERI
KIIDENLARFNKYDKQSNGIEQVLVQQLRMILQLLPQLGVLVKTENSYFQ
LES
>MS0554 unknown
MLKKYRTFIKVRYFYIKFFETPVSPDFYTK
>MS1618 unknown
MRSPFCQNMRTFEGFSQLQKPADKSKQNLKLSLIKLGKSVKF
>MS1633 unknown
MFEQEAWQPSAPIKTLFTRAKIIREIRKFFTERGLLEVETPVLSEFGVTD
VHLSTFNTEFIAPIGENSKTLWLMTSPEYHMKRLLAAGSGAIFQICRVFR
NEEAGSRHNPEFTMLEWYRPHFDMYRLINEVDDLLQQILDCEPAESFSYQ
FVFQQYVGLDPLSAPRAELVAKAREHHFMCDENEERDTLLEFLFSTVVEP
QIGQTRPAVIYHFPASQAALAQISSEDHRVAERFEFYFKGLELANGFNEL
TDANEQLIRFERDNRQREKMGLPQRAIDKRLLAALEAGMPNCAGVALGVD
RLLMAALNANRIEEVMAFGVNNA
>MS1271 unknown
MRQKIFLFVRSLIILYLILFIGEGIAKLIPIGIPGSIFGLLILFIGLTTQ
IIKVDWVFFGASLLIRYMAVLFVPVSVGVMKYSDLLVSHASSLLIPNIVS
TCVTLLVIGFLGDYLFSLNSFTRLRKKAIKKRDINNVNNKGEAS
>MS1329 unknown
MYLVKFLHFFSLYLFQLAKARYHSGLFFEIK
>MS1483 unknown
MSYNNRNDCFFMQHDKNTQNSTALLSAVNLLKKCGQKSICKRLHIEL
>MS0509 unknown
MKKTYFISDLHLSENRPKLTALFEHFMHNIAPQAQAVYILGDLFDFWVGD
DEKSPLISRVQAQIKQLTEKNIPCYFIHGNRDFLLGEKFAESCGLQLLPD
YKIVNLYGTDTLICHGDTLCTDDVNYQTFRRKVHQKWRQRLFLLLPLKVR
IKIADKIRRQSRHDKKMKSAEIMDVNGEFVCQIFERFNVRQMIHGHTHRQ
NIHQIPPHFKRIVLGDWHDDYASILEVSEQETYFLPQTAR
>MS1490 unknown
MTRKYWLIIMKNKALVLDLDDTLYAEIDFLYSAYKHIASRLAPERSETLF
NRLVELYHRGENAFQYLVEQYDVDLSTLLDWYRFHVPQIRLFPHVADQLN
RLKEDFRFALITDGRSVTQRNKVKALGIEPLLDFIVISEEVGSEKPSLNN
YRLVQDALHCRDYIYIGDNPKKDFVTPNKLGWKTICLKDRGTNIHRQDFE
ILEEFRPHFYMSDWSELPTFLDF
>MS1398 unknown
MTALLRWKSMKQTSLFSQNNTQNQPLASRLRPTSLDEFVGQKHLLEPGKV
LQQMIVQDELSSMIFWGPSGVGKTTLAQIIAHQTNAKFITFSAVVSGIKD
IKKIMEEAETDREMGEKTIVFIDEIHRFNKAQQDAFLPYVEKGSIILIGA
TTENPSFEINSALLSRCKVFVLEALSNNDIVLLLKQALNHPQAFIPLEVN
ADEKLLQAIAEFANGDARIALNTLELAVKNVEKQGNSVHLSENLLADILN
NRQIVYDKTGEEHYNIISALHKAMRNSDPDAAIYWLSRMLEGGEDPVYIA
RRLIRFAGEDIGLADTNALTLTTNVFQACRFIGMPECDVHLTEAVVYLSL
APKSNAIYQARCKVREDVKNTRNDPVPLHLRNAPTKLMKNLGYGKGYKLA
HHYEDKLTTMQTMPDNLLGKQYYFPTEEGNEQRFKARLAQIKQWKAEHK
>MS0418 unknown
MFEWMFEADFWQQHSLWFMFVSSFLSATVLPGNSEIIFLALVSANLFTAQ
DYFSPPVFNLLSLATLGNTLGGLTTYWLGRVFPKPELRDQSNKKVRWVFA
KFQRYGIFVLLLSWLPLIGDLLCAVAGVMRLNWFASLCCIFIGKALRYVF
LLYLAVGYTFW
>MS2308 unknown
MHENFDQQLIFSELIYGNKQYLRIICTFAFKTATTTFRQSHGSL
>MS0108 unknown
MSAPLTFQQVFDRVVGHEGGYVNDPHDPGGETNWGITKYTARENGYTGSM
KAMTREQAYKIYEKAFWQRYHCEKLPEAVAFQFFDAAVNHGVGNASRMLQ
RAVNVADDGIIGKVTLSAVEKMPISDLLLRFNAERIRFYTKLKNFPRYGK
GWMNRIAGNLAYAAIDNEV
>MS0094 unknown
MYVTIDELTTAFARKTLVQLSNDEPTATEPNLTVLDTAIKVAEERIDAAL
RSRYTLPLTQVPTLISQHALTLARYWLYARRPETKMPETVKETYTQAVKE
LEQIANGKLHLGIAESAVEKSNDLLPDNSEYEVRATQRINTDGY
>MS1649 unknown
MKLTNIIEIKAKLVLKTGLHIGAGDSEMHIGGIDNSVIKHSITQSPYIPG
SSLKGKIRTLLEWYSGEVKSEPLSINNVASANNSENVKNILRLFGFAGHS
ENNKELCQELKSSRLAFWDCALNEDWEKMIREDNQLLTEAKSENTIDRIT
ATAGNPRQTERVPAGAEFDFKLALRQFEGDSEELVKLVLKGLRLLELDSL
GGSGSRGYGKVEFQGLTVGGKEEKLPENPFA
>MS1427 unknown
MQVISGHWGEMLPFYLQRLDDSIPQAATGLKRSITQTFKEQVFVTPSGML
TLPHFNFIYELVGADRILYSIDYPYQTLDGARAFIENLPISQAEKELIAY
KNAEKLFGLG
>MS1629 unknown
MIIFKGLYIGTSSLYYFCRCNPSPAAFFVYFQPHFILAF
>MS1110 unknown
MFLIYRNSLNRNRGFPSLPDKICAKSTALLPAP
>MS1137 unknown
MYLRQLDISGFRGIKRLSIHLRPDMVLIGENSWGKSSLLSALSLILNVDN
GLYHFVPTDFHRADNMKDITLLFTFSESSINEEHEKFNPVYRHIFVPHED
GFERIYLRVSGDINEQNQVQTYYSFLDQQGQPIDVENVDFLVKELTHDHP
VYRFRDARLNRHKANSQPLKYAENIDAVSRELYAVTELVKYYFVETQEYA
QMSSDPGVLWDLAQSLCYRLEQRKNPELQQRLVNAITSLFEHNGKLNPGS
HRFMRPILLLEDVATRLHPRMVAIVWKLANYLPIQRITTTNSVELISQVN
LRSICRLVRYDDRTRAYQLNRRDLGKEDLRRLSFHVHHNRSRALFARTWI
LVEGETEVWILSELAELLGIDLDIEGIRIVEFAQSGIRPLIKYARAMGIE
WYALTDGDEAGKKYTETVKTMLLEHELLSNRVTTLPRQDIEHFFYSSGFE
NVFIRLARWEPQGGHYPIHKIIQKAIQRTSKPDLAITLSNEMANRGRDSI
PLLFKRLFSKVVSLTRTQES
>MS1292 unknown
MEIKMNNEITSEKSSWRSKLGALGPGILMASAAVGGSHIIMSTQAGAIYS
WQLVPIIILANLFKYPFFRFGAQYTLDSGNTLLEGYLQKGKFYLWFFFLL
NIFATIINTAAVGLLCAAILTFVLPFPVPVPILSLIVITVSSGILLLGKY
RMLDGLSKLIMIALTVTTVMAVLTALFRNRIQGVAQADYVAPSPWNLGPL
GLGFIVALMGWMPAPIEISALNSMWVVAKRRLTKVTYRDGIFDFNVGYIG
TAVLALVFLALGALVQFGSGEQVQMVGGKYIAQLINMYASTIGDWARGLI
AFIAFMCMFGTTITVIDGYSRTNVESLRLLLKRKESSPKYLNLAVILAAL
SGLAIIFYFNNAVGPMLSFAMITSFVFAPLFAWLNLSLVLKGEHKVRGGL
FWLSIAGLIFLISFAGLFIANQAGWLA
>MS0147 unknown
MYPVDKHIKGTGKRVNKKSSKIHRTFLALPDKIYYNRPNFRK
>MS1665 unknown
MALLSTLSDRKSAVKKTEILNYAFVQSSTS
>MS2123 unknown
MHFERRSQWSSELGREMYFNVYGHTGKPVIVFPSSGGNQEEYANFGMIDA
CRSFIDRGLIKIYTPDSYDKESWLATWKSGHDMALAHNAYDRYIVHELVP
LIRHESQWNGTMIATGCSMGAFHSVNFALRHPDLFDTTIALSGVYDARFF
TGEFYGDPTVYFNSPIDSLWGQNDDWFLNQYRRNHFIVAVGQGAWENEHV
SDTVRLQEAFNAKGIPAWFDYWGEDVDHDWPWWRKQMPFFLSKLEEQGII
>MS1474 unknown
MIKYIFAFVIIIAIILVAITVGANNDQVITFNYIIAQSELQLSSLVAILF
GFGLILGWLITGFFYLKLKLKNITLTRQVKRQTQQINELTTSLDKAAQ
>MS2363 unknown
MPKNKRLTIKAKQITARGAKNKIDAEINRPPTGRGLLIIILLLVMFWFFT
VHIAVSYKG
>MS1584 unknown
MGPRSETRQINNYKIFYFLTALLSKVRFNFPLF
>MS1710 unknown
MLKATFKTWVSKILLLATALFAVQNALADESPYGLTRQAAEKLFADIKAN
QPKIKQDPNYLKTIVRQDLMPYVHVNYAGSLVLGQYFKSTTPAQREQFFA
AFDQFIVQAYAQALTMYSNQDIQVQPQQTVSDSQASVRVKLLQKGQEPLN
LNFQWRKNSKTGKWQVYDMTAEGVSMVDTKKQEWSSILRKNGIDALTAQV
QRAAAVPVSLGKK
>MS0527 unknown
MNMPISKKKFWLSDIDEILASFFLALIVLLSGYGVVMRYFLNTPSAWVEE
ICVVFFIWFTFLASSALCKNNELIRIDYLLTKIPAKVANFIDGVIQPLIM
IFCLGFMIYLGFKLLPMSKMRFTPALQISYVYIYAIIPISALFMLYYELR
KIVYYFKINKRN
>MS0714 unknown
MISKIKVSLVALCAGLFFVSVNTSAAETQTQVPQQCQKLFSATERLIEEA
EKQPGTHTQVSKIKNKLNQSKKQILEMELATQIKSCDHGLARLNRLNQQD
QITN
>MS2246 unknown
MSENKTQVNPSVERFEQAVADKSYESACTELLSILGKLDSNFGNINDIEF
QMPKQLAEANLQQDKIVYFCTRMATAITTLFSDKELNISESGAQRFFLFQ
RWMSLIFASSPYINADHVLQVYNQNPDRISSEVHLEANRSALLKFCILYF
PESNLNINLDTLWNVDANICVSLCFALQSPRFIGTATAFSKRALILQWLP
EKLAQLPNLNNVPSSITHDVYMHCSYDVAENKHWVKNALNQVIRRHVLEA
GLQDRDVKKLGYRNGKPVMVVLLEHFHAAHSIYRTHSTSMIAAREHFYLI
GLGNESVDQKGREVFDEFHEVAGNNLIEKLAFLRNLCEENGAAVFYMPSI
GMDLLPIFASNIRYAPIQVIALGHPATTHSPFIDYVIVEDDYVGSEQCFS
EKLLRLPKDALPYVPSALAPEKVDYNLRENPDVVNIGIASTTMKLNPYFL
EALKAIRDRAKVKVHFHFALGQSQGITHPYVERFVKTYLGDSATAHPHSP
YHQYLEILRGCDMMVNPFPFGNTNGIIDMVTLGLVGVCKTGPEVHEHIDE
GLFKRLGLPEWLIANTVDEYVERAIRLAENHRERLALRRHIIENNGLKTL
FTGDPRPMGTVLLAKLKEWASENQVQLEIAE
>MS1580 unknown
MSKKLSVAVLVGIFALLAFLYGQKNKAETQLLTLLQKQGIKVNSLDFSFL
PHPTLTANKVRYLVPESSRLVAFEQVAAEFSGASLLLGDFKISNMRFNDG
EIRSEPQSPPVLYSLNFSLKPAALYLNRLENLLHFFKTKEVLDGGNNQWL
YELNLTAKNPSNDNLHFATTFKLLTRGIALKDTNASVDLNELTYSDNKQF
TLTADKIYLTTQQSAVENYEFSAENLKLNNENLGRVQGEWLASGINPQGY
LVNLTSSICNYCNSMIDVRSVNPQNSIIRFKTEFFPLETLLGILKLPVLL
SGKSDVTAELYLSEEQPTIGDFNLNVLNGKLKGVNLLSLIGQYLPINYDE
GKLKNLETGFIQYNAQFRWRGRNLHIDNMLLQTEDLILKGRGYADLQTMK
CDAMVNIGVNDAQYKQLTLPIRFFDDCTSPQYKIEINKDFRHQLRNFIKD
KFN
>MS1429 unknown
MKKLLIALLLGVTTMAQAEYRMSLFELTVKPENQQAIEAIGKHNLGTSIQ
TEAGTLAMFHTVKKDEPSKNVILEVYQDDQAYQVHSQAEHFKQFVEVAKT
AVIERKAEALNSQFLAEKRPLADFENGNYLINLATVRVKSAQNDAFKAIV
VDEMKQAMAKESGVLLMYAATLKEQPNEWRFFEIYADQAAYAQHRQTPHF
QAYLKGTNGMIESKGVVELQGKTLVTKGVFQSK
>MS0830 unknown
MKREVLLNKKVSKKPANHTDFSSKEKHFFVLYFN
>MS1196 unknown
MENLSMKYQKLENQEAHWKWLYLIKKNREGENITRYEERSLQQSKVHDLL
ESQNYPEKIEEWIANHMAESLIIKLDQAIRARRKRFFNAEKLSTKKKSID
LEYGVWLRLSKYSKKMKMTLSETITYMIDERESKALYESQMSAMKAGLKD
LLNK
>MS1773 unknown
MNKIRKLLSCRKGVSSIEFTLTVGLFFMVVFMILELARLTLFTSYWDYLL
TESVRITKNQRAENNDYASLFRTVLEQQHQQQNNAVLAFFDVRDEKIDVK
VEYAESVDDLVNEVFRQPTIVNGVAVSPTGADASIARYSLSYSYRFLVPL
PFISEQWINPMFNREIFVVQEYERPSFRYNN
>MS0673 unknown
MDDIMSNYRRDFSPGATYFFTVVINQRSDGLLIKYINEFKQAYQDVVSYY
PFETIALTVLPDHFHLIMQLPENDSDYSKRISSLKYNFSSLLPTYYRNMN
LSRQFKREAGIWQRRFWEHLIRDDRDLDNHIDYVYYNPVKHGYVSQVMDW
KYSTFHRDVKNGIFELDWGSYISESVRNLYLD
>MS0735 unknown
MMYYVIFAQDKPNTLEKRLEVRPQHLARLEQLKTEGRLLTAGPNPSEDGK
SVTGSTVIAQFDSLAEAQAWAQQDPYVDAGVYGEVIIKPFNKVF
>MS1957 unknown
MNEQQFKNELEKLTEDKNRTYMYIIYGLFILAVVFKPLAIIGAVFAFMKR
EELSVLAQTHCNYLIKTFIVAFIGSFLIFVPVIFWFIFAWYVYRVASGFQ
NFYGNREVNGESWFK
>MS1719 unknown
MKWTDAQEIAENLYDLYPDVDPKTVRFTDMHQWICQLEEFDDNPEASNEK
ILENILLRWLDEYE
>MS0980 unknown
MMILITYDVSLENEGGERRLRHIAKHCLDYGIRVQYSVFECEVTPAQWVE
LKDKLLNTYDKETDSLRFYQLGSKWKHRVEHYGAKRAIDMFRDILII
>MS0656 unknown
MNNKDIKIIIATHKKHFMPSDEIYLPLHVGKLGKTDLGYQGDDTGDNISA
KNPNFCELTGLYWAWKNLANDYLGLIHYRRFFSVKSRSERKNNPLETLYL
TSEEASQLLEQYDVIVPSKRNYYIETLYSHYANTLHAEHLDVTRKIIADT
CGEYLDSFDSVMKQRGGYMFNMFIMSKELVNDYCSWLFPILFELEKRIPA
EQYSAFHARFYGRVSELLFNVWLKQYSQSKPLKVKAIPFVYGEKINWLKK
GFAFLMAKFFGKKYEKSF
>MS0402 unknown
MLNKFRLNRRNFFVLIPLIKADKSAVVFCEDLTIL
>MS0281 unknown
MEIFTNAISYIDLNSIIAIFAAGLFGLFVGAIPGLTATMAVALMVPFTFF
MEPIPALALMISVGASSIYAGDIPGALLRIPGTPASAAYVDDSYLLVKQG
KVNRVLGLGLMSSVIGGVIGTIILALAAPSLAQFALKFSSFEYMWLSLLG
LSCATLIAGKFITKSLLTLLFGILISTIGFDEFTGQARFTFGFVSLYEGV
SFIPAMIGLFAISGAIEYYATRYKGQNPINTDITELEQTKNSLNLFKGIA
KPLIKRKGSILRSSVTGTLIGALPGAGADIAAWISYALSKKTSKTPEQYG
KGSEDAIIDASSSNNASLAGSWIPSLVFGIPGDSAAAIIIGVLYMKDMQP
GPSLFLFQPDKLYAVFILFLIANLALIPLALIVVNFLKKIIQINKDILYP
IVIIFSMVGAFAINNSPEAIIVMLIMGVIGYFLQKNHYPISPIILGMILG
PMLERNLLASLTKSDGNLIAFVERPVSAVLGCCFLLVVILQVWGIVKNFN
EKN
>MS1130 unknown
MIFNINYFLFSGKVRSKKFKNFDRTLILKFL
>MS2319 unknown
MIRHQLKSAVKNTALLCLLFLISLPAFSAKLAIVIDDLGYHPREDAAILA
MPKEISVAIIPSAPYARQRNQLAYEQGRDILIHMPMQPISQMNIEAGGLS
IGMDAQQVAHNVQQAKNIVSHAIGMNNHMGSAATADRPLMTELMAELRKQ
HLFFLDSRTIGRSVAEKIAKESGVRALQRHIFLDDSDVYGDVQRQFQQAI
HYARKHGTAIVIGHPRKNTVAVLRQGLANLPPDIQLVSMGSLWRDEKVVP
PAPFILIFSDKPAPTSVAPFEPVPLLRGVPR
>MS0613 unknown
MKKTIIAAFVVSAGVIACSSPVENRPQAPLDMQTVRHYQNKVYGGNTVPA
AQRVKEQPVVDTPMNVSDTRRQDRLDTRQTVRPGNVVIVPSIGYGYHHHR
YRW
>MS0991 unknown
MTNTKKVYYAHSEKDLPHEQWQTFSSHAENVAKLAAQFAEIFDAYQLAYN
TGLLHDLGKYTPAFDKRLHGGPSVDHATAGAKIAIERWGFPLGKILAFCI
ASHHTGLVNGDGEGDNRSTLKQRLSVPFGKGNLPELDPIWQSELPLPEKL
TFPALKPDPYYQPFALAFFIRMLYSCLVDADFLDTEAFYANLKQQDIDRG
NAPSLDQLHQQFNRFISDFRERKKALQPQTEEEQRNAKLNRLRSQILDHA
IAQAQQEPGLFSLTVPTGGGKTFTSMAFALEHAKKYGMLRIIYVIPFTSI
IEQNAQEFRKAFGEFGEAAVLEHHSTFDDEKLLDKDTKDKLKLASENWDM
PIVVTTAVQFFESLFADKSSRCRKLHNIANSVIILDEAQMLPLNLLLPIM
QSIKELARNYHSSIVMCTATQPAIQTQHGFYRGFENVREIAPNPTALFAD
LRRTSVQHIGMQSDKDLIDKLTENQQILIIVNNRRHARSLYEQAKQLDGT
FHLTTLMCAKHRSQMLEQIRQHLQAGRSCRVIATSLIEAGVDVDFPLVMR
AEAGLDSVAQAAGRCNREGKKLAEQSFVWVFQPEQQWKAPTELGLLSAAM
RSTVRCYGDNLLSVEAISHYFSAVYEQKGKDLDNKQILAKCHAAGKTLDF
PFQTIAKEFCMIESHMLPLIIPFDKEAEKRIEELRHAEKVGGLLRKLQPY
TVQIPQKSLEALFKAGRIEAINEQQFGNQFYSLIGLDLYDEVAGLDWGDL
GFITIENSVF
>MS0685 unknown
MSDCTMCCERDHRKGELKAKSAVENLKVLYKHLFLGIGSAKSRETEQNF
>MS1605 unknown
MLLEVLFYIGLIVEAMTGALSAGREKMDIFGVIVIAFMPALGGGLMRDII
LGNYPVNFIANPHWVLIVAVTALATIFIAPLITHFNRSFRTVFLVLDGLG
LILFSIFGTQIALEMGFGLTVASISAILTGAFGGVLRDILCNRIPLIFQK
ELYAFIAFFTACLYIGLQHLGLSINLTVMITLTVGFIFRLLAIYFSWGFP
VFDYHEEEMSPKEIMPRLPKRRKYKEK
>MS0696 unknown
MDALLSWEWFVPAVMFFSFFVLVFVGVPISFSIGIATLAAAAFMLPFETT
LIVSGQKIATGLDSFSLLAIPFFILAGSLMNSGGIATRLINFSQILVGRI
PGSLGHTNVMANMMFGSISGSAVAAAAAVGGTMAPLQAKAGYDPAYSAAI
NVSSCISGLLIPPSNVMIVYALTAGGISVATLFMAGYVPGILMGFGIMAM
NYIIARKRRYPVSDKPTFAEVVKYSLDAVPSLLMVVVVMGGILGGVFTAT
EASAIAVVYTFILSVIIYREIKLTQLPKIILDAIVTTSIVLFLIGVSVAM
SWAMTNADIPYMVNELLISVSDNLIVILLIINMLLLVIGIFMDMTPAVLI
FTPIFLPIVKELGMDPVHFGIMMIFNLCIGLCTPPVGSALFVGCSVSGVK
LQDLIKPMLPFFAVLVITLLMVTYIPQLSLFLPGLFEL
>MS1713 unknown
MGENTLVEVNNLTFKRGERTIYNDLNLKVQKGKITAIMGPSGIGKTTLLK
LIGGQLHPEQGEILFEGKDICQMSNSELYKVRQRMGMLFQSGALFTDIST
FENVAFPIREHTNLPESLIRQIVLMKLEAVGLRGAAELMPSELSGGMARR
TALARTIALDPELIMYDEPFAGQDPISMGVIVSLIKRLNEALNLTSIVVS
HDVQEVLSIADYAYIIANKRVIAEGTAEQLLQSTDPQVVQFINGQEDGPV
HFHYPSQDYEEELFGRGINK
>MS1420 unknown
MLKQMFITAMLFVATMAYAETVRPAYYVAEFQPTDREGIKAYSAQVESTF
KPYSGRFIVRGGEADVKEGFGVQGRLVIIKFDSLKQAQEWYNSSAYQKII
PIRQRSGNSRTYIVEGLPDNNSK
>MS0751 unknown
MSLKTSNLAFKERVNHEVNNEIMRKAVVKAQETIGANRQKMVDELGHWEE
WRDLAKQIRNHVLQNLDAYLYQLSENVIKNGGHVFFAETAEEATNYIRRI
AREKNAKKIVKSKSMVTEEIGLNAVLEQDNIQVIETDLGEYLLQISGDKP
SHIVVPAIHKDRHQIRKDLHEKLGYEGAETPEDMTLFVRKKIRQDFLEAD
IGISGCNFAVAETGSVCLVTNEGNLRLATTLPKTHIAVMGMERLAPTFQE
VDVLITMLARSAVGAKLTGYNTWLTGPRLEGETDGPEDFHLVIVDNGRSD
ILASEFKEVLRCIRCGACLNTCPAYRQIGGHGYGSIYPGPIGAVISPLLG
GYEEFKDLPYACSLCTACNSVCPVRIPLAQLINKHREKMVAQNLRPPLEK
LSILGFNFANSHPAVWKVGVNMGAKLMNKLIKDGKAPISVGALGEWTKAR
NLPQSDGESFRDWFKKRGSN
>MS1047 unknown
MHSVLFSYHINSKYHKGNIMLCAIYKSKKKEGMYLYVAKRDYFDEVPETL
KMAFGTPNFVMLFNLLGEKKLVRAENQEVLKHIQEQGFYLQMPPKQESLF
EQFKAEQKAKQTKNKTALKVR
>MS2060 unknown
MAESFSVERRFFDDKNYPRGFARHGDYTIKESQALEQYGQAFKALDSGER
APVTDEEELFVSFCRGERPAATFFEKTWNKYRSRISATKRVYTLSGVVGD
GLDEFNAND
>MS1447 unknown
MKKRLILLCTLVLLGGCTVGGGFGVGNNGAGVGISTGIGF
>MS1999 unknown
MFYINNDNFKREPYRGYCPKHLEKHFTKPRLDIRGNGYYSPNNEYVVAWY
ESDFIFKSDKAQAHAKVKFVNLTCPKYTGKNDAEYLSVLYISVPLNYLHY
FTIGSIWKGGVAKEQFAFEEFYITVEAENQDGKRENLSIVSFGESEENGL
EKPFDYTIYTIPTEYHNYGDDLNRLLSISYQNQKFIIHPLHIFMMHYGYS
TDIKRILATYPLDEVRERLFIDKVVENFDVERYVVLPKYFVKKDAIFLYH
LKYDEETTGIRVKNLVSQFRLNVRNQKHPIEIGFWHTQKVELKIRGIRLG
NAVLCAEIIGLNQPEGEDITLVLSQSKREKSGNKTEEQNNEVNNVPITRV
YTREPELDELPLTDNSPDNRTVEYNKRQFELLGRQRQIRALKKAREESNQ
KMKALTPDEIDSLGVGESDGRNGKVGLAFCFLDDTPVGKSSKLYKLWQHA
KAVAEWNHGEAHWYTPNLGFRNDGELLPVSLETNKCFDYPEIAIIIRLQI
YGETFFLIDFSMKQRDISIRGLGYKPDPNEDFLYESDMSKSELSELLSAV
HHHEHLPKEYIEKKNKEKTKLTTFNHSEAESSNWAYNAVQKLTSRAITKP
TYF
>MS0681 unknown
MKKRGKKPELDWETEEQEEIIWVSKSEIKRDAEELKKLGAKLVDLTKTNL
DKIPLDGNLLEAVELARRSVKEAKRRQLQYIGKLLRNTDVEPIRDALDKI
ENKHNQQQAMLHKLELMRDELVSKGDEGLVALLIDYPQMDRRHLRNLIRS
AQKEKEQNKPPKAYREIYQYLKDFIIEE
>MS1780 unknown
MNYRVLFFISFLILAMGLSGLFFMLPEESPNTPQQQTTSEKTAPKSQISI
LIAQTTRQIPQGTLLQAEDYALSELTVDSDDARANFDLKAWLANNENSSL
QGYLAKQTLQVGSFLSPDLLLSPQHPDYLLSSLDPMQEVAYRIYIKAENG
YIFDTLRAGSHVSVGSQQIAGGKNNKERTELIKLVGDTVVLQSKIYGQDE
KLMDRNIVGYISVKLNAQQLQKFYSLPKGANLILFPNNIWKQGEPNHRGI
FIRELRGQ
>MS1392 unknown
MRSKNEVFLQAVGFGRNFANGTLLYTPNNILEKM
>MS0119 unknown
MTETSEKSTALSNESYLRHSMVIMDKWGNGEAYDEKLIVDRGKHCQRTMV
ESMLEFGRVLIILKEHMVHGKFQETLEHEFDVTPRAAQKFMQATLKFCGE
GLQDTTPKLVQLGKSKLLELVTQDDDDLKELAEGGTVAGLKLDEVDRMSV
QELRKALRNAKAEKEAMGKVLANKDNKINELDVELAKKKKDIETRSPDRK
GGDLRKETSQIAYGAEAILRGQVRPAFDALLEHTEESGMDHTQFMSGVVA
EIELILIELKETYGLNDVPSVEADDWENQSDKSLGSVLDEIIADQQAM
>MS0073 unknown
MKKVISLICASILSLGIVGCDKKIDSTNVAQSDDSWPQKTITLIVPFGAG
GDTDFHARNLASHLEKELGARIIVNNVSGANGNAGMKQVVSAKPDGYTAL
FFHESMLTNKVVGLAQQAHEALSPVAATIVDDSYVIAANAKSGLKNLTDL
INKAKSEPGKLIYASSVGGYSYYLGRVLEQKTGIDFNIVDAGGGSDRNAA
LLAGKIDVNVNPYGVMKSYFDSGDFIALATINNERNKLFPNVPTAKEQGF
DWNAERYYFLSFPKGTDEKIISKMEQAIKTVVDNPDFRKKTEDAYSVSPT
FVGTKDLLNHLDSALKEFETNKDLVNN
>MS1651 unknown
MSMQIKSIQLAFSVLYHYFDECVKKALSNLPIQRVDIPDSLLAQAESLVL
GKLPSEKKKDLPLTSIFEGISQQNNSQQYLYDFKPLSPDSIFPDLQRNEG
HQPFELWQHLAKAVEEIPTSHRENINLWLDHFDTALQCYTSQITCPYDQS
ISFYDFTKAVAAFVVASMDKSADKNRPFLLIQGDFFGVQDFIFSGGSQSN
KQAAKLLRGRSFQVSLFTELAALKVLNACDLPATSQMMNAAGKFLIIAPN
TPEIHKKLDDVQKELNEWCIKNTYGLIGLGIAKMSAGKVDFEQKNYEKLI
KLLFENLETQKLKRLDLTDTTQSVQEESYPNGVCEMNSFFPALPNSNRSI
MTEDQVKIGELLAKKQRIIVCDVGTEINNSYRTQTLKLDMFGYNVIFTDS
RKDTKDFGHPVKLYQIHRFWDFSLAKNTKDELWNGYARRYINAYVPFDEQ
EQIKTFDEIAQADEGINALMTLKGDVDNLGTIFQKGIQPANIAKMAALSR
QMNQFFSLWLPAYCAEYSPNMYTVFAGGDDFFLIGPWHSTQKVAFEMQQA
FKRYVAENPEIHFSVGMVMSKVGLPVPRLGDLAEMALEKAKSIDSGKNAV
TIFNRTVKWTDWQQLCDLEDEIHRLAKDYNISTSYLYSLIRLCEQANDKN
NIESTMWRSHFYYRTARYVIDKLNQEKRDKALNEITISLGENGISQYKIN
FAIPLTNYFYQKR
>MS1630 unknown
MRYFIGSFTFLTANGIIFLTIFRIQPLFRLSALILILLLFSAFISVGLAL
TYKLLKSFINSSILNRTLRAVYPIGMLILVGLSIYNAYTPKVIHYQIELD
KPLKAMRIAVASDFHLGKLFGSEQIDKLARIIEREKADLVLLPGDIMDDN
LNAYLAEQMSSHLAKLKAPLGVYATLGNHDFFGQQQAIADEINKTGIKVL
WDEAVTINNEFVIVGRNDDLNKARPTTKRLLQNVDTNLPVFLMDHRPTEV
TEHSALPIDVQVSGHTHNGQIFPANLIIKAMYRLGYGYEKIADGHFFVTS
GYGFWGIPMRLGSQSEIFIIDVKGKN
>MS0536 unknown
MKNNPLKTIDRTIKVRSVFSKFFKVPYATKISR
>MS0190 unknown
MKKDRTLNKSAVFFCEILLPNQAELPKLRLNFPAGKVSFFFLFFHFKGCL
RATKGLHNVK
>MS1044 unknown
MHKLIIIRGHSGSGKTTFALKKIAEFKRQYPVGHVFHIENDHYLIENDKY
IWTEQRFRQARLQAQKTIYRAFRFCRKHNAPDCLIVISNVGVNKQEIQCF
VHQAEKQNMQVEIYRLRHFYPNTHHVPEDTVMSMYRHLCANPIEGEIIID
>MS2198 unknown
MKLKSLFCLCLALPLMAAANNEAPQNPQNIVSFSAETEKEVPRDLMQVSL
YLHEEGNNLKNLNKVIAEKLNKGLTLIKQQPAIEIQSNNRQTQVRYNNKN
QKDGWIATAELVLQSKNFSQLSQLIEDLSPLFAIGNIEAALSKEAIVAME
DEMTDSVLAKFQAKATLIQHSLQAKGYRLLDINIDSLNEHYASPMVNHVA
MKMAVAEQAAPVQLESGKTRLKAIARGRIELIKE
>MS0123 unknown
MNECEQIKQVWKQEYDEAAEAAAKTERSGNYYQAAELWKKAKEKALNLSQ
KEWCKRRYQYCISWASRREK
>MS1247 unknown
MKKVSLATILALSTMGMAFFANAADNVQTNAPAANAPAYEVMPCGMVREY
NPMCNGAMCNRGYPDGRANMRRGFKNMPGNAPYMGMQMGQRGGFVANQSV
TRVADAGKWEDDQMIVLEGNIIKRVGRKDYVFKDGSGELEIEISRRAWHG
DIFSADDRVRLVANVEKSWGKTEVLAVHIEQIRPDVAAPSQGNKTGNNQ
>MS0192 unknown
MSLFSLWVMAFGLSMDAFAVSICKGLAMEKFQWCGALKAGLYFGLFQAVM
PLIGFLLGVQFSEYITDYDHWVAFFLLALIGVNMLRESLSDEDDEDSCSN
DFNFKTMMTLGFATSIDALAVGVTFAFLSVDIYSSVVTIGLITAALSIIG
VKSGHFLGKKIKTKAEILGGLILIGLGVKILMEHTLFG
>MS0018 unknown
MNNQKALRELTLRGMILGALITVIFTASNVYLGLKVGMTFASSIPAAVIS
MAVLKMFKGSNILENNMVQTQASSAGTLSSVIFVLPALLMMGYWQDFPFW
QTLLICVSGGILGVIFTVPLRNVMVVKSDLPYPEGVAAAEILKAGDEAGK
ESGVKEIMAGGIIAAVVSFLTNGLRIITDGASLWFKGGAAIFQIPMGFSF
ALLGAGYLVGMMGGIAMLVGTLFTWGAAVPYFTATTPMPADMGIADFAMS
LWKSKVRFIGVGVIGIAAIWTLLVLMKPMIQGMSQSFRALKDKNNINLDR
TSQDLSPKAMIYTILGSTVLIIIALVSFLQPVGLPTSTTFLFVVLCTLLA
VLIGFLVAAASGYMAGLVGSSSSPISGIGIISIVLISLVLIVVGHSLGLM
DSKDGQRFLTALTIFTSAIVFCVATISNDNLQDLKTGYLVHATPWRQQFA
LIIGCIVGALVITSVLEILYHAYGFAGAMPREGMDVSQALSAPQATLMMT
ISNGIFSDNLEWTYIFVGIGFGLSLIIIDTLLKKSSQGRLALPTLAVGIG
IYLPPVVNVPLIIGALLSWLIQRHLRHYAKRSGKDISELNKKAERFGTLF
AAGLIVGESLIGVIMAFIIAASVTSGGSDAPLALELADWDSMAEILGLVA
FIIGIAIFTRRVLKAKKA
>MS0106 unknown
MAKRIRKTQMKYLKKLRKRWQGWRFAKQNPVVAESRSYITLALKLGVENP
VMRKRRRNP
>MS0851 unknown
MSDYRVEYVIAKFIAIANYVNMLFGRCLYAKKIIFRFLIFARHRMGSILC
GL
>MS1989 unknown
MFKTYFELKRTNHGTKLINIAQIIKRKKSNI
>MS1360 unknown
MSLTLLALDTSTEACSVALLHHGEKTHLDEVAQRSHTKRILPMVDEILAQ
SGLRLNQLDALVFGRGPGSFTGVRVGTGITQGLALGADLPVIPVSDLAAM
AQAAYELHQAEQVITAIDARMNEVYFAQLIGEKVRSEFGEFLQWNEVIAE
QVCSPEQAIAQLRANRTQGDWLNVGTGWAAYEALTKTPFGKISAIQLPSA
LYMLSLAVPAWYNRQYVKAVDVEPVYLRNEVTWKKLPGRE
>MS1782 unknown
MLTNLTTKTYIATTEAIRRFKQDHKGVTAIEYGLIAVVMAAFIVYVFADD
TSFVQSLKEKFSDVSKSVGNATFKE
>MS2340 unknown
MAGLFTQPGRRRYGVAIFAGIIGGLISAFVKWGAEHPFPPRSPIDLFTAA
CPQPVLDALNSGAIVMDQALQQCSRAFLNPPHVFLRDVFGIDPTAPAFMF
ADQAFNWIGVTHITFSLVFAIGYCLVAEVFPKIKFWQGIGAGLIACVVVH
YIVFPAMGLTPPVAEWPWFEHVSEIVGHVFWFWSIEVIRRDLRNRITHEP
DAEVPLDQPYR
>MS0733 unknown
MACIIADFPQDEKYKGRNIKVRSKNGNFQQKT
>MS1404 unknown
MTTYSQPAIIWPEKYTPGETDNYASNEVIIKDLSVLDVWEYLIDTKAWPT
YYNNAENIVVGDGSQTKLAANATFVFDTFGFHVSSKVEEFELSNDGNLAR
LAWSGTFGEGDEFSDVYHAWLIENLPNNRVRILTEESQIGKLPQQLAQTL
PNPMINGHQAWLVGLANSAKNKTSY
>MS0982 unknown
MIIRRNMMSLPQVYSLDLGRKVNAKEADIAYQKGLIRSQKNFRCPHQLCG
IAITCANLERPKQERKVDPYFKSVEYHKPSCPFAEEERRIKLHEADKNSL
YENVASGEILVNLTEPAPKKQDSSDISEVEKGSFSRATQSSDSEKEKASI
NHTKTLSVLVSSFLNNENFQITLPKPYQEKIFLKDAFIKIDGQNLSNLEQ
NCWRIYYGKAWINKLSNGDYRIVFDNKMKDPDLRKNAVCPSFFIPKDWID
NSPYEKFSKSQMDKLADNKWHREVFIFSDVPSLSHTKEYINFMLEGLPFL
EMIYLKK
>MS1541 unknown
MREAKSAILFFTGLISNIFLFLSCQNKPQL
>MS2314 unknown
MPFTFAHPVTVLYFPRNSRYFHFPALVLGTMSPDFMYMLHWKTDVGGHTL
FGSEWVNLPLCLLFYAVYRLILATPIKQHLPAFCGSNVPQVTFKNPLAWL
IVFLYSAWIGMATHIALDELTHDGGYFVQLFPILQTKIIFHIYDWLQYGI
GAVGLISIILYQRRMAGKYPYRSSRSAKQKWFFWLSVVSLTVIIFYLSNP
LYPLVWNEVASIVLRIINSFFISLTIHGVIFTVMKKRSLKIG
>MS1216 unknown
MSKMARTTRSCLHLWYVVYHSFFLKWIVMSLFSRIPFDKKLIESAIARFE
QESSAELRVYIERNLPESENLSCVDRALQIFMQLEMDKTQAHNGVLIYIA
HKSHKCAVIGDLGIHQFVGDNFWQQQCQLMISYFKDDEYTQAVIAAIESI
GKELAIHFPVKPDDKNELPNEVIING
>MS1234 unknown
MKMNKKFAQIVKNPAFRNMVLKTIFNVTNVMSATKYLR
>MS1401 unknown
MSVLHFIGIDVAKKKFDVAYLKDKERQMVKTKVLDNKPAGFNQLLDWIKK
NVSNDFSTIHITLEPTGVYHEALAYFLHDNGFVVNLINPARLPKFAEYKG
FVHKNDRGDCKLLALLGAENPHEYWQPEPLSIRQLKAKLSRLEALKSDLL
RENNRLEQAESGNLPDEVLQSIHHIRKALQDSIKALSQDIDDHINGNPEL
KKDKALLKSIPGVGDVITKQMLVVYHSKHFQKAADMAAFLGLIPKERTSG
TMKGKIMLSKRGSPQIRALLFLPAVAAKSYNPDIKAHYERLLAKGKTKMQ
AIGAAMRRLVHICFGVLKNKSVYQPQTILA
>MS0229 unknown
MTWTYILSILAIIIIVAMAGYSIYIFRELHKQKRRFAQARQARIARLHES
ITIIAKAMQSGECNHSEGVIRLKMLLEPLGQKEIDAYHSMHRLYQTVKDM
PTHDSRRALKRNERMKLDLARETAEAELEEKIKSELEQLLADIASYQKIK
Q
>MS0690 unknown
MVSFPQIYLFSAIALFQSACNQHYNLVYHFTNEFQ
>MS0214 unknown
MLKINYERRLVMKEVSSKSLNDDELALLDNLLLEYANEESDEGIFTLSEL
DGYLTAIISSPMLIQPSTWIPAIWDNDLPEWENEQEMAMFFDLLFRHYNS
IIMMLQTGLEYYSPCFEYSNFTDGDYPIVDDWCFGYMRGVKLADWQNLPT
KLQPYLKLIEDQTHLHSSLDDYVSPSLQEQNELADRLIEAAVKIYRYFR
>MS1833 unknown
MKIQLIAVGTKMPDWVKVGFEEYQRRFPKDMPFELIEIPAGKRGKNADIK
RILEQEGKAMLSACGRGKVVTLDIPGKPWTTDQLARQLESWKNDGRDICL
LIGGPEGLSPECKAAAEQSWSLSPLTLPHPLVRVVVAESVYRAWSLTTNH
PYHRE
>MS2217 unknown
MVCYSPICSYNYINKCKAVLLAPLFFTPESKCTQKSAVKNFRNFYRTF
>MS0130 unknown
MKQNTVAKNKEISMLDLYPLPQDPFQIATVVLAVICLIFLFIMARRKRDV
QELQQDLNKNILDFNQLLEKFDTLTAAKNQLDQDVIKAQTTAEGLQIRLQ
ERNELIQGLQTELNEEQLRHETLTGSMNTLKERFGVASALVTNLQQQLVE
SQNAVARKEQDLNKIQEKTTALSQELTELKTTLSEKEKNFAEQQQAFAQS
KQQLSAEFQNLANRILEEKSQSFSQSNQIALDALLKPFREQIDGFQKRVN
EIHSESLKGNANLESEIKRVLNIGNQMSQEANNLTSALKGEKKTLGNWGE
MQLERALQLAGLVKGEHYEAQAHFKDAQGKNNYPDFVVHLPDNKHLVIDS
KMSLVAYENAVSTDDENKRQHFLREHVKSVRNHMDDLWRKDYTNLIGMRS
PNFVLMFVAVEPAYIEAMKADLNLFNYGYEKNVILVSHTTLMPILRTVAN
LWRIERGNAEAREISERAGDIYNQICLVAERLAKLGNTLSTVNGHYNSAV
TALVGNQGLVGKVERFKDLSAKANKAMPAVEMLHSDLDTEKLLVVKAED
>MS0009 unknown
MNNNYKKISPILTAVLLSACTAQVPLPKTCEDFINEYAKLSVDTKKIIPE
TLLGEDMRDYILADRYTLREKYQDSVNSSYQSIKTNLGRNAAEMSLKAIE
QSCYIGTEQIKALDFMQ
>MS0930 unknown
MCSSSHKENKFVKIYISSGKFICQMAKVLLFISPFKIPEQNLEVLC
>MS1731 unknown
MWVKTPNKTGYNDLNALSVQIMKTALLILLKGLNN
>MS0517 unknown
MDSIPLSTLFITLFILLILSAYFSSSETGLLSLNRYRMRYLAEKGHKGAK
KTETLLKKTDKLLSLILICNNLVNISASAIATIIGMRLYGDMGVAIATGV
LTFVMLVFSEIYPKTIAAIYPEKVAFTSSHLLILLMKLFSPLVFFMNIII
QGLIKITGLKTETKAHSISPEELRAIVNESGKFIPSAHQKMLLSILDLEE
VTVDDIMVPRNDISGIDIDDDWKAIMRQLNHAPHGRVVLYKGDMEQNVLG
MLRVREAYRLMMDKNEFTKEMLIRAVDEIYYIPEGTPLTAQLLNFRHRKE
RIGLVVDEYGDIKGLVTLEDILEEIVGEFTTSTTPSINEEITKQSDGSMI
IDGSANIRDINKMLNWHLNTDEARTFNGLILEYLEEIPQEGTVCEIEGLQ
ITILEVSENMVKQAKVVKL
>MS2066 unknown
MKKTLLAVAIGGAMFATSAAAVDFHGYARSGIGWTSGGGEQTALKVNGGG
SKYRLGNETETYAEFKLGQELFKDGNKSIYLDSNIAYSIDQQVDWEATDP
AFREINVQFKNFAEDLLPGATLWAGKRFYQRHDVHMNDFYYWDISGPGAG
VENIDLGFGKLSLAVTRNTEGGGTATYGQDKVYYIDNNGQIQYRYEDRKA
DVYNDVFDIRLAELNVNPNGKLEIGFDYGNAHTKNGYHLEPGASKNGYMI
TLEHTQGEFFGGFNKFVAQYATDSMTSWNTGHSQGGSVNNNGDMLRLIDH
GVVQFSPKVEMMYALIYEKTDLDNNQGKTWYSAGIRPMYKWNKTMSTLLE
VGYDRIKEQSSGKKNDLAKVTLAQQWQAGDSIWARPAIRVFGTYGHWNDK
FNITDRTNAGYKAKDAEFVAGVQFEAWW
>MS1650 unknown
MNIKFGKDKSPEIFSSIAEQTAEQIKSNKDKNKTTQLRKFYDELAMWNER
VQLAREDKEAKFQELVPFIKMLKAKVAYAEGRKHIDKNFSDVFNRCIDQA
NNAETLRDAKLFMEAVMGFCKLEELKR
>MS0717 unknown
MFYLAWVVGVLLAILASVMITIRIEKSGKFDE
>MS0161 unknown
MLYTFSKADYAPRELADLLARLTTQDAVLLWQDGVLLALKYGDYFVKHSS
QVYLFEPDIRARGLSALIQQKNKSFNRIQMPQLVQLTTRYFPQLAL
>MS1147 unknown
MLFSLIKYGGLFYYKFYKNLAKFTALLCLINTA
>MS2362 unknown
MGTGFACGGWALAWTVYIFNKGKYHPLVRPALLASLFGYSLGGLSITIDM
GRYWHLPYFFLPGQFNTNSVLFETAVCMTIYICVVTLEFAPVWLGFFGLK
KLFKKLNKIMFFVIALGALLPMMHQSSMGSLMIVAGHKVHPAWQSYEMLP
VFSLLTAFIMGFSIVIFEGSVVKASLAGQAPDERHLFSQLTKTAAVLIAL
FLMFRFGELIYHNKLHYVLGLFKFEAWMFWAEVWLMTLPLLALFLGERRN
DGRWLFVSALSMLLGAALWRLDYSMIMYNPGNGYKYFPSGQELLISIGFV
SIEVCAYILIIRLFPVLPVLKEANKETSEYIIAEKAALSENLAQKNS
>MS1001 unknown
MIFPENRIIKATKFAQKFMPFIAVFSVVWQQFYAKSDLVALAIAVLCAIV
ALCIPLQGLYWLGKRAQTGLPAQSAVKFFEISKLLEKKNVTTSQIERPTY
QHLADLLAKAQKHCTKEFWEEL
>MS0174 unknown
MEYLIPKSAVVFEEEIKKSRFITYLRHTEGLVEAKAFWQDVKLRHPGARH
HCWASVAGAPNNSQKLGFSDDGEPAGTAGKPMLSALQGSQIGEISAVVVR
YYGGILLGTGGLVRAYGNGVQQALKLLETTVKIERQVYGLYCDYGQVNWL
QLLCERYNVLIENQLFQENVWFQLAISDDKLEPFKQELTERSAGQLTIEP
AE
>MS0572 unknown
MKSMRKFIKYFLLTVVFVFHVVLFAGINYVFPHYETTKITGVEVKRVDKD
GPITKANPADGPTRDVYYIYTQQPDKQKPMVYRNEDTRWGFPFYFKFGSA
DLQAKASTFAQDQRLVEIKYYGWRIVMFDEFRNAVSMREVTEDSGSHPIL
SYIFYFLGIITLFFSIQLIRGWFDSEA
>MS2005 unknown
MDTTYFGRAFGIMVLYDSISKQALFVEAVKYETNALYAAALAELKAKNIE
IQSIVCDGRKGLMQLYPDIPTQLCHFHQVQILNRYLTRNPKTDAGKALRQ
LALSLKYSTQSSFQAAFEAWYRQHKAFLNERSLNEKTGKSSYTHRRLRSA
YFSLKRNLPYLFVFEDYPDLDICNTTNLLDGKFADLKQKLRCHQGMKRDA
KIKFIKDYFSYK
>MS1865 unknown
MGAKLLPYFDNANAISLHSFPRNVGEVQQGGGKILTISHCMTAPLFTRVN
KKSDCFVIKVRSILEKFCW
>MS0781 unknown
MNKNSHKKDRTFMYGLRYCLICTANGDSFDF
>MS0589 unknown
MIDWDFSFNYSALFEISVKFTVKKTDFLIHLAQFFANSI
>MS1294 unknown
MQTLTGLRRLMVINVLVIYLAAVNIAAYFLMKIDKKRAKNKEWRIEEILF
FSFCFMGGFIGIHLGMVHFRHKTKN
>MS0052 unknown
MKALAQFVGKAIETICVIILATMSVLVFLNVVLRYGFNSSINITEEVSRY
MFVWLAFLGAILAFNENQHVSVTVFVEKLSPSAKKLLHLITDVIMLFCCY
LIVDGSWIQFNLNLNNLAPISGLPQGITYLASTVAGFSIGILILARIATN
IAALVKGETK
>MS2158 unknown
MKAMKKLSKILFIASALSLPTTMFAADTQSAAQTTQGVKQMTLSARQLSL
AQIGAFTATGDMESLKTAVNQALDSGLSVNEIKDAMVQLYAYTGFPRSLN
ALNALAETVKEREAKGLKSEQGKTATPLPANTDILALGSQTQTELTGQKV
DISALSPEIDRYLKTHLFGDIFASDLLNWQEREIVTVGALSHLQGVESQL
NAHIGISKKNGVNDEQIAAIKAIQPSGLPQLSQFPIGEPNDAYAQYFTGK
SYLYPVSTEQVKMFNVTFEPSCRNDWHIHHATKGGGQMLIVTAGRGYYQE
WGKPAQELKPGDVVHIPANVKHWHGAAKDSWFQHLAVEIEGENTSNEWAE
RVSDEEYAKLK
>MS0813 unknown
MIKVMVDKLKFFYLKKRNFVFIFLFVGKMKLEPNYFLLKNCFNNA
>MS0385 hypothetical protein
MCKNTEYLFSRHFIHMHIILFILLIILIKFLPHFGVLFAFPLTAVFVAIL
AMIVAALGPWEFYKFKTTVDPRHLNKTSMLVTSGIYRYSRNPMYLSLVLF
LFSEILWLGNWLGIVGIVIFVTYLNLGQIKREEAALAEKFGKTYLAYKQR
VRRWI
>MS2307 unknown
MLIKIFVHITLMFRKNTRFKLKKIYNARVIDFNPIQIRNSKGTDMSDEIE
LKLAVSPRAADILVQEIARYPILAQKKTFLANCYYDSADGYFAHQKMGLR
VRRENDRFTMTLKTNGNVLGGLHIRPEYNVELESDAPDLSKLSIFNETLP
KLPADLQVQPVFNTDFERHIWLLEGENREQIEVALDRGEIKSGEKTEIIS
ELEFELKKGNVADLLSFVAGLNLTDGVRLSALSKAKRGYQLAYNQSRKPV
DWLDKWRDILKSEENHGNLTAQLKALFHHEQQLVEETVALKADYFARNFL
TSVERIGAFFNLYHHYIEQPNLLGRIVNEKLAQGKNVDDSVISELTESNN
YLFNQIRDLIRLHSETKDNLLALTKLIALLHEAGYVRRMLNLIRLTME
>MS1900 unknown
MKYLQNAGTRKKCGKKSGNFYRTFEQLSLIA
>MS0113 unknown
MYSTVKHIVPQGEKRNMTNTVKTANQIKLEFHQQGKTISSWAKENGYSRT
DVSRVINGLAKGQRGKTLEIAVKLGMVIL
>MS2168 unknown
MLDIKPILAIKGTSMLIWNDIERNELMYHLVGFDLENSNVIQNKSNSDYI
ATLADGQAEKNLYNNELDSGSTIPLVSKEQENDPVTDLVTPLAETLVRLM
EVLGNEEKGIVQLGIELSVADKKNIRKTYIEPALKLGLIERTIPEKPTSP
NQKYRKIKH
>MS1446 unknown
MATLEQNLQQMLQGSVEDLGCELWGIECQRAGRFMTVRLYIDKEGGVTVD
DCADVSRQVSAILDVEDPIADKYNLEVSSPGLDRPLFTLEQFQRYVGQEI
SVHLRIPMLDRRKWQGKLERIEGDMLTLIVDDQEQSFALSNIQKANVIPK
F
>MS0883 unknown
MYRNKLLFKITDSFVTGVMTMQNFEYYTPTKIVFGKQTEQQVGELIKEQG
CQKVLIHYGGNSAKASGLLDRVKASLDNAGIAYTELGGVVANPLLSLVYQ
GIELCKKEQVDFILAVGGGSVIDSAKAIAYGVAEPDKDVWELYDRKRQAT
ACLPVATILTLAAAGSEMSESSVITKEEGDIKRGYSNNLSRPVFSILNPE
LTMTLPKYQTASGNVDILMHTMERYFTPHDTMEITDGIAESLLKTVMKNA
QILAKDPQNYEARAEIMWSGSLSHNGLTNCGGGNGDWATHMLEHELSGMF
GVTHGAGLAAVWGHWARYVYQALLPRFERFALRVMGVAPAENAEQTALKG
IEAMENFFRSIDMPTNLSELGVNATAEQIAEMAKKCAIASKGCIGAAKPL
YEQDMAAIYTAAQNA
>MS1347 hypothetical protein
MTAPPASWGSEILPIPIALTDLFHIQQRKITMKFETQCLHAGYSPKNGEP
RVQPIVQSTTYTYDSAESIGKLFDLQEAGFFYTRLANPTTNAAEEKLAAL
EGGVAALCTASGQAATFYALMNLVESGDHFISTTNIYGGTYNLFAHTFRK
MGVEVTFVNQDDNLDELRKAIRPNTKAVFGETISNPTLRVLDIEKFAALA
QAANAPLIIDNTFATPYFCRPFKYGANIVVHSTSKYLDGHAVALGGAIID
GGNFNWEQEKFRQFSQPDITYHGLVYTRTFGKAAYAVKARVQLMRDLGAT
PAPQNSFLLNLGMETLPLRMKQHYANAQAVAE
>MS2140 unknown
MIVEIYQDEAAYQRYRETAHFKAYIVQTKDMLLDKKLHELTGMTLMNKGR
F
>MS0522 unknown
MKFYRTLEDFKVISFDLDDTLYDNSQVILDAERHSVDFLREISQIPQLDG
GYWRYWKNKTALDFPLLAEDVTQWRIKTIVELLRAHQKSAVEIERISHAA
MEDFFEWRHKMQVPQQSFEVLNKLKRQYKLAALTNGNVTPSRAGFDQFEL
VLTGGVQGRAKPHQDLFRQTAGYFNVRPHEILHVGDNLVTDVQGAIQAGC
QAVWINLSDKKIQHFSEATLVPTFEITDLNELLFFRNL
>MS0065 unknown
MSRLDMQKLILADDFTGANDTGIQFVKNNIKVDILLDISKGYSGKSDVLV
FNTDSRAVSIQEAKERVTRVLSLYEGMSVYKKIDSTLRGNIGAEIEACMD
ATNTLIAFICSALPDAGRIIKNGICYVNDVPLLETEFATDPKTPIISSSV
KEIITSQTDIPVIEVMHDELCRPMVVNAKIKQAIAHNQKVIFSFDATTNQ
DLVRIINLSNSLDESVLLIGSSGLAGCMTMRKAILPMLFVVASMSEKTTQ
QVNYIRHDETNFVIDLDTELLLSSNQYNDSVIKQALAQFELGKNVIIKTD
SSIEARNNVDNLSEKLALTRAELGDHICMKLSALTKEILIKNFYQLSAIF
LTGGDIAIAVAKALNADSYHIAGEVENGVPFGYFLNSPLSRIPVITKAGG
FGSDAVLKNTIEVIKNLS
>MS1601 unknown
MTDDINEIVKSAVNFQRISQKKQKARKDPNAPYVRPKLELPEGHNKLLLH
TCCAPCSGEIIAAVKASDVQFTIFFYNPNIHPHREYLIRKDENKRFADKN
NIPFIDADYDRDEWFKRTKGLEHEPERGARCTKCFDMRLERTALYAHENR
FPVIATSLGISRWKNQEQVYDCGRRAAARYEDVIFWDFNWRKDGGSARSD
KLRKEERFYKQEYCGCVYSLRDTNKWRESRGLGKIEIGTVYYSVDE
>MS0091 unknown
MSIAINQIVNANVYIDGNSQIGKAQQIKIPDIEFEMVDHKGLGLFGTIKL
PSGAKAIEGGVNWDSYYPEVRAKLYNPFKNFQLQCRSNLQVFNAQGLAAE
EPMVTIMNVSSVKIGGTDVESKENAKFDDTFAVHSIKQTVAGKEILFIDV
FANIFRVNGEDVLSKYRTNVGQ
>MS1036 unknown
MTDKIYDLHCHSTASDGILSPSEIVQRAHEQGVQSLALTDHDTISGLTEA
RRQAELLGVEFINGVEISTSWENKVIHIVGLNFDENSPEMTALLAKQAQL
RLNRALTIGEKLAKAGVANAFEGASALAKGEVTRAHYARYLVQIGKVANE
NQAFKRYLSQGKSCYVKAEWCDIPAAISIIKQAGGIPIIAHPLRYTMTAR
WIKRLIADFKNWGGEGIEVSGCGQTADQRQLIARWANEFELLASVGSDFH
FPCGWVELGKSLWLPENVTPVWSQFGDKPKYLQNTCKS
>MS0033 unknown
MEKPTALLTQPCLCQSGKQYTDCCAPLHTRQTLPANAEQLMRSRYCAYVL
QLIDYIVETTVPSQQQLLDRTILQQWAKTTNWIGLEIVSHREKLSKIHSA
VEFNAFFATDEGKQVHNERSLFVQINGRWYFVDPTVPLPNNKQPCVCGSG
KKFKACCGGLL
>MS0232 unknown
MVLMPAGLSIKSAVKIDKVLYNPRFLYFPPYKRISWH
>MS0224 unknown
MMKFNRIRNIFMKNKLIFWAALSGFFSIAFGAFAAHGLSKILEPQALNWI
DTGLKYQFFHTLALLCLGCFQLLYMPQANVPACRYRLLNLIGFSWFAGIL
FFSGSLYALALGGAHFLVWLTPVGGIAFLVGWAGLIWLSLRH
>MS2238 unknown
MRPLINPTKSLIVFSFSELNSAYSKGQYNAIQAKFANV
>MS0617 unknown
MEISTLQVILVFLVSCVCGAGSILDEFQTHRPLIACTLIGLVLGDMTTGI
IVGGSLELLALGWMNIGAALAPDAALASVISTILVIVGGQDISTGIAVAI
PLAAAGQVLTYVVRAITVGFQHAADKSVEDGNLARLDWIHFGALMLQAMR
IAIPALIVALTAGTDVVQTMLNAIPPVVTTGLKIAGGFIAVVGYAMVINM
MRAGHLMPFFYAGFVIAAFTDFNLVALGVLGTIMAVIYIQIHPKYNKSQQ
VVVAAASNNDLDNRLD
>MS1372 hypothetical protein
MYLPSLAIIGGLILLIWSADRFVDGATATARAFGMPQLLIGIVIIGFGTS
APEMIVSALSALNGNPGIALGNAYGSNITNIALILGLTALISPLAVNSQA
LKQELPMLIFITAISALLIYDNEVSRLDAFVLLFIFFIYMSWSIINGLKN
KNDSLAREITEELAEQEEMSLKQALMWLLVGLVLLMTSSQLLVWGAVEFA
HYFGVSDLVIGLTIVAVGTSLPELASSLAAAKKDQVDLAVGNIIGSNLFN
TLAVVGIAGVISPMQIGPEVFNRDMLVMSALTVALLIFGLGFGRSKKAGK
INRFEGLLFFVCYIVYNLYLFQTAV
>MS0125 unknown
MTIHSAKLQLVVTADKDDLNIKTGVDCYDLPHQLTEIMSDLLVKIPVLIR
SAWFYITDNYADAENGFDVTLTFHFEKEQGDDWSASAKSTHPGTVEDLLL
GMAKMIFQEDPIIDELIEKELEELDLPEYVQHFDPTC
>MS0128 unknown
MTIAKFDNEDFRNKAPDLLADLAKHSVNIIKQHADIEDDLAENIGMLIAM
KIGESWGGLNIYMPKAQTLFFCEREKQIYNDFTGNNHAYLARKYKLSLQC
IYQIVKRVQKDEINKRQYQMFRED
>MS0134 unknown
MIFLLHLGTFMYKTFKKLTALLGALAALSACHSAVQPAKTVFLAGATGVI
GEPLGKALVAKGYHVYGTTRSAEKAKQLEADGITPVVLDIYDAAAVEKAV
VNAKPDVVISQLSSLPKGLKEEEMAEGLKRDNRIRIAGTRNLIAATEKAG
TPKFITQSFVFYAESATPPIEESALLSTKDPVYGESTAAMMNLEKQTLAG
KFTPVVLRYGWIYGGKSGFNAPIEGYSTIHIDAVVDATVRAVEADLKGIY
NVSEASPFINIDKFRKAVPGWKDK
>MS1955 unknown
MKYAFIDYENLHSLDGLELQNYKRIFLFIGANQTNIRLTEKFDDEINVTF
VTIKDVSSNNVDFHIAYYLGKLDATVDKNIEFHILSKDQGYNGICSFIRH
QRENRHCSRIAPAVSEPLALPKPDESSKQKIEIIFKEYKSFMVKREKKHL
PTKTQSLRNNIHNQTSLKGLEKQDVNNVIIKVINKLSQEKLLKITDSKVS
YP
>MS1980 unknown
MLGVIADDFTGASDIASFLVENGLSCVQMNGVPKAPLADKVDAVVISLKS
RSNPVNEAIEQSLNAFNWLKANGCSQYYFKYCSTFDSTEKGNIGPVTDAL
LDALNDDFTVITPALPVNGRTIFNGYLFVGDTLLSESGMRNHPITPMKDA
NLMRLMDAQSKGKTGLVAYSDVIQGAARVKERFAELKAQGYRYAVVDAVD
NAQLAVLAEAVADLKLVTGGSGLGAYMAARLSGGQKGANAFVPAKGKTVV
LSGSCSVMTNKQVNAYKAKAASIYLDVESALTNANYADELYREVVKHLDE
PLAPMVYATVPPEQLHEIQAKFGGDKASHAIENTFAKLAQRLKNEAGVVN
FITAGGETSSIVVQQLGFTGFHIGKQIAPGVPWLKALDENISLALKSGNF
GKEDFFEYAQGMLL
>MS1026 unknown
MITETLFNAENITANSPQLEQLKQLFPNCFDTSGHFLLEKFQAEIAQHTD
ISHEFYSMNWLGKSYAKLLRNLPPETLLAEDVEHNSKEENAHSQNVLIQG
DNLEVLKHLKNAYRNSVKMIYIDPPYNTGSDGFVYQDDRKFTPEQLATLA
NITPDEAERILNFTDKGSNSHSAWLTFMYPRLYVARELLKEDGVIFISID
DNEVAQLKLLCDEVFGEGNFVAKLPTIMNLKGNNDEFGFAGTHEFTLVYI
KNKNSVEDLNGIPLENEDLAEYSKEDEIGKYKQGATLMRTGEAGSRNARP
KGYYPIYVNTELTRMSLERQKEDDFEVYPKTTKGKDMSWRRSPETLSKTF
SEFIIKKTSSGISFYKKQRLEEDLEKGKKPKSLFYKPQYSSGNGTTLLES
LFGKRIFNNPKPIELLKDFISIGMGKNDLILDFFAGSGSTAHAVMQLNAE
DGGNRQFILVQLPEQTDTKSEAYKAGYKTIFDITKARIEKSAVKIREDFP
DASGAKSIDSGFKIYQTTDNFNAVAEDEFNPNQAQLPNLTSLTESQIQTL
LTTWRVYDGAKLTEIVQAVDLGGYIAYLCDKRLYLLHEHFNSQHLLTFIQ
KLDNDTAFNPNRVIVFGNHIESAMQQELNQALASYSNRKNISLSLIVRA
>MS0707 unknown
MEKIVSLLIIPAIWVKKCGYFRRSFDFFQL
>MS2263 unknown
MHIIHNFPCLLKRTELSAVRILLLYWDNYRNFNILQNSEKNDRTF
>MS1018 unknown
MEKIMENQSFLQNFFKLNQHKTSTKTEIIAGITTFFTMVYIVFVNPSVLG
DAGMDKQVVFVTTCLIAGFGTMAMGLFSNLPIALAPAMGLNAFFAYVVVG
KLGYSWEVGMGAIFWGSVGLLILTLLQVRYWLMASIPLALRVGIGAGIGF
FIALIGFKNMGLVVANPATLVALGELHDPKVLMGILGFFIIVVLAARNIF
SGVLVSIVVVTALALQFDENVIYRGLVSMPPSLDAVVGKVDIAGALDIAL
LGIIFSFLLVNLFDSSGTLLGVTDKAGICDERGRFPKMRQALYVDSVSAV
VGSSIGTSAISTYVESGAGVSVGGRTGLTAVVVGVLFLLTIFFSPLAGLV
PAYATAGALVYVGILMASSLIKVQWEDLTEAAPAFITAAMMPFTYSITEG
IAFGFISYCVMKVGTGRWKEVNAPVWVVSVLFLIKFIWIG
>MS0117 unknown
MKRMLEKLAMWFLHRNGYIVRSKAECNLVPDFVLRMQEKQVKPIQPVDWA
EEGETK
>MS0285 unknown
MKIFSDKITHFVSVWLLKAVMFLALLISSPAIAESAVELKVEGIANEKLR
ENVQLYLATLDKEDADGSERYQNKVKENIDKALRVYGYYGSTVAFNQQPR
SNAPDLLIARVDIGKPTLIEDTDIVITGDALHDEYFKRLEKKVPAKGTVL
DHETYEDYKTELQKLAVQRGYFDADFPVHQLQVMPSTRQAWWRMDFNSGS
RYRYGEISFEHSQIREDYLRNMLEIKSGDEYLINDVSNMTNNFSSSGWFQ
SVLVRPELHEDSKTIDLHLLMYPKKKNAMEVGLGYSSDVGARAQIGWTRP
WINNRGHSLHSDLYVSSPKQTFEITYKMPLLKNPMRYYYEFSTGIENEDD
TKTDTKSLAATFAALRYWNNATGWQYSLGTKIRYDEFTQADQEHKTFLLY
PTTSVSRSRISGGLFPIRADTVSATVDLGRKLWLSDVDFFRVRANAGWIK
TFAPNHRFLTRGEIGYLHTNELERIPPALRFFAGGDRSVRGYGYKKISPR
NSKGKLIGASRLATGTVEYQYQFVPNWWLATFADAGLAANSYSTSELRYG
AGMGVRWASPVGAIKFDIATPIRDKDDSKNIQFYIGLGTEL
>MS0591 unknown
MLSYRYYLYQLLNQKCGENYRTFFYLIIMRLLIKINLH
>MS0489 unknown
MNKPIKYKPLIFLSNGVLRLLGNIIKILSYPFHAIFPKKRFTIPEFSPAF
RPSNKQSKINKTIWQTNYSNKVTLPVYCNYLVNRALSWSYEYRYVSTEAR
EEYIKANADTRVYEAYSKLTDGAAQADFWRIFTLYNEGGVYMDIDGHLVW
CLADIIDENDTEVVITRRDKYTNFFLASAKGNRFLKDTLDIIVNNIEQRK
IDGGVFTLTGPTTLNMALKGKNVNSRRDKFTCAQGTFTNEYFQYMDKKKG
KWNHTKNEDLLKK
>MS0416 unknown
MLDVGLISYFSLVDLSIAFQHSKRNKIKKCIDHKFIGKYKKIVRGRKMYD
LITMNQYDALIFDMDGTIIDTMPSHAKAWEKVGEVLGYPIKGDVMYEFGG
ATTKIIAQETMRRYGVPAELLEQVVTMKRQFGQEMVLQNATLLPTMQVLE
HFLGKKPMALGTGSHKAMVDMLLQRFDLNDYFSAVVMAEDVQKHKPDPET
FLRCAELMKVDPVRCLVFEDADFGVTAAHAGGMDVFDVRINQIMKVS
>MS0779 unknown
MAEVKNLLIGPAIHRILRLTCGYFRQQRNRL
>MS2245 unknown
MMTVKERLFHAVLFEAGAIILSVLFIWLTTGKSGMVESASMILISFIAMV
WNMIFNWIFDKFFTFPKQYRTAKLRLFHTVAFETGLLIFTIPVIAYFLAV
DWFTAFLMDVGISITIMLYGYFFNWGYDHMRATLINKR
>MS0397 unknown
MIMKKFFLFATALLLASCSAQKPNLVSTQKPILNIAANLAQSIEANAGAH
SAWVKNKSQQPIAFNYNLYWYDENGITQLFSTQQEKYQGALLLQPQQKAE
INLTKPTAESVNYRLYLFSGNN
>MS0344 unknown
MLNPASCDLFAIPYFQFAQLKKYCPELIPQIKADYKREWNEWKTCILQVS
EGLGSPFAEPHIEKWCNGWQVRAHFFAYFKYEFNKNSAAILSVLLNRRRL
QVSLDWHCYRADRSQINLSQYNQWTEDFDFRQFADFDIWRGDESEYADFR
RVKQLTSQDLSLRSDEDFWCIGKNVEKADLADIDAVDFISRTIRELLPLY
EKCHQ
>MS0115 unknown
MLILTCTGLTVANGQDISALTSKQFQTTCVGD
>MS1961 unknown
MQTTIEVNMSVKTELLFSNTWNVRISDPGEEGAHSHFFETIYITLEAYID
GDNVSYEFTRKVEDEVKIKRNFTQLDELFKFLADYLDAVSLGNLGVKIGQ
LGLVK
>MS2172 unknown
MKLKALTSALILATTLSGGIAMAKTQSATVAEMPAQTIQLTQEWDKVFPK
SDKVEHRKVTFKNRYGITLVGDLYLPKNAQGKLQAIAVSGPFGAVKEQVS
GLYAQTLAERGFVTIAFDGSYTGESAGLPRDLASPEINTEDFSAAADFLG
SLENVDREKIGVLGVCGWGGFALNAAVGDPRIKVVATSTMYDMTRVMANG
YNDSVDNDARYQMKQDLNNARWEAMSHDYANTGAPVLPSEKELNADTPKF
VADYVNFYKTKRGFHPRSVGSNGSWTTTTPIAFINMPILQRAGELRAPAL
IVHGENAHSRYFSEDAFKTLGSKDKELHIVKGASHTDLYDNQANKIPYDK
FEQFFKANLK
>MS1235 unknown
MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA
ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI
VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER
SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE
GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS
>MS2155 unknown
MLKITTFTDPMMGLSYESEPFFRKLETHFAGHIEFHTVMAGLVRNVYEFV
NPADLAISEAMAIERYLPHLAAIYNAEQSISGMPISMENLDLFSTDRTSS
IPLNLAYKTVQQLAPEKADEFLYRLRFATIVEVRPTTKLNELARVAGQVG
INEQTFLNAYHLDDVKASLTEDFQRFQQLGIRGLPAYLLEYQGKRVVVNG
VLDDRQFFTLIAQLTQNNISPQKPEISQSAVKNLIEKHKLISPIEIQYAF
GLANVNNIMPYLNPLLMNGEIKRIEVQDRGKLSSLNYSFFSLYEPYRIL
>MS1855 unknown
MQQHKQIGRICTLLIRGSKTSHAKKLRKTMNITNPNGDRKAVVIFSGGQD
STTCLLKAIADYGVENVEAVTFQYGQRHAIELEKAKWIAQDLGIKQTLID
TSVIKTITANAMMDNIKITKDEAGMPNTFVDGRNALFLLYTAIYAKGQGI
RDIITGVCETDFSGYPDCRDVFIKSMNVTLNLAMDYQFNIHTPLMYLTKA
QTWQLADELGALNYVREHTHTCYLGVEGGCGSCPSCILRENGLQQYLASK
Q
>MS2255 unknown
MEKLFDINEQGLSVRCKLFYEKDVHSIENIVLILHGFGSSKEVKSNAKFG
ERLITKYKNYGAIAFDLPCHGADARKKLSVAECLTYIQLVVNYAKEKLNA
QNLYAYATSFGGYLTLKYIAERENPFRKIALRAPAIQMFHTLTANMTDDE
RHKVAKGKEIMLGFERKMKIGKEFLDELEQGDIQQYDYLDYADDMLILHG
TADEIVDIATSQTFAENNVIELIAVEGADHPFSNPQLMDLAIGRIVEFFH
>MS1020 unknown
MAVTPMFNHQYLTESNHIVAIGGGHGLGRVMSALNFLKENLSGIVTTTDN
GGSTGRIRLHQGGIAWGDLRNCLNQIIDVPTTASAVFEYRFAGTGDLAGH
NLGNLILTALANMQIRPTEAIDLIRNFLRVRSAIIPMSDIPVDLAATLKN
GEQVIGEVEIDKLPEPPASLYLHPQVEATPEAIAALRQADIILLGPGSFL
TSIMPVLLMDEVKAELRQSQAKKIYIDNLGLELSPAANLSLAERIRWINQ
AVGKDIIDGIITKPEFAQNCGQIRAKIMARRLNAGDVSYRHDRALLCQAI
DDLVAELNK
>MS1705 unknown
MEIANNLKQIHKNIVSICQNAGLPSNSVKLLAVSKTKPVEDLEQAYQAGQ
RAFGENYVQEGVEKIEFFQAKHPDMEWHFIGPLQSNKTRLVAEYFDWMQT
VDREKIAIRLNEQRPANKSPLNVLIQINISDEESKSGIKPADMMALAEII
ENLPHLRLRGLMAIPAATHDVAIQAQSFSAMHKLFVELQQSLPNQRIDTL
SMGMTDDMTAAIKCGSTMVRIGTAIFGSRN
>MS0928 unknown
MAILGTTRRDELIFRRLCVENRLFIFYKQTLYSIFLNYLPNVDFS
>MS0037 unknown
MLLKPMFKNNHFMTALLIRQRKRPQTRTFLL
>MS2256 unknown
MKNTFRLAGETVVLEQLVQYHSYWLLFKHIFVKNYPRGIL
>MS2114 unknown
MLFISPCVITISKISNCLKSVIQFSGSFYLLKGKTMSLPYILIALIAGTA
LASQAAINSKLAQAMLGQPLVSAFISFASGTIALLLLCLWKADLSASLRE
LPNVEPWKLIGGVLGAGLVLTTILLAPKLGITNMLFFIIVGQLCAAAVID
HFGLLGMAQRSFQLSQFIGLLIIACGLGFYFFGNKIVN
>MS0099 unknown
MEQEKINAALARVQEAGYKSSLMLALAEWAEQKLRQGETLDVASLSAWAA
DPTRKKAYSFAVNRFLAEFSDSASKDK
>MS1396 unknown
MPSLKANQQYKEQLMTNIEQLIERQALKDLVDTFSNLADEKNVAAQMPLF
TEDAIVNTYIGGELVFEMAGRAQIEQVFSDYLAPFHAVYHLNGQHTVTFQ
DETNATAINYCQVALVSKQDGKEMLLSHYVRYNDTYTKIDGKWLIAKRIA
NFMISENRELGVTA
>MS2038 unknown
MRLKNRIFLTALLRLFLVKRKHNNPENSVDFSLLNC
>MS1711 unknown
MRQTIKYEFGVGLFLLIGIAALIFMGLKVANVQGFSETKSYQVFATFDNI
GGLKVRAPLKVGGVVIGRVTNISLDEQNYLPQVTIAINEEYNQIPENSSL
SIKTSGLLGEQYIALSVGFDDGETAMLKEGDKIVDTKSAVVLEDLIGQFI
YGDKDKKDDSAEPQAAE
>MS0097 unknown
MPRNGGNMTQIEIFKAGKRLDAHGTEVDITVEDLQDTVKFYNPEFHEAPL
VIGHPKLNNPAWGWVKGLSLDGDVLKADVDEVDAEFAEMVKSGKFKKVSA
AFYLPNSPNNPHKGVLSLRHVGFLGAMPPAVKGLKQVEFAEDDDFLEFSD
WGQASLFSRLREWIIGKFGIEDADKALPHFEVEWLKEDAMRDQIQKQVQS
EQVTPEPIFNEPQKPEGETGMTPEEIEALKAENEKLKAEKAKAEAAQAEA
ALAAEKAGNAEFAEGLVKQGKLAPVVKDALVRALDNLADLKAGKDPEFGE
GEEQDVLSQFKTALSQSPKIIEFGEVATSDKTKDTPPDEVEYAETDDPTR
IELDRRIRAYMKEHNVDYVTALGAVK
>MS2080 unknown
MSTILLSYGSQSRKDPATAEQMETKKLKRRGNQRVKNSRKPTALFIGIC
>MS0734 unknown
MTTPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGHANISG
YDFIVIDTGGIDGTEEGVEEKMAEQSLLAIEEADVVLFLVDARAGLTPAD
IGIAQYLRQRQNKITVVVANKTDGIDADSHCAEFYQLGLGEIAQIAASQG
RGVTQLMEDVLAPLAEKMKTDESAVENDENSEQEKDEWEHEFDFNSEEDA
ELLDEALAEENEEPENKNIKIAIVGRPNVGKSTLTNRILGEDRVVVYDLP
GTTRDSVYIPMERDGQQYTIIDTAGVRKRGKVHLSVEKFSVIKTLQAIQD
ANVVLLTVDAREGISDQDLSLLGFILNAGRSLVIVVNKWDGLSQYTKDQV
KSELDRRLDFIDFARVHFISALHGSGVGNLFDSVQEAYACATKKMTTSML
TRLLQMATDEHQPPMINGRRIKLKFAHPGGYNPPIIVIHGNKIDKLPDSY
KRYLSNYYRRSLKIVGSPIRLQFQEGSNPFAGKRNKLTPNQLRKRKRLMK
FIKKSKR
>MS0636 unknown
MNLSYANGSRGASKMSKNRPDFYDQHTSIATRTATTGKIHRRFNLWRIGG
>MS2209 unknown
MKSILAKGLILALLTTSVSAYALNRQQHDTVVGAALGGVAGAVLGNDVTS
TVAGAALGGVVGSQWNANKQRDDHYRVGDRRHHRDFDRLRYEDRRHHPKE
RYFAHHKPKHSDYREMRRHRH
>MS1264 unknown
MLKDEDINLFRESIKGAKKLAQNTFVAPKKVNVKKKSEQREIREKSDTLF
YFSDEYEPLLNEEDAVKYLREGEDTYLLKQLRRGDFSPELFLDLHGLTKE
QAKLELASLIQACLDEHVYCASIMTGYGTYTLKRQIPRWLVQHPNVRALH
RAPKEWGGDAAILVLIDS
>MS0567 unknown
MFVYNREIPLDDLGGGVQRKILAYSENIMSVEVHFEKGAIGSLHSHPHEQ
LTYVLSGSFEFTIGDETKIVNAGDVLYKQPNVMHGCVCLEKGVLLDTFTP
MRKDFIK
>MS0532 unknown
MPYTGRGRVLSTLRGIEKVRIGSYQLKNRILLAPMAGITDQPFRKLCAAY
GAGLTFSEMMSTNPQVWHTEKSRLRLAHHQAAGINAVQIAGSDPKEIAKA
AQINVDYGAEIIDINMGCPAKKVNRKMAGSALLQYPDLVRQILEHVVNAV
SVPVTLKIRTGWNKEHRNCVEIAKIAEQSGIQALTIHGRTRECLFEGNAE
YDNIKAVKRQVSIPVIANGDITSAEKAKSVLEYTGADAVMIGRGALGNPW
LFKSVESLVETGSIVFEPSLDEKCGVILQHIQSLHQFYGEEKGYRIARKH
VAWYLQGIQPSSNFKQTFNAITEPQEQLIALEEFFNSIRNG
>MS1239 unknown
MTSCGYDIFSFLWLKSGRILGYIGFLYKMAMGFEELFI
>MS1998 unknown
MSLAIVYTRASMGIQAPLVTIEVHISNGKPGFTLVGLPEKTVKEAQDRVR
SALINTQFKYPAKRITVNLAPADLPKEGGRFDLPIAIGMLAASGQIDADK
LRRFEFIGELALTGNLRGVHGVIPAILAARQAKRYAVIAAQNANEAALIS
DQESFFATSLLEVVQFLNEQNKLPSTSDLTPQSAKNGSSTITKDLTDIIG
QQHAKRALIIAAAGQHNLLFLGPPGTGKTMLASRLTDLLPEMTNQEAIET
ASVTSLVHNELNFTNWKQRPFRAPHHSASPAALVGGGCEN
>MS1112 unknown
MMSWYYIFNAMLFMYPALMAVYCIISASYYYFFIEGKLKKPKYSKMKLED
VPLVSIMVPCYNEADNLDDAIPYLLKLKYPKFELIFINDGSKDDTGKIID
RWAQKDSRIVALHQENAGKASALNHGLTVAKGKYVGCIDGDAVLDYKAVD
YMVQALESNPQFGAVTGNPRVRNRSTILGCLQTSEFSSIIGLIKRAQSVM
GTIFTVSGVCCLFRVEAMQKIGGWSTNMITEDIDVSWKLQTSGYNIVYEP
RALCWTLMPETIRGLFKQRLRWAQGGAETIIKYFPQVWRLKNRRLWPMYI
EYFLTAFWAYSLIIVLCINTYLQITEETFEISIFRPLMTVLFLTFFLQYM
FSLFLDSRYEKGLLRYSLYCIWYPYVYWLLNMVTLVFGIPKAIFRNKSKL
AVWTSPDRGV
>MS1973 unknown
MKNWINLLPWRQQLIAQKNRKFIYKISLFFTALLILEMGISLFHGNLTRQ
LSEKQQQFYRQQDEFAKLTRQVSQLRRSYEQTEEQNLISSDSVSLFLSWL
ARLPLNEGELTEFLLQQNSIHLHGYAENQQEFDSIHQYIVQTEWIYESKL
THFSTSANGLLAFSFAIEWGRNGKASMD
>MS1676 unknown
MFRGAQAINLDTKGRIAIPTRYRPELLAENQGQLICTVDIRQPCLLLYPL
KEWEIIEQKLCQLANFDPAQRSVQRVMSGYATECELDSAGRILLSAPLRQ
RAKLEKTIMLVGQLNKFEIWSETEWQAQIERDLELGLSGELATSDALKML
SL
>MS1069 unknown
MDYKDNSLKTLKLGQKTDYIANYDRTLLQPVPRALNRDGLGITKQQPFSV
GADIWTAYEISWLNIKGLPQVAIADVEIDYRSTNLIESKSFKLYLNSFNQ
TKFSDMSEVQRTISEDLSICAEGNVRVQLHSLSNYSHERIADFAGECLDE
LDIEISDYGFNAEILQNCTALSTEIVEETLVSHLLKSNCLITSQPDWGSV
QIHYQGKRIDHEKLLRYLVSFRQHNEFHEQCVERIYCDIMKYARPEKLTV
YARYTRRGGLDINPFRSNFEAIPQNLRLARQ
>MS0800 unknown
MFMLFTTIIGIALGVLDVLFGFYDHQTGQGFLSGIYSLAVLIPTIAVSAR
RLHDTDRSAWWLLLGFIPVIGILILIVFWCFDGSFTTNRFGVNPKQDFLY
EKNKRTQSDIISKS
>MS1777 hypothetical protein
MLTKEQQVFFRNEVLSNLDIEKLDEIQSEYGKLVDELVQIVYQVSNQHGH
YLTALDASGMAEIIADEITGYGPLRELMEDDTINDILVNGPDDVWIERAG
ILEKTNKQFINNEQLTDIAKRLVARVGRRIDDGSPLVDSRLPDGSRLNVV
VPPIALDGTSISIRKFSKNKKSLQELVNFGSMTLEMANFLIIAARSRVNI
IVSGGTGSGKTTLLNALSNYISHTERVITLEDTAELRLEQPHVVRLETRI
AGVERTGAITMQDLVINALRMRPERIIVGECRGAEAFQMLQAMNTGHDGS
MSTLHANSPRDATSRLESMVMMANASLPLEAIRRNIAAAVNIIVQASRLN
DGSRKITNITEIMGMESGHIVLQDIFTYQPSKYRDENGKIIGEFISHGLL
SNSVVYQNAQIFNLSNELQSIFEGLQ
>MS2258 unknown
MIALAEIGTNFNRSTNFGGDNTGFLLKNDIKKLVILVSQHDKKTEKPTAL
LSK
>MS0342 unknown
MNSPQKSSNAKFWAICTTALTAAVASTLCCIGPLIYLAFGLSSAWLMDLS
EYSYLQIPMLIISLVTFSYGFWLLNFSDKIICTKYLSRRTLQILYWIMAP
VILFFLSYPYVLPYILELLE
>MS2020 unknown
MAIFRKYGAFGIVWFSHLERIFIILIKEEDIFELIIKFLLNEIFKILTAL
LMKTAFN
>MS0092 unknown
MAFHHGSETKRVNGGSVAVSTVDGAIIGIVGTAPMGAVNELTVCLTKKDF
SQFGTILDQGFTLPDAFDILARYASGQVYVVNVLDPAKHRTTVTDEVLTQ
DSDTLVATTAKKGLISVTNVKLGGSLLTEGETYSVNLESGEITLTVAAGE
QDLTASYVYADPEKVTEDDIKGGVDSLTGKRQGFELLRDGFNLYGADAKI
LICPEYDKTASCAAALATLADQMHAKAYVQLPKGTSLSKAIQGRGSLGTI
NASASNENVRHFFPYALGSSNNLESLATHAAGLRMKVDVDEGYWFSTSNH
ELSGVIGMEIPLTARVDDIQSETNRLNAVGITTIFNSFGTGFRLWGNRSS
NYPTETHISCFEVASRTGDIIDESIRQAELQFIDKPIDDALIDSFIETID
TFLRSQKSLVGYSVGLDYDYDLVDAFSQGQIPLIYDYTPKIPGERISNKS
VMTRTYLANLVSQR
>MS1466 unknown
MERLLARPVTKIRLPVKKPILKSYLVVEKSCVV
>MS0339 unknown
MIGINRNFARIGGALGYQYGFMHKNPEHFVIDV
>MS1443 unknown
MYSSKQVKSAVKNFQILTALFSICLYKRYNRH
>MS0558 unknown
MAQINIEITYAFPEHYYLKKFTLDEGTTVQSAILQSGILQQFTDIDLREN
KIGIFSRPVKLTDSLNDGDRIEIYRPLLADPKEIRRKRAAQQAKDQEEKK
KAEKSANKEN
>MS1071 unknown
MTIHLQTKKHLQQLQFAMQSLDLWQTVPPAEEAFLSTEPFAIDRMTATEW
LQWIFIPRMYALLESGTELPAQIAISPYIEEALKETDNLSLLLSPIIEIE
QLLQKS
>MS1971 unknown
MRIKILFKLFFIIVSVIRLLTFQSWAESDPFDKTKRNFSQNTDMLVEKTN
QCHQSAAVWAENTEFKQLKIVGVLQYEQERKVFLMDAERHIFTAGQGDFL
AKERMQLQAINTREVDFMVWNNPQDCGQGELMKIKF
>MS1092 unknown
MALGWYELKLAKDGQFMFNLKAANSQVILTSELYRSRAAAENGIASVQKN
GGDEKNFEFRENKNGEPYFILKAQNHQEIGRSEYYSSKAAAQNGVNSVMN
NAATTVIKDITKS
>MS2267 unknown
MKMNKKFAQIVKNPAFRNMVLKTIFNVTNVMSATKYLR
>MS0838 unknown
MALILNILNFVLGGFATTLGWLIATIISAVLVVTLPYSRGCWEITKMSLM
PFGNDIIHVKYLEPRSSLSNSLGSVLNVLWFVLFGWWLCLLHILTGITQC
LTLIGIPTGLAHFKLARISLFPVGQRVVPKEMAQLVYRHQAEREFENQRN
A
>MS0897 unknown
MNKSKLALIIIVALGAVWTGGAWFTGKTAEAEYKTQVENLNKQLTSAKSA
VAFSVKIENVRFDRGLFSSEMSYDILIESTQDKTQTWRLPFAGTLYHGPL
PLNLVRQFDFSPAVFSSHSQLIKNELTTPWFDYTKEQNPITADISLNYMQ
EFDAALNLAAGDIKLEDIAVNWSNAFIKYAATPKKQGEFNYRYDAAKLTL
TDKALADIKLADKAETQSDLSAIDIELQNLDGLMQIIPATNDITTGLYKG
KIENLTYTYHFADNKKTANTIYFNNFNYDYAANENEGMLNYDIHNRADSL
KINDKNLGGVQLDLQANHLPTNLVAQLIHGIKQQADEKEINEILLKILEN
QPHFRISPLALQNTAGKFTGNFNIELAHADFANALKKGNVLSLFKQFSLN
IDTDKPALVEFLSTLQQLSGVAKEKADGYAQQQVDRIANTLKKQNVIVEE
GGSAKFNLAIVNDKLMLNGNEIPEEYIGLMLFGLMMQSK
>MS1430 unknown
MNPTVVVSNIKTLLLAVLSGRIFAPVMQHDCQPYLDSGELEIVFPNLESQ
MWGIYLYRPYQTITPKRVLVVFEILERLLKMHSEQ
>MS1732 unknown
MMALSQEKRLIEAPVNGGRNYNGPKVAKFLVG
>MS2169 unknown
MEIEHKMDKCHQVHYLAYPYYLFEYYKENASE
>MS1215 unknown
MDRFSRMWRKLQKSAVCFLLIFVTVNVWAADFPGSPNPFRYVNDYTNTLS
ENDKNYLENKLINFSRETSSQIAVVMVKTTGEYAISDYAFTLGDNWGIGR
KQLNNGVLLLVAKEDRKVFIATGQGLEGALPDAFLSQIIRRVILPNFRQE
QYASGINGALDYIIAASKGEYDAAAEQNDEGFEQYIPFLMVLVFVLFVLF
GELNGRRKPYISPTTNHQLEQVILQSARRRRGNSGGFGSGGFGGFGGGGS
SGGGFGGGGFGGGGAGGSW
>MS0118 unknown
MGGRRRNKMSEKINSTQRALRILKALKGRTLTGLSNKELADRLNESPVNI
TRSLQALIAEGLVVKLEETGRFALSIQMLQIAVTHQRDTEKMQARMAEMD
QRVNAGAF
>MS2181 unknown
MNEAIFLTKRALMLKKLSFIAMLAMLSACSLSSYVPFMGNDKPVINLDKD
KIDQKSYAVAYASTVQSYNGRITEDYDVNSFASGANDWYLGRILVPTEQI
RARLGSGLDSKLHAYYSGVIFAADLQTNFSRLSATCWSKVDTQSMTQGIY
DAVIDLRKGKVRGENDEYITKGSEELLNLCK
>MS2129 unknown
MKFKLKALTATLFLGSSLLGANAMAQLPQNATAIEVPAQSIQLTQEWDKI
FPKSDKVEHRKVTFKNRYGITLVGDLYVPKGATGKLPAIAVSGPFGAVKE
QSSGLYAQHLAERGFVTVAFDGSFTGESSGLPRNTASPEINTDDFVSAVD
FLGSLDNVDREKIGVLGICGWGGFALNSAISDPRIKAVATSTMYDMTQVM
ADGYEIKMEPNPKVPYERTSPMTTEARYKMKQDLANARWEAAANGYSLNG
KAEDHLTPQDKITAETPRFVREYSNFYKTKRGFHPRSVNSTTGWNTAMTP
SFINMPILQRAGELKAPALVVHGEFAHSRYFGEDAYKALGSKNKELYIVP
GANHTDLYDDVNGKIPYDKFEQFFKANLK
>MS1041 unknown
MFMKSSYPFSNTWEKLLIGFFCTPIILGILLFINEVTGFQLVCISLIGTI
CLWGVFITVKILQINTQQSHQCRFKEF
>MS0427 unknown
MKKSLSLLAVLAFSFGLIACDGDNVRSEMQMMGRYNSELISAASAEEFHK
ASENLQKFSLEAMNKRPSTVKSDEEFKAYQQGMQHFIDVVQQADQLAQQG
KFEEAKDLTKQLLEMKNQYHAEFKNK
>MS0217 unknown
MSNTSAASFNSALICSKYIQFSFYEQVKNAENTAPDKCFAIL
>MS0076 unknown
MKKNLVALAVVAMAAQAHNPKSKQASKQASKQASKQASKQAR
>MS0090 unknown
MSVSKPLNLFKISPTAVGVNSIVKLNNNPQWSFFMSQKLDDLLVFRTIQL
DYPIKDGEGNTVTELKMRRAKAKDMRRMSAQKTEAEQEIFMFAQLVGLVP
EDIDELDIADYGKLQKAFTEMVQGKSA
>MS0621 unknown
MCQMLAMNCNTPTDIVFSFEGFRRRAGMTDSHSDGFGIAFFEGKGVRVFR
DDQPGAVSPIADCVKQYHIKSLNVIAHIRKATQGVVNIENTHPFIREIWG
ENWVFAHNGNLNALPDLSSCYCTPIGDTDSEAAFCYIAAKLKERFCRKPT
ENEIFDTIKELAAELAQHGTFNFILSNGQWMIAHCSTNLHYLTRQAPFGV
AQRIDDDGIIDFSNYAKDTDKVTIITTFPLTKDEIWAKMEHGGMVMFKDG
VKIREAIGTPKEAVDDGTLGCTKIAA
>MS1868 unknown
MQKVKLPLTVDPVKDAQRRLDYVGYYAADQLVRLNESVVKVLSDAQVTLS
FFIDPQKLVVMKGQAQVEVELECQRCGQTFNQTLECTFCYSPVANLSKID
ELPEIYEPIEFNEFGEIDLIGTIEDEFILNLPIVPMHSSEHCEVSAQEQV
FGELPEELAKKPNPFAVLANLKQK
>MS0126 unknown
MKVKCSACGAVYSLDALIANQSASQALNAALMVSGELGEALIRYLGLFRP
AKTSLTFDRVATLLNELTPMIQAGKITRDGREFPAPTEAWIYAK
>MS1438 unknown
MKNLKLSIATIAVASLLSACTSQYATEKHEQLKLQNQAALGIVWMQQSGE
YQALAHQAFNTAKTAFDQAKKTKGKKKAVVVDLDETMMDNSAYAGWQVKN
GEDFTQETWTKWVNARQTAAIPGAVEFANYVNNHGGTMFYVSNRLENGER
QGTIDDMARLGFPGVSEKTLILKDGKSAKSARYKTITDQGYDIVVYVGDN
LNDFGDATYRKPNAERRDFVAQNAKQFGTKYIVLPNPNYGDWEGGLDSNY
YKGDVKNKVDIRLNSIKAWDGK
>MS1451 unknown
MSNLSFDFVENDFKPLAARMRPTTLEQYCGQQHLLGNGKPLRKAIEAGHA
HSMIFWGPPGTGKTTLAEIIAHKINAEVERISAVTSGIKEIREAIERAKQ
NRLADRRTILFVDEVHRFNKSQQDAFLPHIEDGTIIFIGATTENPSFELN
SALLSRARVYILKSLTNQDILHVLEQALADKERGLGNENLDLEEGILELL
ADYVHGDARLALNCLELMVDMADESEKGKKIDRTLLTEVLGERQARFDKQ
GDRFYDLISAVHKSIRGSAPDAALYWYARIITAGGDPLYVARRLLAIASE
DVGNADPRAMQVAIAAWDCFTRVGAAEGERAIAQAIVYLAVAPKSNAVYN
AFNQAKQLAKESADFDVPVHLRNAPTKLMKNLGYGAEYRYAHHEPNAYAA
GENYFPEELKDTVLYEPTNRGMEIKIQEKLAWLRELDKQSSVKRYK
>MS0936 unknown
MLDKIIGAVISNALGGNSTNNSSNSSLISNVLGSLLQSQGGMEGIFNKLQ
QGGLDNLLESWIGTGRNQPMQANQVSEVFGEDTISSVARQAGVPASQAQD
ILSQALPQIIDMLTPNGREGGVRTDSLTQATQQVQQDNGFGLDDLIGGVL
GSVLGGGQQQSAQPEQQRSQGGLEDLLSQMLNTQTRSSRTPTASTNDELA
QDIGSILDGFFKQR
>MS2177 unknown
MDFNAILNQVLSAAQETVKKTASGNSTTDKVAKIGGGAAAIGVLSMIFGR
TGGAGLAKLGSLAALGSLAYQAYQDYQHKQSQVVPVTEMEFTQSVQQSAE
LSKVILQAMIAAAAADGAISDREQQAILSQAGDDAEVQQWIRQEMYQPAT
VREIAQQVGDNQALASQVYLAARMVCADLARKEIVFLANLAQALGLDEAL
VEQLEKQAGF
>MS0940 unknown
MMHKMMPILFGEIKNEKTDNDSVIVGFTCTSQRTRFDVIRKL
>MS1426 unknown
MMTRRTFLTASGLMASGLFLPKICKSETLLQRRRPMKIIAVEEHVLDADL
GKASMPAALAQAPYLPDWGKTVQDGYNLDRSRPQIEQNALINPKGFDMGE
GRLKEMDLAGIDMQVLSYGGFPQFALKEQSAALNRAANDRLAEAVAKHPD
RFAGFATLPWGQPQEAVKELKRAVNELGLKGALLNGRPSEHFIDHSDYEP
LLAAFHELNVPLYLHPGVPVQAVQQAYYGGFSPEI
>MS0083 unknown
MQTHNFGATYQEGIVTEVDAAKHKVRCKIPALEDLETAWLPFLTPNAGGN
QFYCLPDKDELVALLLDARGEGGCVLGAIYNDQDPTPVANAEIWCHKFKN
GTEISHNRKTGDVVVNTKGHVTVTAGAGATINADTVVNGKLHATGKITSG
EEVSAPKVKQGTVELGTHTHGSSPQPNK
>MS0665 unknown
MNKYLKSDFIFSLFLSIAIMFICLYFEKSFFFVDDAQNEFLPFTRQIGNV
WLNGEIPFILKNTFIGSNTMIDIHRAIFLPQNIFLSILSVKITSLKIISI
IAAFINLFVMSFSALKLSEAFSLTKAAGIVLAFLFCINPIFLYFYLESWW
NAAAGQAWFVASLASVAWLMRAFSIKRLLLNVITVLSIFASGWPHSVLVY
GFLALIFSIFLYLNKRHNDLILFVLISFSIILIAIPLYSEYVISGDLINR
QSFKFNNVGNFLSTTLNQLLLTFNVTYYHFMHRYGGYSITHIPMGYSSIY
ILLLICFGSLKNIARNPNSLFLLVLCTVFFILTQTPTEIGPFRYPFRFTP
YFSEVLTMLSIFSLEKLGIVKTRARVFLVVLLLSISLLLSIFSLEENFGK
YAILQFLFFAVTTWYVVRYNSISLKSGLPYTAFIFLLMLLAKDSVIGYLS
FPDLKNSINMENNYSQGGYILSLTNGKRPKNNLEDLNSTHFMLYGLKSIN
GASPVGNKYISKTISTRSSQAFFNAKETILGLSKTYKDKCYFDLFGIDTV
ILNKKDNSSLISQKLSDCGFSERKVKSHDVIYFLRNDFNAKGSVSYHSDT
LSINQQISLKNNSEFYQLSGLKGDELIFNRVYWYGYRAYINDKEIPLLNY
DGLLRIILDHDYQNGVLRLEYFPKSWKYALLIALSGFLLLLFSVGYMQRM
RKWVSLN
>MS0773 unknown
MKKAGLTLALLLTGCGILGPSYSGETTAGALLKSDTERNINIFFRAIHQC
SPEKIHTQINSAKPATQNSVEQAQETWTVTGCGKTEVFNIQYVGDGVGGT
YIRMSKKN
>MS2270 unknown
MKKIVLLLAVITTLTACSSADTPTPRDENQLADGIMIPVEGTGAIAGGSF
MPEIEQQSMPDSMK
>MS1464 unknown
MNNLALEKLISQKLNSATIADYAPNGLQVEGRPEIRKIVTGVTASQALIE
AAIARHADAVIVHHGYFWKNEDPCIRGMKGKRIKTLLINDINLYGYHLPL
DVHAELGNNAQLAKLLKIENLQPLESDSVSIPVFGELAQPISLEDFARRI
EKSLQRKPIVCNAEELTQNPPHLIRKIGICTGGGQGYIDLAAARGCDTFI
SGEISEQSTHSARELGIHYFACGHHATERYGIRALGEWLAQEYQFEVEFI
DIDNPA
>MS0269 unknown
MKTNLKLGLLIGALGLFSTGAMAAHLPDEIYQPRGAKVIKADRQGKGEFE
VEFRLDAREHRIPVLAEKAISHARYHGFRLVESEIEHDDADLKFKRGDQE
MDIEIELKDHHRIEYKAELDLDKN
>MS1268 unknown
MKILITGATGLVGKALTRQLLKQSHQITALTRAVNTAQKLFPEVDWVSSL
STYKNLDQFDAVVNLAGEPIFDKKWTDEQKLRLKNSRILLTQQLTQLINR
GKRPPVFISGSASGFYGNAGSQLLTESALPATSFTAELCQAWEAAAQQAD
TRVCVIRTGMVMSPRGGALARMLPLYRFGLAGKLGSGQQFMPWIALKDMV
RGIIFLINNPNAVGAFNFSSPNPVTNKEFNRLLGSRLKRPHFFSVPACIL
RLFLGERACLLLDSQNVYPKKLLDLGYTFQFEHLETYFSKTLKQKRKK
>MS1884 unknown
MSHILAVKVAQVENLSLSDGSTIETAIRKKAVDKVRVHQLGAEGNDVGDK
KHHGGVDKALFFMAQKSLEKLTALLKLDYDYLQDSRFGENFVVSESDENS
VCIGDQYRIGSALVEVCQPRKPCNTLSKNTEVPETRKTVVETGLVGWYVR
VLEDGVIAQGDKLELVKRPYPEMTVALVHGLLSQPAKNLDKTVLDKAIAC
APLAEGYKKTLYKQAEKLAQQSSESAFFNTPEF
>MS1331 unknown
MNKALLPVLVSSIFMLSACNEEKNIELAAQLQHYQQQVDQLKTELENANN
KLTQTQNELTAQQQAFPALKTTEEKIFTRNEEISFTENRPTGSGIINYYI
DTVKTSIPWLDKLLISQAIDILNQDAEPKDKLTINDSDSDQQKAVLTEKL
ENNYQRDLDILTANKLPGIDYIIETSYLGQRENLVSFSLFRHAYYGGERS
SFYTRYLNIDSETQSIIRLSDVIPPVKQKELKELLWNSYANALGNNKPYI
KKQNFYIAKDFYFTPDGMNFVYSPSSIAPFSAGEITLQLYWNEINTLING
QYIWHDIK
>MS1638 unknown
MKTYNKRILLSVTGMSPAVVTETLYALVTEKNFIPTEIQVITTIQGKNKL
LSALLGIEGGRKERKGALAEFIEDYGSQYGFSAIHFDESCIHIIEDTSGE
KLPDIRTPQENEFAADNIVKLVGSLCQGEESQLHVSIAGGRKTMGFFMGY
ALSLYGRKQDSLSHVLVDEQFETLPNFYYRKPYSHIIINRDGVELDASKA
NVMLAEIPWVRLGLGVPEGLKHQAISYSESVKNAQALLSQQSITFLAPLE
DRLVKFGSKVIKLAPRGYALLLGLVVAKDAGWQFGIREEKHTIDTYLKIY
SQIKEDEEMQKRLAGMDNDLKDVLSESRTDIRKKITENFSLGKGAESDYI
PSSSRKTGNYELNIDLDNIDISAIQNELARLKIL
>MS0496 unknown
MCLTANYLSGRVKNFRIFDRTFNRLKFSASIKALN
>MS1916 unknown
MTEKINLMNLTRQQMREFFKELGEKPFRADQLVKWIYHFGEDNFDNMTNI
NKKLRDKLKAVAEIKAPEIAVEQRSADGTIKWAMQVGDQQIETVYIPEAD
RATLCVSSQVGCALACTFCSTAQQGFNRNLTVSEIIGQVWRASKVIGEFG
VTGIRPITNVVMMGMGEPLLNVANVVPAMELMLDDFAYGLSKRRVTLSTS
GVVPALDNLSGMIDVALAISLHAPNDELRDEIVPINKKYNIKMLIDSVNR
YLSVSNANHGKVTIEYVMLDHVNDSIEHAHQLAEVLKNTPCKINLIPWNP
FPQAPYGKSSNTRVDKFQKTLMEYGFTVIVRKTRGDDIDAACGQLAGDVI
DRTKRTAAKRQFGQNIDVQLQ
>MS0260 unknown
MLGVLNMTRQNIFIILAFSNEINKMELQDKLLIAMPNLQDSYFSQSVIYI
CEHNEQGAMGLVLNQVTDLSIAELVAKLNFMMADGRHYPETYVFAGGPVS
MDRGFILHTATERTFEHSYRVTDNLQLTTSEDVIETFGTPEAPEKYLVAL
GCATWTSGQLEKEIADNDWLVVPANNHILFDVPWAECWTAAQQLLGFQPA
NLVAEAGYC
>MS1100 unknown
MATLGATRRDELIFPIFNDLKKCKNLPHLIRFQRP
>MS0979 unknown
MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ
GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK
SLRR
>MS2159 unknown
MKKILVLTGSPHPNGASSRLADEFVKGAKEAGNDVFRFDAGLQPLGELHF
LQLDASERTIADNDIVSREVLPKLIEADVVVFVSSLYYFGMNAQLKAVID
RFYSINHELKDDKQSAVIMAGYGEGDDLKPMKDHFNIIQKYMRWQNIGTI
VAEDSWNAAKLAKHLQEAYALGKSISA
>MS0127 unknown
MRNKLLQLVHIGKTQLGMDDETYRSLLSQQFYQNSAKNISYSGLIKLVKL
LQSKGAKIQLPKSKSTLSPLQRKVWAVWKSTADNPTSQALNAYVARIGVD
EPWNMMNNSQASFVLETLKKWQERKGN
>MS1637 unknown
MEARRRLGYDGNRVIHEYANKNLYRWRINARSLSYCL
>MS0938 unknown
MAKTRGSARDSIKIYTEKCGENSLIFYRTLDEKI
>MS1911 unknown
MSITVNQIVLHQIIKPASANIPANNNNETENGETATQNTQLETVLRQELL
PITAEAEQFMLELHQAYQNKTKGYGVFQEQSRFAQSLNRLLERETDFLPF
SYEAAKLLSSELAKYAFAESGTFVLCRYNFLATDYLFIALLDSKASVLVD
EKLEIHRTQYLNINQFDIAARINLTDLRVNANSNRYLTFIKGRVGRKIGD
FFMDFLGADEGLNPQVQNQCLLQAVSDYCQKGELSAEQSQAVKKQVFDYC
KGQINAGDEIELTELSETIPTLNQQPFADFAAEQDYGLENNIPPVRSALK
SLTKFSGSGKGVTISFDAELLDKRIYWDDMQDTLTIHGLPANLKDQLQRL
LKNHN
>MS0167 unknown
MMKKVITYLKTLMIEKTSLDQLKAFYLRFKFTAPI
>MS1641 unknown
MGSGYIVAKMQPLDWENIGLQICNLSEKFDRLLKSNFYCLREQNDHLVYF
KTRRRN
>MS0943 unknown
MAFSSLFTLLDDIASVLDDVSVMTKVAAKKTVGLISDDLALNANQVSGKD
IGAERELPVVFSVARGSLINKVILIPLALLLSVYFPSAINILLMAGGAYL
CFEGVEKLLHKFIRQEQHEEIKDSGDEKDKIKGAVRTDFILSAEIIIIAL
GSIQQADISTKILTLSAVGLGITVFVYGLVGLIVKLDDIGLWLLRKNGKF
SQKSGEFLLFIMPWFMRSLSVIGTIAMFLVGGGIFVHYLPEIHHFVEQFR
IYHQLAWLVEGLTGMIIGAIACAVILPLLKLFSRKAH
>MS1187 unknown
MNFPVSLYIALRYWRAKSADRFARLVTNLASSGIVLGVMALIIVLSVMNG
LEKHQKQQVLSGIPHAVLMPQEGYLDLQAAQPSMPDFVRQAVPINSTNVI
LQTAQGVSAGHIIGVQKPSDDLILDYLTQQQLSELLPAGEFKILIGNRLA
DKLRLNIGDKVRLMITENSQYTPFGRVPTQRLFTVSDIYFSDNSEVSGYE
IFANLSDIGRLMRIRPEQVQGYRLFLDDPFQITALPQFFSADKWKLEDWR
SQKGEFFQAVRMEKNMMGLLISLIIIVAISNIITSLSLMVVDKQGEIAIL
QTQGLNKRQVRRIFILQGFLVGLVGTIIGTILGVLITLNLADIIELFGQR
GIFLPTSLELGQIIVIVAFSLLLSLLSTIYPAYRAAKVEPAEALRYE
>MS0872 unknown
MGRLKRFLLIFVLAFFAAGAYFFYTVQIFEQPKISFNSAHPSSLTPQNQY
CFAVNSPLQIIRQNRFKFVVWNIHKGLDEGWQQSLQQFAQEADFLLLQEV
ASTQQLAQEIPQFSTALYVTSFSYLGRESGVSILAKTMPQRICGGAEKEP
WILIPKVGNAMTFPLQNGQSLLVVNLHLVNFEFHPTSYRNQLENMMRLVA
KHQGPIILAGDFNSWNQPRLNLVRRFAKQYQLNEVNYHPDERLRFLTNPL
DHVFVRGLNVITSTTVKTSSSDHNPIFVEVALDKPNSK
>MS2232 unknown
MQQIQISDAAQGHFRKLLDQQEEGTNIRIFVVNPGTPNAECGVSYCPPNA
VEATDTEMKYATFSAFVDEVSLPFLEDAEIDYVTEELGTQLTLKAPNAKM
RKVADDAPLIERVDYVIQTQINPQLASHGGRITLVEITDEGYAILQFGGG
CNGCSMVDVTLKDGVEKQLVELFAGELKGAKDITEHQRGEHSYY
>MS0356 unknown
MLKNGGGTTKNPEFTYILHRKSAVEILKNYAKIYRTFIFTIKVKFG
>MS1589 unknown
MNRRDHLLQELGITQWQLRRPDVLKGAINIAVEEHIRLLVIAECTLSARD
FFIQDVLRSAEIKLQDCLFLTFSQAAHLTVQHPVNYWLLSDEQGIIEQTL
TFCTLQNSLWQTPDLPRLKLDRRAKQALWKQIQTSL
>MS0085 unknown
MTTQTLLKHIVKQGERWDNLSYQYYGDALEYGRIIDANPHISFCEVLPTG
VTIYIPVLNVKPTSNENMPPWLRGTNE
>MS2210 unknown
MTSKIIKAVVIGALATSVSACGLHGQQRDTATGAVIGGVAGNIIGGNTVS
TVAGAALGGVVGSQWNKHR
>MS2135 unknown
MLQLRKSNERGHANHGWLDSYHTFSFADYFDRNHMHFSDLRVINEDFIQP
TMGFGTHPHKDMEILTYVLQGAIAHKDSMGNVKTFTAGEFQIMSAGTGIY
HSEFNPSESELLHLLQIWIMPNELGVSPRYDQKQFADKEGATLILSPDAE
GESFKVYQDMKLWRHQYKAHQKVELGLNSRRNYWLQVVKGNLTVNDIALA
TSDALGISAEELATIETSDEVEFLLFDLR
>MS1645 unknown
MKIKLYFPDESIATIKRMGLRQMSPETLRWTADHPSSSYGMGALLRGKSG
EILDGKSFAAMVHAFGAWIETDSEDTSRRVHNALVTAATGTEESVKVAKE
>MS2122 unknown
MSRALNFVMISPHFPTNFETFAVRMREKGINTLGIADTPYEQLSETLRNN
LTEYYRVDNMEDYEQVYRAVGYFAHKYGRIDRVESHNEYWLELDAKLRTD
FNVFGYKNDDMLAIKTKAQMKEVFRKSGLKVAKGRVFKDDEDARKLAKQL
KFPVIVKPNSGVGASDTYKIKSAVELEDFFGYKNPNVEYIMEEFIDGDIV
TFDGLTDHDGKIVFYSSLEYSEAVLDTVEKDGDMFYYVPREISPKLVKLG
EQCVEAFNVRERFFHFEFFRVKKSGELLPLEINCRPPGGLTIDMWNYAND
FDVFREYANVVTENKFYSDITHPWNVVYISRKANQNYVNSIDDVCQKFGD
NIISVQTVPGVFAKVMGEHGILVRTKTIEQMREIVQFAQAKQ
>MS0873 unknown
MKLLISNQHGAIVMALMPFFYGMLLSQPVWAHIFLLLAWFSLYLLSYPFL
NLFKGRNLAQYKTWVWIYACAVIIFVIPALIYNWKILYFALTIALLSSVS
VYFVKQKNERAFLNNLNGIVIFAVAGMGAYYFADSVWDYKIWQVACYPSL
FFIGTTLYVKSMMRERKNPLYLKLSIIFHIGCILVFLFVQQYILTLAFII
PLVRAIYLPAKKLSVKQIGLIEMAVSLLFFVILLWATI
>MS1709 unknown
MQAEKHLKWAAEQNDDRIAFRLDGELSRDTLLPLWNEFQKREQRSSFLSE
RQIADKNISWDLSQVSRIDSAGFALLCDLLHYCQAKKNADKTLLLENVPP
QLLTLADLVGLADWIKPYLK
>MS1353 unknown
MAILGTARKDELTFRQLCVENRLFRMTKRHVFRLDLTKNKLRRSRL
>MS0476 unknown
MEKNAFGILQPKLDVRNVLPLNQLDIIFTPLVAFDKSANRLGMGGGFYDR
TLQNWQNKSFLPVGLAHQCQQVEKLPVESWDIPLYDILSA
>MS0350 unknown
MIIILSLVMQKKFALDHVLQHFLWVGELYGLCYDRQKLMKKLLNKKKDDM
SRNIQELKNIVAKLRDPDGGCPLGSETIL
>MS0504 unknown
MTALFVCYAHKVKQTNFKEKSISNSRCFLND
>MS0122 unknown
MAKKAVRIKAETHEINLQTQDDVALAIKEIGDLEREQVRLSTLQADEKAA
IDEKYTAELTALKDKVKPLQKAVQAYCESRRNELTNGGKQKTAYFTTGEV
QWRAKPPAVIARGIDVILESLRNSGLFRFIRTKEELNKEAMLAEPDIARS
IDGVTIREGVEEFVIKPNDEEVRT
>MS1697 unknown
MKKTLLAIIAALAMVSAAQANVYVEGNAGYSKIKSGEVSDHRFSPNVALG
YDTGDMRYAIDYTHYGKSTDGNSEVKAHGFGVSAIYDIEVGSPVKPYIGA
RLSANDIDAKEEKRSGGSRIIKETDSYKLGYGALAGVQYQVAKDVSLNGG
VEYNRLGKANGHNINQYGAKVGVRYDF
>MS0279 unknown
MKKSFVKTLLATSMLFSETAMAAYPEKPITIIVPWGAGGNTDTIARLVAK
GLQEELKTNVNVINRTGGSGVVGHNAIKTAKADGYTLGVVTVEIALMKHQ
KMADLSYKDYTPIARLGVVPGGVQVAKDAPYKDINELLAAVKANPGKLKA
SGSGLNSIWHLNLLGILKSAGLPEDSVKFVPSQGASAALQELVSGGIDFT
TSSPGEAQSMTDAGMVRHLAITTPTKSELYANIPVFQEATSYKWTLNGWN
VLTAPQGLPDDIKLVLEKAMEKVYATGELQKFANKQGFEASALYGSELEK
FMADEDQKFGDLLSTK
>MS0761 unknown
MKKSKMNDKIIFNQSENPKDSKEPTDFVAKQEFIDITDADVRLDPEDLTG
EFSLGQEGELLTENLTESLTPKPRWWKKLLILTAVLFFGATVAQSVQWLI
DTWQQQQWIYFVFALVSLFVVILGFSALFREWRRLAILRRHIDLQRQSET
LLQKSAVNFEQDLPAQDSESGKQLCLKIAESMNLEPQYPALNQWQKQINE
SYSAQEVAYLFSQTLLKPIDAKAIKLVTKSAVEAGTIVAISPLALVDMFF
IAWRNIRLVNRIARLYGIELGYASRLRLIRLVLLNVAFAGATELAQEIGM
DWLSQDIAAKLSARAAQGIGVGLLTARLGIKAMEFCRPLVFSKQEKPKLT
AIHRELLSTLKSTVFTSSKIKDKEKM
>MS0096 unknown
MAGQKVATRLTDPVLTQYALGYHNNEFVGELLLPIADVPKEGARLPKFGK
EAFVTENDERELHAASNKITPAKVTTEDIALGEKDLAYPIDYREGKEADF
DYEQFAVDLVMEKMALNRELRIKALVTNEAAYGAKNKIVLSGTSQFSHAD
SQLFKVFDDAFEAVRMASGKSVNRIVISSNVWTAIRNHKEVLDILKQRGL
KSLSPSLFAELIKGEGQDDLQIAIGRASYTTQLDQDTQPVWENDIVMAYV
PQKAADGKHKMYKPSFGYTFRRQGAFVVDKYDEVGGKVYNARATDINKEY
LLMTDAGYLIKSAV
>MS0541 unknown
MTKFISLLERILAVFCVVLCIALVISVVWQVFSRYVLNAPSTVTDELARF
LFIWVGLVGAAYGLGKKKHLAIDLLLMKLEASPKKYAFLQLIINLISIFF
ITVIMCYGGMKLVLDTIAAGQISPVLGIQMGLVYLALPVSGFFMLIFSAR
DLFAELRQLSAQN
>MS0254 unknown
MTKLIHLTQYKLIELTGVDSEKFLQGQLTCDVTKLKTGDSTLTAHCDPKG
KVSSVFRLIRVAQEQFYLLFRTDLLPAGLDQLKKYAVFSKVAFAEPEVQL
AGVIGENCGQFSASFVVNSGNAAILINPAERLEFNASAEAWDCVEIQRGY
PILSAKTQNEFIPQALNLQCIEQAVSFQKGCYIGQETVARAKYRGTNKRA
MFIFKARSQIIPEIGGEIEMRLENGWRKTGVILSAVNFGEVLWLQVVLNN
RLEDGQQFRLPADETALELYPLPYELV
>MS0579 unknown
MRSKIRKFLPHFYCNQALPDITTGAISAPIYFAFSFSSPTKISQRQSKPN
YIQSIG
>MS0921 unknown
MTTIYYILIAIAVLALIFGIILGFASVKLKVEADPIVDKIDAILPQSQCG
QCGYPGCRPYAEAIANGDIITKCVPGGQPTVIKIAELLGVDAPDAEFTED
NTPKVAFIHEDMCIGCTKCIQACPVDAIIGTNKSLHTVIPDLCTGCELCV
APCPTDCIKMIKVEKNIDNWDWKVNPDLVIPVMNTTDGEKKLVVGK
>MS0988 unknown
MANRIRLHIWGDYACFTRPEMKVERVSYDVITPSAARGILSAIHWKPAIN
WVIDKIYVLKPIRFESVRRNELGAKISESKVSGAMKRKSVADLYTVIEDD
RQQRAATVLKDVAYVIEAHAVMTSKAGVDENTTKHIEMFKRRALKGQCFQ
QPCMGVREFPAHFALIDDNDPLPLSQLSESEFNRDLGWMLHDIDFEHGNT
PHFFRAELKNGVIDVPPFYAEEVKR
>MS1608 unknown
MVNLVIVSHSKKLADGVAELAGQMVTGGCKIAVAAGIDDEENPIGTDAVK
IMSAIEEVFSADGVVILVDLGSAILSAETALDLLDPEIAEKVAISYAPLV
EGALAAAVSASTGDDLQTVLAEAKAAGDLKLQQENK
>MS1114 unknown
MNKVSLLTLLIGGALAVQYANGSPIDERRENIIKYSRLGDGQLVEGTKQL
IDLYNKTKDKKVRDDLITLLVRQNRDAEALSISETYKLTDFSSNELEYLA
RAARNERQFSKSLAFYNQLNNLDTKNPNGLLGLALVSTDMAKFEQSKLYL
SRYKHRFGTDEQYNQANAYFLDSSEPLITRFHRWNSELDTNPNDIELVKK
LYRLAAQLNISPVQEQLIAKYPEVFTDNDKSWLLHDQAVRISKNSPNKQQ
LNTAYSMLDKVYIKVPEDNSLKQQSLQDMVVVGSKLKNDDSNRAKNSYEL
LTESNQPIPNYVKEAYADYLVASGSPFAALSLYKEVEQSHLAEGGEVPFT
LGIKIVQALNDAAKYPEARDYLENNIGEPSLMVLDFTRSRKIENPDYGNY
FSTKVSSLVAQGDLSSAMQLIDERLSVTPGDGWIMLTKAELEAARARTDD
AADWVHKAQAFLPEDTAWAEVAQANLALSVNDWRTASRLVNTWTTEEKDN
ANWFMEQYDQAKSARLVASGGISHRTSPAGENESNQEYYLYSPKTDDGHD
VYIHYLTTKSPDDGLPFEQQRVGAGVEANFYPFMVNAEAGKGIKLNDKAY
FAATIQYQLNQHWQFSLNGGLNSANTPIKAIYQDTYAKDLGFSVNYKYSD
RFEAGAGITAMKFDDENLRKNLSFWSNFNLFKHNRWNLNGSLYGSYERNK
AIPGAYYYNPLKSRSLEDNFDLSYYQPFDHSITLTHHFKAGGGYYWQDSF
ASSKTWSVAYGQEWRLGKKLNISYDVGRKRSIYDGSPEFNNFINLTLSVS
F
>MS1974 unknown
MKFVKNRAKEVNVGIYGIVRDGKQQLDVVWLDERDKVYQEKQLLPAAYSQ
YEMINLIHKSLGYERLNAKFISVIPPHHIWSRSLFLPTILTHQECDQQCA
YTLQNELPVPLDSVFYDYSATEVAEGTYLKIYAVMQKVAQEQVAGCAPYH
INILDNAAFAVKRAFNFVMPEDFPEDTLFLYRDESISLAIQTKTEIEQRI
LQLNQTGLSDLYTVFCRRYNEQPAHCYAYSNIERRDSPHWRLVETPYPFM
ALGAALWSAEERKKEESEKTAESLH
>MS1787 unknown
MNIQKRYLQAEKEARWSLGLTILYVIGWCVCAYLPKGSAGPLGFPLWFEL
SCIYLPILFVVIGYWMIKIVYQDIDLDHSGSSGKDKSAGENS
>MS0194 unknown
MTNGKRHNSPTYVSNKNVNFCKKGGYFSDLVYKNKSASWKNHQKRPHFFP
IALIIDK
>MS0770 unknown
MQNLIKKAIEKIRNQVNKQFRRSINRKNQRLLTNHEMSVIASNCNGAFIL
HDLAEQFRSPFVNLYLEPADFVKYLQNIHHYMQADLQFIKTDKAYPVGKL
EDLTVYFMHYHSEQEARNKWIERTKRINLDNLFILMTDRDGCRYEDLSAF
DKLPFANKIVFTHKKYTEFSSALYIPGFEAQSQVGDLFEFSGWNGKKFYD
QLDYVNWFNTGKY
>MS0357 unknown
MKIQHTEDQQQGEFFILSETGEKVAKLTYFYQSPRVINANHTYVSDSLRG
QGIADKLYQALIQLIKEKRLELIPSCSYIAKKWRRDHQKS
>MS2328 unknown
MSTVQQAYELAKKQFADIGVDTEQALALLDQLPISMHCWQGDDVSGFEQG
AGALSGGIQTTGNYPGKARTPQELRADLDKAVSLIPGKKRLNLHASYLEA
DHRVDRNEVKPEHFANWVAWAKANNMGLDFNPTYFSHPLSAEATLSHQNK
EIRDFWIEHGKACRKISEYFGKELGTASVMNIWIPDGSKDFVVDKFAPRQ
RLVEALDEIIAEKIDAKYHLDAVESKLFGIGVESYTVGSNEFYAAYAVSR
GTALCLDAGHFHPTEVISDKISAVMPFVQHLLLHVSRPVRWDSDHVVLLD
DETQAIAGEIIRNQLFDRVHIGLDFFDASINRIAAWVIGTRNMQKALLRA
LLEPTDELRALENARDFGSRLALLEEQKSLPWQAVWDMYCERHNVPVGRR
WLDEVRAYEKTVLSQRV
>MS1775 unknown
MVVNEESNMILGTLLFYLALTLSGFLVFFLAVSNKKKLSRNQDIISGMYP
KEKNNKETQKNKNQQQVELEQLIITNNKFLNILSTIDKNIKVKLFITLIL
TGIYALFNLDAERKSLAIAGAVIFVLVILIPGSLANMILKRKIKNMMTDL
PGFVDLVAICVQTGMTINAALLRVAEDFKILNPDLSYVMLRIIRKAEIIG
LPSALDTLAVSLPTREIRMFTTVLQQSLNFGSSIYSHLLQLSSDMRELQL
LTIEEQLGTLSAKMSVPLILFIMFPIIILIVAPGVMRVFPNV
>MS2113 unknown
MKKDLIYRKRYLERVRPFIGKSLIKVFTGQRRVGKSYLLFQIMQEVQASD
SQAHIIYINKEDLAFSHIKTAQDLAEFVLIEKKSGKKNYVFIDEIQEISE
FETALRSLLLDDELDLYCTGSNAHLLSRDIAGSLSGRAIEINVHSLSYFE
FLEFMRLEDSDKTMSQFLKYGGLPYLKDLPLQDNIVFEYLRNIYSTIAVR
DIINRYALRNVQFLEQLTQFFASNIGNLFSAKKISDFLKSQRISANTVQV
QNYAEYLANAFLIHKVPRYDIEGKRIFEIGEKYYFEDLGLRNALIGYRVQ
DRGKLLENTIFNHLQIAGYDVKIGGLGTQEIDFVAEKDGERIYVQATLTI
NEEKTLEREFGNLLKIQDNYPKYVVTMDEFDGNTFEGVECLSLREFLMLL
MDSND
>MS1946 unknown
MMKQYYIGVMSGTSLDGVDLALMDFTLNPPKLMATDFTPMPEKIREKLTA
LLRSGETSLRNLGEIDHQLGLLYAESINRFLQKVRLKSEDICAVGCHGQT
VWHSPNCEFPFTMQIGDMNLVAAKTGITTVGDFRRKDMALGGQGAPLVPA
FHQDLFFAAERLTVVLNIGGISNISVLEENCPTVGYDVSVGNALLDSWIE
LHQGKRYDKDALWAKNGKISTALLTDLLAEPFFQQAPPKSTGRELFNLAW
LNKKLEKFTALSQPMPSPQDVQRTLVEFTALSIANELKKLQKSDRTNLLL
VCGGGARNPLIMQRLTALLAEWQVSTTSEFGLDIDYVEAAAFAWLAYRRI
HNLPSNLPSVTGAKSEVSLGVIFPK
>MS0913 unknown
MSLKNCKWWDKFHLPVKKPEFSGFIHTLFGYYFLRINNPINKPKNAPIVE
PIPAQHMRLGEDISACPGF
>MS1778 unknown
MLLLDKNQSNQNGARKIVVLSDSEEMQNNVSQLLRTRGFENVEQRKRHFL
SADIAFSPEDIIGMIIDIKDETDVSLIAEHITAIVPQNLWICAVGNSDSI
TLAQNLADTGILYFHADTQLHLMMEKITSSKISIPHTRHTVNVCVLGCKG
GIGSSLIATHIANQIISKKKVPVLLAQGPNGSQDLDLAFDKKLQGDIAKY
DEYLDIFNGVPQGLNDKVTEKYNFVIYDQPIFNIDKDLYPEFFKYSNSFV
LVVERRIGALRVAKQFLEQCDRLRSLTNQAVRVFVCISDHKPKSEKLMAK
SDIETLLGATVDAVIPFIKNTEAKTILALNLSKAHKKSFYTLAMKIIGVL
SRNNLNNENKSLFKGLYRLLFNR
>MS2382 unknown
MRMTKSNSTRETFSGRRAFIFAAIGSAVGLGNIWRFPYTTYENGGGAFII
PYLIALLTAGIPLLFLDYAIGHRHRGGAPLSYRRFSKHFEAFGWWQVMVN
VIIGLYYAVVLGWAATYTYFSFTMAWGDKPIDFFIGEFLKMGDITQGVSL
EFVGMVVGPLIAVWLVALGVLALGVQKGIARTSSILMPVLVIMFLILVIS
SLFLPGAAKGLDALFTPDWSKLSNPSVWIAAYGQIFFSLSICFGIMITYA
SYLKKEFDLTGSGLVVGFANSSFELLAGIGVFAALGFMAAASGHEVSEVA
KGGIGLAFFAFPTIINEAPFGQILGVLFFGSLTFAALTSFISVIEVIISA
VQDKLRIRRAKVTFIVGVPMMIVSTLLFGTTTGLPVLDVMDKFVNYFGIV
AVAFVSLIAIVANEKLGLLGDHLNETSSFKVGFIWRLCIVITTGILAFML
FSEGAKVFAEGYEGYPSWFVNSFGWGMAVMLVIVAVLLSRLKWKNEVQVS
GE
>MS0766 unknown
MHSIEIKGRILIIFCSITSKIDILIKKYYIKLNNI
>MS1287 unknown
MNNSYGTLYIVATPIGNLQDITQRALDIFTQVDLIAAEDTRHSGLLLSHY
GIKKPFFALHDHNEQQKADALVEKLRQGTNIALISDAGTPLISDPGFHLV
RKCRQTGLKVVPLPGACAAITALCASGIASDRFCFEGFLPAKSKARKDKL
QNIAEEDRTLIFYESTHRILDTLEDIEAILGAERYIVLAREITKTWETIT
GDTVANLRKWLAEDPNRTKGEMVLVIEGKAKSDDAEEISPQAIKALALLA
KELPLKKAAAIVAELYGYKKNALYQYGLEYLD
>MS0997 unknown
MLDHKMKNYYLQANLLLAILERNKNKCYFNSLIFCFPSSAFSKGTFFMSV
FYDLFQ
>MS0861 unknown
MRMIFYFDKFSDLKMAKQDADYITLDLFANVPKIGRPKTNPLSREQQIRI
NKRNQLKRDKSSGLKRVELKLHTDLVRQLEDLASQQQISRAEVIEKILQN
YFNIQENR
>MS0414 unknown
MLHLSGSDAPITANELLAIEKRLNIVLPQEMKNLYLKFNGGQPTEYVHDD
NYLYPIWAFSCLSEIEDDLQLIDENWCPNGFAPQELLPFAYNAVGGFFAL
SLRKQDFGFVYFILIEEKIEIIGKWKNFAIFLNSFIEKTQIDEN
>MS0393 unknown
MALHKCPECRHKISQNAMICPHCGFSFETASLEKYKQTLEQRRLHNQQIN
KKSAKLQFIWLIIFALFIALAGYFTS
>MS1083 unknown
MRSKISKFFTALCLYFKLKKIYKDMRYAFSHFCNFLQCFRFSFIQTK
>MS0186 unknown
MQGFFVTKFNQIKYLHLDLSSIKFRIETYFGVFLL
>MS0362 unknown
MKNDIKTLSLESSQSAKTNPIRLYSGGIGHSGKTKPAGAKFI
>MS0573 unknown
MGLFEAIFILFLLIVISAIISSSEISLAGARKIKLQSLANEGDTRAEKVL
KLQEHPGRFITVVQIGLNMVAIFGGMIGESALRPYIQQTIHQYTNAPWVD
GAASCASFVVVTAAFILLADLMPKRIAITYPEQVALRTVGVMSFCIVIFK
PLVLLFDSVANGLFRLLKISTVRHDSMTSEDIVAVVDAGAEAGVLKAQEH
YLIENIFDMQERTVTSTMTTRENIVFLNRTFDRQKVMETLTKDSHSKVLI
CDNGLDRILGYVESHTLLTLYLREEQVSLTDQRILRKPLFIPDTLSLYEV
LELFKSSGEDFAVIVNEYALVVGICTLNDVMSIVMGELVSSEEEQIVRRD
EDSWLIDGATPLEDVMRALNIESFPDWENYETISGFMMYMLRKIPKKTDF
VLYDKYKFEIIDTENFKIDQLMVSIRKDLNEQN
>MS0894 unknown
MNKLALYCRIGFEKETAAEITEKAAEKGVFGFARVNNDSGYVIFECYQEG
EADRLAREIPFNQLIFARQMIVISDLLENLPPTDRITPIIEEYNRIGSLV
NLHRTTELFVETADTNEAKELSVFCRKFTVPLRQALKKQGYLAFKEVKKS
GLTLHIFFVKPNCCYVGYSYNNNHSPNFMGILRLKFPPQAPSRSTLKLHE
AILTFLSPEEERKCMNESMYGVDLGACPGGWTYQLVKRGLFVYAVDHGKM
AASLHDTGRIDHCPEDGFKFQPPKRSKIDWLVCDMVEQPIRIAALIAKWL
VNEWCRESIFNLKLPMKKRYAEVQNCLQLITNELDKAGFKYHIQAKHLYH
DREEITVHISVKK
>MS1906 unknown
MRSKLIKIICLRSRICVSETIHTLEKQAKLAIITNGFTALQHLRLQRTGL
AQYFQFITISQELGIAKPDARIFEHSLQQADIEDKSQVLMVGDNLHSDIL
GGKNAGLDTCWLSYDKANDSDIAPTYSIKKFNELLDVVAA
>MS2297 unknown
MKMNKKFAQIVKNPAFRNMVLKTIFNVTNVMSATKYLR
>MS0865 unknown
MNMIRIIRSFGLALSLVFAFVGSTFAADLPTEKSLQAQIEQLQKDEQTEV
NKALVQNLQDAQELLAQIAKQKADNEKLNKDIDRSTRTLAESKANIERFK
KQEKTVEQLKEDFRKLSLTTLQDRSESATENLQNLQAELLTLNANLSGQK
TAPERAQAALTENLKQSQALNSQLSNVNIEKTLQTKLTAQLALLELKNAY
NQILLYGNNALTNLYTSQVNEKTLEQTQLQKQLTALQDIINEKNLEKTQE
QVEKATESQQKSAATNTNPVIVRELDLNTAVTKDLLEQTTKLNALSQDNL
RVKGILDNLQQTQHNIEEQISALQGTLVLSRIINKQQQSLPQDSMIKGLP
KQIADLRVKIFDITEFKNNVNNAPAYIASLEKSDKVTFTDAEKNQLTDIL
AARDKVLTDLLKQLNGQLNVSINLELNQQQVQTISDALQSKLKQQSFWVK
SNSNIDLKWLQDAPMLIRYQLRGIGNTFDFSNWRDNLVPAVFWILLLIAL
TAIIHRKKEKIKQQLTRINNKIKSLGTDNQWNTPLAIFWTIILCLPSTFM
FLAVFILVTYICFQDPTQVWPWGLKMSVYWLSFAFLLAMLRPNGIGYTHF
GMPKQSNETFRKILKQSVWVIALLLNTSIFTNLEMGVTYDVLGQTMTIIV
LIVTIFIVAPGFRKAISTYQEATNNDKQGTHTYVLYLIRAVLLLAPIILI
VLIAVGYYYTALVLIEHLVATYFAVITWVIFRNIIQRAFSVTSRRLAAKR
LQEKREQARAKAEASEHPEVDSGEVILEVKEETLAVSEVKQQISKITDFL
LWLCLFGLLYWVWSDLVTVAYYLDGVTLWKQSVTTESGTVMESVTLFNLL
IAILVLFATYVLVRNIGGVLEVLIFSNLKLSQGTPYTITTLLTYAVIALG
ASFAFGTLGMSWSKLQWLFAALSVGLGFGMQEIFANFVSGIIILFERPVR
IGDVITLGEFNGTVSKIRIRATTLVDFDGKEVIVPNKAFVTERLVNWALS
DTVTRVIIRVGVAYGSDLELTKKLLLQAADDCDKVLKTPSPVVYFLTFGA
STLDHELRVYVGNISDRNPTIDFLNNRINTLLAQHNIEIAFNQLDVFIKN
QNADEEVKLGNEQLKLQK
>MS1193 unknown
MFKWQKGILIALMMVLSGCSAKPSQSLDIEKSNKPKLIVGTTSKDSREFI
SCIDSKLANNENVRSHKKKSYQIKNGKTDKYSIISQKGYSYLLSVNCSKI
QQTVMNFFYFPQQKENEILEPILACLSAVNSVNLKTYPIESVIRDMPNKF
TALR
>MS1924 unknown
MKKLLIASLLLGSTSAIAAPFVVQDIRVDGVQAGSEGKVLAGLPVRVGQR
ATDGDIANVVKTLFARGYDNVKAARDGNTLVISVEQQPVIADVTIDGNSS
IPTDALKQNLDANGFKAGEVLNREKLEAFRQGIQEHYESTGRYNAKVETI
VNNLPNNTAEVKLQIKENDVALLKGISFEGNQAFDSDTLQEQMELQPDAW
WKFFGNKFENNQFGKDLETISDYYHNHGYAKFRVTDTDVQLNDEKTEARV
KVGVNEGDLYTIKDARIVGDVAGMQDELQPILKTIHVGEMYRRGELQSVE
EQIKAKLGERGYANATVNVHPDFDEENKTIAVTFIVEAGRRYSVRQIRFE
GNTVTADSTLRQEMRQQEGSWLSSQLVELGKVRLDRTGFYESVEHRTEEV
PGSDDELDVIYKIKERNTGSINFGIGYGTESGFSYQASVKQDNFLGMGSS
VSLSGSRNDYGTSVSLGYTEPYFTKDGVSLGGNIFYEKYDNSDSDTEASY
ARTTYGVNTTLSFPVNENNSYYMGLGYAYNKLKNITPEYNREKYMKSMGY
DETGDWRFKAHDFTFSTGWTFNNLNRGYFPTKGVKATLGGTVTVPGSDNK
YYKLNADVVGYYPLERSQTWVLSGKATVAYADGMGGKKLPFYQNYNIGGI
GSLRGFSYGGVGPNAIYIDSNGNYTQLDSDVVGGNAMATASAELIVPTPF
VAEKNQNSVRTSFFVDAGSLWNTHWKAEDKARFPTLPDYSDPSRIRVSAG
VGFQWQSPIGPLVFSYAKPIKKYDRDDVEQFQFSIGGTF
>MS0934 unknown
MDSRLLEIIACPRCQGRLQLDKENERLICRFEHIAFPIVQGIPVLLVEEA
VSLAEDPKDIT
>MS2001 unknown
MYLDSRYWQHNPRVADGADAFVQAFTQLAQSKPQARGTIKRVIAEGDYVV
LHVHRQDTPDDLGRAVVDIFRLDKDGKIIEHWDVGQAVPEKTASGRSMF
>MS1810 unknown
MKNYSETIIIGAGAAGLFCAGQIGKAGKSVTVFDNGKKAGRKILMSGGGF
CNFTNLEVLPSHYLSHNPHFVKSALARFTQWDFIAMVAAQGIAYHEKESG
QLFCDNGAEDIVKMLEARCTENRVSIQLRQRIDLVEAVHNDENARFKIQS
GGQTWYCKNLVIATGGLSMPALGASPFGYQIAEQFGLNVLSPRASLVPFT
YRENDKFLTALSGISLPVRVTAQNGKSFSNNLLFTHRGVSGPAILQISNY
WQPNESVEIDLLPTDSIEEYLSQLKASSPKLQLKTALSRILPKKLVELWF
ERQLLQDETLANLSKVRLKNLENLIHHWQFQPNGTEGYRTAEVTMGGIDT
KEISSKTMESQKVKGLYFIGEVLDVTGWLGGYNFQWAWSSAYACAVGITQ
TE
>MS0171 unknown
MYAILLQIKTFKTIFSPETLRYTMRAFSGQGEIPYWWYMFT
>MS1213 unknown
MCQNRLNLSTLPTSTGATSTEATATKTATGRSTSAETAKTSGTKTA
>MS2150 unknown
MLTQSPAFNPQSQLSRCGAKNQAIGCNGNFMARCTTGREQSVSLEVFFIK
LPEKPYS
>MS0081 unknown
MTEAIKIINDDVKIVLAETIADYEKRTGKTLRPAHIERSIIQSYAYREQL
VRQGINHAFLQTFPQFATGLALDLCGEPMGCYRLSDLPAEVTLRFSVEGD
HDAVVIPEGTLVAATDNVVFATDTEVRISSTESYVDVVGICQITGAVGNG
WQLGQVKTLKSTLDAKVTVSNIDVSDNGIDTESDDDYRKRILLAPEAFTT
CGSVAAYEYHTRSVSQYIADVDIATPVGGTVQVTILTKQGLPSSILLNKV
KDHISGEKLRPLCDTVVVSSPERVAYSVVANLDLLETVAESDVKVQAEAA
LRAFISSRTQLLGADIVPLDIQAALKVAGVYNVTLASPTLTKLTKQQWAE
CESITININGERQDG
>MS1544 unknown
MFFHNFHNVLGKGHFVHKISLKKNRTLHKKVRSIFR
>MS1402 unknown
MRYNSSFQITLSRQMNKPNITIQPIQASHYADYVALIGKQLGEGYFKQAD
FEALANNPQAICFEAVDEQNQVVGVITSVTLDRESALALLKIQAQNTPDY
VLQSDRIGIFKTIAIDENRKGCGIGSALVRKLLESFKQAGLNSIACVAWQ
YGETENIRGIMQAFDFTCYEKIANYWLDDPEPFICPACGEPPCRCQANIY
FRQI
>MS2377 unknown
MSENKKQLTARDIRATYWRSTFLLGSFNFERMQAMGFCVSMIPTIKRLYS
QKEDQAAALKRHLEFFNTQPWVGSAIMGVTAAMEQERANGATDIDDAAIS
GVKVGLMGPLAGVGDPIFWGTLRPVLAALGAGLAISGSLLGPLLFFIGIN
ICRALTRWYGFKYGYAKGTEIVSDMGGGRLQKLTQGASILGLFVMGSLVS
KWTSINIPLELSRYHNAMGEEVVTTVQSVLNDLLPGLAALLLTFFCMYLL
RKKVNAMYIIFALFGVGILGYWLGILA
>MS0084 unknown
MSNVPKPDFSLCYEKTNITADIEPRLVQFTYTDHLEGQSDELTVEFEDIS
GKWVRQWFPTQGDKLRAAIGYKDSLLVDIGEFEIDEVEYRYKPSTINLKA
LSTGISKANRTLKPKAYENTTLAQVVAKVADSLKLKLVGKIKAIPIKRIT
QYQERDVEFLARLAREYHHSFKIVGSQLIFTDKTELGKSEPVLILEERDT
ISLSLRDRIKDTAKAVDISGFDASGKKVVKKRKKATALRPNLKQVKASSE
DTLKVVTRGETQEQIDARGEAALAEQNDNQTAGNITLIGNPELVAGATIL
LKNLGVFSGKYLIKSSRHSFGRNSGYTTEIEVRMLEFIADDLITLGMEKT
NANA
>MS1963 unknown
MIKKCHKVLTLLIVFWSRRYFKVEYGYSLYQ
>MS1848 unknown
MQNFDRTFLLFRSYIGTCRRFKLEKFYFSGKISKNLPIS
>MS0141 unknown
MKQPAIFVGHGSPMNVIEENNPFNQKFAEITRTFAKPKAILCISAHWYSK
ELEVQSGANPKMIYDFYGFPPQLSRVQYPASGNPRLAAQIQQLLAPEEVR
LNPDRGYDHGAWAVLKHLYPEADIPVIQLSLDRTKPASWHFALAQKLKSL
REQGVLILASGNIVHNLSALSYEHINRLDAGYDWAYEFRDQINRAIAGNN
IELLTHIERLGRPAMLSVPTPEHYLPLLYVVAMREEQDNVELFNDHLVGG
SLSMTSVFIG
>MS1160 unknown
MILISFLNFVYFIKRLKFPMIQASNLSGLKKCGRISPFFYDLD
>MS0101 unknown
MKKNLVSEIATRARSIDFWAFGYYLPNPDPILKKMGKDIAVYRELLSDGQ
VRSGVRRRKAAVKKLEWRITTTNNAKVDEQLERIFSRLKMNHIITEMLNA
ALYGYQVSEVMWGERDGLFVPLEIIGKKPEWFVFDEDNQLRFRTKENWVT
GELLPEDKFLLTTQEATQDNPYGLGDLSLCFWAATFKKGGLKFWLEFTEK
YGSPWLVGKHSRQAQQPDKDRLADSLEAMIGSAIAVIPDDSSVEIIESSG
KAASADTYEKFLKHCKAEINIALLGQNQTTEQESNRASAQAGLEVAEDIC
ADDRAMIEETFNTLLQWIVKYNFNVEQLPQFEFFEQAEINTTQVERDTKL
HGMGVRFSKTYFQREYGFEDGDIEIQQAQSAVKNPQVSEFAEHNQQGLHP
IADGIIEQLEIEGESQVDDWLQTVKDRLAKADSLEDFRNQLDSLIPELTF
AEYGKLLAMASTVSELAGRQSVNDERKVKGDE
>MS0298 unknown
MATNYYDITLALAGVCQSAKLVQQFALEGKADEEAFNTSLYTLLQTTPKD
ILSVYGGHERNLKLGLETLLEQLNGSTEDITRYWLSLLALSGKLEKNAQA
KSELARRIQYLPTQLEHYDLLDEQMLANLASMYVDIISPLGNKIQVKGSI
EVLQQTSMHHRIRACLLAGIRSALLWRQVGGSKWQLLFSRRKIFNMAKQI
YSSL
>MS0343 unknown
MKKLILATALSSVAAFTQAQIVPNANSATHTYEFTQSYDLQVPKGSSGET
KLWVPLPFSNDYQDVKSVEFDGNYQQAYITENNQYGAKTLFALWDKDAQK
RDLKVKLVVTTKDREPMKQGLLENYQAPENIEYSVDVQQYLKPTQHIKTD
GIVKQFADKIVGKESNPLKKAEMIHQWIVNNMERDNSVLGCGDGDVEKIL
TTGVLKGKCTDINSVFVALARASGIPAREIFGIRLGQAVKMGEYSKGAFG
SAKDKVANENGGQHCRAEFYLAGFGWVPVDSADVAKYRLTENKSVEDKDT
QAVSQYLFGNWEANWMGFNHARDFNLYPMPELAPLNNFGYPYAEVGGDPL
NSYDAKKFGYEFTSKEL
>MS1206 unknown
MGYFQVPKVRSKIFKFFTALLSSLHYATQILKYKYLKNIFVQKDYFFSFL
KNINQTNTLVLYANKINIAK
>MS2289 unknown
MIIRIAKKQDYPQIIDIYNQAIPSRRITADLEPVTMESRKDWFEFHLHSE
RHPIWVLENSIIKNNQEEKQILGWCTFSPFYPRAAFDNTVEISIYLDNKA
KGNGYGSKILQFMKEQMMCRDINTLMAYVIEENNISRKAFEKQGFKLWGR
YPNIANMGDCYQTFLMYGYQSGIKNS
>MS1909 unknown
MQFIKNGRQYREATSQKISWGHWFALFNIIWAILFGSRYAFIIDWPSTLW
GKLYFFISILGHFSFVVFAGYLLIIFPLSFIIKNERTFRGLSVIVTTICL
TLLLIDTEVFSRFNLHLSSVVWNLLVNPEDGELSRDWQIFFAPMPLILLV
QMLYSRWSWNKLRSLERQKWMRKVGIFFVTMFVATHLIYAWADAYIYRPI
TMQKSNFPLSYPMTARTFLEKNGLLDKTEYAQTLEQEGRPEAFNIDYPKH
KLAYMPIERKPNILLINISGMRYDSVIESKMPNLTEFAKQSAQFMNHYST
GNNSNLGLTGLFYGLNASYTDSILHNKTESELFKKLQAEHYQMGLFSANN
FKDSLFRQALFQKVNLPRIKAGNQSAVKNWLIWLNKAHLDQAWFSYLDLD
VLTAVQNADPKSKEEETEIYDNQLGNVDVQLQIVFEQLQERGLLDKTIVI
ITADHGHAFQLSDKEHIDYFGLDEIQVPMIIRWNALLNEQQSKLTSHVDL
VPTLMQNVFKVENPITDYAQGESLINISRKADWILVGNYRWNVIISPNGN
QYHIDRKGQYQKYNVDYEKESSLRPPLGLFLEVFTQSRSFMAK
>MS0170 unknown
MRYSLFLHIMKYKYDKNAKKLTALCIRDIFVL
>MS1935 unknown
MIIPWQELEPDTLINIVESFILREGTDYGMEELSLAEKRDNLLKQIHSGK
AVIFWSELHETIDIKTTT
>MS1603 unknown
MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ
GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK
SLRR
>MS1513 unknown
MMGSYYTNIYLPCNPKMKKTDFLPKKYQILLFWQTNMPPIIFGHHPI
>MS2273 unknown
MLELAYLQTLPQQRALLKADYADFIVKEDLGYAMTGEGEFVALYVRKTDA
NTLFVGEQLAKFVGLSPRNMGYAGLKDRKAVTEQWFCLQMPGKAMPDFSR
FNMAGVEILQVTRHSRKIRTGSLNGNHFEILLRNAVETDELKVRLENIKN
FGFPNYFTEQRFGKDGHNLTQAMRWANGEIKVKDRKKRSFYLSAARSEVF
NLVVSERIRQGLANQVLAHDILQLAGTHSWFTADGKEDLALLQTRLENHD
LQLTAPLIGETQQLACELENKLVERHQSLISLMKRERMKPARRPLLMQAR
DFHWEFVENGLKLKFYLPAGSYATALVRELVNIDENE
>MS1560 unknown
MRSFSIFLGVMIRPTGNVMINIQPSALMQSATWAQPIEMLFACHGRVKNF
CRQLGMLPDYLAENGVNQAVKNDVKQIITYFNVAAPLHHKDEESDFFPAL
LHYVPEAKTDILKLEAEHIGIHGIWEQLGVQLQELIDEKRTTIEQSLLDD
YRAAYERHIALEEPLFELGQKHIPAEQLTAMGKIMAERRKVKNS
>MS2083 unknown
MQIFIYQELKMKKFISLLILTALCASVTACGVKGPLYFPEQEQPKQEQAE
>MS1453 unknown
MKFWLEIPIIKNLYRLIRKVDEKNCFKNDRTFRIGIKFDRLGGCGR
>MS2000 unknown
MNAEQRVIVTPTFWGKFFGKIKSVELQQNKVIVTDKKNNITEHDLGKTFD
FPAIQKSFFGTKLFFKDDSTEVVLSKLAKKQTDSLLLEIEKVVASNIKVK
VKEGFQHFANLAENQYLRDSDIPTLNDRVRLSVLSYGDNKEHFQKYFDES
LVKKIQYISSLLGFVQLLHNTVSISFRHNKLPICI
>MS0801 unknown
MKSIVKVLGILVIAGSVGACSNMSKTQKNTAIGAVAGGVVGHAIGESTGA
TLGGAALGGLIGSQVK
>MS1257 unknown
MFFALKEKCGYFFEKFCLDMNHGSKNETGIAGL
>MS1972 unknown
MLLNGGAMVKQVWIKQLWHRCIQASVLKQNTGLLILALFGLFLPLNRLYS
SWEQLIRLENNINEQQRQTIYQQRLLQSLEKKAKNDLLTPQSAALLSQIN
QYVQSSSVNVKIQNAQWHFSSSAVLQLRMEGDFLSLNQFITDILQKFETL
RLSSLKLFKPDENLAAYLTLRLQLTKE
>MS1320 unknown
MRLYVAFYKIYKIKFLFKFMTKITFLTFFSDENQQK
>MS1765 unknown
MQIMKKIMALLVAGAFIFTLSACETTKGVGKDIQNAGQKMEQVFN
>MS0135 unknown
MIKLLKPVDVKSNEKQALLWSWLYVFALFLAYYTLRPIRDELGAAGGVTQ
LTWLFTGTLVAMLMLTPLYGYLVKHWKREKFITISYRFFMLNLVVFAMLM
AMATGDVLVWTGRIFFIWVSVFNLFVVSVFWSLMADIFNTDQGKRLFGFL
ATGSTIGGIAGSAFVSFFADVFSNYILLLMAILLLEMSVLAAKKLSKLGE
IELRASNSAGRFNQEIGGGVLDGLKRTFQSPYLLGISGFILLYSITSTVL
YFQQAEIVNSTFSDRAERTAFFANIDLWVNSLTLFFQFGLTGRMMKYIGI
LPVLSLLPLFSVISFAALAMNPTIVVFVLVQVSRRVANFAFARPSREVLF
TRLSREDRYKAKNVIDTLIYRSGDQIGSWGYAGLGALGLSLTGISWLTVP
VCVLWFGLSIWLAGKEDVD
>MS0446 unknown
MKINSFIKLFSLESPMKIELPKIFIISLKNSPRRDVIAQRFNALGIKFEF
FDAIYGKDLSQEELSKIDREFAVKRFSTKKPLTLGEIGCALSHIAVYEHI
LKNNIEQAIIFEDDAIIHHEFKKIVEETLSKVPSRREIIFFEHGKAKSWF
CKRSIHEGYKLVRYRSPSKNSKRCIFRTTSYLITLSGAKKLLNHAYPVRM
PSDYLTGGLQITQINAYGIEPPCVFCGVDSEINAIEDRYN
>MS1394 unknown
MKLSILNLVPVREGQNYQQAMASMVTLAQYAEQIGIERYWIAEHHNTKNL
ASSATALLIQHTLAHTETLRVGSGGVMLPNHSPYIVAEQYGTLETLYPNR
VELGLGRAPGTDMRTAYALRKGREHSDFPTEIAELRGYFENTNPVSAYPA
AGLKVPFYILGSSTESAYLAAELGLPYAFASHFAPRMMEMAVEIYRKQFK
PSPHLAAPYVILGVNAIVAQTDEQARQLATTQTQFFLNVVTNAQQNLQPP
LASADDVWKRHLSAQFPPHFGPVDFQEIPLYNQERAVVEQMTACSLIGSS
ASVTHQLNTLRDQVHFDEIMAVSYIFDEQLQRLSYKMLKEIVDKI
>MS0313 unknown
MKKISLFLTALLAASSALAANNQAAPQQENAKTEFMFGAKAANDPVGIWQ
KDGRHFSKKDLSKQFCWTLTNFRSDSGNVNITITLTSPKNTNFNLGEHIS
KNTTTHIFNFTYPITQTYYNCWAFEESDPEGKYTLTVKANNTTFPTQVFT
LTK
>MS0750 unknown
MDFQHNREQFLNRLAAKMGKARSFSPQAMEEPVNRYPTERLTELSQAQLC
EEFVNFAKVMMVDVKVCPESDVVSSALSLCEKYGGNSVILNDDERLTRLG
ITQALQEKYPCHIWSPETGQQNIDKAEKANIGVVYAEYGLAESGGIVLYS
QPERGRAVSLLPEKSIVVLRKSQVLPRVAQLAKVLHDKAQKGERMPSCVN
IISGPSSTADIELIKVVGVHGPVAKIYLLIDDL
>MS0325 unknown
MRKQKIVNIKDLVESSDLSKIMQKGLFLNRLNQQLQQWFPSQFKGMYRLA
NFTENGLHIEVANAVVRQGFLFRRQELLQLVQKEYPEITRLNFKINPELN
R
>MS0158 unknown
MVIRRKCGSFFAGNIMLSIKRPFTDEDRAILNSYKAVVEGVSALLGSHCE
ILLHSLEDLDNTAVYIANGHNTNRQAGTTLSEADLQSLQAMENGMVLKPY
FTRHKGNNGLMKSTSIAIRNGKRQIIGLLCINLNLEVPVSQFIQAFIPTQ
DYPVTTAGNFASSVEELVLQTVETTIEEITADRLVANNNKNRQIVTTLFE
KGIFDIKDAINLVAERLNISRHTVYLYIRQIKQDDQK
>MS2055 unknown
MMFPPSHGLFNVGMKKSAVKKTKILLNKVRK
>MS1887 unknown
MKKLSLALSILLLAGCMGTELSTKDKTYNASTDARIRIFGQNGRPSTLTI
EHNGNKEKITIGGGVGQAFSSLVGAKGNESIGMPESVYSKDPSQFSNIGS
TPFFKEFIIPANAKVNVKNEIMSAPHIFKDVTTGKTTTTYYKCSGGKEIS
FVAEAGKNYEVIPSSSTNECGVTLNELN
>MS2013 unknown
MGTLETTVKNRSMKITKIRPQKRNSCSICGKAQVTRKFQEEYYCANCYAQ
WFKKKTCKQCGQLKRIHREGELCLECEKLTDCVRCGKTSGTFEIGMISRY
GAVCSSCTRYFREEIECSECGKMTRDRYRSLVTNESVCLQCYRRYTFATC
KNCRRYRKVHNQEKQLCKKCDEKLLSTCSKCKGEMPSGYGNVCPDCARRS
LLFNMIRLNGHILRNKAVKTAYKKFIFWYMRKCGISVVLHKGSDFMRFFI
DCDDIWQKIPDYAELVTHFKPNGLRANLTVLRWLLDTNQVVVDEALKDDL
AEMQRIQSLFNKLKESIPCIASYYKLLQRRYDDGKTSLKSVRLALQPAID
LISSQAVTDYPTQEQLNNYLSEKTGQIAAITGFINHLKSAYRRELKIDRK
LIQQMKAKQLKKHYSKRLVELYKQTELTTAEQMDLLSVVLYSLHGIEIKK
PKFDAIVLIDGVAYYRDNTKDYFLPQDIYLRIKPQF
>MS1143 unknown
MLIGLFIGLLFGFFLQRGQFCFVSGFRIIYTQRNFRFLTALLIAVSIQSI
GFFSLSGLDLITIPNTPMPLLATLIGGLLFGIGMVLANCCASGGWFRTRE
GAVGSWIALICFALTMAATQTGALKQWINPLLLETTTLDNIYNTFNLSPW
ILVTVLVLITVVMIVYHIKHPRYQFPQEPTTALIPHRIFTKHWHPFTAAV
WIGLLGVLAWLVSEQYGRSYGYGVAVPTANVVQYIVIGQQRYLNWGSYFV
LGILLGSFIAAKLSGEFEIRLPEPKAILQRMLGGVIMGIGASLAGGCTIT
NALVSTAYFSWQGWLATLMIMIGCWLTSVLVKPTQCRI
>MS1747 unknown
MKQKIVLATGNKGKVREMSDVLADFGFEVVAQTDLDIESPEETGLTFVEN
ALLKARYAAKVSGLPAIADDSGLVVEALNGAPGLYSARYAGIDGETADAE
NRRKLLRDLADVPVGKRQAKFVSCIVMLRHETDPSPIIAEGECIGEIIFA
EKGENGFGYDSLFFTPEKGCTFAELETVEKKKISHRARALAVLKSKLGA
>MS1907 unknown
MSKSTCRAYFFIGGFAMDKMVIWLLALIIAAPVLVLAVSPTLNKMGNQVG
NMGNNNSAAFSQQGNSVHGQDGSIYNRVGNTTYSNKGTVYYNTGEHTYAS
DGSYCTKIGAVTQCNKPTK
>MS1646 unknown
MSNILNWPDYKVLQVSELEHDYQVHAEVSEPPTQCPHCNHPEIVGFGRRD
EVIMDTPVHGRRTGIMLNRRRYRCQSCRKTFLEPVPHKDEKRQMTNRLIQ
YIERESLRRTFSSVAEDVGVDEKTVRNIFNDYCERLEKTLNFEMPQWLGI
DEIHIIKPRCVITNIQQQTIVDMLDNRNKTTVTRYLSKRTDRDLVRYVAM
DMWRPYRQAVETMIPDATVIIDKFHVVRMANESLERARKAIRSALTPQQR
RGLMRDRFVLLKRRHELTDAEYMRFSGWTLNYPEIGQAYELKEAFFEIWD
CQTRHQAQEAYYSWLRQITPEMKAHYDPLIKAMGNWHDDIFAYFDHPITN
AYTESLNNLIRVVNRVGRGYSFEALRAKILFTEGFQKIKKPRYQRQRIPE
GAMGRMPFYGVAEAGPSTNYGADISTLVREIEAGRL
>MS1136 unknown
MHNSIKKVRSVFQKFYLTSHHNHSADSKTSGNLTALYHNKLYK
>MS1740 unknown
MEVVIMAKGKKIQLTFESFIDSDTNVKVTRLTPKDVTCHRNYFYQKCFTQ
DGKKLLFAGDFDGNRNYYLLDLQSQEAIQLTEGKGDNTFGGFISHDDKFF
FYVKNESSLRKVDLATLEEKVIYTVDENWKGYGTWVANSDCTKLVGIEIL
KSCWQPLTDWDKFKAFYHTNPTCRLIKVDILTGDLEVVLQDNVWLGHPTY
RPFDDSIVGFCHEGPHDLVDARMWFVNEDGTNVRKAKEHQEGESCTHEFW
VPDGSKMIYVSYFKGQTERVIYSVDPNTLENTRLITMPPCSHLMSNFNGN
LLIGDGCDSPVDVADSDSYNIENDPFLYLFNIEKQRTVKLAKHSSSWQVL
DGDRQITHPHPSFNPNDSAVLFGSDFEGRPAIYLADISQLKD
>MS2311 unknown
MIMQKLIKLFFSGILLTLSIQAAQAETQYVTENLNTYLRKGAGDNFKIAG
AIQAGEAVSVLDRKEKYSLIRDSKNREAWILTAELTDTPSSKEENPRLKA
QVQELTSKLNRLDADWQQRTTEMQRRTKDADQKSSQLLEENSQLKRELEI
TKNKNRDLEAMLDAGKREIAIQWFIYGGSVLGVGLLIGLIIPLILPKRRR
RDGWA
>MS0738 unknown
MNQFITLLLSTWGILSIHQISRRQSVDYMQTAKSTLGLIFGVIILNILIA
LPLMGGLINIIPAAINPAAASAGIIGFALMIFGVYVYVRLCLAPIHYTVS
KTNIFASLQQTWQLGNKRTSTLFLYCLLVYFIVPFIAQQVAFLANNTFLN
IITTLIISFLSVFTLVVTYRFYILFTQKA
>MS0725 unknown
MMIYRGETLVGLLISMTLSAFLILIAVQFYVYVQHTNLQVMQRLELQAEL
QSILQIIAKDLRRTGFDLPYSEPEKIKFDHFSKESPNSCVIFTYGLGESD
KTKLKKQNTEEDTKVVLGYRLYNQRLEAIPQAKKTNTERNEKTLVEGCSL
RLGWEQLIDSDKFAVSQLQFKWLVEKKGIEIYLKGYLKQQKSLFYETSII
LPIMNEVMWDENL
>MS1572 unknown
MENVNKQSFQDVLEYVRLYRLRNKLLRDIGDNDRKIRDNQKRVLLLDNLS
QYITNDMSVEDIRAIIENMRDDYEGRVDDYMIRNADLSKERREIKEKMKA
QKKAHAELLKKADD
>MS0120 unknown
MAILPEVLMNIALDVKRAKARGDKLEPIYQRGCELTKLSRATLIRQLKPY
LPPSGRKVRSDKGTNQLELAELKTISAAWLENRRNQYKKRMLPLDELLAM
LRANGEIKAEFVDKATGEIRPYSESAVSRALINARLHPDQLLKPKPAIRM
RSLHPNHCWQIDPSLCVLYYLKRDHKQTENGLQVMEAKRFYKNKPANVAS
VESDRVWRYVITDHTSGVIYVEYVYGGETSENLCNTFINAMQRKPHGDEP
FCGVPKMVMLDPGSANTSKMFDNLCYQLGVKLQINEPGNPRAKGQVEKGN
DIVERQFESRLRFKSVANLDELNERAHEWMRAFNATKKHSRHGMPRYKAW
LHITKEQLVLAPSLDICRELMVSKLVERQVDGQLQVKFEGLTYDVSGVPN
LNVGDKLRLGKNPYRPDCIQVECFEQVFDENNEMSLKPYWFVVEPIETDK
FGLDVNAAVIGESYKSHAKTTLETNRETVERLAYGATDDDGVKAAKKANK
PLFDGRIDPFKTIDERPDVMFIPKRGQEHELTTNARRVEQKPVGLVECAK
QLKARFPQWNGKHYKQLATHFADGVPAELLETWLQDEKLPEILNPETKIL
KLSAA
>MS0319 unknown
MAIKRNQRQRKKMHLAEFQELGFLVNWQFAENTAIEQVDEVVDRFIRDVI
QPNGLAYEGSGYLQWEGLVCLEKLGKCDESHRELVKNWLESNGLQQVEVS
ALFDIWWDYPVKEA
>MS1279 unknown
MVTGVMFEPEPIYDELDKKPAELTHDQPLGFTDVVDNAKKEAKTTKTANK
KTKDKKSASHLRIVK
>MS1785 unknown
MVINMNNAQVDELVVKHLKANPQFFVQHIDLLDQLIIPHAQKGTLSLVEM
QLERQRERIKELEAELALFADLAHQQQDIFLALMPLQKRLAQCKNFPEGV
EEINKWARNFELQQAKILLFNDCWQKNPSVGEEFWIDRKGFELIRLERIG
LRHFYLGELTNKEKSLLFLPEELPVGSIACCLLGMKKNQHKSTALLLFSA
RDTAHFHNGQDTAFLKHLVSIVELHLHRWLMIYQQAE
>MS1332 unknown
MVNIYGMTLNNRPRSLIMSKTTLSLLISATLLLSACNDEEVRSLKEQLQT
SRQQIAQLQAELQQTNATSTIKADSAQPEAAISPTEDTAIQGKIIQDEIP
TLYVKPVTVFDKTEKFNFNVSKKPKNNEPLYEESHIHYAMHTVETGIEWL
DSLLYQNLMADITIEDADKQKEFESIPNAKDRYAAFIEYFYNNALPEIKN
GTTLGSDYYINLDYVGQRENILTFKVANYMYDGGAHGMYSTDYINIDSRK
KAVIDLNTLVNKDKQNQLKELLWKSYLAYTNNDNSDIPFTEKQDFDISQQ
FYFSSEGVNFVYPPYALGSFAEGEITLSISWSDAKNIINKDYLREGFTIQ
E
>MS0409 unknown
MTSEDPIYEKLNETTSIRGFITACVAIFDESVDQLINRVFRKTDFAVKSV
VDSLFINSGPLFDLSIRLKVLLGLGIISHETFMDINAFIQLKEALNNDGK
EYEFFDPIIISFIQGLNVRQDKSFLNLDTKIDGTKDSLLYQVKVLRREKL
IRSYLILSVTDLYDQLQVESPL
>MS1628 unknown
MTRFTMKKNIKIILLILYILSPIDLVPEAFVGLLGLSDDLIALILLIKQI
LKK
>MS0739 unknown
MTINFQQILQDSWNFIRNQRKFTLMLTLTFCLVTLILNIFGSSLFQSVTE
TAINEPIDKNELSTMMQRVQKAVTFYYYM
>MS0042 unknown
MTALYNFIQAIRLKSAVILWIILLKPYYVYTKYRNWALRQFSNYWHNILQ
NN
>MS0962 unknown
MVFPEYKVRIPQFKCFNQKIQFFDRTFFIILGLTMSITLSTKQKQFLKGL
AHHLNPVVMLGNNGLTEGVLAEIDNALNHHELIKVKIAGSDRETKQLIID
AIIRETGSGAVQTIGHILVLYRPSEDIKIQLPKK
>MS1211 unknown
MKTLCKSSTRFDKQALSLGFSELILQENAARGVAELVRQKLQTGEKILFL
CGGGNNGSDAIACARMLSGDYECELYFITENLNANAQAQLNIALKVGVNR
VSQPDLANIGCVIDGMFGSGLSRDLDAEIIRLLDRINAHNALKIAIDFPS
GLDGNGNIRGACFKTDFTLAMGALKIGLFSDVAKDMTGEVSLVNLGLSDN
RFITNQEDFLLERTDLNLPSRTLQNVHKGSFGHAFVALGQMPGAGIMAAT
AALIMGAGKVSVVGKAENLTPQIMQKNNFDGAGAVAIGIGLGNADIDISA
IKHLPCVLDADLCYRAEIREFLPNPSAVFTPHPKEFSGLLRNLGLADISI
EEAIKNRFELAREFSRQIKGVLLLKGANPIIAYNGTLYLCNLGSNKLSVG
GSGDVLAGIILGYLAQGFTALEAAQNAVLAHAKSAENYQGNDYSMTPLDL
IDGLRYL
>MS1616 unknown
MNKFMSEEQNSKFLTALFSIMPVVLLLGIDIYTMFLQTEAKAISHFNLGV
LAAQFICSLVFLKGRICNGQRGRLTQAVMYFAVYWLVWLLLSLFSSYHFI
LTDMLSVAGLLMLFSMWRQPLEPGSRRLMLNMGALAGLLGVICFFVQLAE
IPVLHWVQYNFFGQALAGVILANLLLVISRNRLQSFMALLPLVMSVLLVL
NSIFTLAVLAYGQLGSAVVFANNFAFVLYFLLHLVMIAILAFHIFRKAKL
SYNTLMLLLVISLSLPLWASFAYLE
>MS0072 unknown
MFELLLQGISSLFTITGLSCLLGGVFIGIIFGSVPGLSATMALALFLPIT
YALDPNMAVILLIALYIGGISGGLITAILTGIPGTPSSIATCFDGYPMTK
RNQAFKALGVGVTFSFLGTIFSTIVLIFLSPILAKIAIKFGAYEYFAVAL
FSLSMLVGLSGENIWKGLISGLMGCMFATVGMDSIASVNRFTFGSEEIAY
GFDVLPVLIGLFAINEIIAKADTVKTEHQNMQVITAIRMEKGLGFSLKEF
FGQIKNFFVCSSLGTGIGILPGIGGGAANVMAYTVSKSISKHPEKYGTGI
IDGVVASESSNNAAVGGALVPLLALGIPGDTVTAILLGGLTLHGIIPGPL
MFTENVGTVYAIFTAMLFGSVVMFIMEFYGLRLFTKILSIPKHFLLPAIF
LFCIIGAFGVRNNFFDVWATVLFGIIGFSFYKLSIPAAPFILGFIIGPLA
EINLRRGLMFSQGDFTAFFKAPIALTFFILTGLVIVFAVKSRIKKHN
>MS1522 unknown
MFKVQRIYDFEPMENDCAVFVDRLYPRGVNKEKFAHCLWLKDVCPSHELR
RFYHENPQENYGEFVLRYQLELGNELPQKGLIMLKRLEKEHPQVILLTAV
KDVRHSHIPVLLKALAAIVEFL
>MS1954 unknown
MQPYSKSLKELSQKLRSDQTDAERKLWQRINREQLLGFRFNRQKPLLNYI
VDFYCPKAKLIIELDGSQHYEPDYQGKDALRDAELNSLGFTVMRFSNDEV
YYEIEAVVDQIYLFLESIDHDRAD
>MS1796 unknown
MGLCLKLPQASGKVFSIYAYFKRNKRFTKKFF
>MS0908 unknown
MLMFELTKLITNMLLPPFNILILLVLSFLFLAFKFKKLAALCALSGLTIL
YVFSIPYTAQLLNDSLTTEDNLTVEDYRSAQAIVVLGAGLRDSKELYNKI
TVPGIALERMRYAAYLHKETELPILISGAGPNGNSEAKIMGQEFFTFFGV
QPKWLEERSTNTKQNALYTREMLEREGIKRIILVTNQWHMQRAKLLFEAQ
GFEVLPASVGSGVTPESYELNAMHFIPQAGAMAANMQLLKEWLGFIKEKL
>MS1549 unknown
MILTRYLTKEVFKSQVAILFILLLIFFSQQLVRVLGSAANGNVPADLVLS
LLGLGMPAMAQLMLPLCLFIAILLTFGRLYAESEISVMRACGVGQRILVK
VALGLSVLTAALAAYNVLWVSPWAIQKQGQIVEDARANPNMSALSAGQFM
TSNDSDFVLFIDNIKDNKISNIYLFQTKEKGNSKPSVIVAENGELQSLPN
GDQILSLQNSQRVEGSAALPDFRITNFTEYQAYLGHRNVDSDENETTELP
LAELLALKTPAAKAELNWRISLILAVPLMALLAVPLSKVNPRQGRFAKIL
PALLLYLIYFLLQSSLKSAGGAGKLDAGLLMPLVNLFFLLLGIMLNSWNS
AFMYKIRHLFSKKSAI
>MS1733 unknown
MFLPELTGLCLFSHDDSRKALSFCSLKNWKQAEKLRDFLKRKSEYKKSWP
GKSGQVFSNNERHRYKNT
>MS0211 unknown
MKFLTALFYSFFLFIAKKYSKRYNNQPFLRNVAGGKVSSAFCVIGIT
>MS1350 unknown
MLNKVFLSGVALLLAGCAAEQAPIPAQFAGADYQLSDKDAKQWVALGKRA
ESCIYPNLTRIQQEHFAKEDSYIYSQYVFFYPLEDVIGSDAVKIIEADQQ
SMDYATYQFKKFKQSDELPKLDELTTAQCNTLRIKAREDLAVVKGQRISA
MVEDTNTTGGTSNANKVGTEDNKFFFDIIKWGSALLL
>MS0215 unknown
MFIYAGHKTKSAVQNFQIFDRTFIPPFVRLYPLH
>MS0651 unknown
MSLRLQFPGGIIFSALAIPGKIWYFLSVNPTETYLPSMNEEHSTKTTQDT
IQKKNFFQSLFDRFFQGELKNRDELVEVIRDSEQNELIDQDTREMIEGVM
EIAELRVRDIMIPRAQIVFIQTDQDLESCLDTIISSAHSRFPVVGNEKDN
VAGILHAKDLLKFLRTDAEEFQLESLLRPVVIVPESKRVDRMLKEFRSER
FHMAIVVDEFGAVSGLVTIEDILEQIVGDIEDEFDEEDIADIRQLSRHTY
AVRALTDIDDFNQQFGTHFEDEEVDTIGGVIMQAFGYLPKRGEEITLENI
HFKVTSADSRRIIQLRVTVTDEQLAEIEKSAEEKEE
>MS0372 unknown
MIKGIQITKAANDNLLNSFWLLDSDKGEARCLAAKAEFAEDQIVAINELG
QIEYRELAVDVAPTIKVEGGQHLNVNVLRRETLEDAVNNPDKYPQLTIRV
SGYAVRFNSLTPEQQRDVITRTFTESL
>MS1185 unknown
MNTPFFISWRYQRTKQKNRLVSLIALFSSIGIALGVAVLILGLSAMNGFE
RELNNRILAVVPHSDITAYQDGRINDRQDLERRLMANSDIKAVSPYVSFT
ALVENGAKLKVVQVHGVDHKMLDNVSSLGKFVLNNGWQQFAERGGLVLGS
GIARDLDVSEGDWVTLLISQNTDGDQLSQPARERVQVTGILRLDGQLDHS
YALIPLATAQEFLGFAKNEISGIEMKVADPFKVQQLNFANLNDYPQMLNL
QTWINKFGYMYRDIQLIRTVMYIAMVLVIGVACFNIVSTLIMAVKDKAGD
IAIMRTLGANNGFIKRIFIWYGLQAGMKGCLIGIILGVILSLNLTSIIKA
VESLLGHKLLSDGIYFVDFLPSELHWQDVLLVLVAALMLSLLASLYPANR
AAKLQPAQVLSGH
>MS1986 unknown
MFIEPNKKPKIYTALFTQAYYHIFIRFILNNL
>MS1493 unknown
MNINVISIFKLLLLFALGLVILSPALSTQIGVPRLDSALCFLFFFLAVIT
PFLRDMETDFFKLQFPVYVLFFFGFLSVLNAFSTEKLVDLFFFGIVMFLF
HYSFLTFNRGDGEAGIRHLLLGISLIVLAGFFIEALLGFQLVSGNEELTV
TDKAFKGFFFNTNDQSVIMISLAVAVGFFYIIRENNWKIKLIGYALIFIM
GLAIVISASRSVLLSYLIMLMLILFLNASAYFKAVYLFFACVIALFIFNL
SWLQEVFILLAKIDWLERPIERFSLVIFSMGDDKSVGYRTEIYTTFLDNF
KILWLGYGPRDYIQYFDQIKLSFPLGYTNPHSFFIELYLAFGIFAFLAFI
YFLLNSIIYVMNTRLLAWKERIFILFVFINFCWIVWVPSSILRLPLVWYP
LFLVLVYTVLVKNGTFVSPKLVGRRRSS
>MS1176 unknown
MQAGNPVLCNTTLFVKDPKPNSVLLRNLMMSFIFSHNK
>MS0178 unknown
MAVKKCGKFPYIFAPTPVNGGDDAPVQLVGMV
>MS1706 unknown
MKIKSKSAVKIFDFLTALFLTTKNGTLGVPFLLYKYYKLVLIGKTFDSCA
>MS2167 unknown
MNVPILYFDVVHSIQIHDWIIEKSGGLAGLYPDGTGKLESVLEHIQNDLY
YPNFEDKLVHLIYSINKLHAFLDGNKRSSIVLGSYFLELNGYDYCVKEFT
IKMENIVVWLAESKISKELLLKLVCSILNNEEQYSDELKYELICATSDDF
GN
>MS0436 unknown
MKANNNPVRRSFLLFNNLSDNLCGHLKDSFKQKI
>MS0732 unknown
MINLKIDGFDVRVDEGTTILEAAKSVGINIPTLCYLKDVSDIGSCRVCVV
EVEGFEKLPTSCNTLAQEGMVIRTQTDKVVKSRRMALDLILSHHNLICFS
CPSNGACELQNVAHQCGISESSFPNFRLPGIEVPHVEDNPFLGYRPDLCI
HCQRCINTCANVSGCSSIKLASRGIFRAIETPFGKDWKETTCESCGNCAE
ACPTGAIYKKEAKSYRSWEIQRVRTTCPHCAVGCQYDLLVKDNKLVGAEG
VDGPSNGGRLCVKGRFGSYKFVMSGDRLTDPLIKDRATGKFRKASWDEAL
DLVASKFMTLKRQYGGDSLAGFACSRSPNEDIYMVQKMVRTCFGTNNTDN
CARVCHSASVEGLARTLGSGAMTNPIYDITHDVDAILLVGSNPEEAHPVI
GMQIREAVRNGTKLIVVDPRDIGLTKQADIHLKLRPGTNIAFANGMCHIF
IKEGLIDEKFIAEHTEGFKELKKIVKDYTPEYVAEICGIDADDLRAAARI
YATAKKAPIIYCLGVTEHSTGTEGVMSMSNMAMMVGKIGREGCGVNPLRG
QNNVQGACDMGASPNQYPGYQSVKDPEIRAKFEKAWGVKLPAHIGLHATD
VFPAAIKGKIKGLYICGEDPVVTDPDTNHVINALKSLDFLVVQELFMTET
ALLADVVLPGRSYAEKDGTFSNTERRVQRVRKAITLPGNSRLDTDIICEL
MRRMGYNQPNLTASEIFDEMASVTPSFRGISYERLEKEPTQSLQWPCTDQ
YHPGTPIMHVGKFARGLGLFYPTVYTPAKELPDAQYPMMLTTGRILYHYN
TRAMTGRTEGLMEIAGHSFIEINSADAKRLNIENGERVRVTSRRGTITTE
ARVSDKTNEGETWMPFHFADGNCNWLTNAALDQFARIPEYKVCACRIEKL
PEDEAFNMKGKYITQKMVAAQWRKKMDKSIAKLVR
>MS1948 unknown
MGDIYEFTKTITNPAIYFHYGVFCHAVGIFYLFAAIICFGLGSCLFGTNA
RACLLAPSDFYSEKPIVMANILVKFDNGENRGFIFGSSRLVLSFLTMLVM
NLIPHLFLI
>MS1356 unknown
MPSDGMYKIIRRYFFNKRNKCYFFVKKARQMQGKFLFS
>MS0841 unknown
MFDRKREETTALFYYHLGLVIKGEKRGTSEGVERSALTVVPIV
>MS1680 unknown
MMKTVTIDLQIASEDQSNLPTLEQFTLWATNAVRAEHFEPEITIRIVDEA
ESHELNFTYRGKDRPTNVLSFPFECPEEVELPLLGDLVICRQVVEREAQE
QGKPLTAHWAHMVVHGSLHLLGYDHIEDDEAVEMESLETEIMTGLGFEDP
YSYDEE
>MS0280 unknown
MFRLITPIALFFFGLFVSIYSYQSYGDFAEYGAAFYPTAVGVLVSFFSLV
DFIMELRIKDKYVFQHFDFFQDGKIILLIIAIISFYIFVADYLGFIITTS
LILIFLTLPFLEKYKLLTALLLIILSIGIYLLFARVLLVGLPSGIIFE
>MS1460 unknown
MVARRAHNPKVVGSNPAPATKFKRKALILLMPFCYLPYIEKGDKKNRP
>MS0322 unknown
MSAIEQTAEGLRLRIFLQPKASRDKIIGIHDDELKIAITAPPVDGAANAH
LLKYLSKAFKVPKSAIILEKGELNRHKQLFIPEPKLIPEELQPLL
>MS0398 unknown
MSAGISLRIILLKIVSSAILCFLIFGKNLPHFLSEISLNSNLSRKPKE
>MS0421 unknown
MRNSSMMNLLNRRNSVKNNRTLAYFYSKCGQKFMIFYKLISSLS
>MS0475 unknown
MNRRVQGPTTHNGISGSSNHLLLCKPFLSFANKFELKFDKLAKN
>MS0434 abc, Abc protein
MIKLKNISKIFDVAGKKLNALDNVSLDIPKGDICGVIGASGAGKSTLIRC
VNLLERPTSGSVFVDGQDLTQLSEAQLIAERRNIGMIFQHFNLLSSRTVY
ENVALPLTLEHMAKEKIHEKVTALLALVGLTDKKDVYPANLSGGQKQRVA
IARALASDPKVLLCDEATSALDPATTQSILKLLKEINRTLGITILLITHE
MDVVKNICDQVAVIDKGQLIEQGSVSEIFSNPKTELAQEFIRSTFQANLP
EEYLAKLTDTPKRSDSYPIIRFEFTGRSVDAPLLSQTSRKFNVSFNILVS
QIDYAGGTKFGFTIAEVEGDEDSITQAKIYLMESNVRVEVLGYVD
>MS1552 abgB, AbgB protein
MELTQQQLVQWRREFHRFPETGWAEFWTTSRIADYLEQMGFEILLGNQII
NRDFVRGRQQAVVEKGLANAVAYGAKQKWLEKMDGYTGCVAVLDSGKPGK
TLALRFDIDCVNVMETKAPEHIPNKEDFASLNDGFMHACGHDGHITIGLG
TALWLSQNKDKLSGKVKIVFQPAEEGVRGAAAIAASGVIDDADYFSASHI
GFCADSGTVISNPKNFLSTTKIDIRYQGKPAHAGAAPHLGRNALLAAAHA
VTQLHGISRHGEGMTRINVGVLKAGEGRNVIPSKAEIQLEVRGENKAVNQ
YMVDQVMRIANGIAVSFDVEYETEIMGEAVDMINDTELVGLVEEIVLAHP
KVHSANANYAFNASEDATVLGRRVQEQGGKAIYFVLGADRTAGHHEAEFD
FDEDQLMNGVNIYTALVQRLLG
>MS0768 accA, AccA protein
MRKKMSQQNQTEYLDFELPIAELEAKIESLRSVTDQDSKIDLDDEIKRLQ
KKTAELTKKTFADLDAWQVSRMARHPNRPYTLDYISRIFTEFEELAGDRA
FADDKAIVGGLARLDGRPVMVIGHQKGRSVKEKVLRNFGMPAPEGYRKAL
RLMQMAERFRLPIITFIDTPGAYPGVGAEERGQSEAIARNLREMSTLTVP
VICTVIGEGGSGGALAIGVGDKVNMLQYSTYSVISPEGCASILWKSAEKA
STAAEVMGLTASRLKELELIDNIVTEPLGGAHRQYDEMAQALKQRILSDL
EDLDILDKETLLDRRYQRLMNYGYV
>MS1789 accB, AccB protein
MFQYCKVRLLFFIFSNKTDQNIMAMDIRKIKKLIELVEESGIMELEISEG
EESVRISRGAAAPSAVQYTLPAAAPAPVAAPHAPVAAPVAAPDAVAELSG
HIIRSPMVGTFYRSPSPEAKAFVEVGQTVKMGDALCIVEAMKMMNRIEAD
KAGVVTAILVNDGDAVEFDEPLIVIE
>MS1788 accC, AccC protein
MGLSHLFTFVDQISNWNTLMLEKVVIANRGEIALRILRACKELGIKTVAV
HSTADRDLKHVLLADETVCIGPAPSVKSYLNVPAIIAAAEVTGADAIHPG
YGFLSENADFAEQVEVSGFTFIGPTADVIRLMGDKVSAINAMKKAGVPCV
PGSDGPLGTDMVKNKQIANRIGYPVIIKASGGGGGRGMRVVRNDESLEES
IAMTKAEAKAAFNNDMVYMEKYLENPRHVEIQVIADTHGNAVYLAERDCS
MQRRHQKVLEEAPAPGITEEIRRDIGQRCANACIEIGYRGAGTFEFLYED
GKFYFIEMNTRVQVEHPVTEMITGVDIVKEQLRVASGLPLSVKQEDIKVH
GHAIECRINAEDPKTFLPSPGKIAHLHSPGGLGVRWDSHVYAGYTVPPHY
DSMIAKLIVHADTREGAIRRMQNALAETIIDGIKTNIPLQNLILEDENFQ
KGGTNIHYLEKKLGMGE
>MS1174 accD, AccD protein
MWLNKQNFYDLIKRLKMSWIDKIFSKSPISSSRKANVPEGVWTKCTSCEQ
VLYRDELKRHLEVCPKCGHHMRIDARERLLALLDKDGVTELAADLEPKDI
LKFRDLKKYKDRLTAAQKDTGEKDALVVLSGTLYGLPIVAAASNFGFMGG
SMGSVVGAKFVAAAEEAMEKNCPFVCFSASGGARMQEALFSLMQMAKTSA
VLAKMKEKGVPFISVLTDPTLGGVSASFAMLGDINIAEPKALIGFAGPRV
IEQTVREKLPEGFQRAEFLLEHGAIDMIVKRSDMRDTLASLLTKLMNKPS
PFNAEELSDTE
>MS1336 aceE, AceE protein
MSQMINDVDPIETSDWLLAIDSIIREEGVERAQFIIEELMQHARSKSVAL
PTGATTEYVNTIPPSEQPPYPGNLSIERRVRSAIRWNALMMVLRAQKKDL
ELGGHISTYQSAASIYEVCFNHFFKAATEKNGGDLVFFQGHAAPGIYARA
FVEGRISQEQMDNFRQEAKANGLSSYPHPKLMPDFWQFSTVSMGLGPVNA
IYNARFLKYLNNRGLKDTTDQTVYAFLGDGEMDEIESKGALTLAAREGLD
NLIFVISCNLQRLDGPVNGNGKIVQELEGLFFGAGWEVIKVMWATGWDKL
FAKDTSGKLTKLMMEVVDGDYLTFKSKNGAYIREHFFGRYPETAALVADM
TDDEIWALRRGGHDTEKMFAALARAKKSDKPVVILAQMVKGYKIPEAESK
NTAHQTKKMSHASLKSFRNHFDLPLTDEQIDNYEYITFAPDSEESKYLHE
RRAALNGYVPARLPKFTTEFKVPALEDFSQLLEEQPRAISTTMAFVRVLN
TLLKNKDIGKQIVPIIADEARTFGMEGLFRQVGIYNPHGQNYVPSDKELV
AYYREAKDGQVLQEGINELGATASWLAAATSYSVNNLPMIPFFIYYSMFG
FQRVGDMMWAAGDQLARGFMIGGTSGRTTLNGEGLQHEDGHSHIQAGVIP
NCVSYDPAFAFEVAVIMQDGINRMYGEKQEDVFYYITTLNETYDQPAMPA
GVEDGIRKGIYKFETVGKGEAAIQLMGSGAILRHVRQAAQILADDYGIAS
DVFSVPSFTEVAREGADVARWNLLHPTETQRVPYIAQVMSDKPAVAATDY
MKLYAEQVRAFIPAQSYHVLGTDGFGRSDSRENLREHFEVDAHYVVVAAL
NELAKQGKLEKQVVADAIAKFGLDVDRINPLYA
>MS1354 aceF, AceF protein
MSNFDIITPDLPESVADATVVKWHKAVGDKVRRDEVLVEIETDKVVLEVP
ALNDGIIESIIEPEGATVVSKQLLGKAALLPVGEVTVRAETPTVAPQIED
SAVASSADTLGPAARRLIAEHDLNVNEIKGSGVSGRITREDVEAVIAQKA
ASVAAKSAVENTVISSPAAVRTEKRVPMTRLRKRVAERLLEVKNSTAMLT
TFNEVDMQPIMQLRKKYAEKFEKQHDTRLGFMSFYVKAVVEALKRYPVIN
ASIDGDDIVYHNYFDISIAVSTPRGLVTPVIRNCDKLSMAEIERQIKALA
EKGRDGKLTVDDLTGGNFTITNGGVFGSLMSTPIINPPQAAILGMHAIKD
RPVAIDGQVAIRPMMYLALSYDHRLIDGKDSVGFLVTVKELLEDPTRLLL
EI
>MS1335 aceF, AceF protein
MPTLRRMINMSKQIQIPDIGADEVTVTEVMVKVGDTVTEEQSIINVEGDK
ASMEVPSPEAGVVKEILVKVGDKVTTGSPMFVLESADSAPASAPQAAAVA
PAAAPTTSAVIEIHVPDIGSDEVNVTEIMVKVGDSVAEEQSIINVEGDKA
SMEVPAPQAGVVKEILIKEGDKVSTGSLIMKFEVAGGAPAAETPATTVQA
APAVSAVQDVNVPDIGGDEVNVTEIMVKAGDSVAEEQSLITVEGDKASME
VPAPFAGVVKEILVKSGDKVSTGSLIMRFEVAGSAPAVQAAAPAQAAPAP
VAPAPQAAPAQSLAPVNQDSIATSASYAHATPVVRRLAREFGVNLDKVKG
TGRKGRILKEDVQEYVKNALKALESGATASTGAASGAGLGLLPWPKVDFS
KFGEVEEIELTRINKISGANLHRNWVMIPHVTHFDRADITDLEAFRKEQN
VLAEKQKLGVKITPVVFIMKAAAKALEAYPRFNSSISEDGQRLTLKKYVN
IGVAVDTPNGLVVPVFKDVNKKGIIELSRELMEVSKKARDGKLTASDMQG
GCFTISSIGGLGTTHFAPIVNAPEVAILGVSKSEMAPVWNGKEFMPRLML
PLSLSFDHRVIDGADGARFISYINGVLSDLRRLVM
>MS0999 ackA, ackA protein
MCERMSFNVNPKDRLFMSQKLVLILNCGSSSLKFSILDPKTGEEKLSGLA
EAFYLDDARIKWKLHGEKGNAELGKGAAHSEALNFIVNNIFPLDPTLKDG
IVAIGHRIVHGGEKFTSSVIVTDEVVKGIEDAIQFAPLHNPAHLIGIKEA
FKIFPHLKDKNVVVFDTAFHQTMPEEAYLYALPYSLYKEHGVRRYGAHGT
SHYFVSREAAKRLGVAEDKVNVITCHLGNGGSVSAVRHGQCIDTSMGLTP
LEGLVMGTRCGDIDPAIMFYMHDTLGMSVEEINTTLTKKSGLLGLTEVTS
DCRFAEDNYDNEDESLRVPAKRAMDVYCYRLAKYIGSYMAVIGERLDAIV
FTGGIGENSAHVREITLNHLKLFGYQLDQEKNLAARFGNEGIITADNTPI
AMVIPTNEELVIAQDTARLCIKD
>MS2369 acnB, AcnB protein
MANFLQEYQQQVDERAKEGVVAKPLNADQTAQLIELLKNPPQDKAEFLLD
LFKNRIPAGVDEAAYVKASFLSAVTKGDVACPLISAKSAVEILGKMQGGY
NIEPLLSALDNPELAPAAAKELSGILLMFDNFHDVRERAEQGNPYAKQVL
QSWANAEWFTNRPKLAEKITVTVFKVSGETNTDDLSPAQDAWSRPDIPLH
ANAMLKMPREGIIPDQPTLVGPIKQLESLKQKGFPLAYVGDVVGTGSSRK
SATNSVLWFMGEDIPYIPNKRAGGIVLGGKIAPIFFNTLEDAGALPIEVD
VSALNMGDVIDIYPYAGKICVHNTDQVLAEFSLKTDVLLDEVQAGGRIPL
IIGRGLTHKARLALGLNESEIFKKPQAVQASEKGYTLAQKMVGRACGVEG
IRPGQYCEPRMTSVGSQDTTGPMTRDELKDLACLGFSADLVMQSFCHTAA
YPKPIDVVTHHTLPDFIMNRGGVSLRPGDGVIHSWLNRMLLPDTVGTGGD
SHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESVLVRFTGNMQPGITLR
DLVHAIPYYAIQQGLLTVEKKGKKNIFSGRILEIEGLENLKIEQAFELSD
ASAERSAAACSIKLNKEPIIEYLNSNIALLKWMIAEGYGDARTLERRIKA
MQTWLDDPQLLEADKDAEYAAVIEINLDEIKEPIVCAPNDPDDARLLSDV
QGDKIDEVFIGSCMTNIGHFRAAGKLLNKFKGMIPTRLWVAPPTKMDAAQ
LTEEGYYSIYGKSGARIEVPGCSLCMGNQARVADNATVVSTSTRNFPNRL
GQGANVYLASAELAAVAALLGKLPTPEEYLSYTVDLQQDKDDTYRYMNFD
KIENYMKKADKVIFRQAV
>MS1875 acpP, AcpP protein
MSIEERVKKIIVDQLGVKEEEVKSEASFIEDLGADSLDTVELVMALEEEF
DIEIPDEEAEKITTVQSAIDYVQNNQ
>MS2089 acrA, AcrA protein
MKKKYLILLLALALTACGEAQTEVAETTSRMKVNVVQVQPTQLNYRLLLS
GSIQAKDDVSVGTSLQGLQVLDVKAEVGDWVEQGQVLATLEQSQVQSQFR
QNDALLQRAKANLVSQQSTLKEAEATLKRYQQLIKTDAVSHQELDQQRAK
AESARAAIQAAKAEIAQVQAQLDDSRHQRKKAEVLAPTSGIVTQRLAQAG
NLTDSNALFHIARDGVLEAVVRASADEISVLETGLVANVQMLDKVTSGLI
RLISSQIDSATHTAKIHIALQEKLQVPFGTPINAVVQLPEMTAQIAVPFS
AVNFGADGNHFVMVVNADGTVVRRKITLGEVS
>MS1128 acrA, AcrA protein
MKKKVLAIAVLAALIAGGAYYFMNGSKKAPTYLTEDVQRSNVEKTVVASG
SIESSNEVDVGAQVSGKVVKLYVTLGQEVKKGDKIADIDSTTQINSLNTA
KAALASYQAQLKAKQTGYNVALSSYNRLSKLYTQQSTSLDNLNSAKNTLD
AAKAEVDALKESIKQAEIQVNTAETNVDYTKITSPIDGTVISTPVSEGQT
VNANQTTPTIVTVANLDKMLIKPEISEGDITKVKAGQQVTFTILSDSTTT
YDAVIDSVDPATTTTTDASATSSASSSSSSSSSTSAVYYYANMAVDNPNR
VLRIGMTTENTIKIARAENVLTVSNMALKKQDNKYYVNVLNAQNQPERRE
VQVGVQDDFHTEIKSGLTESDKVILSQIEDGEKVGNLGRGPRMF
>MS1301 acrA, AcrA protein
MKAKYVVAILAAVAVAGGTIWYNAQLKAEKLAGIAAVNGRLELKRLDIAT
LYAGRVEEMYVQEGDEVQPGQNLARLSSSISQTQVDAANAQKQRAQEAVT
RAVAQIDSQQQQLKVAKLELDNAQKLRRDNLVSASELERRQANYRAAVAA
VNTAKAAKAEADAAVNQAQAQLEQALSQNSDMLIKAPKAGWVEYQIAEVG
NVLGIGGRVVSLLDPTDTYINVFLTSAQSNQVKVGEEARIVVDGMNAVFP
AKITYVAADAQFTPKSVETTEERAKLMFKVKLQIPAEIALQYNKLLKGGM
TALGYVKYGQEALWPENLTVKLPQGE
>MS0454 acrA, AcrA protein
MTDSQLAKPKRSHAFLLKVGLAVAVLVFALVIGLNKFKEIMIGKAIANMP
ETANPVTALTVGSSEWTPVIETTGLVRPNQGAMLSSQASGTIKRIYVKSG
QAVKKGDVLVELDNAVEEATLKASEAQLPSVRLTYQRYANLIKSQSVSQT
ELDSAKAAYDQLVANINSLKASIERRKILAPFDGITGIVQVNEGQYISAA
TEIVRVEDISSMKVDFSVSQNQLEDLHIGQKVTATSDARTGETFAAKVTA
IEPAVNKSTGLIDVQATFAPEDGKKLLSGMFTRLRLALPTERNQIVVPQV
AITYNMYGELAYVLMPLSDEDKEKLKDNENLSKMYRAQQITVFTKDRQGI
YAQLKGNEVKVGDILVTGGQQRLSNGSLVVISDKDGVGTVQPAEKTNL
>MS2087 acrB, AcrB protein
MNFRISAWAIRNPIPIIVLFLLLTIMGIRSFQALPINADPNISFPAVNIT
ISQTGASPDELENSVTRRVEDAVAGMAGVRHITSSITEGTSTTSVEFRLE
TDTDRAVNDVRNAITQIRGDLPQNIDNPIVERMDTEGAALGYYAVQSPNM
NQTELAWFIDDAVSCELLAVNGVQQVKRLGGEKREIRVALQSTKLNALEI
TAEQVSQQLAQTNANVPAGRVEWFNQEQSVRVIGSQINLDDLANLPIALS
DNRKVKLSELATITDSHAEMRSRTRLNGREVLGFQVFRSKGSSDTVVESG
IQQALKKLIETYPDIHLTEVHNSVDTTRENYDVAISTLLEGAALTVLVVW
LFLRNWRATLVAAIALPLSILPAFWIMKLLGYTLNSISLLAITLVIGILV
DDAIVEIENIETHMQQGKRPFQAALDASDAIGLAVVAITASIVAVFLPVS
FIDGMTGQYFGQFGTTVAAAVLSSLMVARLAIPLLAAYLLKPHISKHHTA
QHVGRLKKSYLSLLAKALQFRKTTLLMGGGLLLMSAMIIPQLPTGFVPKG
DTGMSQIDITLPPSSPLAQTDDMLQQLDRIIREFNEVDLVFTTAGSSEIN
KGEVLIKLKPYKERSVSQKEFEDKLRDELVKFADIRANFRNEMAGRDVSI
LLTGNDPVKLDQTAAELKKQMQEIKSIENVQINAPLVKPELQVKLRKNEA
AQAGISSQAVGNLLQIATLGTTDGNAARFNLPDRQIPIRVTLSENERNQP
EVLQHLRVASSNGGTVILNTIADIQFGAGSASLERFDRERRIAVEADLAV
GQTIGTALSQINELPIMQKLPDGVRVPSAGDAEYMDEMFSQFGFAMATGV
AMVLLVLILLFKDFLQPFTILTALPLSIGGAALGLLLYGAALDMSSVIGI
LMLMGIVTKNSILLVDFVIEKRQQGVERTTALIQSGAERVRPIIMTTIAM
VAGMIPAVFAGGASAAFRAPMAIAVICGLTASTLLSLVFVPVVYSLMDDM
RNYLAPKLAKLTSVTEEDRVV
>MS0456 acrB, AcrB protein
MKFTDIFIKRPVMAIAISMLIVILGLQAISKLAVREYPKMTTTVITVSTT
YAGADAGLIQAFVTSKLEEAIAQADNIDYLSSTSAPSSSTITVKMKLNTD
PASALADVLSKVQSVRSELPSGIEDPTLTSSTGGSGIMYISFRSDKLHPS
QVTDYIERVVKPQLFTVEGVAKVQVYGAAEYAMRIWLDPQKLAGQNLSAT
QVTTALSNNNVQTAAGSDKGYFNIYRNKVETTTNTVEDLGNLIVYSDGDK
LVRLRDVADVELNKESDDTRAAANGSDAVVLSIEPTSSANPLTVADNIKP
LYETIKKNLPDSIESNILYDRTVAINSSINDVIHTIIEAVVIVLVVIMMF
IGSLRAIFIPIVTIPISLIGVIFMLQMFDFSINLMTLLALILAIGLVVDD
AIVVLENVDRHIKEGETPFRAAIIGTREIAVPVISMTIALVAVYSPMALM
GGITGTLFKEFALTLAGAVFISGIIALTLSPMMSAKILKHESSKFEEKVN
RTLSKLTTGYTYILGLVMQARKAILLFAVIIFATLPILFSSLSSELTPAE
DKGGFLGMVTAPSNVNVDYVQQATKPYEEILNNTPEKQYSQVIAGAPNTN
QALVITTLKDWAERSRSQAEVMAELTKKAAAIPEVSISAFAFPEIETGEQ
GPPVVFVLSSPGSTKELAQTAETFLDKIRKSGKFVYSNLTLKYDVAQMRI
QVDKEKAGTYGITMQQIASTLGSYLSEATITRVDIDGRAYKVISQVKREN
RLSPESLKNYYISASNGQSVPLSSLLTVELEPQPYSLPRFSQLNSAEIQL
VPSPTTTTGDAIAWLKDAAQDLPQGYSYDWKGEARQLVQEGNSLATTFIL
AVLIIFLVLAIQFESIRDPFVILVSVPLAISGALLTLNLLSFLGVTGVTL
NIYSEVGLITLVGLITKHGILMCEVAKEEQLNHGKTKMEAIMTAAQLRLR
PILMTTAAMIAGLIPLLYASGAGAVMRFSMGVVVVAGLAIGTLFTLFVLP
VIYTYIGSNHKPLPEFDENAPRIGSSH
>MS2295 acrR, AcrR protein
MEQKLSPKQKGRPRTFDREKALESALFVFWNQGYTNTSIADLCNAININP
PSLYAAFGNKSQFFIEILDYYRRVYWDVIYAKMDVEKDIHRAIHIFFRDS
VNVVTVANTPGGCLSAVATLNLSAEETKIQQNMRQLKSDILKRFENRLKR
AIVDKQLPSQTDIPALALALQTYLYGIAIQAQAGTSKDDLLKVASKAGLL
LPKLI
>MS1936 acrR, AcrR protein
MAEQLTLDSIEPEPEKQSAKIEKRSIKERRQQVLTVLTHLLHSEKGMERM
TTARLAKEVGVSEAALYRYFPSKTKMFEALIENIESSLFSRISYSIKMET
NTLNRVHDILQMIFDFARKNPGLTRVLTGHALMFEEAKLQARVALFFDRL
ELQFVNILQMRKLREGKTFPIDERTIATYLVTFCEGQFMRLVRTNFRHMP
NQGFEQQWRFIEPLFE
>MS0153 acrR, AcrR protein
MINMAGVRAIQKEKTRRALIDAAFNQLNAEKSFSNLSLREVAREAGIAPT
SFYRHFKDMDELGLTMVDEAGLTLRQLMRQARKRIEKGGSVIVISVETFF
EFIAHSPNVFRLLLRESSGTSQAFRTAAAREIKHFVDELAEYLANKNNYS
EYVAYVQSEGMVTIVFTAGANALDMNNKERELLKERLILQLRMLAKGAHH
HMMERERHNTHLPATGKS
>MS0453 acrR, AcrR protein
MRQSETDMAEQIFAATERLMAKDGLHHLSMHKIAKEARISAGTIYIYFKS
KEELLEQFAWRVFSLFQTALEKDYDETLSYFEQYKKMWLNVWYFLQDNPN
IVMNMQQYQSLPGFFDICKEMDYNSRWATFCQKAQQAGAVCELSVSILFS
LSMESAMNLAFKKLYINEFLADEELMTIIERTWRSIQK
>MS2211 acrR, AcrR protein
MKKNLNFVVKESITEALLRLMAKKNFDEINITAITELAGVSRISFYRNFD
SKEDVLIKYMYVRAKELYKPFESQDVSVRDKLIGMFKSIEGMEDIINLLY
AQNLSHIFLQYFNFVRGAKPEQENLDAYQNSIVVGVCFGALDEWIKRGRQ
ETPEQMVDLLQNVIWGFVKE
>MS1300 acrR, AcrR protein
MKQDIRITKTLGLIRHVFLELLEEKGFEHIVVQDILDRAQINRSTFYKHF
QNKHAVALMLVDEIKQLLTENFENRFSIPTTEFAQKMVPIFWQHRDLIHL
IGKIENPRIHLYKDLALVIKEEYIKQAVREQPQSSEELDFQGYLFAIVSL
GTIRYFVEKGELPDPSVIVGDIESVFNLLIIK
>MS0845 acyP, AcyP protein
METYMLKKQFVVYGIVQGVGFRYFTWKKATEIGLNGIVKNQRDGSVYILA
EGSASQIDSFRDWLSHGPPSARVDRVEENDYSGTHSFGLFSVEH
>MS0637 ada, Ada protein
MDSIYYSYYSSPVGNLLMIAQQGKLTNLDCELEQTAPNPKWILNNELPLF
RQVKSALDRYFSGEKEDFSDIPLNPQGTTFQQSIWQALRRIQLGKTTSYG
ELARLINNPKAVRAVGGAVGSNPISIIIPCHRVLGKNGQLTGFGGGLPMK
RFLLNLEKIRYVDKGVEYVKQKLLKKYTA
>MS1386 adhC, AdhC protein
MSNTIKSRAAVAFAPNEPLKMVEIDVERPKKGEVLVKITHTGVCHTDAFT
LSGADPEGVFPVVLGHEGAGVVVEVGEGVTSVAVGDHVIPLYTAECRECE
FCKSGKSNLCVSVRETQGKGLMPDGTTRFSYNGQPIFHYMGCSTFSEYTV
VADVSLAKINPQANPEEVCLLGCGVTTGIGAVHNTAKVQEGDSVAVFGLG
GIGLAVIQGAKQAKAGRIIAIDTNPAKFELAREFGATECLNPNDFDKPIQ
QVIIEMTKWGVDHTFECIGNVNVMRAALESAHRGWGQSIIIGVAGAGQEI
STRPFQLVTGRTWKGSAFGGVKGRTQLPGMVEDAMKGIIRLRPFVTHTMP
IERINEAFDLMHEGKSIRTVVHY
>MS0796 adk, Adk protein
MEISMKIILLGAPGAGKGTQAQFIMNKFGIPQISTGDMFRAAIKAGTELG
KQAKALMDEGKLVPDELTVALVKDRIAQPDCANGFLLDGFPRTIPQADAL
KDSGVNIDYVLEFDVPDEVIVERMSGRRVHQASGRSYHIVYNPPKVEGKD
DVTGEDLIIRADDKPETVLDRLAVYHKQTQPLVDYYQAEANAGNTKYFRL
DGTKKVEEVSAELNSILG
>MS2161 aes, Aes protein
MKILLKLTALLLSLGVALNVAADGGKNHPSYPLLASEFRDLSLLESVRFT
PEQLQDKAKLTELNAAFLQAAEQSEVQPNEKITAPAQGAQPAVDLYIYRP
ATAKNEKLPVIYFMHGGGYLFGNARQNNAALAELADLNKAVVISVEYRLA
SQTPYPADIDDAYHGLAYLFKNGQKLNADTNKVVIMGESAGGGLAARLAL
KVRDKGEFKLAGQVLIYPMLDYRTGTSQSLYNTPYTGGYVWTAEYNRIGW
ETLRGGQTIAQAEMPYYSAATATDLAGLPPTYMMVGSLDLFANEDMDYAN
RLVQAGVPTDLQLVSGVYHAFEIFNPNASQTLAYKLARTNAIQQMLAK
>MS1419 aes, Aes protein
MNFAKNLQKSTALFNVCTGNANMKKLLLAPLLLAFCLPSLAESYRTLDQV
SPAYQEAAKMLKMDFADPNVRENAQKQNIQRANESYQPTAHWTVPAQGSQ
PAVELYVYKPKSVAGKLPVIYYIHGGGYILGNAKAAGDNLQAIAEANKAA
VISVEYRLATVAPFPADLNDAYHGLSYVYKNAGKLGLDKEKVVLMGESAG
GGLAARLALFTRDKGEFTPEGQVLIYPMLDYRTGTPESPYDTKNLGEFLW
TESANRLGWATLRGNQTISDEQLPYFSPAFAKKLSGLPRTYMMVGDLDLF
VAEDLNYASRLIQAAVPTELQVFPGLFHAFEAFNKDGKQTKEYEQSRNQA
IQEMFSHPVK
>MS0155 ahp1, AHP1 protein
MSNMEGKKVPQVTFHTRQGDAWVDVTSAQLFDNKTVVVFSLPGAYTPTCS
SSHLPRYNELTPEFKKLGVDDVICVSVNDTFVMNAWKCDEDADNITVLPD
GNGEFTEGMGMLVDKEELGFGKRSWRYSMLVKNGVIEKMFIEPNEPGDPF
KVSDADTMIKFIKPDWEPKPSVALFTKPGCPFCAKAKALLTEKGYPFEEI
VLGKDATVTSVRAMSGRATFPQVFIGGKHIGGSDDLEAYFANK
>MS1521 ahpC, AhpC protein
MVLVTRQAPDFTSAAVLGNGEIVDNFNFKQHIAGKPAVIFFYPLDFTFVC
PSELIAFDHRYEEFKKRGVEVVGVSIDSQFTHNAWRNTAVDQGGIGQVQY
ALAADTKHEIAKAYGIEHPEAGVALRASFLIDANGVVRHQVVNDLPLGRN
IDEMLRMVDALQFHEQHGEVCPAQWEKGKEGMKDSPEGVAKYLKQNADKL
>MS0348 alaS, AlaS protein
MKTTAQIRQSYLDFFHSKGHQVVESSSLVPHNDPTLLFTNAGMNQFKDVF
LGMDKRPYTRATTAQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGD
YFKHDAIAYGWEFLTSPQWLGLPKEKLYVTVYETDDEAYDIWNKEVGVPA
DHIIRIGDNKGAPYASDNFWAMGDTGPCGPCTEIFYDHGEHIWGGLPGTP
EEDGDRYIEIWNIVFMQFNRHADGTMEKLPKPSVDTGMGLERIAAVLQHV
NSNYDIDIFQTLIKKVAQLTGEKDLTNKSLRVIADHIRSCAYLIADGVMP
SNEGRGYVLRRIIRRAVRHGHLLGAKETFFYKLVPTLADVMEHAGEIVNQ
KRALIEKTLKAEEEQFARTLERGLLLLDDALSQVKDNVLSGDVAFKLYDT
YGFPLDLTADVCRERNITIDEKGFEREMQAQRARAQASSNFGVDYNNVIK
VDGQTEFKGYETTSLSSAKVVALFTDGKSVERVQSGENAVVILDRTPFYG
ESGGQIGDTGYIATDLAAFRINDTQKYGQVTGHIGQLESGSLSVGDTVSA
QVDTERRLAVAANHSATHLLHAALRKVLGDHVAQKGSLVSESALRFDFIQ
PEAISKEQIIEIEAIVNRQIRENISVTTEVMDIEAAKQKGAMALFGEKYG
DLVRVVGMTGFSIELCGGTHVKRTGDIGLFKVVSESAIAAGIRRIEAVTA
ENAINWLNNQQNILNQSADLLKSDTASLVEKIQQLQDKAKKAEKELQQLK
EKAAMQAGSDLAKSAVKINDISVIVQQLDGIETKSLRVMVDDLKNQLGSG
VIVFASVVEDKVNLIVGVTADLTGKVKAGELVNLMAQQVGGKGGGRPDMA
MAGGSQPENVGAALSACSDWLESNL
>MS1182 alr, Alr protein
MKPATVKISSVALKHNIQIIKQKAPHSKIIAVVKANAYGHGVEFVSSTLE
NLVDGFGVARLAEALSVRSNGVTKPILLLEGFFSPKDLPILSVNNIQTVV
HNQDQLDAIKRANLENPIKVWLKIDTGMHRLGVSLEEVDYYYNELMNCPN
VDEVGFVSHFSRADETDSDYTNIQLNRFLDATKNKKGNRTIAASGGILFW
EDSHLEYIRPGIIMYGVSPINIPSSEYGLIPVMTLTSSLIAVRDHKKGEP
VGYGGIWVSERDTKIGVVAIGYGDGYPRNVPAGTPIYINGRRVPIVGRVS
MDMVTVDLGPDCKDKVGDEAVLWGKELPIEEVAEITGLLSYELMTKLTPR
VLTEYVD
>MS0767 alsT, AlsT protein
MNAKRYFGVLNDFVIMVEQGIHWLVDNVEGPLWDATIVILLGVGLFFTIT
TGFVQIRLFPHSLREMWFGREVQGDSLTPFQAFATGLASRVGVGNISGVA
TAIALGGPGAVFWMWLTALIGMSSAFAESSLAQLFKIKEADGSFRGGPAY
YITQGIGSRWLAAAFAIALIFTFGFAFNAVQSNSIVEATRNAWLWDEHYV
GMGLVLLTALIIFGGIKRIGKFSARIVPVMALVYLLIAVSILLIHYDRIP
SVISLIIRSAFDFSAMAGGVFGAMLSKAMLLGIKRGLFSNEAGMGSAPNV
AATADVKHPASQGLIQMLGVFVDTMVVCTCTAIIILLSDNYGGEQLQSIS
LTQNALKYHMGEFGLHFLAFILLLFAFSSIIGNYAYAESNIRFIKNNPVV
VNLFRAMVLFFVYFGAVNSGGIVWAFADTVMAVMAMINLVSLIILSPIVW
LLLKDYHRQAKQGIVPVLDIMLHPRLLKLRLDQRLWNRR
>MS0353 alsT, AlsT protein
MSLETILSSIDSFIWGPPLLILLSGTGLYLTLRLGFLQIRHLPRAFAYMF
KKEEGNHQRGDVSAFQALCTALSATIGTGNIVGVATAIQAGGPGAMFWMW
LVALLGMSTKYAECLLAVKYRVRDKNGFMAGGPMYYIERGLGIKWLAKLF
AVFGVLVAFFGIGTFPQINAITHAMNDTFSVPVTISAAIITILVAAIILG
GVKRIAAVSSYIVPFMAVLYVTTSLIILLINADKVPSALALIIESAFNPE
AALGGALGFTVMKAIQSGVARGIFSNESGLGSAPIAAAAAHTKEPVRQGL
ISMTGTFLDTIIVCSMTGLVLVITGAWQSSDMAGAAVTNYAFSQGLGTNI
GATIVTVGLLFFAFTTILGWCYYGERCFVYLVGIKGIKLYRTAFIILVAC
GAFIKLDLIWILADIVNGLMAFPNLIALIGLRKVIVSETKDYFMRLKTNN
YSLDDNEEQIVNS
>MS1515 amiC, AmiC protein
MKRFFLLFLTALFFAINPAWAAVWTIAIDPGHGGKDPGAISRNLGIYEKN
VTLSIAQELKGILDRDPNFRAVLTRRADYYISVPQRSEIARKNKANYLVS
IHADSSENPALKGASVWVLSNRRANDEMGQWLEDHEKQSELLGGAGSVLS
NHGSEKYLNQTVLDLQFGHSQRTGYELGRSILRNFAKIADLSRTSPQHAS
LGVLRSPDIPSVLVETGYLSNATEEAKLSSPSYRKRIAYIIYQGIVDFRK
RHLGGEINTSSKIAGQMPTEKTQSANTAKAKDNFKDDKETVQDSGVRHKV
KSGETIAKIAGKYNVTSEEIITLNKLKRKDLYIDELVKIPAEKTQSAKTA
KAKDNFKDDNETVQDSGVRHKVKSGETIAKLARKYDVKSEDIVTLNKFKR
KDLYIDELVKIPASAKNRQKNEPVTNTKNTEKAENSAKTAKSELVNGSYT
VKNGDTLFSIANRFGVKQEDIIELNKLKNANIFVGKKLKIPTGAKLKEDT
KQQTKTNKTTKKETQAEPKTIEKATVSYTVKSGDSISRLANKFDVKAAEI
IELNNLKNKELHIGDKIKLPANAKNVVTESKKTSVNKNTAVKGTKSSTKN
TTNKKTAKQDTKKK
>MS0365 ampD, AmpD protein
MMTVKHPIKIKNGWLSGVRKIISPHFDSRPSQADISLLVIHYISLPPEQF
GGGYIEDFFQGKLNPETHPYFQTIYQIRVSAHCLIGRDGRVTQFVSFNDR
AWHAGESCFQGREKCNDYAIGIELEGSNEQPFTEVQYQRLAELTNIIRHY
YPKITEDRIVGHCDVAPGRKIDPGQYFEWTKYFDLLKESQ
>MS2071 amyA, AmyA protein
MKKSLLYLCLFTSSAAFAQGWQHTHFQHFNDNAESNLFQSQTPLGKGNYP
LTFTLDNQCYQPQSAVKLNQTVSLIPCSGEAPQLRLFRQGNYIAQIDMRS
GTPTLRISVEQRAENDNNTVKSCPVWNKQPIEIDVSSTFTEGESVRDFYS
GQTAKVKNGKVTMMPAENAGGLILLEKSADKKTEVFDWKNATVYFVLTDR
FHNGNPANDNSYGRHKDGMQEIGTFHGGDLQGLTAKLDYLQQLGVNVLWI
SSPLEQMHGWVGGGNKGDFPHYGYHGYYHLDWTKLDANMGTEADLKNLMR
QAHRRGIRVLFDVVMNHTGYATLADMQEFNFGDFYLKPEEMAATLGKKWT
TWQPQKGQNWHSFNDFIKFGDSKAWQNWWGKDWVRADIGDYDSPKFDDLK
MSLSALPDLKTESESAVKLPRFFQHKNTNAKELANAKVRDYLITWLTDWV
RRYGVDGFRVDTAKHVEKPTWLALKQASQQALKEWQQKNPQESFGDDFWM
TGEAWGHGVFKSDYYQNGFDAMINFDFQDQAKNALDCFARIDPVYQDMSN
KLKDFNVLSYLSSHDTRLFFHSDSERNAVKQKTAANLLLLSSGAVQIYYG
DESGREFGATGSDPVQGTRSDMNWKELQNDQSKQALHQHWQKLAQFRQRH
RAVGAGVHQTLKSEGYFAFSRTLGEDKVMVVWAGN
>MS1236 amyA, AmyA protein
MNDNWWKNGVIYQIYPKSFQDTTGSGTGDIQGIIKRLDYLQTLGIDGIWI
TPMYVSPQIDNGYDIADYRNIDPSYGTMADFEQLIAEAHKRDIRIVMDMV
FNHSSTFHQWFKQGEDPNSEYHDYYIWREQPTNWQSKFGGNAWKWSDKAQ
KYYLHLFAPEQADLNWENPKLRAELYDICRFWAEKGVDGLRLDVVNLISK
PEKYEDDFEGDGRRFYTDGPKIHQYLQELNQNALKPFGLMTVGEMSSTKL
EHCQRYANLDGSELSMTFNFHHLKVDYPNGEKWTYAKPDYVELKSIFNYW
QKGMHGKAWNALFWCNHDQPRIVSRFGDEGELRTLSAKMLAMLLHGMQGT
PYIYQGEEIGMTNPNFSSIEEYRDVESLNAYQILQNQGKSAVEILQILAQ
KSRDNSRTPVQWDASPNAGFTSATPWIGVAKNYPQINVEQALADRDSVFY
TYQKLIALRKQLAVLTDGDYSDLLPNHESVWLYQRSTAGERLTVAANLSN
QPQFIEIKPQGQVLINNYADITQEDSGICLKPYQALYFLA
>MS2050 ansB, AnsB protein
MKLTKLALTMSLGLGVSFANAAELPNITILATGGTIAGSGATSVSSSYKA
GQLTVQTLIEAVPEMKDLANITGEQVVNIGSQDMSDEVWLKLAKTINAKC
NETDGFVITHGTDTMEETAYFLDMTVKCEKPVVLVGAMRPATEKSADGPL
NLYNAVVVATDKKSAGRGVLVAMNDKVLGARDVTKTSTTAVETFNSPNFG
SLGYIHNSKVDYERSPESKHTTATPFNVDNLTALPKVGIVYAYSNMPTEP
LKALLDAGYEGIVTAGVGNGNVNQANSAILEKAAKDGVAVVRSSRVPTGY
TTRNGEVDDNALGFAASGTLNPQKARVLLQLALTQTKDINKSNNILMISK
SGRST
>MS0548 apaH, ApaH protein
MTRRDYEKIDGSAYANIYAVGDLHGCYELFMRELESVKFDTTRDLVISVG
DLIDRGPHSLSCLRLIRNSWFKAVKGNHECMAIEGLLGQDEHYQRLWLYN
AGDWVLSLNPTERAEVLDLLKFCAGLPLVIELNDEGFKTVIAHADYPYDQ
YRFGRPLTQEQAVWERRRIEMRDETEIKGADAFIFGHTPLKRVMQLGNRL
YIDTGAVFFGNLTLLRLK
>MS0631 apaH, ApaH protein
MATYFVGDLQGCYDELQRLLEKVRFDPTQDLLYLVGDLVARGDKSLECLR
LVKSLGKSAQTVLGNHDLHLLATAFGIKKVKSRDRVDAIFHAEDFEELIH
WLRHQPLLVYNAKQNWVMTHAGISPDWDINTAQACAKEVENVLQQGDYCH
LLSQMYDSRPDLWSADLTGIERLRYIINVFTRMRFCYRDHRLDFDCKSPV
DKAPEELTPWFNLSNPLYKQVDIIFGHWASLVDTPTPHHIYALDTGCVWN
NRMTMLRWEDKQYFCQPALKDYAFNG
>MS0303 apbE, ApbE protein
MKLKQTFTWLSAVIMAISLAACKKDPEIITLSGKTMGTTYHIKYIDDGGL
TQNAEQAQEQIESILKDVNDKMSTYIPNSELSRFNQYKEINQPVEISADL
AKVIKEAVRLNKITEGALDVTVGPLVNLWGFGPEKRIDKQPSATQLEERR
AWVGIEKLALTEQAGKFTLAKAVPELYIDLSSIAKGFGVDQVADYVESIG
AKNYMSEIGGEIRAKGKNIEGKDWQIAIEKPNFDGSRSVQDILGLKDLAM
ATSGDYRNYFEENGMRFSHEINPQTGKPIQHKLASITVLSPSTMTADGLS
TGLFVLGEEKALEVAERENIPVYLIVKTESGFDVKMSSAFKNLLNSSKEG
K
>MS0716 appB, AppB protein
MFDYESLRFIWWILIGVLLLGFVVTDGFDMGVLTLLPFAGKKEVEKRIMI
NSVAPHWDGNQVWLLTAGGAMFAAWPIVYAASFSGFYIAMILVLAALFFR
PVGFEYRAKIDNPAWRKAWDWGLFLGGFVPSLVFGVAFGNLLQGVPFEFN
DLLQVKYTGTFFELLNPFAILCGLISLSMLITHGAAWLQMKTTSDLRDRA
RAITQVGAFATLITFILAGVWLLYKDGFVLNSTVDHFAPSSPLGKTVSLE
TGAWFNNYYEMPVLWIFPALVVVGALLNIASSKADRSGFAFFFSALTMLG
VIFTSGIAMFPFVMPSITHPDMSLLMWDSTSSELTLSLMFGLALVFVVIM
LIYTIWAYAKMFGRLDGNFIEENKNSLY
>MS2117 apt, Apt protein
MSEKYVVTWDMFHMHARKLAERLLPASQWKGIIAVSRGGLFPAAVLAREL
GLRHVETVCIASYDHDQQGDLKVIHKAETDGEGFIVVDDLVDTGNTAREI
RNMYPKAKFVTVFAKPAGAPLVDDYVIDIPQNTWIEQPWDLGIGFVPPLA
RK
>MS0870 apt, Apt protein
MDRTLGNKMNEQLQLIKSSIKSIPNHPKEGIIFRDITSLIEVPEAFQATV
DLIVGNYKNQGITKVVGTESRGFIFGAPVALALGLPFVLVRKPRKLPRET
ISQSYQLEYGEDTLEMHVDSVKAGDNVLIIDDLLATGGTVDATIKLIKRL
GGDVKHAAFVINLPELGGEERLRSLGVEPFTLVNFEGH
>MS1209 ara1, ARA1 protein
MQWLKYKHRCKYSLGGNKMQTFKLNNGVEIPVLGFGVFQIPPEETEQAVI
SAIHAGYRHIDTAQAYMNETETGAGIRNSGVVREEIFVTSKVWIENYGYE
AAKASLDRTLARLDIGYIDLMLLHQPFNDVYGAWRALEEYLAAGKIRAIG
LSNFTADRVLDVGLYNKVMPAVNQIEINPFHQQQAQVEGLLSEGIVPEAW
GPFAEGKFGIFENPVLAKIGQKYGKSIAQVVTRWLVQRGVVVLAKSTRPE
RMAENLNVFDFELDADDFAQIAALDVGKSQIISHTDLAMVRQFKEWVFNV
>MS2075 ara1, ARA1 protein
MLTFVKQGLELGVDTLDHAACYGAFTSEAEFGRALALDKSLRAQLTLVTK
CGILYPNEELPDIKSHHYDNSYRHIMWSAQRSIEKLQCDYLDVLLIHRLS
PCADPEQIARAFDELYQTGKVRYFGVSNYTPAKFAMLQSYVNQPLITNQI
EISPLHRQAFDDGTLDFLLEKRIQPMAWSPLAGGRLFNQDENSRAVQKTL
LEIGETKGETRLDTLAYAWLLAHPAKIMPVMGSGKIERVKSAADALRISF
TEEEWIKVYVAAQGRDIP
>MS0687 ara1, ARA1 protein
MKKITLKNGDKLTLLGMGTWFIGDNAHYRQEEIAALRYGIEHGINLIDTA
EMYGNGRAERLIGEAIAPYDRNSLYLISKVLPNNANKRKMEQACNNSLKA
LNTDYLDMYLYHWRGTTPLAETVECLEALKNKGKIKAWGVSNFDLEDMQE
LLALPNGNQCQLNEVLFHLGSRGIEYALKPYQDKLAIPTVAYCPLAQAGS
LQRNLLRHPEVTTIAEELNCTPYQLLLLFVLAQPNMIAIPKAGQVRHMKE
NIACLDMQLTQQQLARLNNAFPSPTHRIHLDIV
>MS0058 araA, AraA protein
MEFLKKLEVWFVVGSQDLYGDEALKQVNANAEQITRYLNDQNPFIQIKLK
PLATTPEDILSLCQAANYEENCVGVIAWMHTFSPAKMWIGGLTRLNKPLL
QFHTQLNKNIPWNEIDMDYMNLHQTAHGDREFGFMVSRFRKPRTIVVGHW
QSESVKQKLDRWMRVLAAIYDQQHLKVARFGDNMREVAVTEGDKVEAQIK
FGYSVNGYGLYQLVNSINTVNDEDITALVKEYEASYQLADSLKDGGEKRQ
SLIDSARIELGLKAFLDKGGFKAFTDTFQNLAGIKQLPGLPVQRLMAQGY
GFGAEGDWKTAALVRAIKVMSYGLPNGCSFMEDYTYNLDDNNEIVLGAHM
LEVCPSIANNKPILDIKPLGIGGKEDPARLIFTSKSGKATASTIVDLGNR
FRMITADMQAVDKPQDMPNLPVGHAFWKLEPNFDIGTQAWILSGGAHHNV
FSLDIDADMLRTFAEYFDIEFIHINVKTELPNLKNELRWNEAAYK
>MS2173 araC, AraC protein
MPKPLILSRKNLANLGSVIQQRKLLYTRMAVDEPTLLYIQVGQKTLRWRG
QELTIQAGEMVLLAAGQTFDVLNNPDAKLGFYQAGWIALEQRVVDEFADL
FGVETYVQELAKIQPLAPLKAHFDVVRQALENDEAPELVLKLKLFELLAW
LKAEHLSFVPHEKHNLLRQIRKMIASNTAFEWTAETIARQLHLSETSLRR
ALQKSDTTFREVLTDVRMSRALTLLQITKWQVARIANEVGYDSPSRFTVR
FKQRFGFLPSDIRENLSQPVQNEQQKLVRIGVKK
>MS2323 araC, AraC protein
MTDILQLSHHSYFISEESPITVERRHYQPPFPLHRHDFNEIVIISAGNGI
HFWNDEIHPITTGNVLYIESGDKHKYGEVDKLKLDNILYRPEKLSLFPIM
KDYIPHNNEKKSLRINQETLVQLQSLISQLEIESKKTNKSSMHLSEAIFL
QILILICRTQQQENKAYSDISKLESLFSALNQSISQEFYLADFCRQHQLA
VSSVRRIFKQQTNMTIAQYLQKLRLCRAATLLRNTSESVANIAIRCGYSD
SNYFSSVFGKTFSCTPTEYRSRFIKK
>MS0060 araC, AraC protein
MKYQREVQQETNPLLPGYQFGSYLVAGCTPIEKGNEVDFAIRRPNGMKGY
IINLTTKGEGTVFEGDRAFTCCKGDLLLFPPNAEHLYYRSQSSESWHHQW
IYFRPRSFWANWLQWSHISDHVGRLTITDPTTYEEILALFKKIEREYNAK
DIFSEAMSMCLLEQLLIKCIKLDPVNSQRMLDPRILETCHFISANLHINH
KITEIAEHIHMSPSRLTHLFAQQTGSSIIKWREEQRMIKAQHLLHTSGAP
IYAIARQLGYDDQLYFSRLFKRYSGLSPSDYRNSR
>MS1400 araC, AraC protein
MANIRQNQSISELHYQPHKHHPYGIELFTVASLRARSAEVVMEKNYLYQC
DMIIVVTQGSGTLWQDFEPVACMQGSVLWIKQGQACSFGNDKHWDGWVLM
IKNKPLLSEFDYQINTLWLSENELENVEQSLKQLKQDSEKPYSIVHKQLI
HHQFYAFLWRLISLTPNQTILYSPRLRSRFDSFQSLLESYFHEWHHVHQY
ATALACSEKTLSRACLEITQQPAKTVINNRLLLEAKRLLVQSNQSIASIS
LQLGFNEATHFVKFFKREAGITPQKFRELG
>MS2105 araC, AraC protein
MSGILFLLITCLFIQIIMFSEQSFARLLDVIPHNQTYHSPIKGLIIHHSD
HPFSYDNVIQEPSICIVIRGEREVQLGNQCYLFDNRHFMFCPVNVPMCGK
VLQATAEEPFVVMSMKIDLQAVNKILLEQTALLAKNSENPTAFGQWHLDA
ELENAFERLLLLHENTKDITFLAPLIQQEIYYRLLTGEQGDKLKQMVSFG
SNTQKIAKATEYLKAHYIETITVESLAELCGMSLSGFHNHFKKHTTLSPL
QYQKSLRLMEANRLISQENLPISTAAFQVGYESPSQFSREYKRYFGKAPS
VR
>MS2131 araC, AraC protein
MLNWLIRQTLKLKSGEKGTMGIETPVPELFVFHSETDLRDVSQLQESGIC
LILQGRKDVRVGDQHYRYQAGEFVCYTVDLPIMTEYLTDDGGYLDLRLFF
DLPLMREIIDELNRQNFSFAPASQQKIVSTASPELIRAFEMLICLTENSQ
DLPIMLPLIKKAIYFYLLTGEQGGTLRQIALQNSNSQRIVETVGWLKEHY
NESFDIEQLAAASSMSISGFYAQFRRLTGMSPLQYQKNLRLTKANALLKL
GQKNISEIAFEIGYDSLPQFSREYKRYFGHSPRSDLSRAG
>MS1229 araC, AraC protein
MRFCWYTSDNNKSVVNQINMKTSHLAKQTSTELADKSGSEIISPLSLSLD
ARPFNVEIQQPPGNMPAYHWHGHIEINIPFDDDVEYSFNEHSTLINAGHI
SIFWASIPHRLTDKHNCRTMAVFNIPVYQFLSWQLSQNLINHITHGIIIQ
SKNPRLVSLFEVQRWEQELKLEDPNRHKLVYDEIQLMIKRVSLDGWLLLL
EPPKKNNHQLSGSKHAQNYVRTMLDYIANHYNAPLTVQSVANAVGLNTNY
AMGLFQSAMQLTIKQYIIMMRINHAKALLSDTNRSVLDISLTTGFSSMSR
FYDNFLKYTGVSPNKYRKQIRADDNWSAQGLIPTTQAIKGASTGEKLIMT
GEHFNQSEEF
>MS2322 araC, AraC protein
MIQKLLARDFFNNKEQPIILEPRAPQEIFPEHTHDFDELVIVKHGSGRHI
LNGYPHDLYPGVVLYIQAQDHHSYENLQDLCLTNILIQSNNNFKYLNNID
ILLNGLKPENSSYQLINKKTAEYIDSLLEKINAIDESYNLQNECLFFQVL
SSIQAHQFNDSGYGNTEEKGRQMIRWLENNFEKEIDWEELAEKFALPIRT
LHRYIKSQTGHTPQNYVTKLRLAQAYYQLKYTEKNIINIAYDCGFNDSSY
FSTCFKNEYSIAPRELRI
>MS2327 araD, AraD protein
MQNIINSWFVQGMIKATYDMWLKGWDERNGGNVSLRLLDDDVVSYKDEFY
QNPRHVEITQNITALANQYFIVTGSGKFFRNVIIDPADTLAVIKVDEQGK
GYYIMWGLVNGGVPTSELPAHLQSHIVRMKVSGGKDRVIMHCHATNLIAL
TYVLELDPKVITRELWEMSTECLVVFPDGVGVLPWMTPGKDEIGYATAQE
MAQHPLVLWAFHGVFGTGPTLDDAFGLIDTAEKSAEILVKVLSMGGKRQT
IQTDEFKLLAERFGVTPMDGVL
>MS0046 araD, AraD protein
MLKELRERVLQANLELPKHKLITFTWGNVSEIDREKGLVAIKPSGVDYDV
MTVDDIVIVDLDGNHVWGDKKPSSDTATHLELYRQFPEIGGIVHTHSRHA
TAWAQAGEDLIALGTTHGDYFYGAIPCTRKMTAEEIAGEYELETGKVIVE
TFRKRGINPTDIPAVLVNSHGPFVWGKDGFNAVHNSVVLEEIAYMNAFSK
LIRPNVQSMQQELLDKHYLRKHGKNAYYGQ
>MS1979 araD, AraD protein
MTDLEQKELMVQLGRSFYERGYSVGGAGNLSVRLDENRVLVTPTGSSLGR
LKVERLSVLDMDGNVLEGDKPSKESVFHLEMYRKNPKCNAIVHLHCTYLT
ALSCLQGLDPTNAMKAFTPYYVMRVGKMQVIPYYRPGSPEIARELSERAL
SGKAFLLANHGVVVTGADLLDASDNTEELEETAKLFFTLQGQKIRYLTDD
EVKDLENRGK
>MS0061 araH, AraH protein
MIMTSTTQEKSAGSFSKIWNAYGMLLIFAVIFVCSCVFIPNFATVVNMKG
LGLAISMSGIVACGMLFCLAAGEIDLSVASVIACAGVVTAVVINMTQSVT
IGILAGLGLGIAVGLINGFVIAKLKINSLITTLATMQIARGFGYIISDGK
AVGITKEEFFELGYQDIFGVPLPIIFTVICMVVFGFLLSKTTYGRNTLAI
GGNQEASRLAGINVDRTKLIIFVVSGFVSALAGVILAARMTSGQPMTSVG
FELVAISACVLGGVSLNGGVAKISFIIAGVLILGTIENAMNLLNISPFAQ
YVVRGLILLIAVIFDKYKQKFIKS
>MS0199 araH, AraH protein
MSAIKLNVRDAGTLVGLVIIFVVFSFLSPVFFTVPNLLNILQQSSLNAAI
ALGMTLVIISAGIDLSVGPTAALSAVLGASLMVSGVPVPIAVLGALCIGS
LGGLFNGVLIAYAGLQPFIVTLGSLSLYRALSLIYTGGNPIFGIPAEFRA
FMNGSLFGIPSSILIVASIALILWVVLNKTPLGEYIFAVGGNEEAARVCS
VPVAKTKVAVYMISGFLASVAGLVLVGRLGAADPTLGNLWELDAIAAAAI
GGASLMGGKGSIIGTILGAVILGALRNGLTLLNIQAFYQLLATGLIIIVA
MLIDRATRGK
>MS0641 araH, AraH protein
MAGQKNKTWDFFKQNAIYFVLLILLGVIIAQDPSFLNLMNFSNILTQSSV
RLIIALGIAGLLVVQGTDLSAGRQVGLAAVVAATMLQAIDNLNRVFPNLP
EMPIFVVILIVCSIGAVIGLINGFVVAILNVTPFIATMGTMIIVYGFNSL
YYDAVGGSPIAGFSENFSSFAQGFFRFGSFKLSYITIYAIIATILMWILW
NKTRFGKNIFAIGGNPEAARVSGVNVTRNLLVIYMLAGVFYAFGGMLEAG
RIGSATNNLGFMYELDAIAACVVGGVSFAGGVGTIIGVVTGVLIFTVINY
GLTYIGVNPYWQYIIKGSIIILAVAIDSLKYAKKK
>MS1610 araH, AraH protein
MFSFKKLISKLGIGLVLLFMIIGMSLTSQAFLSTNNIFNILLQVSVICVI
SVGMTYVILTGGIDLSVGSIVALSAVCLGLFTHWGVAWLGENPSQGALLA
VVLLSIVGAVLVGALCGYVNGVVIVYGKVTSFITTLGMMGIARGLALTLS
DGKTIYNFPEQLRFFGNGRLAVTENFSIPIPVIIALIVVLISFYVLTQTV
FGRQIYALGGNREAVRLSGINVNKLEIKTYVINGALAAVGAVILVGRLNA
AQPIAGTGYELDAIAATVIGGTSLMGGVGSVVSTSIGALIMGVLQNGLTL
LNVTSYLQRLIIGMVIILAVFLDQLRRGEVSTGGLRRIFFRE
>MS1579 araJ, AraJ protein
MLNRKLVNRVEYFRVIVMAFAAFVFNTTEFVPVALLSDIADSFQMPVSNT
GLMITIYAWIVSLCSLPCMLMTARLERRRLLISLFILFIASHILSAFAWN
YEVLLIARAGVALTHSIFWSITAALTIRIAPKNKKTQALGLLALGSSLAM
VLGLPLGRIIGQAFGWRTTFTLIGVFAALILILIVRLLPKIPSQNAGSLK
SLPVLARRPMLITLYIFTILVISAHFTAYSYIEPFMIQIGRVSANKATAV
LLIFGVSGVVASVLFSRLYRIAPIKFLLSSVAILTLALICLYGVSGISGA
IFALVFIWGVAISALSLAMQMKVLQLAPDATDVATAIYSGIYNIGIGGGA
LIGNQVMQHLGLANIGYVGAVLGAVSIIWFILMFLKFSRVPLNIVNQ
>MS0754 argB, ArgB protein
MRSTELVQWFRQSTPYVNMHRGKTFVIMLDGNTIASSNFINIINDISLLH
SLGIKLIIVYGARVQINSLLAQNNVTSVYHKNIRVTDPRTLELVKQAVGQ
LSYDITARLSVRLPHSPVLNVVSSNFILAQPIGVDDGVDYMLSGKIRRIE
IDNIKHHLDNNAIVLLGPIAPSVTGETFNLPFEEIATQVAIKLKAEKLIG
FSSTQGILDPQGISIPDLLPQDAAKYLNQYIQQGEYHCSQARFLQAAIEV
CKAGVKRSHLLSYEEDGSLLQELFTRDGVGTQLSVDNSEDIRIATVQDIP
GLIELIHPLEQQGILVKRSREQLEMDIANYTIIDRDGVIIACAALNQYPE
ENMAEMACVAVHPDYRSSSRGDILLEAIQKRARQLGIEKLFVLTTRTVHW
FQERGFRLANVEDLPKEKRDHYNYQRRSKILIQPLNEEE
>MS0236 argB, ArgB protein
MKPLVIKLGGVLLDTPAAMENLFTALADYQQNFARPLLIVHGGGCLVDDL
MKRLNLPVQKKNGLRVTPADQIDIIVGALAGIANKTLVAQAAKFKLNPVG
LCLADGNLTQATQFDPELGHVAMVVAKNPALLNNLLGDAFLPIISSIAVD
DNGLLMNVNADQAATAIAALINADLVMLSDVDGVLDANKQRLTELNSAQI
EQLIEDKVITDGMIVKVNAALDAAKILNCGVDIANWKYPEKLTALFAGEI
IGTRINP
>MS0235 argC, ArgC protein
MAQKAIVIGASGYTGAELARILTHHPEFELAGLYVSTNSADANKSISTLY
PQLKTICDLPLQPLPEDLTEIAQNADLAFFGTAHEVSANLAPVFLQNNCK
VFDLSGAYRVNSESFYQEFYGFEHKHPELLKQAVYGLAEWNADKIKTTDL
VAVAGCYPTVSQLSLKPLIEEGLLDVNQLPVINAVSGVSGAGRKASLTSS
FCEVSLNAYGVFNHRHQPEIATHLGTDVIFTPHLGNFKRGILATITAKLK
AGVSDEQIKRAYAKYYANKPLVRVYEQGLPSIKAVEFSPYCDIGFATKNN
HIIIVGAEDNLLKGAAAQAVQCANIRYGYNEVLGLI
>MS0783 argD, ArgD protein
MILVAGPNVLRFAPALNISQQEVAEGFKRLDQALQKFA
>MS0782 argD, ArgD protein
MSQYTRKTFDEVMIQNYVPADFIPVKGKGCKVWDQQGRDYIDFTSGIAVN
ALGHCPDEIVDVLKKQGETLWHSSNWFTSEPTLELASKLVEHTFAERVMF
ANSGGEANEAALKLARRYAVDNYGYQKDTIISFKKSFHGRTLFTVSVGGQ
AKYSDGFGPKPAGIVHLPFNDLDAVKAMIDDHTCAVIVEPIQGESGIIPA
TKEFLQGLRRLCDENNALLIFDEVQTGVGRTGYLYAYESYDVVPDILTSS
KALANGFPISAMLTTTKIAASFKPGVHGTTFGGNPLACAVGAKVIETIAN
PAFLENVQKTSALFISELNKLNEKYHLFNEVRGQGLLIGGGIN
>MS0829 argD, ArgD protein
MTITTPVKAVLASNQYFLDRQNAMESNVRSYPRKLPFAYAKAQGCWVTDV
EGNEYLDFLAGAGTLALGHNHPVLIQSIKDVLDSGLPLHTLDLTTPLKDA
FTEELLSFFPKDQYILQFTGPSGADANEAAIKLAKTYTGRGNVIAFSGGF
HGMTHGALSLTGNLGAKNAVQNLMPGVQFMPYPHEYRCPFGIGGEAGAKA
VERYFENFIEDVESGVVKPAAVILEAIQGEGGVVPAPVSFLQKVREVTQK
HGILMIVDEVQAGFCRSGKMFAFEHAGIEPDIVVMSKAVGGSLPLAVLAI
KKEFDAWQPAGHTGTFRGNQLAMATGYASLKIMREENLAQNAQQRGEYLT
QALRELSKEFPCIGNVRGRGLMMGIDIVDERKPQDAAGAYPQDGELAATI
QKFCFKNKLLLERGGRNGNVVRVLCAININQAECEEFIKRFKQSVTDAIK
AVRG
>MS0674 argE, ArgE protein
MKNTIINLAQDLIRRPSISPDDQGCQQVIAERLTKLGFNIEWMSFNDTIN
LWAKHGTTSPVVAFAGHTDVVPTGDENQWNYPPFSAQIVDDMLYGRGAAD
MKGSLAAMIVAAEEYVKANPNHAGTIALLITSDEEAAAKDGTVKVVESLM
ARGENIDYCLVGEPSSAKQLGDVVKNGRRGSITGDLYIQGIQGHVAYPHL
AENPVHKATKFLTELTTYEWDNGNEFFPPTSLQIANIHAGTGSNNVIPGE
LYVQFNLRYCTEVTDEFIKNKVAEMLQKHDLTYRIDWNLSGKPFLTKPGK
LLNAVVESLESVAGIKPKLDTGGGTSDGRFIALMGAEVVELGPLNATIHK
VNECVSCRDLATLGEVYRQMLVNLLGK
>MS0233 argE, ArgE protein
MKRLPKFLDMYSQLIALPTISALEPEFDQSNKALIELLADWLATLGFKTE
IIPVENSRAKYNLLATYGEGEGGLLLAGHTDTVPCNEELWTTNPFKLTER
DGKFFGLGTADMKGFFAFVIDAVRQIDLTKLTKPLRILATADEETTMLGT
RTFIRHTHIRPDCALIGEPTSLRAVRAHKGHVGKAVRIIGKSGHSSDPAK
GINAIELMHEATGYLMQMRNELRDKYHHDAFEIPYPTMNFGAIHGGDAVN
RICGCCELHFDIRPLPKMRLEDLDEMLQQKLAPMFEKWGDRISIEALHEP
TPGYECEHSAQVVQVVEKLLGEKCEVVNYCTEAPFIQELCPTLVLGPGSI
EQAHQPDEFLSAEFIEPTRDLLTKMIMHFC
>MS1555 argE, ArgE protein
MSVNMKRIQTIIEKLASISSVPGELTRLAFSAEDEAAHNYLIELCKPYDL
SIRRDQVGNLFIRKSGIEDHLPAVTFGSHIDTVVNAGKFDGPLGSVGGLE
ILFQLCEQGVQTRYPLELIIFTCEESSRFNYATLGSKLMCGIANRESLSR
LRDKQGNSLEEAMATIGLDFTEVDQVKRNAEEFKCFFELHIEQGPRLANE
RKTIGVVTGIAAPIRCIVKIQGQADHSGATAMHYRRDALLGGAELALAIE
RAAIDAGHSTVATVGNLNAKPGVMNVVPGYCELLVDIRGIHSEARESVFT
VLQQQIEQVTAKRGLSIELQLISKDQPILLPDQMVQQISRAAQDLGYAYE
IMPSGAGHDAMHMATFCPTGMIFVPSKNGISHNPLEFTSWEEIEAGIKVL
QLVVLEQAEKV
>MS1073 argF, ArgF protein
MPFNLKNRHLLSLVNHSPREIKYLLDLARDLKRAKYAGTEQPRLKGKNIA
LIFEKTSTRTRCSFEIAAYDQGANVTYIDPTSSQIGHKESMKDTARVLGR
LYDAIEYRGYKQETVEELAKFSGVPVFNGLTDEFHPTQMLADVLTMIEHS
TKPLNEIKYVYIGDARNNMGNSLLLIGAKLGMDVRICGPKSLLPEENFVS
ICEEISKETGARLTVTDDIDLAVKDADFVHTDVWVSMGEPIEAWGERINL
LMPYQVNTDLMKRTGNPNVKFMHCLPAFHNCETKVGREIAAAYPNLANGI
EVTEDVFESPMNIAFEQAENRMHTIKAVMVASLA
>MS1479 argG, ArgG protein
MSNTILQNLPLGQKVGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPD
EDDYNAIPKKAMAYGAENARLIDCRKQLAQEGIAAIQCGAFHISTGGVTY
FNTTPLGRAVTGTMLVAAMKEDDVNIWGDGSTFKGNDIERFYRYGLLTNP
NLKIYKPWLDDQFIDELGGRFEMSQFLIANGFDYKMSVEKAYSTDSNMLG
ATHEAKDLEDLSTGIKIVKPIMGVAFWDESVEIKPEVVTVRFEEGVPVEL
NGKRFDDVVELFMEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMAL
FHIAYERLVTGIHNEDTIEQYRINGLRLGRLLYQGRWFDPQALMLRESSQ
RWVAKAITGEVKLELRRGNDYSILDTVSPNLTYEAERLSMEKVEDAPFDP
IDRIGQLTMRNLDVTDTRNKLGIYSEAGLLTAGKDAVVPQLGSK
>MS0237 argH, ArgH protein
MALWGGRFTQAADQRFKDFNDSLRFDYRLAEQDIEGSVGWSKALVSVGVL
TTDEQQQLERALNELLIEVRSNPQAILQDDAEDIHSWVESKLIDKVGNLG
KKLHTGRSRNDQVALDIKMWCKAQVTELQYAVRDLQAKLVETAENNQHAV
MPGYTHLQRAQPISFAHWCMAYVEMLERDYSRLADAYNRMDSCPLGSGAL
AGTAYPVDREQLAKDLGFAFATRNSLDSVSDRDHIIELLSTASLSMVHLS
RFAEDMIIFNSGEADFVELSDRVTSGSSLMPQKKNPDACELIRGKAGRVI
GSLTGMMVTVKGLPLAYNKDMQEDKEGIFDALDTWHDCLTMAAFVLEDIR
VNVERTREAALKGYSNATELADYLVAKGVPFRDSHHIVGETVVYAIKVHK
GLEDLSIEEFRQFSDVVGEDVYPILSLQSCLDKRSAKGGVSPLRVAEAIA
DAKARIAAKK
>MS1267 argR, ArgR protein
MAIEKTDNLLTVFKDLLSQERFGSQSEIVSALQDLGFSNINQSKVSRMLT
KFGAIRTRNTRMEMVYCLPNELSVPNTSSPLKNLVLDIDHNDFLIVIKTS
PGAAQLIARLLDSVGKTEGILGTIAGDDTIFITPTKGTGIKELINTIQQL
FENSL
>MS1330 argS, ArgS protein
MNIQWILSDKIKRAMIAAGAEQNAEPLVRQSGKPQFGDYQANGIMGAAKK
LGLNPREFAQKVLEQVDLSDIAEKTEIAGPGFINIFLNKNWVAQQADTAL
NTPNFGIKTAHPQTVVIDYSSPNVAKEMHVGHLRSTIIGDAVARALEFMG
NHVIRANHVGDWGTQFGMLIAYLEKMENEHADAMQLSDLEAFYRAAKETY
DNDEEFAVKARSYVVKLQSGDEYCRTMWKKLVDMTMQQNQRNYERLNVTL
TEKDVMGESLYNPMLPAIVEDLKKQGLAVEDDGALVVYLDEFKNKDGDPM
GVIVQKKDGGFLYTTTDIAAAKYRYHTLHADRALVFSDTRQSQHMQQAWL
ITRKAGYVPDSFSLEHHNFGMMLGKDGKPFKTRSGGTVKLADLLDEAVER
ATLLINEKNTALSEQEKAAVIEAVAIGSVKYADLSKNRTTDYVFDWDNML
SFEGNTAPYMQYAYTRIRSIFNKTDVNPTALSAAHIEIRNDKERALAIKL
LQFEEAVQTVAKDGTPHILCNYLYELAGVFSSFYEHCPILNAEEPVKLSR
LKLAKLTEKTLKQGLDLLGIKTVEKM
>MS1575 aroA, AroA protein
MEKLTLTPISHVEGTVNLPGSKSLSNRALLLAALAKGTTRVTNLLDSDDV
RHMLNALKQLGVNYSLSEDKSVCEVQGLGKAFAWQNGLALFLGNAGTAMR
PLTAALCLANADSVPAEIILTGEPRMKERPIKHLVDALLQAGADVQYLEQ
EGYPPLAIRNTGLKGGKVKIDGSVSSQFLTALLMAAPMAERDTEIEIIGE
LVSKPYIDITLNMMKIFAVDVDNQNYQRFVVKGNQQYQSPNIFLVEGDAS
SASYFLAAGAIKGKVRVTGVGKNSIQGDRLFAEVLEKMGAKITWGEDYIE
AERGELNGIDMDMNHIPDAAMTIATTALFAQGETVIRNIYNWRVKETDRL
SAMATELRKVGAEVEEGEDFIRIQPPASDQFKHAEIETYNDHRMAMCFAL
VALSNTAVTICDPKCTAKTFPTFFDEFSAIATV
>MS1968 aroB, AroB protein
MVCVNVELKERRYPIYIGENLLTDTGVYPVKMGDKVMIVSNPTVAQYYLT
PVTETLEKLGCQVSHVLLPDGEKYKTLDSLNMIFTALLKENHGRDTTLIA
LGGGVIGDVTGYAAASYQRGIRFIQIPTTLLAQVDSSVGGKTAVNHELGK
NMIGAFYQPCTVIIDTRTLVTLPKREVNAGLAEVIKYGAILDLPFFEWLE
AHIDNLVALNQQDLQYCIARCCQIKADVVARDETEKGDRALLNLGHTFGH
AIETHLGYGNWLHGEAVAAGSMMAAVLSEKLGDLSYSEVARLEKLLARAN
LPTVSPDTMQAEDYLPHMMRDKKVLAGKLRLVLLKTLGQAYVASDTDKSL
VLDAIRVCSQNN
>MS0866 aroC, AroC protein
MAGNSIGQLFRVTTFGESHGIALGCIVDGVPPNMALSEADIQPDLDRRKP
GTSRYTTPRREDDEVQILSGVFEGKTTGTSIGMIIKNGDQRSKDYGDIMD
KFRPGHADYTYQQKYGIRDYRGGGRSSARETAMRVAAGAIAKKYLREQFG
VEVRGFLSQIGDVKIAPQNISEIDWAQVNDNPFFCPDQSAVEKFDELIRQ
LKKDGDSIGAKLTVVAENVPVGLGEPVFDRLDADLAHALMGINAVKAVEI
GDGFAVVEQRGTQHRDEMTPQGFLSNHAGGILGGISTGQPIIATIALKPT
SSITVPGRTVNLNNEPVELITKGRHDPCVGIRAVPIAEAMTAIVLLDHLL
RHRAQCGLK
>MS0133 aroE, AroE protein
MQLKSSLIVNQHLRLEQITEDDAEPVFRLICRQRDYLSRWLPGVGLTSNV
SSTLKFIRSLKPLEQVFTIRRDDEIIGLVSFNKADYSNLKLEIGYWLSQS
EQKQGIMTQCVQTMIDYAFNQLYFNRIQIKCAIGNTASKGIPQRLGFQLE
GIERQGLLLLSGEFADFEIYSMLAQDWKNKQDKQIMDTYAVWGNPIAQSK
SPAIHKIFAEQTGQNMKYIAMLGDEQHFERQLQEFFAQGAKGCNITAPFK
ERAYRLADEYSERALTAGACNTLKKLENGKLYADNTDGAGLVSDLQRLGW
LKPNQQILILGAGGATKGVLLPLLQAQQKILIANRTLAKAEELAEKFSPY
GEIRAVELKTIPPYRYDVVINATSLGLTGKTADIQPEILQQAGAVYDMQY
AKETDTPFIALAKSLGVNNVSDGFGMLVGQAAHSFRLWRGIMPDIEVLLN
RGI
>MS2315 aroE, AroE protein
MINKDTQLCISLSGRPSNFGTRFHNYMYEKLGLNFVYKAFTTNDIEHAVK
GVRALGIRGCAVSMPFKESCMPFLDEISPSAKAIESVNTIVNTDGYLKAY
NTDYIAISKLIAKYQLKPTACVIIQGSGGMAKAVAAAFKNAGFDNLKIYA
RNATTGGYLAKLYGYQYIDSLYGQNADILVNATPIGMKGGGKEESIISFP
EAMIDQASVAFDVVAMPAETPLIKYARQQGKTVISGAEVAVLQAVEQFEL
YTGQRPGDELIAEAASFARANS
>MS1104 aroG, AroG protein
MKDSIHNVHIIDEKVLITPAELKQKLPLPIALRTQIETHRREIADIVHKK
DDRLLVVIGPCSVHDTKAAIDYAKRLKALSDELKDQLYIVMRVYFEKPRT
TVGWKGLINDPRIDGTFNVEEGLHIGRKLLLDLAEMGLPLATEALDPMTP
QYLADLFSWSAIGARTTESQTHRELASGLSMAVGFKNGTDGSLATAINAM
KAASMGHSFIGINQQGQVNLLHTEGNPDGHVILRGGKKPNYQQEFVNQCE
EELAKAGLETAIMIDCSHGNSNKDYKRQPSVAKDAVNQIVAGNKSIIGLM
IESNINAGNQSSEQKVSEMKYGVSITDACIDWETTDNLLRKIAAALKNRA
E
>MS1184 aroG, AroG protein
MISFKVRLNFSIFRMIYRELIMPTKNKNNIRVANDDTRIANIEQLLPPVA
LLEKYPASNVAVKTVRNARNKAHQIIHGEDDRLLVIIGPCSIHDPKAALE
YANRMAKMREKYKDTLEIIMRVYFEKPRTTVGWKGLINDPYLNDTYALND
GLRIARKLLSDINDLGLPTAGEFLDMITPQYVADFMSWGAIGARTTESQV
HRELASGLSCAVGFKNGTNGGVKIALDAIGAAEASHHFLSVTKFGHSAIV
STKGNLDCHIILRGGDKGTNYDAENIAKVCANIEKSGRIGHVMIDFSHAN
SSKQFKKQVEVCHDVAKQIAQGSNQIFGVMVESHLVEGRQDLVNGKAETY
GQSITDACIGWDDTEIVLQELSDAVAARRKVNGK
>MS1969 aroK, AroK protein
MRLRILLLFIENFKKNNTMAEKRNIFLVGPMGAGKSTIGRQLAQLLNMEF
IDSDNEIEQRAGADISWIFDIEGEDGFRKREERIINELTQKQGIVLSTGG
GAILSKETRNHLSARGIVIYLQTTVDKQFERTQRDKKRPLLQGVEDVRKV
LEDLAQVRNPLYEEVADITLPTDEQSAKLMASHIVELIDNFNS
>MS1790 aroQ, AroQ protein
MSQLSRILLLNGPNLNMLGAREPKHYGTLSLAAIEANVQALAAKNNIELE
CFQANSEEKLIDKIHQSFKKVDFILINPAAFTHTSVALRDALLAVAIPFV
EIHLSNIHKREPFRHHSYFSDVAEGVICGLGAKGYECAFEFAVEFLAKKA
>MS1959 arsC, ArsC protein
MSVIIYHNPRCSKSRETLKLLQDQNINAEIVLYLEKRFSVSELQSLMKKL
NIHSPKEMMRIKDALYQELQLNNEHISEQELLEAIGNHPALLERPIVING
DKAKIGRPPEAVLSIL
>MS0672 arsC, ArsC protein
MITVYGIKNCDTVKKALKWLTDNNIEHKLHDYRTDGLDPEFLINAEAQFG
WQTLVNKRSTTWRNLDSQIKENMEKHTALSVLAEQPTLIKRPIILQDGIA
LIGFNIKEYKKAFG
>MS0220 artI, ArtI protein
MQVVLKRRKQNNSDNIYLTINQGSYMKKLLLAAALAGTTFAAQARDITFA
MEPSYPPFELTNAQGEIIGFDVDVAKAICKEIEANCNFKSQSFDALIPSL
KAKRFDAAISAIDITETRAKQVLFSDAYYDSSASFIAVKGKADLNSAKNI
GVQNGTTFQQYTVAEAKQYSPKAYTSLQDAILDLKNGRIDIIFGDTAVLA
DMLAKEPELTFVGDKVTNKKYFGNGLGIAVNKSDKALVENLNKGLAAIKA
NGEYQKIYDKWMTAK
>MS0704 artI, ArtI protein
MKKLLLSTLLITTAFAVSAKDISFAMEPTYPPFEFTNEKGEIIGFDVDIA
NALCKEMQANCTFKSQAFDALIQGLKQKRFDASISGMGITEARKKQVLFT
EPYFSSSAAFIAKKGTDFTKVKTIGVQNGTTYQNYIIKEKPEYEVKAYAS
FQDALLDIQNGRIDAIFGDIPVLVDMIKKTPELAFAGEKIDNKTYFGNGL
GIAANKANQELIDEFNQALIKIRQNGEYQKIYDKWMTAK
>MS1277 artI, ArtI protein
MFKKLVLLATGMFAVATTTQAVAADSLLDRINNKGTITVGTEGTYAPFTY
HDASGKLTGYDVEVTRAVADKLGVKVEFKETAWDSMMAGLKAGRFDIVAN
QVALTTPERQATFDKSEPYSWSGAMMAVRADDDSIKTLDDIKDRKAAQSL
TSNYGELAREKQAKIVPVDGLAQSLLVVQQKRADFTLNDSLAILDYLKKN
PNSGLKSAWEAPAEEKLGSGLIVNKGNDEALAKISAAVIELQKDGTLKKL
GEQFFGKDISVK
>MS0900 artI, ArtI protein
MKKATLATLIAAMFVTATAQAQTSPDTLTKVLETKELVVCSPGDYKPFSF
DNNGKFEGVDNDLMDKLAQSMGAKVTIVKTTWKTLMDDFTANKCDIAVGG
ISITLERQQKALFTEPYFINGKTPIVRCENVDKYQTVEQINRPEVRIIAN
PGGSNEKYARNELSNANLTMNAENLTIFQQVIDKKVDVFVSEAAEAIVKA
HEHKGVLCAVNPDKPLKPAQNGWLIHNGDYRFKSYVDQFLHLEKMSGNLD
KTINKWLPRD
>MS1808 artI, ArtI protein
MIERLQCHHFPNTESSHLKGLLLRIIAAFALVLWAIDMVFPWQQMMRSEE
NRYNAIQQRGKLVVGTVNNPVSYFIGNEGQAGLEYELSRAFADYLGVELE
MKAMDNGEQLFDALEDNEIDIAAANLLYQAKKAETFQLGPAYYSASWQLV
YRKGESRPQSLSQIKDKLIIARGSELPLILQGYQTKYPNLKWQLENNQTQ
EELLLQVAQGKIKYTVANSIDVSAVQQVRPEIAVAFDVTDEASVHWYLPN
NSYNELQAALLDFMNTALEGGLIARIEEKYFNHFSQFDYVDMRQYVQAIN
DILPKYAPLFDRYKGDLDWRLLAAIAYQESHWNENATSPTGVRGMMMLTK
DTAERMKIADRTDAEQSIKAGSEYLHWLISQVPDSIPKEDRIWFALTGYN
MGLGHMLDARRLTKNLGGDPDNWLDVKKNLPLLAEKRYYPNLKYGYARGY
EAFQYVENIRRYMNSIINYYRVQENADNKDKPSETDENLPLPLTDNQEKQ
E
>MS1684 artI, ArtI protein
MKIFKKTTALLAAALLATGLTACDNKDSGAASADNNAVSAIERIKKADKV
RIGVFSDKPPFGYVDKDGKVQGFDVEIAKAVTKDLLGDENKAEFVLVEAA
NRAEYLLSNKVDITMANFTVTPERKEVVNFAKPYMKVALGVVSKQDAPIT
DVAQLADKTLLLNKGTTADAYFTKNFPKNKSLKFEQNTETFQALLDGRGD
ALSHDNTLLFAWAKENPGYVVAIKNLGDLDYIAPAVKKEDTDLLQWLDGE
IEKLAKDGTLNKAYQKTLQPIYGDEIKEADVLVEYQ
>MS1687 artM, ArtM protein
MSIIMNWQYIWNALPRFVDATILTLELSFWAILFSVIIGVICAVVMSYRV
RGLQTIVKAYIELSRNTPLLIQIFFLYFGLSKIGVKLEGFTCAVIGLAFL
GGSYMAEAVRAGIESVSKGQVESALSIGLTPMQTFRYVVFPQAFAVATPA
IGANCLFLMKETSVVSAIAIAELMFMAKEIIGMDYKTNEALFLLVVFYLI
ILLPVSVFIGYLERRLRRAKYGA
>MS0222 artM, ArtM protein
MFREYFMEIARGIPTSLLLTAVALAVAFVLALFLTFLLSMENKPVKRVIN
IFLTLFTGTPLLVQFFLIYSGPGQFQWIVNSALWPLLSNAWFCAMFALAL
NSAAYSTQLFHGAVKAIPKGQWESCAALGLSRLQTLKILIPYALKRALPS
YSNEIILVFKGTSLASTITIMDIMGYARQLYGTEYDAITIYGIAGVIYLV
ITGLMTLLLRKLEHKVLAFERLEVEKA
>MS1686 artM, ArtM protein
MGLTLLFEGNNLQRLLAGLGITAEIAFVSVFFACILGIVMGVVMTSRNIF
VRGFCRLYLEIVRIIPLLAILFIVYFGVAKWFNVHLSGVTVCILVFIFWG
TAEMGDLVRGALTSIEKHQTEAAYALGLSKIQTFIYILLPQSLKRVTPGA
INLFTRMIKTSSLAMLIGVLEVIKVGQQIIETSLFRDPTSALWIYGVIFA
LYFAICYPLSLFSKYLEKRWEN
>MS1276 artM, ArtM protein
MLNNLLLSIPFMTESRVDLVISAFWPMVEAAVLVSIPLAVSSFIIGMIIA
VAVALVRVTPVNGVIHRLFLVIVKVYISIIRGTPMLVQISVVFYGLPALG
IFIDPIPAAIIGFSLNIGAYASETVRAAISSVPKGQWEAGYTIGMSYMQT
FRRIIAPQAFRVAVPPLSNTFIGLFKDTSLASVVTVTEMFRVAQQMANMS
YDFLPIYIEAGLIYWCFCWVLFVIQAKVEKRMERYVAR
>MS0221 artM, ArtM protein
MFFEYLPLMSTATLMTLGLAVCSLIAGLVLAIFFVVLETNKFVCVRKPTA
IFVTLLRGLPEILVVLLIYFGSTELVEKLTGEYIEFSPFLCGVIALAIIF
AAYASQTLRGAIQAIPLGQWESGAALGLSRGYTFVNIILPQVWRHALPGL
SNQWLVLLKDTALVSLIGVDDLMRQASLVNTNTHQPFTWYSFAALLYLII
TLVSQFFMRKLEMRFTRFERGVK
>MS1101 asd, Asd protein
MAILFLLFHDFLPPYFLLQLKICRIERLFIAERIMSTSLNIAIAANFDLC
EKIASYLEESLLEVEKLSIVEIYPFSEEQGIRFNGKAVAQLPVDEVEWSD
FNYLFFAGDLAHIPLLAKASEAGCLTIEMNGVCSALADVPVVIPGVNEEQ
LRDLRQRNIVSLPDAQVTQFALSVRSLLNNASNAQIVVSSLLPASYYDAD
GVHKLVGQTAKLLNGIPPDEEEMRFAFDVFPAKSLNLNAQLQRVFPQLEN
VVFHQIHVPVFYGLAQMVTVKAEFEPEQDSILAEWSTNDLIRYHQDKVMT
PVLNGEAENNEDEVHLQISALESVEGGIQYWLVADNQRFSQAFLAVKLLE
SIYRQGY
>MS0006 asd, Asd protein
MKNVGFIGWRGMVGSVLMDRMQQEQDFANLNPVFFTTSQAGQKAPVFGGK
EAGNLKDAFDIEELKKLDIIVTCQGGDYTNEVYPKLKATGWDGYWVDAAS
ALRMEKDAIIVLDPVNQHVIADGLKNGIKTFVGGNCTVSLMLMALGGLFE
RDLVEWISVATYQAASGAGAKNMRELVSQMGLLEKSVSEELANPASSILD
IERKVTAEMRADSFPTDNFGAALAGSLIPWIDKLLPSGQTKEEWKGYAET
NKILGLSDNPIPVDGLCVRIGALRCHSQAFTIKLKKDVPLEEIEQILASH
NEWVKVIPNDKETTLRELTPAKVTGTLSVPVGRLRKLAMGPEYLAAFTVG
DQLLWGAAEPVRRILKQLVA
>MS0036 asnA, AsnA protein
MKKSFILQQQEISFTKNTFTEKLAEHLGLVEVQGPILSQVGNGIQDNLSG
TEKAVQVNVKMITDAAFEVVHSLAKWKRHTLARFGFAEGEGLFVHMKALR
PDEDSLDQTHSVYVDQWDWEKVIPEGRRNLDYLKETVREIYAAILETEAA
VDKKYGLKSFLPKEITFIHSEDLVKDYPGMTDKERENELCKKYGAVFLIG
IGGVLPDGKPHDGRAPDYDDWTTTSEGEYKGLNGDILVWNPILNRAFEVS
SMGIRVDETALRKQLSITGDEDRLKFDWHQDLINGRMPLSIGGGIGQSRL
AMLLLQKRHIGEVQSSVWPKAVMEQYENIL
>MS1042 asnS, AsnS protein
MTKIVSVAEVLQGRTAIGEKVTVRGWVRTRRDSKAGLSFLAVYDGSCFDP
IQAIINNDIVNYESEVLRLTTGCSVIVTGTVSKSPAEGQAVELQAETVEV
VGWVEDPDTYPMAAKRHSIEYLREVAHLRPRTNIIGAVARVRHCLAQAIH
RFFNEQGFYWVATPLITASDTEGAGEMFRVSTLDLENLPRDDKGAVDFSQ
DFFGKESFLTVSGQLNGETYACALSKVYTFGPTFRAENSNTTRHLAEFWM
VEPEVAFATLADNAKLAEDMLKYVFNAVLKERMDDLKFFEKHIDKDVINR
LERFVASDFAQIDYTDAIDVLLKSGKKFEFPVSWGIDLSSEHERYLAEEH
FKSPVVVKNYPKDIKAFYMRLNEDGKTVAAMDVLAPGIGEIIGGSQREER
LDVLDARMAEMGLNPEDYWWYRDLRKYGTVPHSGFGLGFERLIVYVTGLQ
NIREVIPFPRTPRNANF
>MS1984 aspA, AspA protein
MAATRKEVDLLGEREVPADAYWGIHTLRAVENFNISKVTISDVPEFVKGM
VMVKKATALANGELGAIPADIAKAIVAACDEILTTGKCLDQFPSDVYQGG
AGTSVNMNTNEVVANLALEKIGHQKGEYNVINPMDHVNASQSTNDAYPTG
FRIAVYNSILKLMDKIQYLHDGFDNKAKEFANILKMGRTQLQDAVPMTVG
QEFKAFAVLLEEEVRNLKHAADLLLEVNLGATAIGTGLNTPAGYSELAVK
RLAEVTGLPCVKASNLIEATSDCGSYVMVHGALKRTAVKLSKICNDLRLL
SSGPRAGLNEINLPELQAGSSIMPAKVNPVVPEVVNQVCFKVMGNDTTVT
FAAEAGQLQLNVMEPVIGQAMFESIDILANACVNLRDKCIDGITVNKEIC
ENYVLNSIGIVTYLNPFIGHHNGDIVGKICAQTGRSVRDVVLEKGLLTEA
ELDDILSVENLMNPTYKAKLSK
>MS0708 aspS, AspS protein
MMRSHYCGGLNRENIGQEVTLSGWVHRRRDLGGLIFIDMRDREGIVQVCF
DPKYQQALTRAASLRNEFCIQIKGEVIARPDNQINKNMATGEVEVLAKEL
SVYNAADVLPLDFNQNNTEEQRLKYRYLDLRRPEMAQRLKTRAKITSFVR
RFMDDNGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQ
LLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFLTAPEVREIME
KMIHGLWLNTINVDLGKFPVMTWTEAMQRFGSDKPDLRNPLEITDVADIV
KDVDFKVFSGPANDPNGRVAVIRVPNGASVTRKQIDEYTQFVGIYGAKGL
AWLKVNDVNAGLEGVQSPIAKFLTEEKIKAIFDRTSAQTGDILFFGADKW
QTATDALGALRLKLGRDLALTQLDQWAPLWVIDFPMFERDEEGNLAAMHH
PFTSPKDFSPEQLEADPTGAVANAYDMVINGYEVGGGSVRIFDPKMQQTV
FRILGIDEQQQREKFGFLLDALKFGTPPHAGLAFGLDRLTMLLTGTDNIR
DVIAFPKTTAAACLMTEAPSFANPQALEELSISVVKTDKE
>MS2348 atpA, AtpA protein
MQLNSTEISELIKKRIAQFDVVSEARNTGTIVSVSDGIIRIHGLSEVMQG
EMIALPTGRFAMALNLERDSVGAVVMGPYTDLAEGMEVQCTGRILEVPVG
RGLLGRVVNTLGQPIDGKGEIKNDGFSPVEVIAPGVIDRKSVDQPVQTGY
KAVDSMVPIGRGQRELIIGDRQTGKTALAIDAIINQRDSGVKCIYVAVGQ
KASTIANVVRKLEENGALANTIVVAASASESAALQYLAPYAGCAMGEYFR
DRGEDALIVYDDLSKQAVAYRQISLLLRRPPGREAYPGDVFYLHSRLLER
AARVNEEYVENFTKGEVKGKTGSLTALPIIETQAGDVSAFVPTNVISITD
GQIFLESNLFNAGVRPAVNPGISVSRVGGAAQTKAVKKLAGGIRTALAQY
RELAAFAQFASDLDDATRKQLSHGEKVTELLKQKQYAPLSVAEQAVILFA
VEFGYLDDVELNKIADFETALLDYANRTNTEFMQELTKSGDYNDEIKNTL
KGILDNFKANNTW
>MS2352 atpB, AtpB protein
MSGQTTSEYIGHHLQFLKTGDSFWNVHIDTLFFSVLAAIIFLAVFRSVAK
KATSGVPGKLQCMVEILVEWINGIVKENFHGPRNVVAPLALTIFCWVFIM
NAIDLIPVDFLPQLAGLFGIHYLRAVPTADISATLGMSLCVFALILFYTV
KSKGFGGLVKEYTLHPFNHWSLIPVNFVLESVTLLAKPISLAFRLFGNMY
AGELIFILIAVMYSANAAIAALGIPLHLAWAIFHILIVTLQAFIFMMLTV
VYLSIAYNKAEH
>MS2345 atpC, AtpC protein
MTTFNLTIVSAENKIFEGAVKSVQATGIEGELGILAGHTPLLTAIKPGIV
KFTYNDGIEEVIYVSGGFLEIQPNIVTVLADVAIRGSDLDQDRILAAKKK
AEDNIVAKSGDLNHEMLTAKLSKELAKLRAYELTEKLVKNKR
>MS2346 atpD, AtpD protein
MSAGKIVQIIGAVIDVEFPENAVPKVYDALKVAEGGLTLEVQQQLGGGIV
RCIAMGSSDGLKRGLSVSNTGKPISVPVGTKTLGRIMNVLGEPVDEQGPI
GAEEEWAIHREAPSYEEQSNSTELLETGIKVIDLICPFAKGGKVGLFGGA
GVGKTVNMMELIRNIAIEHSGFSVFAGVGERTREGNDFYHEMTDSNVLDK
VSLVYGQMNEPPGNRLRVALTGLTMAEKFRDEGRDVLFFVDNIYRYTLAG
TEVSALLGRMPSAVGYQPTLAEEMGVLQERITSTKTGSITSVQAVYVPAD
DLTDPSPATTFAHLDSTVVLSRNIASLGIYPAVDPLDSTSRQLDPQVVGQ
EHYDVARGVQGILQRYKELKDIIAILGMDELSEDDKLVVARARKIERFLS
QPFFVAEVFTGSPGKYVSLKDTIRGFKGILEGEYDHIPEQAFYMVGSIEE
VVEKAKNM
>MS2351 atpE, AtpE protein
MENIMESVITATIIGASILLAFAALGTAIGFAILGGKFLESSARQPELAS
SLQTKMFIVAGLLDAIAMIAVGISLLFIFANPFIGLLQ
>MS2350 atpF, AtpF protein
MNLNATLIGQLIAFALFTWFCVKFVWPPIIKAIEERQSSIANALASAEKA
KQDQADSQAAVEQEILAAKEEAQKIIDLANKRRNDILEEVKTEAENLKAT
IIAQGHAEVEAERKRVQEELRVKVASLAIAGAEKIVGRTVDEAANNDIID
KLVAEL
>MS2347 atpG, AtpG protein
MASGKEIKTKIASVQSTQKITKAMEMVATSKMRKTQDRMAASRPYSETIR
NVISHVSKASIGYKHPFLVEREVKKVGMLVISTDRGMCGGLNINLFKTVL
NEIKKWKEQGITVEVGVIGSKGIAFFRSLGLKIRAQHSGMGDNPSVEELL
GIANDMFDAYKDGKIDALYLAHNQFINTMSQKPSFAQLVPLPELDTDNLG
ERQQAWDYIYEPDPKMLLDSLLTRYLESQVYQSVVDNLASEQAARMVAMK
AATDNAGNLINDLQLVYNKARQASITNELNEIVAGAAAI
>MS2349 atpH, AtpH protein
MQNYKKVELMSELTTIARPYAKAAFDFAVEQSATDKSAVEKWTEMLGFAA
QVADNEQIRDFFANTFSVQKAADAMVSICGEQLDQYGQNLIRLMAENKRL
TVLPAVFDEFQRYVEEHNATAEVQVISAQPLNATQEQKIAAAMEKRLARK
VKLNCSVDNSLLAGVIIRTDDFVIDGSSRGQLNRLANELQ
>MS1797 avtA, AvtA protein
MELFPKSNKLEHVCYDIRGPVHKAALRLEEEGHKILKLNIGNPAPFGFEA
PDEILIDVIRNLPTAQGYCDSKGLYSARKAIVQYYQSKGIHGATVNDVYI
GNGASELITMAMQALLNDGDEVLVPMPDYPLWTAAVTLAGGKAVHYLCDE
EQDWFPAIDDIKSKITSRTKAIVIINPNNPTGAVYSKELLLEIAEIARQN
GLLIFSDEIYDKILYDGAVHHHIAGLAPDLLTITMNGLSKAYRICGFRQG
WMILNGPKDKARGYIEGLDMIASMRLCANVPMQHAIQTALGGYQSINELI
VPGGRLYEQRNRAYELLNQIPGVSCVKPMGALYMFPKIDIKKFNIYDDEK
LVLDLLAQEKVLLVHGRGFNWHAPDHFRIVTLPYVHQIEEALNKFARFME
NYHQ
>MS1248 avtA, AvtA protein
MRYDKMSPFIVMDIVREAAKYPNAIHFEIGQPDLAPSEKVKKALQSAVEN
NKFSYTESLGLLALREKICQYYDRTYHVKITPNRVLLTPGTSGAFLIAYA
LTLAQDDKLGLTDPSYPCYKNFAYMMDIQPEFMPVDKHNCYQLEVGQLKG
RNIKALQISSPANPTGNIYTAESLKSLNDYCMENHIDFISDELYHGLVYD
QNAATALQFNPRAYVINGFSKYYCMPGMRLGWIIVPEDKVREAEIIAQNI
FISAPTLSQYAALEAFEEEFLTATKQVFQQRRDFLYDALKDLFTIEFKPQ
GAFYLWADVSKYTDDSYQFAKKMLHEIQVAATPGIDFGENGTKHYLRFAY
TRDIEHLREGVERMKQWLKNK
>MS0764 azlC, AzlC protein
MSEIVSKTPVRDAAKAAFPYSAPMIAGFIFLGIAYGLYMKQLGFGVLFPV
FMALLIYAGSVEFIVAAALVAPFSPLNVFLICLMVSGRQIFYGISMLEKY
GGHLGKKRWYLITSLVDEAFSLNYMAKIPSYIDKGWYMFFVSLYLQIYWV
MGAGIGNLFGAMLPFDLKGIEFAMTALFIIIFAENWLKEKSHESSLLGLG
ITLTSLIIVGKEQFLIPSLLGIWIMLTLSRPKLSSKLKRIE
>MS0765 azlD, AzlD protein
MTLTEQIITVGMGILGVHICRVLPFLIFPPNRPIPEYIRYLGKVLPAAMF
GMLVIYCYKNVDIFSGFHGFPEFLAGLITLALHLWKKNMFLSMAVGTGLY
MFLVQAVFVN
>MS2286 baeS, BaeS protein
MNAIITLIYSWFIMLCAYFSVWAISDYLLGNSLLALLFLPFALRLGINLH
TPKIFWLVSYCAEFCILSLLMYASPNEYYLPAIILSIASLPVTFIGQKYY
QGNEARKLAVQGVIAVFVSLLNGIVSFSLSISFFYTFLTSLTGMLLIMPA
CFLGYDYLFKRKWIPLTASLVHKPISLRAKHILIYILLFLLNIFIQVDIP
EEFHRFALFFLAIPIILLAYHYGWQGALLGTLLNSIALIASTGSFSRGEL
TDLFLSISAQTVTGIFLGLAIQRQRDLNNSLMVELNRNRTLTRQLINTEE
SIRKEISRELHDEIGQNITAIRTQASIMKRLETSPKIEKVGSMIEQLSLN
IYDTTKGLLNRIRPKMLDDLELQQALQNLFLELDLENHGISTALFWENKQ
NEPLDHIQEITLYRLCQEGLNNIVKYSHASQVIISVLIRKDIELIIQDNG
DGFNPETVKSGFGLQGMKERVDILCGKFQLISKERSVSPQQHGTTIKITL
PRL
>MS1244 baeS, BaeS protein
MHEQQRERNMRGRFARHLAPLRAPMSYDDDALAFAIFTRSGEMLVSDDAN
GAKFQFAPDRGFTETKLSEGNESWRIFWLPSKDKNLIIAVGQEMDYRNNL
INNFVLGQMWIWIASLPLLIGLIIFVIHHELRLLNRVSAEVRERSPEDNH
LIDTADMPTEVLPLLQSLNGYFARTAETFRRERRFTSDAAHELRSPLAAL
RIQTEVAQMLTDEPELQTQALDNLTKGIDRASQLIEQLLILSRLDNLSQL
NELEPIYWEQLIPAVISEQYSHAQQRNIEIKFDRKALPQVKKGQPLLVSL
MLRNLIDNCIKYCPEGSVIQVNLNEDTVVIEDNGYGVSDEDINKLGQRFY
RPAGQNEKGSGLGLSIVHRIAELHHYEFIVENIKDQSGKCIGFRSIIKLN
>MS2288 baeS, BaeS protein
MKSSSKKGLLMKSLLSFKRLTKHSVTTFIAHYLSLIIILAGIITTFSFAI
MGSNKSDAELINVAGSLRMQSYRLIYTMEYEPEKVDIGLRQYRISLHSNP
LVTIHHHLLTPGDVKQSYSDLVRRWQEMENLARTNQQEQYRNQITSYVNQ
LDQFVYSLQRFAEKKVIIAVLVIILSMLLIVGIASYLIWHTHQEIVKPLN
QLMRASTQIEMRQFQHILLDTKRDDELGRLAKSFTHMSNELHKLYANLEE
KVTEKTQKINQVNRSLAMLYYCSQELSASDLNRNKLLHVLKHVMATEHLR
AFELDLIELRQWNITLGEPSVALSWQEQTIGSEDNKLGSLRWQAGLPCPD
TRTMENIAQLLGRTLYFNQTQRQQQQLLLMEERSIIARELHDSLAQVLAF
LQIQLTLLKHNLNKDDDKAKRQSLLIIKDFEQALSDGYIQLRELLATFRL
TVQEANLKLALEQVVDSLRNQTDIQMTVDCSLPSQTFNPQQLIHALQIVR
ESTLNAIKHSKADLIEVIAHTNEEGEQELIVRDNGVGIASLNEPDGHYGL
NIMQERSQQLNAKLTISNRATGGTEVKITLPNTLA
>MS1245 baeS, BaeS protein
MNLLKMNSIRLRLIVILSFIALVIWGVTSALNWHYVRQEVNNMFDIQQYL
LAKRLSSSYCNRFCMNSSASVI
>MS1914 baeS, BaeS protein
MIIEKLSNHLQSITTQIFAIFWFTFTLLLLLAFFIPSLDNRVYSALTSEQ
LETYQKQIVTSIRTNQISRLLVAPAKFAIDSTAPIRPILMDSNKRIIGAL
PDELPTVQQFVLQSANVSAPMVRNFNNIQLAGPFIVHLNTPENEPFLLYF
VKTINPQKEGVNFIFDHPALIVFLIMLFSSPILWWLAWRIAHPLRRLQHF
AGLVSKGDFTLHKELEESGVYELRQLSKSLNQMTESLDNLLSTRQALLYS
ISHELRTPLTRLQLAVALIRRRQGESRELTRIETEAERLDQMINDLLQLS
RNQLKSELERERFPITEIWQDVLEDTKFEAKQRNIHFKATCKIPDVAKYA
INGNRSALASALENILRNALKYTNTSIEATFTLENNYLRIDIDDDGIGVP
ESEYSKIFTPFYRLDTARTRSTGGTGLGLAIVLNIIKQHHGEIGANKSRL
KGLCITMKLPLDK
>MS1730 baeS, BaeS protein
MKNVKFFAQRYIDWVTKLGRLKFSLLGFILIAILALCTHIFLSLMITGQI
HWESLLYSVVFGVISAPFVIYFFALLVERLELSRQNLTNLVGELQQEIRE
RTTAEQRLAQAIRDKTTLMATISHELRTPLNGIVGLSRILLDSKLTEEQY
NYLKTINVSAVSLGHIFNDIIDLEKLDGSRIELYKKETDFHALITDVYNV
AQLMAEQKHLKFILQVDKDLPNWLLLDYTRLSQVLWNLISNAVKFTDKGT
VTLKISRLSENRYAFAISDTGPGIPENELNKIFTMYYQVKANFNKHKAAG
SGIGLAISKSIARLMNGDLVVESEIGKGSTFILTIQADEVSKPVSDGTAD
LDLSLSILLVEDIELNIIVAKSLLEKLGHQVDTAMTGQEALTKFERNNYD
LVLLDIQLPDMTGFDIAKILRTKYEDGVYDYLPPLIALTANVMQNKSDYQ
KQGMDDVLRKPLSLDSLNQCLSEYFGDEIGVSSAQNSVMTKAAELPDDFD
YPLLDDLVEMLGASFVLKNLALFKQTMPEYIDELLTIYQNYQKDKEKKKD
VAACMHKIKGAAASVGLKHIQLLAEKGQHDEADIWRENIKRWIDEIEQSW
FEDVTKLEHWLAKK
>MS0264 bcp, Bcp protein
MNTLKIGDFAPHFSLSNQHNETVSLTDFQGKKVLVYFYPKALTPGCTTQA
CGLRDAKAELDKSGVVILGISPDSPKKLAQFAEKKELNFTLLSDENHQVA
EQFGVWGEKKFMGKIYDGIHRISFLIDEKGVIEQVFTKFKTGEHHQVVLD
YLTQNS
>MS0227 betT, BetT protein
MFIEDNRPNNMEHLMSLYQQLRATSTLKAPIFMPTVIFVLLVTVFCSVFP
EQAQITLNTVKQSIFTHFSWFYILAGSIFFLFLIFLCGSRLGDIRLGADN
DEPEFSFTSWIAMLFAAGMGIGLMYFGVAEPVLHYVKPVQENLTEAERMK
EAMMTTFYHWGIHAWAIYAVIALALAYFGFRYKLPLTVRSGFYPLLKNNL
SGFWGHLIDIVALCSTIFGLTTTLGYGAMQVNAGFNNLGLIDSNSFVVLA
VIMIVSMMLAVISAISGVGKGVKILSETNLVLGGLLLIFVIIAGPTLWLF
SGLTENLGYYFSSLLELSFRTFAYEPEHQSWLSGWTILYWSWWASWAPFV
GMFIAKISKGRTIREFILGVLFVPSLFNILWMTSFGGSAIWIDQQTHGAL
AAISDNTEALLFGFFDQLPFGQIASVIALLVISIFFITSADSGIFVLNNM
ASQGSGKAPKWQSVFWGALLAVLGLSLLYSGGLASLQSMTLIIALPFMAI
MLVLCFGLWKGLMVDTQYSSKKFTQGSVLWTGENWKERLEKIVNPTDRKD
IRRFLNQIARPAFNDLVKEFLEHGLNAQMNFIDGKNPKIEFEVVNENLRN
FLYGIRLQSRQLSDLVVDDDNLPNLEESKIYEPITYFFDGREGYDVQYMT
QEELIVDVLKQYERFMNLAMDKSHNLMTADVENMAE
>MS2104 bfr, Bfr protein
MNKMATEKQIEILKGMAKAWFGNSQQHSIHAEIIRQKGFSKLADKIQAEA
EGEWKEAQRVNARLLELGVTPTLAINNYPIITDIREQLEYDYNEGLKGMA
ELNAMIADFADDYITRRMIEEFIVDEQEHTNWLAEHIGLIEEIGYQNYLI
QQL
>MS0396 bglX, BglX protein
MSTLLIDLKGQELLAEEAELLAHPLVAGLILFTRNFYDRSQIQALIKDIR
RRVKKPLLITVDQEGGRVQRFREGFTQLPAMQSFAAMISDPALQLTTAKE
AGWLMAAEMTALDIDLSFAPVLDLGHECKAIGDRSFCEEVEPAVRLASAF
IDGMHQAGMATTGKHFPGHGHVLADSHLETPYDERPSAVIFERDIQPFQQ
LIAQNKLDAVMPAHVIYRHCDSQPASGSKYWLQDILRQKLGFDGTVFSDD
LGMKGAGFMGDFVARSEKALSAGCDLLLLCNEPEGVVQVLDNLKLEENPP
HFAARQRRLQSLFKKKAFSWNELTKTRRWLENSKKLTALQQSWLDSK
>MS1523 bioB, BioB protein
MKTTLQLYSNTPHPQVEYWSVCKVEALFETPFLELVHRAALVHRENFNPQ
AIQLSTLMSIKTGGCPEDCGYCPQSARYQTGVQKQELLNVEDIVEKAKIA
KSRGASRFCMGAAWRGPKPKDIAKMTEIIKAVKALGLETCGTFGLLDDGM
AEDLKEAGLDYYNHNLDTSPEHYNKVIGTRGFDDRLNTLGKVRKAGLKVC
CGGIIGMNESRKERAGFIASLANLDPQPESVPINQLVKVEGTPLSDAQEL
DWTEFVRTVAVARITMPKSYVRLSAGRQGMSEEMQAMCFMAGANSIFYGD
KLLVTGNAEEDCDRLLMEKLDLEPETTENRYLSQNN
>MS1006 bioD, BioD protein
MSVFFVTGTDTSVGKTIVSRAIIQAMQNAGIQIVGYKPLACGQDDPVYTD
VQESGQTDYDNMDNRDVLVLQDSTNEEVSYQDINSYTFAHTMPMLSQEGK
HIDINKINTDLKRLSSRYQSVLVEGSFGWLTPINQDYTFASWAAEHKMPV
VLVVGIKEGCMNHALLTVQSIEQMGLPLLGWVANRINPMLGHYAEIIEDL
SKRIKAPLLGKIPYMHKPETQELGHYITDIDRLSYMKTEILK
>MS0775 birA, BirA protein
MFYVDVKKYCGRIIQGWQSFGNGFFKKNFYFFELYDKKTSYIFSGKFMSS
LLEILADGQPKTFKKLTALLSLSQAQLLDETERLQTLGIQIKASPQTLQL
IPQLDLLDGARLSKALFPHRVVIQPVIDSTNQYILNHLAELKKGDLCLSE
HQTAGRGRRGRQWLSPFAGQLILSIYWTLNARKPLDGLSLVIGMAIADAI
KSAGGKEINLKWPNDLLLNGRKLAGILIEIANRQQDQLNLVIGIGINLSL
PKLKAQIDQPWAELCEILPQLDRNELLIRVVKHLYLYLAAFEREGINAVF
REKWAETDYYFNKEVNIITEKQTITGINQGIDENGYILIKTKNGELLKFN
GGEVSLRKPA
>MS0892 bisC, BisC protein
MQVTRRKFFKICAGGMAGTSVAALGLMPTAALAAPREYKLLRAKETRQSC
TYCAVGCGMLMYSIGDGAMNSRGKLTHVEGDPDHPVSRGALCPKGAGVLD
FVNSPNRIQYPEYRAPGSDKWERISWHDAIHKIAKLLKDDRDANWESANE
EGTPVNRWLTTGFLTASAASNETALISQKWARAFGLLVLDNQAST
>MS1030 bisC, BisC protein
MFATHLLGVIMQVSRRKFFKICAGGMASSSAAMLGFMPTQALAAPREYKL
IHAKVARNNCTYCAVGCGMLMYSLGDGAKNARGKLFHVEGDPDHPVSRGS
LCPKGAGVLDFVNSPNRLKYPEYRAPGSDKWVRLSWEDAIHRIAKLMKED
RDAYFEEKNAQDTTVNRWLTTTMFCSSATSNETGILTHRWARSLGMVTIN
NQAATCHGPTVPALAATFGRGAMTNHWVDIKNANLVIVMGANTAEAHPVG
FKWVIEAKKNGAKLMVVDPRFNRTAAVSDFFAQIRAGSDIAFLLGVIRYL
LEHDAIQHEYVKHYTNAALIVADDFEFNDGLFSGFDESTAQYDRTSWAYA
TDESGQPLRDLTMQHPHCVLNLLKKHVERYTAETVENITGVKQATFNQFC
ETLAETASPNKTATFLYALGWTQHTVGAQNIRAMAMIQLLLGNIGMAGGG
VNALRGHSNVQGASDMGLTPVGLPGYLQLPNEKDVSLEKYLERVTPKTLV
QGQTNFLQNTPKFVVSLLKSFYGDNATAENEWGFHYLPKYDQVYDQLKMI
EMMNEGQINGFLCQGFNPVSSLPNKNKVVSALSKLKYCVVFDPTETTTSN
FWQNHGEYNDVNPAEIQTEVFRLPTVCFAEEDGSIANSGRWLQWHYKAAE
PPAEAKPDVDILAEIREAILEMYEKEGGRGLEPLKATAWDYVNPLEPKAE
ELAKQNNGYALADLYDTAGNLIAKKGELLSNFGQLRDDGTTACSAWIYTG
QWTEKGNQMDNRDNSDPSGLGNTLGWAFAWPANRRIVYNRASADLTGKPW
DPKRQLVKWNGKNWNYIDIADFGTAPPNSEIMPFIMQNDGLGGLFCLNRL
ADGPFPEHYEPMETPIGTNPLHPNVISSPVARVMENDKPNFGTSNEFPYV
GTTYSLTEHFHAWTAQVQLSMITQPEAFAEISEELAQEKGIKQGDVVKVH
SKRGYIKMKAVVTKRIKPLTVNGQTVHTIGFPIHWGFSGVGKKTFVTNTL
TPPVGEVNSLTPEYKAFLVNIEKTTEAL
>MS2281 bisC, BisC protein
MGNIMELNRRDFMKANAAVAAAAAAGITIPVKNVHAADDDMGIRWDKAPC
RYCGTGCSVLVGTKDGRVVATQGDPDAEVNRGLNCIKGYFLSKIMYGADR
VQTPLLRMKDGKFHKEGDFTPVSWDQAFTVMADKIKAILKEKKDPNAIGM
FSSGQTTIFEGYAKVKLWKAGLRSNTIDPNARHCMASAAVAFLRTFGMDE
PMGCYNDIEKTDAFVLWGSNMAEMHPILWSRISDRRLSSDKVKVVVMSTF
EHRSFELADTPIIFKPHSDLAILNYIANYIIQNDKVNWDFVNKHTKFKRG
ETDIGYGLRPEHPLEVAAKNRKTAGKMYDSDFEEFKKIVAPYTLEEAHRI
SGVPKDQLETLAKMYADPQQNLVSFWTMGFNQHTRGVWVNHMVYNVHLLT
GKISKPGCGPFSLTGQPSACGTAREVGTFVHRLPADMVVTNPKHVEIAEN
IWKLPKGTISNKPGFPAVQQSRALKDGKLNFLWQLCTNNMQGGPNINEEI
FPGWRNPDNLIVVSDPYPSASAVAADLILPTCMWVEKEGAYGNAERRTQF
WRQQVKGPGQSRSDLWQIVEFSKYFKTEEVWSEELLAQMPEYRGKTLYEV
LYLNGEVNKFQTPTNVPGYINDEAEDFGFYLQKGLFEEYASFGRGHGHDL
ADFDTYHQVRGLRWPVVDGKETLWRYREGYDPYVKAGEEVSFYGYPDKKA
IILGVPYEAPAESPDEEYDLWLCTGRVLEHWHTGTMTRRVPELHKAFPNN
LCWMHPTDAKKRGLRHGDKVKLITRRGEMISHLDTRGRNKCPEGLIYTTF
FDAGQLANKLTLDATDPISGETDYKKCAVKVVKA
>MS0588 bisC, BisC protein
MKKVNNSRRNFLKSSSLGFAGASMATATTGGITGLLSVTANAAETNSKTV
VTAAHWGPLGVVVENGKVVKSGPAIAAPIENELQSVVADQLYSEARVKYP
MVRKGYLDGNQDRSLRGHDTWVRISWEQAFDLVAKEMKRVRETYGASGIF
AGSYGWYSSGALHAARTLLHRYMNITGGFVGTKGDYSTGAAQVIMPHVLG
TIEVYEQQTSWEVILESSDTIVLWGANPLATMRIAWTSTDQKGLEYFKKF
KETGKRIICIDPVRSESCEYLGAEWIPINTGTDVPLMLGIAHTLVNENKH
DKEFLKNYTTGYDKFEEYLLGKIDNQPKTAEWAEKICGVPAQTIKQLAAD
FSAKRTMLMGGWGMQRQRHGEQSHWMMVTLASMLGQIGLPGGGFGLSYHY
SNGGVPTARGGILGSITANPSTQAGAKTWLDDVSKFSFPLARISDALLNP
GKTIQYNGTEVTYPDIKLIYWAGGNPFVHHQDTNTMVKAWQKPETIIVNE
VNWTPTARMADIVLPATTSYERNDLTMSGDYSMMNIFPMKQVVEPQFEAK
SDYDIFAELAKRAGVEEQFTEGKTEMQWLKGFYETAFNAARANRVLMPKF
DDFWNENKPITFNPTDSAKKWVRYAEFREDPLLNPLGTPSGKIEIYSNTI
AKMNYDDCKGYPSWMEPEEFAGNVTAEEPLALVTPHPYYRLHSQLAHTSL
REKYAVKDREPVLIHKDDAAARGIANGDIVRVFNKRGQVLTGAVVTDGVI
KGTVAIHEGAWYDPLDLGQTERPLCKNGCVNVLTRDEGTSKLAQGNSPNT
CIVQVEKYTGEVPEVTVFKQPKIA
>MS2337 bisC, BisC protein
MLYGKFQRISWEEALDTIADNLKRIVKDYGNEAVYNNYATGIVGY
>MS0891 bisC, BisC protein
MTNHWVDIKNANVVVVMGGNAAEAHPVGFRWAIEAKKQNGAKLMVVDPRF
NRTAAVADFYSPIRSGTDITLLSGVIKYLLDNNAIQHEYVKHYTNASFLI
NEGYGFEEGLFTGYDEAERSYDKSTWSYQLDENGQPKRDLTMQDPRCVIN
LLKKHVERYTPEMVERVCGTKQKAFLEFAETIASTAVPNRTMTILYALGW
THHSVGAQNIRAMAMIQLLLGNIGMAGGGINALRGHSNVQGTTDMGLFPS
MLPGYIPLPTETDTSLESFLNRITPKTAAEGQTNYWQNTPKFVVSMLKTF
YGENATKDNEWGFHNLPKQYKKKMDHLQYIDLMDQGKITGYLCQGYNPLA
SYPDKNKISSALRKLKFLVVMDPLKTDTSEFWQNHGEYNDVNPAEIQTEV
FRLPTVCFAEEDGSIANSGRWLQWHYKAAEPPKEAKPDVDILSEIREVML
EMYEKEGGPSIDTIKAMTWNYQNPLEPKAHEIAKESNGYALEDLYDANGN
LIAKKGELLSSFAQLRDDGTTSAANWIYSGQWTPKGNQMDNRDNSDPSGL
GNTLGWAFAWPANRRVLYNRASADLAGKPWDPKRPLIKWNGKNWNYIDIA
DFGTAPPNSNVMPFIMNNEGISRLFALDKMVDGPFPEHYEPIESPIGTNP
LHPNVISNPVARILDNDKASFGNASEFPYVGTTYRLTEHFHWWTKNADLN
MIAQPQPFVEISEDLANEKGIAQGDVVKVTSKRGYIKAKAVVTKRIKSLD
VDGKKVHTIGIPLHGGFIADGRKSFLPNALTGRVGDANTQTPEFKTFLVN
IEKTTEAL
>MS1708 bolA, BolA protein
MEITMETQEIERILKQALNLDEVYVQGENAHYGVIVVSEEIAKLSRLKQQ
QTIYAPLMDHFSSGEIHALTIKTFSPEKWKLERMLNVVN
>MS0311 bolA, BolA protein
MSKQQELTERLTRQFSPLFLQIENESHMHSSDRGGESHFKVVIVTDEFEG
KPKVVRHRMIYQFLAQDLENGIHALALHTYTPKEWQSLGKIIPKSTNCLG
AG
>MS0488 brnQ, BrnQ protein
MNKNTFIVGFTLFAIFFGAGNLIFPPKLGLESGSEFWSAITGFILSGVGL
PLLGIIVSAFYEGGYKTATTKISPWFSVIFLMAVYLSIGPFFAIPRTAAT
SYEMAILPFIGKSSSLSMLIFTLFYFAISLWFALNPSKTVSRIGAILTPI
LLFAILALVVKAFFILIDNDPSEVIFTLRESNNSFLFTGIIDGYLTMDTL
ASIAYSVIVIAAIQSKGIKHGKELTKQTLLAGIVAAIALAAIYLAIGWIG
NRVHISAETISLLQERNQDIGTYILNKITAQAFGNFGRSLLGVIVSLACL
TTAIGLIVSVSEYFNEIYHKISYKTYVIIFTLIGFIIANQGLSAVISKSV
PILLVLYPISMTIILLLSVNIFVKVPLVAQRLSIALTTLVSIGSVAGLEQ
ANNLPLKDYSMEWIPFAVTGALLGCLIHVFYKSES
>MS0793 btuC, BtuC protein
MYSAVLFSYICYLSGIGMFSYKNKIHSKLIRQLGILGMLSLVAVVAYLFY
RLPNRWEYALYHRSLSLVAIVVTGAAIALATMIFQTIVNNRILTPSILGL
DSLYLLIQTLIIFLFGSKTLLGINQTLLFFLSTGAMVMFALGLYHFLFKR
ERQNIFFLLLVGIIFGTFFQSLTTFMEVLIDPNEFQIAQDIGFASFNRIN
LDILWIALGILLVVIFYTCRYLRYFDVLALGRDQAINLGVDYQAVTRRLL
IIVAILTSVSTALVGPLTFLGLLVMNVTFEFIRGYQHKILIPAAMLISVM
TLVMGQVLVSQVFTFNTTLSIIINFTGGVYFIYLLLRANKKWQ
>MS1013 btuC, BtuC protein
MDTQRIDIKSVPASIRLFNNKMNKLNMLLGILLTLLITLAATHRLGDFSA
LLNPDGVLTDMRSLVLWEIRLPRILLALLTGAGLALAGNAMQGIFQNPLA
SPGLLGSANGATTASVFILYYFSAPFTILLCGGVLGALLSFLLVYLMAKN
RGSTMMILSGVAINMLLGSLIALLLSNAESPWALAELYRWLQGSLVWAKT
DTLLMCLPIVLAGLFCLYSQRRYLDLLTFGEETAATMGVDPKRSFFITTL
GVALLVGATIPQTGTIGFIGLIAPHLARMMLKKRPSQLYLTSALFGALLL
LIADLCILYIPLFSHIYIGTLTAIIGAPFLIWILLAQQKMLAK
>MS1203 btuC, BtuC protein
MVKKLNIALFALAVLFFTWLSVVQLNNNDALAYLLFANYTLPRVFMAILA
GCALGIASSLLQQVINNPLASDNTLAVSSGAQFSLFLVAIFVPNWLGAGS
MFIALIGALVALALVFLLAWRRTISPLLMILAGLVVNLYLASFSAVLMLF
YPEESRGLLLWGAGSLVQESWYDSLQLLWQFTIALILIFIFAKPLQILTL
NDNNAKSLGVPVNLIRFLGLVISAFLVAIVVSRVGMLGFVGLAASSIVRQ
FSTTNLLKRLILSAYMAAMLLLLTDLTLQLFAYYRQIELPTGAVTALLGT
PLLLWLMFNISNNGRLVSQDESLSLGKQPVKAAGVIISLLLLLSILCALF
IGKNASGWYWDSVMLTLRYPRLLVAMAAGIMLAVAGTLLQRLSHNPMASP
ELLGITSGTAFGILTVIFFVATPTRGQFWFAGILGGFLVLLFIMLINQRN
QLLPEKILLTGISIAALFDALQRIVLAGGDYKWQQLLAWTSGSTYHATPQ
LATGFLSIAVLLFLLALPLDRWLALLALQTPVAQALGLDITKVRWILIIF
SAFLTALSTLLVGPLSFIGLLAPHLAHFCGWHKPKAQLIGAVLLGTLVMT
IADWLGRQLLFPYEIPAGLVATLIGGAYFLFMMRKI
>MS0792 btuC, BtuC protein
MIHRRYLILLLLLLSIISLFLGVSSVNLKGLLYFNSEQWQILLISRVPRL
ISILIAGSALSICGLVMQQLSRNRFVSPTTAGTMDSARLGILISMLVFPT
ASMLFKTVIAIVVSFLGTLLFMTILSRLKFKDSIFVPLVGIMFGNIISSV
TAFIAYQQDILQNLSGWLQGDFSLIMSGRYEILYFSIPALITAYLFANRF
SIVGMGQDFAVNLGLNYQQVLYLGLAIVATVSSIIIVSVGVIPFLGLIIP
NLVTLYLGDNLKKILSHTALLGAVFVLFCDIFGRIVIYPYEIAINAVVGV
FGSAIFLYLLFKRYRHV
>MS1113 cDA1, CDA1 protein
MGLFMFKHCMRLVTYLSTSLLFAVQALAANNHFGILCYHNIIDESVQSEK
YYPQTISAQKLISQFNWLRTNGYIPVSMQQILDARNGGKALPEKSVLLTF
DDGYQSFYTVIYPLLKAYNYPAVYAIVTDWIETPANKKVTYGDEKLDRKE
FVTWQQLREMKDSGLVEIASHTHDLHHGVKANPAGSNVPAVITPAYINGK
YETESQYEARLRKDFQRSFSLLKQHLGAAPAAMIWPYGRFNEKAAAIAEE
AGFKVHMSLVDTINNTPDQFHLGRLLLDNETSINTIENYLKNKNKDVLVQ
RSLRIKLDDVYDPNPAQQSKNLDALIERIYRQDIERVYIQAFSDTDNDGV
ADALYFYNQQLPVRADLFSRVVWIIKTRLGKAVYAWMPISAFKGKNNTQQ
IKSIYRDLALYSKINGILFDDNLSSDNKFTDLKPLDAASLRLTDELKDIV
YPYPLGGREDFATMRMISAPVNMSDESEKQFNQNLAELNRHYDAVIVSAA
PYVKGSELTQSGARNWLGNIIKKTVPQVAKDRLAFELQTVDWRTQQAITD
DELIDWMRDIQTKYHFYNFGYYPDNFQENQPKLNEIRPHFSINTNLGLK
>MS0939 cDC9, CDC9 protein
MMLLENYKNQDITGWVMSEKLDGVRGYWDGKQLISRQGGVLAAPDYFLEN
FPPFPIDGELFSQRDQFAEISSITRSQQDKGWHKLKLYVFDVPEAPGDLF
TRLATLKNYLKTNRTSYIEIIEQIPIRDKNHVRQFLQQVETQKGEGVVLR
NPNAPYENKRSTQILKLKSHLDEECTVIAHHKGKGQFANALGALTCKNQR
GKFRIGSGFTLEDRVNPPAVGSVITYKYRGLTKTGKPRFATYWRKREDLQ
ETP
>MS0778 cafA, CafA protein
MNSVELLVNVTPNETRIALVDTGILKEVHIERQAKRGIVGNIYKGRVTRV
LPGMQSAFVDIGLEKAAFLHASDIVSHTECVDESEQKQFIVKDIAELVRE
GQDIVVQVVKDPLGTKGARLTTDITLPSRYLVFMPENSHVGVSQRIESED
ERARLKALVEPYCDELGGFIIRTAAEGATEDELKQDADFLKRLWRKVMER
KAKYPTRSMLYGELALALRVLRDFVGAGIEKIRIDSKLCFTEVNEFCEEF
MPELVDKLVLYSGNQPLFDVYGVETAIQIALDKRVNLKSGGYLIIEQTEA
MTTIDINTGAFVGHRNLEETIFNTNIEATQAIAQQLQLRNLGGIIIIDFI
DMQTDEHRNRVIQSLEEALSKDRVKTNVNGFTQLGLVEMTRKRTRESLEH
VLCGECPACQGRGHVKTVETVCYEIMREIIRVHHLFSSEQFVVYASRAVS
EYLINEESHGLIAELEVFIGKQVQVKTEVYYNQDQFDVVVM
>MS1622 cafA, CafA protein
MKRMLINATQQEELRVALVDGQRLFDLDIESPGHEQKKANIYKGKITRVE
PSLEAAFVDYGAERHGFLPLKEIAREYFPDDYVYQGRPNIKDIIKEGQEV
IVQVNKEERGNKGAALTTFVSLAGSYLVIMPNNPRAGGISRRIEGDERLE
LKDALSSLDVPEGVGLIVRTAGVGKSSEELQWDLKVLLHHWEAIKQASQS
RPAPFLIHQESDVIVRAIRDYLRRDIGEILIDSPKVYEKAKAHIKLVRPD
FISRIKLYQGEVPLFSHYQIESQIESAFQREVRLPSGGSIVIDVTEALTA
IDINSARSTRGGDIEETALNTNLEAADEIARQLRLRDLGGLIVIDFIDMT
PVRHQREVENRMREAVRQDRARIQISRISRFGLLEMSRQRISPSLGESSH
HICPRCQGTGKVRDNESLSLSILRLLEEEALKENTKQVHTVVPVNIASYL
LNEKRKAIHDIEKRHNVEILVVPNKEMETPHFSVFRVRDGEEINTLSYNL
VKVYEEQETTFIADEPFTTRVTETAAVTTENVLESAALSMTISEPAPTIE
VKKEEQPSLFVRIVAAIKGLFASEPKVEEKVEEAPQPNTRNRRNNQEHRN
SRRNRNERNNRGNNEEPADKVQSEKSAEKAERPARSERTRRNNRNRNAAA
DDSLNNESVIEAVTNETTDDEAKAPQQRRQRRDLRKRVRVTEEQVTVAAE
PVDKPSLAESVPVEPVVEENVYQERDRQDNRRRLPRHLRVNNQRRRRNAE
QVSAMPLFAAVASPELASGKVWIESPTAPAKPKESAFLSVDELLEQQSEV
KQPGVTTPAPATQVIFDKADNDIAPLASFVTQPANESVQKKVQESLDRLE
QTNGQQTEVKEDDNASNMTLSDVTATESAVEKTEVLNLSNYRFSGRLGTI
SAVKHTKAEMTLAKAADEVLPPFEIVQWQDSRYYFHGKGSAGHNSAVSHV
FTAATQAKAE
>MS1805 cah, Cah protein
MMKKLLSTAAALLVSGFILSGCSTTEKHWGYTGDVSPEYWGGLSDKFKTC
AVGQKQSPVNIQVQKATDKDLPALNINYLASKATVVNNGHSIQTDLTDEN
STLTINGKVYTLKQFHFHSPSENTIDGQYLPLEGHFVHVAKDGGIVVVAV
LYEIGGENAQLADIWAGMPEKAGEKVKLKAKFNPATLISSKQSYYSFEGS
LTTPPCTEGVDWIVLKAYCSGQLI
>MS0864 caiC, CaiC protein
MISMLMFPWLNYANSAQYQNKTALRDDLQGEVFTWPQLAQRIEQTRLSLQ
RQGLSMGQGIALCGKNSLDLLCFYLAGLQLGLRVLGINPAFPVEKINRLC
ELNDISLRIDFSSSQYHCRRLKNSAKDDRTFTLTEGYTMTLTSGSTGLPK
AAVHSVNAHLANAVGVSELMRFGANDSWLLSLPLYHVSGQGILWRWLQQG
GELVLPQADFYASVIGVSHVSLVPTQLQRLLSYLAKHPNKFVCTKHILLG
GSQIPLELTRQANRLGIQCYSGYGMTEMGSTVFAKESDETAGVGLPLKGR
EYRLVDDEIWLKGAGLAEGYWIDQKIRTLTNKQGWFQTKDKGQWLNNELV
LLGRLDNMFISGGENIQPEEIENIIQGYELVNQVFILPRDDAEFGQRPVA
MIQFNLDADTENNFKSAVEKLKIWLSDKIERFKQPVAYFPLDVEKARQEG
TIKISRNLLKTELMTLLGK
>MS1358 caiC, CaiC protein
MEKGWFKNYPEGSPREIDTSEYHSILDMFDKAVREHPDRPAYINMGKVLT
FRKLEERSRAFAAYLQNELKLTRGERVALMMPNLLQYPIALFGVLRAGLV
VVNVNPLYTPRELEHQLQDSGAKAIVVVSNFASTVEQVVFNTDVKHVILT
RMGDQLSFGKRTLVNFIVKYVKKLVPKYKLPHAVTFREVLSVGKHRQFVR
PDLARDALAFLQYTGGTTGIAKGAMLSHGNIITNVFQAKWIAESFIGDRR
RERIAIIPLPLYHVFALSVNALLFVELGITAVLITNPRDVDGMVKELRKY
PFTAITGVNTLFNALLNNENFKEVDFSSLKLSVGGGMAVQQSVAQRWHDL
TGNNIIEGYGMTECSPLIAASTILTDKHDGSIGVPVPNTDIRIMRDDGDE
AELGEPGELWVKGEQVMQGYWQRPEATAEVLKDGWMATGDIVVMDKNYIM
RIVDRKKDMILVSGFNVYPNEIEDVVMLNPKVLEVVAIGVPHEVSGETIK
IFVVKKDESLTRDELRAHCRNLLTGYKVPKEIEFRDELPKTNVGKILRRV
LRDEELAKRNAQ
>MS2237 carA, CarA protein
MSEPAILVLADGSIFRGTSIGAAGHTIGEVVFNTSMTGYQEILTDPSYFK
QIVTLTYPHIGNTGTNSEDLESNGVYAAGLIIRDLPMIHSNFRANQSLSD
YLKDNNVVAIADIDTRRLTRLLRDKGAMAGCIMSGEVDEQKALELALSFG
SMAGKDLAQEVTAQQSYRWTQGEWVLGKGYAEQQNASFNVVAYDFGVKHN
ILRMLAERGCKLTVVPAKTSAEEVLALNPDGIFLSNGPGDPEPCDYAISA
IQTLLATKKPIFGICLGHQLLGLASGGKTKKMAFGHHGANHPVQDLDTQK
VMITSQNHGFEVDEHSLPANVRVTHRSLFDNSVQGIELTDQPAFSFQGHP
EASPGPHDVAYLFDKFIDAMKQAKA
>MS2236 carB, CarB protein
MAMSTKPSGASKFVYKTANNFLKVLSRENNMPKRNDINTILIIGAGPIVI
GQACEFDYSGAQACKALREEGYKVVLVNSNPATIMTDPNMADVTYIEPIH
WQTVEKIIEKERPDAILPTMGGQTALNCALDLSKNGVLKKYGVELIGATE
DAIDKAEDRGRFKEAMAKIGLNTPKSFVCHSFDEAWKAQEEVGFPTLIRP
SFTMGGSGGGIAYNRDEFQAICERGFEASPTHELLIEQSVLGWKEYEMEV
VRDKADNCIIVCSIENFDPMGVHTGDSITVAPAQTLTDKEYQIMRNASLA
VLREIGVDTGGSNVQFAINPENGEMIVIEMNPRVSRSSALASKATGFPIA
KVAAKLAVGYTLNELRNDITGGLIPASFEPSIDYVVTKVPRFAFEKFPKA
DDRLTTQMKSVGEVMAMGRTFQESIQKALRGLETGICGFNLKTEDMEKLR
HEISNPGPERLLYVADAFGIGWSIEDVHHYSKIDPWFLIQIQDLVLEELA
LEKKTLADLNKDEIYRLKRKGFSDKRIAQLVKSDETSVRSLRNAFNIHPV
YKRVDTCAGEFKSDTAYLYSTYEEECEAAPSDRKKVMILGGGPNRIGQGI
EFDYCCVHAALALRESGFETIMVNCNPETVSTDFDTSDRLYFEPLTLEDV
LEIIHVEKPWGVIVHYGGQTPLKLANALHANGVNIIGTSADSIDAAEDRE
RFQKILHDLNLKQPANRTARNTQEAVGLANEVGYPLVVRPSYVLGGRAMQ
IVYNDEELNRYMREAVSVSNDSPILLDHFLNNAIEVDVDCICDGEQVIIG
GIMQHIEQAGIHSGDSACSLPPYSLSMEIQDEIRRQTAAMARALNVVGLM
NVQFAVQNDVIYVLEVNPRASRTVPFVSKATGQPLAKIAARVMAGISLKE
QGIQGEVVPQDFYAVKEAVFPFIKFPGVDTILGPEMRSTGEVMGVGATFA
EAFLKAQIGAGERIPRTGKVFVSVDNNDKPRLLPIVKRLQEQGYGLCATF
GTAKFLRENGIAVQTVNKVREGRPHIVDAIKNDEIALIINTAGGMAESVA
DSASIRASALKQRVPLYTTIAGADAISLSVANLDIHDVYSVQGLHAGLTK
>MS1491 carB, CarB protein
MNILVTSAGQRVSLVQAFKKELSQLVSDGKVLTVDLNPELAPACYVADGH
FQVPRVTDAGYIPTLLKICEENNVKLIIPTIDTELLILSEHLQRFKEKGI
FISVSDTEFVRKCRDKRLTNQLFIEHNIAVPKQFEKGQFEYPVFVKPYNG
SLSKGIFVAEKPEDISPEQLENPELMFMQYISPAEYDEYTVDCYFDKNSE
LKSAVPRKRIFVRAGEINKGVTRKNAIVTQLSEKLSRLPGARGCLTIQVF
YKESTAEILGIEINPRFGGGYPLSYLAGANYPRWLIQEYLFNQPIPAFDD
WEADLLMLRYDAEVLAHHYEK
>MS1302 ccmA, CcmA protein
MADTLLAVQVDKLKHSYGKTTALCDLSLQIPRGKIIGLIGPDGVGKSTLL
SLIAGVKIIQSGSVTVFGLNVAEKKARDLLSHKIAFMPQGLGKNLYLTLS
IYENIDFHARLFGLPKAHRKARIERLLNATGLAPFADRAAGKLSGGMKQK
LSLCCALVHSPDLLILDEPTTGVDPLSRRQFWQLVEDLRRETPGMTVIVA
TAYIDEAEGFEQVIAMDDGKLIAYKPTKQLIAETESENLEQAYVKLLPAD
KRGSGKGLTIPPFEVDANEPPVIVAKGLTKRFGDFTSVDNVSFTIPKGEI
FGFLGSNGCGKSTTMKMLTGLLDPSEGTATLLGQPIDASNIDTRKRVGYM
SQAFSLYEELTVRENLELHAKLFQIPPAQWNTYVHSAMEQFDLADLADEK
PSSLPLGIRQRLQLAAACLHKPELLILDEPTSGVDPAARDMFWEYLIKLS
REDRITIFVTTHFMNEAARCDRISFMHRGRVLAVGTPEELRTGKNAATLE
EAFIIYLEEQADDITAPSNETGQSAVKNDEVLPPAEGLWAWWSLIWTFAV
REGKELLRDNIRLFFALLGPIIMLIAMASSISFDINPMKFAVLDHDNSSA
SRHLVEYFSGSRYFIRQADLHSVDEINSNIQSAKVKMVLEIPTDYGKKLL
NWQQPEIGVFIDGAFPSTAENLNGSVIGVLTQYQREISKHIDMSVSSTVL
LEPRFVYNQDFKSIFAMTPGIIMLAMILVPSMMTALGVVREKEMGSIMNL
YGSPASPLQFLLGKQIPYIILAFVSYLAAVCVAIIVFKVPIKGSVLAMFF
GVILALLATTAFGLFVSAFVKTQIAAIFATAIISMIPALNFSGMIYPVTT
LPDTIYTAARTFPGYWLQLVSLGGFTKGLNFTDFFDCYLALSTIFAVYIT
LATLLLKKQEV
>MS0601 ccmA, CcmA protein
MQSVNQLKIDRLACQRGDKILFTDLSFNLQSGDFVQIEGHNGIGKTSLLR
ILAGLAQPLSGKVRWNSEEISKCREEYYYDLLYLGHHAGIKPELTAWENL
KFYQQAGHCRQGDEILWNVLEKVGLLGREDIVASQLSAGQQKRIALARLW
ISQAPLWILDEPFNAIDKNGVKVLTGLFEQQAEKGGIVILTSHQEVPSSA
LTVLNLAQYKFTDNE
>MS1066 ccmA, CcmA protein
MQYSRIGYIFKLFMRIYMYALEIKGLTKQYKNGFKALHGIDLCVKEGDFY
ALLGHNGAGKSTTIGIISSLINKTSGQVKVFGYDLDSQLGLLKQQIGLVP
QEFNFNQFEKVLDILANQAGYYGIERSEAEKRAEVWLKKLDLWDKRNQQA
MRLSGGMKRRLMIARALMHKPRLLILDEPTAGVDIELRRTMWTFLRELNE
QGTTIILTTHYLEEAEMLCRHIGIIQQGRLVVDMPMKDLLAKLETETFIF
DFAPNSPKPIIRDYRLKQIDVDSIEVEMPREKGLNHLFEQLSNQGIQVLS
MRNKANRLEELFVSMSLNKPTDEVK
>MS0602 ccmB, CcmB protein
MIFFEIIKRELRIAMRKQAEILNPLWFFLIVITLFPLVIGPDPVLLSKIA
PGIAWVAALLSALLSFERLFRDDFIDGSLEQLMLTAQPLALTALAKVIAH
WILTGLPLILLSPVAALLLSLDVRIWWALVLTLLIGTPILSCIGAIGVAL
TVGLRKGGVLLSLLVVPLFIPVLIFSASVLDAATLNLSYAGPLAILGAIL
AASATLAPFAIAAALRISLDQ
>MS0518 ccmC, CcmC protein
MVTGLRIMSFALFSALFYIISILFIAPMLAKAQSGEQIQRPNKNWFILTA
LFAVICHFISLFPFFSNLFSGENFTLMEIGSLISVLIAILATVAIALKIK
TFWFLLPIIYCFATINVTLAAFAPSHVIQNLAQDLGLLLHILLAMFAYAV
CFIAMLQSIQLAWLDRKLKTKQMVISPLLPPLMMVERHFFRVMLSGEILL
TLTLLTGAVYLADFFGNENIQKAIFSFLAWIVYAVLLIGHWKYRWRGKKM
IIYTISGMILLTIAYFGSRAMLGMN
>MS0603 ccmC, CcmC protein
MWKWLHPYAKPETQYKLCGKFIPFFAVIALLLLSVACIWGLAFAPADYQQ
GNSFRIMYVHVPSAIWSMGVYGSMAVAALIGLVWQIKQAHLSVIAMAPIG
AALNFLALVTGAVWGKPMWGAWWVWDARLTASLILFFLYLGVMALYSAFQ
DRNTGMKAAAILCVVGVINLPVIHFSVEWWNTLHQGASITKFEKPSIATP
MLIPLILSIFGFMALSIWLTLVRYRVELLKEDRKRPWVKALIK
>MS0604 ccmD, CcmD protein
MFFESWSDFFYMGGYGFYVWLSYGITFITLLILAIQSYRGKKIVFREIQR
EQQREQRLQATKSRGTL
>MS0605 ccmE, CcmE protein
MNPRRKSRLTIILFVLLGVTIASSLVLYALRQNIDLFYTPSEVISGKNDD
PDTIPEVGQRIRVGGMVVEGTVKRDPNSLKVSFNVNDIGPEITVEYEGIL
PDLFREGQGIVAQGVLKEPKLLEATEVLAKHDENYVPPDLSEKMEQVHKP
MGISNQDMQGESDRDRLDKAVNSVEEGKK
>MS1815 ccmF, CcmF protein
MIPELGFIALLIALLSSFLLTLIPLVGMIKRNTNLLSYAWNFSYLFAIFS
TISIACLAYSFSVNDFSVEYVAAHSNSQLPLFFKIAATWGGHEGSMLFWL
FSLSLWTAAFAFFSRKIDPVFSARTLSILGFICLSFAIFILFFSNPFIRQ
FPLPPEGRDLNPMLQDIGLIIHPPLLYLGYVGFAVNFAMTLSVMLSGHVD
AAIARWTRIWVLLSWFFLTLGIMLGAWWAYYELGWGGWWFWDPVENASLM
PWLIGLALLHSLIVTEQRGIFSYWTILFSLFAFAFSLLGTFIVRSGVLTS
VHAFAVDGERGTALLLIFFLLTALALTVFALKVNLRQSAVRFSVFSKESF
LLLANVVLTIATVSVFLGTFYPMVFSAMGWGSISVGAPYFNSIFVPLLLI
MLIAMVFVLATKWQKMNRTLLRQKSILLIPALLIAYLIIHFTVRQDESLR
FHFSAFVLLSLAIWLLLATLWINWRKIGLRRSGMILAHCGVAFAVIGAVM
SGYFGSEIGVRLAPQQSQMLNGYEFRYIGFTNELGPNFTSEKAHFEIYKN
NQKLTALYPERRYYEVRTMNMSEVGIQWGVLGDIYIVMGDKLAPNEFSFR
LHYKPFVRWLWLGGILMALGALVAAVSLVQRKNAMAFSSAIKKE
>MS0606 ccmF, CcmF protein
MIAELGNFALALGLAISVLLAVFPLWGAEKGNKQLMSLARPMTYGLFICL
TFAFGALFYAFAVNDFSIQYVVNNSNSRLPLQYRLSAVWGAHEGSLLLWI
WLLSVWSVAVSLFSRQLPQEAVARVLGIMGLVTIGFLIFILFTSNPFART
FPNLPIDGKELNPMLQDVGLIFHPPLLYMGYVGFSVAFAFSIASLMTGKL
DTAWARWSRPWTMAAWVFLTVGIVLGSWWAYYELGWGGWWFWDPVENSSL
MPWLAGTALLHSLAVTEKRGAFKAWTVLLAILAFSLCLLGTFLVRSGVLV
SVHAFASDPTRGLYILAYLIVVIGGSLTLYAYKGGQIRSRDNAERYSRES
MLLLNNILLMAALVVVLLGTLLPLVHKQLGLGSISIGAPFFNQMFLVLMT
PFALLLGIGPLVKWRRDKFSAIRKPVIISLFLMIILGFALPYFIGNKLSL
SAVLGTMMVVIITLLSLYELKQRAAHRDNFLIGITKLSRSHWGMFLAHLG
VAMTVWGVTFSQNFSVERDVRMSVGDSVNIAGYEYKFQGIRDANGPNYLG
GTAQVDILKEGKLEGSLFAEKRFYTVSRMTMTEAAIDWGFTRDLYVALGE
QLEDKSWAMRLYYKPFIRWIWFGGVFMALGGVLCMFDRRYRFSKILNK
>MS1812 ccmH, CcmH protein
MKKILFSILVFVSLSLQAEMVDTYQFKNVEDRTRAVALAKSLRCPQCQNQ
NLVESNSPIAYDLRIEVYKMVDEGKSNQQIVEVMTSRFGNFVLYKPPFEL
TTALLWCLPIGLLLLAVLLMVRYLRRRSENREICTALSERQRRELAELLA
KNKDKK
>MS0608 ccmH, CcmH protein
MKKLTALLIMLVAVVASPCFAAIDAFNFSSAQQENDYHALTNELRCPQCQ
NNNIADSNATIAIDMRAKVFELLQEGKSKQDVVNYMVERYGHFVTYNPPI
TVATILLWILPALLICFGLAFVFRQKGKTLIKNSSQDISTENSTVENLSD
EQQKRLKALLKNKE
>MS1835 cdd, Cdd protein
MKNTIIKGLTDLVEQKRDNLIRQVVVQLEAQGYKAVLEQATVQQFCRQFA
LSPVEFALRCLPVAACYALTPISQFNVGAIAIGQSGSFYFGANQEFVAAS
MQQTVHAEQSAISHAWLAGEKAIAHMVVNYTPCGHCRQFMNELNSAERLK
IHLPHSQNNLLHNYLPDAFGPKDLNIQNVFFDGQSHPFNYQGHDPLIRAA
VEAASQSYAPYSQAFSGVALQLGELIICGRYAENAAFNPTFLPLQSALNY
QRLQGLIDVKVSRVVMAEAKADLTSLPMTQSLAGAHLGLDIEYISL
>MS1926 cdsA, CdsA protein
MLKERILSAIALIAVVFAALFLFSPFYFALCLGAVVTLGVWEWTQFAKIK
TEVWRYVISAIAGTFLFLWIYSHHSYLNAGRVFDGLAEPLLLAAVIWWIA
AFFLVINYPKSASIWSKSLILQIIFAFFTLLPFFIGVLKLRLDGYIIDAH
HGVVLLLYVFILVWAADSGAYFVGRKFGKHKLAPKVSPGKTWQGVIGGLI
TACVLAFIFQTIAGESLFNRGSTFSLTLLSVATVAISVLGDLTESMFKRE
SGIKDSSQLIPGHGGILDRIDSLTAAVPFFAYFYFFVL
>MS1111 chb, Chb protein
MMIFSSCKTPNHFLCAIFIGFGLNIPQTYAETVTQQFQKAFSSAEVSEKK
ESGLALDIARHFYSAETIKNFIDTIHKNGGTFLHLHFSDHENYAIESTIL
DQRAENARRDENGFYVNPKTGKPFLTYEQLKDIMDYAKQKNVELIPELDS
PNHMTAIFNLLAEKNGKDYVQKLRSKWTNEEIDITNPDSIAFMTSLIEEV
VWIFGNSTKHFHIGGDEFGYSEENNHEFINYANKLSAFLKEKGLKTRIWN
DGLIKSTIDQLDPNIQVTYWSYDGNTQNKQAARQRRTMRISMPELIERGF
SVLNYNSYYLYFNPKESPNISKDSDFAMRDVIKNWDLSIWDEKNTQNKVA
EPNKISGSALAIWGEYAGSLKGDSIHKATENLLKAIIYKTNAAGDSTGTI
SRKLQQLDFAQINANSYIDLMQVRNNESVTLENYPQTVHLLQTNALSGKK
RVLWVSGSHVHKIRLEPQWQKTGLNEKRNGKSYTAYKYQDNILWLDDNIT
EQ
>MS1015 cirA, CirA protein
MCCFHLIIMKKIDVYHENFPLENSIRLDRIFFILAKESAMKLNLITSALL
FSTIHSSFAVEPIKTELTPIEVYSAYAIPVNQDQTASSLTVLTEKDFAGR
NAAYVSDVLKTVPGVAFGINGGRGATTSLFLRGADSNHTTVIIDGVRMNP
INGNGFDFGGLALSNIERIEVLRGEQSALWGSNAMGGVIYITTKSGLYKE
KPFNVDFDLGTGSRNTRDASATISGYNKGFYYALHGDSHRTKGISALSSN
RFNYTALDGTKVTTGGAGEQDGFHRDNASIRLGYDDANKGLELLAQHSSQ
SAHYDNSLAEERLFNDYMRTRETLFKLAGYWGSEHELFKHSLSASHLKTD
NDTFSLWASAYDAKRLNTNYQLDINFDRDGATTQSFSILGEYQKSKYDST
SYTDEKALNEKSIAAEYRLFHENGHSLSLSGRYTDNSKYDDTFTGRIAGA
YRLSPNLKTHASFASAVQNPTFIEYFGYYGSYAANEGLKAERSRGGDIGL
LIESTDKHHSLDITYFARNVDNFISSELVDPVYYIYRSINLEGTTKIRGV
EIAYNGQLTDNLTAYANYTFTRSKDSQGDSLVRRPKHQANAGLNYQITEK
FGSNVNIAYVGKRIDNYYEETYPYAVHAVNMPSYTLLNLGVNYQLTSNIN
IYANLNNLFDKKYENILGYGQDGRNVYVGLKGSF
>MS1315 cirA, CirA protein
MNFKFNLIYTALFSGLAFSSYATETNQEVNTELEQINVATELEKAKAAGN
KQKDIVNLSLLGRQPAFTSPISVVNYDEKAFEDKQPRNVVDAIAKTDASV
MNFGGETNTLSGIYVRGLQLDARQISVNGLAGLYSTYNSPTAAVSSAQLI
KGASTATTGMDPEGSAGSAMNIETKHATDNPINKIGFGWFSNNRLQESFD
FGRRFGENNAWGVRVNGKYRDGDTARHGYDEINKEFAVAADYRGDKFRAA
IDYMYAKRATNGGRARVQDIQNLDFAMPKAPDGKINLIPSWSGQTTEDQT
VMGTFEYDLPYNMMLSGGLGHMESKYYGAFGQIRMTNTEGDYSIRQMRAI
DYRIRTTSANLKLQGELETGSLYHMWNTSFDFVQRQRDFDQSPVLSNFST
NIYNPVFPSVTAYSALQQSTDEKSRSYSWALADTVGFFDNSLRLTLGGRF
QWIKQHNYKNDSKGDKNRFSPMVTLAYVPNPDLVFYGNYLEDLEPGYVDE
DGNMAKPVVSRQIELGVRKNWGDLFTTTASIYQIRRPGIVTTNLAKNNAD
FTVGEEQGEERNRGIEFNIYASLFNNTVRPSLGITYNKGELIDYSTYAGA
IKTGTQVASPRIISKANIDWDTPFIENLTLNAALQYYGKSYQDIDKKYKL
PAYTTVDLGAKYLIKLNETQTLTLRAAVENVFDKNYWQVQRGKYDRSFAV
VGMPRTYWLTAEYTF
>MS1205 cirA, CirA protein
MKKTFIYSTVAQTVLLTIAGTAIAAEDGTEQLDTIDVVTEGSMFRMGEVP
FHQAKSAVAITREQLDSQNVDKLDEVAKYQAGFANQVFGNDTNTNWFRVR
GAEVSQAVNGLPTFSYGFFTPYVNSFGLEAIEVTKGADAMTFGAAKSGGL
INYVTKRAHKDQIGHGEFKTTFGSHNQYGLAADYTGTLTDDERLRYRVVA
SYLGRDGDWEGTDNQTLYIAPTLTWDISDKTRFTLLTGYQRDHGTPSSNF
LPQEGTLVPSPRGYIHRRTNLGDPVKDTETNRQYNIGYEFSHDFNNGLSF
NSSYSYSHIDNYHRGAYAYPSAYNADWSPLAPSAAGYSLSRAVVFNDGKA
ISHTTNNYLTWNYDNAWLKNTLVVGTDYRHNKVDALYSLYGTTSNTNLFT
PSVGWNQAQDVSAAPHVQIKSRQLGFYLQNNARFADKYVLGLGIRHDRAE
QREYTSTQKVKDNHTSYSASLMYEAPFGLNPYFSYSESFNLPTGLSGDET
LYDPNITRQYELGLKYIPTWLDGTISVAGFRAKDTGALVGNGLGATISSA
DPIYRKGFEVQADVNFTSNWTGTLAYTYTKSESKDSAGKKTRQPLIPTNM
VAAKTAYSFTEGLLKDLTVGVGLRYLGHSVTSKGSLYSHARLPSATVVDL
MARYAITPNWIAQVNIDNVGNRRYVSGCDYYCYYGAERRATANLSYKF
>MS0516 cirA, CirA protein
MKKLKISLLPLTAFVAATVHAETLDTIDVVSDNFSPQAENIAAKGVTKVR
QATKMSDVIRGVPGVNVNGARSTVERYNIRGVSEEYLNVTVDGARQNGYS
FHHNGNYGIDPEILKRVDIDVGSNSVSTGAGSLGGSMKFETVDAADMLEE
GENFGGKVKYGYGSNGNSNQGTAMLYGRRGNLDLLGYFNYRHQRDGEDGN
GLKNKNKGHLSNYLFKTKYNISNEQWIKASAERYTNTALSCYRANMGMCL
GDVPQPGEPGYVETNHGKAYTELTRKTYTLSYGFNPEHNNWVNIKANAYN
TETEVASMGSPKSKVRTVGGTLSNTSEFELGVTSHQFLVGGEYYNSKAQA
LGSVNNAYVADMDSTSVYVEDKIALGNLMIIPGVRFDHYKADLASDFDKS
YHRFSKALGLKYSLTDNLIVFANYTELFKGPDAGEIYLRGTRAYDGNLEA
ARGDNKEVGFSYAKDGLFSDIDGFSFTAKYFKTDYDNINQTVSASRCVNT
SAISSGSIYCNLGKVDIKGVEAQAKYRYEDTSFSVSYARARSEQKSTGLA
AFADTGDRYNFTLSQYISSAQVELGWNTMYVRAIDVDDSTLKESYAVSNM
YVSWSPAQAKGLELTFGIDNIFDKAYKDHSTQYYGSVDLDPGRNYKLSVS
YKF
>MS0278 citB, CitB protein
MHGIAWGYCFIIVKVRNMSEKTKVIVIDDHPLMRRGIKQLIELEEQFEVV
GDAGSGNEGVELAIKTSPDLIILDLNMKGLSGLDTLKVLRQEGVDARIVI
LTVSDSKADIYALIDAGADGYLLKDTEPDTLLAQIKQIAQGEIILSDSIK
NLLVERHPAHEPIHALTDREMDVLQLIATGLSNKQIAAQLFISEETVKVH
IRNLLRKLNVHSRVAATVLYLEYKGS
>MS2285 citB, CitB protein
MRKFYRTFFQNATIRRSKAYPGEITMIKVALIDDHVIVRSGFAQLLSLEE
DIEVVGEFGSAKETRQNLPRIKADVCIIDISMPDESGLDLLKSIPSGIHC
IMLSVNDSEMIVKKALELGAKGYLSKRCSPEELVQAVRTVYTGGVYLMPE
LTVKLVTNKNNNPIQQLTKRELEICELLIRGLGAKEIGEQLGLSFKTVHA
HRANAMSKLDVKNNVELANLFHQYS
>MS0424 citT, CitT protein
MNEQLLIWFQSPLLWVVALLLGAVFLFMQNKLHMDVIALLVMLLFCLSGI
LSLDEVFAGFSDPNVILIALLFIVGEGLVRTGVAYQVSEWLMKVANNSEI
KVLILLMLAVAGLGSFMSSTGVVAIFIPVVLMICQQMNISPKRLMMPLSV
AGLISGMMTLIATAPNLVVNAELARIDNLRFSFFSFTPIGLTILVIGIFY
MLLVRRWLSSSTEDLKQKQKRDSITDLIEEYQLHQRTKRFVVKSNSQFIG
HAVEDLHLRSNYGLNILAIERWKHFRPLFIAASLGKTEIKEKDILLIDVA
NPDLDLSAFCHLYHLEPTEIRNTHFNEQLKSVGMVEITPVPDSVAIGRSA
AELRFRSNYGLNVIGIKRNGELLQGHLVEEPYKLGDQLLVIGDWKLIRKL
PDRTKDFFVLDYPSEIERAVPARSQSMHAILSVVTMVVLMVSGVVPNVVA
ALIACLMLGKFRCIDAKSAYDSIHWASLILIVGMMPFSIALQKTGGVDLV
VNFMINTVGNMGKHWILISLFILCAVVGLFISNTATAILMAPIAINMAHQ
LNLSPVPFAMTVAIAASAAFMTPISSPVNTMVLGPGGYKFGDFIKIGVPF
TILVMLVTVFLVPVLFPF
>MS1378 citT, CitT protein
MSTDTQENESSKNRRNMIILIADIGLFFILLNVLPFDEAPRKGLSLLAFV
AVLWLTEALHVTITALFVPILTIGLGLFSTKEALVAFADPTIFLFFGGFA
LAAALHIQKIDRLIANKIMTMAKGNLCVAVLFLFFATAFLSMWMSNTATA
AMMIPLAMGIMSNMDREKEHNTYVFVLLGIAYSASIGGMGTLVGSPPNAI
AASQVNLTFADWVKYGVPVMLLLFPIVIGLLYFNFRPKFNQTFDYQFEQI
QLTTPRIITLSIFVLVALLWVLGSEINPYIASLLGLGGKIASFDSIVGVS
AALLLCICRVVNWEQIQHHTEWGVLYLFGGGLTLSAVLTHTGASKIMADG
IVAIIEGKHYYIICLIIASFIVLLTEFTSNTASAALLVPIFISIAESLNM
SPLGFALIIGLGASCAFMMPVGTPPNAIVFSTGMVKQRDMLRSYKINLSC
IIIVSAIGYLFWL
>MS0970 citT, CitT protein
MNIQTSAPKVEVRLGFKLQGLLIAVLVGIAILLIPTPEGLSTKAWGMFAL
FVATIVAIIAKAMPMGAATLVALVISGLSGLTPLSPAKGEVGMLSGFSNG
TIWLIAIAMFLSRAVIKTGLGKRIALYFVARFGKRMMGVAYGIALADVVI
GPGIPSASARGGGIMYPIMQSIADAYNSKPGPTARRAGAFLAIAVSQIDT
IVCTMFLTAMAGNPLIAELAKSQGVEITWMTWFLGAIVPGIVSLILLPYF
VYLIYPPELKDTPKMAEMAREELNSMGKMSQAEWILALDFILLLLLWTVG
DLVFHIPATVSAFVGLVILLLTNIMSWKNIISETAAWDTMFWFAVLVMMA
NALNKYGTISWISTHIADSVGSFSWPVAFTILVLVYFYTRYFFASAMAHI
SAMYLAFVAAAIAVGTPPIIAAIGLGYTSTLSMSLTQYAGGPGPALYGSG
YNSTGQWWGVSFAVSILSLAIWFSVGGVWMKLLGWW
>MS1915 citT, CitT protein
MRVNFMNTQTSVSPPSIFSRNSLIFMADVIIFALLLAFLPFEQNVNKGLA
LLVFVGVLWLTEALNVTVTAVLVPLLAIGLGLVTTKNALVAFADPTIFLF
FGGFALATALHIQKLDRLIANRIMALAKGNLFIAVLYLFSVTAFLSMWIS
NTATAAMMLPLAMGILSQLDREREHNTYTFVLLGIAYSASIGGMGTLVGS
PPNAIVASQLHLTFSDWLWYGMPVMIILMPLMIGCMYVIFKPRLNIRFTQ
DFEKIEMTTPRIITLLIFILTAVLWVFSSSVNPMLSGLLGLPKDIASFDS
VVALLAAALICISGVASWKQIQDNTEWGVLLLFGGGLTLSAVLKDSGASK
VMADGIVFLIQGGHFYVIGLIVTAFIVFLTEFTSNTASAALLVPIFISIA
QALGMPEMGLALLIGLGASCAFMLPVATPPNAIVFGTGEVKQSDMIRAGV
VLNILCIFVIGTIGYLFWFG
>MS1783 clpA, ClpA protein
MNIEKFTTKFQQALAEAQSLALGKDNQYIEPVHVLSALINQQDGSVAPIL
TSAGVNVGALKAELNSEINKLPQVSGNGGDVQISRQLLNLLNLCDKIAQR
NNDKFISSELFLLAALEEKGSLGDLLKKCGAKKESLEQAIKTIRGGQSVN
DQNAEESRQALEKYTIDLTERAESGKLDPVIGRDEEIRRAIQVLQRRTKN
NPVLIGEPGVGKTAIVEGLAQRIVNGEVPEGLKGKRVLSLDMGALIAGAK
YRGEFEERLKAVLKELAQEEGKVILFIDEIHTMVGAGKTDGAMDAGNLLK
PSLARGELHCVGATTLDEYRQYIEKDAALERRFQKVFVDEPTVEDTIAIL
RGLKERYELHHHVQITDPAIVAAATLSHRYISDRQLPDKAIDLIDEAASS
IRMEIDSKPEPLDRLDRRIIQLKLEQQALKKEEDEASRKRLDMLEKELSE
KEREYAELEEVWKSEKAALSGTQHIKAELESARTQMEQARRAGDLNKMSE
LQYGTIPALEKQLAAADSAEGKEMSLLRNRVTDEEIAQVLSRATGIPVSR
MMEGEKEKLLRMEEELHKRVIGQGEAVEAVANAIRRSRAGLSDPNRPIGS
FLFLGPTGVGKTELCKTLANFMFDDENAMVRIDMSEFMEKHSVSRLVGAP
PGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHHDVFNILLQVLDDGRLT
DGQGRTVDFRNTVVIMTSNLGSDLIQENKDLGYEGMKEIVMSVVGQHFRP
EFINRIDETVVFHPLAKENIKAIAQIQLARLTKRMEQHGYAINFSETLLD
FISEVGYDPVYGARPLKRAIQQEIENPLAQQILSGKLLPAKPVTVDYEDG
KVVAKQ
>MS1847 clpP, ClpP protein
MALIPMVVEQTSRGERSYDIYSRLLKERVIFLSGEVEDNMANLIVAQLLF
LESENPEKDINLYINSPGGSVTAGMAIYDTMQFIKPDVRTLCVGQACSMG
AFLLAGGAAGKRAALPHARVMIHQPLGGFRGQASDIQIHAQEILKIKQTL
NERLAFHTGQPFEVIERDTDRDNFMSAEDAKNYGLIDSVLVKR
>MS1846 clpX, ClpX protein
MTKETETTCSFCGKSQDEVGKLIAGVDGYICGECIDLCHDLLHDEETREQ
QSAEEAVETEEKLPTPHEIRAHLDDYVIGQDYAKKVLAVAVYNHYKRLRS
NHGIADVELGKSNILLIGPTGSGKTLLAETMARMLNVPFAMADATTLTEA
GYVGEDVENVIQKLLQNCDYDTEKAQRGIIYIDEIDKITRKSENPSITRD
VSGEGVQQALLKLIEGTVASIPPQGGRKHPQQEMLRVDTSKILFICGGAF
AGLDRVVQKRIHKGSGIGFDAEVKGKEDEVSLTDLLKQIETEDLIKYGLI
PEFIGRLPVVAPLSELDEKALVQILTEPKNALTKQYQALFGLENVELEFT
PEALNAMAKKALERKTGARGLRSIVEGALLDTMYDLPSLEGLVKVVVDEA
VINEHSAPKLEY
>MS0250 cls, Cls protein
MLVNKIKRAKQRLERLPYLAQSVADFEVIYNPAQFKQTIIQLIRSAKNRI
YITALYWQFDEAGQEILNELYAAKQKNPALDVKVLVDWHRAQRNLLGAAK
GTTNADWYCEQRAKHQAQSMFFGVPINTREVFGVLHIKGFVFDDTLLYSG
ASINNVYLHHKDKYRYDRYQKITNPALADVLVAFVNQYLLDPNAVHALDD
AARPATKEIRMHIKAFRKKLALEAGYWINNAVAFSNDQLTVSPLFGLGTS
GNVLNNCIEDLFMLVKEKLVICTPYFNFPRTLKGKISHLLKQGKKVEIIV
GDKVANDFYIPPTEEFKIAGALPYLYEKNLRAFCKKFAAQINEGLLVVRT
WKDGDNTYHLKGVWVDEDYILLTGNNLNPRAWRLDAENGLLVHDPKHELR
VQMEEELRQIRQHTTVLRHYSELQKMNQYPEPVQRLLKRFERVKVDKLVK
MIL
>MS1171 cls, Cls protein
MMIKSLCLCIILNSCIKFSKSAVRMNINLDSIIAYIVPVIMWTLIVTITL
RQIIKRQSSSAMLSWLMIIYIVPVVGILAYLVLGEINLGKRRANASKQLL
PKYMKWFAGLKNQQHLLINDQQPSLASPLFALAQRRLSIPCINGNELHIL
DTPESIIQNIIDDIHQAQYSINMVFYIWSNGGLVEQVQQALIQAKQRGVK
IHILLDSVGSRAFFKSENYQKMTALGIEIEEALHVNLLRVFLRRIDLRQH
RKIIVIDNQISYTGSMNMVDPNFFKQDSHVGKWIDIMVRIDGPVSAVLNG
LHSWDWEMERGQGLYVPLPSPQHPMDNYNIHAVQILATGPGLPADLMEQS
LATAIFAAKESITITTPYFVPSQNIVDALQIAALRGVNVSLILPVHNDSL
MVRWASRTYFDDLLTAGVKIYNFTEGLLHTKSILVDNKMALVGTVNMDIR
SFSLNFEVTMVVEDQTFANEISLLHENYMNGATLLDEQRWLNRPVFTRII
EKLFFLFSPLL
>MS1477 cmk, Cmk protein
MTKNIVITVDGPSGAGKGTLCYALANRLGFALLDSGAIYRVTALAALQCK
ADLTNEAELAELAAHLDIEFLPEAGEVKVMLGGEDVSGLIRTQEVADAAS
KVAVFPQVRSALLQLQKDFATPKGLIADGRDMGTVVFPTAQVKLFLDASA
EERAKRRFKQLQNKGISGNFDQILAEIKDRDFRDRNRPVAPLKPADDALL
LDSTTLSIEEVIAQALSYIHQKVKI
>MS2188 coaA, CoaA protein
MNIESQSSVSEKFSPFLTFTRKQWAELRKSVPLKLTEQDLKPLLGFNEEL
SLEEVSTIYLPLARLINYYIEENLRRQTVMNRFLGNTNANVPYIISIAGS
VSVGKSTSARILQSLLSNWPENRKVDLITTDGFLYPLEKLKKENLLHKKG
FPVSYDTPKLIKFLADVKSGKPNVSAPIYSHLTYDIIPDKFNKVDRPDIL
ILEGLNVLQTGSRKAEQTFVSDFVDFSVYVDADEALLKEWYIRRFLKFRE
SAFTDPNSYFKDYAKLSKEEAVETAANIWNTINGLNLRQNILPTRERANL
ILRKGADHAVQEVKLRK
>MS1951 coaD, CoaD protein
MTKTVIYPGTFDPITYGHLDIIERSAVLFPQVLVAVASNPTKKPLFELAE
RVRLAEESVAHLPNVQVIGFSDLLANVVKERHITAIIRGMRTTMDFEYEL
QLAHLNRALTDGVESLFLPSTEKWSYVSSTIVREIYLHRGDVSQFVPPPV
LTALMEKNR
>MS0359 coaE, CoaE protein
MAYIVGLTGGIGSGKSTIADLFMELGVPVVDADEVSRRLVEKGSPLLSKI
ATHFGADILTNGGELNRSKLREIIFNRPEQKNWLNALLHPAINEEMQRQL
QAQQAPYVLFVVPLLIENNLMSLCDRILIIDVSPQTQLERATKRDKNQRE
LIQQIMNSQVSREKRLTFADDIINNDEDFAQNGDRIKQKVLELHQRYLQL
AQQKSSTYDNKNDR
>MS2235 cof, Cof protein
MTIPNLRDKIKIVFFDIDETLIMKFEDILPDSVLPVIRKLKQNGIIPAIA
TGRSRCSLPTKIKALIAEEPIELFVTMNGQFSVFQNKVIEKHPIPTEKVQ
HLVDFFDAQQIDYAFVSDNNVAVSKITAKQKSALDPILTDYIVDKDYFKH
NEVFQLLPFYDQSQDELVKNANILDGLRVVRWDKDSVDLFDAEGSKARGI
ASAIKRLGFEMENVMAFGDGLNDLEMLSTVGVGVAMGNARDELKKVADFV
TDRIEDHGIYNFLVKAGLIED
>MS2344 cof, Cof protein
MQYKAIFSDIDGTLLNSRHQISSKTESVIKLAVSKGIPFIPVSARPPYAI
TPYTEQLQTNQGIICYSGALILDKNLRELYSVQIDQADLAALNQILADYP
YLSINHYAALDWFSNDLDNYWTKQEADITGLFPKQTPSNLTKVHKILVMG
EADKIKPLEQKLKQKLPHLSIHLSKPEYIEIMNKAATKAKAIGFMERHLH
VSADEVIAFGDNFNDLDMLEYAGLSVAMGNAPDEIKQVAKKVTASNDEDG
IALVLNEIFNL
>MS2225 cof, Cof protein
MKQLPFRAIVSDMDGTLLNANHVVGDFTINTLEKLAQKGVDIVMATGRGY
TDVASTLSKMKIKNAAMITSNGAQIHDLQGNRLYSNYLPEDVAFEVMQLP
FDADRVCMNTYQNNDWFINIDLPQLRKYHQTSGFMYEVVDFKKHHGRDTE
KVFFIGKKPADLMEIEQELTTRFGNYATITYSTPVCLEVMNKNVSKATAL
AHLIEQREYSLSDCIAFGDGMNDIEMLTEVGKGCIMQNADPRLLQLLPDN
ERIGLNKDESVASYVRAVFGIY
>MS0842 cof, Cof protein
MAYQVLAFDLDGTLLNSQGIILPSSKKAIEAARAKGMQVILVTGRHHTAV
KPYYYELNLETPIVCCNGTYLYQPQTDEVLRSNPFSKTQALQLIDIAERQ
KIHILMYSRNAMNYMELNPHMEKFQKWVQSCPQNVRPDVRQVSSFRDIVN
NEDIIWKFVMSAPNRELMQQTVNMLPQDQFSCEWSWIDRVDISNKGNTKG
SRLLEYLRSVNMNPEQVVAFGDNQNDLSMLTSVGLGVAMGNADEIVKQQA
KCIIGTNNENSIADFIEGLK
>MS1748 comEA, ComEA protein
MTTLFLILCIGSKKYTVFLCMNRLFPESNVVATEQRYFNFKRESLMKLSV
RKFLLSCLAAGSLLSAGTAFAADKVPASAETQAIKTSETAKPADNIGNTV
NINTATAEEIKQTLIGIGAKKAEAIIQYREKHGNFTNVEQLLEIQGIGEA
TLDKNKDRIKL
>MS0826 comEA, ComEA protein
MKEKTKSTKNQLSESAKERMETAKNSVTSTKDKAASMKPTVKNALNSSSK
VNINTADAKTLQSLTGIGEVKAKAIVDYRKKVGKIKNASELSNIDGIGDA
TIEKITPYLNF
>MS0931 comEC, ComEC protein
MMKLDLFLFCFIVNTLCLLVLPESFLLDFPLFLHFLFPLVIAAFIYWFKY
RRLWRGFYYLFCGLIAVFYIHFQALSLFRAADGVKYLPAKVQTDFVIDEI
LYQRDYRNIIVKAQLAPEFKPQRIYVNWQADQAVKTGEKWRGELHLRAVS
SRLNYGGFDKQKWYYAQGITAWAKVKSAVKISEDLSLRQQLFNHYLAQTE
RLRQQGLLMALAFGERAWLQEDVWQIYRKTNTAHLIAISGLHIGLAMLLG
MGVARLIQFCLPTRYISPYFPMLSGLVFAAVYAGLAGFAIPTLRALIALV
IVSLLKLLRGYCNVWQLFLRVIGVLFIFDPLMVLSNSFWLSVCAVFSLIL
WYQIFPLNLLEWKGKSVTDGKFAWLFGLIHLQLGLFCLFSPMQLMTFQGI
SLAGFWANLIIVPLFSFLLVPVILFALFSNGAWESWRIADWLAQWFTHLL
SYFQDYWIGVSNQTSWLICCLLCLLLLTVVHFIYPLKKQIPEKNELLTQF
KTKKISLKSDRTLSPVLRKYLVSVATLFLASGAMLWLYQQWRQPDWRFET
LDVGQGLANLIVKDGRAVLYDTGAGWKNGSMAQSEIIPYLQRQGLILEKV
ILSHDDNDHSGGIADILQAYPSINILQPSMVNYEKTEQNSFNFDRTFCKQ
GLNWQWHGLNFQVLAPAKIAERANNTDSCVLLIDDGQYKLLLTGDADLAA
EQQFVAHLGKVNVLQVGHHGSRTSTGEALIKQIKPDFALISAGRWNQWGF
PHPVVTQRLKRHKSAVYNTAFSGQISFEFYPNKIEVKTARSNYQPWFRQI
VGGERD
>MS2234 comFC, ComFC protein
MNWFAFRCIYCQRKLAIGSHGLCCSCNKQIRRFNYCGVCGSELAENTLGC
GNCLQNRPAWHRMVIIGAYKMPLSSLIHRFKFQNSFYFDRTLARLLYLAI
RDARRTHGLMLPEVIIPVPLHHFRHWRRGYNQADLLAGQLAKWLNIPCNN
RLIKRVKHTRTQRGLSAAARRVNLQKAFRFADKKQACPYKSVALVDDVIT
TGSTLNALAGLFVQQGVEQIQVWGLARA
>MS0341 copZ, CopZ protein
MKKTLLFLTALLFSGSGFAAERNVTLHIEEMNCQLCVYLVNKELRNIDGV
ISTKANFNTRLVKVVADEKVTDEMMINAIDKLHYHAVVKK
>MS0320 corA, CorA protein
MINAFALENARLTRLDEDNLSTLNKAIWIDLVEPTSEEREILQDGLEQSL
ASFLELEDIEASARFFEDEDGLHLHSFFYCEDEEDYADLASVAFTIRDGR
LFTLRDRDLPAFRLYRMRSRYQRLDECNAYEVLLDLFETKIEQLADVIET
VYSDLERLSRVILDGKQGEAFDDALGTLTEQEDMSSKVRLCLMDTQRALS
FLVRKTRLPANQLEQAREILRDIESLQPHNESLFQKVNFLMQAAMGYINI
EQNRVMKFFSVVSVMFLPATLVASTYGMNFEFMPELGFKYGYPMAIGLMI
AAGVTPYMYFKRKGWL
>MS0346 cpsG, CpsG protein
MTKLTCFKAYDIRGRLGDELNADIVYRIGRAFGQFLKPTTIVVGGDVRLT
SKELKSAVTNGLLDSGVNVIDLGEVGTEEIYFATSFLKADGGIEVTASHN
PMDYNGLKLVREGSRPISADTGLADIQRLAEENNFPAVTQRGVYKQQSVL
GEYVEHLLSYINLDNLKPMKLVINSGNGAAGHVIDAIEAQFKARRVPVEF
IKVHNNPDGTFPHGIPNPLLHENRQDTIDAVLANKADMGIAFDGDFDRCF
LFDETAQFIEGYYIVGLLGQAFLQKHKGAKIIYDPRLIWNTIKLVEENGG
EAVMSKSGHSFIKEKMRAIDAIYGGEMSAHHYFRDFNYCDSGMIPWLLVM
ELVCTTGKTLGQLVNDSIDTYPSPGEINSKLADAKTAIARVRAAYEKDAV
SVDETDGISIEYPTWRFNLRSSNTEPVVRLNLETRGDKKLMTEKTEEILA
LLRQ
>MS0771 cpsG, CpsG protein
MTTIFTIAQNWLEQDPDLETKAELAQLIDNAKAGDENALKELTARFDGRL
QFGTAGLRGRLQAGSMGMNRVLVAQAAGGLADYIKQYDHNAPSIVIGYDG
RKNSDIFARDTAEIMSGAGIKAYLLPRKLPTPVLAYAINYFDATAGVMVT
ASHNPPEDNGYKVYLGKENGGGQIVSPADKDIAALIDKVAAGNIKNLPRS
QDYVVLDDQVVDAYIEKTASIAQEPRTDINYVYTAMHGVGYEVLSKTLQK
AGLSQPHLVKEQVYPDGTFPTVSFPNPEEKGALDLAVKLAKKQNAEFIIA
NDPDADRLAVAIPDAKGNWKSLHGNVLGCLLGWHLAKKYHAAGKQGVLAC
SLVSSPALAEIAKKYGLQSEETLTGFKYIGKVKGLLFGFEEAIGYLVDPD
KVRDKDGISASVAFLDLVLYLKKQGKTILDYMNEFNREFGAYVSGQISIR
VSDLTEISKLMTALRNNPPSEIGGFKVTQFIDHLKTERNNDILVFTLENG
SRLITRPSGTEPKIKFYLDAKGKDALDADNVLAQFDESVRALLRREQYGK
QDC
>MS0967 cpsG, CpsG protein
MAERKYFGTDGVRGKVGTFPITPDFALKLGWAAGKVLASQGSRQVLIGKD
TRISGYMLESALEAGLAAAGLSAAFIGPMPTPAVAYLTRTFRAEAGIVIS
ASHNPYYDNGIKFFSAQGTKLPDEIEEAIEAMLEQPIDCVESAELGRASR
IKDAAGRYIEFCKGTFPTELSLSGYKIVVDCANGATYHIAPNVMRELGAE
VIEIGTSPNGMNINEKCGATDIKALKAKVLETKADVGLAYDGDGDRIMMV
DHLGNVVDGDQILFIIAREDLRAGKLKGGVVGTLMSNMSLEISLKTLGIP
FIRANVGDRYVLEKMVENDWKLGGENSGHIIIADKNTTGDGIIASLAVLT
AMAQHKLSLNELASAVKLFPQVLINVRFSGGTNPLESDAVKAVAAEVEKR
LAGKGRILLRKSGTEPLIRVMVECSDAELARKSAEEIVEAVKAN
>MS2226 crcB, CrcB protein
MIMWQSLILISSGAALGASLRWGMGLILNPLFAAFSFGTLIANYLGCFII
GLIMAMIWQHPQFSGEWRLFMITGFLGSLTTFSSFSAEVMENFIQQKWLI
GLGIMSAHLFGCLIFTGIGVLITRWLN
>MS1934 crp, Crp protein
MLEQVNAHQTNVLPQTQPVQPASPMDPTLDWFLSHCHIHKYPAKTTLIHA
GERADTLYYIVKGSAAVMVKDEEGKEMILSYLSQGEFFGEVGLFEEGQVR
SAWVKAKNACEIAEVSYKKFRQLLQVNPEILMYLSAQLSRRLQNTSKQVS
NLAFLDVTGRIAQTLLNLAKMPDAMTHPDGMQIKITRQEIGQMVGCSRET
VGRILKMLEDQNLIAAHGKTIVVFGTR
>MS1077 crp, Crp protein
MPKFIEKQMKVLTPDAAKTGRRIQSGGCAIHCQDCSISQLCIPFTLNEHE
LDQLDNIIERKKPIQKSQILFKAGDELTSLYAIRSGTIKSYTISETGEEQ
ITSFHLPGDLVGFDAITNMSHPSFAQALETAMVCEIPFDILDDLTGKMPK
LRQQMLRLMSSEIKSDQEMILLLSKMNAEERLAAFIYNLSKRYAARGFSA
REFRLTMTRGDIGNYLGLTVETISRLLGRFQKLGILSVQGKYITINDMVQ
LVELSGTNRTKIKLVD
>MS1273 csdB, CsdB protein
MFDTTGFRSHFPYFQHPDRVIYLDNAATTLKPQSLIDATVKFYQSAGSVH
RSQYDEEQTALYEQARSQVRQLINAESDKAIIWTSGTTQAINTVANGLIP
YIQSDDEIIISEADHHANFVTWSMIAQKCGAKLRILPIQDNWLIDENALL
EALNKRTKVVVLNFVSNVTGTEQPVEHLIRLIRKHSSALVSVDAAQAISH
VKIDLRKLDADFLSFSAHKIYGPNGLGVLSGKLTALELLQPLIYGGKMVD
RVSKQQISFAELPYRLEAGTPNIAGVIGFNAVLSWLNQWDFEQAEHHAVQ
LAEQTKVRLKNYEFCQLFNSPKPSSVISFVFKNIAGSDLATLLAEQNIAL
RTGVHCAQPYLSRLGQHSTLRLSFAPYNTQQEVDAFFTALDKSLALLEE
>MS1095 cspC, CspC protein
MEVGVVKWFNNAKGFGFINAEGSDADIFAHYSVIEMDGYRSLKAGQKVNF
EVVHGEKGSHATKIIPILE
>MS1144 cspC, CspC protein
MSKLNGLVKWFNSDKGFGFITPADGSKDLFVHFSSILGNNYRSLNEGDRV
EYNVENTQRGPAAVEVAVIK
>MS0166 cspR, CspR protein
MLDIVLYEPEIPQNTGNIIRLCANTGFRLHLIEPLGFTWDDKRLRRSGLD
YHEFAHIKKHKTFEVFLESEKPKRLFALTTKGGPAHSEVKFELGDYLMFG
PETRGIPMAILDSMPMEQKIRIPMTENSRSMNLSNSVAVTVYEAWRQLDY
IGAVNLNRK
>MS0347 csrA, CsrA protein
MLILTRKVGESLLIGDDISITILNVRGNQVKIGVNAPKDVSVHREEIYQR
IKQAEEKESTS
>MS1677 cstA, CstA protein
MLWFLLCVAILLLGYFIYGKIVEKIFVINPDRKTPAYSLRDGVDYVPMTK
KKIWLIQLLNIAGTGPIFGPILGALYGPVAMLWIVFGCVFAGAVHDYFSG
MLSIRNGGANVPYLAGKYLGRPAKHFMNIIAILLLLLVGVVFVASPASLL
TNITSDLMNSGSVGAAAVNDEAGAAKGNILIMWTAVIFIYYIIATLVPID
KIIGRIYPFFGALLLFMTCGMLFGLFFEGIPFFRTLGGDISLADFFTNMH
PKNAPIWPLLFITIACGAISGFHATQSPLMARCTENEREGRFIFYGAMIG
EGIIALIWCMVGLSFYNDQAGLAEAIQIGTPSKVVYDAAIGMLGVFGGIL
AVLGVVVLPITSGDTAFRAARLLIADFFKYDQRNLTKRLTIALPLFAIGF
WVSTIDFSVLWRYFGWANQTTAMVMLWTAAAYLFRHQKFHWVCTIPAVFM
TLVCSTFLLNAPIGFGLDYQLSVWLGGAVTAVAVIAFFMLLKPISADEQD
>MS1002 cvpA, CvpA protein
MIDYIIIGIIVFSIVVSLLRGFVREVMSLASWVVAFVIASQFYPYLANFL
TQIESEYLRNGTAIGILFILTLIVGAIVNYVIGQLVDKTGLSGTDRVLGA
CFGFLRGVLIVSALLFFVDTFTNFDQNDMWKESKLIPHFGFVVEWFFEQL
QANSSFLNSTLNK
>MS0277 cyaA, CyaA protein
MKYDLQFAKKQVDDLHRLRVERVLQGSTADFQHVFQLIALLLHLNHPALP
GYVTDAPAGVAHFKLSDYQKNFLAQQFPTGFDFVRLEQESNAHQQEKTPI
YGVYVMGSIASISQTAKSDLDTWVCHSPDLTPYALNKLQQKTQLLKIWAK
KFNTDITLFLMDEFYFNHYRYSNTLSVENCGSAQHMLLLDEFYRSAIRLA
GKPLLWLHLNVENEADYGKEVQRLQQTKQINRADWIDFGGLGAFSANEYF
GASLWQLYKGIDSPYKSVLKIVLLESYSQEYPNAKLISMQFKQQLFNLKP
VKEQCFDAYLAMLERVTEYLTKLKDEKRLDFIRRCFYIKVTETVRERPLA
PWRAKILKNLTAQWGWSEETIKHLNRIHTWKIRSVRETHNKLIRVLMLSY
RNLVNFARKHNVNASIAPQDISILTRKLYTAFEVLPGKVTLMNPQLALDL
SEKNLTFIEVTEEHGVKPGWYVVNQMPSVVYPSQNRYIEYNPILIKLIAW
TYFNGLLTSKTKVHISSTHVDIEKINQCITDLRVSFPVKASPPTDEELTH
PCEIRSLAVMINLTKDPTPYSDINRTEIQQSDLFSLDGENESLIGSVDLL
YRNKWNEIKTLHYEGDKAMLSALKVLSNKIHRGSGVPESVNVFCYNQYYQ
EEISELVVGLLNKCISIQLGTTQLPMSSVPRMTGKNWKLFFEEHDATLHQ
PQTEPVFISQVIAEQKQVKVKRNQPYKHLLNYPRQIDSFASEGFLQFFFE
DNEDETFNVYILDENNRLEIYRQCDGSKEQKIREISQIYNLSGSDQNDNH
YKIIKRDFNYPQFYQLKHQQKGILILPFSGSCMV
>MS2082 cyaY, CyaY protein
MNIAEFHQNIDQIWDSIEEQLENQDIDADCERQGAVFTITFENRTQIVIN
KQEPLLELWLASKLGGFHFSYKNGDWLNYEGKRFWDCLAQACAAHGEEVS
FA
>MS0715 cydA, CydA protein
MLDVVELSRLQFALTALYHFLFVPLTLGLSFILVIMETLYVATNKQVYKD
MTKFWGKLFGINFALGVTTGITMEFQFGTNWSYYSHYVGDIFGAPLAIEA
LLAFFLESTFVGLFFFGWDRLTKAKHLLATYCVAFGSNLSAMWILVANGW
MQSPVASEFNFETMRMEMTSFMDLWLNPIAQSKLVHTLAGGYVSGAMFVL
AISAYYLLKGRDIGFAKRSFSVASVFGFISIIATIIMGDQSAYEVGNVQK
TKLATMESEWHTQEAPASWNAFAIPNDAEMKNDFEFQIPYLGGIMATRSL
DQTYPGIHDILIENEGRVRNGMVAYGLLEELRAQKKAGQVNEETKAQFLA
TRDDLGYGLLLKRYTDKVVDATEEQIKQATRDTVPNVAPVFWSFRVMAAL
AGVIMVLLSGAFIQNLRNATTKIPLLLHALLWCLPLPWIAIECGWFLAEY
GRQPWAIYEVLPVGVANSSLSTGDLWFSIGLLCGLYTLFIVVEMYLMYKF
GRLGPSSLKTGRYYFEQSSKAGA
>MS1065 cynT, CynT protein
MKKIEQLFANNHSWALRMKEENSSYFKELADHQTPHYLWIGCSDSRVPAE
KLTNLEPGELFVHRNVANQVIHTDLNCLSVVQYAIDVLNIEHIIICGHTN
CGGIKAAMANQDLGLINNWLLHIRDIWYKHSHLLGNLSPEKRADMLTKIN
VAEQVYNLGRASIVQDAWKRGKKLSLHGWVYDVSDGFLIDQGVLATSRES
LEISYRNSIARLKTLDEEDIFRKGNKENNDEIIG
>MS1340 cysA, CysA protein
MMLEINVKKRLGQLVLNARLTIPGQGITGIFGISGSGKSSLINLVSGLIH
PDEGNIRLNDRTLIDTANNICLAPNQRNIGYVFQDARLFPHYSVKGNLCY
GIKRFNQQEFNRIVRLLGIEHLLARYPLTLSGGEKQRVAIGRALLSNPEM
LLMDEPLSALDLPRKRELLAYLEKLSQEINIPILYVTHSLDELFRLADFV
VLLDEGKVAAFDSLENLWQSPLFEPWQEQGQKSAVLSLPILNHNFSYKMT
ALLLGEQQLWVKLLNGDEGKTVRICIRSTDVSITLTVPEKTSIRNILSGK
IITLLPKGNQVDVKIALGKDEIWASVSTWAAEELQLQIGQSVYAQIKAVS
VM
>MS1261 cysA, CysA protein
MSIKIENLEKHFGSFHALKNINLQFKQNQLTALLGPSGCGKTTLLRIIAG
LEFADSGKILFEHRDVTDLSAKDRGVGFVFQHYALFQNMTVYDNVAFGLR
VKPRKERPSKEEIQQKVTALLKLVKLDWLANAYPNQLSGGQRQRIALARS
LAVQPKVLLLDEPFGALDAQVRKELRRWLRDLHQELNVTSIFVTHDQDEA
LDVSDRIVVMNQGQIEQIDEPNQIYHAPQTPFVTQFVGDVNVFHGHIDEG
NLVIGEFSHKIDPATNTTQPVNNQSATAYIRPYELTISRHADNALATGKI
THINAIGFIVRIEIESAQSDQPIEVILTKAAYSQSQYKVNEQIYLVPDKL
NLFQQMNI
>MS2212 cysE, CysE protein
MLREVWNNIRNEAKELVEHEPVLASFFHSTILKHKNLGGALSYILANKLA
TSTMPAITLREIIEETYQDDPRIIDSAACDIHAVRQRDPAVGLWATPLLY
LKGFHAIQSYRITHHLWQQNRKSLAIYLQNQISVAFDVDIHPAARVGCGI
MFDHATGIVVGETAVIENDVSILQGVTLGGTGKESGDRHPKIREGVMIGA
GAKILGNIEVGKYAKIGANSVVLQPVPEYATAAGVPAKIISKDRSAKPAF
DMNQYFIDDAEALNI
>MS1254 cysG, CysG protein
MMNYFPVFADLNNRPVLVVGGGTIAARKVNLLLKANAEVRITAQKLNAEL
TALVEQDRIIWIAKEFHGEQIRNVFLVVAATDDEQLNEQVFQVAESRQKL
VNVVDDQARCSFIFPSIIDRSPIQVAVSSGGAAPVLARLLREKLEALLPQ
HLGVMADISGKWRHKVKQQLKTITERRRFWESLFNGRFSRLLKNRQIEAA
KKELELQLTKDYQGGSVSLVGAGPGDAGLLTLKGLQEIQQADVVLYDALV
SAEILDLVRRDAELIFVGKRAQGRQVAQQETNQLLADLALQGKRVVRLKG
GDPFVFGRGGEELEVLAQQGIPFSVVPGITAAIGATAYAGIPLTHRDYAQ
SAVFVTGHRKADASDIEWQTLARSNQTLVIYMGTLKAATIAQSLQQYGRA
ASTPVAVISQGTQETQHTQIGTLKNLAELAEKAPTPALIVVGEVVSLHEK
LAWFGEDKFAQKRPHFTLDSLRIERVA
>MS1253 cysH, CysH protein
MIIKPNFWQIPQPTATDFAALAEKEQLLAQRIHEIANRHQHAKFASSLAV
EDMVITDVIAKSKAKITVFTLETGRLNPETLALADTVKKTYPDLDFRLFR
PNPIAAEKYDREKGRFAFYESVELRRECCFIRKIEPLNRALADADAWLTG
QRREQSVTRTELEFHEWDQSRGIDKYNPIFDWHEMDVWAYILKYDIPYNE
LYKQGYPSIGCEPCTKQVKAGEDIRAGRWWWENKDSKECGLHK
>MS1252 cysH, CysH protein
MTTQNQIENGHLDWLEAESIYIIREVVAECSHPALLFSGGKDSVVLLALA
RKAFQLEGRDLVLPFPLVHIDTGHNYPEVIQFRDEQVKKLNARLVVGHVE
DSIAKGTVVLRKETDSRNAAQAVTLLETIEANGFDALMGGARRDEEKARA
KERIFSFRDEFGQWDPKAQRPELWSLYNGKLHKGENMRVFPISNWTELDI
WQYIEREKLELPPIYYAHQREVVERNGLLVPVTPITPKQPGDESKVVSVR
FRTVGDISCTCPVASTAATPAEIIKETAVTEISERSATRMDDRTSEAAME
QRKKQGYF
>MS1249 cysI, CysI protein
MSDKKQKGLEWQDNPLSDNERLKEESNHLRGTILDDLEDGLTGGFKGDNF
QLIRFHGMYEQDDRDIRAERQEEKLEPRKFMLLRCRLPGGIIKPEQWIEI
DKFARDNNYYQSIRLTNRQTFQYHGVPKTKLQDMHRLLHKLGLDSIATAS
DMNRNVLCSSNPVESELHQEAYEWAKKISEHLLPRTNGYLDVWISGKKVQ
SSDSFLGQEDEPILGNRYLPRKYKTAVVLPPLNDVDLYSNDMNFVGIKDE
KTGKLAGFNVLVGGGLSFEHGNTKTYPNIALELGYVPVEDTLKAAESIVT
TQRDFGNRADRKNARLRYTIQNMTLEGFREEVERRMGRRFEAIRPFEFTE
RGDRIGWVKGIDKKWHLTCFIESGRLVDKPDLPLMTGMLELAKVHKGDFR
ITANQNIIIANVAEEDKRQIEDIARQYGLIRKITKLRENAMSCVSFPTCP
LAMAESERVLPEFIDELDKIMAKHHVEQDYIVTRITGCPNGCGRAMLAEI
GLVGKAVGRYNLHLGGNIAGTRIPRLYKENITLDEILSELDGLIARWATE
RDQGEGFGDFVLRVGIIKPVVNPVVDFWDENLIPTVAV
>MS1250 cysJ, CysJ protein
MSNTTNPLPPETEQLLAKLNPIQLAWLSGYAWAKAQGEDAGTNVTNKNAA
STLVTEDKPLNVTVLSASQTGNANGVANQLAERLKAEGVNVTRKALKEYK
AKTIGDEQFVLLVTSTQGEGEAPEEGVPLYKLLHGKKAPNLANLEFAVLG
LGDTSYPNFCQAGKDFDKRFEELGAKRLLARADADLDFKSTADKWIQDVV
EAVKAKSAVSASVVASVVSASSAQSAVNYSKENPYTAKLITNQKITARDS
AKDVRHFEFDLSGSGLQYKAGDALGVWAENDPDLINEVLGLLKIQPDESV
QLNGKSLDIHGALLSRLELTQNTPAFVKGYAQLANNKKLTALVSSDKKLA
DYVNDTPIVDVLHDFPAKISAQQFADLLRPLTPRLYSISSSPEEVGEEVH
LSVGVVRFEHEGRARTGVASGFLADRVEEDGEVKIFVEPNDNFRLPQDKS
KPIIMIGSGTGIAPFRAFLQQRQAEEAEGKNWLIFGNQHFATDFLYQAEW
QQFVKDGYLHKYDFAWSRDQAEKIYVQDKIREKSTALWQWLQEGAHVYVC
GDASKMAKDVENALLEVIAREGKLTPEDAEEYLNDLREDKRYQRDVY
>MS1770 cysK, CysK protein
MTIFADNSYSIGNTPLVRLHNFGHNGNLVVKIESRNPSFSVKCRIGANMV
WQAEKDGVLTKDKEIVDATSGNTGIALAYVAAARGYKITLTMPETMSLER
KRLLRGLGVNLVLTEGAKGMKGAIAKAEEIVASDPNRYIMLKQFENPANP
AIHQQTTGVEIWQATEGKVDVVVAGVGTGGTITGISRAIKLDQGKQITSV
AVEPAESPVITQILAGEEIKPGPHKIQGIGAGFIPKNLDLSLIDRVETVD
SDTAIKTARRLMAEEGILAGISSGAAVAAADRLAKLPEFQDKLIVAILPS
ASERYLSTALFEGIEG
>MS1251 cysN, CysN protein
MKDYIMSNLNQYAPLRFITAGSVDDGKSTLIGRLLYDSKALLSDQLLSLD
KSKSNGEVIDFSILTDGLEAEREQGITIDVAYRYFSTAKRKFIIADTPGH
EQYTRNMVTGASTANAAVVLIDASQLDFSKEEVELLAQTKRHSAILKHLN
TPHIIVAVNKMDLLNFEQNKFNAITAAYTKLAKQLGLKEVVFVPVSALQG
DNIVHKSDATPWYEGEALLTVLENLPTDDHQSEKAEDFHFPVQLVSRLDQ
DKQDDFRGYQGRIESGSIRKGDKVRIEPTGYETRITEIYSPNGLVQSAKV
GEQVTLRLADDIDISRGDTFLAENSATVATKALKATVCWFDQRALNPARK
YLLKHTTLTVFAKVSSVDRVLDVQTLSHSAQADSLKMNDIGEVQISLQKP
ITATTYAQNIATGSFILIDEATYHTVAAGMILEI
>MS0017 cysQ, CysQ protein
MQITQQLLDDVLKIASLAGEHLKTFYAKSVNVEIKTDNTPVTEADLFLSQ
FLIEKLTVLTPDIPVLSEENCNIPLAERQKWQSYWLIDPVDGTQQFINHT
GQFSIMICLVQDNQPQLGIIHAPIIGKTYYARRGLGAFLIENGCCRKLPP
LQPHNHQHIKITIGSSNPEKIRQSVQPPYKADLLLYGSSSLKSGLVAEGV
ADCYVRLGNTGEWDTAAAEVLLNEVGGKIFNLQYRPLTYNQRETLVNPHF
VMANAQLDWKKIFRFDL
>MS0622 cysS, CysS protein
MSGRDPRRLIKTQILEPSMLKIFNTLTREKEEFKPINPNKVGMYVCGVTV
YDLCHFGHGRTFVSFDVITRYLRYLGYDLRYVRNITDVDDKIIKRALENN
ETCDQLVERMIAEMHKDFDALNILRPDVEPRATKHIPEIIAMVETLIRRG
HAYVAEDGDVMFDVESFQKYGALSRQNLEQLQAGARVEIKSVKKNPMDFV
LWKMSKPNEPSWDSPWGKGRPGWHIECSAMNDKELGNHFDIHGGGSDLMF
PHHENEIAQSCCAHDGEYVNYWLHTGMLTINEEKMSKSLNNFFTIRDILT
KYDAESVRYFFLTAQYRSLLDYSEENIGLARKALERLYTALRGCETVEIP
AEDQYVIDFKTAMDDDFNTPGALAVLFELAREINKLKTEDQTKANQLASR
LKQLAGVLGLLEQAPETFLQGDAADAEVSKIEALIKRRNEARAAKDWAAA
DAARNELTAMGVVLEDGAKGTTWRKL
>MS1259 cysU, CysU protein
MPGFRRGLTVTILWLTSMIVLPLILLVITALQLKGTEIWQIITSTRVISS
ILLSFKMALAATVVNIIFGFLLAWILVRYNFRGKSLVNAFIDLPFALPTA
VAGIALASLYAPTGLIGGILAKAGVQIAYTPSGIAIALIFVSLPFVVRAI
QPVLANFDPSFEEAAHILGASKWTTLTKVIIPALLPAIIGGAGMGFARSL
GEYGSVIFIAGNVPLVSEIAPLIIMSKLDLYDVQGASVVALLMILISFIL
IFLVNWLQWAINKRITQVK
>MS1260 cysU, CysU protein
MEKTDWQKWGLISVGLLFFTIILLFPLLTVFYYALEQGIDLFIKSIQEEE
AQAAIWLTVKVALIVLPINIFIGVVMAWTIAYFNFKGKSFLTALLDLPFS
VSPVVVGLMFLLMFGIDSFFGQWLATHQIRVIFALPGIVLATLFVTFPLI
VKSLIPTMNAQGNSEEQAALILGANSWTLFRKITFPKIKWALIYGVILSN
ARAMGEFGAVSVVSGHIRGLTNTIPLYVEISYNEYQFVAAFACASLLALL
AIFTLGLQNTLTWLQKRKFNRH
>MS1341 cysU, CysU protein
MRRKFNAAYFIKISFMLSSLINYFQFSPNEINAIRLSIKVAVVAICCSLP
FAIFVAWLLARKNFWGKSLVNGIIHLPLVLPPVVIGYLLLISMGRNGIIG
RYLLQWFDFSFGFSWYGAALASAIVAFPLVVRSIRLALESVDFKLEQAAR
TLGASSWRVFFTVTLPLALAGVLAGVILGFARSLGEFGSTITFVSNIPNV
TQTIPLAMYSFIETPGAESSAARLCIVAIFISLISLLLSEWLAKRTQTKL
GQIDVRN
>MS1769 cysZ, CysZ protein
MLFPTALCMLALIRFLIYIFLGSFMKKEKEIKSGFHYFVMGWHLIGQQGL
RRFVVMPVLLNIILLSGLFWLFVSKISDMIEGVISFIPDWLSWLSGILLA
LSILMILLVFYFIFNTLSGFIAAPFNGLLAEKAEAMLTGESGENMTTMEF
IKDTPRMLAREWQKLLYSLPKYIGLFLLSFIPLIGQSLIPVLTFLFTAWM
MAIQYCDYPFDNHKISFPTMKFKLNENRIQNVTFGTFVTLCTFVPFINFV
IIPVAVCGATAMWVDTYRKQLYLDKNLQKSTAVSTASTEKPGSDIARHSN
NIRNR
>MS1434 czcD, CzcD protein
MNAQAKKDKQAHHEHHAHSQVHTEHEHSQVPKNKMILGISLAIISCYMVV
EFIGGYLFNSLTLMADAGHMANDSLSLFLALVALFLSAKAQKWFALLNGT
SLVFVAVMILIEAFKRWQAPTEMAALPMMTVAIIGLLVNILVAWIMLKSD
QENLNIKAAYLHVLADLFGSVVAIIAGLSAWLLDWQWVDVVASVILSALV
LRSGLSVIKQAITALRSDGEEFSMDTHSH
>MS1607 dAK1, DAK1 protein
MALTKQQILQWLENCNRTFNERRDYLTELDTAIGDGDHGLNMQRGFSKVM
DKLPTIKDKDIGTILKNVGMTLLSQVGGASGPLYGTFFIKGAQSAVGKEE
ISFEELVQVLKDGVAGIVSRGHAELGDKTMCDVWLPVVNQLEQEDGNQPL
DALLKSAVEKANISLQATIPLIAKKGRASYLAERSAGHQDPGATSTTYML
EALYNAVK
>MS1606 dAK1, DAK1 protein
MKKLINSVETVLDEQLQGLAKSHPQLVLNTEPVYVRRADAPVAGKVAIIS
GGGSGHEPMHAGFVGEGMLDGACPGAIFTSPTPDQMMECAMAVDSGEGVL
LLIKNYTGDVLNFETATELLADMGNQVATVLVDDDVAVKDSLYTAGRRGV
ANTVIMEKLLGAAADKGYNLNQLAELGYKLNNQGHSLGIALGACTVPAAG
KPSFTLAENEMEFGVGIHGEPGIERRPFENLDKTVQQMFDTIIENGNYER
KMRRWDCQANQWNEVTEQKQALQPGDRVIALVNNLGSVPLSELYGVYNKL
TECCEKFGLTIERNLIGSYCTSLDMQGMSVTLLKVDDEILSLWDAPVNTP
ALRWGK
>MS0960 dacB, DacB protein
MSKFTKNLFASSLLFSNIAFAQIDVQPLTQILPQGASIGFIAENINQNKI
IADHNGQTFMLPASTQKVFTALAAKLALGDEFRFETSLQTQGKVQNNQLD
GDLIVKFTGDPDLTTGQLYGLFATLKKQGVNQINGNLILDTSVFASHDRG
SGWIWNDLTMCFNSPPAAVNLDNNCFYVNLDANKSVGEFVQFNVPTQYPI
QVFGQVRVVGAEEAPYCQLDAVVHDNNRYQIKGCIARQTKPFGLSFAVQD
TDAYAAAIVQRQLRQAGIQFSGQVQQPHQPQQGTVLAQHLSKPLPELIKK
MMKKSDNQIADSLFRTIAYHTFKRPASFQLGSLALKRILSTQAKIKFGHS
IIADGSGLSRHNLVDPNTMLQALNYIARNEDKLHLMDSFPVAGVDGTISG
RGSLINPPLIKNVLAKTGSLKGVYNLAGFMTNARGERIAFVQFINGYSTG
ELENKTKRAPLVQFESKLYNALYAD
>MS1829 dacC, DacC protein
MLKNALKKTSLAIFAGLLALPLTVSAEDVNFGIVPPQVNAQTYVVMDYNS
GAVLASLNPDQRQYPASLTKMMTSYVIGDALKQGKIHNTDTITVGESSWG
KNFPDSSKMFLNLNQQVTVEQLNRGIIIVSGNDACVAMAEHVSGTTDNFI
NSMNKYAEQFGLKNTHFTTVHGLDDANQYSSARDMAIIGAHIIRDLPEEY
KIYAEKDFTFNKIKQPNRNGLLWDKTINVDGMKTGHTSQAGYNLVASATN
ADNMRLITVVMGVPTYKGREVESKKLLQWAFNSFDTFKTLEAGKAVTNQD
IYYGNQGKVQIGVLQDRFITVPKGRNADLKARFELDKKYLEAPLAKGQVV
GKVIYQLDGKDVAKVDLQAMQDVEEGGIFGKAWDWLVLTIKSLF
>MS0849 dacC, DacC protein
MELEQLGFRLFSKNTLAVISFVTEAIGRMLLGERLSSTLL
>MS0850 dacC, DacC protein
MVYDFTHNKVLESRSPNSILPIASVTKLMTANVFLENNRNPNCSSSITDE
DYDHIKGTRTKLPKYTPISCNELLKAMLVHSDNYAAHALSRSAGMSRAQF
IKKMNQKAQQLGMNSTRFSDSSGLSSSNVSSPMDLVKLAKYSLDKQLIKT
LSNTRATYIRAGRHNVFMQNTNKLVREEMFDAAINKTGYIRESGYNLVFV
NKHQCNRSTIGVISLNNASSAYRSNFTKHKLEEYGCLAANDVEINEFNDQ
DFENYDEQGLAQLIEQVAK
>MS1592 dadA, DadA protein
MLKVTTAHIHFNRDNTPVSEQFDDIYFSTADGLEESRYVFQEGNNLWRRW
LQFGENHFVIAETGFGTGLNFLAVTALFREFRTQYPDSPLKRLFFISFEK
YPMSCADLRSAHQAYPQFNSLAEQLRQNWLQPIVGCYRFHFEETVLDLWF
GDIADNLPQLGDYMVNKIDAWFLDGFAPSKNPEMWNENLYKQMFRYTKPA
GTFATFTAASAVKKGLESAGFSLQKRKGFGKKRECLQGFKPLNAEQNPAV
HTPWLLSRSATLSENTDIAIIGGGISSLFSAISLLQRGANVTLYCEDEQP
ALNASGNKQGAFYPQLSDDDIHNIRFYIHAFAYGQQQLRWAIQQGIEFEH
EFCGVALCAYDEKSAVKLAKISDYDWDTSLYQPLNQQELSEKAGLPLPCG
GGFIPQGAWLAPRQFVQNGFAFAQKCGLKLKTFEKITALSQSEKGWILHN
DKNEQFHHETVIIANGHKLKQFTQTARIPVYSVRGQVSQIPTSSQLLKLK
SVLCYDGYLTPADQAKQFHCIGASHVRDCEDRDFSLQEQQENQAKIQLNI
AEDWTKEVNTADNLARTGIRCAVRDRIPLVGNVPDFERQADEYRNIFNLR
RRKQFIPQAAVFENLYLVGALGSRGLTSAPLLGEILASMIYGEPIPLSED
ILHCLNPNRSWMRKLLKGTPVK
>MS2290 dadA, DadA protein
MLKFSYQEHIKTYYYDTRNQDFTQPTLTGGQSADVCVVGAGFGGLSAALE
LAERGKSVIVLEGARIGFGASGRNGGQAINGFEDGMDAYIDDMGLEKARK
LWEMSLEAIDIIEQRIAKYNIQCDWRKGYATLALNHRRMDDLVTIEQTSR
EIFAYDYMQLWNKAELKQYLGSDIYVGGLYDGNSGHLHPLNYCLGLAKAC
LDLGVRIFEQSPVIDLDVGKSKVIAETAEGSVTAENVVLATNAYVTSLPK
RIQRGTARKILPIDSFIIATEPLDQETANAVINNGMSVCDNNLLLDYYRL
SADNRLLFGSDSSSNKDMVQVMRNNMLHVFPQLENVKIDYGWAGPIDMTI
NAKPCLGRIASNIFYAHGYSGHGVALTGLAGRLIAEAIEGDDERFAIFES
LKSPSVYGGRIVKNLATKIGVKYYKWLDKYR
>MS1967 dam, Dam protein
MSHSGKTKHGLKHRSFLKWAGGKYRLTDNINNLFPKRRKCLVEPFVGAGS
VFLNSQFERYILADINADLINLFNTVKTDVDAYIEALKPVFFHAEANSAG
YYYARRDDFNNSTDPFFRSVLFLYLNRFGFNGLCRYNSLNEFNVPFGAYK
SHYFPEKELRYFAEKAKSAVFICADFNETFKLADDESVIYCDPPYAPLLQ
DSNFTKYAGNDFSVTHQQALAELAKQTVNERNIPVLISNHDTAFTREIYH
GAKFKRIKVQRTISQAAERRVKVNELIAVFK
>MS0265 dapA, DapA protein
MSSTRPLFYGSIVALITPMDGHGEVNYDELKKLVEYHIASGTHAIVSVGT
TGESATLSIDENVKTIQKTVEFAAGRIPVIAGTGANATSEAITMTKLLNN
SGVAGCLSVVPYYNKPTQEGMYQHFKAIAECTDLPQILYNVPGRTGSDMK
PETVGRLSKIENIVAIKEATGDVSRVKQIKELAGEDFIFLSGDDATGLES
IKLGGQGVISVTNNLAAADMAKMCELALAGNFDEAEAINQRLMGLHHDLF
IEGNPIPVKWAAYKLGLIKEPVLRLPLTTLSEAAQPKVLEALKQAGLI
>MS0067 dapA, DapA protein
MFKPQGIIAPVLTALDDNEKFNPEVYKNYINYLIKAGIHGIFPLGTNGEF
YGFNEAEKLEIIKTAIEAADGCVPVYAGTGCVTTKETVEFSKKVVDLGVD
VLSIVSPYYIAVTQDDLYRHYATIAENVTAPILMYNIPARTGNNIDYKTI
KKLAQYENIIGVKDSSGNFDNTLKYIENTDSRLSIMAGSDSLILWTLLAG
GTGAISGCSNVFPELMVSIYEYWKQGDFEKANEAQKKIRDFRNVMQMGNP
NSVVKRAAQLRGLGTGPAKEPSNCANNPVIDKALQDVFKLYD
>MS0282 dapA, DapA protein
MKKINLEKTMSIQGIIPVMLTPFMENNEIDYDGLRKLTDWYIDNGSDALF
AACQSSEILFLSLEERVKITKTVMDQVQGRIPVVASGHISDSFEQQVEEL
TAIYNTGVDAVILITNRLDPNNEGTTVLKSNFEKLLAALPKDIVLGLYEC
PVPYRRLLTDGEISYFAGFENMVVLKDVSCNLETVKRRIQLTKNSNLKIV
NANAAIAFEAMKAGSEGFSGVFNNIHPDLYAYLYKNKNSSDPMVQELANF
LAICGAAESFGYPNFAKLMHTKIGTFKHYNSRVIKDDIKVKYWAVEELLD
HIMQGSERYRNKLNLR
>MS0971 dapB, DapB protein
MTLKLAIVGAGGRMGRQLIQAVQAAEGVELGAAFERKGSSLIGADAGELA
GLGELGIKVAEDLAAEKDKFDIIIDFTRPEGSLEHIKFCVANNKKLILGT
TGFDDAGKQAIGKAAEKTAIVFASNYSVGVNLVFKLLEKAAKVMGDYSDI
EIIEAHHRHKVDAPSGTALSMGEHIAKTLGRDLKVNGVFSREGITGERKR
TDIGFSTIRAADVVGEHTVWFADIGERVEISHKASSRMTFANGAVRAAKW
LANKQIGLFDMTDVLDLNNL
>MS1177 dapD, DapD protein
MSNLQSIIEAAFERRAEITPKTVDAQTKAAIEEVIAGLDCGKYRVAEKID
GDWVTHQWLKKAVLLSFRINDNQLIDGAETKYYDKVALKFADYTEERFQQ
EGFRVVPSATVRKGAYIAKNTVLMPSYVNIGAFVDEGTMVDTWVTVGSCA
QIGKNVHLSGGVGIGGVLEPLQANPTIIGDNCFIGARSEIVEGVIVEDGC
VISMGVFIGQSTKIYDRETGEVHYGRVPAGSVVVSGSLPSKDGSHSLYCA
VIVKKVDAKTLGKVGLNELLRTIEE
>MS1784 dapF, DapF protein
MQFSKMHGLGNDFVVVDAVTQNVYFPEEVIKKLADRHRGIGFDQMLIVEP
PYDPELDFHYRIFNADGSEVAQCGNGARCFARFVTLKGLTDKKDIAVSTT
NGKMILTVQDDGMIRVNMGEPVWEPAKIPFIANKFEKNYILRTDIQTVLC
GAVSMGNPHCTLVVDDVETANVTELGPLLENHERFPERVNVGFMQVINPN
HIKLRVYERGAGETQACGSGACAAAAIGIMQGLLENKVQVDLPGGSLWIE
WQGEGHPLYMTGDATHVYDGVIKL
>MS1581 dcd, Dcd protein
MRLCDTDIERYLDEGIIEITPRPGNEKINGATIDLRLGNSFRVFRDYSAP
YIDVSGPREEVSAQLDRVMSDEIIIRDDEPFFLHPGVLALATTLESVRLP
DNIIGWLDGRSSLARLGLMVHVTAHRIDPGWEGRIVLEFYNSGKLPLALR
PNMIIGALSFEILSNHAARPYNRRRDAKYKNQQSAVASRINQDE
>MS1199 dcp, Dcp protein
MSNPLLENTPLPQFSKIKPEHIQPAIEQLIQDCRITTENLLKQPQLSWDN
FCQPLSEVNDRLSKAWSPVSHLNSVKNSNELRDAYQACLPMLSEYGTWVG
QHQGLYNAYVQLKNSPEFAGYSPAQKKAVENSLRDFKLSGISLAPEQQKR
YGEIVSRLSELSSQFSNNVLDATMGWDKVITDEEQLKGLPESALQAAKQS
AQNKGVEGYRFTLEFPSYIPVMTYCENRELREEMYRAFVTRASDQGPNAG
KWDNSAIMEEILTLRVELAKLLGFNSYTELSLATKMAETPAQVLSFLDDL
AMRSKPQGEKELADLYAFCEKEFAITELEPWDISYYSEKEKQALYAINDE
ELRPYFPEQRVISGLFELIKRIFNIRAVERQGVDCWHKDVRFFDLIDETD
EVRGSFYLDLYAREHKRGGAWMDDCIGRKIKADGALQKPVAYLTCNFNAP
VGDKPALFTHDEVTTLFHEFGHGIHHMLTKVDIGDVSGINGVPWDAVELP
SQFMENWCWEEEALAFISGHYQTGEPLPKEKLTQLLKAKNFHAAMFVLRQ
LEFGIFDFRLHDNYKPGKANQILDTLNAVKDQVSVVKAVDWARTPHSFGH
IFSGGYAAGYYSYLWAEVLSADAFSRFEEEGIFNAVTGKSYLDEILTKGG
SEEPMVLFERFRGRKPTLDALLRHKGIAN
>MS0542 dctP, DctP protein
MILFTVLNKRYRANYYFGIPFAANDRRKTIDIPPILCMENLHMFMKKKVL
TLAISGLLAATVSFSVSAKTTLKLSHNNDKTHPVHISMQAMADEVKKLTD
GEVVIRIYPNSQLGNQRESMELIQSGSLDMAKSNASELEAFEPIYGAFNV
PYLFKDSEHYYKVLRDPEIGGKILDSTKGKGFIGLTYYDAGSRSFYAKKP
IKTPADLKGLKVRVQPSPSALEMMKLMGASATPLAFGELYTALQQGVVDA
AENNPTALTLMRHGEVAKFYSEDEHTIIPDVLLISEKSWGKLTPEQQKIV
KEAADNSMMSHKDLWAKATEEEIQKAKDTMGVEFVKVDKQPFVDAVKPMH
DKALADPVIGPIVQKIDAAR
>MS2257 dctP, DctP protein
MKLKSVFSNSVLAKAEMTLRLGLEPSIESPQGVGAKEMAKVADELSKGKI
AIEFFPDQQLGTGPQMIEMVKKGELDIFQGGAGLYSSIEPRLNVFDIPYL
FDSVEQAYKVLDSDFGKEILATLEPANLKGLSFWENGIRSVTSNVKPINT
PEDLVGLKIRVMPANQVHVDLWQGVGAKPEPLPYGEIYGKLKSGELDAQE
HPIAPIYTGKFYEVQKYLSLTQHMYGPLIQVMNLEKFNALPKETQDILLK
ASYAGAVKMRQFSNENAAKFIDDMKNKGMLVNEVDTTPFKAKMRPLVEKP
YVEKNGDDWLKKINASIEADRKK
>MS0698 dctP, DctP protein
MKLKTFILSSLSIVLPLCAVSTNAIAAKVTLKLAHNLEQSHVIHKALDYM
AKEVKEKSNGELILRIYPNRQMGDARETIELLQNDALDMTKANSSELEPF
VKEMAVFTCPYLFNNDEHFKKVLYGSAGKSITDKTKNSGFTVLSSYVGGS
RNFYTKKPIYSPADLKGMKIRVISTPTTNRIIELLGGSPVPVPLGEVYTA
LQQGVIDGAENNIPSYTSTRHVEVAKYFTEDQHTSMPDYLVIANKVWNKL
DENQQKILLDAAKESEIYQQKLWDEETIHSRREAEKIGTTFIQVDKQPFR
DALIPLYNDFKQNPVFSQIIADIEAEAK
>MS0526 dctP, DctP protein
MKLFSKSIKTILSVGLLGFTINAQAETEIMVAYGNQPGEPIDKAMHFWAD
KVKEKSNGDIVFKLFPSSQLGSETEVMEQAKFGSNIITISDYGALMDIVP
DLGVINAPYISQSFEKKSKLLQSDWFKDLSAKLDQNDIHIIVPDVVYGTR
HLLTKKRVTKPADLKGVKVRVQHSRLFLETIKAMGGVPTPMSLSDVYPGL
SEGIIDGLENPAVVLFGGKFYEVAKNLSLTAHTKHMSPFVAGTAFWNTLT
PEQQQIIVDTSREMVVYGAGLINEAEKDALDKLKAAGVTINEVDLPVFEQ
SVGGVISNGFPEWSPNLYKNVQEKLEQF
>MS0049 dctP, DctP protein
MKLFSLNKLSALIAGVALLSAVTAQAETSLRFGYEAPRSDTQHIAAKKFN
DLLKEKTNGEIKLNLFPDSTLGNAQTMISAVRGGTVDLEMSSSSNFTGLV
SELNVIDIPFIFKDRTHAYQVLDGEIGQKLLSQLDAHGLKGIAFWEVGFR
GFTNSKHPVTKPEDIKGLKVRTNQNPMYIKAFSILGANPVPMPLSELYTA
LETKAVDAQEHPIGIVWSSKLYEVQKYFSFTNHGYTPLIVVMNKAKFDGL
SPELQKAIVDAAQEAGKYQRQLNLDNEQGIVEKMKKAGIEFVDNLDTAPF
KAAVEQETRKAFIDANGDSLIKQIDALGK
>MS0050 dctP, DctP protein
MKLFNLKTLATLVAGVALMSATAQAEISLRFGYEAPRSDTQHIAAKKFAE
LLKDKTKDEIKLKLFPDSTLGNAQTMISGVRGGTIDLEMSGSPNFTGLEP
KLNVIDIPFIFKDREHAFKVLDGEIGQGLLKDLESQGLKGLAYWDVGFRA
FSNSKHTVTKPEDIKGLKVRTNQNPMYIEAFSLLGGNPVPMPLSELYTAL
ETRAVDAQEHPIGIFWSSKLYEVQKFLSLTNHGYTPLIVVMNKAKFDGLS
PELQQAVLDSAKEAGAFQRQLNIDNEKEIIGKVRKEGVEVTEQIDQAPFK
AVIEEKVRKTFIDKYGKDLVEKIDALAQ
>MS1877 dcuB, DcuB protein
MLYLEFLFLLLMLYTGSRFGGIGLGVISGIGLVIEVFILRMPLGKAPIDV
MLVILAVVTCASILEAAGGLKYMLQIAERVLRSNPKRVTILAPMVTYVMT
FMLGTGHSVYSVMPIIGDIALKNKIRPERPMAVSSVASQLAITSSPLSAA
IAYYLTQITKMPGYEHITLLNIISVTVPATFVGTMAMALYSLRRGKELED
DPEYQRRLKDPTWRDRILNTTATSLDAELPRSAKMAVWLFVLSLVTVVVI
AMLPEIRTVGVPVDGKPVKAISMSFIIQMMMLCFGGIILIATKTNPQSVP
NGVVFKSGMVACIAIYGIAWMSDTYFSYAMPEFKAAVTTMVESYPWTFAF
ALFAVSVVINSQAATAVMMLPVGISLGLPAPVLVGLIPATYAYFFIPNYP
SDIATVNFDVTGTTKIGKYYFNHSFMIPGLIGVTTACLVGYALAHMIIV
>MS2216 dcuB, DcuB protein
MSAMFLIQFAIVLLCILMGARAGGIGLGVFGGLGLAILSFGFGLKPAGLP
IDVMFMIMAVVSAAAAMQAAGGLDYMIKIATNILRRNPKYITFMAPAVTW
LFTFLAGTGHVAYSVLPVIAEVARHNGVRPERPLSMAVIASQFAIVASPI
AAAVVAVVAYLEPQGITLANVLSVTIPATLLGIFLACVFVNKIGVELKDD
PEYQRRLQDPEYVKANHADVNMDEIQLKPTAKLSVGLFLLGALLVVVMGA
LPELRPSFDGKPMGMAHTIEIVMLTIGALIIFTCKPDGTEITRGSVFHAG
MRAVIAIFGIAWLGDTLMQAHMDEVKGMVSGLVETAPWAFALALFILSIL
VNSQGATVATLFPLGIALGIPAPILIGVFVAVNGYFFIPNYGPIIASIDF
DTTGTTRIGKFIFNHSFMLPGLLSMAFSLGFGLLFANMFL
>MS1553 dcuC, DcuC protein
MDELKPVIAVAGIIATIYLLIKKYETRTVLIGVGLLMSLLTLNPMGALDA
FAKSMTSGGLIMAICSSMGFAYVMKYTQCDTHLVHLLTKPLGGLKFFLIP
VATVITFFINIAIPSAAGCAAAVGATLIPILKSAGVRPATAGAAILAGTF
GSMMSPGSSHSAMISEMSKLTITEVNLTHAPYTMVAGAIGAVMLTLLALF
FKDYGDEHRQAYLREQKEAEDKFVKVNVLFALAPLVPLVILVIGGTSLQQ
VSWLGWTQMGVPQAMLIGAIYGILVTRISPVKITEEFFNGMGNSYANVLG
IIIAAGVFVAGLKSTGAIDSAIGFLKHSNEFVRWGATIGPFLMGIITGSG
DAAAIAFNSAVTPHAVELGYTHVNLGMAAAISGAIGRTASPIAGVTIVCA
GLAMVSPMEMIKRTALGMVLAILFLALFML
>MS1664 ddlA, DdlA protein
MKPLKEQKIAVLLGGTSAEREVSLTSGDAVLTALRNQGYDAHPIDPKEYP
VAQLKEQGFERAFNILHGRGGEDGVIQGVLEQIGLPYTGCGVMTSALTMD
KMRTKMLWKGFGLPIADMEIVTRDTVDELNPLEVVERLGLPLMVKPSREG
SSVGLTKVNAVEELKNAVDLALTHDDTVLIEEWLSGIEMTVPVLDDQVLP
AVQIIPEGEFYDYDAKYISDNTRYICPAPMSEESLQELQKLVKRAYDVVG
CRGWSRIDVMTDANGNFRLVEVNTTPGMTSHSLFPKSAATVGYSFEQLVV
KILELSA
>MS1118 dedA, DedA protein
MDILIDFFINYGYLAVLLVLIICGFGVPIPEDITLVSGGIIAGLGYANPH
IMVFVSMFGVLAGDSVMYWLGRIYGVRILRFRPIRKIMTLQRLRMVRDKF
EQYGNRVLFVARFLPGLRAPIYTVSGITRRVSYPRFIFLDFLAAIISVPI
WVYLGYHGGNNHEWIEAQIRKGQMGIYAVLAIVVIFVGWKVYKSKKAKAD
KTN
>MS2201 def, Def protein
MSVLNVLIYPDERLKTIAEPVTEFNDELQTFIDDMFETMYQEEGIGLAAT
QVDVHKRVITIDITGEKTEQLVLINPELLDGEGETGIEEGCLSLPGLRGF
VPRKEKVTVKALNRQGEEFTLHADGLLAICIQHEIDHLNGIVFADYLSPL
KRNRMKEKLVKLQKQISRHQA
>MS1373 degQ, DegQ protein
MLKKIIQSAVIGLACAGFILAVLPRFSSTGQPFYSGEDVVLSFKNAVRAA
SPAVVNVYNQSLSSSSVDEKFQVNNLGSGVIMSKDGYILTNMHVIQNADQ
IVVALQNGALFEATLIGTDKLTDLAVLKIRADNLTTIPQNPDRSIHVGDV
VLAIGNPYNLGQSVSQGIISATGRNAIGDSVGRQNFIQTDASINRGNSGG
ALINSVGELVGISTLSFGKDPSDVAEGLNFAIPMNIANDVLQKIIRDGRV
IRGFFGVQSDIIFNDGSDSEPGVKVKSVVSNGPAAKAGIQPNDVILEFNG
EKANSPAQMMQVISNVRPGSVVKVLIERAGNQLELPVKIEEFPDTLPQ
>MS0993 degQ, DegQ protein
MKKTSFTLTAIALGLSVLAAPTVSVADFSSFFGGDKSESAEQTSANKAAS
NVAQSAVNSPFITNSLAPMLEKVLPAVVSIAVEGNQKIAKRSFDIPEEFK
FFFGEDFFGDNSSKSSSRKFRGLGSGVIINAEKGYVLTNNHVIDNADKIT
VLLQDGREFKGKILGKDSQSDIALVQLENPKNLTEIKFADSDKLRVGDFT
VAIGNPFGLGQTVTSGIISALGRSTGSDSGAYENYIQTDAAVNQGNSGGP
LINLNGELIGINTAILSPSGANAGIAFAIPSNMANNLAQQIGEFGEVRRG
MLGIKGGELNADLAKAFHVDAQQGAFVSEVLPGSAADKAGIKAGDVIIAM
NGQKVSSFAEMRAKIATSGAGKEIELTYLRDNKKENVKVTLQADDQAQST
ANAEAVIPALEGAELTNFNENGKKGVKLSKVAENSPAAQRGLKTGDLIIG
VNRIAIEDLTQLRKAMENKDSVIALNIERGNNNFYLLIQ
>MS0152 deoC, DeoC protein
MHPQELAKFIDHTALTAEKTAQDIIKLCDEAIENQFWSVCINPCYIPLAK
EKLAATNVKICTVIGFPLGANLTSVKAFEAQESIKAGAQEIDMVINVGWI
KSGEWDKVRSDIQAVLQACNGTLLKVILETCLLTPDEIVKACEICRDLKV
GFVKTSTGFNKDGATVEDVALMRQTVGDKLGVKASGGIRDTETAMAMINA
GATRIGASAGIAIIKGLQDNSGGY
>MS1899 deoD, DeoD protein
MTPHINAPEGAFADVVLMPGDPLRAKYIAETFLENAKEVTNVRNMLGYTG
TYKGRPVSVMGHGMGIPSCSIYTKELITEYGVKKIIRVGSCGAVRNDVKV
RDVIIGLGACTDSKVNRIRFKDNDFAAIADFDMTQAAVQAAKAKGINYRV
GNLFSADLFYTPDVEMFDVMEKYGILGVEMEAAGIYGVAAEFGAKALSIC
TVSDHIRTGEQTSSEERQLTFNDMIEIALESVLLGDQA
>MS1938 dfp, Dfp protein
MFPKLWFLGAGGVMRNLNGKRIVVGITGGIAAYKTIEFIRLLRKSNAEVR
VVLTAAAAEFVTPLTLQAISGNPVAQSLLDPQAELAMGHIELAKWADAII
IAPASADFIARFTVGMANDLLSTVCLASAAPIFLAPAMNQQMFSQAVTRQ
NLKSLAERGVKLIGPNSGFQACGDVGAGRMSEPAEIYAALCEALFLRQDL
LGIKVAITAGPTREAIDPVRYISNHSSGKMGFAIAQAFADRGAEVTLISG
PVNLAAPDKVNRINVVSARQMWQQSMKSAVENHIFIGCAAVADYRVAEVS
EQKIKKTDDNDELTLNLIKNPDIIADVAHLTENRPFVVGFAAETQNVEQY
AKDKLQRKNLDLICANDVVGGSVFGAEQNTLHLFWQNGEKVLPTDTKKAL
AKSLVQEIIELYRK
>MS0242 dgkA, DgkA protein
MEKTTGLTHFIKSAGYSIQGLKSAIKYEAAFRHELAAGLILIPAALYLAN
DKFEMALMIGSYLIVLVTELLNSALEAVVDRIGSERHELSGRAKDQGSAA
VFVAIANCVMIWLILLIF
>MS1791 dgoA, DgoA protein
MTTRCYHLYRYSIPVDSQLILRNRFLKRREGLLVQIKCKENEGWGEIAPL
PEYSRETLEQAQEQAIQWLKDWDAARSRNEKLSFDGLYPSVAFGLSCALA
EMKGSLQTEGNYQVAPLCYGDPDELYEPLDQMQGEKVAKIKVGMYEANRD
GMIADMLLEAIPDLYLRLDANRSWTPAKAAMFAKYVKPEHRPRIQFLEEP
CKTREESRRFAEQTGINIAWDESVREPDFEVVAEPHVTAIVIKPTLVGSL
EKCVSLIEKAQNLGMKAVISSSIESSFGLTQLARIARQYTPDVTPGLDTL
DLMEYQVVRRWPGSGLPVADFNSGFITEINF
>MS0689 dgoA, DgoA protein
MSTPVITEMQVIPVAGHDSMLLNLSGAHSPYFTRNIVILKDNSGNTGIGE
VPGGEKIRQTLEDAKPLVIGKTLGEYKNVMNTVRQTFNDRDAGGRGLQTF
DLRTTIHVVTAVEAAMLDLLGQHLGVTVASLLGDGQQRDAVEMLGYLFFV
GDRKKTNLAYQSQENDLCDWYRVRHEEAMTPESVVRLAEAAYEKYGFNDF
KLKGGVLDGFEEAEAVTALAKRFPQARITLDPNGAWSLDEAIKIGKQLKG
VLAYAEDPCGAEQGYSGREIMAEFRRATGLPTATNMIATDWRQMGHTISL
QSVDIPLADPHFWTMQGSVRVAQMCNEWGLTWGSHSNNHFDISLAMFTHV
AAAAPGDITAIDTHWIWQEGNQRLTKEPLQIKGGLVEVPKKPGLGIEIDM
DQVMKANELYKSMGLGARDDAMAMQFLIPGWTFDNKRPCLVR
>MS1269 dgt, Dgt protein
MQKIQLNNIWQQRFITDKPREKDHRPPYRRDRGRILHSAAFRCLQAKTQI
HAVGENDFYRTRLTHSLEVAQIGNSLVAQLKFGDSFEHLAAQLNADKTAL
QQSLKPLLPSNDLIESLCFAHDIGHPPFGHGGEVALNYMMREHHGFEGNA
QTFRIVTKLEPYTLSAGMNLTRRTILGLVKYPTLLDLSSPDYAKSDLQSN
GDPRYVKINDWRPGKGLFYDDLPMFEWLLQPLSDADRTLFGQFQQPRQSP
SDMLKTKFKSLDCSIMEIADDIAYAVHDLEDAVVVGLVNRQQWQEAEVEL
KNCRSNWIQSNIEQITEKLFSDQHYQRKNAIGALVNYFITHIRWKMTADF
AEPLLRYNAELPKGVLDALNVFKRFVFKYVIRDVETQRIEYKGQRILTEM
FQIFESDPERLLPRNTVKRWQNATDEGKHRIICDYIAGMSDAYALRLYQQ
L
>MS1361 dinG, DinG protein
MANIDQIKAAFSERGQLSSNIKDFRPRSEQLEMAEAVGKAIENKGVLVVE
AGTGTGKTFAYLTPALLSKKKTIVSTGSKNLQDQLFKRDLPTIQKALNYS
GKIALLKGRANYLCLERLDQVIAQGVLGDKSVLVDLSKVRKWNNATKTGD
LSECVELAEDSPILPQLTSTTESCLGSDCPNYGDCYVAAARKRALAADLV
VVNHHLFCADMAVKENGFGELIPNAEVIIFDEAHQLPDIASQYFGQSITS
RQLFDLCKDINIVYRTEIKDMPQLGVASDHLLKMVQDFRLLLGEGNNRGN
WREWLVKPDVQKGFKVLQEKLDFIADVVKLALGRSQTLDSIFERISALKA
QLVRLSDTSVTGYCYWFETFNRQFGLHITPLTVSDKFGEQMNNHESAWIF
TSATLEVGGSFNHFRQRLGIRATDEKVLQSPFNYPEQALLCVPRYLPGSN
QNHTMTKLAEMLLPVIEANKGRCFVLCTSYFMMKGFAEYFREHSGLSILL
QGEISKTKLLEQFVSEEHSVLVATSSFWEGIDVRGDALSLVIIDKLPFTS
PDEPLLKARVEDCQLQGGNPFNDIQIPEAVIALKQGGGRLIRDVTDSGAV
IICDSRLVTRPYGETFLKSLPNAKRTRDLNKVVEFLKSIQQNRT
>MS1135 dinP, DinP protein
MHKLRKIIHIDMDCFYAAVEMRENPALRDKPIAVGGSVQQRGVLTTCNYP
ARKFGLHSAMPTGQALKLCPDLILLPVNITLYKQVSHQIKQIFHRYTDNI
EPLSLDEAYLDVTDCVQCSGSATWIAEEIRRAIFNELHLTASAGVAPLKF
LAKIASDQNKPNGIFVITPGEVDNFVKTLPLSKIPGVGKVTGQKLLQMGL
KTCGDVQKLDLTVLLNRFGKFGQRIWQYSHGIDEREVQSHWQRKSVGVED
TLLRNITDIEQGIVELERLYPILEQRIKRACPDIPFERFRKLGVKLKFED
FQVTTLEKSAVEFKRENFIVLLRQIWQRRQGRAIRLVGLQVTIPEQKAEQ
QMSLW
>MS1722 djlA, DjlA protein
MNNPFALFDLPIEFQLDQNRLSERYLALQKALHPDNFANSSAQEQRLAMQ
KSAEVNDALQILKDPILRADCIIALNTGEQQNTEEKSTQDMAFLMQQMQW
REQLEEIENTQDIDGLMTFSAEIEQSNKEKISEISTALSMKDWQQAKLIN
DRLRFIKKLMTEIERIEDKLADF
>MS1902 djlA, DjlA protein
MERTMNFIGKILGFIIGYRFGGLFGGIAGLILGHIADKKLYELGSVNSSF
FSKKITRQSLFMQTTFAVLGHLSKAKGRVTEEDIQLANNLMSQMQLDVAN
RQLAQNAFNRGKEADFPVREVIREFRIGCGQRADLLRMFLHIQVQAAFAD
SNLHNNEKELLFVIAEELGLSRFQFDQMLAMEMAARQFTQGGFYRQQQYQ
QQSHQQYNQENYQNSYRTSSGPTVEDAYKVLGVNAGDNQQTVKRAYRRLM
NEHHPDKLVAKGLPKEMMEMAKEKAQQIQAAYDLICKVKGWK
>MS0927 dksA, DksA protein
MTTNTNKASLSLLALAGVEPYKEKAGEEYMNEAQLLHFKKILEAWRNQII
QETTRTVSHMQDEAANFPDPADRATQEEEFSLELRTRDRERKLMKKIEST
LKKLETEDFGYCDSCGVEIGIRRLEARPTADLCIDCKTLAEVREKQMGY
>MS1568 dltE, DltE protein
MAILITGASAGFGKAACITLVKAGYKVIGAARRLEKLTELKQQLGENFYP
LQMDVSQTAEIDSALASLPADWAEIELLVNNAGLALGLEPAYKVNFDDWL
TMINTNIIGLTYLTRQILPQMVERNKGHIINLGSIAGTYPYPGGNVYGAT
KAFVKQFSLNLRADLAGTAVRVSNIEPGLCGGTEFSNVRFKGDDEKAANV
YKNTLSIQPEDIANTILWIYQQPAHVNINRIEIMPISQSSGALNVVRE
>MS2336 dmsC, DmsC protein
MATIIVIYQGFGLSQIHSSAQQAVALVPDFAVNQVIRLCLLAAAGMVLLK
SKQPLLLSIAVILALFAEMIGRELFYSLHMTVGMA
>MS1878 dnaA, DnaA protein
MSEHQLPLPIHQIDDETLDNFFVGHNDLLVDSLSKNIACLKQQFFYVWGA
EGSGKSHLLKAVSNQFLLQNRPAIYVPLSKSQYFSPAVLENLEYQDAVCL
DDLQLVVGNEEWEIAIFDLFNRIKEKENTLLLISANQSPNALPIKLPDLA
SRLTWGEIYHLNVFTDEEKILVLQRNAHERGIELPDETANFLLKRLDRDM
HTLFDALLKLDKASLQAQRKLTIPFVKETLGL
>MS0485 dnaA, DnaA protein
MERDLSQLWQNCLLQLQDQISSSDFGLWLRPLQADTSMPNTIVLYASNMF
VKSWVENNYLAQITKIAQDLSNNTDLVIKVQEGSKPAARKVVAQQEIANT
PVQHSAPMPENEPQAAFRSNLNQHHLFENFVEGKSNQLARAVGQKVANRP
GDKSANPLFLYGGTGLGKTHLLHAVGNGIIAGNSNARVVYIHAERFVQEY
VKALKAERIENFKKFYRSLDALLIDDIQFFAGKDGTQEEFFNTFNSLFEG
EKQIILTSDRYPREIEKIDDRLKSRFSWGLSIAIEPPDLETRVAILMKKA
EEKNIYLPEEVAFFIGQKLRTNVRELEGALNRVHANADFTGKAITIDFVR
ETLKDMLALQDKLVTVENIQKMVAEYYRIKVSDLKSKNRSRSIARPRQLA
MALAKELTNRSLPEIGKAFGDRDHTTVLHACRTIAALRDDDNNIQEDWSN
LIRTLSA
>MS1183 dnaB, DnaB protein
MVYDIAVFSVLIESFFMARQPSQSPDKQTAQINIPPHSIEAEQAVLGGIM
LNNSHWENVVEHVITEDFYTAAHRLIFREMEELARQNHPIDLITLDQALK
NKGVVEDVGGFAYLAELSKNTPSAANIIAYADIVREKAVLRELIGVGNTI
AQSAYSPKGREVKEILDEAEREVFKIAEKRSAENEGPENILNVLERTIDK
IEFLSKNQHANGGVTGVTTGFKDLDKKTAGLQPSELIIVAARPSMGKTTF
AMNLCENAALSSEKPVLIFSLEMPADQIMMRSLASLSRVDQTKIRTGQIT
EDDEWARISSTMGMLTNKPNMYIDDSAGLTPTELRSRARRVYRENGGLSL
IMIDYLQLMRAPGFDNRTLEIAEISRSLKALAKELEVPVVALSQLNRTLE
NRTDKRPVNSDLRESGSIEQDADLIMFIYRDEVYHETTEENHNVAEIIIG
KQRNGPIGRVRLTFQGQYSRFDNYAGGHQFNDDDY
>MS0574 dnaE, DnaE protein
MPEPRFVHLRVHSDFSMIDGIAKVKPLVKTCVQENMVAMALTDFTNFCGL
VKFYGEALGSGIKPIMGADVSVKSDLCGDEHFELTLLAKNNAGYKNITLL
LSKAYQRGYEDVPYIDQDWLAEYNEGIIVLSGGRKGDVGKKLLKTGAADE
VESAVGFYQKYFPDHYYLSLSRTGHNEEETYIKTALKLAEKHNLPVVATN
DVVFLKSEDFEAHEIRVAIHDGFTLDDPKRPKLYSDRQYFRSEQEMCELF
ADIPSALENTLLIAQRCNVTIRLGEYFLPQFPTGELSTEDYLIKRAKDGL
EERLKVLFPDEKEREEKRPAYDERLDTELGVINQMGFPGYFLIVMEFIQW
SKDNNIPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMDNRDKVIEHVADMYGRGAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFDAEPQLQQIYDSDEEVKALIDMARK
LEGVTRNAGKHAGGVVISPTLITDFSPLYCDSEGKHPVTHFDKNDVEYAG
LVKFDFLGLRTLTIIKWALDMINARMDRDGKPHIDINHIPLDDPESFNLL
LKSETTAVFQLESRGMKDLIKRLQPDCFEDIIALVALFRPGPLESGMVQN
FIDRKHGREEVAYPDAQYQHECLKPILEPTYGVIVYQEQVMQIAQELAGY
TLGGADLLRRAMGKKKPEEMAKQRSVFEKGAIEKGIDGELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKTHYPAEFMAAVMTSEMDNTDKIVGL
YDECLRMGLTVTPPDINTGKHHFSVNDHGEIVYGIGAIKGVGEGPIEALV
SAREKGGIFKDLFDLCARVDLKKINRRTFESLIMSGAFDKLGPHRAALSK
NLEDALKASDQHAKDEAAGQADMFGVLTESPEEVEIAYANTPRWSEKQIL
DGERETLGLYLSSHPISRYLKELSHYSPNRLKDLVPNIRGQVSTASGLVV
ASRFAVTKKGNRLGIATLDDRSGRLDITLFAEALEKFGEKLQKDSVVVVS
GQVSFDDFTQGLRMSVRDLMTLDEARSRYAKSLAISLSQQQITPQFLKRF
KSVIEPYSGGTMPINVYYQSPQGRALLKLGIQWYIKPTDELLSELVNMLG
ESAVELEFE
>MS1761 dnaG, DnaG protein
MGVPIPRSFINDILAKADIVDVVNSRVKLKKAGTNNYQACCPFHHEKTPS
FTVSKNKQFYHCFGCGAHGNAIGFLMEYDKLEFLEAVEELANFLGLEVPR
EAGSDKKFEKSQPHYQNKRNLYELMHDIAEFYRQQLPHSIPAQAYLQKRG
LSEEVIERFAIGFVPDSFNAVLRRFGTTKAEQQKLFDLGMLSRNDRGDIY
DRFRNRIMFPIRDRRGRTIAFGGRVLTDERPKYLNSPETLTYHKGNEIYG
LYEALQINDSPEMLLVVEGYMDVVALAQFGVNYAVASLGTATTAEQIQLI
FRASEQIVCCYDGDRAGREAAWRALENALPYLQDGRQLKFVFLPDGEDPD
TYIRQYGKDAFEDYIQKALSLSDFMFTHLIEQVDLSSKEGKSKLAALAVP
LIKRIPGQMLRLYLRNILAQKLGIIDQTQLESLIPSKIEQPEAAIEKSPA
VKRTPMRLLIGLLLQNPQLAQLDYDLEPLKSLNEPGFELFYALTKLCRDN
MGITMGQILEYWRDSQYSKPLEILAIWDHLVTDDKIQETFLETLLYLYVR
FTDQNIERLIAKDRSTGLSPEEKQELAQLLARPQQNNS
>MS0899 dnaJ, DnaJ protein
MGERAHPTLVTKIMAKQDYYETLGVQKGADEKEIKRAYKRLAMKYHPDRT
NGDKAAEEKFKEVNEAYEILMDKEKRAAYDQYGHAAFEQGGFGGGAGGFG
GGFGGFGGFEDIFSEMFGGGASRQRVVRGEDLRYDIEITLEEAVRGTTKD
IKINTLAACDHCDGSGAEKGSKVETCPTCHGHGRVRRQQGFFMTETTCPT
CQGSGKKIEKPCKHCHGDGRVHKKKNLSVKIPAGVDTGNQLRLSGEGAAG
ENGAPAGDLYVVIHVKDHHIFERDGSNLYCEVPISFTMAALGGEIEVPTL
DGRVKLKIPAETQTGKLFRMRGKGVTSTRAGYAGDLICKIIVETPVKLNE
EQKELLRKFEESLEGQSKQRPKSSSFLDGVKKFFDNLGK
>MS0898 dnaK, DnaK protein
MNLTRRIKMGKIIGIDLGTTNSCVAVMDGDKPRVIENAEGERTTPSIIAY
TQDNEVLVGQPAKRQAVTNPKNTLFAIKRLIGRRFEDQEVQRDVNIMPFQ
IIKADNGDAWVDVKGDKLAPPQISAEVLKKMKKTAEDFLGETVTEAVITV
PAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALAYGLDKGKGNQTIAV
YDLGGGTFDLSIIEIDEVGGEKTFEVLATNGDTHLGGEDFDNRVINYLVD
EFKKEQGVDLRNDPLAMQRLKEAGEKAKIELSSAQQTDVNLPYITADATG
PKHLNIKLTRAKLEALVEDLVARSMEPVKVALSDAGLSVSQIDDVILVGG
QTRMPLVQQKVAEFFGKEPRKDVNPDEAVAVGAAVQGGVLAGNVTDVLLL
DVTPLSLGIETMGGVMTTLIEKNTTIPTKKSQVFSTAEDNQSAVTIHVLQ
GERKQASANKSLGQFNLEGINPAPRGMPQIEVTFDIDADGIIHVSAKDKG
TGKEQQITIKASSGLSDEEIQQMVRDAEANAEADRKFEELVQARNQADAL
VHSTRKQLTEAGDKLSADDKAPIEKAVNELEAAAKGEDKAEIEAKIQALI
QVSEKLMQAAQQQAQADAGAQQAQGNNGGDDVVDAEFEEVKDNK
>MS1721 dnaK, DnaK protein
MALLQIAEPGLMAAPHQHKLAVGIDLGTTNSLVATVRSAHTEILLDEKDR
PLVPSIVHFGDNNEITVGYEAGELASIDPQNTVISVKRLIGRSLEDVQAR
YPNLPYRFEASENGLPLISTRKSAVSPVEVSSEILKKLTALAKRRLGGEL
QGAVITVPAYFDDAQRQSTKDAAKLAGLNVLRLLNEPTAAAIAYGLDSGK
EGVIAVYDLGGGTFDISILRLSKGVFEVLATGGDTALGGDDFDHLVADWI
TEQSGISPQDDKQKRQLVELATRLKIQLTDNETVAIQYQNWHGKISRNQF
NQLIQPLVKRSLISCRRALKDANVTADEVNEVVMVGGSTRVPFVREQVGE
FFKRQPLTSIDPDKVVALGAAVQADILVGNKPDSEMLLLDVIPLSLGIET
MGGLVEKIIPRNTTIPVARAQEFTTFKDGQTAMTVHIVQGEREMVADCRS
LARFTLRGIPPMAAGAAQVRVTYQVDADGLLNVTAMEKSTGVQSSIQVKP
SYGLTDDEITQMLKASMDNAKQDIDARLLAEQRVEAKRVIESVLSALSHD
RDLLNDEELSAIKKALVELDKLQQQNDTLAIKQGIKDLDAATQEFAARRM
DKSIRSALTGHSVEDI
>MS0486 dnaN, DnaN protein
MRERAMQFIVSRDNLLKPLQQVCGVLSSRPNIPVLNNVLLQIADDCLTIT
GTDLEVELSTQAKLISGTEGKFTIPAKKFLDICRSLPDEAEIHVTFEEER
AIVRSARTKFNLATLPAEEYPNLADWQSEVDFTTEQATLRRLIEATQFSM
ANQDARYFLNGMKFETEGNLLRTVATDGHRLAVCTIALEQDLQNHSVIVP
RKGVLELARLLEATDAPARLQIGTNNLRVQLANVVFTSKLIDGRFPDYRR
VLPRNADHILEADWDVLKQAFVRAAILSTERFRSVRLQLDQNQMKITATN
PEQEEAEEIIDVSYSGNEMEVGFNVSYILDVLNALKCQRVRMRLTDASSS
CLIENCEDASAEYVIMPMRL
>MS1570 dnaQ, DnaQ protein
MKVEIDLNRQILLDTETTGMNQFGAHYEGHCIIEIGAVEMINRRYTGRKL
HLYIKPDRLVDPEAIKVHGITDEMLEDKPVFTEVAQEFIDFIKGAELLIH
NAPFDVGFMDYEFRKHHIDVKTADICSVTDTLQLARQMYPGKRNSLDALC
DRLGIDNTKRVLHGALLDAEILGDVYLVMTGGQTSLFDDNEPELADIHSA
KAHILAQNADKVAHHLSLLQPTDEELQAHLEYIKLINKKSKDNCLWEKRL
GSDSNEETQH
>MS0702 dnaQ, DnaQ protein
MVTETQNETETTEKIDYNLLKNRFRGYYPVIIDVETAGFNAKTDALLELA
AITLKMDENGLLVPDQKCHFHIKPFEGANINPESIKFNGIDIDNPLRGAV
PESEAITGLFQMVRKGQKNAGCQRSIIVAHNATFDQSFVMAAADRTKIKR
NPFHPFSSFDTASLSGLMFGQTVLVKACQAAHIAFDGKQAHSALYDTERT
AELFCYMVNHLKALGGFPHIAEN
>MS0871 dnaX, DnaX protein
MSYQVLARKWRPKNFAEVVGQEHILAALSNGLRENRLHHAYLFSGTRGVG
KTSIARLFAKGLNCMDGVTAEPCGKCAHCKAIEEGNFIDLIEIDAASRTK
VEDTRELLDNVQYKPVQGRYKVYLIDEVHMLSRHSFNALLKTLEEPPEYV
KFLLATTDPQKLPVTILSRCMQFNLKALDQKQISHHLQHILKEEEIPYEM
TALDKLAKAARGSIRDSLSLTDQAIAMSNGNISRDVVRVMLGLLDDNQPI
EILYALQQGNGENLMKVIQAVADKGGNWDELLIEVGETLHQIAMQQLLPS
TSNDETQIGFLAKHIAPEDVQLFYQIVVNGRKELAFAPNPRIGVEMTLLR
ALAFHPKLVQSQPSQQEQLSNVQTYVQSAVKKTENLVDMPVVSQSIKAKY
ESPAHSAAANAEQPSSAALSALEQIQKLRSQASGNGEKKNVNVTSSPLTE
TDSSSLSDLSETSPKVTALPVVTMQNKSKKQADLLDRLVNLSNSKNTETE
NAEDSAENTENDSEDETNLAETYRWEWTNPELAQEETAVRPSDIKKAILQ
EKTPEVITKVIAMADERDEWTKTVSQLHLDELKLVKQIALNSVVLIQHEN
EMKLGLRSAQKHLVRDKSVEILQDALTKFYGKTINLTIDFNDDESLFTPL
DHRRQIYQELSEQAKEDLLKDKKVRLLQDMFDAKLDMDSIRPV
>MS0465 dppB, DppB protein
MQHYFIRRLIMMIPLMLLISFVAFSLMNLVPSDPAETMLRINNITVTDEA
VKEARQALGLDKPFLLRYALWLYALLQGDLGKSFLSNQNVWDEITQAFPA
TFYLAVTAFAVIFLLSLTLSLLCMLMLNSLWDKIIRGILFFFTALPNYWL
ALLFIWLFSVRLNWLPSNGLEQKSGIILPALTLSLGYIGVYVRLLRGAML
NQLQQPYVFYARTRGLSEKQILFKHILQNSLHTSYIAMGMSIPKLLAGSV
IIENIFALPGLGRLCIQAIFGRDYPVIQAYILLMAMLFLVGNFVIDWLQH
RRDPRIKRGY
>MS0855 dppB, DppB protein
MLYSFLRRLFLLLIILVILSAVSYTIFMRDPINQVFAEPYFYSGYFTYVD
SLLKGDLGITYNGGDSLLMLILTVLPPTLELCFAAMIVAFLFGVPFGLLG
AFFNKNIFGKAINAVSSLGLSVPVFWIAPILLYVAAIQHWQISAVGQINL
LYEIKPITGFATIDVWFVDEPYRTKVIQNVFQHLILPTLVLAITPTMEIT
KLIQQRTEYILAQNYVKLSITRGWPIWRILTKYVLRNSLPLVIPQIPRLI
TFVLAQGMLIEGVFGWPGIGRWLIDAVSQQDYNSISAGVIVIGLFIIVIN
ALTEILTFILDPFNKKGWYAR
>MS1367 dppB, DppB protein
MFKFILKRILMVIPTFLAITLVTFALVHFIPGDPVEIRMGERGVDPIVHA
QMMEQMGLNDPLPEQYLNYIKGVVQGDFGRSFRNNEPVLKEFFTLFPATV
ELAFFALLWSLIAGIFLGVIAAVKKDSWISHTVTALSLTGYSMPIFWWGL
ILILYVSNFLGLPAGGRLPDEYWIDFDTGFMLIDTWNSGEPGAFVAAIKS
LILPAVVLGTIPLAVVTRMTRSSMLEVLGEDYIRTAKAKGLSTTRIVIVH
ALRNALIPVITVVGLIVGQLLSGAVLTENIFSWPGIGKWIIDAINARDYP
VLQGSVLIISTIIIVVNLLVDVIYGVVNPRIRHN
>MS1366 dppC, DppC protein
MTTEITSSTPQTPLQEFWYYFRQNKGAVIGLTFIAAVFFICICAPFVSPY
DPIVQHRDALLLPPAWMENGSLSYFLGTDDIGRDILSRIIYGARLSVFIG
LLIVILSCIFGVILGLLAGYYGGLLDVIVMRLMDIMMAIPSLLLTIALVT
ILGPSLFNAAIAIAIVSVPSYVRLTRASVLNEKNRDYVVASRVAGAGVLR
LMFIVILPNCLAPLIVQMTMGISNAILELAALGFLGIGAQPPTPELGTML
AEARSFMQAASWLVTIPGVAILLLVLAFNLMGDGLRDALDPKLKQ
>MS0464 dppC, DppC protein
MSGFIKQLRSDIFAQCCLFILTMIGLAGIFAPWICTFDPATIDMQAKLLP
VSAQHWLGTDHLGRDIFSRLIWGVRSTVFYGLFAMLLTMMLGILIGMTAA
IGGKKTDEFIMRLCDVLLSFPGEIMILALVGMLGPGIEHILVAVILVKWA
WYARMIRGTVMQYTHKNYVHYSQAIGVSPWRIIRRHLLPVATAELIILAS
ADMGAVILLISGLSFLGLGVQPPTPEWGAMLSDAKNIMLLYPQQMLPAGL
AITLTVTAFNGFGDFLRDVLDPDNPLKGTNNE
>MS0854 dppC, DppC protein
MQDKEPYEFRQTETLKAIWHDFRKDRIALFGLYIFILLILTAVFAPWIAP
YASDDQFVGRELMPPSWFPNGEVTYFFGTDDIGRDIFSRLINGVSYTFGS
AAIIIIFTAVVGGMLGILAGMSSGMKSRILGHFLDAFLSVPILLIAMIIA
TLMEASLLNAMLAILLALLPHFIHEIYQAIQQELKKEYVLMLRLDGASNM
ELLRETILPNISVRYIKELIRSFVVAILDISALSFISLGAQRPTPEWGAM
IRDSLELIYLAPWTVILPGLAIIMTALVVIIFGNGLCKAISKHYE
>MS0463 dppD, DppD protein
MNKPIIRFDNFSIENPDSDRPLIAPLNLTLPPYRTLALVGESGSGKTLLG
RSILGLLPEQLNTTGNIYFQDKKIISVTGTPTVDDKQKTNEIATLEIRGK
AVSFIMQNAINAFDPLFSLQDQFCETLQKHTALSYRQALIKAQQSVSKVK
LSSALLKRLPSQLSGGQLQRMMLALTFALEPELVIADEPTSALDSLTQFE
LLPLFKQMAKERSMIFITHDLALVQELADDIAVLKRGEIVEFRAKSILFS
HPQHPYTQYLLAMRAKLNQPFARLVRKKQ
>MS0853 dppD, DppD protein
MALLDIRNLTIKVNTPNGYVSVVDNVNLTLNEGEICGLVGESGSGKSLIA
KVICNTSKDNWIITADRFRFNDVELLKLSPYKRRKLVGKEISMIYQEPLS
YLDPSKKIGQQIMQNIPSWTFKGKIWHWFGWRKRRAIELLHRVGIKDHKD
IMNSYPNEITEGEGQKVMIAIAIANQPRLLIADEPTNSLESTTQLQIFRL
LSSMNQNNGTSVLLASNDMAGISEWCHSFIVLYCGQNAESGPKENILETP
HHPYTSALLHSMPDFSQPMPLKSKLNTLRGTVPLLEQMPMGCRLGPRCPF
AQKKCIKKPPLRRIKQHEFACHYPLNLLETNRKEKDTITPLTLSPESKIS
>MS1365 dppD, DppD protein
MSLLNVNQLSVHFGDGKAPFKAVDRISYSVNKGEVLGIVGESGSGKSVSS
LAIMGLIDYPGRVSAEALSFDGVDLLSLNEKQKRKIVGADVSMIFQDPMT
SLNPCYTVGYQIMEALKAHQGGSKKERRERTVELLKLVGIPAPESRLDVY
PHQLSGGMSQRVMIAMAIACKPRLLIADEPTTALDVTIQAQIVDLLLTLQ
KQENMALILITHDLALVAEAAHRIIVMYAGQVVEEGRAEEIFKRPKHPYT
QALLRSLPEFAEGKSRLQSLQGVVPGKYDRPQGCLLNPRCPYATEHCRRV
EPDLIQLGEGKVKCHTPLNAQGEPSNV
>MS0670 dps, Dps protein
MNTKTISFPSLTLTEKSQALTADINKNATHSVPGIDVNTGHSIAEALQAR
LQGLNELALILKHAHWNVVGPQFIAVHEMLDSQVDEVRDFVDEIAERMAA
LGVAPNGLSGNLVANRQTPEYPLGRASAQDHLRIIDKFYSFNIESHRVVL
AHYGELDPISEDLLVAQTRALEKLQWFIRAHLDNGNGSI
>MS0429 dsbB, DsbB protein
MLSFFKTLSMGRSGWLLLAFSALVLELVALYFQYGMQLQPCVMCVYERVA
LGGILFAGIIGAIAPSSWFFRFLGIIIGLGASVKGFLLALKHVDYQLNPA
PWNQCAYLPEFPQTLPLDQWFPYLFKPIGSCSDIQWSFLGFSMAQWILVM
FAFYSILLAIILISQVKAGKPKHREIFR
>MS1540 dsbG, DsbG protein
MKKFVTALSLMAISMAATADNAQITTQLKKLGATNIEVKDSPISGIKTVV
TNEGVLYTTEDGKYVLQGKLFELTDKGPVDVTGKALLATLESYKNEMIVY
PAKNEKHVVTVFMDITCGYCQKLHSEIKEYNDLGITIRYLAFPRGGLGTK
TAKEMEAIFTAKDPAFALDEAEKGNPPKELKAVNITKKHYELGVQFGVRG
TPSIVTRSGELIGGYLPPKELLSALESVK
>MS0846 dsrC, DsrC protein
MLNINNTQIETDPAGYLLNLNEWNEDVAKAIAEKEGVVLTEAHWEVIYFV
REFYQEYKTSPAIRMLVKAMAQKLGEDKGNSRYLQRLFPDGPAKQATKLA
GLPKPAKCL
>MS1019 dsrE, DsrE protein
MQKLLFILNESPYGTEKTFNGLRHAVNLLEEHGKEVEVKVFCFSDAVLAG
LSGQNPNDGPNVQQTLEVLAGLGAEVKLCTSCTKARGITQLPLVKGVSLG
TLDDVSDWTLWADKVINF
>MS0159 dsrE, DsrE protein
MRYVLSVRQPVYGSQGAYLAYQFAQELICQGHLISQIFFSQEGVSNGNGL
VYPANDEFNLVKAWQTFSKKHNVPLHLCIAASQRRGVVDKLTALDPAQTN
LAEGFVLAGLGEFSKAMLEADRVITL
>MS0160 dsrF, DsrF protein
MKLAFVFRQSPHGTAISREGLDALLAATAFCDEEDIAVFFMADGVLNLLA
NQQSDLILQKDIASAFKLLDLYDIGQRYICAESMDDFALSYDDLVINCEK
IDRTLMLQKLQQAEKIITF
>MS0031 dtd, Dtd protein
MIALIQRVSQAKVDVNGQTVGQIGGGLLVLLGVEKEDSKEKADKLAEKVL
NYRIFGDKNDKMNLNVQQTDGELLVVSQFTLAADTGRGLRPSFSKGAPPQ
LANELYQYFVQKCGEKVRVETGKFAENMQVSLTNDGPVTFWLKV
>MS1937 dut, Dut protein
MKKIDVKILDSRIGNEFPLPAYATSGSAGLDLRALIEEGFDLQPGETKLI
PTGLSIYIADPNLAAVILPRSGLGHKHGIVLGNLVGLIDSDYQGPLMVSM
WNRGEQPFRIEVGDRIAQLVFVPVVQAEFNIVTDFTQTERGEGGFGHSGK
Q
>MS1928 dxr, Dxr protein
MRMPAGFLINAMKKQNLVILGSTGSIGKSTLSVIEHNPEKYHAFALVGGR
NVDLMVEQCVKFQPEFAALDDENAAKQLAEKLKSAGKKTKVLAGQKAICE
LAAHPEADQVMAAIVGAAGLLPTLSAVQASKTVLLANKETLVTCGQIFID
EVKRTKARLLPVDSEHNAIFQSLPPEAQQQIGFCPLKELGINKIVLTGSG
GPFRYTDLTEFDNITPEQAVAHPNWSMGKKISVDSATMMNKGLEYIEARW
LFNAGAEEMEVIIHPQSIIHSMVRYIDGSVIAQMGNPDMRTPIAETMAYP
GRIVSGVTPLDFYQLSGLTFLEPDYERYPCLKLAIEAFAAGQYATTAMNA
ANEIAVEAFLNRMIKFTDIARVNAKVVELIQPQQINCIDDVLAVDKQSRL
VAKEVIVSLKA
>MS1059 dxs, Dxs protein
MQNYPLLSLINSPEDLRLLNKDQLPQVCNELREYLLESVSRTSGHLASGL
GTVELTVALHYIFKTPFDQLIWDVGHQAYPHKILTGRRERMTTIRQKDGI
HPFPWREESEFDVLSVGHSSTSISAGLGIAIAAEKENAGRKIICVIGDGA
ITAGMAFEAMNHAGALHTDMLVILNDNEMSISENVGGLNNHLARIFSGSI
YSSLRDSSKKILDTMPPIKNFMKKTEEHMKGVISPISTLFEELGFNYIGP
IDGHNIEELIATLSNMKTLKGPQFLHIRTKKGKGYTPAEQDPIGFHGVPK
FDYHTGQLPKSTATPTYSQIFGEWLCETAEQDEKLIGITPAMREGSGMVE
FSNRFPDQYFDVAIAEQHAVTLAAGLAIGGYKPVVAIYSTFLQRAYDQVI
HDVAIQNLPVLFAIDRAGIVGADGPTHQGAFDLSFLRCIPNLIIMAPSNE
NECRLMLHTGYCCGKPAAVRYPRGNAIGVELEPLRKLEIGKSNLVRQGQD
IAILNFGTLLPNALDVAEKLNATVVDMRFVKPLDHERINELAKTHRTLVT
LEENTIQGGAGSAVSEVVNIQQHHVNILHLGLPDEFVAQGTQQEVLKELK
LDATGIEEQIKNFLRIA
>MS2206 ebgC, EbgC protein
MYIGDLNRNDYQRDLPKVLADVCDYLKTLDLSALENGRHEINENIFMNVM
TPTSDAAENKKSELHHRYIDIQLVISGLDGMEYSVTEPALEKYEEYHQEE
DYQLTAAEIADKNWIVVRPNQFVVFYPYEQHKPCCNVNGQAELKKLVVKV
PVALL
>MS0053 ebgC, EbgC protein
MIFGHIAKVNPKQYPQAIRFALDYLAKTDFDSMEAGRYPLKDDKIYVQVL
DLETKPKAEYLPEVHRNYLDVQYLHSGTEIMGVSTDLGNNAVAVEYNPER
DILYYAEAENEQELHCQPGNFAVFFPEDAHRAAIYNGSEKIRKIVVKIAM
SEI
>MS0566 eda, Eda protein
MAYTTAEIIEKLGALKVVPVIALDDAEDILPLAATLAENGLPVVEITFRS
AAAEEAIRLLRQTNPDILIAAGTVLTPDQVVRAKNAGVDCIVTPGFNPNI
VRLCQELNIPITPGVNNPMAIEGALELGVSAVKFFPAEASGGVKMIKALL
GPYQQLQIMPTGGINVNNIRDYLAIPNVVACGGSWFVEKSLIANKHWDEI
GRLVREVLALVR
>MS0546 eda, Eda protein
MAYTTEQIIEKFSALKVIPVIAVEEAQDIIPLVKTLSENGLPVAEITFRS
AAGEEAIRLTRQHFPDVLIAAGTVLTPAQVVAAKNAGADCIVTPGFNPNI
VKLCQQLELPITPGVNNPTAIEAALELSINAVKFFPAEASGGVKMIKALL
GPYANLQIMPTGGISTANIKDYLTIPNVVACGGSWFVDKALIKAKNWAEI
GRLVREAVELVR
>MS0508 efp, Efp protein
MRIIMATYTTSDFKPGLKFMQDGEPCVIVENEFVKPGKGQAFTRTRIRKL
ISGKVLDVNFKSGTSVEAADVMDLNLNYSYKDEDFWYFMHPETFEQYSAD
SKAVGDAEKWLLDQAECIITLWNGSPISVTPPNFVELEVVDTDPGLKGDT
AGTGGKPATLSTGAVVKVPLFIQIGEVIKVDTRSGEYVSRVK
>MS0266 elaA, ElaA protein
MLFMRIHPMNWQCKTFNQLSNIELYQILQLRSDVFVIEQQCIYRDMDNKD
LLASHLFLSKDNQIVAYCRLLPKGVSVADAAIGRVIIHEKYRGRHLAHKM
MGKAIDIIIHEWHENKIYVQAQEYLQGFYQSLGFKATSDVYLEDEIPHLD
MYWES
>MS0144 emrE, EmrE protein
MNPWILLAISILLEIIATSLLKLSDGFTKLIPTVGSMLLYGLSFYCVSIV
YRTLPVGIVYAVWSGVGIVLTAIIAYFAFGQKIDGSGLVGMLLIVGGVLI
INVFSRSV
>MS0256 eno, Eno protein
MAKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGLAAAPSGASTGSREALE
LRDGDKSRFLGKGVLKAVGAVNNEIAGAIVGKDASNQAEIDQIMIDLDGT
ENKSKFGANAILAVSLATAKAAAASKGLPLYAYIAELNGTPGVYSMPLPM
MNIINGGEHADNNVDIQEFMIQPVGAKTLREALRIGAEVFHNLAKVLKAK
GLNTAVGDEGGFAPNLGSNAEALACIKEAVEKAGYVLGKDVTLAMDCASS
EFYNKENGKYEMKGEGRSFTSQEFTHYLEELCKQYPIVSIEDGQDESDWE
GFAYQTKVLGDKVQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSL
TETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSR
SDRIAKYNQLIRIEEALERAGTPAPFLGLKAVKGQA
>MS0367 era, Era protein
MTETKPENIVQHNETTAAEQETYCGFVAIVGRPNVGKSTLLNKILGQKIS
ITSRKAQTTRHRIVGIHTEGPYQAIYVDTPGLHIEEKRAINRLMNRAASS
AISDVDLIIFVVDGIHWNADDEMVLNKLRASKAPVVLAINKIDNIKNKDE
LLPFITELSGKFNFKEIIPISAQRGNNVHNLQKVVRQSLRKGVHHFPEDY
VTDRSQRFMASEIIREKLMRFMGEELPYSVTVEIEQFKMNERGTYEINGL
ILVEREGQKKMVIGQGGQKIKTVGIEARADMERLFDNKVHLELWVKVKSG
WADDERALRSLGYMEEY
>MS2051 eriC, EriC protein
MFNLKHFKTRLGFIIHKKLRQTHRISHKSIEFICLLSGAALVALFSLAFA
KLSDLGLQWNARWSAHYPLAVWFILPLGLAALSWFTAKFTPYVGGSGIPQ
VIAAISLPHGKNKNRLVEFWQTLWKIPLTFLAMLIGASVGREGPSVQVGA
AVMLAWGNFCRKHNFAFRGLSANELIAAGAGGGLAAAFNAPLAGVIFAIE
ELGRGFVLRWERRILLGVLAAGFILVAIEGNNPYFPQYQGAYSSQYIYLW
VILCGVICGFFGGVFARLLAKGAAGVSPAKIRGWVRRHPIYTALLLGLML
AALGSYTKGQTYGTGYHVVTQALSGELLQPESVGIAKLAATVATYWTGIA
GGIFTPSLTIGAGIGSQIAAYAGDLIDPRLLVLLCMSGFLAGATQSPVTA
SVVVMEMTGSQPVLIWALIGCIVASFISRQINPKPFYHLGAARFRQRVQE
ESKLKNNDVNP
>MS2190 eutG, EutG protein
MIMSNAVENTVSPAQAEVNSLVEKGLVALEQFRQLNQEQVDYIVAKASVA
ALDQHGALALHALEETGRGVFEDKATKNLFACEHVVNKMRHWKTAGIISD
DDVTGITEIADPVGVVCGITPTTNPTSTAIFKSLIALKTRNPIVFAFHPS
AQQSSAHAAQIVRDAAVAAGAPENCIQWIAQPSMEGTNALMNHPGIATIL
ATGGNAMVQAAYSCGKPALGVGAGNVPAYVEKSADIKQATHDIVMSKSFD
NGMVCASEQAAIADAEIYEEFVNELKSYGVYFVNKKEKTLLEEFMFGVKA
NGANCAGAKLNADVVGKSAYWIAQQAGFEVPKKTNILAAECKEVSPKEPL
TREKLSPVLAVLKSRSTEEGLTLAEAMVEFNGLGHSAAIHTKDAALAKRF
GERVKAIRVIWNSPSTFGGIGDVYNAFLPSLTLGCGSYGKNSVSNNVSAM
NLVNIKRVGRRRNNMQWFKVPSKIYFERDSIQYLQSVPDMRRVVIVTDRT
MVDLGFVQKIAHQLESRRDPVSYQLFADVEPDPSIQTVRRGVDLIRNFKP
DTIIALGGGSAMDAAKVMWLFYEQPEIDFRDLVQKFMDIRKRAFKFPSLG
KKARYIGIPTTSGTGSEVTPFAVITEGNKKYPIADYSLTPTIALVDPALV
MTVPAHVAADTGLDVLTHATEAYVSVLANDYTDGLALQAIKLVFRYLEKS
VKENDPEAREKMHNASTIAGMAFANAFLGMNHSLAHKLGGHFHTPHGRTN
AILMPHVIRYNGTKPTKTATWPKYNYYKADEKYQDIARLLGLPAATPEEG
VKSYAKAVYDLAVRCGIKMSFKEQGLEEQAWMDARHEIALLAYEDQCSPA
NPRLPIVADMEEILTNAYYGYDESKY
>MS1802 eutG, EutG protein
MGVVLMSTYYFLPTRNVFGENAVEEVGELMRSLGGNRPLIVTDGFLAQSG
MAEQLATILRGAGLEPIIFGGAEPNPTDKNVESGIAFYHDHNCDCIISLG
GGSSHDCAKGIGLIASNGGRIQDYEGVDRSTNPMVPLMAVNTTAGTASEI
TRFCIITDTARKVKMAIVDWRVTPQIAVNDPLLMKGMPAGLTAATGMDAL
THAIEAYVSTAANPLTDAAALMAISMIQQYLPKAVANGDYMKARDKMAYA
QYLAGIAFNNASLGYVHAMAHQLGGFYNLPHGVCNAILLPYVEEFNLIGN
LNRFRDIANAMGENIQGLSTDDAALKAIAAIRRLSKQVGIPANLKELGVK
PEDFDVMAENAMKDVCMLTNPRKATKQQVIEIFQRAYDGN
>MS0069 eutG, EutG protein
MTYSLLHTNKVIAGAGCVAQITDVVNSFDATNVVIITDQGVFNAGLINEP
KMLLEQAGVNVHVISDTPPEPPVDKVNDIYKVAMQFNVEMVIGIGGGSAM
DTAKLVAILLNNHVALRDVVDGKVKFKNRGIPTLMIPTTSGTGSEATQNS
IVLVPERELKVGIVDEKMLPNCVILDPKMTTGLPKHITANTGIDALCHAI
ECYISKKCSPFTEMFALKSIELIAKSIRIAYEDGHNLQARENMLLGSYLG
GVSIATSSTVAVHALSYPLGGKYHIPHGLSNAILLPDVMKFNLDACVEKF
ARIAKAMDLNIAGLTEQEAAEAMIEELYALIRDLNIKCDLKTVGITEDIL
DELVDAGYSVRRLLDNNPKEMTKQDIRGIYKKIL
>MS2325 eutG, EutG protein
MSNRFILNETAYFGAGSIQHIVTEVQKRGFTKGLIVTDKSLIQFKVVEKV
TALLEGANLAYEIFDEVLPNPTMNVVKAGLAKFKASGADYMIAIGGGSPQ
DTAKAIGIIVNNPEFADIRSLEGTAPTKKPAVPTIAVPTTAGTAAEVTIN
YVITDEENKRKFVCVDPHSIPVVAVIDSEMMASMPPTLKAATGVDALTHA
IEGFITLGAWELSDMFHLKAIEIIARALRSSVKGEQQGVEDMALGQYVAG
MGFSNVGLGLVHGMAHPLGAFYSTPHGVANAVLLPHIMAYNADFTGDKFR
SIAKAMGVRNTEILSIEQARVAAVEAVKTLNKDVGIPATLREVGMKEEDI
PELAKAAFADVCTGGNPRPTSVNEIERLYRAIY
>MS0514 exbD, ExbD protein
MKKFDEINIIPFIDIMLVLLAIVLITASFISQGKIQVNVPKASSTVAFKA
DDLAKLLTVTEKGEIYFNDKPIQLTELEQEINGWDKEQKVTLKVDAKSSF
QDFVSITDLMAKNDIKNVAIVTVKEKGK
>MS0721 exbD, ExbD protein
MRLIFEEFYEVPMAYRRKERNIKSEINIVPFLDVLLVLVLIFMATAPIIS
QSVEVDLPDAVESQNVSNEDKVPVIIEVAGVGQYAISIAGQRTENLTEEM
VTEQTRAEFEKDPNTMFLIGGARDVPYEEVIKALNLLHLAGIKSVGLMTD
PI
>MS0121 exeA, ExeA protein
MMNGDAMLKLKQVLIDKGVSLRQLAQKMNVSPATVAQLVNHNQRVKQQWA
EFETNLARNLTALGITAPLKDLLKDEATGESLATEPAASAPKTKQDIEDD
IMLLAKQALFPATKKHFSLFRDPFAEDIRSADDVFSSADVRYVREALFQT
AKHGGFMAVVGESGAGKSTLRRDLIDRINQENAPITVIEPYIIAMEDNDV
KGKTLKAAHIAEAIINTLAPLESVKRSPEARFRQLHKVLKESVKSGYSNV
LIIEEAHSLPIPTLKHLKRFFELEDGFKKLLSIVLIGQPELKVKLSERNT
EVREVVQRCEVVELAPLDSELENYVAFKLAKVGKKVDDIFDEDAFAAVRQ
RLVAVSRNKTSASLLYPLAVGNLLTAAMNLAESLGVPKVSGEVVMGV
>MS2265 fAA1, FAA1 protein
MNRSELDFHFVNRVRQQAKMLNQATALRHKVNGGWVDISWEEFQFQIDRV
SLALLAHGIDVQDKIGIFAHNMPQWTIADLGALQIRAVTVPIYATNTAKQ
AEFIINNAEIKILFVGEQEQLDTILEIKNNCPTLEKIILMKSTAEFSPNE
SLLSWHSFMGKSADTDPNRLLERLNDARLTDLFTLIYTSGTTGDPKGVML
DFSNLAHQLKSHDLALPDVVGREDVSLSFLPLSHIFERAWVAYVLHRGAV
VCYLESTNEVRNALTELKPSLMCAVPRLYEKMYSAIQDKVIHAPLHRRML
FQWAINQGQKFAHTQKSTWRHKIADKLVLSKLRNLLGGNIKMMPCGGAKL
EGKIGEFFHAIGINVKLGYGMTETTATVSCWADKHFNAASIGRLMPNAEV
KIGENNEILVRGGMVMKGYYNNSAETAKAFTEDGFFKTGDAGEFDENGNL
YITDRIKELMKTSNGKYIAPQYIEGKLGKDKFIEQIAVIADAKKYVSALI
VPSFEALEDYAKQLNIKYQDRLELIKHSEIIKLFEKRLEELQQELAHFEQ
VKKFTLLPQAFSIKMEEITPTLKLRRKVILERYRRQIEAMYS
>MS1194 fabA, FabA protein
MTDSCTLNKKSSYTYEDLLASSRGELFGPKGPQLPAPTMLMMDRVIEMNE
TGGNYGKGYVEAELDIKPDLFFFGCHFIGDPVMPGCLGLDAMWQLVGFYL
GWIGGEGKGRALGVGEVKFTGQILPTAKKVIYRIHLKRVINRKLVMGLAD
GEVEVDGRVIYTATDLKVGLFQDMSTF
>MS0460 fabA, FabA protein
MTTENRPAKIIEAHEIMTLLPHRYPFLLVDRVVDFEEGQWLKAYKNISVN
EPCFTGHFPGQPILPGVLILEALAQSMGLLAFKTHEIKGGELFYFAGIDD
ARFKRPVLPGDRLELFVEVIKERRGITSFTGVASVDGEVACEAKLMCARR
>MS1591 fabB, FabB protein
MKRAVITGFGVISSIGNNKEEVLASLKAGKSGIEIVPSFVEMGMRSHVAG
TVKLNPAELIDRKIYRFMGDAAAYAYLSMKEAIEDAGLSEDQVSNERTGL
VIGAGIGSAHYQVKAADAARGSRGVKAIGPYAVTKTMSSSVSACLATPFK
IKGVNYSISSACATSAHCIGNAFELIQLGKQDIVFAGGAEELSWEGAAQF
DAMGAVSTKYNETPEKASRAYDADRDGFVIAGGGAVVVVEELEHALARGA
KIYAEIVGYGATSDGYDMVAPSGEGAERCMKQAMANIDTPIDYINVHGTS
TPVGDVKELGAIRNVFGEAKPAISSTKSMTGHSLGAAGAHEAIYTLLMLH
NDFIAPSINIETLDEQAEGLNIVTETKENAGLQTVMSNSFGFGGTNATLV
FKRYAK
>MS1873 fabD, FabD protein
MKKFAMVFPGQGSQSVGMLAELAEQFPVVQETFKQASEVLGYDLWQLVQQ
GPAEELNKTWQTQPALLAASVAIYRIWQQQYPELKPEVMAGHSLGEYSAL
VCAGVIDFQDAIKLVELRGKLMQQAVPEGTGAMYAIIGLDNESIINACKA
AEQGEVVSAVNFNSPGQVVIAGSKAAVERAAAACKEAGAKRALPLAVSVP
SHCALMKPAADQLAVSLESISFKAPEIAVINNVDVKAENDAEAIRTALVR
QLYSPVRWTEIVERMAKNNIEVLLEMGPGKVLTGLTGRIVKELSAQQVND
AKSLETVKEILA
>MS0543 fabG, FabG protein
MITMPFNFWYMEKDKMTLAKKHNFKDKVVVITGAGGVLCAYFAKEIAKTG
AKVALLDINLESAQKFADEINAQGYIAKAYKTNVLELDSIKQTRDAIAAD
FGTCDILINGAGGNNPKATTDNEFHELDLPPTTKSFFDLDKSGIEFVFNL
NYLGTLLPTQVFAKDMVGKKGANIINISSMNAYTPLTKIPAYSGAKAAIS
NFTQWLAVHFSHVGIRCNAIAPGFLVSNQNRALLFDEQDNPTARAHKILT
NTPMGRFGEAKELMGGILFLMDEEYASFINGVVLPIDGGFSAYSGV
>MS0563 fabG, FabG protein
MVNTLLIHFLHRRIYMNLFDLTGKVALVTGCNTGLGQGMALGLAQAGCDI
VGVNLVEPLDTKEKIEALGRKFVNIEANLMKQEGLTDVVEKAVSVFGKID
ILVNNAGIIRREDAIDFSEQNWDDVININLKTVFFLSQLVAKQFIAQGHG
GKIINVASMLSFQGGIRVPSYTASKSAIMGITRAMANEWAKYNINVNAVA
PGYMATDNTAALRADEARSKEILDRIPAGRWGTPNDLVGPCVFLASAAGD
YVNGYTVAVDGGWLAR
>MS2145 fabG, FabG protein
MQRFEQKTALVTGAGTGIGQAIAVRLAQEGAKVLVVGRTEKTLQETTALH
PNIAYAVADIEKDDDVQKIVQQLNQKYGGLDILINNAGWAPVTPISQVKI
EEYDKVFGINVRALVNLTLQCLPMLKARKGNIINMSSAICRNHLPNMSMY
AGTKAAVEIFTKIWAKELGADGVRVNSISVGPIETPIYDKTDLSNDGIQD
HIDRIRKTIPLGAFGKSEDVANVTAFLASDEARFITGSDYSVDGGFGA
>MS1412 fabG, FabG protein
MSILEKMKLTGKTAFVTGGARGIGKSVAIAFAQAGANVVIADFDIAEAEK
TAAEIAKEEGVKSIAVQTDVTDQASVNHLMDVIKQQFGKLDIAFCNAGIC
INVPAEEMSYEQWLKVINVNLNGVFLTAQAAGKLMIEQGTGGSIINTASM
SAHIVNVPQPQCAYNASKAGVIQLTKSLAIEWAKHNIRVNSLSPGYIGTE
LTLNSKDLQPLIKEWNAMAPLHRLGKPEELQSICVYLAGDTSSFTTGADF
IVDGAFTCF
>MS0955 fabG, FabG protein
MIKIIFLKCNFHLNEEQKMSELFSLKNKRILITGSTRGIGNLLANGLAEH
GAEIIIHGTRLETAEKIAADFNTKGFKAYAVAFDVTDSKAAQDTIDYIEK
EIGPIDVLINNAGIQRRYPFCEFPEKDYDDVISVNQKAVFIISQAVARYM
VKRQRGKIINIGSMQSELGRDTITPYAASKGAVKMLTRGMCVELARYNIQ
VNGIAPGYFATELTKPLVENQEFTSWLCKRTPAGRWGDPKELIGAAVFLS
SKASDFVNGHLLFVDGGMLAAV
>MS2175 fabG, FabG protein
MFKKIILTLFSGLIFTEVTMAQTKYGVGSYNTEEVAAEMEYIEKHIRPLN
PKPTKRIFITGSSAGIGELTAKMLLAKGYEVVAHARDAKRAADVKRDLPE
IKHVVIGDLAKPDEVDKIADQVNALGRFDVIIHNAGVYRGENIFQINLLA
PYVLTAKITQPQTLIYVSSNMHNGGELRLDAFNAGNVGYSDSKLQLLTLA
KSLAVRWSKVRVNAMHPGWVGTKMSGGSAPDPLRQAYETLVWLAEGTDPA
AQTSGGYFFNKQPDSHYRRDSEDSAQQAVLWQALEKITGVKLPE
>MS1406 fabG, FabG protein
MWELQRSKKMKQKEVIVAIGSGSIAQAIARRVSIGKQVLLADIKLENAEA
AAKTLREAGFEVSTTVVDVSSRASVQALVQTAVDLGAVKGVIHTAGLSPS
QASPEAILKVDLYGTAVVFEEFGKVIAAGGSAVVIGSQSSHRLAIDEISQ
AQADELATLEPEKLLELPLVQEINDSLRAYQISKRGNALRVQAEAVKWGK
RGARINCISAGIIYTPLAYDELTSSERGEFYRNMLAKSPAGRGGTPDEIG
ALAEFLFNSSYISGSDILIDGGVTASYKYGELKPA
>MS2144 fabG, FabG protein
MNNMKKLLILVGAGKGLGNAIAKEFASHDFRVALIARNAENLTAYRQEFQ
ALGYEVMTQVADALYPETLTKAINAIQAEWGTCDALVYNVGITELDNDRP
ITNELLMQRYQIDAASAYHCAMLVATPEFAAKQGAIIFTGGGFAKTFQPI
LALKPLCIDKAALNAMNIVLHHLLAPQGIFVGSVLVSNVIQPNDPKYAPD
VIAKAYWKMYCERDEFELLY
>MS1421 fabG, FabG protein
MKLQNKVALVTGGGTGIGRAIAKQMAEAGATVIIIGRREAQLQESARQHA
NIHYIVADVLNSDDITRTLNEIQQRFGKLDVVVNNAGIAPVTPIENVNLA
DFDRTFALNVRAVIDVTSQAIPYLKSTQGNIINITSGLVNNPMPMNSIYT
ASKAAVLSMTRTWAKELAPYGIRVNSVAAGATKTPLYDGLGLSETEAKDY
EATVEHIVPLGRFAEPDEIAPAVVFLASDDARYATGAHYGVDGGFGI
>MS2163 fabG, FabG protein
MNNIQGKVVIITGASSGIGEATAYKLAEQGAKIVLAARREAQLKAIADNI
KAKGGEAVYRVTDVVKPEDNQALVELAKSAFGKVDAIFLNAGLMPSAPLS
ALETDNWNRMIDVNIKGVLNGIAAVLPTFEAQKSGHVLATSSVAGLKVYP
GGTVYCGTKWAVKAIMEGLRMESAQAGTNIRTATIYPAAVQSELVAGITD
ETTSQGYRQLYDTYEIPAERVANVVAFALSQPDDTNVSEFTIGPTTQPW
>MS1874 fabG, FabG protein
MQGKIALVTGATRGIGRAIAEELATKGAFVIGTATLEKGAESISAYLGEK
GKGFVLNVADQESIESVLEQIKKEFGDIDILVNNAGITRDNLLMRMKDDE
WFDIIQTNLTSVYRLSKAMLRTMMKKRFGRIITIGSVVGSSGNPGQSNYC
AAKAGLIGFSKGLAKEVASRGITVNVVAPGFIATDMTEVLTEEQKAGILA
NVPAGHLGEPKDIAKAVAFLASEDAGYITGTTLHVNGGLYMA
>MS1871 fabH, FabH protein
MYSKILATGSYLPAQVRTNADLEKMVDTSDEWIYTRSGMKERRIAAADET
SATMGANAAAKALEMANLDPQEIELIIVGTTTNSHSYPSAACQIQGILGI
KDAISFDVAAACTGFVYALSVADQFIKSGQVKKALVIGSDLNSRHLDETD
RSTVVLFGDGAGAVILEASEQQGIVSTHLHASADKEDMLSLPHIERGEDK
SGYITMQGNATFKLAVGQLSSVVEETLEKNNLQKSDLDWLIPHQANIRII
SATAKKIRYGYVTSCINHRKIRQ
>MS1872 fabH, FabH protein
MDMSQVVLTIEKYGNNSAATVPVALDEAIRDGRIKRGQLLLLEAFGGGWT
WGSALVRF
>MS1467 fabI, FabI protein
MKAQGAELAFTYLNDKLQPRVEEFAKEFGSDIVLPLDVATDESIQKCFAD
LNKVWDKFDGFVHAIAFAPGDQLDGDYVNAATREGYRIAHDISAYSFVAM
AQAARPFLNKDASLVTLTYLGAERAIPNYNVMCLAKASLEAATRVMAADL
GKDGIRVNAISAGPIRTLAASGIKNFKKMLSAFEKTAALRRTVTIDDVGN
SAAFLCSDLSSGVTGEVLHVDAGFSITAMGELGEE
>MS0638 fadL, FadL protein
MKKAINKTFLASCILCAAGQASAAAFQLAEISTSGLGRSYAGEAAIADNA
AVIATNPALMSQFKTNQFSAGGIYVDSQIRMNGTVSANLAGQTVAQAPAS
KTSVVPGSLIPNMYFVSPLNDKFAVGAGMNVNFGLKSEYEDDYAAGVFGG
KTELTALNLNLSGSYRVTEKLSAGVGLNALYAKAEVSRNAGILADAVGNV
ASNPQALSAIVAQRPDLASKMGALSGLASGLQRDTLLTHLQDKTAWAFGY
NLGLAYDLNERNRFGIAYHSKIDIDFKDRNAVSYLPYGTTPYIGEGGLVL
HLPSYWEFSGYHKLTDKFAMHYSYKYTEWSRLKNLHATYADRSLRNDGLA
FHKDEEYKDNSRIALGATYEVDEKLTLRAGVAYDESAAPRTHASASIPDT
NRTWYSLGATYKFTPALSVDFGFAHLRGRKLDFSEEQSLAGGLVTVKADY
KSKATANLYGLNINYSF
>MS0538 fadR, FadR protein
MTDNAELRSYKKIGSILKQELIDGLYQIGERLPPERDLAEKMNVSRTVVR
EAIIMLELENLVEVRKGSGVYVINMPLTSEENQDDTYEDVGPFELLQARQ
LLESGIAEFAAIQATRSDILRLKEILNKERMTLAEDDKDYTADEEFHSAI
AEITQNEILIKLQKELWKYRTKSSMWQGLHAHITDQEYRKSWLQDHQNIL
NGIQRKNPALAKKAMWQHLENVKQKLFELSDIEDPDFDGFLFSVNPVVVG
L
>MS0431 fadR, FadR protein
MFTQKSANSSPSVLKARSPAALAEEYIVKSIWSNFYPPGTDLPAERELAE
KIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNDVWQTSGLNILDVLVRL
DSTMSPTLIANMLSARTNIAIIYIPRAFKVSYEKALASFDGLENLPETAE
SYTAFDYEILHKLAFISLNPIYGMVLNSLKGLYTRVGSYYFAIPEARALA
KKFYIELRELGKAHRLDEIPSLFRQYGRESSLIFEAAQDGLAQYLIEN
>MS0244 fba, Fba protein
MAKLLDIVKPGVLTGDDVSKVFAYAKEQGFAIPAVNCVGSDSVNTVLETA
ARVKAPVIVQFSNGGASFYAGKGIKPASGARADVLGAIAGAKHVHTLAQE
YGVPVILHTDHAAKKLLPWIDGLLDASEKEFAKTGRPLFSSHMLDLSEEP
MEENMAICREYLARMDKMGMTLEIEIGITGGEEDGVDNSGVDESKLYTQP
EDVLYVYDQLNPVSSRFTVAAAFGNVHGVYKPGNVKLKPSILGASQEFVA
KERGLQAKPINFVFHGGSGSSTEEIREAISYGAIKMNIDTDTQWAAWDGI
LQFYKANEAYLQGQLGNPEGPDSPNKKYYDPRVWLRKMEESMSKRLEKSF
QDLNCVDVL
>MS1615 fbp, Fbp protein
MKTLDEFIVDRQAEYPNAKGALTGILSSIRLVAKVIHRDINRAGLTNNIL
GFSGIDNVQGEHQMKLDLFAHNMMKQALMAREEVAGFASEEEENFVAFDT
ERGRNARYVILTDPLDGSSNIDVNVSVGTIFSIYRRVSPIGSPVTLEDFM
QPGNRQVAAGYVVYGSSTMLVYTTGNGVNGFTYDPSLGTFCLSHENMQIP
ATGKIYSINEGQYLKFPMGVKKYIKYCQEEDAATNRPYTSRYIGSLVADF
HRNLLKGGIYIYPHATNYPQGKLRLLYEGNPIAFLAEQAGGIASDGYNRV
LDIQPSQLHQRVPLFVGSKQMVEKAQDFMHQFKED
>MS0719 fcbC, FcbC protein
MNNTFQFPVRVYYEDTDAGGVVYHARYLHFFERARTEFLRTLNFSQNQLL
HEQNIAFVVKSMTIDYRFPACLDDALIVESEVVEVKGATILFSQILKRDE
LVLTTATVKVACVDLGKMKPAALPAEVKAAISK
>MS0893 fdhD, FdhD protein
MIEITKRTISFFKNLTFIKQIDKDTVSDSNNSRFEFIQKEETLAVEMPVA
LVYNGISHTVMMATPSNLEDFALGFSLAEGVIDRVSDIYGIDVEETCNGV
EVQVELATRCFVRLKDLRRTLTGRTGCGICGSEQLEQVTKKLAKLDRTFC
FELKKLDGCLALLQQAQTLGKQTGSTHAVGFFSPQGELLAIREDVGRHVA
LDKLLGWHAKQGKPQGFVLTTSRASYEMVQKTASCGIEMLIAISAATDLA
VRMAEECNLTLIGFAREGRATVYTEKVRLKI
>MS0843 fdhE, FdhE protein
MQSSQKCGKIYRTFLLGIITMSIRILPEHEIKKTATSFEQPALLFANPQN
LYERRAKRLRKLSESHPFAEYLNFAAEVSEVQLKILKMHPLPQDERLTKE
NFSLDNSIQPLHTKNWKRDVIWREYLAEILAQIKLKATNQITTTIDWLEK
ASDTEIESLADKLLAEDFSSVSSDKAVFIWAALSLYWLQLAQQIPHTTNM
ESGENLHVCPVCNAAPVASVVHFGAAQGLRYLHCSLCESEWNMVRAKCSN
CNQAEHLEYWSIDEEMAAVRSESCGDCHSYLKILFQEKDPHVEPVADDLA
TIYLDIEMEEKGFARSGLNPFMFPSEEA
>MS0889 fdnI, FdnI protein
MSKVEFTNDTKIVRHKYPARVSHWFLVIAFFMTMFTGVAFFFPDFAWLHE
ILGTPQLARAIHPITGIIMFIAFIILAFIYASHNIPERNDIRWLKGIVEV
LKGNEHGVAYNGKYNLGQKMLFWTLNLAMITLLVTGIIMWRRYFSGYFSV
TTLRIAILLHSASAFALFTGILVHIYMAFWVKGSIRAMVEGWVTVRWAKK
HHPKWFNHEIKPEIERYMLEKDKASK
>MS1028 fdnI, FdnI protein
MSKVEFTNDTKIVRHKYPARVSHWFLVIAFFMTMFTGVAFFFPDFAWLHE
ILGTPQLARAIHPITGIIMFIAFIILAFIYASHNIPERNDIRWLKGIVEV
LKGNEHGVAYNGKYNLGQKMLFWTLNLAMITLLVTGIIMWRRYFSGYFSV
TTLRIAILLHSASAFALFTGILVHIYMAFWVKGSIRAMIEGWVTVRWAKK
HHPKWFNHEIKPEIERYMLEKDKASK
>MS0969 fdx, Fdx protein
MKIYQVRIENYQLTFSHNNKASLLSELEALGLKPEYQCRSGYCGSCRVKL
KKGRVSYKELPLAFVNPDEILLCCCQAEEDLEIELLS
>MS1720 fdx, Fdx protein
MPKVIFLPHETLCPEGMVVDAAAGDNLLEVALEAGIEIEHACDGSCACTT
CHCIIREGGDSLNETTDQEDDMLDKAWGLEVDSRLSCQCQIADEDLVVEI
PKYTINHAREENH
>MS1202 fecB, FecB protein
MRILTALLLFFSLSVNAQIKIATLDWTVAETLIALNNAPVAVGDKASYKI
WVGKPALAENTQDLGLRLQPNKESLARLSVDRFINSDFFASIEPSLTAKA
PVSTVNFYQPGDTWQNIENATRQIGELIEKSEQAEQLITQTNTQLAKIGQ
TLTHFRDRPVAIVQFIDTRHLRFYDSHSLFGTILNKLGLTNAWNHSGGVW
GSENLSITALATLPKNTRLVVVKPHPANVANALKYNSLWRNLALAEDPLL
LPAIWSFGALPSAVNFAQNLQSALLNQRSETW
>MS0795 fecB, FecB protein
MIDKENSMKKTLITLAAGLVAAFGVVSAQAADIGLETFGGKQIVPENPKR
VVVLDFAALDTIREIGAKETVVGISKGRIPQYLAEFDTDKYANAGTMPEP
AFEKINEMSPDLIIASARQKKVLARLKEIAPVFYMENDYENYYPSFEQNL
LALGKIFNKESAVKEKLAQLDNRMTALAKLTAGKSALVTIVNESRISAFG
DKSRYALVYQKFGFTPIDKNLSSSTHGNSVGFEYIAEKNPDYLLVVDRTA
AITDKANNAQTVLDNALIKPTKAAKNNHIVYLNAENWYLAFGGLQSMDTM
ISEIESAVK
>MS1158 feoB, FeoB protein
MNKDKLTCFALVGAPNCGKTVLFNGLTGSNAKVANYPGVTVERREGLFVD
DPSVSIIDLPGTYSLRTTTLDEAVARDEILGKYGRKIDGIIAVADATNLR
MTLRMVLELKMLGLPMVVSLNLIDVARSRGLTIDEKKLSELLGVPVLETV
ATRKEGIRGVKNAIANLPQNSAGIDSNQVAQLLESLDSNALYAQVEDILA
QTVKTEMVMPKWHQRLDRLTLHPVWGFVLLLLVLLLVFQAVYSWSEPVMD
FIEDFFANLGEWVATLMPEGILQDLILNGIIAGVGSVLVFVPQITILFAF
ILLLEDSGYLPRAAFLLDNLLSTSGLSGRAFIPLLSSFACAVPSVMSART
IQDPRERLVTIAIAPILTCSARLPVYALIIGAVIPDQTVWGIFNLQGLVL
FGLYFIGVFSAGLVAYIMKRLARRKGSIRQFPLLMELPTFRLPNFRHIFS
SLWDKVKAFLKRAGTVIFALSVILWALVTFPGAPEGAAGAAIDYSFAGML
GSLIQPIFAPLGFTWQMCIAMIPGIAAREVVVAALGTVYAVGASGSEEAI
QNALVPIVHEQWGIPTALAFLAWYVYAPMCMATLAVIKREVNSMKKTLMI
AGYLFVLAYIFSFVVYQISIRIF
>MS1157 feoB, FeoB protein
MVLADITEGTAQAKLDNSGMNARPDNPLVDNKLSNRKAARGK
>MS1012 fepC, FepC protein
MIEIKNLSLPYGLHNINTRIPAGKLIGIMGANGAGKSTLLKAMAGILPLT
NGEIWFGGQKLSAMSAAQKNQQFAYLAQDSRVHWDLSVYDVIALGLPYQL
QATAEQTKVRSVSEKFSISHLLEKPYRQLSGGEKARVQLARCRIKDAPLL
LADEPIASLDPYYQIDIMQQLKALTPERTCVVVIHHLDLAYRFCDEVILL
HQGNLIASGETQAVLNAENLAKAFSIRAEINLATKGISGIEKIGG
>MS1201 fepC, FepC protein
MFQLEQASFAIPNRALLAPTTLTFRRGKVYGLIGHNGSGKSTLIKLMAKQ
NPLSSGEIFVRGKALRHWGSREFAREVAYLPQHLPTATQLTARELIQMGR
YAWNGLLKSNKEKDKSAVENALILTHTEKFAEQQIDVLSGGERQRIWLAM
LLAQQSNFLLLDEPLAALDIAHQVEVMKLIKKLSRELNLGVVIVIHDVNL
AAAFCDELVALHSGKLLVKGTPGQIMTTETLQRIYGLELNVIPHPQTQVP
VVFY
>MS0794 fepC, FepC protein
MAIEIKHVNKSYGSKKVVDSVSLVIPKGKITSFIGPNGAGKSTVLAIISR
LLNADSGDVLLNGKLLNEQKSADIAKQLSILKQSNHINLRLTVEELVAFG
RFPYSKGNLKKNDRTFIDNAIGYMDLEEFRHQYIDELSGGQRQRAYIAMT
LAQDTDYILLDEPLNNLDMKHSVQIMQVLRKLVTELNKTVVIVIHDINFA
SCYSDYIVAMKNGKLVRQGSIAEIMQTSVLEEIYGMEIPIQEINGNKIAV
YFKN
>MS0519 ffh, Ffh protein
MFENLSDRLSKTLRNITGKGRLTEDNIKDTLREVRMALLEADVALPVVRE
FINKVKESALGEEVNKSLTPGQEFLKIVQKELESAMGESNESLNLASQPP
AVILMAGLQGAGKTTSVGKLAKFLKERHKKKVLVVSADVYRPAAIKQLET
LAQSVDADFFPSDVKKNPVDIAKAALADAKLKFYDVLIVDTAGRLHVDGE
MMDEIKRIHEVLNPIETLFTVDAMTGQDAANTAKAFNEALPLTGVILTKV
DGDARGGAALSIRQITGKPIKFLGVGEKTDALEPFHPDRVASRILGMGDV
LSLIEDLQRSVDHEKAEKMAQKFKKGDQFTLEDFREQLIEMKKMGGMMSM
LDKLPGAKNLPEHVKNQVDDKMFVKMEAIINSMTLKERANPDIIKGSRRR
RIAMGSGTQVQDVNKLLKQFDEMQRMMKKMRSGGMAKMMRGMKGMMGGGL
GALGGLGGMFGKR
>MS1167 fhaB, FhaB protein
MNKRCYRIIFSKTLNCLVVVSELAKTVGKAVAEFSNKLLPMRFFRQKTPD
FSLHFAAFICFIGLGIIYVPQAMAKPLEIHADRSAPSGNQPTVLRTANGI
PQVDIQTPSAGGVSRNVYSQFDVAEKGAVLNNARKSTNSQLAGWVTANPN
LVRGEAKVILNEVNSKDPSQLKGYVEVAGKKADVIIANPSGLHCDGCGVI
NAGRTTLTTGQVELENGNVKGFNVRGGKVEVAGKGMDTSRVDYTDIVAGK
VKVDGGIWAKELKVTTGKNKVDRTNSKVVYVGNDSTAPSSSENMDKQPIA
YAVDVSELGGMYANQIHLVATEQGVGVNNAGKIGASAGNVHIDSNGKITN
SGYLGAQQDIAVTANNNIENKGSIYTQQGDIKLKGRDISQQGNIIAEGAA
QKKGRVQITANRDIQQSGDTLAENYIDYQAKNIKVTNNATIVAGLDFTQN
TLADKSEAGKNARFNAEKSAVINGKILSSNRTEIKAADINLTNSQLHSNH
LSATASVGSIIASDSNIYTEKSAVFSTPVSLVTQNAQLNAGHISINATQA
DNTQGTWINRDEQDLNLNLQRGLTNTRGQIATNGQLLFNGEQIDNQAGLI
SADSYQISAATFDNAKGKLIQQGVNPFNLTVNGTLTNDKGVIGYQMQNLN
TANNSMENPVHSDNIPTNTAENITANTSTSEKPIHSGTDAIDVSDKIINI
VSNVNVTEKLNNTDGYILSGSQTVLWGEGKLSNDSGTLNLSEFTWESNKN
INNQLGLISALNALSLHAAELNNNQGTIRSGKNILLSTHALSNNGGLIQS
GGNVTINTHGYNLDNSNTLSSNGNKGIVTSGELNLSNINQLNNEQGYIVS
TQAQRIQTEKLNNTRGTLATNNTQSLKVTGTLQNQDGNLYSGQLTLDSNL
LINRKGNITAASALDITVKGALDNTQGIIAAKENSVIQVATLDNQNGLIG
IEQGKLNLTAATRLTNQRGQIISQGDLRLMGGDLQNNQSGTIKSLAKLTV
NTGNQQINNQQGTLSSAGELTIQSGYLDNRQGLIYAQKSLFVDTHNQNLD
NRNSGTKGILSLSDIILQNIGQLNNSRGQIQAQQDFSVNADTINNTGNGL
LYSNTDLSLKAKSLDNRQGTVQALGNVTFDRFSTVNNSVEKNKSGSLIQA
GRALTVSALNIDNQNTKTAETVPTQGLVGQAVVLSSDAVNNRQGGIYAAQ
ILSANIKNLINNQKGEMLSGGTLNAVGSALKIQNSEGVIASVGKLAIDAA
QISDIGKIQSKNDADIALKQDLTLNGGIEVEGSLKLKTNNLTNDGSLLTG
KGLHIQTGKLVNNEKAILSSGTTLLNANSITNYGLINGSNTSLKTIDLNN
LGTGRIYGDQLSIQARELNNFELNGKSATIAARKRLDLGVGTLTNRNDSS
LISLGEIHIGGTLDSRGYATGKATAVYNPNGLIEAQDNIYINSGLVSNTH
NYFRTALKLISQETVTEYQGSGDPTIYEEGTPGLYVFNHESDHLHVPTGA
YYESWYKYHYLKSVQRTEVLPAEYEPGRIYSGKNITIRGDRVDNINSRII
AGGKLDIPANILNNKEETGVEIVTKAGCAEGSRSEACRNLPAELRGYGDD
VSLRKHSNNSPYGLHSYWRHHEKGRDSTGHSRQDYTPPQEIKEGIPLEVA
AYKEYSLPSFGKANISAINVPQSIDVQVKSAVQNEKEFKPSSLSSDIAVT
EQDVTVNNQTTGSIVKDNAVVRTQNRAIALPRSSLFMVNPQSGSGYLVET
DPAFTQYRNWLSSAYMMNALNLDSDSMHKRIGDGFYEQRLVQEQIAELTG
RVYLSGYSNQEEQYKALMTNGITAASQFQLTPGMALTGEQIARLTSDIVW
LVNKTVTLADGSTATVLYPQVYVVVQKGDINGYGALLSGDITAIQSSEMT
NSGTLAGKNLLAISAENITNRFGKMTADNTLVSAEQDLVNIGGTIEAAQI
LNVNAGRDIVVNTTTHRTQNANGETVTLGIRGGFYLTGKDGRQMWVNAGR
DINLTAGEIINNAQNALTAIQAGRDILLNSGQQSDRYENIRDAENYLKTA
NRRDIGTMISSQGSGKTVLQAGRNIEAKAAAVSAGGDVLLNAGNNVTLSS
GEEYHYVDSAFKDTGRGFLNKTTTKTRDISEQTFAIGSQLDGNNVDIFTQ
QGDVNIIGSDVVAENELNVAAKNINIAAATNQVYSENLKQVKKSGLMGSG
GIGFTIGSRSQKHVYGENTITQSDARSTVGSVNGNVSMTAENHVNIEGSD
VIAQTDKSIDIIGKSLTVEAGRDVIDSTETHEYKKSGLTVSLSTPVTDMA
LNARNSLRCSKEVKNERLSNLYQVKAAQEAVMAAQAADSTIDSINALIGD
GQMVEGDVSNPSLKISIGVGSSQSKQTSRSQQISYSGSELSAGNINLKSS
AGDINLFGSTINATKAVLDSANNINLFSLQDSYRNRSDNENSGWNAGVFV
GMNGNSFGIGIEGSAQSGKGRENTDTITQKNSYINVRQAVIRSGRDTNLK
GAVINAERLTADIGGNLLIESRQDSNVYNSEQSQSGANFAVAVYGTGTNV
NVNASMDKAKLNYAQVEEQSGFKVGRDGMDINVRGNTHLKGGLIESEAAA
NKNRFSTNTLTTEDIENHSEVSVQSVSGGLSTNMMANAVNAMRAAISVLG
TANKDDHSTTQSAVSGNIDLNIKNGEKPTALSQDTMNANKQVNRYDIEEY
KEKAELAQVIGEIGQNGITIVLQPKLDKAQQEKDEAEAILKNPNSTAGER
REAQIQFNQAQTTLNQYGKGGDIQMAIRAVTGVLQGIAGGDVNAAIVNGL
SPYANLAVKEATTDSLTGEVNLVANLMAHAVLGAIEAQITGNNAIAGAAG
AVTAEATATLLAKSLYDVGKVDSSGRIKTVNDLTEYEKDSLLVLSQVAAG
ITGGVIGDSTQSAVVSGDIGKRAAENNLFGTVLNNPQINWQAVAEGEKIK
RERDEEIRAYIKKEHPVIYQTAEGTYYFMSATGKAIYVAREMVIELAPMV
IAPEIAAGTKVYAAVSRIALSGGANVVAQKVSGQEFNWAEFGGAVVSGAI
TPSLKTTKEAIRFNAGVGMAVGLANGGDGLESATYSGIGTYMGGKISNPT
WSAIVSEVIQKIPTINESLSKEHEGK
>MS1163 fhaB, FhaB protein
MLSAQAKTFAPLGDVHVNNHTELTGGLVTSTDKAEVEGKNRFSTGTLNAT
DIQNQAETSGSAYKVSGSADINGGWTGDKKEALSAAIGYGEVDENQTATT
KSGINTANIDIRDKQEQVAKTGQTAEDMLTQVKTEISTDMATQNSGVLEN
HFDKDTVQKELDYQVKVTSEFQEITLPEIDRQMANKAAEYREEEKIFRQA
GNEQAANEMAANAEKWEMGGEYKQRVDAIANAVGLALGGVGVEGTLTGAA
SPYINEAIKAQLPEDKNRAANVIAHVIWGAVEAKLQGASATTGALSTAVG
ELSAPIVSEVLYGTSDPNLLTEEQKQFVSNLSRIAATATGAISSRAEGNR
SVQVAKDAVTSGKVAENAVENNYLSQLSDNRRIWLREQLNRDDLSSVQRE
KYEQEFIQLEQDDHTSDILVAKAKYNPESMTQSDWELYQNYATRYYFESI
RTEKPENVIADLDNILSNQYIKGYSYPYATAEKYRHELPSRWSLFGTNKS
ADEQFYTDIYSKYQNRKTYQESFDGRVAQSTAEALGYASTMMSAGTVASV
VSKVGKFTSNGINKASSAIGAFAVKYPNISGAISDGMISSGVHVGYKLST
GQDVNEYEVLGAFAGGALTRNHTLGNQIRINIGVATVSSLSKDPSGNSLG
KDYFGAIVAPIINKPFSTKDSTLGNIVGGSLGEYGGDLDNRVKDYKEVKK
ILSDGEYSK
>MS1169 fhaC, FhaC protein
MRVFTGVILSLCSACVLAVDSPNLNQLNVQSDAALQQRQEEQNKALQRQQ
VADPNIRLENRLEPSEGFPEKENPCYQISHIILTDFSPEISDFSVIPPSS
IPSSRFYWALNAIYSTRDFSLPHCLGSEGINILLKRIQNRLIEQGYITTR
VVVQPQNLQNGILVITVIPGKIGQIQLQDESSFPYATSATLWFAMPTNNG
EILNLRHLEQGLENLKRNTSADANMQLSAVEDEVGASDVIIRYKQGFPIH
LTLGLDDSGTKATGRLQGTATLSWDNMFSLNDLFYASFTKSIKRHSDNVD
EPHGSKNVSLYYSVPWKNWLLTLSGYQYRYHQSIAGAFENYQYSGKSTQL
RMNLSYLLYRNSSRKSYISFGGWARKSFNYINDVEVEVQRRRMAGWDIGL
KHIEYLGDATLQISANYKRGTGAYKALPAPEEYFDEGTSRPQIITVGIDL
NYPFNIGEQPWKFNTSWNAQWNQTPLIQQDKFSIGGRYTVRGFDGELYLS
GERGWLWRNELAWNVFNKGQELYLGIDKGNVYSRFDDLPGNSLVGGAIGL
RGKIWGLDYDYFVGVPIDKPAGFKTSHVTTGFNLNYRF
>MS0531 fis, Fis protein
MLEQQRSPSDALTVSVLNSQSQVTNKPLRDSVKQALRNYLSQLDGQDVND
LYELVLAEVEHPMLDMIMQYTRGNQTRAATMLGINRGTLRKKLKKYGMG
>MS1863 fkpA, FkpA protein
MSKQFFDSVALDSVSAKGGYGVGLQIGQQLLDSRLNVEAEAVAKGIYDVL
NNNAPALDLNEVSKALQELQQKAQDAAQAQFKQIEEDGRAFLVENAKKDG
VQVTESGLQYEILVEGNGNKPSREDTVRVHYTGTLPDGTVFDSSVSRGQP
AEFPVGGVIAGWVEALQLMPVGSKWRLAIPHNLAYGERGAGASIPPFSPL
VFEVELLDIL
>MS0157 fkpA, FkpA protein
MLFVIFVKPTEPRLSIKEIVMLKIQKFSAVALLVGAVLATSACKDDKKAQ
AAAEPAKQEAPAAAQAENSRVKDPSYAVGVLIGNDLKGLVEAQKDVIAYD
NDKILAGVAEALQGKIDLTNQDVVNTLKDIDEKLKVAAQTKAEEQAKQAK
AESEKFIAEFKQKDGVKETKSGLLYRIEKEGEGAAIKPTDSVKVHYTGKL
TNGTVFDSSVERGQPVEFLLDQVIPGWTEGLQLVKKGGKIELVIPAELAY
GEQDLGTIPPNSTLHFEVEVLDVTPAKK
>MS2250 fldA, FldA protein
MEKSIAIITGSTLGGAEYVADHLAELLENRGFSVQVENNAAFTDVAEQSL
WLIVTSTHGAGDLPDNLKPFIRQINTEDLTQVRFAVVGLGNSDYDTFCHA
VDKVENALTAQGAARLCDSLRIDVLTTDDHEQCAENWLPNFVAAL
>MS0176 fldA, FldA protein
MKTIILYSTHDGQTKKIAEYLAQNLDKGAKVVNLTELTQNLADFDRIIIG
ASIRYGRFDKNLYKFIEKHTALLQTKLGYFYGVNLTARKAGKDTPETNVY
VRKFLAKIHWKPTDSAVFAGALFYPRYKWIDRIMIQFIMKITGGETDPTK
EIEFTNWESVKNFAKKIQNMN
>MS0860 fldA, FldA protein
MAIVGLFYGSDTGNTENVAKMIQKQLGNELVDIRDIAKSTKEDIEAYDFL
MFGIPTWYYGEAQCDWDDFFPTLEQIDFTDKLVAIFGCGDQEDYADYFCD
AMGTVREIVEQRGAIIVGNWPTEGYSFESSRALINNDTFVGLCIDEDRQP
ELTAERVNTWVKQVYDEMCLAELA
>MS2202 fmt, Fmt protein
MMKPLKIIFAGTPDFAAQHLQALLNSHHQVIAVYTQPDKPAGRGKKLQAS
PVKQLAEQYNIPVYQPKSLRKEEAQAQFAQLQADVMVVVAYGLILPKAVL
EMPRLGCLNVHGSILPRWRGAAPIQRAIWAGDKQTGVTIMQMDEGLDTGD
MLHKVYCDITAEETSASLYHKLATLAPPALIDVLDELESGKFIAEKQEDS
KSNYAEKLSKEEAKLDWSLSAAQLERNIRAFNPAPVAFLTVPVNEAEERI
KVYRAEVLPHQNSAAGTVLAFDKKGLRIATAEGVLNIQQLQPSGKKPMSV
QDFLNGRADWFVLGQVLN
>MS0400 focA, FocA protein
MKSEDFKLAWMASPTEMAQTGLDVGVYKATKKQAYSFLSAISAGMFIALA
FVFYTTTQTASAGAPWGLTKLVGGLVFSLGVIMVVVCGCELFTSSTLSTI
ARFESKITTIQMLRNWIVVYFGNFVGGLFIVALIWFSGQIMAANGQWGLT
ILNTAQHKIEHTWVEAFCLGILCNIMVCIAVWMAYAGKTLTDKAFIMILP
IGLFVASGFEHCVANMFMIPMGMVIANFASPEFWQATGLNAEQFANLDMY
HLVIKNLIPVTLGNIVGGGVCIGLMQWFTSRPH
>MS1858 folA, FolA protein
MTLSLIVAATKNHVIGKDNQMPWHLPADLKWFKENTLGKPVIMGRKTFES
IGRPLPKRVNIVLSRHPFEHEGVIWKESLESAVDFLKDSAEIMLIGGGQL
FEQYLSQADKLYFTEIQTELEGDTFFPAINTDEWEISYEEYRPADENNAY
DLRFLILERKS
>MS1824 folB, FolB protein
MIDRIFIEELTVFAQIGVYDWEQQIKQRLIFDIEMAWDSSKAAETDNVSY
CLNYAEVSQFIIQYVQSKPFLLIERVANEVAEQLQKEFGIKWIKLKLSKP
KAVAEARNVGIIIERGQC
>MS1173 folC, FolC protein
MQEKHNLQATSSLSEWLSYLENSHFKAIDLGLERIKAVANELDVLNPAPF
VITVGGTNGKGTTCRLLETMLLKAGLRVGVYSSPHLLRYNERVRIQNQEL
PDEAHTQSFAYIEARKTQSLTYFEFTTLSALYLFKQAKLDVVILEVGLGG
RLDATNIVDNNLAVITSIDIDHVDFLGSSREQIAFEKAGIFRAGKPVVIG
EPDVPAAMLAHAGLLGCELACRDKDWSFAQKADSWTWQNQKVRLENLPIC
RIPLQNAATALAAVQFMPVQISEEIIRQSLQEVELAGRFQRINAERLVPL
ATLVRRSVESLPQIIIDVGHNPHAARYLAGKLIELKQKTSQKITAVCGIL
KDKDSEGVLSPLLPIIDKWHCVTLEGARGQSGSNLFVTLKNLANKQQIPF
HGESENSVESGIISAISQMDNNEILLVFGSFHTVTGFLELL
>MS1852 folD, FolD protein
MTAQVISGSALAKKVKTEVGQKIEQYVAQGKRAPGLAVILVGADPASQVY
VGSKRKSCAEIGINSKSYDLEESTSEAALLTLIDELNNDADIDGILVQLP
LPKHIDSTKVIERIAPHKDVDGFHPYNVGRLCQRIPTLRACTPYGIMKLL
>MS1851 folD, FolD protein
MIVGASNIVGRPMAMELLLAGCTVTVTHRFTTNLEGYVRQADILVVAVGK
AEFIPGNWVKEGAVVIDVGINRCEDGKLRGDVEFAAAAEKAGFITPVPGG
VGPMTVAMLMFNTLTAYENNG
>MS1043 folE, FolE protein
MSRISAEAEKVRHALIEKGIETPMIALTKSKNERRIGIENRMREVMQLIG
LDLTDDSLEETPVRLAKMFIDEIFSGLDYTNFPKITNIENRMKVSEMVLV
NDVTLTSTCEHHFVTIDGLVSVAYYPKKWVIGLSKINRVVQFFAQRPQVQ
ERLTEQILLAFQTILETEDVAVYMKATHFCVKCRGIKDTNSYTVTSAFGG
VFLEDRETRKEFLSLINK
>MS0925 folK, FolK protein
MKTVYIALGSNLNTPIEQLNSALTALNKLPQTSLSAVSSFYQSKPLGPQD
QPDYVNAVACIHTELAPLELLDYLQQIENEQGRVRLRRWGERTLDLDILL
YDDLVIKSERLILPHYDMTNREFVIIPLYEIAPNLILPQGIAIAELAKNF
ANHDMKICYKP
>MS0966 folP, FolP protein
MKLYANNKVLDLSMPKVMGILNFTPDSFSDSGRFFQLDKALAQVEKMVKA
GASIIDIGGESTRPMAEEVTLEQELERVVPLVEAVRQRFDCWISVDTSKA
QVMCESAKVGMDIINDIRALQEPDALETAVKSGLPVCLMHMQGQPRTMQT
NPHYDNVVSEVLEFLQNRTALCLQAGMNPQNIIWDMGFGFGKTVQHNYKL
LQQLSVFAAQGYPVLAGLSRKSMIGAVLDKTVEQRVTGSVTAALIAAMNG
ATILRVHDVEETMDALKIWQATLQA
>MS1653 frdB, FrdB protein
MYTVRKLMQPKQQLKLRRTQMANQAMMNVEVLRYNPEVDKEPYLRTYQVP
YDNQTSLLDALGYIKDRLDPELAYRWSCRMAICGSCGMMVNNIPKLACKT
FLRDYSGHMRIEPLANFPIERDLIVDLSHFIESLEAIKPYIIGNEMPALD
GQPHPSAELAKSRTKQTPAQLEKYRQFSMCINCGLCYAACPQFGLNPEFV
GPAALTMAHRYNLDNRDHGKAERMPIINGENGVWTCTFVGACSEVCPKHV
NPAAAINQGKLESAKDYLISMLKPKA
>MS1654 frdC, FrdC protein
MTTESKRNKYVREVTPTWWKSWSFYKFYMLRESSAIPTVWFCLVLLYGVF
CLTTANGFVEKFIPFLQNPVVVILNLISLALLLLHAFTLFQMTGEVMSGS
LGLKSEVIQKALKVLFAIVTVVALVLVCI
>MS1655 frdD, FrdD protein
MVDQNPKRSNEPPVWLMFSAGGMVSGLAFPVLILILGILLPFGIISPDNI
IAFSHHWFGKLVILALTIFPMWAGLHRLHHGMHDIKVHVPNGGLIFYGLA
AVYSFIVLFAVIAI
>MS2007 frnE, FrnE protein
MKKIKIEMYSDYACPFCYIGKSHLEQALAQFEHADKVEIVHKAYELYPQT
GETVTSTTQGRIEWKYHKTPEQALEMIRHIENLAKRAGIAMNYENVQNTN
TFKAHRLTKFAASKGKENEMYNRLMKAYFTDNLPLADRKTLLQCAEDVGL
DLAETEAFLNSNDFADSVTADETQARHIGVRSVPFFVINGVEVAGSQPPA
RFLALLQQVYAANNM
>MS1929 frr, Frr protein
MINEIKKDTQDRMEKSLEALKGHIAKIRTGRAQPSLLDAIQVDYYGSATP
LRQLANVVAEDARTLAVTVFDRSLIQAVEKAILTSDLGLNPSSAGTTIRV
PLPPLTEERRRDLIKIVKGEGEQGKVAIRNVRRDANDKIKALLKDKEISE
NDQRKAEEEIQKITDSYIKKVDEVLAEKEKELMDF
>MS2178 fruA, FruA protein
MKDKPMNIFLTQSPNLGRAKAFLLHQVLAAAVKQQNHQVVENAEQADLAI
VFGKTLPNLTALLGKKVYLVDEEQALNAPENTVAQALTEAVDYVQPAQQD
VQPATASGMKNIVAVTACPTGVAHTFMSAEAITTYCQQQGWNVKVETRGQ
VGANNIISAEDVAAADLVFIATDINVDLSKFKGKPMYRTSTGLALKKTAQ
EFDKAFKEATIYQGEETTTATETQTSGEKKGVYKHLMTGVSHMLPLVVAG
GLLIAISFMFGIEAFKDENIAGGLPKALMDIGGGAAFHLMIAVFAGYVAF
SIADRPGLAVGLIGGMLATSAGAGILGGIIAGFLAGYVVKFLNDAIQLPA
SLTSLKPILILPLLGSAIVGLAMIYLLNPPVAAAMNALTEWLKGLGSANA
LVLGAILGGMMCIDMGGPVNKAAYVFGTGMIGSQVYTPMAAVMAAGMVPP
LGMAIATWIARAKFNASQRDAGKASFVLGLCFISEGALPFVAADPVRVIV
SSVIGGAIAGAISMSLAITLQAPHGGLFVIPFVSQPLMYLGAIAVGALTT
GVLYAIIKPKQAAE
>MS1510 fruB, FruB protein
MYSKDVEITAPNGLHTRPAAQFVKEAKAFASDVTVTSAGKSASAKSLFKL
QTLGLTQGTVITISAEGEDEQNAVDHLVALIPTLE
>MS2179 fruK, FruK protein
MAKVATITLNAAYDLVGRLKRIELGEVNTVETLGLFPAGKGINVAKVLND
LDVEVAVGGFLGEDNVGDFEHLFQQQGLQDKFQRVAGKTRINVKITETDA
DVTDLNFLGYQISEQDWRKFTADSLAYCKEFDIVAVCGSLPRGVTADMFQ
SWLSQLHQAGVKVVLDSSNAALTAGLKANPWLVKPNHRELEAWVGHELPT
LKDIIDAAKQLKAQGIANVIISMGANGSLWLSDNGVILAQPPKCENVVST
VGAGDSMVAGLIYGFVNNLSQQETLAFASAVSAFAVSQSNVGVSDRKLLD
PILANVKITTIEG
>MS1080 ftn, Ftn protein
MLKKAIIDKLNEQINLEFYSSNIYLQMSAWCSNHGYEGAAAFLLRHADEE
MEHMHKLFTYVSETGGLPLLGKIDAPQNEFKSLRDVFEITLKHENLVTAK
INELVEVTFANKDYSTFNFLQWYVAEQHEEEKLFNSIIDKFNLLGEDGRS
LYFIDKELATLDLA
>MS1079 ftn, Ftn protein
MLSANVVKLLNEQMNLEFYSSNLYLQMSAWCDQKGYTGAAAFLSAHAAEE
MEHMRKLFTYLNETGSTAVIEEIEAPTHEFKSLKEVMELTYQHELHITSK
INELVGKTFEEKDYSAFNFLQWYVAEQHEEEKLFNGILDKFNLVGNEGKS
LFFIDQELAKLAADH
>MS1662 ftsA, FtsA protein
MAKIVESKTIVGLEVGTSKVVAVVGEVLPDGVVNVLGVGSCPSKGIDKGS
ITDLAAVVNSIQRAIEAAESVADCQIMSVTLAITGEHIQSLNESGFVPIA
DGEVTQDEIDQAMHTASSVKLPEGLSLLHVIPQEYAVDKQQNIKNPLGLQ
GVRLKAQAHLIAGHQAWVNNLQKAVETCGLKVDQVVFSGLASTYSVLTED
EKDLGVCLIDFGGGSMDIMVYTNGALRYSKVVPYGGNTITDFVAQSLTTS
RNEAESIKINYGSAFMPSAELLEQFAKKKIEVAGLGGGAPRTFTKAQVVE
VTSRCYHDLLQVVENELTQLRNELAMRGIKQELIAGFVLTGGSSQMTDIA
KCATDIFESHVRVGYPLNITGLTDYVNKPQYATVLGLLQYSHHNEEESTQ
MFGGSASESSFLGSIFEKCKKIANKVKSEF
>MS0008 ftsE, FtsE protein
MIRFANVSKAYLGGKPALQGLTFHLPVGSMTYLTGHSGAGKSTLLKLIMG
MERANGGQIWFNGHDITRLSPYEIPFLRRQIGMVHQDYRLLPERSVVDNV
ALPLIITGFHPKDAEKYALAALDRVGLRDRANYLPVHLSGGEQQRVDIAR
AVVHKPQLLLADEPTGNLDDKLSMDIFNLFEEFNKLGMTVLIATHNLGLI
QQKPKPCLVLEQGHLR
>MS1832 ftsI, FtsI protein
MKNMNLLTKLFATPRHDPIRDNKAERNLFARRALVAFIGTLLLTVVLFTN
LYNLQVTEYDKYQTRSNGNRIKLLPVPPTRGLIYDRYGKLLAENLTFFGL
YIVPEKVENLDRTFDELRDVVGLTDSDIEQFKKERRRSSRYTPIMLKSDL
TEEQIARFAVNQYNYPSLDIRPYFKRHYLYGEPLTHILGYVARINDKDVE
RLKKEEKDANYAGTTDIGKLGIERFYEDQLHGTAGYEQVEINNRGKVIRK
LSEQPPVAGKSIYLTIDLELQRFITDLLAGQKGAVVVMDPRDSSILAMVS
SPSYDNNLFVGGISGSAYKRLLEDPTRPLYSRATQGAYPPASTVKPFIAV
AALTEGVITPNMTIFDPGYWILPNSTKRFRDWKKTGHGSLNLYKSITESA
DTYFYQVAYKMGIDKMSEWMTRFGFGVPTGVDIQEETSGIMPTREWKQKR
YKKPWVIGDTIPVGIGQGYWTATPLQLAKATSVLVNDGKVNTPHLMKETV
GSEKEPYKDPLLYEDISEPTKAAWNEAKRGMYGVVNAPNGTGRKAFTGAA
YRVAGKSGTAQVFSLKENQRYDASQLKRELHDHAWFTAYAPYENPHIVVS
IILENAGGGSGNAAPVVRQIMDYYLLHRLPQVAKLEGITDEQSVNSAAEQ
SKAAENNSEEAATSGDMPIEEPISLPHEGATE
>MS1673 ftsI, FtsI protein
MVKFNTSRKASAKPKKSVKKNIMPNTAVKLNKPKIIYETSFLSGRFQVAV
CLIIVCLLALVARAAYIQIINVDTLTNEADKRSLRTQEIQSVRGSILDRN
GQLLSVSVPMHSVVADPKFVLDENSLADKDRWKALADTIGVPYKDLVKRI
EKNPRSRFEYFARQVPPSVADYVKKLRITGVVLKSDSRRFYPRAEETAHL
LGFTDIDNNGIEGIEKSFNSLLIGKSGSRTYRKDKYGNIVEDISDVKKYD
AHDVTLSIDEKLQSMVYREIKKAVAENNAESGTAVLVDIRTGEVLAMVNA
PSYNPNKRNGVSEDLMRNRAVTDTFEPGSTIKPFVVLTALQRGAVRRNEI
INTGPLVLNGHEIKDVAPRNQQTLDEILENSSNRGVSRLALRMPPSALME
TYQNAGLGKATDLGLGGEQAGFLNANRKRWSDIERANVAYGYGINATPLQ
IARAYITLGSFGIYRPLSITKVDPPVIGNRVFSEKITRDVVNMMEKVAIK
NKRALVDGYRVAIKTGTAKKLENGRYVDKYMAYTAGIAPVSDPRFALIIL
INDPKAGQYYGGAVSAPVFSSIMGYTLRANNITPDGVPATDKTAARTIRL
NNKLSQKMNGETMRKQAN
>MS0963 ftsJ, FtsJ protein
MGKKRSASSSRWLNEHFKDPFVQKAHKQKLRSRAYFKLDEIQQSDRLFKP
GMTVVDLGAAPGGWSQYVVTQIGDKGRIIACDILDMDPIVGVDFLQGDFR
DENVLAALLDRVGEDQVDVVMSDMAPNFSGMPSVDIPRAMYLVELALDMC
KQVLAKKGSFVVKVFQGEGFDEYLREIRSLFTTVKVRKPEASRDRSREVY
IVATGYRG
>MS1674 ftsL, FtsL protein
MLENSERYPLQNIVVDDLFSANKLVVALLIAIVMTAVTTVWVTHKTRTLT
SEKGELVFEKQALENEYLNLKLEETTQSDNTRIEAIATVKLGMKHIDSEH
EVVILE
>MS0450 ftsN, FtsN protein
MVQRDYAARGGRRKKTTGLNKKLLIAVAAMVVLAFAAGLYFIKNSSQPVV
EQNLQVETKPQPKSQLPSRPEEVWSYIKELESRTVPTDAVQTEKVIQLSE
KQKEELKKLAEQERQAELERTKKSEQETIADKTVDEQTSSAVVSAVNDEA
ALKAEQQALEKRKKEEERKKAEAVKVAETKKADTAKSGGGSYGLQCGAFK
NRAQAESLQARLAMTGLNARVNTSADWNRVVIGPVGDRAAAAAAQKQASS
ITNCVIIGM
>MS1663 ftsQ, FtsQ protein
MSVVKRKTTQKKIKLAEPKTRVFLQVKPLLVLCCVGLLYFAYINWQTLLD
KLDSKPISSFALVGTPQYTTNADVRDMILKMGELKGFFGQDVDVIREQIE
SMPWIKGAVVRKIWPDRLSIWVAEYAPVAFWNSEDFVSLDGVVFKLPKDR
LKNDNLPRLYGPDYQSLAVLDAWKQIFNELKSKGITLKAVSIDERGSWEI
VVENDITLKLGRGEWKSKIDRFMTIYPQVEIPENKKIAYIDLRYKVGAAV
SFADIN
>MS1668 ftsW, FtsW protein
MYIFSKIKAGYQRWTTLTPTNLLYDRSLLWLFIILLFIGFVMVTSASIPV
GTRLFDDPFYFAKRDAMYVILSMGICYYFIKVPMANWESWHKRVFILALI
LLILVLIPGIGKSVNGARRWIPMVLFNFQPAEFAKLALICFLSGYFTRRY
DEVRSRKLSAAKPLIVMGFLGTFLILQPDLGSTVVLFVITFGLLFVVGAH
IMQFLVLAATGGFLFVVLVLSSAYRMKRITGFMDPFKDPYGTGFQLSNSL
MAFGRGEFTGEGLGNSIQKLEYLPEAHTDFVMAVVGEEFGFAGITVMIIL
LALLVFRAMKIGRESLQLEQRFKGFFAFGISFWIFFQGFVNLGMSLGLLP
TKGLTFPLVSYGGSSLVIMAISIAILLRIDHENRLMRGGHARLKDD
>MS1831 ftsW, FtsW protein
MTEKKSIFSNIWTRLHLDFLLLVGLLVVSGYGLIVLYSASGGSETMFRSR
IIQVVLGFAVMIVMAQFPPRFYQRIAPYLFFVGLIMLILVDLIGTTSKGA
QRWLDLGLFRFQPSEIVKLSVPLMVAVYLGNKKLPPKLSETVIALAIIVV
PTLLVAIQPDLGTSILVSASGLFVVFLAGMSWWLILAAVVGLAAFIPIMW
LYLMHDYQRTRVLTLLDPEKDPLGAGYHILQSKIAIGSGGLWGKGWMLGT
QSQLDFLPEPHTDFIFAVLSEEQGMFGITLLMLIYFFIIIRGLIIGVNAE
TAFGRILTGALTLIFFVYIFVNIGMVSGILPVVGVPLPLISYGGTSFVSL
MAGFGVIMSIHTHKRTLYHKGN
>MS0007 ftsX, FtsX protein
MSRVRARAFTLRTIIMSSKNSAPFWVQMQYVLRHVWADLVKRKYGTILTI
LVIAVSLTIPTVSFLLWKNTHIASTQFYPESDITVYLHKNLSEEDANAVV
EKIRQVEGIDSLNYVSRQQSLNDFRNWSGFSEELDILDDNPLPAVVMIQP
APEYQDSKKREDLRANLNKIKGVQEVRLDNDWMEKFTALTWLIGHISVFC
AVLMTLAVFLVIGNSIRSDVYSSQANISVMKLLGATDQFILRPYLYTGII
YGFFGGFFACFFSSLLIGYFASAVQYVTDVFAVKFSLNGLEIGEVLFLLI
ICAIVGYISAWISATRHIKMLDHKAG
>MS0140 ftsY, FtsY protein
MADEKKKSGFWSWLGLGKKEAGDSAEKQADQAPSAYEKAEQTVEETKRKI
DELANQAQGIAEQVKDQVDEIKEDLADKLEQTKQDIVHQVEQVQVEAEQK
FERTIEKFLNSEPQSDENQSEQEKIEAVSATEKEQQTAEATHSTDLDVNE
VTVETQEKPGKGGFFSRLVKGLLKTKQNIGAGFLSLFTGKKIDDELFEEL
EEQLLIADIGVPTTAKIIKNLTEHASRAQLKDTQALYQQLKVEMAEILKP
VEQPLIVDTGRKPYVILMVGVNGVGKTTTIGKLARQFQQQGKSVMLAAGD
TFRAAAVEQLQVWGERNNIPVVAQSTGSDSASVIFDAMQSAAARNADILI
ADTAGRLQNKNNLMDELKKIVRVMKKYDESAPHEIMLTLDAGTGQNAISQ
AKLFHEAVGLTGISLTKLDGTAKGGVIFAIADQFNLPIRYIGVGEKIEDL
RPFHAEEFIDALFTHEEN
>MS1661 ftsZ, FtsZ protein
MFEPVEYGFDDEIGRTLIKVVGVGGGGGNAVNHMVNNMIHNGGTLVGENS
MTSDEHGEIIFYAVNTDAQALRKSIVQQTVQIGAATTKGLGAGANPNVGR
KAAEDDQEAIRAMLEGADMVFIAAGMGGGTGTGAAPIVAQVAKELGILTV
AVVTKPFSFEGKKRMAFAELGIKELSKHVDSLIIIPNEKLLKVLGKTTTL
VQAFSAVNDILRNAVTGISDMITSPGLINVDFADVRTVMSEMGRAMMGAG
IAQGAASDGRAEKAAQDAVASPLLEDVDLSGARGVLVNITAGMDLGLDEF
YAVGDTIRAFASDEATVVVGTTLIPEMSDEIRVTIVATGIGDIDEPAATL
APVAQRPVTGAAPNPNQPGQAPQQPTTQPEQPARPTSFGNNNDLFKPAFL
RGDK
>MS0760 fumC, FumC protein
MTAFRIEKDTMGEVQVPADKYWAAQTERSRNNFKIGPAASMPHEIIEAFG
YLKKAAAFANTDLGVLPAEKRDLIGQACDEILARKLDDQFPLVIWQTGSG
TQSNMNLNEVIANRAHVINGGKLGEKSIIHPNDDVNKSQSSNDTYPTAMH
IAAYKKVVEATIPAVERLQKTLAAKAAEFKDVVKIGRTHLMDATPLTLGQ
EFSGYAAQLSFGLTAIKNTLPHLRQLALGGTAVGTGLNTPKGYDVKVAEY
IAKFTGLPFITAENKFEALATHDAIVETHGALKQVAMSLFKIANDIRLLA
SGPRSGIGEILIPENEPGSSIMPGKVNPTQCEAMTMVAAQVLGNDTTISF
AGSQGHFELNVFKPVMAANFLQSAQLIADVCISFDEHCATGIQPNTPRIQ
HLLDSSLMLVTALNTHIGYENAAKIAKTAHKNGTTLREEAINLGLVSAED
FDKWVVPADMVGSLK
>MS0859 fur, Fur protein
MSEENIKLLKKAGLKITEPRLTILALMQEHKMQHFSAEDVYKLLLEKGEE
IGLATVYRVLNQFDEAHILIRHNFEGNKSVFELCPTEHHDHIICVDCGKV
FEFNDDIIEKRQQEISREHGIQLQTHSLYLYGKCADIDKCDK
>MS1503 fusA, FusA protein
MSLNDYPQQVNKRRTFAIISHPDAGKTTITEKVLLYGNAIQTAGSVKGKG
SQAHAKSDWMEMEKQRGISITTSVMQFPYNDCLVNLLDTPGHEDFSEDTY
RTLTAVDSCLMVIDAAKGVEERTIKLMEVTRLRDTPILTFMNKLDRDIRD
PMELLDEVESVLKIHCAPITWPIGCGKLFKGVYHLYKDETYLYQTGQGST
IQEVKIVKGLNNPELDNAVGDDLAQQLRDELELVKGASNEFDHELFIGGE
LTPVFFGTALGNFGVDHFLDGLTEWAPKPQPRQADTRIVESSEEKLTGFV
FKIQANMDPKHRDRVAFMRVVSGKYEKGMKLKHVRIGKDVVISDALTFMA
GDRAHAEEAYAGDIIGLHNHGTIQIGDTFTQGEDLKFTGIPNFAPELFRR
IRLKDPLKQKQLLKGLVQLSEEGAVQVFRPMMNNDLIVGAVGVLQFDVVV
SRLKTEYNVEAIYENVNVATARWVECADSKKFEEFKRKNEQNLALDGGDN
LTYIAPTMVNLNLAQERYPDVTFFKTREH
>MS0164 fusA, FusA protein
MARITPIERYRNIGISAHIDAGKTTTSERILFYTGVSHKIGEVHDGAATM
DWMEQEQERGITITSAATTAFWSGMSQQFQQHRINVIDTPGHVDFTIEVE
RSMRVLDGAVMVYCAVGGVQPQSETVWRQANKYQVPRIAFVNKMDRTGAN
FLRVVEQLKTRLGANAVPLQLPVGAEDNFKGVVDLIKMKAINWNEEDQGM
TFTYDDIPADMLEACEEWRNNLVEAAAESSEELMEKYLGGEELTEEEIKG
ALRARVLANEIILVTCGSAFKNKGVQAMLDAVVEYLPSPVDIPAIKGINE
DETEGERHASDDEPFAALAFKIATDPFVGNLTFFRVYSGVVNSGDTVVNS
VRQKRERFGRIVQMHANKREEIKEVRAGDIAAAIGLKDVTTGDTLCDPNA
PIILERMEFPDPVISVAVEPKTKADQEKMGLALGRLAQEDPSFRVHTDEE
SGETIISGMGELHLDIIVDRMKREFKVEANIGKPQVSYRETIRTRVNDVE
GKHAKQSGGRGQYGHVVIDLYPLDPEGPGYEFVNEIKGGVIPGEYIPAVD
KGIQEQLKSGPLAGYPVVDIGVRLHFGSYHDVDSSELAFKLAASIAFKAA
FNKANPVLLEPIMKVEVETPPEYVGDVIGDLSRRRAMVNGQEANDFVVKI
NAEVPLSEMFGYATDLRSQTQGRASYSMEPLKYAEAPTSVAAAVIEARKK
>MS0457 fxsA, FxsA protein
MPIIFIITLIAFLFIYGELSLLIAIGSAIGAFGVIMLLLLSVFIGGVILK
SKGLFGLNFRRQIAQGEIPADSVVKSLLWMIAGILFIIPGFITDLLACLL
LLLPSGLFEKWISQKFTVINSGFTAQGFGRHSHRYRYYKDQNTEVFEAEY
EKEVDEKKRIK
>MS0827 gadB, GadB protein
MGRRPPYGTNMADISKHRQSLFCSDPQSIADYETAMSNAVKAVSNWLKNE
KMYTGGSIRELRKTIGSFNPSKQGVGVNQSLDHLVDIFLNPSLKVHHPHS
LAHLHCPTMVASQIAEVLINATNQSMDSWDQSPAGSIMEEQLIDWLRQKA
GYGQGTSGVFTSGGTQSNLMGILLARDWAVANHWKNEDGSEWSVQENGLP
AEALKKLKVVCSENAHFSVQKNMAMMGMGFQSVVTVPTNANAQMDVAELE
KTLATLKAEGKIVACIVATAGTTDAGAIDDLKAIRKLADAYQAWLHVDAA
WGGALLLSKDFRHLLDGIELTDSITLDFHKHFFQSISCGAFLLRDERNYR
FIDYKADYLNSEYDEEHGVPNLVSKSLQTTRRFDALKLWFTLEALGEDLY
ASMIDHGVKLTKQVEEYIRTTEGLEMLVPTQFAAVLFRVAPEGYPAEFID
ALNQNVADELFARGEANIGVTKVGNKQSLKMTTLSPIATLENVKALLALV
LAEAERIKDAIANGTYVPPID
>MS1227 galA, GalA protein
MKNNFIRLTGGNTDLIIRPEPAEILYWGKRLEIDEITEADLLSLERGVSN
GGLDVDTPVTLAAENGRGYWGSSGVDGHRDGYDWAPVFKTKSAVKNGEVL
LIEAVDEIAQLAFKTEIEADGNGVFKFRNSLTNLGEGKFTVNRLAVTLPV
PEYADEVLSFYGRWCRELQENRTCLKHGAFMQENRHGRTSHEYAPNLVLG
QPHFSQQQGEVWGFHLAWSGNHRIRADVLIDGRRFAQLENLYLPGEIVLE
QGESLSTPWVYTAYSDCGLNGMSQQFHRHIRSRILHFAQPIRPVHLNIWE
GVMFDHDPAHIIAMAEKAAEMGVERFIIDDGWFIGRNDDFGGLGDWFLDE
KKYPNGLKPVIDAVKKLGMQFGIWVELEMISKRSKLYQQHPDWMLRLDGY
DQPEERHQYVLDLVNPDAFNYILERMDWLLGENQVDYIKWDHNRRLVQPG
HLGKAAVTAQTQAAYRLFDILQKRYPHVEIESCSSGGARIDFEILKRSQR
FWTSDNNDALTRQKIQRGMSYFYPPEVIGAHIGGAPCQTTMRNFSFDFRG
LTALFGHMGVELDPVKESAEERQGFAKYIALHKQLRPLLHSGESFRLDHH
DDTTLINGVVAQDKKQAVVLISQLDMPDYKQMGKLRVPYLEANATYQVKL
LSIPDYIKQGKGGHLMKVFPQWVLDNFAGKPVSIRGEWLAKAGLTIPVLD
PQSAMLVEFKRL
>MS0798 galE, GalE protein
MTILVTGGAGYIGSHTIVELLNAGEDVVVLDNLCNSSPKSLERVKQITGK
SVKFYEGDVLDRTLLQRIFAENQIKSVIHFAGLKAVGESVQKPAEYYMNN
VTGSLVLVQEMKKAGVWNLVFSSTATVYGEPETIPVTENCKVGGTNSPYA
TSKLMVEQILTDVVKAEPRFSMIILRYFNPVGAHESGLIGEDPNGIPNNL
MPYISQVAIGKLPELSIFGNDYDTHDGTGVRDYIHVVDLAIGHLKALTRH
EDDAGLHIYNLGTGIGYSVLDMVKAFEKANNMTLPHKFVARRPGDIAAYY
SDPSLAAKELSWTAQRGLEQMMKDTWNWQKNNPKGYRD
>MS0648 galK, GalK protein
MQPKDLAKKLFSEKFNRTSELNVYAPGRVNIIGEHTDYNDGFVMPCAINY
GTAVSGAKRDDHTFCVYAADLDQFDRFRLDRPIEQNPSEKWTGYVRGVVK
FIQERCPEFTQGADLVISGNVPLSSGLSSSASLEVAVGKFCQQLGELPLS
NTDIALIGQKAENKFVGANCGNMDQLISALGQQDHLLMIDCRSLETKATP
VPHNIAVMIVNSHVKHDLVTGEYNTRRQQCEAAAKFFGVKALRDVSIQQF
KEKEAELTALDGEAAKRARHVVTENQRVLDAVDALNQGDISRLGELMGQS
HDSMRDDFEITTPEIDYLVELAQQVIGKSGGARMTGGGFGGCIVAVAPVE
KVEEVRKIIADNYQKRTGIKEDFYVCTASQGVHLC
>MS0649 galM, GalM protein
MHRLSRSAFMLKTLQKTTALAPDNQPFQIVTLSNKNGMKVQFMDWGATWI
SCQVPVNGELREVLLGCRAEDYPRQSAFLGATVGRYANRIANARFELNGQ
TFPLTANQGVHQLHGGDGFDKRRWKIEKCGENFVTFCLNSVDGDQGFPGN
VEVVLDYELSEDNALTVRFHATPDKDTPLNLTNHAYFNLNNAIRGCDVRG
HSLQLNADYFLPVDTDGIPNAPLKSVEGTSFDFLEEKPIGLDFLQEEQQL
TKGYDHAFLLNNNAEKTCAILTALDRSLSLQVFTSQPALQVYTGNYLAGV
PTRLGGSYADYAGIALETQALPDTPNHPEWQQYGGITKAGETYRHWTTFR
FI
>MS0647 galT, GalT protein
MICFSPDHSKTLPLLTVEEITEVVKVWREQLRELGQKYQWVQIFENKGAA
MGCSNPHPHGQIWANSFLPNEVARADLNQKKYFEKQGSVLLLDYAKRELE
RKERIVVETEHWLAVVPYWAVWPFETLLMPKKAHIKRLTDLTEEQSRDLA
LALKKLTTKYDNLFEISFPYSMGFHAAPFNGEENEHWQLHAHFYPPLLRS
ATVRKFMVGYEMLGESQRDLTAEQAAARLRDLSEVHYKMRK
>MS0646 galT, GalT protein
MGLSFPAPGKTPLARSAGKKLPRKKNQAMTRIVIFVRVMRVLRASLIPII
KNLMYSEMIFRLYCLKRRRRNKQRIRYFKAAGLRVKAV
>MS0645 galT, GalT protein
MSISFDPTEHPHRRYNPLTDQWVLVSPHRAKRPWQGQQEKSCRGRKTKP
>MS0345 galU, GalU protein
MKGKTMKVIIPVAGLGTRMLPATKAIPKEMLTLVDKPLIQYVVNECIAAG
VKEIVLVTHSSKNAIENHFDTSFELETMLEKRVKRQLLEEVRSICPKDVT
IMHVRQGNAKGLGHAVLCGRPAVGNEPFAVVLPDVLLAEFTANQKTENLS
AMIKRFNETGSSQIMVAPVDPKDVSSYGIADCNGAEFSGGESAVISRMVE
KPSPEKAPSNLAVVGRYVFSATIWDLLERTPVGVGDEIQLTDAIDMLIAK
ETVEAFHMTGESFDCGDKIGYMKAFVEYGIQHEKLGNEFKNYLKAFAKTL
>MS1739 gapA, GapA protein
MIILFNLNIIGENLMAIKIGINGFGRIGRIVFRAAQTRDDIEVVGINDLI
DVEYMAYMLKYDSTHGRFDGSVEVKDGNLVVNGKTIRVTAERDPANLNWG
AIGVDIAVEATGLFLTDETARKHITAGAKKVVMTGPSKDATPMFVRGVNF
SAYAGQDIVSNASCTTNCLAPLARVVHETFGIKDGLMTTVHATTATQKTV
DGPSAKDWRGGRGASQNIIPSSTGAAKAVGKVLPALNGKLTGMAFRVPTP
NVSVVDLTVNLEKPASYETIKQAIKDAAEGKTFNGELKGVLGYTEDAVVS
TDFNGCALTSVFDADAGIALTDSFVKLVSWYDNETGYSNKVLDLVAHIYN
YKG
>MS1919 gcpE, GcpE protein
MSAFKPTIKRRESTKIYVGNVPVGGDAPIAVQSMTNTRTTDVEATVAQIK
ALERVGADIIRVSVPTMEAAEAFKLIKRQSSVPLVADIHFDYRIALKVAE
YGVDCLRINPGNIGREDRIRAVVDCAKDKNIPIRIGINAGSLEKDIQEKY
GEPTPEALLESALRHVEILDRLNFDQFKVSVKASDVFLAVEAYRLLAKAI
KQPLHLGITEAGGARAGAVKSAVGLGMLLAEGIGDTLRVSLAADPVEEIK
VGFDILKSLRIRSRGINFIACPTCSRQEFDVIGTVNALEQRLEDIITPMD
VSIIGCVVNGPGEALVSDLGVTGGNKKSGFYLNGERQKERFDNEYIVDQL
EAKIRAKIAAQDPKNRIL
>MS1379 gcvR, GcvR protein
MANSVITVIGKDRVGIVYDVSKILAENQINIVNITQQLMDDYFTMIILVD
TSKCSKSFPELAEFFTQESKNLALDIRLQNEEIFKAMHRI
>MS0196 gdhA, GdhA protein
MQTLTILLIRGKLMSSTVSSLEDFLSLVAQRDGNQPEFLQAVREVFTSIW
PFLEANPQYRSQALLERLVEPERAFQFRVAWTDDKGQVQVNRAFRVQFSS
AIGPYKGGMRFHPSVNLSILKFLGFEQIFKNALTTLPMGGGKGGSDFDPK
GKSDAEVMRFCQALVAELYRHIGPDTDVPAGDIGVGGREVGYLAGYMKKL
SNQAACVFTGRGLSFGGSLIRPEATGYGLVYFAQAMLAEKGDSFQGKTVS
VSGSGNVAQYAIEKALQLGAKVVTCSDSAGYVYDEAGFTTEKLAALLDIK
NVKRGRVKDYAEQFGLQYFPGERPWGVKVDIALPCATQNELELTDAQKLI
ANGVQLVAEGANMPTTIEATEALQAAGVLFAPGKAANAGGVATSGLEMAQ
SSQRLFWSAEEVDQKLHNIMLDIHANCKKYGTDANGNINYVAGANIAGFV
KVADAMLAQGVY
>MS2354 gidA, GidA protein
MFYSENYDVIVIGGGHAGTEAALAPARMGLKTLLLTHNIDTLGQMSCNPA
IGGIGKGHLVKEIDAMGGLMATAADQAGIQFRTLNSSKGPAVRATRAQAD
RVLYRQAVKVALENQPNLDIFQQEATDILIEQDRVTGVATRMGLKFKTKS
VILTAGTFLGGKIHIGLDNYTGGRAGDPASIALADRLRDLNLRVARLKTG
TPPRLDARTINFDILAKQHGDAQLPVFSFMGSVDQHPRQIPCFITHTNEQ
THEVIRNNLDRSPMYTGVIEGIGPRYCPSIEDKVMRFADRNSHQIYLEPE
GLTSQEIYPNGISTSLPFDVQMKIVNSMVGLEKTRIVKPGYAIEYDFFDP
RDLKPTLETKAIKGLFFAGQINGTTGYEEAASQGLLAGINAGLFVQEKES
WFPRRDQAYMGVLVDDLCTLGTKEPYRVFTSRAEYRLLLREDNADSRLTP
IAHELGLIDENRWARFNQKMENIERERQRLRNIWIHPRSEHLDVINEVLS
SPLVREASGEDLLRRPEINYQILTALDLFKPAMDDKEAVEQVEIAVKYQG
YIEHQQEEIEKQKRHENTAIPDNFDYTLVAGLSNEVRAKLEQHRPVSIGQ
ASRISGVTPAAISILLVNLKKQGMLKRGE
>MS2353 gidB, GidB protein
MVNKLEQELTQKLEILLKQTALSISDQQKNKLVQLVLLLNKWNKAYNLTS
VRDPMEMLIKHILDSVVVSPYLQGDLFIDVGTGPGLPGLPLAIINPDKNF
VLLDSLGKRISFIRNAVRELELSNVVPVLSRVEEYIPDHKFDGILSRAFA
ILKDMTDWCHHLPNEKGLFYALKGVYQQEEVMDMSNNFQVIDVIKLHVPE
LIGERHLVKVKKM
>MS1221 glcD, GlcD protein
MLPRLKEVPQLTPLVSDFLDDLKAQYFEGDIASNYADRLSLATDNSVYQL
LPQAILFPKSVSDVVRITKLAQQHKYLSLTFTPRGGGTGTNGQAINNNII
VDLSRHMTGILELNVEERWVRVQAGVVKDQLNQFLKPHGLFFAPELSTSN
RATLGGMINTDASGQGSLQYGKTSDHVLGLRAVLVNGDIIDTSAVKTERF
LDNLAAKKVTFTSKRLHEEVFHRCKEKREQIVRDLPQLNRFLTGYDLKNV
LNEDESEFNLTRLLTGSEGTLAFICEATLDLTPIPQIRTLINVKYSSFDA
ALRSAPFMVQANALSVETVDSKVLNLAKEDIIWHSVKELLTEEANSPILG
LNIVEFAGNNKNLIERQVAALCAQLDEKIANRESNIIGYQVCSDLPSIER
IYAMRKKAVGLLGNAKGAAKPIPFVEDTCVPPENLADYITEFRALLDGYN
LQYGMFGHVDAGVLHVRPALDLCDKEQVQLFKHISDSVAELTRKYGGLIW
GEHGKGIRSYYGEKFFTPELWQELRYIKFLFDPHNRLNPGKICSALNSEQ
QLYPILSPMRADNDRQIPIKMREEYAGAMNCNGNGLCFNFDVHSTMCPSM
KVTGNRLFSPKGRAGMVREWLRLMANENVTPEQLNFHHSQVKLSELVEKV
KNSVKKWRGEYDFSHEVKAAMDTCLACKACASQCPIKIDVASFRSKFFYF
YHQRYVRPSKDYIVANLETVAPYMAKQPKLFNFVMKSKFMKIAAEKALGM
TDIPLLSEPNLRHQLVEIGYQGKTLEQLERLSPTEKSNMLLIVQDPYTSY
YDATVVADFVELCRKLGFKPVLLPFKPNGKAQHIKGFLGQFARTAKNQAD
FLNRMTKLGLPLVGVDPAIVLSYRDEYNEILQQNRGDFHVITAHEWLKNQ
LDSNLLKTAVKNLQKNHRTLNTHEWYLFPHCTEQTFMPNSPQEWQQIFTA
FGQHLEVEKVGCCGMAGVFGHDMKNQEMSKAIYAGSWATKLTGKNIEYCL
ATGYSCRSQVARLENEELKHPVQALLSLFH
>MS0661 glf, Glf protein
MKKYDYLIVGAGLFGSIFAYEATKRGKKCLVIEKRDHIGGNCYTQNVEGI
NVHKYGAHIFHTSNKVVWDYIQQFAEFNRFTNSPIARYKDELYSLPFNML
TFNKMWGVITPQEAEAKIKEQIAQESITEPKNLEEQAISLVGRDIYEKLI
KGYTEKQWGRKCTELPAFIIKRLPVRYTYDNNYFYDTYQGIPIGGYTGIF
ERMLDGIEVKLGVDFFTEREYYENLADKIVFTGMIDEYFGYQFGKLEYRS
LRFDNEVLDMPNYQGNAVVNYTEAEVPYTRIIEHKHFEYGTQPKTVITRE
HSKEYEEGDEPYYPINDARNNELYAKYKELADEKSNVIFGGRLAQYKYFD
MHNIIAEALECVNAHFR
>MS1120 glgA, GlgA protein
MKVLHVCSELYPLLKTGGLADVLGALPAAQKEIGLDARILIPAYPAISAG
IPDTGVVAEFHNSAAGHVVLRYGEFNGVGVYLIDAPNLYAREGNPYHDQW
YNDYADNYKRFALLGWVGAELATGLDPWWMAEVVHAHDWHAGLTSAYLAY
KGRPAKSVFTIHNLAYQGLFAYHHLFEIGLPTSMFNVNGLEFYGQISYLK
AGLYYSDAVTAVSPTYAREITTPEFAYGFEGLLSTLHSQGKLVGILNGVD
DNIWNPNTDGYIQDHYKLKSMTGKKKNKAALQAHFNLPEKPDALLFVMIT
RLTEQKGVDLLIQSAENIIKQGGQLALLGSGAPSLESALLGLAHKHPKNI
AVKIGYDEPLSHLMVAGGDVILVPSRFEPCGLTQLYGLKYGTLPLVRQTG
GLADTVVDSTAENIKERRATGFVFNEANSQALSHAISRAFSLWKKQRTWF
TVRTVAMEQDFSWQISARRYEELYRRI
>MS1123 glgB, GlgB protein
MKKLVAQSVIDAFFDGTHSDPFAVLGMHETHNGIEIRVLLPEAHRVIVID
KETHKAVVELELVDERGFFNAIVPKANQFFAYELQVYWGKESQILEDPYR
FHPMINELDNWLLAEGSHLRPYEVLGAHFVEYDNVAGVNFRVWAPNAKRV
SVVGDFNYWDGRRHPMRFHPASGIWELFLPKVALGQLYKFELIDSNNQLR
LKADPYAFAAQLRPDTASQVSALPEIVEMTEKRRAANQSDKPISIYEVHL
GSWRRNLENNFWLDYDEIADELIPYVKEMGFTHIELLPISEYPFDGSWGY
QPLGLYAPTSRFGTPDGFKRLIEKAHESGINVILDWVPGHFPSDTHGLAA
FDGTSLYEYADPKEGYHQDWNTLIYNYGRHEVKNYLSGNALYWVERFGLD
GLRVDAVASMIYRDYSRRDGEWVPNQYGGRENLEAIEFLKHTNYVLGTEL
PGVAAIAEESTSFPGVTLPPEHGGLGFHYKWNMGWMNDTLEYMKLDPVYR
QYHHGKMTFAMLYQYSENFVLPLSHDEVVHGKGSLITKMSGDTWQKFANL
RAYYGYMWAFPGKKLLFMGNEFAQGREWNYQESLDWFLLDDGQGGGWHSG
VQRLVKDLNKTYQNQTALFELDTNPQGFEWLVVDDNQNSVFAFERRSKSG
EVIIVVSNFTPVPRDNYRIGVNEPGKYEEILNTDSAYYKGSNLGNYGEVI
AEEIENHGKAQSISVMVPPLATVYLRLKK
>MS1121 glgC, GlgC protein
MNNAVLNQPNKYDLVKDTLVLILAGGRGSRLHELTDKRAKPALYFGGNRR
IIDFALSNCINSGLNRIGVITQYAAHSLLRHLQTGWSFLPQERGEFVDML
PARQQIDDNTWYRGTADSVYQNLAIIRGHYKPKYVLILAGDHIYKMDYSQ
MLLDHVSSGAKCTVGCIEVPREEAKEFGVMAVNETLKVKAFVEKPQDPPA
MIGKPNSSLASMGIYVFNADYLYEALDRIKTPNTSHDFGKDVMPLALNDG
VLYAHPFDRSCKGRNTEGAIYWKDVGTLDSFWQANIDLVSEEPQLDIYDQ
TWPIRGNPVQAYPSKFFYDEPNCKQVDNSLIAGGCMVKNASISYSVLFDN
VSVNAGSSIEQSVILPQVKIGKNCMLRRCIIDRHVQIPDGMQIGVDLELD
SKRFRISKNGIVLVTESMLHKLNGKSVASEAHLD
>MS1119 glgP, GlgP protein
MLDKDFIYESPKLTVEALKQAIVSKLVFDIGRSAQEATTRDWLNATVYAV
RDFVAEGWIQTVNQFREEKTRRVYYLSMEFLMGRVLSNAMLSEGVYDTAK
QALSELGLVLEDILEKEADPGLGNGGLGRLAACFMDSIATCNLPGMGYGI
RYEYGMFKQTIEDGSQVEKPDAWIAKGAPWEFTRASKRYRVRFGGNLHFE
GEKCIWTPSEEITALAYDNIVPGYETKSAATLRLWTANAGDIFNLANFNK
GDYFGAIEERSSIENVSRVLYPDDSTWAGRELRLRQEYFLVSASLQDIIK
RHKKFHGGKIANLADKVAIHLNDTHPALAIPELMHILVDQEGISWKKAWD
MTRRIFSYTCHTLMSEALETWPIELMAKVLPRHLQMIYEINAEFLEYVRT
YVSADVDFIRRVSLIEEGNQRKVRMGWLSVVGSHKVNGVAEIHSDLMVSS
TFADFAKIYPERFTNVTNGVTPRRWIGVANPKLAALFDKYIGTEWRKDLS
QLSLLKPYIGKPEIIGELAKIKFANKKRLARYVKNTLDIEINPNAIFDVQ
VKRIHEYKRQILNVLQIISRYNQMIANPEKNWQSRVFILAGKAASAYYTA
KQTIRLINDIAEVINNDERLKGRLKVVFIPNYSVSIAEIIIPAADISEQI
SLAGTEASGTSNMKFALNGALTLGTLDGANVEILENVGEDNIFIFGNTVE
QVEELRRNGYSPVTFYQQDEELRQAVDQIALGHFSPKEPTRYQGLIDSLR
NYDYYQSFADFRSYADMQAKVDEKYQDQAAWFNSTLENIANMGFFSSDRT
ILEYAERIWKIKPLKLEN
>MS2073 glgP, GlgP protein
MTFQSIVEKYCRYFDVADPKNLTLQQWYQIVAEGSLELACSQPFAKPAES
RHVNYLSMEFLIGRLTGNNLMNLGYYEQIRDYLKQYQVELVDVLEQERDP
ALGNGGLGRLAACFLDSMAALGQNATGYGLHYQYGLFKQSFAEGMQKETP
DTWDRNNYPWHSFNPSKTRYVGFGGKIKHIQGDNYEWSPKLTIQGKAFDL
PVVGYRNNLIQPLRLWQADSDQSFDFDAFNEGKFLKADKTIVNAAALTQV
LYPNDNHKAGQKLRLMQQYFHCACSVADILERHFAEGYQLADFAKRQVIQ
LNDTHPTLAIPELMRLLLDDYHLTWDQAWDICTNTFAYTNHTLLPEALEQ
WDQRLFKQLLPRHYQIVEKINDIFHQKVRSEFGENSQVWEKLAILFDYRV
RMANLCVVTCFRVNGVAQIHSDLLVTDLFPEYHKLFPGKFCNVTNGITPR
RWIRQANPKLSDLLDRTLKQDWAKDLELLSGVEKYVDDAGFREEYQAIKR
HNKIVLADEINRTLALKVNPDAIFDVQIKRFHEYKRQHLNLLNIIADYQS
LKANPNQDYTPRVFVFAGKAAPGYYLAKNIIHAINNVAEIINNDKQVNDR
LQVAFLPDYRVSLAEKIIPAADVSEQISMAGKEASGTGNMKLALNGALTL
GTLDGANVEIAEMVGEENVFIFGHTVESVRELLAKGYHPKDYYKKDSVLK
NAVDFLAHGKASNGDKETFRLMLDSLLERDPFLVFADFDSYRLAQQKIGS
AYLNREAWLRSAILNTARLGTFSSDRSIRDYQQHIWLKK
>MS1122 glgX, GlgX protein
MLKNQTGKPYPLGATLVEVNGTKGVNFSIFSASARAIELCLFDNSGREVR
FPILDKTDDIFHIWVPNVPLGTKYGFRIHGDERHNPKKLMLDPYAKMVVG
KPDLTSKENQAWYLLSDERDNSKIAPKSIIIDGEFDWEQDKPLNIPWTET
IIYELHVKGFSKLRADLPEEIRGTYSALAHPSVIAYLKELGITAVELLPI
NFSISESHLQERGISNYWGYNPMAMFAVEPQYAATEDPVHEFKTMVKTLH
QAGIEVILDMVFNHSAESERDFPTFSYRGIDEQTYYWSDAQGNYLNWSGC
GNLLHLAHPYMRRWAIDCLRYWVEEYHIDGFRFDLATNLGRETPAYKAHS
ELFKAMRLISGFKNTKFIAEPWDMGEDGYQMGNFPPFFAEWNDRFRDDIN
RFWLWQSGELGAFAERFAGSADIFKQEGKYPHNSVNFITAHDGFTLRDLV
SYNHKHNNANGEDNRDGRNENYSHNHGIEGSTDGLDEPQKTAVENARILS
SQSLLCSLLLSNGTPMLLAGDEFGNTQFGNNNGYCQDSGLTWLKWSNFNL
DLFEIVKKLIIVRKGIQSLVNDKWWTEGNVRWFNEFGSLMNVSDWQERGA
KALQVLLDEQWLCVVNAKTELQVFSLPEGDWNMEISMTGCKNQNNQLIVD
NLSFCLLRRIL
>MS0189 glmS, GlmS protein
MHNGIIENYEELRTLLQERGYVFQSQTDTEVIAHLVEWEFRTAGSLLEAV
QKTVKQLRGAYGTVVLNEEEPEHLIVARSGSPLVIGYGVGENFLASDPLA
LLSVTRRFTYLEEGDVAEITRKSVQIYTRDGQKVEREIHEGNFEADAADK
GPYRHYMQKEIFEQPVAIMNTLDGRIKEGKVNIEAIAPNAAEILSKVEHV
QIVACGTSYNAGMVARYWFEAIAGVSCDVEIASEFRYRKFVTRPNSLLIT
LSQSGETADTLAALRLAKESGYMSAMTICNVASSSLVRESDFAFLTRAGV
EIGVASTKAFTTQLTCMLLLNAAIGRLKGNLSEEQEHHIIQSLQRLPAQI
ESALVFDKQIETLSEDFAEKHHTLFLGRGEYYPIAMESALKLKEISYIHA
EAYAAGELKHGPLALIDSEMPVVVVAPENDLLEKVKSNIEEVRARGGQLY
VFADSDAGFEDSDNFKTIVLPKVDEVTAPIFYTVPLQLLSYHIALIKGTD
VDQPRNLAKAVTVE
>MS0188 glmS, GlmS protein
MCGIVGAVAQRDVAEILVDGLHRLEYRGYDSAGVAVLNNAHEMQIVRRVG
KVKALDDAIAKNALLGGNRYCAHPLGNSRRTDRS
>MS1949 glmU, GlmU protein
MIIMKKLSVVILAAGKGTRMYSDLPKVLHKIAGKPMVKHVIDTAKQLSAD
QIHLIYGHGADLLKSHLADEPVNWVFQAEQLGTGHAMQQAAPFFADDENI
LMLYGDSPLISKETLEKLIAAKPENGIALLTVNLDNPTGYGRIIREKGSV
VAIVEQKDADAEQLKITEVNTGVMVSDGASFKKWLGRLNNNNAQGEYYMT
DVIGLANQDGFQVAAVSATDKMEVEGANNRLQLAALERYYQHKQAERLLL
EGVMLIDPARFDLRGTLEHGKDCEIDVNVIIEGSVKLGDRVKIGAGCVIK
NCEIGDDVEIKPYSVFEDSTIGARASIGPFSRLRPGAELAEETHIGNFVE
IKKATVGKGSKVNHLTYVGDAQVGTDCNLGAGVITCNYDGANKFKTVIGD
NVFVGSDVQLVAPVNVANGATIGAGTTVTKDIGENELVISRVPQRHIAGW
QRPTKKK
>MS0262 glnA, GlnA protein
MANPNAIQRVAKLIEDNDVKFVLLRFTDIKGKEHGVSLPVNLVADELEDF
FEEGKMFDGSSVEGWKAINKADMLLMPMPETAVIDPFAQITTLSIRCSVY
EPNTMQSYDRDPRSIATRAENYLKSTGIADQALFGPEPEFFLFDDVRFST
EMNNVSYKIDDIEAAWNTNRKFEDGNNAYRPLKKGGYCAVAPIDNAHDIR
SEMCLILEEMGLVIEAHHHEVATAGQNEIASKFNTLTLKADETQIYKYVV
QNVALEYGKTACFMAKPFAGDNGSGMHCNMSLSKDGKNVFQGDKYAGLSE
TALYYIGGIIKHAKALNAFTNPTTNSYKRLVPGFEAPVLLAYSASNRSAS
IRIPAVTSPKAIRVEARFPDPLANPYLAFAALLMAGIDGIINKIHPGDAM
DKNLYDLPPEELKEIPAVCSSLEEALDSLQADHEFLIQGGVFSKEFIDAF
VAIKRKEVERVNMTPHPVEFEMYYA
>MS1305 glnD, GlnD protein
MIGHNNFIIQSVILRFFMFQSVEGLLTPGLIKQQKEQLKQTELENFAQAD
VNSLISHRTLFCDNFLIRLWRQFSLHEVTDLALIAVGGYGRREIFPLSDL
DFLILTEQPMPADLAKKVEEFIQFVWDCGFDVGASVRTLEDCDSQGRADI
TIATNLLESRLLTGNETLFDKLSSIVGREDFWPRKTFFEAKIQEKKQRYQ
RYNNTSYNLEPDIKYNPGGLRDLHLIYWIALRHSNALSLEEILQSGFIYP
EEYAELERNQQFLFKVRFALHLILKRYDNRLLFDRQVKVSELLGYQGEGN
QGVETMMKAFFQSLQAISLASDILAKHYKEHFVDENGEEECQVLDDNFQM
INNAIFLVREDCFVQQPDTILDLFSYLIIRPQAELHSSTLRLLHLALGQL
NGYLSELPAAREKFLRLLTQPRGIERALIPMHKYGVLTAYIPEWKGIEGL
MQFDLFHIYTVDEHTMRVLAKLETFLSEETAEAHPLCVKLFPSLPDRALI
YIAALFHDIAKGRGGNHADLGAVDVGRFAAQHGFDCREIETMKWLVKQHL
FMSVTAQRRDIHDPEVVMNFAAEVQNQVRLNYLVCLTVADICATNTTLWN
SWKRSLFASLYQYTNQQFNQGMDNLLDNQEQEEQNKALALEILQSQGFTE
DVQSLWKRCPGDYFLRNTPKELAWHAVLLAGVETELLVKISNRFSAGGTE
VFIYCKDRPNLFLKVVAAIGNKKLSIHDAQIITSLDGYAFDSFIVTELDG
SLLKFDRRRVLEKAIINSLNSNELTKLQGSENHKLQHFNVKTEVRFLNTE
KTTHTEMELFTLDKAGLLADVSLVFSELNLSIQNAKITTIGEKAQDFFIL
TNAKGEALSERERQSLSEKLQARLD
>MS1278 glnE, GlnE protein
MTMPLPSIEQTLIQLADNLITHFPEQFNSQIYQQIQKDISNIKTPVGALM
RAVSMSDFVTEILQKQPHFLAECWHKTPQLADCDSYAARLSVQLADIREE
TGLYKTLRDFRNQEMAKLSICQSLNSATVEEIFIRLSQLAEALIIGARDW
LYQRACLDWGTPTDNQGNVQQLYILGMGKLGGFELNFSSDIDLIFTYPAN
GETVGSRKPIDNQKFFTRLGQRLISALDEFTEDGFVYRTDMRLRPFGDSG
ALALSFNAMESYYQEQGRDWERYAMIKGRILGADEQDPNVKTLRQLLRPF
IYRRYIDFSVIQSLRDMKSKIEREVLRRGLVDNIKLGAGGIREIEFIVQV
FQLIRGGREISLQQHELLKLLPEIEKLNLITADQHQDLLQAYLFLRRVEN
VLQAINDKQTQLLPADELNRCRLISATCEFTQWDNNHRPQKIQYPIHDWE
SFYQVLQQHQQKVRSVFNNLIGFNNENEADDSDNAWSDFLDADLEQGEIA
DILAQQGVSEEERDEIIGRLEAFRHSVSHRSIGIRGREVLTQLMPLLLLQ
IFSNKKYRTLLPRMLNIVEKILTRTTYLELLLENPQALTQLIELCAKSQL
IAEQVAQHPILLDELLDREALLNPPSFEQYPAELQQYLLRLPEDDDEQFI
TALRQFKQATLLRIAAADILGALPVMKVSDHLTFLAETILHTVVNLAWQQ
ITARFGKPEHLQNNEKGFLVAGYGKLGGIELGYRSDLDLVFLCDEIHSGQ
TVGGKKVIDSHQFYLRLAQKIISIFSMTTSAGILYEVDLRLRPSGEAGPL
CCSFKAFEDYQMNEAWTWEKQSLVRSRAVYGEPALREKFELIRTGILASP
RDLTQLKIDVREMREKMYRHFAGADDNKFNIKKDQGGITDIEFIAQYLVL
AHAPENPNLAYWSDNVRIFDIMAEHGIITLNEAEKLKNCYTGLRNQIHHL
NLLGEPPIVSKEEFADERRFIHQIWQKLFFE
>MS0426 glnK, GlnK protein
MKKIEAIIKPFKLDDVRESLSDIGITGMTVTEVRGFGRQKGHTELYRGAE
YMVDFLPKVKLEIIIPDELLDQCIEAIMETAQTGKIGDGKIFVYNVERVI
RIRTGEENEDAL
>MS1685 glnQ, GlnQ protein
MALLEIKELVKNYGEVTALNGVNLSVEKGEVVVILGPSGCGKSTFLRCIN
GLEEIKSGSLKLADVGELGKDISWVKARQHIGMVFQSYELFAHMTVIDNI
LLGPLKVQKRARAEVEKQADALLKRVGLYERKNAYPRELSGGQKQRIAIV
RSLCMNPDIMLFDEVTAALDPEMVREVLDVVLGLAKDGMTMIIVTHEMQF
ARQVADRIVFMDNGNIIEESEPEQFFTSPKTERAKTFLNILDYYI
>MS1275 glnQ, GlnQ protein
MIKVKNIHKAFGENVILRGIDLDITKGEVVVILGPSGSGKTTFLRCLNAL
EMPEQGTIEFDNAAPLKIDFAAKPSKKDILALRRKAGMVFQNYNLFPHKT
ALENVMEGPVRVQSKKVAQAREEALALLTKVGLADKADLYPFQLSGGQQQ
RVGIARALALQPELMLFDEPTSALDPELVQDVLDTMKSLAKEGWTMVVVT
HEIKFALDVADLVIVMDDGVIVEQGSPKQLFDNPQHERTKAFLQRLRSH
>MS0219 glnQ, GlnQ protein
MTISVKNLNFFYGSSQALFDINLTAEDGDTVVLLGPSGAGKSTLIRTFNL
LEVPKSGDLTVADNHFDLSQNTDAKKMRQLRQDVGMVFQQYNLWPHFTVM
ENLIEAPMKILGLTESEAQKEAMELLTRLRLEEHAHRFPLQLSGGQQQRV
AIARALMMKPKVLLFDEPTAALDPEITAQIVSIIQELQETGITQVIVTHE
VGVARKVATKVVYMEKGRIVETGDASCFEAPQTEQFRQYLSHD
>MS0490 glnS, GlnS protein
MISKFKVIEMELKALFNLDPNVKVRTRFAPSPTGYLHVGGARTALYSWLY
AKHNDGEFVLRIEDTDLERSTPEATAAILDAMEWLNLTWEHGPYFQTERF
DRYNEVIDQMIEQGLAYRCYCSKERLEELRHQQEANKEKPRYDRHCLHDH
EHSPYEPHVVRFKNPQEGSVVFEDAVRGRIEISNHELDDLIIRRSDGSPT
YNFCVVVDDWDMGITHVVRGEDHINNTPRQINILKALGAPIPVYAHVSMI
NGDDGQKLSKRHGAVSVMQYRDEGYLPEALLNYLVRLGWGHGDQEIFTLE
EMIKLFELEHVSKSASAFNTEKLLWLNQHYIRELPAEYVAQHLAWQYQEQ
GIDTSKGPALTEIVSMLGERCKTLKEMAASSRYFFEEFDGFDEAAAKKHL
KAAAVEPLEKVKEKLTALSGWDAHSAHEAIEQTAAELEVGMGKVGMPLRV
AVTGAGQSPSMDVTLAGIGRERVLARIQKAIDFIKAKNA
>MS1127 glnS, GlnS protein
MNNNEILIEETRPTNFIRQIIDEDLASGKHNNVYTRFPPEPNGYLHIGHA
KSICLNFGIAQDYQGKCNLRFDDTNPVKEDVEYVDSIKQDVEWLGFKWEG
EPHYASDYFDQLYGYAIELIEKGMAYVDELSPEQMREYRGTLTEPGKNSP
YRDRSIEENLNLFEKMKNGEFAEGAACLRAKIDMASPFMVMRDPVLYRVK
FASHHQTGDKWCIYPMYDFTHCISDAIERITHSLCTLEFQDNRRLYDWVL
EHISIERPLPHQYEFSRLNLEGTLTSKRKLLKLVAEGAVDGWNDPRMPTI
SGLRRRGYTPAALREFCRRIGVTKQDNVVEFSALESCIRDDLNRNAPRAM
AVLNPLRIVIENFTEKEVLTAPNHPNYPELGTHEMSFTKEIYIDQADFRE
EANKQYKRLVLGKEVRLRHAYVIKAERVEKDEQGGITTVYCSYDPETLGK
NPADGRKVKGVIHWVSATENLPAEFRVYGRLFNVPNPGAEEDILAAMNPE
SLVVKHGVVEMSLANAEPEKAYQFEREGYYCADNKDSKAGNLVFNLTVSL
KEGF
>MS0597 gloA, GloA protein
MKLEHVAIYVQDLEKAKAFFMKYFNAQPNEKYHNPRTNLMTYFLTFSGGA
RLEIMTRPEIIELDKNIFRTGLIHLSMQVGGEEKVRELTERLRTDGYQVI
SEPRKTGDGYYESCVLDGEGNQIEIVA
>MS0610 gloA, GloA protein
MLNDVIRTLPAWLNDWGKKEKKTTLFDFRQQI
>MS0611 gloA, GloA protein
MISLFTGFHHIAIIVSDYEKSKYFYTQILGAEVIEETYRASRHSYKLDLK
FADGSQIELFSFPSSPSRLTMPEACGLRHLAFKVKDIEEAVQYLKTQQIE
CEDIRIDELTGKKFTFFKDPDNLPLELYEFNSFKGG
>MS0703 gloA, GloA protein
MMRILHTMLRVGDLDRSVKFYQDVLGMRLLRTSENPEYKYSLAFLGYDDE
DKTAVIELTYNWGVTEYELGSAFGHIAIGVDDIHATCEAVKAHGGKVTRE
PGPVKGGSTVIAFVEDPDGYKIEFIENKNAKAALGN
>MS0946 gloB, GloB protein
MLVPIPALNDNYIWLYGRENLPVIAIDVAECKNLSAYLTQHHLQLEAVLL
THYHDDHTGGVEELKRYYPDIPVYGPAETADKGATHIVNEGNIQTAHYRI
EVVPSGGHTANHVSYLIDNHLFCGDTLFSAGCGRVFTGDYGQMFESITRL
KQLPDKTVICPAHEYTLSNLVFAEAFAPNEKVKSAVKNQRISVESLRAQN
KPSLPTTLALEKNINPFLQAENLADFIYLRKAKDNF
>MS0824 gloB, GloB protein
MNIDIIPVTSFQQNCSLIWDDRKNAAIIDPGGEPKKLIEKIEENGLDLKM
ILLTHGHLDHIGAAPALKAHFGVDIIGPHEDDVFWFENLPQQSAQFGLFE
ANAFLPDMWLNRENEVLEVGSLKLEVLHLPGHTPGHVGFFEHQNIVAFTG
DVLFRNSIGRTDFPGGSYDDLISSIKEKLFPLGDDWIIIPGHGPYTTIGA
EKKTNPYLK
>MS2011 gloB, GloB protein
MKKLVLTTLISATLGLSAIAAHAHPTYAPAKNAVKMQKTQVPGYFRQMVG
DYEVTALYDGVGNLDMSLMAPFTQFSKAELDAMLDDEFAQRSELGGLEGT
IIGFLVNTGDNLILIDAGKGEAEAPIFLDKQGRLIDSLKAAGYQPEQVDI
ILPTHMHADHINGITEKGKRVFKNATVYLPLQEKAFWLDTPMDKLPSEIH
PFIEAARYAVAPYLKADKVKFYNAGDEVFAGVKTVPLFGHTPGHSGFEFT
SKGEKILFWGDVMHNGAVQMAHPEVAIEFDADAEAARTNRQTILTKIAAD
KTLIAAAHLPFPGLGHIKTEKDGKGYRWYPVQYRPFDKH
>MS1993 glpA, GlpA protein
MLGCVFYLTTNQFTFTRGGIMGMSSQLYKNVGDFSPINTDVIIIGGGATG
AGVARDCSLRGLKCVLLERHDIATGATGRNHGLLHSGGRYAVNDRESAEE
CIKENLILKRIARHCVDDTKGLFITLPEDDLDYQKKFIEACQASGIEAEA
IDPALAKFMEPSVNPDLVGAVVVPDGSIDPFRLTAANMIDAVENGAQVFT
YCEVKGLIREGGRVIGVNVYDHKNKINRQFFAPMVVNAGGIWGQGIAEYA
DLKIRMFPAKGALLVMGHRINGMVINRCRKPADADILVPGDTICVIGTTS
DRIPYDQIDNMEVTPEEVDILIREGEKLAPSLRHTRVLRAYAGVRPLVAT
DDDPSGRNVSRGIILLDHAQRDGLDGFITITGGKLMTYRLMAEWATDLVC
QKLNNSKKCETSDRTLPGSNESREETSQKVVSLPTTIRNSAVYRHGSRAT
RLLENERLDRSLVCECEAVTAGEVRYAVDELNVNNLIDLRRRTRVGMGTC
QAELCACRAAGLMARFDVATPRQSTEQLASFMEERWKGIRPIAWGDAVRE
AEFTSWIYYSLLGLNDVLPEDAQGVNNNEF
>MS1994 glpB, GlpB protein
MNFDVVIIGAGIAGLTCGLTLQEKGVRCAIINNGQAALDFSSGSMDLLSR
LPNGSTVDSFAQSYAALAQQSPNHPYVILGKDVVLDKIQQFETLAKSLNL
SLVGSSDKNHKRVTALGGLRGTWLSPNSVPTVSLEGKFPHDNIVLLGIEG
YHDFQPQLLADNLKQNPQFAHCEITTNFLHIPELDHLRQNSREFRSVNIA
QVLEYKLSFNNLVDEIKQAVGNAKAAFLPACFGLDDQSFFESLKQATGIE
LYELPTLPPSLLGIRQHRQLRHRFEKLGGVMFNGDRALRSEFEGNKVARI
FTQLHLENAVTAKYFVLASGGFFSNGLVSEFEEIYEPLFRSDIVKTERFN
ATDRFSWISKRFADPQPYQSAGVVINAECQVQKDGNNVENLFAIGAVIGG
YNGIELGCGSGVAVTTALKVADNIIAKESSN
>MS1995 glpC, GlpC protein
MNIQELIKQAKQDMQSPIAAEIFHDKSFESCIKCTACTAVCPVSRNNPLY
PGPKQAGPDGERLRLKSPSFYDEALKYCLNCKRCEVACPSDVKIGDIIVR
ARNKHLAQQNKPFVQKLRDAILSNTDIMGTLATPFAPIVNTVTGLKATKF
VLEKTIQVSKHRTLPKYSFGTFRSWYMKNAAKEQAKFDQKVAYYHGCYVN
YNNPQLGKEFIQVFNAMDIGVVLLEKEKCCGLPLSVNAFPERAKKLAQFN
TDYIEKMLDENGLDVISEASSCTLNLRDEYHHILGIDNAKVRPHIHMVTP
FLYKLFQQGKTLPLKPLKLRVAYHTACHVEKAGWAPYTLEILKQIPGLEV
VVLPSQCCGIAGTYGFKAENYETSQAIGKTLFDNINEGGFDYVISECQTC
KWQIDMSSNVTCIHPITLLAMSINQ
>MS0752 glpC, GlpC protein
MNVNFYVTCLADVVKAGVAKNTVLLLEKLGCKVIFLEKQGCCGQPALNSG
YTKQALPGMKNLVETFEVNDYPIVAPAGSCVYAIKNYPEYFTRFNEPQWA
ERAQKIADRFYDLTDFIVNVLGVTNVGATLTGKAVYHPSCSLSRKLGIVK
EPVSLLQQVKGLTLLPIANQQTCCGFGGTFSVKMAEISGEMVKEKVAHIS
EADPDYLIGADVSCLMNIAGRLEREGKKVKVMHIAEVLMQEEK
>MS1990 glpF, GlpF protein
MPLFFYFILYIMTHILCENREFKKIPNGLFLIKTTNHNFFKHNSQSSKEK
NMNPYLAEFLGTALLVLMGNGVVANVCLNKTKGNGSGWIVITTAWAFAVY
VAVVATGPYSGAHLNPAVTLGLAANGGFSWTMVPGYIIAQILGGIFGGLV
VYLFYRDHFSATEDEGAKRASFCTEPAIRNYGSNLFSEIIGTVVLVSVIF
YISAGSITLPGAEGATPVGLGSIGGLPVAILVWAIGLSLGGTTGYAINPA
RDLGPRIALTLLSKKLKTSPDWGYAWVPVLGPCIGGLLAAIGYQIVM
>MS1965 glpF, GlpF protein
MKKLFAEFFGTFWLVFGGCGSAVLAAAYPELGIGFAGVALAFGLTVLTMA
YAVGHISGGHFNPAVTLGLVAGGRFQAKEAFSYILAQVVGGVMGATVLYA
IASGKVGFDAVNGGFASNGFGEHSPNGYSLAAVFIAEVVLTAFFLIIIHG
ATDKRAPAGFAPIAIGLALTLIHLISIPVSNTSVNPARSTAVAVFQGGWA
LEQLWVFWVAPIIGGIIGGIIYRVLLESKD
>MS2185 glpG, GlpG protein
MQLLFRSEIPSFAWQFRDYIRKKYQIELILQQEKTDMRQNVIAVYLSGNS
EQTAAILQDLAEFHRNPFDERYERASWETGDVSSGSHSLKELAENSSQGI
KQQLLKTGPVTLLITLICIIVYGFEISGMAEQIMQFAHFPYEFGENQQIW
RYFTHSLVHLSSMHITFNLVWWWIFGGAIERYFGSTKLIIIYVLAAFATG
VTQNFASGPHFFGLSGVVYAVLGYVFVADKFSPNNRFNLPSGFFNVLIIG
IALGFVTPLIGIKMGNTAHITGLLVGLILAFLQEKIGKKSK
>MS1988 glpK, GlpK protein
MNTYFINYSSRRLTMSEKKYIIALDQGTTSSRAVLLDHDANIVEIAQREF
TQIYPKAGWVEHNPMEIWATQSSTLNEVVAKAGITSDEIAAIGITNQRET
TIVWEKETGNPVYNAIVWQCRRTSDITDKLKADGYEDYIRQTTGLVVDPY
FSGTKVKWILDNVEGARAKAERGELLFGTVDTWLVWKLTQGRVHVTDYTN
ASRTMLFNIHTKQWDDKMLEILDIPRSMLPEVKNSSEVYGQTNIGGKGGV
RIPVAGIAGDQQAALYGHLCVTAGQAKNTYGTGCFMLLHTGDKAITSKNG
LLTTIACNAKGEPEYALEGSVFIAGASIQWLRDELKIVHDSYDSEYFATK
VPSTNGVYVVPAFTGLGAPYWDPYARGAILGLSRGANRNHIVRATLESIA
YQTRDVLEAMQSDSGEKLKYLRVDGGATANNFLMQFQADILDVNVERPVV
KEVTALGAAYLAGLAVGFWKDLSELQDKARVERTFTPDNDNEKRERRYKG
WKKAVRRALEWAKEDVE
>MS0380 glpR, GlpR protein
MKLNEKEQLIIDSLKRKDVITNIELSEILQCSTVTIRSLIRSLEKKGLII
RTHGGAKLCNDYLDIHIPAGNIFKEREAKLRIAEKAYQYIAERDTIILDD
SSNSYYLAQVIKKYSDKYLIIITNSLPVIAELSTCSAVEIISIGGVLRGN
KNAFVGDFAIEMLKNFKATKAFIGVHGIDPEFGITSIGNEQMMIKKQIFK
IAQYVYVLTCSEKFGTGYLLVSAPLSQVHKIITDKNIDKNILNVIKSSVD
IDLV
>MS2186 glpR, GlpR protein
MKQSIRHQKIVELVKLQGYISTDELVTLLNVSPQTIRRDLNELAENNLIR
RHHGGAASPSSAENSDYSERKLFFSLEKNHIAQAVSRLIPNGSSLFIDIG
TTSEAVANALLGHQNLRIVTNNLNAAHILMKNDTFKITVAGGSLRQDGGI
IGEATVNFISQFRLDYGILGISSIDLDGSLLDYDYHEVQVKRAIMESSRE
TVLVTDHSKFSRQAIVKLASVTDVDYLFTDQEPPKSIMELIHNSSVELRV
CK
>MS0024 glpR, GlpR protein
MVRSNIMNEQIRHNKLLTLLGENGFLSVQEIMTALNISPATARRDITKLN
EQGRLKKLRNGAEAVIQSTFQPQKKQNEIKNLDEKQRIAALAASLCQNDS
SAILTCGSTMLLLGNALCNRNVQIITNYLPLANQLIENDHERVVIMGGQY
NKSQAITLSLSEHNEAFAADIMFTSGKGLTAQGLYKTDMVIASSEQRLLK
RAQKLIVLVDSSKLDKTVGMLFTELKNIDLIITGQEADPDFIRTLREKGV
DVMLA
>MS0074 glpR, GlpR protein
MSVDRQNAIKLFLRSHNMATVEQLVKITNSSPATIRRDLIKLDDAGIINR
THGGVSLRDSFPYQPTTNEKQYQHVTEKENIADYVVSLISPGDSVLLDAG
TTTLCIAKKLVNIPLRVITSDLHIALLLSEYKQIDIVMTGGAIDKSSQSC
IGQHGLDLLQNINPDFAFVSCNSWSIERGITAPTEDKANLKKCLLQNSRR
KVLVADSSKYGKCSLFKVIELNRLTDIITDHNLPQSAQKALNELDLSVAF
A
>MS0187 glpR, GlpR protein
MKRNFQQRNTQQRRHGIMQLLQQKGEVSVEQLVQLFETSEVTIRKDLTAL
ESNGFLLRRYGGAILMPQDLMDESQDENLSKQKLSIAKAAAERIRDHHRI
IIDSGSTTAALIKQLNSKQGLVVMTNSLSVASELRSLENEPTLLMTGGTW
DTRSESFQGKVAEQVLRSYDFDQLFIGADGIDLARGTTTFNELVELSRVM
AEVSREVIVMVESQKIGRKMPNLELNWQQIDVLVTDDLLSEKDKAVIERH
NIEVIIAK
>MS1983 glpR, GlpR protein
MIPAERQKMLLNLISQQDIVSISQLVETLGVSHMTVRRDIQKLEEEGKVV
SVSGGVKMLEHLSIEPTHNDKSLLSPSQKSQIGIKASEIIPEKTTIYLDA
GTTTLEIAHHIVDREDLLVITNDFVIANFLMKAGKCELIHTGGSVNKSNY
SSVGELAAQFLRQISIDIAFISTSSWNLKGLTTPDENKLPVKRAILQSSN
KRILVSDSSKYGKVATFQICPLSEFDVIICDSDLLENAKDAINEMRIELL
LV
>MS2316 glpR, GlpR protein
MREKKVKPRERQSAIVEFLQINGKTAVEQLAQIFKTTGTTIRKDLTALEA
EKKVLRAYGSVVLVNKDEIDLPEANKTNTNLEVKRRIGQKATEFIGDGDS
LLMDSGTTVLQMVPYLAKYRDLTIMTNSLHIMNALTGLERDYELLITGGT
YRQKSASFHGILAESTVEKFTFDKLFIGTNSFDLDYGLTTFNEVHGVSKS
MCKAAREIIVLADSSKFQRRSPNVVCPLEKINTIVTDKNLDPAIHQALIE
KNINVILV
>MS2254 glpX, GlpX protein
MNRSLSIEFSRVTEAAAISAHSWIGRGDKNAADEAAVKAMRYMLNRIHMD
GEIVIGEGEIDDAPMLYIGEKVGSGMGEQVSIAVDPIDGTRMTAMGQSGA
LAVLAAGGKNTFLKAPDMYMEKLVVSAEAKGMIDLNLPIEQNLRRVASRK
GKLMSELVVMVLAKPRHNEIIKQIQSLGAKVLAIPDGDVAASVQVCLPDA
EADVLYGIGGAPEGVITAAAVRALGGDMQARLLPRNEVKDDTPENQQIAQ
EEMRRCQEMGVAVNQVLSLNELAHDDNLVFVATGITNGDLLKGIQIKGNF
ATTETLMIRGQSHTIRRIQSMHYLDGKDPDLYKSIAL
>MS2371 gltA, GltA protein
MADKKATLTVDGKNYEFDIVKGSLGYESIDIHGLSQNKLFMYDPGLVSTA
VCESAITYVDGDEGMLLYRGYPIDQLASNADYLEVSYLLLFGERPTKQQY
QDFSKLVKRHTLVHEQLTKFFQGFRRDSHPMAVMCGVSGALAAFYHDSID
VKKEEHRELTAIRLLAKIPTLAAMCYKYSIGQPFMFPQNNLSYAGNFLYM
MFATPCEPYVVNPVLERALDKIFILHADHEQNASTSTVRIAASSGANPFA
CIAAGIASLWGPSHGGANEACINMLEEIGTVDRIPEFIARAKDKNDPFRL
MGFGHRVYKNYDPRAKVMRETCHEVLKELNIKNPLFDVASELERIALSDP
YFIDHKLYPNVDFYSGIVLKAIGIPTSMFTVMFALARTVGWIAHWKEMYK
QGNFKIARPRQIYTGYTERDFPAIDKD
>MS0731 gltD, GltD protein
MAKFFLAPADNYDVKIGELVDKFVNKVRSFPPGTCPLVVQYASLRSSMSQ
TCGKCVPCRDGIPHLSFLLRDILAGEGDDSTMRQIRELAEMIRDGSDCAI
GYQPAIEILDSIEEFKEEYESHIHNKSCQKVIGQRIPCINMCPAHVDIPG
YIAHIGDGNYAEAINLIRKDNPLPTACGLVCEHPCEERCRRRLIDDAINI
RGLKKYAVDQVAADVVKVPQALPDTGKKVAVIGGGPAGLTCAYFLAQMGH
RVTIYERQKALGGMLRYGIPNYRFPKDRLDQDLNAILSAGRIEVKYGVMV
GDDIAIEDIYNSHDAMFVGIGAQKGKTLRIKGSEANNVFSAVEMLDDIGN
GKIPDYTDKVVVVIGGGNVAMDAARSAVRCKAKDVRIVYRRRQDDMTALH
AEIEAAIMEGIELITLAAPVAIEKDEQGNCTGLTVQPQMTGPYDHGGRPS
PVAVKKPPFTIGCDVILIAVGQDIISLPFEEFGMPANRGIFQADLTTAVP
DMDGVFVGGDCATGPATAIKAIAAGKVAAHNIDEYLGYHHEFPCETKAPP
PKENVRIQVGRANTTERPAYIRKCDFEHVENPYTYEEAMQEAERCLRCDH
FGCGVLQGGRDL
>MS0030 gltS, GltS protein
MTFDTYETLALACLVLLLGYFLVKRVKLLSNFNIPEPVVGGFIVAIVLTV
VHEIWGLSFSFDSNLQRTMMLVFFSSIGLSANFARLIKGGKPLVMFLVVA
AMLIAIQDTVGIFGSMALGLDPAYGLIAGSVTLTGGHGTGAAWAETLTND
FGISGAMELAMACATFGLVFGGIIGGPVARFLLTRLHKEEVPEDENVDDV
QEVFEKPVYRRKVNSRAIIETISMMAVCLLVGQFLDELAKGTAFQLPTFV
WCLFTGVILRNTLTLVFKFTAPDQTIDVLGTVGLSIFLAIALMSLKLWEL
AGLALPVFVILTLQVVVMATFAILVTYRVMGSDYDAVVLSAGHCGFGLGA
TPTAVANMQAVTAHFGHSHKAFLIVPMVGAFFIDLLNASLLKFFVEVAAY
FH
>MS1295 glyA, GlyA protein
MLQNHSIAEFDPVLWDAIQNENRRQEEHIELIASENYVTKAVMEAQGSQL
TNKYAEGYPGKRYYGGCEYVDIVEQLAIDRAKELFGADYANVQPHSGSQA
NAAVYGALLNAGDTILGMDLAHGGHLTHGAKVSFSGKIYNSVLYGITAEG
LIDYEDVRVKALESKPKMIVAGFSAYSQVVDWAKMREIADEVGAYLFVDM
AHVAGLIAAGLYPNPLPHAHVVTTTTHKTLAGPRGGLILSACGDEEIYKK
LNSSVFPANQGGPLMHVIAAKAVCFKEALQPEFKAYQAQVLKNAKAMVEV
FKQRGFEVVSKGTENHLFLVSFVKQGLTGKAADAALGEANITVNKNSVPN
DPQKPFITSGIRVGSPSITRRGFNEADASTLAGWMCDVLESIGKDNYDQV
IAETRAKVLEICKRLPVYGD
>MS1953 glyQ, GlyQ protein
MSAKFNVKTFQGMILALQDYWAQQGCTIVQPFDMEVGAGTSHPMTALRAL
GPEPMAFAYVQPSRRPTDGRYGENPNRLQHYYQFQVVIKPSPDNIQELYL
GSLKMLGFDPTQHDIRFVEDNWENPTLGAWGLGWEVWLNGMEVTQFTYFQ
QVGGLECKPVTGEVTYGLERLAMYIQGVDSVYDLVYSDGPLGKTTYGDVF
HQNEVEQSTYNFEYADVDFLFECFNKYEQEAKFLLKQEPRMENDKEIWVE
TALPLPAYERILKAAHSFNLLDARKAISVTERQRYILRIRALTKGVAEAY
YASREALGFPGCK
>MS1956 glyS, GlyS protein
MTTQNFLAEIGTEELPPKALKKLATAFAENVENELNQAGLTFEKVQWFAA
PRRLAVKVLNLATSQPTKEIEKRGPAVSAAFDAEGKPTKAAEGWARGCGI
TVEQAERLATDKGEWLVHRATIEGQPTKNLMLDIVTRSLANLPIPKMMRW
GDKTEQFVRPVHTVSLLLGGELIEGEILGIASGRTIRGHRFLGEAEFQIA
HADEYPQILKDKGSVIADFNERRAIILADSQAKASALGGVADIEDDLLDE
VTSLVEFPNVLTATFEERFLAVPAEALVYTMKGDQKYFPIYDKNGKLLPH
FIFVSNINPTDPTPIIEGNEKVVRPRLSDAEFFFNTDKKQRLEDLLPRLE
TVLFQQQLGTLLDKTKRIQALAGEIATQIGADKAKAERAGLLSKCDLMTN
MVFEFTDTQGVMGMHYARHDGEDEEVAVALNEQYMPRFAGDNLPNSLVAS
SVALADKFDTLTGIFGIGQAPKGSADPFALRRAALGALRIIVEKNLPLDL
AEIVKKSTALFADRLTNQNVVDDVVDFMLGRFRAWYQDEGIAVDVIQAVL
ARRPTKPADFDARVRAVSHFRTLDSAEALAAANKRVSNILAKIEGEISSK
IDRTLLLEPEEKALAEQVLALQSELAPLFAKGEYQPALDRLAGLREVIDN
FFDKVMVNAEDEKLRQNRQAILNTLRNLFLQVADISLLQ
>MS0218 gmhA, GmhA protein
MILADSFKQGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISD
VSHLSCVSNDFGYDYVFSRYVEAVGKEGDVLFGLSTSGNSKNVLNAIEAA
KAKGMKVIAMTGKDGGKMAGLADVEIRVPHFRYADRIQEVHIKVIHILMM
LIEFEMAKAA
>MS1290 gmhA, GmhA protein
MFNGLKILLNNMLEKIKDLYTENIQTQISASRLLPETIVEATTKLVSCLL
RGNKIIVCGHGRSYANAQFLVANLLNRYELERPSFPSVLLTIDSAVGSAI
VSDNHITTLYQRQFNAIAQQGDILVALVPNSGDESIINVINCATNKDVEI
IALTGANDDHLQGLISENDLEVQTPAIKESRILEGHLFIINALCELIDHT
LFTQSG
>MS1738 gmk, Gmk protein
MSQGNLYILSAPSGAGKSSLISALLEQDQANTMMVSVSHTTRQPRPGEEN
GVHYHFVSVEEFELLINEGAFLEYAKVFGGNYYGTSLPTIEKNLAQGIDV
FLDIDWQGAQQIRKKVPSVKSIFILPPSLAELEKRLIGRGQDSAEVIADR
MSKAMDEISHYNEYDYVIINDDFTRALADLVHILRAEKLTLAYQTEQNQA
LINQLLAK
>MS0013 gnd, Gnd protein
MSTKGDIGVIGLAVMGQNLILNMNDNGFKVVAFNRTTTKVDEFLQGAAKG
TNIIGAYSLEDLAAKLEKPRKVMLMVRAGDVVDQFIDALLPHLEQGDIII
DGGNSNYPDTNRRTKALAEKGIRFIGTGVSGGEEGARHGPSIMPGGNPEA
WPYVKPILQAISAKTDKGEPCCDWVGAEGAGHFVKMVHNGIEYGDMQLIC
EAYQFLKEGLGLSYEEMHEIFQQWKQTELDSYLVDITTDILAYKDTDGQP
LVEKILDTAGQKGTGKWTGINALDFGIPLTLITESVFARCVSSFKEQRVA
AAKLFNKTVSPVEGDKKVWIEAVRKALLASKIISYAQGFMLIREASEQFG
WNINYGATALLWREGCIIRSAFLGNIRDAYETNPDLVFLGSDPYFKGILQ
NALADWRKVVAKSIEAGIPMPCMASAITFLDGYTSERVPANLLQAQRDYF
GAHTYERTDKPRGEFFHTNWTGRGGNTASTTYDV
>MS0957 gntK, GntK protein
MTQGKSFILMGVSSTGKTSVGTEVAHRLGLKLIDGDDLHPRANIIKMGEG
KPLNDEDRAPWLERIRDAAFSLEQKSEVGVIICSALKKKYRDLIRQGNER
VKFLFLYGSYELILERMRQRKGHYMKEEMLKSQFDTLEVPQADEADVIHI
DIDGSFEEVVQRCITALKPYL
>MS0524 gntR, GntR protein
MFFIKNKDNMSRDLNLRQDIINQMIDDISSDLLTSPLPSLSALATLYNVS
RTTIRHAITYLTEQKIINRIDAQLIITKKPSADDKITYIKIKKPGNNQIK
KLEKYFSSAVQQKIIKPGDDFTELELAKNANVDIFTVREYLIQFSRFNLI
SHISAGKWRLTKLTQHYADKLFELREMLECHALNCFMNLPKNDIRWKQMK
LLLQEHRILRNNIVEKYVDFSLLDQQLHSLILSAADNPFINDFINLISVI
FHFHYQWDNSNLRTRNILAVEEHLAILVKIVSQDDLGAITELKRHLQTAK
NGLMNSIRLMNN
>MS0688 gntT, GntT protein
METAASMSQMLIGLAIGIALLLILAMKTRIHVFVALILASLTTGLIGGLP
FAEVISSVTKGFGSTLGSTGIIIGLGVMMGAILEKSGAAEQMAFSIIKLI
GKAKEEWALALTGYVVAIPVFADSGLIILTPLARSLSRMTGKSVIGLGLA
MATGLQLAHVFIPPTPGPLAVAGILDIDMGMMIIWGMILTVPTLVMSTLY
AKWLGKKIYQIPNEDGTDFERKEFKEEYIKSIENVEQIYKDKNLPGAGLS
FSPIVIPLILILGNTTVNFLKIENGFADLLKIVGHPIIALIIGLLIALYG
LGRRLSKAETNKAIEDGVKSTGMILFITGAGGALGYVVRDAGIGNALGEA
VLTVGIPGILIPFVIAALMRIALGSATVALITAATLAAPLVPQLGLNPTL
VAMSTCAGAVSFSYFNDSGFWVFNGLYGLKEVKDQFMAKTMVSFIGAFSC
LALVLIFNIFM
>MS0954 gntT, GntT protein
MLIFIMIASVALLLLLIMKFKVHAFVALTIVSLLTALATGIPINKILPTL
LNGFGNTLASVALLVGLGAMIGRLLEITGGAKVLADTLINKFGEQKAPLA
LGIASLLFGFPIFFDAGLVVMLPIIFSVAKQFGGSLIRYAFPAAGAFAVM
HAFSVPHPGPVAAGDLLGANIGLLTIIGLICAIPTWYIATYLFGLHLGKK
YHLDLPKAFLNAMPINETAVLTPPSFKKVILILLLPLGINYAGYGVKYFS
RCKSN
>MS1977 gntT, GntT protein
MSLKIAAILLALLYQEYCMSNEMLILIGIVSVIALLLIMIKGKVHPFVAL
SLVSIAVALSSGIPMGKVVPTLISGMGGTLGSVALIVGLGAMLGKIIEKS
NGADVLASWLLDKFGEKRAPFALAMTGFIFGIPVFVDVGFIVLIPIIFSV
ARRIGGNMLVYALPIGLSMLTVHVLMPPHPGVVAGAQVLNADIGLVLGLG
FIAALPAVLIGQTFIPLFTKNNFVAIPASSDLLEYQKQVSKNVDGLPKFA
TVLAMIVFPLLLIMSGTVSATVLPKESIVREFFSMVGASPFALLLAVCVS
SYILGIRRGWRKEQLEEILNSALAPIAGIILITGAGGMFGKVLNESGVGN
ALADVLSSTGLPILALSFILAAMLRAAQGSATVAVITTATILAPAVTSAG
YSDIQTALVTAAIGAGSMTLSHVNDSLFWVWTKFFGITITQGLRTWSILS
TIYGSLAFLIVTLMWMFA
>MS0953 gntT, GntT protein
MLDTVLNTLAVAKVIDGSQLWVETLRLLGKTPIALLITLIVSIVLLKNQR
SYEQIEKICDSSLGPICAIVLVTGAGGMFGGVLRASGIGEVLASTLGHTG
MPVIVAAFIISSALRVAQGSATVALTTTAALISPMVAADPSLSQMDLCFI
VISIASGATVLAHVNDSGFWLVSRFLEIDTKTMLKTWTVQETLIGIVGFI
IAYVGSIIF
>MS0335 gntT, GntT protein
MIMSITVAFIIGVAVLLFLALKLKVSAFLSLLATALTIGILSGMGTTEII
KDIVAGFSKSVGSIGLVIIFGTMLGNYLEQSRAAHKMALDAVRLVGTKNS
SIAMSISGYLISIPVFSDVGFLILSPLIKAISKKSKIPLAALAVALSAGL
LATHVYVPPTPGPLAAAGLLGIDIGRAIIWGAFAAVVMTLFGWMYAHFYL
MKKSPDYYTFVETVVEEKEVDETNLPGSLASLMPLLLPIVLILLNTTCAA
IFPKDSPVLSVTKFIGDSNIALVIGALTAIALLGKRIGKEKVLKIMDSSL
KDAGSIIFITAAGGALGQILKTSGAGDSLAQAVVSSGLPFILIPFVISAI
LKIVQGSGVVAVITSATLAAPIATQLGIDPILIFLASGAGARAYCHVNDS
YFWVYTNCCGFDMKTGLKTLSNASIFMSLGGLLATFIASLII
>MS0686 gntT, GntT protein
MSGISLIISFIIAIIIMIWMISKLKVHPFLSLMTISLALALVAGIELNKI
PGMIGDGFSSTFKSIGIVIIFGAIIGTILEKTGAALKLADMVVKLVGQKH
PELAMLIMGAIVGIPVFCDSGFVVLNPIREALYKKIAANPVATAVALSGG
LYASHVFIPPTPGPIAAAGALGLESNLLLVIIMGVVVSIPVLTAVYFFAG
YIGKRVTLDEEAQADAAIVKNYEQLLKQYGILPGKFLSLAPILMPIVFMA
LGSIAKIAEIGGNTGIIIQFLGTPIIALAIGVIFSVFLLLQTKKITEFND
LTNETLKIVGPILFITAAGGVLGKVITEAGFVDYIKQNAHIISTTGIFFP
FIISAVLKTAQGSSTVAIITTASIMGMYSAGDSLMSVLGLTSEIAAALCV
MAIAAGAMCVSHANDSYFWVVTNFGKMTAQQGYKTQTLMTFIMGIVGIIT
VYILSLLLL
>MS2331 gph, Gph protein
MNSQFKLIGFDLDGTLVNSLPDLALSVNSALAEFELPQAPEELVLTWIGN
GADILIGRALDWAKEQSGKSLTDEQTAQLKERFSFYYAENLCNVSRLYPN
VKETLETLKEQGFILAVVTNKPTRHVQPVLKAFAIDHLFSETLGGQSLPA
IKPHPAPLYYLCGKFGLYPHQILFVGDSRNDILAAHSAGCTAVGLTYGYN
YNMPIADSHPDWIFEDFADLLKIV
>MS2321 gpmA, GpmA protein
MELVFIRHGLSEWNALNLFTGWRDVNLSEKGVEEAKEAGRKLKAAGFEFD
IAFTSVLTRAIKTCNLVLEESNQLWVPQIKTWRLNERHYGGLQGLNKAEA
AAEHGDEQVRIWRRSYDVLPPVLDPKDPNSAHNDRRYAHLPADVVPDCEN
LKVTLERVLPFWEDQIAPAIKAGKRVLVAAHGNSLRALAKHIEGISDADI
MDLEIPTGQPLVYTLDDNLKVVSKRYL
>MS1172 gpmB, GpmB protein
MKKDLRLYLIRHGRTVWNEQGLMQGWGNSALTEQGVKGAQLTGQALAEVP
FIAAYSSCLQRTIDTANYILGERSVPLFQHIGLNEQFFGSWEGTNVETIR
QTAEFQQMVNDPKNYQASSNGGETWQQVAERAMKAMQDIIDVHHRGDILI
VSHGHTLRLLLALFAGATWQNHREQGKSVAMLNTAINMVRYVQHDEDQAG
KFIIERLNDAAHLG
>MS0287 gppA, GppA protein
MNNENLRAKATALNNVAKHEMREVREIAAIDLGSNSFHMIVARIVNGSIQ
VLSRLKRKVRLAAGLDENGVLDQAAISRGVDCLALFAERLQGFKAENVNV
VGTYTLRSAVNNQEFLRQAQAVFPYPIRIISGEAEAEMIYAGVSHTQPEQ
ARKLVIDIGGGSTEMIIGEGFTPLLVNSRNMGCVSFAKQFFVNGEISEQN
FNRARQTALERVRDLSEQYRQLGWKHVLGSSGTIKTVHQVIMANIDNDGI
ITAGRLDHLIERTLKATHFDNLKLSGLIEERADVFVPGLAILSAVFDAFD
IQQMRYSDGALREGVMYSLETNFQVTNIRERTAEGLAEQFNIDREQAHRV
TQTAVLLAQQFTGWQSPEQAEELQEILLWAALLHEVGIVINHKNLQKHSA
YILQNIELPGFDKEQQRLLATLVRYHINNFRLEDISAGRYEIQDVLSLIR
LLRLAIALNKSRQATESTEEISLKTDRISSLWTLTFEPNYLRDNPLVEND
LAAEQLQLKDIGINFKFA
>MS2213 gpsA, GpsA protein
MSIQASPVTILGAGSYGTALAIALSRNGYPTYLWGHNPTACAQMAQERQN
ARFLPDISFPEALRVESDLKSAVEKSKDLLIVVPSHVFGEVIQQIKPFLH
NRHRIIWATKGLERGTGRLLQNLVEQELGSQYPLAVLSGPTFAKELAAGL
PTAITLAAENEQFAKEFQARIHCSKHFRVYINNDMVGVQLGGAIKNVIAI
SAGMSDGMGFGANARTALITRGIAEISRLGVSLGANVNTFMGMSGLGDLV
LTCTDNQSRNRRFGMMLGQGVDARTAMDEIGQVVEGYYNTKEAYMLAQKQ
GIEMPITEQIYQVLFCGKDAKEAATALLGRKSKVE
>MS0961 greA, GreA protein
MKQIPMTVRGAELLKQELDFLKTTRRPEIIKAIAEAREHGDLKENAEYHA
AREQQGFCEGRIQEIESKLSNCQIIDVTKLPNNGKVIFGATVVLVNTEND
DEVTYQIVGDDEADIKSGLISVNSPIARGLIGKEVDETVSIVVPGGKVEF
DIIEVNYI
>MS2208 greA, GreA protein
MAKSNYITRAGWNVLDQELKYLWKDERPKVTQAVSDAAAMGDRSENAEYI
YGKRRLREIDRRVRFLSKRLEVLQIVDYNPKQEGKVFFGAWIELENESGE
IKQYRIVGCDEFDPAKNWISIDSPVARALIGKQIDDEVRVETPAGKVLLY
VNNIWYEK
>MS0459 groL, GroL protein
MAAKDVKFGNDARVKMLAGVNVLADAVKVTLGPKGRNVVLDKSFGAPTIT
KDGVSVAREIELEDKFENMGAQMVKEVASKANDAAGDGTTTATVLAQAIV
NEGLKAVAAGMNPMDLKRGIDKAVAAVVTELKALSKPCETSKEIEQVGTI
SANSDSIVGQLIAQAMEKVGKEGVITVEDGTGLEDELDVVEGMQFDRGYL
SPYFINKPETATVELDSPFILLVDKKISNIRELLPVLEAVAKAGKPLLII
AEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIAILTAGTV
ISEEIGMELEKATLEDLGQAKRVVINKDNTTIIDGIGDEAQIKGRVAQIR
QQIEESTSDYDKEKLQERVAKLAGGVAVIKVGAATEVEMKEKKDRVEDAL
HATRAAVEEGIVAGGGVALIRAASKAAASLQGDNEEQNVGIKLALRAMES
PLRQIVANAGEEASVVASAVKNGEGNFGYNAGTEQYGDMIAMGILDPTKV
TRSALQFAASIAGLMITTEAMVTELPKDDKLDAAAAMGGMGGMGGMM
>MS0458 groS, GroS protein
MAVGKGRVLENGTVQPLDVKVGDTVIFNEGYGVKAEKIDGEEVLIISESD
ILAIVE
>MS0743 grpE, GrpE protein
MEKIMSEQEKNQENLENAEELTQKANDTENSAEQAEPADETASDALEEAI
ARVQELEEQLAETAKKEQDLLLRSRAELDNMRRRAEQDVEKAHKFALEKF
SKDILNTIDNLERALATPANKEDEAVKSLFDGVELTLKELLATVARFGVE
PVGAVGETFNPELHQAISMQSAEGFETNQITVVLQKGYLLNGRVIRPAMV
MVAA
>MS1052 grxB, GrxB protein
MKLYVYEHCPFCVRARMIFGLKNLPFEQEVLSNDDEATPTSLVGKKVVPI
LVKDDGTAMPESLDIVKYVDENFGDKLLTEQIRPELEVQLKQIGSYYNHL
LLPRFVKLGLAEYNTQSALNYFIQKKTKSIGDFAENLANTPQYLDKLNRD
LTLLDNLILAQDKVNGEQLSVEDIILFPMLRNLTCVKGVIFPTRVKNYVD
CMAKMSKIDLFYGNAV
>MS0755 grxC, GrxC protein
MFVVIFGRPGCPYCVRAKNLAEKLKNSLDDFDYRYVDIIAEGISKADLSK
SVGKEVETVPQIFIDEKPIGGCTDFEALMKEQFNIVA
>MS1683 gshA, GshA protein
MRFDQGNLMNIQQIVKEKGLGLLFRQGTVGIEKESQRVHADGSIVTSEHP
KAFGNRSYHPYIQTDFAESQLELITPPNKKIEDTLRWLSALHEVTLRTID
ENEYIFPMSMPAGLPPEQEIRVAQLDNAADVAYREHLVASYGKAKQMVSG
IHYNFQLDPKLVETLFNAQTDYKSAVDFQNNLYLKMAKNFLRYQWIPLYL
LSATPTVEANYFKDGSPLKPNQYVRSLRSSKYGYVNAPDIIVSFDSIEKY
VETLEHWVNSGRLIAEKEFYSNVRLRGAKKAREFLHTGIQYLEFRLFDLN
PFEAYGINLKDAKFIHHFILLMIWLEETADQDAVELGRARLGEVAFEDPH
SETAYRDEGEQIINQLIDMLKAIGAEQSAVEFAEEKLAQFANPGQTLCAR
LVDAIEQAGGYQQLGGEIAKRNKVQAFERFYALSAFDNMELSTQALMFDA
IQKGLNMEILDENDQFLRLQFGDHFEYVKNGNMTSHDSYISPLIMENKVV
TKKVLAKAGFNVPQSLEFTSVEQAVASYPLFEGKAVVIKPKSTNFGLGIS
IFQQGVHDKADFAKAVEIAFREDKEVMVEDYLVGTEYRFFVLGNETLAVL
LRVPANVMGDGVHTVAELVAAKNDHPLRGDGSRTPLKKIALGEIEQLQLK
EQGLTVDSVPAKDQLVQLRANSNISTGGDSIDMTDEMHPSYKDLAVGITK
AMGAAVCGVDLIIPDLKKPAEPNLSSWGVIEANFNPMMMMHIFPYSGKSR
RLTLNVLGMLFPELV
>MS0671 gsp, Gsp protein
MSEISPNIPTHDAFGSLLGYAPGGIAIYSSDYETADKNEYPDDAAFRSYL
GREYMGYKWQCVEFARRYLYLNHGMVFTDVGMAYEIFSLRFLRQVVNDAL
VPLQAYANGSKKSPEPGALLIWQEGGEFQETGHVAIITEVFNDKIRIAEQ
NVIHYRLPSGQQWTRELPMSVTEQGYILHDTFDDTEILGWMIQTDDSTYS
LPQPTAAPESLEIHAEHIENKGQFDGKWLNESDPFEKLYVTAMNGHQVSR
TDQYRYFTISETAKHELIRATNELHLMYLHATNKVLNDDNLLKYFNIPKL
LWPRLRLSWENRRYQTVSGRLDFCLDERGLKVYEYNADSASCHAEAGAIL
GRWAKVAGLDNGEDPGAHLRNALADCWKHRDNTPLVHIMQDNDSEEDYHS
MFMQSALLQAGCRTKIIHGTEGLHWDKRGRLLDDEDNQILSVWKTWAWET
MLEQLREDATGREVAPPIRTGYPEDKVRLIDVLLRPEVLVYEPLWTAIPS
NKAILPVLWSLFPNHRYLLESGFELTQNLIKNGYAKKPIAGRRGDNVTLF
ADQHSRLDVTHGRFGKQEHIYQQLWCLPKVEEQYVQICTFTVGGHYGGSC
LRSDPSRIIVGDSDMQPLRVLNDKDFLAK
>MS1970 gspD, GspD protein
MRAGRVDENKILKEYLPMESIISFGKKCGLFFGIFISSAFAGESGTFAER
QFSIHLKKAPLVATLQQLALEQNANLVIDDELEGTLSLKLEKVNLERLFH
SVAKLKNLSLHKDKDIYYFTKNNLIEPSSIAGELKNTENFTALSEPNLVS
TTVKLHFAKASEVMKSLTSGTGSLLSPVGSVSFDERSNQLLIQDERRSLQ
NIKNIIAQLDKPIEQIAIEARIVTMNDESLKELGVRWGLLEGVNSAHRIA
GSLEANGFADIGQNLNVNFPTSATPAGSVALQVAKIHGRLLDLELTALEQ
ENNVKIIASPRLLTTNKKSASIKQGTEIPYVAVNRKNDTEHVEFREAVLG
LEVTPHISKDNSILLDLIVSQNSPGANIVYGNGNLISIDKQEINTQVFAK
DGETIVLGGVFNDTITKSEDKVPILGDIPLIKHLFSKESEKHQKRELVIF
VTPHILKQGESLEQIQKRFKYAPKSPEK
>MS1779 gspD, GspD protein
MTAALTAFMLVCAAPIFAKPMYLEQGTSKYIELDKKIDTIFVSSSEVADY
EIVDDYSFMVYGKQEGTTDVTAFDANGNILYTDTLNVNALINNIVDTNKQ
IKARFPNSNLQVKKLGKAYVIEGKANTQHESEEVNRIVGEAIGVAPKVTE
TTLKRGNGMSDEKIPFLDKYEYNGVINNTNIDKTKQINVKLTVAEVNKNF
SDSLGIKWEHLSGSVLENWTSGANGYSGGFDGTTGSIALINANRLSAFIT
AVNNANNGKILAEPNISMLSGETADILVGGEVPFAQRDSDGNTSIIYKDF
GVKLMVGAKVQKNDQVRIVLAQEVSTLAGNYTYTSVGDIPYFQTRRAKST
FEVGNGESFIIGGLLSSSDLEGVSKVPLFGDIPILGAFFRSVTTSRETKE
LVVVATVNLVTPNDAEKVIYPSFEQTGTLERFFNLTPFKNVYHKTLTTNF
MKNGGFIQ
>MS0363 gspE, GspE protein
MIIKNSDVRLMQSIKIIAGNGEQYMIDHELWQRNQQQQHVLLRYLAVPIK
EEEQKLWLAIDNVENIAACETFSFLTGKIVEPVLVSNETLKSLLQPDEPQ
SLSIEETSLIYTESLSQNKENKNTDEPIIRLLNNIFESALSKNASDIHFE
PQKNQLRIRFRIDGVLQQQTPVNLSLAGRIISRLKLLAKLDISETRLPQD
GRFDFKTTFAETLDFRISTLATSNGEKIVLRLQQNKPVDFSFEQLGMEPA
QQQKLEQALNQPQGLILVTGPTGSGKSITLYTALQWLNSASKHIMTAEDP
IEIELEGIIQTQIQPQIGLSFSRLLRTFLRQDPDVIMLGEIRDDESAQMA
LRAAQTGHLVLSTLHTNNAYSAISRLLQLGIKQHEIDNSLLLIIAQRLVR
KRCQKCGQFSENFINCDCHQGYRGRIGIYQFLHPRWQAQKWQYVTDFPSL
YQAAKNKVQQQITDKQELLRVLGSEK
>MS1617 gst, Gst protein
MTALFYSFIMQPKFLINGNFMIILYALTQSRAYRIAWLLEILNLPYKLEI
IERDGETNLAPDALRSIHPLGKSPIIKDGDLVLTESGAIVEYLINRYGGG
KLKPEMNSTDYWQYLHWMHYAEGSLMPLLVIKLIFRKIDEADMPFIAKPI
ANKITEKVKQGFIQPQLKLHLDYIESQLAEKFWLVGDELTGADIMMSFPL
QAAVSYFETNQYPHISAYVSRLNHTESFKRAEQKLGPLTFF
>MS2100 gst, Gst protein
MVTLHYLKQSCSHRIVWLLEALSLDYELKIYDRDPQTLMAPAELKAQHPL
GKAPVLQDGDLVLAEGNAIIQHLLDRYDDENRFTPAHKTGAYSNYVYWLA
VSASMFSANFLALLSTRSDLGDFAQYATAQTPLFFNHVEQTLEGKQWIVG
EQLTGADFALSFPLQWGMKYVDEADYPNIVRYLAQIENHPAYVKANEKTA
GELDLSKF
>MS1281 gst, Gst protein
MTSAANKRSIMTLFSDKSDIYCHQVRIVLAEKGVAYETEIVDPQALSEDL
MELNPYGTLPTLVDRDLVLFNSRIIMEYLDERFPHPPLMPVYPVARGKTR
LLMLRIEQDWYPALEKAEKGTEEERATALKQLKEEMLAIAPIFTQTPYFM
SEEFSLVDCYIAPLLWRMQELGVEFGGAGAKAIKGYMAKVFERESFVQSL
GNNAPKNLMDEK
>MS2085 gst, Gst protein
MKLYYLPGSCATVPYVALEWIGEPYEAQAVTHDYIKSAEYLVLNPQGQVP
LLVDNDLVLTQNIAILTYLDNLFPEKKIFGSKTARDKAKAMKWLAFFNGD
LHKAFVPLFRVPAYAEGNEELTNEIRKDAAANVIRMLSIADEYLTRHIHF
GEQISVADVYLFVELRWCKMLGLDLSQFANLQAFYQRIAADVGVKTVLIK
QGISE
>MS0257 gst, Gst protein
MKLWYSTTSPFARKVLVTLKHQQLEDKTDLLRITSSFDPDSPHNQVNPLG
RIPALQRNCGNWLFGSLLICEYLDQKGACPKLIPESGKPRWAVLALHNLV
DGIMENTMPMVAEKMLRPENEWWTSRHQQLMDRNVRSFTQLEQALLPFGT
ELNIGTITAVCLIDWWIFRADKIGYDLAAHFPHLVTWAEDMNNKYAILAA
TKPGI
>MS0772 guaA, GuaA protein
MNNIHNHKILILDFGSQYTQLIARRVREIGVYCELWAWDVTEQQIREFNP
TGIILSGGPESTTEDNSPRAPEYVFNAGVPVLGICYGMQTMAMQLGGLTE
PSSHREFGYASVSLENSTALFAQLNDDLNSSLPKLDVWMSHGDKVTRLPE
GFQLTGTTSTCPIAAMSDESRHFYGVQFHPEVTHTKSGLALLTNFVVNIC
GCTTNWTPENIIEDAVARIKAQVGDDEVILGLSGGVDSSVTALLLHRAIG
KNLHCVFVDNGLLRLNEGDQVMEMFGDKFGLNIIRVNAEDRFLDALKGID
EPESKRKMIGKVFVDVFDEESHKQTSVKWLAQGTIYPDVIESAASKTGKA
HVIKSHHNVGGLPDYMKLGLVEPLRELFKDEVRKIGLALGLPAEMLNRHP
FPGPGLGVRVLGEIKKEYCDLLRKADAIFIEELYNSGWYYKVSQAFTVFL
PVKSVGVMGDGRKYDWVVSLRAVETIDFMTAHWAHLPYDLLGKISNRIIN
EVDGISRVVYDVSGKPPATIEWE
>MS0774 guaB, GuaB protein
MLRIKQEALTFDDVLLVPAHSTVLPNTANLSTQLTKEIRLNIPMLSAAMD
TVTETKLAISLAQEGGIGFIHKNMSIERQADRVRKVKKFESGVVSEPVTV
FPELSLGELAQLVKKNGFAGYPVIDQNDNLVGIITARDTRFVKDLNKTVA
EVMTPKEKLVTVKEGAKREDIIALMHSHRVEKVLVVDDNFKLKGMITVKD
FQKAEQKPNACKDELGRLRVGAAVGAGPGNEERIDALVKAGVDVLLIDSS
HGHSEGVLQRVRETRAKYPNLPIVAGNIATAEGAIALADAGASAVKVGIG
PGSICTTRIVTGVGVPQITAISDAAAALEGRGIPVIADGGIRFSGDIAKA
IAAGASCVMVGSMFAGTEEAPGEIELYQGRSYKSYRGMGSLSAMSQGSSD
RYFQSDNAADKLVPEGIEGRIAYKGLLKDIIHQQMGGLRSCMGLTGSATI
EDLRTKSQFVRISGAGIKESHVHDVTITKEAPNYRLG
>MS1486 gumC, GumC protein
MSKKQNDVIDLTKLLGLFWDQKRIILLSTLLCAGLGLVYSLLAPSIYMAT
SSVQVEEKYTGGALQGLSSIFEQESTAGTEIAVIKSRAIVSKAVEDLNLT
TEVSPVYPIPFFSKAVEKLMGDKPEITVARFVPKREDAQEYTLVIGSNEN
EYSVLDEQKQLVLNGVVGEKYDNQDIEILVSQLKGSSGKRFSLKKMEKSD
VLELVEALQKAVTADEKGKQTGVIELTFKGEDPEYIQKVLHSITQSYLEH
STARNSAEASNSLSFLQKRLPEVRDRLTKSENELNEYRQKKASVDLELEA
KSVLDTLVQLDSNLNALTIRESEISQRFTKRHPNYVALLEQRQVLLDEKA
RLTKQLESLPETQKDTVRLTRNFEVDQQIYTQLSNKIQELDVVKAGAVGN
VRILDEAQTLPKPVAPRKLIILVLTAIVGFLLGSGGVILKSILQNGILTV
SEVSETGLVTYASVPFSKKQSALSRSKGGNRIGEGLLSDRYADDFSLESL
RSLRTGLNFMLAESNKRVVLLSGVSTGVGRHFITANLADLLAKADKKVLL
IDADLRNSHLHHILGVENNMGLSELLAQNIPFEQGVRHLDSRFDLITCGS
RSDAPSELLSVSRCKQLLDWAAQHYDTVLVTAPPILSVTDAAIVGQHADI
TLLIGRFEQTSVSEIEASRERFDNAGVEIKGFVLNGVKPRAVNKGDYFRN
EYA
>MS0996 gutQ, GutQ protein
MDYLQNARETLATEKDALTLLSRNLDQSFNNVIDLILNCGGRLVIGGIGK
SGLIGRKMVATFASTGTPSFFLHPTEAFHGDLGMLKPIDIVMLISYSGES
DDVNKLIPSLKNFGNTIIALTGNKHSTLAKHADYVLDISVEREACPNNLA
PTTSALVTLALGDALAVALINARHFQPMDFAKFHPGGSLGRRLLCRVKDQ
MQTNLPVTALNTSFTDCLTIMNEGRMGVALVMENDDLKGIITDGDIRRAL
AANGADTLNKVARELMTSNPKVINQDTYIGQAEDYMKEHRIHSLIVVDND
NKVVGLVEFSS
>MS0858 gyrA, GyrA protein
MTELVQDITPVSIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLF
SMNQSGNTYNKSYVKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDAPAAMRYTEVRMQRITQELLTDLDKETVDFSPNYD
GKEMIPDVLPTKIPSLLVNGSSGIAVGMATNIPPHNLGEVMDGCLAYMDN
EDISIDELMQFIPGPDFPTAALINGRRGIEEAYKTGRGKVYVRAKASVEI
NDKGREQIIITEIPYQVNKAKLVEKIGELVRDKKIEGIAGVLDLSNKEGI
RLEIDIKRDAVGEVVLNHLYALTQMQVTFGINMVALDHGQPRLFNLKQII
EAFVKHRREVVTRRTVYELRKARERAHILEGLAIALANIDPVIELIRASK
TADEARENLLSRAWSLGNVAPMLEAAGVDASRPDGLAAELGAHDGQYFLS
ETQARAILELRLHRLTGLEHEKIVEEYHEILLQIGELIRILTSSVRLNEV
IREELELVKSTYNDERRTEITAASGDINLEDLIAQEDVVVTLSHEGYVKY
QPLTDYEAQRRGGKGKSATKMKEDDFIERLLVANTHDTILCFSSRGRLYW
LKVYQLPEASRGARGRPIVNILPLEDNERITAILPVASYDEDKFVVMATA
CGIVKKTALTEFSRPRANGIIAVNLRDEDELIGVDITDGSNEIMLFSSQG
RVVRFAEAAVRAMGRTATGVRGIKLALTNDISDDESAVEIEEISDDNAED
TLDLNIDKVVSLVIPKNEGAILTATQNGYGKRTALNEYPTKSRNTKGVIS
IKVSERNGKVVAATQVEETDQIMLITDAGTLVRTRVSEVSIVGRNTQGVR
LIRTAEDEHVVSLERVAEPEEDEFDAESPETAVENSEE
>MS0875 gyrA, GyrA protein
MSEINYEGIEQMPLRTFTESAYLNYSMYVIMDRALPFIGDGLKPVQRRIV
YAMSELGLNATAKYKKSARTVGDVLGKFHPHGDTACYEAMVLMAQPFSYR
YPLVDGQGNWGAPDDPKSFAAMRYTESRLSKFAELLLGELGQGTVDYQPN
FDGTILEPQYLPARLPHILLNGTTGIAVGMATDIPPHNLNEIADAAVMLL
DNPKATLDDILTLVQGPDFPTEAEIISPKEEIRKIYENGRGSVKMRAVWK
KEDGEIIITALPHQASPSKVIAQIAEQMTAKKLPMVEDIRDEADHENPVR
IVLVPRSNRVDSEALMAHLFATTDLEKNYRVNMNMIGLDNKPAVKNLLQI
LTEWLSFRRSTVTRRLQYRLDKVLSRLHILQGLMIAYLNIDEVIHIIRNE
DEPKPVLMARFELSDEQAEAILNLRLRHLAKLEEHELQAEKDQLEQERAQ
LEQILSSERRLNTLIKKEIQQDAKTYASPRRSPIVERAEAKAISESEMIP
AEPVTVILSEMGWVRCAKGHDIDPQGLNYKAGDKYLAHACGKSSQPAVFI
DSSGRSYALDPLSLPSARSQGEPLTGKLTLPAGASVDYLLIENENQQLLM
ASDAGYGFICKFEDLIARNKAGKAVISLPENAKVLPPKNIENSTALLVAL
TAAGRMLIFPVKDLPSLSKGKGNKIVTIPAASAKERTDLLVKLLLISENS
SLVFHSGKRKITLKPEDLQKYRAERGRKGTQLPRGLTSQAEITVVEPN
>MS0878 gyrB, GyrB protein
MANNYSAEDITVLKDLEPVQLRPGMYTDTSRPNHLGQEVIDNSVDEALAG
FANKIEVILHKDQSLEVIDNGRGMPVDIHPTEKVSGIELILSKLHAGGKF
SNKNYEYSGGLHGVGISVVNALSELVEVIVKRDGQIYKIVFSNGQKIEEL
QVIGTCGRRNTGTTVRFKPNPKYFDSDKFSVTRLRHLLRAKAVLCSGLEI
KFTDKVNDTEESWCYQDGLSDYLIEAVQGYNALPQTPFIGDFSADSEAVS
WALLWLPEGGELIAESYVNLIPTIQGGTHVNGLRQGLLDAMREFCEFRNL
LPRGVKLVADDIWDRCAYVLSLKMHDPQFAGQTKERLSSRQSAVFIGGVV
KDAFSLWLNQNVEIGQQLAELAINSAQRRLRASKKVVRKKLVSGPALPGK
LADCSQQDLEKTELFLVEGDSAGGSAKQARDREYQAILPLRGKILNTWEV
SSDQVLGSEEVHNIAIALGIDPDSDDLSQLRYGKVCILADADSDGLHIAT
LLCALFLRHFPKLVQQGHVFVAMPPLYRIDLGKEVFYALDESEKEGILDR
LKSKRGKPNVQRFKGLGEMNPSQLRETTMEPNTRRLVQLTYEQDETNMTE
TFELMDMLLAKKRAEDRKNWLQTKGDQVDLTV
>MS2249 gyrB, GyrB protein
MSENTQENYGASSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDN
AIDEALAGYCKDIIVTIHEDNSVSVQDDGRGIPVDIHPEEGVSAAQVIMT
VLHAGGKFDDNSYKVSGGLHGVGVSVVNALSDKLQLTIRRQGHVYEQFYS
LGEPNEQLKNIGETDKTGTTVRFWPSPTIFSNTVFEYEILKKRLRELSFL
NSGVSIKLFDERDGANDHFHYEGGIQAFVEYLNQNKTTIHPKPFYFSIEK
EGIGVEVALQWNDGYNENIYCFTNNIPQRDGGTHLAGFRGALTRTLNSYM
DKAGLNKKGKNDKDKVETSGDDAREGLVAVISVKVPDPKFSSQTKDKLVS
SEVKGAVESAMNERLQEYLEENPNDAKIIATKIVDAARAREAARKAREMT
RRKGALDIAGLPGKLADCQERDPAFSELYLVEGDSAGGSAKQGRNRKNQA
ILPLKGKILNVEKARFDKMLSSQEVGTLITALGCGIGRDEYNPDKMRYHK
VIIMTDADVDGSHIRTLLLTFFYRQMPEIIERGYVYIAQPPLYKVKKGKQ
EQYIKDEPAMTQYELAIALEDAALYVNANAPAMTGLPLEKLVADYNNTHQ
MIERLHRRYPEALLKELIYYPQLTTELMKDTGATEEWTKNLIAVLTEKDT
QGNSYSFRLQYDTERQVNDIILTVRTHGVDTNYTLNYQFATGNEYARIVK
LGNQLKGLLEDGAYVTRGNGKLEISSFEQAIEWFVKESRKGLTVQRYKGL
GEMNPEQLWETTMDPDARHMLKVTIKDAVAADQLFTTLMGDEVEPRRDFI
ESNALRANLDI
>MS1324 hemA, HemA protein
MTILVLGINHKTASVALREKVAFSPEKRDLAFQQIAQSELAQSEVILSTC
NRTEIYLHNKHISPEADQENQRWLEQCIQWFADVHQLDVDELRNCLYIKQ
NQSAVNHLMRVSCGLDSLVLGEPQILGQVKQAYQYSEDYCQAQHMPMSSE
FSRLFQKTFSVAKRVRTETNIGNSAVSVAYAACSLARQIFEGLKDLNILL
VGAGETIELVARHLLRHGVKKLMISNRTLARAELLVEKLEHNKYIQVLSL
QQLQDGLNQADIVISSTGSPIVLITAEMVKQAQQKRRNAPMLIVDIAVPR
DVDERVEKLDGVYHYTVDDLHSIIQRNLSEREKASKEAETIIDAEASDFF
EWMKVHQFSNLIRTYRESAEQIRQDLLEKAVQAIGQNEDPETVLQELSYK
LMNKLIHSPTQAMQAMMKQGSIQGLRSFSSALGIADKKERNPQK
>MS0503 hemB, HemB protein
MTQQIFTGFPARRLRRLRKHDFSRRLVAENALTADDLIYPVFVIEGENRR
EPVPSMPGVERLTIDQLLVEAGLLVKYGVPVIALFPVVGAERKSLMAEEA
YNTDGLAQRAVRALKAAYPQLGVMTDVALDPFTTHGQDGIIDETGYVVNE
ITTEVLVKQALSHAQAGADIVAPSDMMDGRIGKIREALEAEGFINTQIMA
YSAKYASNFYGPFRDAVASAGNLKGGDKKTYQVDPANSDEGLQEVALDLQ
EGADMVMVKPGMPYLDMVYRVKDYFGVPTFAYQVSGEYAMMMAAIQNGWL
KEKECIMESLLCFKRAGADGILTYFAKQVAEWLYLEGKNK
>MS0276 hemC, HemC protein
MNNKNHLKIATRQSPLALWQANYVKDRLTALYPDLQVELVTMVTKGDVIL
DTPLAKIGGKGLFVKELEHALLNHEADIAVHSMKDVPMEFPQGLGLSVIC
KREDPRDAFVSNKYRSLAELPQGAIVGTSSLRRQCQLKSLRPDLDIRSLR
GNVGTRLSKLDNGDYDAIILASAGLIRLGMAERIASFIETDISLPAAGQG
AVGIECRVDDELVQSLLAPLAHQETTICVLAERAMNNRLQGGCQVPIGGF
AQVKNGEVFLRALVGATDGSQIIRAEGKSAVENAEVLGVQIAEDLLQQGA
DKILKSVYQD
>MS0275 hemD, HemD protein
MAVLVTRPAEKGIQLVDMLNKSGVAALHLPFFNITAGRELNDLPNKFNQL
KPNDYVVAVSQSAVDFAAETLQNTGFHWRTDLQYFTVGQQTALHFASLSE
QPVHYPFLSENSEGMLALAQMQNLKGKNVLLLRGNSGRELFPQQVLARGG
DIDILECYQRQPIDYDNVEQTSICKRAGIQTIVVTSGELLNTLIQFVPEN
EYDWLRSCQLVVVSTRIENMARKFGWTDIVVSPKADNNTLLQTILTLTS
>MS0183 hemE, HemE protein
MTTLKNDRYLKALLREPVDMTPVWMMRQAGRYLPEYKATRAEAGDFMSLC
RNADLACEVTLQPLRRYELDAAILFSDILTIPDAMGLGLTFGAGEGPKFD
RPIETKSAVENLPIPDPEQELQYVMNAVRTIRRELNGEVPLIGFSGSPWT
LATYMVEGGSTKAFTKIKKMMYAEPKLLHKLLDKVADSVVLYLNAQIKAG
AQAVMIFDTWGGVLGHREYLDFSLQYMHKIVNGLIRENDGRKVPVTLFTK
GGGLWLDAIADTGCDAIGLDWTVNLAQAKAQVGHKVALQGNMDPSVLYAA
PERIEQEVRSILADFGEGSGHVFNLGHGIHQDVPVESPKVFVDAIHQYSK
PYHK
>MS1346 hemH, HemH protein
MKSSKTGILLANLGTPDTPSPKAISRYLKEFLSDPRVVDLPRWKWLPLLN
GIILPIRSRRIAKNYGAIWTEQGSPLFAITQKQKALLTEFFQQRQQNVII
EIGMTYGNPSMQYAIDNLIEQKVDKIIVLPLYPQYSSTTTAPVFDVFAQA
LKRHRHIVPFEFIHSYHLDENYIEALVKSIKVRLKNDEFLLFSFHGIPKR
YEQEGDFYRPQCEQTAQAVVQKLGLKKEQWRLCFQSRFGSEPWLQPYTDK
FLETAAQQGITKLAVICPGFSADCLETLEEIKEENKRIFLAYGGESYHYI
PALNDSPEHIACLGNLLLKRMTI
>MS1265 hemK, HemK protein
MIAYNIVKLILHRYVLYSHISFHIGIIMTSVTFNQELIDFIQADDITNNL
HSIKDYLRWTYSNFNRSDIYYGHGQDNSWDESTQLVLSGLDLPLDLPQEL
YNANLTQAEKETVINLVIKRLAKRLPVAYLTNSAWFCGLEFYVDERVIIP
RSPISALIENRFQGIIVKEPKRILDMCTGSGCIAIACAEQFKEAEVDAVD
LSIDALNVAEINIDRYNLSERVFPIQSDLFDNVPADKYDLIVSNPPYVDR
EDLADMPEEFHYEPEMALGSGVDGLTITKQILANAANYLNDDGVLVCEVG
NSMIHLIEQYPDVPFNWVELRNGGVGVFVLTKAQLIAYQTKFTD
>MS1191 hemK, HemK protein
MSKIQEKITMKSGIRFKQLTEDFMVIGEEITITKISPQTYQQWLAFAEDS
LQDMTKQDPYANPKVDANRLLQFVTQKSKGTIIAFSDTLLTENESALLSQ
YLVRRCEGEPIAYILGEQDFWSLNLEVSPDTLIPRPDTEILVEKALEFAK
FRLNSPHFSGELAILDLGTGTGAIALALAAELAPISQKCGAKLRILGVDL
TNGAVELAKRNALRNQLPQVEFLQSNWFEQLENRQFDIIVGNPPYIDRQD
EHLALGDVRFEPLTALVAEDSGYADLRHIIERAPFHLKHQGWLILEHGWQ
QGQKVRSIFNEFSQNYWQQVATMKDYGDNERITLGCWNKE
>MS0910 hemL, HemL protein
MTDSNTLFSRAQQVIPGGVNSPVRAFKGVGGTPVFIQKAKGAYIWDTDDK
QYVDYVGSWGPMILGHNHPAILSAVIKTAENGLSFGAPTPIEIDLAELVC
KLIPSMEMVRMVSSGTEATMSAIRLARGYTNRDKIIKFEGCYHGHADSLL
VKAGSGALTLGQPNSPGVPADFAKHTLTCTYNDLESVKQAFEQYPQDIAC
IIVEPVAGNMNCVPPQNNFLQGLRELCNQYGAVFIIDEVMTGFRVALGGA
QSYYNVEPDLTCLGKVIGGGMPVGAFGGKKEIMQFIAPTGPVYQAGTLSG
NPIAMAAGLACLTELQKAGNQERLAQLTEKLALGLKALADKHHVPFTVNY
VGGMFGLFFTDKAQVTCYQDVMACDTEKFKVFFHKMLDEGVYLAPSAFEA
GFMSLAHTDADIDRTLTAADKAFAALA
>MS0228 hemN, HemN protein
MGKHFMSDIIWDLALIQKYNQSGPRYTSYPTALEFNENYTDEDFKAAAAR
YPGRPLSLYVHIPFCHKLCYFCGCNKVITRHQHKADIYLDYLEQEIKHRA
ELFKTRSVTQIHWGGGTPTYLSEAQSARLMTMLRNHFSIADSAEISIEMD
PREIELSMLDHLRKIGFNRISMGVQDFNKDVQKAVNREQDEEFIKALLVH
ARELGFRSTNLDLIYGLPLQNVESFMFTLQKVIELNPDRLSVFNYAHLPS
RFAGQAKIKDHMLPAPETKLTILQKTIETLGAAGYKFIGMDHFAKPDDEL
AIAQQNGILHRNFQGYTTQEECDLLGLGVSAISLLGDTYAQNQKELKRYY
ADVDATGIALHKGLVLSKEDCLRRDVIKQLICNFKLNYELIEKQYNIDFK
THFTEDLALLAPLAEDGLVEITNTAIQVSARGRLLIRNICMCFDTYSRQL
AKRQQFSRII
>MS1746 hemN, HemN protein
MTTLVLPPLSLYVHIPWCVQKCPYCDFNSHAQKGAIPEIDYIRHLVTDLQ
ADLLRFKDSVQNRPLHSIFIGGGTPSLFSAESITCLLTEIKKLIDFSPHI
EITMEANPGTVEAERFQGYVKAGVTRISMGIQSFNDEKLKALGRIHNARE
AKSAVEIAKISGLKSFNLDLMHGLPNQTPEQALDDLRQAIALNPPHLSWY
QLTIEPNTMFAYRPPVLPDDDELWDIFERGHRLLTEAGYRQYETSAYAKP
DFQCQHNLNYWRFGDYLAVGCGAHGKMSFVNGDIIRYSKTKHPKGYLRGE
YLYEERLINEADRPFEFFMNRFRLLEPVPKTEFVQFTGLPESAVKKQIDW
ALEKNYIIETESTWQITERGKLFLNELLEVFLPDD
>MS0274 hemX, HemX protein
MTDKQTENVETVEIVEDKATDAIKADSSEKNSKNNRTPEPVVVKKGGSAL
GLLAILIALGVGGAGYYFGQQQVQQIQQKLTALQSQQPGATAESPDFEQT
KEHILKLVNENNQTNADKIAVLQREITAKDQALLSLQSQINAVSNSVKAE
QPNDWLLSEADFLLTNALRKLVVDHDVDTSISLIKLADEALEKVATPQAS
AIRSALNNDLKQLLALNNVDQNAIMQRLSQLANSLDELTVLNIDFDDQEN
SAAVTDSVGDWQNNLKKSAVSFLNHFVRVTPTNSSKKELLAPNQDIYLRE
NIRLRLQIAILAVPRQQNELYKQSLEAVSSWIRSYFDTNSDLAKNFLKNI
DELSEQSIYVDVPERLESLNALDQLLNKQVHEVKKVELSVDKGLTENNET
GATPETEQNVPATSEPVPVEPQQ
>MS0273 hemY, HemY protein
MFKVLFLMLALLAGLVAGPYLSGKQGYVLIATGNYNIEMSITTLIILFIA
AMAVVYILEWAISHFFKWSNATYTFFSKRKRNKAQRQTLEGLMRMTEGDY
SKAEKLIGKNAKHSAEPILNFIKAAEAAQQRGDEFSANRYLIEATELAGS
DNLLVELARTKILLQQNKLPAARSSVDSVLEMAPRNPEVLKLATEIYLRS
KAYSTLDNILSTVESSGLYTQKEFIDLQRKTEDGLLDEIMNEEGADGLLA
WWEDQSRKRRNDFYVKLALISRLIDCNDHDSAYEISLDAFKKVDENKDLA
GKFFNLMTKLQATDNSKLVKLLEKRMSSTSAEHQCCLNRALGYLYVRNND
FNKASECFKQVVSGQDKLDPSDATMAAYVFEQVKDTESAKRVREEGLKAA
MAVNRLSTEEVIEKPALALEQKADNKESNKIANWANR
>MS0195 hepA, HepA protein
MSFAVGQRWISESENDLGLGVIVGMDNRTVTILFPASDEQRVYALAAAPL
TRVEFQKGDTVVHHEGWKAQIIDVTENNGVLIYLTIRLDTQEEAVLREMD
LAHKISFSKPQERLFGAQIDRSDRFTLRYHALQQQQAQFQSPLRGLRGIR
AGLIPHQLHIANEVGRRVNPRVLLADEVGLGKTIEAGMILQQQLFAGKVE
RVLIIVPETLQHQWLVEMLRRFNLHFSLFDEERAADFAANEYDEERNPFE
SENLIICSLDWIVAQPKRAQQILQAEFDMLIVDEAHHLVWSERQPSMAYQ
VVEQLSRRIPAILLLTATPEQLGQESHFARLALLDPDRFYNYDAFVAEQK
NYQPVAEAVQTLLNEKPLNTAEQNAIADLLEEQDVEPLFKVINSMAEESE
RLQARQELIDNLVDRHGTSRILFRNTRQGVKGFPHRIYNQVTVEMPKQYV
NAVKVMNLLGEEIGDGLFYPEQIFQKMNPEAKWWEFDPRLEWLITFLKNH
REEKVLVICRHANTAIQLEQALREKEAIRSAVFHENMSIVERDRASAYFA
LQEEGAQVLLSSSIGSEGRNFQFACHLVLFNLPDNPDLLEQCIGRLDRIG
QTRDIRIHTPCFADTPQVVLARWYHEGLNAFEETCPMGMTIFTECGEKLK
NFVKNPTQLDGFEEFVAQTRKRQQVLKQELENGRDRLLELNSNGGERAQK
LAEHIADEDNSTALVNFVLNLFDVIGIEQEDLGEKSIAIIPASTMLVPDF
PGLKEEGVTVTFDRRLSLAREELEFLTWDHPIVTNGIDLITSGDIGKTAV
SLLINKSLPPGTLLLELIYVVESQSPKGLQLTRFLPPTPVRLLLDAKGNN
LAAQVSFQALEKQLRPVKRNMANKMAKMIRPNIERLIAGGDKHIAEQARE
IIQSAKQKADQTLSAELDRLNALKAVNKNIRQDEIDILAQIREQSLTQLD
QANWRLDSLRVIVSNKE
>MS1978 hfi, Hfi protein
MKTEENKMPKFAANLTMMFNEVPFLDRFEAAAKAGFKYIEFLWPYDYPAE
QIKAKLDQYGLKQILFNSLPGDIAAGEWGVSAIPGREAESHQHIDLALEY
ALVLQCPTVLIMGGVVPPGQSRAKYKQTFIDNLRYASEKFKPHNINIVLE
ALSPQVKENYLMKTQDDALEIIELVNRDNVFLQLDYYHAQNVEGNLARLT
DRVAPVMKHIQIASVPDRHEPDEGEINYQYIFDKLDAIGYDGYVGCEYKP
RTETTAGLAWFEKYK
>MS0964 hflB, HflB protein
MNDMVKNIILWVVVAVVMMTAYQGFSSSANGSAVDYTTFVTDVGNNQIAQ
ARFEDTEILVTKTDGSKYSTVMPIYDDKILNDLLNKKVKVEGTMPEKRGL
LSQILISWFPMLFLIGVWLFFMRQMQGGGGKAMSFGKSRAKMLTKEQIKT
TFADVAGCDEAKEEVGEIVDFLRDPGKFQKLGGKIPKGILMVGPPGTGKT
LIAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKNAPCLIF
IDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFEGNEGVIVIAATNR
PDVLDPALTRPGRFDRQVVVGLPDVRGREQILKVHMRKVPIGADVDAMTL
ARGTPGYSGADLANLVNEAALFAARTNKRVVTMLEFEKAKDKINMGPERR
TMIMTDKQKESTAYHEAGHAIVGYLVPEHDPVHKVTIIPRGRALGVTFFL
PEGDQISVSQKQLESKLSTLYAGRLAEDLIYGEENISTGASNDIKVATNI
ARNMVTQWGFSDKLGPILYTEDDGEVFLGRSMAKAKHMSDETAHIIDEEV
REIVARNYDRARQLLIDNMDILHAMKDALVKYETIEEIQIKQLMNREAVT
PPAGWEDHSSADNSSSNTAETEPSENETEENSVN
>MS0834 hflC, HflC protein
MDIMEGFPITVIVFIVLILFVVSSALKTVPQGYNWTIERFGRYIKTLSPG
LNFIVPFIDRVGRKINMMEQVLDIPSQEVISKDNANVSIDAVCFVQVIDA
RSAAYEVNHLEQAIVNLVMTNIRTVLGSMELDEMLSQRDNINGRLLSIVD
EATNPWGVKVTRIEIRDVRPPRELSEAMNAQMKAERNKRAEILEAEGVRQ
AQILRAEGEKQSRILRAEGEKQEAILQAEARERAAQAEAKATQMVSDAIV
NGDTKAINYFIAQKYTEALKDIGGSNNSKVVLMPLEAGNLIGSVAGIAEL
LKDVKK
>MS1619 hflC, HflC protein
MSLNDQDPWAKPGQNDPKQPENPSNKPDNKSGWSDRQDNKEQSPPDIEEI
FGNLLKKLGGNGGQSNNGNNTNLPKNLNKLAPAAIALAVVLWGLSGLYTV
KEAERGVVTRFGQLHSIVQPGLNWKPNFIDEVIPVNVEQVKELRTQGAML
TQDENMVKVEMTVQYRVQDPAKYLFSVTNADDSLNQATDSALRYVIGHMT
MDDILTTGRAVVREQTWKTLNNVIKPYDMGVEVIDVNFQSARPPEEVKDA
FDDAIKAQEDEQRYIREAEAYAREQEPIARGDAQRIVEGATAYKDKVVLN
AKGEVERLQRLLPEFKASPDLLRERLYIQSMEQIMSKTPKIMLDGNGNNL
NVLPVDQILRNKNTQPAAEPSANQGTLSSNELKSAVKNGQNSHGQTQTDD
RSISRQGRFN
>MS1620 hflC, HflC protein
MRKFLLPVLVILAAILYSSIVIVNEGTRGIMLRFGKVQRDSDNKVVVYTP
GLHFKIPFIDNLKPLDARIRTLDGQADRFVTVEKKDLLVDSYVKWKISDF
GRFYTSTGGGDYNQASNLLRRKVNDRLRSEIGTRTIKDIVSGTRGELMDG
ARKALNTGQDSTAELGIEVVDVRVKQINLPDEVSSSIYQRMRAERDAVAR
QHRSQGKEKAAFIQADVDRKVTLILANANKTAEELRGEGDATAAKLYTEA
FSGEPQFYSFVRSLKAYENSFAGSDNMMILKPDSDFFRFMQPPKK
>MS1519 hflX, HflX protein
MNNDVNISKSAVNFTALSSISAPRSDQSDNAIVVHVFFSQDKNPEDLDEF
QQLAQSANVNILQVITAARSTPQAKYFVGQGKAEEIAQAVETHNADVVLV
NHSLTPAQARNLESLCQCRVVDRTGLILDIFAQRARSHEGKLQVELAQLK
HLATRLVRRKTGLDQQKGAVGLRGPGETQLETDRRLIKVRIAQLQNRLAK
VEKQRNQNRQTRQKADIPTISLVGYTNAGKSTLFNRITQANVYAADQLFA
TLDPTLRRLQIQDVGTTILADTVGFIRDLPHDLVSAFKSTLQETTEAGLL
LHIIDAADPRKLENIEAVNAVLEEIKAADLPTLLVYNKIDTLENLEPHIE
YDDQHIPVAVYLSAISAEGIDLLFAAIREKLKNEILHLQLNLSPNEGKIR
HQLYLLDCIRREEISDQGEFLLEIQIDKIQWLKLAKKFPQLEKCGKNL
>MS1518 hfq, Hfq protein
MAKGQSLQDPYLNALRRERIPVSIYLVNGIKLQGQIESFDQFVILLKNTV
NQMVYKHAISTVVPARSVSHHNNPQQQQQHSQQTESAAPAAEPQAE
>MS1475 himA, HimA protein
MTKSELIESLVEKNHSISVKSVENAVKEILEHMSQALESGDRIEIRGFGS
FSLHFRQPRVGRNPKTGAQVKLDAKCVPHFKAGKELRERVDFNA
>MS1089 himA, HimA protein
MTLTKVELADNLIEKHGLNKSEAKALVEDFFEEIRVALEKGNDVKLSGFG
NFELREKASRPGRNPKTGESVPVSARRVVVFKPGQKLRARVEKTKPKS
>MS0185 himA, HimA protein
MNKTDLIDAIASAAELNKKQAKAALEATLDAITASLKAGDSVQLIGFGTF
KVSERKARTGRNPQTGAEIQIAASKVPAFVSGKALKDAVNG
>MS0112 hipB, HipB protein
MNLSSLFSVRLKNERNRLGLTQAEIAKKCGVSREMWGKYERGVALAGSEV
LFSLAAIGVDMDYILLGTRKEVFEEITTEALKDMPKADFSDKTGLLVQLF
MQCDDNGRAAILSVAQTMAGMANKTGHQNSDSTGGQSFAGDVHGGQFSTG
TINNYGEKK
>MS1883 hisA, HisA protein
MKKSIIIPALDLIDGNVVRLHQGDYAKQTTYSDNPIEQFASYLAQGAEQL
HLVDLTGAKDPAKRQTALIGKIIAATHCKIQVGGGIRTEKDVADLLAVGA
NRVVIGSTAVKERAMVKEWFNKYGAEKFVLALDVNIDASGQKIIAISGWQ
EASGVSLEELIEDFQSVGLQHVLCTDISRDGTLAGSNVDLYKEICAKYPA
VNFQSSGGIGSLEDIKALKGTGVAGVIVGRALLEGKFNVAEAIECWQNG
>MS1890 hisB, HisB protein
MTQQPTLFIDRDGTLIDEPKTDFQIDSLEKLKFERNVIPALLKLKNRYRF
VMVSNQDGLGTDSFPQEDFDKPHNAMLAVFRSQGIEFDDILICPHKPEDN
CDCRKPKIKLLKKYIDKKLFDPADSFVIGDRPTDVQLAENLGIRALQYHP
ENLDWDMIAEKLLREPVADPKGLGQPRHAVVARKTKETDIKVEVWLDEAG
VNQINTGIGFFDHMLDQIATHGGFRMNVSCKGDLHIDDHHTIEDVALALG
AALKEAIGNKRGIQRFGFVLPMDECKAECALDLSGRPYFKFKAKFNRDKV
GDFSTEMTEHFFQSIAYTLLATLHLSVKGDNAHHQIEALFKAFGRTLRQA
IKIEGNEMPSSKGVL
>MS0435 hisB, HisB protein
MNKERMVKKAIFLDRDGTINIDHGYVHKIDDFHFIEGSIEALEELKNMGY
LLVLVTNQSGIARGYFSEDEFLQLTEWMDWSLADRNVDLDGIYYCPHHPE
GLGEYRQDCDCRKPKPGMLLQAIEELNIDPAQSFMVGDKVEDLKAAVSAN
VKYKVLVKTGKTVTQAGEQLADYVLDSIADLPRIIKRLKK
>MS1891 hisC, HisC protein
MSISQLSRKNVQALTPYQSARRLGGNGDVWLNANEYPTSPDFNLSERIFN
RYPEPQPEAVIKGYAAYADVKPENVIVTRGGDESIELLIKGFCEPEDKVL
YCPPTYGMYAVSAETLGIATKTVPLTEDFQLDLPEIEKNLAGVKVIFVCS
PNNPTGNVLNQADLIRLLDITAGSAIVVVDEAYIEFSPETSMIKQLGNYP
HLAIIRTLSKAFALAGLRCGFTLANPELIGVLQKVIAPYPLPVPVSDIAA
QALQPQGVAQMKMRVADVLANRAWLIGELKQIPSVVKIFATEANYVLVKF
QDGEKVFNALWEKGIILRDQHKAFGLKNCIRISIGTRAELEKTVVALKLA
>MS1574 hisC, HisC protein
MVVRPNFKTTIPNNHKSAVENMTFLQQANTGVQALSPYQAGKPIEELERE
LGISNIIKLASNENPFGFPESAKKAIQNQLDNLTRYPDSNGFSLKAAIAE
KFNLQPEQITLGNGSNDLIELIAHTFATEGDEIIFSQYAFIVYPLITKAI
NAKAREIPAKNWGHDLEAFLAAINEKTKLIFIANPNNPTGNFLTEAEIDS
FLAKVPPHIVVALDEAYTEFTAKEERVNSLALLKKYPNLVVSRSLSKAYG
LAGLRIGFAVSNPEIAGLFNRVRQPFNVNSLALAAAEAVLNDDDFVEKAA
ENNRRELKRYEEFCQKYGLQYIPSKGNFITIDFQQPAAPVYDALLHEGVI
VRPIAGYGMPNHLRISIGLPEENQRLFDALIKILNLK
>MS1892 hisD, HisD protein
MQTLIWKDLTEQEKKQALTRPAISAAGNIKDAVDAIRENVVANGDKALFE
LSEKFDRVKLNSLEVSEQQIEEAAQRLPEELKQAIQNAKKNIEAFHLAQV
PVEADVETQSGVRCQVLTRPINRVGLYIPGGSAPLFSTVLMLAIPAKIAG
CKKIVLCSPPPIADAILYAANLCGVETIYQVGGAQAVVAMAFGTETVAKV
DKIFGPGNAFVTEAKRQVSQAVNGAAIDMQAGPSEVLVLADENADPDFVA
SDLLSQAEHGADSQVILVTPSERLALETELAVERQLTTLPRSEIAQKALA
HSRIFIAENLQQCVEISNEYAPEHLVVQVQNARDLLSNIDNAGSIFLGAY
SPESMGDYASGTNHVLPTYGYTRTSSSLGLADFSKRMTVQELSPQGFKDL
AKTVEVMAAAERLDAHKQAVSIRLAKIK
>MS1882 hisF, HisF protein
MLAKRIIPCLDVRNGQVVKGVQFRNHEIIGDIVPLAARYAEEGADELVFY
DITASSDGRTVDKSWVERVAEVIDIPFCVAGGIKTIADAEQIFTFGADKI
SINSPALADPDLISRLADRFGVQAIVVGIDSWFEQETGKYWVNQYTGDES
RTRQTNWQLLDWVKEVQKRGAGEIVLNMMNQDGVRNGYDLTQLKLVRDVC
KVPLIASGGAGEMVHFRDAFIEANVDGALAASVFHKQIINIGELKEYLAR
EGVEVRR
>MS1893 hisG, HisG protein
MSTNKRLRIAMQKKGRLSDESQELLKQCGVKINLQGQKLIAYAENLPIDI
LRVRDDDIPGLVFDGVVDLGIIGENVLEEEELTRTAAGDKVEYKMLRRLE
FGGCRLSLAVDSDVEFDGPESLSDCRIATSYPQLLKRYMAEQGVPFKSIL
LNGSVEVAPRAGLADAICDLVSSGATLEANGLKEVEVIYRSKACLIQRKE
PLSEEKQALVDKILTRIQGVQQADESKYIMLHAPKDKLEEITALLPGVEN
PTILPLAHDDTKVAVHVVSQENLFWETMEQLKEKGASSVLVLPIEKMLA
>MS1885 hisH, HisH protein
MIIIDTGCANLSSVKFAFDRLNIKAEISRDIATIKSADKLLLPGVGTAMA
AMKILQDRNLIETIQNATQPMLGICLGMQLMTEYSSEGNVPTLSLMSGHT
DLIPNTGLPLPHMGWNKVRYEQDHPLFAGIEQDSHFYFVHSYAVLPNEHT
IATSDYGVPFSAALGCKNFYGVQFHPERSGKNGAQLLKNFVENL
>MS1881 hisI, HisI protein
MQNKINWQKVDNLLPVIIQHFQTCEVLMLGYMNQEALAKTCDEKVVTFFS
RTKQRLWTKGETSGNFLNAVDMSLDCDNDTLLILADPIGPTCHTGEESCF
HQFATQSEGDWTWFAKLERVLAERKFADPESSYTATLYAKGTKKIAQKVG
EEGVETALAALSKDKGEIVSETADLIYHLTVMLHEQNLEWGDVIDKLKER
HQGIGLHPEGSNK
>MS1920 hisS, HisS protein
MAKTIQAIRGMNDCLPTETPVWQWVESKVRSTLASYGYSEIRMPIVENTP
LFARAIGEVTDVVSKEMFTFNDRDNESLTLRPEGTAGCVRAGIEHGLLYN
QEQRLWYMGPMFRYERPQKGRYRQFHQAGVEVFGIANAEIDAELIILTAR
LWKELGIEQHVTLQLNSIGSLEARANYRSALVEFLQQYVGLMNEEEKERL
LKNPLRILDTKNEALQQALNNAPKLLDYLDDDSREHFARLCAILDNVGIS
YEINPKLVRGLDYYNKTVFEWVTTALGAQGTICGGGRYDGLVEQLGGHAT
CGVGFAMGLERLILLVQEVNKAIPLPQSAVDIYLIFAGENTASAAFRLAE
KVRSELPHLRTMMHCSGGNFKKQFKRADKSGAKIALVLGESEVQNQQVVV
KDLLGGAEQQTVALEAVIDHLKTSFKE
>MS0399 hit, Hit protein
MWIYSFGLRDKLLFKLISDKKCGRFFPKIRKHKMAEETIFSKIIRKEIPA
DIIYQDDLVTAFRDIAPQAKTHILIIPNKLIPTVNDVTAEDEAVLGRLFI
TAAKIAKLEGIAEDGYRLIVNCNKHGGQEVFHIHMHLLGGEKLGPLNAK
>MS1923 hlpA, HlpA protein
MKKVVKATALSLALALTSSMAMAAENIAFINAGYLFQHHPDREAVAKKLD
SEFKTQADKLAANKKSIDAKIASLQKDAQNPKNRPSELKKREDEINKLMK
DHDEEVRKFQVENDKRQNEERAKLLEGIQVATNNIAKDKGYTYVLDANSV
VFAADGKDITEDVLKAIGGKTETKPAEATK
>MS1168 hlyC, HlyC protein
MKIETFDVIAPSIFSDEPLDESQLFGAFATIWLRSEYHSKAPLYRFAERI
LPVLRNKQFALFIKDNEPVAYFSWAYFTQEAEEAYLHNDDVLLEAGNWCA
GNNLWIIDWFAPANLTKEIKPLIERHLFPNEILTALYHKSAVNHVQKRYF
KGCAVSRERFRDFIRNHS
>MS0292 hmp, Hmp protein
MSYRAYVIRPNRKGIKQMAKTNKNPLCINELQVYSIVQEAPKVKTINFIA
QDFYPYEAGQYALVSIRNTPHITRAYSLSSTPGESRFVSITVREIDGGVG
SGWLNNEVKVGDQVWFSNPMGDFSCQKVIADNYLLVGAGSGVTPIMSMTR
WLLKNRPQANVSVIHSVHSPQDVIFKSEWAQLKADNPRLNLVFNASVNAT
AGFESGRISKEILTKAVPNLTDYTVMTCGPQAYMDALKTMVLELGGSEQR
FFTEAFFNTALAGDISSDKKTTLTVSGAKPMKTEVPVGMTLLAALEAQEQ
PVVSGCRTGLCGLCKTKVTGGEYEIVSNGDLTQEEIAQGYVLACSCRVKA
DMTVEI
>MS0680 hmp, Hmp protein
MKNIKFLLWGVLIGISALWFLADDLIPEPFTYFSFRFVVNQYTGILSISL
MSIAMLLATRPRWLENYLNGLDKGYRLHKWLGISALITALTHFWFTHGTK
WMVGWGWLERPLRQRQRLGQNAGAGLEQWLGGMRGIAESIGEWAFYLALI
LMIVSLVKKIPYRWFVKFHKWLAAAYLALVFHSVVLIKFEYWHQPIGWVT
AVLLTVGAVSALLILFNLAGKKIRYQGTIRSARPLQKIDGLDLTINVPTW
QGHKAGQFAFVHALNDTEKPHPFSFASAWDPASRDIRFCIKALGDYTDTL
AQRWKANDKLLIEGPYGRFTFADDAQQQIWIATGIGITPFMARLEELAQS
THKQTVDLFYSYRESDPVLIAELQQKSAEAGINLHLRCSAEQSRLTSADI
INTVKDLTKTSFWYCGISAFGDTLCKDLCRQGLPASRFHQELFEMR
>MS1322 hns, Hns protein
MNEVIKTLNNLRRLRSMAKELSIEQLENIIEKFQLVIEEKKAEELEIKRL
EEERKNRLEKYRELLKEDGITADELAQILAGKNNTAKAKRAPLSAKYKYI
NENGEQKTWTGQGRMPKAIQLQLNAGKSLSDFAI
>MS0361 hofF, HofF protein
MLNQLATLLDAQIPLKESLHILIQNCSSIPLNQWLRNLLSQLERGFAFSK
SIELQGLYLSAQELQLIKVGEMSGKLSYVCSQIAHFRQQQLALQRKIQKI
LLYPLVVLVISATLTLLLLIFIVPQFAEMYQDNNQDLPFITKFLLVLSHS
LTHYIWYIIGVATLTFIFIKKQWRHSIWLYKCAQQLMALMPLISTIKQQA
RLINFCRSLQLMLNAGIPLQQGLQAFLPQIKTWQNTGALPGDLILVEEVQ
AILHWIKQGYGFSNSVGSRLFPQQAQQILQVGESSGQLSNILQKIADDYQ
QQLDHKIDLLSQLLEPFLMLLIGIIIGVIMLGMYLPIFNMGNIM
>MS0364 hofG, HofG protein
MKNYRTLSQYPAIRKGFTLIELMIVIAIIAILATIAIPSYQNYTKRAAIS
ELLQAGAPYKADVELCIYEKGGEKDCSSGANGIAEPAKTKGKYVDAVTVS
SGTISVTGKGSLSGVSYSTKATGNASEGISWTTTCTPKDIFPAGFCDNKE
>MS0724 hofG, HofG protein
MRKGFSLIEFLTVLLLISISGSLTLSGWQSLGESQMLQQEQQRLLLFIKN
IQARVENSNQVWHLVANRSFDQKNWCFTAQIKHDLFICDCFYPVLCPKEL
LPHFYYPLFPDTVKFVGKKYYPAITAKFGGVRRTTENNCFSLISSNKQSV
LSFSKMGNVSIKKPGSSSSCFNTAEE
>MS0332 holA, HolA protein
MNRIFAEQLSPSLAGRLAKVYLLVGQDPLLLSESQDNIIQAATKSGFDEK
LEIQIDNGTNWNDLFERCQSMGLFFSKQVITLHFPENPTALLSKNLAELI
SLLNSDLLLILHFGKLTKLMEKQDWFIQSEQYDRNAVLVNCQTPTAEQLP
RWVANRCKAMGLIAEQDAVQLLCYSYENNLLALKQTLQLLDLLHADHKLT
FVRVKNIVEQSSVFTPFQWIDALLEGKEARARRILTGLQAEDIQPIILLR
SLQRELTILLQLAKPQHKTASVDSALPVAQLREGFDRLKIWQNRRPLFTQ
AFQRLTYRKLYLAVQQLAELERLAKQEFSADIWDQLANIIPKICR
>MS0570 holB, HolB protein
MGKFNFVMTAIIYPWLQSYYERITAAFQQGYGHHALLFRAEQGIGADQLI
HAVANWLMCQHSSPRPCGECHSCRLFAAGNHPDVYQLAPVENKDIGVDQV
REINEKVSQRAQQNGNKAVYVQSAERLTESAANALLKTLEEPRPNTYFLL
NADLSSPLMTTIYSRCQVWLINTPSEQQALNWLQLHNYSEISEIQTALRI
SYGRPLLALHCLEQGWLEKRREFFRAFWLFYTRRSPLELLPLFDKELILQ
QVDWLLAFLSDALKDKLNITSGWICRDLIRGIQQFNERQTVAGLLTATKI
MQKVRSDLVQINAVNQELILLDGLTRLITEVFEH
>MS1557 holC, HolC protein
MPKQAQFYLIEKTQADNALSATEALACNLAADAWRLGKKVLIACETEEQA
LNLDEALWQRDAEQFVPHNLSGEITNYATPIEISWQGKRNAQRRDLLISL
QNNVPDYAQSFNHVIDFVPAEEERKAVARERYKLYRQLGFEMVMEKA
>MS0683 hpt, Hpt protein
MKKHHVDILISEQEVKARIQQLGAEITAYYRQQQVEKLIVVGLLRGSFMF
MADLVREIKLPVEIEFMTTASYGSGMTTNHDVKITKDLDGDIKNQHVLIV
EDIIDTGYTLEKVREILNLRTPASLKICTLLDKPSRREVEVPVDWIGFRI
PDEFVVGYGIDYAQHHRNLGYIGKVVLEE
>MS1437 hrpA, HrpA protein
MMKMKTPKREFNALQKSLAEQIEDVMIVEQSRLLARIRGLGQIKKEQSQQ
AAALDIEQQIQQAKLRLELRKSAVKNPIVFPENLPVSQRKTEIQKLIAQN
QVVIVAGETGSGKTTQLPKMCLELGFGQKGLIGHTQPRRIAARSVAARIA
EEMQTELGGIVGYKVRFNDQIGEDTQIKLMTDGILLAEIQTDRFLNRYDC
LIIDEAHERSLNNDFILGYLKQLLPRRPDLKVIITSATIDVERFSKHFNN
APIIEVSGRTYPVEVRYRPVAETEEQDQLQGILNAVDELQAEGRGDILIF
LSGEREIRDTAEALEKQNLRHTEILPLYARLSAQEQNKIFHPGGLNRIVL
ATNVAETSLTVPGIKYVIDPGTARISRYSYRTKVQRLPIEPISQASANQR
KGRCGRVSEGVCIRLYSEQDFNNRPEFTDPEILRTNLASVILQMTALGLD
DIEAFPFVDAPDKRHIQDGIKLLEELGAFEWQKSPPSAFGTSPRKRGEGN
LASNSSLPPFTGGAGHSPEGGKRVLTQTGRQLAQLPVDPRLAKMLLSAVD
LGCVLEVMIIVAALSIQDPRERPQEKQQSADDKHRRFADKKSDFLAFLNL
WNYIQEQQKVLSKNQFRRLCQKDYLNYLRVREWQDIYHQIRLTVREMGLP
INSEPAQYPQIHSALLSGLLSHIGMKEAEKQQYLGARNAHFAIFPNSVLF
KKQPKWVMAAELVETSKLWGRMVAEIDPEWVEPLAKHLIKSSYSEPRWSK
SRGQVIANEKVSLYGVPIVASRPVNYGAIDPQTSREIFIQSALVEGDWHT
RHKFFFENQKLIREVEDLEHKSRRRDILVDDRTLFEFYDSRIGADVVSQK
HFDSWWKKAAQQDPELLNFEKSFLMKEDAQKVSQLDFPNFWHQGNLKLKL
TYQFEPGTDADGVTVHIPLPLLNQIEMQGFDWQIPGLRHELIVSLIKALP
KSLRRNFVPAPNYAEAFLARVANFDKPLTETLSYELRRMTGVNVEVEEWK
LEQIPPHLRMTFRVIDEKGKKIAESMNLDELKFGLKDQVQQSISAVADDG
IEQSGIHIWNFDSLPQCYEQKKQGFTVKAFPAITDEKEAVGIKLFETEYE
QSVAMQQGLRRLILLNVPSPIKYLHEKLPNKSKLGLYFTPFGKVLDLIDD
CIACAVDKLIADFGGFVWNERDFERLRDFVRENLNEITVDIAQQVERLLT
LTFEINKRLKGKMDFTMAFALSDIKSQLAGLIYPGFVEKTGYARLPDIQR
YLQAIDKRMDKLAQDINRDRAAMLRVEQCQQAYQQLLAKLPKSKPLSTEV
LEIRYMIEELRVSLFAQQLGTKYPVSEKRVLGVITEI
>MS2165 hsdM, HsdM protein
MTTDNLHTKQSTISSVIWSMANMLRGTYRPPQYRRVMLPLIVLARFDAIL
APYTDAMKAKADELQAMGGKAPEGALYEMALTKAADPNRKQPLYNTSGYN
LQRLLADQDHIAANLVKYLQGFSAKAKDIFDKFEFENEIEKLDSSNRLYA
VVSQFQKDLKENGIDLSPQSISNLQMGYIFEELVRKFNEQANEEAGDHFT
PREVINLMVNLIFEEDQQRLSQPHAIASIYDPTAGTGGMLSESEKHLKSY
NDSIKLQLFGQEYNAESYAICCADLLIKDEPISNLVFGDTLGVKNSKNTG
TGFVPHDGHQTKKFDYMFSNPPFGVEWKNEQDFINDEAKSGFAGRFGAGL
PRINDGSLLFLQHMISKMKPVEEGGSRIAVVFNGSPLFTGDAGSGESNIR
RWIIENDWLEAIIALPDQLFYNTGIYTYVWIVSNKKSDRRKGKVQLIDGT
QHYQKMAKSLGDKRNELSPAQIADLTRLYADFKDGASGRISTKFCSKIFN
NQDFGYLKLTVERPLRLNFQAGQERIEKVKTQTAFINLAVSKKRKDEAQI
KAEEAEGQRQQQAILAALSTIGDGLYQNRTAFLKLLDKALKGLDFKLGAP
LKKAIIEALSERDQSADICLDSKGNPEADSQLRDTELVPLPKEITLPLPV
DYGEGKTDELVKQVKAHCEAYLQAEVLPHVDHAWIDYSKTKVGYEIPINR
HFYQYQPPRALDEIKAEISELEAEIMAMLGNV
>MS2171 hsdR, HsdR protein
MITEKDFENEIERFLLAEGGYVQGKNSEYNKETALFEEDVLSFIQTTQPK
RWERLAQGQKANVKAVLIKALCQELEAKGALDVLRHGFRCYGKTFQTAYF
APNTSINEETQQRYDANILKITRQVVTEDGDRPDIVLSLNGIPVATAELK
NVLSATHWTVEDAIYQYRKERNPKGKLFTFKKRTLVHFAVDTEEVYMTTK
LDGEQTYFLPFNRGYNKGRGNPPIAGNVKTAYLWEQILTRHSFLEIIARF
LHLSVEEKKVRTDSGLRLLQKETMIFPRFHQLDAVRQLIAHSREHGAGRN
YLIQHSAGSGKSNTIAWLAHQLSSLHNRDDQKIFNSVIVVTDRVVLDRQL
QATISQFEHKDGVVQKIEHNSQQLAVAIASDTPIIITTIQKFPFVMAALA
RKQESGINVAISTEGKQFAVIVDEAHSSQSGEAAMELRKVLNKDGIEAAV
MAEFLDDDDDETGLSDEAKKQLFIEAAKRQRQPNLSFFAFTATPKWKTKA
LFDEPGADGNTPFHHYTMKQAIEEGFILDVLENYATWKQYFKLLKISEND
KELSKSKAKKEMMRFVNLHPSVIAQKVEIIVEHFRTTTMHKIGGRAKAMV
VTNGREHAVRYKLAFDEYIKEKGYTGIKSLVAFSGGITLKEAPEKEYTEA
LMNGIREVDLPEQFASEHYQVLLVAEKYQTGFDQPLLHTMFVDKKLSGIQ
AVQTLSRLNRCAKGKTDTFVLDFVNQPEDIYKAFKPFYEVTELGDIPSNE
KLDELAATLDQWKIYFQPEIRQFAEIWFSAKQQPTGSEHKQLNSILDKAV
ARFLAVGDEIQQGVDDLSELKNEQQNLFKSQLKSYLSLYQFVSQIMDYSD
DLHEQRYVYLRALQSKLPNNSDRNKVDLSKDVVLHFYKLQKRSEGKIHLD
EGGADPLKGATDVGSGRADATDELSNMVQEINGMYGTQFTIADQLFFEQI
IEDALADNEIVGAAKNNSLESFTAYFADKLLDLLFQRMQGNEEISNQVMS
DESLRNRVVKRLAKQIYQRK
>MS2170 hsdS, HsdS protein
MQKYDKYKPSGVEWLGDVPEGWEVTKIKYIAELTPKKSELTELDKECSFV
PMEKLKLGNLVLDETRTISDVYNGYTYFEDNDLLIAKVTPCFENKNFVIA
EKLVNGIGFGSSEIYVLRVKNCLNRYLFYRLQENTFMDLAIGSMTGAGGL
KRIPSEFLNNYSIALPPLEEQTAIAHYLDQKTAYIDRLIDRQQTLLEKLS
EKRTALITEAVCGRLPIAPYSASLKRGTGFDEENGSPNTAQTAPLFSKEG
LGEICLKDSGIQWLGKVPEGWEVIRLRFLCNIQTGNMDTQDNEPDGIYPF
YVRSPIIERSNNYTFEDDEAVLMAGDGVGAGKVFHYVQGKYGCHQRVYSL
NQFQNITGRFLFYYLREFFSRKIEEGGAKSTVDSVRLPMLKDFPTCVPPL
SEQTTITHYLDQETAKIDRLRTQIETVIERLKEYRMALITQVVTGKVKV
>MS0272 hslU, HslU protein
MSMTPREIVSELDAHIIGQNEAKRAVAIALRNRWRRMQLPEDLRQEITPK
NILMIGPTGVGKTEIARRLAKLANAPFIKVEATKFTEVGYVGKEVDSIIR
DLADISMKLVRQQAVEKNRMKAQDAAEDRILDVLLPPAKDQWGNVQETGN
ASTRQVFRKKLREGQLDEREIEIDISTPVNVEIMTPPGMEDMTSQLQSLF
EGMSPNKTKKRKMKIKDALKVMLDEEAAKLVNPEELKQKAIEAVEQHGIV
FIDEIDKICKKGEHSGGDVSREGVQRDLLPIIEGSTVNTKHGMVKTDHIL
FICSGAFQVARPSDLLPELQGRLPIRVELKSLTKEDFERILTEPNASLTL
QYRELMKTEGVDIEFTQDGISKIAESAFRVNEKTENIGARRLHTVLERLM
DGISFDASERAGEKVVIDEKYVSDALNDVVENEDLSRFIL
>MS0271 hslV, HslV protein
MTTIVCVRKNGKVAIGGDGQATLGNCVEKGTVRKVRRLYKDKVVTGFAGS
TADAFILRDLFEKKLELHQGHLVKAAVELAKEWRTERSLRRLEAMMIVAN
ESEFLLVSGSGDVIEPEFDVLAIGSGGNFAKSAALALLRTNNELSAAEIV
KQALIIAGDIDIYTNHNHVIEEV
>MS1696 htpG, HtpG protein
MSNKETCGFQTEVKQLLQLMIHSLYSNKEIFLRELISNASDAADKLRFKA
LSAPELYEGDGDLKVRISFDDKKGTLTVSDNGIGMTREQAVDHLGTIAKS
GTKEFLSALGNDQAKDSQLIGQFGVGFYSAFIVADKVEVRSRAAGVAADK
GVLWASAGEGEYSVENIEKKDRGTEITLFLREDEKEFLNEWRLREIIGKY
SDHIGLPVEILTKEYDEEGKESGVKWEKINKAQALWTRSKAEISDDEYKE
FYKHISHDFADPLSWMHNKVEGNQEYTSLLYVPGKAPWDLFNREQKHGLK
LYVQRVFIMDDAEVFMPNYLRFMRGLLDSNDLPLNVSREILQDNKTTAAL
RKALTKRSLQMLEKLAKDEPEKYAVFWKEFGLVLKEGVAEDFANKEQIAK
LYRFASTHTDSSEQNVSFEDYISRMKEGQKAVYYITADSYVAAKNSPHLE
LFNKKGIEVLLLSDRIDEWMLSYLTEFDGKPLQSVTKADLDLGDLADKEE
ENQKEQDEKFDSFIQRVKSLLGERVKDVRITHRLTDTPAVVSTDNDQMTT
QMAKLFAMSGQPVPEVKYTFEINPQHELVKKAAKVTDETEFGDWIELLLN
QAMLAERGSLENPVAFIKLVNALLAK
>MS0437 htpX, HtpX protein
MFKKSKKIFAVSFIVATLAACADTAQINQEAASSYTQTINQARAKGVVDT
SSATSKRIQSVFNQMVPYADKENTTGVKFNWQLTVVKSNELNAWAMPGGK
MMFYTGLVDKLNLTNDEIAVVMGHEMAHALQEHGKQSRNVGIMTGILGAA
ADIAAAATLGVDTGGLGGTVADLGVNKPFSRSNETEADEIGLFLMAKAGF
NPQAAPQLWVKMQKAGGSNGPSLLSTHPSDASRQENLQRLMPEALKIYKA
RNSK
>MS1134 htpX, HtpX protein
MMRILLFLATNAAVLIVFNIILSLTGIRGQDAMGLLIMAALFGFTGSIIS
LLMSKRSALAATGAEVIEQPRNDTERWLLQTVHSQAEKAGLPKPDVAIYH
SNDVNAFATGASKNNSLVAVSTALLNNMTRDEAEGVLAHEISHIKNGDMV
TMTLLQGVLNTFVIFAARMIARMVANNRSSEESNSGIYFLVAMVLEVVFG
FLASMIAMWFSRFREFRADAGSAELAGKQKMIAALKRLQAIHEPQEMDGK
LAAFAINGKRGGFTSLFLSHPPLEKRIEALETSK
>MS0869 htrB, HtrB protein
MIRDGLVKRINMSEKNKRLTARVGYEPHFSWSYLLPKYWGIWLGIFVLLI
FAFIPFRLRDNLAAKLGLVIAKYAKKPRHKARVNLQYCFPQWSAEQREKV
IDDMFITVTQVMLGIGEIAVRSKAHLQRRSVFFGIEHIQKAKEQGYNIIL
MVPHGWAIDASGIILHTHGMPMTSMYNPHRNPLVDWLWTITRERFGGKMH
ARQNGIKPFLNMVRKGDMGYYLPDEDYGAQASEFVDFFATYKATLPGLNK
MAKLAKAVVIPMFPRYNAKAGRYEMEIHPAMELSEEPKQSARSMNAEIES
FVSPAPEQYVWILRLLKTRKDGKDIYQ
>MS1263 htrB, HtrB protein
MAKNNTPIFQKSFLAPKYWPFWLAVGIFRLILLLPYPLLCKIGLGLGKLF
SKLSVGKRRSQIVRRNLQLCFPNWNEEKIESTLQANLESVGMAIIETGMA
WFWSDKRIAKWSKIEGIEYLKNNAKDGIILVGVHFLTLELGARIIGLQHP
GIGVYRPNDNPIMDWLQYRGRIRSNKDLLDRKDLRSMIKALRTGNTIWYA
PDHDYGRKNAVFVPLFSVPDAATTTGSYYLLKSSPLSKVVPFAPLRNTDG
SGYTVTVEPPVDFTDILHDKEAIAKRMNKVVEREIMLGVEQYMWLHRRFK
TRPNESDKSLYD
>MS2365 hyaA, HyaA protein
MEQIMQRTDGLLSALTHSVTDVSRRDFMKLCTALAATMGLTSKASAEEIT
NALTNPQRPPVIWIGAQECTGCTESLLRATHPSIENLVLDMISLEYHEVL
SAAFGDQAEENKHRALEKYKGKYVLVVDGSIPVKDGGVYCMVAGNPIIEH
IKEAAKGAAAIIAIGSCSAWGGVPSSGGNPTGAKSLSEVLPGIPVINIPG
CPPNPHNFLATVAYILTYKKLPATDKLNRPLFAYDRLIHENCYRRPHFDA
GRFAKEYGDYGHRHGWCLYHLGCKGPETYGNCSTLDFCDVGGNNWPVGIG
HPCYGCNEKGVGFTKGIFQLANVENPTPRVEKPDVANQEGETASMTAIAL
LGAATAVLAGVAVETLKELSVQRKNQLEKEKTKQENSNNTGK
>MS2361 hyaB, HyaB protein
MTDKKRITIDPITRIEGHLRIDCEIENGVVTNAWSTGTMWRGMENIVKGA
DPRDAWMIMQRICGVCTTVHAILSVRAVEDAVGAKVPLNAQYIRNMILAA
HSIHDHIVHFYQLSAMDWVDITAVLKADPEKAANMLKGVSSWGLNSANEF
RNVQTKVKKLADSGQLGIFANGYFGHPAMKLSPEVNLIAVAHYLQALECQ
RDANRVVALLGGKTPHIQNLAIGGVANPINLDSQAVLNLERLMYVKSCID
RLNDFINQVYKVDTAIFAAYYPEWLNLGKTSGNYLAVPEYPVNAENSEFA
LTGGYLQNFDLNTFRPITQQKDNFVVQGIKESGKHAWYEDDEALAPWAGL
TRPKYTQWDENGKYSWVKAPSFYDDVVEVGPLAYLLTNLAAKNEVTTKHF
NELKSIYDQLAGRNLEINDLHSTLGRIIGRTVHCCALNEILTQQWQLLVN
NIGKGDTIAYLKANIPENGEFRGVGFGEVPRGMLSHWVVIKDGKIENYQA
VVPSTWNSGPRNQHDALGPYEQSLIGTPVADPAKPLEVVRTIHSFDPCMS
CAVHVVNTETGETTKVKVL
>MS2360 hyaD, HyaD protein
MKPLILGVGNILLSDEGIGVRAVQHLEKNANFTPHFDLVDGGTCGMELLD
VMANRDYLIIIDAVIAGKRPGEIVVLKDEQVPALFSRKISPHQLGICDVL
SALKLTDEYPKHLCLIGIQPESLESHIGLTKTVENAMPAVFQCLAQQLTD
LGLPSPVIN
>MS1029 hybA, HybA protein
MSAVQEQNIIKRSATSGVTPPPQVRKDVVEVAKLIDVTTCIGCKACQVAC
SEWNDIRAPQEQCVGVYDNPRDMNAQQWTVMKFSEVEENDRLEWLIRKDG
CMHCAEPGCLKACPAPGAIIQYANGIVDFQSEKCIGCGYCIAGCPFNVPK
MSNEDNRVYKCTLCVDRVNVGQEPACVKTCPTGAIHFGSKEEMLHYAETR
VADLKSRGYDNAGIYNPEGVGGTHVMYVLHHADRPELYAGLPKDPEIDVT
VKLWKDILKPVAAVAMGGLALAEIAHYVGVGPNNEEDVEDHSAHFEREDA
EEEQSHHNKGGK
>MS2364 hybA, HybA protein
MDRRKFIKAGMLGGVASALPLNAAHAEVKNQEPIPGALGMLYDSTLCVGC
QACVAECQKINGTPVNPKGEQTWSNNDKLSPFTRNVIQVWSEGEGTNKDQ
PQNGYAYIKKQCMHCVDPNCVSVCPVQALTKNPKTGIVGYDPDICTGCRY
CMVACPFDVPKYDYDNPLGQISKCELCNQKGVERIVQGKLPGCCHVCPTG
AIIFGSREELMAEAKRRLSMTQGTDYEFPRQHVNSKDKYQAKIPAYEQHI
YGEIEGGGTQVLVLSGVPFENLGLPQLDEIATGARAAHLQHTLYRGMILP
LVGLAGLTFITYRNMHGKKPEHHQEEDNNE
>MS1817 hybA, HybA protein
MTACSRRNFVSGMGALILTTGTSVKLSAQGEKPNETAPKRYAMVHDETSC
IGCTACMDACRETNQVPEGVSRLEILRSEPHGEFPNQEYEFFRQSCQHCT
NAPCVAVCPTGASFIDPETGIVDVNKDLCVGCQYCIAVCPYRVRFIHPVH
KTADKCNFCRDTNLAAGKQPACVEACPTKALTFGDMNDPNSAVARKVREN
PVYRTKLTLGTEPNLYHIPFAKGEHR
>MS0890 hybA, HybA protein
MSAVQEQNIIKRSATSGVTPPPQVRKDVVEVAKLIDVTTCIGCKACQVAC
SEWNDIRAPQEQCVGVYDNPRDMNAQQWTVMKFSEVEENDRLEWLIRKDG
CMHCAEPGCLKACPAPGAIIQYANGIVDFQSEKCIGCGYCIAGCPFNVPK
MSNEDNRVYKCTLCVDRVNVGQEPACVKTCPTGAIHFGSKEEMLHYAETR
VADLKSRGYDNAGIYNPEGVGGTHVMYVLHHADRPELYAGLPKDPEIDVT
VKLWKDILKPVAAVAMGGLALAEIAHYVGVGPNNEEDVEDHSAHFEREDA
EEEQSHHNKGGK
>MS1545 hybF, HybF protein
MEIVEEQCHRNNVNKVTDIWLEIGPLSCVEPDAIEFCFEVCRKNTVMENC
KLHFVPVPALAYCWHCEKTVEIKSHHDACPQCGGIHLQKQGGDDLRIKEI
AVE
>MS1463 hypB, HypB protein
MCTTCGCGHPEQVRIGELQHTHSHSEHQSAVKMPDFSQSVFHSMKPSIHE
HAGEQDNTQKRLLKIEQDVLGKNNRIADSNRNLFNYLNLTVFNLVSSPGS
GKTSLLTATLNSLKNDRNCYVIEGDQQTENDADRIRATGVPAIQVNTGKG
CHLDAQMISDAMMKLRPQENGLLFIENVGNLVCPSEFDLGEKAKVVILSV
TEGEDKPLKYPHMFAASKLMILNKVDLLPYLKFDVEKCIENAKRVNPQIE
VIQLSAATGEGLQDWLNWLQQ
>MS2358 hypC, HypC protein
MCLGVPGQIIDVGEDGFQPAVVDVCGVQREVNISLICENNTTDLLGKWVL
VHVGFAMSVIDEEEAKQTLSALMTMSQLDHEVGDFAGLNKN
>MS1462 hypD, HypD protein
MQFVDEFRDPKLAKHLVERLTKLMKNLPQFSAKNPLYLMEVCGGHTHSIF
KFGLDRLLPESIEFIHGPGCPVCVLPMGRIDLCIEIAQNPNVIFCTFGDA
MRVKGRKGSLLEAKAQGCDVRIVYSPLDALNIALSNPDKKVVFFSLGFET
TMPAAAVTLQQAKRRNIANFWIVSQNITIIPTLRSLLSQDQIKIDGFIAP
GHVSMVIGSAPYRELCKKFRKPFVIAGFEPLDILQSIVMLVEQFADGRCE
VENQYKRIVHEQGNMLAQKAMAEVFQLKARSEWRGLGEIEESGVELTVDY
RRFDAEIYFNSKAQQVADDPNSRCGDVLTGKCKPADCPLFGSDCNPDNAY
GALMVSSEGACAAYYQYRRE
>MS1461 hypE, HypE protein
MTDFITMAHGNGGAAMQQLIRDYFVEAFDNPTLAQGEDQARIPLAELIKC
GEKLAFSTDSFVIDPIFFPGGNIGKLAVCGTVNDIAVGGAIPKYLSCGFI
LEEGLPLSELKEIIRAMAETCRRAGVQIVTGDTKVVQKGAVDKVFINTSG
IGVIPAEIDWGAHQIEAGDKIIVSGTIGDHGATILNLRENLGIKTDLHSD
CAVLSSLIDLLRPIQGVKAIRDATRGGVNAVLHEYAQTQNLGMQVHEEDL
PMRNEVRGICELLGLEPLNFANEGKLVIITKAEKTQEILTALHSHELGKN
AAIIGEVTDDKKVRVVGIFGQTRLLDLPANEPLPRIC
>MS1546 hypF, HypF protein
MNTEKQSVIELRIKGKVQGVGFRPFVWLLANQYGLKGDVNNDGQGVLIRF
IEPDCASLQQFLRDLQNKLPPLAQITEIQETTKIGENLPHFSDFTIRESE
NNAVDTQIVPDAATCPACLKELFAPRNRRFHYPFTNCTHCGPRFTIIKSI
PYDRPNTSMANFPFCPECEREYKNPADRRFHAQPNACPVCGPHIWLQNQH
TKIADHEAALIQTLHLLNEGKIIAIKGIGGFHLACDATNRQTVQLLRSRK
RRPTKPLAIMVPDLQFLTALSRAETKLLTSSAAPIVLLSKHKVPAVDELI
APHLNEIGVMLPSNPLQHLLLKAINKPLVMTSANPSGQPPVLDNESAVKF
LQNLADFYLCHNRDILQRADDSLVRIAFDGLETLRRARGYVPDEISLNIS
NNKNILALGSDLKNTFCLLKRNKAVVSQHIGDTADEKVRSQLEENLALFQ
HIYQFKADLIAVDSHTGYFSSATGRQIAQCQQIPVMEILHHHAHIRAVMA
EHNCNEKVIGIALDGIGMGENQQLWGGECLLINRTEVKHLGGLPAVALPG
GDLAATQPWRNWLAHIHQFVENWQELAAKSCEKYNWQSLSRAIEQKINCP
TISSAGRLFDAVAYSLNIAPENLSWEGEAACRLEALAGQSQFTEKSAVKI
RQILPEFADELIWTNDKNNKTFLNLAKFWQSWRNYKAQKADKAFAFHLAL
AAGFAELARQQANKYQCRTIVLSGGVMHNRLLRRLLKENLQEFNVLSAHQ
FPMGDGGLSLGQAAIAADFT
>MS1693 icc, Icc protein
MISNTYIYEADSDVIRFVQITDPHLFKDEQGELLGVNTQQSLTQVLTELK
ENQFNYDFVLATGDIVQDSSEEAYLRFCKSVQQLDKMVFWIPGNHDFQPK
MFDILVQEHGNLSPKKHLLLGDKWQILMLDSQVFGVPHGQLGQYQLEWLD
SKLKDNPDRYSLVVLHHHILPTHSSWLDQHNLRNAHELAQVLAQYDNVRG
ILYGHIHQAMDGTWKDYQIMATPSTCIQFKPDSNVFALDTLQPGWREVEL
HSDGSIITRVNRIQKASFLPNMQEDGY
>MS2370 icd, Icd protein
MQSKVKIPQGDKIQLADNGALIVPHNPIIPFIEGDGIGVDVTPAMKAVID
AAVEKAYGGTRKISWMEIYAGGKANQVYGENTWLPAETLELIRQYHVAIK
GPLMTPVGGGIRSLNVAMRQGLDLYNCLRPIRYYEGTPSPVKHPEFVNMV
IFRENSEDIYAGVEWVAGSPGADKLINFLQREMGVTKIRFTEDCGIGIKP
VSKQGSQRLVRAALQYVIDNDRSSLTLVHKGNIMKFTEGAFKEWGYQVAK
EFGAELIDQGPWMKLKNPNTGTEIIIKDCIADAFLQEVLLHPKDYDVIAT
LNLNGDYISDALAAQVGGIGISPGANIGDDAAIFEATHGTAPKIAGQNKG
NPGSLILSGEMMLRHLGWLEAADLVVNAVAKTIADKTVTFDFAEMLEGAT
LRSTSEFAEDIIANM
>MS0562 iclR, IclR protein
MEKENQPEAVSSVLKVFGIIEALAEQKEIGITELAQRLMMSKSTTYRFLQ
TMKTLGFVSQEGETEKYTLTLKLFEVGAKALEYADIIGLANHEMSYISRQ
TNETLHLGTLDGTEIIYLHKIDSGYNLRMYSRIGRRNPIYSTAIGKVLLS
GLTNKEIRELLADLTFVKHTSKTLENIDQLIEEIEKVRKQHYAEDNEEQE
PGLRCVAAPIYNRFGRIIAGLSISIPTIRFEEEKLPQLVNLLQVAGKNIS
EQIGYHDYPEILAP
>MS0055 iclR, IclR protein
MFFIVRRLKEMEKNSGNQSLIRGLRLIEILSRFPNGCPLVQLANISELNK
STVHRLLQGLQQEGFVQPAITVGSYRLTSKCLSIGHKIFSSLNIINIISP
HLENLNLDLGETINFSMRENDHAIMIYKLEPTTGMMRTRAYIGQHLQLYC
SAMGKLYLAYDRPAYLKEYWQTNNDNIQTLTCNTITELPVMEKELDEIKK
QGFAVDKEENEIGISCIACPIFNFQNKVEYAMSVSISTSKLNQYGIEHLL
EKIKLTAEAISLELGWLPESVQN
>MS1751 ileS, IleS protein
MVRKMSEQKDYKNTLNLPETGFPMRGDLAKREPGMLKNWYDNDLYQKIRQ
SSKGKKSFILHDGPPYANGSIHIGHAVNKILKDIIIKSKTALGFDSPYIP
GWDCHGLPIELKVEGLVGKPNQKISAAQFREECRKYAREQVEGQKKDFIR
LGVLGDWDNPYLTMNFDTEANIIRAFGKAVANGHLYKGSKPVHWCLDCAS
SLAEAEVEYEDRTSPSIYVRFAAADESAVENKFVLTEQGKGKLSAVIWTT
TPWTLPSNKAISINPELEYQIVQFGDERFILAAELVESVAQAVGVESWKA
LGSAKGSDLELLQFKHPFYDYNVPFILGDHVTLDGGTGLVHTAPDHGQDD
YVVARKYNIGMAGLIGNDGKFNSNAKFFAGLGVFEANGKVLEKLDEVGAL
LKLEKIRHSYPHCWRHKTPIIFRATPQWFIGMETQGLRQQALSEIKKVRW
IPDWGQARIEKMVENRPDWCISRQRTWGVPVALFIHKETEQLHPRTLELI
EEVAKLVERKGIQAWWDLDAKDLLGDDAAHYSKVPDTLDVWFDSGSTYYS
VVKNRPEFNGKEADMYLEGSDQHRGWFMSSLMLSTATDNKAPYKQVLTHG
FTVDGQGRKMSKSIGNIVTPQEVMDKFGGDILRLWVASTDYTGEMTVSDE
ILKRAADSYRRIRNTARFLLANLNGFDPKRDLVQAHEMISLDRWAVDCAF
RAQAEIKEAYDNYQFHTVVQRLMKFCSVEMGSFYLDIIKDRQYTTKADSL
ARRSCQTALWHIAEALVRWMAPILSFTADEIWGYLPGERGEFVFTEEFYD
GLFALDVSESLDDAYWQQVITVRNEVNRVLEQARNDKVIGGGLEAEVTIF
ANDEYSALLNKLGNELRFVTITSKAEVKTLADADVAEGEVAGLAIKAIRS
ANHKCPRCWHYSDSKDANSLCSRCEENVNGNGEERRFA
>MS2218 ilvA, IlvA protein
MVNNLSNAPTGAEYLRAILISKVYEAAKVTPLQLMPKLSERLGNRIYVKR
EDHQPVHSFKLRGAYAMISGLTQAQKEAGVITASAGNHAQGVALSAKNAG
IRALIVMPQNTPSIKVDAVRGHGGEVLLHGANFDEAKAKAIELSQTEQMT
FIPPFDHPAVIAGQGSIGMELLQQNGHINRIFVPVGGGGLLAGVAVLIKQ
LMPEIKVIGVEAKDSACLYYALKAGRPVDLERVGLFADGVAVKRIGDETF
RICQQYVDDVILVDGDEICAAMKDMFENVRAVPEPSGALSLAGLKKYAKQ
HNLQGETLVNLLSGANLNFHTLRYVSERCEIGEKHEALFAVTIPEQRGSF
LKFCQILGQNAVTEFNYRYADEKQACIFVGVRITGEQEKQVIIQQLKQGG
YDVQDLSDDDIAKTHIRYMVGGRSSSDLNERLYSFEFPEQKGALLKFLET
LGTTDANISLFHYRGHGADYGDVLAGFQINDADLPAFKQHLEKLGYAYQD
VTDSPSYRYFLG
>MS1319 ilvB, IlvB protein
MKMKKLSGAEMVVQSLRDQGVKYLFGYPGGSVLDIYDAIHTLGGIEHVLV
RHEQAAVHMADGYARSTGEVGCVLVTSGPGSTNAVTGILTAYTDSVPLVI
ITGQVRSNLIGTDAFQECDTIGLTRPVVKHSFMVKHAEDIPETIKKAFYI
ASSGRPGPVVIDIPKDVVNPANKYTYEYPKEVSLRSYNPNVQGHKGQIKK
ALKALLVAKKPVLFIGGGVIIGNSSEKLTQFAQLLNLPVTSSLMGLGGYP
GTDKQFLGMLGMHGTYQANMAMHNADLILGIGVRFDDRTTNNVEKYCPHA
KVIHVDIDPTSISKNIAADIPIVGSVDNVLTEFLSLLEDDNLSKSQSDLT
EWWKQIDEWKAKKCLEFDRTSQAIKPQAVVEAIYRLTKGEAYIASDVGQH
QMFAALHYPFDKPRHWINSGGAGTMGFGLPAAIGTKFAHPDSRVVCITGD
GSIQMNIQELSTAKQYGTPIVIVSLNNRFLGMVKQWQDLIYSGRHSQVYM
NSLPDFAKLAEAYGHVGIQINTADELEEKLTQAFAVKDKLVFVDVLVDAT
ENVYPMQITGGGMNEMLLGKPAEK
>MS2223 ilvB, IlvB protein
MNGANLVTECLKAHNVDTVFGYPGGAIMPVYDALYDCGINHLLCRNEQGA
AMAAIGYARSTGKTGVCIATSGPGATNLITGLGDALMDSIPLVAITGQVA
APLIGTDAFQEADVLGLSLACTKHSFIVQNIEELPEIFAKAFKIAQSGRP
GPVLIDIPKDVQFAETLLQPIVYSVEKPTALSAKSLEKAVELLKNAKRPV
AYIGGGVGMAKAVPALHEFLTATRIPTICTLKGLGAVPADNPYYMGMIGM
HGTKAANYATQEADLLLVLGARFDDRVTGKLSSFATEAKVIHADIDVAEI
NKLRRADVALCGDLEQALKALSFALDIEPWRADVQRLKRDFDWDYGENEG
EGDINPLFLLNRVSRLKAENAIVVTDVGQHQMWAAQHMSFGKPENFITSA
GFGTMGFGLPVAIGAQKARPRDQVILVTGDGSIMMNIQELGSIKRAKTPI
KILLLDNQRLGMVRQWQSLFFHGRHSSTILDDNPDFVTLASAFGIRGERI
EKAGEVNEALDRFFASQEAYLLHVCVHEDENVWPLVPPGACNVEMIEEMS
>MS0045 ilvC, IlvC protein
MSNYFNTLNLRQKLDQLGRCRFMERSEFADGCNFLKGKKIVIVGCGAQGL
NQGLNMRDSGLDISYALRPEAITEKRASFQRATENGFKVGTYQELIPTAD
LVVNLTPDKQHSKVVADVMPLMKQGASFGYSHGFNIVEVGEQIREDITVV
MVAPKCPGTEVREEYKRGFGVPTLIAVHPANDPKGEGMAIAKAWASATGG
DRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSIVCYDKLVADGKDPAYA
GKLIQYGWETITEALKQGGITLMMDRLSNSAKIRAFELAEEIKEHLNFLY
LKHMDDIISGEFSATMMADWANGDKDLFAWREATGKTAFENAPKADGIKI
SEQEYFDNGVVMVAMVKAGVEMAFDAMVASGIYEESAYYESLHELPLIAN
TIARKRLYEMNVVISDTAEYGNYLFSNVATPILAKEIVSQLKRGDLGEPT
PAAEIDNVYLRDINDTIRNHPVELIGQELRGYMTDMKRISSQG
>MS2219 ilvD, IlvD protein
MEIFMPKLRSATSTQGRNMAGARSLWRATGMKEGDFGKPIIAVVNSFTQF
VPGHVHLHDIGQMVVKQIEAAGGVAKEFNTIAVDDGIAMGHGGMLYSLPS
RDLIADSVEYMVNAHCADAMVCISNCDKITPGMLMAAMRLNIPTIFVSGG
PMEAGKTKLSDQLIKLDLIDAMIQSADKNVSDSDVDAIERSACPTCGSCS
GMFTANSMNCLTEALGLSLPGNGSCLATHADRKQLFLDAATQIVELCKRH
YEQDDYSVLPRSIATKAAFENAMSLDIAMGGSTNTVLHLLAVAQEAEVDF
TMADIDRLSRIVPCLSKVAPNTNKYHMEDVHRAGGVMAILGELDRANLLH
HDTKTVLGLTFAEQLAKYDIKLTRDEAVKTFYRSGPAGIRTTEAFSQDCR
WETLDDDRENGCIRDKAHAYSQDGGLAMLSGNIALDGCIVKTAGVDESIL
KFTGEAIVFESQEDAVDGILGGKVKAGHVVVIRYEGPKGGPGMQEMLYPT
SYLKSMGLGKACALLTDGRFSGGTSGLSIGHCSPEAASGGTIGLVRNGDI
IAIDIPNRSIQLQVSDEELATRRAEQDVKGWKPANRAREVSFALKVFGHF
ATSADKGAVRDKTKL
>MS2192 ilvE, IlvE protein
MCRIGIFMDYPLFETVAVERGEILNLDYHQTRYEQALHQYYGRKVLPFNL
QEILQKSTALLTLKRSEPLIRCRIDYNDQDYRLQCFAYQRKVFRSFQPVI
CDHIDYGLKFSDRRIFAELLRQKGKHDEIIIIKQGLVTDCTIGNLLFRKN
QQWFTPEAPLLNGTQRAKLLAEKRIQTLNIKRQDIAQFDEIRLINAMNPF
SESL
>MS0896 ilvE, IlvE protein
MKDLDWKNLGFGYTKTDYRYIAYWKNGEWQKGELTKDNTLHISEGSPALH
YGQQCFEGLKAYRTKDGSIQLFRPDQNALRMQQSADRLLMPRVPVDMFID
ACKQVVKANEEWVGPYGSGATLYLRPFLIGVGDNVGVHPAKEYIFSIFVC
PVGAYFKGGLAPSKFLISTHFDRAAPHGTGAAKVGGNYAASLYPGKYAKE
HGFADCIYLDPATHTKIEEVGSANFFGITKDNKFITPISPSILPSITKYS
LLYLAKERLGLEVEEGDVYVKDLDQFAEAGACGTAAVITPISGVQIDDKY
HVFYSETEIGPITQKLYDELTGIQFGDKPAPEGWIVKVE
>MS2222 ilvH, IlvH protein
MDSSWRKSMTNELTIVAHHRPEILERILRVVRHRGFTVIKLKMNLENGKI
WLDFVVEGERDICLLVHQLVKLEDIIDITTDEECECDE
>MS1318 ilvH, IlvH protein
MRRILSVLLENESGALSRVVALFSQRAFNIESLTVAPTDDPTLSRMTIEA
SGDEAILEQIEKQLHKLVDVFKVINLSDCEHVEREVMLLKLRATGSTRDE
IKRLTDIFRGQIVDVTTKSYTIQLAGTKDKLNAFVSAVKEETTIIEIVRS
GLISLSRGEKNCL
>MS2357 imp, Imp protein
MKKNYYSLISFSIFTALYSTAGFADLQQQCLAGVPQFSGEVVKGNANEMP
VYIEADKAELNHPTKGVYQGNVDIKQGNRHLITETAEIIQSGQDENVQRY
AYAKGGFDYKDNIINLTGDDAKVHLNTKDTDVKNADYQFVGRQGRGSAQS
AEVREDYRLLNNATFTSCLPNDNSWQIEAKEMKQYIKEEYAEMWHARFKV
AGVPVFYTPYLQLPIGDRRRSGLLIPSAGSSSRDGYWYSQPIYWNIAPNY
DATFTPKYMTHRGWQMNGEFRYLNEIGEGKIAGEYLGDDRYKDYIGDNKS
RHLFYWAHNAKLFDNWRLNVNYTKVSDKRYFSDFDSDYGSSTDGYATQTA
RLAYFQPNYNFAISAKQYQVFDEVSVGPYKALPQIDFNYYQNDLAQGLLD
FKLFAQAVRFENDSTLMPTAWRYHAEPSLNLPMSNQYGSLNVETKLYATH
YEQRKGSSARAEDIDRSVNRMIPQIKVDLQTVLASDKTFVDGFTQTLEPH
LQYLYRPYRDQSNIGSKRNTEYLGYGYDSALLQQDYFSLFRDRRYSGLDR
IASANQFTLGGTTRFYDEQANERFNLSLGQILYLNDSRIDNNSDHSTSGR
ASSWALESNWKLSDQWNWRGSYQYDTRLNETSLANTTLEYNPEKNNLIQL
NYRYASQAYIDQNLTSGANRYNQDIKQIGTTIAWEVSDNWVLVGRYYHDI
ALNKLVEEYAGIKYNTCCWSVGVGARRHLVSKSNYTYSANKDTIYDNSFG
ITFELRGLGNEQHSGIVDMLDKGMLPYVKPFNL
>MS0654 infA, InfA protein
MSKQSKQEKLPPPHKNGYNSHHSKKLPQKCGKNFIIFLEDTMAKEDCIEM
QGTILETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTIEMTP
YDLNKGRIIFRSR
>MS1444 infB, InfB protein
MTDKETQNENAPKKLSLQRRVKTTVAGGKVQVEVRKSRKIDTEAAKKAAE
EAKLKAQEAAEKAAAEKAEKEAAEKAKKNAEKARVAAAVKKPEPVKVVDA
EKDRIKAEEAELRRKADELARQKAEEQARKAAEEAKRLAELAADRETTEV
SDDFSDYHLTSTYAREAEDEEERRKEGRGRGKNKVGKAKKGGRDDNGSKD
ERNADRRNQKDVKGKGKQGKKGSSAIQQAFTKPAQAVNRDVVIGETITVA
ELANKMAVKATEIIKTMMKMGEMVTINQVIDQETAQLVAEEMGHKVILRK
ENELEESVLEDRDVNAEKVTRAPVVTIMGHVDHGKTSLLDYIRKAKVAAG
EAGGITQHIGAYHVETNGKMITFLDTPGHAAFTSMRARGAKATDIVVLVV
AADDGVMPQTIEAIQHARAASVPLVVAVNKIDKPEANPDRVEQELLQYDV
VSEKFGGDTQFVYVSAKKGTGVDELLDAILLQSEVLELTAVKEGMATGVV
IESYLDKGRGPVATILVQSGTLNRGDILLCGFEYGRVRAMRDELGKDVES
AGPSIPVEVLGLSGVPAAGDEATVVRDEKKAREVALYRQGKFREVKLARQ
QKAKLENMFSNMAEGDVAELNVIVKADVQGSVEAIVQSLQELSTEEVKVK
VVGSGVGGITETDATLAAASNAIIVGFNVRADASARRIIETENIDLRYYS
IIYELLNEIKAAMSGMLQPEFKQEIIGLAEVRDVFRSPKFGAIAGCMVTE
GVIKRNNPIRVLRDNVVIFEGELESLRRFKDDVNEVRNGMECGIGVKNYN
DVKVGDQIEVFEVVEIKRSI
>MS1054 infC, InfC protein
MSRAEEVELDLVEISPNAEPPVCRIMNYGKFIYEKEKAAKEQKKKQKVVQ
VKEIKFRPGTDEGDYQVKLRNIVRFLEDGDKVKITVRFRGREMAHQDIGL
DVLDRVKQDTAEIAMVESAPGKLEGRQAVMVIAPKKK
>MS1208 insB, InsB protein
MKLLRQLSVAALGLFALQAAQSKNLIVYFTVPESVPTEKLDGVSGASVII
KDNERLGSAEYLAKEVQKTAGGDLFRLETVQAYPTVHQQLLDFAQEEQRK
NIRPALKAKPNLNGYETIFVAYPIWWYKLPMPLYSLFEQVDFSGKNIVPL
VTHGGSRLSGTDRDIAQLQPKATVKDGFEYYLYKTTGADSTMEQKLADWL
VRQGYAK
>MS1529 iolE, IolE protein
MKTIKGPGLFLAQFIDNKAPFNRLDSLAQWAAGLGFKALQIPCNHKHIFD
VELAAQSQTYCDEVKGILAQYGLIVSELSTHLEGQLVAVHPAYDTAFDGF
APEQVRGNSEARQLWAVNIIKQAAVASKRLGLNAHASFSGSLAWPYFYPW
PPRKEQLIQTAFDELAKRWKPILDYFDEQGVDLCYELHPGEDLHDGVTFE
RFLDKLNRHPRCNILYDPSHMLLQNMDYLQFIDFYHERIKAFHVKDAEFV
KSAKSGCYGGYQNWLERAGHFRSLGDGQIDFKAIFSKLTQYDYTGWAVLE
WECCMKDSAVGAKEGAEFIQKHIIPVAEKSFDNFADTGDDSEQAKIMLGL
K
>MS1307 iscA, IscA protein
MTDMTIPLTFTDAAAKKVKNLIIEEENQDLKLRVYITGGGCSGFQYGFTF
DEKVNDGDLTIENDGVKLVIDPMSLQYLIGGTVDYTEGLEGSRFVVHNPN
ATTTCGCGSSFSI
>MS1723 iscA, IscA protein
MSVEQFSVEDAEQSSQSASIGMTESAAKHVKKCLESRGKGIGLRLGIKTS
GCSGLAYVLEFVDELNSDDNVFEQHGVKVIVDTKSLVYLNGTQLDFVKEG
LNEGFKFTNPNVKDQCGCGESFNV
>MS1724 iscU, IscU protein
MAYSEKVIDHYENPRNVGSFDKKSSDVGTGMVGAPACGDVMQLQIKVNEE
GIIEDAKFKTYGCGSAIASSSLITEWVKGKSLDEAGAIKNSQIAEELELP
PVKVHCSILAEDAIKAAIADYKSKKGA
>MS1600 ispA, IspA protein
MHCLLSELKENRVKSTALFLFNYIKNCIMTTTNNLMDIKTIQALVSDDMQ
KVNEEILAQLNSDVPLINQLGYYIIHSGGKRIRPMIANLAAKALNYQDNK
HITCAAFIEFLHTATLLHDDVVDESDMRRGNPTANAEFGNAASVLVGDYM
HTRSFQMMTELGSLRILQVMSAATNVIAEGEVQQLMNVRDPDTTEQNYMK
VIYSKTARLFEVSTQTVAILADAGAAIEEGLQNYGRYLGTAFQLVDDILD
YSANAATLGKNIGDDLAEGKPTLPLLHAMRHGNPQQAELIRNIILQGGNR
DALDEVLTIMHEHKSLDYAMEHAKQEAQKAVDALAPLPDNIYKRAMISLA
YLSVDRAY
>MS1060 ispA, IspA protein
MMRLMYQFSNDLQQAQQRINRFLEAQFDEINTRPSPLADAMKYGLLLGGK
RIRPFLVYATGRMLGADTAQLDYAAAAIEAIHAYSLIHDDLPAMDNDSLR
RGQPTCHIAFDHATAILAGDALQAFAFEILTKSTALSAEQKLRLIQVLSH
NSGVFGMCLGQSLDLISEHKQISLSELEQIHRNKTGALLSAALKMGFICS
SHFADNALEAKLDRYAAAIGLAFQVQDDILDIEGDLAAIGKNVGSDLESD
KSTYPKLLGLAGAKQKARELYVAAVGELEHLPFDTTALRALAEFIINRKN
>MS2275 ispD, IspD protein
MTRHSRPIIAVVPAAGVGSRMQADKPKQYLTLLGKTLLEHTLEVLLSYTP
IQQIILAVAENDPYLDQLDVIRQPKIKIVQGGRDRAGSVFNGLKAITQPH
AWVMVHDAARPCLTHEDLDKLLQIEDDNGGILAIPAVDTIKRASAEKQII
QTEDRSQLWQAQTPQFFRADLLYRALQQAFEHGLAVTDEASAMEFAGFRP
HLVAGRSDNLKVTRPEDLKLAEFYLSRK
>MS1535 ispE, IspE protein
MKTHQFSTALFSDYKQGDSFNFPCPAKLNLFLYINGRRTDGYHELQTLFQ
FLDYGDWLSIKVRNDGKIRLTPEIPDLKTEDNLIYRAAKLLQQKTACRLG
ADLHLDKVLPMGGGVGGGSSNAATALVALNYLWKTGLSVNELAELGLKLG
ADVPIFVHGKAAFAEGVGEKITYCEPPEKWYAVIKPNVSISTAKVFSEPD
LTRDTKKKPLEQLLQQEYTNDCEKVVRKLYPEVEELLRWLVKYAPSRLTG
SGACVFAEFADEQSAQTVFNLKSKQFSGFVAQGLNVSPLHKMLEQLNRQN
HG
>MS2274 ispF, IspF protein
MIRIGHGFDVHAFGEARPLIIGGVEVPYHTGFIAHSDGDVALHALTDALL
GALALGDIGKLFPDTDMQFKNIDSRILLREAFRRVQEKGYKIGNVDVTII
AQAPKMRPHIDAMRAVIAEDLQCSVEQVNVKATTTEKLGFTGRSEGITTE
AVALLVKSC
>MS0507 kamA, KamA protein
MRILTQNNPVREENWLEILANSISDPEVLLKTLSLPIDKFEKDIHARKLF
AMRVPLPFVRKMELGNAQDPLFLQAMSSADEFLTADGFSKDPLEEQQVVA
PNILHKYKNRLLLMVKGGCAINCRYCFRRHFPYADNQGNKANWQKALDYI
SANPQIEEVIFSGGDPLMAKDHELDWLIKKLEKIPHLQRLRIHTRLPVVI
PQRITGAFCKILTESRLNTVLVTHINHGNEIDEQLTRALNKLKNAGVVLL
NQSVLLKNINDNAQTLKNLSDKLFRAGILPYYLHLLDKVEGASHFYVPDQ
RAVEIYRELQSLTSGYLVPKLAREIAHEPNKTLYGG
>MS1189 kdsA, KdsA protein
MQNKIVRIGDINVANDNPFVLFGGMNVLESRDMAMQVCEKYVEVTNKLGV
PYVFKASFDKANRSSIHSYRGPGMEEGLKIFQELKQTFGVKIITDVHEIY
QCKPVAEVADVIQLPAFLARQTDLVEAMARTGAVINVKKPQFLSPGQMGN
IVEKIEECGNDKVILCDRGSNFGYDNLVVDMLGFGVMKKVSKGAPVIFDV
THSLQCRDPFGAASGGRRDQVTELARAGLAVGIAGLFLEAHPDPNNAKCD
GPSALPLSVLEGFVSQMKALDDLVKSFPQLDTSK
>MS0935 kdsB, KdsB protein
MTNFTVIIPARFASSRLPGKPLADIAGKPMIIHVLEKARLSGATRVVVAT
DNEEVKQAVEQFGGEVCMTSAKHNSGTERLAEVVETLNIPDDEIIVNIQG
DEPLIPPVIVSQVAENLCKFKVNMASLAVKIHESAELFNPNAVKVLTDKD
GYVLYFSRAPIPWNRDAFARLNSGELKQEELDLADHYLRHIGIYAYRAGF
IKQYVQWEPSALEQIESLEQLRVLWYGEKIHVELAKEIPAVGVDTAEDLE
KVRAILSNF
>MS1952 kdtA, KdtA protein
MLRFVYSFAMYILQPFVLLFILLRSIKSPNYRKRLNERYGIYANLTPPKP
QGIIVHAASVGEVIAATPLVRRIQQDYPDLPITMTTVTPTGSDRVKAAFG
DSVSHFYLPYDLPDAMDRFIRFVRPKACIVIETEIWPNLIRQLHNKNIPF
IIANARLSARSAKRYGWVKNILNRMFNEISLIAPQDDISGNRYLDLGYRG
DLQLTGNIKYDLVISDALSQQIKRLHQEWAGERPVWIAASTHEGEEGIVL
QAHRSLLQKFPDLLLILVPRHPERFKAVEDLIVKGGFSYCRRSENVAPGS
DTQVVLGDTMGEMMLLYGISDIAFVGGSLVKHGGHNPLEPLAFKLPVISG
YHTFNFPEVFTKLRDVNGVLEIKENSTALSSAVEKFLLSPALRERYGNAG
YEVLIENRGALQRLLQLLTPYLENKK
>MS2136 kefB, KefB protein
MVTEGANYLVSIVTFLGAAIIVVPLFKKIGLGPVLGYLAAGLAIGPFGLA
LFTDSTTIIHIAELGVVMFLFLVGLEIQPKQLWGLRKYIFGMGSLQVLGA
TGALTAIGLLYNFSLQFSFIAASGFVLTSTAIVMQTLSYRNDMTSDPSRR
IIAVLLFEDLLIVPLLALVAILSPAQTADSLQSHSLWQHIIVSFAGLALL
IVAGIWLLDPLFRLVAKTKIRELMTAVALFVVLGSALLMEATGLSMAMGA
FLAGVLLSNSSFRHQLEVDIDPFKGLLLGLFFLGVGMSLDLTHVLNHWKM
IVSALFLMMITKGIIIYAVARATGSTKLQSLDRAVLMAQGGEFAFVLFSS
AALQGVISAEVHANMTAIVVLSMALTPLFIVIYQKWIAPKFAVREVLEND
VIEEQNDIILIGLGRFGQIVNHLLRASGFQPTIIDKDAKLVSGMKKRGIR
SYFGDACHPDLLHRAGIETVKLVIVAIDNTKQATKIVQHIRQINPKAKII
ARAYDRHHVFELAQAGANVQIRETFDSALRTGKQALTTLGIEQEKVHRIG
NMFFGKDRHSVKLMADVYDPKKPMFTNADMLKIAFEQDEELKLEIQKILD
EEI
>MS0630 ksgA, KsgA protein
MNSKRHLGHTARKRFGQNFLHDDNVIQGIVAAIYPQKGQFLVEIGPGLGA
LTEPVADQTDRLTVVELDRDLAQRLRHHPFLHQKLNVIETDAMQFDFGKL
YEDEHLAEQGQKLRVFGNLPYNISTPLIFHLLKFYDKIQDMHFMLQKEVV
KRLCAAPNSKAYGRLTIMTQYFCQVMPVLEVPPTAFKPAPKVDSAVVRLI
PHKELPHPVKDLYWLNRVTSQAFNQRRKTLRNALSTLFTPEQLTALNIDL
TARAENLSIADYARLANWLADNPPADVRRDEIIEENEE
>MS0806 lacZ, LacZ protein
MFIHRYFEDPQALHINTTPHHAYFIPQKCGQKWENFEPEQSLFYLSLNGY
WDFRYYLSPQELPESPNEVNFEAKIPVPSNWQTQGYDRHHYTNINYPFPF
DPPYVPQNNPCGIYRRTFELNKKENKHYLLNFEGVDSCLYVYINQTFVGY
GQISHSTNEFDITDFVQAGNNEIFVVVLKWCDGSYLEDQDKFRMSGIFRD
VYILEREANYLQDFFIRTDLSPNLNNAQIKVETKFLENNQNIDYALYDPS
GKLLIQQQTDKFEISFDNPETWNAENPRLYTLIMSYGQEQIVQRLGFRRI
QIENGILLFNGQPIKFRGVNRHDSDPVTGYHISRTQAVRDLQLMKAHNIN
AIRTAHYPNAPWFSELCDRYGFYLIGESDIESHGSSMLAVRQTEPSIFLN
VKNSYEHERIRQDNIDNLCYFARDPQFKEALLDRTYANVERDKNRTSVII
WSLGNESGYGENFEACAAWVKSRDPGRLVHYESSIYQHSAHRNDLSNLDF
YSEMYAATEDLDAYFANPANLKPFMLCEYSHAMGNSNGDAEDYFQAFHRH
PGSCGGFVWEWCDHAPYRDDKPEHFGYGGDFGESPHDGNFCVDGLVSPDR
IPHTNLLELKNVNRPVRAWLEQGKVYIKNYLDFTNLKEILTIRYSFSENG
KVINQSELQIDCAPHQIQVLDIALPADNGNLCWLNLDYVLTQPTDLLSKN
HLLGFDQLIIFKQGALPAQIFKNKTGHFKIQDTAQTLKICQGDFSYELDK
NKGIFSRITYREQNLIEQPLDFNIWRAPLDNDRLIRQSWQQAGYDKTYSR
AYQINWIEKNEGILIQANLALLAVSQGRILNLAVNYLLGADGQMKISLRA
TRPEHLPYLPRFGLRFFLPKGQTQGQYFGYGPQESYVDKHHLAKLGIYPL
NATDNYVDYLKPQENGSHYGSRYITLNSLHVSADQPFSFNLLPYSQEELT
TKAHNYELKESPWDILCLDYKMSGIGSNSCGPNLKEQYRLSETDFNWGVF
LQWSNPAR
>MS0749 lacZ, LacZ protein
MFLPNYFQNPQILHVNATPHHAYFIPHDSVESAVKNPRESSAFFTLLNGE
WNFQYFASYYDLTEDFLTRHLPDKIPVPANWQNHGYDHHQYTNVNYPIPF
DPPYVPQDNPCGLYQRKFCINLNKAKRYLLNFEGVDSCLFVYINQQFVGY
SQISHCTSEFDVTDFLRQGENEIHVLVLKWCDGSYLEDQDKFRMSGIFRD
VYLLERESHYLQDFFIRTELAEDLKSAVLKVEPFFVQAGEPAALREMAWQ
LSDPQGNILLSAVTERGFEYVVNELQLWNAEQPKLYTLLFRYGSEVICQK
IGFRKIEVKEGVLHFNHQPIKFKGVNRHDSDPKTGYVISREQALTDLRLI
KTHNFNAIRTAHYPNAPWFAELCDELGFYLIAESDIESHGSNAVYVEMPE
TSILLNVKTDPKTDEIQQKTVDEYCYFARDPNFKQAILDRTYANVQRDKN
RASVVIWSLGNESGFGENFEAAAKWVKGFDPSRLVHYENSIYQHSEHTND
LTHIDLYSEMYASTESMQAHFADPNNRKPYLLCEYSHAMGNSCGDAEDYW
QIFNQYPQACGGFVWEWCNHSPYLTDGKMGYGGDFGDEPNDGNFCADGLV
TADRQVQSSLLEMKNVNRPLRANLTAQGVELTNYLDFTDTEDFIAVHYQF
SENGIVVGEGYIDDVKISPKQTALLPLNLPEDNGNLWLLDLTYYQKKETP
LVAKNHQLGFDQIALFGQRIVPVARIGRVKSAVKIWQDSAVIKIQTEKAQ
FVMDKRKGIVQQIMTEQGGLLREPLDFNIWRALADNDNLIKRQWQAAGYD
RAITRAYEIQAEDFSYKAVVKAKCGLVALSKARILTLDVVYHIYANGELK
IEIDAEKAPQLPFLPRFGFRFVLDEAFQQGEYFGYGETESYADKHHGAKL
GLYRTTAQQNHRDYLKPQENGSHWGCSFVKLRSENEEICVTSDKPFSFNL
SPYKQESLQKAKHNYDLEDSNSTVLCIDYKMSGIGSNSCGPVLKAIYRLK
ENKWHCGFRLQIL
>MS1728 lasT, LasT protein
MLNNIRLVLVETSHSGNIGSAARAMKTMGLANLYLVSPKQGIDEQAVALS
AGAEDVLRKAVIVKSFDEAVADCSLVIGTSARLRHLQSTLIEPRECGKIS
IQEAHCGQIAIVFGREKFGLTNDELLKCRYHLNIPANPDYSSLNLAMAVQ
LVSYELRMAWLNGRQSESMSDSVEIAQKPTALELEYFFAHTEKLYQSLGF
IQNQGVMQKLRHLYNRVNLKKNELNILRGMLSAVEKRLDLLRDK
>MS1188 ldhA, LdhA protein
MMKSAVVFTALFLYAISHIKDELYLPKQGAFMKIVFLDSTALPPHLPIPR
PDFDHEWIDYPYTGAEQTVERAKDADIVVTSKVIFSREVMEQLPKLKLIA
LTATGTNNIDLIAAKELGIRVKNVAGYSSVTVPEHVLGLIFSLKHSLAGW
YRDQLEGKWGESKQFCYFDYPITDIRGSVLGVVGKGCLGTEVGRLATALG
MKVLYAEHRDAQSCREGYTPFDEVLKQADIVTLHCPLTEHTTNLINKETL
SLFKKGAFLINTGRGPLVDEQALLDALKSGHLAGAAIDVMIKEPPEKDNP
LIVAAKTMPNLLITPHIAWASDSAVTTLVNKVRDNIEEFVATGK
>MS2079 ldhA, LdhA protein
MTKSVCLNKELTMKVAVYSTKNYDRKHLDLANKKFNFELHFFDFLLDEQT
AKMAEGADAVCIFVNDDASRPVLTKLAQIGVKIIALRCAGFNNVDLEAAK
ELGLKVVRVPAYSPEAVAEHAIGLMLTLNRRIHKAYQRTRDANFSLEGLV
GFNMFGKTAGVIGTGKIGLAAIRILKGFGMDVLAFDPFKNPTAEALGAKY
VGLDELYAKSHVITLHCPATADNYHLLNEAAFNKMRDGVMIINTSRGVLI
DSRAAIEALKRQKIGALGMDVYENERDLFFEDKSNDVITDDVFRRLSSCH
NVLFTGHQAFLTEEALNNIADVTLSNIQAVSKNATCENSVEG
>MS1217 lemA, LemA protein
MKKWLLIIIVAVIAGFTLMSSYNGLVKAEEEIDSVWANVESQYQRRSDLI
PNLVNTVKGQANFEQETLTGVIEARAKATQTKIDPANMTEEQLAQFQQNQ
DSVGSALSRLLVSVERYPELKAHEGFMNLQAQLEGTENRINVARNKFNEA
ARVYNQKVRQFPTKLAAMILGFKEKPYFKSTAGAENAPTVSFDK
>MS0371 lepA, LepA protein
MFFYVFLASFLLFISSDIFFSLTKKCGKNFANFIALSRTFGYNQPIKLFI
KNNYFEHITIKMKNIRNFSIIAHIDHGKSTLSDRLIQTCGGLSDREMEAQ
VLDSMDLERERGITIKAQSVTLNYKAKNGETYQLNFIDTPGHVDFSYEVS
RSLAACEGALLVVDAGQGVEAQTLANCYTAIEMNLEVVPILNKIDLPAAD
PERVAEEIEDIVGIDAMEAVRCSAKTGVGIEDVLEEIVAKIPAPKGDPNA
PLQALIIDSWFDNYLGVVSLVRVKNGVLRKGDKIKVMSTGQTYNVDRLGI
FTPKQVDKNELECGEVGWVVCAIKDILGAPVGDTLTSQHNPASSVLPGFK
KVKPQVYAGLFPVSSDDYEAFRDALGKLSLNDASLFYEPETSTALGFGFR
CGFLGLLHMEIIQERLEREYDLDLITTAPTVVYEVELTNGDVIYVDSPSK
LPPLNNISEIREPIAECNMLVPQEYLGNVITLCVEKRGVQTNMVYHGNQI
ALTYEIPMGEVVLDFFDRLKSTSRGYASLDYSFKRFQAADMVRVDIMING
ERVDALALIVHKDNAPYRGRELVEKMKELIPRQQFDIAIQAAIGNHIIAR
STVKQLRKNVLAKCYGGDVSRKKKLLQKQKEGKKRMKQLGNVEVPQEAFL
AILHVGKDSK
>MS0370 lepB, LepB protein
MVKTVNKERLMANFFLPILLVVGFAIWKVLDHFTLPNTFSILLIILTALS
GILWCYHRFVVNPKRSRQITRIEQRTGKTLSAEEKQKVEPVSEGSEFVAS
IFPVLAFVLILRSFVFEPFQIPSGSMEPTLRIGDFLVVEKYAYGIKDPVF
QNTLIETGKPQRGDVIVFKAPPQPNVDYIKRIVAIGGDRIRYNELDRKIT
LVYGENGKPCSENCEVKEFSYSEPVENKEFQFIIGQNPDGSLMYGPSPLE
TTESGDVEHKIHWYPEPISEGYRYKDYSTQDNYITEWTVPENQYFVMGDN
RNNSEDSRFWGFVPEKNIVGKATYIWLSLDKKQNEWPTGIRSERIFQKIQ
>MS0369 lepB, LepB protein
MLMLKRLAKLVVLLSVLRGIIAYLINIVPTASMAPTFMPQDFILVNRVAY
NLKIPVLGDTITPLNTAKRGDIVIFRQDGGTDEYIKRIIAVEHDHVRYDQ
KSGIISVTPNYRQNNCQINHCETLLYKQQNERNYLNPETVLFSQQGEKLA
LIERQEFTDETNHAILLTKVRYDQSAHYFKQDNLPLGEWIVPAGHYFVMG
DFRENSIDSRFFGFIPHDNLTGKAVSVIFNPKQETRFFKTIQ
>MS0599 leuA, LeuA protein
MNVHNKRIRTMANNRVIIFDTTLRDGEQALKASLTVKEKLQIALALERLG
VDVMEVGFPVSSAGDFESVQTIAVHVKNSVVCGLSRAVNKDIDAAAEALK
VAERFRIHTFIATSALHVEAKLKRSFDDVVEMAVAAVKRARRYTDDVEFS
CEDAGRTGIDNICRVVEAAINAGATTVNIPDTVGFCLPTEYGNIIHQVMN
RVPNIDKAVVSVHCHNDLGMATANSLTAVLNGARQIECTINGIGERAGNT
ALEEVVMSIKTRQDLFGVDTRINTQEIHRVSQMVSQICNMPIQPNKAIVG
ENAFSHSSGIHQDGMLKNKNTYEIMSPETIGLKKEKLNLTARSGRAAVKG
HMADMGYTEQDYDLDKLYEAFLKLADKKGQVFDYDLEALAFIDMQQGDED
RLKLDVITSQTISTLPASAFVQVELDGKRINKTSNGGNGPVDAVYNAIMQ
IVGMDLKMSHYNLTAKGEGAEALGQVDIVVEYQGRKFHGVGLATDIVESS
ALALVHAINAIYRSQKVADLKKDLKHIHTV
>MS0598 leuB, LeuB protein
MSTYNVAVLPGDGIGPEVMAEAIKVLDKVQAKFGFKLNFTQYLVGGAAID
AKGEPLPAETLQGCDNADAILFGSVGGPKWTHLPPDQQPERGALLPLRKH
FKLFCNLRPATLYKGLEKFCPLRADIAAKGFDMVVVRELTGGIYFGQPKG
RDGEGSDTRAFDTEVYYKYEIERIARAAFDAAMKRRKQVTSVDKANVLQS
SILWRETVAEIAKEYPEVQVENMYIDNATMQLIKAPESFDVLLCSNIFGD
IISDEAAMITGSMGMLPSASLNEEGFGLYEPAGGSAPDIAGKGIANPIAQ
ILSAAMMLRYSFNLNEAATAIENAVQKVLADGHRTGDLADNSTPVSTAEM
GTLIANAI
>MS1105 leuB, LeuB protein
MVIMTHKIAVIPGDGIGIEVINEGVKVLNCVSQLDPKIQFEFTHFPWGCE
FYSKTGRMMDDDGIERLSKFDGIFLGAVGYPGVPDHISLWGLLLRIRKSF
DQYVNVRPVKLLKGAPCPLKEKSPKDINMIFIRENSEGEYAGSGSWLYRD
KPNEVVIQDGVFSRVGCERIIRYAFELARTEKKSLTSISKGNALNYSMVF
WDQIFQQLSQEYPDVETHSYLVDAAAMLMITKPERFEIVVTSNLFGDILT
DLGAAIAGGMGLAAGANLNPEGNFPSMFEPIHGSAPDIAGKQLANPLATV
WSASQLLEFFGYKEWAARLIDAIEYLLVEQKTLTPDLGGTAKTADVGDAV
VAYLQKHFA
>MS0333 leuC, LeuC protein
MENAMSKTLYDKHIDSHTIKELDNEGNVLLYIDRTILNEYTSPQAFSGLR
EENRDVWNKKSILLNVDHVNPTRPVRDANMTDPGGTLQVNYFRENSKLFD
IELFDVTDPRQGIEHVVAHEQGLALPGMVIAAGDSHTTTYGAFGAFGFGI
GTSEIEHLLATQTLVYKKLKNMRVTLTGKLPFGTTAKDVIMALVAKIGAD
GATNYAIEFCGEVIDELSVEGRMTICNMAVECGARGAFMAPDEKVYEYIK
GTPRAPKGEMWDLAIAEWRKLKSDNDAVFDKEIHMDCSDLEPFVTWGISP
DQADVISGEVPDPNLLPEGQKRKDYQAALEYMGLEPGMKFEEIKISHAFI
GSCTNGRIEDLREVAKVLKGRKIAQGVRGMIIPGSTQVRARAEAEGLAKI
FIDAGFEWRQSGCSMCLAMNEDVLSPGDRCASGTNRNFAGRQGAGSRTHL
MSPAMVAAAAVAGHLVDVRKFVEGD
>MS0596 leuC, LeuC protein
MAKTLYQKLFDAHVVYEAEGETPILYINRHLIHEVTSPQAFDGLRVAGRQ
VRQVSKTFGTMDHSISTQVRDVNKLEGQAKIQVLELDKNCKATGISLFDM
NTKEQGIVHVMGPEQGLTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVE
HVLATQTLKQARAKSMKVEVRGKVNPGITAKDIVLAIIGKTTMAGGTGHV
VEFCGEAIRDLSMEGRMTVCNMAIEFGAKAGLVAPDETTFEYLKGRPHAP
KGKDWDDAVAYWKTLKSDEDAQFDTVVVLEAKDIAPQVTWGTNPGQVIGI
DQLVPNPAEMTDPVTKASAEKALAYIGLEPNTDLKNVPVDQVFIGSCTNS
RIEDLRAAAAVMKGRKKADNVKRVLVVPGSGLVKEQAEKEGLDKIFLAAG
AEWRNPGCSMCLGMNDDRLGEWERCASTSNRNFEGRQGRNGRTHLVSPAM
AAAAAVFGKFVDIRNVSLN
>MS0334 leuD, LeuD protein
MDKFTLITAKAAPMMAANTDTDVIMPKQFLKGIDRKGLDRGVFFDLRFNL
DGTPNEKFILNQADWQGSQFLVVGPNFGCGSSREHAVWGLKQLGIRALIG
TSFAGIFNDNCLRNGVLTICVSDQEIEQIATTVSNPATNTISVDLEGQKV
LTENGEIAFDVDPLKKEMLIKGLDAVGFTLSMKDDILAFEQSYFKANPWL
KL
>MS0595 leuD, LeuD protein
MTRIDKMAGLKQHSGLVVPLDAANVDTDAIIPKQFLQAITRVGFGKHLFH
EWRYLDAEETQPNPEFVLNFPQYQGASILLARKNLGCGSSREHAPWALAD
YGFKVMIAPSFADIFYNNSLNNHMLPIKLSEQEVEEIFQWVWANPGKKID
VDLEAKTVTVGEKVYHFDLDEFRRHCLLEGLDNIGLTLQHEDAIAAYESK
IPAFLR
>MS0338 leuS, LeuS protein
MQEQYRPDLLEQEVQKYWQNNQTFKAVKDSSKEKYYCLSMFPYPSGRLHM
GHVRNYTIADVVSRYQRMNGKNVLQPVGWDAFGLPAEGAAVKNKTAPAKW
TYENIDYMKNQLKMLGFSYDWDREIATCKPEYYKWEQWFFTELYKKGLVY
KKTSVVNWCPNDETVLANEQVHEGCCWRCDTPVEQKEIPQWFIKITDYAE
QLLSGLDTLPEWPDMVKTMQRNWIGRSEGVEITFKIENSDETVAVYTTRP
DTFYGVSYMAVAAGHPLAEKAAQNNAELARFIQECKNTKVAEAELATMEK
KGMATGINAIHPITGKPVPVWVANFVLMHYGTGAVMAVPAHDQRDFEFAT
KYGLPIKQVIAPMNGEEIDLTKAAFTEHGKLVNSAEFDGLDFEAAFNGIA
DKLEKMGVGKRQVNYRLRDWGVSRQRYWGAPIPMLTLENGDVVPAPLQDL
PIVLPEDVVMDGVKSPIKADPDWAKTSYNGQPALKETDTFDTFMESSWYY
ARYTSPQYHEGMLDSDEANYWLPVDQYIGGIEHATMHLLYFRFFHKLLRD
AGLVSTDEPTKKLLCQGMVLADAFYYTSPTNERIWVSPTKVMLERDEKGR
ILKATDDEGHELVHAGMTKMSKSKNNGIDPQEMVEKYGADTVRLFMMFAS
PAEMTLEWQESGVEGAKRFLGRLWNLVFEYNKNPVKTAPNPTALSSAQKA
LRRDVHKTIAKVSDDIGRRQTFNTAIAAIMELMNKLTRAPLTDEQDRAVM
GEALSAVVRMLYPITPHICFQLWKDLGNEDIIDFAPWVQADEAAMIDDEK
LVVVQVNGKVRGKITVPADMAEEEIKRVALAEENVQKFLDGLNIVKVIYV
PGKLLSFVAK
>MS0744 lexA, LexA protein
MSAFCTKKQGIYMKPIKALTARQQEVFNFLKHHIETTGMPPTRAEISREL
GFRSPNAAEEYLKALARKGVVEILSGTSRGIRLLVDTEESANDEDAGLPL
IGRVAAGEPILAEQHIEGTYKVDADMFKPQADFLLKVYGQSMKDIGILDG
DLLAVHSTKDVRNGQVIVARIEDEVTVKRLERKGDVVYLHAENEEFKPIV
VNLKEQPNFEIEGIAVGIIRNNAWM
>MS0406 lgt, Lgt protein
MENQFLAFPQFDPIIFSLGPISLRWYGLMYLIGFIFARWLAVKRANRPDS
GWTVEQVDNLLFNGFAGVFLGGRIGYVLFYQWDLFVQEPSYLFRVWEGGM
SFHGGLIGVIVAMLVTAKLQKRNFWVVADFVAPLIPFGLGMGRIGNFIND
ELWGRVTDVPWAVLFPSGGYLPRHPSQLYEFVLEGIVLFCILNWFIRKPR
PAGSVAGLFLLFYGLFRFIVEFFREPDAQLGLYFGQQISMGQILSTPMIL
LGALFIVLAYRRRSAVKN
>MS1766 lig, Lig protein
MTIMDINQQIKQLRDTLRYHEYQYHVLDDPKIPDAEYDRLFHQLKALEQQ
HPELITADSPTQRVGAKPLAGFAQITHELPMLSLDNAFSDEEFNAFVKRI
QDRLIVLPQPLTFCCEPKLDGLAVSIFYVNGVLTQAATRGDGTTGEDITL
NIRTIRNIPLQLLTDNPPARLEVRGEVFMPHEGFNRLNERALEHGEKTFA
NPRNAAAGSLRQLDPKITSRRPLVFNAYSVGIAEGVELPATHYERLQWLK
SVGIPVNSEVQLCDGSEKVLEFYRSMQQKRPTLGYDIDGTVLKINDIGLQ
RELGFISKAPRWAIAYKFPAQEELTRLNDVEFQVGRTGAITPVAKLAPVF
VAGVTVSNATLHNGDEIARLDIAIGDTVVIRRAGDVIPQIIGVLHERRPA
NAQAIVFPTQCPVCGSKIVRIEGEAVARCTGGLFCDAQRKEALKHFVSRR
AMDIDGVGAKLIEQLVDKELIRTPADLFKLDLITLMRLERMGEKSAQNAL
DSLEKAKNTTLARFIFALGIREVGEATALNLANHFKNLDALQAASPEQLQ
EVADVGEVVANRIYVFWREQHNIDAVNDLIAQGIHWETVETKEAGENPFK
GKTVVLTGTLTQMGRNETKDLLQQLGAKAAGSVSAKTHFVIAGDNAGSKL
TKAQELGVAVMSEAEFLAIVNAYKR
>MS1826 lipA, LipA protein
MSTAFKMERGVKYRDAAKTSIIQVKNIDPDQELLQKPSWMKIKLPANSAK
IQSIKNGMRRHGLNSVCEEASCPNLHECFNHGTATFMILGAICTRRCPFC
DVAHGKPLPPDPEEPKKLAETIQDMKLKYVVITSVDRDDLPDRGAGHFAE
CIKEIRKINPNTQIEILVPDFRGRIEQALDKLKDNPPDVFNHNLENVPRL
YRDIRPGADYQWSLKLLREFKALFPHIPTKSGLMVGLGETNEEILNVMQD
LRNNGVTMLTLGQYLQPSRFHLPVARYVPPEEFDEFRTKAEVMGFEHAAC
GPFVRSSYHADLQASGGLVK
>MS1827 lipB, LipB protein
MHFVYSLVLATYLEFYMEQKLIIRQLGIRDYQKTWHEMQEFTDNRTDKSA
DEIWLVQHPSVFTQGQAGKAEHLLRSTAIPVVQSDRGGQITYHGIGQQIM
YVLIDIKRLKTQGRDISVRQLVSALEQSVINTLADYGIESYAKADAPGVY
IDGKKICSLGLRIRRGCSFHGLALNINMDLEPFHSINPCGYAGLEMAQLA
DFVSPQEADCGKVSPKLVEHFVTILGYNKQQIFNIKE
>MS0753 lldP, LldP protein
MAFFLSILPIILLIYLMVKRNAWPSYVALPWIAVCVLIIHLAFFGTNIAI
VSANVTASIIAVQTPITVIFGAILFNRFSEVSGVTNTLRKWLGNINPNPV
AQLMIIGWAFAFMIEGASGFGTPAAIAAPILVGLGFNPIKVAVFALIMNS
VPVSFGAVGTPTWFGFGPLNLNDEQILEIGSMTALIHCFAAFVIPLLGLR
IMVGWQEIRKNILFIYLSVFACVVPYFLIAQFNYEFPALVGGAIGLLISV
LAANRNIGLARVENNLDNNAVSFKEISKALLPTGMLIAILIITRIQQLPF
KAWMNDATTWFAVRIGSLGDFEISRGLIFSLKNIFDTSVSASYKLLYVPA
FIPFIVTVLIAIPLFKVSFRNATDIFGDSLKRSKNPFIALIGALIMVNLM
LVGGEGSMVKTIGKTFAETTGEHWTLFSSYLGAVGAFFSGSNTVSNLTFG
SVQLSTAELTGLSTTLILALQSVGGAMGNMVCINNIVAVSSVLGTNNQEG
NIIKQTVLPMIIYGIIAALVALFVIPLFYNI
>MS0652 lnt, Lnt protein
MNNIFTYIIAILTGAAGVLAFSPFDLWGFAYVSLIGLLFVIKNPQKKTAL
LSAFLWGLSFFSIGVSWLHVSIHQFGGSPLWLSYILVVVLAAYLSLYPLL
FAYIVRRFNVTSLAIFPVIWTFTEFLRGWIFTGFPWLQFGYTQIDSPFYG
IAPLFGVTGLTFFVMWASAVIFSGISTLIQTPKKLPVALVNALLLLSVGG
LAALSSQKIFVKEVPEKALTVTLAQGNIEQNLKWDPQYLYATLDIYRKLI
LEHLATSDLIVLPESALPALENQLQPFYQALQTATQEKGTEVLIGSVYHD
EKSDKLLNSIVSVGNSAQPYQAGNAASAMRYSKHHLVPFGEYVPLENLLR
PLGSVFNLPMSAFQSGDFIQKPLLAKGRALTPAICYEIILGSQLQQNLQP
NTDFLLTVSNDAWFGDSIGPWQHLQMARMRALELGKPLIRATNTGISVFI
NAQGKVISQAPQFEQTALTEKIAPTEGKTPYAALGDKPLYLLAFIFVMLR
VLAIFIKRKVLKSAV
>MS1452 lolA, LolA protein
MQQRLAQVNYFSADFNQNVSSANGKNVQTGSGKLQIKRPNLFRMDNKAPQ
ETQIISDGKTLWFYDPFVEQVTANWVENAVNNTPFVLLTSNDSANWNQYT
VVQNGDTFVLKPKAKNSNIKQFDIRIDNQGVLKGFSTIEKDGQTNLYILR
NITNQPLADSLFKFTVPKGAEFDDQRNKNKK
>MS1534 lolB, LolB protein
MNHLKSFFTALVAGFILTACSSLDISDTRPADVKTIDKSDIQWQQHLKQI
KQIQHYSSQGQIGYISSKERFSSRFEWNYAAPTDYTLKLYSTISSTSLVM
QMHNTGMTISDNKGNRRSEADAKALVREIIGMDVPLEQFAYWLKGQPDEK
ADYQVGENHYLASFTYPLDGTVWSADYLNYHEEKQPALPKDILLKNANQT
LKIRVDNWTF
>MS1844 lon, Lon protein
MSKRIQQKELPVLPLRDVVVFPFMVMPLFVGRAKSIHSLDKAMESGKQLL
LVSQKQAELEDPTIDDIYNVGTIVNIIQLLKLPDGTVKVLVEGQQRANIL
KLTDQDYFSATVTPIETTLGDEKELEVLRNTVLEEFDNYAKQNKKIQPEL
AKALADVGDFDRFADTLAAHLPISVANKQEVLERENVTERLEYLLGTMES
EADLLQVEKRIRNRVKKQMEKSQRDYYLNEQIKAIQKELGDNDNALENDE
IGQLRQKIEDAKMPLEAREKVEAELQKLKMMSPMSAEATVVRSYIDWMIQ
VPWAKRTKVKKDIVKAQEILDADHYGLERVKERILEYLAVQSRLNQIRGP
ILCLVGPPGVGKTSLGQSIANATGRKYVRMALGGVRDEAEIRGHRKTYIG
SLPGKLIQKMAKIGVKNPLFLLDEIDKMASDMRGDPASALLEVLDPEQNS
HFNDHYLEVDYDLSDVMFVATSNSMNIPTPLLDRMEVIRLSGYTEDEKLN
IAQRHLVQKQMERNGLKAGELAIEESAIVDIIRYYTREAGVRGLEREISK
ICRKAVKNLLLNKELKSITVNADNLHDYLGVRRFDFGLADTQNRIGEVTG
LAWTEVGGDLLTIETASVVGKGKLTFTGSLGDVMKESIQAAMTVVRTRAE
KLGIAADFHEKRDIHIHVPDGATPKDGPSAGIAMCTALVSCLTGNPVKSE
VAMTGEISLRGKVLPIGGLKEKLLAAHRGGIKTVIIPKDNVKDLEEIPEN
AKNALTIIPAETIDEVLTVALENPPEGVEFIKAAPLAKVKAPKARKPVSK
RSTSTVN
>MS1195 lonB, LonB protein
MNLSLSSEQLLSWQHLMPTLELADIPEQSISFFDLQPRANSAIQQFLQNS
HRSLLVLKADDQAEYAPLLADYIQSLLPQNSQVKGVNYFIEQADSFSFAR
ISVEPAQSKEDNFAAIKQVGTALYFDENQLFGSLLVHPISKDIQLNAGLV
HQLNGGVLILSVAGLLARFDIWCRLKHILTTQTFDWYSMHPFKPLPCHIP
SYPLELKVVLLGSREELAAFSELETELFGLGDYSELESYFSLEEPVQKVK
WVQYVRTLAKQYDFPSISDKGVERLYQLYVRESEDRAVINISPLMLKNLL
SKAVLVCTGTELSAVDFEKVFQITAMQHSFLRDRTYDDILHEQVFIPTEG
EAIGQINGLSVIEYQGTPTSFGEPSRLSCIVQFGEGEITDVDRKSELAGN
IHSKGMIIAQNCLANILELPSQLPFSASLVFEQSYAEIDGDSASLAAFCA
LTSALADLPVSQSVAITGAIDQFGLVHSVGGVNEKIEGFFAICERRGLTG
EQGVIIPASVIHQLSLSIEVITAVKNRRFFVWAVEDVYQASKILFQRDLT
EEDKSYNGDNEPISRLIARRIEQRTDHLSRGFWGLLFGRK
>MS1985 lpd, Lpd protein
MTKHYDYISIGGGSGGIASINRAAGYGKKCAIIEAKHLGGTCVNVGCVPK
KVMFYGAHIADAINHYAEDYGFDVSVNKFDFAKLVESRQAYIGRIHTSYG
NGLSKNKVDVFNGFARFVDAKTVEVSYEDGSSEQITADHILIATGGRPSI
PNVKGAEFGISSDGVFALNELPKRVAVVGAGYIAVELAGVFNSFGVETHL
FVRQHAPLRNQDPLIVDTLVEVLVQDGIQLHQKAIPQEVVKNADGSLTLK
LEDGRETVVDSLVWAIGREPATDVINLQAAGVETNDRGFIKVDKYQNTNI
PGIYAVGDIIEGGIELTPVAVAAGRRLSERLFNNKPDEHLDYNLVPTVVF
SHPPIGTVGLTEPKAIEKYGAENVKVYTSSFTAMYTAVTQHRQPCRMKLV
CAGADEKIVGLHGIGFGVDEMIQGFAVAIKMGATKADFDNTVAIHPTGSE
EFVTMR
>MS1334 lpd, Lpd protein
MEIFKILTALADFRKEGIPSCLPVTLAGKSACSYIGKNMSKEIKTQVVVL
GAGPAGYSAAFRCADLGLETVLVERYSTLGGVCLNVGCIPSKALLHVAKV
IEDAKHVEHHGIVFGEPTIDLDKVREGKNAVVGRLTGGLAGMAKMRKVTV
VEGLAEFADSHTLVAKDREGNPTTIKFDNAIIAAGSRPVQLPFIPHEDPR
VWDSTDALALREVPKNLLVMGGGIIGLEMGTVYSALGSQIDVVEMFDQVI
PAADKDIVKIFTKRIEQKFNLLLETKVTTVEAKEDGIHVSMEAKDGKVET
RVYDAVLVAIGRTPNGKLIGAEKAGIEVTDRGFINVDKQMRTNVPHIFAI
GDIVGQPMLAHKGVHEGHVAAEVIAGKKHYFDPKVIPSIAYTEPEVAWVG
KTEKECKAENLNYEVATFPWAASGRAIASDCADGMTKLIFDKDSHRILGG
AIVGTNAGELLGEIGLAIEMGCDAEDIALTIHAHPTLHESVGLAAEVFEG
SVTDLPNPKAKKK
>MS1058 lplA, LplA protein
MRKIIMYFIDNKEITDAGINIALETYLVENRLVNEPILLFYINSPSIIIG
RNQNTIAEVNQPYLDEKNIRVVRRMSGGGAVYHDLGNLSFCFIKDDDGSI
GDFAGFTRPVIEALHQLGAKNAKLEGRNDLLIDGKKFSGNAMYAKGGRMT
AHGTILFDSDLEEVSKALKSRKEKIESKGIRSIRKRVTNIKPYLSLEYQH
LTTRQFRDILLLKIFNVTSREQVPEYRLTEEDWQKVYALREQRFANWDWN
YGRSPQFTLEYYHKFPAGLVEYKLNVEQGKIQNIRIFGDFFGLAEIAELE
KALIGIKYEKQAISQIFNHFNIKQYFGNIEPEALTELLVNGIYEE
>MS1288 lppC, LppC protein
MTILLQRAKFKKRLMPILFPLMLAGCTNLFGSNFQDVLRNDANASSEFYM
NKIEQTREVEDQQTYKLLAARVLVTENKTAQAEALLAELTKLTPEQQLDK
SILDALIAAVKRDNDSASALLKTIPLAQLSQSQTSRYYEVQARIAENKTD
IIEAVKARIQMDMALTDVQRKQDNIDKIWALLRSGNKTLINTTQPEGNVA
LAGWLDLTKAYNDNLSQPSQLAQALQNWKTTYPNHSAAYLFPTELKSLSN
FTQTQVNKIALLLPLSGNASILGSTIKSGFDDSRGADKSVQVDVIDTMAM
PVTDAIALAKQNGDGMIVGPLLKDNVDVILSNPTAVQGMNVLALNSTPNA
RAIDKMCYYGLAPEDEAEAAANRMWNDGVRQPIVAVPQSDLGQRTASAFN
VRWQQLAASDADVRYYNQPDDAAYNLTADPAQNQAIYIVVTDSEQLMSIK
GALDNSGVKAKIYTNSRNNSSNNAVEYRLAMEGVTFSDIPFFKDLDGEQY
KKIEAATGGDYSLMRLYAMGADSWLLAHSFNELRQVPGFSLSGLTGKLTA
GPNCNVERDLTWYSYQGGNIVPLN
>MS0461 lpxA, LpxA protein
MIHPSAKIHPTAIVEEGAKIGENVIIGPFCLIGADVDIGKGTVLHSHIVV
KGITRIGEDNQIYQFASIGEANQDLKYNGEPTKTIIGDRNRIRESVTIHR
GTVQGGGVTRIGDDNLFMINSHIAHDCIIKNRCILANNATLAGHVQLDDF
VIVGGMSAIHQFVVVGAHVMLGGGSMVSQDVPPYVMAQGNHARPFGVNIE
GLKRRGFDKPTLHAIRNVYKLIYRSDKTLDEVLPEIEQVAQKDSSISFFV
EFFKRSTRGIIR
>MS0422 lpxB, LpxB protein
MRSIMENLIKNNPTIAIVAGEVSGDILGGGLIKALKVKYPQARFVGIAGK
NMLAESCESLVDIEEIAVMGLVEILKHLPRLLKIRSDIVQKLSALKPDIF
IGIDSPEFNLYVEDRLKAQGIKTIHYVSPSVWAWRQNRIYKIAKATNLVL
AFLPFEKAFYDRFNVPCRFIGHTMADAIPLNPNRTEACKMLNIDENQRYV
AILAGSRGSEVEFLAEPFLQTAQLLKRKYPDLKFLVPLVNEKRRRQFEQV
KAKVAPELDLILLDGHGRQAMIAAQATLLASGTAALECMLCKSPMVVGYR
MKAATYWLAKRLVKTAYISLPNLLADEMLVPEMIQDECTPEKLVEKLSVY
LDETESAVQNRQVLIQRFTELHQLIQCDADSQAAQAVADLLEGKVNG
>MS1659 lpxC, LpxC protein
MIKQRTLKQSIKVTGVGLHSGNKVTLKLRPAPINTGIVYCRTDLTPPVYF
PADATAVRDTMLCTALVNDQGVRISTVEHLNSALAGLGLDNVIIEVDAPE
VPIMDGSASPFVYLLLDAGIEEQDAAKKFIRVKQKIRVEDGDKWAEISPY
NGFRLNFTIDFNHPAISKNLSNYTLEFSAQKFVQQISRARTFAFMKDIEY
LQSQGLALGGSLDNAIVLDNYRVLNEDGLRFKDELVRHKMLDAIGDLFMA
GYNILGDFKAYKSGHGLNNKLLRALLANQEAWEFVTFEDKEKVPQGYAIP
SQVLI
>MS1922 lpxD, LpxD protein
MSVYSLKELAEHIGATSRGNTDVVVDSIAPLDKAQANQLTFISNAKFRPF
LAQSQAGILVVSEADIEFCSANSNLLITKNPYVAYALLAQYMDTTPKAAS
DIASTAVIASSAKLGTNVSIGANAVIEDGVELGDNVVIGAGCFIGKNTKI
GANTQLWANVSIYHEVQIGSDCLIQSGAVIGGDGFGYANERGQWIKIPQT
GSVIIGNHVEIGACTCIDRGALDSTVIEDNVIIDNLCQIAHNVHIGTGTA
VAGGVIMAGSLTVGRYCQIGGASVINGHMEICDQAIVTGMSMILRPITEP
GIYSSGIPAQTNKEWRKTAALTLDIDKMNKRLKALEKKLAD
>MS0933 lpxK, LpxK protein
MLKMKFWYTKSWIAYLLLPFSFLFWLVSQCRRWLFQAGIIKSYRAPVPIV
IVGNLSVGGNGKTPVVIWLVKALQQNGLRVGVISRGYGSQSAVYPLLVTE
KTDPLEGGDEPVLIAQRTQVPVCISANRQQAIELLLQTQPCDVIVSDDGL
QHYKLQRDFEIVVVDAQRGFGNGFVMPAGPLRELPSRLDSVDLVIANGKA
NRYSQTVMTLAADYAVNLVTKEKRLLTEFESGSAFAGIGNPQRFFTMLQG
FGIQLKQTYEFQDHQKFSAELFAKFSKNEPHFMTEKDAVKCFPFARENWW
YVPVEAKITGQSAVNFIENIVERVKNGQ
>MS1270 lrgB, LrgB protein
MIYFYTLLTIAAFMIALLITKRIKSVLLNSFVLTVIILVAVLLAADIPYD
QYMAGNAPLNNLLGVSVVALALPLYEQLHQIAVRWKAILFIVTSASLLSM
FSGALLALALGASADVVATVLPKSVTTPIAMAIAQNIGGVPAVAAVGVVV
AGLQGSVFGYLVLKKLQLKNSEAIGLAVGSVSHALGTVSCLEVDAKAGNY
SSISLVLCGIISSLLAPLVFKLVSFCM
>MS1455 lrp, Lrp protein
MVNFMEKKLPKALDSIDIKILNELQRNGKISNIDLSKKVGLSPTPCLERV
KRLEKQGVIMGYKALLNPELLNSPLLVIVEITLIRGKPDVFEEFNAAVQE
LDEIQECHLVSGDFDYLLKTRVADMAAYRKLLGTTLLRLPGVNDTRTYVV
MEEVKQTNFLQLK
>MS0035 lrp, Lrp protein
MYAIDSLDQQILRVLTKDARTPYAEMAKNFGVSPGTIHVRVEKMRQSGII
EGTKVRIDERKLGYDVCCFIGIILKSAKDYDKVIKQLEGFDEVVEAYYTT
GNYSIFIKVMTHTIAELHSVLATKIQLIEEIQSTETLISMQNPILRDIKP
>MS1750 lspA, LspA protein
MMTKSKTGLSFLWLSAVVFFIDLLTKYIVTQNFELYESVNILPIFNLTYA
RNTGAAFSFLAEHGGWQKYFFIVLALAVSAVLVHLLRKNSARQKLQNSAY
ALIIGGALANMADRAYNGFVVDFFDFFWREWHYPVFNVADIAICVGVGLL
ILDSFKNGEKKADKQ
>MS0419 luxS, LuxS protein
MPLLDSFKVDHTVMKAPAVRVAKIMRTPKGDDITVFDLRFCVPNKEILSP
KGIHTLEHLFAGFMREHLNGDSVEIIDISPMGCRTGFYMSLIGTPNEQQV
ADAWLASMRDVLTVQDQSTIPELNIYQCGTYTEHSLADAHETARHVIEKG
IAINKNEDLLLDEKLLNL
>MS2084 lysA, LysA protein
MDFFQYKNNKLYAEDLLVSELAEQFGTPLYIYSRATLERHWKAFDSALGD
HPHLVCFAVKSNPNIAILQVMAKLGAGFDIVSQGELERVIAAGGDPHKVV
FSGVAKNEKEIARALELDIRCFNVESLAELQRINEVAGKSGKIAPISLRV
NPDVDAHTHPYISTGLKENKFGVSVDEAREVYRLASRLPNIKVTGMDCHI
GSQLTEIQPFLDATDRLILLLEQLREDGIELEHLDLGGGLGVTYSDETPP
HPSEYATALLNKLKQYTNLEIIMEPGRAISANSGILVTKVEYLKSNETHN
FAIVDAGMNDMIRPALYQAYMNIIEADRTLNRESKIYDVVGPICETSDFL
GKQRRLAIAPGDYLVQRSAGAYGASMSSTYNSRPLTAEVMVDGSQAHLIR
RRAELTELWALESLLP
>MS1613 lysC, LysC protein
MANLSVAKFGGTSVANYAAMTACAKIVIADPNTRVVVLSASAGVTNLLVA
LANGCEATQRAKLLAEVRQIQENILNELKDAGTVRLEIEELLTNIEYLAE
AASLATSSALTDELISHGEMMSTKIFVQVLRELNAQATWVDVRTVVATNS
NFGKAAPDDEQTQKNSDNVLKPLIDRGELVITQGFIGRDPNGKTTTLGRG
GSDYSAALIAEVLNAKDVLIWTDVAGIYSTDPRIVPNAQRIDTMSFAEAA
EMATFGAKVLHPATLLPAVRSNIPVYVGSSKAPEQGGTWVTRDPQPRPTF
RAIALRRDQTLLTLSSLNMLHAQGFLANVFNILAKHKISVDTITTSEVSV
ALTLDKTGSASSGAELLSSDLLNELSEVCTVKVDTGLALVALIGNDLHLS
AGIAKRIFGTIEEYNIRMISYGASTNNICTLVHSAHADDVVRALHKELFE
>MS1703 lysC, LysC protein
MRVLKFGGTSLANPERFLQAARLIEKAHLEEQAAAVLSAPAKITNHLVAL
SEKASLNQPTETNFNEALDIFYNIINGLHEKNNNFDLKGTSQLIESEFNQ
LAELLEQIRQAGKVEDAVKATIDCRGEKLSIAMMKAWFEACGYEVTVINP
VEKLLAYGNYLESSVDIEESAKRVDVASIPKNNVVLMAGFTAGNEKGELV
LLGRNGSDYSAACLAACLNASACEIWTDVDGVFTCDPRLVPDARLLPSLS
YREAMELSYFGAKVIHPRTIGPLVRSNIPCLIKNTGNPTAPGSIIDGNEP
QSGELQVKGITNLDNVAMFNVSGPGMQGMVGMAARVFSTMSKAGVSVILI
TQSSSEYSISFCVPSKLAAKAKDALNTEFAKELLDKDLEPVEVIEDLSII
SVVGDGMKQAKGIAARFFSALSQANISIVAIAQGSSERSISAVVAQNKAI
EAVKSTHQALFNNKKSVDMFLVGVGGVGGELIEQIKQQKEYLAKKDIEIR
VCALANSNKMLLNENGLSLDNWKEDLSNATQPSDFDVLLSFIKLHHVVNP
VFVDCTTAESVSGLYARALSEGFHVVTPNKKANTREMAYYNLVRENARKN
QRKFLYDTNVGAGLPVIENLQNLLAAGDEVERFNGILSGSLSFIFGKLEE
GLTLSQATALAREKGFTEPDPRDDLSGQDVARKLLILARESGLELELSDV
EVESVLPKGFSEGKSAVEFMEILPQLDAEFAARVEKAGAQNKVLRYVGQI
NDGKCKVSIVEVDADDPLYKVKNGENALAFYTRYYQPIPLLLRGYGAGNA
VTAAGIFADILRTLRN
>MS0763 lysR, LysR protein
MKPIFLELRHLKTLLALKETGSVSLAAKRVYLTQSALSHQIKLLEEQYGL
PLFERKSNPLRFTAAGDRLLQLANDILPKVVAAERDLSRVKQGEAGELRI
AVECHTCFDWLMPAMDSFRQHWPLVELDIVSGFHTDTVGLLLTHRADWAV
VSEVEETDGIVHKPLFSYEMVGLCAKDHPLAHKEIWEAEDFADQTWITYP
VPDDMLDLLRQVLKPAGINPVRRTSELTIAIIQLVASKRGVAALPFWAAK
PYLDRGYVVARKITQNGLYSNLYAAYREEDANSAYLEDFYETVKSQSFST
LPGLSVLE
>MS1097 lysR, LysR protein
MDKLNAISVFCRIIESQSFTQAAALENISVAMASKLVAQLEEHLKTRLLQ
RTTRKIVPTEAGLVYYQRCQPILLELKEADSSISDLSTSLQGNLVVSVPM
DFGLKFITPTLPAFISANPNLHVEMEFSDRRVDLMAEGYDLALRIGSLQD
STLVAKKLATTSMHFAASAEYLRRYGTPRKPEDLQYHQCLLYKAIGNQIY
WEFANKGKIQRVKMRSKMVCNNGLTLVQLAKADLGIINSPRFLVEEELAS
GELIEVLPEFKQQLLDIHAVYPHRRHLAAKVKAFVEFLSGLNLGSET
>MS0154 lysR, LysR protein
MNIRDLEYLAALAEYKHFRRAADACHVSQPTLSGQIRKLEDELGITLLER
TSRKVLFTQSGLILVEQAKKVLREVKLLKEMASNQGKEMTGPLHLGVIPT
VGPYLLPYIMPALKEAFPDLELYLYEAQTSHLLDQLESGRLDCAILATVP
ETEPFIEVPIFNERMLLAVSEQHPWAKEKSIKMHALQGHEVLMLDDGHCL
RDQALGYCFTAGARENSHFQATSLETLRNMIAANAGMTLMPELAMLNEGT
RAGVKYIPCTDPEPKRTIALVYRPGSPLRSRYERVANAVGDAVKAILHTE
GD
>MS2152 lysR, LysR protein
MNDKFSGIEEFLMTVEMGSFSAAAERLNLTGSAVGKSISRLEQRLNTQLF
HRSTRKITLTREGEVWLASCRRMMEELEQAKLLLSSQSQQIIGEIRIDLP
TTYGRSHILPKLLAIQADYPKLYLNISFQDRKVDMIAEHIDIAVRFGELA
DLTDIIAKQIDCFQNQLCATPAFVSKWGKLNHPDDLTHFPCIVGNQISWR
LMNEQGKSTGFPLNVQHQINDGDARLQAVLADCGIAFLPDWLIQPAVEAG
KLVQLLPEFTPPPEPIYVLWQKKLHLQPKVKAIVNSLV
>MS1403 lysR, LysR protein
MRELRNLDLNLLKAFDVLMDEKSVSKAAQRLSVTQPAMSGILQRLRDSFN
DPLFVRVQRGIVPTNRALELRQPIKQLLQSAEQLLQPKIFDPQTAELTLT
IACTDYALRAVISPFLAVLKQRAPKIKVAILAINEQNLQSQLEQGVVDFG
LVTPDFSAPDIHSKDLYQEQYVCALRKDHPVAQQGSISLEQFCRLEQALV
SYQGGSFSGATDKALAKLGLTRNVTVSVQNFIVMPEFLANSDLLAVVPKR
LVENLANIHYFEPPLQIDGFTKTLVWHERTHRDPAYRWLRELMAEVC
>MS0336 lysR, LysR protein
MLKDKKTWPLIEDLNVFLTIIRKNSFSGAAKELGQSNSYITKRINILEDH
LHTSLFYRNTRNIKLTAAGEYVQNQAIAIIDKMDSLMTNIVEDKKSMFGH
LHICSSFGFGRTHLAKPISLFAKQHPNLSLDLTLTDHKLDLIKENIDLEI
AVGNDLNDRYFAKKLANNRRILCASPDYLQSYGLPKKVEQLSKHNCLFLK
EKNSSFGVWKLFNGKILKSITVNGGLTTNNGEVILQWALEGHGIIYRSLW
DAEKYLISGELVHILPEYYEDAPIWVVYPNKLSESLKTEIFVNFLTEYFA
KKELTKSHDE
>MS0044 lysR, LysR protein
MRKKPMEFNELKLFLHLAESQNFSRSAAQNHMSTSTLSRQIQRMEDELGE
PLFLRDNRRVQLTECGEKFKIFAQQSWNQWQHFKQQIHHNENELNGELKV
FCSVTAAYSHLPQVLEKFRLRYPKVEIKLMTGDPALALHQVQSQQVDLSL
SGRPLHLPNSIKFHYIDDISLSLIAPRIACPATQLLQHSPIDWQRIPFIL
PVEGPARQRIDQWFRQQKIKHPKIYATVAGHEGIVSMVALGCGLALLPDV
VIKNSPMNSQVSSLTLDIPVYPFELGVCVQKKSLELPLIKAFWDSLQTEN
AG
>MS2008 lysR, LysR protein
MHITHCLRTLKALKLQNNVHLCTIKPKEQRRETMNLDWSDIHYFVLMVEK
QTLKATAEALQVEHSTVSRRIERLEKQLNVHLFDRINKRYLLTADGQRLY
TEAKKLQFNVRQFVQAAQDSLQEMTNVLVSMPPMIAHALVSPHLAAFQQR
FPAIRLVLSSNTAISSLHQRQADIALRLVVPQQNDLVVRRLRDMQYGWFA
HADYVKNTPESQWQYIDFGVTGPHTPWLNKQLADKSIGFVCNDFAVMQSA
VMQKLGIGWLPFEYGNSSEFIQVHTSEIFIGQLHLVMHEDVRHAQKVRDV
ADFLIEILRE
>MS0884 lysR, LysR protein
MMLDKVEAVRYFCIAAETLHFRETANRLAISPQVVTRMIAELERELGEPL
FKRNTRNISLTDFGQAFLADAQQWLKATETLFQTDFKESMSGTVRITLPR
LPNNDVILTELLTALSPYPDLHIDWRPDTALYNSITRQIDIGIRISLEME
PHFIAKKITHIKERIVASTALLNRLGQPRDLDDLQNRFPLCAEINPQTGK
AWHWFNTAEQSFVAKKPYFMSSESYSNLAVILKGLAIGVLPDYYYLPHVQ
TGKLKILFPDLPIPEWKMFLYRPYQENTPLRVIHVFGLLEKILVKHYHTT
G
>MS2176 lysR, LysR protein
MQFYSITVPKIAFSYLFHKMKTAKSFIPENEIMNRLDALKYFIVAAETLS
FKSTASRFSVSPQVITRVISELEGELGEQLFKRNTRAIRITDFGSRFLAD
AIAFLQQEERLFGGVKTAEESLSGLVRITLPPSDYADKILLRLLTALAPY
PDIQIDWRTDFDTLKAVDDQIDIGIRISRTPEDHWVAKKITDLQEPIVAA
PSLIAKTGLPKDVFDLAANFPVGYILNPKTGKVWDWMMGEQPIILTKPTV
ITSDIKSLLPAVLSGRIFAPIMYHDCKSYLDSGELQVVFSNEETLIWGIY
LYRPYQTITPKRVLLVFELLEKILEEGF
>MS1210 lysR, LysR protein
MMNYAAMLHNLPNLNELYFFVQIANAGSFTKAAERLGVTTSALSQNMRSL
EKHLDVRLFNRTTRSISTTEAGEKLLAEIAPHFLAIADAVRHLDEIRDEP
QGTIRINTSEIAANLIIYPKLQPFLLANPHIKVELVIDNRWVDIVAQGFD
MGVRLGYAVFNDMIAVQISEPMKMVLVASPGYLKDKPLPKKINDLTNYHL
IGSRFSSEHSQLEWEFMDKGQKVGFQPMPQFSINNDLRTQAALDGFGIAW
LPEIRVHEELKNGNLVEILPQYAYTYDPFYIYYPNRKGNSKAFQMVVELL
KFKK
>MS1415 lysR, LysR protein
MHSSIYGYLTVFHTIAAEGSIAGAARKLQMASPSISQSLKLLEQHIGLPL
FNRTTRKMELTEAGHHLLASTQDAIAQLSVAVESVQDLSGVPKGVVRMTV
PHVGYWLIIEPHLAEFCERYPDIQLEISINDGTVDILKEGFDLGIRFGDK
VDEQMVAKKLTAPFRLGLYASSAYQQQFGLPKKIAELKNHRLVGFRFATS
NRIFPLSLNDKGEEVSVEMPTPIVANSLIVAKDVIKSGIALGRFFEPLMS
KQADRAAFIPVLEKHWKTFGALYLYYMQHSQKAGRVRAVIEFFTEKAQVE
KK
>MS2130 lysR, LysR protein
MLNKFDALRYFCVAAETLNFRETANRLSVSPSVITRVVNELEAELGEQLF
KRHTRSIKLTSFGEQFLLRAQHLLAESETLFKMGKNQADDLAGIVRITVP
SWRNNDEIIRQLLITLESYPEIIIDWREDMGKLDMVEDRIDMGLRIGLEP
DQDFVVRKITEIGDVLVASPALVKKLGQPTDLTDFERRYPMAIPINSNTG
KPWTLFLNEDITLNPKNPAFYSVDNYSALQAVLLGKCAGLINDFMVKPYL
EFGELIQLFPEIQIDKWQLFLYRPYQTVTPARVLKVFDLLTEILRKTYY
>MS1395 lysR, LysR protein
MKENLNDLRAFLVVARTGSFTKAGAQMGVSQSALSHSIRGIEERLNIKLF
HRTTRSISTTEAGEQLYQRLSPLFDDIDNELNELSEFRNAVTGTLRINGN
EHAFYYALGDKFVRFSQKYPEVNLELVAENRFIDIVAERFDAGIRLGSDV
AKDMIAVRLTDKLPMCCVASPEYLANYGTPKTPYDLTEHQCLLHRLSNGG
VMNWEFIDPKSKGRILKVQPQGTISANGGRVLENYARSGLGILWCPLDMV
EEDIRSGKLIRILQQWDMDYDGYHLYYPNRRQNSPLFKALVEELRLVK
>MS2143 lysR, LysR protein
MPEMKKTDRFNHLISFTHAARFGSFSAAAEALDLTPAAVSKNVALLEQAL
NVRLFNRTTRSLSLTEEGQVFYAESKKALALLEEAVNQITLAESQEIAGN
VRISMPNVVGRNLVFPLLKSFNEDYPKIHLELDFDNKAIDFVKAGFDFVL
RVGESSEGSLVARHIGMIQTCLVASPAYLKSQGVPKNMADLPQHQLLMTR
LPNGKLQPWTFNEQGDNVHFLHAQPHLVLTDAEMQTQAAVQGFGITQLPV
YLALPYLQNGELVTILNDSYQPLKLSLNILFPHRTLLAQRVRTTMDYLLE
QLKQHEGLRMTQEELKAFSFK
>MS1039 lysR, LysR protein
MKIQQLRYIVEIVNQNLNVTEAANALYTSQPGISKQVRLLEDELGLEIFE
RNGKHIKTVTPAGKKIVAIARELLVKTQAIKAVANEFTQPNHGVLRIATS
NTQARYMLPAVIERFSKQYPNVSLHVHQGSPNQLYDALLSSEVDLAITTE
AQYLFDDVVLLPCYMWNRSIIVKADHPLAKLSHVTIEDLGKYPLITYTFG
FTGVSDLDQAFNSAGILPNIVFTATDADVIKTYVRLGLGVGIIASMAHTD
ADTDLIRIDASHLFKSSMTQIAFKHSTFLRNYMYDFINYFSPHLTRAKVE
KAERARDNTAVQKLFEGIDLEVR
>MS0895 lysR, LysR protein
MERKMFKRLPPLNSLKAFESAARFLSFTKAADELCVTQAAVSHQIKLLED
FLNIRLFIRKNRSLELTELGKNYFQEISPILQKLADVTEKLKSTDNPHLT
ISVLQSFGINWLVPRLNRFNQLYPNIEVRIKSAEQDEGILGNDIDVAIYY
GYGNWDNLKTEKLSEDNLLILASPKLLANNPVNSKDDLKHHTLIHVHTRD
NWQNMATELGISDLNIHIGPLFSHTFMALQAAVHGQGIVLANSILAQQEI
DNGNLQVVLPYELKDPKSFYVVSDTNRTNDQNISAFRQWIMQEMKYN
>MS2151 lysR, LysR protein
MNSTEYGQLLIFQAIAKEGSISACARALRISVPAVSKALRQLENRLGVPL
FQRSTRKIQLTETGVQLLEQTVQAVDTLSQAFENAKTLAKTPTGTVRITV
SQVAFSLILQPVYAEFRERYPHIVLDISINNATVNLIDEQFDLGIRFGNH
LEEGIVARRLTGEIREGLFISPQYAQKFGTPKTLADLAHHQLIGYRFITA
NRFHPLTLMENGQPHTIEMPMSLILNDSEMAIDAIRQGFGIGRIFEAQYE
RLESKIDLLPVLKKHWQTLQPMYLYYQPKSQKVKRVQVLIEFLQEKMEVL
GW
>MS2134 lysR, LysR protein
MQKWKDNMKEISLDDMRLFVSVVQSGSLSHAGELTGIPVSRLSRRLTQLE
QALGTQLLNRGKKGVSLNELGERFFEHSQQMLQQAELAIESVQKSLENPS
GLLRISVAADIFYLFIQPYLATYLNENPQVNLEINLSNQKINMIQDGVDL
AIRTGVIDNENVVARLWKKMEFGVFASQAYLAKYSEPQSPNDLYQHHIIS
QMYTLPWRFQQGNQEVAVFPHSRLTCNDFAIVEQQLKQHSGIGILPITKN
HNRSDLIRILADWQLQSVPVSLIYYRNRGAIATVRSFVEFLQRLV
>MS2092 lysR, LysR protein
MNKLDALKFFITAAETLNFREAAVKLAISPSVVTRTIAELENQLGEPLFK
RSTRSITLTSFGELFLPKAKRLLEDSDTLFQTAKDDNEMKGVVRITLFRL
PNHEQILFELLTALRPYPELFIDWRLDMMRLDTVEHRIDIGIRVGREPNP
NFIIKPIAKVQHIFVAAPDLLERLGAPKDFEDLRQRYPFSGLINPETGKV
WEFMLDGVNTFLPRHLEFFSTDPDTQIQAALAGRAVVQASDLACKEYLAN
GRLVKVLPQIQQEKWQLYLYRPYQTITPKRVMKVFEVLEGVLRKYLG
>MS1689 lysR, LysR protein
MQSSIYGYLTYFHEIVIEGSIAGAARKLEVAPPAVSNALKLLERHLGLPL
FTRTTRKMELTEAGQRLFESTKDMLRGLDSVMESVRDLTEKPSGLVRITT
SIISYLLVIRPHFAEFCERYPDIRLEISVNDGIVDIVKEGFDVGMRFGDR
LEQNVVAKKLLDPVRLGLYASESYLRKYGKPETLEDLSQHKLLGYRFVTA
NRTYPLTFNQDGREISIDMPYSVLTNNLTVELDTVRQGVALGQLFEPVVN
ALHDRKNFIPVLDAHWTQYPALYLFYMQHSQKAGKVRALIDFLEEKIKG
>MS2116 lysR, LysR protein
MNTKNTSVYALKLFLQVLELGSLSEVARRENLSASMLSRLIKQLEDDWGA
ALFYRNTRAITPTETGLLLAEYARQIVSQFQAAEQAITAQTAEIAGTVRI
NAPVFFGQLHIIPHLAELQARYPNLIVNLVQTDDYIDPFTDSTDIIFRLA
PLNDSSLKVRILAQQHFCLAASPSYLQKYGTPKIPADLAKHHALLYKGKT
GTLRWLLQEGENWQACSPKIALTSNNGNAIATACVQGMGIALLANWAASD
LLKEGKVVRLLPEYNFSTQTVPVYVAMLYPQTAFISPSVRAVLDYFREIF
QDKSW
>MS2006 lysR, LysR protein
MNLDWNDLHYFVLLVEKETLTAAANALDVEHGTVSRRIERLEKQLGLHLF
NRINKRYLLTDDGRDLYAEAKKLQLNIKQFAQTAQDKCQSMGEVTVSAPP
FVANSLITPLLAHFYRRFRHIRLILNSDSGLSNLHRSQADIALRIAQPKQ
DDLVAHRLMNVEYRWFAHRDYLACTPESERQFLSLNLTGTHQQWLQTQLT
GKSVRFACNDFNIMKSAVLQQLGVGLLPVCYIDSPDLAAVKNMEYFRAPL
YLVMHEDVRQSQKVRMAADFLIENLRD
>MS1543 lysU, LysU protein
MSEQQNAELDFHGEMAVRREKLAALRAKGNAFPNTFRRDALAQDLHNQYD
ETDGEQLKEKDLHVAVAGRIMTRRTMGKATFITIQDMSGKIQLYVARDNL
PEGVYGEDVKSWDLGDIVGIKGTLFKTKTNELTVKAHEVQLLTKALRPLP
DKFHGLSDQETRYRQRYLDLISNEESRRTFVIRSKVIAGIREYFIGKGFI
EVETPMLQVIPGGAAARPFVTHHNALDIDMYLRIAPELYLKRLVVGGFER
VFELNRNFRNEGVSVRHNPEFTMIEYYQAYADYHDLMDNTEELLRKLALD
ILGTTIVPYGEYEFDFGKPFERITMHDAVIKYGAEKGIVKEDLYDLDRAK
AAAAKLGIEIQKSWGLGSIVNAIFEEVAEHHLIQPTFLMAHPAEISPLAR
RNDENPEVTDRFELFIGGREIGNGFSELNDAEDQAERFDAQVAAKDAGDD
EAMFKDDDFVTALEHGLPPTAGEGLGIDRLAMLFANAPSIRDVILFPAMK
HKG
>MS1749 lytB, LytB protein
MKIILANPRGFCAGVDRAISIVELALEIHGAPIYVRHEVVHNRFVVNGLR
ERGAVFVEELNEVPDGAIVIFSAHGVSQAVRQEAKNRNLKVFDATCPLVT
KVHMQVARASRKGTKAILIGHEGHPEVQGTMGQYDNPEGGIFLVENVEDI
AKLGLKDNEELTFMTQTTLSIDDTSDVIVALKAKYPAIQGPRKNDICYAT
TNRQQAVRELAEQSDLVIVVGSKNSSNSNRLAELASRMGVPAKLIDDSND
IEPDWLKGINTIGVTAGASAPEVLVQSVIARLKELGVDSVEELEGCEENT
VFEVPKELRIKEVG
>MS0924 mET2, MET2 protein
MDSSMSAQQVTLFTEQPLDLIFGGRLGQIDVAYQTYGTLNEDKSNAVLIC
HALTGDAEPYLSPVENQAGGWWQSFMGEGLALDTSRYFFICSNVLGGCKG
TTGPASINPKTNKPYGSQFPKVTVQDIVRLQKALISHLNIPHLHAVIGGS
FGGMQATQWAIYYPDFVDKVVNLCSSLTFSAEAIGFNHVMRQAIINDPNF
NNGDYYEGEPPENGLSIARMLGMLTYRTDLQLAKAFGRATKNEGHYWGDY
FQVESYLSYQGQKFLGRFDANSYLHLLRALDIYDPSIGFDNIKEALSRIK
AHYTLVAVTNDQLFKLTDLHKSKTLLEQAGVPLDYYEFPSDYGHDAFLVD
YDTFEPKIRSGLE
>MS0553 mHT1, MHT1 protein
MPITILDGGMSRELMRRNAPFRQPEWSACALYEEPSAVQAVHEDFIAHGA
EVITTDSYAVVPFHIGEQRFHTDGKTLADLAGRLAKSAVKNSGVLTTKIA
GSLPPMFGSYRADLIQPERFAEIAQPLIDGLSPYVDIWLCETQSAIIEPV
SIKALLPKDDRPFWVSFTLTDDELTCEPQLRSGETVKSAVEKMVDLGVDA
ILFNCCQPEVIGEALAVTTATLTALNATHIQTGAYANAFAPQPKDATAND
GLDEVRKDLDPPAYLAWAKKWTAQGASIIGGCCGIGVEYIETLAKNLK
>MS1293 mMT1, MMT1 protein
MGFNVVSRSRQIINVSFISIFTNIILVAVKVTIGFFTNSLAVMLDALNNL
SDSLSSLVTIVGTKLATRAPDKKHPYGYGRIEYITAIVIGAIIFLAGATA
LKESIAKIITPEETNYNITAIVIISIGIVVKYGLGRFVKNSGEELGSQAL
IASGTEALFDSVLAIGTLFCAILSFFWNITIDAFLGAVIALGIMKSGFDR
LKETLDNIIGVRADEALTSKIKCHIRRYQGVVGAYDLILHNYGPTEIIGS
VHIEVPDDMTAREIHRLTRSIKADIMRDFSIEMTIGVYAANDSEPYVAKL
KADLLRILHSYQEILEFHAFYVDRELKQVTFDLIFDFETKNVESLKEEII
QRIKKDYSEFEFSIVVDPDFSSTGLVTACA
>MS0376 mMT1, MMT1 protein
MSKQYSTLVKRASLLAVFTAVTLIVVKAFAWWQTGSVSMLASITDSTLDL
LASFMSLLILRFALMPADHNHSFGHGKAESLASLAQGAFIIGSALLLLLH
AFQRLGEPKVIQQTGLGITVTMFSILLTFILVAYQNKVIKLTDSPAIKAD
QLHYQTDLLMNAAIMLSLLLGSLDFIWADAVFAILIAVYILVNGGKMCFD
AVQLLLDLALPEQEIEQIERLIREDPNIIGFHDLRTRRAGEVRFIQMHLE
LSDDLSFVQAHAITDSLETRLKQAFPRVEIVIHHEPTSVVLAEQKAK
>MS2068 malE, MalE protein
MKNKCVKLTLTAIAGLVLSTSVMAKMTEGKLVIWINGDKGYNGLAEVGKK
FEKDTGVQVLVEHPDRLEEKFAQVASTGDGPDIMFWAHDRFGGYAQAGLL
SEVSVSKEFKDKFVDFAWDAETYNGKIIGYPVAIEAISLIYNKDLVKEAP
KSWEEILELDKKLKKEGKNAIMWNLSEPYFTWPVAASNGAYAFKYKDGKY
DVKDIGVNNEGAVKALQFVVDMVKNKNISADMDYAVAEASFNKGQTALTI
NGPWSWGNIDKSGVKYGVAVLPTLNGQASKPFVGVLSAGVNSASPNKDLA
KEFLENYLLTDEGLDTVNKDKPLGAVALKSYQEKLAADPRIAATMENAKN
GEIMPNIPQMTSFWYAEKSAINNAVTGRQTVKAALDDAHARIQKQQ
>MS2069 malF, MalF protein
MSTLTQPKSTHWFKYLIAGIVLLFDFYLVGLMYLQGEYLFAILTLVILTS
GVYVFTNKNAYAWRYVYPGIMGMTIFILFPLVATIAIAFTNYSGSNQLSF
ERALSVLTEQRYFAGDKYNFKLYPQADNQYKIVLTNPATAQTFVSESIAL
KAADVPVSEQAEPTGEIAPLRIITQNRSALQAMKVILPNDNELTMSSLRQ
FSEQKARYQFDKENNILRNNENGKLYKANDETGFFQAVNESGDWLSETLE
PGYTVGSGFHNFVKIFTDKGIQKPFVQIFIWTVMFSLLTVVFTVILGMVL
ACLVQWEALKGKAIYRLLLILPYAVPSFISILIFKGLFNQSFGEINMILN
QLFGISPEWFNDPFLAKAMILIVNTWLGYPYMMILCMGLLKAIPSDLYEA
SAMDGASTWQNFSKITFPLLLKPLTPLMIASFAFNFNNFVLIQLLTNGRP
DMIGTTTPAGYTDLLVSYTYRIAFEGSGTQDFGLAAAIATIIFLLVGGLA
LLNIKATKMEL
>MS2070 malG, MalG protein
MAIVQSKSVRYRVWATHLILISFLALIIFPLLMVIGISLRPGNLAIGDII
PSQISWEHWQAALGFEVTHADGTVTPPPFPVLRWLWNSIKVATITSIGIV
TLSTTCAYAFARMKFKGKKTILQGMLIFQMFPAVLSLVALYALFDRLGQY
VPFLGLNTHGGVIFAYLGGIALHVWTIKGYFETIDGSLEEAAALDGATPW
QAFRLILLPLSVPILAVVFILSFIAAITEVPVASLLLRDVNSYTLAVGMQ
QYLYPQNYLWGDFAAAAVLSAIPITLVFLLAQRWLIGGLTAGGVKG
>MS1587 malK, MalK protein
MLSHHKNKNGGAYPTLYRQYNIMTNQNDNFLVLKNINKTFGKSVVIDDLD
LVIKRGTMVTLLGPSGCGKTTILRLVAGLENPTSGQIFIDGEDVTKSSIQ
NRDICIVFQSYALFPHMSIGDNVGYGLRMQNIAKEERKQRIREALELVDL
AGFEDRFVDQISGGQQQRVALARALVLKPKVLLFDEPLSNLDANLRRSMR
EKIRELQQSLSITSLYVTHDQTEAFAVSDEVIVMNKGKIVQKAPAKELYQ
QPNSLFLANFMGESSIFNGQLQGNQVTLNGYQFTLPNAQQFNLPNGDCLV
GIRPEAVTLKETGEPSQQCSIKTAVYMGNHWEIVADWAGQDLLINANPEV
FNPEQKQAYVHLSSHGVFLLKKE
>MS0812 malK, MalK protein
MENIVQSKPIIELRSLKKSYNENTIIDNFNLTINNGEFLTILGPSGCGKT
TVLRLIAGFEEANGGQIILDGEDVTDLPAEHRPVNTVFQSYALFPHMTIF
ENVAFGLRMQKVPNEEIKPRVLEALRMVQLEEMADRKPTQLSGGQQQRIA
IARAVVNKPKVLLLDESLSALDYKLRKQMQNELKALQRKLGITFIFVTHD
QEEALTMSDRIIVLRKGNIEQDGSPREIYEEPSNLFVAKFIGEINIFDAQ
VLNRVDEKRVRANVEGRVCDIYTDLAVKEGQKLKVLLRPEDVQLEELDEN
EQSSAIIGHIRERNYKGMTLESTVELEHNNKLVLVSEFFNEDDPNIDHSL
DQRVGVTWIEKWEVVLNDENDNA
>MS0584 malK, MalK protein
MEQTDMAKLEIKNITKKFGDFYAANNISFTAEEGEFVTLLGPSGCGKTSL
LKLIAGFHIADEGEILIGGKNVNEIPPEKRNTAMCFQSYALFPHLNVSHN
ICYGLKQRKIDINEQKQRLDLAIKQMDLEIHRLKLPNELSGGQQQRVALA
RAMVTRPDVILFDEPLSNLDAKLRESVRFEIKQLSKQYNLTSIYVTHDQA
EALSMSDKIIVLNKGKIEQIGSPQEIYHHPINRFVADFIGIANITEAHVK
EMENNLYEVNSIYGNFTVYSEIKPQSDHIYICFRPEDIEIVPASENKENM
LTVDVTHTAFMGNITEIQALIRKDDKEQKLRLQLTKFPQLTENYQLSFCV
PRDAIKFLESVK
>MS1524 malK, MalK protein
MIKLERVYFNYKTMPMNFNIHIKPQERVAIIGASGAGKSTLLNLIAGFER
ADDGEIWLNGVNHTYTEPYERPVSMLFQENNLFTHLTVEQNIALGLKPDL
KLSAAEQSLVRQTASAVGLSRFLDRKPTALSGGQKQRVALARCLLRDKPI
LLLDEPFSALDPALRAEMLDLLSQLCNEKKLTLLIVTHQPSELQGRIDRI
LTVENGHFAKNNDLK
>MS2067 malK, MalK protein
MANVSLRNVGKSYGDVHISKDINLEINEGEFVVFVGPSGCGKSTLLRMIA
GLEDITTGELYIGEKLMNDVEPSKRGIGMVFQSYALYPHLDVADNMSFGL
KLAGVKKNERDQRVNQVAEILQLAHLLDRKPKALSGGQRQRVAIGRTLVS
QPEVFLLDEPLSNLDAALRVQMRVEISKLHKKLNRTMIYVTHDQVEAMTL
ADKIVVLNAGGVAQVGKPLELYHYPANRFVAGFIGSPKMNFLPVKVTAVE
ENQVKIELPDANHHNFWIPVSGEGVNIGENLSLGIRPEHLVPAEQAQVSL
RGIVQVVELLGNETQIHLEIPEIKQPSLIYRQNDVILVNEGDEMNIGIVP
ERCHLFKEDGTACQRLFAEKGV
>MS2074 malQ, MalQ protein
MTITTKQFRRAGIMPYFFDERGVKKWAPHNIKKALFNTFEGNIQASSTPI
PAVKIFYQNRPHFLPINSADKKHPLKGRWQLQLENTQTVISGPIKTRGIN
LPKDLPLGYHQLQLQSAGKVFNCTVIVAPQSCYQPQALREHKKLWGTFLQ
LYTLKSEQNWGIGDFGDLKKFLQNLAPFQADFLGLNPIHALFPANPDSAS
PYSPSSRQWLNIAYIDVNQLAEFQQSDEAQAWFNSAEVQQKLTELRQAEW
LNYGEIIPLKLKGLRFAFKKFQQNPTALSSRQFAQFVQQGGESLQVQATF
DALHQYLADRFDNQWGWDFWAKEYQDYHSLAVQQFRAEHQAEIEFYAWLQ
FIADQQLAECDEVCRQQNMRIGMYRDLAVGVTGNGAETWNDKRLYCLNAS
VGAPPDVLGPQGQNWGLTPMNPHVLQQQAYAPFIELLRANMKHCGALRID
HIMSLLRLWWIPKGDSAVNGAYVRYPVDDLIAILALESQRHQCLIIGEDL
GTVPKEIVGKLKNAGILSYKIFYFEFDEHGQSRDLQTYPYQAMTTLSTHD
LPTINGYWRGYDFELGEKFGVYPNPKILDILQRDRVRAKTQILQRLRQHH
VPVEAKISAELGSSVSNKFVHQLQTYVAQVSSGLFGFQPEDWLGMTEPVN
IPGTSTQYANWRRRLTANVEDIFADTDIQHLLKEVNAIRKE
>MS1124 malQ, MalQ protein
MSELQQAAQLGIALSYYDIEGRLIQAKEETLQYFTALFQPSPDGKNKSTK
QFHDVFVMNAQERAIYEFQRLALSPSACEYQLFDEQNKLCGVQTLSDPKS
LSLPPLEAGYYLLKLKCNDAEYRIRLLVQPSTAYQPPLLERKKAWGLNVQ
LYSLRSTRNWGIGDFADLRNLIKSAVKFGADFIGINPLHLVYPAVPEWAS
PYSSSSRRWLNCIYLDIESLPEYTLSKLAKKWRVEHNEQIEQLRKAELVD
YATVNQLKQSALALLFDFFNRSKSAQIAARRTEFNVFVENQGEALLYQGL
FNVLDSIEHVDLPENENQIGWLGWRKEWQHLTSKKRKALLKEHQKQVYFY
AWQQWLAEQQLAEAESLCLTEGMQLGIYGDLAVNSSRGSSDVWSDQKLYC
INASVGAPPDPLGPVGQNWNLPPYNPNQLKRRGFQPVIDMLRANMRHFGV
LRIDHVMGLFRLWLIPEGKTAADGVYVHYPFNELMAILAIESQRNQCLII
GEDLGTVPDEVRSKLKEFQILSYFVVYFSNQNGEFPQGKDFPVNAFATIG
THDVPSLAGFWHCRDLALFAQLDVLKDDLLKAKYDQRLTDKQALLDRLRL
DGYLPADYQGDALNMAMHDNLNLVIHRYLAESASQLVGVQLENLLNQEVS
FNLPGTSTEYPNWRKKLAVNLDDIFNDERIIALLKTINYARSQPKS
>MS2072 malT, MalT protein
MLIPSKLVCSFRLQNSVPRTRLIQELDKSAFYPVVLINAPAGYGKTTLVS
QWIEDKKNVGWYGLDEGDNNSDRFAVYFSAALHSAINEEVDVLLEENRKA
NLLALFNQLLIKASGFPQHFYLVIDDYHLIENDEIHEALKYWIRHQPANM
TLILISRSVPPLSVASLRVQEQLLEIDINQLMFDHQESVAFFQARLGSEL
KQQDIIELCNEVEGWPTALQLISLFAKNKSQTLQVPLQDIAKRLAKSNNF
HINEYLADEVLNKVDKSTRLFILRCSVLHSMNETLVEAVTGEPNSRKKLE
SLEKQGLFLQQMANSKWQTVDDSWWKFHPLFASFLNFCCQHELYDELSQL
HRRAAQAWLKLGYVTEALHHAMQLSDTCLLLEILDEHAWTVFHQGELQLL
EESLNSLDYAHLTEHTNLVLLKAWLVQSQHRHVEVSGILAEFSRALNENK
VELSKTAQAEFNVLRAQVAINSGDENTALQLASDALKDLSENAYYAHIVA
TSIIGEAHHCHGNLAEALSMLQKAERMARQHHTYHNILWSLLQQSEILLA
QGFSQAAYDMLDKASEFVKENHLQKVPMYEFLLRLKGKILWEWYNLDKAE
SMAVAGMNALQKFEDKLQCLALLTKISLVRGNLDNTSRLLNEVEQLERSH
AYHHDWTASADQVRMFYWQMTNDVAAARNWLIQNPAPISDKNHFTQIQWR
NIARARILLGQYDKAQEILDNLIETAEKFSLTSDLNRALIVRNRLYFLQG
AKELAQQDLIAALKLTRQTNFISAFVVEGDVMAQQIRNLLQLNVLDELVL
HKAQFILRNINQFYRHKFAHFDETFVSQLLKNPKVPELLKISPLTQREWQ
VLGLIYSGYSNEQISDELQVAATTIKTHIRNLYQKIGVTNRNEAISYTKE
LLALMGYN
>MS0614 manA, ManA protein
MTGIYKLTGSLQHYVWGGHDYLPELLHIKKEPNQYYAEWWLGAHSSSPST
IEVEEKQLSLIDFIRQHPEVLGSQSRALFGDELPYLLKILDVKKPLSIQL
HPTKKQAEIGFAEENSKGIDLKDAKRTYKDNNHKPEMMIALSDFWLLHGF
KTKEKIIETLKNRPTLTALLSKLETQDMHGFYADIMQASQTELANWLLPI
IESNKIAYEKGELSLENPDYWVLYAMEAMEISPEKLDSGLICFYLFNIVH
TKTGEGIYQDAGIPHAYLRGQNIELMACSDNVIRGGLTPKHVDIPALLEV
IDCREVVPEIIPPAPQENGAFIYSTPAKDFALENVRYDAGIKVESKAENA
TIIFVMQGTLKISQKNTALFLKQGESAFICADTQYRIEGIEQGYCVLSKL
P
>MS1776 maoC, MaoC protein
MILYYLIIIFSLLLFTYTALNLNSAKKKINQNDDSFKTINEFFKNIKFKT
ALWKYYFTAGNERNLLRNVLLTLIIFFFFHTLNYLYIKVDKFIFLGVFLV
LFFIIVWKLGQRRNRKEFEEMFPEVIQILNSATSSGAGLLQALERCGKDI
SGQIGEEFTAIHKRLAIGEDANSVFEDSYSRYPYKEYYFFITIIRVNLDK
GGQMREVILRLGRVIADSKKMEKKKSAMTSEARMSAMIVASFPVAFFIFM
KFMMPENFEFLLNDPGGRMILYYVLGSESLGMGIIWWLMRKAT
>MS1306 map, Map protein
MAIPLRTEAELEKIRIACKLASDILVMIEPHIKEGVSTGELDRICHEYIE
RVQGATPANVGYHGFPKATCISLNDVVCHGIPSEDKILKNGDILNLDVTV
IKDGYYGDNSKMYIVGGETSVRSKLLCEVTQEALYVGIRAVRAGVRLNQI
GKAIQQYVEKQGFSVVREYCGHGIGDQYHTEPQVLHYFGDDGGVVLKPGM
VFTIEPMVNAGKKEVRLMGDGWTVKTKDRSHSAQFEHELVVTETGCEVMT
IREEEEKEGRISRIMVNAEA
>MS1180 marC, MarC protein
MVAVINPFGVLPVFVNMTNHQTKAERNHTNLITSFSVGVILLVSLFFGKI
ILSLFSISINSFRIAGGILIISIAMTMISGKLGEDKQNKDEKNADFANMN
SIAVVPLAMPILAGPGAISSTIVWASQYSSWFDWIGFSFAIILFSLLCYG
LFRSGPTIVNALGKTGSNVVTRIMGLILMSLGIESIVVGITKLFPGLTH
>MS0138 marC, MarC protein
MFDSLFVQFVVLWAVIDPIGSVPVYLSKTVGLSVEERHKVARKSVIIATI
VLMFFLVIGQGLFETMQIPLSAFQIAGGLVLLLFALTMIFGEGKPETEMK
MRTSLSELAVYPLAVPSIASPGAMMAIVLLTDNHRFDFFEQCLTTVVMLL
ILFITYLLFLIANKIQRVIGNTGAAVISRVMGLILAAVAVNNVLVGIRDF
FGIAL
>MS2091 marR, MarR protein
MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ
KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV
IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE
>MS2146 marR, MarR protein
MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ
KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV
IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE
>MS0223 mauG, MauG protein
MKKYFLSAIAIAGLGYLSMVGYAHFFDKSQSEKLYAATEVPQQFKPVAKV
MFDNGCQYCHSPSADIPGYANFPIAKQLMEQDIAQGLRSFRLDRMFEGMK
DPSKLSEADLAKLEQVIRNDQMPAAKFLHIHWGTRPDADEKQVLLDWIKQ
QREAHFLPQNTQGADAARLVQPIPDAIATDPHKVALGEKLYFDGRLSADG
SIQCHTCHQLEQAGVDNLPVSEGIEGKKGGINAPTVFNAAFNKWQFWDGR
AKTLADQAGGPPINPVEMGSKDWDEIIARLDQDEAFKKEFLSVFPELSQA
TVTEAIGEFEKTLITPNSAFDRYLKGEQSALNDQQKRGYELFKNAKCDTC
HTGTAMGGQSFEYMGIYDDYFKARGTDLTDADKGRFAETQDPYDMHRFKV
PTLRNVALTAPYMHDATAKDLKEAVRIMGHYQSNKDFSDAELDDIVSFLN
SLTGEFKGKLLTNEKMK
>MS0351 mazG, MazG protein
MIPCLIEESYEVVEAIQQKNTADLREELGDLLMQVVFLSQLAAEENKFTF
DDVVNDIAEKLIYRHPHVFGDKEAADEHAALRNWNEMKAREAKNQAHTSI
LDNVPFSFPALLRAEKLQKKCAKAGFDWQQVAPVIAKVEEELEEVTQEIN
CPAPQQAKLEEEIGDLLFAVVNLSRHLKCQAEESLRKANHKFERRFRAVE
DKLRQQNKTATESSLMEMDMLWDEVKHEEKVSSD
>MS0836 mdaB, MdaB protein
MKHLVIFAHPNTKNSFNKAILERVLQASQKMNVDTTVRDLYGMNFNPVVS
WEELTGSFKEIIPAAIRHEQQLISEADLITLIYPLWWMGFPAILKGYFDR
VFTHGFAYKTDETGTVGLIQGKKMQQFITMGNNEERYQQMGFARSLNDTL
VNGLFNYVGIIDIDHRLLGDIHIISSEERQALLNEVEQKTKENLTALLEG
KA
>MS2139 mdaB, MdaB protein
MSNILIISGHPNLANSVVNTIILDEFAKTLPQAEIRKLDQLHTNYEFDVA
AEQAAIEKADVILWQFPFYWYAMPALMKKWLDDVFVHGFAHGSTAKIAGK
KLLISLTTGAPLEAYQREGFFKHKMDDFFAAFETTAILCGLDFQGVQFLN
GVSYVGRNEEKIAQQQAEAKVYAQTVIEKVKRL
>MS2094 mdaB, MdaB protein
MKTTVLVVHPNIKQSRVNAALAKGAADVAGVKVRYLYDLYPDGKIDATAE
QAVLEKADRIVLQFPMYWYSSPALLKQWLDDVLAYGWAYGDKQALKGKEL
MLAVTTGGGEEFYQKDGLAGHTVAEFLVAYETIASYLGMNYGKMFVTGNC
LNISDDEIAAQVPRYQAVLSA
>MS1417 mdaB, MdaB protein
MKNVLIVSGHPNLKTSIANQVILDETAKALPNAEIRKLDELFHNGTFDIA
AEQAAVLKADVLVFQFPFSWFSLPGVMKIWLDEVFEHGFAHGSTAQLAGK
KIIFSTTTGAPAEVYQKDGFFKYTMEEFAAQFEIMAQLCNLDYQGLIYTN
GIGYTSRENEEKINAQKAEAKKHAQRLVALIEKA
>MS2162 mdaB, MdaB protein
MVIFLTVCYHTSGRFLSIFCKCRYSTSGEEYMNRRNLLKAGVALAAVAAM
PFGRAQAKTPSKKTLVIVSHPYPESSTFIKGLQQAAETVEGVTVRNLETI
YGFDTRAVKGDEERRIMRAHDRVVFIFPTHWFNITPMMKAYLNETWGSVG
PGLWQGKEMLVVSTAAGGSETYGKNGRVGVELADVFLPMKASALHCGMTY
LPPLVFQGVRSSELANYQQQLIERLMQ
>MS1266 mdh, Mdh protein
MKVAVLGAAGGIGQALALLLKLQLPAGSSLSLYDVAPVTPGVAKDLSHIP
TDVVVEGFAGTDPSEALKGADIVLISAGVARKPGMTRADLFGVNAGIIRS
LTEKVAEQCPKACVGIITNPVNAMVAIAAEVLKKAGVYDKRKLFGITTLD
ILRAETFIAELKGLDPTRVTIPVIGGHSGVTILPLLSQVQNVEWSSEEEI
IALTHRIQNAGTEVVEAKAGGGSATLSMAQAAARFALALVKASQGAKVVE
CAYVEGDGKYARFFAQPVRLGTEGVEEYLTLGKLSAFEEKALNAMLETLQ
GDIKSGEDFING
>MS0950 mdlB, MdlB protein
MEKTRQRYLFKWLRAQQQPVKKLLNLNILLASTSSVILVLQTWLLATLLN
DLVMLDTKPRELLPHFFGLAIGFTLRALILWLRERIGFKCGQQLRNIIRR
RILEKIHQVGPAVINNKPAGSWATLMLEQVENLHNFYSRYLPQQMLSVIA
PVVILIAVFPINWAAGFILMATAPLVPVFMIIVGLAAADSSQKNMTTLAR
LSAQFLDRLKGLETLRLFDRAEQQTNHIEIGTEAYRKTTMDVLKMAFLSS
AVLEFFTSISIAIMAVYFGFSYLGQINFGTYDTSLTLFAGFFCLILAPEF
YQPLRDLGTYYHDRAAAIGAADSIVEFLEQKELHSATQTRQLNETTALEI
KAENLVINSPQGQALTKPLNFSLSPLSHTALVGQSGAGKTSLMNALLGFL
PYEGSLTVNGIELNRLEPTQWRKHIAWVGQNPLLLQGSIKENLLLGEIQA
SEEEIAQALKQAKATEFTDKLGLDYEIKDGGTGISVGQAQRLAIARALLR
QGSLLLLDEPTASLDAQSENQVLQALNQISQTQTTLMITHRIEDLKQCDQ
ILVMQSGEIVQQGIFEQLQNEGYFAELLAQRNTDVA
>MS0932 mdlB, MdlB protein
MVSADCRRRAGLIFGSYYNIGYNAPIIFLRRYTFMQKLQENDLSTSQTFK
RLWPTIAPFKIGLIAAAAALVLNALTDSGLIYLLKPLLDDGFGKADTSFL
KLMAVLVIVFIFIRGITSFISSYCLAWVSGKVVMTMRRRLFKHLMYMPVS
FFDQNSTGRLLSRITYDSEQVANSSSNALVTIVREGAYIISLLAVMIATS
WQLSVVLFIIGPVIAVLIRLVSKIFRRLSKNMQNSMGELTATAEQMLKGH
KVVLSFGGQQIEEQRFNEVSNDMRRKGMKMVVADAISDPIVQIIASLALS
AVLYLATIPSIMSQNLSAGSFTVVFSSMLAMLRPLKSLTNVNSQFQRGMA
ACQTLFDILDLDTEKDKGKYEAERVKGDVSFKDVSFTYQGKDQPALKHLS
FDIPHGKTFALVGRSGSGKSTIANLVTRFYDINQGEILLDGVNVQDYTLS
NLRTHCSVVSQQVHLFNDTIANNIAYAAKDKYSREQIIAAAKAAHAMEFI
EPLENGLDTVIGENGASLSGGQRQRLAIARALLRDSPVLILDEATSALDT
ESERAIQAALEELQKDRTVLVIAHRLSTIEKADEILVIDHGEICERGSHE
ELLALNGAYKQLHKMQFNG
>MS0394 mdlB, MdlB protein
MLNKIFSWFERRVEAYPDQTPNTPENGLFKFIWSSLDGMKKWILLLAVLT
VGTGVMEALLFQFMGVLVDWLGNYTPVTLWQEKGTLLWGMGFLLVFSILW
SFLASAVRLQTLQGVFPMRLRWNFHRLMLGQSLSFYQDEFAGRVSAKVMQ
TALAVRDTVLTIADMMVYVVVYFISSGVVLVALDGWFLVPFVVWVVLFVM
ILRVLIPKLAKTAERQADARSLMTGRVTDAYSNITTVKLFSHGAREASYA
KKSMEEFMVTVHAQMRLATSLDTLTYAANVFLTLSTAILGILLWQKGAVG
VGAIATAVAMALRVNGLSRWIMWESARLFENIGTVNDGMTTLSKPHTIID
KPNAPQLEVKKGEIRFDNVDFCYDPAKPLLNHFNLTIRPGEKVGLIGRSG
AGKSTIVNLLLRFYEAQNGTISIDGQNILDVQQESLRRQIGLVTQDTSLL
HRSVRDNIIYGRPEATDEDMINAAKRAEAADFIPFLSDAKGRRGYDAHVG
ERGVKLSGGQRQRIAIARVMLKDAPILLLDEATSALDSEVEAAIQESLDK
MMENKTVIAIAHRLSTIAAMDRLIVLDKGQIVEQGTHAELLAQNGLYAKL
WRHQSGGFLSEHAD
>MS0949 mdlB, MdlB protein
MRSLIPFLTLFKYAKFPLILGVILMILGLAASIGLLTLSGWFLAATAIAG
AGTLFNFFYPSSAVRGLAIGRTVARYFEKIVTHDATFRILSKLRVQVFGK
IIPLSPAVLNRYRNSDLLNRLVADVDTLDSLYLRLIAPFVSAIFVIVLIT
VGLSFINLPVALFIGITLLVLLLVIPTVFYKLGTKFGKKLTLSRATYRSQ
FVEFIQAQAELLLFNAEDKIKQNLANTESEWQAYQQRETNLAGLSSAILL
FANGLILAVTLWAAAHIDLGTGEYKAALLALFAFSALAAFEILMPIGAAF
LHIGQVIASADRVTEIISQPPLVKFSGKQTALSATADLIRLKDVSFSYPE
RTSCALNGLSLTIRKGQKVAVLGKTGSGKSTLLQLLVRNYNPSQGEILLA
EQPIHAYSEQCLRENICFLTQRVHTFSDTLRNNLQIANKTKIDDLKMREV
LVQVGLAKLLEQKSGLDLWLGEGGRPLSGGEQRRLGLARILLNESPILLL
DEPTEGLDRETERRILRLLMQHSANKTLIMVTHRLTAIEQFDQICVIDDT
NLVEKGSYQELNSKNNGFFKKLVERI
>MS1204 mdlB, MdlB protein
MNIFISTIKGYRWHLVAVLILTFIFSGFGIGVLAFINNKLMKATEQKELL
IWTFIGLLILFLITSVIAQISLTALGHKFVYLMRKQLVKQLLDTGTEQLN
QIGKARLLASLSGDIRNITFAFVRLPELVQGSILVLCAGAYMFYLSESLF
FVTALWLSVTVWVSNIAVRRVYHHLRIVRETEDKLYKNYQSAIEGHKELS
LNRERAKFYFERELEESASTQRDNSVRADSYHAFANNWTNVMVLGAVGLV
FYLSLAEGWSNLETATTIALTILFLRTPLISAVGALPMLLNAKVALDKLS
KLNLAPYTEDFAISNPLPRDWKEIRFENVTYSYPTAEGTMSFALKPVNLT
IKRGELIFLIGKNGSGKSTFSMLLAGLNHATDGKIFVDNIEITAANQRAF
RAQISAVFSDFYLFTQLLGQAGFASLKEAGQWLETLQLENKVTVENHRLS
TTNLSQGQRKRLGLLIALLEHRPLLILDEWAADQDPTFRRTFYQVLLPLL
REKGHTVFAISHDDSYFHLADRLLLISQGELRELFGEERETASHDAVEKL
NNIIKETKI
>MS1569 mdoB, MdoB protein
MHSFKNYKEIMKNLKRLLGASYPIIIFLPVNLVVLSLSRLGLSLWQSERV
DATQGWTELFLQGMRVDLASLCWLFFPLILLGTLFSGNNKFGKIIQWIIK
LGLTVFSTFFVFMELATPAFIKTYDYRPNRLFVEYLNTPKEVFTMLAHGH
LAALISTVILTALFAIIFWKFAQKLSRDLYFPKWQYRLVSFLILGLFAFI
GARSSFEHRSLNPSMVAFSNDTLVNSLVLNSGYSVLFAVQQMKDESNSSE
IYGKMPLEEVVNTLKGLNPRPETAYISAELPTLTHNQASYQGKPKNLVIL
LQESLGAQFIGTLGGKPLSPNIDELAKEGWLFDNLYATGTRSVRGIEAII
TGFTPTPARAVVKLQGSQDNFFTIADLLKQQGYDTSFIYGGEKHFDNMAS
FFYGNGFTRIIDQADYSNPTFSGTWGVSDEDLLNKANETFESLHQQGKPF
FSLVFSSSNHDPFEFPEGKIQLYEQPQATRNNAAKYADYALGEFFKKAKQ
ANYWKDTVFIIIADHDSRVNGQQLVPIKHFHIPALVLGADIEPKRDHRLI
SQIDIPPTLLSLIGISGDYPMIGFDLTKAENPNRALMQFDKNLAYMRDNR
VAILQPNKPATGFIYDEKTGDLTAASIPENMAKEALAYALWGSYAYKNKL
YKSDYLLKK
>MS1409 melB, MelB protein
MKQQNVWLKRIGYGFGDFGCNLVFSTMASYLMFFYTDVFGIEAAVVGTMM
LSTRLLDAVTDVLMGLVVDRTNTRWGSGRPYFVIGAIPFAIFTTLTFYVP
DFGTAGKIIWAYCTYIMLSLAYTVVNIPLNTIVPRLTSDINERNILVASR
MICALLGTTVVMGITQPLVDFFGQGDYKQGYFITMTLYGILAMLIFFFTF
TQTEEVVPPTVVRTENSSVLDDFKGLTSQTWILVLVNFFYFGLFVVRNTS
VIYYFTYNLNSTSWLTFVGFFGILSGLPILLLLPRLQKIFPQRTLIIACC
LLYIIGDAIAYIGKDSLTLQLVSLAVTGLGMYGIFGVTFAIQPDVIDYSE
YEKNRSIPGMIASMQGFFVKFGMGVAGLSIGWILEGGGYQPNVVQTESAL
FSIEVCYIWIPVIICLSIIALMYFYKLDGLRSEMTRVLDLRRKQMEYAHQ
H
>MS1228 melB, MelB protein
MSLSMKTKLSFGLGAYGKDFAIHIVYMYLMYYYTDVLGVSGAIVGTIFMI
ARVWDAVNDPIMGWIVNNTRSRWGKFKPWILIGTVLNSIVLFSLFCADYF
SGTALIIYIAVTYILWGMTYTLMDIPFWSLTPTLTLDQREREELVPYPRF
FSSLANFITAGTCIAFVDYVGGDDKGFGFRMFTLVIIVCFLISTVITLMN
LKEKYSSDNLETGEAQQRIPLKTLVSLIPRNDQLSSLLVMALSYNIASNI
ISGFAIYYFTYVVGDKEMFPYYMSYAGIANLIIIIFFARLVRLFSRRTLW
ITISVSSILSCLILAYTGMAETPSVFLIILAGIFMQIGSALFWTLQVIMV
ADTVDYGEYKLGIRSESIAYSVQTMVVKAGSAISAFLIGVLLTAINYVPN
EVQNENTIFWMKVIMIGLPILFYSIKLFVYFRYYKLHGDLLAKVNIALLD
KYRNVKED
>MS1840 menA, MenA protein
MTQSALKTWFETARPKTLPLALAIIFTGSAVAYWFGSFDWQITLLCLLTA
TLLQILSNFANDYGDFQKGSDTVERIGPLRGIQKGNMTEGQLRNGLIVTI
ILILISGFALLATAYQSLQDLIVFITLGIASIVAAIAYTVGKKPYGYLGL
GDIFVFLFFGLLAVAGTYYLQAHSMNWTVFLPAGACGFLSTAVLNINNMR
DIEQDKKAGKHTLVVRLGAEKSRVYHCLLLTSGVLCYALFSAINVDSRWG
FLFILAVPLLVKHAGFVYKTKEPILLRPMLAQMSLLALLTNVLFSLGLVL
AK
>MS1792 menB, MenB protein
MLYPSEEFLYAPVEWADHSEGYTDIRYHKSKDGIAKITINRPEVRNAFRP
QTVKEMIHAFSDARFDEKIGVIVLTGEGEYAFCAGGDQKIRGDYGGYKDD
SGVHHLNVLDFQRDIRTCPKPVVAMVAGYAVGGGHVLHMMCDLTIAADNA
KFGQTGPKVGSFDGGWGASYMARIVGQKKAREIWFLCRMYDAKEALDMGL
VNTVVPYADLEKETVRWCREMLQNSPIALRCLKAALNADCDGQAGLQELA
GNATMLFYMTEEGQEGRNAFNQKREPDFSKFKRNP
>MS1794 menD, MenD protein
MSVSTFNRCWSKVILETLTRHGVKHFCIAPGSRSTPLTLEANRLQEQRRA
LCHTHFDERGLGFFALGLAKSSQTPVAIIVTSGTAAANLYPAIIEARQTG
DNLIVLTADRPDELIECGANQAILQQNMFAGYPVASVNLPRPSQDYIVSW
LISTLDQACHQQAQQAGVIHINVPFAEPLYDADEDEIDVHPWLAPVQRWL
NHNKPWADHQALQEEVVMHEHWDNWRTKRGVIVAGRLTQEQSMGITAWAN
TMGWVVLTDIQSGVEPSLPYADIWLANKTVREKLLQADLVIQLGYAFVSK
RINQFLADFKGEYWIVDESAHRVDPYHHIHTRFTAKVHHWLRAHPPLRQK
PWLLEPLALSKFCASFIEQQVGGNLNEASLAHHIERILPNNGILFLGNSL
FVRLVDALGKLPEGYPVITNRGASGIDGLLATAAGVGMGSNQPVVAMIGD
VSALYDLNSLALFKNVNQPTIIFLINNNGGAIFDMLPVESSVKSEFYRMP
HHTEFSQAASMFDLKYARPYTWADLSSVLKQAYSRKEATVIEIKVGPMDG
SNTYKRLIEQISYAVIGA
>MS1795 menF, MenF protein
MFLTMDSLQSLQQQLIQQMDAYQPCKNQPEITALTAKIQLEQNLLAWLKA
QQDYPQFYLHCRADSPNEQENHIAAIGQVRTFTTVNHAQTFVRRADFTLV
GGMTFNGECDFYLPRLLLRQVDGELTATLFIDSQKDLSVEKQLAGKCLKN
FTKSVALEPVVQSVRLVEKKATQAQWCEWVEQALLEIKKGSFTKVVLANE
SIFSSRQPINAIDFLAESEKKNTGCYHFFFAQKADYAFIGSSPERLYLRN
GQYLQTEALAGTAVMSDDEEQNQRQGEWLLKDEKNEYENMLVVEDICGNI
ESFTQNIEVQSVELKRLRLVQHLRRKIFAKLTALTADEACLNAIHPTAAV
AGLPKQNALRFLAKTETFERSWYAGTLGFMNRARAEFCVTLRSAFVEQNR
IRVFAGAGIVAGSVPLLEWQEIERKASGLLSLLQNSGEHICQ
>MS1839 menG, MenG protein
MSYTGNRFILEKSNLYKGNFMRIDTSELCDVYLDQVDVVEPIFSSFGGVN
EFFGKVTTIKCFENNGLIAEILEEQGEGRVLLVDGGGAVRRALIDAELAQ
LAADNGWEGIIVYGAVRQLSRLENINIGIHALAPIPVGADEDTQGESDIP
VNFGGVTFFPEDYVYADLTGIILSQEPLELEELGEE
>MS0804 mesJ, MesJ protein
MSDIFNQFQQQIYQQKILIAFSGGLDSTALLALCKKLQENRPHFQFRAIH
IHHGLSPNADKWALHCENICRQFSIPLIVEKVRVDKSNGIEAGAREARYH
AIANHLNIDEVLATAHHLNDQTETFLLALKRGSGVQGLSAMQKESVVFNL
PIFRPLLQFTRAQLEDYVKSQKINWIEDESNEDNSYDRNFLRNIILPKMK
TRWAHFDQAVYRAAQHCFEQTQLVNELLEDEFQKIFEKNDRTLSVKLFDR
YSYIKQKALLRLWLARLQLAMPSQKQLEQLIRDVIFAKPDAIPQFKLANQ
VIRRYQQKLYLTADFADLTDVAVPLKISQTVPLPDGLGHISLREQSGCFV
FSWRHYQVQLPPCKQPIEIRFAYSGKVKLHKNGVNQDIKKVWQNLNVPPW
LRNRTPLIFYGDQLKSAVGFFKVFDC
>MS1087 mesJ, MesJ protein
MTELETKENKKQIYNFNKLQKRLRRNVGNAIADFNMIEDGDKVMVCLSGG
KDSYTLLDILLNLRHNAPVHFDIVAVNLDQKQPGFPEHILPEYLSSIGVE
YKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGH
HRDDMLETLFLNMFYGGKLKSMPPKLVSDDGKQIVIRPLAYCKEKDIEKY
AVAKQFPIIPCNLCGSQPNLQRQVIKEMLQTWDRRYPGRIETMFSAIQNI
TPSHLCDPNLFDFKNIKRGQLPKGVEGDIAFDKEELPQTPIIDEDTEDFV
NNGQLIRFKEVN
>MS1627 metC, MetC protein
MTQNYSIETILAQAGNKSDARTGAVSTPIFLSTAYGHRGIGESTGFDYTR
TKNPTRLVLEETIAKLENGDQGFAFSSGMAAIQVLMTLFTAPDEWIVSSD
VYGGTYRLLDFAYKNNNSVKPVYVNTASVEAIETAITPNTKAIFVETPSN
PLMEECNVTEIAKIAKKYNLLLIVDNTFLTPVFSRPLDLGADIVIHSATK
YLAGHNDTLAGLVVAKGQALCERIFYIQNGAGAVLSPFDSWLTIRGLKTL
ALRMERHQANAAAIAEFLKAQPQVKDVLYPNKGGMLSFRLQDENWVNPFL
KAINLITFAESLGGTESFITYPTTQTHMDIPAEERIARGVTNDLLRFSVG
LENVEDIKADLAQAFAQFK
>MS1520 metC, MetC protein
MSNKYSLATTLVHAGRSKRVSQGSVNPVVQRASSLVFDSIADKRQATVNR
AKQALFYGRRGTLTHFALQDLMCEMEGGAGCYLYPCGAAAVTNAILAFVQ
SGDNILMTGAAYEPTQDFCNKILSKMNVSTTYYDPMDGEKIAELVQPNTK
VLFLESPSSLTFEVPDVPNIVKAVRKINPEIVIMIDNTWAGGILFKALEH
DIDISIQAGTKYLVGHSDVMIGTAVSNARCWDQLRENSYLMGQMVDADTA
YTTARGIRSLAVRFKQHTESSIKVAQWLAEQPEVKAVFHPALPSCPGHEF
FKRDFTGSAGLFSFELKEQLSREKLERFMDNFKLFSMAYSWGGFESLILY
NQPADIAAIRPNIKRKLTGTLIRIHIGFEDVNELIEDLKAGFERLK
>MS0941 metE, MetE protein
MTKLFPNATVRTSAPYRFDIVGSFLRSDAIKSARAACACGDISCADLTRA
EDAEIAKLVERQKSVGLHAVTDGEFRRTFWHLDFLAGLDGVEEVDAEKFS
VQFKHHNVRPKTLKIVAKVDFSENHPFVEHFRSVNELAKGTEVKFTIPSP
SMLHLITNVRATNYQPIPRYENNNQQLLDDIADAYIKAMNIFYKLGCRNL
QLDDTSWGEFCAEDKRAAYQERGFDLDQIAKDYVYMLNKIVDAKPAQDIA
ITMHICRGNFRSTWFSAGGYEPVAEILFGSCRVDGFFLEYDSDRAGDFKP
LRFIKNQQVVLGLVTSKDGTLENREDIINRIKEAAQYVDINQLCLSPQCG
FASTEEGNILTEEQQWAKLNFIREIAEEVWGK
>MS0787 metF, MetF protein
MSYAKDIDTLNQHVADLNGQINVSFEFFPPKNEKMEETLWSSIHRLKTLN
PKFVSVTYGANSGERERTHSVVKNIKQKTGLEAAPHLTGIDATPEQLKEI
AQDYWNNGIRRIVALRGDIPAGYTKTPFYASDLVALLRSVADFDISVAAY
PEVHPEAKSAQADLINLKRKIDAGANHVITQFFFDIDNYLRFRDRCASIG
IDAEIVPGILPVTNFKQLQRMAALTNVKIPNWLAVNYEGLDEDQTTRNLV
AASVALDMVRVLSREGVKDFHFYTLNRSELTYAICHILGVRPK
>MS1631 metG, MetG protein
MSNQHRQILVTCALPYANGPIHLGHMLEHIQADIWVRFQRMRGNEIHFVC
ADDAHGTPIMLKADQMGITPEQLIADVKEKHYADFCGFNISFDNYHSTHS
EENRELSELIYSRLKENGFIKSRTISQLFDPEKSMFLPDRFVKGTCPKCK
AEDQYGDNCEVCSATYSPTELINPRSAVSGATPVIKESEHFFFDLPSFES
MLKEWNRSGALQSEVANKMQEWFDAGLQQWDISRDAPYFGFKIPGTENKY
FYVWLDAPIGYMASFKNLCKRENLDFDRFWNKDSNTELYHFIGKDIMYFH
SLFWPAMLDGANYRKPTNIFVHGYVTVNGEKMSKSRGTFIQAATYLKHLD
PECLRYYYAAKLSNRIDDLDLNLDDFVQRVNTDLVNKLVNLASRNAGFIQ
KRFDGKLADKLEDESLFAEFIAQSEQIAAYYENREFGKAIREIMALTDKA
NKYVDDKAPWVIAKEEGREAELQAVCSMGIQLFRVLMGYLKPVLPKLAER
SEAFLQAELTWDNLAQPLLNHGIAPFKALFSRLDVKQIDAMIEASKAENA
AVNATVKKEEKNSKKSTALLTDFEPIEPEISIDDFAKIDLRVAKVIKCEE
VPESKKLLKFQLDLGFEQRQVLSGIKGAYNNPEELEGRFVIVVANLAPRK
MKFGVSEGMILSAGTGGEDLYLLDVDAGVKAGSRVM
>MS1009 metH, MetH protein
MHNKIDILKASLAQRILILDGAMGTMIQQYKLSEQQFRGERFKQSSVDLR
GNNDLLSLTQPLLIQAIHEKYLQAGADIIETNTFSSTSIAQADYDLQAIA
YELNFAGAKLARIAADKYSSADKPRFVAGVLGPTNRTASISPDVNDPGFR
NITFMQLAEAYGEATRGLIAGGADIIMLETIFDTLNAKAAVFAIEQVFEE
LGVRLPVMISGTITDASGRTLSGQTTEAFYNSLRHAKPLSFGLNCALGPK
ELRQYVEQLSKISECYVSAHPNAGLPNAFGGYDLGAEEMAAQLKEWAESG
FLNIVGGCCGTTPEHIKAFAEAMQGVKPRPLPQIKTAMRLSGLEPLSIDD
DSLFVNVGERNNVTGSAKFKRLIKEEKFGEAIEIAIDQVENGAQVIDVNM
DEALLDSQKCMTRFLNIMATEPDAAKVPVMIDSSKWEVIEAGLQSIQGKG
IVNSISLKEGEEKFIRQAKLIRRYGAAAVVMAFDEKGQADTEARKVEICT
RAYDILVNQAGFPPEDIIFDPNIFAIGTGIEEHNNYGVDFINATGRIKQT
LPYAKVSGGVSNVSFSFRGNNPMREAIHAVFLYHAIKQGMDMGIVNAGQL
AIYDDLDPELREVVEDAVLNRRPDATDRLLEIAEKYRNQDSTGEDNGVAE
WRSWSVEERLKHALVKGITHFIIEDTEEARQKFSLPLEVIEGPLMAGMDV
VGDLFGDGKMFLPQVVKSARVMKQSVAYLEPFINATKQKGSSNGKVVIAT
VKGDVHDIGKNIVSVVLQCNNFEVIDLGVMVPADKIIETAIAEKADIIGL
SGLITPSLDEMEYFLGEMNRLNLNIPVLIGGATTSKEHTAIKLYPKYKYE
VIYTTNASRAVTVCAALMNPESKAELWARTRKEYEKIQQSFAERKPLRSS
LSLEQARANGFNPFAGEWANYQVPQPKQPGISEFKDVPIAMLRKFIDWSP
FFRVWGLMGGYPDAFDYPEGGEEARKVWHDAQIMLDEFENNGKLTPSGVL
GIFPAERAGDDIKIYQNSDRTLLAGVARHLRQQSERGKNSKIPYNLCLSD
FIAEGSNGQQDWLGMFAVCAGTQEHALVDSFKAKGDDYNAILLQAVGDRL
AEAMAEYLHFELRTRLWGYSDETFDNQALIDEKYIGIRPAPGYPSCPEHT
EKQLIWDLLEVEQRIGMKLTESYAMWPAASVCGWYFSHPASSYFTLGRID
EDQAADYAKRKGWDEREMRKWLGVSMK
>MS1966 metJ, MetJ protein
MGFFSLKYRQILRLLIGNFMADWDGKYISPYAEHGKKSEQVKKITVSIPI
KVLEILTNERTRRQLKNLRHATNSELLCEAFLHAFTGQPLPTDEDLLKER
HDEIPEQAKLIMRELGINPDEWEY
>MS0669 metK, MetK protein
MSSYLFTSESVSEGHPDKIADQISDAVLDEILKQDPKARVACETYVKTGM
ALVGGEITTSAWVDIEYIARQVICDIGYTSSEMGFDGHSCAVLNGIGKQS
SDINQGVDRDDPLNQGAGDQGIMFGYATNETEVLMPAAITYAHRLMERQA
WVRKNGTLPWLRPDAKSQVTLKYENNKIVGVDAVVLSTQHSDSVTQEDLH
EAVMEEIIKPVLPAKWLSKDTKYFINPTGRFVIGGPMGDCGLTGRKIIVD
TYGGAARHGGGAFSGKDPSKVDRSAAYAARYVAKNIVAAGLADRCEIQLS
YAIGVAEPTSIMVETFGTGKVADELLVALVREHFDLRPYGLIKMLDLIKP
IYRETAAYGHFGREQFPWEKTDRAAELKAAAGL
>MS0216 mfd, Mfd protein
MTTHYFNLDIPTQAGDHKIVANVLTGSDGLAICEMAEQFQGLTVVVANDT
KSAVRLEKILQESGKLEVRYFPDWETLPYDSFSPHQDIISSRLSALFYLQ
NTRKGILILSVSTLMQRICPPQYLQHNVLLIKKGDRLVIEKLRLQLENAG
YRAVEQVMEHGEFAVRGALLDLFPMGSPLPFRLDFFDDEIDSIRTFDADT
QRTLEEIRQINLLPAHEFPTDDKSIEFFRAQFRETFGEIRRDPEHIYQQV
SKGTLVSGIEYWQPLFFENMATLFDYLPANTLFVDMEQYQIQAERFYQDA
VQRFESRKIDPMRPLLAPERLWLRIDEVNRALRNYPRISLKAEKVRTSVR
QKNLPLKALPELQIQPQQKEPLQNLRHFIEKFKGHIVFSVETEGRRETLL
DLLSPIKLRPKQVNSLFEAQSQTYSLQISSLDNGFIIEQENGEPIAIICE
TELLGERVQQRGRDKRKSVNPDTLIRNLAELKIGQPVVHLDHGVGRYGGL
VTLENAGIKAEYLLLTYANDAKLYVPVANLHLISRYVGGSEETAPLHKLG
SDSWAKARRKAAEKIRDVAAELLDVYAQREAQKGFAFHYNREEFMQFSAT
FPFEETHDQEAAINAVISDMCQPKAMDRLVCGDVGFGKTEVAMRAAFLAV
MNHKQVAVLVPTTLLAQQHYENFRDRFANLPVNVEMVSRFRTAKEQKKIL
EDLSAGKVDILIGTHKLIQSDVKFNDLGLLIIDEEHRFGVRQKEKIKQLR
ANVDILTLTATPIPRTLNMAMNGIRDLSIISTPPARRLTIKTFVRQADDL
LIREAILREILRGGQVYYLHNDVASIENCAEKLTALVPEARIIIGHGQMH
ERELERVMTDFYHQRFNVLVCSTIIETGIDIPTANTIIIERADHFGLAQL
HQLRGRVGRSHHQAYAYLLTPPPKLMTKDAVKRLEALESLDNLGAGFILA
THDLEIRGAGELLGSEQSGQIESIGFSLYMELLEAAVQAMKQGREPSLDE
LTQQQVEIDLRIPALLPEDYLGDVNMRLSFYKRIAGAENKPALDELKVEL
IDRFGLLPEATKNLMQITELRLMAKQLDIIRIDGSQNGGFIEFSPTADID
PMKFINLIKQQPAVFKFDGPTKFRFSCALEQAQKRLDFIFNLLQSLMD
>MS0642 mglA, MglA protein
MTAQSAQSDNQVLLTMTNVSKSFPGVKALDKANLTVKSHSVHALMGENGA
GKSTLLKCLFGIYAKDEGEILFLGKPVNFKTSKEALENGISMVHQELNLV
RQRNVMDNLWLGRYPLKGVFVDHTKMYNDTKAIFDELDIDIDPREKVANL
SVSQMQMIEIAKAFSYNAKIVIMDEPTSSLSEKEVEHLFKIIEKLKDRGC
GIVYISHKMDEIFKICDEITILRDGKWINTVPVKGSTMEQIVAMMVGREL
TQRFPPKINEPKEVILEVEHLTALNQPSIQDINFELRKGEILGIAGLVGA
KRTDIVETIFGVRERKSGTVKLHGKIMKNRTALEAINNGFALVTEERRST
GIYANLSIEFNSLISNMKSYMNKWGLLSDKKMKSDTQWVIDSMNVKTPSH
KTTIGSLSGGNQQKVVIGRWLLTQPEILMLDEPTRGIDVGAKYEIYQLIM
QLAQKDKGIIMISSEMPELLGITDRILVMSNGKVAGIVETAKTSQEEILQ
LAAKYL
>MS1611 mglA, MglA protein
MKMTDTNILTLKNISKSFFDVTVLEDINLDIRCGEVLCLIGENGAGKSTL
CKIIAGIYSRDTGEMLYQGQPYSPTTVKEAQEAGIGFIHQELMLVPKLTV
MENIFLGAEKTLSFGRMNWSEMREKTQHIIDELELDIKPDDLIADLSIAQ
QQMVEIAKAVFSEYKIIIFDEPTSSISRKNTEVLFKIIHQLKTKNVAMIY
ISHRLEEFKYIADRVTVLRDGRITGTMRYEETSPEDIVRLMVGRKVDFSR
YRRDTVFTQEKLRVENIQSKHISPISFQVNKGEILGFAGLVGAGRTEVLR
AVYGADEATGKIYIDQKEISIHSPEDAVKHKIGFITEDRKSQGLVLGMSI
RENITLPILKRFWNGWQLDKKKEREVVEANRSKLHIVSKDQEQQTKTLSG
GNQQKVILARWLESGVDILFFDEPTRGIDIGAKSEIYDLMRQFTENGGTI
VMVSSDLPELITISDRVIVMRNGEKVKEIANRDDITEENLMHAMIGI
>MS0200 mglA, MglA protein
MATERLSMRNMTKKYGTVTVLEDVSFNVKAGEVHALIGENGAGKSTLLNL
LSGVRDATAGEIYIDGQKVNINSPKAAKDCGIAMIHQELQNVPELSVFQN
IFLGRSLKKNLGLFVDKSKESQLALEVLKSLDPGIDPSVPIKTLKVAQQQ
IVEIARALLDNAKIIAMDEPTSSLTPSEFERLAELIKGLANSGVSIIYVS
HKMDEIFKVCDRATILRDGRFIDCVNMSEQTEESIVTKMVGRKIEKLTHM
SYATDEKILEVRNLGRDKAVKDINFIAHKGEVVGISGLVGAGRTELFRLI
AGLDKPTSGEILVEGKRLKLNSVRDSIKMGIGLVPEDRKKEGILRDRSVL
INIAMPSMDRFSQNGFIRKDYLGAVSHRLMMDLNLKPFDLEKTVGTFSGG
NQQKVIIGRWLAAGTKIYLFDEPTRGIDIGTKSEIYNLIENLAKAGNVIL
VVSSEMPEIIRVSDRVLVMKEGAITAELCGGDINEENIAQYAIGQNKIKN
EEGLNYVSN
>MS0062 mglA, MglA protein
MQVPYLEFDNVSKSFPGVKALQNISFKCYEGKVHALMGENGAGKSTLLKI
LSGNYLPSEGKLSIGGRQLVFRNTKEALLAGVAIIYQELNIVPEMTVAEN
LCLGQLPHSFGIVDKAELIERTQQYLDKLDLNISPNTPLKELSIGQWQMI
EIAKALSRGAKIIAFDEPTSSLSAPEIEKLFSVINELRDEGKVILYVSHR
MEEIFRISDEITVLKDGQFVETFSDLSKITNDDLVRSMVGRNLGDIYHYR
PREVGDVRLKIAHLSGEKLQGDFSLTVRAGEVLGLFGLVGAGRSELLKVI
FGADPCVSGSIELDGKTLSIRSPKDAIEQGIVLCPEDRKKEGIVPTASVG
ENINISARRLHNFFKFIINDKWEKKNAEKQRQQMNVKTPSIEQLIVNLSG
GNQQKAILGRWLSEDIKVLLLDEPTRGIDVGAKSEIYDLIFKLADQKLAI
IVVSSDLPEVIGVSDRIMVMRAHQITGVVERADATEEKVLKLAMVESLNV
GD
>MS0839 mgsA, MgsA protein
MPLSAKKPRILIQGGYMETTFRHVAAQKHIALVAHDHCKEDLINWCQKNV
HHLQNHQLYATGTTGHLIEKATELKINSLLSGPMGGDQQLGALIAENKID
VMIFFWDPMNAVPHDPDVKALLRIAAVWNIPHAMNIASADLLINSPLINR
EIELRIPDYQTYLQKRLK
>MS1793 mhpC, MhpC protein
MNTMTLVFLHGLLGTKSDWRKIIENLPHFRCVSLDLPFHGEHKFTEANNF
EQCADFISHQIKSAVGNQPYFLVGYSLGGRIALYYALQSQCEKGNLQGLI
LEGANLGLTCDEARKVRWKNDEFWAQRFITESAESVLNDWYQQPVFAHLN
AQQRADLIEKRVTNCGKNIGKMLEATSLAKQPYLGDKVRESTLPVYYLAG
EKDQKFRQMAVQEKLNLQLIANAGHNAHLENPVEFSQKLTALLRNHKIKK
TDNL
>MS0862 mhpC, MhpC protein
MKLLNYQFHQLKQPSNQATMVFIHGLFGDMNNLGIIARAFSDAYNILRLD
LRNHGQSFHADEMNYSLMAQDIIHLLETLQLTKVILIGHSMGGKAAMKTA
ALRPDLVEKLICIDIGPIAYAHRWHDDVFAGLFAVKNAQASSRQEAKPIL
ASYIKDEGVIQFMLKSFDGNAAEKFRFNLSALFNNYGQIMGWEEVFFDKP
TLFIKGGNSDYLQSGYGTRILAQFPQASSFTINGSGHWVHAEKPEFVVRA
IQRFLESN
>MS0882 mhpC, MhpC protein
MMLYETKGNGEPIIFLPGLFAGGWIWNSVVRNIQDKGFKTFTFTDPIPVA
FEGSQQKALTELDTITENCSTPVYLVGNSLGALIALHYAFQRKDRVKGVI
MSGAPGQLEMEAGVSLDELKTGKDKYTTLLGSRIFYDQSKIPPHGIEEVK
YLFGTEKIFRNIVRWLYFSRKYDVPDVLQKISIPIDFIWGQYDLITPIEP
WIDIAKNFPQTSMTIIKDSGHSPMVEQPELFTEALLRKISSGRTHIK
>MS2156 mhpC, MhpC protein
MIMTISALDFFKRDVTLPNQLDGLPHKLSDVTGLQIGSFKTNDGVSLNYW
KAGSGEPLVFVPGWSSNGAEYINLIHLLKDKFTVYVLDQRNHGLSDKVKF
GNRISRFAMDLHEFFNAENIEKAHLCGWSMGCSVIWGYVDLLGTSRVEKF
VFIDEAPSIYCHSNWTEEERINAGAFTTSAEMMIDMYYGRGTCNMLQVNT
DLFNFYNTIDALAFENSMALCDQVCPHDKDALEQVLFDHILNDWRDVLIN
KIDKPTLVVSGEHSNWVESQRWIAQTVPNSEDLIYGKHEHGDHFLHLKMP
QKFAGELTEFLNRMS
>MS1517 miaA, MiaA protein
MNQKPTAIFLMGPTASGKTDLAIQLRQELPVEVISVDSALIYKGMDIGTA
KPSKEELALAPHRLIDIIDPAESYSAANFRSDALREMADITEQGRIPLLV
GGTMLYYKALLEGLSPLPQADEKVRSKIEEKAQKFGWATLHKELSLIDPV
SAARINPNDSQRINRALEVFYISGKSMTELTEQKGEQLPYHILQFAIAPE
DRAILHRRIEMRFHKMIESGFKQEVERLYHRGDLHIDLPSIRCVGYRQMW
EHLRGDYDLDEAVFRGICATRQLAKRQITWLRGWKYPIQWLDSLKNSENK
EIIKRAFDLTMQNG
>MS1690 miaB, MiaB protein
MTQKLHIKTWGCQMNEYDSSKMADLLQSTHGLELTEEAEQADVLLLNTCS
IREKAQEKVFHQLGRWKELKKKNPNLVIGVGGCVASQEGEHIRERAPYVD
IIFGPQTLHRLPEMINQIRAGEKAVLDISFPEIEKFDRLPEPKAEGPTAF
VSIMEGCNKYCTYCVVPYTRGEEVSRPLDDVLFEIAQLAEQGVREVNLLG
QNVNAYRGPTHDGGICSFAELLRLVAAIDGIDRLRFTTSNPIEFTDDIID
VYRDTPELVSFLHLPVQAGSDRILTMMKRGHTAIEYKSIIRKLRAVRPNI
QISSDFIVGFPGETNEEFEQTMNLIQQVNFDMSFSFVYSARPGTPAADMP
DDVTEEEKKQRLYILQQRINNQAAQFSRAMLGTEQRVLVEGPSKKDIMEL
TGRTENNRIVNFAGTPDMIGKFVDIKITDVFTNSLRGDVVRTEDQMGLRV
VQSPQAVINRTRKEDELGVGRFGG
>MS2247 miaB, MiaB protein
MRYFMSYSAPNIGFVSLGCPKNLVDSERILTELRTDGYNIIPTYENADLV
IVNTCGFIDSAVQESLEAIGEALEENGKVIVTGCLGAKENQIREVHPKVL
EITGPHSYEAVMEHVHKYVPRPERNPYTSLVPAQGVKLTPKHYAYLKISE
GCDHKCTFCIIPSLRGDLDSRPITQVLDEAKRLVDSGVKELLVVSQDTSA
YALDQSKENQNKTVFWNGAPIKNNLITLCRQLGTLGAWIRLHYVYPYPHV
DDLIPLMAEGKILPYLDIPLQHASPKVLKAMKRPGSVERVLERIQKWREI
CPELTLRSTFIVGFPGETEEDFQMLLDFLQEAQLDRVGCFKFSPVEGAVA
TDMADQVPEEVKEQRFQRFMELQQQISAQRLQQKIGKTLPVIIDDIDEDG
IIGRSMADAPEIDGVVYVDNRSESAVKIGDIIQVAITNADEYDLWGTC
>MS2355 mipB, MipB protein
MTTQLDALRNMTVVVADTGDIEAIKKYQPQDATTNPSLILSASALPQYAS
LIDDAINYAKAKSTDKAQQLIDAEDKLAVNIGLEILKIVPGRISTEVDAR
LSYDTAATVEKARKLIKLYNEAGINNDRILIKVASTWQGIRAAEILEKEG
INCNLTLLFSQAQARACAEAGVYLISPFVGRILDWYKANTDKKEYVPNED
PGVISVTSIYNYYKQYGYQTVVMGASFRNIGEITELAGCDRLTIAPALLK
ELQESNADLPRKLDYKGEVKPKPAPLTESQFYWEHNNDPMAVDKLAEGIR
KFAADIEKLEAMLSTKL
>MS1459 mltA, MltA protein
MNFIRTFIMFFTKNFVLKAATVLAATVLAACSSNTNAVKKTTESSVDPAQ
FGAKYKGRSYSTSLFSSANVDNYSGVVNQGDFLTQLSNVRAYSTGISSTY
YDNYNKISQWVLAGADVNQLANYGIRPQVMSGEDGYQNVLLTGYYSPVIH
ARYSAQGKYQHPIYAMPSQKRFTRSQIYAGALEGKGLELAYSDSMLDNFL
LGVQGSGYVDFGNGNLNYFAYAGQNGYKYQAVGRLLVEDGEIPKEKMSIQ
AIRDWAERNPSRLQSLLERNPSYVFFKNDPAGKVKGSAGVPLVPLASVAS
DRSIVPSGSVLLVEIPQIDNEGNWTGEHRLHLMVALDVGGAVKGNHFDLY
QGIGDKAGHISGLLKHYGRVWVLR
>MS0901 mltB, MltB protein
MKIKYKFLALTACLMLAGCSSNNNKSAVNSAEDLSALPATAVYSNARTLN
NFDDYVQFLKRKAAGQGVSSATLTTQNNIRYIDSAVRLDQKQAGNAARRQ
GLPPLPPNPNGVTNYLTKHLTQAKVDKAEDNYYDVQVPLQKASSAFGVQK
EFILALWGMESSFGYYQGDYDVLSVLATLAFDGRRETLFSKEFINAMKML
DAGHLNRSKMLGSWAGAMGQTQFMPSSYLNYAADGDKDGTKDIWSNEYDV
FASIANYLHTVGWDDTLPWGIEVSLTTPLPLSLAGTEKEKARSLNDWQAQ
GVLPKNMFDADKLKALSNADLWLVRPDKEVGRTFLVSNNYRTILDWNRSN
YYALSVGMFADRIKQTLGF
>MS0315 mltE, MltE protein
MIKVMKLKKFLVLLLIPFLYACSSDRSGNYDDAFAKDTNGLDLLTGQFSQ
NIDQIWGVNELLVASRKDYVKYTDSYYTRSHISFEEGQITIETLADANRL
HSAIVHTLLMGSDAKGIDLFASGDVPISSRPFLVGQVVDNFGRQINNIDV
ANSFASYLLQNRLQSRRLSNGRTVQFVSIQMIANHVNVRARKYLSLVRQA
SRRYGIDESLILGIMQTESSFNPYAISYANAMGLMQVVPHTAGRDIFKLK
GRSGQPSKSYLFDPANNIDAGVSYLWILKNEYLAGITNPTSMRYAMISAY
NSGAGAVLRVFDSDQEYAINIINRMQPEQVYRILTTVHPSSQARNYLLKV
DKAQRSYRRAR
>MS1565 mltE, MltE protein
MNFSRFALALFSFGAVMPVIAAEQSLSQQREIYQKINQLLSISQSENTQN
IAKALLDEMKDYPLYPYAEYKLISSNLANTDFAQIEAYLQRRKDFPLAKN
LTKQWVIQHQNNQDWQGILANQDKLPKDIVSQCALLQAKSPISPVIDTNN
QNALKSAVNSAQILTEKAEHPLPQAELEKLWLTGNSLPKACDPILDQWNQ
TGGLTADLIRRRAVLALEQGNSGLLTHLSAQTQDTGLQNWLKTLAAIQKT
PQKLKDPANPFNPDKLEPNTQNKRIAKALFPSFVRTIKDNEVGDPQRLLA
QFDGWAKRFNLTAEETTDWEIAVISQLFDSPNTLLQQWRDTELKNLKADK
LTERRIRMAIRNKEDIKPWLTLLSDKAKNADEWKYWTAKTLQRSTDKAEQ
NQANALLSSLLNQRGFYPMLATQELGRAYQINLRNEENLAKPTASSKPEN
PAPAKPSAQELTAQKYAAELSRIEELRILADTNNMNTEWRSLFARANFDE
QIALTEYARDKQWFDLQVEGTILAKAWNHISLRLPNAYPQWFDLLLKNKK
IDRTFAMAIARQESAWKPYVTSSADARGLMQLLPSTAKLTAQKAGLPYSN
ANQLYDPFNNIMLGTAHLQELQDKYGNNRILISAAYNAGGSRVDQWLAKS
AGKLTMAEFVASIPFYETRGYVQNVLAYDAYYQLLQNKKAQIFANEEYNR
LY
>MS0577 mmcQ, MmcQ protein
MLRKISFRCSIKRQINKIGKNNMAKQDLKRHIFDYVLTQYGSEAEYLWKS
YPDFAVFRHQDNRKWYAIVMNVEKEKLGLAGSGKINVMNVKCSPEMLSLF
LAQEGFLPAYHMNKSHWLTIRLDGSVDKETVCFLLNGSFDLTATKQVKKK
LGIMRYSEWIVPANPKYYDVENELHEGKEIFWKQSNNVDVDDIVYIYVTE
PTAAIRYKCLVLEVNIPYKYRHEELQIKRVMKIRCLKEYDRKLFTRDKMA
QFGVSAVRGPRHMPYSLKREIDNLTNDEVSV
>MS0692 mmsB, MmsB protein
MKIGFIGLGIMGKPMSKNLIKAGHSLVVLDFNKAAVDEIVALGATSAATP
KEVAEQVEVVITMLPNSPHVKTVVSGENGLIEAQNTNYVFIDMSSIAPLA
SREIYAELEKKGIDMLDAPVSGGEPKAIDGTLSVMVGGKKDVFDKYYDVM
KAMAGSVVYTGDIGAGNVTKLANQVIVALNIAAMSEAFMLATKAGVDPEL
VYQAIRGGLAGSTVLDAKAPMVLDRNFKPGFRIDLHIKDLANALDTSHGV
GANLPLTSAVMEMMQSLRSAGDDKLDHSALARYYERLTGTEIKRF
>MS1981 mmsB, MmsB protein
MRTVKQHSSTIRRNIMSYSVAVIGLGSMGMGAAVSCVNAGLETYGIDLNP
AALEKLKAAGAKDVASNGDAFAKDLDAVVVLVVNAAQANAALFSETGIAK
KLKPGTAVMISSTMAAADAQAISQKLTELGLIMLDAPVSGGAAKAMKGEM
TVMASGSKEAFDKLQPVLDATASKVYNIGEAIGLGATVKIVHQLLAGVHI
AAGAEAMALASKAGIPLDVMYDVVTNAAGNSWMFENRMKHVVDGDYTPLS
MVDIFVKDLGLVNDTAKSLHFPLHLASTAYSMFTEASNAGFGKEDDSAVI
KIFSGIELPKKGGN
>MS1021 moaA, MoaA protein
MQSIPIKNVGTNRLVDTFQREYYYLRLSVTDVCNFKCTYCLPSGYQPPVQ
KESFLSLDEIRRIVGAFAAMGTEKVRLTGGEPTLRKDFLAIVETISALEG
IKKVALTTNGYRMEKDVERWKKAGVSSINVSVDSLDPRQFYSITGENKFH
QVMKGIERAFEIGYEKIKVNSVLMKNLNDQEFDRFKNWVKDKPIQMRFIE
LMQTGEMDQFFNRYHLSGQILAEKLLKEGWILRQKDRTDGPAKVFSHPDY
LGEIGLIMPYEKNFCASCNRLRVSAKGKLHLCLFGEEGVDLRDLLVSDEQ
QVILQSRLYAALQGKREHHLLAQGNSGIRTNLASIGG
>MS0425 moaB, MoaB protein
MTALSQTKLKIGLVSVSDRASQGVYQDQGIPELQAWLEAALTEEFDVETR
LIPDEQAEIEKTLIDLVDNRQCHLVLTTGGTGPAKRDVTPDATLAVADRE
MPGFGEQMRQVSLQFVPTAILSRQAGVIRKDSLILNLPGQPKAIKETLEG
VKDKEGNVLVKGVFAAVPYCLQLISGIYVDTKPEVIESFRPKSARRC
>MS1022 moaC, MoaC protein
MTEFTHINSNGEANMVDVSNKRETVREARAEAFVSMSAETLAMIVSGEHH
KGDVFATARIAGIQAAKRTWELIPLCHPLLLSKVEVKLTALLDTNQVRIE
SLCKLTGKTGVEMEALTAASVAALTIYDMCKAVQKDMVISNVRLLEKTGG
KSGHFKVE
>MS1023 moaD, MoaD protein
MLKVLFFAQTRELVGVDQIDVEAAFSTAEALRAHLAEKGGKWALALEAGK
LLVAINQTLSSLDSSIRDGDEIAFFPPVTGG
>MS1024 moaE, MoaE protein
MPFFRLLPGDKMSDIKIAVQEAEFDQNSEYRWLSQSDSVGASVIFVGKVR
DLNLGDEVSSLYLEHYPAMTEKALNEIVDEAKSRWDIQRVVVIHRVGLLH
TGDEIVLVGVSSAHRGDAYHANEFIMDYLKTKAPFWKKEKTDKGERWIES
RDSDQQAAEKW
>MS2057 mobA, MobA protein
MTITISAVILAGGLGRRMGGVDKGLQFWRGKPLIETVYQRLHRQIERISI
NANRNREIYARFGVPVFSDRLAGFQGPLSGILTALERATTDYVLFVPCDC
PNFPLNLLEKLKSAVEFSQISLAYAHDGERDHPTFCLVSTQLKNALADYL
AGGERRMLYFMQMHGAVAVDFSTEKQGFININNLADLNSP
>MS2056 mobB, MobB protein
MIFDEMNKMTTQNSLPMLGITGYSGSGKTTLLEKLVPQLTGLGIRVAVIK
HTHHDVNIDKPGKDSWRMKEAGASQVIMTCDRRWAIMTETRQPVSLSYLA
GQFDSALTDLVLVEGFKQEPIAKILLHRKGMEKTLPELDEYVIATATDYP
LVQNLPDLNINDVPAVARFIQRWYEEKCGQKNENFAK
>MS1342 modA, ModA protein
MSKIKNIFIGLTLSCAVSLFAEAKVTVFAAASMTNALEEVASDYKKVNPN
EDIVFSFASSSVLARQITEGAPADIFISADQKWMDFLAEKDEIVKDSRVD
LVGNKLVMIAPRTSKIEKVDLTNDKWQTALDKTYLSVGDPDHVPAGIYAK
TAFTYLNQWAALENKLARAKNVRDALRLVEQGESPLGVVYATDAAISQKV
RVVAIFPAESHPPVEYPAAIVKNKDNKESKAFFNYLKSDKAKTVFEKFGF
SAK
>MS1344 modE, ModE protein
MDNTEILLTIKLHQRLFVDPKRIRLLKEIAHCGSINQAAKNAKVSYKSAW
DHLEAMNAISPKPLLERNIGGKNGGGTQLTNYARRLLQLYDLLEKTQEKA
FQILQDESIPLNNPLSATARFSLQSSARNQFFGKVTKLELKNGHCMVSIQ
IEGLNRPLVASITEKSAVRLGLVPGKEVMLMIKAPWIKTQLEEPVDKENQ
FLAEVRSVSDKGGEKEIILSIGENPEFCATIEKTVDVAVNQKRWLYIDPE
QIVLASL
>MS0833 modF, ModF protein
MSKLAMRNAQFELHQNNLLSIPHFEIHSCDFWVVMGYNGSGKTAFSLALE
KKLSLYNGEYQNQFDSISLLSFEKQQKILEQTFKDLNNDEAPDDFGKTAR
EIILNGTDKNNLCEFYAQYLHIEKLLDRPFTKLSTGESRKVLLAQALVSE
PDLLILDDPFEGLDRQSVQDWLKLLESLKGKLALVLIVNRFSDIPSIADF
VAILDNKQMILAGKRQDIEGQSVYQQLKYAEDAEDVPLPGSASPLIRLPE
GQNPFELKNVTIQYGDKVILNNLNWTVKAKQNWWIKGPNGAGKSTLLSII
TGDHPQAFANHVVLFGKRRGSGETLWDIKQKIGYVSSQLHMDYRVNSTAI
DVIISGFFDSIGVYRQVPDALRIKAMQWLSRLNMDSLAKKPFRSLSWGQQ
RLLLITRAMVKHPPVLILDEPLQGLDGINRKLVKSFIEQLVSNSETQLLF
VSHQDQDAPNCITHLFEFIPQEEGYCYQQKTLESADIME
>MS1005 moeA, MoeA protein
MNSLLSLEQALEKMLATLPSPSLDNLETLPISQAQNRICAQDVMSPINVP
SFDNSAMDGYAVRLADLEQSPTLSVAGKSFAGIPFTDEWKPLSAVRIMTG
AMIPQGADAVVMQEEVSVNEDGSVTFEKLPKPGQNIRRIGEDVKQGDVVL
SVGAELNTVSLPLIASLGIPEIKVFPKLKIAVLSTGDELVPVGQPLNEGQ
IYDTNRFAVKLMLEKLHCEVLDFGILPDNEAEFEKAFMHAQEQADLIITS
GGVSVGEADFTKTVLEKLGEINFWKLAIKPGKPFAFGKLPNAWFCGLPGN
PVSALVTFYQLVQPVIAKLSGAANYKRPQQFPAIAATNLKKSPGRLDFQR
GLYRLNAQGQLEVEPVGLQGSHVFGSFVKSNCFIVLERERGNVTAGETVT
IEPFNHLLG
>MS1312 mrcA, MrcA protein
MSTEQDKSAQSNTSNKPAKTLQNKKRRFTLFMAKLAFTGACLIGAYGIYL
DGQIRSKMDGQIWRLPAEVYSRIESIRLEDNWSLDKIKQTLLENDYRQTT
LVAAPGDFKIEDNSIVLIRRAFPFPEQAEAQRVFRLRFTDNKLSVIEDLI
NLKAINEFKLSPKLIAMLQSEKEERLAIPLQNYPRLLIDALLLTEDRNFY
QHEGISPLGMARAMITNIRAGHTVQGGSTLTQQLVKNLFLTNERSISRKI
HEALMAILLDFRYDKNTILETYLNEIYLGQSGDIQIHGFELASHFYFGLP
IREISLDQIALLVGMVKGPSLYNPWRNPGYALERRNVVLRLLLEHKIIGQ
ELYDMLSKRPLGVQEKGKITRNYPSFIQTLQAELRDNLGENRENKLLGAR
IFTTLDPKQQRAAEQAVVKATGELQLKTKNPDLQAAMIIADYKSGQILAI
VGGTQIQYAGFNRAFMAKRQIGSLVKPSVYLAALSEPDKFRLNTPLNNRP
ITITIKGSPPWSPRNYDNRFSGSVMLIDALARSLNVPTVNLGMKTGLKKV
IETQQAMGWDKVNIPKVPSMLLGSLQVSPYDVTKLYQTIANNGGKVRLTT
IQSITDRQGNLLYRHDNNAEQVVPEEAAYQTLYAMQQVVDRGTARSLMEN
FGQYHLAGKTGTTNDARDTWYVGIDGQNLATVWVGRDDNGETKLTGASGA
LYLYKDYLTRVPVKTLKLTKPKGIKFVGINSYGGWNCDNPIRTIPVWAGK
DQVFCAPTPKPAPVEQAQPAVVEEAAPAQ
>MS1567 mrcA, MrcA protein
MRLKTLKFPFVNKKNTRTFKKKCGRFLSYFIGLTVALTFLFRFVPIPFSA
YMAEQKLAHIIQLDFDYKVNYDWISLEDISPYMQLAVIAAEDQNFPNHGG
FDWNAIKSAIKYNEKSSRIRGASTISQQTAKNMFLWHGQSWIRKGIEVPV
TFMLETLWSKKRILEVYLNIAEFGNGIFGVEAASRYYFKKPAKRLTQSEA
ALLAAVLPNPIIYKANRPSLLVRKKQAWIIRQMNSLGLNYLKKL
>MS1975 mrcA, MrcA protein
MKIVKLIFSTLLTIVILGCVAGGLLYFHIKSQLPDVQSLKTVELQQPMQI
YTADEKLIGEVGEQRRIPVKLENVPKMLINAILATEDSRFYEHHGLDPVG
IARAVSVAIANKGASQGASTITQQLARNFFLTPEKTIIRKTKEAILAIEI
ENTLTKNEILELYLNKIYLGYRSYGVAAAAKTYFGKNLADLTLSEMAIIA
GLPKAPSTMNPLYSLKRSEERRNVVLGRMLEMQFINKEQYDEAVQEPIKA
SYHGAQIEFRADYVTEMVRQEMVKRYGEESAYNSGFKVYTTILSQDQAQA
QKAVRNNLIDYDMRHSRYRGATPLWQSSETPWENNKIIDTLRKLPNSEPF
LPAVILSVAKEGTELLLASGEKMTLNAAAMRWGGRNVSLKTGEQIWIRQR
DNNEWVLGQIPEANSALVSLNSDNGAIEAIVGGFSFEQSKFNRATQSMVQ
VGSSIKPFIYAAALEKGLTLSSVLQDTPISIRKPGQAEWRPKNSPDRYDG
PMRLRVGLGQSKNMIAIRAMQTAGIPYVAEFLQRFGFKREQYFASEALAL
GAASFTPLEMARGYAVFDNGGFLVDPFIINRIVDNSGKDIFIANPKIACT
TCDEMPTIYGQTTDKVDGFKENDSVNADGNLAQTDENTNGEETDQNGENN
DVPELQNQGGTINEDALNLMVEGKTDSSQVQYAPRVITGELAFLIRSALN
TAIYGEQGLGWKGTSWRMANEIKRKDIGGKTGTTNNAKVAWYAGFGANLT
TAVYVGFDDNKRNLGKGEAGAKTAMPAWINYMKFVLEDVPERVLPTPANI
IEKSIDLGSGLLSKGGGRTEYFIKGTEPKRAFVQERGYYVPEGLPFQTPS
ASEYVPIGQPAPAAPAPSRKELF
>MS0590 mreB, MreB protein
MLFKKIRGLFSNDLSIDLGTANTLIYVKGQGIVLDEPSVVAIRKDRVGSL
KSIIAVGKDAKMMLGRTSNNIDAIRPMKDGVIADFFVTEKMLQHFIKQVH
SGNFLRPSPRVLICVPAGATQVERRAIKESAIGAGAREVYLIEEPMAAAI
GAKLPVSTPTGSMVIDIGGGTTEIAVIALNGVAYSSSVRIGGDRFDEAII
AYVRRTFGSIIGEATAEHIKQEIGTAYIQDESEVKELEVYGSNLAEGAPR
AFRLTSHDVLEAIQQPLDGIVTAMRTALEECKPEHAADIYERGMVLTGGG
ALLRNIDVLLSKESGVPVVVAEDPLTCVARGGGEALEMIDKHGGDIFSED
>MS0592 mreC, MreC protein
MKAIFTKAPSLGLRLVLAVMLSVGMILFDGQTNIMIQTRNFIDTAVGGLY
YLANTPRTVLDNVSDNLVDTNKLQIENKVLKQQLREKNADLLLLDQLKVE
NQRLRLLLNSPLRTDEYKKIAEILTAETDVYRQQVVINQGRNDGAYVGQP
VIDEKGVIGQIISVGEAASRVLLLSDVTHSIPVQVLRNDVRVIASGTGRT
DELTLDNVPRSVDIVKGDLLVTSGLGGRFPEGYPVAVVENVSRDGSNYFA
TVSAKPLASLERLRYVLLVWPAGDDIHKARAASPEDVRNAVKQRLANTAS
EQKKIPVTEDDATKAPVQLNNSEENIPSPENLPEMNRNDTQVDPELKEHR
EED
>MS0593 mreD, MreD protein
MKGNFFVQLFALLAIFIVALVLEISPWPAGFHSFKPAWLVLALTYWVLAL
PTRINIGTAFIFGVVWDVLLGTVLGVHALVLSCFAYLIARYHQILRNLSL
WQQSLLIVLLVFFVRLGVFLLELFIHSAEFDWKEIFGALISGLLWPWVFL
LLRKIRRQLGLH
>MS1632 mrp, Mrp protein
MSIIYTDNLSAGQQAQIQTLFQQYRHPSLKKDLIALSAVKKAEKGGDTLR
IELSMPFPWNSAFEQLKADLSDKLLSATESKNIKWQLTYQIATLKRANNQ
PAVKGVKNIIAVTSGKGGVGKSTVSVNLALALQAQGARVGILDADIYGPS
IPHMLGAPDQRPTSPDNQHITPIQAHGLFANSIGFLMDEENATVWRGPMA
SSALSQLLNETLWPDLDYLVIDMPPGTGDIQLTLSQQIPVTGAVVVTTPQ
DIALLDAVKGISMFNRVSVPVLGIVENMSMHICSNCGHHEAIFGTGGAER
IAQKYHVEMLGQLPLHICLREDLDKGTPTVVSNSNQEIRDAFMQLAEKIG
YELYFQGAVIPSEIMFREVK
>MS2196 mscL, MscL protein
MIAINFRRIFMSFMKEFREFAMRGNVVDMAVGVIIGGAFGKIVSSLVGDV
VMPVLGILTGGVDFKDLKFVLAEAVGETPAVTLNYGLFIQNVFDFIIIAF
AIFMMVKGINKLKKPVEEAPKGPTSEELLSEIRDLLKK
>MS2333 mscS, MscS protein
MAAEEQAKEVAQQVDVVAETSKVIDKVSNMDLNAVLHDWVIPYGTKILLA
IAIFVIGKMLARGISKLLGKAALASTKDEMLQSFVTSISYFLFLLIVVIA
SLSQLGINTSSLVALIGAAGLAIGLSLQNSLQNFASGVMLLIFKPFRKGD
LIETGGMTGVVEEMGLLVLELRTGDNKTVLIPNGKVFSDSIVNYSDNKTR
RIDFTFDVSYESNLKEAKDVVARILADNELVLKHPAPIVAVGALAANCVQ
LVVRPWVKTADYWTAYWGITESVKLEFDKAGIVIPYNQMDIHISGNTASE
LNNELNK
>MS0411 mtlA, MtlA protein
MLSANAKVKIQSFGRFLSNMVMPNIGAFIAWGFITALFIPTGWFPNEMLA
KLVGPMITFLLPLLIGYTGGKLVGGDRGAVVGAITTAGVIVGTDIPMFLG
AMIAGPTGGWAIKSFDKWADGKIKSGFEMLVNNFSSGIIGMILAILFFWV
VGPAVKIISDWLAAGVDVLVNAGLLPLTSIFVEPAKILFLNNAINHGIFS
PLGIQQSQEFGQSVFFLIEANPGPGLGVLLAYMIFGKGSAKQTSGGAAII
HFFGGIHEIYFPYVLMNPRLILAVIAGGATGVFTLVLFNAGLQAPASPGS
IIAVLAMTPKTSFLGVITSVIAACAVSFVVASFFVKLQKEDESGKLEEAQ
AASKAMKSNTSQQVTNYDGLKKIFVSCDAGMGSSAMGASMLRKKINDAGL
PIEVANCAINDLPEDARLVITHQDLTLRAKKQVPNAMHFSLTNFLDNKFY
DSLVNDLKANFDEKAPVAQAKEGEIEVNGTTFSLQPEQIFLGLKANDKFA
AIRFAGEQLVKAGFVQPSYVDAMFEREKLVSTYLGEGVAVPHGTIEAKDA
VLKTGVVVCQYPEGVRFTDEEDGVAKLVIGIAARNNEHIQVVSAITNALD
SDEAIELLTSTNDVNKVLELLKA
>MS0529 mtlD, MtlD protein
MKLLNRTNFPGRQHPTKIIQFGEGNFLRAFIDWQIDILNEKTDLNAGVTI
IRPINTDFPPSLNTQDGLYTTIIRGLDENGNKVKESRIIRSVNNEINIYQ
SYDEYLQLAHNLEIKFIFSNTTEAGISYHADDKFDDRPQVSYPAKLTRLL
YERFSVVNGDKDKGFILLPCELIDYNGEQLKELVFKYAKEWNLSAEFIQW
LETANTFCSTLVDRIVTGYPRAEAAELEAELGYKDTFLDTAEHFHLFVIQ
GPKSLAQLLRLDQVDLNVLIVDDIRPYKERKVAILNGAHTALVPVAYMAG
VNTVGEAMNDAELCRFVKSTMDKEIIPVLSLPQDELQQFADAVIKRFQNP
FIQHQLLSISLNSMTKYRTRNLPQLISYVEKFGKLPPHLTFALAALIAFY
RGERDGQTIPLQDDEHWLVNFKTWWQEQAAGEISLFQLVHHVLKQEAHWE
QDLTTIPQLVETVTQQLEAILEKGMRQALREYCVD
>MS0410 mtlD, MtlD protein
MKALHFGAGNIGRGFIGKLLADSGMQVIFADVNDSVIDLLKSRRSYGVKI
VGDSINTVERVTQVTGVNSKDETAIITLFNEVDLVTTAVGPNVLKIVAST
FAKALEARIAGGNTKPLNIIACENMVRGTSFLKEQVFTHLNPDYKDKVEQ
LIGFVDSAVDRIVPPVKPDAEDPLLVTVEEFSEWIVDQTQFKGAIPDIKG
MELTDNLMAFVERKLFTLNTGHAVTSYYGKFKGYKFVKESIEDESVKAFV
KSVMQESGAVLIKRYGFDPQAHAAYIEKILKRFANPYLVDDVDRVGREPL
RKLSYNDRLIKPLRGTIEYGLPNDNLIRAIATALSYRNENDPQALELAKS
LAEAGVTQTIKKYTELQDENVIARIAKAYETL
>MS1222 mug, Mug protein
MSVQIIETHPFPPVLPARATVMMMGTFPPKSEKRCMEFHYPNFQNDMWRI
YGLIFFEDKEYFQVPGEKRFDAERIKAFLHERGIASCPTVIKAVREQGNA
SDKFLKIVEPVNLTQVLQKVPNVRWLFTTGGKATEALFSLVPELKLKEPK
TNEYIDFPFQGHELKLYRVPSTSRAYPLSLEKKAEAYRKFFELSGILK
>MS1084 mukB, MukB protein
MSEELELESEFLPEEKSETIVPATVLTQSAGIERGKFRSLTLINWNGFFA
RTFDLDELVTTLSGGNGAGKSTTMAGFVTALIPDLTLLHFRNTTEAGASG
GSRDKGLHGKLRPGVCYAVLDAVNSRQQRILAGVRLQQVAGRDKKVDIKT
FSIQGLEISQNPTAVLTETVSQRQARVLSLTELKDRIEEQGAQFKQYHSV
ADYHAMMFELGMIPRRLRSSSDRSKFYKLIEASLYGGISSAITKSLRDYL
LPENLGVRKAFQDMESALRENRMTLEAIKMTQSDRDLFKHLITETTNYVA
SDYMRNANERQGNIETALSFRKEWYAAKSEQDLSQHRLIDLSREAAELTE
NEKALEIDHQSASDHLNLVLNALRHQERIERYQEDVNELTEKLEEQKIVV
ENANEQLEESQLQFETLETEVDQIRGQLADYQQALDAQQTRALQYQQAIQ
ALEKAKALCGLADLSVKNAEVYHEEFEAQVETLTDRVLQLEQKMSISEAA
KTQFDKAYQLVCKIAGEIPRSAAWDSAKELLREYPTQKLQAQQTPQLRAK
LHELEQRYQQQQSAVKILKDFNQRAGLSLETADELEDYHAEQEALIENLT
AEFSEQVETRSTLRQKLEQLTALFEEKARSAPAWITAKSALERLTEQSGE
QFEDNQDVMNFMQAQLEKEREFTMQRDQLEHKRQQLDEQISRLSQPDGSE
DARLNVLAERFGGVLLSELYDDVAIEDAPYFSALYGPARHAIVVRDLNAV
KEQLAHLDDCPDDLYLIEGDPAAFDDSVHSAQELAQGVVVQVSERELRYS
KFPEIPLFGRAAREKYLTELEAERDKIVEQYAQRAFDVQKCQRLHQQFSQ
FVGLHLALAFQPDPEQQMREINQQRNEINRELTALSTDEQQLRIKLDNAK
EQMQLLNKLIPQLNVIADESLSDKVEECREQLDIAEQDEIFIRQHGMTLS
QLEPIANTLQSDPENYERLKDDLYQAIDMQKQAQQKAFALADVIHRQAHF
SYEDTVKTETNDLNEKLRVRLEQVQAKREQQRDQVRQKQQQFAQYNQVYI
QLQSSFETKNQMLKELMDEVGELGLTVDENSEQRARVRKDELHHQLSTSR
QRRSFVEKQLTLIESEAENLTRRIRKAERDYKQQRELVVAAKVSWCVVLR
LSRNSDVEKRLTRREFAYLSADELRSMSDKALGSLRTAVADNEYLRDALR
ISEDSRKPENKVRFFIAVYQHLRERIRQDIIKTDDPIDAIEQMEIELSRL
TDELTGREQKLAISSESVANIMRKTIQREQNRIRMLNQGLQNISFGQVKS
VRLVVNIRDTHAMLLDALSGNQEEYQDLFNDNRMTFSEAIAKLYQRLNPH
IDMGQRTAQTIGEELLDYRNYLDLQVEVYRGADGWLQAESGALSTGEAIG
TGMSILLMVVQSWEEESRRIRGKDIIPCRLLFLDEAARLDGKSISTLFEL
CERLDMQLLIAAPENISPEKGTTYKLVRKISGNQEHVHVVGLRGFGAKE
>MS1085 mukE, MukE protein
MNENLQELIPTKLAAAIANPLFPAVDSQLRSGRHIGQEYLDNFAFLADFQ
NELDMFYRRYNVELIRAPEGFFYLRPKATTLIARSVLSELEMLVGKVLCY
LYLSPERLAQQGIFSVQEVYDELLNLADESKLLKAVNQRSSGSDLDKQKL
AEKVRAAVNRLRRLGMIHTVGEQNSGKFTISESVFRFGAEVRSGDDPREA
QLRLIRDGEAATPDSLSQEKSAVKNDEEIEDELDEGLGEEE
>MS1086 mukF, MukF protein
MLETSQTIPELVSWAKEREFSLNLPTERLAFLLAIAIYNAERFDGEMVES
DLVDIFRHVSNEFEQSKETIATRANNAINELVKQRFLNRFSSEFTESLSI
YRLTPLGVGVSDYYIRQREFSALRLSVQLAIVANEIQRASELAEEGTAKQ
EDEYYWRRNVFAPLKYSVAEIFDSIDLSQRIMDENQQSIKEEIAELLTKD
WQAAIASCERLLDETSGNLRELQDTLNAAGDKLQEQLLRIQDCVIGRDDL
YFIDQLITDLQAKLDRIISWGQQAIDLWIGYDRHVHKFIRTAIDMDKNRV
FSQRLRQSIHNYFDMPWYLWTAQAERLIDLRDEELALRDEDALGELPEEL
EYEQLSDLHDQIVDYMQNLLIAQRERNQPIDLSLVLKEQLEGYPLARHFD
VARIIVDQAVRLGMASADLSGTYPQWQEINNRGAEVQAHVIDEYK
>MS1707 murA, MurA protein
MEKFRVYGQSRLTGTVDISGAKNAALPILFASILAEEPVILTNVPDLKDV
ETTFKILRKLGVNVECAEEPGKVLIDAGNINQFVAPYELVKTMRASIWAL
APLLSRFHEGQVSLPGGCTIGARPVDMHISGLEKMGAAIELDEGYVKATV
NGRLKGARIYMDKVSVGATLSIIMAATLAEGKTVIENAAREPEVVDTAIF
LNAMGAKISGAGTDTISIEGVERLAGCRHRIVPDRIETGTFLVAAAISGG
RITCRGTKADTLEAVIEKLREAGMQIDITEDSITLDSLGQRPKAVNIRTM
PHPGFPTDMQAQFTLLNVVAEGTSKITETIFENRFMHIPELIRMGAKAEI
EGNTAICHGVEHLSGAQVMATDLRASISLVLAGCIASGETIVDRIYHIDR
GYERIEEKLRGLGARIERFSD
>MS0028 murB, MurB protein
MQSLKPFHTFAVPAQAKNIVEITALEQLQQVWDGCRQENQPVLFLGQGSN
VLFLKDFAGTVLINRLMGIEHNEDEQFHYLHVNSGENWHNLVEWSLSQSI
GGLENLALIPGCAGSAPVQNIGAYGVEFKDVCDYVDVLDLNQGKQFRLTN
AECEFGYRESVFKHKYAQGFIVTAVGLKLAKAWQPVLKYGTLANFDKSAV
GFQQIFDEVCAVRRAKLPDPKEFGNAGSFFKNPVISAGHFALLQQEYPNI
PNFPQDDGSVKLAAGWLIDQCQLKGYQIGGAAVHQNQALVLVNKGDATAS
DIVELAHHVRQSVAAKFDVYLSPEVRFIGELGEVNAEQAIS
>MS1614 murC, MurC protein
MKHIHILGVCGTFMGGLAIIAKQMGYRVTGSDTNVYPPMSTFLQEHNIEI
IPHFEVSQLQPAPDMVIIGNAMKRGNPCVEYVLENRLPYMSGPQWLHDNL
LCNRWVLAVSGTHGKTTTTGMLTWILEQNGLNPGFLIGGIAGNFGMSSRF
TDSPYFVIEADEYDTAFFDKRSKFVHYNPKTLIINNIGFDHADIFDDLKA
IQRQFHHMIRTIPASGRILSVATEQSVKETLDMGCWSEKQFLGKEQEWNA
ERITNDCSRFAVFHLGEKVAEVHWDIVGQHNMHNALMAIAAAYHAGVKIE
DACRALATFVNAKRRLEVKGEVGGVTVYDDFAHHPAEIQATLTALRDKVG
GGVRILAVLEPRSNTMKMGVHKDEIAPALVRSDYVFLLQPDNIPWEVVEI
ANKCVQPTKWTADLDKLVDFVVQEAQPTDHILVMSNGSFGGIHQKILDKL
ANK
>MS1666 murC, MurC protein
MINAKKEFQQRVRNMIPGMRRVHQIHFVGIGGAGMGGIAEVLLNEGYAVT
GSDIAESAVTNRLISLGAKIHFSHAASNVDNASVVVVSSAIKADNVEVVA
AHEKRIPVIQRAQMLAEIMRFRHGIAVAGTHGKTTTTAMISMIYAQAGLD
PTFVNGGLVKSAGTNAHLGCSRYLIAEADESDASFLHLQPMVSVVTNIEP
DHMDTYHGDFDEMKRTYVNFLHNLPFYGLSVMCADDPVLLELIPQVGRPV
ITYGFSEEADYRIENYEQTGFQGHYSVITPAGERIDVLLNVPGKHNALNA
TAALAVAKEEGIENEAILAALADFQGAGRRFDQLGSFIRPNGKVMLVDDY
GHHPTEVNVTIQAARKGWENKRIVMIFQPHRYSRTRDLFDDFVRVLSQVD
LLIMLDVYPAGESPIAGADSRSLCRSIRNLGQVDPILVTDTAELPEIMDR
VLQDGDLVLAQGAGNVSKLSRQLVELWTKA
>MS1669 murD, MurD protein
MTDYQGKNITVIGLGKTGLSCVDFLTAKKANVRVIDTRKIPAGAEQLDKS
IPLHTGSLNQQWLLESDMIVISPGLSVKTAEIQTALSAGVEVVGDIELFC
REAAKPVIAITGSNGKSTVTALVTEMGKAAGLSVGMGGNIGIPALSLLNE
NHDLYVLELSSFQLETTYSLKATAATVLNVTEDHMNRYADLEEYRQAKLN
IYHHCQTAVINGEDPLTKEDDKQSAQQQVSFAENNADYWLKTENGKKYLM
AKDKLILACDEIKLTGRHNHMNALAAIALAQAAGIKNSGILTALRTFPGL
AHRFQLAHMANGVRWVNDSKATNVGSTVAALTGLHIEGKLHLLLGGDGKG
ADFSELEKLINKPEIFCYCFGQDGAHLAKLSSQSQLFNTMEQAIETLRPT
LKPGDMVLLSPACASLDQFASFEKRGEEFTRLAKLSVAQ
>MS1672 murE, MurE protein
MRKLTALFGRDDRFDAIKLNRMTLDSRSVRTGCLFVAIKGHSVDGRQFIP
QAISAGASAVLKECDNADEHLQVSEQNQIPVISYYHLSEHLSDIADQFYK
APSQHLTLVGVTGTNGKTTVSQLLAQWAQLLGRKPAVMGTIGNGLFGALK
PAANTTGSAIEVQSSLADFVQQGADFAAIEVSSHGLVQHRIEALHFAAGI
FTNLSRDHLDYHHSMENYASAKKRLFSELSCQQKIINADDEIGVQWLREL
PDAVAVSCNPAYQPTQENWLKVTAVSFNSQGATITFNSSWGGAILTSRLI
GAFNVSNLMLVLATLLSLGYSLDELLKTVSQLTGVCGRMEMLHAAHKPTV
IVDYAHTPDALEKALQAARAHCTGRLWCVFGCGGDRDRGKRPLMAQVAER
FADYVIVTDDNPRTEDRHQIVQDIVAGFKRPESVNIVYDREQAIRTAVQS
AVENDVILIAGKGHEDYQIIGHTKHHFSDQEAVKKYLG
>MS1671 murF, MurF protein
MIKLNVKKIAQILKAKLIGEETLTIESVSTDTRRKMPNGLFFALKGEKFD
GHNYLAEAVAQGCTAVVVDHPCEIDVPQLVVKDTRLALGRLAAWLRRELE
PLTVAITGSCGKTTVKEMTAAILQRTAGDDEAVLFTEGNFNNDIGVPLTL
LRLTEKHEYAVIELGANHAGEIAYTAHLTMPDVALVNNVSAAHLEGFGSV
EGVAQAKGEIYSGLTPDGVAILNLDSNYAHYWGGDINDREFESFAYDHVG
ADYYAEKIMLSEYGSRFTLNTPKGAIKIELPYLGKHNVANAVAASALAMN
VGASLEDIKRGLENPSHVKGRLFPIQLSTNLLLLDDTYNANVASVKSAIS
VLSDYREAFRIFAFGDMAELGDETISCHQEVADFAKAANLDLVVTYGSES
AVVSKACGGVHFSNKEALIASLKEIISHQLKENEDIVLLAKGSRSMKMED
VINSLKDRFLC
>MS1667 murG, MurG protein
MAQRKKLLVMAGGTGGHVFPAIAVAQYLQKQGWDICWLGTKDRMEAQLVP
KHGIPIEFIQISGLRGKGIKALLGAPFAICRAIMQARKIILRQKPDAVLG
MGGYVSGPGGVAAKLCGVPVILHEQNAVAGLTNVWLSKIAKRVLQAFPTA
FPNAEVVGNPVRQDLFSMPDPEQRFAERTGKLRVLVVGGSQGARVLNLTV
PEMAARLTDKLEIRHQVGAGSVEKITALYEEKGALSADVKITEFIDNMAE
AYAWADIVICRSGALTVCELAAVGTPAIFVPFRHKDQQQYLNAKYLADVG
AAKIVQQAELNADVLVDLLTNLDREQLLAMAIKAKQMSAPFAAQRVAEVI
IENAN
>MS1734 murI, MurI protein
MIMNTEIKPTILFFDSGVGGFSVYKEVKQLLPNAHYLYCFDNAFFPYSEK
SEEVIIERTLTVCKKINEQYPLDAIVIACNTASTVVLPTLRQHFAIPIIG
TVPAIKPAAEKSQTKHIGLLATKGTVKRTYVTSLIERYAQDCIVEKIGST
KLAEIAERKLHGESVDLIALRNELTPWIQLSDLDSVILGCTHFPLIKEEI
QLCLPQVKFYFEPGTAIAKRVFDLLAGITPKDKTETDNCIFYTKHFELED
KFIQALRFWGFKNLKLLSILE
>MS0635 mutH, MutH protein
MAAELHIPVPPDLKRDKGWVGQLIETALGAKAGSKPEQDFANLGIELKTI
PINSAGFPLETTFVSLAPLIQTAGVNWHNSHLRYKLSKVLWIPIQGERQI
PLAERRIGSPILWQPDPQQEARLQQDWEELMDYIVLGKVHEITAKIGEVL
QLRPKGANSRAKTKGIGQNGEIIETLPLGFYLRKEFTAQILQNFLRNK
>MS1516 mutL, MutL protein
MPIHILPPQLANQIAAGEVVERPASVVKELVENSLDAGASRIQIDIENGG
ATLIRIRDNGLGIAKEDLSLALARHATSKISCLDDLEAILSLGFRGEALA
SISSVSRLTLTSRTAEQKEAWQVYAQGRDMETTIKPASHPVGTTVEVANL
FFNTPARRKFLRTEKTEFAHIDEVVRRIALAKPQIAFTLTHNGKILRQYK
SAVEIEQKLKRVSAICGEDFVQNALQIDWKHDNLHLSGWVAVPNFHRPQN
DLSYSYVNGRMIRDKVINHAIRQAYGDYLTNEQYPAFVLYLDLDPNEVDV
NVHPTKHEVRFHQARLIHDFICQGVGNALQSEQADFARYDTPASADEIQE
PAANWHSSLIKPNRSAAGHNIFESASDKNISGANTYSHGSAKINRFSTKF
AENIPHFSTKSVSKTEQKLYGNLLTTPAEAKKNTAINAESENSFEKNVST
PQQSTQLSGQFLHSLALVKNQALLLQQGQDFYLLPLAKLQKLKFELTLQQ
PDIAQQPLLIPILFRLNERQLAQWQKQKNFFLQSGFEFDENPAQHRITLN
KVPSCLRQQNLQGCVIRLLEENHEKISDFLTALCNQLQLNEIHVLADALT
LLTEVELLLKTQNKIQLAQLLISVDFTQYLQ
>MS2244 mutS, MutS protein
MNVMENLEQHTPMMRQYLALKAENPDILLFYRMGDFYELFYDDAKKAAAL
LDISLTKRGQSAGQPIPMAGVPYHAVEGYLAKLVQLGESVAICEQIGDPA
LSKGPVERKIVRIVTPGTVSDENLLPERQDNLIVAVYQEKDKFGLATLDM
TSGRFQISEPENAESLKAELQRLAPAELLYCEDFADMQLIEHYKGLRRRP
IWEFELSTAVQLLNRQFGTKDLRGFGVEKAILGLCAAGCLLQYAKETQRT
ALPHIQSITLIQNNENIQLDAATRRNLELTQNLAGGTENTLASVLDKCVT
PMGSRLLKRWIHQPIRHIQKLRQRQQIISEIIQLDLIGELQPYLQQVGDM
ERILARVALRTARPRDLTRLRTALEQIPTIKDILKNSPKFTALFQQIGDF
DELFALLQQAIIDNPPLLIRDGGVIAEGYNAELDEWRALSDGATKYLEDL
EIRERESTGIDTLKVGFNAVHGYYIQISQGQAHKAPIHYVRRQTLKNAER
FIIPELKTYEDKVLKAKGASLALEKQLYDALFDRLLPHLGALQLASLTLS
ALDVLTNLAERAETLNYVAPDFSDEIGVKIENGRHPVVEQVLKEPFIANP
VDLNQQRHLLIITGPNMGGKSTYMRQTALITLMAYIGSFVPAESALIGPI
DRIFTRIGASDDLASGRSTFMVEMTEMANILHQAGANSLVLIDEIGRGTS
TYDGLSLAWACAEWLAKKLRSLTLFATHYFELTVLPEQLAGTANVHLDAL
EHGDSIAFMHAVQDGAASKSYGLAVAALAGVPKNVVKLAKQKLANLEKLS
QQSADQKLQDLRTINQNQGELNLMEEEDGKNAALEMLAQLDPDDLSPKQA
LAYLYQLKKLL
>MS1694 mutT, MutT protein
MLIFCEQVQKNYKKNLKIFNFELSLPIVFAGGSVMSELQQFSQQDIEVLN
EETLYSGFFKMKKVRFRHKLFAGGMSEVVTRELLYKGAASVVIAYDPVRD
EVVLVEQVRIGAYDPNLSSSPWLMELIAGMIEEGESPEEVAMRESEEEAG
VTIDNLEYALSVWDSPGGTVERLYLFAGRVDSSKAKGLHGLACEHEDIKV
HVVSRETAYQWVNQGKIDNSSAVIGIQWLQLNYRRLQKNWC
>MS0709 mutT, MutT protein
MNYKNPNSVLVVIYAKNSGRVLMLQRQDDPEFWQSVTGSLAEKEMPFLTA
LREVKEETGIDIKRENLTLVDCHQSVEFEIFPHFRYKYAPNVTHCKEHWF
LLELPDERVPVLTEHLAYQWLEPAKAAELTKSPNNAQVIRKYLINKSA
>MS2341 mutT, MutT protein
MLKPHVTMACIVHCKGKFLFVEEIEYGKRTLNQPAGHLEENETILEGASR
ELYEETGIRAKMQHLVKIYQWHAPRSQKDYLRFVFALELDDWAEITPHDS
DITQGFWLTLEEFNYYIRQENQCARNPLVTEALEDYLAGSRYPLDILTLF
NN
>MS0328 mutT, MutT protein
MDKKTVQVAAGIIRNEFGQIYLTQRLEGQDFAQSLEFPGGKVDVNETPEQ
ALKRELEEEVGIVALNPVMFEQFVFEYPNKIIHFYFYLISEWIGEPFGRE
GQEGFWIEQLDLDESQFPPANSKLIQRLLAEMNC
>MS0019 mutT, MutT protein
MNLLQKPEILGISVAAKSRIFEIQAVELKFSNGELRTYERFKPSSRCAVM
VLPIDGEDLLMVREYAVGTERYELGFTKGLMEAGETPEQSANREMQEEIG
LGAKQFMLLRTVNSSPSFMNNPMHILIAQDFYPSKLPGDEPEPLQLVRVP
LANINELIEDPGFSEARNLVALYTLRDYLRKLK
>MS0408 mutT, MutT protein
MIDFDGYRPNVGIVICNRKGQVLWAKRYGQNSWQYPQGGINDGETPEQAM
YRELYEEVGLTRRDVRIVYASKQWLRYKLPKRLLRYDSKPMCIGQKQRWF
LVQLMSDEKNINMNCSKSPEFDGWRWVSFWYPVRQVVSFKRDVYRKAMKE
FACFLFDANKTVNPLSTNNNDEKKANYSAKKPYSPYRNQDKKRKTRV
>MS0258 mutT, MutT protein
MNLVYFCKMYRFQMTDKLLNEPWLTWAIQIQAIAQNGLAYCQNVYDIERY
EQLRDIAVEMLSYKTAIPQDKVKNLFCNEQGYQTPKVDTRAAIFKDDKIL
LVQESDGLWSLPGGWCDVLESIDSNTVKETREEAGLDINTKFIIAIHDQH
KRNYPPFAYAVLKTFVMCELIDGEFQPNSETIASDWFALDELPPMAEEKN
TPSQVELCFQAHHSKHWVTQFD
>MS0317 mutY, MutY protein
MLAQSSIQAPFARSVLRWYDKYGRKNLPWQKNKTFYQVWLSEVMLQQTQV
STVIPYFERFIDAFPTINVLADAPLDEVLHLWTGLGYYARARNLHKAAQT
VRDQYGGEFPTDFQQVWDLTGVGRSTAGAILSSVLNAPYPILDGNVKRVL
SRYFTVEGWAGEKKTENRLWRLSAEVTPTERAADFNQAMMDLGAMVCTRT
KPKCGLCPLSKKCGATLTNSWEKYPAKKPKKQLPERESYFLILAQNGKVA
LEQREQSGIWGGLYCFPQFEDKSTLLQYLQQLGIREYQEWSAFRHTFSHF
HLDIFPIYAQYRQTERDENRSDWKKIEENGADYKSTISSTINYWYDPENP
DQIGLATPVKNLLTEFQKGQHYVKNRIL
>MS1388 mviM, MviM protein
MGMKLGIVGTGMIVRDLMQTLHKVRLEQLAIWGRDQAKTAQFAAEQGILQ
VFSDYAAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTD
EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ
YSSRYDRFKAGETLPVFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCTYA
ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP
ANAMNRFTYTPNQGEAQHFEFGDVHRMLPEFERFVDIVDRKDFAQAEKML
DISAAVSEVIEQARKGAGIRFAGE
>MS1414 mviM, MviM protein
MDMKLGIVGTGMIVADLMQTLHKVTLEKLAIWGRDQVKTTQFASENGISQ
VFADYEAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTG
EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ
YSSRYDRFKAGETLPAFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCAYA
ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP
ANAMNRFTYTPNQGEAQHFEFGDAHRMLPEFERFVEIIDRKDFAQAEKML
DISAAVSEVLEQARKGAGIKFAGE
>MS1528 mviM, MviM protein
MKKINVGIIGTGFIGAAHIEAIRRLGFVDVIALAENNQQLAEQKAKELNI
PLAYDCVDKLLANPDIQVVHNCTPNHLHFAINKKVILAGKHVFSEKPLCL
TSQEADELTSLAEQQGVTTAVGFVYRNFAMVQQAADMVRDQQIGRVFAVN
GHYLQDWMLLETDYNWRVDPKVGGKSRTVADIGSHWCDTVQFVTGKKIKE
VFADMSIVYSTRKASKQVESFVTVNADSSYELKPVETEDYASVLVRFEDG
SKGSFTVSQVSAGHKNDLTFDISGSEKSLHWEQETPQYLKIGYRQQANQI
LCDDPSLVNPAVRAYNHFPGGHIEGWPDAFKNMMLAFYAFIAEGKDPQQD
TAKFAMFKDGAQIVHIVDTIIESAQQGKWISVK
>MS1500 mviM, MviM protein
MKKFALIGAGGYIAPRHLRAIKDTGNTLVVAMDVNDSVGIMDSHFPDAEF
FTEFEQFEAFVEDQKLKGEKLDYVAICSPNYLHAPHMKFALKNGINVICE
KPLVLNSTDLNMLSEYEQKYGAKVNSILQLRLHPSIIALRDKVEAAPADK
VFDVDLTYLTSRGKWYLKSWKGVDQKSGGVATNIGVHFYDMLHFIFGDVV
KNEVHYRDEKTVSGYLEYKRARVRWFLSIDANNLPENAVQGEKLTYRSIT
IENEELEFSGGFTDLHTQSYQRILEGKGYGLEENRTAIETVEVIRHAPII
ENPANPHPFLAKVLNK
>MS1755 mviN, MviN protein
MSKRLLKSGIIVSTMTLLSRVLGLVRDVVIANIIGAGATADVFLFANRIP
NFLRRLFAEGAFSQAFVPVLAEYQRSGELSKTQEFIGKVSGTLGGLVSIV
TLLAMVGSPVVAAIFGTGWFIDWINDGPNAEKFTSASLLLKITFPYLWFI
TFVALSGAILNSLGKFGVMSFSPVLLNIAMITTALLLAPQMESPDVALAI
GIFIGGLLQFLFQLPFLKKAGLLVRPRWAWNDEGVKKIRTLMIPALFGVS
VSQINLLLDTFIASFLMTGSISWLYYSDRLLEFPLGLFGIAISTVILPTL
SRQHVNRADDVQKSAADFRATMDWGVRMILLLGVPATIGIAVLAQPMLLV
LFMRGQFSLTDVQATSYALWSINVGLLSFMLIKILANGYYARQDTKTPVK
IGIIAMISNMVFNLLAIPFSYVGLAMASAMSATLNAYLLYRGLAKADVYC
FTKQSAVFFLKVLAAALVMGTVVWYFSPQLVIWNEMAFLTKVIRLAELIL
IAASSYLLMLVILGIRKRHLLAR
>MS0182 nPY1, NPY1 protein
MQLIRSSDYGFWLLSQGSHIHLVNNYLPEGRAEDFHLQGKKGMVIGELDR
QPLWLVEEQPNDTRAYFDLRDQLYLPERTFNLLNRGVELNHFFKTHQFCG
KCGDKTMQTEDEWAVQCTNEECNYRTYPVICPSIIVAIRRGKEILLANHR
RHAPKYGKGGMYTTLAGFVEVGESFEQTIHREVFEETGIKVKNIRYFGSQ
PWAFPNSQMVGFLADYESGEIRLQEEEIADAKWFRYDEPYPEFPEKGTIA
RALIEATLKLCAEHQDK
>MS1888 nadE, NadE protein
MKTAAYADYLIQWLENQRTELYGMDGYTLGVSGGIDSAVCAHLAARTGAP
VQALILPAEVTSPSDVADAQATLESAGIDGQIISIAPWYDLIMQQLSPVL
NSEPERVNVLKGNLMARLRMIALFTTAQSHRSIVLGTDNAAEWLTGYFTK
FGDGAADVLPLAGLRKEQVFELGRYLGVPQSVLDKKPSAGLWAGQTDEAE
MGVTYAEIDAYLRGETVSPQALQQIRFWHNRSHHKRMLPPKPKSPDEAEC
>MS0169 nadR, NadR protein
MSNFSYLQQKRKQLNLKVNDICEQANVTRAYFNQLVSGKIKNPSAAKLTA
LHKALQITEQDNKKVGVIFGKFYPVHTGHINMIYEAFSKVDELHVIVCSD
TERDLQLFYDSKMKRMPTVQDRLRWMQQIFKYQKNQIFIHNLVEDGIPSY
PNGWRAWSNAAKALFKEKEINPTVVFSSEPQDKAPYEKYLNLEVHLVDPA
RESFNVSATKIRTQPFKYWKYIPKEVRPFFAKTIAILGGESSGKSVLVNK
LATVFNTTSAWEYGRDFVFDKLGGDEQAMQYSDYPQMALGHQHYIDYAVR
HAHKVAFIDTDFITTQAFCIQYEGKPHPFLDSMIKEYPFDVTILLNNNTK
WVDDGLRSLGDYKQRQRFQQLLKKLLDKYKVPYIEIESPSYLERYDQAKA
IVEKVLNDEEVSELTHEND
>MS2204 nagA, NagA protein
MKYALTNCVIYTAKNVLYEHAVIVEKDKIQAVLPERELIPEIQRINLKGS
NLTAGFIDLQLNGCGGVMFNEDISVKTLEIMQETNLKSGTTSYLPTFITS
PDEDMKSAVKIMRDYLAKHKNQALGLHLEGPYLSVEKKGVHREEYIREIS
PEMKAFLLDNADVISKITIAAENPAMQYAGEFVEKGIIVSVGHSNGTYEQ
AKRAFAQGASFATHLHNAMSPVSSGRAMGVVGAVLDSDEVYSGIIADGLH
VAFGNILIAKRAKGDKLCLVTDATAAAGADIEQFTFVGKTVYVRNGKCYE
ANGTLGGSAVTMIESIRNAVEQVGIPLEETLRMCNYYPAKAMKVDDRLGS
IEAGKIANLTAFTHNFDIVGTSVNGEWKFA
>MS2205 nagB, NagB protein
MRLIPLKNDEQVAKWSAQHIVDRINAFNPTEDHPFVLGLPTGGTPLKTYR
ELIKLYQAGKVSFKHVVTFNMDEYVGLPKEHPQSYHSFMYNNFFNHVDIP
EKNINILDGNTPDHDAECRRYEEKIKSYGKINLFMGGVGVDGHIAFNEPA
SSLSSRTRIKTLTPDTLIANSRFFNNDVSQVPKYALTIGVATLLDAEEVM
LLITGHQKALALQACVEGAVNHLWTVSALQLHRHSIVVCDEPATQELKVK
TVKYFTELEAYAIHSVI
>MS0015 nagB, NagB protein
MNYITFPTAQAAVEKIAQEFVLYSQLDRPVHISLSGGSTPKLLFKTLATP
PFNTKVRWENLHFWWGDDRMVVPSDPESNYGEVQKLLFDHIRIPVENIHR
IRGEENPDQELARFSAELTACVPNLEFDWIILGMGSDGHTASLFPHQTNF
ADENVALIAKHPESGQIRISKSAKLIEQAKRITYLVTGSAKAEILKQIKT
TPAEQLPYPAARIRAKNGITEWYLDADAAKLL
>MS1413 nagC, NagC protein
MTRNEEALDIKHTNYRNIYRLFFQYNGLSKPQIVKLLNLSLPTVSNNIGE
LEAEGKIREGGFFQPQGGRPAIAYQLVENAFISIGVEIQKKNVRCLALNL
QGNILAQKDTALYFENEPQYIESLCNIIHTFIRSLGCLYTQILGIGFSIQ
GIVSKDGQSMLYSRVLPGEHFDVKELQPYFDVPVKLFHDVKCAALTELWF
SEQIDNAVYISISEHLGGAIIINNQIDLGKKGYSGALEHLQIHSEGNLCY
CGQRGCLETYCSLSALLSPNETIEAFFKALRNKDELVLMRWDAFLEHLAK
GLNTVYLLLERDIILGGEIAFYLIPEDLKILQEKILKLSTFPLEGDFIRI
ATQQKYTSAIGAALPFLIEYLP
>MS1527 nagC, NagC protein
MKNGITWKNSLFLRMIMLYGFDIGGTKIELAVFNDKLERQYTERVETPKD
SYEQWLDVIVNLVEKADQKFACKGSVGLGLPGFVNHETGIAEITNIRVAD
NKPIIKDLSERLGREVRAENDANCFALSEAWDEENQQYPFVLGLILGTGF
GGGLIFNGKVHSGQIGMAGELGHLQLNYHALKLLGWDKAPIYDCGCGNRA
CLDTYLSGRGFEMLYRDLKGEALSAKEIIERFYAADKTAVDFVGLFIELC
AISLGNIITALDPHVIVLGGGLSNFDYLYEALPKALPKHLMRSAKVPVIK
KAKYGDSGGVRGAAALFLTK
>MS1508 nagE, NagE protein
MGLFDKLFGSKNSKTVEVDIYAPLSGEIVNIEDVPDVVFSEKIVGDGIAI
RPNGDKIVAPVDGVIGKIFETNHAFSMESKEGIELFVHFGIDTVELKGEG
FTRVAQEGQSVKRGDTIIEFDLPLLEQKAKSILTPVVISNMDEISALDKK
VGQVVAGDSVVLSLKK
>MS1408 nagE, NagE protein
MSDKNFIMPMSGELLSLEQVPDSNFSQKLLGDGFAVRLSGEVVVSPFSGV
VIAAFPTGHAFIIRREDGLEVLIHIGLNSAGKADAFRMQINKYDEVKQGD
VLVYVDTNKLSDQQEDLISPIVFANPDIKISLHKLNQAVMVGDDSAVTLD
>MS2278 napB, NapB protein
MRKYLTLILAAFAGFAVAEEPSSSLTMEQIPENIAPAYTNPQKDAGNIPT
TFPFQPPLVPHSVRGLQVTKNANQCLSCHSPEVSPTTGAPRVPESHFLDR
DGKPTEGTSPRRYFCLQCHVQQTDVNPIIQNKFESIRAKQGK
>MS2282 napD, NapD protein
MSNFNLNENWYVCSLVVQARPEKLSQVKADILAIPTAEIHGEKLEEGKLV
VTLESSRQLALADLIDEVKDISGVIVVSLISNYLDEK
>MS0247 napF, NapF protein
MALLITDKCTNCDMCLPECPNEAISVGDEIYLIDPALCTECVGHYDTPTC
QKVCPINKCIITDPDHIETQDQLWERFVLIHHADQV
>MS2283 napF, NapF protein
MMDRELPRRQFLRGGFLKSLQSETVKQQGFLGVRPPWTVAEAQFVADCTR
CGDCIAVCETQILVKGVGDFPEVRFSRGECTFCMKCVEVCRQPVFRPTEE
AAWQHKVEIQTGCLANNQVECRSCEDNCERRAIRFKREIGSVAKPQIDLE
LCNGCGACLSVCPVLAIKVLTTSAGE
>MS2334 napF, NapF protein
MTAMQGKNEQYYKAYLTYNRISRRALLRGVFHPAEQATQIREFRLAPRPP
FAAAEDLFLAACNGCGACVAACPYNLIRISGQKAMLELEYAACDLCGKCA
ESCSTHALHPAFKKDTQLRPHFSEHCLLKQNQFCAVCQEICPQQAISADL
QLNHEVCNGCGECKLACFVSAIQLI
>MS2280 napF, NapF protein
MKLDPNRRQFLKNATRTAAGVCGIGVILGLQQHQANAKEGVALRPPGALA
EKDFLAACTRCGQCVQACPYDMLHLASLLSPMEAGTPYFVARDKPCEMCP
DIPCMNACPSGALSEELTDINDARMGLAVLLDHETCLNWQGLRCDVCYRV
CPLIDKAITLDRIHNDRTGIHAKLIPTVHSDACTGCGKCEQACVLEEAAI
KVLPMDLAKGLLGRHYRLGWQEKQNAGKALLEEQHPDGLRPAFDARMPEG
QVEPVYQHMKVQPDVKVATPNRATYDYVPNPTTVDAPEHYPNLDLNTKGV
K
>MS2279 napH, NapH protein
MATVKTTPNKPKDAGLEARQKLGWWHAYRFLILRRLSQLSIILMFLSGPL
WNVWILKGNYSSSMLFDVVPLTDPLITAESLATGYLPEWTTIVGALIIVA
FYAVFASKAFCSWVCPMNIVTDAAAWLRRKLGIRQSAKLPRNLRYVILVM
ILLGSAVSGTLLWEWINPVAALGRVFVFGLGATLWLVAVVFLFDLLVVEH
GWCGHLCPIGAAYGLIGAKSLIKINVVDRERCDRCMDCYNVCPEPQVLRL
PLHGSESDSPIVLDKDCITCGRCIDVCPENVFAFGSRFEKQVQVKNI
>MS0181 ndh, Ndh protein
MKNIVIVGGGAGGLELATYLGNNLGKKQRANVVLVDRNQTHLWKPLLHEV
ATGVLDSETDAVSYRAHAHNHYFNFEQGSITRIDRTNKYVELAPVTGQEG
DVLVVARRIPYDYLVIAIGSKSNDFNTKGVAENCIFLDSPNQALRFQHKM
LELFLKFSENNALEEIGEDDSKQRLVQDGKVNIAIVGGGATGVELSAELF
NAAQHLSSYGYGKIQSGHLQVTLIEAGDRILPALPERISSSVQQELENLG
VTVKTGTMITEATEKCLITKEGEEINADLMVWAAGIRVSAITQQFDGLEV
NRINQLNVKNTLQTTVDDSIFAIGDCAFLLQKDGKPVPPRGQAANQMATI
CGQNIVALFNNKPLKDFHYFDKGSLVSLSKFTALGNITTGKRSSLTIEGR
LARLAYISLYRLHQQKLHGCFKTGLIILIGRLNRFIRPSLKLH
>MS0668 ndk, Ndk protein
MSLERTFSIIKPDAVERNLIGKILARFEQSGFEIVAAKMVRLTKAQAEGF
YAEHQGKPFFEDLVEYMVSAPILVSVLQKENAVKDYRTLIGATDPAKAKE
GTVRKEFAESLRRNSVHGSDSLESAAREIAYFFIDSEICSR
>MS1944 nei, Nei protein
MRNPNFCSGFLLCRQNPDRNTVMPELPEVETAKNGITPYLEGYLIEKIIV
RQPKLRWEVSPQLAQISQQKITALSRRAKYLIIHTEQGYIIGHLGMSGSV
RIVSARDPVDKHDHLDIVMNNGKIMRYNDPRRFGTWLWSANLDEFHLFLK
LGPEPLSDEFNAEYLFKKSRKKQTPVKNFLMDNSVVVGVGNIYANETLFM
CGLHPEKITAKLTKAQCALLVEKIKQELKRAIEQGGTTLKDFLQPDGRPG
YFAQELQIYGKKGAPCPNCGTKIESLVVAQRNSYFCPKCQKK
>MS2153 nemA, NemA protein
MMNPKYQPLFEPYTLNNGVEIKNRLTVAPLTIYDSGKDGEMTETGRRFWQ
NRFEGFGLYIMPFTNVHPSGIGFESPNAFDERHLPTLREYAEMAHSQGAK
AVVQIAHSGLRADPAMTQGAELVAATGDYYGCFRTMSEQEVWDMVTNYAY
AAELVLRAGFDGVEIHGANGWQIQQFFSASTNLRNDYWGGTLEKRMRFPL
AIIDGIDEMRQKHNRPDFIIGYRFSPEEPGEDGITMKETLALVDALLEKP
LQYLHISLWDFYKKVRRGADTHLTRMQVVHDRIAGRLPFFGSGNLYTADD
MLKAYQTGWVESVSIGKSIMLNPNLVELIETGRESEIESAFDWDKADYYR
YTPAMLDGTRAGTDFFPPSKQNGVRYKTNHF
>MS2010 nemA, NemA protein
MNKKFERLFETVTFPNGATISSRFAMGPMVIVGSESNGEIGADDLAYWQR
RNDAGSLLITGATAVSDYSDAYGNGLKLHKDELLDGWKQLAAVMKAKGNR
AVVQLFHAGYRAAFTYKDKGVAYSASSKEYGFLDYPVTGMTEAQIEDTLN
EFAAAAKRAIDAGFDGIEIHGANRYLIHQFFSAVSNVRDDQWGGSLENRA
RFALEVVKRIQEVIKQYAKADFILGYRISPEEIHREGNGFTFDEALYLID
EVAKLGVDYFNVSQSGVRGFAAEPKAGAYMGQAISKVIKTRLVGRALLLA
SGDLTSPDKILEAVTEYADITSNATMVLLDPDTKNKIQSGREDEVSLAVD
ETTIDDLKLPKAFYKIAPMIVTSQFVPQHTKDLIYKEPK
>MS2097 nemA, NemA protein
MNAKFSPLFQSYTLNNGVEIKNRLVVAPMTHFGSNPDGTLGENEREFISN
RANDMGMFILAATLVQDGGKAFHGQPEAIHASQLASLKETADIIKAQGAK
AILQLHHGGKQAVTELLNGKDKITASDDEATGTRAATVEEIHSLINAFAN
AADLAIQAGFDGVEIHGANNYLIQQFYSGHSNRRTDEWGGSRENRMRFPL
AIVDEVLAVKAKHQANDFIVGYRFSPEEPEELGLTMEDTLALIDTLKGKA
LQYLHISLHEFFKKARRGADTNAFRMQLVHDRIGGKLPLIGVGSLFTAEQ
ILEAYNTGWAEFIALGKTVMINPTIATLIKEGKENEIVTTLDPEKADQYG
IKGILWDLCKNGGAWLPPLKGKDDWHPVDV
>MS2012 nemA, NemA protein
MSRTYQQITQSRSFQMAKFRYLTEPFQIKNLQLKNRVVMPPMCMYVAKED
GIANNWHFVHYVSRAVGGVGLIIVEMTNVADNARISPDCLGLWNDEQAQA
LKKIVDECHAQGAKIAVQIGHAGRKALGWDDVVAPSAIICDESVTSDKSR
WSYKMPRALTTEEAEQVVLQFQSAVRRAVAIGFDAVEIHAAHGYLIHQFY
SPKMNIRTDKYGQDKCLFGIEVIQAAKAVMPAEMPLLVRISAQEYSDNGF
PAEYGVSVAKRFAEAGADVLHVSGGGDGNFI
>MS2296 nemA, NemA protein
MNPIFSPLFQPYTLNNGVEIKNRLVVAPMTHFGSNTDGTLGKQEHRFISN
RAGDMGMFILAATLVQDGGKAFHGQPEAIHTSQLPSLKATADIIKAQGAK
AILQIHHGGKQAITELLNGKDKISASADEESGTRAATIEEIHTLIDAFGN
AADLAIQAGFDGVEIHGANNYLIQQFYSGHSNRRTDEWGGSRENRMRFPL
AVIDAVVAAKIKHQRDDFIIGYRFSPEEPEELGLTMEDTLALVDVLKEKP
LQYLHISLWDFYKKARRGADSNTARLQLVHERIGGKLPLIGVGNLFTAQQ
ILEAYQTGWAEFIALGKTVMVNPKIATMILNGQENQLITEVDENQADHYG
FPDFLWNATMSATQAWLPPVKGKPWSPLDI
>MS1349 nfnB, NfnB protein
MHLIKLNNREDIMTISKQDVLEAFKFRSACRYYDPAKKISKADMDYILEL
ARLSPSSVGSEPWKFVVLQNPVIREKIKPVTWGIKHPMDEMSHLVVILAK
KNARYDSDFFRTSLEKRGLTPEQMEATLARYKSFQTDDIKVLESDRALFD
WCSKQTYIALANMMTGAAMIGIDSCPIEGFNYAEVNRILAEEGLFDADEY
GVSCMVTFGYRAREITKKYRKPAEDVIEWIE
>MS2115 nfnB, NfnB protein
MTILSTEQILSAFKNRKSCRHYDETRKISEQDFNFILELGRLSPSSVGSE
PWKFIVLQDPKLREAIKPFSWGMASTLDSASHIVVILAKKNARFDTPFML
EGIKRRGVTEPEAIEKTLAKYKDFQENDMQTLNDERALFDWCSKQTYIAL
GNMMTGAAMAGIDSCPIEGFNYAEMNRVLSEAGLFDANEWGVSVAVTFGY
RTQEIAQKARQPQEDVVIWAK
>MS2149 nfnB, NfnB protein
MRNKPMSTLDFATTVRERHSVRQFLPTPMTNAQIREVAEDARRSPSSTNT
QPWSVHIVSGETLARLKKRIMEKFEQGELCPDFAYDQSKFDGIYEPRWRE
FYKEMFAANGVTRDDSEGRKKITRRNAEFYDAPHAAFLFMPDVGDGNVNA
ASDMGMYSQTFLLSLTARGFGGIPMLFLAFFADVVREELGISPDFKLLHA
IAFGYPDQDAAINQFRSKRASVDETVTFYE
>MS0756 nfnB, NfnB protein
MFFSIIAIGVIYQTAYQEVIMSEKNFIETLLSHRSIRQFKSQQIAPDIIE
QLVDVARFASSSNHLQCISIVRVMQPQLRHELMLCASGQAYVESAAEFWV
FCADFNKHKQICPEAQLDYTEVMLIGAVDAGIMAQNVLAAAENLGLGGVY
IGSIRNQIEKVGELLNLPEYVVPLFGMCLGYPDQNPPLKPRLPQELMFFE
NQYRPLDKEMLNDYDKEVAEYYKKRSQADMDWSRNVVKTLGKPVRPQVLG
YLQKQGFVKK
>MS1109 nfnB, NfnB protein
MDALTLLTTRRSEKKLSAPVPNNEQLELIFQAATHVPDHGKLQPYHFIVI
ENDGLKKLETLLKSAVTELKLDEKRLQKAEKIASTAPMMIAVVAKINTDI
AKVPAWEQMLSAGCSAYAMQLAANAQGFDNVWVTGPWVDGSDLREALGCA
PKDKVIGFIILGTSQEKITREPKTVKTENFVSYL
>MS1801 nhaA, NhaA protein
MSPSINLTFKGGCNMFMAQIQRFFKMGSASGILLFFFALLAIIFANTSLN
NFYFNFLDIPVSVQFGEFMINKTLLHWINDGFMAVFFVLVGLEVKREMLE
GSLSRYQLAIFPAVAAIGGMIVPALIYYLITNQHPELSNGWAIPMATDIA
FALGIVALLGTRVPLPLKVFLLALAIIDDLGAIVVIAVFFSEELSIQALS
VAIVAIAGLITLNRMKVGHLCAYLIFGLILWAAVLKSGVHATLAGVIIGF
CIPQKDSEGKSPLHTFEHILTPWCSFFVLPLFAFANAGVSLGTINTDMIF
STLPLGIALGLIVGKPLGVFSFSYFSVKLGIAKLPEGIKWKQVFAIAILC
GIGFTMSMFLAGLAFTDGQSDSLINTLSRLGILLGSSVSAILGYLLLKST
TK
>MS0430 nhaB, NhaB protein
MSGYTAFFNNFLGKSPNWYKLSIIVFLILNPILYFLISPFIAGWCLVIEF
IFTLAMALKCYPLQPGGLLAFEAIAIGMTSPAHVKAEIMASFEVILLLMF
MVAGIYFMKQLLLFAFTRLLITVRSKIVLSLSFCLSAAFLSAFLDALTVV
AVIISVVMGFYGVYHKVASGNNFDDSTDITNDEKIKKDQQVLEQFRSFLR
SLMMHAGVGTALGGVMTLVGEPQNLIIAEQASWGFGEFFIRMAPVTVPVL
ICGLITCVMIEKMNIFGYGDKLPRKVWGILAKFNRAQQQKMNRQERQKLI
IQGIIGIWLVCGLAFHLAAVGLIGLSVIVLTTAFCGITSESTIGKSFQES
LPFCALLVVFFSVVAVIIDQHLFGPIINYVLSASESTQLLLFYGFNGLLS
AISDNVFVATVYINEAKNALHAGIISLEQFELLAVAINTGTNLPSVATPN
GQAAFLFLLTSSLAPLIRLSYGKMVYMALPYTIVLTVVGLLAVEYILPGA
TKYFSSLGWITALPV
>MS1321 nhaC, NhaC protein
MQLADFSTSLWSILPPILALTLAIFTRKVLFSLSVGIIVGSLMLSNGSLA
QGVSYLFDSVTSLIFNFDEENHFVLNDNNVNILVFLLLLGILTALLSVSG
SNQAFAEWAQKRIKGRRGAKIMAACLVFITFIDDYFHSLAVGAIARPVTD
KFHVSRPKLAYILDSTAAPMCVLMPVSSWGAYIITLVAGLLAEHSITGYT
PIGAFMTMSAMNFYAIFSIVMVFIVAYFSFDIGPMAHHEKLAMEQANNVQ
ETNSAVQGQVRNLVLPIVGLIIGTVTMMMHTGNQALLADGKEFSVLGAFE
NTTVGISLVVGGMTAVLISTILIVLAKKLSSGNYAKAVIAGMKSMVGAIL
ILCFAWTINKVVGDMQTGKYLSSLISGNLTPALLPALLFVLGAAMAFSTG
TSWGTFGIMLPIAAAIAVNAAPELLLPCLSAVMAGAVCGDHCSPVSDTTI
LSSTGAKCNHMDHVTTQLPYALLVATATILGYLVVGFTETPLFGFITTGM
ALFLLIFITKKR
>MS0136 nhaP, NhaP protein
MSVYAYICFLFSISIFLAFFTRKISARIQSTIAITASAMIGSLVLILFGY
FGWFKVEDIAIHIMERVDFKNFLLNGMLGFLLFAGSLGIKLPLMKEQRRE
IAVFALFSTLASTFFIGILIYYAAMLVGLRIDFVYCLVFGSLISPTDPIA
VLAIIKNLKAPKRLSMQVEGESLFNDGVGLVIFTTLFAVAFNGQEPTFSG
VFGLFLKEAVGGILFGFVMGFAIHLLITFTKEVSLEILLTLTIPTAGFML
ANLLHISGALAMVTSGIIIGNWTRRQGFSERNRYFLDHFWEMIDHSLNSL
LFFLIGLALLLVEFTFESSMLMLLAIPVCLIGRYVSLWIPYQIMSRFRRY
NPYTLRILTWGGLRGGLALAMALSIPSNVVNISGNGMNLDLRDVIILMTY
AVVMFSILVQGTTIEKMIETSKVIDPKRDAYVKLGGSRDYPPHI
>MS1726 nifS, NifS protein
MKFPIYLDYAATCPADDRVAEKMMQYLTRDGIFGNPASRSHKFGWQAEEA
VDIARNHIADLIGADSREIVFTSGATESDNLAIKGAAHFYQTKGKHIITC
KTEHKAVLDTCRQLEREGFEVTYLAPKSDGLVDLDEFRAAIRPDTILASI
MHVNNEIGVIQDIEAIGKICREHKVIFHVDATQSVGKLPINLAELPVDLM
SMSGHKLYGPKGIGALYVRRKPRVRLEAIIHGGGHERGMRSGTLAVHQIV
GMGEAYRICKEEMAEEMAHVTKLRDRLYNGLKDIEETYVNGSMEHRVGSN
LNISFNFVEGESLMMALRDIAVSSGSACTSASLEPSYVLRALGLNDELAH
SSIRFSLGRYTTEEEIDYTIDLVKSAVKKLRDLSPLWDMFKEGIDMSKIE
WSAH
>MS0432 nlpA, NlpA protein
MNFKKLLTVAAVTSVFALTACNDEKKADTAAPSAQNTPAQTITVGVMSGP
EHQVAEIAAKVAKEKYNLNVKFVEFNDYALPNPAVSKGDLDINAMQHKPY
LDEDVKKNNITNLTIVGNTFVYPLAGYSKTIKNVSELKEGAKVAVPNDPS
NQGRALILLEKQGLIKLKDNTNLAATPLDIVENPKNLKITPVDTAVAARA
LDDVDLAVVNNTYAGQVGLNTADNGVFVESKDSPYVNIIVARTDNKDSEA
VQNFVKAYQTEEVYQEAVKFFKDGVVKGW
>MS0267 nlpB, NlpB protein
MKKWLLSVAVLATVTACSSSNESRQVANDSYEKNAESKINFSPLATGGVT
IVGQDNKYQLPTTNISKGPAVDIRPPTTPMSIIGNSVAQFDGERASIVYP
AAKSAVYNLDQVARLLKEENIEFTRQENKLLTDWAPTGRVDEVGDVKLRY
LIEQLGNKEANALSVTVLEAKRNEIIFTPSVTDKQRYTSDRLNNFVGNLN
HAYRTQMAQTAPVATSNGAIQAEIVTDGNNRTALGLTSSFAQSWEKLGQV
LPELGFEIDEETAGRGYRVLKYKPVDDSQWARLGVNKPELEKGEYSMQLS
AYGNQSAVVLMDEDKAALEGDKAQAVYKALQVLMTK
>MS2320 nlpD, NlpD protein
MIMWFLVSNKIPRKLTALLGLGLFFAFPLQAADLSKIQQQIKQQEQKIAE
QKRTQNQLQSTLKEQETKMSGMIGELRQTETDLKETRKIISETNKQIRTL
EQQERAQKEKLAKQLDAVYRSGNPSSVVEHLLSDDAKKADRMKVYYEHMN
QARMDAIAEIRNTRAQLDEQKNVLNTQLQEQQTQLSTQKKQQQELQKMKN
ERQSTLNKLSKSLKQDQNRLQTLKENEIALRNEIQRAAQAAQQQEKRERE
AYTAKKESEEKRSNKPYQPTSQEQQLIRSNSGLSGRYAYPVVGRILHAFG
SQQAGEVKWKGIVISARAGTAVKSIANGRVILANWLQGYGLVVVVDHGKG
DMSLYGYNQSVSVKVGSLVRAGQQIAEVGNSGGQGSSGLYFEIRRQGNAV
NPMGWLR
>MS2269 nlpD, NlpD protein
MVTVLQTVLVCGAAGVWTAGCVFTAGVVAVFPTAVAGAVAAGIVGWLATG
LSIPAGMELCWISGCHEPLLPVLSTGCIIPGLNVPSAFSTGAGEVLELQA
DNTATAIGNNKNDFFILNSLKKLFHTIRHTLLFDFRHKTAACNRSGTFHR
NHNTVR
>MS0791 nlpD, NlpD protein
MAQHVKLARDRRKRKSRIKAAIFFMAIISIFTGTFLSLKDSVEDKNIDGD
TALAQAEQFEKLTPDAGTSDRLTQHLLDQAKVLAEDNNATSYDDDLSGQD
DEVDEIKIDPDDFDISSLPPEAQSALSDLLDVADQAKRISDQFSHTIVRG
DELKDVLELSGLEPMTAEGLIASYPELKKLKAGQQMYWILDKNGELEYLN
WLVSEKEERIYERLESGKFERQILEKKSVWKKEVLKGKITSSFRASLLKL
GLDQRQVSQLTNALQWQFSMKKLMKDDNFAILIFREYLGDKLTGQGNVEA
IHIISQGKSYYAIQAENGRYYSRQGETLGKGFARYPLQRQARISSQFNPR
RRHPVTGHVRPHKGVDFGVPTGTPVISPADGVVEKVAYQKGGAGRYIMIR
HGREYQTVYMHLSKPLVKAGQSVKRGERIALSGNTGISTGAHLHYEFHIN
GRPVNPLTVKLPGTSNEMRDSERRQFLTKAKYVERQLKM
>MS2268 nlpD, NlpD protein
MFLIAYISGMDVKELAALNGMTSEPYNLKVGQTLKVANRVGGGAETETIT
EQQCTEVPVEQPAVTYTAGANGTQYGSDGTITGPVKAGAGTAGAASAVAG
PVVASAATSAPSPAYNSGVTAGVGTVAPATTGVVAGTASSAVQTPASNIS
WKWPTSGRVVQGFSNSDGGNKGIDISGSKGQPVYAAAAGRVVYAGNALRG
YGNLIIIKHNDDFLSAYAHNDSISVNDQQEVKAGQQIAKMGSSGTNSTKL
HFEIRYKGKSVDPTSYLPRR
>MS0294 norB, NorB protein
MREEYRHQSRVNEDGNVVLSETRLQAIRQTADYYIKLYGDDPSMISSRES
FAMKNNTLPDPEARQKLSDFFFWTAWVASTNRPDAEATYTNNWPHEPLIQ
NVPTTENIMWSLISIICLIAGIGFLIWAYSFLRDHNETAPQAPAADPLSK
LNLTPSQKALGKYVFLTLALFVVQVGLGGVLAHYTVEGQKFYGVDISQLF
PYSLIRTWHIQSALFWIATGFLTAGLFLAPIINGGKDPKYQKFGVNFLFL
ALLIVVVGSYSGNFFALSHQIPAEFNFWFGHQGYEYLDLGRFWQLLLFVG
FLLWLWLMLRCTSHSFKQGGDKNLLAIFIASIIGVGLFYGPGLFYGEHSS
ITVMEYWRWWVVHLWVEGFFEVFATCALAFIFYNLGLVGYHSATVASLMA
GSLFLVGGIPGTAHHFYFSGTTTPALAAGAVFSALEVVPLVLLGSEAYEH
WSYQHRTSWMQKLRWPLMCFVAVAFWNMLGAGVFGFLINPPISLFYLQGL
NTTAVHAHAALFGVYGFLTLGFVLLVARYLKPDFQFNEKLMKTGFWSMNI
GLVLMIAISLLPIGLYQVSASISEGLWYARSEGFLQQDFLQTLRWLRTVG
DLILIFGAVLFAYEVTRLTFSRRT
>MS0293 norB, NorB protein
MGKYKKLWAALVVVLTVTFTILGYIGVEVYRQAPPVPQAYVSQTGETVMT
KDDILAGQTAWQTTGGMEVGSLLGHGAYQAPDWTADWLHRELTAWLDIRA
QATFNKSYTELDPASQAALQADIARGIPPSKQGE
>MS1314 norM, NorM protein
MQKITHWQEYKIEAKSLILLSLPILLAQIAQNSMGLVDTIMAGRVSAADM
AAISVGASIWMPLVLFGQGLLLALPPTISYLNGSAQRHRIAHQVRQGIWI
ILFSIVPLALLIYHSDTVINRMGMEEHLAQITIKYLHAMLFGLPAYLLLV
NFRCLNDGLAKTKPAMIITLIGLLLNIPLNYIFIYGKLGVPAFGAVGCGI
ATTIVNWIMCILMISYTKSARNQRDLKVFENIIELPNPATLKKLFKLGLP
IAIAICSEVALFALTSLLLSPLGTNAVASHQIALNTSSFIFMLPMSLGMA
TTILVGQSLGERSPLKAKDISYVALFIGLATATLTAFLTVVLRYQIAGIF
VKDTEVISLAASLLLLNALYQFSDAVQVVVGGALRGYKDTKAILYITLFC
YWVLGMPIGYILSRTDLITAHMGPTGFWIAFVVSLTVAAVLLFYRMYKIQ
KQSDEQLLTKLEKLK
>MS0920 nqrA, NqrA protein
MTDVLARFNSGKLWDFDGGIHPPEMKSQSNQTPITKAPLTEDFYIPVKQH
AGDAGNLLVKEGDYVLKGQPLTQGDGLRMLPVHAPTSGTVIAIAPHIAAH
PSGLSELAVHIHADGKDQWREQNPIDDFLSQSAEQLIEKIYQAGIAGLGG
AVFPTAAKIDSAQKKVKLLIINGAECEPYITCDDRLMRDNPDEIIEGIRI
LRYILRPEKVVIAVEDNKPEAVQSIKNALQGANDIEIRVIPTKYPSGAAK
QLIQILTGMEVPAGQRSSSIGVLMQNVGTAFAIKRAIINDEPLIERVVTL
TGDKIPNKGNQWVRFGTPISFLLKNVGYQYDERLPVFLGGPMMGLTLPNL
DAPITKLGNCILAPDHFEYDPQAREQSCIRCSACSDACPVHLMPQQLYWY
ARSEDHEKSEEYSLKDCIECGLCAYVCPSHIPLIQYFRQEKAKIWEIKDK
AKKAEEAKLRFEAKQRRLEREEQARKLRSQRAAEARREELANQKGVDPVK
AALERLKQKQAAITEKPKISALKTVVNEKGEVLPDNSEVMALRKARRLAR
QQAISTENSVLTQVDSGTQSDNSDVQKNPENSTALDGKKAAIAAALARAK
AKKLAQNNTESVSDNVTAVKSAVQNTEISAPSDSAEKTETDPKKVAIAAA
IARAKAKKAAQNNTESASDNVSAVKSAVQNTEISAPSDSAEKTKTDPKKA
AIAAAIARAKAKKLAQQQSNKTE
>MS0309 nqrA, NqrA protein
MITIKKGLDLPINGKPEQVIRDGNAVTEVALLGEEYVGMRPSMKIHEGDT
VKKGQILFEDKKNPGVVFTAPVSGTVTAINRGAKRVLQSVVIRVEGNDQE
TFAKYSPAELVSLSSEQVRQNLQTSGLWTALRTRPLSKIPAVDAVPSSIF
VNAMDTNPLCADPAVIINEYQADFTNGLTVLTRLHNKVNLCKAAGSNIAS
VDNVDSHEFAGVHPAGLVGTHIHFIDPVGINKSVWHINYQDVIAIGKLFT
TGELFTDRVVALAGPQVKNPRLVRTNIGANLSQLTANELADGNNRVISGS
VLYGAKAEGAHDYLGRYALQVSVIAEDTEKEFFGWISPQANKYSITRTVL
GHFGRKLFNFTTAENGGHRAMVPIGSYERVMPLDILPTLLLRDLEVGDTD
SAQALGALELDEEDLALCTFVCPGKADYGSFLRQALDKIEKEG
>MS0308 nqrB, NqrB protein
MGLKNLFEKMEPAFLPGGKYAKLYPLFESVYTLLYTPGTATQSTTHVRDA
IDSKRMMIIVWLALFPALFYGMYNVGHQSINAVLSLGTSVDSLAANDWHY
ALAQALGVDFTAAAGWGSKMLLGATFFLPIYIVAFAVGMFWELLFAIVRD
HEVNEGFFVTTILFALIVPPTLPLWQAALGISFGLVVAKEIFGGVGKNFM
NPALAGRAFLFFAYPGQISGDLVWTATDGFSGATALSQWAQGGEAALQHV
ASGQPITWMDAFLGNIPGSMGEVSTLALIIGAAIIVFARIASWRIIAGVM
VGMIITSSLFNLIGSESNPLFAMPWYWHLVLGGFAIGMFFMATDPVSASF
TNKGKWWYGALIGVMAVLIRVVNPAYPEGMMLAILFANLFAPIFDYLVVQ
GNIKRRKARTA
>MS0919 nqrB, NqrB protein
MFKIASSPHSHSGKLTARIMLWVILAMLPAIFAQLYYFGFGVLFQITIAV
VFALCLEFLVTILRKKPKLFYISDFSVTLTALILAVAIPPYAPYWIILIG
IFCAVILGKHVYGGLGQNPFNPAMVGYVVLLVSFPMQMTTWLAPVQLLHE
PPTFIDAYHLIFSGGTTDGFSLHQLTASIDGMSSATPLDAVKTGLKANRG
LAEINRSPLFTQSSLAGLGWFQVNLAFLLGGLFLVWKRIIHWQIPTALLI
TVCLFSLCSWLFSDNMPSPLWQLFSGATMFCAFFIATDPVTASITPKGKL
VFGVLVGLLLCLIRFYGGYPDGAAFAILLANICVPLIDQYTRPRVTGYDL
RGKN
>MS0918 nqrC, NqrC protein
MGVGQTSVKYGAILGIVALICTIISTALYFLTKDKIEAEILKQQQELLAQ
VIPANYYDNDVTATCKTTESREIEKICTALLTGKVSAYAVEATAPDGYSG
AIRLLMGITPEGEVLGVRVLAHKETPGLGDKIETRVSHWILSFNHQKISE
DNLQDWAVKKDGGKFDQFAGATITPRAVVNQVKRSALAVLKNEQNNR
>MS0307 nqrC, NqrC protein
MAKFNKDSVGGTLTVVVLLSLVCSLIVAGAAVLLKPTQEIQKQLDKQKNI
LMAAGLMQQGTNVQQTYAKFIEPKIVDLATGDYVDGITNFDAKASAKDPA
TRVAIAPADDKAGIKVRSKFAEVYLVKDEAGNTTQVVLPMYGNGLWSIMY
GFVAVQPDANTINGITYYEQGETAGLGGEIANPNWQKNFVGKKLFDAQNK
VALVVGKNASSNKEHGIDALSGATLTSNGVDGSFKYWFGPQGFGPYLAKF
KAEGAN
>MS0917 nqrD, NqrD protein
MEKEQSIWHDLLSQGLWRNNPAIVQLLGLCPLLAVSNSATNALGLGFATL
LVLTCTNTMVSLFRKQIPHEIRIPIYVMIIATTVTAVQLLMNAYTYSLYQ
SLGIFIPLIVTNCIVIGRAEAFASKNSVLHSAFDGFAMGLGMTLSLFLLG
ALREVLGNGTLFDGIHLLLGDWAKPLRIEFFHNDSNLLLAILPPGAFLGL
AVILALKNVIESRTK
>MS0306 nqrD, NqrD protein
MAGSNLKKLLLSPIADNNPIALQILGICSALAVTTQLQTAVVMAIAVSFV
TGFSSFFISCIRNYVPNSIRIIVQMAIIASLVILVDQILRAYAYDLSKQL
SVFVGLIITNCIVMGRAEAFAMKSGPVESFVDGIGNGLGYGAILLIVAFL
RELIGSGKLFGVTVLETVQNGGWYQANGLFLLAPSAFFIIGFVIWGLRTW
KPEQVEK
>MS0922 nqrE, NqrE protein
MQKQDNNALTQTNYTNMTDYILLIISTALINNFVLVKFLGLCPFMGVSKK
VETAIGMGLATTFVLTVASLCTYLADSYILAPLNASFLRTLVFILVIAVV
VQFTEMVINKTSPTLYRLLGIFLPLITTNCAVLGVALLNINLAHNLTESV
IYGFGASLGFALVLVLFASLRERLAAADVPAPFKGASIALVTAGLMSLVF
MGFTGLIRV
>MS0305 nqrE, NqrE protein
MEHYISLFVKSVFIENMALSFFLGMCTFLAVSKKVSTAFGLGIAVIVVLG
ISVPVNQLVYTHILKDGALIEGVDLSFLNFITFIGVIAALVQILEMFLDK
FVPSLYEALGIFLPLITVNCAIFGGVSFMVQREYNFPESVVYGIGAGTGW
MLAIVALAGLTEKMKYADVPAGLRGLGITFITVGLMALGFMSFSGIQL
>MS0304 nqrF, NqrF protein
MDSNFIFGIGAFTAIVLVLAVVILIAKSKLVDSGDITISINNDPEKAITL
PAGGKLLGALASKGIFVSSACGGGGSCGQCKVKVKSGGGEILPTELSHIS
KKEAKEGWRLSCQVNVKSSMDVELPEEVFGVKKWECTVISNDNKATFIKE
LKLAIPEGEEVPFRAGGYIQIEAEPHTVNYKDFDIPEEYHEDWDKFNLWR
YVSKVDEHIIRAYSMASYPEEKGIIMLNVRIATPPPRNPDVPPGQMSSYI
WSLKPGDKVTISGPFGEFFAKDTDAEMVFIGGGAGMAPMRSHIFDQLKRL
HSKRKISFWYGARSKREMFYVEDFDQLQAENDNFTWHVALSDPLPEDNWD
GYTGFIHNVLYENYLKNHEAPEDCEYYMCGPPVMNAAVINMLESLGVEHE
NILLDDFGG
>MS0992 nrdA, NrdA protein
MCFSKVKTFMNKALMVTKRDGQVEPLDLDKIHRVITWAAEGLENVSVSQV
ELRSHIQFYEGIRTSDIHETIIKAAADLISKDAPDYQYLAARLAIFHLRK
KAYGHFDPPRLYEHVKKLVRLGKYDESLLSDFSREEWDEMDNFLDHSRDM
TFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAASLFSKYPAETRLD
YIHRFYDAISTFKISLPTPIMSGVRTPTRQFSSCVLIECDDSLDSINATS
SAIIKYVSQRAGIGINAGAIRALGSPIRDGEAFHTGCIPFYKHFQTAVKS
CSQGGVRGGAATVYFPMWHLEVESLVVLKNNRGVEENRARHMDYGVQINR
TMYQRLIKGGDITLFSPSDVPGLYEAFFADQAKFEELYVKYEQDPTIRKR
TVKAVDLFSLLMQERASTGRIYVQNVDHCNTHSPFDPAVAPVRQSNLCLE
IALPTKPLNNINDEDGEIALCTLSAFNLGKIDDLDELENLADLAVRSLDA
LLDYQDYPVPAAKRSSLGRRALGIGVINYAYYLAKNGVRYSDGSANNLTH
RTFEAIQYYLLKASMNLAKELGACEYFNETTYAKGILPIDTYKKDVDNLT
SEPLHYDWEQLRTEILEFGLRNSTLTALMPSETSSQISNATNGIEPPRGH
ISVKASKDGILKQVVPDYENLSDKYELLWDMPNMDGYLHLVGIMQKFVDQ
SISANTNYDPKRFEDDKVPMKILLKDLLTAYKYGLKTLYYQNTRDGADDA
QEDLDDGCAGGACKI
>MS0633 nrdD, NrdD protein
MLAMGSFFIIKRDGSRASFEIQRIINAIKKAAKAVGIDDERFCHLVSQQV
FDEIFQHNQNEIDISRIQQFVENKLMASAYPQVARAYIEYRHDRDLAREK
RSQLTKDIEGLIEQSNVEILNENANKDAKIIPTQRDLLAGIVAKHYAKSH
ILPRDVVEAHEKGEIHYHDLDYSPFFPMFNCMLVDLKGMLTQGFKMGNAE
IEPPKSIGTATAVTAQIIAQVASHIYGGTTINRIDEVLAPYVQLSYEKHL
KHAQEWNVPDQKAYADALIEKECFDAFQSLEYEINTLHTSNGQTPFVTLG
FGLGTSWQERLIQKSILKNRIRGLGKNHKTPVFPKLVFTIKHGINQSPKD
PNYDIKQLALECASKRMYPDILNYEQVVKVTGSFKAPMGCRSFLGAYEEN
GELVHDGRNNLGVVSINLPRIAIEAKGDEQRFYEILDQRLAVTKKALMTR
IARLENTKARVAPILYMEGACGVRLKADDNIAQIFKNGRASVSLGYIGIY
ETINALYNQGHIYDNEMLREKGVQIVEYLSKATKEWQKETGYAFSLYSTP
SENLCDRFCRLDTKQFGVIEGVTDKGYYTNSYHLDVEKKVNPYDKLDFEM
PYPSLASGGFICYGEYPNIQHNLKALEDVWDYSYDRVPYYGTNTPIDECY
ECGFTGEFECTSKGFTCPRCGNHDSEKVSVTRRVCGYLGSPDARPFNAGK
QEEVKRRVKHM
>MS0968 nrdF, NrdF protein
MAYTTFSQNKNDQLKEPMFFGQNVNVSRYDQQKYETFEKLIEKQLSFFWR
PEEVDVAQDRIDYAALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLV
SIPELETWIETWTFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEEIIKR
AKDISAYYDDLIRDSQLYNLFGEGTYKVEGKECKVTLRNLKKQLYLCLMS
VNALEAIRFYVSFACSFAFAERQLMEGNAKIIKFIARDEALHLTGTQHIL
NIMAAGQDDPEMAEIAEECKQEAYDLFLAAAEQEKAWADYLFKDGSMIGL
NKDILVQYVEYITNIRMQAVGLPLPFQARSNPIPWINAWLVSDNVQVAPQ
EVEISSYLIGQIDSKVDTSDFGDFDL
>MS1857 nrdG, NrdG protein
MGNADLFLRISGITAVENLLISNPRYPIVEIFESLQGEGFNTGMPCIFVR
FGKCNLACPWCDTDYERFEYRTLQQIVEKVRSFSAKNIIITGGEPTIQPN
ISLLLAQFKRDGYFLAIETNGLRAVPPQIDYISASPKAMYAEKYRRRCID
FAHEVRIVMDADAENFCQQIEQKIRAERYYLSPCEIEGKMNLLETIALLG
KLNQRPNKPKWQLSIQTHKLAGIE
>MS0632 nrdG, NrdG protein
MNYLQYYPVDIVNGEGTRCTLFVSGCTHACKGCYNQKSWSFSAGVPFDNA
MEEQILKDLKDTRIKRQGLSLSGGDPLHPLNVETLLPLVQRIKRECPDKD
IWCWTGYKLEELDDYQRKMLPYIDVLIDGKFVQELADPALVWRGSSNQIV
HRFRSNEF
>MS1819 nrfA, NrfA protein
MNGVIIVNILRKTLSSLAIVGLGFAMANSAVAEEKAMATHQQAQQLQQPA
PETAAKRAPTKEELTPVNPNLKIEAANEKFAADFPRQYNSWAKTAEQTEF
HKEVEDDPRMIVMWGGYAFAKEFNSPRGHIYAVTDVRNILRTGSPKDANG
GPQPMACWTCKGPDVPRLIAEWGEEGYFSGKWAKGGAEVVNSIGCADCHD
TQSQDFKDGKPALRVARPHVLRALDTVGKTFATSDRTDQRAGVCANCHVE
YYFDKSTGANNVVFPWYKGRDVDSIEKYYDEIGFKDWEHSISKAPMLKAQ
HPDFETWSMGTHGKNGVTCVDCHMAKTQDKDGKVYTDHQVVGNPVKDNFQ
NTCARCHDQSQDTLIKTVEQHKADVREVMLKLEDQLVKSHFEAKTAWDNG
ATQEEMKDALQAIRHAQWRWDFAAASHGMHMHAPDVALKIIASGLDRVAD
ARAKLAVILAKHGVQQPIQYPDISTAEKAWKVMGIDIEKERKEKEEFIKT
VIPEWNKEAISKGLILTAPPTTPAK
>MS1816 nrfD, NrfD protein
MSTLTYPVPFHTPDLVWDSSIAIYLFLLGISSGAVQLAIAYRRSHKLEKP
SENWIVRSAAVLGTIPTLIGLTLLIFHLARPWTFWKLMFNYQFNSVMSMG
VMLFQVYMLFMVIWIAVLFKAEIDNLIKKFVPKLRFVTNIIGACERIFSA
AEVILFILAAVLGAYTGFLLSALISYPMLNNPVLPALFLASGTSSGIAAT
FLCILIAGKLKGDSHEVHYIHKFEVPIMVTELGLIVCFFVGLYFGGGQKV
VALQNALSGFWGAVFWIGVMLIGIMIPLIANLFASDKWKYNAKFIILVSI
FDLIGVLCLRYFILYAGQLTIAM
>MS1811 nrfG, NrfG protein
MTQFIIGLIIFAAIGLLLFVFFTKKVSWQQNYRQQQNIALYEQQLQSNPG
EELANEFAQRLLMDEQQSEAALTLKTAVGFSRKLSALLWLVLIVMPLLYY
FSLNRFDYVRQGEKAFAQQQSRLITASAEDKNIDYVLSIQNKLRKDPNNA
DDWVELGQAYMLSNDYDNALLAYGNAEKLEGGKPHILGLAATTLYYQAGQ
KITPQVQHIIDIALAADPKEVSSLSLMASDAFLKNDFPSALQYWQRLLDS
GHTGLDRREIIRNMNLATMLQNNRMQQKAN
>MS1774 nrfG, NrfG protein
MFNNSMKKALFLSLIFALGGCSGLPMSDSESFVAKEKLYHSTNNYNGLIS
LYREQLKTTEDNSVRYKLALTYYQKGDSQSSLDYLQPLLNEQNLYFQSAT
ILQIRNLIQLQNYNEAISSASMLISKYPHNSEAYNLRGIANAQLGKYKNA
EQDINSARNRFINDVIAINNLAMLKIINGDYKNAVNLLLPQYLNGAKEQR
LVHNLVFALVKSGDTNYALDIIKKERLNTSPEDLVNALKKTEKVPNKVTT
ARYKK
>MS1820 nrfG, NrfG protein
MRKFKSLTLIALSVLVIASCSSSEKPVEQASEQELFSTGANYLQEGNYTQ
ATRYLEAVDSRFPGSSYSEQAELNLIFSTYKSQDYTKTLTTADRFLQQFP
QSQHLDYVLYMAALTNSALGDNLFQDFFGVDRSTRETTSMKTAFNNFQTL
VQNFPNSPYTPDALARMAYIKDRLARHELEIAKFYAKRSAWVATSNRITG
MLRSYPDTQATLEALPLLQESYEKMGLTQLASQAATLVKANEGRVIKEAE
KPKEPFLSLPSWLSFGSSDSSDKEKVATKSDDSFFSWPSWLSFGSKD
>MS0494 nrfG, NrfG protein
MIQLKKLFNFVVFLPGLFFAFALSGCVNGADDVFVSKNKIILGEQYPNVH
FDQEVMIVRISQMLIIGQLSKNERADLYFERGVLYDSLGLWGLARYDFTQ
ALALQPRSPAIYNYLGLYLLLDEDYDSALEAFNAVLELDPNYDYTYLNRG
LDFYYMERYNLAQQDLLKFYEAKKDDPYRALWLYINELKFKPNEATQNLA
RRAKDLSTEYWGTYIVQYYLNEISVKDLLDKAKVFVDPQSSQYAEILTET
YFYLAKQKLNAGHAEEAETLFKLAMANQVYNFVEYRFALFELAKLKTNSE
QTEQAVVQRVKTTQAPNSKELDAE
>MS0609 nrfG, NrfG protein
MTFWISALVFTLIMTFICFYPLLRGQTDREQETNRDSLNKAFYFDRLKEI
EEDEKQGLLDNAAQLKTELQQSLLEDIPEGVTEKTDKKAYSKLWFVSAFL
FLGIIAGVSYFKVGGWQSQEMMAKSYEKLPYFYERLKEEDTKPLDDTELQ
QFATALRIKLQKEPNDADGWWLLGQLGTAMGNGELAHNGYSKAAELKPDN
TDYKLAYARTLMFSDDKADRAKGNELLKEVIRSDHSNLQALSLLAFNYFE
EEDYKMAAVTWAMMLRLLPEDDPKRDLIEKSIRSARDALAEQEQEKHKRM
IPQNK
>MS0916 nth, Nth protein
MNKQTRIEILTRLRDNNPQPTTELTYNSPFELLIAVILSAQATDKGVNKA
TERLFPIANTPEAILALGVEGLKEYIKTIGLYNAKAENIIKTCRDLIEKH
QSQVPEDRAALEALAGVGRKTANVVLNTAFGHPTIAVDTHIFRVSNRTGF
APGKDVVKVEEKLNKVVPNEFKVDVHHWLILLGRYTCIARKPRCGSCIIE
DLCEYKDKTDL
>MS1897 nupC, NupC protein
MSMLTSLLGIFVLLAIAYSLSSNRKAINFRTVGGALLIQILIGAFILYVP
AGRDILLSMANGVAKVISYGNEGIKFVFGGLAGDKIFEVFGGDGFIFAVR
VLPSIVFFSALISLLYYIGVMQWVIKIIGGALQKLLGTSKSESMSAAANI
FVGQTEAPLIVKPYISRMTESELFAVMCGGLASIAGSVMAGYAGMGVPLT
YLIAASFMAAPAGLLFAKILVPQTEKFDDAIEHVELEKPANILDAAAGGA
SSGLQLALNVGAMLIAFVALIALINGILGGVGAWFGMPELSLGEIFGWIF
RPLAWLIGVPWEEAGVAGQMIGTKLAINEFVGYLEFTKYLTPETPMVLGD
KTKAVITFALCGFANFSSIAILIGGLGAMAPNRRGDIARLGIKAVIAGSL
ANLMSATLAGLFIELSGVALG
>MS1445 nusA, NusA protein
MSKEILLAAEAVSNEKLLPREKIFEALESAIALSTKKKYEQEIDVRVAIN
QKTGEFDTFRRWLVVDEVVNPTKEITLEAAQFEDPDIQLGDYVEDQIDSV
AFDRITMQTARQVISTKIREAERNKVVEQFRSEEGKIVTGTVKKVTRDSI
ILDLTGNKEDPAKAEAVITREDMLPRENFRPGDRVRGVLYKVNPESKGAQ
LFVTRAKPVMLEELFRLEVPEIGEELIEIKGASRDAGLRAKIAVKSNDKR
IDPVGACVGMRGSRVQAITNELGGERVDIVLWDDNPAQFVINAMAPADVN
SIVVDEDNHSMDIAVEQENLAQAIGRNGQNVRLATQLTGWTLNVMTTEEL
QQKHQAEDNKVLNLFMTSLELDEDFAQLLIDEGFSSLEELAYVPVSELTA
IDGLEDEDLVEELQNRAKDALTAKAVAEEEALKQAEVEDRLLNLEGMERH
IAFRLAEKNIKTLEELAEQGVDDLADIEELSAEKAADLIMAARNICWFGD
E
>MS0975 nusB, NusB protein
MTEQVKKRPSPRRRARECAVQALYSFQISQNPVETVELSFVTDQDMKGVD
MPYFRKLFRQTVENIPSVDSTMAPYLDRSANELDPIEKAILRLAVYELKY
ELDVPYKVVINEAIEVAKTFGAEDSHKYINGVLDKIAPALARK
>MS0205 nusG, NusG protein
MTETAVKKRWYVLQAFSGFEGRVATTLREYIKLNHMEDQFGEVLVPTEEV
VENVAGKRRKSERKFFPGYVLVEMEMNDDTWHLVRSVPRVMGFIGGTPDR
PLPISKREADLILNRVEENADKPRPKNTFQPGEEVRVTEGPFADFNGTVE
EVDYEKGRLKVSVSIFGRATPVELEFSQVEKANG
>MS0039 oadA, OadA protein
MTKKIKFTDVVLRDAHQSLFATRLRLDDMLPIAAELDKIGYWSLEAWGGA
TFDSCIRFLGEDPWVRLRELKKAMPKTPLQMLLRGQNLLGYRHYADDVVD
KFVERCVANGMSVFRVFDALNDPRNMQQALTAVKKQGGHAQGTLSYTTSP
VHTLDTWLNVTEQLLEIGIDSLVIKDMSGILNPMAAGELVGAIKGKFGDD
VELHLHCHSTTGMAEMALLKAIEAGADGIDTSISSMSGTYGHPATESLVA
TLQGTEYDSGLDIPSLEKIAAYFRDVRKKYAKFEGQLRGIDSRILVAQVP
GGMLTNLESQLKQQNAADKLDAVLQEIPRVREDLGYIPLVTPTSQIVGTQ
SVINVLMGERYKTIAKETAGILKGEYGRTPAPVNAELQARVLEGNQPITD
RPANHIAPEMDKLAAEVKQQAAEKGIKLAENEIDDVLIVALFPQIGLKFL
ENRGNPAAFEPVPTAEQAPAKAAAPVAPKAQSGAAVYTVELEGKAFVVKV
SEGGDITNIAPTQTSNAVPAPQAAPVAAPASGGTPVTAPMAGNIWKVVAT
EGQKVAEGDVLLILEAMKMETEIKAAQAGTVQGIAVKAGDAVAVGDTLMT
LA
>MS0038 oadB, OadB protein
MVSSMESIISLLKGTGVMHMEWGQAVMILISLLLLWLAIARKFEPLLLLP
IGFGGLLSNIPEAGLAMTALDNLLHLGSPDQIAAIAAKVGAIADPAAIKA
AVSGISASEHAQLEAMAVDMGYSAGILALFYNVAIGYGVAPLIIFMGVGA
MTDFGPLIANPKTLLLGAAAQFGIFSTVLGALTLNYFGLISFNLAQAASI
GIIGGADGPTAIYLTSRLAPELLGAIAVAAYSYMALVPLIQPPIMKALTT
EQERKIRMVQLRTVSNREKIIFPIVLLLLVALLLPDAAPLLGMFCFGNLM
KVSGVVDRLSDTTQNALINIVTIILGLSVGSKLIADKFLQPQTLGILILG
VIAFCIGTGSGVLMAKLMNKFSKNKINPLIGSAGVSAVPMAARVSNKVGL
EADNQNFLLMHAMGPNVAGVIGSAIAAGVMLKYVSAMIN
>MS0040 oadG, OadG protein
MTETELFKEGLNLMFSGMGFVIIFLLILIWAIGIVSKLINTFFPEPIPVA
QAKKTVTPTQSAVVDDIERLRPVIVAAIAHHRRTQGLN
>MS0506 oapA, OapA protein
MPFIIPIDFGNKTSKDIILKVLSDNRHTKILIFVTALSCATVTISLALRY
IVERTPENNPSSEKPSQNELDLGFNQVEPITPKKVVKPEPSLFNKAKSIF
AKKEAVEPNHFAVRKEPTFGESADNAKIENTTESTIAEKVKTVSATATSA
TSASIENMNSDINTSAESFNAQDVQAEPQVETSSKDTTDTEVKSKMKKPE
DWAVMQKLPRKHRRLAIALICVVILLLALLWLKPSSDTVEDFQTDNNKNL
PIEFQPLDQSQAIENVDVNNTAPTTEAVEQANATAENALSAPAPTTPNLQ
SESVATEAAKAETTDTTKVAEQAKPIEKVKAVDTPKATEKPRAVEQVKAK
PAEKTKATEKVSVAENKPVKPAPKKPSVTDAKPATVSAAGSASKTLVIPQ
GTSLMQVFRDNNLNISDVNAMTKANGAGGALSNFKPGDKVQVSLNAQGRV
KTMRLANGATFTRQADGTYQYSK
>MS1594 obg, Obg protein
MKFIDEALIRIEAGDGGNGCVSFRREKYIPKGGPDGGDGGDGGDVYLIAD
ENLNTLIDYRFEKRFAAERGENGRSSNCTGHRGKDITLRVPVGTRAIDND
TKEIIGDLTKNGAKLLVAKGGYHGLGNTRFKSSVNRAPRQKTMGTPGEKR
DLQLELMLLADVGMLGLPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGV
ARVDANRSFVVADIPGLIEGASEGAGLGIRFLKHLERCRVLIHLVDIAPI
DESDPADNIGIIESELFQYSEKLADKPRWLVFNKIDTISDEEAAKRAKDI
TERLGWEEDYYLISAATGKNIPQLIRDIMDFIEANPREVEEEEKAAEEVK
FKWDDYHNEQLSERGFDDEEDWDDDWSEEDDEGVEFIYKP
>MS1218 ompA, OmpA protein
MTKIFKRIIKMKKTAIALAIAGLAAATAAQAAPQENTFYAGAKAGWASFH
DGYTQYAEDGVGSHTKSVTYGVFGGYQIFNRDNLGLAVELGYDDFGRAAL
RTNGATSAKHTNHGAHLSLKPSYDLGALAPVLSGLDVYGKVGAALVRSDY
KVNDGYSYGFNKSDFADHSLKTSLLLGAGLEYALPSLPELAFRLEYQWLN
KVGKLENANGTRFDYTPEIHSVTAGVSYRFGQGVAAPVAEEVVSKTFTLN
SDVTFAFGKATLKPEASASLDNIYGEIAQVQSPAVSVAGYADRIGKEAAN
LKLSQRRAETVANYLVSKGVAQNAITATGYGEANPVTGNTCDAVKGRKAL
IACLAPDRRVEVSVQGTK
>MS1219 ompA, OmpA protein
MRTGIGKKAAILNFSQRRVEPVANSWVSKVVAQIAITATGTGEAIPVTGN
TCDAVKGRKALIACLAPDRRVKVAVKGEKQATM
>MS1220 ompA, OmpA protein
MKKTAIALAIAGLAAATVAQAAPQENTFYAGARAGWASFHDGVDAFHNSD
GLSAKKNSVTYGVFGGYQILNQNNFGLAVELGYDDYGRIRLIETDAKRGK
FTNHGVNLSIKPSYEVLDGLDVYARVGAALIRTDYKDYSIDAAGHSLKVS
PTFAAGLEYALPILPELAMRLEYQWIENVGRDQKWSGYNDADFTPDIGAV
TFGLSYRFGQGAAPVAAPEVVNKTFTLNSDVTFPFAKATLKPEATSTLDG
IYGEIAQVNNVSVNVNGYADRHR
>MS0561 ompC, OmpC protein
MKKTIIALITSALFFSGGASAVTVYSAEGTKVNLDGRASFELINRTDKRS
DLIDRGSRVRIHAYQDIGSGFTALANVEIRFTKDGDIGNQIYTKRLYGGF
QHKLGSLTFGKQALLADSIGYSNFTYELGKITMMPKDADKAVRLLTDWFY
GFRFGADYVFGTSEKYDDSDRELANKNRAYELAMFYNNKFGEFNVKGAVA
YSQQKAGTLAKDEYDKKAMSTSVQLGYGKGAVGFDWTKGKSIEGKKDFKF
RVGNNKFEEINLFEVGAKYAVTDKNNVYAEYLWGTGEIQGQDDGKFKGWF
LGADHQFNKRVVTYLEGGSFKTKRSGDTLEKEKRIALGLRVYF
>MS1337 ompC, OmpC protein
MKKTLVALAVAATTAAMAVPATAATVYEQDGAKVELSGSFRAFLGRVGDD
NRGDLKNDGSRVYVKASQDLGNGLSAFAGYQIRFEEEAYKTAQRGSDSDF
GDPTTRELYAGIKHVDIGALSFGRQNTNSDDFLDDAAYYTSASLSPLTTR
SDKSVKFKSAEWNGFSFGLDYLFGDSDKLDVTNDGNYKNGYAAVLFYHNA
IGEHAYNLKALYSQDRYEGFGSETGVKKTQWGLHAGYNYGPFDAALSYVN
YRTKFETGYFGTVGQVSLAREAGIIGDAKGNYILLDAGYRIIPESRLYVE
WERLDAKADDADYTAAIRNQYTAGIDYRLHKNVVPYIEYAHTRTKFANAE
TEKDNTFGVGLRVFF
>MS1913 ompR, OmpR protein
MAKILLVDDDTELTELLSELLSLEGFEVQIACNGEEALAKIDESYDIVLL
DIMMPVLNGIETLKRLRQNFTTPVLMLTARGDEIDRVLGLELGADDYLPK
PFNDRELVARIKAILRRSVLNKSASSEEETPFEERKAIEFAGLTLYPGRQ
QVMYQGQDLELTGTEFALLCVLIKHPGEVLSRELLSLEALGKNLTSFDRS
IDMHMSNLRKKLPTRPDDFPWFKTLRGRGYILLTD
>MS1504 ompR, OmpR protein
MLSPQILIVEDETVTRNTLKSIFEAEGYEVFEATDGNQMHQIIETQEINL
VVMDINLPGKNGLMLARELREKTNTALMFLTGRDNEVDKILGLEIGADDY
ITKPFNPRELAIRARNLLHRTMAENEKNSNTHVDAYRFNGWTLDINKRAL
IDPESVEYKLPRSEFRAMLHFCENPGKIQTREDLLKKMTGRELKPQDRTV
DVTIRRIRKHFEDHPDTPEIIATIHGEGYRFCGEIE
>MS1246 ompR, OmpR protein
MSSMMMRILLIEDDALIGNGIKVGLTKSGFSVDWFTDGKTGLQAIKSAPY
DAVVLDLTLPGMDGMDILQQWRNEKIDTPVLILTARDTLNDRVTGLQRGA
DDYLCKPFALAEVIARLQALIRRRYGQANPIVEHSLVKFDPNSRKVSLQG
KDIPLTTREYNLLELFMMNKERVLSRSFIEEKLYNWDDEVSSNALEVHIH
NLRQKLGKQFIRTVHGVGYALGKNEE
>MS0740 ompW, OmpW protein
MKKLALVLGISVALISGTVMAHSAGDVLIRAGGALVVPDVENSNPAWSGL
DVNSNAQLGLTATYMVTDNIGVELLAATPFSHEIKLGNTLVGKTKHLPPS
LYAQYYFLDKNSPVRPYVGAGVNYTTFFDEKEVLNGVTDLKLKDSWGLIT
NIGLDINVTDNFYVNAAMYYAKIKSKATFKVGGVAQENKVTLDPTIFFLG
VGYRF
>MS0466 oppA, OppA protein
MKKICTILTALFTATCVYADSTNNRLDYASTKDIRDINPHLYAGEMAAQN
MVFEPLVINTNQGIRPFLAKSWRISEDGKSYLFHLRKDVKFTDGEPFNAF
VAKMNIEAVLANFNRHAWLELVRQIDSVRAPDEFTLELTLKNPYYPTLTE
LALTRPFRFLSPKCFNQGKTSQGVMCYAGTGPWILKKHKKNALADFSRNE
NYWGELPKLNGVTWHVIPERQTMLLALLKGDIQLIFGADGDMLDMDSFKQ
ISESGQFISAMSEANASRAIVLNSARTITSDQKVRQALQYAVDKAAIAKG
VFNDTESIAETLMAKNVPYADVDVQTYPFNLLKAAQLLEEAGWNLSVGKN
IREKAGKPLSLLLSYNINNAAEKEIAQLLQADFRKIGVDLQILGEEKQAY
LDRQKNGDFDLQYSLSWGSPYDPASFVSSFRIPAHADYQGQKGLPNKTEI
DEMIGELLITPNEQTRIKLYQKLFKTLAEQAVYVPLTYSKTKAIYSAQLE
GVGFNPSQYEIPFEKMSFKK
>MS2053 oppA, OppA protein
MKLTTKFTLAALVLSAIGFVQAAETTFINCTSRAPTGFSPALVMDGISYN
ASSQQVYNRLVEFKRGSVDIEPGLAESWDISDDGLTYTFHLRKGVKFHAN
KEFTPTREFNADDVIFSFQRQLDSNHPYHKVSNGTYPYFNSMKFPSLLKS
VEKLNDHQVRITLTRKDATFLASLGMDFLSIYSAEYADKMMRAGKPETID
NQPIGTGPFVFAGYQVDKAVRFVANKDYWKGKAAIDRLVFSITPDAGTRY
AKLQQGACDLAEFPNTADIERMKADKRIQMPSQESLNVAYIAFNTEKAPF
DNVKVRQALNYAVDKNTILNAVYQGAGIAAKNPLPPTIWGYNDQVQPYEY
NPEKAKQLLAEAGFPNGFETELWVQPVVRNSNPNPRRMSELVQSDWEKVG
VKAKLVSYEWGDYIKRAKAGELTAGTFGWSGDNGDPDNFLSPLLAGVNAG
NSNYARWKNAEFDALLDKAIGLTDKAQRAALYKQAQVIAHDQAPWIPMAH
AVTYAPLSARVRDFKQSPFGYTSFYGVRVEDKK
>MS1325 oppA, OppA protein
MSNKMRSSLFSGKFSLVAKSAVIFCCFLSSVGCDRIKNLFSDTKQSVSEQ
PAESMTSTKQIQTETVPEQHILSRGVYSDLVLNIRDVKSSEQADFMRDLF
EGLVIFDIHGNIQPAVAESWETKDNKTWIFTLRQDAKWSNGEAVTAEDFA
QAWKLLALSSSPLRQYLAFIHIDQAQEILEGKSDISQLGIKAQDEYHLQI
SLDKPISYLPEMLAHIALLPAYSGGNSNKGELISNGAYKLAGQKADTISL
VKNEFYWNAEKVSFPQVHYQKLADNTDVKKVDLVTDFRQIKMENVVNFPK
LCTYFYEFNLKDQNLAKTAVRNALNSMISSHNIVRDSGLSGFAVSYFVPR
NMEFESDESWQATVVEQILQQADFSEKNPLQFKLTYEQEGIHPNIANRLV
RSWSQSDLISVKMEPVNWSQLQEKRAKGDFQIIRSGWCADYNDPSAFLNL
LYSKNPDNKTGFSQERVDKLLEKAQQTISEPERNELYRQVLLISRQEHLF
LPIFQYAKAVYLNPTLQGFDIHNPTEVIYSKDLSRKPMRQKN
>MS0856 oppA, OppA protein
MFIRKVTFIGFLLFSAMLPFFSWAAPRVPEILTQNGLIYCTHSSGFSFNP
QTADAGTSMNVITEQIYNKLFEIKNNSSRLEPSLAQSYKISEDGKTITVY
LRKGVEFHHTPWFTPSRNFNADDVVYSLNRVLGHNTSLPEFNASEQQKGM
KRQYNIFHELAKKTRFPYFDSIKLNQKIESVTALDPYTVQINLFAPDASI
LSHLASQYAIIFSHEYALQLNADDNLAQLDLLPVGTGPYQVKNYFRNQYV
RLIRHENYWKKEAEIKNIIIDLSPDRTGRLAKFFNNECQIAAFPDVSQLG
LLQENGERFQTTLSDGMNLAFLAFNFKRPLMQDAEIRRGIAQAINRHRII
KDIYYNTASVANKIIPSVSWAGSDSNNHSFAYDYDPAQAKKVLQDRQLSL
DMWVLKEEQLYNPSPIKMAELIKHDLTKAGIEVKVRLISRNFLMEQLRNN
SENYDLILGGWLAVSLDPDSFMRPILSCGTTSEITNLSNWCSQSFEEILD
RALISNSTNERAVNYHLAEQEVLSELPILPIASVKRILISNSNVQGVEMS
PFGSISFEKLSFKKGEK
>MS0462 oppF, OppF protein
MSLLKVENLTKSYRTFNSLFSHLSHPALQNVSFQLEKGESVGLIGENGSG
KSTLARIISGIEKADSGHVWLNGTDIYQRKNRRQQISVVFQDYFSSVNPT
MTVLQAICEPLLEQKQAAAKSLEPLVVQFLKKVNLSTDCLHKYIYQLSGG
QAQRVCLCRALINNPSLIILDEALSSLDIVTQVQLLELLIELKNEFQLSY
FFISHNIQMICYLCERVLFFKQGQIITQSDIENLAEIKSDYAQKLIRSVI
>MS1364 oppF, OppF protein
MSESIKQATPLLEAVNLKKYYPVKKGLFAKPQLVKALDGVSFCLEKGQTL
AVVGESGCGKSTLGRLLTMIETPTDGELYYNGQNFLENDKTTQKLRRQKI
QIVFQNPYGSLNPRKKIGSILEEPLVINTDLTAAQRKARVLEIMAKVGLR
AEFYHRYPHMFSGGQRQRIAIARGLMLQPDIVVADEPVSALDVSVRAQVL
NLMMDLQKEMGLSYVFISHDLSVVEHIADQVMVMYLGRCVEQGRVEAIFK
NPRHPYTQALLSATPRLSPKLSSERIKLEGELPSPLNPPKGCAFHTRCRL
ATERCKQEQPLLKDYSDGTRIACFMVE
>MS0852 oppF, OppF protein
MSALLQIDNLSKSFTDNLGFFAEHKLQAVKHISFTLEKKQMLAIIGKNGA
GKSTLAKMIVGIIPPTSGRILFKSTPLEFGDYKRRASHIRMVFQDVNNAF
NPRLNVGQTLDDPLRLLTTLNERERNERIFETLRLVGLYPEHANVGINTL
SISQKQRVALARALILEPEIIIIDDALGSLDASVKTQLTNLMLDLQERLS
LSYIYVGQHLGIIKHCADKILVMDEGEMIEYGETRHVLTRPQNDITKRLI
ESYFGKELDNSAWEKPEQNEM
>MS2242 oraA, OraA protein
MTTLAFSYAVNLLSRREYSEFEIRCKMQEKAFSEQEIEDTLAQLQQKNWQ
SDKRFTENYLRARAQRGYGVNRIKQELRQLKGILPETVDEALMECDIDWS
EIALNVLAKKFPDYRARQDAKNKQKIWRYMLSHGFFAEDFADFIGNGTED
EFY
>MS1512 orn, Orn protein
MQLDNQNLIWIDLEMTGLDPENERIIEIATIVTDKDLNILAEGPVLAVHQ
SDELLAKMSDWCIKTHSANGLVDRVKASKLTERAAELQTIDFLKKWVPKG
ASPICGNSVAQDKRFLFKYMPELADYFHYRHLDVSTLKELARRWKPELLN
GFEKKNTHLALDDIRESIAELAYYRDHFIKLDGDQK
>MS2101 osmC, OsmC protein
MKIFYHTSATATGGRDGHTRVDDGSIGFDLVGFQNESGKVGTNPEQLFAM
GYAACFDSAMNHVAPTLNLKPTKSSTTVGVGIGQKPDGAFGLDLDITITV
EGLSLEDAKTLINKAHEVCPYSNATRGNVDVRLHVNVIQSFDL
>MS1291 osmY, OsmY protein
MNMHKLKKLTFIIGSALLLQGCVAALVGGGAVATKVGTDPRTTGTQLDDE
TLKFQVYNAVNKDEQIKQEGRIVVSSYSGRVLLLGQVPTESLKSVATSLA
KGVDGVGDVYNEIRVGSPITVTQKTKDSWITSKIKSDMLLNSSVKTTDIK
VITENGEVFLMGNVTQEQANAAAEVARNIAVLKKS
>MS1345 paaI, PaaI protein
MGIWTKDYTLSELNQIGEHCSVAHLAIRISAIEENWIEATMPVDQRTKQP
FGLLNGGLSVALAETLGSIAGNLCLQEGQAAVGAEINASHLRPATSGLVT
ARATPVKLGKTLQVWQIDIRNEQNKVCCTSRLTLSVINKNEK
>MS2194 pabA, PabA protein
MSKRLLIVNNHDSFTYNLVDLIRRLSVPMRVIEVEKLDLDEVEQFSHILL
SPGPDVPEAYPEMFALLTRYYRHKAILGVCLGHQTLCRFFGGRLYNLRQV
RHGVCGRLKVRSKSAIFSGLPEEFDIGLYHSWAVDSQNFPAELTITAECH
EEVVMAFEHKTLPIYGVQFHPESYISEYGEQMLINWLNS
>MS1150 pabA, PabA protein
MATILFLDNFDSFTYNLVDQFRGLGHQVKIYRNDCDLALLESIALQPDTI
LALSPGPGTPAEAGNMLALIQRVKSAVPIIGICLGHQALIEAFGGKVVHA
GEVLHGKVSKINHDEQAMFLNLQNPMPVARYHSLKGSNLPEELVVNATYN
DIIMAFRHKNLPICGFQFHPESILTVQGAKLLENSVNWLLNK
>MS2293 pckA, PckA protein
MTDLNQLTQELGALGIHDVQEVVYNPSYELLFAEETKPGLEGYEKGTVTN
QGAVAVNTGIFTGRSPKDKYIVLDDKTKDTVWWTSEKVKNDNKPMSQDTW
NSLKGLVADQLSGKRLFVVDAFCGANKDTRLAVRVVTEVAWQAHFVTNMF
IRPSAEELKGFKPDFVVMNGAKCTNPNWKEQGLNSENFVAFNITEGVQLI
GGTWYGGEMKKGMFSMMNYFLPLRGIASMHCSANVGKDGDTAIFFGLSGT
GKTTLSTDPKRQLIGDDEHGWDDEGVFNFEGGCYAKTINLSAENEPDIYG
AIKRDALLENVVVLDNGDVDYADGSKTENTRVSYPIYHIQNIVKPVSKAG
PATKVIFLSADAFGVLPPVSKLTPEQTKYYFLSGFTAKLAGTERGITEPT
PTFSACFGAAFLSLHPTQYAEVLVKRMQESGAEAYLVNTGWNGTGKRISI
KDTRGIIDAILDGSIDKAEMGSLPIFDFSIPKALPGVNPAILDPRDTYAD
KAQWEEKAQDLAGRFVKNFEKYTGTAEGQALVAAGPKA
>MS0926 pcnB, PcnB protein
MISKNALSVVEKLNRNGYEAYVVGGCLRDLLLDKKPKDFDVATNARPDQI
QAIFQRQCRLVGRRFRLAHIMFGRDIIEVATFRANHSDIENENASKQSEE
GMLLRDNVYGTLEQDAERRDFTVNALYYSPKDNLVYDYFNGIEDLKAGKL
RLIGDPVTRYQEDPVRMLRSIRFMAKLDMFLDKSSAPHIRKLAHLLKNIP
PARLFDESLKLLQSGQGVKTYNLLREYHLFEQLFPSLMPYFTERGDSFAE
RMILTALTSTDERIADKLRVNPAFLFAAFFWYPLREKVEILKNEGGLNNH
DAYALASNEILDLFCKNLAAPRRHTATIRDIWFLQLQLLKRNGKAPERTM
EHNKFRAAFDLLAMRAEIEGGEAIELSAWWHEYQLSTDEQRSALVKEQDK
QHPQGKKKFYRPRRRKPKVKTATNP
>MS2312 pcnB, PcnB protein
MQTYLVGGAVRDQLLNLPVKDRDWVVVGATPEQLLSLGYQQVGRDFPVFL
HPKTKEEYALARTERKSGAGYTGFICDFSPHISLEQDLIRRDLTINAIAQ
DNQGKFIDPYEGISDLKNRTLRHISPAFAEDPLRVLRVARFAARYHQLDF
SIAPETIALMAEITEKGELQQLTIERVWQETEKALKEKNPEIYFQVLLQV
GALKILFPELYALYGVPNPAQYHPEIDSFLHTMLVLQQAVRLTENTEFNK
SAVRFAAICHDLGKALTPKDILPHHYGHEKAGIQPIRTLSNRLKVPTYYK
ELAEFTCEYHSYIHKAFELKPETVIKLFNKLDVWRKPQRFEELMLVCVAD
TRGRTGFEQTAYPQKDYLRQLYQTALQVNVQQVIEDGFEKQGIRDELTRR
RTIAVKTKKAEILPRFVGQ
>MS0066 pdxA, PdxA protein
MKPILGITMGDAAGIGPEVIIKALEDKRIYDLAHPVVVGDFKIMQRALPI
VKSNLKLRKVDDVDHYQSEFGYIDVIDLDNLPADLPFAKVDARAGKAAYE
FIERAVDLTLKGKIHAIVTAPLNKEALHAGGKMFPGHTEILAKLSNTEDF
SMMLTSEKLNVIHVTTHVSMRQACDLIKKERVLTVIELAQEYTKMLGFKE
PRIAVAGFNAHAGEHGLFGTEDEEEILPAVKEAQAKGINVIGPIPPDTVF
HRAANLDEFDMVVVMYHDQGHIPLKLIGFDSGVNVTTGLPFIRTSVDHGT
AFQIAGQGIADSRSMTEALYLGAKMANIKYSNQ
>MS0381 pdxA, PdxA protein
MKKPVIALTMGDPAGIGPEITVDTMLSEEIHNVCKPFVIGSIPILERAAK
VRGVSIKFNKIQDPSEAKYELGTFDVLETGNYDTDSIKWGEVQKLAGQMS
YDWVLKSIELGMAKKIDAVSTAPIHKIAIKLAGVKEPGHTEIYQNETHSE
YGLTMFSCHKLRVFFVSRHMSVIDACKYATKERVLQDVRNIDKELRNIGI
ENPFIAVAALNPHGGDNGLFGREEIDELIPAVEAAKAEGINATGPVPADS
VFHIGKSGKYDAILSLYHDQGHIACKTLDFEKSVTITFGLPFMRSSVDHG
TAFDIAGKNIANGVSMIESTKVLAEYTAKYMNKKA
>MS2064 pdxH, PdxH protein
MIDLHNIRNEYSQQALSEKQCDDDPLKQLEKWLNEAIQAKVNEPTAMNVA
TVGENGKPSSRVVLLKEVNERGLVFFTNYHSHKGRDLAVNPFAAVNLFWA
ELQRQVRVEGRVERISPQASDEYFASRPYTSRIGAWASEQSAVISGKNSL
LTKAALIAAKHPLQVPRPPHWGGYIVIPELIEFWQGRPSRLHDRIRYRLE
KGEWVRERLSP
>MS0805 pdxK, PdxK protein
MKNVLSIQSHVVFGYAGNKSATFPMQLMGVDVWALNTVQFSNHTQYGKWT
GMVIPKEQIGEIIRGIDEIGELKNCNAVVSGYLGSAEQVDEIIKAVEKVK
SLNPQALYLCDPVMGHPDKGCIVADGVKEGLINLAVSHADILTPNLVELR
EISGLPVENFEQAIEAVKVIRAKGPKTVLIKHLSKVGKYADKFEMLLAND
EGIWHLTRPLYTFAKEPVGVGDLTAGLFLANKVNGKSDLEAFEHMANAVN
EVMKTTFELNSYELQLIAARKLIVNPVSSVKAVKIA
>MS1550 pepB, PepB protein
MKYSVKQTALEQENKSLFIAIFENQELSPAALKLDLKLKGEITEAVKNGE
VSGKIGRILVLRHGAQRIILVGCGKQNEVTERQYKQIIQKAVKTAKETIA
TTIINALTEVKIKDRDLYWNVRFAVETIEEDNYIFEQFKSKKSENNSKLA
EIIFYTEENHEQAELAIRHATAISSGVKAAKDIANCPPNICNPAYLAEQA
NQLAGRSSLIETTVIGEKEMRKLGMNAYLAVSCGSKNEAKLSVMEYRNHE
NPNAKPIVLAGKGLTFDAGGISLKPAADMDEMKYDMCGAASVYGVMNAIA
ELQLPLNVIGVMAGCENLPDGNAYRPGDILTTMSGLTVEVLNTDAEGRLV
LCDTLTYVERFEPELVIDVATLTGACVVALGQHNSGLVSTDDNLAQDLER
AAKLANDKAWRLPLSEEYQEQLKSKFADLANLGGRWGGAITAGAFLSNFT
KNYPWAHLDIAGTAWLQGQNKGATGRPVSLLVQFLLNQVK
>MS0667 pepB, PepB protein
MQIQLSNLPAPKSWGKNPLLSFSDNQATIHLENSEKSDRTLIQKAARKLR
GQGIDDVELVGNDWSLENCWAFYQGFYTAKQDWAVEFPELGDDHEELLAR
IQCGDFVREIINLPSSVITPLELAQRSARFIAGLAEEYAGKSAVDFHIIS
GEELKAQNYLGIWNVGKGAENPPAMLQLDFNPTGNPESPVLACLVGKGIT
FDSGGYSIKPSNFMDSMRTDMGGAALVTGALGLAIARGLNRRVKLFLCCA
ENLVSGNAFKLGDIITYRNGVKAEILNTDAEGRLVLADGLIDASSENAQF
ILDAATLTGAAKVALGNDYHCVLSMDEELTTDLFNAAKEEQEPFWRLPFE
ELHRSQISSSFADISNTSSAALAAGASTATAFLSHFVKDYQQNWLHLDCS
ATYRKTPSDLWATGATGLGVQTIANLLLTKATQL
>MS0815 pepD, PepD protein
MQYNEQLLERFFNYVSLDTQSKPGAKTSPSTQGQLKLAKILEQELYSLGL
DEIEVSKHGIVTALLPGNIENSPTIGLIAHLDTSPQCSGKNVKPEVIENY
RGGDIALGLGDEFISPVTFTFLHKLVGKTLIVTDGTTLLGADNKAGIAEI
MTALSQLKESSVPRCHIRVAFTPDEEIGLGMKFFPIEKFSCDWAYTIDGG
AVGELEYENFNAAGATVTIFGRAIHPGSAKDKMVNALTLACEFQQGFPTD
EVPEKTEEKQGFFHLNSFHGDIEKVELHYLIRDFDKQAFTQRKAFLEKWV
DEFNCRKQLKEPVKVTITDNYYNMYDTVSKVPQSIELADSAMKACGIVPI
HQPIRGGTDGAWLAEKGLACPNIFTGGYNFHSKHELITLEGMCSAVDVIM
KIAQLAVK
>MS2118 pepD, PepD protein
MSEIQSLQPQLLWKWFDQICSIPHPSHHEEQLAEFIVNWAKGKGFYAERD
EAGNLLIRKPATKGMEHCQSVALQAHLDMVPQANEETDHDFTSDPIQPYI
DGEWVKAKGTTLGADNGIGMASALAVLDSENLAHPALEVLLTMTEEVGMD
GALGLRKNWLQSEIMINTDTEDNGEIYIGCAGGENADLTVPVQWQENNYE
HCYQISLKGLRGGHSGCDIHTGRASAIKTLARFLANLQQNQPHFEFSLSE
IRGGSVRNAIPREAFATLCFNGEPANFTQGVKSFESLLKTELAIAEPDLQ
LTAQPAEKATKVFAPNTKNNVVNLLNALPNGVIRNSDVVENVVESSLSIG
VLKTTEDAVKGTILVRSLIESGTNYINGLLISLTELCGASVQFSGRYPGW
EPHAETPILTLTKEIYGELLGYEPAIKVIHAGLECGLLKKIYPALDVVSI
GPTIVNAHSPDEKVHIPAVRTYWELLTKVLAGIPAKK
>MS1554 pepE, PepE protein
MQAVLSPLNMEIISGKMLRHNGESREEHLAEFLIVNPTALVYAHPESTAL
HIEGRQATILE
>MS1034 pepN, PepN protein
MHAKAKYRKDYKKPDFTVTDIHLDFQLDPQKTVVTAHSQYQRLNPAATVL
RLDGHSFQFASIKVNGKDFATYQQDGESLTLDLSDIDAERFELEVITRLV
PAKNTSLQGLYQSGEGICTQCEAEGFRQITYMLDRPDVLARYTTKITADK
SKYPYLLSNGNRIAGGDLEDGRHWVEWNDPFPKPSYLFALVAGDFDLLED
SFTTKSGREVKLELFVDRGNLNRASWAMESLKKAMKWDEERFDLEYDLDI
YMIVAVDFFNMGAMENKGLNVFNSKYVLANPETATDEDYLNIESVIGHEY
FHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDVGSRAVNRIKNVKFLR
TAQFAEDASPMSHPIRPEKVLEMNNFYTMTVYEKGAEVIRMMHTLLGEKK
FQQGMKLYIAENDGKAATCEDFVAAMEQASGVDLTQFRRWYSQSGTPELT
VTDSYDEKKRSYKLYVSQMTAPTADQMDKVNLHIPLKIALYDMNGMPFSL
IKDDEAVNDVLDILLEDQVFEFHNITSKPVPALLCDFSAPVKLDYDYSTA
QLIALLKFAHNEFVRWDAMQMLFAQELRRNLSAYQQGEQLTFSAEILSAL
QQVLENYQSNVELTTLILTLPKETEFAELFKTIDPEGIAVVCDFMQHAIA
EGLQDLWLKTYHQINLEEYCIDMRDIALRGLRNLCLQYLAFTDYGNALVN
KHYLYADNMTDKLAALAAATKAQLTCRDKVMKDFEEKWQHDGLVMDKWFN
LQATRPDGNVLTLVKQLMDHPSFNFNNPNRLRALVGSFESQNLRAFHAVD
GSGYRFLTDVLLRLNESNPQVAARLVEPLIRFSRYDSQRQTLMKRALERL
REVENLSNDLFEKIEKALQ
>MS0479 pepP, PepP protein
MDLAYMAELPADEFVLRRQKLAAQLTDNSVFIVFSEVEKRRNNDCTYPFR
QDSYFWYLTGFNEPNSALVIQKKGKLVETTIFVRPSNPLMEIWNGRRLGV
ERAAEKLHLDQAFSIDDFARIFGKICQNSTALYHYQGLQPWADQLLAETF
ISPPDYINWAPMLDEMRLFKSANEVRLMQQAGQITALGHMKAMRQTRPNR
FEYEIESEILHEFNRFGARYPAYTTIVAGGENACILHYTENDQPLKDGDL
VLIDAGCEFAMYAGDITRTFPVNGKFTQAQREIYQIVLNAQKRAIELLVA
GNSIQRANDEVVRIKVKGLLDLGIMRGDIDELIANNAHREFYMHGLGHWL
GLDVHDVGSYSKEGQNGDRNSKVRDRPLEIGMVLTVEPGLYISPKSDVPE
QYKGIGVRIEDNILITEYGNKVLTAAAPKEIGDIEALMATER
>MS1958 perM, PerM protein
MIEMLKNWYLRRFSDPQAMGLAAILFFGFVAIYFFSDLIAPLLIALVLAY
LLEMPISFLSDKLKLPRFLSILLILGGFIAVTILMIFGLIPTLINQTVNL
FSDLPNMLNLSHQWVMSLPESYPELVDYQMIDSLFITIREKTLAFGESAV
KFSLSSLMNLVTIGIYAFLVPLMVFFMVKDQDELIAGFSRFLPKNRTLAS
KVWQEMQLQIANYIRGKLFEILIVAVVSYIIFLFFGLRYPLLLAVAVGLS
VLIPYIGAVLVTIPVALVAIFQFGATPTFGYLMTAYIVSQLLDGNLLVPY
LFSEAVNLHPLTIIIAVLIFGGLWGFWGVFFAIPLATLVKAVVNAWPSNE
DEAIS
>MS0428 perM, PerM protein
MNKSVSVNQFLIGFAALVIILAGIKMAGEIVVPFLMSLFIAIICSPIIKF
MTNRKIPHWLAISILFLFIVLVFFFLLGLVNSSIREFSQSIPQYRVLMSE
RLNEITALIQKWNLPLNLEKETILEHFDPSSIMNFVSRLLLSFSNVLSNA
FVLILVVIFMLLEAPTAKRKVALALSGNEKDASKEEKHLERILQGVISYL
GVKTAVSLLTGLCAWVLLETCGVQYAVLWATLTFLFNYIPNIGSIIAAIP
IVLQALLLNGFSTGFAVMTGIIAINMLIGNFLEPKLMGRTLGLSTLVVFL
SLLFWGWLLGTVGMLLSVPLTMALKIMLEASPNTTKYAALLGDVEESN
>MS0377 pfkA, PfkA protein
MIKKIAVLTSGGDAPGMNAAIRGVVRSALFEGLEVFGVYDGYYGLYHNKI
KQLNRYSVSDVITRGGTFLGSARFPEFKNPEVRAKCAEILRSHGIDALVV
IGGDGSYTGAKLLTEEHGIQCIGLPGTIDNDAPGTDYTIGYQTALETAVD
AIDRLRDTSSSHQRISIVEIMGRHCSDLTINAALAGGCEYIVASEVEFDQ
EELIQQIERSIANGKRHAIIAITELITDVHELAKRIEERVHHETRATVLG
HVQRGGSPCAFDRILASRMGVYAVDLLLQGKGGYCVGIQNEQLVHHDIID
AINNMRRSFKAELLDMNERLF
>MS0403 pflA, PflA protein
MSVLGRIHSFETCGTVDGPGIRFILFLQGCLMRCKYCHNRDTWDLHGGKE
ISVEELMKEVVTYRHFMNASGGGVTASGGEAILQAEFVRDWFRACHKEGI
NTCLDTNGFVRHHDHIIDELIDDTDLVLLDLKEMNERVHESLIGVPNKRV
LEFAKYLADRNQRTWIRHVVVPGYTDSDEDLHMLGNFIKDMKNIEKVELL
PYHRLGAHKWEVLGDKYELEDVKPPTKELMEHVKGLLAGYGLNVTY
>MS0401 pflD, PflD protein
MAELTEAQKKAWEGFVPGEWQNGVNLRDFIQKNYTPYEGDESFLADATPA
TSELWNSVMEGIKIENKTHAPLDFDEHTPSTITSHKPGYINKDLEKIVGL
QTDAPLKRAIMPYGGIKMIKGSCEVYGRKLDPQVEFIFTEYRKTHNQGVF
DVYTPDILRCRKSGVLTGLPDAYGRGRIIGDYRRLAVYGIDYLMKDKKAQ
FDSLQPRLEAGEDIQATIQLREEIAEQHRALGKIKEMAASYGYDISGPAT
NAQEAIQWTYFAYLAAVKSQNGAAMSFGRTSTFLDIYIERDLKRGLITEQ
QAQELMDHLVMKLRMVRFLRTPEYDQLFSGDPMWATETIAGMGLDGRPLV
TKNSFRVLHTLYTMGTSPEPNLTILWSEQLPEAFKRFCAKVSIDTSSVQY
ENDDLMRPDFNNDDYAIACCVSPMVVGKQMQFFGARANLAKTMLYAINGG
IDEKNGMQVGPKTAPITDEVLNFDTVIERMDSFMDWLATQYVTALNIIHF
MHDKYAYEAALMAFHDRDVFRTMACGIAGLSVAADSLSAIKYAKVKPIRG
DIKDKDGNVVASNVAIDFEIEGEYPQFGNNDPRVDDLAVDLVERFMKKVQ
KHKTYRNATPTQSILTITSNVVYGKKTGNTPDGRRAGAPFGPGANPMHGR
DQKGAVASLTSVAKLPFAYAKDGISYTFSIVPNALGKDDEAQKRNLAGLM
DGYFHHEATVEGGQHLNVNVLNREMLLDAMENPEKYPQLTIRVSGYAVRF
NSLTKEQQQDVITRTFTQSM
>MS1478 pfoR, PfoR protein
MKNRLKNFLIRQNIKFSLRRYAIDAMNFMALGLFGSLIIGLILKNTGDWL
DILWLNELGALAQSSMGAAIGVGVAYALKAPPLVLLSSTTTGIAGATLGG
PIGCFIAAAIGAEFGKLVNKTTPIDILITPAVTLLSGIATAQFMGPFLAS
LMRETGAMIMWAVELHPIPMSILVSVLMGMILTLPISSAAIAVTLSLSGL
AAGAATIGCCAQMIGFAVIGFKENRWGGLLSLGLGTSMLQIPNIVKNPKI
WVPPTLSGAIIAPFATVIFQMQNIPSGAGMGTSGLVGQIGTINAMGNSPY
IWLVILVLHFILPAILSLLITYLMRRKGWIKPGDLKLAV
>MS1537 pfs, Pfs protein
MKIGIVGAMKQEVEILANLMRNQTVTQVAGCTIYEGLINGKQVALLQSGI
GKVAAAIGTTALLQLAKPDVVLNTGSAGGVADGLKVGDIVISTETAYHDA
DVTAFGYAKGQLPACPATFISDEKLTALAKQVAQAQGHNVKRGLICSGDS
FIAGGERLAQIKADFPNVTAVEMEAAAIAQVCHVFRVPFVVVRAISDAGD
GQAGMSFEEFLPIAAKQSSAMIIGMLEQL
>MS1181 pgi, Pgi protein
MQNINPTSTAAWKALEAHKGTLENTTINDLFQQEKNRFADYSLTFNNEIL
VDFSKNKITRETLNLLRRLAKECALDEAKEAMFSGEKINRTENRAVLHTA
LRNRSNTAVLVDGKDVMPEVNEVLAKMKAFSERVISGEWKGYTGKAITDV
VNIGIGGSDLGPYMVTEALRPYKNHLNMHFVSNVDGTHIAEVFKKTNPET
TLFLVASKTFTTQETMTNAKSARDWFLATAKDEKHVAKHFAALSTNAAEV
EKFGIDTDNMFGFWDWVGGRYSLWSAIGLSIILSVGFENFEALLSGAHEM
DKHFRNTPIEQNIPATLALVGLWNTNFQGAQTEAILPYDQYMHRFAAYFQ
QGNMESNGKYVGRDGKVISNYQTGPIIWGEPGTNGQHAFYQLIHQGTTLI
PCDFIAPAKTHNPLADHHNKLLSNFFAQTEALAFGKTKETVEAEFLKAGK
SLDEVKDVVPFKVFAGNKPTNSILLQEITPFSLGALIAMYEHKIFVQGVI
FNIFSFDQWGVELGKQLANRILPELSGDEQVTGHDSSTNGLINQFKAWR
>MS0245 pgk, Pgk protein
MSVIKMTDLDLAGKRLFIRADLNVPVKDGKVTSDARIRATIPTLKLALEK
GAKVMVTSHLGRPTEGEFKPEDSLQPVVDYLKDAGFNVRLAQNYLDGVEV
NEGEIVVLENVRINKGEKKNDPELGKKYAALCDVFVMDAFGTAHRAQAST
YGVAEYAPVACAGPLLAAELDALGKALKEPQRPMLAIVGGSKVSTKLTVL
DSLSKIADQLIVGGGIANTFIAAEGHNVGKSLYEEDLIPEAKRLAAATNI
PVPVDVRVGTEFSENAPATEKSVTEVQADESIFDIGDKSAEELAKIIKSA
KTILWNGPVGVFEFPNFRKGTEVISNAIAEATANGAFSIAGGGDTLAAID
LFGIKDKISYISTGGGAFLEFVEGKVLPAVEILEKRANG
>MS0973 pgpA, PgpA protein
MMAHSSSTTNPLDKLSLRNPVHLLALGFGSGLIRPAPGTWGSLAAIIIGA
PILHWIGTVPFLVLILLGFALGVYLCQKTADDMGVHDHGAIVWDEFIGIF
ITLLAIPQISLFWCIAAFVLFRIFDIIKPYPISYFDKRLESGFGIMVDDV
LAAVYAAISLFLLHYII
>MS1532 pgpB, PgpB protein
MYLMLKCTLSIFRENFIMLKRLSLYTLLLCLVPIFVWISGWHWQGDAALT
QFDYFLYWLTETGSSPYAIITCGVFALLFFPFAKTKKQWVAVVAIMAVSM
VVTQGLKTGLKHVFAEARPYVVELAANSDISTEYFYDQTKEQRQSIVTDY
YSSRAETPGWLVEHRENEVSYSFPSGHTIFAVSWLLLAAGFFRLLGQTSS
GAKILLILTALWAFLMLVSRLRLGMHYPLDLLISTLIAWVLHCALFVFLE
KKRLFPRD
>MS0959 pgsA, PgsA protein
MTMKLNFPTFLTLFRVALIPFLIIAFYLPFGWSAFLSTAIFFVASITDWF
DGYLARKWKQTTRFGAFLDPVADKVIVATALVLIVEYYHVFWITIPAIVM
ISREIIISALREWMAEIGSRTTVAVSWIGKVKTTAQMFALGCLLWRYQYW
MEALGIILLYIAAILTVWSMIQYLKAAKDYLLEEINS
>MS1657 pheA, PheA protein
MALDLSEIRQQITQIDRSLLKLLSERHRLAFDVVRSKEITQKPLRDEKRE
QQLLQELINFSENENYQLEPQYITQIFQKIIEDSVLTQQVYLQKKLNEQR
EQSIHIAFLGKRGSYSHLAARSYATRYQEQLIELSCSSFEQIFEKVSSGE
ADYGVLPLENTTSGSINEVYDLLQHTDLSLVGELTYPIKHCVLVNGQDDL
SKIDTLYSHPQVIQQCSQFIRSLNKVHIEFCESSSHAMQLVSSLNKPNIA
ALGNEDGGHLYGLTVLRSNIANQENNITRFIVIARKAITVSPQIHTKTLL
LMTTGQEAGSLVDALTVFKKYQIKMTKLESRPIYGKPWEEMFYLEIEANT
NHPDTQAALEELRQYSTYLKVLGCYPSEIVKPVDVR
>MS1091 pheS, PheS protein
MQNLKEITEQARAALDELHDKGLDALEAFRVEYFGKKGHFTQLMQSLRNV
AAEERPAVGAKINEAKQAVLDILNAKKEAFEKAALNAQLEKERIDVSLPG
RKVELGGLHPVSLTIERVTKFFSELGFSVESGPEIESDYYNFDALNIPKH
HPARADHDTFWFNPELLLRTQTSGVQIRTMEKKQPPIRIMVPGKVYRNDY
DQTHTPMFHQIELLYEDKKVNFTELKGVLYDFLRAFFEEDLQVRFRPSYF
PFTEPSAEVDIMGKNGKWLEVLGCGMVHPNVLRSVGIDPNEYSGFAAGMG
VERLTMLRYNVTDLRSFFENDLRFLKQFK
>MS1090 pheT, PheT protein
MKFSEQWVREWVNPAVNTEQLCDQITMLGLEVDGVEAVAGEFNGVVVGEV
VECAQHPDADKLRVTKVNVGGERLLDIVCGAPNCRQGLKVACAIEGAVLP
GDFKIKKTKLRGQPSEGMLCSYRELGMSEDHSGIIELPADAPVGKDFREY
LILDDKEIEISLTPNRADCLSIAGVAREIGVVNQLAVTEPAINPVPVTSD
EKVAINVLAPEACPRYLLRSVKNVNVNAETPVWMKEKLRRCGIRSIDPIV
DITNFVLLELGQPMHAFDAAKLAQPVQVRFAADGEELVLLDGTTAKLQSN
TLVIADQTGPLAMAGIFGGQASGVNAQTKDVILEAAFFAPLAITGRARQY
GLHTDSSHRFERGVDFELQHKAMERATSLLVEICGGEVGEICEVVSETHL
PKLNKVQLRRSKLDALLGHHIETETVTEIFHRLGLPVSYENEVWTVTSAS
WRFDIEIEEDLIEEIARIYGYNSIPNNAPLAHLSMREHHESDLELSRIKL
ALVGNDFHEAITYSFVDPKLQSILHPEQAVWILPNPISSEMSAMRVSLLT
GLLGAVVYNQNRQQNRVRLFETGLRFIPDESAEFGIRQELVFAAVMTGSR
LSEHWASKAEPADFFDLKGYIENLLSLTKAGPYIKFVAKEFPAFHPGQSA
AIVLDGEEIGYIGQLHPMAAQKLGINGKAFACELIVDKVAERNVANAKEI
SKFPANKRDLALVVAENIAASDILDACREVAGSKLTQVNLFDVYQGQGVP
EGHKSLAISLTIQDTEKTLEEDDINAVISVVLSELKDRFNAYLRD
>MS0619 phnA, PhnA protein
MDQMPKCPKCNGEYVYHDSVNFVCPDCSYEWSGEDVAEEEETKVWKDSNG
NVLQDGDDVILVKDLKVKGSSIVLKKGTKAKNIRLVDGDHDVDCKIDGQP
FSLKSEFLKKA
>MS1186 phnL, PhnL protein
MNNGILLNCQNLTKDYIEGSVTTRVLKDVTFSMNDKELVAIVGSSGSGKS
TLLHTLGGLDQPTSGEVFIKGKSLQKASQDELAKLRNTYLGFVYQFHHLM
ADFTALENVLMPMLIGNQNKTEAKDRAEKMLNAVGLSHRITHKPSALSGG
ERQRVAIARALVNNPALVLADEPTGNLDHKTTESIFELIQQLNEDQGIAF
LLVTHDLNLAEKLNRRLIMQDGVLRPEM
>MS1682 phoH, PhoH protein
MTTQYKQEFTLTPQDNARLQSLCGAYDDNIKLIESEFNLDIARRNFTFII
QSKDKQSKPHHEALIKSAVKLIQDLYVETAPVRGKIKELDLSDVHMAIQE
SRMLLQPAQTDEKADTDESKVYTTTIKTKRGLIKPRGKNQIEYLHNILIH
DISFGIGPAGTGKTFLAVAAAVEALERQEVRRILLTRPAVEAGEKLGFLP
GDLGSKIEPYLRPLYDALYEMLGFERVEKLMERNVIEIAPLAYMRGRTLN
DSFIILDESQNTTVEQMKMFLTRIGFNSKAVITGDVTQVDLPRSQKSGLK
HAIEVLEKVEELSFNYFDSKDIVRHPVVAKVVQAYESWEAEDEIRKRKLA
EQRRAERAEIAENLKVD
>MS1917 pilF, PilF protein
MMTFKFQQKLTALFSLFISLLLSACSSSPQVNAENLAKQQAAKARIELGL
AYLHQQNINQAKQNFDKALEHAPQYYLTHSALAYFYQQLGDVKQARIHYQ
KAIDLDNNQGDVHNNYGTFLCSQGQFEQAYDEFEQALQSPNYYRQTDSYE
NLALCALSAKNNERFQRYLTKLEKLDPKRAAKLENISN
>MS2310 pitA, PitA protein
MELLHNYGTVIIFITAAFAFFMAFGVGANDVSNAMGTSVGSGTITARQAI
YIALIFEAAGAYLAGGEVTETIKSGIIDPMDFVTHPDTLVLGMMSALFAS
GSWLLIASRWGWPVSTTHSIVGAIVGFGCITAGAGAVKWSALTGIVGSWF
ITPFIAGVLAYGIFFCIQKLIFDTEHPLRNAQKYGPHLMGATVFIICIVT
VAKGLKHVGLNLTGLETLLISIALSLVSVVISYFYFRSKKFIKKVHKGVF
GGVEHVFSILMLMTACAMAFAHGSNDVANAIGPLASVVTIVESGGDIAAN
APIAWLVLPLGAAGIAVGLIVMGYKVMATIGTGITDLTPSRGFSAQFATA
ATVVVASGTGLPISTTQTLVGAVLGIGFARGIAALNLTVIRNIIASWIVT
LPAGAFFAIIIYYILDAIFR
>MS0004 pldB, PldB protein
MLNREPKFSNFALTELLPFAERCPLNYIKGKNNIKIAYRHFKHEHDGNDR
LLVLVNGRAENLLKWTEVAYDFYRQGYDVLSFDHRGQGYSDRLLKNRDKG
YIDEFRYYTDDMAAVIAEAYSYRQYSSCHLVAHSLGALISTYYLANYDHH
IKSAVLSAPFYGLQLRRPFIDQIIINLMILFGQGKRYVPGKEGYKPANLN
NNDLSFCKTRMKWMNRINRNHPAIHLGGPTFRWVHLCLNAIKGLPKIIPR
IEIPILILHSDKEKIVNNKNLQKLTALFQHVQVEEIQNAKHEILFERDAL
RARAIQRISKFFTDFK
>MS0745 plsB, PlsB protein
MQDVNLGVGKCIMTSLLNLYRKVLEAPLSFLVKNNPIPSNPIEELKLNVS
QPIVYVLPYTSQTDFVIFRKNCLSVGLPDPIETNDIHGKQLPRYVFLDEG
RQIFKSKGPKKETEKVFYNYLELHRAFGDLDVQVIPVSVLWGRAPGREDK
GKLPQLRLLNGMQKTIAAMWFGRDTFVRFSQAVSLRYMVTEHGADQSIAQ
KLARVAKMHFAKQRFSATGPRLPNRQAMFNKLLQSPVILSAIADEAKSKN
MSRERAHQEAEKILKEIAADVSYENLRVLDRLLRWLWNKLYQGIDIENAD
RVRQLALEGHEIVYVPCHRSHIDYLLLSYVLYHQGLVPPHIAAGINLNFW
PAGPIFRRSGAFFIRRTFKGNRLYSTIFREYLGELFHRGYSVEYFIEGGR
SRTGRLLTPKTGMMSMTLQALQQRQTRPITVVPVYIGYEHVLEVDTYAKE
LRGAAKEKENAGLVLRVIKKLRNLGKGYVNFGEPITLSNYLNQHYPEWKN
TDDEKPTWFNKAVDSISNQVMVNINNAAAVNAMNLTGTALLSSRQRALSR
EQLLEQLQSYQEFLQNVPYSDDVIVPAESPEEMLKHVLGLERVGVLVEKD
SFGELVRLERNSAVLMTYYRNNIQHLFALPSLVASIILHYEKIHNGELLH
AVQRIYPFLKNELFIHIEKEELTLVVEKIIAEFHRQKLIDVDGDVFGIND
RGIRTLQLWASAVREILQRYRITIAILQYKPDIARNALEKESQSVAQRLS
VLHGINAPEFFDKAVFAEFTASLKDNGYFDEAGNAVTEKLDELADILNHI
ISAEVNLTIRSAIEKAEEMPAQE
>MS0510 plsC, PlsC protein
MLKLIRIILVAICCVLICVLGTIFSLIRFRHPSNVGVMARWFGRLYPLFG
LRVEHRFPDNVDQNVPAIYIGNHQNNYDMVTISYMVRPRTVSVGKKSLIW
VPFFGILYWATGNIFLDRDNKNKAHNTMTELARRIQQDNISIWMFPEGTR
SRGRGLLPFKTGAFHAAISAGVPIIPVVCSTTHKKIDLNRWNNGKVICEI
MQPIDTQSYSKENVRELASHCYDLMKKRIAELDAELAQVGK
>MS1870 plsX, PlsX protein
MSRLTLALDVMGGDIGPRITIPASIKALEKDPMLSLLLFGDSQQINPLLE
QVPSALKERLRVCHCSRVVENNQGLSYALRHSKGTSMRLAIEAVQKGEAQ
GCVSAGNTAALMGLSKVLLQPLKGIDRPALISLLPTMDGGRTVMLDLGAN
IDCNANNLYQFALMGAIFAENQLDLVFPRIALLNIGIEEIKGYKSIREAA
DLLTGNSSLNYIGFIEGNLLLNGKADVIVSDGFVGNIALKTLEGAAKNVI
SLIKGKSRNHLLKPLFNWLIKLLFKDSYQRLQKINPDQYNGASLIGLTSI
VVKSHGAANIEAFNNAIHDAALQARQQIPEKILAGLQK
>MS1889 pncB, PncB protein
MQNSALLMDFYALTMANSYFEQGRQDEIAYFDYFFRRVPDEGGYAVFAGL
EQLLDYLENLQFSEQEIAFLQQKQIFSEGFLDYLRHFRFRGDLWAVKEGT
AVFPAEPLVVIRAPIIDCTLIEAFLLLTLNHQTLIATKAARIVSVAQGRN
VLEFGARRAHGVDAAHFGARAAYIAGVDGTSNVYSDFACGIPALGTMAHA
YVQSFDNEYEAFLQYAKTYPDNTVLLVDTYDTLHQGIPNAIRVHREYLAP
RGYKLKGIRIDSGDLAYLSIRAREMLDAAGLTETKITVSNSLDEYLIKDL
LLQGAKIDSFGVGERLITAKSEPVFGGVYKIVALENAGQIIPKIKLSETL
QKTTTPGFKNLWRLYDKNRKAIADVITLHDEIIDSTKPYMLFDPEYTWKT
KWVTDFVAEQKLQQWISQGGKQQPIPTLEESRAHCRQELQSLWNEVRRLE
KPHGYYVDLSQDLWTLKRHLIERHSGKKE
>MS0493 pnp, Pnp protein
MNPIVKQFKYGQHTVTLETGAIARQATAAVMASMDDTTVFVSVVAKKDVK
EGQDFFPLTVDYQERTYAAGKIPGGFFKREGRPSEGETLIARLIDRPIRP
LFPEGFLNEIQIIATVVSVNPQISPDLVAMIGASAALSLSGVPFNGPIGA
ARVGFINDQFVLNPTMAEQKQSRLDLVVAGTDKAVLMVESEADILTEEQM
LAAVVFGHQQQQVVVEAIKEFVAEAGKPRWDWVAPEPDSALISKVKAIAE
NRLGDAYRITEKQVRYEQIDAIKADVIAQITAEDEEVSEGKIVDIFTALE
SQIVRGRIIAGEPRIDGRTVDTVRALDICTGVLPRTHGSAIFTRGETQSL
AVVTLGTERDAQILDELTGERQDTFLFHYNFPPYSVGETGRVGSPKRREI
GHGRLAKRGVAAVMPSISEFPYVVRVVSEITESNGSSSMASVCGASLALM
DAGVPVKSAVAGIAMGLVKEDDKFVVLSDILGDEDHLGDMDFKVAGTRTG
VTALQMDIKIEGITPEIMQIALNQAKSARMHILGVMEQAISAPRAEISEF
APRIYTMKIDPKKIKDVIGKGGATIRALTEETGTSIDIDDDGTVKIAAVD
GNAVKTVMARIEDITAEVEAGAVYTGKVTRLADFGAFVAIVGNKEGLVHI
SQIAEERVEKVSDYLQVGQEVQVKVVEIDRQGRIRLTMRDLGSKEESQEL
SVEQ
>MS1224 pntA, PntA protein
MLIGVPRELLDNESRVAATPKTVQQILKLGFDVIIEHDAGFKASFEDNAF
EQAGAKIGDQQAVWNADVIFKVNPPTDDEIALMKEGSTLVSFIWPAQRPD
LMEKLSTKNVNVLAMDAVPRISRAQALDALSSMANISGYRAVIEAANAFG
SFFTGQITAAGKVPPAKVLVIGAGVAGLAAIGAANSLGAIVRAFDSRPEV
KEQVQSMGASFLEIDFKEEGGSGDGYAKVMSEEFNRRAMELYAEQAKDVD
IIITTAAIPGKPAPRLITKEMVDSMKPGSVIVDLAALTGGNCEYTKAGEI
FVTDNQVKVIGYTDFPSRLPTQSSQLYGTNLVNLLKLLAPNKDGQIDLNF
EDVVIRGVTVIRNGEVTWPAPPIKVSAQPQQKAAAQKVEKKEEKPKDPRI
KYGVMALAAILFLWLASVAPSAFLSHFTVFVLACVVGYYVVWNVSHALHT
PLMAVTNAISGIIIVGAVLQIAQGSFFISVLAFIAILVASINIFGGFKVT
QRMLAMFRKG
>MS1223 pntB, PntB protein
MSVGFVQAAYIVAAILFIMSLAGLSKHETAKAGCWYGIVGMTIALIATIF
GPATEGQLWILVAMAIGAVLGIRKALKVEMTEMPELVAILHSFVGLAAVL
VGFNSFGLHSAVAMPVNLDEAAQAAFLAEQAALDNIHNVEVFLGIFIGAV
TFTGSVVAFGKLSGKINSKALMLPHRHKLNLAALVVSALLMISFLNSPEN
IFPVLLMTAIALAFGWHLVASIGGADMPVVVSMLNSYSGWAAAAAGFMLN
NDLLIVTGALVGSSGAILSYIMCKAMNRSFVSVIAGGFGNDPVVSSDEEQ
GEHRETTAEETAELLKNASSVIITPGYGMAVAQAQYPVAELTQKLRDRGV
NVRFGIHPVAGRLPGHMNVLLAEAKVPYDVVLEMDEINDDFEDTDVVLVI
GANDTVNPAAMEDPNSPIAGMPVLEVWKAQNVIVFKRSMAVGYAGVQNPL
FFKENTQMLFGDAKDRVNDIISALN
>MS0193 pnuC, PnuC protein
MSLSKSLKDEFFGGWTKFEAFWLILFLAIQIGLFIYQPDSWIATIAAITG
IICVVFVGKGKISNYLFGFISVSLYAYTSYTFKLYGEMMLNLLVYVPVQF
IGFFMWRKHMTNKNTLNTAGVEEVIAKALTAKQWVLVILAAGLVTYAYIE
WLRHLGSALPALDGVTVGVSIVAQVLMILRYREQWSLWIIVNILTISLWV
GMYLENGETSLPLLTMYIMYLCNSIYGYYNWTQLVKKHQAG
>MS0225 polA, PolA protein
MFLLYFEIVMAQIAQNPLVLVDGSSYLYRAFHAFPPLTNSLGEPTGAMFG
VLNMLKSLITQVQPSHIAVVFDAKGKTFRDELFEQYKSHRPPMPDDLRKQ
IQPLHDIIRALGIPLLSIEGVEADDVIGTLALQASSAGKKVLISTGDKDM
AQLVDDNIMLINTMNNTLLDREGVIEKYGIPPELIIDYLALMGDSSDNIP
GISGVGEKTALGLLQGIGSMAEIYANLDKVADLPIRGAKKLGEKLAAAKA
DADLSYVLATIKTDVELDLNPEQLIIGTANKDELIEYFARYEFKRWLNEA
LNDESSVTKPQEQAVKINNYQATPALAKQESAVKNSVKIDRTLYETVDNQ
AKLQQWIEKIRQVKLVAVDTETNALDPMLAELVGISFALENGEACYIPLA
HVHQVAAQAENAQGDLFAESEQSSESRWEPVVGQLNKAECLSQLKPLLEN
PEIKKIGQNIKYDLTIFANNGINMQGVTFDTMLESYVLNSTGRHNMDDLA
ERYLGHHTIAFEDIAGKGKNQLTFDQIELKKAAEYAAEDADVTMKLHQTL
WREVAQSPELVKLYQEMELPLVSVLSRIERNGVLIDSRALLAQSKEFSQK
LTALENKAHELAGQHFNLASTKQLQEILFDKLGLPVLKKTPKGAPSTNEE
VLEELAYEHELPKLLVEHRGLSKLKSTYTDKLPQMVNRKTGRVHTSYHQA
VTATGRLSSSDPNLQNIPVRNEEGRRIRQAFIARKGFKVIAADYSQIELR
IMAHLSADKGLTAAFSEGKDIHRSTAAEIFGLALENVTAEQRRSAKAINF
GLIYGMSSFGLSRQLGIARGDAQRYMDLYFQRYPGVQTFMTNIREKAKSQ
GYVETLFGRRLYLPDIQSANAMRRKAAERVAINAPMQGTAADIIKRAMID
IDKAITDDPDILMIMQVHDELVFEVKEDKIEHYSALIKSLMENAAQLHVP
LIVDVGVGDNWDEAH
>MS0583 potB, PotB protein
MKGIKKDLKAWLLLCSGLGTILFLMGSTFYIVVTQSLGLYNISGEDSRFT
LQYWHDVLTNSVFQSSYIYSVKVSLLGAILSIIVSYPIAMWLRNELPAKV
TIITILRAPMLVPGLVAAFLFVNMISYHGILNETMVFLGIWHEPKTLQND
EFGWGVVILQMWKNIPFALILIGGAVNSLKTDLLDAAANLGSTSWQRFRY
VIFPLTLTAVQVSFILIFIGALGDFAFYSIAGPRSTYSLARLMQMSAYEF
EEWNQSAVMAMMIMLTSAFFTILVSIIIKPLAVKRGDIK
>MS0811 potB, PotB protein
MKMTTRKFQNSTVAVIFAWLIFFMFVPNFLVLIVSFLSKDSSNFYALPFT
FENYARLFEPLYGTVVWNSLYMSGIATVICLLIGYPFAFFMAKLNPKYRP
ILLFLLVLPFWTNSLIRIYGMKVFLGVKGILNEFLLFTGIIDEPIRILNT
EVAVIIGLVYLLLPFMILPLYSAIEKLDLRLLEAAKDLGANGIQRFIKII
IPLTMPGIVSGCLLVLLPAMGMFYVADLLGGAKVLLVGNVIKSEFLISRN
WPFGSAISIGLTILMALLIFVYYKANKLLNKKVELE
>MS0581 potC, PotC protein
MSSAKITTKNSKIIARISLTFFVLVNFIWLVLPFLMAGLWSLVDPKQPWS
YPDILPPSLSLERWQMVWENTSLPEAMFNSYTIAPTVSLITISLSIPTAY
AFGRMEFRGKKIAELLTLIPLVIPGMIIALFFSRMLLDLNISNPFVGIVI
GHVVLTLPYAIRILSAGFSSVPQDLIEASRDLGASKFTVFKDVYMPMLKP
SFLASIIFCLVKSIEEFAISFVIGSPDFITVPTILYSFLGYSFIRPNAAV
VSIILLVPNIILMMIIEKLLKGNYLSQSTGKA
>MS0810 potC, PotC protein
MSRVLRNIFMLVVYAYLYIPIIILVGNSFNADRYGLSWKGFSFAWYERLA
NNDTLIQAAVHSVTIAFFAATFATIIGSMTAIALYRYRFRGKQAVSGMLF
VTMMSPDIVMAVSLLALFMIIGISLGFWSLLLAHITFCLPYVVVSVFSRL
KGFDLRMLEAARDLGASEVTILRKIIFPLALPAIISGWLLSFTISMDDVV
VSSFVSGVSYEILPLKIFSLVKTGVTPEVNALATIMIVLSLLLVLLGQII
GKKDKS
>MS2292 potD, PotD protein
MLVTATAFFSTASFAAPKQLYIYNWTDYIPSDLISKFTRETGIKVNYSTF
ESNEEMFSKLKLTINKPGYDLVFPSSYYISKMVKENMLTPINHSKLTNLK
QIPSNLLNKDFDPANKFSLPYVYGLTGIGINTSFVNPDEVTGWGDLWKEK
FKGKVLLTADSREVFHIALLLDGKSPNTQNEEEIRNAYQRLTKILPNVAA
FNSDTPELPYIQGEVELGMIWNGSAYMAEKENPAIKFIYPKEGAIFWMDN
YAIPKNARNIEGAHKFIDFMLRPEHAKIIIERMGFSMPNEGVKVLLKPED
RVNPLLFPPEDEVKKGVFQADVGDATDIYEKYWNKLKTN
>MS0809 potD, PotD protein
MSWNYQGQIFYSLSTGANSMKKLAGLFAAGLIAVAVTGCNDKESKSADAN
APETAKDNGTVYLYTWSEYVPDGLLDDFTKETGIKVIVSSLESNETLYAK
LKTQGADGGYDIIAPSNYFVSKMAREGMLKELDHSKLPVIQELDPDWLNK
SYDPNNKYSLPQLLGAPGIAYNTQTYKGSDFTSWGDLWKPEFAGKVQLLD
DAREVFNIALLKLGKNPNTQDPAEIKAAYEELLKLRPNVLSFNSDNPANA
FISGEVEVGQLWNGSVRIAKKEQPGSIDMIFPKEGPVLWVDNLAIPATSK
NPDGAHKLINYLLGAKAAEKLTLAIGYPTANVEAKKVLPKEITEDPAIYP
TAELLRTANWQEDVGEAVELYEKYYQELKAAK
>MS0580 potD, PotD protein
MRNILRKALSLTITALAVANFAQAENLTDKSWPDIEAQAKKEGKLTVSVW
YLQPQFRVFVKEFEKQYGIQVKVPEGTLDGNINKLIAEKNLEKGKMDVVV
LSADRVSNVTNNGVLANIKQLPNFGKLNHFLQGVDLGETAVGYWGNQTGF
AYDPLRITEDQLPQSWQDVENYIQQNPKKFGYSDPNGGSSGNAFIQRALV
YVNGEYDYMTPTVDAAQVANWKKTWEWFNARKNVMIRTASNADSLTRLND
GELVLVSAWQDHLFSLQKQGAITTRLKFYVPQFGMPGGGNVATIAKNAPN
PAASLVFIHWLTSPEVQQKLSQEFGVRPLDSESGKRDTLFFSTPWRKAEM
EAFTKEVVSR
>MS0552 potE, PotE protein
MSNKKIGLLSLTALVLSSMIGSGIFSLPQNMAAVAGAEAISIGWLITGIG
IIFLGLSFFFISRLRPELDGGIYTYAREGFGDLMGFMSAWGYWLCATIGI
VGYLTVAFEGLGVFTDSENTVIFGQGNTVASFIGSSIIVWLVHALIAGGI
KEAASVNLVATFVKVAPLVLFILLGFWFFDTDIFNSDVKASALNNNIGDQ
VKDTMLITLWVFTGVEGASVLSAHAKKRTDVGLATVLGILIALALYVAIT
ILALGILPRETIAEMPNPSMGPLLDAMMGPTGKVIITACLIVSVLASYIS
WTMYSAEIPYRGAQKGAFPKILDKLNENSTPINSLWFTGFIVQFCLILVF
VFEQSYNTLLLISTSMILIPYFLIGAYLFKLAIQTNSAWYIKLTGFMASI
YGLWIVYAAGLQYLLLSVVLYVPGILLFLYSHRKFHGKFKLKGFEQTILA
MIFILFCYAVYRLPELLAA
>MS2221 ppa, Ppa protein
MADFNQILTPGDVDAGIINVVNEIPEGSCHKIEWNRKVAAFQLDRVEPAI
FAKPTNYGFIPQTLDEDGDELDVLLITRQPLATGVFLEAKVIGVMKFVDD
GEVDDKIVCVPADDRDTGNAYNTLSDLPAQLIKQIEFHFNNYKALKKPGS
TKVTHWGDVEEAKEVIRESIKRWNEQ
>MS1017 ppc, Ppc protein
MTEEYLMMRNNINMLGRFLGETIQEAQGDDILELIENIRVLSRNSRSGDD
KARAALLDTLSTISADNIIPVARAFSQFLNLTNVAEQYQTMSRSHEDKVS
AERSTAALFARLKEQHVSQEEIIKTVQKLLIEIVLTAHPTEVTRRSLMHK
QVEINKCLAQLDHTDLTAEEQKNIEYKLLRLIAEAWHTNEIRTNRPTPLE
EAKWGFAVIENSLWEGLPAFIRKLNDAAVEHLNYALPVDLTPVRFSSWMG
GDRDGNPFVTAKITREALQLARWKAADLFLTDIQELCDELSMTQCTAEFR
EKYGDHLEPYRVVVKDLRSKLKNTLDYYNDILAGRIPPFKQDEIISEDQQ
LWQPLYDCYQSLTACGMRIIANGLLLDTLRRVRCFGVTLLRLDIRQESTR
HSDAIGEITRYIGLGDYSQWTEDDKQAFLIRELSSRRPLIPHNWTPSEHT
REILDTCKVIAKQPEGVISCYIISMARTASDVLAVHLLLKEAGISYHLPV
VPLFETLDDLDASKEVMTQLFNVGWYRGVIKNRQMIMIGYSDSAKDAGMM
AASWAQYRAQDALVKLCEQTGIELTLFHGRGGTVGRGGAPAHAALLSQPP
RSLKNGLRVTEQGEMIRFKLGLPAIAAESLDLYASAILEANLLPPPEPKA
SWCRVMDELAVASCEIYRNVVRGDKDFVPYFRSATPEQELAKLPLGSRPA
KRNPNGGVESLRAIPWIFAWMQNRLMLPAWLGAGASIRQAMESGKAAVIE
EMCNHWPFFNTRIGMLEMVFSKTDSWLSEYYDQRLVKKELWYLGESLRKQ
LSEDIATVLRLSGKGDQLMSDLPWVAESIALRNVYTDPLNLLQVELLRRL
RADPEHPNPDIEQALMITITGIAAGMRNTG
>MS0624 ppiB, PpiB protein
MKKFKFLTALFALFFVFNANAKNVTLHTNYGDIKIALNEKKAPVSSKNFL
DYAQTGFYDNTIFHRVIDGFMIQGGGFEPGMNQKKTNEAIRNEANNGLKN
LRGTIAMARTSAPHSATAQFFINLQDNDFLNFTEESQQGWGYAVFGKVTE
GMNVVDKIAKVATGRVGMHRDVPKEDVVIKSVTVE
>MS0623 ppiB, PpiB protein
MRNQIPIHFYKENTMVTLHTNFGDIKIALNHEKAPETAANFEAYCKEGFY
NNTIFHRVIDGFMIQGGGMEPGMREKNTKAPIKNEANNRLSNKRGTIAMA
RTSDPHSATAQFFINVADNAFLDYRAKEMFGREVVQEWGYAVFGEVVEGM
DVVDKIKGVKTGNAGFHQDVPKEDVVITSVTVE
>MS0360 pppA, PppA protein
MSNLYLVFFACLLGYLTYYYLSNFRTKLHKDIYYAFYQIFPQKQPHFTIE
QADQAGQLSPLSLANQWIYIGVSIVLCSIIQTLTKDLTLTCFYMSYLVLL
FIIGKLDWHYQLIEPALCQLLLLILSGASYFRLISNSLEDVVESAVISFV
IFYLVYHISKLCYKKEVFGQGDYWLISALSAGLSWRDIPLMISLACLLAL
FYALIYNRLLAHTKISLVPFAPFLCTANLVTLFIKMLI
>MS0818 pqiA, PqiA protein
MIFDQMPNAQRYIRKYGKSAVDFPAKFLLDVDFADKNRSIILSD
>MS0819 pqiA, PqiA protein
MPKIDRTFPFFLLLMSKYGIKYPPLKQKRDNMAKASSLSASQNASIVRCN
DCNALVALSELKKSQQAECPRCHNVLKSQDRWRLRRCAIIAISILILMPF
ALTYPLLSVDLLGITVDASVWGGVWKMATEGYPYTAFLVFICAVFLPVSF
ALLVILLYLAKLTHQKPRNLLFALGYIKPWVMFDVYLVALGVSAFKVRDY
ATIHIDIYLIAFVLTSLLTTLLFIKINPKELWNDFYPQNQHLIKISPENP
PHFCRTCEYTFEHSAFDRKSHTICPRCHSRLDTPSYVNLQNTWATLIAGI
IMLFPANIFPISYTIMNNVATGDTLMSGVITFIGMGSYFVAFVVFFASIF
VPVSKVFIMIYLLLSIHFQWKHSIKWQMSLFHIVHFVGRWSMLDLFVLSL
MMSLVTRGQIINFSVGPAALYFGIAVFLTMLSTTFFDTRLLWNIYDKQPS
K
>MS0817 pqiB, PqiB protein
MASPFSLRCFQPHSLIPVYFGIFMTNNQANNRVKINAENNVQAAKIKQDK
RISPFWLLPIIALCIGALLFFQIIKEQGETIRITFTTGDGLVANKTQVRY
QGLQIGIVKKVNFTDDLKKVEVQASIYPEAKNVLRENTKFWLVQPSASLA
GISGLDTLISGNYISLQPGDGNYKDDFIAEETGPIAQVSDGDLLIHLLAD
DLGSISEGASVYYKKMPVGKIYDYRFTPDQKKVEIQVVIDKAYANLIKQD
TRFWNISGINANVGPSGITVNMDSLNAIVQGAITFDSPDNSPKAKQDQQF
TLYPTLQAAQRGIEVKITLQNQAGLKAGKTEVFYNNLQVGTLAKLDNEDI
THAKISGTLLLDPNISNELRTNTNIILRTPKMNLATLEKLPDMLRGQFFE
IIPGSGEPQREFQVYKESDLLLKQADTLVFTLTAPETYGIAEGQQIFYNN
LPIGEIVKQTLNEQGVEYQAAIAGKYRHLIYGDSQFVAASNLDISLGIDG
LRVEAASPDKWLQGGIRLIANKNKGSALSSYPIYKDLSSAEAGITSSTLT
PTITLNAQNLPNIGKGSLVLYRQYEVGKVLDIRPLKNSFDVDVAIYPKYR
HLLTKNSLFWVESASQVDITARGISIQTSPLGRVLKGAISFDNSGGNNNK
TLYANELRAKSAGQVITLTADNATNLTKGMALRYMGLEVGQLESINLDQN
KNQVVVKALMNPNYMNLVAKEGSEFRIISPQISAGGIENLDSLLQPYIDI
DAGKGKYKTTFAIKNNNNTDNKYNNGFPIILEASDALNITTGSPIYYRGV
EVGKINRMELNELGDRVLIHLLIANKYRHLVRKNSEFWISSGYSAGVGWS
GIEVNTGTVQQLLKGGISFSTPSGTVIQPQAAANQRFLLQIKKPVEAKTW
NSAVLPEQN
>MS0821 prc, Prc protein
MKLNKSKTYLATLVVSAVIGMSGNAFAVHPTLKASDIVIPQPTEENGLAT
KRATTRLTQSHYRKFQLDDEFSHKIFARYLDFLDYSHNLFIKSDIDELQA
KYAALLDDELNEGKLDIAFAMYDLMAKRRYERYEYALSLLDKEPSLKDDD
QIEIDRKKAAWPASVEEANKLWAARVKNDIIDLKLKDKKWSEIKKTLTKR
YNLAIRRLTQTNADDITQLFLNAFAREIDPHTSYLAPRTAKSFNESMNLS
LEGIGATLQQEDDVTTIKSLVPGAPAERSKRIKAGDKIIGVGQAKGEIED
VVGWRLEDVVDKIKGKKGSKVRLEIEPAKGGKSKIITLVRDKVRIEDSAA
KLTVEKVNGQNVAVIKIPTFYIGLTADVRKLLEQMKAKKATSLIIDLREN
GGGALTEAVELSGLFISDGPVVQVRDAYNRIRVHEDPDNAQVYTGPLLVM
TNRFSASASEIFSAAMQDYNRGIIIGQDTFGKGTVQQSRSLNFVYDLDQE
PLGFIQYTIQKFYRINGGSTQLKGVTADINFPAIIDTKENGEEKEDNALP
WDKIPAATYSQVSHARDAVEVLKSKHLDRISKDPEFIALAEDLKIRDERS
ERKYLSLNYEKRKAENDKDDARRLKALNERFAREGKKALKDINDLAKDYE
APDFFLKEAEKMASDLAKFETDKESIQAKAMSLENKADTKDVKAESKKTA
EVKTETVKSKEDVRPETK
>MS1192 prfA, PrfA protein
MKPSIISKLDSLNERYEELEALLGDASVISDQDKFRAYSKEYSQLEEVVK
TFSRWKQLNSNIEEAELLLDDPEMKEMAQMEIEESKNELEEVEQHLQILL
LPRDPNDEYNAYLEIRAGTGGDEAGIFAGDLFRMYSRYAEMKRWRVEVLS
ENESEQGGYKEIIALVSGDNVYGQLKFESGGHRVQRVPKTESQGRIHTSA
CTVAVMPELPESEMPEINPADLRIDTYRASGAGGQHINKTDSAVRITHIP
TGMVVECQDERSQHKNKAKALAVLASRLVQAEQDKLAAEQATTRRNLLGS
GDRSDKIRTYNYPQGRVTDHRINLTVYRLDEVMNGKIDELIQPIITEYQA
DQLAALSDQP
>MS1542 prfB, PrfB protein
MEQPDVWNEPEKAQALGKERSALETVVNTIKKLDQGLEDVDGLLELAVEG
EDEETFNEAVTELDELEQQLAKLEFRRMFSGEHDACDCYIDLQAGSGGTE
AQDWTEMLLRMYLRWAESKGFKTELMEVSDGDVAGLKSATVKVSGEYAFG
WLRTETGIHRLVRKSPFDSNNRRHTSFAAAFIYPEIDDDIDIEINPADLR
IDVYRASGAGGQHVNRTESAVRITHIPSGIVVQCQNDRSQHKNKDQCMKQ
LKAKLYEMELQKKNADKQALEDSKSDIGWGSQIRSYVLDDSRIKDLRTGV
ENRNTQAVLDGDLDRFIEASLKAGL
>MS0449 priA, PriA protein
MILLMKFVRVALAVPLMRFFDYILPEQMQPVIGGRVLVPFGRQKRVAIVV
EFAQETDIPKEQLKPVLNVLDDAGLFNDDMWNLLKWGAGYYQFSLGDVLF
SALPVKLRNGESVVEKNKILWKLTALGEQAMVSGELKRAKKQLEALTELT
KNPLEKGNNEFSAAIWSQLKEKRFVEEVTQPLQIIPWQIRLGGKEIMRAE
QRLTLNKQQALALSRLLFHQGFAAWLLDGVTGSGKTEIYLQYIEEILKQD
KQVLVLVPEISLTPQTVQRFQARFNVDIDVIHSNMNDSQRLLVWQRARTG
QSAIVIGTRSALFTQFKRLGLIVIDEEHDNSFKQQDGGWRYHARDLAIVY
AKQLDIPIVLGSATPSLESLNNVKNRKFKHIVLSHRAGAGSGLKHEVIDL
KRQRIQHGLSDTLLRKMASHLEKGNQVMLFLNRRGFAPVMLCHECGWIAT
CTQCDKPYTYHQHQRVMKCHHCEIQKPVPMQCGACGSTHLVTTGIGTEQL
EFVLQQQFPQYEVTRLDRDSTVRKGALENHLSAIKQGKSRILIGTQILAK
GHHFPDVTLVALVNVDSALFSLDFRAEERLAQLYVQVSGRAGRAEKQGEV
VLQTHYPDHPLLQQLLHDGYHAFANSALQLRRQMGLPPFSAQALFRAQSK
SSEEAEQLLQQIASYFYDWKNRQNMPDLQLLGPMPAPFSKKAGRFRWQLL
LQHPSKSVLQHALGQFNFENEVKSSQARWILDVDPQDLS
>MS0470 priB, PriB protein
MKTTILRMLKSNLSINNRLSLEGFVTEQPKRTKSPNGIEHCRIWLEHRSE
QIEAGLKRQAWCKMPVHISGTQLVQKTQSITVGSHLLVVGFLTLHKTSKG
LSQLVLHAEHIEYL
>MS0533 prmA, PrmA protein
MAWVQIRLNSTNEKAETISDYLEEIGSVSVTFMDSQDTPIFEPLPGETRL
WGNTDVIALFDAETDMQQIVRLLRQEGHLDENTAYKIEQIEDKDWEREWM
DNFHPMQFGKRLWICPSWREVPDPNAVNVMLDPGLAFGTGTHPTTALCLE
WLDGLDLAGKTVIDFGCGSGILAIAALKLGAKEAIGIDIDPQAILASRNN
AEQNGVADRLKLYLSEDKPANMKAEVVVANILAGPLKELYPVISELVKEK
GNLGLSGILATQAESVCEAYQAKFDLDAVVEREEWCRITGKLK
>MS1604 proA, ProA protein
MTDLIQMGKQAKQAAFALSQLSQQEKNHALALIAERLEAQQERILAENAK
DIQAARENGLSESIIDRLLLTKERLTGIADDVRHVISLADPVGKIIDGGV
LDSGLKLERIRTPLGVIGTIYEARPNVTIDVASLCLKTGNAVILRGGKET
QHSNKILVEVIQNALQQAGLPEMAVQAITDPDRALVMELLHLDKYVDMII
PRGGAGLQALCRDNSSIPVIVGGIGVCHIFVEQSADQDRSLAVIENAKTQ
RPSTCNTVETLLVQESIAEEFLPKLARRLKTKEVKFHADSTALSILQGVS
ADVKPVTEQQLRNEWLTYDLNVVIVKGIEEAVEHIREYGSEHSESILTES
QKLANQFVAQVDAAAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMG
LEALTTYKWVCVGDYTSRA
>MS1862 proB, ProB protein
MKFGTSTLTQGTPKLNRAHMIEIVRQLAQLHQEGYRLVIVTSGAMAAGRH
YLNHPKLPPTIASKQLLAAVGQSQLIQTWEQLFAIYDIHIGQLLLTRADI
EDRERFLNARDTLHALLDNRIIPVVNENDAVATAEIKVGDNDNLSALVAI
LVQAEQLYLLTDQQGLFDSDPRKNPQAKLIPVVNEITDHIRSIAGGSGTT
LGTGGMSTKITAADIATRSGIETIIAPGNRENVIADLAHGEAIGTKFTVQ
TDKLESRKQWLFAAPSAGILTIDQGAENAILEQHKSLLPAGIVNIEGRFS
RGEVVKIRTQQGKDIALGMPRYNSDALYLIQGKKSQNIEKILGYEYGSVA
IHRDDMIVLNK
>MS1799 proC, ProC protein
MKNKLLTFIGGGNMAQAIVFGLLNKGYSAAKLIVCDRNEAKRNLFAQKGV
EVNLTNVEAAEKAEVVVLAVKPQAMAETCGPLSAVDFSGKLVISIAAAVS
VSRLSALLPTAKNIVRVMPNTPALVSEGMAGLFASAGLNGEYQDFAEDLL
NAVGKTCWLQKEEDMHAVTAGSGSSPAYFFLFMEAMEKTLSSMGISPENA
RTLVQQSALGAAKMVENNPQLPLSTLRENVTSKGGTTAAALAVFNQYQLD
KIVQQAMEACVARSQEMEKLF
>MS1798 proP, ProP protein
MNVRPFTWLALSYFGYYCAYGVLVPFLPVWLKSQNYGTELIGAVIASSYL
FRFLGGIFFPSRVKRANQILPALRLLAWANVFVITAMAFVSESFWLIFIA
IAVFSMVNAAGMPLTDSMATTWQRQIRLDYGKARLIGSAAFVVGVTVFGS
LIGAIGEQYIISILIGLFGLYAVLQMVPPQPKPADEDKNSAKSAVGFGEL
LKNPTHLRLIIAAMLIQGSHAGYYVYSVIYWTNRGIAVETTSLLWGLGVI
AEILLFFFSGRLFRNWSVNAIFYLSAAAAALRWGAFSYTDALWQIALLQC
LHSLTFAALHYAMVRYIGMQPQNAMVRLQSLYSGLASCASVALLTALAGI
IYPISSHWVFLVMMICALIALFVIPRKPTNA
>MS0191 proP, ProP protein
MSNKVNSYGWKALMGSAVGYAMDGFDLLILGFMLSAISADLSLSPTQAGS
LVTWTLIGAVAGGIIFGALSDKYGRVRVLTWTIVLFAVFTGLCAFAQGYW
DLLIYRTIAGIGLGGEFGIGMALAAEAWPARHRAKASSYVALGWQVGVLA
AALLTPLLLPIIGWRGMFLVGIFPAFVAWYLRAKLHEPEVFVQKQAEVAT
GKRQSPFKLLIKDVATAKVSLGVVVLTSVQNFGYYGIMIWLPNFLSKQLG
FSLTKSGVWTAVTVCGMMAGIWIFGRLADRIGRKPSFLLFQIGAVISIIA
YSQLTDPAIMLFAGAALGMFVNGMMGGYGALMSEAYPTEARATAQNVLFN
LGRAVGGFGPVIVGAVVSAYSFKIAIALLAVIYVIDMIATVFLIPELKGK
ALK
>MS2054 proP, ProP protein
MASGEANYRSLAWIAASALFMQSLDATILNTALPTIAADLHHSPLEMQLA
VISYALTVALFIPISGWVADKYGTLRVFRFAVGMFALGSLACAMSSSLIM
LIFSRVLQGFGGALMMPVARLSIIRSVPKQELLPVWNLMATAGLTGPILG
PILGGWIVTYTSWHWIFLINIPMSLLGIWLANRYMPNVTGSLQKLDWAGF
FFLGGGLVGVTLGFDLISEEFIAKWQATVIVILGVILIITYCFHAQKRER
LALLPLSLFKIRTFRVGIMANMLIRLCASGIPFLLPLMYQVVFHYSADKA
GMLIAPIALSSMLVKPLCGRILTKLGYRTALISASIVLTLSIAVMSFLHI
DSPVWILIVNVALYGGCISIVFTAVNTLTISELSDQDASAGSTFLSVVQQ
VGIGLGIAVSALILSLYRYFIGESAVQLQQAFGYTYLTSASFGVLLVLVL
SGLKKEDGAHLHK
>MS2374 proP, ProP protein
MSGEKTSRYVLGVTLVATLGGLLFGYDTAVISGTVSSLDTVFIQPKGLPE
ISANSLLGFCVASALIGCIIGGACGGYLSSKYGRKKALLIAALLFLISAF
GSAYPEFGLKTINETNNIPYYLSNFLIQFVIYRIIGGIGVGIASMVSPMY
IAEITPARIRGKMVSFNQFAIIAGQLIVYFVNYFIALNGDNTWLNMLGWR
YMFLSEMVPAALFLILLFFVPESPRWLVLQNKFSQAEITLLKLLGERSGK
TELQNIVSSLEHRVVKGAPLFSFGLGVIVIGIALSVFQQFVGINVALYYA
PEIFKSLGASTNNALLQTIIMGTINLSCTTIAIFTVDKYGRKPLQIIGAL
GMAMGMFVLGMAFYANLSGTIALTGMLFYVAAFAISWGPVCWVLLAEIFP
NAIRSQALAIAVAAQWIANYIVSWTFPMMDKSSYLVERFNHGFAYWVYGL
MAILAALFMWKFVPETKGKTLEELELLWNKK
>MS0499 proP, ProP protein
MNLREHIDNNPMSAYQWTVVIIAAIMNLLDGFDVLALAFTATAIRGDLGL
SGAELGYLFSAGLLGMAAGSLFLAPLADKIGRRPLLLISVTLSALGMLGS
AYSASYGALGFWRLITGLGVGGILVGTNVLTSEYSSRKWRSLAISIYASG
FGIGAVLGGMFAVVLQEEYSWHAVFLAGFILTAVCLIVLLIWLPESIDFL
MTQQPRNAQIRLNKITKKMGLKGQWTLPEKVLASASKLPLTQLFNKNYRK
STALIWIAFFAIMFCYYFVSSWTPALLKEAGMTTEQSVSVGMMVSLGGTC
GSLLYGLLASRWKAKQMLVQFTVLSAFSVIIFILSSSILWLAMLFGILVG
GFMNGCISGLYTLNPSIYAANIRSTGVGWSIGVGRIGAILAPLAAGVLLD
YGWDKQSLYIGVGFVLLIAAIALSLLRIKTTLVKC
>MS1178 proP, ProP protein
MPNKAETSPAKLRLKAFLKRIKIMNTTENSKQKPVNVVAFAFLLTAFLTG
IASSFQTPTLSLFLAQEIQVSPFMVGMFYTSNAVLGIVLSQILAKYSDSQ
DDRRKIIIFCSLLAIGGCITFAYNRNYYVLMFFATFLLSLGSSANPQAFA
LAREYADYTKREAIMFTTIMRTQISLAWIVGPPLSFSIALGWGFEYMYMV
AASAFLLCAIIAKALLPYVPRKAVVPLTKPDEVAGLPAKNKKQSDKQSIR
LLFITCFLMWSCNGMYLISMPLHVINELHLSERLAGILMGTAAGLEIPVM
LIAGYLTKYLTKKSLILTALFMGLFFYIGMLFAEQTWQLVALQAFNAIFI
GIIATLGMVYFQDLMPGKMGSATTLFSNAAKSSWIVAGPFVGIIAQIWNY
SSVFYISIVLVAVSLFSMSKVKSV
>MS0785 proP, ProP protein
MQNKFAVYLAAIGHLVTDMAQGALPALLPLFIKNYGLTYQEAGGLIFANT
VLASIAQPFFGYLADKRSMPWLIPLGMMLSGCCIAAMGFVHSYPGLFFFA
MIAGIGSALFHPEGARLVNRMSGGEKGKAMGIFAVGGNAGFAIGPMFAGL
AYLFGAQTLSIFALINTIIALIIFLQLPKLTVENVVNKAKNTASTTLQND
WRSFAKLSVIIFVRATNFTVLNAFIPIYWIHILHQQETDANFALTIFLSM
GVAITFIGGLLSDRLGYVRIIRYAYLIFLPTILIFTQSENLWLSFILLIP
LGLGVFTQYSPIVVLGQTYLAKSVGFAAGITLGLGITMGGIFSPIVGWIA
DHYGLQIALQTLSVLSLLGLIFSYRLKITDTEKPEKK
>MS0797 proP, ProP protein
MSQNHFFSHIFNRNMLICIFTGFSSGLPLYILTSLIPTWLRSTEIDLKTI
GFFTLTSLPFIWKFLWSPFLDRFVPPFLGRRRGWMLIFQLLLLISLGLFG
FIDPHTNQGLSLLIGLATMVSFFSASQDIVLDAYRREILSDQELGMGNSI
HVSAYRIAGLVPGSLSLILSDHFSWQAVFIITALFMLPGLLMTLFISHEP
QIELKSNRTLAENIVEPFKEFFQRKGLWGAIGILTFIFLYKFGDSMATAL
ISAFYLDMGFTKTQIGLVVKNASLWPMIIAGIIGGMITLKIGINKALWLF
GLVQIVTILGFAWLAQLGPFEKVDSFAIFALTVVVMAEYVGIGLGTSAFV
AFMARATNPVYTATQLALFTSLSALPRAVFNSFSGVLIENMGYYHYFWLC
FFLAIPGMLCLIWVAPWKEK
>MS1530 proP, ProP protein
MNTETKQPALIVPRLSLMMFMEFFIWGSWSVTLGIVMTKYDLSTLIGDAF
SMGPIASIISPFILGMLVDRFFPSEKVLAVLHLIGAAILWFIPEFITGQQ
GGTLVFALLAYMLCYMPTVALTNNIAFHSLADSEKSFPVIRVFGTIGWIV
AGLFIGQADLSASPAIFQVAAICSLILGLYSFTLPNTPPPAKGKPFSMRD
LMCADAIALFKIPHFLVFAICATLISIPLGTYYAYAAPFLDAVGFEKIGS
LMSMGQMSEIVFMLLIPFFFKRLGVKYMLLAGMLAWFLRYAFFALGVSEE
IRWAVYLGILLHGICYDFFFVVGFMYTDKVADEKIRGQAQSLVVLFTYGL
GMLLGSQISGGLYNNMFADNTDVSTWSTFWWIPAISAVVISVIFFIFFNY
KEDKREA
>MS0807 proP, ProP protein
MLMTSQNKINAVPSNQNFYLNNRNYWIFSGYFFVYFFIMATCYPFLGIWL
GDINGLSGEDRGTVFAMMSFFALCFQPVFGYVSDKLGLKKHLLWVLGISL
LIYAPFFIYIFAPLLKVNVWLGSLVGGAYIGFVFQAGAPASEAYIERVSR
RSKFEYGRVRMFGMFGWAICASIAGVLYATNPNLVFWLGSIASLILLLLI
ALAKPEQTSTVQIAEKLGANKNPVNLRQAFALLKLPKFWALLAYVMGIAC
VYDIFDQQFGNFFNTFFESHEQGIKMFGYVTTAGELLNALIMFFVPLIIN
RIGAKNALLIAGTIMSVRIIGSSYAIEAWHVVVLKTLHMFEVPFYLVGLF
KYIANVFEVHFSATIYLVACHFAKQIGNMLVSPLVGAWYDTYGFQDTYLI
LGCIAAGFTLLSVFTLTGKSLSSQS
>MS1407 proP, ProP protein
MMTSSRPNLTLLLILGALMACTSLSTDIYLPAMPTMAKELQGNTELTITG
FLIGFAIAQLIWGPISDRIGRKIPLFIGMALFAVGSVGCALSQSMAEIVF
WRVFQAVGACVGPMLSRAMIRDLYDRSQAAQMLSTLTIIMAAAPIIGPLL
GGLLLKISSWQAIFWLLVVIGILLFLSIIKLPETLPPAKRAAGSFWSAFG
NYRILLKNRAFMRYTLCVTFFYVAAYAFITGSPFVYIDYFKVDPQYYGFL
FGVNIVGVALLSAVNRRLVRHYPLESLLRVSTMIALCAVLILVVLVFMDL
DGIAGILSVAVPIFIMFSMNGIIAACTNAMALDSVQPEIAGSAAALLGSL
QYGSGILSSLLLAYFSDGTPHTMAWIIALFVGLCAVIGWGQRPRSA
>MS0392 proP, ProP protein
MSTAKKRNFIFIATLGILSMLPPLGVDMYLPSFLNIARDLQVDPERVQYT
LTFFTFGMAAGQLFWGPVGDSYGRKPIILLGVIIGAVAAFFLTGVNSIEN
FTALRFIQGFFGSAPVVLVGALLRDLFDKNELSKTMSMITLVFMIAPLVA
PIIGGYLVLFFHWHSIFYVICAMGILSAILVFFIIPETHHQDNRIPLRLN
VVVRNFVTLWRRKEVLGYMFSSGLGFGGLFAFLTAGSIVYIGLYGVPVDQ
FGYFFMLNIGVMTLGSVINGRVVHRVGAERMLQIGLTVQLIAGIWLLIVA
CFDLGFWPMALGIAVFVGQNSLISSNAMASILEKFPTMAGTANSVAGSVR
FGLGATVGSLVALMKMDSAAPMLFTMGICVIVAVCCYYFLTYRSL
>MS0820 proQ, ProQ protein
MLGIIFSYITCCVQDRKQMTEIQTDVQAESQKLTNNKEIIAYLAEKFPLC
FSVEGEAKPLKIGLFQDLAEALKDDERVSKTQLRHALRQYTSNWRYLHGC
RLGAERVDLQGNPAGVLEQEHVEHAQQQLAEAKAKFAEKRAAEKAANTKQ
VKKRPARKPSDKAMKATRKPANDKVRKAKVELKEIDFATLQKGSQVKVKV
GDSAKQAVVLDVIKDNARVELDNGLVLTVATDRLFA
>MS2063 proS, ProS protein
MVNYRQFFNLKNNRNPIIMRTSKYLFSTLKETPNDAQVVSHQLMLRAGMI
RPMASGLYNWLPSGIRVLEKVKNIIREEMNKSGAIEVLMPVVQPAELWQE
SGRWEQYGLELLRFNDRGNRDFVLGPTHEEVITDLVRREVSSYKQLPLNL
YQIQTKFRDEVRPRFGVMRSREFIMKDAYSFHTTKESLQATYDVMYQTYS
NIFTRLGLDFRAVQADTGSIGGSASHEFQVLASSGEDDVVFSTESDYAAN
IELAEAIAVGERAQPGAAMQLVDTPNAKTIAELVEQFNLPIEKTVKTLIV
KGATEEQPLVALIIRGDHDLNEIKAEKLPEVASPFEFADEADIKAKIGAG
VGSLGPVNLNIPVIIDRSVALMSDFGAGANIDGKHYFNINWERDVALPKI
ADLRNVVEGDPSPDGKGTLLIKRGIEVGHIFQLGQKYSEAMNATVQGEDG
KPLVMTMGCYGIGVTRVVASAIEQHHDERGIIWPSDAIAPFTVAIVPMNM
HKSESVQAYAEELYQTLLAQGVEVIFDDRKERPGVMFADMELIGVPHMVI
IGEKNLENGEIEYKNRRTSEKQMIAKDQLLDFLKANVNV
>MS0549 proV, ProV protein
MTTSVKISVKNLTKIFGSHPKSAFKLLQNGKTKEQIFAETGSTVAVNNVS
LDIMAGEIFVIMGLSGSGKSTLIRLLNRLIEPSAGHVFIGDDDIAEMSEK
ALRAVRRKRISMVFQSFALMPHMTILENVAFGLELSGVNSKNRRRMALET
LARVGLEAYADVYPGELSGGMQQRVGLARALANDPEILLMDEAFSALDPL
IRTEMQDELLRLQENSERTIVFISHDLDEAMRIGNRIAIMQDGQVIQVGR
PDEILQNPANDYIRSFIQGVNVSNVLSAKDIASKRHLLNIVQKSEDETPH
VAFKLLEQHERDFAVVLDRYGYYKGMVSVDSLQQARSNRQSLSQSFIEIT
PLSPEQSISDIINDVATTREPLPVVDDKGHYYGVVTKVKVLQTLDRGTEA
>MS0550 proW, ProW protein
MTTENIRTADPWEATLQAAQQDNAYAWLQGSEQSQDFNWMYPFDHTLVPF
GDWVESLINWLVTHLRSFFQFISAPIDYILSLFQTSLNVLPPTVMIILFT
LLVWQFTHFRLALATLLSITLIGAVGAWNEMMITLALVLTSVSFCLLIGL
PLGIWMARSTRASAIVKPVLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVT
IIFALPPIVRLTILGIQQVPEALIEAAQAFGASKKQLLYKVQLPLAMPSI
MAGVNQTLMLSLSMVVIASMIAVGGLGQMVLRGIGRLDMGEAATGGLGIV
LMAIVLDRLTQKIAENMHSQHKVRWYERGITGLFIRKK
>MS0551 proX, ProX protein
MAYPMKLTILFSLALFASNAVRADDKAIQPLQSPLAEETFQTLIVVKALE
ELGYRVNPPKEVDYNVAFTSIANGDATFMAVHWLPLQADKYANAGGDRKL
YRQGTFVEGAVQGYMIDKKTADTYNITNLAQLKDPKLAKLFDTNGNGKAD
LIGCSPGWSCEYTVSQHIDGYGLSRTVEVTQGNYSALIANTIAQYQNGKS
ILYYTWTPYWVSGVLVPGKDVVWLQVPNRPDPGKTVADTNLANGKNYGFT
VSSMHIVANKTFTDAHPDAARLFAVMRLPAGDISAQNMAMRNGQNSSQDI
ERHAEAWIKFHRVQFDEWIKQAKSAKN
>MS1536 prsA, PrsA protein
MPDIKLFAGNATPELAKRISERLYISLGDATVGRFSDGEIQVQINENVRG
SDVFIIQSTCAPTNDNLMELIVMVDALRRASAGRITAVIPYFGYARQDRR
VRSARVPITAKVVADFLSSVGVDRVLTCDLHAEQIQGFFDVPVDNVFGSP
VLINDILKKTDLENPIVVSPDIGGVVRARAVAKLLNDTDMAIIDKRRPKA
NVSQVMHIIGDVSDRDCILVDDMIDTGGTLVKAAEALKERGAKRVFAYAT
HAVFSGSAAQNIANPALDEVVVTDTIPLSAEIKALGKVRSLTLSAMLAEA
IRRISNEESISAMFDA
>MS1864 psd, Psd protein
MLGEYNIMNSLEKKQITYGQRLKIAFQYAMPQIYLTQIAGWFANKRWGAV
THFVIKMFAKKYNVHMAEAAKPNFSDYATFNEFFIRQLKEYARPINQNTD
ALCLPADGKISQCGHIDDELLLQAKGHSFSLRDLLAGDEELTRLFKDGEF
VTTYLSPRDYHRVHMPCNGTIRKMIYVPGELFSVNPFLNTHIPNLLARNE
RVICLFDTDFGPMVQILVGATITASISTVWEGVINPPRTGDIRTWTYEGQ
SAVSLAKGQEMGAFQLGSTVINLFPKNAVKLADYLQVDTVTRVGEILAYK
K
>MS2183 pspE, PspE protein
MFKEITPQQAWQLMIEENATLVDIRDEQRFTYSHAKGAFHLTGQSYGKFQ
IQCDFDDPVIVSCYHGISSRNVAAFLVEQGYDNIYSIIGGFEGWQRAGLP
IETAY
>MS2248 pspE, PspE protein
MFPVVSYLSIILMIKEFMMNITKITARQLQEKLAQGALLIDIRDADEYSH
ECIEQAVSQPLTGLKPEICNNSPCVIFHCQSGMRTQANMALLAKASAAAA
EVYILDGGLNAWKKAGFATVVNKAQPLPLMRQVQIAAGSLVLLGVILGYS
VSPWCFLLSGFVGAGLIFAGVSGFCGMAVLLSKCPWNK
>MS2215 pspE, PspE protein
MEEFMPMATEFAKNHTLLIAAWVAIFVIVIFQLVKSFTSKVKILSNAEAT
SLINNEDAVVIDLRSIDEFKRGHIAGSLEFIPTDIKNRNLGKLEQHKDRH
VILVCANGFTARSSAQLLTKQGFAHVYVLNEGIMGWKSQNLPLVK
>MS0998 pta, Pta protein
MSRTIILIPISAGVGLTSVSLGLIRALEQKGTKIGFMKPISQPRSGEDML
DRTTSIVRTSTTIETTEPVMLSEAENLIGQNQTDVLLEKIVAQHQQISKD
NDIVIVEGLIPSRKNSYANSVNYDIAQALDAEIILVSAPATETPAQLKER
VEAAAASFGGKSNPNLLGVVINKFNAPVDESGRTRPDLTEIFDSFQHSHN
NIKEIYKLFENSPIKVLACIPWSADLIATRAIDLVKHLGASILNEGDMNR
RIRSITFCARTLPNMIEHFKAGSLLVVSADRPEILTAAALAATTGIELGG
ILLTGGYKIDCEIKKLCNPTFENTKLPVFRIEGNTWQTALSLQSFNLEVP
VDDKERIENIKQYTSGQFDADFIHSLASASVRARRLSPPAFRYQLTELAR
AAKKRIVLPEGDEPRTIKAAVLCAERGIAECVLLAKPEDVKRVADSQGVK
LGNGITVIDPASVRENYVARLVELRKAKGMTEMAAREQLEDTVVLGTMML
EAGEVDGLVSGAVHTTANTIRPPMQIIKTAPGSSIISSIFFMLLPDQVLV
YGDCAVNPDPTAEQLAEIAIQSAESAKSFGIDPRVAMISYSTGTSGSGAD
VEKVKEATRIAQEKRPDLLIDGPLQYDAAVMEDVARSKAPNSKVAGKATV
FVFPDLNTGNTTYKAVQRSADLVSIGPMLQGMRKPVNDLSRGALVDDIVY
TIALTAIQATQC
>MS0556 pth, Pth protein
MSEIKLIVGLGNPGDKYADTRHNAGEWLVERLARRFNFNLKDEAKFFGKT
ARAVIGGEEVRFLIPTTFMNLSGKAVGALATFYRIKPEEILVIHDELDLP
PGVAKIKQGGGHGGHNGLKDTIAQLANNKNFYRLRIGIGHPGDKNLVSAY
VLSKPSPIDRSAIDKALDEAASCMEILLKDGITKATNRLNGFKA
>MS1509 ptsA, PtsA protein
MISGIPASPGIVFGKALVLKEEKIVLDMQKIAEDQVETEVARFYEGRTAA
VEQLSAIRDRAEKTLGEEKAAIFEGHLMILEDEELEEEIIDYLRSNKVNA
GVAASKIIDQQVAMLADIDDEYLKERAGDIRDIGNRLIKNILGMKIVDLG
EINEESILVAYDLTPSETAQLNLDKVLGFITDIGGRTSHTSIMARSLELP
AIVGTNNATAMINSGDYLVLDAINNAVYVNPAQDVIDGLKAQQAKLAEEK
AELAKLKDLPAVTLDGHRVEVVANIGTIRDCEGADRNGAEGVGLYRTEFL
FMDRDQLPSEEEQFIAYKEVVEAMNGRQVVLRTMDIGGDKELPYMNLPKE
MNPFLGWRAVRIALDRREILNAQLRAVLRASAFGKLAVMFPMIISVEEIR
ELKSVIETLKQELRTEGKAFDENIQIGVMCETPSAAVNAKFLAKEVDFFS
IGTNDLTQYTLAVDRGNEMISHLYNPMSPSVLSLIKQVIDASHTEGKWTG
MCGELAGDEKATILLLGMGLDEFSMSAISVPRIKKLVRSVNFAEAKALAD
KALQLPTAAEIEKLVADFLAEKTLN
>MS0784 ptsG, PtsG protein
MLVLARIGENFCLIYKRGVAMNYPKIAQQVIEKLGGKENIANLAHCATRL
RLTMNDESKIDKQAIEDIEGVKGQFSTSGQYQIIFGSGTVNKVYAEMNTI
MNGSPSADSTGESQQAKGPQQGLIQRLIKGLADIFVPIIPAIVAGGLLMG
INNVFTAKDLFEEGRTLLDLYPQYKDLADLINTFANAPFVFLPVLLGFSA
TRKFGGNPFLGATLGMLLVHPALTNAYGYAEALAGGNLQLWNIFGLEIEK
VGYQGTVIPVLIAAWVLATLEKFLVKVVPSVLNNLVTPLFSLFITGFLAF
TVIGPFGREAGEFLSQGLTWLYDTLGFIGGGVFGALYAPIVITGMHQTFI
AIETQLLASTAATFIFPIAAMSNIAQGAACLAVAVLNKDAKTRGLALPSG
ISALLGITEPAMFGVNLRFRYPFYAAMLGAGSAAAFIAFFNVKATALGAA
GLIGIASIRAGDWGMYSVGMVISFCVAFAAALVLGARANAKE
>MS1237 ptsG, PtsG protein
MAKINKVDPKNVDKLIVAVGGRENIATVTHCITRLRFVLNDESKVDAKTI
EELPMVKANFATGGQYQVVIGQEVGDYYQVLLEKTGLASVDKEQVKAAAR
KNQKWYESLISHMADIFIPLLPALISGGLILGFRNVIGDIKMFEEGTKTL
VDISAFWASMHSFLWLIGEAIFFFLPVGICWSIARKMGTSPILGITLGVT
LVSSQLMNSYALGSQIPEVWDFGLFSIEKVGYQAQVIPAIMAGLTLSYIE
RFLNKIVPDFLNLIIVPVTSLILVVFLAHSIIGPIGREIGNGVAFVVKAA
MTGEFAPIGAALFGFLYAPLVVTGVHQTSLAIDMQMIQSIGGTPVWPLIA
LSNIAQASAVVAVLIMAKKASVREVAVPAALSAYLGVTEPAMYGINLKYR
FPMLCAMTGSACAALVCGFAGVLASSIGVGGLPGILSIQHQFWGTFAIAM
LVAIIVPILLTMAIYKRKEAAGTLE
>MS1717 ptsN, PtsN protein
MVKFTEILSPENIRQGIICSSKKRLLEVISDIVTKRFNLQEEEIGYHIEQ
LECFETLLSREKLGCTSLGNGIAMPRAKLPIGDKPVAVFLQLASPVNYEA
PDKRDVDLVLAILIPEKCCAAYSPYLPELAERFSDKMLCKQLRAAQSADE
IWQIFQYMDNCLHEHTDDTATEEK
>MS2180 ptsN, PtsN protein
MFNLPENNIHLSAQAGNKEQAIELAAKALEQAGYVESGYLQGMLGRELQT
STFLGNGIAIPHGTLETRNMVKNTGVQIFQFPQGIEWGDGNIAYVVIGIA
ARSDEHLALLRQLTHVLGDEDTAAKLATLQDAKKFRAILMGEDDEFAVKT
ENISLDVDTQSLLTLVAINAGKLEQQSAVENSFVSDVIASPALPLGNGLW
VTDSPLGNLKNALAFSRAKNAFSVNGKNVQGVVTVSAKDDAVNETLARLL
SEQVQQTLLAGNAEKIIAALNGIQAEQAVTAEQVTTQAVPAAGTVIGTFT
LRNENGLHARPCANLVNLVKKFDAKITVENITRGTAAVSAKSLMKVVALG
VTQGHRLRFVAEGAQAQQAIEAIAKEIAAGLGEPVSAVPPAEPDTIEVAN
PATPEVEQPKSDSIEAVFVINNENGLHARPAATLVNEVKKYNASVAVRNL
NRDGGLVSAKSMMKIVALGATKGSRLHFVATGEEAQQAIDGIGAAIAAGL
GE
>MS0021 ptsN, PtsN protein
MLKQSLIDNNSIKLHQKAANWQEAIKIAIDLLVKSGAVEARYYDRIVECI
KEMGPYIILAPGLAMPHARPEDGVIRTAFSLVTFDTPIHFEGEDDPIRMM
VALAGSDSDKHMEGLMEITQILEDEDSETGVNIQKFLDCNTEAEVFAVID
AALSE
>MS0149 ptsN, PtsN protein
MITSKQKRSFMLKQFLPLSHIQYVDSVENWQQAVQLSAAPLLAEQLIEPR
YVERIFQIHREIGPYYVIAPQIAMPHSRPEDGSNAQALSLVVLKQGVNFG
SDNDPVQLVLMLAAKDSESHLEMLSAVAELFSDEEAVQQIIQSTTVSEIA
EIVHRY
>MS1621 purA, PurA protein
MGKSVVVLGAQWGDEGKGKIVDLLTDRAKYVVRYQGGHNAGHTLIINGEK
TVLRLIPSGILRANVTCLIGNGVVLSPSALMQEMGELESRGINVRERLLI
SEACPLILPYHVAMDHAREAALGKKAIGTTGRGIGPAYEDKVARRGLRVG
DLFDKEQFAEKLKNILDYYNFQLVNYYKVEAVDYQKTLDDVMAVADIITG
MVADIGAILNTARKNGDNILFEGAQGAMLDIDHGTYPYVTSSNTTAGGVA
TGSGLGPRNIDYVLGIIKAYCTRVGGGPFTTELFDEVGQEIARKGNEFGA
VTGRPRRCGWFDAVAIRRAIQVNSITGFCMTKLDVLDGFDEVKICVGYKL
PNGEVVDYAPLAAKDWEGVEPVYETMPGWKENTFRVTDVDQLPVNCLNYI
KRIEEVTGVPVAILSTGPDRVETMILQDPFTA
>MS0297 purB, PurB protein
MQLSALTALSPIDGRYQDKTTALRGIFSEFGLLKFRVTVEVRWLQKLAAT
AQINEVSSLSQEANDYLNQIVTNFAIEDAERIKEIERTTNHDVKAVEYFL
KEKSAALPELAAVSEFIHFACTSEDINNLSHALMLKTAREEVILPEWQKL
IDEITRLANEYKEIPLLSRTHGQPASPSTVGKEMANVAYRLRRQYKQLEQ
IEVLGKINGAVGNYNAHLSAYPEINWHQFSEEFVTSLGVNWNPYTTQIEP
HDYIAEFFDCVARFNTVIIDFDRDLWGYIALNHFKQRTIAGEIGSSTMPH
KVNPIDFENSEGNLGLANAVMSHLAQKLPVSRWQRDLTDSTVLRNLGVGL
GYCLIAYAATRKGISKLEVNEQHLRDELNQNWEVLAEPIQTVMRRYGIEK
PYEKLKELTRGKRVDEQAMHDFIEKLDIPAEEKARLQQLTPATYIGAAVQ
LVEKL
>MS1481 purC, PurC protein
MAELSLKKIYSGKVRDLYEIDDKRMLMVASDRLSAFDVILEDPIPRKGEI
LTQISNFWFKKLAHIMPNHFTGDTVYDVLPKEEADLVKNRAVVVKRLKPI
KIESIVRGYLTGSGLKDYKQTGTICGLQLPQGLVEASKLPEPIFTPSSKE
EVGDHDINISYAECERQIGKELAAQVRDAAIALYKEAAAYALTKGIIICD
TKFEFGLDENGTLTLMDEVLTPDSSRFWSVDTYREGTNPPSFDKQFVRDW
LEQSGWNKQPPAPKVPADVIQKTVDKYQEALDLLTK
>MS1296 purD, PurD protein
MNILIIGNGGREHALAWKAAQSPLASKVFVAPGNAGTARESAVENVDISA
TDVPALVKFAQDNNVGLTIVGPEAPLVVGVVDAFEQAGLTIFGPCQSAAQ
LEGSKAFTKDFLARHNIPTAEYQNFTEVEPALAYLREKGAPIVIKADGLA
AGKGVIVAMTLAEAEAAVKDMLSGNAFGEAGSRVVIEEFLDGEEASFIVM
VDGKNVEPMATSQDHKRVGEGDKGLNTGGMGAYSPAPVVTQEIHQRVMEQ
IIYPTVRGMAAENNVYKGFLYAGLMIDKNGQPKVIEFNCRFGDPETQPIM
MRMQSDLVELCLKACKGELDQIKSEWDPQAALGIVLAAEGYPGDYRKGDE
ISGIPVQASQDEKVFLAGVAEKEGKLVTNGGRVLCVTALGNSVLSAQQKA
LKLAEQVNWTGRFYRRDIGYRAVAREQNG
>MS1033 purE, PurE protein
MSNTHAQIAIVMGSKSDWSTMQEATGMLDQLNVPYHVEIVSAHRTPDKLF
SFAENAQAKGYKVIIAGAGGAAHLPGMIAAKTLVPVLGVPVKSSMLSGVD
SLYSIVQMPKGIPVGTLAIGPAGAANAGLLAAQILAAWDSELSARLQKFR
EQQTNAVLNNPDPRN
>MS1003 purF, PurF protein
MCGIVGIVSQSPVNQSIYDALTVLQHRGQDAAGIVTIDDENRFRLRKANG
LVSDVFQQVHMTRLQGNAGIGHVRYPTAGSSSVSEAQPFYVNSPYGLSLV
HNGNLTNSDELKSKLFKLARRHVNTNSDSEALLNILAYYLDHMQTEHLSP
EDIFYAIKKTHKDIRGAYACIAMIIGHGMVAFRDPHGIRPLILGKREESG
KTEYMFASESVALDTAGFDVVRDIEPGEAVYITFDGKLYAEQCAENPVLT
PCIFEYVYFARPDSTIDGVSVYAARVHMGERLGQKIANEWADADIDVVIP
VPETSNDIALRIATILGKPYRQGFVKNRYVGRTFIMPGQKQRISAVRRKL
NTISSEFKDKNVLLVDDSIVRGTTSEQIVDMARAAGAKKIYFASAAPEIR
YPNVYGIDMPTKHELIAYGREPEEIAKLIGVDKLIFQDLSALTQSVQQEN
PNIKEFDTSVFTGHYVTGDISTEYLDNIAQQRNDAAKRKRAKDATNLEIH
NEG
>MS1297 purH, PurH protein
MQIRPIRQALLSVSDKTGIVEFAQGLVQRGVKLLSTGGTAKLLADSGLPV
TEVSDYTGFPEMMDGRVKTLHPKVHGGILGRRGTDDEVMQKHGIEGIDMV
VVNLYPFAQTVAKPNCTLEDAVENIDIGGPTMVRSAAKNHKDVAIVVNNA
DFHMILAEMDQNQNSLTLETRFDLAVRAFEHTAQYDSMIANYFGQLVKPY
FAAEEEDKDAKCGQFPRTLNLNFIRKQTMRYGENSHQNAAFYVEKEVKEA
SVSTAKQLQGKALSYNNIADTDAALECVKSFDEPACVIVKHANPCGVALG
KDVLEAYNRAYQTDPTSAFGGIIAFNRELDEATATAIVDRQFVEVIIAPT
VSAAAVEVVKRKKNVRLLACGELSKPQARLDVKRVNGGLLVQDADLGSVS
IDDLEVVSKRKPTKQELEDMLFCWKVAKFVKSNAIVYAKNNQTIGIGAGQ
MSRVYSAKIAGIKAQDEGLEVKGCVMASDAFFPFRDGIDAAAEVGIECVI
HPGGSMRDQEVIDAADEHNMVMVLTKMRHFRH
>MS1032 purK, PurK protein
MQKSALYPTVYVLGNGQLGRMLRYAGAPLDINVQPLAFNASVFDLPKDSI
ITAEIERWEETPLTTMLGNHHKFVNKNVFVKTADRLTQKSLLDELALPTS
PWCLVENHQQWADIFTNVGEKVVVKRRMGGYDGRGQWIITEENKTLITDE
LLNEVIAEKFIPFDYEISLVGARFRNGDTRFYPVTHNLQQDGILRYSVTD
ETFPQQARQQVQAEAMLSKIMAKLDYVGVMAMECFVVGDKLLINELAPRV
HNSGHWTQLGCSVSQFELHLRALLDLPTPKLTTFAPSVMVNLIGTDHNKL
WLDTPFSQLHWYGKEVRTGRKVGHINISHPDKNVIITQLEKLAGELPNDY
QSGLNWAINKLK
>MS1806 purL, PurL protein
MFQIFRGSPALSEFRLNQLSARFQKADLPVKSCYAEYLHFADLSAGLSAE
ETDELEQLLHYGPTLAQHESKGECFVVIPRVGTISSWSSKATDIAHNCGL
DKVVRLERGIAYYFEFERTLSAEQQQRLVSHIHDRMMETVVRAPEQAAVL
FDSQDPKPFTTVDILNGGRKALEIANVELGLALASDEMDYLVENFTALGR
NPNDIELYMFAQANSEHCRHKIFNADWVIDGEKQEKSLFKMIKNTFEKTP
DHVLSAYKDNAAVMEGSKVGRFFADQDGQYRYHNEDAHILMKVETHNHPT
AISPFPGAATGSGGEIRDEGATGRGAKPKAGLVGFSVSNLVIPGFEQPWE
NELSKPSRISSALDIMIEGPLGGAAFNNEFGRPALLGYFRTYEEKVNSFA
GEEVRGYHKPIMLAGGIGNIRAEHVQKGEIPVGAKLIVLGGPAMNIGLGG
GAASSMTSGKSKEDLDFASVQRDNPEMERRCQEVIDRCWQMGEGNPIAFI
HDVGAGGLSNAMPELVHDGGRGGKFELRNILCDERGMSPLEIWCNESQER
YVLAVAPENLAVFEELCQRERAPYAIIGEATEEEHLTLHDNHFDNNPIDL
PMSLLLGKTPKMTRDVKSTQVNNSPVDQTNIELKEAFHRVLRLPVVAEKT
FLITIGDRTVTGMVARDQMVGPWQIPVSDVAVTTAALDTYHGEAMSIGER
APVALLDFAASARLAVAESITNIAATNIGDIKRIKLSANWMSAAGHEGED
AGLYEAVKAVGEELCPALGLTVPVGKDSMSMKTTWSENGEQKTVTAPLSL
VISAFARVEDVRKTVTPQLRTDKGETALLLIDLGEGKNRLGATALAQVYK
QLGDKPADVVNVELLKGFYNAMQTLVQQGKLLAYHDRSDGGLIVTLAEMA
FAGNCGIRAEISALGDNDLGILFSEELGAVIQVRESDLAAVREVLTQHGL
IHLTKDLGLVTEYDEFEIKRGTKVVLSEKRSELRGIWAELTHQMQRLRDN
PECADQEFAAKKDPANQGFSAHLTYDINEDVAAPYIATGKKPRIAVLREQ
GVNSHVEMGAAFDRAGFEAIDVHMSDLHTARQNLKDFNALVACGGFSYGD
VLGAGGGWAKSVLFNTALRDQFQAFFEREDTLALGVCNGCQMISTLADII
PGTENWPRFVRNTSERFEARAALVRINESNSVWFQGMAGSHMPIAVSHGE
GRVEFKNDSQLQGLRDQGLIIAQYVDNNIRPTEVYPANPNGSVDGITALS
NTNGRVAIMMPHPERVFRTVSNSWHPEDWSEDGAWMRLFRNARVFFK
>MS0626 purM, PurM protein
MSKQSLSYKDAGVDINAGNALVDRIKPHVKRTTRPEVIGGLGGFGALCAL
PTKYKEPVLVSGTDGVGTKLRLAIDLNKHDTIGIDLVAMCVNDLVVQGAE
PLFFLDYYATGKLDVDVATDVVAGIAEGCVQSGCALIGGETAEMPGMYHA
GDYDLGGFCVGVVERAKIIDGSKVKTGDALIALGSSGPHSNGYSLIRKVI
EVAGVNPATEQLAGRPLADQVLAPTKIYVKSVLELIEHVDVHAIAHLTGG
GFWENIPRVLPEDVKVVINENSWEWQPVFKWLQEQGNITRHEMYRTFNCG
VGMVIALPQADAEKALQVLKAAGENAWLIGQVEPLNAGEEQVIIR
>MS0627 purN, PurN protein
MKKIVVLISGQGTNLQAIMDACKAGKINAQVAAVISNKADAYGLIRAKNS
GIPTAVFERKNYADNSQMDRAISDYIDGIAADLIVLAGYMKILTAGFTRH
FAGKILNIHPSLLPKYPGLNTYQKAIEAGDSEHGTTVHFVNEKMDGGAVI
LQAKVPIFPDDRIEDVEERVKIQELQIYPLVVKWFVDGRLKEAGGKAYLD
GQLLAENGYAAE
>MS0148 purR, PurR protein
MSLANNSNKNRRSTGKVTLADVAKEVGVGTMTVSRALRTPKMVSENLRQK
IHEAVQKLGYVPNSAARELASVSSRNIVIVTSSLVSVENNLILNSLQKEL
QPLDLQIIILVANKKGWLRELINNSPLAVILLNLQCPSTEAQWIRNSGLI
CLEIGSKQANPLGINVCVDSKSAVQKVISFLVAKGYRDIGLLCAQQEQAI
FQQYLACWHSALHANHLNSHQILHCSEPVSFSAGAKLFNEAISTWGCIDA
FVFLSDELACGALFEAQRQHIGIPYDVAIIGLGDLEISQTTYPALTTLNI
PYAKLGETAGKKLAELLQTEKDPQTECIQLISTLRERESG
>MS1531 purR, PurR protein
MSVQKIAKLAGVSVATVSRVLNDSPSVKAVNKEKVLAAIKALNYQPNLLA
RQLRTSRTGMILAMVSNIANPFCAAVVKGIEREAEKNGYRILLCNTESDL
ERSRSCLQLLSGKMVDGVITMDAISELPELQNIIGDAPWVQCAEYDPDSS
VSSVSIDDISATEFVIDQLVKTGKKRIALINHDLSYQYAQHRELGYLDGL
KRHGLAYCEIIYADELDYLSGKEAVLSLLKNAQRPDAILAISDVLAAGVI
NGLNELNVAIPEDIAVVGFDGIDISQITTPSLSTIQQPCKEIGEMAFSLL
LQQIDSTSSVKRVHHLLPWTFIKRQSS
>MS0284 purR, PurR protein
MATMKDIARLANVSTSTVSHVINNDRFVSEKIREKVMAVVKELNYQPSGL
ARSFKTKETKTIGMLVTASDNPFFAEVVHAVERYCCQQNYNLILSNTEGS
PQHLQHNLQMLINKQVDGLLLMCSETHTQDNMPINLPIPAVIMDWWPSEL
TADKIFENSELGAYLATKHLIHHQHKRIAIVNGDLRKPIAQNRLIGYKKA
LTEANLPIDETLIFEGKFDFQTGFDALERLLKTDCPPSAIFACCDAIALG
IYQAAWRHNLIIPRHLSVIGYDDTILSQYIAPPLSTIHQPKTELGKLAVQ
TLLERIKNPQKTYRTFVLDPVLVERESVATRKES
>MS1317 purR, PurR protein
MKLEELAKLAGVSRTTASYVVNGKAKQYRVSDKTIEKVQALIKEYDFKPN
AMAAGLRAGKSNTIGLIIPDFENLSYAKIANQLEKSCRENGYQLLITCSN
DNVANELECAKHLFQRQVDALFVSTVLPADNHYYQQNNAIPIIGFDRHID
SEGVDNVLTDDKHDAYELAVSLFDKADYQRILFLGALPELPMSKAREEGF
KQALGKKQVQVDYLYASQFRKENAEQLVSEWIEKNGIVPDAIFSTSLTLL
QGLLMSFIKRNEAFPKDLVIATFGWHEMLELLENKIVCSVQDHSKVVQAL
LDLALHKMRIKKLKQPHPVIQRRLAYHNWQ
>MS1238 purR, PurR protein
MKYTINEIAKLCNVGKSTVSRVLNKDPKVRSETREKVQRVIDRLGFQPNR
SARAMRAGQEPVVGVIVSKLDSGSESQTLRAILQALQAEHITPLIVESRF
EAEQVRHHFQLFRERQVNAVILFGFFPLPLEIVREWQGSLVVIARTYPNI
SSVYYDDEQAITRLMTELYRQGHRRIAYLGIQDSDETTGKLRTQSYLQFC
RSHNIRPNSVSVELSAESAYLHCAELFTRPVDALVCATGRLALGAFKFSQ
QSGRVFPIAYVGYNELLQYMMPNALSLDFGYCQAGLKAVELLMRQLRGKS
STEHYLVSTHQP
>MS0644 purR, PurR protein
MITIRDVAKQAGVSVATVSRVLNNASSSEKARKAVQSAVEKLGYSPNANA
QALALPTTDTIGVVVTDVTDAFFAILVKAVDQVASSYNKTILIGIGYHNA
EKERNAIDTLLRKRCSCLVVHSKALSDEELANYLEQVPGMVIINRSIQGY
EHRCVSLDNQRGTFLATETLIRLGHKRIGYIGSNHHINDEEERRQGYIQA
LQHHRLPQIDDAIIQSSPDFEGGEEAMIKLLSYHSDLTAVVAYNDSMAAG
ALSVLNENNINVPRQFSIIGFDDMPISRYLIPKLTTIRYPIDLMANYAAR
LALSLVNEGIETPLHAQFNPTVVRRFSTENCNNP
>MS1063 purR, PurR protein
MATIKDVAKMAGVSTTTVSHVINKTRHVADETKQTVLDAIKALNYSPSAV
ARSLKVNTTKSIGMVVTTSETPYFAEIIHAVEEQCYRQGYSLFLCNTQND
PDKLKNHLEMLAKKRVDGVLVMCSEYKDDSRDLLKSFSYLPIVIMDWGPV
NPDTDLILDNSFEGGYLAGKHLVDNGHKKIGYLSAELTKVTAKQRYQGFI
KALSEANVEMKSEWLFEGSFEPEDGYECMNRLLALEDRPTAVFCCNDIMA
LGAISAITEKGYRVPDDFSVIGYDNVHSSRFFAPPLTTIHQSKARLGERA
LRLLFERIAHKDAKRETIEIHPELVIRKSVKKIA
>MS1242 purR, PurR protein
MTKHKRPTLQDIANHLGITKMTISRYLRNPASVAEETGKRIAKAIEEFGY
IPNRAPDILSNAKSRAIGVLVPSLTNQVFADVIKGIEEITDEAGYQTMLA
HYGYSEKKEEQRIESLLSYNVDGIILSENSHSERTKKMLQVANIPVIEIM
DTSEIGIQQVIGFDNIAAAQAMVETMIKRGYKKIVYFSARLDKRTQLKMQ
GYQQAMKKYQLSPRIIATKEHSSFTHGAELLHQALKQYPDIDGIFCTNDD
LAIGALFECQRLGIKVPKQIAIAGFHGHDVGQSITPQLATVITPRLQIGR
IAAQELLARLQNIPAQSSIINLGYQIHLGESI
>MS2375 purR, PurR protein
MKSGLKHHRIALLFNANKVYDREVIEGVGQYIQASQCLWNIFIEDDFVYR
KESLHNLDIDGIIADFDDPETVAMLEHTEIPVIAVGGSYQNPAFYPHYPY
VATDNYALVETAFLHLKQKGINQFAFYGLPNETPKHWSEERKNAFMQLMA
DYGHQTYIYLGEQAHSDNWLEVQSKLCDWISRLPPHTGIIAVTDARARHL
LQACEYLNIAVPDELCIIGIDNEELIQYLSRVSLSSVVQGTNQIGYQAAK
LLDQLLKGRPVSQTPILVPPLRVEQRRSTDYRSLHDPLVIQAMHYIRHYA
TQGIKTEQVLDHLRISRSNLEQHFKAEMNKTIHQVIHEEKLDRAKNMLKF
TDVPIQEISDICGYPSLQYFYAVFKKEYGQTPKEFRER
>MS0808 purR, PurR protein
MLMVSLKDVAKEAGVSLMTVSRALKSPDKLSPKTYKVVKEVIDRLGYVPN
LAAQHIRGVAANTIGVLSLGTATTPFSVEILLGIEQTVRQHGWNSFVINT
FENDSQAMEDAVEQMLSHRPSAIIIARNGLKNVSIPEKLRSFPLVLANCQ
TQDMAVAAYIPDDYQGQRVVVDRIVAKGYQRPLFLHIPKNYIATAKRRQA
FEDAWANHSGQKPVQFFMRRDGEDYFEGAQPLIDYLEKPDPLPFDVIICG
NDRIALVAYQLLLAKGYRIPEDVAVCAYDNMVGIAQLFIPPLTTVELPHY
QMGQEAALHLIEGRKDRDIHQLPCPLIEGESC
>MS0420 purT, PurT protein
MTMTTLGTALTPKATKVMMLGSGELGKEVVIELQRLGVEVIAVDRYKNAP
AQQVAHRSYTISMLDGEALKALVEKERPDYIVPEVEAIATATLVELEQKG
FTVVPTAKATQLTMNREGIRRLAAEELGLPTSNYQFVDNFTDFKSAVENI
GIPCVVKPIMSSSGHGQSIIKSFDQIQQAWDYAQQGGRAGAGRVIVEGFV
KFDYEITLLTVRHIGGTSFLAPIGHRQQNGDYRESWQPQAMSEIALQKAQ
QVAEKITSALGGRGIFGVEMFVCGDEVIFNEVSPRPHDTGMVTLISQELS
EFALHARAILGLPIPEINLISPAASKAIVVEGKSTQVQFGNLEQVLAEPN
TNIRLFGKTEVDGHRRMGVILSRDISVEKALEKAFRAYDKLEINL
>MS1323 purU, PurU protein
MIEKKILLTDCPDDKGLIAKITNICYKHQLNILHNNEFVDFETKHFFMRS
ELEGIFNEATLRADLEFSLPEGANFRLIDAQKRKRVVILVTKEAHCIGDI
LMKNYYGGLDVEIAAVVGNHETLKELVERFDIPFHCVSHEGLTRVEHDKL
LAEKIDEYAPDFIVLAKYMRVLNPDFVARYPNRVVNIHHSFLPAFIGAKP
YQQAYERGVKIIGATAHFINNELDQGPIIMQNVINIDHTYSADAMMKAGR
DVEKTVLSRALDLVLHDRVFVYKNKTIVL
>MS1551 putA, PutA protein
MTKDLNVFDKHYGLLINGEWTDGSEGKTLTAHNPANGAELATFIDATDAD
VDAAVTAAQEAFKTWKHTTAAERAAILNKIADVIDENTELFALQETLDNG
KPIRETRAADIPLAADHFRYFAAVIRSEEGSANQLDDEDLSLILREPIGV
VGQIIPWNFPFLMAAWKIAPALAAGCTVVIHPSSSTSLSLLSLAQKINHL
LPKGVFNVITGKGSKSGEYMLHHTGFNKLAFTGSTEIGRKIGVAAAEMLI
PSTLELGGKSANIFFDDMPFDKALEGAQKGILFNQGQVCCAGSRIFVQEN
IYDKFIAALKEEFKKVKVGLPWEDDTQMGAQVNSNQIKVISKYVDIAREE
GCEIIIGGEKATDPALAKGEFFQPTLILAPDNTKRVAQEEIFGPVAVVIK
FKEEADVIHMANDSEYGLGGAVWTHNINRALRVARALETGRVWVNCYNRL
PAGAPFGGYKTSGIGRETHKMMLDAYTQVKNIFISTREEREGMY
>MS2132 putA, PutA protein
MINIPSLIQAQRNFFAKGATKSLSFRKEQLLRLKALLEENTQAIIEALKT
DLNKPADQVMLAEISPLIHEIDYMLENLDRLAAPKDVESPETLSFFGMGE
YHSQIIYEPYGVTLNISPWNYPIQLSISPIIGAIAAGNTVVLKPSEFTAA
TSALLNRLVAQYFVPEFFVVIEGDVAVNQALLAEKFDYIFFTGSVPVGRI
VMAAASKHLTPVTLELGGKSPFIVDKSANLEQAAESLIFGKTFNSGQTCI
APDYLLVQQDVKAEFVAILKQKLQQKFDDNPFENYAKVVSERHYLRIKSF
LNDGKIVAGGLFNDETHQMLMTVLDGVTWESPVMQDEIFGSVLPMLTFNG
FDEAIERILAQPKPLALYCFTETEENATAVLSQVSFGGGAVNSCFLHFFN
HNLPFGGVGDSGMGSYHGDRSFYELSHEKAVVTRKIV
>MS1741 putP, PutP protein
MNVDYLVMAGYFALIIAISLLFKKMASNSTSDYFRGGGKMLWWMVGGTAF
MTQFSAWTFTGAAGKAFNDGLSVIAVFVGNMVAYACAYWYFARRFRQMRV
DTPTEAIGRRFGTSNEQFFTWVIIPLSVINAGVWLNGLSVFASAVFDADI
TMTIYVTGISVLIISLLSGAWGVVASDFVQMLVVAVISVACAVVGLVVIG
GPGEIIDRFPGGFVSGPDMNYPLILICTFLFFIVKQLQSINNMQDSYRFL
NAKDSKNASKAAIFALLLMLVGTIIWFIPPWVTAIIYPEAASLYPQLGKK
ASDAVYLVFAKNVMPAGTIGLLMAGLFAATMSSMDSALNRNSGVFVRSFY
APIIRKGKADDKELLRAGQIVCVINGILVILMAQFFNSLKHLSLFDLMMQ
VATLLQSPILVPLFLAIIIRKTPKWAPWATVLFGMFVSWSVVKVFTPEYV
ASWFGVEDLTKREISELKVIITIAAHLIFTAGFFCLTTLFYNEAKDTNNE
RRIAFFKDVDTECVAEEGQDEIDRLQRKKLSTLVMLMAAGLLLMILIPNP
LWGRALFACCSLAIFAVGYGLKRSAEV
>MS1786 putP, PutP protein
MNLGVIFPLVIYLAFIFGAAIYAYVKRQRGDFLTEYYVGNRSMTGFVLAM
TTASTYASASSFVGGPGAAYKYGLGWVLLAMIQVPAVWLALGALGKKFAM
LSRETNALTINDLLLYRYKNKYLVWIASIALLIAFFAAMTTQFIGGGRLL
ETTIGINYTQSLLIFALTVGLYTFIGGFRAVVLTDTIQGTVMILGTMILL
GAVIYAAGGTEAAITKLTEVDPQLVSPYGPNNMLDFQFMTSFWVLVCFGV
VGLPHTAVRCMAFKDSKALHSGMLIGTIILSIIMFGMHLSGALGRAIVPD
LTIPDQVIPTLMIKVLPPIVAGIFLAAPMSAIMSTIDAQLIQSSAIFVKD
IYLAAKPEKANNQKLISRFSSLITLTITVILVFLSLNPPDMIIWLNLFAF
GGLEAAFLWVIVLGIYWNKANAYGAIASMVVGLGSYIYLTVAKLKLLDFH
AIVPALVFGLIAFLIANKIGERKQIKA
>MS0777 putP, PutP protein
MFGLDPTLITFTIYILGMLAIGVLAYYYTNNISDYILGGRRLGSFVTAMS
AGASDMSGWLLMGLPGAVYVSGLIEGWIAIGLTIGAYLNWLFVAGRLRVH
TEFNNNALTLPEYFHSRFGTSHNLLKIISASIILVFFTIYCSSGVVAGAK
LFQNLFGIPYATALWYGALATIAYTFIGGFLAVSWTDTIQATLMLFALIL
TPVVIVVSLGGIDGFSASMQSAEIDMQKDFTDLFTGTSTLGLFSLAAWGL
GYFGQPHILARFMAAYSAKSLHKARRISITWMIICLIGAISIGFFGIAFF
HANPQIAEVVTKEPEQVFIELAKLLFNPWVAGILLSAILAAVMSTLSCQL
LLASSAITEDFYKGFIRPKAGEKELVWLGRIMVLIIAALAIWIAQDENNK
VLKLVEFAWAGFGSSFGPVVLLSLFWKRMTSSGAIAGMLTGAIVVFSWKS
VIPATSEWSGVYEMIPAFSLASLMIILVSLLSPAPNKEIVETFEKANLAY
KNAE
>MS1197 pykF, PykF protein
MIISIALLTISAYNTAQFFLHGCFLVFSSNMKKGYLNQTNKIFTEYLMSR
RLRRTKIVCTMGPATDKGNNLEKIIAAGANVVRMNFSHGTPEDHIGRAEK
VREIAHKLGKHVAILGDLQGPKIRVSTFKEGKIFLNIGDKFILDAEMPKG
EGNQEAVGLDYKTLPQDVVPGDILLLDDGRVQLKVLATEGAKVFTEVTVG
GPLSNNKGINKLGGGLSADALTEKDKADIITAARIGVDYLAVSFPRSSAD
LNYARQLAKDAGLDAKIVAKVERAETVETDEAMDDIINAADVIMVARGDL
GVEIGDPELVGVQKKLIRRSRQLNRVVITATQMMESMISNPMPTRAEVMD
VANAVLDGTDAVMLSAETAAGQYPAETVAAMAKVALGAEKMPSINVSKHR
MNVQFESIEESVAMSAMYAANHMRGVAAIITLTSSGRTARLMSRISSGLP
IFALSRNESTLNLCALYRGVTPVHFDKDSRTSEGAKAAVQLLKDEGFLVS
GDLVLLTQGDASSSSGTNLCRTLIVE
>MS0691 pykF, PykF protein
MNEKYVPNQFRQKLLKGETLIGCWCALGNPITAEVLGLAGFDWLLFDGEH
APNDVLSFIPQLMAVKDSASMPIVRVPKNEPVIIKRVLDIGFYNVLVPYV
ESKEEAEEAVSATRYPPEGIRGVSVSHRNNGYATIPDYFKVINDNIGVIV
QIESQKGVDNVDEIAAVNGVDCLFVGPGDLSAALGYLGQPNHPEVQKVIQ
HIFATAKKHGKPCGILAPVEADARRYLEWGATFVAVGSDLGVFRGATKAL
SEKFKG
>MS2103 pykF, PykF protein
MNVFDKAFLHNKFKAAVLEHKTQIGFGLVSGSAVNAEIVAGSGYDFIWID
GEHGPNTVTTIIDQARAIAPYGSHVIVRPLEADRALIKQLLDAGIQSIIA
PMVESGEQAEYIAQSMYYPSRGKRGFGAPAVRAGRWGRLPEYIKHAEDEL
FLAVQIESKKGVENLKDIVTTDGVDAVFLGPADLAVDMGYFGDFSGEEMQ
ATIEKLIKDIRALGKPVGTIAGSPEEAKRYIDWGASFVVVGVDTIFLAHM
ADSVLGACRSIVK
>MS1035 pyrD, PyrD protein
MLYPLIRKGIFALEPENAHDLAIKMLHLAGNPILNKLLKALLACPSGNEK
TVMGIKFKNPIGLAAGADKNGDAIDGFGAMGFGFIEVGTVTPLAQDGNAK
PRQFRIVEAEGIVNRNGFNNYGVDYLVENVKKAKFDGVIGINIGKNKVTP
VERGKDDYIFCLNKAYNYAGYITVNISSPNTPGLRQLQYGDALDDLLKSI
KERQAYLAQVYNKYVPIAVKIAPDQTEEELVQIADTLRRHKMDGVIATNT
TISRDTVAGMKNADQTGGLSGKPLQHKSTEIIRRLQQELKGEIPIIGSGG
IDGVQNAQEKIVAGAELLQVYSGLIYHGPGLVKALVEAIR
>MS0251 pyrE, PyrE protein
MQNYKQEFIKFALSRNVLRFGEFTLKSGRVSPYFFNAGLFNTGADLARLG
EFYASAIQASGLNYDVIFGPAYKGIPIGTTVSVALFNKFNLDKPVCFNRK
EAKDHGEGGNLIGSPLQGRILLVDDVITAGTAIREAMDIIAANNARLAAV
VIALNRKERGKGELSAIQEVERDYRCDVLSIIDLDDLMQFIENEPEYSQY
LPAMKAYREQYGVA
>MS1472 pyrF, PyrF protein
MKAKFSKEEVNMSNKIIVALDYETEKEALQLVDQIDPSLCRLKVGKEMFT
TLGTNFVKLLQDRDFDVFLDLKFHDIPNTVARAVRSAADLGVWMVDLHAS
GGLRMMEEAKKILEPYGKDAPILISVTVLTSMEDLDLLQIGINASPMEQV
IRLAHLSQRAGLDGVVCSPQEVEILRQHLGKEFKLITPGIRPVGSEFGDQ
RRVMTPPAAIEAGSDYLVIGRPITQAANPAEVLRSINASIANLIA
>MS0255 pyrG, PyrG protein
MIGYNHFISLLIPTLGFTMATNYIFVTGGVVSSLGKGIAAASLAAILEAR
GLNVTMMKLDPYINVDPGTMSPTQHGEVFVTQDGAETDLDLGHYERFIRT
KMTKRNNFTTGKIYSEVLRKERRGDYLGATVQVIPHITNEIKARVIEGAA
GHDVAIVEVGGTVGDIESLPFLEALRQLAVQVGREKTIFMHLTLVPYIPT
AGEVKTKPTQHSVKELLSIGIQPDVLICRSDRMVPPNERAKIALFCNVPE
KAVISLKDVDSIYRIPALLQSQGLDDLICQRFRLACKEADLSEWEQVLYR
QANPTGDVTIGMVGKYVELPDAYKSVNEALKHAGLTNRLNVHIKYIDSQD
VETKGIDVLKGVDGILVPGGFGYRGVEGKILTAQYARENNIPYLGICLGM
QVAFIEYARHVAGLTQANSSEFDKNCPQPVVGLITEWQDADGSVEQRSEN
SDLGGTMRLGAQQCHLIEGSKARELYGKETIEERHRHRYEVNNTLLPQIE
AAGLKVTGLSADKKLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGF
VKAAKENQKK
>MS1930 pyrH, PyrH protein
MRVKMNKPIYKRILLKLSGEALQGDEGFGIDPSILDRMALEIKELIAMDV
EVGVVIGGGNLFRGAKLAKAGMNRVVGDHMGMLATVMNGLAMRDALHRAD
VNAKLMSAFQLNGICDTYNWSEAIKMLREKRVVIFSAGTGSPFFTTDSAA
CLRGIEIEADVVLKATKVDGVYNCDPAKNPDAKLFNKLTYAEVIDKELQV
MDLAAFTLARDHGMPIRVFNMGKPGALREVVTGETEGTIIS
>MS0628 pyrR, PyrR protein
MEKIIIDESQFMRTISRISHEIIEKHQNLNDLVIVGIKRRGAEIADLIKR
KINELSGQSLPSIDLDITFYRDDLEYVEPASQSPVYSGASEFISVQNKTV
ILIDDVLFTGRTIRAALDALVDFGRAAKVELVIFVDRGHRELPIRADYVG
KNVPTSRSEKVQVRTMKFDGCYEVALISK
>MS1763 qRI7, QRI7 protein
MRILGIETSCDETGVAIYDEDKGLIANQLYTQIALHADYGGVVPELASRD
HIRKTAPLIEAALQEANLTAKDIDGIAYTCGPGLVGALLVGSTIARSLAY
AWNVPAVGVHHMEGHLLAPMLEDADNRPQFPFIALLVSGGHTQLVKVEGV
GKYEVMGESIDDAAGEAFDKTAKLLGLDYPGGAALSRLAEKGSAGRFVFP
KPMTDRPGLDFSFSGLKTFAANTINQAIKNEGELSEQTKADIAHAFQTAV
VETLAIKCKRALKETGYKRLVIAGGVSANKQLRQGLANLMDDLKGRVFYP
APQFCTDNGAMISYVGYLRLKHGERTDLAIEVKPRWPMIELEAI
>MS1558 queA, QueA protein
MHLSDFYFDLPDELIARYPKPERSSSRLLRLSGENGDISHHTFSDVYDLI
NEGDLLIFNNTRVIPARMYGRKASGGKIEVLIERVLTESRFLAHVRSSKA
PKAGTELILGEDKLGEGKGVKAVMIGRQDALFELEITEKSTALLDILQKI
GHMPLPPYIDRPDEDADQERYQTVYSKIPGAVAAPTAGLHFDEELLAKLK
AKGVNFAFVTLHVGAGTFQPVRVNNIEEHHMHAEYVEVPQQVVDAIVATK
AAGKRVIAVGTTSVRSVESAALAAQEKGFAQIIEPFFADTSIFIYPGKQF
RVVDCLITNFHLPESTLIMLVSAFAGYKNTMNAYKSAVENRYRFFSYGDA
MFITKNNHVKGLD
>MS0014 racX, RacX protein
MVSITGIIEIIHIINDKVKDFEMKTIGILGGMSPESTASYYLEINRAVNL
ALGGNASAKLLISSVDFEEIVQCQKAGDWQKAGKILAEQAKLLEQAGADG
ILLATNTMHKVARPIIDNISVPFLHILDAVADSIKAKGLNKVALLGTAFT
MSDNFYRDGLIERGITPVVPDEETQKEIHRIIFEELCIGKILPQSKTFYL
KTIEKLTALGAEGVILGCTEIGLLINQADSTLPFFDTALLHSEMAVDFVL
EK
>MS1941 radC, RadC protein
MTKRYLLEELQQNQEFNSTDTARIYLQTALEQREREIFLVLFLDNQHRLI
KQEEMFLGTINSAVIHPREIIKTALYCNAAAMILAHNHLRESPNRVNRIA
ILRKGSVRRQI
>MS1940 radC, RadC protein
MEFSTQSNPSIKNKGERLMLQIEKENLMPREKLLKFGANTLDNKELLAIF
LRTGIKNCPVMQLSEAVLTHFGSLRQLINADRHNFAPLRASASLNLSNYR
PVRK
>MS1939 radC, RadC protein
MDKLSDADALKGAKLCRSALINCRNEPKCVKTASESCITGQFFIPVRKNI
ANNSLLSKVLAPNFNSFSRGIRFSFSICSIRRSPLFLIEGLLCVENSIMP
TELDQFFETDRKNKIFSSLSFEVECQNLFVFILCFLNYGFWVPEE
>MS0578 rarD, RarD protein
MFMSISAKLKGWHYAFACYAIWGTFPIYWYPLNSSAMPADQILAQRIVWS
VVFAVFLLIIFKQSRAVLRAFTKPKILAIFFLSSFLIALNWLVYLWAITN
HHVLDASLGYFINPLFNVFLGRLVFKERLNKPQLLALCFATAGILWLAIP
AGQIPWVALLLAGSFGFYALIRKLAPMEALAGLALETLLLSPFALAYLFF
CYTQNTLVFSELNSLQLGVLLGSGAATTIPLLWFAMGARQISMSLLGMLQ
YISPTLQFLCGSLLFGEALSITRLIGYSLVWIGVAIFLLAMRKKMQNK
>MS1442 rbfA, RbfA protein
MSREFKRSDRVAQELQKEIAVILQREVKDPRIGMVTVSDVEVSRDLAYAK
IFVTFLFNNDDEAIKQGMKALEKAAPYIRSLVGKAMRLRIVPELRFEYDR
SLVEGMRMSNLVTNVVRSDKERHIDEENGED
>MS0032 rbn, Rbn protein
MRRVVMTQLKLLFAVFYCRFQQNKLTQAAGALTYSTMLAIVPLVMVVFAI
FSAFPMFNEAAAELKTFIFDNFSPSAGDTVGQYIDEFVVNSKSMSAVGII
SLIAVALLLINQIDRTINDIWNSKNRNFIFSMTIYWTLLTLGPIFIGMSF
AINTYIRSIIAFEGDLGLPFGLKLLSFVPFLLTWLSFSLIYTLVPNTKVN
FRYAAVGALVAAIFFTLGKKAFAWYMATFPSYQLIYGAMATLPITLLWIQ
LSWLFILLGAQLTAVLGDMRLIKSGDLNLTAIKEKTE
>MS0643 rbsB, RbsB protein
MKKTVLSAVALAVGLGAGISTAQADTKIGVTIYKYDDNFMALMRKEIDKE
ATNLKDVQLLMNDSQNAQSIQNDQVDVLISKGVKALAINLVDPSAAPTVI
SKAKPDDIPVVFFNKDPGEKALAKYEKAYYVGTDPKESGTIQGDLIAKQW
KANPALDLNKDGKIQYVLLKGEPGHPDAEARTKYVIEQLNANGIETEQLF
IDTGMWDAAMAKDKTDAWLSSSRANDIEVIISNNDGMAMGALEATKAHGK
KLPIFGVDALPEVLQLIKKGDIAGTVLNDGVNQGKAVVQLANNLAQGKDA
TEGTQYKAENRVVRIPYVGVDKDNLSEFLK
>MS1612 rbsB, RbsB protein
MASKLLKNIFKFSALISALPALAFAADKPQIALLMKTLSNEYFISMRQGA
EETAKEKNIDLIVQVAEKEDSTEQLVGLVENMIAKKVDAIIVTPNDSIAF
IPAFQKAEKAGIPIIDLDVRLDAKAAEAAGLKFNYVGVDNFNGGYLEAKN
LAEAIGKKGNVAILEGIPGVDNGEQRKGGALKAFAEYPDIKIVASQSANW
ETEQALTVTTNILTANPNINGIFAANDNMAIGAVTAVENAGLAGKVLVSG
YDGIPLAIEYVKQGKMQNTIDQLPKKQVAIAIEHALKQINKQEIPPVYYV
DPVVVDKEESKNY
>MS0639 rbsB, RbsB protein
MKKLVLNTVAISVLLGSGLAVAQTEPLIGVTIYQYDDNFMNLMLSEINKE
SANFKDVRFLMNDSQNSQAIQNNQIDILLAKKVKVLAVNLVDPPAAKTVI
AKAKKHNVPVIFFNKDPGAKLLASYNHAYYVGSSPKNSALEQAKLIAKHW
NANKQFDLNQDGKIQFAMLTGQPDSTAAEVRSKYVIEELHNLGIQTEALF
VDTAMWNGNMARDRMELWLNDTKGKQIEMVIANNDAMALGALESLSAQNK
QLPVFGIDALPETLTLIKTGKITATVLNDGAYQSKVLVELARNLALGKNA
AEGMPWKPENNSILSPDIAIDKDNVEQYRK
>MS0063 rbsB, RbsB protein
MKLSKRTFLKSLVAVSILAVTGLNSNPVYSSSAEPIKLGFLVKQPEEPWF
QTEWAFADKAAQALGNVQIIKIAIPDGEKTLNAIDNLAANGAKGFVICTP
DPKLGPAIMAKARAYDLKVIAVDDQFLNAAGEPMTNVPLIMMAASEIGQR
QGEELYKEMQNRKWDVKDTAVLAITADELDTARRRTEGSIEALIKAGFPK
EKIYKSPTKSNDIPGALDAANSMLVQHPEVKNWLIVGMNDNTVLGGVRAT
EGQGFKPENVVAIGINGVDAVNELSKPRATGFLGSLLPSPDIHGYRSVEL
LTKWIREGVEPEKYIAVQDVVFLKRDNFKEELSKKGL
>MS0201 rbsB, RbsB protein
MRKFLKTTLVSAVFTFMIGSAYAQLTPLNSDTEQDRINWTELESKLGSFP
TLKEGLKIGGVSKTLTNEYWRSLGEGYKNFADKHKVFVAYQAAANEGDQL
GQLSIAETMITEGYSALLFSPQTDVNLQPAAEVAQSKNIPVVNVNDAVMP
TATHYVGNVQKDNGVRVANWFIEHSAEGGKVAVVEGQPGVFAAKQRTEGF
TETINKSGKFEVVASVPANWSREQAYNVAMTILQRNPDLIGFYVNNDGMA
LGVVEAVKAAKLQGKVAVFGTDGISDAYKSILAGELTGTVDSFPVLTGEV
AMEVVLRLTAGQKLPRVVTTPQALITLENAKTYSEADAKTIREILSK
>MS0283 rbsK, RbsK protein
MDKIMKKLTVLGSINADHVISVPHFVKPGETLTGSNYHIAYGGKGANQAV
AAARLGADVDFIACIGDDDIGKAMKAAFERDGIHTHTISTIPHQTTGIAM
IQVAESGENSIVISAGANAHLTEELLAQHQESIAHADCLLMQLETPISAV
EKAAVFAKRNGTKVVLNPAPAQPLSDHLLAHIDMITPNETEAEILTGVQV
TDEQSAAEAAQVFHHKGIETVLITLGSKGVFFSSRGVQRIIPGFRVKAVD
TTAAGDTFNGALITALLEDKSMEEAIRFAHGAAAISVTRKGAQPSIPSRE
ETLAFLNEQH
>MS0565 rbsK, RbsK protein
MIMKKIAILGECMIELNGEPFGRMRQTYGGDSLNTATYLARVSRREQFEI
SYVSALGKDKLSLGMLAHWRNDGINTDCVLLDEKRQPGLYLIQLDEKGER
TFLYWRNQSAARYLLQHEGYTEVLARLATADMIYLSGISLAILPENDRTL
LIRQLGNLKKAGVKIAFDSNYRPALWDSFQQTQACYQALLPLVDLALVTF
DDEQSLWRDENVQQTISRLVQLGVGTVVVKSGEHGAVFYHNGETQQVATE
VVQRVVDTTSAGDAFNAGFLNGYLQQKSLVDCCRQGNKLAGIVIQHKGAI
IDKTATQHFIREFN
>MS0197 rbsK, RbsK protein
MPNKVVVVGSLHYDIVVESTHRPVKGETVIGKRWYPKFGGKGGNQAVAAA
KAGCRVFMVSAVGPDNFAPFLLEHLNKSGVNTDFVQKISGVGSGMSVAIM
DSEGDYGAVVVSGSNLEIDINRLDNETLWDNAKMLILQNEVSDSINFEAA
KRASRRHIPVCLNAAPAKKLSAEFTKLIDILIVNAVEAEAMCGLSVNSLD
SALQAALKLSQDFSRVIVTAGGDGVAYADKESNGKIASIKVKLISTLGAG
DCFVGHLCTALSENNTLRDAVAYANQKAAEHVSTVQE
>MS1233 rbsK, RbsK protein
MHMTNKIWVLGDAVVDLIPDGDNHYLRCAGGAPANVAVGVARLGVPSAFI
GRVGKDPLGEFMRDTLNQENVNTDYMLLDPKQRTSTVVVGLTDGERSFTF
MVNPSADQFLQISDLPQFQAGDWLHCCSIALINEPTRSATFTAMKNIRAA
GGKVSFDPNLRESLWKSQDEMIDVVMEAVSLADVLKFSEEELTLLTHTDS
LEKSFEKITALYPDKLIIVTLGKEGALYHLHGKKEVVAGKALKPVDTTGA
GDAFVSGLLAGLSQTENWQQPEQLVTIIRQANASGALATTAKGAMSALPN
QQQLAEFLAN
>MS0545 rbsK, RbsK protein
MKNLTLIGECMIELNGEPFGVMRQTYGGDTLNTATYAARVASPEKLNVGY
VSALGTDKLSQGMIERWQADGINTDLVLRDEKRSAGLYLIQLDKQGERTF
LYWRNQSAARYLLQHPDYNRVLSALKNTDMIYLSGISLAILPENDRTLLI
EQLGELKKSGLEIAFDSNFRPALWDSREQAQNCYKALLPLVDVALVTFDD
EAMLWADNDEQATITRLSSFNIPKIIVKQGRLGATVCEKGKQTFVPTIPV
EHVVDTTSAGDSFNAGFLVGYLQGKPLNECCKQGNQLAGIVIQHQGAIIE
KSATEHLRNAFA
>MS1800 rdgC, RdgC protein
MYWFKNAMIYKLTKELDWSEDKLQQNLAQCAYHPCGQSDMSKFGWTTPLR
GAELFCFSVGKQILLVAHKEEKIIPAHVIKRELDNRINELEEKENRKLKK
TEKQALKDDVVSVLLPRAFSKNQQTAIWIDTEKNLIYVDAASSKRAEDVL
ALLRKSLGSLPVVPLAFANEPSMVMTDWIIKNDMPQWLVPLEEAELKAAD
DRGIIRCKNQALDSEEMISHLQAGKFVTKLALEWEEHLTFVLNDDGTLKR
LKFADMIREKNDDILKEDFAQRFDADFILMTGELAKLTENLIEHFGGEKN
RL
>MS2243 recA, RecA protein
MATNDEKSKALAAALGQIEKQFGKGAIMKLGDTQALDVESISTGSIGLDV
ALGIGGLPMGRVVEIFGPESSGKTTLTLSVIAQAQKAGKVCAFIDAEHAL
DPIYAAKLGVDVKELLVSQPDNGEQALEICDALVRSGAVDVIIVDSVAAL
TPKAEIEGDMGDSHVGLQARLMSQALRKLTGQIKNANCLVVFINQIRMKI
GVMFGNPETTTGGNALKFYSSVRLDIRRVGAVKDGDEIIGNETRVKVVKN
KLAPPFRQVDFQILYGEGISKNGELIELGVKHKLVDKSGAWYAYNGDKIG
QGKANAMKWLAENPTVAAELENKIRAELLANPEQALLADIETNSEEKEDF
E
>MS1099 recB, RecB protein
MNSTLLIEASAGTGKTFTMASLYLRLLLQAGENCFFKPLEVEQILVVTFT
EAATQELRERIRHRIHLAKKQLTQYAENKNKQVFYGTENEILADLVDSLE
LPVAIQRLKIAEQNMDLAAIYTIHGFCRRMLVQYAFNSGIHFNLQLVKDE
TELLTRFSNELWREHFYNLSFSLTNFIHRNLKSPTDVLQKIRKFVTSENL
NVELNEPHLLQLEFNRFLSQYIEKNINEIKQLKTAWIESENEIQRLIEKA
KTQKLIKGASYKANHLPGRYEKIRQWAQDETDFSIPEPLSKYFSQSAVDS
YLTKNEPVNHAVFKQADSAVERAQSTELYVKVILYHYIQWMRDKLDRYKA
SHQEKSFDDLLRLLKEAVVSPEHGNELVKLIRYQYPFAMIDEFQDTDAQQ
YQIFSKIYIESAQAETGFIMIGDPKQAIYQFRGADIFTYLKAAQQAKYHF
TLGKNYRSEGNLIHAVNQLFNFSSAQPFLYENIEFSSVEPGKAQGRFILN
EQQEAPLGVYLGEEPSDEQLAETCANCISQWLQLALRERAGIQTAEKFLP
LEPKDIAVLVRNAKEAELIKNALQARQISSVYLSDKSNVFDCNEAKELLL
ILQACLNPFSERNIVNAIATAIFCLTGADIQHIKQHETDWEKWIDRFVGY
QRSWRQQGVLAMLHQLFLAEQIPQKLINMPNGERRVTDLFHLAELLQEAT
TLNESDAALLRWFERQIRGENTQDENIIRLESEQQLVKIVTIHKSKGLEY
NLVWLPFISAKAKVNPQHISTYYNAQAQAVQWDMDACHNDEVIKERLAEE
MRLLYVALTRAKYHLAMALPDNFTKNWNALLYALTRGEIGTQAKLTDEYQ
TKPLLDDFAQRISPANIHYYQTDEIQGGGYQQKDNHAQYVAQEFHGKIER
DWTISSFTSLTQMHEWNSQKGRHEAFSPIVTTESAVNFSLILDEAKDIDL
TFLPKINEDKNNFSDIVTGYRQGYSPFDFPHGINVGTALHRFFEKNEFNQ
PIIDEYVKNLCQTIQLSEEWKQPLIQWIEAILTTPLFNGEPLNLAQLDKK
DCIKEMQFYLKLEGRFKLHSFNRLLQKYHTIKREPYLFDEIQGMLRGFID
LVFRHESKYYVLDYKSNFLGKDMAFYARSQLTDVMKNHHYDLQYLLYTLA
IHRYLKQRVTDYDYDSHFGGVIYCFLRGMNGRNPDYGIYSAKPARELIEG
LDNLF
>MS0728 recC, RecC protein
MQLKINSLKITALLVIRKFIVFTVYHSNRLDVQKDILIELMQLLPPDDPF
QTEIILVQSPGMAQWLQLKIAEKKGIAANLKFPMPASFIWQQYINVLEDV
SQQTQFNKDAMTWRLMQLIPQFLSEPCFQALENYLKNSPYSEQQKLYQLA
RKVADLFDQYLVYRPNWIHAWENNQPESIEQAIGTYQKDDNPELITQIKR
DIKWQGILWRALIDEVQRGAGYKVRHRANLHQAFIDKLRSAKPENLPQRI
FIFGISALPQSYLETFEAMSRYCDIHLFFNNPSREYWGDIVDDRFLQKLQ
TRQRFDHYENNHTALLSSATLTNMQQENYEFSPDNEKLLVGNPLLAGWGK
LGRDFFYLLTDLMTRAEEHNREIIAFVDLDDKTLLSQVQGHILDLIPMAV
KKLNKPQEDNSLTIHACHSVMREVEVLHDYLLSLFELDKNLTPKDIVVMV
ADIDKYAPYIQAVFGQYQKDLQTNQFYQADKRYIPFSISDNKLTESDVLI
ASFLMLLNLKESQFSAEEVLAYLDIPAIRMRFQIELEDLETIREWVKNSG
IRFGLEKRTDNSLKNYNAWQSGLERMLLGYAMRAENGIWQDSLGFDDSHG
LQGKLAGLLAAFIERLYQWQQFLRNPHSYEEWGQALLELVDHFFLENEQS
LEAILYLKEIIQQLHEQLDEVNFTSKLEIDVIAEVMAEQLNDKNTSLKFL
VGKVSFCTLLPMRAIPFKAVCLLGMNDGEYPRQQTPNSFDLMQYHRQKGD
RFRRDDDRYLFLEALLAAENYFYVSYVGQSIIDNQQREPSVLVSQLLDYL
AENLANNDEEIEQIRTSLVQYHSMTIFSPDNFSAMHRSYAKEWLPLVNRN
QYPVPDFTQQISGEIDEVREIDILQLVQFVQHPVKFFFEKRLGVYFQQTD
EQIPETENFTLDNLDNFLIKDELIRFADDETDNYFERLKLEGILPYGHFG
DIYKRRLQNEAAELKNKISAYLSQEPAHQFVEITLDMGEQSVLLTGHLDH
LYQPFAQRVKWRVGEVKDKHIIENWLYYLLQLCTTDNVNPPLYYGKNGCI
GFKTLEKSTALSILKLYVKAYLQGLKQVQIVPTYKIDDYLKSCQPETEFD
TLSAFNNLRDLFKSSNNYTNEKEDIYWTRVFQQATELNSDKEKLMQIQQT
TRDWFGLMLNSVEKVKL
>MS1098 recD, RecD protein
MLEILAKLQQENVITAGDYHFAKMIAEKCEEGTDKSSRTKNNLTALLAAL
CNYSHQQGNTCLFLEEQIKSNLFGLAYRALEQDYLQQIDEKIGYLPVAQW
QQILKSHIAFTTEPKTKIAPFVFQFNALYFYRVWQDEYLVARYLKSAVKN
SKVLAEQPDTKIIHQLIGENTGLNQGQKIAIATALRQQFCLISGGPGTGK
TYTVARLLVALQQLHQGKLQIKLAAPTGKAAARLTESIENALQQMTLSAK
LKHCIPTEAMTIHRLLGGRSFKFNAQNPLPLDVLVIDEASMIDLALMSNL
LQALPSHARLILLGDKDQLASVEAGAILGELGQFLEQGYSASFIDYLNRV
TDSHLAFNSVQGDEIRDYLSHLTESRRFDEKSAIGHLAKAINSAEIDRSL
QLFSQLDDIEYVDFNRYFANGIQPESSAEYLAYCVNLVVERAVREYRDYL
LEIETRSAKSELTEQDIEKIFAGFKKVRFLSALRLGELGVEKLNLSIAEG
LRRQNLIQFKNSRDWYQGKPVMIIQNDANVGLFNGDIGLFIQGKVWFELG
ENHYRRISPSRIPSHETAFVMTVHKSQGSEFNHAFLVLPTENVPVLSREL
VYTAVTRAKQRFTLFATDNIWKSAVRKQVKRQSGLGRLLIENI
>MS0487 recF, RecF protein
MAIARLIVENFRNISAVDLEFDHGFNFLVGNNGSGKTSLLEALFYLGHGR
SFKSSVTTRVIRYDQPHFTLHGRIRELQHEWSVGLQKQRKDGNTIVKING
EDGNKISDLAHLLPMQIITPEGLTLLNGGPSYRRAFLDWGLFHHQPNFHS
AWSALHRLLKQRNAALNQTYDYNMLKPWDMELAKLAHQVSQWRADYAEAL
SPEIEQTCRLFLPELDIHVSFHQGWEKDTDYAQLLTENFERDKAIGYTVS
GPQKADFRFKSNGLPVEDVLSRGQLKLLMCALRLAQGEHLMAQKNRHCIF
LIDDFASELDETKRALLAQRLQNSNSQVFVTAISPEQLKQMQPEKHRTFQ
VVNGQIEQLL
>MS1735 recG, RecG protein
MTTQLLDAIPLTSLSGVGAAVSAKLSKIGINNLQDLLFHLPIRYEDRTRI
TPISDLRPEQYATIEGIVQTCEIQFGRRPILTVSLSDGTSKIMLRFFNFN
AGMRNGFQPGARVKAFGEVKRGRFMAEIHHPEYQIIRDKQPLQLEENLTP
IYSATEGLKQNSLRKLTDQALELLDKIQIAEILPDQFNPYPFSLKEAIRF
LHRPPPDVSVESLEKGTHPAQVRLIFEELLAHNLAMQKVRLGTQQFQALP
LHFQTDLKQRFLATLPFEPTNAQVRVTQDIERDLAKDYPMMRLVQGDVGS
GKTLVAALAALTAIDNGKQVALMAPTEILAEQHAENFRRWFEPFGIEVGW
LAGKVKGKARQSELERIKNAEVQMVVGTHALFQEEVAFSDLALVIIDEQH
RFGVHQRLLLREKGEKAGNYPHQLIMTATPIPRTLAMTVYADLDTSIIDE
LPPGRTPIKTIVVSEERRAEIVARVHNACTNENRQVYWVCTLIDESEVLE
AQAAEATAEDLHRALPHLRIGLVHGRMKPAEKQAIMASFKAAELDLLVAT
TVIEVGVDVPNASLMIIENAERLGLSQLHQLRGRVGRGSTASFCVLMYKP
PLGKISQKRLQVLRESQDGFVISEKDLEIRGPGEVLGTKQTGIAEFKVAN
LMRDRKMIPTVQHYARRLIVEYPDVADTLIKRWLNNREIYSNA
>MS1539 recJ, RecJ protein
MIYSGLIKCISVILDCIYPVNKLIQRRTIPHGSAVCADPLLDRLYRSRHI
KNSQQLDRTLHSMLAPNQLQGIDQAVQLLITAREKQQKVIIVGDFDADGA
TSTALTVSALRQLGFTDVDYLVPNRFEQGYGLSVAVAEMALAKGVELLIT
VDNGVSSLDGVAFLKGRGVRVLITDHHLPPEILPAADAIVNPNLADCHFP
SKALAGVGVAFYLMLALRAKLRESGEFNEKTQPNFTELLDLVALGTIADV
VPLDQNNRILAHQGLARIRAERCRYGIRALIEVANKDISQLSASDLGYSI
APRLNAAGRLDNMSVGVELLLADSMEQARALALELDGLNQTRKEIEQGMK
AEALEICRNLTALKTELPTGIALYQADWHQGVLGILASRIKDQFHRPVVA
FAQDQNGLLKGSARSIEGLHMRDALERINTLYPDMIVKFGGHAMAAGLTI
KEELFADFQRSFNQVVTDWLDKDMLQGIVWTDGDLPQTMMNMNTAELLKQ
AGPWGQAFPEPIFDGEFRILQQRLVGEKHLKMLVEPVNGGPLFDAIAFNI
DTRYYPDLSIRTAVLAYKLEINEFRGNRDVQLLVDYIQPRS
>MS0741 recN, RecN protein
MLTQLTINNFAIVRHLDIELSEGMSVITGETGAGKSIAIDALGLCLGQRT
EAAMLREGQERAEVCATFQLKADSPAARWLTDHELQDQDNPEECILRRLV
NQDGRSKAFINNTPVSASQLKEFGQYLVHINGQHASQLLLKNDFQLQALD
NFCAHNHLLEQMKTDYLNWKELQSQVKTFNQKCVENEAKKQLLQYQVNEL
NEFNLRPNEYQELEEEQRRLSNSEQLTQLSQSVLQILTENETVNVDSLLY
RTTQHLEDLAELDTRYVDAQALLQEALIQVQEAASEIQHLSANIEEDPQV
LREIEQRMNQAVQLARKHNVKPEELTQLHKQLKLELNQLVDFSESENELL
AQEQQAYEKMSASATKLHQSRRQGAEKLAKQVTKSVKQLAMENAEFFINL
TADYSKISVNGADNVIFNLQSNLGQSPQPLAKIASGGELSRIALAIQVLT
SDKTAIPTLIFDEIDVGISGATASVVGKLLRKLGHSCQVLCVTHLPQVAC
NGHHHFMVEKSTVEGKTETKMTALSSQQRIKALAKLLGGQHITDSVLANA
QEMLALVS
>MS0239 recO, RecO protein
MLHRKPYSETSLLVDLFTEESGRLTVLAKGARAKRSALKSVLQPFTPLLL
RWTGKSSLKILTKAEPAAIALPLQQTALFSGFYVNELITRVIEPETPNPQ
LFQDYLHCLTSLAVSQNFVEPALREFEFKLLNILGYGVDFLHCAGSGEPV
DENMTYRYREEKGFIASLIKDNLTFFGRELIAFERQDFSEKSVLQAAKRF
TRVALKPYLGNKPLKSRELFTQTILHLK
>MS2081 recQ, RecQ protein
MTAELSNRSEAIKPELIKSAVENPEISTALDVLHSVFGYQTFRKGQQEVI
QAALSGRDSLVVMATGNGKSLCYQIPALCFAGLTLVISPLISLMKDQVDQ
LLANGIAADFLNSTQSLEQQQQVQNKAISGELKLLYLSPEKVMTNSFFQF
ISLCNVSFIAIDEAHCISQWGHDFRPEYTQLGGLKGCFPHAPIMALTATA
DSTTRQDILQNLSLNEPHLYVGSFDRPNIRYTLVEKFKPMEQLCNFVAAQ
KGKSGIVYCNSRSKVERIAEALKKRGISAAAYHAGMESSQREAVQQAFQR
DNIQVVVATIAFGMGINKSNVRFVAHFDLSRSIEAYYQETGRAGRDDLPA
EAVLFYEPADYAWLHKILLEEPESPQRDIKRHKLEAIGEFAESQTCRRLV
LLNYFGENRQTPCNNCDICLDPPKKYDGLLDAQKILSTIYRTGQRFGTQY
VIGVMRGLQNQKIKENQHDELKVYGIGKDKSKEYWQSVIRQLIHLGFVQQ
IISDFGMGTRLQLTESTRPVLRGEVSLELATPRLSSITMVQAPQRNAVTN
YDKDLFARLRFLRKQIADKENIPPYIVFSDATLQEMSLYQPTSKVEMLQI
NGVGAIKWQRFGQPFMAIIKEHQALRKAGKNPLELQS
>MS1506 recR, RecR protein
MQTSPLLENLIESLRCLPGVGPKSAQRMAYHLLQRDRSGGMNLARALTEA
MSKIGHCEHCRTFTEEDICSICDNPRRQNSRLLCVVEMPADIQAIEQTGQ
FSGRYFVLMGHLSPLDGIGPREIGLDLLQRRLQQEQFNEVILATNPTVEG
DATANYIAELCNQQNIKVSRIAHGIPVGGELETVDGTTLTHSFLGRRTIG
>MS1262 rfaE, RfaE protein
MAQYSAQFNQAKVLVLGDVMLDRYWFGATNRISPEAPVPVVRVQENEERA
GGAANVAMNIASLNVPVQLLGLTGLDETGKALTTLLQTQKIDCDFVRLAT
HPTITKLRILSRHQQLLRLDFEEDFKNVQSTELLTKLENAVKNFGALILS
DYGKGTLNDVQQMIQIARKAKVPVLIDPKGTDFERYRGATLLTPNMSEFE
AVVGKCDTEEDIIEKGLKLIADIELSALLVTRSEKGMTLLRPGQEAYHLP
TEAKEVFDVTGAGDTVISVLATALADGRSFEEACFLSNVAAGIVVGKLGT
STVSTIELENAIHGRSSTGFGIMNEDELKVAVQLAKARGEKIVMTNGCFD
ILHPGHVSYLENARKLGDRLIVAVNTDDSVKRLKGETRPINDLPSRMAVL
AGLSSVDWLVAFDEDTPQRLIGEVLPDLLVKGGDYKPEDIAGSKEVWANG
GDVKVLNFENGCSTSNVIQKIRELKD
>MS0445 rfaF, RfaF protein
MNLKNILRNLRLSLGKLLIDKKIQGDVNVFPPQRILFLRQDGKIGDYIVS
SFVFRELKKANPNIHIGVICTQKDAYLFEQNPHIDQLYYVKKRDILDYIT
CGLRLAKLQYDVVIDPTISLKNRDLLLLRLINARNYIGYKKSNYQLFNIN
LEGEFHFSELYRLALEKIGIQVQDMSYDVPYNSQSAIEISQFLEINQLKN
YIAVNFFGGYRHKVMNKQNIEKYISYLTSQSDKPLVLLSYPEVIPMLKDA
AKSYTNIFIHDTTTIFHTIELIRHCALLISTDTSTVHIASGFNKPMIAIY
KEDPIAFIHWKPISQAKTHILYYKDNINELSPEAIKPEWLL
>MS2260 rfaF, RfaF protein
MNILVIGPSWVGDMMMSHSLYQTLKKQYPDCAIDVLAPNWCKPLLARMPE
VRRALTMPIGHGAFNLTERFRIGKSLRNQYDMAIVLPNSLKSAFIPLFAK
IAVRRGWKGESRYFFLNDLRSNKNDYPMMVQRYAALGYEKSAVPDAQNLP
IPTPYLTVEKSAVEKTKEKFSAQLALAENRPAIGFCPGAEFGPAKRYPHY
HYAKLAEMLIEKGYSVRLFGSAKDNEVCQQIRGGLPESLQAYCVDLSGQT
ELNQAVDLISDCVAMVTNDSGLMHIAAALKRPLVALYGPTSPQYTPPLSD
KAVIIRLIEGGLIKIRKGADSAEGYHQSLIDIQPEMVMEKLAQLLN
>MS2259 rfaF, RfaF protein
MVKMKVLIVKTSSMGDVLHTLPALTDAQKALPDVRFDWVVEENFAEIPHW
HCAVDKVIPVAIRRWRKQPFSAETKNQWKNYRTFLQTEQYDAVIDAQGLL
KSALLVTRPARGEKYGYSFSCAREGLAAFFYDHKFNIPYQQHAVERIRRL
FAQSLSYALPDEKGDYGIASHFRHAAKAIDNPYIMFIHATTRADKFWINA
EWTKLARLLAAKGYHIHLPWGNEREYEQAQQIAQNIPQVKILPKLSLSAL
AEEIAQSSAVVSVDTGLAHLTAALDIPNITLYGATDPKLIGTYGKNQHHL
TKETMRAISAEEVFERFFGVV
>MS1962 rfaF, RfaF protein
MALFNQAPKSLCVLRLSAIGDVCHALAAVQQIQKYWPETEISWIVGKTEA
QLLAGIPNAELIVYDKKSGWKGVLALWRQLKHRRFDALLNMQTAFRASVL
SFGIKARYKIGFGKQRAREGQWLFTNRKVRDPQNPHVLDGFMAFVEYLGV
PVEAPHWQLAVSEQDKEAVKPYIDPARKNLIISPCSSKAEKDWLIERYAQ
VANIAHQHNVNVILCGSSAKREVEILQKITALCDFQPVNLSGKTNLKQLV
ALISMADLVISPDSGPAHMATTQGTPVIGLYAYHNPLRTGPYNNLANVVS
VYEKNVRKEYGKPSDQLPWATKLTGKNLMSQIQVEDVVEQMKKLAVI
>MS0439 rfaF, RfaF protein
MQKLLVIRNDKLGDFMLIFPALALLKKSYPQLKISALVPAYTAPIAEICP
YIDEVIIDAKNKKNPAEFDRTLQLIRAQKFDAVISFFSTWYNAKLVWKAG
IKYRLAPATKLFQFLYNHRLTQRRSRSEKAEYEYNMDLARQFLRDHNIPI
IEPSAPYLQFAKSAVENQKILLSEQLNIKQDKKWLFVHSGSGGSANNLSL
QQYADLIQGILREFDCYIILTAGPNEEEKARQLANLVSRPNVVVYAKNNG
LVDFARSLACADLYISGSTGPLHLSGALNIPTIGFYPSRLSAIPRRWRPI
NAPDYHIAFCPPFKKEVEKNLTVISIAECLKDIIPFIRTHWH
>MS1494 rfaG, RfaG protein
MQDKKHILIVAPYVTFPDEMGMNRFIYLAKLLSAEFDVTLLTSKYCHFLK
EHREQTPVLDNVNVVLLDEPGYRKNVSIQRLISHHRFCRNFEDFLKNYRR
KIDLVYSAYPLIKTNYILGKYKQSKNFKLIIDVQDVWPEAISGPIAFFST
SIGKMLMKPITRYANKTYGYADALVAVSDTYLNRADVNHLPDELKSAVYI
GGDFLFTKSTDKKVTDKLTATYLGTMAGSYDLETVVRSAPLCSENVEIRF
IGTGPHEASLQALNHQLGGHVKFLGVHPYSEAMRMLADSDVALNAIKASS
EGSITNKLSDYICCALPIVSCQKHPEVEKLLAKGGGIQYTAGDYRELAET
LNKLAEDRTVLDKMSQVNLSLAKEKFLRERSYKEIEKLIKNII
>MS1497 rfaG, RfaG protein
MSQAKLRVLLVSDMGHIGGTEIATLIAATELNPLTESVTVFGKTGPLFDR
INKLGIRQINADCHTKNPLKLLNYVCQLVKTVNDNQIDVIHAQMARPLLF
IWLAKKFFKNKQVKIFWTSRGLDHETYQKVVPFFAKMGVRGLGNCKLEQQ
KLIRYGYPETQTSYIYNAYRLTPTVKPMKSLDKRQFVIGTLSALREGRSV
ELFLDLAKYCLTQYSERQFQFVIGGDGPHRATLEKISADLGIDQQVKFVG
NVSDVSTFMDGVDVFVSPLVVDGDSGAGLSNSIVEAMIMKVPVCAFRAAG
IEEIVINGSTGHLIEPRNIPAMAEAVCWTVDNKATTESYVNNAYNLIIRE
CDPKKYAQKLLKLYKEL
>MS1496 rfaG, RfaG protein
MHVLILPSWYPLHKDDLNGSFFREQAYALARSGIKVGVIAPQFRSLRLGK
NAVLGRYDQEFWQDGDINTYFQHGVFWFPKVPYLDLKRWVKAGLALFESY
VKEQGMPDILHVHSLLHAGPLALEIHRKYRIPYCVTEHSSVFGRGLVKDW
EWDHLKRSEASASKLLAVSKSLADLLKQKLNGKEWTVFPNLLDDLFVESP
VDSSVRKYQLCAVAFLYAKKGFDVLIKAFAKVVEEYPQLKLMIGGDGPER
AKLEALIKSLKLENNVSLLGALSRQEVCQLMKESLCFVLSSYIETFGVVV
IEALSQGTPVVSTLCGGPESILTEGDGLFVKTGDEKELAKGILEFLANQE
KFDNQQIRRRCIDTYSEKPFVNRLTAIYQDILDKT
>MS0447 rfaJ, RfaJ protein
MNIIFNCDENYAPYLSVVIKSILDNTTLSTQFYILDFNISEESKSCIKNL
IQNINKKNSFQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYLNE
LNKAIYLDIDIIVISDLSRLWHIDLADNLVGACLDPYIEYENQDYKRKIG
LQDSQPYINAGVLLLNLKALREFNLYQKAIDWNKDYPNIQFQDQDILNGV
LKGKVLFLDSRYNFTVNHRNRIKLAHKGKLLLSSLEKATKPICILHYVGS
HKPWLPTTTMVKSCLFDQIYNSIRNKPPHWNKKYQSVPLKFQLKRILREI
EDKLVYKII
>MS0243 rfaL, RfaL protein
MLKLNKGFLINGVVGLFFIICLSVRSGYSISPILLALVGLGYLIYDLIKK
RKWQISPDEKWLIYSYGLYFSLFVLSLLIHQGRLKELDSPVKIVFLLTLL
LLFSRFPIKFSTLIYAIPMGSMIAGITALIDRFYLHSQMAYAPRIMHIQG
GDIAMSLGMFSLVCCIYFFIKQQKKWMLFCLLATLSGMLGSILSTARGGW
IGVPFVLCFIFWAYRKYLSRTFFVSVISILVVAVGVAVSIPNTKIMKRIN
AAQHDITSYVDNKKGSDSTSVGARFEMWKSAFLMIKEKPVFGWGIEGVNQ
MRKEHQKQGIISKYASQFTHAHNQYLDDFSKRGVLGFIALLAVFLVPMRF
FRKNLSDRLEVKVVAIFGMVHVISVMFYCFSQGFFSHNSGNIFYFFPVIL
FYAVILNLTAKKSAESTKS
>MS1593 rfbB, RfbB protein
MKILVTGGAGFIGSALIRYIINQRQDEVINLDKLTYAANLDSLETVSLNP
RYSFERADICDRAALDRIFADHQPDAVMHLAAESHVDRSIDGAGIFIQTN
IVGTYTLLEAARHYWNRLDTERKKTFRFHHISTDEVYGDLADKNALFTEE
TPYSPSSPYSASKASADHLVRAWHRTYGLPTIVSNCSNNYGPFQFPEKLI
PLMILNALEGKPLPVYGNGLQIRDWLFVEDHVRALYKILTEGRVGETYNI
GGNNEKSNIEVVKTLCTLLEELVPNKPAGVMKYEDLICYVTDRPGHDLRY
AIDSSKINRELDWRAEESFESGMRKTLQWYLTNKSWWRRILNGSYHLERL
GLNN
>MS0657 rfbX, RfbX protein
MVKSTKRVFNFIMNKINTEHKKRLFSNFFSLTVLQIVNYALPLLTLPYLV
RVLDVETYGLVMFAQSFILFFNILVDFGFNLSATKEVSIHRDDKNKLIEI
YSSVMVIKFLLILSSFIILSIIIFSFERFSLNKGVYFLSFLWVIGQALFP
VWYFQGIEKMKYITIVNIIAKFLFTGCIFLFVKENADYLLIPLFNGLGIL
IAALVALWIVHVSLKQKVTWQPLSKLWIYFKESSTFFLSRASLTMYTSAN
AFVLGIFSNNTIVGYYSIADQLYKALQAFYTPLSQVLYPYIAKERNIVLF
KKIFNMAVFLNCMGIAILYFITVDVFALLFTQKIGIESINVFNIFLIASL
IVVPSILLGYPFLGALGFAKEANLSVIYASIIHILGLVILILFNKISLYS
VAYMVLVTELFVFMYRISKIRGRRLWRKQL
>MS1495 rfbX, RfbX protein
MVRLISTVFVRQILVGILQVITLIVIARGLGTGQMGQYTLAILLPTLFSQ
IITFGLQSINIYAIGRKMINENQALYANLIFLSGLSVLTSLILGVVVYYF
GQYFFNEVPVNLLYLALASLLPQTFFTVLPSLIQAVQNFKWFNIVCVAQP
LVIFVVSMVAILLSDNVSSILTAYVLSHWISFFILLGIILKLIKVETCSL
KRFFSDFIGYGLKSHLSNIITLLNYRSSLLILGYFTTPVIVGIYSVGMQL
AEKLWLPSQAVSTVLLPRLSNKLGEGGDEKEVAKLTLDSARLTFIVTLII
GIAFACLSSIVVRILFGVEYDKAVYVILLLLPGILAWTPSRILANDLAAR
GFAELNLKNSYWVFGINTALSLCLVPLWGLIGASVATSIAYSMDLVLRLI
AFNQVTQSRAFLHIIPRISDFGTVINFIKGLRNAR
>MS1670 rfe, Rfe protein
MLVWFAKFLEQYYSGFNVVSYLTFRSVLALLTALLLSLWIGPKMIRRLQI
FKFGQEVRNDGPESHFQKRGTPTMGGLMILATITVSTLLWGDLSNPYIWF
SLFVLLGYGAIGFVDDYRKIKYKNTDGLIARWKYFWLSLVSLIAIFGMYA
LGKDTDATRLVVPFFKEIMPQLGLFYVVLAYFVIVGTSNAVNLTDGLDGL
AIMPTVFVAGAFAIIAWATGNVEISKYLYIPYIKYTSELVIFCTAIVGAG
LGFLWFNTYPAQVFMGDVGSLALGGALGTIAVLVRQEFLLVIMGGVFVME
TVSVILQVGSYKLRKKRIFRMAPIHHHYELKGWPEPRVIIRFWIISLMLV
LFGLVTLKLR
>MS0911 rfe, Rfe protein
MLWLSLILTFIVSFLTILVIKPVALKVGLVDKPNYRKRHQGAIPLLGGIS
LFMGNLCFYLLQWDSTRLPYLYLFCVLVLLVIGLLDDRFDISAALRAGIQ
AVLAVLMIYVGHLSLEHLGQIIGPFQVTLGPIGFVITVFATIAVINAFNM
IDGIDGLLGGLSSVSFAAIGILMLMDNQFDLAVWCFALIIALIPYVMFNL
TIFGKERKIFMGDSGSTLIGFTMIWILLLSTQGKGHPMNPVTALWVIAIP
IIDMIAVIYRRLRKGKSPFKPDRLHVHHLMVRAGLTSRQALLVITFFAAL
CAGFGILGEVYYINEWIMFILFIVLFFLYLYSITKAWKVTRLIRRMKRRA
QRRKHS
>MS0535 rhaT, RhaT protein
MLQKYRGEIILFIVSLIAASGWFFSKFSMAEFPALGFIGLRFFLAAIFFF
PLAYPQLKRLDKPQLIKSALVGLCYAVYIMLWMLGLINSAHFGEGAFLVS
LSMLIAPLLSWLIFGHLPYKSFWLALPAAFTGLYLLSSGKGGLHFSFGSL
IFLISSLVAALYFVLNNQYARDIPVLSFTTIQLFIVGTCCGTLSILFEQW
PTSISMTAWGWFLCSLVIATNLRMLLQTYGQKYCHVATAAIIMILEPVWT
LFFSILILGERLTLHKAFGCLSILAAIMIYRLPAILRNQASANKE
>MS1825 rhaT, RhaT protein
MLMPHFTQSKGYGYFCLILATFFWGGNYMFGRILSHVIPPIILNYLRWLP
AAIILLLLFAKYLPQQRHIIRKNWQILTALALLGVLIFPVFLYQGLQTTT
ALNASIYLAVVPIVVMFLNRICFKDTIRFPVFIGALISFIGVLWLLSHGE
LSRLLTFNVNRGDLWAIGSAVSWSVYCSIIRLRPKEIGNSVMLTAQVGIA
MIIFTPVFLSQLNTENLQIISELTYGQWMIILYLIIGPSILSYGFWNYGM
TIVGGTKGAAFTNATPLFAAALGILVLGEQLHGYHLISSLLIVIGLTLCN
KK
>MS1753 rhaT, RhaT protein
MLFQIIATLIWASAFIAAKYTYEMMDPVLMVQCRFFIASIIMLPGFFAAY
KRVPKERLKIMWLLALINFPLMFLLQFIGLYFTSAASAVTMLGMIPLLTV
LIGFLFFKRRINKIDLLLSLVALAGIILTVVGGGEDNLINPWGCLLVLGS
AVSFCFCLYLSKDVMQEMAPKDYTNVLVILGSILCLPFTCVLVRDWSIVP
SVKGMISLFYLGIGCTWLAVVLWFKGVQKTPTYISSILTTLEPIFGVILA
ILILDERLSTVSAMGILLTLGAAAVSVLIPVLMKKSP
>MS1754 rhaT, RhaT protein
MFYLIAAVLIWASAFIAAKFSYTMFDPALTVMLRLILSALLVLPTFFRSY
RKIPKQYRLQLWGLGLLNFPVVLLLQFTGVHYTSVASAVTMLGTEPLVVT
LLGHIFFHKPARLLDWLLGIVALTGIVFVVYGSESGGEVTLLGCTLVLLG
SIAFSFSIHLAQSVMKAVEAKAYTDVIIMTGAISCVPFSLLLVQDWQIHL
NIEGISAILYLSVGCTWLAYRLWSKGLRVSSANTASILTTLEPVFGVLLA
ILLLGEHLTLTTLFGICLVISAAGISVLSSMLINYIKNKVTIL
>MS1595 rhaT, RhaT protein
MNQQPVLGFIFALITAMAWGSLPIALQHVLTVMGAESIVWYRFFVASLAL
FLLLAWKKKLPALSQFTSRYWKLSLIGVLGLAGNFFLFNSSLNYIEPAIT
QIFIHLSSFMMLICGVFVFKEKLGAHQKAGLLILILGLGLFFNDKFDMLF
GLNMYSTGILLSVSAAVVWVAYGMAQKLMSRQFTAQQILLIMYTGCVIVL
CPFAQFSQIQGLSGFALGCFIYCCLNTLIGYGAYAEALNRWDVSKVSVVV
TLVPLFTILFSRILHGLDPAHFAMPHLNTVSYIGAFVVVLGAIISAVGYK
LFKYKR
>MS1597 rhaT, RhaT protein
MKQQPLLGFLFGLIAACMWSSLPLFVQQVVKVMDIQTSVWYRFVLSAVGV
LLLLCFSGKFFTFKRISPKNTLLLLLAIAGLSVNFYLYNLALKYIPPTTS
QVLSPLSSFMMLFAGVLIFKEKMARHQKIGLAVLSLGLILFFNERLDDFL
QLNTYFKGVVMVIASSFVWVIYAIVQKVLLSHLSSQQILLMIYIGCTLVF
FPNADIKQIYQLDGFQLVCLVFSGVNTIIAYGCYAEALDRWEVSKVSAIL
TQIPIFTLLFFHLAVMIAPNYFVAVELNWISYLGAFCVVSGAMLSALGHK
LKMLKERD
>MS0885 rhaT, RhaT protein
MLRPSCREKIIMVNNYNLALIKVHFTAVLFGLTGVLGVIISADSDVIVLG
RVIIAFLALSVYFLIKREKLTALSTKDVANQSLSGALLTAHWVTFYVAVK
VGGVAVATLGFAGFPGFVALFERLFFQEKLKRRELILLIAVTIGLILVTP
QFEFGNQSTQGLLWGIFSGAIYGILAILNRKNINKLSGTQASWWQYLIGS
ILLFPFAAHKLPAVSVTDWFWIACLGLLCTSLAYTLFVSSLNIINARTAA
MIISLEPVYAILIAWIWLGEQPGLRMIIGGLIILLSVGVVNFRR
>MS2326 rhaT, RhaT protein
MGGSILLGIFWHFVGATSAACFYAPMKKVTNWSWETMWAIAGIFSWILLP
WGISYWLLPDFGSYYSFFGSDILLPVFLFGAMWGIGNIGYGLTMRYLGMS
MGIGIAIGITLIVGTLMTPIIQGRFGELLASTGGQMTLIGVVIAVVGVAV
VSYAGLLKEKAIGVTAEEFNLKKGLALAVMCGIFSAGMSFAMSAATPMHE
EAARLGVDPLYVALPSYVVIMGGGAIINLGFCIIRLITRPELSFKADMSV
VKGLLISNILFSALGGIMWYFQFFFYAWGHANIPANYGFMSWMLHMSFYV
LCGGIVGLLLHEWKDTGKKPTRVLCIGCLIIVLAANIVGLGMAN
>MS1837 rho, Rho protein
MVTLAHSKLLPTIKQTSQNFIKSNQKDSQQIIMHLTELKNTPVSELVALG
EGQMGLENLARLRKQDIVFAILKQHAKSGEDIFGGGILEILPDGFGFLRS
ADSSYLAGPDDIYVSPSQIRRFNLQTGDKIEGKIRPPKEGERYFALLKVD
QVNDDKPEVSRSKILFENLTPLHANSRLRMERGNGSTEDLTARILDLASP
IGKGQRGLIVAPPKAGKTMLLQNIAQSITHNYPECELIVLLIDERPEEVT
EMQRSVKGEVIASTFDEPASRHVQVAEMVIEKAKRSVEHKKDVVILLDSI
TRLARAYNTVTPASGKILSGGVDANALHRPKRFFGAARNVEEGGSLTIIA
TALVDTGSKMDEVIFEEFKGTGNMELHLSRKIAEKRVFPAIDFNRSGTRK
EDLLTTPDELQKMWILRKILNPMGEVEAMEFLIDKLMVAKTNEEFFEIMK
RS
>MS1681 rhtB, RhtB protein
MEFWHGFLIITGIHILAAMSPGPDFIYVSQQTLSRGRAAGIICALGVAFG
LGVHILYSVLGLAVVIASAAWILTTIKIIGGIYLIYLGYKGLKARAKNQV
QIIEKVEVQQENRLKTLWKGFLCNVLNPKAPVYFVSVFTVVLSPNMPVWQ
LAIYGVWMMFLQFVWFASVAFLLSIPKVNKQFQKAGHWIDRILGFVMVGL
GIKVISS
>MS0972 rhtB, RhtB protein
MLNLIIVHFFGLVTPGPDFFYVSRMAASNSRRNVICGIIGITLGVAFWAA
SAMLGLAILFTTMPVLHGVIMLLGGGYLAYLGLLMVRSRTNATFAPLSAE
ELNKTTTVKKEIMKGLFVNLSNAKAIIYFASVMSLILVHITQVWQMLLAF
AIILVETFIYFYLISVLFSRPFAKKFYSRYSRYIDNVAGIIFLIFGMILA
YTGVMEMMG
>MS1533 ribA, RibA protein
MSKIQRVAEANLPTEFGLFRIVGFEFPDTKKEHVALVMGDISNSEENPVL
ARIHSECLTGDALHSLKCDCGFQLSTALRQISEAGRGVLIYHREEGRGIG
LINKIRAYSLQDNGMDTIEANLALGFAADERNFKVCADIFELLGICKVRL
LTNNPAKIDTMKKAGINVVERIPLNVGENRYNTGYLDTKAKKMGHYIVHD
NEKHYLDCPYCQSEIPNKK
>MS0172 ribB, RibB protein
MNQSLLASFGSSEERVIAALDTFKQGNGVLVLDDENRENEGDLIFPAETI
TTEQMAKLIRYGSGIVCLCITDELCQKLELPPMVAANTSVNKTAFTVTIE
AAEGVSTGVSAADRVTTVKVAVADNAKPSDLHHPGHVFPLRAAENGVLAR
PGHTEAAVDLARLCGYKPAGVICEITNDDGTMARTPELVAFAQKFGYAVV
TIEDLIAYRTKYNK
>MS1313 ribC, RibC protein
MEKIMFTGIVQGTAQIQSIISKRDFRTHIIKMPQKLLADLEIGASVAHNG
VCLTVTDIKGDLVSFDLMTETLRITNLGDVKEGDFVNIERAMKMGSEIGG
HLLSGHVYCTAEVVEIIPSENNLQLWFKLPTNEVMKYILTKGFIAVDGIS
LTIGEVKDNEFCVNLIPETIHRTLIGQRKIGSKVNIEIDPQTQAIVDTVE
RYLASKV
>MS1374 ribD, RibD protein
MSEFTEQDRAFMQLAIELAEKGQFTTTPNPSVGCVLVKQGEIVGKGFHFK
AGEPHAEVMAMREAGKNAKGATAYVTLEPCSHFGRTPPCAKGLIEAGVVK
VIAAMEDPNPSVAGQGLKMLQQAGIETAVGLLQEQAERLNRGFLKRMRTG
LPYVQLKMAMTIDGRTATSTGESQWITGESARIDVQQERAKASAILSASG
TVLADNPSLNVRWEQFPEQLKADYKKETVRQPVRVIIDSKQQISSQLNLF
KIDSPVWLAGIQPRDLTDFPANCEIICLMPEKEQSLLQALMIELGKRQIN
SVWVEAGAKLAGALIEQNLVDELVLYIAPKLLGDEAKGLCHLPHLTKLAD
APLWRLQSMEKVGDDIKMIYMRK
>MS1752 ribF, RibF protein
MQLIRGLHNLRRDFAGCALTIGNFDGVHLGHQAILQHLREKANQLKLPMV
VMLFEPQPREYFVSADAKQQAPARLMRLRDKLHYLQQQGVDYVICVKFDR
TFAKQDPNLFIETYLVNRLHVKFLSIGDDFRFGANRRGDFSLLESAGKKY
GFSVEDNRTFSLDKLRISSTAIRHALAHDDLKKAEELLGRAYSIFGKVVH
GQKLGRTIGFPTANIRLQRQVNPLQGVYAVRIQCPCGRAFQGVANIGQRP
TVNGVEQRLEVHLFDFDENLYGQNIAVTFCHKIRNEMKFPSLNALKQQIA
RDVLVAQQYFRQNS
>MS0976 ribH, RibH protein
MKVLEGGLAAPNAKVAVVVARFNSFINDSLVEGAVDALKRIGQVKDENIT
LVRVPGAYELPLAVRRLADSKKYDAIVALGTVIRGGTAHFEYVAGGASNG
IGHVSLESNVPVAFGVLTTENIEQAIDRAGTKSGNKGAEAALVALEMVNL
LAQINA
>MS0300 rimI, RimI protein
MQFKIKPMLPEHYQQVYRLWTSIEGMDMSDADDNFEAISAFLAFNPDLNY
IAEINGKVVGVIMCGFDGRRATLYHAAVDPDYQKQGIGFALAEHLESALK
TKGISKGRLLAFKSNESATLFWQKAGWTLQQKLNYFSKKFI
>MS1590 rimI, RimI protein
MTEISPIQAEDFDRLFEIEQAAHLVPWSMGTLQNNQGERYLNLKSSVQNH
IAGFAICQTVLDEATLFNIAIDPVCQGQGIGKALLSELIKRLREKNVATL
WLEVRESNQTAKRLYDRLGFNEVDIRKNYYPTPDGGRENAIVMALYL
>MS0757 rimK, RimK protein
MKLLMLCREPNLYSCRRLKMTAENAGHKMDILDPNRFLLKIQENRPHFAL
YYQPNQGTPYLLPDYDAVIPRFGTQSTKMGCSVLTHLAAKNIPCLNNPAS
FALARDKWLSLQALAAANIAVPVTVFAGQDFQAGSAVEKVSSPTILKTLN
GSQGIGVILADRSQSAVSIMETLTLSHIPVLLQDFIGEAGASDIRCFVIG
DKVVAAMQRSGQKGEFRANCHRGGITQQITLSDDEKLIAVRAAQALGLDV
AGVDLIQSKKGLLVLEVNASPGLEMIEKTSGIDVATQMIAYLEKKIAGL
>MS0441 rimM, RimM protein
MNKMDRQRIETVGKLGSTYGIRGWLRIYSSTENAESIFDYQPWFLKIKDQ
WQAIELETWKHHNHELIVKLKNINDRETAQTLANVEIGVDLSVFPALEEG
DFYWHDLIGCQVVNLQGYAMGTVSEMMETGSNDVLVVRANAKDAFGKQER
LIPFLYEQVVKRVDLTTKTIEVDWDAGF
>MS1830 rlpA, RlpA protein
MKLKFLLTVLLAVMTTACSASSNTAQVNNTKKHYGIAGPKLEHKGAAKSS
NTYVVNGRKYTTQTSRNAKNYSKEGKASYYHNKFHGRRTASGEKYSNQQY
TAAHKTLPLGSYALVTNLRNNKKVIVRINDRGPFSKTRIMDLSHAAANEL
GLIRAGVGNVRVEALHVDRSGQISGAGASTLVKNARTDEARDRIK
>MS0337 rlpB, RlpB protein
MLKQLKKFTFVTAIAALTACGFHFQNGQLIPQELQTLTLESSDQYSDMAM
AMRKQLQLNNINLVEASPDVPVLRLNKTSTDDEVVSVFKQGREAEKMLML
EVSASVKMPNRAAYPISAKVNRTFFDNSRAALAKSSEKEIIWNDMREQAA
RQLISKMVALQHQIKNDKQ
>MS0231 rluA, RluA protein
MALIEYNPPTEPWLDIVYHDNHILVVNKPSGLLSVPGNQPRYYDSAMSRV
KDKYGFCEPAHRLDMATSGILLFAMSKAADSELKRQFRERETKKYYEALV
WGHLEQDSGEVNLPLVCDWENRPRQTICFERGKSAVTKYEVLQRLPNNTT
RVKLTPITGRSHQLRLHMLALGHPILGDKFYAHPQAKTMAPRLCLHAESL
TIKHPISGEEMTFFRLADF
>MS1821 rluA, RluA protein
MIFMAQITMSAEVQPHQMGQRLDQTLAELFPDYSRSRLKTWIEDELVLLN
GKVANIPREKVYGGEQVEITVEIEDENRFEPQNIPLNIVYEDDDILVINK
PKDFVVHPGAGNPNGTVLNALLYHYPQITEVPRAGIVHRLDKDTTGLMVV
AKTIPAQTQLVRALQKRKITREYEAIAFGIMTKGGTVDEPMSRHPTKRTL
MAVHPMGKPAVTHYRVMEHFRNYTRLRLRLETGRTHQIRVHMAHIAHPLL
GDQTYGGRPRPPKNASEEFMSVLRNFQRQALHAIMLRLEHPITGELMEWH
APLPEDFVELVNALKADYQLHKDELDY
>MS1072 rluA, RluA protein
MLEILYQDEHIVAVNKPAGMLVHRSWLDRHETQFVMQTLRDQIGRLVYPI
HRLDRPTSGVLLFALSSETANLLCQQFETKQVEKSYLAVVRGYLTGSERI
DYPLKIQLDKIADKFAQEDKAPQPAVTDYEGLKTVEKPYATPRYATSRYA
LVRLVPHTGRKHQLRRHMKHIFHPILGDTQYGDLHQNRTLTEHTEVSRLM
LHAEKLSFIHPIKRQRTEITAGLDEQWKKLMALFEW
>MS1170 rluA, RluA protein
MQEFKIIHKHRDFIIIDKPNGVSVHRDEAEIGLTGLLAKQLSVAQVWLVH
RLDKVTSGLLILALNERSAAEFSLLFARHEISKTYLALSKQKPKKKQGLI
IGDMEKARRGAWKLCQTKQNPAVTRFESVSCEPNLRLFILKPQTGKTHQL
RVAMKSLGSPILGDELYGGNSEKNDRTYLHAFRLEFTYQGEPFRVQSLPQ
TGEYFLRESVREKIDSV
>MS1624 rluA, RluA protein
MTTNPLNEKIINATVKMLQISEDESGQRIDNYLLAKLKGVPKSLIYRIVR
KGEVRVNKGRIKPEYKLQTGDTVRIPPVRVAEKEQAPISNKLNKVAALEK
QIIFEDDCLLVLNKPSGIAVHGGSGLSFGVIEALRSLRPEARFLELVHRI
DRDTSGILLVAKKRSALRNLHEQLRIKTVKKDYLALVRGQWQSHVKVIRA
PLLKNELSGGERIVRVNEQGKPSETRFAIEERYPTATLVRASPVTGRTHQ
IRVHTQYAGHPIALDDKYGDKEFDQQMQKLGLNRLFLHAYSIKFEHPKSG
EELRLTAPLDENMKGILKKLRENKA
>MS0368 rnc, Rnc protein
MNHLDRLQRQISYEFKDITLLKQALTHRSAATKHNERLEFLGDAILNYTI
ADALYHQFPKCNEGELSRMRATLVREPTLAILARQFKLGEYMALGHGELK
SGGFRRESILADCVEAIIGAISLDSSLVSATQITLHWYEKLLREIKPGEN
QKDPKTRLQEYLQGHRLALPTYDVKDIKGEAHCQTFTIECHVPNLDRTFI
GVGSSRRKAEQAAAEQILTALEIK
>MS1357 rnd, Rnd protein
MTYQIKEIQNPPHFMLITDDEALADVCARASTKSAIALDTEFVRIRSYYP
KLGLIQLYDGEQVSLIDPQEIQDFSPFKQLLADPKILKVLHACHEDLEVF
QHYYQQLPAPMLDTQIMANFLGFQNSMGLASLIKHYFNLEIDKGASRTDW
LARPLSNRQLAYAAADVWYLLPLYCKMQNALEQTRWQSAVEFDCNLLLEK
HRIVKNIDKAYLSISGAWKLNSEELMRLKLLASWRQEEAVKRDLALNFVV
RGENLWLLAQNNPKHTSEMLKLGLNPQEVRIHGKKMLQILERAERIDAEY
YPPEISRLADDMRYKQGLKNLQQKLKTIAPPDLNAEVIAGKRSLESLMKW
VWLKHKDPNKLPDLMRDWRAEFGSELAKLL
>MS1571 rnhA, RnhA protein
MYQIMRKQIEIFTDGSCLGNPGAGGIGVVLRYKQHEKTLSQGYFKTTNNR
MELRAVIEALNLLKEPCAVTLHSDSQYMKNGITQWIFNWKKKNWKASNGK
PVKNQDLWMALDNAVQAHTIDWRWVKGHSGHRENELCDQLAKQGAENPTL
EDIGYQPD
>MS0423 rnhB, RnhB protein
MAEFEYPQGFELIAGVDEVGRGPLVGAVVTAAVILDPNNPIDGLTDSKKL
SEKKREKLAEEIKQKALAWALGRAEPEEIDALNILQATMLAMQRAIKNLK
IQPHFVLIDGNRIPQLAIPAQAVVKGDSLVAEISAASIIAKVSRDHEMEV
LDKQYPQYEFAKHKGYPTKVHLAKLAEFGVLPQHRRSFSPVRKLLENE
>MS0483 rnpA, RnpA protein
MIKLNFSRELRLLTPAQFKYVFEQPLRASTPEITILARRNDLQYPRLGLT
VAKKHLKRAHERNRVKRLCRESFRLLQHELPNYDFVIVAKHGIGKLDNPT
FTAILSKLWQRHIRLAKKSLSN
>MS2330 rpe, Rpe protein
MKSYLIAPSILSADLARLGEDVENVLKAGADVIHFDVMDNHYVPNLTFGP
AVCKALRDYGITAPIDVHLMVKPVDRLIPDFAKAGADYITFHPEASEHID
RSLQLIRNSGCKAGLVFNPATSLSYLDYVMDKIDVILLMSVNPGFGGQSF
IPATMQKLQEARRRIDESGFDIRLEVDGGVKINNIAEIAAAGADMFVAGS
AIFDQPDYRKVINEMRQELAKVQK
>MS0252 rph, Rph protein
MRPNNRAVNEPRPIKITRHYTKHAEGSVLVEFGDTKVICTATVEDSVPRF
LKGQGQGWVTAEYGMLPRSTHSRMLREAAKGKQGGRTMEIQRLIARSLRA
MVDLTALGERSITLDCDVIQADGGTRTASITGACVALTDAINALVENGTL
KTSPLKGLVAAVSVGIVNGEAVCDLEYVEDSAAETDMNVVMMEDGRMIEV
QGTAEGEPFSHEELLTLLNLAKQGCNMIFDAQRRALAADC
>MS1744 rpiA, RpiA protein
MNQLEMKKVAAKAALQFVKPDMIVGVGSGSTVNCFIEELGAFRDQIKGAV
AASKASEELLRKQGIEVFSANDVSSLDIYVDGADEINPQKMMIKGGGAAL
TREKIVSSLAKNFICIVDSSKQVDILGSTFPLPVEVIPMARSQVARKLVA
LGGSPEWREGVITDNGNVILDVHNFIIMNPIEMEKELNNVAGVVTNGIFA
LNAAHTVIVGTPDGAKIIE
>MS0383 rpiB, RpiB protein
MKIAIGCDEAAYRLKVEIMKHLDELGIEYDDFGAGEGDVVLYPDVAEAVA
VAVAEGKYQRAILTCGTGIGMCITANKVPGIRAAVCYDVFSTERSRKSND
AQIMCLGERVIGVELAKSLIDVWFKCEFAGGGSAPKVKRINEIDAKYNKR
>MS0564 rpiB, RpiB protein
MVSTLIELKNLLFNKEINMKIALMMENSQAAKNAVVLKELKGVVDPKGYS
VFNVGMSDENDHHLTYIHLGIMASILLNSKAVDFVVTGCGTGQGAMMSLN
LHPGVVCGYCLDPADAFLFCQINNGNALALAFAKGFGWGAELNVRYMFEK
AFTGVRGEGYPIERKEPQVRNAGILNEVKKAVAKDNYLDTLRAIDPELVK
TAVSGERFQQCFFENCQDKEIEAFVREVLAK
>MS0198 rpiR, RpiR protein
MAQIDPKSIGAHIRTRKQQLTPLERKVLDCILAKSDFDEKTSLKEIATEN
QVSEAIVVKIAKKLDFSGYREFRSGLAYYKQLEVANLHNDISADDTATQV
IKKVFETSIQALQETMSILDISEFERCVKILVEADHIDLFGIGGSAQIAK
DMAHKFLRIGIKASVYDDSHMMLMAGAVSHPGNVVLAISHSGTTIDVIEP
LQLARQNGAKTIAITNYAISPIAECADVVLTSTSQGSLLLGENAAARIAQ
LNILDALYVAVAKQNLDISEDNLRKTRYAVKHKRTK
>MS0208 rplA, RplA protein
MAKLTKRMKAIKAGVDSTKAYEINEAIAVLKQFATAKFDESVDVAVNLGI
DPRKSDQNVRGATVLPNGTGRSVRVAVFTQGANADAAKEAGADLVGMEDL
AEQIKKGEMNFDVVIASPDAMRVVGQLGQVLGPRGLMPNPKVGTVTPNVA
DAVKNAKSGQVRYRNDKNGIIHTTIGKASFSAEALTQNLQALLAALVKAK
PTTAKGIFIKKVSISTTMGAGVAVDQNSL
>MS2045 rplB, RplB protein
MAIVKCKPTSAGRRHVVKIVNPELHKGKPYAPLLDTKSKTGGRNNLGRIT
TRHIGGGHKQHYRLIDFKRNKLDIPAVVERLEYDPNRSANIALVLYKDGE
RRYILAPKGLSAGDQIQAGVNAPIKVGNALPMRNIPVGSTVHNVELKPGK
GGQIARSAGSYVQIIAREGNYVTLRLRSGEMRKVLAECTATIGEVGNSEH
MLRVLGKAGANRWRGVRPTVRGTAMNPVDHPHGGGEGRNFGKHPVSPWGV
QTKGKKTRHNKRTDKYIVRRRGK
>MS2048 rplC, RplC protein
MIGLVGRKVGMTRIFTEEGVSIPVTVIEIEANRVTQVKTLENDGYSAVQV
TTGSKKASRVTKPEAGHFVKAGVEAGRGLWEFRTEGEEFTLGQEINVDIF
ADVKKVDVTGTSKGKGFQGGVKRWNFRTQDATHGNSLSHRVLGSIGQNQT
PGRVFKGKKMAGHLGAERVTVQSLEVVRVDAERKLLLVKGAVPGATNSDV
IVKPAVKA
>MS2047 rplD, RplD protein
MELQVVGANALAVSETTFGREFNEALIHQVVVAYAAGARQGSRAQKTRAE
VSGSGKKPWRQKGTGRARSGDIKSPIWRSGGITFAAKPQDHSQKVNKKMY
RGAIKSILSELVRQDRLVVVDKFEIDAPKTKVLVQKLKDLALEDVLIITA
SLDENLFLAARNLYKVDVRDVQGIDPVSLIAFNKVVVTVDAVKQIEEMLA
>MS2036 rplE, RplE protein
MAKLHDYYRDQVVNELKTKFNYASVMQVPRIEKITLNMGVGEALTDKKLL
DNAVADLTAISGQKPLITKARKSVAGFKIRQGYPIGCKVTLRGERMWEFL
ERLITIAVPRIRDFRGLSAKSFDGRGNYSMGVREQIIFPEIDYDKVDRVR
GLDITITTTAKSDEEGQALLAAFNFPFRK
>MS2033 rplF, RplF protein
MSRVAKAPVSVPAGVEVKLDGQLLTVKGKNGELTRTIHNFVEVKQDNNEL
TFSPRNDGAEANAQAGTTRALVNAMVIGVTEGFTKKLQLVGVGYRAQVKG
NVVNLSLGFSHPVEHTLPAGITAECPSQTEIVLKGADKQLIGQVAADIRA
YRSPEPYKGKGVRYSDEVVRTKEAKKK
>MS0472 rplI, RplI protein
MQVILLDKIVHLGNVGDQVNVKSGFARNFLIPQGKAVMATKANIEHFEAR
RAEIEAKVAAELAAAQARAAQIAALEAVTISSKAGDEGRLFGSITTREIA
EAVTAAGVEVAKSEVRLSTGPIRTLGDHEVKFQLHGEVFTALNVIVVAE
>MS0209 rplJ, RplJ protein
MALNLQDKQAIVAEVNEAAKGALSAVIADSRGVTVDKMTELRKTAREAGV
SMRVVRNTLLRRAVEGTEFECLTDTFTGPTLIAFSNEHPGAAARLFKEFA
KANDKFEIKGAAFEGKIQDVDFLATLPTYDEAIARLMGTIKEAAAGKLVR
TFAALRDKLQEAA
>MS0207 rplK, RplK protein
MAKKVQAYVKLQVAAGMANPSPPVGPALGQQGVNIMEFCKAFNARTESLE
KGLPIPVVITVYADRSFTFVTKTPPAAVLLKKAAGVKSGSGKPNKEKVGK
VTLDQVRQIAETKAADMTGATIETKMKSIAGTARSMGLVVEE
>MS0210 rplL, RplL protein
MIVMSLTNEQIIEAIASKSVSEIVELISAMEEKFGVSAAAVAAAAPAAGA
AAAEEKTEFDVVLKAAGANKVAVIKAVRGATGLGLKEAKDLVESAPANLK
EGVSKEEAESLKKELEAAGAEVEIK
>MS1283 rplM, RplM protein
MVIKLMKTFVAKPETVKRDWYVVDATGKTLGRLATELASRLRGKHKAEYT
PHVDTGDYIIVINADKVAVTGRKETDKVYYWHTGYVGGIKQATFKEMIAR
RPEAVIEIAVKGMLPKGPLGRAMFRKLKVYAGSQHEHAAQQPQVLDI
>MS2029 rplO, RplO protein
MYLNTLAPAEGAKHSAKRLGRGIGSGLGKTGGRGHKGQKSRTGGGVRRGF
EGGQMPLYRRLPKFGFTSMKAAVTAEIRLNDLTKVENNVVTLESLKAANI
ITKDIQFAKVVLAGEVKGAVTVRGLRVTKGAKAAIEAAGGSVEE
>MS2041 rplP, RplP protein
MLQPKRTKFRKVHKGRNRGIAAGTDVSFGTYGLKAIGRGRLTARQIEAAR
RAMTRAVKRQGKIWIRVFPDKPITEKPLEVRMGKGKGNVEYWVALIQPGK
VLYEMDGVSEEIAREAFALAAAKLPIKTTFVTKTVM
>MS2022 rplQ, RplQ protein
MRHRKSGRQLNRNSSHRQAMFRNMASSLVSHEIIKTTLPKAKELRRVVEP
LITLAKEDSVANRRLAFARTRNIETVAKLFNELGPRFAQRAGGYTRILKC
GFRAGDNAPMAYIELVDRPETAAAVEE
>MS2032 rplR, RplR protein
MDKKSARIRRAARARHMMRENGVTRLVIHRTPRHIYAQVIAPNGSEVLAA
ASTVEKAISEQVKYTGNKDAAAVVGKLVAERALAKGIKDVAFDRSGFKYH
GRVQSLADAAREAGLQF
>MS0443 rplS, RplS protein
MSNIIKQLEQEQLKQNIPSFRPGDTLEVKVWVVEGAKKRLQAFEGVVIAI
RNRGLHSAFTLRKVSNGTGVERVFQTHSPAIDSISVKRKGAVRKAKLYYL
RERSGKSARIKERLGA
>MS1056 rplT, RplT protein
MARVKRGVIARARHKKVLKAAKGYYGARSRVYRVAFQAVIKAGQYAYRDR
RQRKRQFRQLWIARINAAARQNGLSYSKFINGLKKASVEIDRKILADIAV
FDKVAFAALVEKAKSAL
>MS1599 rplU, RplU protein
MRVKCRDIISARYSGVFMYAVFQSGGKQHRVSEGQVVRLEKLEKATGETV
EFDSVLMVVNGEDVKIGAPVVTGAKVVAEVVAQGRGDKIKIVKFRRRKHS
RKQQGHRQWFTEVKITGIQA
>MS2043 rplV, RplV protein
METIAKHRYARTSAQKARLVADLIRGKKVAAALEILTYTNKKAAALVKKV
LESAIANAEHNDGADIDDLKVTKIFVDEGPSMKRVMPRAKGRADRILKRT
SHITVVVSDR
>MS2046 rplW, RplW protein
MSQQERLLKVLKAPHISEKATNNAEKSNTIVFKVALDANKVEIANAVAQL
FEVKVDSVRTVVVKGKTKRHGAKTGRRSDWKKAYVTLAEGQELDFVEGAA
E
>MS2037 rplX, RplX protein
MAAKIRQNDEVIVLTGKDKGKRGKVTQVLPNGKVIVEGVKIITKHEKPVP
ALGKEGGLVKKEAAIDVSNVAIFNPKTNKADRVGFRFEDGKKVRFFKSNN
EII
>MS1116 rplY, RplY protein
MAFKFNAEVRSAQGKGASRRLRHNGQIPAIVYGGNEDAVSIVLNHDELNN
AQAHDSFYSEVITLVINGKEVAVKVQAMQRHPFKPKLVHIDFKRA
>MS1598 rpmA, RpmA protein
MATKKAGGSTRNGRDSEAKRLGVKRFGGESVLAGSIIVRQRGTKFHAGSN
VGMGRDHTLFATADGKVKFEVKGEKSRKYVSIVTE
>MS1942 rpmB, RpmB protein
MEIIMSRVCQVTGKRPAVGNNRSHALNATRRRFLPNLHTHRFWVESENRF
VTLRLTAKGMRIIDKKGIDAVLADIRARGEKI
>MS2040 rpmC, RpmC protein
MKAQELRAKTVEELNAELANLAGEQFKLRMQAATGQLQQTHQLKQVRRNI
AQVKTVLTEKAGE
>MS2030 rpmD, RpmD protein
MTMAKTIKVTQVRSSIARLPKHKATLRGLGLRHMHHTVELIDTPAVRGMI
NQVSYMVKVEE
>MS0448 rpmE, RpmE protein
MRFSMKQGIHPEYKEITVTCSCGNVIKTRSTAGHDINLDVCGNCHPFYTG
KQRVVDTGGRVERFNKRFSIPGSKK
>MS1869 rpmF, RpmF protein
MAVQQNKKSRSRRDMRRSHDALTTAAVSVDKASGETHLRHHVTADGYYRG
RKVINK
>MS1943 rpmG, RpmG protein
MAAKGAREKIRLVSSAETGHFYTTDKNKRNMPEKMEIKKFDPVVRKHVIY
KEAKIK
>MS0484 rpmH, RpmH protein
MKRTFQPSVLKRSRTHGFRARMATKNGRQVLARRRAKGRKSLSA
>MS1055 rpmI, RpmI protein
MYQTKQCGVFLTMPKIKTVRGAAKRFKKTASGGFKRKQSHLRHILTKKTT
KRKRHLRHKSMVAKADQVLVVACLPYV
>MS2027 rpmJ, RpmJ protein
MKVRASVKKICRNCKIVKREGVVRVLCSDPKHKQRQG
>MS2023 rpoA, RpoA protein
MQGSVTEFLKPHLVDIEQVSPTHAKVILEPLERGFGHTLGNALRRILLSS
MPGCAVTEVEIDGVLHEYSSKEGVQEDILEVLLNLKGLAVKVQNKDDVFL
TLNKSGIGPVVAADITHDGDVEIVNPEHVICHLTDENASINMRIRVQRGR
GYVPASARVHAQDEERPIGRLLVDACYSPVDRIAYNVEAARVEQRTDLDK
LVIELETNGAIDPEEAIRRAATILAEQLDAFVDLRDVRQPEVKEEKPEFD
PILLRPVDDLELTVRSANCLKAETIHYIGDLVQRTEVELLKTPNLGKKSL
TEIKDVLASRGLSLGMRLENWPPASIAED
>MS0212 rpoB, RpoB protein
MGYSYTEKKRIRKDFGKRPQVLNVPYLLTIQLDSFEKFIQRDPEGQQGLE
AAFRSVFPIVSNNGSTELQYVSYKLGEPVFDVRECQIRGTTFAAPLRVNL
RLVSYDRDAAPGTIKDIKEQDVYMGEIPLMTDNGTFVINGTERVIVSQLH
RSPGVFFDSDKGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFARIDR
RRKLPATIILRALGYSTEEILDLFFEKIQFEIQDNKLLMALVPERLRGET
ASFDIEANGKVYVERGRRITARHIRTLEKDNVTKIDVPTEYIVGKVSAKD
YIDLESGELVCPANMEISLDILAKLAQAGYKSIETLFTNDLDFGPYISET
LRVDPSSDRLSALVEIYRMMRPGEPPTKEAAEALFDNLFFSAERYDLSAV
GRMKFNRSLGLAEGVGNGVLSKEDIVGVMKKLIDIRNGRGEVDDIDHLGN
RRIRSVGEMAENQFRIGLVRVERAVKERLSLGDLDAVTPQDLINAKPVSA
AVKEFFGSSQLSQFMDQNNPLSEVTHKRRISALGPGGLTRERAGFEVRDV
HPTHYGRVCPIETPEGPNIGLINSLSVYARTNNYGFLETPYRKVVDGQVT
EEIEYLSAIEEGNYVIAQANASLDEDFRFTDAFVTCRGEHGESGLYRPEE
IQYMDVSPQQVVSVAAALIPFLEHDDANRALMGANMQRQAVPTLRADKPL
VGTGMEKPIALDSGVAVVAKRGGIIQYVDASRIVVKVNEDETIPGEAGID
IYNLIKYTRSNQNTCINQIPCVNLGEPIGRGEVLADGPSTDLGELALGQN
IRVAFMPWNGYNFEDSMLVSERVVQQDRFTTIHIQELSCVARDTKLGAEE
ITADIPNVGETALSKLDESGIVYVGAEVKGGDILVGKVTPKGETQLTPEE
KLLRAIFGEKASDVKDSSLRVPNSVSGTVIDVQVFTRDGVEKDKRALEIE
EMQLKEAKKDIAEELEILEAGLFSRVRNLLIDGGVDAKELDRLDRTKWLE
QTLNDEAKQNQLEQLAEQYEELRKDFEHKLEVKRGKIIQGDDLAPGVLKV
VKVYLAVKRRIQPGDKMAGRHGNKGVISKINPVEDMPYDENGQPVEIVLN
PLGVPSRMNIGQILETHLGLAAKGIGEQINRMLKEKQEIEKLRGYIQKAY
DLGGGSQKVDLNTFTDEEVMRLAQNLRKGMPLATPVFDGAEEKEIKDLLE
LGGLPTSGQITLYDGRTGEKFERPVTVGYMYMLKLNHLVDDKMHARSTGS
YSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVNG
RTKMYKNIVSGTHQMDPGTPESFNVIMKEIRSLGINIDLDEE
>MS0213 rpoC, RpoC protein
MKNFHRTLNKFNSDRSKSVKDLVKFLKAQSKTSEDFDVIKIGLASPDMIR
SWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKHR
GVICEKCGVEVTQTKVRRERMGHIELASPVAHIWFLKSLPSRIGLLLDMP
LRDIERVLYFESYIVIEPGMTDLEKGQLLTEEQFMDAEDRWADEFDAKMG
AEAIQALLRDMDLEHECETLREELQETNSETKRKKITKRLKLLEAFMQSG
NKPEWMVMTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKR
LLDLVAPDIIVRNEKRMLQESVDALLDNGRRGRAITGSNKRPLKSLADMI
KGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFRPFI
YAKLESRGFASTIKAAKKMVEREDAIVWDILAEVIREHPILLNRAPTLHR
LGIQAFEPILIEGKAMQLHPLVCAAFNADFDGDQMAVHVPLTLEAQLEAR
ALMMSTNNVLSPANGDPIIVPSQDVVLGIYYMTREKVNAKGEGMLLQDPR
EAEKAYRTGRAELHSRVKIRITEYVKNAEGEFEPQTTLTDTTIGRAILWM
IAPKGMPYSLFNQTLGKKAISKLINECYRRLGVKASVMFADQIMYTGFAY
AARSGSSVGIDDMVIPEKKYEIISAAEAEVAEIQEQFQSGLVTAGERYNK
VIDIWATANERVAKAMMENLSTEEVVNREGNLEKQSSFNSIFMMADSGAR
GSAAQIRQLAGMRGLMARPDGSIIETPITANFREGLNVLQYFISTHGARK
GLADTALKTANSGYLTRRLVDVAQDLVIVEDDCGTHEGIVMTPLIEGGDE
KVSLRELVLGRVAAEDILKPGTEEVLFPRNTLLDEKVCDILDENSVDSVK
VRSVVTCDTDFGVCAKCYGRDLARGHLINQGEAVGVIAAQSIGEPGTQLT
MRTFHIGGAASAAAKESSVQVKNSGSIRLTNVKSVTNNEGKLVVTSRNTE
LTIIDAFGRTKEHYKVPYGAVLNKGDGEAVTAGETVANWDPHTMPVVSEV
AGFVKFVDIVDGLTVTRQTDELTGLSSIVVQDVGERATAGKDLRPAIKVV
DAQGNDIFIPGVDVLAQYFLPGKAIVTLDDGAEVQVGEPLARIPQESVGT
KDITGGLPRVADLFEARKPKEPAILAEITGIVSFGKETKGKRRLVITPVE
GEAYEEMIPKWRQLNVFEGEMVERGDVISDGAETPHDILRLRGVHAVTEY
IVNEVQEVYRLQGVKINDKHIEVIVRQMLRKGIITKAYDSEFLEGEQVEV
ARVKIVNRKREAEGKPPVEFERELLGITKASLATESFISAASFQETTRVL
TEAAVAGKRDELRGLKENVIVGRLIPAGTGFAYHQNRIKNRGQANVVEEQ
EVKFSAADEAEIEAEFNMIAEDPAASLAEMLNMADDAE
>MS1760 rpoD, RpoD protein
MNQRRTSYMDHNPQSQLKLLIAQGKEQGYLTYAEVNDHLPEELVDTDQIE
DIIQMINDMGIQVLESAPDADDLMLSETIADEDAVEEATQVLSSVEAELG
RTTDPVRMYMREMGSVELLTREGEIDIAKRIEDGINEVQSAVAEYPEALD
YLLKQYEQVEEGSVRLADLITGFVDLNAEEASEEISDLEEVLDDEDGDIP
ADALNDEEEDEESDEGDTSTDDSDNSIDPEVAREKFSALKDQCVKTLEFI
EKYGRTDNKVKEQIQVLSDIFTQFRLVPRQFDTLVLSMRSMMKQVRAEER
QIQRLAVDYAKVPKDDFQKAFIGNETSEQWLESLLQSKKTYVEKLQQRAP
EISKSIVRLQQVETDTKLTVQQIRDIGERIAQGELKARRAKKEMVEANLR
LVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWI
RQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREASPEELAE
RMGMPEDKIRKVLKIAKEPISMETPIGDDDDSHLGDFIEDSTLELPLDSA
TAQSLKVATHEVLEGLTPREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRE
RIRQIEAKALRKLRHPSRSETLRSFLDE
>MS0025 rpoD, RpoD protein
MTKETQTMMLVPQGSIEAYIRAANEYPMLSAEEEKELAERLYYQEDLEAA
KKLILSHLRFVIHVARGYSGYGLPQADLIQEGNIGLMKAVKRFNPEVGVR
LVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFS
DNELDLVANELGVTKEDVIEMESRMTGADVGFDLPTDDSEEETFAPSMYL
EDKSSNFAAELESENFETQAIDQLSNAMENLDERSKDIIQARWLDDTKAT
LHELAAKYNISAERVRQLETNALKKLKSAVSF
>MS2228 rpoE, RpoE protein
MLLTRGYMAEQLTDQALVERVQQGDKKAFNLLVSRYQNKVAGLLTRYVSR
NDIPDVVQESFIKAYRSIESFRGESAFYTWLYRIAVNTAKNYLTAQGRRP
PNEDILAEEAETYDVGGNLRDVDTPEHEMLSAELKKVIFDTIDGLQEELK
TAITLREMEGLSYEEIADIMDCPVGTVRSRIFRAREIIESKIRPLIQR
>MS1737 rpoZ, RpoZ protein
MARVTVQDAVEKIGNRFDLILTAARRARQLQLHVREPLVPEDNDKPTVIA
LREIEKGLIDNNIMNAQERQEALEQEKVELNAVSLLSE
>MS1476 rpsA, RpsA protein
MSESFAQLFEESLKELETRQGSIVSGTVVAIQKGFVLVDAGLKSESAIPV
EEFQNAQGELEVQVGDVVNVALDAVEDGFGETKLSREKAVRHESWIELEK
AYEEQATVTGLINGKVKGGFTVELNGVRAFLPGSLVDTRPVRDTLHLEGK
ELEFKVIKLDQKRNNVVVSRRAVIESENSQDREEILANLAEGAEVKGTVK
NLTDYGAFVDLGGVDGLLHITDMAWKRVKHPSEIVNVGDEITVKVLKFDK
DRTRVSLGLKQLGQDPWAAIAQNHPVGSKLTGKVTNLTDYGCFVEILDGV
EGLVHVSEMDWTNKNIHPSKVVSLGDTVEVMVLEIDEERRRISLGLKQCK
ANPWLQFAETHNKGDKVEGKIKSITDFGIFIGLEGGIDGLVHLSDISWNV
AGEEAVRNYKKGDEVAAVVLQVDSAKERISLGIKQLEEDPFNNFVAVNKK
GAVVSATVVEADSKGAKVELNGGVEGYIRAADLTDEVNAGDVVEAKYTGV
DRKARIVHLSVRAKDQAEEAAAVASVNNKQEDVAIPNAMAEAFKAAKGE
>MS1933 rpsB, RpsB protein
MICGGKTPIKKENIMAQVSMRDMLQAGVHFGHQTRYWNPKMKPFIYGPRN
GVHIINLEKTVPMFNGALAELTRIASNNGKILFVGTKRAATEAVQAAALD
CQQYYVNHRWLGGMLTNWKTVRQSIKRLKDLETQSQDGTFDKLTKKEALV
RTREMEKLELSLGGIKDMAGLPDAIFVIGADYEHIAIKEANNLGIPVFAV
VDTNSNPDGIDFVIPGNDDATRAIQLYVTAAAAAVKEGRSQQTATEEKFA
EEVAAE
>MS2042 rpsC, RpsC protein
MCQIVNRRGIAMGQKVNPHGIRLGIVKPWSSTWFANTQDFASNLDGDFKV
RKFLNKELANASVSRITIERPAKSIRVTIHTARPGIVIGKKGEDVEKLRN
AVAKIAGVPAQINIAEVKKPELDAKLVADSIASQLERRVMFRRAMKRAVQ
NAMRLGAKGIKVEVSGRLGGAEIARSEWYREGRVPLHTLRADIDYNTSEA
HTTYGVIGVKVWIFKGEILGGMAAIAQQPEQQPAAPKKAPRGKGRK
>MS2024 rpsD, RpsD protein
MARYLGPKLKLSRREGTDLFLKSGVRAIDSKCKIDTAPGQHGARKPRLSD
YGSQLREKQKVRRIYGILERQFRNYYKEANRLKGNTGENLLVLLEGRLDN
VVYRMGFAATRAEARQLVSHKAIVVNGRVVNIPSFQVSVDDVVAVREKSK
KQARIKASLELAEQREKPTWLEVDAAKMEGVFKRVPERSDLSADINEHLI
VELYSK
>MS2031 rpsE, RpsE protein
MANIEKQAGELQEKLIAVNRVSKTVKGGRIMSFTALTVVGDGNGRVGFGY
GKAREVPAAIQKAMEKARRNMINVALHEGTLQHPVKGIHTGSRVFMQPAS
EGTGIIAGGAMRAVLEVAGVRNVLSKAYGSTNPINVVRATIDALANMKSP
EMVAAKRGKTVDEILG
>MS0469 rpsF, RpsF protein
MRHYEIVFMVHPDQSEQVPGMIERYTGSVKEAGGQIHRLEDWGRRQLAYP
INKLHKAHYVLMNVEAPQEVIDELETTFRYNDAVLRNVIIRTKHAVTEAS
PMVKAKDERRASAEVENNDFEDAEE
>MS0163 rpsG, RpsG protein
MGILKIKNGEIAMPRRRSIEPRKILPDPKFGSELLAKFINVLMVDGKKSI
AESIVYNALDTLAQRTNKDALVAFEEALENVRPTVEVKSRRVGGSTYQVP
VEVRPARRNALGMRWIVEAARKRGDKSMALRLANELSDASENKGSAVKKR
EDVHRMAEANKAFAHYRW
>MS2034 rpsH, RpsH protein
MSMQDPIADMLTRIRNGQAASKVAISMPSSKLKVAIANVLAAEGYIESVK
VLEGAKPELEITLKYFQGKPVVESIQRVSRPGLRIYKRKDELPKVMGGLG
VAVVSTSKGVMTDRAARQAGLGGEIICYVA
>MS1282 rpsI, RpsI protein
MTAANQNYGTGRRKSSSARVFIKPGSGKITINQRELDVYFGRETSRMIVR
QPLELVEMTEKLDLYITVKGGGISGQAGAIRHGITRALMEYDESLRPVLR
AAGFVTRDARRVERKKVGLRKARRRPQFSKR
>MS2049 rpsJ, RpsJ protein
MKGDGVSFLLAKNYWSSGLMQNQRIRIRLKAFDHRLIDQSTAEIVETAKR
TGAQVRGPIPLPTRKERFTVLISPHVNKDARDQYEIRTHKRLVDIVEPTE
KTVDALMRLDLAAGVDVQISLG
>MS2025 rpsK, RpsK protein
MAKTPVRARKRVKKQVVDGVAHIHASFNNTIVTITDRQGNALAWATAGGS
GFRGSRKSTPFAAQVAAERCAEAVKEFGLKNLEVMVKGPGPGRESTIRAL
NAAGFRITNITDVTPIPHNGCRPPKKRRV
>MS0162 rpsL, RpsL protein
MATINQLVRKPRVKKVVKSNVPALQACPQKRGVCTRVYTTTPKKPNSALR
KVCRIRLTNGFEVTSYIGGEGHNLQEHSVVLIRGGRVKDLPGVRYHTVRG
ALDCAGVKDRKQGRSKYGVKRPKA
>MS2026 rpsM, RpsM protein
MARIAGINIPDHKHTVIALTAIYGIGKTRSQAICAAAGIAENVKISELSE
EQIDKLRDEVGKFTVEGDLRREVTLNIKRLLDLGCYRGLRHRRGLPVRGQ
RTKTNARTRKGPRKPIKK
>MS2035 rpsN, RpsN protein
MAKQSMKARDVKRVKLAEKFYAQRVELKRIISDVNSSDEERWDAVLKLQT
LPRDSSPSRQRNRCSQTGRPHGVLRKFGLSRIKVREAAMRGEIPGLKKAS
W
>MS0699 rpsO, RpsO protein
MSLSVEKKAAIVAEFGRDAKDTGSSEVQIALLTAQINHLQAHFAEHKKDH
HGRRGLLRMVSRRRKLLDYLKRTDLAKYSETIARLGLRR
>MS0440 rpsP, RpsP protein
MVTIRLSRGGAKKRPFYQIVVADSRCPRDGRFIERVGFFNPLAAGNAERL
RIQLDRVNAWLEKGASLSDRVAALVKEAQKAA
>MS2039 rpsQ, RpsQ protein
MTDKIRTVQGRVISDKMDKSFTIAIERKVKHPLLGKFIRRTTKLHVHDEN
NEARIGDTVEIKECRPVSKTKSWTLVRVVEKAVEA
>MS0471 rpsR, RpsR protein
MARYFRRRKFCRFTAENVVEIDYKDIATLKNYITESGKIVPSRITGTRAK
YQRQLARAIKRARYLALLPYTDNHQ
>MS2044 rpsS, RpsS protein
MPRSLKKGPFLDLHLLKKVEKAVESGDKKPIKTWSRRSMIIPSMIGLTIA
VHNGRQHVPVYVSDEMIGHKLGEFAPTRTYRGHAADKKAKK
>MS1756 rpsT, RpsT protein
MTLANIKSAKKRAVQSEKSRQHNASQRSMMRTYIKKVYAAVAAGEKAAAQ
AAFVEMQKVVDRMASKGLIHANKAANHKSKLVAQIKKLA
>MS1762 rpsU, RpsU protein
MPVIKVRENESFDVALRRFKRSCEKAGILAEVRSREFYEKPTTIRKRENA
TRAKRHAKRVARENARNTRLY
>MS2229 rseA, RseA protein
MEFTMQKELLSAYIDGEQVGNDITLELCNDAELQQSWSNYHVIRSVMRDE
SEVFLGADFTAKMATLIDQEDAITLSQPTPDEVENLPFMQKLKALFAPMV
QVGVAAGVCLVAVLGVQSFNANNNAQTTADTPVLQTLPFSNNVQEVSYNA
PTKDAVTQEQLEQKNKRIGAMLQSYELQRRVYADSVQNQQH
>MS2230 rseB, RseB protein
MVNLFPVDEGTSKMFKKLTALFFILPFSFSVFGQDNLSPKQLLTEMLAAQ
NKLNYEISYVQIAGAEIDTYRYRHVYNEGKSYAQLATLEGGKQEIIQRDN
LISYFHSNYSPFSIRGSQIIDNLPNIVNADFSRIEKHYDFINMGRNRIAD
RLVQTVRILPKDNFRYQYVVFIEEKTHLLLGSDMLDQDGNLLERFRVVNF
YIDDQMTQPLTDALSKLPTPPVLDKATPPKNKLSWQAGWLPQGFAVLNNY
LTKTDEDTIESRLYSDGLFSFTIYVSNNILPENQENVWKQGSFTIYSESM
KDKEVTIIGQIPLTTAKRIVQEIKSN
>MS2231 rseC, RseC protein
MLTESAVVIDYRDGIAKVKCQSKTACGSCAAKNACGSAALSELTGEPGEH
ILIISTITPLKIGQQVEIGLQEQSLLFSAFLAYVIPLMTLLIGTFVATAI
FSNELISAAFIFISTALSFLAVRFYAKKLNKKSAFEPILLRVLN
>MS1588 rsmC, RsmC protein
MISLESQVLERHLPLFADKSVLLAGGVNDDFPQKIQSQCRSVKIWSWYFD
YVNQIQGKSAVDFSVIFTGRADLIVYYWTKNKQEVQFQLMQLLANAPVGQ
EVLIIGENRSGVRSAEKMLAHFGDIGKIDSARRCGLYHFTLQKQPNFELE
NFWKTYRSPQLGELIVYSLPGVFSANELDVGTQLLLSTVKDNIRGDVLDL
GCGAGVIGSMIKLKNPPAKVTMTDIHAMALASAERTLLENKLSGQVLASD
VFSHVEGKFDLIISNPPFHDGIDTAYRAVRELISNAKWHLVPGGELRIVA
NAFLPYPDLLDEYFGGHKVLAQTNKFKVYSVIG
>MS0145 rssA, RssA protein
MKVGLVLEGGAMRGMFTAGVLDIFLDENIHIDGAVTVSAGALFGINLPSK
QRGRVLRYNKKYLNDKRYMGLHSLLTTGNIVNRDFAFYELPYTLDPFDQQ
TFAQSDMDFWVTLTNVETGEAEYFKIQDAFEQMEVLRATSAMPFVSKMVE
INGKKYLDGGIADSIPLQKCFDLGYDKVIVVLTRPLEYRKTPSSKTLFKL
FYPNYPQLAARWAQRYADYNQTVERIIKLNDEQKIFVIRPSESLNISRLE
KDPEMIQRMYELGLKDGKAAIAGLREYLAK
>MS0391 rsuA, RsuA protein
MRKIAALAAFYYFRIFMRLDKFLAENTGLTRSQAAKILRQGNVQVNGQVV
KSGSLKITPQDEVLFEGESLEWLEDGVYIMLNKPQGYVCSHDDGEYPTVY
QFFDYPLAGKLHTAGRLDADTSGLVLLTDDGKWSHRVTSPKYHCQKTYLV
TLADPVESNYRQACEQGILLRGEKEPTKPAILEIIDDYNVNLTISEGRYH
QVKRMFAALGNKVVGLHRWKIGNIELDEDLPEGEFRVLTAEEIQYF
>MS2342 rsuA, RsuA protein
MNLNNPIKSGFKSDLKKSTGKSAFYNKKRKSAVSFHQNSVTKRQKSKQLA
FSETQVILFNKPFDVLTQFTDENGRATLKDFIHIPDVYAAGRLDRDSEGL
LILTNNGEIQHRLADPKFKMEKTYFVQVEGEPTETDLAKLRQGVELKDGM
TRPAKVRLIAQPDFIWQRTPPIRERKSIPTSWLEIKISEGKNRQVRRMTA
NIGFPTLRLIRYAVSSFTLDNLANGSYRLLTDNELERLYKTLKLTKE
>MS1038 rsuA, RsuA protein
MKATRKLMKFQTEKLQNRNFRQKPSENRPQFERGGRPERKERGAFESRFR
ADDSRQASSFKNDRRDERRNARGDERRSSFSGDESSNRRNERKPAERKPL
PMRKPKPAHPVEKKATVEGEKLQKVLARAGQGSRREIESIIEQGRVSVDG
KIATLGDRVTVHDGLKIRIDGHLVNLTAAQREVCRVLMYYKPEGELCTRH
DPEGRATVFDRLPRLTGSRWIAVGRLDINTSGLLLFTTDGELANRLMHPS
QEVEREYSVRVFGQVDDAMIHRLRKGVQLEDGPANFKAIKAVGGTGLNQW
FDVTLMEGRNREVRRLWESQGIQVSRLIRIRYGNIQLMKTLPRGGWEEMD
LAKVNYLRELVGLPPETETKLDVTNLRRRAKTGQIRKAVKRYSEMNKRYK
KS
>MS0712 ruvA, RuvA protein
MANFIVNVKVCMIGRLQGILLEKQPPEILLDVHGIGYELLLPMTSFYNLP
EIGQETVLFTHLVVREDAHLLFGFSAKTDRTLFRELIKTNGVGPKLALAI
LSAMSVNEFAYAIEHEELSKLVKIPGVGKKTAERLLVELKGKFKGIKQPD
FFVESSHVGAVDPVTTSPEVPAEEAVAALMALGYKASDAEKMVKRIAKPH
LTSEQLIREALKAAL
>MS0713 ruvB, RuvB protein
MIEADRIISSNAQLGDEYIDRAIRPKLLTDYVGQPQVREQMGIFIQAAKL
RQDALDHLLIFGPPGLGKTTLANIVANEMGVNIRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPAIEEVLYPAMEDYQLDIMIGEGPAARSIKL
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYSVEDLTSIVARSAGCL
NLEMSDGASHEIARRSRGTPRIANRLLRRVRDFADVKNAGIISEDIAKSA
LSMLDIDQAGFDYLDRKLLSAVIERFDGGPVGLDNLAAAIGEERDTIEDV
LEPYLIQQGFLQRTPRGRIATSRTYRHFGLDKLTE
>MS0711 ruvC, RuvC protein
MFALFIWSFMAIILGIDPGSRVTGYGIIRQTGRTLEYLGSGAIRTQVEDL
PTRLKRIYAGVTEIITQFRPDMFAIEEVFLAKNPNSALKLGQARGTAIVA
AVNQNLPVFEYAARLVKQTVTGSGSADKVQVQDMVTRILRLSDKPQADAA
DALAIAITHAHTIQHSLQVATSAKSTENHEKTTALLRTRYSRGRFRLKI
>MS1964 sPS1, SPS1 protein
MRKDMLQVQHENHFFLFNFDENRPNQEHFFESYFWQKQNRIIGSAKGRGT
TWFIQSQDLFGVNTALRHYYRGGLWGKINKDRYAFSSLEETRSFAEFNLL
NRLYQAGLPVPKPIGAHVEKLAFNHYRADLLSERIENTQDLTALLPNTEL
TAEQWQQIGKLIRRLHDLQICHTDLNAHNILIRQQNNDTKFWLIDFDKCG
EKPGNLWKQENLQRLHRSFLKEVKRMRIQFSEKNWADLLNGYQN
>MS0132 sUA5, SUA5 protein
MELAQIVERLKKNEVVAYPTEAVFGLGCNPNSKSAVEKLLILKQRPVEKG
LILVAHKLDLLLPFIDESRLKQSHWQLLTQQYDCPTTWVVPAKLSVPKFI
TGQFDSVAVRLCTHPAVAQLCEQTGFALTSTSANLSGLPPCKTAQQVRSQ
FGEFFPVLDMAVGNAVNPSEIRDIFSRQIFRRG
>MS1037 sUA5, SUA5 protein
MSQFFYIHPENPQVRLINQAVDILRNGGVIVYPTDSGYALGCMIGDKRAM
DRIVQIRHLPEGHNFTLVCSDLSELSTYSLVTNTAYRLIKNNTPGRYTFI
LTATKELPRRLMTSKRKTIGIRVPDNQIALDLLRTLGEPILSCSLMLPNE
EHITQSDPEEIRDRLEHQVDLIIHGGYLGQEPTTVVDLTEETPVILREGS
GPLDPFI
>MS1471 sUI1, SUI1 protein
MNDIVYSTETGRIKPEKTKQERPKGDGIVRIHRQTSGRKGAGISLIVGLD
LPDDELKKLAAELKKRCGCGGSIKDGNIEIQGEKRDLLKQLLEQKGFKVK
LAGG
>MS0497 sUL1, SUL1 protein
MQVRLFFYRCLSFCSIYIRISMLKKWFITKNVFLAVRPFSALKDSFREGY
TTQKLVKDIIAGLTVGVIAIPLSMALAIASGVPPQHGLYTAIVAGIIIAL
AGGSRFNISGPTAAFVVILYPVTQQFGLSGLLMATLLSGIILVIMALFRL
GRLIEYIPLPVTLGFTCGIGITIGTLQIKDFFGLTIDKMPEHYIGKVQAI
ITALPTINWADAMVGIVTLLVLINWHKLRLPVPGHLPAVIIGTLLSLVLI
HFGYHVASIGSAFEYTLPDGSTGHGIPSVLPQFALPWNIPNAQGEVIEWN
FAIIQNILPAAFSMAVLGAIESLLCAVVLDNMTDTKHHSNNELLAQGLGN
IAAPFLGGITATAAIARSAANVKAGGQSPIASIVHALLVLFALLFFAGAL
SYLPLSSMAALLLMVAWNMANVPQIIHLARRSGRNEIAVLTTCLVLTVIF
DMVIAISVGVLLASLLFIRTIAEMTKSFEIAHPEDLDDVLVYRISGPLFF
AAADNLFADLHEKTVHTDHEIRHIVLQCSAVTVLDAGGIHALTRFVQHML
PHQELYMCNMQFQPLRMIVKSNMLTEIQKINFSTDLKETYNKIRLVEAED
KKSEE
>MS0909 sacC, SacC protein
MRSFLPHFSLFYFHQGIMMIIFNNGKYKSILAAEQGELERIKSEVEKDRD
FRPYYHLAPSTGLLNDPNGLVFDGEKFHLFYQWFPFDAIHGMKHWKHFTT
EDFHIYTEADPLIPCELFESHGCYSGGALPVGDKIAAFYTGNTRRAADNQ
RVPFQNLAIFDRTGKLLSKRPLIENAPKGYTEHVRDPKPYFTKEGKIRFI
CGAQREDLTGTAIIFEMDNLDDEPRLLGELSLPAFDNQKVFMWECPDLLK
VGDNDIFIWSPQGKRREARRFQNNFHAVYAVGKLDDRTFNAAHIAELDQG
FDFYAPQTFAGLENQKHAVMFGWCGMPDLTYPTDKYKWHSMLTLPREITL
QGNRLVQRPIKEIYQNLTALSQISLQQQAEIQDLDRAYIKFDAENTAFNI
RFFANEQGQTLSLSYDGELVCLDRSQTEETEWMKKFASQRYCEIKNLRQV
EIFFDRSIIEIFLNDGEKALTSRFFIANRQNSVKTDRTLRLNVGYPKEIE
YK
>MS0923 sanA, SanA protein
MRCKMQGCKKIQSKIVSFLAKSSLKRLVQTCAILAGIAIVSLATLDQMIG
YSVRNDIYTDITKVPHRPYGVLLGTAKYFARNTPNLFYVNRLNAAEALFK
SAKIDYLLLSGDNRTLQYNEPRTMFKDLRKKGIGEEFLYMDFAGFRTLDS
IIRAKEIFNATPMVIITQRFHCERALFIAKFHHIDAICFAAEYPKDYPFV
RFREVFARLLMLWELFIEKEPHFLGTPEPLPPALPNY
>MS0003 sapB, SapB protein
MQSAQFMILLKISIAMCLGAFIGLERELKHKPVGVKTCVIISITTCILTI
VSIQSAEYYAEVSNNIRTDPMRLAAQVISGIGFLGAGVILRKNNDAISGL
TTAAIIWAAAGIGIATGAGFFFDAIIATLMILVAIRLSPYVMKLAHHRRQ
EKDIEVSFTFHLASIQAIGNITELFMSHQCKIEDIAIKDLYNGEVNLTFQ
CDIEDHNMLRDVYLHSKVLPDVLAVHLETA
>MS1081 sbcB, SbcB protein
MTDFSFFIYDFESFGVNPADDRPAQFAGIRTDKDFNIISDPVMFYCKQTN
DYLPAPEAVMVTGITPQECNEKGISEPEFAARILAEFSQPNTCIMGFNNI
RYDDEMTRYTFYRNFIDPYEYSWKNGNSRWDLLDLVRACYALRPEGINWP
LDEEGMPSFRLEKLTKANGIEHENAHDAMADVYATIAMAKLIKEKQPKLF
QFFFENRGKKEIEKWIDTAEMTPLVHVSGMLGNYRGNCTWIAPLAWHPIN
QNAVIACDLAQNIDDLLNKSAVELRENLYTQKTELENDGVLPVPLKLVHI
NKCPIIAPAKTLLPENAQRLGIDRQFCLDNLKKLQKSLDIRDKVIEVFNE
ERKFDDSDNVETELYSGFFSKADKNNMTILRTLEPEKLADSGLQFEDKRI
PDLLFHYRARHFYKTLNRGEQIKWQKYRRQKLEKSAVQFMESLQHLGEEN
SNHPDKLKLLQQIYDYGIKLLA
>MS0290 sbmA, SbmA protein
MLKNIRLNFNLSIGSGMNYSQELLTSLLWIFKAIGITAVLFSLTVYVLVK
TTRWGRQFWMLAAGYISPKRSKKPIGYFVIIVFFNLLSVRLDILFSEWYK
AMYNALQESHEKMFWIQMVVFSVLATIHIANVLLTYYLTQRFTIQWRTWL
NNEMVNRWTENQAYYKAQYVYNKLDNPDQRIQQDVLSFVSNSIEFATGVI
SSVVSIVAFTVILWGLAGPMTVVGITIPHAMVYLVFIYVLITSIFAFRIG
RPLINLNFTNERLNANYRYSLIRLKEYAESIAFFRGEKMEKNVLFKQFNQ
VIGNVWKMVHMTLKLSGFNLAVSQVSVIFPFIIQASRYFSKQIQLGDLIQ
TAQSFGRVQTALSFFRNSYDSFTGYRAVLDRLTGFYSAVNQANSASHISI
EDSESAVVFDKLTVKKPTGEALIKDLSLNLPQGASLLIKGPSGAGKTTLL
RTIAGLWSYSEGIVRCPQHHALFLSQKPYLPQGRLIDALFYPELAPENLD
LAQAAEIMRKVQLGHLTDRLEQENDWTRVLSLGEQQRLSFARVLICRPLV
AFLDEATASMDEGLEESMYRLLKTELPDTTIISVGHRSTLQIHHTQHLVI
NPQDQSWALS
>MS1316 sbmA, SbmA protein
MNWQTELNNSFSWLITTLIWVSLAFTFFALLLRKTDFGEKFWLVTKPCIE
QSNKFKTIGLILFLFLLILLEVRISVLNSFFYNGLYSALQDKKADAFWFF
ATINAMLVGFKIIHSIINYLIRQIFEIRWLEKFNDDMLSRWLDHKNYYRL
KYEKDLPDNIDQRIEQDAREFITGTVDLVDGILGAIVSIIEFTIILWGLS
GLLVLFDISIPKGVVFFIYTFIIIATALSVWIGYPLIKLNFNKEKLNGDY
RYSLIRIRDNAESIAFYDGEQKERQYLNERFKAIIKNRWAIVRQMLGLDG
FNTGVTQIAMILPLMLQAPRFFAGQATLGDMHQTVQAFNRLMRALSFFRL
FYEQFTLYQARLNRLYGFIGKLNELDTHLIPNPIECSQLVALENFGLKDA
KGNVLFEGINLELSAGDALLIQGASGTGKTTLLKAIAGIYPFETVGRSKR
PCNGKILFLPQRPYMPQGSLREAICYPNIDPHHPELESYMLKCHLDKYIF
ALDQENDWQAILSPGELQRVAFIRIFLTKPDVVFLDETTSALDEPTEHSL
YSKIRQALPGMIILSVGHRCTLQQFHTKHLVIGLDKSSRTI
>MS1255 sbp, Sbp protein
MLNVAYDVIRDFYKEYNLEFRQAYKAQQGQDLMISQSHGGASKQTLSVAS
GLPADVVTLTQSSDVDILVKKGLVDSHWQQALPNHSVPFGSVMVFLVKKG
NPKNIHDWHDLIRDDVSVIFANPKTSANGRFAYLSAYAYAKAQGDEQQAQ
AFMKKILARVPVLESGARGASISFTQRNLGDVLIAPENEAALAAKALGEN
SFSVIYPSYTAYTPVYVAEVNANTKINGTHEQAQAYLRNLWSEAAQELAV
KHHFRPTNEKILQKSTALFPPVNSFDVNQVFGDWAIINQTHFADNALFDQ
LYIAAQHKDK
>MS1311 sdaA, SdaA protein
MISVFDMFKVGIGPSSSHTVGPMKAGKQFIDDLITQGNIGKITRIHADVY
GSLSMTGLGHNTDITIIMGLAGYLPHNVDIDSIADFISRVKQTALLPVAG
GSYTVDFDFKQDMQFHDSFLSLHENGMTLTAFMNDEIAYRQTYYSIGGGF
IVGEAHFNQAQNEEVPVPYPYNNAADILRHCHDTGLPISTVVFRNEVALH
GKESVEHHLSLIWQTMQDCIKHGLKTEGLLPGPLKVSRRAPALHRLLQAN
SNLNNDPMQVIDWINMFALAVNEENAAGGRVVTAPTNGACGIVPAVLSYY
EKFISPLNAETVERYLLVCSVIGSLYKMNASISGAEVGCQGEVGVACSMA
AAGLTEILGGNPEQVCIAAEIAMEHNLGLTCDPVGGQVQVPCIERNAIAS
VKAINAARMALRRSTNPRVTLDKVIETMYETGKDMNAKYRETSKGGLAIK
VVCS
>MS0977 sdaC, SdaC protein
MLHIILTLNRKFKMKNKTFGSALLVAGTTIGAGMLAMPLTSAEMGFTYTM
ALLFLLWILLSYSALLFVEVYQTVQRKDAGIATLAEQYFGMVGRVLATLS
LVIFMYAILSAYVTGGGSLLAGVLPFLGEHAAPISIIAFTVILGIFIVIS
TGAVDGLTRLLFMIKLVAFVLVLTMMLPLVQGENLMAMPLKEFLIISASP
VFFTSFGFHVIIPSINNYLDGNIKRLRAAIIGGTALPLVAYILWQMATHG
VFPQAKFVEIINNDPTLNGLVDATYHVTGSNLISGSVRLFSTLALVTSFL
GVSLSLFDCLDDLLKRINIKAGRLALGVLTFLPPLAFALFYPEGFIAALG
YAGQMFTFYGLVLPVGLAWRARKLHPNLPYRVIGGNLTLLIALLLGLLIM
NVPFLIEGGYLPKVIG
>MS1895 sdaC, SdaC protein
MYYYNSSKSYLTWKTFMEKSMKNKKQPSLLGGAMIIAGGTIGAGMLANPI
STAGVWFLGSLLILIYTWFCMMSSGLMLLEANLHYPTGSSFDTIVKDLLG
KGWNILNGLSLAFVLYILTYAYITSGGGITEGFLNQLLSSEQSAVEIGRS
SGSLIFTFVLAVFVWFSTKAVDRFSTILIGGMVISFFLSVSGLISSANAD
VLLNSATSQDTQYLPYALVALPVCLVSFGFHQNVPSLVKYYNRDAGKVSK
SVFVGTFIALIIYILWQLAIQGNLPRAEFVPVIEKGGDIAALLAALSKYI
QTDYIALALNFFAYMAIASSFLGVTLGLFDYIADLCGFDDSKAGRTKTAL
ATFLPPLLLSLQFPYGFVIAIGYAGLAATIWAAIVPALLAKASRKKFNKP
SYSCFGGNLMVYFIIIFGVLNILSQLAMQFGWLPEFKG
>MS1652 sdhA, SdhA protein
MQTVNVDIAIVGAGGGGLRAAIAAAEANPNLKIALVSKVYPMRSHTVAAE
GGAAAVIKEEDSYDKHFQDTVAGGDWLCEQDVVEYFVQHSPVEMTQMERW
GCPWSRKQDGDVNVRRFGGMKIERTWFAADKTGFHLLHTLFQTSIQFPQI
QRFDEHFVLDVLVDDGHARGVVAMDMMEGKLVQINANAVVIATGGGCRSF
KFNTNGGIVTGDGLSMAYRHGVPLRDMEFVQYHPTGLPNTGILMTEGCRG
EGGILVNKDGYRYLQDYGLGPETPVGKPENKYMELGPRDKVSQAFWQEWK
KGRTLKTAKGVDVVHLDLRHLGEKYLHERLPFICELSQAYEGVNPVNEPI
PVRPVVHYTMGGIEVDFNSETRIKGLFAVGECASSGLHGANRLGSNSLAE
LLVLGRVAGEYAAQRAVEATAANQTAVDAQAQDVVRRLEDLFNQEGTENW
ADIREEMGTAMEEGCGFYRDQASMQTAVDKIAELKERCKRIRIQDRSSVF
NTNVLYTVELGYILDVAQAIANSALERKESRGAHQRLDYVERDDTNYLKH
TLAFYNENGAPRIDYSPVKITKSQPAKRVYGAEADAAEAAAKAKENANG
>MS0327 secA, SecA protein
MLKTIATKIFGSRNDRVLRKLNKVVKKINGLEPAFSALTDDELKAKTAEF
RARLEKGESLESLMPEAFATVREASRRVLGMRHFDVQLIGGMVLTNRNIA
EMRTGEGKTLTATLPCYLNALTGKGVHVVTVNDYLANRDAETNRPLFEFL
GMTVGVNIPGLPPEVKRAAYQADITYATNSELGFDYLRDNLAHSKEERFQ
RQLHYALVDEVDSILIDEARTPLIISGPAEDSSELYIAIDKLIPLLVKQD
KEDTEEYQGDGDFTLDLKTKQAHLTERGQEKCENWLIENGFMTENESLYS
PAKIGLVHHIYAALRAHTLFERDVDYIVKDGEIVIVDEHTGRTMAGRRWS
DGLHQAIEAKEHVKIQGENQTVASITYQNYFRLYEKLAGMTGTADTEAFE
FQQIYGLETIVIPTNRPMIRDDRTDVMFESEAYKFQAIIEDIKECVARSQ
PVLVGTASIEKSELLSNELDKAGIPHNVLNAKFHAQEAEIIANAGYPGAV
TIATNMAGRGTDIVLGGNWRAEAAKLENPTEEQLEALKAAWQERHDVVMK
AGGLHIIGTERHESRRIDNQLRGRSGRQGDPGSSRFYLSLDDSLMRIYLN
EGKLNMMRKAFSTPGEAMESKLLAKVIASAQAKVEAHNFDGRKNLLQFDD
VANDQRHAIYAQRNDLLDHEDISETIKAIREDVYNEVIDQYIPPQSLEEQ
WNIAELEKRLKQDFALDLPIQQWLEEDNQLHEDNLRERIIASAVEEYQHK
EEIVGAETMRNFEKGVMLQTLDELWKEHLAAMDQLRKGIHLRGYAQKDPK
QEYKKESFQMFTEMLDALKLTVIRTLSRVQVRTQEEAQAEAAQQAAAESK
DYADDSASGERSVAQTTQRIGRNDPCPCGSGKKYKHCHGNRAAHEA
>MS2214 secB, SecB protein
MAEENQTPATATEEQQAVLQIQRIYVKDISFEAPNLPHVFQQEWKPKLNF
DLSTEAKQLGEDLYEVVLNISVETTLEDSGDLAFLCEVKQAGVFTISGLE
DMQMAHCLTSQCPNMLFPYARELVSNLVNRGTFPALNLSPVNFDALFMEF
LQRQEQESQNAESSTEVQH
>MS1563 secD, SecD protein
MLNRYPLWKNLMVILVIAIGVLYALPNIYGEDPAVQISGTRGQTATETTL
TDVQTLLTSNQLEPKSIKLEEGSILARFHNTDDQLLAKDKITEKLGQSYS
VALNLAPSTPAWLSSIGGNPMKWGLDLRGGVRFLMEVDMNTALSKRQEQL
QDSLRTELRKEKIQYSAIKNTDNFGTGVTLVKPEQLSDAGRFLRKQHPNL
NITESADNTLNLALSEQALTEARENAVEQNLGILRKRVEELGVSEAVIQR
QGAERIVVELPGIQDTARAKEILGATATLEFRLVNQNVSPEAMVRNIVPT
DTEVKFMRDGQPVALFKRAVLGGEHITNASSGLDQQTSRPQVSVTLDSEG
GEIMADTTRLNIKKPMATLYVEYKDSGKKDENGKVILEKHEEVINVATIQ
GRFGSQFQITGIDSPAEAQNLSVLLRSGALTAPIQIVEERTVGPSLGAQN
VAQGLNAGLWGLAIVIVFCLVFYKVFGLVASLALCANMVLVVGLMSLLPG
ATLTMPGIAGIILSVGMSVDANVLIFERIKEELRNGRPIQQAINEGYNGA
WTSIFDANLTTILTSIVLYAVGTGPVKGFAVTLALGVAISMFTAITGTRM
LINWIYGGKRVEKLSI
>MS0204 secE, SecE protein
MALAIDKKKKNAPEEVEQKSKGLNTFLWVLVAVVIAVAAFGNVYYAEQFS
TAVRVVAVVVLLAVALGIAAVTNQGKVALAFFGESRTELRRIVWPTRPEA
MQTTLIVIGVTVLTSLILWGFDSIIVSIINFLTDLRF
>MS1564 secF, SecF protein
MTVTTNTKQKHEYKGIGLPFSLVHFMKYRKFGYLFSIIVIALSLFSIFTK
GFNWGLDFTGGVIIDTHFSQPADLEQVRSTLKTGGIESALVQTTGSANDV
AIRLPASASDANIGNNIKNMMTSLDKDIQIRSVEFVGPNVGEELTQGAIY
ATLATLILLLAYVGMRFEWRLGLGGILGLAHDVIVTIGLFSFLQIEIDLT
FVAAILTVVGYSLNDSIVVFDRVRENFRKIRRLSSEEVINISLTQTLSRT
LMTSVTTLFVVFSLLFFGGPSIYSFSLALLIGIGFGTYSSIFVAIALAFD
FGLDREHMVVKVVEKEDFQEGL
>MS0729 secG, SecG protein
MYQVLLIVYLLVSIALIGFILVQQGKGADAGASFGGGASGTVFGSAGSAN
FLSRTTAILATIFFVISLVIGNINSHKNNVQQGTFDDLSQAAEQIQQKTV
PAPVENKNADIPQ
>MS2028 secY, SecY protein
MAKQPGYQSRSTQSGSSELKSRLLFVLGALIVFRVGSFIPVPGIDAAVLA
QLIEQQKGTIIDMFNMFSGGALSRASIFALGIMPYISASIIIQLLATVYP
ALNELRKEGESGRRKISKYTRYATLGLATLQAIGISTGLPNMLPGLVPNL
GFGFYFTAVISLVTGTMFLMWLGEQITERGIGNGISLIIFAGIVAGLPSA
IGQTIEQARQGQMHLLVLLLIAVIVFAVTYFVVFFERGQRRIKVEYAKRQ
QGRQILGGHSTHLPLKVNMAGVIPAIFASSIILFPATLTQWFGEGSSLEW
LTDLSMLLHPGQPLYLIVYAIAIIFFSFFYTAMQYNPRDTADNLKKSGAF
IPGIRPGEQTSRYIDKIMTRLTLIGGLYITFVCLVPYIMMSAWNVQFYFG
GTSLLIVVVVIMDFIVQIQSHLMSTKYESALKKANLRGFGQ
>MS2338 selA, SelA protein
MTALFQQLPSVDKILKTPQGEQLVTEFGHSAVVNCCRHLLAQAREKIKIE
KKLPHFFTDFNHTIAEVNRYLANQQQVKIKSVHNLTGTVLHTNLGRALWA
QSAQQAALTAMRQNVALEYDLEAGKRSHRDNYVSELLHELTGAQAACVVN
NNAAAVLLMLATFAQGKEVIISRGELIEIGGAFRIPDIMAQAGCKLVEVG
TTNRTHLNDYRRAINENTALLMKVHSSNYQICGFTCEVSEQELVELGKEF
NIPVVTDLGSGALTDLSRYDLPKEPTVQEKLVQGADLISFSGDKLLGGPQ
AGIIVGKKELIQQLQSHPLKRVLRCDKVILAAMEATLRLYLQPEKLTEKL
TSLRLLTQPLEQLRQQAEQLKAKLENLLKDDFLLQIESSLAQIGSGSQPM
AKIPSIAVTIAEKNSEKLTALLARFKKLSTPIIARVENDKIRLDLRSVTA
IETLLITLEELNQDQ
>MS2339 selB, SelB protein
MIIVTSGHVDHGKTALLQALTGMNTAHLPEEKKRGMTIDLGYAYLPVGDK
ILGFIDVPGHEKFLANMLAGLGGIHYAMLIVAADEGIQAQTKEHLAILRL
LQIEKIMVVISKADRASSAKIDELKTKILTDYPFLAESPFFVTSAVNGRG
IAELREFLTALPNPADKDKPFRYAIDRIFTVKGAGTVVTGTAFSGKVKID
DELYLSNGGKVRVKNIHAQNRQNTEGLAGQRLALNINADLDRTQIERGDW
FFSQAPFEPTERFTIQLTAETSLTENQPVHVYHAASRTTGKLALLIEKAI
YPGQQTFAELILDNPLFLAYGDRIILRSGDAKQLVGGGKVLEINSPKRHK
RSEQRLQWLVQLQQAHSADERIALYLQDKAVEARAITWIEQLTELQLNEI
INKNNDIRFQHWCFNQNYQHRQNHKILTALSFYHDQHEDQLGLGKARLYR
IAALNQPEKLIYHFIDELLEQGKLQQTRGWLHLAEHKIQFSTEERGLWQL
VLDEFEKQKGQPLWVRDLAQNLGFDETLMRNFLYKAGKLGFLTAVVKDRF
FLTEHIYGYARLIKQMIEQNGAVSVNQLRDELQIGRKLAVQLMEYFDRSG
YLRRKGNIHILRDTEAFDL
>MS1241 selD, SelD protein
MLGTILHSQLEQFVDPHLLVGNDTNDDAAVYDIGNGTCIISTTDFFMPIV
DDPFDFGRIAATNAISDIFAMGGKPIMAIAILGFPINVLPAEVAQKIVDG
GRFACREAGIALAGGHSIDAPEPIFGLAVTGIVPTEKVKRNASAEAGSKL
YLTKPLGIGILTTAEKRGKLKPEHKGLATEVMCQMNLIGSQFSQLESVTA
MTDVTGFGLLGHLAEICEGSNLVADVHFNKIKMLDGVPYYIEQGCLAGGV
TRNYESYGIKIGAITEFQKAVLCDPQTSGGLLVAVKPEGETQLLELAAQA
GIELIEVGELRRRVDNSDPVIIRILD
>MS0863 seqA, SeqA protein
MLREDRMKIIEVDEELYQYIASQTKSIGESASDILRRLLNLPVSGVNLTA
VDLTQSTMNSTNEEKGTQLPAEKNVVAETPKPSSEQEIRTPARKQSTQSI
QHIVTKVKNLLQSEAFQEESKMVVRFLNILSVLYRTNPESFAQATEQETS
QGRTRTYYARDEATLLAAGNHTKPRQIPDTPYWVITNTNSGRKMLMLERT
MQFMELPEELIDEVRPYFAVV
>MS1743 serA, SerA protein
MTNKVSLDKSKIKFLLLEGVHQNALDVLHAAGYTNIEYHKKALEPDELKE
AIKEAHFIGLRSRTNLTADILEHANKLIAIGCFCIGTNQVALEAAEEKGI
PVFNAPFSNTRSVAELVLGEILLLMRNIPAANAQVHRGEWNKSAAGSHEV
RGKKLGIVGYGHIGSQLSIIAESLGMNVFFYDVETKLPLGNAQQVSTLEE
LLSSCDIISLHVPELPSTKNLMSAERIAQLKPGSILINAARGTVVDIDAL
AEALEQGKIHGAAIDVFPKEPASAAEAFESPLRKFDNVILTPHIGGSTAE
AQENIGTEVASKFVKYSDNGSTLSAVNFPEVSLPEHRTAKRILHIHHNRP
GILNKINQVFVDENINIAAQYLQTDAKIGYVVIDVETDDSTDLLQKLKSI
EGTIRARVLF
>MS0068 serA, SerA protein
MIMKVVISHRLHDNGMKVLEDANAQVAITNDGNPKIMLPELLDAEGLIIR
IGSIDRETMLQAKNLKVIGRPGVGVDDVDVKTATELGIPVVIAPGSNTRS
VAEHAFALMFACAKDIVRSDNEMRKGNFAIRSSYKAYELNHKTLALIGYG
RIGSILAQMSKAIGMNVKVYDPFVKQGTIEQEGYIYCTELDDVIRDSHVI
SIHVPLTNETRNLIGEHEFSLMNEHTILINCARGEVIDEPVLTKVLQEGK
IHSAGLDVFACEPVDINSPLFQLDNVIVSPHMAGQTKEAASGVATMAAEG
VVAVINGEKWPYVCNPEAYNHPRWNK
>MS1758 serB, SerB protein
MQTSEFINLTLKDIKQHYSPFPNKLINNQPQTEGRDYFILFGTNLEPAKL
QAFQQKCGENFQIFDCWNNLHNIVVLLKGHWQKSYETHAHDLTLDAAKID
FNANLAEQGLLVMDMDSTAIQIECIDEIAKLAGTGEEVSAITAAAMRGEL
DFEQSLRRRVSTLKDAPETILQEVRLQLPLMPGLKETVRILQQHNWRVAI
ASGGFTYFADYLKELLNLDAAVSNQFDIENGKLTGRVKGDIVHAQYKADT
LKRLAREFNIPLENTVAIGDGANDLLMLKQANLGAAFHAKPKVQQQAQVV
VNFADLTALLCLLSAGEKIKHLS
>MS1573 serC, SerC protein
MSNVFNFSAGPAMMPPAVLKKAQEELLNWQGQGTSVMEVSHRGKYFMELI
TQADKDFRELYNIPENYKILFLQGGARGQFAAIPMNLANNKGKALYLNTG
HWSATAAKEARNFTEVDELNITEQIDGLTRVNRLDFSDIAEQYDYVHYCP
NETITGVEINEIPNVGNAVLVADMSSNIMARKLDISKFGIIYAGAQKNLG
PAGIVIVIVREDLIGHARKATPSIWNYEVQANADSMINTPPTFAWYLCSL
VFKDLLANGGIDTVEKRNAQKAALLYDYLDQTVFYHNTIAKENRSVMNVT
FTTGDDQLNAKFVAQATEAGLQALKGHKVFGGMRASIYNAMPVEGVEALI
AFMKKFEAENA
>MS1450 serS, SerS protein
MIDPNLLRNNLAEVAATLKLKRNFILDTKELAELEEQRKALQVETETLQA
KRNARSKAVGAAKARGENIAPLLAEMDDMGHELATVKAELDEILAELNTI
ALTIPNLPADEVPLGKDDSENKEISRWGTPRQFDFEIKDHVTLGENLAGG
IDFAAGAKLSGARFAVMKGQVAKMHRALAQFMLDLHTEQHGYTETYVPYL
VNHTTLYGTGQLPKFGEDLFHTTPLEGEVPYALIPTAEVPVTNLVRDEIL
NTEDLPIRMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVDPDKS
MEALEELTAHAEKVLQLLGLPYRKMLLCTGDMGFGSCKTYDLEVWVPAQD
TYREISSCSNMWDFQARRMQARCRSKTDKKTRLVHTLNGSGLAVGRTLVA
VLENYQNEDGSVTVPEVLRPYMGGLEVIGK
>MS0390 sfcA, SfcA protein
MDEQLRQAALDFHEFPVPGKIEVTPTKSLATQRDLALAYSPGVAMPCLEI
QEDPAKAYNYTAKGNLVAVISNGTAVLGLGNIGALAGKPVMEGKGVLFKK
FAGVDVFDIEINEKDPEKLVEIIAALEPTFGGINLEDIKAPECFYIEQKL
RERMNIPVFHDDQHGTAIISSAAVLNGLRIINKKIEDVRLVASGAGAASI
ACLNLLVSLGMKRENITVCDSKGVIYKGRDENMDATKKLYAIDDNGTRSL
ADAIPNADIFLGCSAAGALTQEMVKTMGPNPLILALANPNPEITPPEAKA
VRPDAIVCTGRSDFPNQVNNVLCFPFIFRGALDVGATTINEEMKMAAVRA
IADLALAEQSDVVSSAYTDESEVTFGPEYVIPKPFDPRLIIRIAPAVAKA
AMDSGVATRPIQNFDAYIEKLTQFVYKTNLFMKPVFNQAKADKKRVLLTD
GEETRILHAVQEISTLGIAYPVLVGRLDVIEAQIKRLGLKIQAGVDFEVL
NTDNEEIYQQCWSLYHNKLKRHGVTEAMAKRRMLTNSTAIGSALLELGYA
DAMLCGLVGTYSSSLSLLKEVIGIKENVDIPATVNGLVLPSGNLFIADTF
VNLAPTAEELAEITLMAAEEVRRFGIEPQVALISHSNFGTSEDQSAVKMR
EVLQLVKTQAPDLIIDGEMHANVALNENLRREVMPDSPLKGAANLLIMPD
MESARISLNLLQGTATPITIGPILMGMKKPAHILTSVSSVRRIINMVAIA
AVKAQQN
>MS1695 sfp, Sfp protein
MTTFIAFGNIQQHYPLRQIPPEFLTEELGRKPTENIRVKRRHRSRWIAHF
LLWELCKKAQIPTALLADIQRSVSGRPYFTPPHIDFNISHSGDWVAVILS
VNTPQSIVGIDIEHPKKMRNYTALLAHFASQREQNWFAGEADADSAFYRC
WCLREAILKSQGVGIVQLSEVFHDPQNLRLQSAYCPSGRLIFTDELPFYF
ACFAANSQLEQAQYFCWENNGFSPVSLNNAIKYSVNKT
>MS1230 sfsA, SfsA protein
MRLPPLQAAKFIRRYKRFMADVELANGNILTIHCANTGAMTGCAEKGDTV
WYSDSKSTTRKYPCSWELTELSNGNLVCINTHRSNQLVQEALQNKVIKEL
AGYSEIYPEVKYGEENSRIDFLLKGEGLPDCYVEVKSITLVKNNIGMFPD
AVTTRGQKHVRELLAMKKQGYRAVVLFAGLHNGFDCFKTAEYIDPDYDKL
LRQAMKEGVEVYAYAGKFDKIQEIPTALSLAEVVPLCFN
>MS0150 sgaB, SgaB protein
MKIMAVCGHGLGSSFMMEMNIKKALKTLGKEAEVSHQDLASVTANEADLF
VMGADIANSSGLPADKVVVVKNIVSVKEFEEKLAEYFNQ
>MS0022 sgaT, SgaT protein
MKNKEVYMETLYNLFLGFNAQVLSKAPFLLGIVACFGYILLKKDTTTIIK
GTIKTIVGFMMVQVGSGVLTTSFKPIIEKLSEFHHLAGAVIDPYTSMQST
IETMGENYGWVGYAVLLALALNILLVVCRRITGIRTIMLTGHIMFQQAGL
VAVFYMIIGASMWETVIYTAVLMALYWGISSNIMYKPTQAVTGGAGFSIG
HQQQIASWVATKLAPKLGDRNDSVDNMKLPKWLHIFHDSISATALVMTVF
FGIILLSFGLDNLQTMAGKTHWFMYILETGLKFAVAIQVIVTGVRMFVAE
LSEAFKGISERVIPNSVLAIDCAAIYAFSPNAMVFGFMWGAIGQFFAVGV
LLMVGAPVLIIPGFIPMFFSNATIGVFSNQFGGWKAVMKICFVMGIIEVL
GSAWVIQLLASQGTTFNGWMGMADWALFFPPVLQGIVSIPGFFFIILALA
MVYMYFASKKLRADEAAAAAAGKTLEQMDGYGLDDIDEPEAETVEQSDNE
TVTQSAVKPVRILAVCGSGQGSSMMMKMKIKGYLDKRGIPNIMDSCAVTD
HKGKLDSTDIIVCSKHLADEISANDKISVLGVQNMLNPNSFGDELLALIK
KYQN
>MS0151 sgaT, SgaT protein
MDSILFFILDILKVPSVLVGLIALVGLVAQKKAFPDIIKGTVKTILGFLV
LGGGATVLLSSLTPLGSMFEHAFNVQGIIPNNEAIVSMALEKYGTATALI
MAFGMVANIIVARFTRLKFIFLTGHHTFYMACMIGVILTVAGFEGVQLVF
VGALTLGLIMAFFPAIAHFYMKKITGSNDVGFGHFGTIGYVLSGAIGQAV
GKGSPSTEEMDLPKNLSFLRDSSISISLTMMIIYFVLAIASGNEYVSTHF
SNGQHYLVYATIQAITFAAGVYVILQGVRLILAEIVPAFTGFSEKLVPDA
KPALDCPIVFPYAPNAVLVGFLSSFMGGIIGLVLLGQLNWVLILPGVVPH
FFCGATAGVFGNATGGRRGAILGAFAHGLLITFLPVFLLPVLGSLGFANT
TFSDTDFGGVGIVLGNMAQFMSKDMIMIVIVAIFLLLVGYNYLAKKPVKT
EE
>MS0047 sgaU, SgaU protein
MRKHKLGIYEKALPKGISWQDRLSIAKACGFDFVEISIDETDERLARLDW
TPEQRIELVSAIIKTGVTIPSMCLSGHRRFPFGSHDEATRQKAYEIMEKA
IKLAVDLGIRTIQLAGYDVYYEEQDEGTLQRFREGMEWATELAASNEVTL
AMEIMDTKFMSSISRWKKWDEIIKSPWFTVYPDVGNLSAWNDNVEEELTL
GMDKISKIHLKDTYKVTESCKGQFRDVPFGEGCVDFVNVFRILDKLNYRG
AFLIEMWTEKSDEPIAEIINARRWIEQKMKEGGFQC
>MS0056 sgbH, SgbH protein
MSKPLLQIALDSTTLEKAVADAKQAESSVDIIECGTILACAEGMKAVSVL
RALHPQHILVCDLKTTDAGTVLAKMAFEAGADWLTVSAAAHPATKAGCKK
VADDFNAANPDLNVKKEIQIELYGNWTLEDAESWLESGIKQVIYHRSRDA
ELAGKGWTEEDLELMKRLSALGMEISITGGIVPEDIHLFKEIKNAKAFIA
GRALVGEKGRQTAAAIRDKINEYWN
>MS0020 sgbH, SgbH protein
MSKPLLQIALDSLSLEKAVADAKKAENSVDIIEIGTILACAEGMKAVSTL
RALHPNHILVCDLKTTDGGAILAKMAFEAGADWLTVSAAAHPATKAACKK
VADEFNAAHPELKVKKEIQIEIYGNWTLEDAKQWVELGVTQAIYHRSRDA
ELAGKGWMPEDIEKMKQLESLGLELSITGGIVPEEIHLFKDIKKAKVFIA
GRALVGEKGQQTAAAIRSEIDKYWV
>MS0173 sirA, SirA protein
MKYNTMNEIISNHTLDALGLRCPEPVMMVRKQIRHMQDGEVLLIIADDPA
TTRDIPSFCQFMDHTLLNSETESLPFKYWVKKGL
>MS1561 sirA, SirA protein
MQYRLDLTGYICPLPLLMARQVLDKLEKGAILTLFLNHTSAVTDFVSLCE
QQGYQLISTENSADKFILTIKK
>MS1198 sixA, SixA protein
MYLLGVRMKIFIMRHGEAEMLAKSDKARHLTENGKNQALQQGLWLKSNNI
NLDLVIVSPYARAIETLDQINQAYDNNLTDKTEIWDGLTPYGDAEMISDY
LATIAEEQPEMSVLLVSHLPLVGEIVAELCGKNPISYHAATIAQIEWDTE
KGVIEQIKYYGR
>MS1359 slp, Slp protein
MIYQIISEDKMKKWLIFPLIVLLSACVPAPEGLERDEFTIQSLRQIEDSD
YVCQCRKVRLGGKIISAEALKNQTKLEILSLPITTYSAKPVIESATDGRF
IAYLDGFADPASLKDQYITVAGILKEQWRGKIDEADYLYPIIKVTAYKQW
RLAKEYYYEYDDWHDYRFRRFGHFRHWGWDPFWRPELKLRYRLY
>MS2199 slpA, SlpA protein
MKVAKNIVVGIAYQVRTEDGVLVDEAPTNQPLEYLQGHNNLVIGLENALE
GKAVGDKFEVRVKPEEGYGEYNENMVQRVPKDVFVGVDELAVGMRFIADT
DMGPLPVVITEVSENDVVVDGNHMLAGQELLFTVEVVSAREATPEEIAHG
HIHSGDHDHGHGGCGCGGHGHEHDHDHSHGGCGCGGHGHHHDHGHDHHHG
EGCCGGHGKKEGHGHGNCGCGGHGH
>MS1326 slyB, SlyB protein
MKKMTLALAVLVSLGLTGCANTDVYSGDVYTGAQSKEARSISYGTIVSAR
PVKIQANNQGVIGTVGGGVLGGITGSTIGGGSGRAVASAVGAIAGAVAGS
KIEEKVSQVDALELVIKKDDGKEIVVVQKADASLKAGARVRIVGGSTLNV
SAI
>MS0156 slyX, SlyX protein
MAIVQHLFGITMQNSANLEQRIAELEMKITFQEGIIEELNQALIEQQFVI
DKMQLQMRHVANKLKDLQPANIATQAEETPPPHY
>MS0041 smf, Smf protein
MAQYSAEQLSEFDAAEWRKIGWNDQQIQTWLNPNMRYLEPALRWNEQPEQ
HILHYRQENYPELLKQIHSAPPLLFIKGNPELLTQPQIAIVGSRNCSDYG
EYWAKHFASELSATGFVITSGLALGIDGFCHQATVEQQGQTIAVLGSGLQ
HIYPARHKKLARRIIETNGALVSEFFPTHPPIAENFPRRNRIISGLSLAT
LIVEATERSGSLITARYALEQNREVFAIPGNIQNQYSQGCHTLIKQGAML
VERISDILENLPHFSINYRPPAKVRSQVQTAQLAAPEVQVSYPELYKHIS
SLPISIDDLINATGLNVNELLVQLLELELQNLICQQNGLYQRN
>MS1912 smpA, SmpA protein
MQIKPIITALLLALSVTSCSTVSKVVYRVDVPQGNYLESAAVSQLQVGMT
REQVQYILGTPVLNDPFSTNTWYYVYLQQRSYETPEQHTLTVNFNQQGTV
ESFDLDKPLPDQEKQVVNNANITMPESQSTSWWQFWK
>MS1145 smpA, SmpA protein
MKLTHLLLSAVAATSLIACGNLSDVTDEGTSENLVWPKIDESRFNHDGSQ
FGSWPNWDSVRMIERGMNKDQIRNLIGSPHFSEGLYGIREFDYAFNYREN
GVHKICQYKILFDKNMNAQSFFWHPNGCNANSSFSLSADFLFDFDKDSLT
ERGKKVVDSVAEQLKASKAKTVKVAGFTDRLGSEAYNLELSQRRANQVKE
WLIARDVKADIDAIGYGSAQQIKPCTGLKAGKALRDCLRPNRRVEISSSG
TVLKKFEENSKKSVGPAVFYQK
>MS1505 smpB, SmpB protein
MIYMTKKKVKPGSNTIALNKRARHDYFIEEELEAGLSLQGWEVKSMRAGK
ANISDSYIIFRDGEAYLFGATIQPLTVASTHVVCDPTRTRKLLLNQRELA
SLFGKANRDGYTIVALSLYWKNAWAKIKIGLAKGKQQHDKRNDIKDREWK
MQKERIMKNANRG
>MS2306 sms, Sms protein
MAKAPKTAYVCNDCGAEYSRWQGQCLACKAWNTISEVRLVSAKQPANRND
RFSGYAGETQAKIQTLAEINLQETPRFSSGFKELDRVLGGGIVPGSAILI
GGHPGAGKSTLLLQVMCGLARNMTALYVTGEESLQQVAMRANRLGLPTDR
LQMLSETSVEQICSLADQLKPQILVIDSIQVMHLADIQSSPGSVAQVREC
ASFLTRYAKTRQVAIIMVGHVTKDGTLAGPKVLEHAIDCSLLLEGESDSR
FRTLRSHKNRFGAVNELGVFGMTEQGLREVKNPSAIFLSRGEEQTPGSSV
MVLWEGTRPLLVEIQALVDHSMLANPRRVAVGLEQNRLALLLAVLHRHGG
LQMADQDVFVNVVGGVKVTETSADLALLLALISSFRNRALPQDLVVFGEV
GLAGEIRPVPSGQERISEAAKHGFKRAIVPYGNKPKSAVENMQVFTVKKL
ADALDILDSLDY
>MS0776 smtA, SmtA protein
MIDFRPFYQQIAVSELSSWLETLPSQLARWQKQTHGEYAKWAKIVDFLPH
LKTARIDLKTAVKSEPVSPLSQGEQQRIIYHLKQLMPWRKGPYHLHGIHV
DCEWRSDFKWDRVLPHLAPLQDRLILDVGCGSGYHMWRMVGEGAKMVVGI
DPTELFLCQFEAVRKLLNNDRRANLIPLGIEEMQPLGVFDTVFSMGVLYH
RKSPLDHLSQLKNQLRKGGELVLETLVTDGDEHHVLVPAERYAKMKNVYF
IPSVPCLINWLEKSGFSNVRCVDVEVTSLEEQRKTEWLENESLIDFLDPN
DHSKTIEGYPAPKRAVILANK
>MS1338 smtA, SmtA protein
MKSELICYKKMPVWNKNSLPKMFQEKHNTKAGTWGKLTVLQGKLKFYTLN
EDGSIVNEHIFSANTDTPFVEPQQWHKVEALSDDLECYLEFYCTKEDYFG
KKYNMTATHSDVLKTAKIITPCKVLDLGCGHGRNSLYLALKGYDVTSWDH
NAASIAFLADSAAKENLQIQTAVYDINNANIQENYDLILSTVVFMFLDRE
AVPAIIDNMQKHTNAGGYNLIVAAMSTEDMPCPIPFAFTFGENELKNYYQ
GWEFVEYNENIGELHKTDKNGNRYKMKFVTMLAKKVK
>MS0203 smtA, SmtA protein
MTTFMNNKTKSAGFTFKQFHVSHDKCAMKVGTDGILLGAWASLQGNRYLD
LGTGSGLIALMLAQRTQTDCHITGVEIDPSAYRQATENVRQSPWADKIQL
EQQNIVDFTRTCTKKFDTVLSNPPYFEQGVDCRDKQRDTARYTQTLSHSD
WLNLAADCLTNTGRIHLILPYAAGKNLQKQTALFCARCCEVITKSGKIPQ
RLLLTFSKQPCTTEQSRLVVYNEQNQYTEQFIALTRDFYLNF
>MS0467 smtA, SmtA protein
MSLNLNQVSLLQNVTRYWNNRAEGYSRHNQQELQSIKRLKWQQLLLAHAP
KKQNLKVLDIGTGPGFFAIIMAQAGAQVTAIDATSNMLEQAKYNAAQAMV
DIRFVRGDVHHLPFADESFDLIISRNVTWNLSEPEQAYKEWHRVLKCGGN
LLNFDANWYLFLYDEQRRRAFEQDRASTIRLNIPDHYADTDTSAMEAIAR
KLPLSRQLRPHWDMNALLNIGFSQLMADTRIGEFLWDDEEKVNYRSTPMF
MIVAQK
>MS1894 smtA, SmtA protein
MKESVYDSEGFFELYQKLRANPGSLNEIVEKPTMLSLLPDITGKTLLDMG
CGTGGHLQMYLRLGAKRVVGIDLSASMLKQAEIDLGKLCENRLQFSSGSF
SLHHLPMEQLDQLPEAQFDVITSSFAFHYVENFPALLTKIANKLTARGSL
VFSQEHPVVTAYQGGERWEKDENKQQIAYRLNFYRDEGKRERSWFKQPFL
TYHRTISTIVNNLIQVGFTIEKMAEPMLADQAEWQTEFKDLQHRPVLLFI
RAKKS
>MS2368 smtA, SmtA protein
MNIQLICETENSQNFTALCKEKGLTHDPASVLALVQTETDGEVRLELRKL
DEPKLGAVYVDFVAGTMAHRRKFGGGRGEAIAKAVGVKGNELPSVIDATA
GLGRDAFVLASIGCRVRLVERHPVVYLLLQDGLRRAYADPEIGEMMQKNM
QLLPVHHITELNPFEDFADVVYLDPMYPHKQKSALVKKEMRVFQYLVGAD
SDSNLLLEPALKLAKKRVVVKRPDYAEFLAEKAPQFSRETKNHRFDIYSV
NV
>MS0706 smtA, SmtA protein
MSKDTIFSTPIEKLGDFTFDENVAEVFPDMIQRSVPGYSNIITAIGMLAE
RFVTADSNVYDLGCSRGAATLSARRNIKQANVKIIGVDNSQPMAERARQH
IHAYHSEIPVEILCDDIRNIAIENASMVILNFTLQFLPPEDRRALLEKIY
RGLNQGGLLVLSEKFRFEDETINNLLIDLHHTFKRANGYSELEVSQKRAA
LENVMRIDSINTHKVRLKNVGFSHVELWFQCFNFGSMIAIK
>MS0945 smtA, SmtA protein
MWHAKHATELKLPTSWQQIPNGTLYCNALNRYFSHWLSNILGDQILKLGG
LSAEIGLDLPMRHQLVISPEIPQNLTALCLHPCTSVVRSKVTELPLIEES
IDACLLANNLNFCADPHRLLREITRVTTESGLLFISLFNPLSILAFKRQF
HQTPYEKFPFRQYPTWLIIDWLELLNFDILQCENLALQHRQHFSLFSPLT
VIIAQKRTCSLSSQAQKIQFHQEDVFSPEAAFKRINE
>MS0389 sodA, SodA protein
MAYTLPELGYAYDALEPHFDALTMEIHHSKHHQTYVNNANAAVEAAVKNV
PALAEYLDACPGKILKNLDKVAAENRTAVRNNVGGHANHSLFWKALKTGT
TLQGALKDAIIRDFGSVEAFQAEFEKAAATRFGSGWAWLVVQEGGKLAVV
STANQDSPIMGKEIAGCEGFPLFCLDVWEHAYYLKFQNRRPDYIKEFWNV
VNWDFAAERFEKKLAECGCAK
>MS1704 sodC, SodC protein
MFIFHTMIVSLWHLPTFYLFTGAYMKKTVILLGLFTLSGAAIAEEAKNVQ
TEIKSKVIEVSLLDPVKGDKAIGQVVVTESPYGLVFTPELNGLTAGLHGF
HLHQNPSCAAGEKDGKKVAGLGAGGHWDPKEAKRHGFPWEDNAHLGDLPA
LAVNADGTASNPVLAPRLKSLDEIADKSIMIHVGGDNHSDHPAALGGGGA
RMACGVIK
>MS0886 soxR, SoxR protein
MNINEIVKKTNLTAKSIRFYEEKGLITAPQRALNGYRQYNQKHVEELNLL
HQARLVGFSLPECKELLELYKDPHRRSADVKAKTLARIAEIDNQIGKLQQ
MRQQLQTLANQCPGDGSEHCPIIEGLSKPNCCDHHAEKK
>MS0468 soxR, SoxR protein
MRIGQLAKAVGCTIETIRYYENQGLLAKPQRSANNFRYYTNDHLQQLSFI
CYCRSLDMSLHEIKMLLNLDRSSGQRAEEINLLLDKHIRDVAKRLHELAH
LRMELIKLKQKCSEMTGENLMQNIFSGGNIRFRKIK
>MS1385 soxR, SoxR protein
MNSQKKFYTISQLAEKLAITTHTLRFYEKEGLLPSVQRDQNGNRLFIQAD
VEWLELLICLKNTGMPLKEIKRFVEWLNYGDSTIEQRLQLFQAQVTKVEQ
QIAELQRHLEILKYKRQFYQCAKELGSVQAVLDTQLQQQFAEQNILLPVS
PLSMAENE
>MS1433 soxR, SoxR protein
MMMKINELSKKSGINLETIRYYEKTGLLPEPKRAANGYRVYDQQSLSQLN
FIKSCRWLGFSIDEIKQLNELKNTPKHHCVADEMILSHLKQVEEKIARLL
EIQTFLQNLVNHEEHSVEECRAISGLSQER
>MS0179 soxR, SoxR protein
MEQTLKQGIFMHIKEFSTKIGLSIDTLRYYEKEGLLNPARNKSGYRNYGK
QDLEWIAFILKLKAMGVPLTQIKEYARLRYLGDTTIPERYAILQAHNQKL
VEQEKEIKKYQQFLAHKLSIYEKVMKKQN
>MS0241 spoT, SpoT protein
MVAVRVSHLLNPKDFIIEDWCAGLGLTPDVEKNIVRAWYYAQEKAQQLFQ
NSHWYLRDGVEMVEILHGLNMDADSLLTAMLFPIVNAKIVNQEQIKEDFG
PHIWKLLKGVIEMNNIRQLNTTDSNAQVDNIRRMLLAMVDDFRCVIIKLA
ERITYLRDAEKRYSKQDKVAAAKECSNIYAPLANRLGIGQLKWELEDYCF
RNLQPEQYRIIAIKLNERRLDREQYIADFVQRVSQYLDESVTGAEIYGRP
KHIYSIWRKMQKKHLDFSQLYDIRAVRIIVPALQDCYTALGIVHTHFKHL
PDQFDDYIANPKPNGYQSIHTVVLGEGDKPIEVQIRTKKMHDDAELGVAA
HWKYKEGNTGSLSAYEEKIIWLRKLLAWQHDISNSGEVVPELRTQVFDDR
VYVFTPKGEVVDLPAGSTPLDFAYAIHSDVGHRCIGAKVGGRIVPFTYQL
QMGDQIDIITQKNPNPSRDWLNPSLGFTHTAKARSKIQAWFKKLDREKNI
PIGKEQLENELNRLAITLKQVEPIALPRYNLKSIDDLYSGIGSGDIRLNH
LINFLQAKLIKPTAQEADEEVLRQVTKTANSAANQQKNEKNKGYVIVEGV
GNLMHHIARCCQPIPGDDIEGYITLGRGISIHRTDCEQLAELKAAHPERV
VESIWGENYNSASGFNLSIRVIANDRNGLLRDITTVLANDKISVANVTTR
LDSKRQLATMDLEIQLKNVQILGKVITRLTKLDDVIEVKRL
>MS1736 spoT, SpoT protein
MYLFEPLNKIIQGYLPSEHIDLIKRAFVIARDAHEGQFRSSGEPYITHPV
AVASIIAEMRLDHEAIMAALLHDVIEDTPYTEEQLTTEFGKSVAEIVEGV
SKLDKLKFRTRQEAQAESFRKMILAMTKDIRVVLIKLADRTHNMRTLGSL
RSDKRRRIAKETLEIYSPLAHRLGIEKVKNELEDLCFQAMHPQRYAVLNK
VIQVARNTRQELVHPILVTIQQRLEEVGINAQVFSEEKPLFYIYQNMRLR
NQQFRSIMDISNFRIIVDSIDNCYRVLGQMHQLFKPRPGQIKDYIAVPKA
NGYQALHTSTIGPHGVAVEIQIRTEEMNLIAELGVTAHWVYKPGGKNDTT
TAQIKAQHWLQSIIELQQSAGNSFEFIESVKSDLFSDEIYVFTPKGRIIE
LPAGATPIDFAYAVHTSIGSTCVGAKVDRETYPLSQALRSGQTVEVITSP
NATPNANWLNFVVTGRARAKIRQTLKTLRLEEAINLGRYQLLHALAGKHL
EDLDPAIVHHVLTELNLDTMDDLLAEVGLGNQLSTVIARRLQGESLAIYT
DIEEVNNQERLPIKGMDGLLVNFAKCCHPIPGDSIVAYANPGKGLVVHHE
NCRNLKKRTTQSVPFIKVEWEQCDHSAEFEAELHINMVAQQGALANLTAA
ISAAQSNIHSIWTEESEGRICHVTLTLSAKDTKHLANIMRKIKSLSGVQS
VERNINE
>MS0474 spoU, SpoU protein
MSENIYGIHAVNSFLATAPERLIEVYVLKGREDKRLQPLLKELHQLGISV
QFLNRQTLDNKANGEVHQGIIARVQPAKELNENDLERILSNNKDPLLLVL
DGVTDPHNLGACLRTADAAGVCAVIVPKDKSAQLTSIARKVACGAAEVVP
LIRVTNLARTLRDLQQSHNIWVVGTAGEATETLYQTKLIGPLALVMGAEG
DGMRRLTREHCDQLISIPMAGSVSSLNVSVATGVCLFEIVRQRLS
>MS0248 spoU, SpoU protein
MNDKSNKAAFQPAKFQRPSNQKRFHERTVGERQEKFGQSRAQFSRSQDND
RFAPQDRRSNKDFDRKERQNPAKNDRTFERRNERPIPQETKITETKLGNV
KVVMKRSGVSEQPRVKKTGSLSPRAPEKIKKNRAEEMKVYGESACLALFA
ERPESIVRVWATVEMAHKIGDMFSYLAANKKVYHVVERAELELVSGTEHH
GGICMLVKKARPFTLTGYLDIPRQQDALIILDNVRNPQNIGGIIRTCAFY
GVKGVIVDNAELLNSAAAMRVAEGGMEYIHQLQTESPDDALAKLRKAGYQ
VVHTTTNKQAKGVHKLQLAKKVVFVLTESENPALVQSGDEVINLSFANPL
KTGLNVAVAAGVLLAKLDK
>MS1107 sppA, SppA protein
MNIVFLFFVLLLAAIVSLTTMVKEKPNLTGDQGALLVNLNGYLADEREDG
LNWRNALKKLNDEQVASQYSTFDVVYAIENAANDERIKGLVLDLNYLDGG
DLPALDYVGKAIRDFQKSGKKVIAYADNYSQSQYFLASYADEIYLNPIGE
VGIEGLSAQNLYFKSMLEKLEITPHVFRVGTYKSAVEPLLRDDMSPEAKA
NTEQWLGTMWSNYQERIAENRNIAKNSVLPEAGVYVDELKALNGDITAYA
KKHKFVTQVASRLKLSQNLTALFGENEQNEPKTVDFDTYLAALPDRLKGD
SSDFVQAKNKIAVINIEGTIVDGETNEQGVGGDSIAQLLRKAYKDKNVKA
VVLRVNSPGGSAFASEVIRQEAENLQTAGKPVVVSMGAMAASGGYWISST
ADYIVADKNTLTGSIGIFAVLPTLENTIKKAGISADGVTTSALVSPSGFS
PLTAELKDSLQLQIEHGYERFLSVVSKGRSLTKQQVDNVAQGRVWLGEDA
YKMKLVDELGDFDTAVRKAQELANGKLAESEKTDTFSVEWITDENTGLLG
GLMKNITQSSQNVIQNAVLKTMGLPKEVKQLQKQLGILTQFNDPKGQYLY
CLNCSEVK
>MS1146 sppA, SppA protein
MWSEILVGYGIFILEILTILLVIAGIVAAIMTLKQQKNPQTGELKLTDLS
EQYQDNVKKLKDFRLTDEELKQAEKARKKADKQKAKENKAKNKKGEKTEE
SLKPCVYVMDFKGDIRASETAALREEISAILNVANPATDEVLLRLESPGG
VVHGYGLAASQLARLKQKGIKLTVAVDKVAASGGYMMACVADKIVAAPFA
VIGSIGVVAQVPNIHRLLKKHDVDVDVMTAGEYKRTVTFVGENTEKGKQK
FQQELEETHDLFKQFVTANRPLVDIDKIATGEHWFGQQALALNLVDEIAT
SDDLILDAMQDKSVIGVKYAVKKSLIQKLGKQAEESSDKLLLKWLKQGNK
TLM
>MS1436 sppA, SppA protein
MSDSTSNFTIDDYDHYFYVGPINMVGYYRLCKEISKHKNKDKVLLCLVTY
GGDPDAGFRIGRALQHHYNGEVTIYIPNVCKSAGTLTTIAAKHIIMDNKG
ELGPLDVQLRKTDELGASNSGLDIFKTLDTLEDRANTAFNKYLRAVRFGQ
GLSTKMSVEIATRLVDTIIKPIAEQIDPMKIGEHQRATDIAIEYGNRLNQ
TSKCLKDDMQSLDKLIRGYPSHGFVIDRKEARTLFSCVTAANEDIIHRYQ
TIHNATESNPNIIGSDLYVEYLEDEQNETSNANNESSSNSTPNDKSGKST
RSSTKS
>MS1088 spr, Spr protein
MRTKIKTFCLTISVAILSACSSNISSITYKGRIDDPIMAIVLLSEQQREW
AGAPYVLGGVSRSGVDCSGFVQTTFMDRFNIALPRTTAAQSGYGQKISLS
DIQTGDLVFFKTGRGPNGYHVGIYVKNDKFLHASTKGGVIYSSMNSPYWK
NAYWQTRRI
>MS1333 spr, Spr protein
MMIKKFLLVTAALVMTACSNSSRLDAAVYPSADETNDTQLTELIGSLKTN
KPQYDVRSNSIHSTKNAQINNKKLMQVYSAWAGTRYRLGGTTTRGIDCSA
FMQEAFSTAFGIDLPRSTSEQRSVGKKIQKSELKQGDLVFFRGNRHVGVY
LGGNRFMHSSTKEGVTISSLDDGYWSRTYTQSRRVL
>MS1698 sprT, SprT protein
MFIYKKSAEFNRTFKSDFKYRKIATQLEGLFIFYKLDLNRLQKINSSCIS
GKNTPRDHAESGYGGERKVDESGIGIGK
>MS1699 sprT, SprT protein
MENISELTGFRHLKMQVQRRLTNCLTLAETHFHRSFPMPTVTYQVRGMKA
GVAYLQQNEIRLNRTLLLENSAEFIGQVVPHELAHLLVYQVFGRVKPHGV
EWQTVMNNVFDLPANVYHRFDVKSVQGETFTYQCQCRTHQLSVRRHTRIQ
RDHAVYFCRKCRSCLSFVSG
>MS1950 srmB, SrmB protein
MRYNFPQFYNLSHLRIFMPQPQFEDFDLSPELLKALAQKGYARPTAIQSE
AIPAAMDERDVLGSAPTGTGKTAAFLLPAIQHLLDYPRRKPGAPRVLVLT
PTRELAMQVAQQAEELAQFTKLSIATITGGVAYQNHGEIFNKNQDIVVAT
PGRLLQYIKEENFDCRAVEILIFDEADRMLQMGFGQDAEKISAETRWRKQ
TFLFSATLEGELLVDFAERILTDPVKIDAEPSRRERKKINQWYYHADSYE
HKVKLLARFIADEQVSKGIVFVRRREDVRELSEILRKRGIRSTYLEGEMA
QTQRNNAIDKLKNGIVTLLVATDVAARGIDIEDISHVMNFDLPYNADTYL
HRIGRTARAGKKGTAVSFVEGHDYKYLGKIKRYTEELLKPRIIEGLEPRT
KAPKDGEIKTVSKKQKAYIRQKREEKRKTTQKKAKLRRQDTKNIGKRRTP
KAVSEAQAKEIR
>MS1836 srmB, SrmB protein
MSLDHLSQQRFADLPLNAKVLEALESNGFEYCTPIQALSLPISLAGKDVA
GQAQTGTGKTMAFLTATFHHLLEHPVKTNHPRALIMAPTRELAVQIAHDA
ERMVKTTGLKTALAYGGDGYDKQLKAIEAGADIIIGTTGRIIDYVKQNII
ALSHIQVVVLDEADRMFDLGFIKDIRYLMRKCPSPKQRLTLLFSATLSYK
VRELAFEDMNDPEYVEVEPLQKTGHRIKEELFYPSNEDKMPLLITLLEEE
WPERCIIFANTKHQCEKIWGYLAADGHRVGLLTGDVAQKKRLSLLKQFTD
GALDILVATDVAARGLHIPDVTHVFNYDLPDDREDYVHRIGRTGRAGESG
VSISFACEEYAMNLPAIEEYIGHHIAVSQYDSDSLIRDLAKPYRLKPSLP
ASNRHNRNGAKPFKKRF
>MS0495 srmB, SrmB protein
MTETKITFGDLGLPEFILSAVSDMGFETPSPIQQACIPHLLNGRDVLGMA
QTGSGKTAAFSLPLLAQIDIEEKHPQMLVMAPTRELAIQVAEACELFTKN
AKGVHIATLYGGQRYDIQLRALRQGAQVVVGTPGRILDHIRRGTLNLSEL
KFIVLDEADEMLRMGFIDDVETVMAELPAQHQTALFSATMPEPIRRITKR
FMTDPQEVKIQSTQRTNPDIAQSCWYVRGYRKNEALLRFLEVEDFDGAII
FTRTKTGTLDVTELLEKHGFRAAALNGDMTQQLREQTLDRLRNGSLDILV
ATDVAARGLDVERISLVVNYDIPLDAESYVHRIGRTGRAGRSGSAILFVE
PRERRLLSNIERLMKKPIEEVDVPNHEALQARRREKFKAKITKQLEHHDL
EQYRLLLEGLFTPDQDQEDIAAAMLMLLQGKQKLILPPEPPMEKRGRRER
DDRRGERGDRRERRPEERRGYGNPQPMDLYRIEVGRADGVDVRHIVGAIA
NEGDINSRNIGHIKLYDEYSTVELPQGMPKELLQVFGKARVLNKQMRMTF
VSEAGETVGRERHEGRRNDRRDNGFRREERRFNDRGNRSFNERAPRREFR
ERNDRRDRRDRRS
>MS0694 srmR, SrmR protein
MSTYLLDAKLAQKIVQRTMDIIDCNINIMDAKGKIIASGDVNRIGEIHDG
ALLVLSQGRVVDINEAVIHSLHGVRPGINLPLRVDGEIVGVIGLTGEPTT
LKEFGKLVCMTAEMMLEQARLFNILAQDTRLKEELVLNLINTDKITPSIV
EWANRLGVDLSIPRVACIIEVDSGQLGIENARSELQNLQTLLKIPERDNL
VAVLSLTELVVLKPALNSFGRWEVDDHLERINQLLSRMNEKAKLNVRISL
GNYFTTEDSISLSYHTAKTTLTIGKARYPKQRIYNYQDLILPVLLDQLRD
GWQKEELERPIKKLKLMDNNGVLLKTLLAWFENNMQTIATAKALYVHRNT
LEYRLNKIADLTGLDLNSTDNRFLLYMALHVAV
>MS0585 ssb, Ssb protein
MAGINKVIIVGHLGNDPEIRTMPNGEAVANISVATSESWTDKNTGERREV
TEWHRIVFYRRQAEVAGEYLRKGSQVYVEGRLRTRKWQDQNGQDRYTTEI
QGDVLQMLGGRGQTADAGFAAPQPNQSFSRPQASAARQQPATRPAPAAEP
AMDNFDDDIPF
>MS1141 sseA, SseA protein
MAGMVGDLDDYDRLLANISLSETDTMPHLRIFMKYTALFAIFSLFLTACN
DNKVQPIDTAELLQNLNNPQYVIIDSRNDSLYNGFKDKHASRGGHIKGSI
QFTCSWFDSIEAGKFDSFAESKGITKNKTLVIYDSNPDNLACISAEFAAK
GYKVRTFSDFISYVNAGYPLESLLNFQYSVSPEWVYSVLQGEKPESYTND
DFMLFEVSWGALENAKAYTQHIVGAYHFDTDWVEGEAPVHNLLEPATIER
NLLKNGITKDKTIILYSDNPLAAYRIFWALKWAGVEDVRVLNGNLSTWMD
SGFPTETKVNIPQPVNNFGGHIPTNPQLSIAQPQQAYARQQQGLKLISSR
AWEEYIGEVSGDDAIQATGEPQGAIWGFSGSAPSNVADFYDPDDTLRNPK
EIEALWQELGIVQGDQLAFYCGTGWRASVPWFMTQLLGWRNTAVYDGGWN
AWQMTELPVQKGAPTGLLKPDAKNDSGRMLKKTNSCRG
>MS1140 sseA, SseA protein
MNIKLTLIATAVALTLSACDDKSVKEIKTDELLKNLDNPEFVIIDGRSDS
LYNGFKDGDAKRGGHIKGAVQFSCNWLAHIADDKFEKFAKDKGLTKDKTL
VFYDSNSEQLNCLSDKFAEKGYKVRVFKDYLSYANSDNPLEAFPNFEYSV
SPEWVNAVIKGEKPESYQNDDFIVFHVGWGPVEKSEEYKQHIPGAFHFNT
DWVENDPVWNLSDPKIIEQNLLNAGINKDKTIILYSDNQLAAYRIFWALK
WAGVKDVRVLNGNLTTWTKAGFATETAVNTPTPVSQFGAEIPLNPQINIS
MPQEAIARQKQGLKLISNRAWDEYTGKISGYSYIPGKGEPQGAIWGFAGT
DASNMADYYDVDGTLRNPKEIFALWKEQNINQGDPIAFYCGTGWRAGVSW
FMTQLAGWDNAYVYDGGWNAWQMDSVFPVQKGAPNNMAKPDSKNDFGQK
>MS1908 ssnA, SsnA protein
MKNHVRSFKTYIRDEIIKKGGWVNAHAHADRAFTMTPEKIHIYHNSNLQQ
KWDLVDEVKRTSSVEYYYARFCQSIELMISQGVTAFGTFVDIDPICEDRA
IIAAHKARDVYKNDIILKFANQTLKGVIEPTARKWFDIGSEMVDMIGGLP
YRDELDYGRGLEAMDILLDKAKSLGIMCHVHVDQFNTPKEKETEQLCDKT
IEHGMQGRVVAIHGISIGAHSREYRYELYKKMREAQMMIIACPMAWIDSN
RKEELMPFHNALTPADEMIPEGITVALGTDNICDYMVPLCEGDMWQELSL
LAAGCRFPNLDEMVNIASINGRKVLGLDR
>MS1280 sspB, SspB protein
MKNKMEYKSSPKRPYLLRAYYDWLVDNEFTPYLVVDATYYGVDVPQEYVR
DGQIVLNLSSGAVANLQLTNDAVMFNARFQGVPREIYIPLGAALAIYARE
NGDRSDVRT
>MS0832 sstT, SstT protein
MNISRLFSFLFHGNLVKRISIGLLLGIIFALVSPSLESALGFHLAEKMGL
LGQIFVRSLRSVAPILVFVLVIAAIANKKVGSKSNMKDIIYLYLIGTFLS
ALTAVFASFMFPTTIALATNEAELSPPGKITEVLTALIFNVVDNPITALF
NANFIGILAWAIGLGITLRYASETTKNVMNDFAEAVSKIVHFIISFAPIG
VFGLVASTLADKGLSALLDYVQLLAVLVGSMLFVAFVINPIIVFWKIRRN
PYPLVWECIRVSGVTAFFTRSSAANIPVNMELAKRLNLDEETYSVSIPLG
ATINMGGAAITITVLTLAAVFTLGIEVSIPTAILLSLVASICACGASGVA
GGSLLLIPLACSLFGISNDIAAQVIGVGFIIGVLQDSTETALNSSTDVLF
TAAACMSEERKNS
>MS1355 sucA, SucA protein
MQKNSPLEEWLASTALGGANQSYIEDLYEDYLRDPAGIEESWRKTFDSLP
KSTAVEQPHSQIRSYFQQLARDSNSQSGASVIDPNVSKRLVKVLQWVNAH
RNRGHLHANLDPLNLWQRLDAPTLDYKFFGFTDNDLDEVFDIGNYVYNRD
KITLRDLAYALKNTYCSTIGLEFMHVNDLEARTWLQRKVENLLNKPLFSK
DEQVKFLEELTAADGLERYLGAKFPGAKRFSLEGSDAFIPLMKEIIRHGA
RNGVKEIVMGMAHRGRLNMLVNVLGKKPAELFDEFAGKHQGNGTGDVKYH
QGYSSDFMTDAGLVHLALAFNPSHLEIVSPVVSGSVRARQKRIGDDHFTK
VLPITVHGDSAVIGQGVVQETLNMSSTRGYTVGGTIRIVINNQIGFTTSN
TRDTRSTEYCTDIAKMIEAPVIHVNGDDPEAVAYAARMAVEYRTLFKRDI
FIDLVSYRRHGHNEADEPLATQPVMYKLIKQHPTPRKVYADRLVAEGVIT
ESKATELMNNYRDGLDRGDCVVPEWRPLDTQKMDWTSFLTQEWTPYNGKF
DPQRFKDLARKVCEYPENHPIHPRVQKIYADRLLMANGEKLFDWGMAETM
AYATLLDDGHHIRISGEDSGRGTFFHRLAVLHNMNERKAYIPLMNLHQGQ
GHFEVWDSVLSEEAVLAFEYGYATAAPKTLTIWEAQFGDFANGAQVVIDQ
FISSGEQKWGRMCGLVMLLPHGYEGKGPEHSSARLERYLQLCAQQNMQVC
VPSTPAQIYHLLRRQMLRTVRRPLVVISPKSLLRHPLAVSTTEELVNGTF
QNVIPEIDDLDPKQVKRVVFCAGKVYYDLLEQRRKNNQTDIAIIRIEQLY
PYPHEEMRDILTAYSHVKDYVWCQEEPLNQGAWYCSQHNFVANLPADGKL
RYVGRPASASPAVGYLALHNEQQKALVAEALAI
>MS1352 sucC, SucC protein
MNLHEYQAKQIFAQYGLPVSEGCACQSLEEAIQAVKKLGGGQWVAKCQVH
AGGRGKAGGVKLVKSEEEVRSFFEKFLGQRLVTFQTDAKGQPVNAIYMEA
CANVKKELYLGAVLDRSSQRIVFMVSTEGGVNIEEVAEKTPHLLHKMPID
PLVGAMPYQGRELAFKLGLQGKQIQQFAQIFCQLGKMFVEKDLSLLEINP
LVILDNDQLHCLDAKIVVDGNALYRQPELNAMRDPSQEDAREAAAEQWHL
NYVALEGNIGCMVNGAGLAMGTMDIVKLHGGQPANFLDVGGGTTKERVAE
AFKIILSDQSVKAILVNIFGGIVRCDLIAEGIVAAVNEVGVSVPVVVRLE
GNNAPLGREILAQSGLNIIAATSLTDAAVQVVNAAEGK
>MS1351 sucD, SucD protein
MAILINKETKVICQGFTGGQGTFHSEQALAYGTKLVGGVSPGKGGSIHLG
LPVFNTVKEAVEQTGATATVIYVPAPFCKDAILEAIAAGLKLIVCITEGI
PTLDMLQVKQKLDESGVVMIGPNCPGIIVPDECKIGIMPGYIHKKGRVGI
VSRSGTLTYEAVKQTTDEGFGQSACVGIGGDPIPGSNFIDILKLYQADPK
TEAIVMIGEIGGSAEEEAAEFIKAHVTKPVVSYIAGITAPKGKRMGHAGA
IISGGKGTADDKIAALQSAGVICVKSLADIGAALKTVLK
>MS0511 sufI, SufI protein
MENYSRRRLFKKTLIATALVATPAPLLAASRQPLVIPPLLESRRGRPVIL
STESSQTALVDGKLVEVWGFNGRYLGPTVRVKQGDFVKLNYRNNLTQLVA
LNIQGLQTSGELLGSIGHSLKPGEGWAPIVPITQSAGTCYYHSCTLASSA
YQNYRGLVGMWIIDDDESRKANLPNKYGVNDIPLILQDQRINSAGTQLFQ
QNEPHFYGERLFVNGQEAPYLNIPRGWVRLRILNASLSRGYDLRMDDERD
VLIIAQDQGFLPQSKTVKQFFVGPGERVEILVDLNEGENVSLIVGAKRGL
LDKAKLLFNSNGELADNTVLELRPEGLLSVFNGKPSFQFSEAAVLPTQIK
QERSFHLDATNGMINQKRFDPRRIDVNAKQGSVERWTISSSIPTGFRIQG
ARFVIESIDDKATDAAELVWKDTVWINGKVRILVKFDNLSSNTQPFIFGS
SDLMQADKGAIGLIVVQ
>MS0876 sufI, SufI protein
MPKPVNRTTPANVVVKLEAADRMMEIMPGVKFKYWTFNGSTPAPFIRVRE
GDTVEVHLSNPINSGLPHSLDFHASAAPDGTAMVSSTKPGRTTVYRFKTL
SSGLYVYHCASIPGAGTHIGKGMFGLMLVEPKEGFPPADKEFYIMQNEFY
TNGSFGEQGLQVFSTEKAAYELPDYVVFNGHYGSMQGEKALKAKVGEKIR
FYVGNAGPNKASSFHLIGKTFDTVYVEGGTLQNHNVQTTLIPSGGAMISE
VTIPVPGQYSFIDHSIFRADKGARGTLMIEGDENPEIFSGKLRDEPYDKR
NPDSDIDTGFKH
>MS1729 suhB, SuhB protein
MEIIMNPMLNIAIRAARKAGNVIAKSYERRDDIQTTLKSANDYVTNVDKA
AEQAIIDVIRTSYPDHTIITEESGALEGKDSDIQWVIDPLDGTTNFVKGL
PHFSVSIAIRVKGRTEVGVVYDPIRNELFTAVRGEGAKLNDLRLRVDAKR
ELEGAILATGFPFKQARHMPLHFAVMNSLIESCADFRRTGSAALDLCYVA
ANRVDGFFEIGLKPWDCAAGDLIVREAGGLVTDYNGGHSYLTSGHIVAAS
ARVVKEILNRLQPLLGDEFKK
>MS2203 sun, Sun protein
MKNSAKTTALLSPKKAFKRPQTPAHKKIQSVRALSARIILQVLDQGKSLS
ALIPELQSQVKAQDLPLLQEICFGVCRVLPRLEQIIKKLVDKPLKGKTRI
VHCLLLVGLYQILYTRIPAHAAVDEVVNATSALKSENFRGLVNGVLRRFL
REQQEILAVVDKNWQTLHPEWFVNKLKKAYPNWREIIEANNQKPPMWIRV
NQQLCTTENYRTLLLTEQELDSFKDENPNALRLAQPTAVQNLPHFTEGWV
TVQDVHAQWSALLLEAKNGDLILDACAAPGGKTTHILELAPQAKVIALDV
EQSRLNRVAENLARLNQQAVLICGDATKPDDWLPNAAERLIGKKTDLQFD
RILLDAPCSATGVIRRHPDIKWLRQEQDIGQLAALQQQILTALWKKLKPN
GILLYATCSLLPQENSEQIRTFLANTPDAKLEPLPFATQENEIGKQFIPS
EFSGDGFYYAKLRKTDKKGG
>MS1843 surA, SurA protein
MVMEKMHNASNSIFSKIIFALISVAFVVSGMAGYMVATADTSAVKINGEE
ISQQAFQQQYNDEYQRLSQQLGAQFSAVADTPEFSEGLRKSVLNRLIDQE
LLRQYVTDLKLVASDASVKQEIVTTPAFQADGKFDNNAYQQTLRANNMTA
DMYAEYVREALRLDQLQSGLAGTVLMLPAQQEEFAKLFFQKRTFRLAKLP
LTAEMAKQTVTDQEVADYYNANKSAFMVPELVKVQYLDITRAAAEKAVKV
TDVEIQQYYQDNKAQFVSKAQDRLAHIQFAKETDALDAYQALQNGADFAA
LAKEKSLDKPSAVNGGELGWLNAGDLPKAFEEAAAALQIGQYSQPVKVDN
QYHIIKLEDRKEPKAQSLEEVKDLIASQIRQDLLNNQFYSLEKQVAEKAF
EDQSSLEAAAKVAGVEIKETDYFSRKDIPAALNFPSVASAIFDGDISQGN
QNSEPMNVADQHSIVVRVVDHKAEGTKSLEEAKAEITAYLKRQKAETVML
EQANKTVEELNLGKQPALNFAAAETWVYAENKDPALNNAIFSMAKPEQDK
TTYAAAKADNGDVVIVALTAVENGEVSAEQGAQFAAQVMQAEQTDLQANL
LKSLRNKAKIEVNEEFMKQSQD
>MS0629 surA, SurA protein
MTVATKWRLFQNKLLSLNFLLKSVSNERNLLNFNNGIMMKPGNLKSLLLV
MIGMFAVSVNVQAVEQVVATVDGTPILESQLKRALGKQANNATNRAKALD
KIIDDMLVQKAVKEANVHISEGQLDKIVENIAAQNNMTYGQLLDALDYQG
IGITKFRNNIRNQLMMAEVRNRSIGKNIDVTREQVETLSKQMLEQAKTQG
KKAQVTGTEYQVRHILLKLNPLLNDAQAKAQLNQICSDIQSGKTTFAAAA
KDYSKDYLSGANGGDLGYAFPEIYDPAFGQVIKATKKGVISAPFKTQFGW
HILEVTDTRQGDMTEAAYRQKAYETLVNQQLQDDAKDWVKALRKGAEIKY
LVK
>MS2272 surE, SurE protein
MLFFMGFMQNIPHKILIIIKEKNRKIMNILLSNDDGYHAEGIQILARELR
KFADVTIVAPDRNRSAASGSLTLVEPLRPRHLDDGDYCVNGTPADCVHLA
LNGFLSGRMDLVVSGINAGVNLGDDVIYSGTVAAALEGRHLGLPSIAVSL
DGRRYYETAARVVCDLIPKLHTRLLNPREIININVPDIPYDQIKGIKVCR
LGHRAASAEVIKQQDPRGESIYWIGPAALPEDDEEGTDFHAVNNGYVAIT
PIQVDMTSYNSMSALQDWLESE
>MS2356 tag, Tag protein
MKKRCPWAEGSQLYRDYHDNEWGKAEFDSRKLFEKICLEGQQAGLSWITV
LKKRENYRRAFHQFCPEKIVRMTDQDIDKLMLDKGLIRHRAKLMAIVKNA
KAYLLMEKCGENFSNFVWSFVNNQPQINDCPDLTAVPAKTECSKALSKAL
KKRGFVFVGETTCYAFMQSMGLVDDHINDCFCKHK
>MS0659 tagB, TagB protein
MTRFVKLSIAILTAMVGWILYFISGFVTRDTKKIVFGTHTGTFSGNVKAL
YLDESYKKDAIKIFIYRNENIKASLEALDGKPLYFSYLSFKGIYHTLTAG
TFVYSSYASDINYWLSKNAKLFNVWHGTPLKKIERDVTTGFYSIRNKYEF
LFKYIYPHLFVRPNQLLVCSEYEKACFKTAFDVSNEAFVEAFPPRLNTLK
EDYINEINKNFIIYAPTWRDDSSFQFYKNCDLNALNEVMEQKGLTFLIKP
HPSDKMKHLDKKYSHIKLAPLSEDFYVLVKKAKLAVTDYSSVMFDCMYCN
IPVVLFCPDLKSYMLNSRDFYCDIKELPFPLMESGNDFIRILNNNFTACN
SDKFLPYKNTLT
>MS0658 tagD, TagD protein
MAKTIITYGTFDLFHIGHLRLLQRLKKLGDKLIVAVSTDEFNEGKGKKTV
IPYEQRAEIVANIKCVDLVIPETAWEQKITDVQKYDVDVFAIGNDWEGKF
DFLKEYCDVVYLERTKDISSTQLKQTLKSFSISKDEILKAFDILEQLKRD
LE
>MS2093 tas, Tas protein
MKMRKLGSQGLLVSEMGLGCMGMDHGYGKPVDRQAMITLIHKAIELGCNL
FDTAPIYGFDNEELLGNALKDHRENVVIATKFGVLDMELVDGQPVPVLDS
SPASIREQLDGSLQRLQTDYIDLFYQHRVDPKVEPEVVAQIMKALIAEGK
IKYWGISNAPADYIRRAHAVCPVSAIEDQYSMMWRKPEQDLFPMCDELGI
GFMAYSPLGNGFLSGKVAQNTEYQEGDFRGQMGRFKPEVMAQNQALLDLI
AEIAERKNATSAQVVLAWELAQKPYIVPIPGTTKLHRLEENFKGAEIELS
AEELADLNTALSKLDINETFF
>MS1418 tas, Tas protein
MKKHKESTMKNLNLPKIMLGTWSWGAGMYGGDQVFGNSIEAKDVKEVFDL
AVKNGLNAFDTATAYGLGASEEILGELMSAYQREELIISTKFTPQLAEMY
DNSVQKMFDASAKRFNTDYIDIYWIHNPTDVERWTSGLIPLLKAGKVKAV
GISNHNLAQIKRVNEILGAEGYKLDAVQNHFSLLYRASEEAGILDYCKQN
GITFFAYMTLEQGALSGKYSPENPMPAGSGRGETYNKVLPQLVKLTDKMR
EIGEKQGASVAQIAVAYAIAKGTLPILGATKPHHITDAAKAMTIALSADE
VTELEELAKATGVDTKGAWEEPMA
>MS1422 tas, Tas protein
MKYTKLGNSDLNVSRICLGCMGFGDAATGQHSWTIGEPDTREIIQYALEN
GINFFDTAIAYQLGSSERFVGKALRDMTKREDVVVATKFLPRTQQQLADG
VSGEQAILSSLDQSLQNLGMDYIDLYIYHIWDYNTPIEEVLQTLHKAKQS
GKVREIGIANVYAWQLAKANALAEREGWSKFISVQNHYNLIMREDERELF
GLCAEDNIALTPYSALASGRLSRLPNETSKRAVEDTYAKGKYDATAEQDS
VIINRVAELAEKYGVSMTEISLAWLLTKVDAPIAGATKKSHIDGAVGAVN
LTLSAEDLVYLEACYQPHNLVGIMAQNSYKTKDVKQVWSR
>MS0500 tatA, TatA protein
MGGISIWQLLIIVAIIVLLFGTKKLRTLGTDLGESVKGFKKAMNEDEPKD
AEFKSLNKDESATAGSEKVKDKEQA
>MS0501 tatA, TatA protein
MFDIGFSELLLIFIVGLVVLGPKRLPVAIRTVMGWVRTIRGLAANVQNEL
AQELKLQELQESIKKAENLNLKNLSPDLAKTVEELKASAEKMKADLDKAA
AETNTTIDEQIQILREENAQTQSNDVATSDTVEKSIADEFSIKNDENPTA
LSSVVSSVDSIQNGQSDLELDAQAEVDRQLAAMMDKYAPPDDVAENPIST
EKTS
>MS0502 tatC, TatC protein
MSSVEESQPLITHLVELRNRLLRSIMFVLIVFCGLVYFSNDIYHLIATPL
LEQMPQGSTMIATNVAAPFFTPIKLTGIAAVFLSVPYILYQVWAFVAPAL
YQHEKRLIYPLLFSSTVLFYTGVAFAYYVVFPIVFGFLTKTAPDGVAIAT
DISSYLDFVLTLFLAFGICFEVPVAIILLCWTGVTTPEDLKEKRPYIIVA
AFIIGMLLTPPDIFSQTLLAVPMCLLFEIGLLFARFYRPKEDETDNGSSE
LTKHKE
>MS0571 tatD, TatD protein
MFIVDSHCHLDSLDYEKLHSNVDEVIEKAKARGVKHLLSIGVALNRFQAM
KTLLAHRDEVSFSCGVHPLDLAGETFDRQRLERYAKDEKVIAIGEIGLDY
YYDQDRKNEQLDAFSQQIEVANQLNKPVIVHTRDAREDTIRLLRENHAEK
CGGVIHCFTENLEFAKQALDLGFYISCSGIVTFKNAEEIRDVVRYVPADR
LLVETDSPYLAPVPYRGKQNQPAYTREVCEYVAALKGVSAEEFALITTQN
FERLFKINVL
>MS0625 tatD, TatD protein
MAFFDTHTHLDYLQRTTNTPLSALMENALNADVQKILIAAVMARDFENIL
NMTELFPRHLYCGLGLHPLFIKNHQKSHLDELETYLQKNPQNLTALSEIG
LERSVSELISDELWRRQCDFLEAQLYLAKQYKLPVNLHSRKSHDQLFTFL
KRIRLPKCGVLHGFSGSYQQAKNFVDLGYKIGVGGVISYLRANKTRQAIA
KLPLDSLLLETDTPDMPVFGFQGEANRPERLVQTFRYLCELRSEPPAQIQ
QQIWRNSCEMFAVK
>MS1370 tbpA, TbpA protein
MQNTTCEKVAQAFSKKYNVKTQFVRNSTGTVLGKIKTEKDNPQGDVWYGG
TLEPHLQAADLGLLEKYRSPNQKDILPIFKDLTEKRGEYTSVIYLMELSM
GINSKKLASLNIEPPKCFADLLDPRFKNQIQYADPRVSGTGYSFLIALVS
LWGEEKAFDFLAKLNKNIAQYTKSGLATSNLASGEVAVDISFFHTYVREK
EKGAPVEGVYPCEGTAYTLGATSIIKGARNLDNAKLFTDWALTPEAQEVH
WREADSYQLPANIHAQYYPGMHVPANPKIIDIDFIRFGSNEQSKRLIERW
VNGILSNQPQ
>MS1526 tbpA, TbpA protein
MIKGFFMSKLKTFLFLTALFISDYGLSAVQPLHIYAEEYFAADWGPAPEV
KAQFERAYPQCQVTIQSFDSRTTMLNRLRLEGKNTKADIVLGMDNHQLEA
AEKTGLFAPAKVDFSRLSLPVEWKNTTFVPYEFSKYAFIYDKSKLTNPPE
SLKELVEREDLKVIYQDPRTSSIGRGLLVWMNKIYPPEQVEKAWQQLQKH
TLTVGKGWSETYGTFLKGEGDLVLSNNTSPIYHLLTDQKENYAATEFSEG
ETLQIDFAGKIAGKHNICADEFLAFLLKPEIQNIIITKGVMLPVVEGDLE
PHYAALKNAVMQGKTIDTLSVSAEQIKHWIEVWQRALSK
>MS1583 tbpA, TbpA protein
MKIKKISLAISTALLGAGLMFSAQANAKGRLVVYCSATNEMCEAVTKSFE
KQYDVKTAFIRNGSGSTFAKIEAEKNNPQADVWYGGTFDPQAQAAEMGLL
TPYRSKNIDDVMPRFQDPAKIKGNYASAIYMGILGFGVNTERLKKLGINE
VPKCWKDLADPRLKGEIQIADPQSSGTAYTAIATFVQLWGEDQTFEFFKK
LHPNISQYTKSGITPSNSTARGEATVGIGFLHDYAVQKAAGAPIEMIVPC
EGTGYELGGLSIIKGARNMDNAKLFVDYVLSKEGQEVAWRKGNSHQTLTN
IKAEQSPTAFDPTKLNLINYDFEKYGASDERKRLIEKWVQEVKLAK
>MS1585 tbpA, TbpA protein
MKISKIALSLSTVLLGSLMFSQNVAADTGRLVVYCSAQNTMCEQETLAFE
KKTGIKTSFIRGGTGSILAKIDAEKANPQGDVWYGGTLDPHSQAGEMGLL
VPYKSPNLQYIPDELKDPAKVKGNYSSAIYLGVLGFGVNTERLAKLKIPV
PKCWKDLTDPRLKNEIQAADPQSSGTAYTALATFIQLWGEEQAYVYLKEL
HKNVSQYTKSGNTATRNTARGEASIGIGFLHEHSLEKEKGAPVELIVPCE
GTGYEIGGVSIIKGARNLENAKKFVDWALSKEAQELSWQKGETHQILTNS
QAKQSPYALDFKSINLINYDFDKYGSSDLRKRLITKWVDDVKLAK
>MS2189 tdcF, TdcF protein
MILKETIMATTIHTENAPAAIGPYVQAVDLGNLVLTSGQIPVNPATGEVP
ADISAQARQSLENVKAIIEQAGLTVADIVKTTVFVKDLNDFATVNAEYER
FFKENDHPNFPARSCVEVARLPKDVGLEIEAIAVRK
>MS0525 tdh, Tdh protein
MMRSLVCKEPFHLILEERAKPQPKDEEVQLKVAAIGICGTDIHAYAGNQP
FFEYPRVLGHEASGVITELGKNVDKFKVGQRVALIPYVSCGKCGACLSGK
TNCCENISVIGVHQDGAFSEYLTAPAKNILPIADSVDFTTAALIEPFAIS
AHAVRRAQITKGDDVLIVGAGPIGLGAAAIAHADGANVVIADTSEERRKH
IQANIPVPTVNPINEKVEDYFNGRLPQIVIDATGNQKAMNNAVNLIRHGG
RIVFVGLHKGTIEFSDPDFHKKETTLMGSRNATLEDFEKVQHLMSERKIS
ANMMLTHTFKYDELAEIYEEKITKNQSLIKSVVLY
>MS0956 tdh, Tdh protein
MMEIKTLSCVVRGPKDVGVMEQSINYDESSKEQTLVKITRGGICGSDLHY
YQYGKVGNYEIKHPMILGHEVIGTVVKTNAPDLYVGQKVAINPSKPCLTC
KYCLSGDTNQCETMRFFGSAMYNPHVDGGFTQYKVVDNSQCIDYPQDVSD
DIMAFAEPLAVTIHAAKQAGDLAGKRVFVSGVGPIGCLAVAAIKASGAKE
IVVSDLSRRCLDLALEMGATKALNAKDDFSEYMAHKGEFDVSFEASGHPS
SIERCLAVTKARGTIIQIGMGGAIPEFPIMTLIAKEICLKGSFRFIEEFN
TSVEWLSSGKVNPLPLLSATFPYTELEKALIIAGDKDNISKVQLSFE
>MS1764 tdk, Tdk protein
MAKLYFYYSTMNAGKSTTLLQSAYNYNERNMNTLVYTAAIDDRFGAGRVT
SRIGISQQAKLFNRESNLFEEIKRHLASEKLHCILIDEAQFLTKQQVYQL
SDVVDLLNIPVLCYGLRTDFQAELFEGSQYLLAWADQLEELKTICHCGRK
ANFVLRLNERGDVVKDGEQIQIGGNERYLSVCRFHFKQKTGKLHN
>MS0026 tehA, TehA protein
MSDTKPFPIPVNYFSMVLGLAGLGLAWRYAAIILPLPAIIGESVLTVATA
IWLALIVIYIYKWKAYPEQAKAELYHPILGCFVSLIPITTILVAMGALPY
SRETAVVLTASGIIGQLGFAMFRSAGLWRSGHPQEAATPVLYLPTVATNF
VSANALGSLGYSEIGALFLGGGMLAWFFLEPAVQQRLRNLTALPENIRPI
IGIQLAPAFVCCSAYLAINDGEIDLLAKGLVGYGLLQLFFLLRLMPWIAT
KGFTMPFWAFSFGLASMAGVGLHTAHSSSSPYLELLGLAMFGFASCCITL
LTLGTLSLIRKEKFLIKN
>MS0634 terC, TerC protein
MFEWIADPEAWIALLTLTALEIVLGIDNIIFISILVGRLPESQRQMGRIL
GLGLAMCTRILLLLSLAWVMRLTTPLFTLFEQQISGRDLILLFGGLFLVA
KSTHEIHATMHPDEGEDETKGKKISFLGTLMMIAILDIIFSLDSVITAVG
MANNVEVMIIAIIIAVGVMMLAAKPIGDFVENNPTLKVLALSFLILIGVT
LIAESLDFHIPKGYIYFAMAFSVTVEMINLRTRKHLTIKE
>MS0948 tesB, TesB protein
MSEVLDNLIHLLKLEQLDDALFRGGCQDLGFRQVFGGQVVAQALSAAMQV
APKDRLLHSCHAYFLAPGDSQRPIIYDVETLREGRNFTALRVKAIQYGHP
ICHVTASFQVEESGFEHQVKMPEIGSPEEFMSESDALKKAAQYIPESVRD
KFTAERPFDIRAKYINNPFLGTELPPEQYIWVRANGKSPLDQNIQKCLLA
YFSDFHCILTALHPHAKGFLQKGMKVATIDHSIWFHRSFDLNDWLLYAIE
SNNAFAARGLARGQIFDKAGRLIATTQQEGLIRYVE
>MS2301 tfoX, TfoX protein
MNRTNKDTQWIRTILNSFLENEVTAKHLFVGYGLFYRKVMFGIVIDDNFF
LKAENQLVEYVEKLGAVSWDIFNKNTNLAISSYYRLPRALVDNEEEFKTL
VILSIKQQQRKILDLNIAKKERIKELPNLSIKHERLLAKIGINNVKEFKS
AGISNCFVKLKVHGFSVNVELFWLFQAALKNKHVSLLTKSEKKSALLVLN
RKLVEAGFREIKHECLI
>MS1559 tgt, Tgt protein
MKFKLKTTSGAARRGELTFNRPQGEFSVQTPAFMPVGTYGTVKGMTPEEV
RATGAEILLGNTFHLWLRPGQEVMRKHGDLHDFMQWHRPILTDSGGFQVF
SLGKLRKITEEGVKFQNPINGERIFLSPEKSMEIQYDLGSDIVMIFDECT
PYPATFDYAKKSMEMSLRWAKRSRDRFDELGNKNALFGIVQGGTFEELRK
VSAENLINIGFDGYAVGGLAVGEPKEEMHRILEFTTPLLPADKPRYLMGV
GKPEDLVEGVRRGIDMFDCVMPTRNARNGHLFVTDGIVKIRNAKYKDDTS
PLDPHCDCYTCQHYTKSYLYHLDKCGEILGARLNTIHNLRYYQRLMEQIR
TAIEQDRFDDFVQEFYARMDKPVPPLQKA
>MS0480 thdF, ThdF protein
MMTKETIVAQATPIGRGGVGILRVSGPLATEVAKAVVDKELKPRMANYLP
FKDEDGTILDQGIALYFKSPNSFTGEDVVEFQGHGGQVVLDLLLKRILQV
KGVRLARPGEFSEQAFLNDKLDLAQAEAIADLINASSEQAARSALKSLQG
EFSKKINQLVDSVIYLRTYVEAAIDFPDEEIDFLADGKIEGHLNDLIGQL
DKVRSEAKQGSILREGMKVVIAGRPNAGKSSLLNALAGREAAIVTDIAGT
TRDVLREHIHIDGMPLHIIDTAGLRDATDEVERIGITRAWNEIEQADRVI
LMLDSTDPDSKDLDQAKAEFLSKLPGNIPVTIVRNKSDLSGEKESIEEQE
GFTVIRLSAQTQQGVSLLREHLKQSMGYQTGTEGGFLARRRHLEALEHAA
EHLQIGRVQLTQFHAGELLAEELRIVQDYLGEITGKFTSDDLLGNIFSSF
CIGK
>MS0677 thiD, ThiD protein
MVIPQVLTIAGSDSGGGAGIQADLKTFQMRGVFGTSVITAVTAQNTLGVF
DIHPIPLASIQAQLRAVAKDFSISAVKIGMLGNTEIIQCVADCLEQYQFS
HIVLDPVMIAKGGATLLEQSAVAALKNLILPKACLITPNIPEAERITGTQ
IKNEADIFNAAQIFHELGANTVVIKGGHHNNSQSKLCKDWVFTQKGYFTL
EAPRFATPHTHGTGCTFSACLTAELAKGKPVEQAVRTAKNYITAAIGHPL
NIGHGHGPTNHWAYQNEQD
>MS0676 thiE, ThiE protein
MNKIKSMLSVYFIAGSQDCRHLPGEPTENLLTILQRALEAGITCFQFREK
GEQSLACDLQLKRRLALKCLQLCRQFQVPFIVNDDVELALSIQADGIHVG
QKDTAVETILRNTRNKPIIGLSINTLAQALANKDRQDIDYFGVGPIFPTN
SKADHSPLVGMNFIRQIRQLGIDKPCVAIGGIKEESAAILRRLGADGVAV
ISAISHSVNIANTVKTLAQK
>MS1004 thiF, ThiF protein
MSELTHAEELRYNRQIVLKSVDFDGQETLKDSKMLIVGLGGLGCAASQYL
AAAGIGHLTLLDFDTVSLSNLQRQVLHNDERLDMPKVESAKIALQAINPH
IEINTINGLLSEEKLAEIIPHFDLVLDCTDNVAARNQLDLCCRQAKVPLI
SGAAIRMEGQVSVFTYEPGTPTYGHLSRLFGENALSCVEAGVLSPIVGIV
GSIQALEAIKVRLKIGKNLCGRLLMIDGLNMSIREIKIPSI
>MS1062 thiI, ThiI protein
MKFVIKLFPEIMIKSESVRKRFVKILTGNIRNVLNKYDDGVAVVKHWDYI
EVRSKNEENRAILIDVLGRIPGIHHFLEVDEKPFTDMHDIFEQTLADVGA
SLENKTFCVRVKRKGKHEFSSLDVERYVGGGLNQAIETAKVKLSNPDVTV
RIDIENDHMMLIKARHEGIGGYPIGTQEDVLSLISGGFDSGVSSYMFIRR
GSRVHYCFFNLGGAAHEIGVKQMAYHIWNRYSGSHKVRFVAINFENVVGE
ILEKIDNGQMGVVLKRMMVRAASKVAERFGIQAIVTGEALGQVSSQTLTN
LRLIDEAASALVLRPLITHDKEAIIAMAKQIGTDDIAKSMPEFCGVISKN
PTVKAIKEKIEKEELNFNFDVLESAVQNAQYLDIRQIAEQTEKDVVKVDT
VSVLSANDVILDIRSPEEHDERPFELAGHEVKHLPFYKLSSQFGDLDQSK
NYVLYCERGVMSKLQALYLKENGFANVRVFAHGNIN
>MS1423 thiJ, ThiJ protein
MTTSIKPVLCVVTSAPIKGKSGIPTGFYLAELTHALDEIEKAGLKTVIAS
VRGGQPPIDGFDLTDPVNAKYWNEGDLYERLANTPALSELNGADYSAVFF
AGGHGTMWDFAQSAEVHRIVSEVYTSGGVVSAVCHGPAALVGAKLPNGEF
VVNGKNIAAFTNAEEVEVEGDKLVPYMLQTELEKQGAIHHAAPNWAENVI
VDGQLVTGQNPASAKGVGAALAKVLLEK
>MS0974 thiL, ThiL protein
MAEGEFDLINRYFALSKQIPRQDVILSIGDDCAITQLKADQRLAVTTDTM
VENSHFYPTIPPRALGYKAVATNLSDLAAMGATPTWVSLALTLPKTDSHW
LNEFSKGMFEILNRYDVTLIGGDTTKGEIPTVTITAQGLLGDYALCRHQA
KIGDWIYVSGTLGDALAGFYLNRDIYEGKKSAVGFDEDFFIQRNLYPIPR
IEFGKTLAKYRLANAALDISDGFMGDLMHILERSQVSAVVNLEDLPLSPQ
LSAYYGREKAELMALQGGEDYELCFTVSDENKTKLDQYLAPLNVPYTCVG
QICAAEDNPVIRLQRYKQEVNLSIHSFDHFK
>MS1586 thiP, ThiP protein
MRPIITLKIARGCMNSAKIPFIQTANFWILLSALAFLILPSQALDYGLLE
STADEFYDAMGWSSLNLTMLWFLPLLGFLLTPRLGLSATAQAKAELGLVA
FTTLFAFVSATVYKVSMGYSVIILLASLTALATFALAKLKIMQGDKFIIA
SLLAIILLIFFFIVYPTVAILISMFYDGDTFTPQQVLRILSQSYIIRVIT
NSLALSGFVGVVSTIFGLAFALYTTRIAKRTAFIGKIFSILPIVTPPFVV
GLGVTLMLGRSGYVTEFLSDNFGFTNQNWLYGFNGIAIAQILAFAPISFM
ILDGALKSIHPSIEEASYTLRANRYQTFYNIIFPLLRPALANSFLIVFVQ
SLADFSNPLVLGGSFDVIATQIYFYIAGSQLDYASASTLGSMLLIFSLLI
FIVQYMWIGNRSYVTVSGKSYRGDVQDLPGGLKWTIIGMLAFWVIFNLAL
YGSIFYGSFTVNWGVDYTLTTRNYQLMFGQGFSDGAWPSLLNTMLYAGIA
APLTALFGLLIAYIVVRKDFQGKKTLEFLTMLCFAVPGTVAGVSYILAFN
NAPMYITGTGVIIIISMVMRDLPIGMRAAIAGLGQLDKSLDEASLSLKGS
SFKTIWYIVFPLLKPALLSALVTSFVRAMTTVSAIIFLVTADTRVATAYI
LNRVEDGEYGVAIAYGSVLIIVMMAIILFFDWIVGDTRISRSKAKKMN
>MS1525 thiP, ThiP protein
MNKWFSRRHIMRPSQYAAGLSVLLLLVCVYGGALQAVFNTGEPYPWRNLW
QDDYLHRVLLFSFGQASLSAFLSVFIGGIFARAFFYQSFKGKDFLLKIFS
LTFVLPSLVAIFGLLGIYGSSGWAVKLLQAVHIQWRPDIYGLSGILIAHL
FFNIPLAVRMFLQGLHNIPNQQRQLAAQLNLRGWQFIRLIELPYLRQQLL
PVFMLIFTLCFTSFAIVLTLGGGPKYTTLEVAIYQAIIFDFDLAKAALFA
LLQFVFCFTLFGLSTLFSAAPETNLSYKELWIAKQSSAVKIWQILVLILV
GLFILLPLVNIVAAAFSAEEFISAWQDPQLWRAMGFSFTIAPLSACLSLL
MSMGLLLLSRRLVWLHLTKIANVIMNLGMLILAVPGLILAVGLFLLLQKM
EFGTAHLFVVMVMCNAFSAMPFVIRILAPAMNNNMQYYEKLCQSLGIRGW
QRFRLIEQHKLAQPIKYAFALACTLSLGDFTAIALFGSQQFSSLPYLLYQ
QLGSYRGDQAAVTALVLLLMCLLIFILVEGAKNSDKNENANDKT
>MS1702 thrB, ThrB protein
MLRIYAPASSANISVGFDTLGAAISPIDGSLLGDVVQIEDIPAGFELESA
GYFVRKLPKEPQKNIVYQAYVLFSERLKLRNGHVKPLRLTLEKNMPIGSG
LGSSACSIVAALVALNMFHNEPFSKMELLEMMGELEGRISGSIHYDNVAP
CYLGGVQLMVQSLGNICQQLPFFDSWYWVLAYPGIEVSTAEARAILPKSY
TRQDVIAHGRHLGSFVHACHTQQDVLAALMMKDVIAEPYRESLLPNFAEV
KQASRDLGALATGISGSGPTIFSIAPDLAVATKLANYLENHYLQNNEGFV
HICKVDNQGTRALG
>MS1701 thrC, ThrC protein
MNLYNIKHPEEQVNFAQAVRQGLGKDQGLFFPEVIPALDNIDELLALPLV
ERSQKILGALIGEEIPAEKLNTMVKNAFTFPAPVAKVEEGVYALELFHGP
TLAFKDFGGRFMAQALATVRGDGKITILTATSGDTGAAVAHAFYGLENID
VVILYPQGKISPLQEKLFCTLGGNIRTVAINADFDACQALVKQAFDDEEL
RRAIGLNSANSINISRLLAQVCYYFEAAAQLSPSERSNLVVSVPSGNFGN
LTAGLIAKTLGLPIKRFIASTNANDTVPRYLAKGKWEPNATVATLSNAMD
VSRPNNWPRVEELFKRNGWALSELHSGAVSDAQTEETLRDMNAKGYLCEP
HGAIAYRVLKQDLQAGETGLFLCTAHPAKFKESVERILNTQLPLPQALAK
HAELPLLSDVMENDFAALRAYLLKK
>MS1053 thrS, ThrS protein
MPIITLPDGSQRQFDNPVSVLEVAQSIGAGLAKATIAGRVNGERRDACDM
ITEDSTLEIITAKDEDGLEIIRHSCAHLLGHAIKQLFPDVKMAIGPTIDN
GFYYDVDLEHSLSQEDLDALEKRMLELAKTNYDVVKRRVSWQEARDTFEK
RGEPYKMAILDENIAKDDHPALYHHEEYIDMCRGPHVPNMRFCHHFKLQK
VAGAYWRGDSKNKMLQRIYGTAWADKKQLNEYLTRLEEAAKRDHRKIGKA
LDLYHMQEEAPGMVFWHNDGWTIFRELETFVRTKLKEYDYQEVKGPFMMD
RVLWEKTGHWQNYGDLMFTTQSENREYAIKPMNCPGHVQIFNQGLKSYRD
LPIRMAEFGSCHRNEPSGSLHGLMRVRGFTQDDAHIFCTEDQIESEVTSC
IRMVYDIYSTFGFTNIAVKLSTRPENRIGDDAMWDRAEQGLANALAHNGL
QYEIQEGEGAFYGPKIEFALRDCLDREWQCGTVQLDFALPGRLNASYVAE
DNDRRTPVMIHRAILGSIERFIGIITEEYAGFFPSWLAPVQAVVMNITDS
QAEYVQKVTKTLSDAGLRVKSDLRNEKVGFKVREHTLRRVPYMLVCGDKE
ISEGKVSVRTRRGADLGTYSVEEFVEILKNQVRSRELKLLGEE
>MS0405 thyA, ThyA protein
MRQYLDLCQRIVNEGCWIENKRTGKRCLTVINADLTYDVANNRFPIITTR
KSYWKAAIAEFLGYIRGYDNAADFRKLGAKTWDANANENQVWLNNPHRKG
TDDMGRVYGVQGRAWRKPNGETVDQLRKIVNNLSRGIDDRGEILTFLNPG
EFDLGCLRPCMYNHTFSLLGDTLYLTSYQRSCDVPLGLNFNQIQVFTFLA
LMAQITGKKAGQAYHKIVNAHIYEDQLELMRDVQLKREPFPSPKLEINPD
IKTLEDLETWVTMDDFNVVGYQCHEPIKYPFSV
>MS1849 tig, Tig protein
MFGVKPSPSAHSIRENSIRGKQRMTTIETTQGLERRVSITVPAETVTTAV
RDELKRVAKNARVDGFRKGKVPAQIIEKRFGASVRQDVLNDLLPRHFFDL
AFKEKVNLAGRPTFAVENYEEGKDLQFTATFEVYPEIQLQGLENIKVEKP
VVEITDADVDNMVEVLRKQQATWAETDNAATKDDRVTIDFVGSIDGEEFQ
GGKANDFVLAMGQGRMIPGFEDGILGHKAGEQFDIEVTFPEDYHVENLKA
KPAKFAITVKKVEVMVLPELTADFIAKFGPNTKTVDDLRAEIRKNMQREL
KNALTARVKNQVIDGLIEQNQIDVPFAAVDQEIEVLRNQAAQRFGGNGEQ
AAQLPRELFEEQAKRRVQVGLLLAEVISSNELKADEEKAKAMIEDIASAY
EQPAEVVEYYSKNNELMNNIRNVVLEEQAIDAVLAKAQVTEKASSFDEVM
NPQA
>MS0057 tktA, TktA protein
MTDHKKLANAIRFLSMDAVQKAKSGHPGAPMGMADIAEVLWRGFMKHNPT
NPKWADRDRFVLSNGHGSMLIYSLLHLTGYDLSIEDLKNFRQLHSKTPGH
PEYGYAPGVETTTGPLGQGITNAVGMAIAEKTLAAQFNREGHDIVDHHTY
VFLGDGCLMEGISHEACSLAGTLGLGKLIAFYDDNNISIDGHVDGWFSDN
TKGRFEAYGWQVIDNIDGHNPEQIAAAVKLARAETEKPSIIICKTIIGYG
SPNKSASHDCHGAPLGDEEIALTRKALNWEYAPFEIPADIYAAWDAKSAG
QSAEAVWNEKFAAYEKAYPELAKEFKRRVNGELPANWAAESQAFIEKLQA
NPASIASRKASQNAIEAYAHILPEFLGGSADLASSNLTLWSGSKPIRAKQ
NVDGNYINYGVREFGMSAIMNGIALHGGFIPYGATFLMFYEYAHNAVRMA
ALMKQRSLFVYTHDSIGLGEDGPTHQPVEQTASLRLIPNLETWRPADQVE
SAIAWKAAVERKDGPSALIFTRQNLAQQDRTSEQLANVARGGYILKDCAG
TPELILIATGSEVELAVKAAEALTAEGKAVRVVSMPSTNVFDKQDEAYRE
SVLPSSVTKRVAIEAQIADFWYKYVGLEGRIVGMNRFGESAPADQLFKLF
GFTVENVVAKAKEIL
>MS0682 tldD, TldD protein
MEISQNQTALLKQQEQALRDAVSYAVEIAQKAGASAEVAVTKVNGLSVST
RLKEVENVEFNNDGALGISVYLGQQKGNASTSDLSKDAIKNAVEAALAIA
KYTSPDECAGLADKELMAFEAPSLALYNPAEVDVDQAIELALQAETAALN
YDKRIVNSNGASFNSHNGVRVYGNSYGMLQSYLSSRYSISCSVLSGIDDE
LENDYEYTVSRDLNALESPVWVGENAAKKAVARLQPRKITTQEAPVIFLN
DVATGLIGSLAGAISGGSLYRKASFLLDHLGRQILPDWFHISERPHLTGR
LASTPFDSEGVKTQSREIVEQGILRTYLLTSYSGRKLGMQSTGHAGGIHN
WLVRPNANGDLDSLLRQMGRGLLVTDLMGQGVNMVTGDYSRGAAGFWVEN
GEIQYPVAEITIAGRLKDMLRDIVAVGDDIEQRSNIQTGSILLESMKISG
N
>MS0780 tldD, TldD protein
MLNKVVESLLTPSNLSVKDLPNIFDQLAHRHLDYSDLYFQLSQDESWVLE
DGIIKEGGFHIDRGVGVRAISGEKTGFAYSDQINLTSLQQCANAVKGIAP
AEQGRIITPTGFNRVNPILRYAAVNPLDTLTKEQKIELLYLVDKTARGMS
PYVSRVSASLSSIYEEVLVAATDGTLAADIRPLVRLSVSVLVEKEGKRER
GSAGAGGRFGLNWFLESFEGEVRAVSFTKEAVRQALVNLEAIPAPAGLMP
VVLGAGWPGVLLHEAVGHGLEGDFNRKESSLFSGKIGELVTSPLCTIVDD
GTLENRRGSLTIDDEGTPSQRNVLIENGILKGYMQDKMNARLMGVAPTGN
GRRESYANLPMPRMTNTYMLSGDSKFEDLIGSIDRGIFASHFGGGQVDIT
SGKFTFSTTEAYLIEKGKITRPVKGATLIGSGIEVMQQVSMVADNMEIDH
GIGVCGKEGQSVPVGVGQPALKIERITVGGTN
>MS0569 tmk, Tmk protein
MNKNMNGKFIVLEGLEGAGKTTARQAVVEQLNALGITDLLFTREPGGTPL
AEKLRNLIKYETEEPVTDKAELLMLYAARIQLVENVIKPALAQGKWVIGD
RHDLSSQAYQGGGRQIDSHLLETLKKTVLGDFEPDFTLYLDLSPAIGLAR
ARGRGELDRIEQQNLAFFDRTRTRYLELVKDNPKAVIINAEQSIERVTAD
IKTAVKNWVNSISL
>MS0722 tolA, TolA protein
MKNNRQNKERSAFLTSLVLHILLFGLLILSSFYHTVEVMGGGEGDGEVIG
AVMVDTGAAAKEWGRIQQDKKGQTDKQKQKKVPDQVDGEQHSPEPEQKIE
EQVEKQKQQELEKQKQLEQQKQAEQQRQQELKAQKEAAEKAKAEAEAKAK
REAEAAKLKADAEAKRLAAAAKQAEEDAKAKAKAEAEAKAKAEAKEKAAE
EAKLKAQAEAKAKAEAEEKAKAEAKAKADAEAKAKAEAKAKADAEAKAKA
DAKAKSDAKAKQAALDDFLNGGDVGGGSAMKGGNANKAGSQGSGAGLGAG
DGGKVGDQYGAVIKREIKRRFLKDPSFAGKVCSIRIQLARDGTITGYQKV
SGPDDICTAALSAVARTKKVPAAPSDDVYNKYKNPIIDFDLK
>MS0723 tolB, TolB protein
MMKFLTRMLSAFAVLFFAISTAQADDEVRIVIDEGVDGARPIAVVPFKTN
GSVPADIAEIVTADLRNSGKFNPIPVSQMPQQPASASEVTPDAWAALGVD
AIVVGQVTATGNGYNIAYQLVDTVGASGGAGAVLAQNSVTVGAKWIRYGA
HTVSDEVFEKLTAIKGAFRTRIAYVVQKNGGSKPYEIRVADYDGFNQFIV
NRSSQPIMSPAWSPDGKRLAYVSFENRKSQLVVQDLGSGARKVVASFQGH
NGAPAFSPDGSRLAFASNKEGQLNIYVMGANGGQPTQLTSGSGNNTEPSW
SPDGSSILFTSDRGGSPQVYRMSSSGGAASPVGGRGSAQISSDGKTLVMI
NGNNNVVKQDVTSGASEVLSTSFLGESPSLSPNGIMIIYSSTQGLGKVLQ
LVSADGRFKARLPGTDGQVKFPAWSPYLDKN
>MS1131 tolC, TolC protein
MLKINKLALAVVLSTALAGCANLDDSYQAAQDDFKQYEEVTKQFNVKDNW
WSLYNDAQLNRVVEQALVNNKDLAKAAISVNSALYQANLLGADLVPSFNG
STSSSASRPIDRHDNSTISHGGSLNVSYTLDLWRRLADAASAGEWSYQAT
QQDMEATRLSLINSVVVTYYQIAYLNDAINATNDTINYYSQINGIMQNRL
AQGVEDRASTDQAQQAVLTARNSLISYQTAKKTAEQTLRNLLNLKPSEPL
NINFPNILNVQTAGVNLNVPVSTIGNRPDVRGYLYRLNSAFKDAKATQKS
WFPEITLGAGLSSSGTRVNNAFNNPVAAGTIGISLPFLDWNHVKWNVKIS
EAAYDTARTNYEQSITTALNEIDTNYFAYTQAQQNFTNLKQKYDYDKRIA
QYYKNRYDAGVSDLKDWLSAINTERASQVSILNAKYSVIQNENAIYSSMA
GYYSPKF
>MS1304 tolC, TolC protein
MKSMHKLSLIAVLVTLAACSSTNVDLNSQIEMPARFEQTAQATGTSEIAQ
WWRNWNYPQLTALIEQGLQQNLEVAMARSRLAEAQANAAYTDADLGPSVS
ASGSASGSRARVDNPLTGGSSTSSGSYQYAAVTASWELDFFGKKRSDRDA
AEAQALSAQDQVYAAQMLVAGQIAESYFNIAALQQRQAVLQQYADVLGKL
KTYVQGRFNAGQANANDVLQTESRLSSIQANLATFDSQIDSNRRAIAILT
GKPAQGFRLSPAVKNPLINLPAAPAGVLPGEVLARRPDLHSYRNQVQAAA
AKLASAKADLYPRFDIQFMGGTGRIDVNSDISELKGWAGLVSGGISLPIF
TNGRIQANIDVADARLKTALLQYDKALIQALADVDNSYQAQFALNRQIRL
LQTAAAQTQKSAVNAEKLFQYGEKTLDNTLSERINALNAQEQLIQARLTH
AKNLVSLYKALGGGWVK
>MS0515 tolQ, TolQ protein
MTTALFDFLKNYSDYIILGLLGLMSFIMIWFVIERFIFLSRVKVHAYSNI
HSLDIDLNRHLTIISTIGANAPYIGLLGTVVGILLTFYDLGNSGGDIDAS
AIMLHLSLALKATAVGILVAIPSMVFYSALGRKVEVNRLKWKALNSQKNI
GA
>MS0720 tolQ, TolQ protein
MSGDLNFFELFIKASIVVQIVILILIAFSVISWTIIIQRSRVLTTALKDS
SAFEDRFWSGEDLTKLYEGLSNRRDGLTGSEQIFYVGFKEFSRLKQANPE
APESIIEGSSRAMNLTMNREIEELEGRIPFLATVASISPYIGLFGTVWGI
MHAFMGLSGAKQATLQMVAPGIAEALIATAIGLFAAIPAVMAYNRLSLRL
NKLEQNYGNFIDEFTTILHRQVFGSKTH
>MS0513 tonB, TonB protein
MRRPSIAGFLGSLLFHGGIAATLFFSFKENDNANGMAAQIIDTNISMEMM
MATMVEETQPTAEPEPQTKEEIVQKEAVEDPTLKKEKPKEKPKEKPKEKP
REKPKEKAKKVPPPVQQGIKADKIVLSNANANSKATGLSNTINSDNPNLA
GQGTGSSEVDAYKIALRREIERHKRYSQRAKMMRKQGTVIVAFNIGNDGS
LSNARVVKSSGTEDLDNSALEAVKNAKSIGQKPAGMANAISVPIAFTIR
>MS0131 topA, TopA protein
MSEPLFQHTKTEECCPQCGSPLQIKQGKKGKFLGCSAYPACDYLKPLSNQ
SESRIIKQLDECCPQCGHPLLIRQGNFGMFIGCGNYPQCHFIVHEDEQPP
AEESVACPECGKGELISRRGRQGKYFYACNRYPHCKFTLPGKPYLQDCPQ
CGGHICLLKKENETYRTFLCVNKSCRHQFDRKKEKT
>MS1096 topA, TopA protein
MSKSLVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTAGASTGEKAKP
VSTKGLTAEEKQALKTEKEKNALVKRMGIDPYHGWKANYQILPGKEKVVA
DLKSLAKKADHIYLATDLDREGEAIAWHLREVIGGDDNRFSRVVFNEITK
NAIKQAFEKPEHLNLDRVNAQQTRRFLDRVVGFMVSPLLWKKVARGLSAG
RVQSVAVKLVVEREREIKAFQPQEYWEVAVVTKTADNQKITLDVAEYKGK
RFDPKNETEAQSAVDFLAKSDYIVSALETKPTTSRPRAPFITSTLQQTAS
TRLNFSVKKTMMLAQRLYEAGYITYMRTDSTNLSRDALNMARSYIERNFG
EKYLPEKPNFYSSKENAQEAHEAIRPSDVNISMNDLQGMEKDAVRLYDLI
WRQFVACQMPAAQYDSTTLTVKAGDYELKAKGRILRFDGWTKVLPQLGKS
AEDQELPALNVHNKLALDEIQPSQHFTKPPARFTEAALVKELEKRGIGRP
STYAAIISTIQERGYVRTENRRFYAEKMGEIVTDRLNQSFAHLMSYDFTA
SMEDMLDQIATGKKDWKTELNQFFKDFSGQLTTAELDELEGGMKPNSLVL
TDIQCPTCGRPMAIRTASTGVFLGCSGYALAPKDRCKTTINLIPEAELLN
VLDDASETKALMERKRCPKCDTAMDSYIIDPHRKIHICGNNPNCEGYLIE
QGTFKIKGYDGPIVECDKCGSDMHLKLGRFGKYMACTACDNTRRILANGE
VAPPKEEPIAFPELKCEKADAYFVLRNSAVGVFMSAHNFPRVRESRPAKV
AELAQYRERLPEKLQYLADAPQQDPEGNPAIISFSRKEKHQYVTSEKNGK
KTKWIVDYIDGNWIERKK
>MS0730 topA, TopA protein
MRLFVAEKPSLARAIADVLPKPHQRGDGFIKCGKNDCVTWCVGHLLEQAE
PDAYNPMFKQWRLEHLPIVPKKWRLIPRKEVAKQLKTVENLIHQADQLVN
AGDPDREGQLLVDEVFNYANLSTDKRNAIQRCLVSDLNPAAVEKAVKKLQ
PNTNFIPLATSALARARADWLYGINMTRAYTIRGRQAGYNGVLSVGRVQT
PVLGLIVRRDLEIENFQPKDFFEVLAHIQTEDETPQKFTALWQPSKACED
YQDDDGRVLSLGLAENVVKRITGQPAEVTEYTDKREKETAPLPYSLSALQ
IDAAKRFAMSAQDVLDTCQRLYETHKLITYPRSDCRYLPNEHFAERMPVL
NAISTHCKEYQPLPEVLNTEQKNRCWNDKKVEAHHAIIPTAKNRPVNLNS
QELNIYTLIARQYLMQFCPDAEYRKSKISLKIAGGNFVAQARNLQIAGWK
ELLGKEDENEQLEPSLPIVKKGQQLFCEKGEVISKKTQPPKPFTDVTLLS
AMTGIARFVQDKELKKILRETDGLGTEATRAGIIELLFKRGFLYKKGRNI
HSSEAGRILIQALPDMATQPDMTAQWEAQLDGISRKQASYQQFMATLTEL
LPELVQFVNFSALRKLSAVANNPKPKNFKKKAKIAQSTETKKEV
>MS0587 torC, TorC protein
MSKRKKISMWAAAVLLVIGALLLLGSQYVMKATSSTEFCVSCHSMEYPAE
EWKASGHFSNTKGIRAECADCHIPHDGIDYVKAKVIALKDVWFTLTNKIP
DRATFEEQRGELAQRVWDEMKANDSATCRSCHNEDAMIVSEQSDSAQKMH
KLAKETNQTCIDCHKGLVHFMPETHAVASVQENVPPQAVQIVDNQPLYAS
NVSTATLIDGGEARLLPYAELANWKEEDNNFIGTIEGWQQTGAESLIYKE
LGKRINVAVLNEEAKTHVNVVNTVHDEVTDSDWKKVNINVSVPKSAVTSN
LESLNQYGHNLNQTHCSGCHAAIGADHYTANQWIGVVNSMKDRTSMTANE
VRALTIYLQRHAKDMH
>MS2277 torC, TorC protein
MIKKFWNWFRSPSKIAIGAVVLLSALGGILAWGGFNAGLEYTNTEEFCSS
CHMNDVVPEYRQTIHYSNRSGVKAICADCHLPHEFIPKWTRKIQASREVF
AHFTGKVDTKEKFEAHRLEMAEREWARMKANNSQECRNCHNFEDMDFTQQ
KTVAQEMHAAAEQQGKTCIDCHKGIAHNLPHMEKVQKTFIPEDMIKPQEP
AVQ
>MS0837 torD, TorD protein
MSETIINNFSLISRLFGNLFYRSPTDSILDGVFGWLQQKGLEQVWPLDTD
EDVRQALDSVQMTIAKEVLAQEYERLFAGEQPKIDSRISAYGLNVDEFIN
FRQTRRMPEVESADNFSLLLLTASWIEDNLDSISAQQELFESFLLPCASK
FLTHVETYALLPFYRSLALLTREILAAMADELEENE
>MS2335 torD, TorD protein
MVKNTALLSLKQQKSAMNFKEILMDNALLQWISTGGRLLGAVFYYEPKDK
RVQPVLDFFRQPDWTKDWATLANPALINALIEKSAQQDLSQAYQYLFIGP
NELPAPPWGSVYLDKESVIFGDSLLALRDFLTVHQIEFIQTQNEPEDHLG
LMLMLAAYLAENKPELLEEFLTKHLFSWVYRCLDLIFAQTDYPFYQAMAL
LARQTLKGWQQQLDLQVDQPQLYR
>MS0324 tpiA, TpiA protein
MARRPLVMGNWKLNGSKAFTKELIEGLKAELAGVEGCDVAIAPPVMYLAE
AEAALAGSKIVLGSQNVDVNVKGAFTGDISTEMLKDFGAKYIIIGHSERR
TYHKESDEFVAQKFGALKEAGLTPVLCIGESEAENEAGKTEEVCARQIDA
VINALGVEAFNGAVIAYEPIWAIGTGKSATPAQAQAVHAFIRGHIAAKSQ
AVADQVIIQYGGSVNDANAAELFTQPDIDGALVGGASLKAPAFAVIVKAA
AKAKA
>MS0379 tpiA, TpiA protein
MKKYYFGTNLKMYKGIADTTRFLAQLSELTHDIRANNDIELFVIPSYTAI
QSAIQTTLATSHDNPIIIGAQNMNPNDNGQYTGDISPLMLKEIGTQLVMI
GHSERRHKFGETDRQENEKVLSALKHDLTTLLCVGETLEQKNYNISDEVL
RTQLKIGLSGINTNQLAKLRIAYEPVWAIGESGIPATADYANEKHAVIKQ
CLIELFGDAGKDIPVFYGGSVNAENSNELFGQQYIDGLFIGRSAWDAENF
FKIIERIVNK
>MS0978 tra5, Tra5 protein
MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRGSAGKRTIQSHLKEKG
IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT
TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH
AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP
MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI
LAKTTYRRLLN
>MS1602 tra5, Tra5 protein
MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG
IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT
TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH
AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP
MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI
LAKTTYRRLLN
>MS1577 tra5, Tra5 protein
MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG
IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT
TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH
AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP
MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI
LAKTTYRRLLN
>MS2299 tra5, Tra5 protein
MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG
IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT
TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH
AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP
MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI
LAKTTYRRLLN
>MS1804 tra5, Tra5 protein
MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG
IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT
TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH
AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP
MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI
LAKTTYRRLLN
>MS2195 trkA, TrkA protein
MKIIILGAGQVGTTLAENLVSEDNDITLVDNELLRLEDLQDKHDLRVVAG
SASSPRVLREAGAPDADLLVAVTSSDEVNMVACQMAYTLFHTPTKIARIR
NSEYLREKDKLFHNDMIPIDHIISPENLVTEEIIRLIDYPGALQVAHFAD
RRISLVVLKAYYGGPLVGYAISMLKEHLPYIDYRIVSILRHDKLIRPQGS
TIIEAGDEITFISATVHIKAIMAEIQRLDKPYKRIMIVGGGNIGAGVAKQ
LEAGCSVKLIERNAEKAKSLAEKLSNTLVFHGDASDQSLLFEEHIENIDV
FISLTSDDEANIMSALLAKRLGAKKAMVLIQRMAYINLIQGGTIDIAVSP
QQATISALLTHVRKGDVKNVVSLRHGLAEALEVVVHGDAATSNVVGRKVS
ELKLPQGVILGAVLRNEEVIIAKKQVVIEENDHVVIYLSDKKNISEIEKL
FQPSAFFI
>MS0175 trkG, TrkG protein
MHILSIVRIVGILVMCFSVAMLAPAFVALIYGDGGGKAFMQSFVISLIVG
TTLWWSCHSHKQELRSREGFIIVVAFWVVLGSLASIPFMLFEYPDLTVAS
SFFEAFSGLTTTGATTIVGLDDLPKAILFYRQLLQWMGGMGIIVLAVAII
PLLGIGGMSLYRAEMSGPMKEQKMRPRIAETAKILWFIYASLTILCALAY
YLAGMSPFDAISHSFSTVSIGGFSTHDASIGYFNDSWINLITVVFLWISA
CNFALHFRAFSEINKGGFFKIYRNDPEFRFFVSIQVILILICSAVMLSHS
YFETTWENIEQVIFQSVSISTTTGYTTSDFSAWPSFVPMLLIIASFIGGC
AGSVGGGVRVARILVLYLQGKRELKLFVHPNLVYPIKWGKRILDERVIGS
IWAFFSAYLLVFIICLLGVIACGVDVFNAFNAVLACINNLGPAMGIVNSN
MVEIPDSAKCILTIAMVCGRLEIFTLLALFSPTFWKA
>MS0395 trmA, TrmA protein
MTTKCPHFQTQQCQSCQWINRPYDEQLNEKQIHLKQQIAPLDQSQLRWSA
PFQSRQSGFRNKAKMVVSGAVERPVLGILKDQNAPQSAVDLTDCLLYSAG
FKPIFPVLKDFIGRAGLVPYNVAKQKGELKYILLTESGYQGDIMLRFVLR
SENKIPLIRRELAKLREKLPQLKVISANIQPQHAAILEGEKEIFFTERQV
LEERFNRIPLFIRPQGFFQTNPQVAEGLYGTAQQWVKDLPVNKLWDLFCG
VGGFGLHCAKALQEKNPDIELTGIEIAPSAIYCAGLSAQKCGLKKVNFQS
LDAANFALNQDENKPDLVIVNPPRRGIGKPLAQFLNQMQPQFILYSSCNA
ISMTKDLLELTHYQLQKIQLFDMFPHTSHYEVLTLLIKR
>MS0240 trmA, TrmA protein
MVLLYTPPQKINKLQREMEVEILDLDYQGLGVAKIQGKTWFVENALPGEK
VRIKIKEEKRQFGLATTKKILEASAQRQTPKCQYASRCGGCQNQHIPVEM
QREAKQKALFRRLLKLQPEGIEFMPMIVGEAFGYRRRVRLSMLFDGKLKR
LEIGFRQKNSAQIVHIEQCEVIEPALNKILSKLTALLSRFSQPKNLGHIE
LVAADNGVAMLLRYSGKLTENDRTLLLDFAVREELMLFLQDDEKTEQIYG
QPPFYQLADNLQLQFDIRDFIQVNSLLNQRMITAALDWLDVQKQDHVLDL
FCGMGNFTLPLSRRVKSAVGIEGISAMVEKAKANAERNQCQNVQFYRADL
DQNFADEVWATEPFNKILLDPPRTGAAFALNALCRLKAEKILYVSCNPAT
LVRDAEILLNSDYRVKKVAMIDMFPHTGHLESITLFEKQS
>MS2367 trmA, TrmA protein
MQQLPIEKYSELLTKKQQKLTALLAPFNAPELSVFVSPVQNYRMRAEFRV
WHDKGDLYHIMFNQQTKQRYRVDCFPIASLLINRMMENLIPLLKEQEILT
KKLFQIDYLSTLSNKIIVSLLYHKTLTEEWQAAAQALKVRLEKLDFDVQI
VGRATKQKICLERDYADEVLPVNGRNYVYRQIENSFTQPNAAVNCKMLEW
AIGCTKNSSGDLLELYCGNGNFSIALAQNFRKVLATEIAKPSVAAAQFNI
AENGIDNLQIIRMSAEEFTQAMNGVREFNRLKGIDLKSYECNTIFVDPPR
AGLDPDTVKLVQNYDRILYISCNPNTLCNNLTELTKTHRIEKAALFDQFP
YTDHMESGVWLIRK
>MS0442 trmD, TrmD protein
MWIGIISLFPEMFKAITEFGVTGRAVKQNLLQVSCWNPRDFTHDKHKTVD
DRPYGGGPGMLMMVQPLRDAIHAAKAEAGDGVKVIYLSPQGRKLDQTGVT
ELAANEKLILVCGRYEGIDERLIQTEIDEEWSIGDYVLTGGELPAMTLID
AVARFVPGVLGKQASAEEDSFAEGLLDCPHYTRPEVLDGYVVPPVLMSGN
HEEIRKWRLKQSLERTWLRRPELLEKLALTDEQKKLLKDIIEAYHIRQSK
TSDNG
>MS0301 trmU, TrmU protein
MLRLNRSYGRTMQNLTNLSSRTYDQHFPKLSAEQLAENAKKKVIVGMSGG
VDSSVSAFILQQQGYQVEGLFMKNWEEDDDTDYCTAAADLADAQAVADKL
GMKLHKINFAAEYWDNVFEHFLAEYKAGRTPNPDILCNKEIKFKAFLEYA
AEDLGADYIATGHYVRRRGDDENARLLRGLDSNKDQSYFLYTLSHKQVGQ
SLFPVGDIEKPIVRAIAEDLGLITAKKKDSTGICFIGERKFKDFLARFLP
AQPGEIRTVDGKVIGRHDGLMYYTLGQRKGLGIGGIKGMDENPFYVAEKD
LVNNVLIVAQGHDNSALLSSGLIARQLHWVDRQPIRENLRCTVKTRYRQT
DIPCEIQPIDDETIRVIFDEPQIAVTPGQSAVFYQGEVCLGGGVIETQIK
>MS1154 trpA, TrpA protein
MARFETLFAQLNAKKQGGFVPFVTLCDPDLERSFDIICTLVDNGADALEL
GFPFSDPLLDGPVIQAANNRALNAGCSTAESFKLLEKVRSKYPEIPIGLL
LCANLIYAQTLDGFYRRCAEIGIDAVLVADIPLLAAEPYIQAAKKHGIQP
VFICPPNADENTVKGVAEHSEGYTYLVSRAGVTSAENQSHAANLDSLVEQ
LKAHNAPPILQGFGIAKPQQVKEALNMGVAGAISGSATVKIIEANLDNHE
KCLADLAEFVKNMKAATL
>MS1153 trpB, TrpB protein
MTDTILDPYFGEFGGMYVPEILIPVLKQLEKAFVEAQQDPAFQTEFLDLL
KNYAGRPTALTLCRNLTKGTKTKLYLKREDLLHGGAHKTNQVLGQILLAK
RMGKTRIIAETGAGQHGVATALACAMLGMPCRIYMGAKDVERQSPNVFRM
RLMGAEVFPVTKGSSTLKDACCEAMRDWAANYENTHYLIGTAAGPHPFPT
IVREFQKMIGEETKAQILQREGRLPDAVIACVGGGSNAIGMFTDFINETS
VRLIGVEPAGKGIETGEHGAPLGHGKPGIYFGMKSPIMQTEDGQIEESYS
ISAGLDFPSVGPQHAYLNSIGRAEYPSITDDEALEAFKELAQHEGIIPAL
ESSHALAYALKMARQNPMREQLLVVNLSGRGDKDIFTVDKIFSERGML
>MS1152 trpC, TrpC protein
MNLNDKPTILQKIVADKIQWIKAKEQVFPLASFKEKITKSDRSFYQSLGK
GTHQNPVFILECKKASPSKGLIRNEFNPADIAQVYKNYASAVSVLTDEKY
FQGDFSYIKQVRDIVTCPVLCKDFMISEYQVYLARYYQADAILLMLSVLD
DETYKKLAALAHELGMGVLTETSNQQELERGIALGAKVMGINNRNLHDLT
VDLARTPPLAQQIPADRIIVSESGIYSHQQVQQLKPYVNAFLIGSSLMGS
DDLNNAVRSVIFGENKVCGLTRPQDVQEVYRQGALYGGLIFAENSKRCVS
LRQAQELVTVAPLRFVGVFQNQQIDFIVKIATQLNLYAVQLHGAENEEFI
AALRIQLPHQIQIWQAVSIDVAQQSAVKIDRISAVDRYVLDSKTANRQGG
TGVAFDWSKIPAEIKNKSLLAGGITPENIELALAQHCLGIDLNSGVESAA
GIKNPEKLTAVFNKIHRF
>MS1151 trpD, TrpD protein
MRIKTRNFIMQTQQILTQLFDNQPLSQEQAAFIFGNIVKGELSNEQLAGA
LIALKIRGETIDEITGAVTALLAAAEPFPAPDYPFADIVGTGGDNADTIN
ISTASAIVAASMGLKIAKHGNRSVSSKTGASDVLTALGVNIRMSTEQARK
ALDEIGIAFIFAQQYHLGFKYAGPVRQALKTRTIFNILGPLINPANPKRQ
LLGVYSPELLKPYAETNLRLNHEHSIIVHGCGLDEVAIHGLTQVAELRDG
KIEYYNLSPKDFGFEPQPLESLRGGAPEENAKILTALLQGKGSEQQAQAV
AMNTALLMKLFGHEDIKQNAQQVLEQLTTGKAFETLTKLTTY
>MS1149 trpE, TrpE protein
MPNAYIQTLSNPVQYQQDLTAVFATVGKTNSLLLESAEISSKNSLQSLLI
INAALKVSCLGQIVTFTALTANGSHVLPLIKEKLQGKTKSLSVQQNKLIA
EFFPIDQNLDEDSKLQSLTVFDGLRVINQLYQHSKQPVFLGGLFAYDLVA
NFIPMNNITLQDDGLSCPDYVFYLAEQLLRLDHPSQQATLQTFCFNDSEL
QNLQQSAVEIDKDLRNLKPLSAIQQGSTDISTNHEDEKFKQIITALKHHI
YIGDVFQIVPSRRFILQCPNTLATYRQLKENNPSPYMFFMQDEEFTLFGA
SPESALKYSADNRQLEIYPIAGSRPRGFDAKGKIDPELDARLELEMRLDH
KEQAEHLMLVDLARNDVARVCESGTRHVKELMQVDRYSHIMHLVSRVVGK
LRPELDALHAYQACMNMGTLTGAPKIKAMQLIYQFEKQKRHSYGGAVGYL
SSDGNLDTCIVIRSAFVQNGIAYVQAGCGEVLDSDPQMEADETRHKAQAV
IKAILQTNAQAN
>MS2193 trpE, TrpE protein
MNFASFIRQANRLGRQKTAFFFLIDFERQKPLISPLESAVENGIIFSVEG
NTNFYRPVELPRQKIRFSSEPVSFERYAAGFALVQQELQKGNSYLLNLTY
PSKINTNYNLAQIFQATKAPYKLLLQDQFVCFSPESFIRIRQNQIFTYPM
KGTIDAALPQAEQQLMQSEKEGREHYTIVDLMRNDLAMVAENIRVRRFRY
IDKISTNRGEILQTSSEITGNLTADWQNRIGSILAALLPAGSISGAPKEK
TVSIIRQAEGGKRGYYSGIFGIFNGEELNSAVAIRYIEQKDGQLYFRSGG
GITSQSRLQEEYEEYCQKVYLPIHCVE
>MS1566 trpR, TrpR protein
MYISRNMEQWTKFIETLRIAFNDGKEQDLLTLLLTPDERDAIGLRLQIVA
QLLDKKIPQREIQQNLNTSAATITRGSNMLKLMSPDFMEWVKKHTNETEN
T
>MS2332 trpS, TrpS protein
MTKPVVLSGVQPSGELTIGNYLGALRQWVKMQDDYECLFCIVDLHAITVR
QDPEQLRKATLDVLALYLACGIDPEKSTIFIQSHVPEHTQLAWVLNCYTY
FGEMNRMTQFKDKSARYAENINVGLFTYPVLMAADILLYQAAQVPVGEDQ
RQHLEITRDIAQRFNAIYGENQFTVPQAFIPKAGAKVMALQEPTKKMSKS
DDNRNNVITLLEDPKSVAKKIKRAMTDGDEPPLVKYDVQNKAGVSNLLEI
LSVITDKPIPQLEKEFEGKMYGHLKTTVADEVVAMLTQLQDRFAHYRNNE
ELLNKIAAEGAKKARARAKETLEKVYNAIGFVAAK
>MS1175 truA, TruA protein
MRTDYKIMKIALGIEYNGKNYFGWQRQEKVHSVQAELEKALSFVANEKIE
VFCAGRTDSGVHGTGQVVHFETNAIRPEKAWAFGTNANLPDDIAVRWAKE
VPDDFHARFSATARRYRYVLYCNKLRSAILPYGITHTHLDLDEHKMQEAG
RFLLGENDFSSFRAAQCQSNTPWRNIHHLNVIRRGNFVIVDIKANAFVHH
MVRNIVGSLMEVGCGNQPPEWIEWLLAQKNRKLAAPTAKAEGLYLVQVTY
PEHFELPQMPLGPLFLADEL
>MS1441 truB, TruB protein
MSRPRKRGRDIHGVFLLDKPQGMSSNDILQKVKRIYQANKAGHTGALDPL
ATGMLPICLGEATKFSQFLLDADKRYQVIAKLGERTDTSDAEGQVVETRS
VNVTEQKILDSLPHFRGDIMQVPTMFSALKHKGKPLYEYARAGIVVEREA
RPISIFELNFISYEAPYLTLEVHCSKGTYIRTLVDDLGEYLGCGAHVSML
RRTAVSDYPADKMLTWEQLQQFAQDEDLAALDARLLPVDSAVSKLPVLSL
SEEQTKAVGFGQRVKFDNLQQLQGQVRLFSPQNVFLGVAEIGKDNVIRPS
RMVNL
>MS1813 trxA, TrxA protein
MKRKVFLFLPLIVLLVICIFLIMGLKQDPKKIASALIGKPVPEFFQADLL
DNNRIISNKHLPKQPYLLNVWGSWCYYCQQEHPLLMELAEQRIPIVGLNY
RDKKQGALEMLTKKGNPFALVIDDSRGELAMKLGVDGAPETYVIDENGVI
RYRYSGAVDKTILQKEILPEFNKLRN
>MS1626 trxA, TrxA protein
MSEVLHATDASFEADVLRSDVPVLVDLWAPWCGPCRMVAPILDDLAAELA
GKVKIVKINIDENQGTPAQFGVRSIPTLLMFKDGQLVGTQVGALPKNQLA
AFVEKNL
>MS1299 trxA, TrxA protein
MRNFLLFLILSLTTFVAHSGLFTGKPQFLKPHEAFILSANKQDAQINLHW
KIADNYYLYKKELRITGENSKIGEIIYPQADKHQDEFFGETEIFRHELFL
AVPVNEQNAASRLEVTYQGCTKGFCYPPETTVLELASLPVGQESQTLTAQ
DSLSQNLLKSKYAVFGFFLLGIGLAFTPCVLPMLPLLSAIVIGQGKRAST
GRSLLLSFVYVQGMALTYTLLGLIVAAIGLPFQVALQSPYVLVTLSAVFV
LLALSMFGLFNLQLPSSLQTKLALFSQKQQSGALGGVFIMGMIAGLIASP
CTSAPLSGALLYVAQTGDLFFGAITLYLLALGMGMPLVLITVFGNRILPK
SGAWMEKVKTAFGFVLLALPVFLLARVLPGIWENLLWSLLTVSFFAWLSF
SMPKGKLGRSLRILFLILAMIAVRPLQNFIWGDYSAPAGNPQSAVEKSEI
SSSTRFKQINNYAQLKQALTSNPKAIAMLDLYADWCVACKEFEKYTFSHP
DVKHKFEQVLLLQVDMTKNSPENAELMEKLSVLGLPTIIFFDRQGNEIVN
SRITGFLNAKQFLSLIEKYL
>MS0607 trxA, TrxA protein
MNKKLYLPLLIFLILVGAFFIQLRQNASGGDPKLLESALVGKPVPEKVMQ
GLFDNKDYTSEVFKQGKPILLNVWATWCPTCYAEHQYLNELAKQGITIIG
VDYKDDSAKAVKWLKDLDNPYQLVLKDEKGSLALDLGVYGAPETFIIDGK
GVIHYRLAGDVNEKVWKNTLLPIYNQLFEDGK
>MS2059 trxA, TrxA protein
MALSNNFRFILRIYRMKKLFLACFAALATVTFQVQAADLTEGKQYEVLAL
EHSAQPEVVEFFSFYCPHCYSFEMQYKIPEKIKQAIPANASFKQYHVNFL
GSQGENLTRAWALAMAIGAEDKIRAPLFKAAQANSLRSMDDIRQIFIDNG
VTAEQFDGSINSFAVTALVNKQTNLAEQFKVRGVPDFYVNNKFHINMEGL
SHDNFVQDYVDTVNELLSK
>MS1538 trxA, TrxA protein
MFSMKIKFCCYLFLVCLAPAVFAQKNTAGTHAAQAIASPGNRAAKNEFQD
GQDYFSYSTPIHTENRRDGKILIQSFFDYDCRVCVNTLDILELYSKINPN
KVVVEEYPIATKETTFSAQVYYSLKRMNHEDIAELLLFETTDIERYRELT
KFENLLAYLKQQNVDEKLFTDIYQSAEIRRQVSEAIYRTEKYGVFTYPFV
VIGGKYVLTNSTLYNDDYTFAVLDFLVHELSAVSTATHSK
>MS0951 trxB, TrxB protein
MSDIKHSKLLILGSGPAGYTAAIYAARANLNPVLVTGLEQGGQLTTTTEI
ENWPGDFAETTGPELMQRMLQHAEKFDTEIVFDHINRVDFSSRPFKLYGD
VQTFSCDALIIATGASARYLGLPSETEYKGRGVSACATCDGFFYRNKPVA
VIGGGNTAVEEALYLANIASEVHLVHRRDAFRAEKILIDRLYKKVEEGKI
ILHTNRNLDEVLGDNMGVTGVRLKDTQSENTEEIKIDGLFVAIGHAPNTA
IFADQLELNNGYIVVKSGLNGNATATSVEGIFAAGDVMDHNYRQAITSAG
TGCMAALDAERYLDALEA
>MS1932 tsf, Tsf protein
MAEITASLVKELRERTGAGMMECKKALVEANGDIELAIDNMRKSGQAKAA
KKAGRVAAEGVILARIAEGHGVLVEMNCETDFVAKDAGFLSLANAVADYA
VANKGVTIEALQAQFEEQRAALVAKIGENMTIRRVAEIEGKVIAQYLHGA
KIGVLVAGEGSADELKKVAMHVAASKPEFVNPEDVSADVVEHERQIQIDI
AINSGKPKEIAEKMVEGRMKKFTGEVSLTGQAFVMDPSQTVGSYLKSVNT
SVANFIRLEVGEGIEKVEADFAAEVAAMQKV
>MS0165 tufB, TufB protein
MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLSKHYGGAARAFDQI
DNAPEEKARGITINTSHVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMD
GAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELV
EMEVRELLSQYDFPGDDTPIIRGSALKALEGEAQWEEKILELANALDTYI
PEPERAIDQPFLLPIEDVFSISGRGTVVTGRVERGIIRTGDEVEIVGIKE
TAKTTVTGVEMFRKLLDEGRAGENIGALLRGTKREEIERGQVLAKPGSIT
PHTDFESEVYVLSKEEGGRHTPFFKGYRPQFYFRTTDVTGTIELPEGVEM
VMPGDNIKMTVSLIHPIAMDQGLRFAIREGGRTVGAGVVAKIIK
>MS2187 tufB, TufB protein
MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLSKHYGGAARAFDQI
DNAPEEKARGITINTSHVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMD
GAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELV
EMEVRELLSQYDFPGDDTPIIRGSALKALEGEAQWEEKILELANALDTYI
PEPERAIDQPFLLPIEDVFSISGRGTVVTGRVERGIIRTGDEVEIVGIKE
TAKTTVTGVEMFRKLLDEGRAGENIGALLRGTKREEIERGQVLAKPGSIT
PHTDFESEVYVLSKEEGGRHTPFFKGYRPQFYFRTTDVTGTIELPEGVEM
VMPGDNIKMTVSLIHPIAMDQGLRFAIREGGRTVGAGVVAKIIK
>MS0263 typA, TypA protein
MCGVPKIFFLSLKITDIANLFETSASPILFVKQLRTFQMAELDIHKLRNI
AIIAHVDHGKTTLVDKLLQQSGTLETARNGDSDERVMDSNDLEKERGITI
LAKNTAINWNDYRINIVDTPGHADFGGEVERVLSMVDSVLLVVDAFDGPM
PQTRFVTQKAFAHGLKPIVVINKVDRPGARPDWVVDQVFDLFVNLGATDE
QLDFPIIYASALNGVAGLEHEELAEDMTPLFEAIVQHVEPPQVELNAPFQ
MQISQLDYNNYVGVIGIGRIKRGTVKPNQSVTIIDSFGKTRNGKIGQVLG
HLGLQRYEEDLAQAGDIVAITGLGELNISDTVCDINAVEALPALSVDEPT
VTMFFCVNTSPFCGQEGKFVTSRQILERLNKELVHNVALRVEETPNPDEF
RVSGRGELHLSVLIENMRREGYELAVSRPKVIYKEENGHKQEPFEQVTID
IEEQHQGAVMEALGIRKGEVKDMSPDGKGRTRLEYVIPSRGLIGFRNEFM
TMTSGTGLLYSSFSHYDDVKPGEIGQRKNGVLISNATGKALAYALWGLQE
RGKLMAEHGQEVYEGQIIGIHSRTNDLTVNCLQGKKLTNMRASGKDDAIQ
LTTPIKLTLEQAIEFIDDDELVEVTPQSIRIRKKLLTEMDRKRANRTTTS
TSTH
>MS1102 tyrA, TyrA protein
MEALKEIRAEIDQLDRELLEVFAKRLALVKKVGEIKHQQGLPIYVPEREA
DMLAARRSEAEKMGIPADLIEDVLRRVMRESYANEHEHGFKTVNPAIKKI
VIVGGKGKLGGLFGRFLTASGYFVEALGSKDWDNAKAILAGANAVIVCVP
IVKTLETIERLKPYLTEDMLLTDLTSVKRRPLEKMLEIHQGAVVGLHPMF
GPDIASMAKQVVVRCDGRYPERYQWLLEQIQMWGARIYQADAAEHDHSMT
YIQALRHFATFANGLHLSRQPVKLANLLALSSPIYRLELAMIGRLFAQDG
SLYADIIMDKPENLEVIESLKQSYEDSLKFFENGDREGFIKTFNKVREWF
GDYSEQFMKESRQLLQQANDYRHNSL
>MS1031 tyrB, TyrB protein
MQITILVSIKEKLISKHNISKESPMFKNITPAPADPILGLGEAFKAETRE
NKINLGIGVYKDADGVTPIMTAVKKAEGQLFENEKDKNYLPIEGVAEYNA
YAKELLFGKDSEIIASNRACTVQTLGGTGALRIAAEFVRRQTKAQNVWIS
KPTWPNHNAIFNAVGVTIREYRWYNPETKALDWDNLLADLNNANPGDVVL
LHGCCHNPTGIDPTPEQWKALAEMSAKNGWLPLFDFAYQGLANGLEEDAV
GLRTFAETHRELLVASSFSKNFGLYSERVGAFTLVADNADVAAVALTQIK
SIIRTLYSNPSAHGARTVATVLANPELRKEWEDELTSMRDRIKQMRKQLV
ELLKEFGAQEDFSYIIDQKGMFSFSGLTAEQVDRLKEEFAIYAVRSGRIN
VAGITEANIRYLAESIVKVL
>MS0762 tyrR, TyrR protein
MFTVKGYDEGNYFIRSIVGKTMSKNTAKRSAHFTVNQYENFTDVVALSPK
MAALVEKAKKFALLDAPLLIQGETGTGKDLIAKACHNLSARKDQKFIAVN
CAGLPDTDAESEMFGRADGDKTSTGFFEYANGGTVLLDGVAELSLNLQAK
LLRFLNDGTFRRVGEEQEHYANVRVICTSQISLQHYVDEGKVRSDLFHRL
NVLSLQIPPLRERKEDLAVLTENFVRQISRRLGVRTPEFDGQFLQYLKDY
QWPGNVRELYNALYRACSLAEHNKLTIDGLNLSENETVPLTLEQFGNESL
EEIMNNFEASVLRKFYEQYPSTRKLASRLGVSHTAIANKLKQYGIGK
>MS1232 tyrS, TyrS protein
MSDINVVLAELKRGVDEVLSEADLIEKLKENRPLKIKLGADPTAPDIHLG
HTVVLNKLRQFQNFGHEVIFLIGDFTGMVGDPSGKNKTRPPLSREDVLRN
AETYKQQIYKILDPQKTRIVFNSDWLGKLGTEGMIRLASNYTVARMLERD
DFKKRFTEKQPIAIHEFIYPLLQGHDSVALEADVELGGTDQKFNLLVGRE
LQKSAGQKPQVAMTLPLLVGLDGEKKMSKSLGNYIGVTDAPNDMFGKIMS
ISDDLMWDWYDLLSFRPLTEIAQFKEEVKNGRNPRDVKILLAKEIIARFH
SEADADTAEQEFINRFQKGAMPDEMPEFTFEGEIGLANLLKEAGLVASTS
EANRMVQQDGVKIDGEKVEDAKTTISASTHVYQVGKRKFARVTVR
>MS1582 udk, Udk protein
MSDSANSSCIIIAIAGASASGKSLIASTVHRELRDQVGSDDISIISEDCY
YKDQSHLDFATRTQTNYDHPNSMDRDLLLEHLRALKAGKSVDIPQYSYVE
HTRMKEVTHFTPKKVIILEGILLLTDERVRNELSLSLFVDAPLDICFIRR
LKRDMEERGRSLESVIEQYRKTVRPMFLQFIEPSKQYADIIIPRGGKNRI
AINMLKAQILHLLGRK
>MS2318 udp, Udp protein
MSEVFHLGLTKAMLKGAKVAIVPGDPARSERIAKEMENAEYLNSTREFTS
WLGYMDGEPIVVCSTGIGGPSVSICVEELAQLGVRTFLRIGTTGAIQPHI
NVGDILITTGAVRLDGASQHFAPLEYPAVADFACTNALYNAALSQGIRPY
VGITASSDTFYPGQERYDTFSGKVYPKFQGTLKQWQDLNVMNYEMESATL
FTMCAALGLKAGMVAGVIVNRTQQEIPNEATIKSTEQKAVAVAVEAAGKM
AKA
>MS1992 ugpQ, UgpQ protein
MKHKLKTLAIGLAFATLAACSSQTAQQTTPMNNQEKLVIAHRGASGYLPE
HTLESKALAFAQQADYLEQDLAMTKDNHLIVIHDHFLDGLTDVAKKFPNR
HRKDGRYYVADFTLKEIKSLEMTENFKTENGKQVQVYPNRFPMWKSHFTI
HTFEEELEFIQGLEKSTGKKVGIYPEIKAPWLHHQEGKDIAVATLKVLQK
YGYTKKTDPVYLQTFDFNELKRIKTDLLPKMGMDVKLVQLVAYTDWHETE
EKNAQGKWVNYDYDWMFKDGAMAEVAKYADGVGPGWYMLIDDKNSKAGDI
KYTPMVADIAKTKMELHPYTVRKDALPAFFTDVNQMYDALYNHAGATGLF
TDFPDLAVKFLGKDKNKQ
>MS1991 uhpC, UhpC protein
MENFMFGPFKPAPAIAELPADKIDSTYRRLRWQVFMGIFFGYAAFYFVRA
NFDLAQKGLIEAGMYTKTELGIIGTGAGLAYGLSKFVMAGMSDRSNPKVF
LPFGLLLSGLCMTLMGLIPWATSGILIMFVLIFLNGWFQGMGWPPCGRTM
VHWWSKSERGTIVSIWNCAHNVGGMVPGMMVLLASAVYFSNTGVQATAKD
VWQQALYYPGIAAMIAAIPVYFVMKDTPQSCGLPPIEKWRNDYPDDYNEK
TYEHDLTTKEIFVNYVLKNKLLWYIAIANVFVYLIRYGVLKWSPVYLGEV
KHFNIKGTAWAYTIYELAAIPGTLLCGWVSDKIFKGKRGLTGFIFMILTT
IAVFALWKNPATPEAELAQYAGLPFYKNPYQLMDFILMTTVGFLIYGPVM
LIGLHALELAPKKAAGTSAGFTGLFGYLGGTVSASAVVGWAADKFGWDGG
FYVMITGGILAVILMFIVMVAEGKHKAKLSDHYGK
>MS2284 uhpC, UhpC protein
MLSFLNEVRKPTLDLPVEERRKMWFKPFMQSYLVVFFGYMAMYLVRKNFN
IAQNDMIETYGLTKTQLGMIGLGFSITYGLGKTIVSYYADGKNTKQFVPF
MLILSALCMLGFSASMGGSSIALFLMVAFYALSGFFQSTGGSSSYSTITK
WTPRKKRGTFLGFWNLSHNVGGAAAAGVALFGAHVFFNGHVIGMFIFPSI
IALIIGFIGLRYGSDSPEAYGLGKAEELFGEEISEEDKDAEQNQLTKKQI
FVQYVLKNKVIWLLCFANIFLYIVRIGIDQWSPVYAYQELGFSKDAAISG
FALFEVGALVGTFLWGYLSDLANGRRGLLACVALVLIVFTLEFYQFANNE
TMYLVALFALGFLVFGPQLLIGVAAVGFVPKKAIAVADGVKGTFAYLIGD
SFAKLGLGMIADGTPIFGLTGWDGTFAALNSSALICIGLLAFVAIAEEKK
IRRLKKAEA
>MS2287 uhpC, UhpC protein
MNTTFSPEINRTYRYWRIHLMIAMYIGYAGFYLTRKSFNFAVPEMINNLG
IDKNDIGMMATLFYITYGVSKFFSGIFSDKSNPRHFMAVGLIMTGVANIF
FGLSSSVLIFTAVWIINAWFQGWGWPACSKLLTTWYSRNERGRWWSIWNT
AHNAGGALIPLLIGYVTIHYSWRYGFAIAGIVAISIGLFLFWRLRNTPES
LGLPSIGHWRNDELELAQEAEAPNLSWRETLNRYVFLNKYIWLLALSYTL
VYIVRTAINDWGNIYLTEKYHYDLVSANSALAVFEIGGFFGSLVAGWGSD
RLFSSNRGPMALIFAIGIFFSITALWLLPTENYILQTALFFVVGFFVFGP
QMLIGMAAAECSHKKTPGSATGFVGLFAYLGAAIAGYPLALIMQHFHWTG
FFVFIACSASGIALLLLPFLKAQN
>MS0373 ung, Ung protein
MQTWKDVIGTEKTQPYFQHILQQVHAARDAGKTIYPPQHDVFNAFKLTEF
DQVKVVILGQDPYHGPNQAHGLAFSVLPGIVPPPSLLNIYKELENDIAGF
QIPRHGYLVKWAEQGVLLLNTVLTVERGLAHSHANFGWETFTDRVIAALN
RHRENLVFLLWGSHAQKKGQFIDRDRHCVLTAPHPSPLSAHRGFLGCHHF
SKANNYLQEHKITEIDWQLDTQLS
>MS1880 upp, Upp protein
MKLVEVKHPLVKHKLGLMRAADVSTKHFRELATEVGSLLTYEATADLETE
IVTIEGWCGPVEVQRIKGKKVTVVPILRAGLGMMDGVLEHIPSARISVVG
MYRDEETLEPVPYFQKLASDIEERLAIVVDPMLATGGSMIATIDLLKQKG
CKHIKVLVLVAAPEGIKALESAHPDIELYTASIDDHLNQDGYIIPGLGDA
GDKIFGTK
>MS1927 uppS, UppS protein
MKELDLNNIPKHIAIIMDGNGRWAKQQGKMRIFGHKSGVRAVRRSVSYAC
QIGVQALTLYAFSSENWNRPEQEVNALMTLFMQALDLEVKKLHKNNIKLK
VLGDISGFSPKLQEKIARAETLTANNDSLTLNIAANYGGCWDIVQATRQI
AQQVKDGSLTISEITEELFQRNLVTKEQPPVDLLIRTSGEQRISNFLLWQ
IAYAELYFSHVLWPDFNEQEFNRAIYVYQQRERRFGTS
>MS0575 uraA, UraA protein
MNNNLLYSVEDKPPFGLSLLLAAQHLLAALGGIIAVPLVIGNVLKLPTED
TITLVNAALLISGVVTIIQCRGIGPIGIRLPSVMGTSFTFVAAALAIGFS
EYGVAGIMGASLVGSLVMIIGSFFMPYIRKLFPPVVTGTVVMMIGLSLIP
VAVDWFAGGQVGDENYATPENLLMATFVLVIVVTLVQWGKGIFSAAAIVI
GMMTGYVVALCLGWVSFDGVNNAQTFAVPQPLHFGLAFPISGIIGMSIAY
LVTIVESSGNFLALGNATQTEITGKHLRGGVLCDGLGSALAAIMSTTPFS
SFSQNIGVISLTGVASRHVVALTGVLLALAGLFPVFGALIVSIPLPVLGG
AGLMMFAMIIAAGIQMLDNIPRSKRNGLIIAISIGCGLAVTTRPELLDKL
PHFFKEVLGSGITVGSLLALILNLILPEDKIPENH
>MS1879 uraA, UraA protein
MTNQTNAPIEVQSKAKQAFVGLQMLFVAFGALVLVPLITGLNANTALLTA
GIGTLLFQLCTGKQVPIFLASSFAFIAPMQYGIQTWGIAVTMGGLAFAGL
VYVALSALVKMRGAGALQRIFPPVVVGPVIIIIGMGLAPTAVDMALGKNS
AYSYNDAVLVSMVTLLTTLCVAVFSKGMMKLIPIMFGIAVGYILCLFLGL
IDFQPVLNAPWFSLPEITTPEFKLEAILYLLPIAIAPAVEHVGGIMAISS
VTGKDFIRKPGLHRTLLGDGVATTAASLLGGPPNTTYAEVTGAVMLTRNF
NPNIMTWAAVWAIGISFCGKVGAFLSTIPTIVMGGIMMLVFGSIAVVGMS
TLIRDKVDVTEARNLCIISVVMTFGIGGMFVNVGELSLKGISLCAVVAIV
LNLLLPKAKNQME
>MS1658 ushA, UshA protein
MNRPFRFLKLTAALSLFSAAAMSYQADKTYQFTLLHFNDLHGHYWHDKNG
QYGLAAQKTAVDRIRNEVEAKGGSVITLFAGDLNTGVPESDLQNAHPDID
GLNAIGYDAMVLGNHEFDNPLQLLDMQEKWAKFPFLAANIYHKNTDKTLV
KPYTMLKRSGLNIAIVGLTTEDTAKLGNPEYMKDLRFDNPISTAKKVVAE
IDKQENPDVKIALTHMGYYYDGNYGSNAPGDVTMARRLEKGTFDVIVGGH
SHDTVCVDAKGVFIRDYQPTQACKPDYQNGTWIMSAGEWGKFLGRADFEF
KNGEVKLVRYELIPINLKKKVETAAGKTEYQLYGEQIPQDEKLLATLKTY
QDKGDQLLSVKIGDVAGKLVGDRNIVRFHQTNLGRLVAEAQRRAAGADVG
IMNSGGIRDSIQSGVITYRDILKVQPFGNIVSYFELSGAELIDYLNIVAL
KEVDSGAYPQFSGISMIIDRTAKQVKEVKIQGEPINLSKTYRISLPNYNA
LGGDGYPVMDKNPTYVNTYKVDAEVLKAFIAENSPIDANKFEPKGEITYR
>MS0064 ushA, UshA protein
MERRRFIQLGASAMLVLGTSRYVWALGDNKAQLRIIATTDVHSFLTDFDY
YKDAPTEKFGFTRAASLIEQARKEVSNSILVDNGDLIQGNPIADYQAAVG
AKQGKPHPAIQVYNAMKYDMGTLGNHEFNYGLDYLNEVIKQADYPIINAN
VVKIGTNEPMFRPYVIQEKDILDQAGNKQKIKIAYIGFTPPQVTVWDKAN
LAGKAESRDIIKTAQKYIPMLKGKGADIVIALAHTGPSDEPYHEGMENAA
FHLADVKGIDAVIFGHSHRLFPNKEFEKSANTDIAKGTVKNVPESMAGYW
ANNISVIDLALVEKNGKWMVVDGSAALRPIYDVTAKKATVENHEKITALL
QPVHEATRKFVAQPIGQANDNMYSYLALVQDDPTIQIVNQAQKAYTENVV
KNLPELAGLPVLSAGAPFKAGGRKNDPTGFTEVDKGRLTFRNASDLYLYP
NTLVVLKVNGAELKEWLECSAGMFKQIDINSDKPQFLLDWEGFRTYNYDV
IDGVSYQFDITQPARYDGECKLINKNANRVVNLTFNGKPVDPKAEFLIAT
NNYRAYGNKFPGTGDAHIVFASPDENRQILANYISAESKSKGEVTPTADK
NWRIAPIHSKVKLDIRFETSPTEKAAAFIKQNAQYPMQLVGKDEIGFAVY
QIDLSK
>MS0349 uspA, UspA protein
MYKHILVAVDLSDESSVILKKAADIAKRHEAKLSIIHVDVNFSDLYTGLI
DVNMSSMQDRISTETQQALLELSEQAGYPITEKLSGSGDLGQVLSDAIDQ
YDVDLLVTGHHQDFWSKLMSSTRQLMNNIKIDMLVVPLRDED
>MS0329 uspA, UspA protein
MYKNILIAIDLSNLDSAKYVVDTCLKLTEDNPQAIFRVVTIIEPMDDSFI
SAFLPKNFDKSVLEEANKALHEFTEKAFPKGAKVQHIVSYGTIYEEINHL
ADEKNVDLIVMLASSQPNAKGLSANTVKVARNTDKPVLILR
>MS1076 uspA, UspA protein
MEKMEVVMKFNNILVILNPENDKQYALARAIRLVKEQKSDKPVKVTLFLP
VYDLSYEMSALLSSEEREEMHKGVIEQRYQQDVLPYIEKYQDATMIEFSS
KVVWNSNEAEALVAELDENTYDLVVKYTKEEESLTSILFTPIDWQLLRKC
PAPILMVRDGDWKHQRRILVAVNVSGDADYHEAFNQQLVELSMDLADNLE
RGNVHLVGAYPPTPINMAIDLPEFHTSEYTSGVRGQHLINMKALRQRFGI
DEDHTHVLEGFPEEVIPEVADKIGAELVVLGTVGRTGLSAALLGNTAEHV
ISKLKCNLLAIKPNKIED
>MS1240 uup, Uup protein
MGFYMSSQFVFTMHRVGKVVPPKRHILKDISLSFFPGAKIGVLGLNGAGK
STLLRIMAGVDKEFEGEARPQPGIKIGYLPQEPKLDPQQTVREAIEEAVS
EVKSALTRLDEVYALYADPDADFDKLAAEQAKLEAVIQAHDGHNLDNQLE
RAADALRLPEWEAKIENLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDA
ESVAWLERFLHDYEGTVVAITHDRYFLDNVAGWILELDRGEGIPWEGNYS
SWLEQKEKRLAQEQAQESARQKSIEKELEWVRQNPKGRQAKSKARMARFE
ELNSGEYQKRNETNELFIPPGPRLGDKVLEVEHLTKSYGERTLIDDLSFS
IPKGAIVGIIGPNGAGKSTLFRMLSGKEQPDSGSITLGETVVLASVDQFR
DAMDDKKTVWEEVSNGQDILTIGNFEIPSRAYVGRFNFKGVDQQKRVGEL
SGGERGRLHLAKLLQRGGNVLLLDEPTNDLDVETLRALENAILEFPGCAM
VISHDRWFLDRIATHILDYGDEGKVTFYEGNFSDYEEWKKKTFGAESTQP
HRMKYKRIAK
>MS0840 uup, Uup protein
MALISLTNGYLSFSDAPLLDHADLHIEPRERVCLVGRNGAGKSTLLKIIA
GDVVMDDGKIQYERDLIVSRLEQDPPSHAQGNVFDYVAEGIGHLADLLKE
YHHISTLLESDYNDNLLSKLAQVQSRLEHENGWQFENKINEVLGKLELNP
NTLLSELSGGWLRKAALARALVCNPDVLLLDEPTNHLDVDAIEWLETFLL
DFAGSIVFISHDRSFIRKMATRIVDLDRGKLVSYPGDYDLYLTTKEENLR
VEALQNELFDKRLAQEEVWIRQGIKARRTRNEGRVRALKMLREERRQRRE
VLGSAKLQLDTSSRSGKIVFEVEDASYAIAGKQLLSHFSTTILRGDKIAL
VGPNGCGKTTFIKLLLGELQPTSGHIRCGTKLDIAYFDQYRADLDPEKTV
MDNVADGKQDIEVNGVKRHVLGYLQDFLFPPKRAMTPVKALSGGERNRLL
LAKLLLKPNNLLILDEPTNDLDIETLELLEDILADYQGTLLIVSHDRQFI
DNVATECYMFEGNGQLSKYVGGFFDAKQQQENALTSKMASEQAKPKKMQP
ESAVEKSEISTANNNQKTIKLSYKEQRELERLPQLLEELEKMIENLQNEV
GNPDFFQQSHEYTSAKLQELADKEAELENAFIRWEELEEKKKGNLS
>MS0137 uup, Uup protein
MIFFSNLTLKRGLNLLLEEANATINPKQKVGLVGKNGCGKSSLFSLLKKE
NQPEGGEINYPADWAVSWVNQETPALNISALDYVIEGDRTYCRLQKELKL
ANEHNDGNAIARIHGQLDIIDAWTVQSRASALLHGLGFSQEELGRPVKSF
SGGWRMRLNLAQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLV
LISHDRDFLDPIVNKIIHIEDKKLNEYTGDYSSFELQRAEKLAQQNALFR
QQQDKIAHLQKYIDRFKAKATKAKQAQSRMKALERMERIAPAHVDNPFTF
EFREPLSLPNPLVMIDKASAGYGEGESAVEILQKIKLNLVPGSRIGLLGK
NGAGKSTLIKLLAGELTARSGVLQLAKGVQLGYFAQHQLDTLRADESALW
HLQKLAPQQTEQELRNYLGGFAFHGDKVKDPVKQFSGGEKARLVLALIVW
QRPNLLLLDEPTNHLDLDMRQALTEALVDYQGSLVVVSHDRHLLRNTVEE
FYLVHDKQVEEFNGDLEDYAKWLNDLNVQEKSAVKNTEVSKESNNENSGQ
NRKEQKRREAELRQQTAPIRKQIAKFETEMDKLTAQLTEIEVRLADSGLY
QTENKEKLTALLTQQVQTRKALEEAEAHWLTAQEELETLLAE
>MS0586 uvrA, UvrA protein
MDVIDIRGARTHNLKNINLIIPRDKLIVITGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLFARVGEPRCPTHNLALTAQTISQMVDKVLTLPEGRKMML
LAPVVKARKGEHVKILEHIAAQGYIRARIDGEICDLSDPPKLELQKKHTI
EVVVDRFKVRADLATRLAESFETALELSGGTAVVADMEDAKAEELVFSAN
FACPHCGYSVPELEPRLFSFNNPAGACPTCDGLGVQQYFDEKRVVQNPAV
SLAGGAVKGWDRRNFYYYQMLTSLAEHYHFDIEAPYEELQKNIQQVIMNG
SGKEEIEFKYMNDRGDVVVRRHPFEGILNNMARRYKETESMSVREELAKN
ISNRPCSDCGGSRLRPEARHVYIGQTNLPDISEMSIGEAYSFFEKLALAG
QKAQIAEKILKEIKERLSFLVNVGLNYLSLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLNTLIHLRNLGNTVIVVEHDEDA
IRLADHIIDIGPGAGVHGGNVIAEGTAEQIMQNPNSITGKFLSGEEEIEI
PQKRTAVDKKKFLHLNGAAGNNLKNVNLALPVGLFTCITGVSGSGKSTLI
NDTLFPIAQNVLNRADNIEYAPYKSIEGLEFFDKVINIDQSPIGRTPRSN
PATYTGLFTPIRELFAGVPESRARGYNPGRFSFNVRGGRCEACQGDGVLK
VEMHFLPDVYVPCDQCKGKRYNRETLEIRYKGKTIHQVLDMTVEEAREFF
DVVPMIARKLQTLIDVGLSYIRLGQSSTTLSGGEAQRVKLATELSKRDTG
KTLYILDEPTTGLHFADIKQLLEVLHRLRNQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEIIATGTPEEVAQNPLSHTGRFLKPILAKK
>MS1371 uvrB, UvrB protein
MSHKINSKPFILHSEFKPSGDQPQAIEILAENLNDGLAHQTLLGVTGSGK
TFTIANVIAKLNRPAMLLAPNKTLAAQLYAEMKAFFPENAVEYFVSYYDY
YQPEAYVPSSDTFIEKDASINDQIEQMRLSATKSFLERRDTIVVASVSAI
YGLGDPDSYLKMMLHLQTGAIIDQRQILVRLAELQYTRNDQAFQRGTFRV
RGEIIDIFPAESDDRAVRIELFDDEIERLSLFDPLTGTGFGAVPRFTVYP
KTHYVTPREQILDAIEKIKSELADRREYFIKENKLLEEQRITQRTQFDIE
MMNELGYCSGIENYSRYLSGRNEGEPPPTLFDYMPSDALLVIDESHVTVP
QIGGMYRGDRSRKETLVEYGFRLPSALDNRPLRFEEFERLAPQTIYVSAT
PGPYELEKSGTEIIDQVVRPTGLLDPEIEIRPVSIQVDDLLSEARQRADR
NERVLVTTLTKRMAEDLTDYLDEHGIRVRYLHSDIDTVERVEIIRDLRLG
EFDVLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARN
LKGKAILYADRITNSMEKAITETNRRREKQMKYNEEHGITPQGLNKKVGE
LLDIGQGGSNKSRNKPRSQKAAEPATTYAIPMTAKEYQQQIKKLEQQMYK
FAQDLEFEKAAAIRDQLHKLREQFVENG
>MS0937 uvrC, UvrC protein
MFDSKKFLANVTHDPGVYRMFDDKDTVIYVGKAKDLKKRLSSYFRANLSS
KKTEALVASICRIETTITTSETEALLLEHNYIKTFQPRYNVLLRDDKSYP
YILLTKERHPRITSHRGSKKVTGEYFGPYPHAGAVRETLSLLQKLFPIRQ
CENSVYANRSRPCLQYQIGRCLAPCVSGYVSDEEYNQQVGYARLFLQGKD
QQVLDHLIGKMERASRALNFEEAARYRDQIQAVRSVIEKQFVSNERLDDM
DIIAIAYKLGIACVHVLFIRQGKILGNRSYFPKVPENTSLSELTETFVGQ
FYLQAHQGRTIPNSIIVDRKLEEKAELESLLTDQAGRKVSIQDNIKGNKS
KYLHLAQMNAQAALALQLKQSSLIHERYKELQQLLGIEKIHRMECFDISH
TMGQQTIASCVVFNEEGPLKSDYRRFNIEGITGGDDYAAMEQALKKRYDK
DLELEKIPDIIFIDGGKGQLNRALKVFHELQVKWDKNRPHLIGVAKGVDR
KVGLETLIISKQEREINLPADSLALHLIQHIRDESHNHAISGHRKKRQKA
FTQSGLETIEGVGAKRRQALLKYLGGMQGVKNATQDEIASVPGISVALAE
KIFEALHH
>MS0413 uvrD, UvrD protein
MKLNPQQQQAVEYTSGPCLVLAGAGSGKTRVIINKIAYLIEKCGYLPKQI
AAVTFTNKAAREMKERVAHSIGKELSKGLIVSTFHTLGFDIIKREYKHLG
FKANMTLFDEHDQMALLKELTEDYLQQDKDLLRELISVISNWKNDLIMPA
QAAKIARDEKQQTFAKCYERYANQIRAYNALDFDDLIMLPTLLFKTNEQV
RSKWQEKIRYLLVDEYQDTNTSQYELIKLLVGSRAKFTVVGDDDQSIYSW
RGARPQNMVRLRDDFPNLQVIKLEQNYRSTQRILHCANILIDNNQHVFDK
KLFSTIGEGEKLQIIEAKNEEHEAERVVGELIGHRFTNKTKYKDYAILYR
GNHQSRLLEKVLMQNRIPYKISGGTSFFSRLEIKDMMAYLRLLVNQDDDA
AFLRIVNTPKREIGAVTLEKLGSLANEKHISLFEAIFDFELIQRVTPKAY
NALQTFGRWIVELSDELVRSEPERAVRSMLAQIHYEEYLYEQAVSPKAAE
MQSKNVATLFDWVNDMLGGDEFNEPMTLNQVVTRLTLRDMLERGEEDDES
DQVQLMTLHASKGLEFPHVFLIGMEEGILPHQTSIDEDNVEEERRLAYVG
ITRAQRTLRFTLCKERRQFGELLKPEPSRFLLELPQDDLQWERDKPPMTE
EQKQEKAVANIANLRAMLKRN
>MS1368 uvrD, UvrD protein
MMDISELLDGLNDKQREAVAAPLGNYLVLAGAGSGKTRVLTHRIAWLIAV
EGISEGSIMAVTFTNKAAAEMRQRIESTLSQHSSRRLFGMWVGTFHSIAH
RLLRAHYLDANLPQDFQILDSEDQLRLLKRLLKLHNYDEKMFPAKQACWY
INNKKDDGLRPHQIDDNNDKQEREWINIYRIYQDTCDRAGLVDFAEILLR
AYELFLKKPVILQRYRQRFQQILVDEFQDTNKIQYAWIRLLAGETGNVMI
VGDDDQSIYGWRGAQVENIQRFLDDFHKAKTIRLEQNYRSTGNILQSANQ
LISNNSNRLGKDLWTEGDKGEPVGIYAAFNELDEALFVSSQIKIWWEDGG
ELNDCAVLYRSNSQSRVIEEALIRAQIPYRIYGGMRFFERQEIKDALAYL
RLIANRQDDAAFERVINTPTRGIGDRTLDVLRNLTREREITLWQATQLAI
GENKLAGRSATALLRFCELINSLAQETEEMPLFAQTDFVIKHSGLYEMYK
QEKGEKGEVRIENLEELVSATREFIKPDDAEDMSDLSAFLTHASLEAGEE
QASPHQSCVQMMTLHSAKGLEFPRVFMVGVEEGLFPSFMSLEEPGRLEEE
RRLAYVGITRAKQKLTICYAESRRLYGKEERHIPSRFINELPQECIQAVR
LRGTVTRAYNQSAVGSVKISPLNDSGWKTGQKVKHGKFGTGTVINVEGSD
NNTRLQIAFQGQGIKWLIAHLANLEKL
>MS0530 uxaA, UxaA protein
MKQFIKIHRQDNVAVALQDLASCTLLDVDGQQIELKENIGRGHKFALTAV
AKNQDVVKYGYPIGHALRHIEPGEHIHTHNVKTNLKDINDYKYLPESTAL
SEQMPDREVQIYRRKNGDIGIRNELWIVPTVGCVVGIANLIKKRFMQLND
LNDIDGVFTFNHSYGCSQLGDDHENTKTMLQNMVKHPNAGAVLVIGLGCE
NNQVGAFKESLGEYDENRVKFMISQHYDDEVEQGVELLQQLYREMRQDRR
ETGKLSEVKFGLECGGSDGFSGITANPMLGHFSDYLISHGGTTVLTEVPE
MFGAERILMSHCKDEETFQKTVDMVNDFKKYFIEHNQPIYENPSPGNKAG
GITTLEDKSLGCTQKAGHSQVVDVLKYGERLTIQGLNLLSAPGNDAVATS
ALAGAGCHMVLFSTGRGTPYGGFVPTMKIATNSELALKKKHWIDFDAGRL
VYDMSMQELLKDFINLVVEIVNGKPTKNEINEFRELAIFKSGVTL
>MS0695 uxaA, UxaA protein
MGENYMNTQQKALYIKVNPTDNVAIVVNSNGLPAGSQFEDGVTLIEHIPQ
GHKVALVDIPKDSEIIRYGEIIGYAVKDIKQGSWIDESLVTLPKAPPLET
LPLATRKAPKLEPLEGYTFEGYRNKDGSVGTKNMLGITTSVHCVAGVVDY
VVNIIEKELLPQYPNVDGVVGLNHLYGCGVAINAPAAIVPIRTIHNIALN
PNFGGEIMVIGLGCEKLQPQRLLEGTVDTYPIELKDATIMSLQDERHVGF
EAMIKEILETAKQHLEKLNRRKRETCPVSDLVVGGQCGGSDAFSGVTANP
AVGFAADLLVRAGATFMFSEVTEVRDAIHLLTPRAETVEVGKRLLEEMKW
YDDYLDMGQTDRSANPSPGNKKGGLANVVEKALGSIAKSGSSNIVEVLSP
GQRPTKKGLIYAATPASDFVCGTQQLASGITVQLFTTGRGTPYGLKAVPV
IKLATRTDLANRWFDLIDIDTGTIATGKETIEQVGWRIFHEILDVASGRK
QTWSDKWGLYNQLSVFNPAPVT
>MS0544 uxaC, UxaC protein
MKNTPALLMVSYCRLTAVFPLTAAFKEDLPMKQFMDEDFLLSNDVARTLY
YDYAKDQPIFDYHCHLPPKEIAENRQFKDLTEIWLAGDHYKWRAMRSAGV
DENLITGNASNYEKYQAWAKTVPLCIGNPIYHWTHLELRRPFGITNTLFN
PQSADKIWQECNELLQQPEFSARGIMRQMNVKFSGTTDDPIDSLEYHKAI
AEDRDFDIEVAPSWRPDKAVKIELPQFNDYIKQLEQVSDTEINGFDSLKK
ALSKRLDHFDKRGCKSADQGMEIVRFAPVPDEKELDRILQLRRNEQPLTE
LQISQFSTALLVWLGAEYCKRNWVMQMHIGALRNNNTRMFKLLGADSGFD
SIADRTFAEQLSRLLDAMDQNNQLPKTILYCLNPRDNEMIATMIGNFQTG
GIAGKIQFGSGWWFNDQKDGMERQLQQLSQLGLLSQFVGMLTDSRSFLSY
TRHEYFRRILCEMIGRWVVNGEAPNDMNLLGNMVKNICFDNAKAYFK
>MS0537 uxuA, UxuA protein
MEQTWRWYGPNDPVSLADIRQAGATGIVNALHHIPNGQVWSVEEIEKRKA
IIEAAGLTWSVVESVPVHEEIKTQTGNYKTWIENYKQTLRNLAQCGIDTV
CYNFMPVLDWTRTDLAYELPDGSKALRFDQIAFAAFELHILKRPGAEQTY
TAEEQKQAKAYFDKMSDADIKQLTSNIIAGLPGAEEGYTLEEFQGQLDRY
KDISPEKFRTHLAYFLNEIIPVAQEVGIKMAVHPDDPPRPILGLPRIVST
IEDMQWYVDTCDLPANGFTMCTGSYGVRADNDLVKMTEKFGDRIYFAHLR
STCREDNPLTFHEAAHLQGDVDMFNVVKALLTEEYRRKANGETRLIPMRP
DHGHQMLDDLKKKTNPGYSAIGRLKGLAEFRGLEMALKKVFFEK
>MS1468 vacB, VacB protein
MFQNNPLLSQLKQQLHDSKPHVEGVVKGTDKAYGFLETEKETFFIAPPAM
KKVMHGDKIKAAIETIGDKKQAEPEELIEPMLTRFIAKVRFNKDKKLQVL
VDHPNINQPIGAAQAKTVKQELKEGDWVVATLKTHPLRDDRFFYAQIAEF
ICSAEDEFAPWWVTLARHEQSRYPVQGQEVYSMLDTETRRDLTALHFVTI
DSENTQDMDDALYIEPVTAPNDEQTGWKLAVAIADPTAYIALDSQIEKDA
RKRCFTNYLPGFNIPMLPRELSDELCSLMENETRAALVCRLETDMQGEIV
GEPEFILAQVQSKAKLAYNNVSDYLEQVENAWQPENESTQQQINWLHQFA
LVRINWRKKHGLLFKEKPDYSFVLADNGHVREIKAEYRRIANQIVEESMI
IANICCAHYLAKNAQTGIFNTHVGFDKKFLPNAHNFLMANLSNEENQQEL
AERYSVENLATLAGYCRMRHDIEPIEGDYLEFRLRRFLTFAEFKSELAPH
FGLGLTGYATWTSPIRKYSDMVNHRLIKACLANRECVKPSDETLARLQEA
RKQNRMVERDIADWLYCRYLADKVESNPEFRAEVQDCMRGGLRVQLLENG
ASVFVPASSIHPNKDEIQVNTDELALYINGERRYKIGDIVNIRLTEVKEE
TRSLIGNLV
>MS0473 vacB, VacB protein
MARKTTKKTTALLDPNYQQELEKYGNPVPSRDFILQVIREHNTPMSREEI
LKVFAIQDDERVEGVRRRLRAMENDGQLVFTKRNCYVLPEKLDLLRGTVI
GHRDGYGFLQVEGVKEDLFIPNTQMKRVMHGDYVLAQREGLDRKGRREVR
IVRVLEGRKKQIVGRFFLEEGIGYVVPDDSRINRDILIPNENRLGARMGQ
VVVVELKPRTASFSQPVGIITEILGDNMAKGMEVEIALRNHDIPHTFPPE
VEKQIKKFTEEVPEEAKSGRVDLRSLPLVTIDGEDARDFDDAVHCRREQD
GWHLWVAIADVSYYVRLRSALDTEARNRGNSVYFPNRVVPMLPEILSNGL
CSLNPQVDRLCMVCEIKLSDKGVMKDYQFYEAVMNSHARLTYTKVARILE
GDEELIERYQELVPHLQELHNMYNKLLEARHQRGAIDFETIESKFIFNEM
GRIESIEQVVRNDAHKIIEECMIMANIAAANFMERHQEPALYRIHAGPSE
EKLISFRSFLAECGLSLEGGMKPSTKDYAKLLEQVKERPDAELIQTMLLR
SLSQAVYNADNIGHFGLALEEYAHFTSPIRRYPDLTLHRGIKYLLAKAQG
VKRKTTDTGGYHYSLDEMDVLGDHCSMTERRADDATRDVADWLKCEYMQD
HVGDEFEGIISSVTGFGFFVRLKDLFIDGLVHISTLDNDYYRFDAAGQRL
IGENSGAVYRIGDIVKVRVEAVSLEQRQIDFALVSSERKPRREGKTAKDN
AKKTMRYAESFAKQRKKAAATSKGKKKSAVKKSKNSVNKKANKKRTY
>MS2239 vacJ, VacJ protein
MKKITLIATALFAGSILTGCATIDPATGERQDPLEGFNRTMWSFNYDVLD
PYVLKPAAKGWQALPSPLTTGLSNVAKNLEEPVSFVNRLLEGEVKKAFVH
FDRFFINSTFGLGGLIDWASYSDPLKIENDRTFGDTLGSYGVEPGAYVML
PAYGASSPRELTGTAVDTAYTYPFWHWVGGAWSLVPTVVKAVDKRAKAMD
KEELLNQAQDPYITFREAYYQNLEYRATDGNVKAKDSGLSQDDLNSID
>MS1556 valS, ValS protein
MTQKLQMADRFDASAVEQALYNHWEQKGYFKPSYDAGRPSYSIAIPPPNV
TGSLHMGHAFQQTLMDTLIRYHRMQGDNTLWQAGTDHAGIATQMVVERKI
AAEENKTRHDYGREAFIEKIWDWKAYSGGTISQQMRRLGNSIDWERERFT
MDEGLSEAVKEVFVRLHEEGLIYRGKRLVNWDPKLHTAISDLEVENKESK
GSLWHFRYPLAKGAKTAEGLDYLVVATTRPETVLGDTAVAVHPEDERYQS
LIGKTVVLPLANREIPIVADEYVDREFGTGVVKITPAHDFNDYEVGKRHN
LPMVNVMTFNADIREEAEIIGTDGQPLTTYEAEIPQDYRGLERFAARKKV
VADFDSLGLLEKIQPHDLKVPYGDRGGVPIEPMLTDQWYVSVKPLAETAI
KAVEEGEIQFVPKQYENLYYSWMRDIQDWCISRQLWWGHRIPAWYDEQGN
VYVGRSEEEVRSKNGLNSSVALRQDEDVLDTWFSSALWTFSTLGWPQQTK
ELAMFHPTNVLITGFDIIFFWVARMIMMTMHFIKDENGKPQVPFKTVYVT
GLIRDEQGQKMSKSKGNVIDPLDMIDGIDLESLLAKRTGNMMQPQLAEKI
AKATKKEFPEGIQPHGTDALRFTLSALASTGRDINWDMKRLEGYRNFCNK
LWNASRFVLTNDKLDLSTGERELSLADKWIQAEFNKTVQNFRNALDQYRF
DLAATELYEFTWNQFCDWYLELTKPVFANGTDAQIRAASFTLVNVLEKLL
RLAHPLIPFITEEIWQKVKDFAGVEGETIMTQPFPAFDEALVNDEAVAQI
SWIKEVITAVRNIRAESNIAPSKGLDLLLRNLPDTEQKTLENNRTLMQIM
AKLDSVKVLAQDEEAPLSVAKLVGSAELLVPMAGFINKDTELARLNKEIE
KLIGEVKRIEGKLGNEAFVAKAPEAVIAKEREKMQDYQEGLEKLRAQYLS
IENL
>MS0675 vanY, VanY protein
MGFGAENLTGKSRSHLLNLPCPLSNNHFLQPQALKAFQALQKSAVKNGFN
LQPASTFRDFARQQLIWNGKFNGERKVHDDQGNPLDLTALSCWEKAQAIL
RWSALPGASRHHWGTEIDFFDPDLLPQHQQLQLEPWEYEQDGYFFELSRF
LQQNLPQFDFVLPFMQTPKGKEIGREPWHISYLPLAEKLEKQFTPEILLN
AWENEDIAGRQTLIAHLPEIFERFIY
>MS1094 vapI, VapI protein
MLSPRKISEIATGKRPITADVAVRLALFFGTDAESWLNLQSHYDIKKSEE
EIKTDIESILDSSIDGYLNI
>MS0417 wbbJ, WbbJ protein
MATEKEKMLAGLAHLPMEEHLSALRLQTKELLFDFNMLRPSNKLEKTHLL
RKILGKAGKNIHVNSPFHCDYGCNIEVGDNFFANYHCVILDNGGVKIGND
VMFAPNVSLYTVGHPLDAELRNQGWEQAKPIIIGNNVWIGGNVVILPGVV
IGDNVVIGAGSVVTKDIPANSLALGNPCKVLRQITAADREYYQQTFMQNN
>MS1499 wbbJ, WbbJ protein
MTYYQHPSAIIDEGAEIGEGSRVWHFAHICGGAKIGKGVSLGQNVFVGNK
VRIGDHCKVQNNVSVYDNVYLEEGVFCGPSMVFTNVYNPRSLIERKSEYK
DTLVKKGATLGANSTIVCGVTVGAYAFVGAGAVINRDVPDYALMVGVPAK
QIGWMSEYGEQLELPLSGQAETKCPHTGAIYRLEGHELKKL
>MS2128 wbbJ, WbbJ protein
MVGRNAHPTSKGNMMDYTLNLPLNQLIAQNSELFSKIHQVVDKNAPLVAE
LNSGFRTQNEIRAILNEMTGTEIDASFHVNLPLYTDFSAHIRIGKRVFIN
TAVMLTDLGGITLEDDVLIGPRVNIITVDHPIDPAQRRGVIVKPVVIKKN
AWIGAGATILAGVTVGENAIVAAGAVVNKDVPANTIVGGIPAKLIKEI
>MS0664 wcaA, WcaA protein
MKVSLAVPVFNEEDTIPLFYQKVRNYEELQAYDVEIVFINDGSSDKTEEI
ITALSAQDPLVQAVQFSRNFGKEPALFAGLEYSTGDVVIPIDVDLQDPIE
VIPELIKEHQKGFDVVLAKRVDRQTDSWFKRKTALWFYKLHNQISKPKIE
ENVGDFRLMSRRVVEAIKQLPERQLFMKGILSWVGFDTAVVEYNRAERVA
GTTKFNGWKLWNFALEGITSFSTFPLRLWTYIGLFISACSFLYGSILILG
KLIWGNTVPGYPSLMVAILFLGGVQLIGVGVLGEYIGRIYSESKQRPRYI
VKTQKGNNNE
>MS0902 wcaA, WcaA protein
MKFSVLMSLYIKEKPEFLRASLQSLAEQTLPADEVVLVLDGEITPELEKV
LDEFKEKLPFTFVPLVQNMGLGKALNEGIKVARNEWLFRMDTDDICYPER
FAKQAEYIERHPDVVLFSTQIAEFDNDPAQIISVRRVPVGYEEIVRFNKM
RSPFNHMTVAYKKSVLQEVGGYQHHLFLEDYNLWNRIIATGYQVGNLPDI
LLYARTNGDAMIGRRRGLTYAKSEWKLYKLKRQLHIQGAVSGFLTFLMRT
LPRLMPVSLLKNLYKLMRK
>MS0903 wcaA, WcaA protein
MFSIIVPSYNRNTEVNALLASLENQTVKNFEVIVVDDCSQNFIKIDRTFS
FPVTLIRNETNSGAAQSRNVGANTAKNDWLLFLDDDDRFADNKCEVLAKT
IVENPQANFVYHPAKCNMVNEGFSYVTSPLPPAQLTLDNMLLANKIGGMP
MLGIKKDFFFELGGLSTELKSLEDYDFVLKLVSNPNLKAILVDQPLSICS
FHTKRASVSTNTANTEKAIEIIRANYVKTVRQTHNFSLNALYMLAYPNAM
NLSRKAATYYFEMFKKSHSLKHLIIAVVTFISPSLAINLKRFV
>MS0438 wcaA, WcaA protein
MPTISVAMIVKNEAQDLAKCLDTVKDWVDEIVILDSGSTDETREIALSYG
AKFYTNTDWQGFGKQRQLAQQYVTCDYVLWLDADERVTPELRHSIQSAVE
KNEDSTLYQIPRLSEVFGRKIRHSGWYPDYVLRLYKTHVAQYGDELVHEK
VHYPVNVKVEKLTGDLEHYTYKDVYHYLIKSAGYGKAWAEQKAAAGKSTS
LFNAVTHALGCFVKMYILRAGFLDGKQGLLLAILSANSTFNKYADLWVRT
KTK
>MS1480 wcaG, WcaG protein
MRSVSIVGLGWLGLSLARHLKNLGWDVKGSKRTHEGVEQMRLMRFETYFL
ELTPEINADPDDLTNLLSVDTLIINIPPSEYFFDPKLYVEGIENLVNEAL
LCNISHIVFISSTSVFPNVSANFDEESVPQPDSEIGRALLEVEQRLFELK
DIDVDIIRFAGLVGYDRHPVYSLVRKESAISGGNTPINLVHFDDCARAIQ
LLLEMPGYQRLYHLAAPKHPSKVEYYTKMATKLGLNPPQFLCDEKDPQRI
IKADKICRELDFVYQYPDPDEFI
>MS0146 wcaG, WcaG protein
MIIVTGGAGMIGANIVKALNDMGRKDILVVDNLKDGTKFINLVDLDIADY
CDKEDFISSVIAGDDLGDIDAVFHEGACSATTEWDGKYLMHNNYEYSKEL
LHYCLDREIPFFYASSAATYGDKTDFIEEREFEGPLNAYGYSKFLFDQYV
RAILPEANSPVCGFKYFNVYGPREQHKGSMASVAFHLNNQILKGENPKLF
AGSEHFLRDFVYVGDVAEVNLWAWENGVSGIFNLGTGNAESFKAVAEAVV
KFHGKGEIETIPFPDHLKSRYQEYTQANLTKLRAAGCDFKFKNVAEGVAE
YMAWLNRK
>MS1492 wcaJ, WcaJ protein
MIKRLFDIVVALIALILFSPLYLFVAYKVKQNLGSPVLFKQTRPGLHGKP
FEMIKFRTMKDGVDENGNILPDAERLTPFGKMLRATSLDELPELWNVLKG
DMSLVGPRPLLMEYLPLYNERQAKRHEVKPGITGYAQVNGRNAISWEQKF
ELDAWYVEHQSLWLDLKIIAKTIQKVIAKDDINAADDATMPKFEGNKKS
>MS0662 wcaJ, WcaJ protein
MCECTLQMICKGTISMKKILFCKFTLALTDFLSLSLSIVLAFYSLELWTG
ELSRYVPTEQIEERFYIHIILSFIGMGWYWIRLRHYTYRKPFWFELKEVL
RTLFILGIIELAIVAFSKLYFSRYLWVLTWLIALIIVPTCRVVMKKILIK
SGWYLRDAIIIGSGQNAVDAYNALMSESYLGFKIKYFITSHENKAIETLD
VPSINENAQELWKAATNKSDQFIIALEEDEADSRDAWLRYFSSHKYRSVS
VIPTLRGLPLYSTDMSFIFSHEVMLLRVHNNLAKRSSRFLKRTMDILGSL
TIIILLSPILLYLYFSVKKDGGNAIYGHPRIGRNGKTFKCLKFRSMVVNS
KEVLEELLERDPEARAEWEKDFKLKNDPRITKIGAFIRKTSLDELPQLFN
VLKGEMSLVGPRPIVKEELERYQDDVDYYLMAKPGMTGLWQVSGRNDVDY
DTRVYFDAWYVKNWSLWNDIAILFKTVNVVLNRDGAY
>MS1501 wecC, WecC protein
MGYIFMNLNSYKVGVIGLGYVGLPLAVEFGKHRFTVGFDISPNRVKELSE
GKDRTLEVSSEALKSVTHLSFTSDLEQLKQCNFFIVTVPTPIDDVNRPDL
TPLQKASESIGKVLKQGDIVVYESTVYPGATEEVCIPVLEKVSGLKFNQD
FFAGYSPERINPGDKVNTLTKIKKITSGSTPEVADIVDQMYASIIEAGTH
KASSIKVAEAAKVIENTQRDLNIALVNELSIIFERVGIDTLDVLEAAGSK
WNFLPFRPGLVGGHCIGVDPYYLTHKAEEVGYNPQVILAGRRINDNMSQY
VAQETIKLMLNNNIDVAHAKVGILGVTFKENCPDIRNSKVVDVVTELKNW
GVEVVVADPWADAQEVKHEYGLDLSSVDENNPVDALIVAVGHKEFRDLSA
ETLRSYLRTSKPVLADVKSLFDRDALAQQGITVFRL
>MS0323 wecD, WecD protein
MKIFKAEQWNLEVLLPLFEEYRLSHGMVENPERTFTFLNNRIRFSESIIF
IATNERQQAIGFIQLYPRLSSLQLQRYWQLTDIFVQDVANQNEIYAGLIE
KAKEFVCFTHSTRLVVEQDQQHQGIWEKEGFKLNTKKALFELKL
>MS2102 wecD, WecD protein
MMQTIDQFIAQYIPAAYALNLRVVESSPQRVVIKAPFECNSNHHHTIFGG
SQALLATLSAWSLVYLNFPEANGNIVIRSSQIRYLKPAPSDIIAVSICPD
SLAMNLAKQMLTQKGKAKITIQCQLYCDDIIVSEWTGEFVLSHTPF
>MS1489 wecE, WecE protein
MVGITDVFRFLTLYIKRFIMLNTAFEPWPSFTQEEADAVSRVILSNRVNY
WTGTECREFEKEFAAYVGTKYAVSLTNGTVALDLALKALNIGAGDDVIVT
SRTFLASASSIVTAGANPVFADVELDSQVISRRTIEAVLTPNTKAIICVH
LAGWMCDMDPIMDLAKEKGIYVIEDCAQAHGAMYKGKSAGSIGHIAAWSF
CQDKIMTTGGEGGMVTTNDKGLWNKMWSYKDHGKDFDTVYNKQHPPGFRW
LHNSFGTNWRMMEVQAVIGRIQLTRMADWTAKRINNMERILNAFDGSPYF
SVYRPNQDYVHAAYKCYVQVNPQALPAGWSRDRIMQVINEQGVPCYSGSC
SEVYLEKAFDNTPWRPAKPLQNAKSLGETSLMFLVHPTLSEDSLVKTCAA
ITQVIHALSENK
>MS1498 wecE, WecE protein
MEFIDLKAQQQRIKAQIDAGIQKVLAHGKYILGPEVAELEEKLAAYVGAK
YCITCANGTDALQIAQMALGIGAGDEVITPGFTYIATAETVALLGAKPVY
VDVDPKTYNIDAEKLEAAITPRTKAIIPVSLYGQCADFDAVNAVAKKYNL
PVIEDAAQSFGASYKNRKSCNLTTISCTSFFPSKPLGCYGDGGAIFTNDD
ALANVIRQVARHGQDRRYHHIRVGVNSRLDTLQAAILLPKLAILDDEIAA
RQRVAENYTRLFNQAGVNTTPFIEAHNQSAWAQYTIQVDNRADVQEKLKT
LGIPTAVHYPIPLNKQPAVADSRIHLPVGDLIAERVMSLPMHPYLTPEEQ
QKIVQSLV
>MS1487 wza, Wza protein
MHRQVITLSVSLRMLFGPNSIRMINFYYRLNMNKLIKSLLLTGLGLSLSS
CSVFLPDSHKSPISQRPAQVVGKNVDLAKAIDAYLITPQLLKTLTPVNAS
AQSNLSLDNELKNYQYRVGVGDVLNVTVWDHPELTTPAGSYRSSAEAGNQ
VHANGTMFYPYAGNIKVAGLTVGQIRSRLTKALSNYIAEPQVEVNVASFQ
SQKAYVTGEVKSPGQQFITNVPLTLLDAINKAGGLADNANWHNVTLTRNG
RDEVISVEALIQRGDLTQNRLLKSGDIVHIPRNDTMKIFVVGEVVQSQLL
TIGRNGMTLTEALSASGGIDKLSSDATGIFVIRGQRGKQEFVQDNNGEKI
EKVANIYQLDVTNPTAYILGTEFYLQPHDVVYVTTAPVSRWNRVISQVVP
TISGFNDLTEGVLRIRTWP
>MS0746 xerC, XerC protein
MMKDSALIELFLNELWLGKGLSDNTVQSYRLDLTALSQWLQGQGKSLETL
DSSDLQAFLGERVDQGYKATSTARMLSAMRKLFQYLYQESYRTDDPSAIL
SSPKLPGRLPKYLTEQQVGDLLNAPSTDIPLELRDKAMLELLYATGLRVT
ELVTLSTDNINLEQGVVRVIGKGNKERIVPMGEEASYWVGQFILYGRPML
LNGQSSDVIFPSKRALQMTRQTFWHRIKHYAILADIDTDSLSPHVLRHAF
ATHLVNHGADLRVVQMLLGHSDLSTTQIYTHVAKERLKRLHEKYHPRG
>MS0523 xerC, XerC protein
MQTYLQKYWNYLRNERQVSSYTLTNYQRQMDAVMKILQENDIQNWRQVSP
SVVRFILAQSKKSGLHEKSLALRLSALRQFLAFLVLQGELKVNPAIGISA
PKQGKHLPKNINAEQLNKLLDNNSKEPIDLRDKAMLELMYSSGLRLSELQ
GLNLTSLNFRSREIRVLGKGNKERILPFGRHASHSVQEWLKVRLLFNPKD
DALFVSSLGNRMSNRSIQKRMEIWGVRQGLNSHLNPHKLRHSFATQMLEA
SSDLRAVQELLGHSNLSTTQIYTHLNFQHLAEVYDQAHPRAKRRK
>MS1850 xkdP, XkdP protein
MLKKEGKMGLFDFVGNIGKKIFNREDEASKAVTEHIAEDNPGVENVNVTV
ENGVAKLEGSAKSASALEKAILMAGNIAGITSVKADGVNILNGEVLAGDD
EFYVIQKGDTLWAIAEKHYGNGIKYKAIVEANKEVIKDENKIFPGQKIRL
PKSL
>MS0560 xseA, XseA protein
MMNENIYSVSQLNYSVRQLLEGQLGLVWLTGEISNFSQPVSGHWYLTLKD
ENAQVRCAMFRMKNMRVAFRPQNGMQVLVRANVSLYEPRGDYQLIIESMH
PAGEGFLQQQFEALKIKLAAEGLFAQNLKKNLPHFAKTVGIVTSPTGAAL
QDILNILQRRDPSLKIIIYPTAVQGKDAANEIVQMIELANLRNEADVLIV
GRGGGSLEDLWCFNEETVARAIFRSSIPVISAVGHETDVTIADFVADVRA
PTPSAAAELVSRNQQELFQQLQYKRQRLEMALDRLFNEKQQHLQRFLLRL
QNRHPSARLLAQRQQTGQLEHRLNSAIRRLLDKNHYKLTALCERLEKNPL
PYLVRQQNYHIVQLATNLDFALKRLIVSKQTSLSALCGKLDGLSPLKVLA
RGYSIAETEQGETISSVNQVETGDKIKTRLRDGVIVSKVI
>MS1061 xseB, XseB protein
MARKPKESSTVDFETTLNQLETIVTRLEAGDLPLEEALKEFENGIKLAKL
GQERLQQAEQRIQILLQKSDTAELTDYQPTDE
>MS1048 xthA, XthA protein
MYFINRNNMKIISFNINGLRARPHQLDKIVEQYQPDIIGLQEIKVADEMF
PHELVDHLGYHVYHHGQKGHYGVALLCKQAPKAVHKGFSTDTEDAQKRLI
MADFETAFGALTVVNGYFPQGESRDHETKFPAKEKFYADLLNYVKNEHNP
ESNIIIMGDMNISPTDLDIGIGEDSRKRWLRTGKCSFLPEEREWYQRLYE
CGLEDTFRKLNPWTNDKFSWFDYRSKGFAENRGLRIDHILANSKLAERCV
DTGIALDIRAMEKPSDHAPIWATFK
>MS2373 xylA, XylA protein
MTNYFDKIEKVKYEGADSTNPFAYKHYNANEVILGKTMAEHLRLAVCYWH
TFCWNGNDMFGVGSLDRSWQKMSDPLAAAKQKADIAFEFLTKLGVPYYCF
HDVDIAPEGNSYQEYVRNFNTIVDILEQKQAESGVKLLWGTANCFSNPRY
MSGAATNPNPEIFTRAAAQVFNAMNATKRLGGENYVLWGGREGYETLLNT
DLRREREQIGRFMQMVVEHKHKIGFSGTLLIEPKPQEPTKHQYDYDVATV
YGFLKQFGLEKEIKVNIEANHATLAGHTFQHEIATAAALDILGSIDANRG
DPQLGWDTDQFPNSVEENTLAIYEILKAGGLTTGGFNFDAKIRRQSINPY
DLFHGHIGAIDVLALSLKRAAKMVEDHTLQNIVDQRYAGWNGELGQQILA
GKSSLEALAQAAQNLDPNPVSGQQEYIENLVNGYIYR
>MS2329 xylB, XylB protein
MTILNIAAVDLGASSGRVMLASYSTENHKISLEEIHRFKNQFVSQNGHEC
WDLAYLENEIVNGLRKISNSGRTLHSIGIDTWGVDYVLLDQNGEVVGPTY
AYRDHRTDGVMQKVQAELGKEVIYRKTGIQFLTFNTLYQLKAMTDENPAW
LSQVKDFVMIPDYLNYRLTGVINREYTNATTTQLVNVNIDSWDTALLDYL
GLPASWFGRIRHPGHQVGLWENRVPVMSVASHDTASAVISAPLSDENAAY
LCSGTWSLMGLDTTTPCTDECAMNANITNEGGIDGHYRVLKNIMGLWLFN
RLCTERDVTDIPALVKQAEAELPFQSLINPNAECFLNPSSMVEAIQQYCR
EHNQVIPKTTAQLARCIFDSLAMLYRKVALELAGLQGKPISALHIVGGGS
QNAFLNQLCADLCGIDVFAGPVEASVLGNVGCQLMALDQIHNAAEFRQLV
VKNFPLKQFKKRPHFMPASDFEEKWCEFCALN
>MS1609 xylB, XylB protein
MNYYLGIDCGGTFIKAALFDENGNIRACERENVAVISEQSGYAERDMPEL
WQACAEVIRRTVKSSEIPPHLIKSVGISAQGKGAFLLDKDKRPLGRAILS
SDQRSLPIVKRWQAEGLEQQIYPISRQSLWTGHPVSILRWLKENDVPRYD
QIRHLLMSHDYLRFCLTGELYCEESNISESNLYNMATGQYEPQLAHLLGI
EEIMDKLPPIVKANQIAGFVTEQAAQACGLTAGTPVVGGVFDVTAMTLCL
ADNQPHKLNVVLGTWSIVTGISNEIDDKQALPFAHGRYVEAGKFLIQEAS
PTSAGNLEWFVKQWKLDYQQINQMVAALPPAQSAVIFLPFLYGTNAKLGM
TAGFYGMQAHHSQAHLLQAVYEGVLFSLMIHLERMCQRFPQVQTLRVTGG
PAKSAVWLQMLADLTGKTLEIPEVEEIGCLGAALMAAEGVNGDSLALKQH
QALRVIQPNPANFDAYQHKYRQYRKLTKLLEQMA
>MS0048 xylB, XylB protein
MNYYLGIDCGGTFVKAALFDETGNLQGIARENVPVISDKAGYAERDMPQL
WQVCAEVVRKTIAESKISPDLIKGVGISAQGKGAFLLDQNQQPLGRAILS
SDQRSLDIVKQWQKEGIPEKLYPLTRQTLWTGHPVSILRWVKEHEPERYA
RIGSVLMSHDYLRFCLTGELHCEETNISESNLYNMEKGEYDPVLADLLGL
KGIIEKLPPVIQSNRIAGYVTEQAAKVSGLAVGTPVVGGLFDVVSTALCA
GLEDETKLNTVFGTWCVVSGITDHIDPNQSLPFVYGRYAEENKFIVHEAS
PTSAGNLEWFVKQWNLDYQRINEEIASLPPAASSVLFVPFLYGSNAGLGM
QAGFYGIQSHHCQAHLLQAIYEGVLFSLMYHLERMLKRFPATKVLRVTGG
SAKSEIWMQMLADFTGMTLEIPEIEETGCLGAAVIAMQALNNNLTVTEIL
NKGIKVVRPNPDNFDLYQKKYQRYVTLTAALKAML
>MS2372 xylB, XylB protein
MYIGIDLGTSGVKAVLLDESQKIIATTQHPLPISRPHPLWSEQNPKDWWY
ATNLAMLALAQQQNLSAVKAIGLTGQMHGATLLDKQNNVLRPAILWNDGR
SSAECEELEKLVPRSRKITGNLMMPGFTAPKLKWVDKHESAIAEKISKVL
LPKDYLRLMMTGEYASDMSDASGTMWLDVAKRDWDKSLLNACGLDENNMP
KLFEGNQITGYLHADLAKNWKMNAVPVVAGGGDNAAGAIGIGLYQTGQAM
LSLGTSGVYFVVTDKFTANPQKAVHSFCHALPDRWHLMSVILSAASAVDW
VKKATGIADIQTLFQKAEKSAVNSEAIFLPYLSGERTPHNDAYAKGVFWG
LSHNDDQTTMAKAVIEGVSFALADGIDVLHETGVTADNIALIGGGAKSAY
WRQLLADISGRTMEYRTGGDVGPALGAAKLAQIALNPHEDIADFCQPLPL
EAVYHPNAERTAYYAEKRAKYAELYQRVKGL
>MS1411 xylB, XylB protein
MHTNVKKLIESGQACLGIELGSTRIKSVLIDTSGNILAKGGFEWENHFVD
GIWTYPLDEVWAGIAQSYRDLCQDVQAKYAVKLNRLAAMGVSAMMHGYLP
FDAQDNQLAEFRTWRNNTTATAADQLTELLQYNIPQRWSVAHLYQAVLNQ
EPHTKEVAYITTLAGYVHWQLTGQKVLGIGDASGMFPIDSTTKDYDAAMV
ETFDRLIADKNLNWKLSGLLPKVLVAGENAGVLTEKGAKLLDPSGNLQAG
CVLCPPEGDAGTGMMATNSIKVKTGNVSAGTSAFAMVVLEKPLSKVYRDL
DMVTTPAGDPVAMAHSQNCSSDLNAWFQLFGEVLRSFDVSFTQDQLYGKL
FDKALDAAPDAGGLLSYCFYSGEHGVDLTTGCPLFLHPANAEFSLANFVL
VQLYTSFGAMKLGMDTLTQKEHVKIEKIFAHGGLFKSKAVAQHVLAAALN
VPVAVLETASEGGAWGIALLAAYTRQAPLSLTEYLDLVVFADSADNVVEP
EPARVKGYASFIRRYQDGLSIEREAADFANKK
>MS0059 xylB, XylB protein
MNMQDAKNLIASGGASVGIEFGSTRIKAVLISTDGTILASGGFTWENHFI
DGIWTYPQSEIWQGLQQAYRDLANQVQEQYGITLTRAKAIGISGMMHGYI
PFDKQGNQLVAFRTWRNNITAKSSQKLTALFNYNIPQRWSISHLYQAILN
QEEHVGEIDYLTTLAGYVHWQLTGEKVLGVGEASGMFPIDPQTGSYYQIM
LNQFDGLIAAQSYPWKIANILPQVLTAGQPAGHLTEQGAKLLDPTGRLQA
GIPFCPPEGDAGTGMVATNSIKTNTGNISAGTSAFAMIVLEKELSKVYEQ
LDMVTTPTGKLVAMAHANNCSSDINAWIRLFGETLKAFGAEVETDKLYET
LFRKALEGDADCGGLLAYGFYSGEHSVGLAEGCPTFMHPANSRFTLANFI
RTHLYSAFGAMKLGVDILIQQEKVNIAQILGHGGIFKTPNVASKILASAI
NVPIAVMKTANEGGAWGIALLANYLDAHKNGQSLDDYLEQCIFSQAEVSV
SYPDSETSKGYEEFIEYYKRGISVVQAAVNTFNE
>MS1562 yajC, YajC protein
MDAQQGSPMSMLIIFAIFGLIFYFMIYRPQAKRNKEHRQLMSQLAKGTEV
LTSGGLVGKITKITADSDMVVIALNETNEVTIKRDFIVAVLPKGSIKSL
>MS0481 yidC, YidC protein
MDSRRSLLVLALLFISFLVYEQWQMDYNTPKPVATEQAQAVSSNAEMPAS
TSSTEGTVDNVAQGKIISIQNDVFTLKVDTLGGDVVESSLTNYAAELNSD
ARFILLQNKPNEVYVAQSGLIGKNGIDTKAGRAAYQVEAEQFTLADGQNE
LRVPLTLEKDGVIYRKVFVIKAGSYDIEVNYEIQNQTNEAIEVQPYGQLK
HTLVQSSGSMAMPTYTGGAYSSADTNYKKYSFDEMKDKNLSIDTKAGWVA
VLQHYFVSAWIPNQDADNQLYTSTANGLGFIGYRGPVVNVPAGGSETIKS
ALWTGPKLQNEMGAVANHLDLTVDYGWAWFIAKPLFWLLNVIQSIVSNWG
LAIIGVTIVVKGILYPLTKAQYTSMAKMRMLQPKLQEMRERFGDDRQRMS
QEMMKLYKEEKVNPLGGCLPLLIQMPIFIALYWTFMEAVELRHAPFFGWI
QDLSAQDPYYILPILMGASMFLLQKMSPTPVADPMQQKIMNFMPLIFMVF
FLWFPAGLVLYWLVSNIITIVQQQLIYRGLEKKGLHSRKK
>MS1768 zipA, ZipA protein
MDLNTILIILGILALVALVAHGLWSNRREKSQYFENANTFGRANRQDEPI
STPESYKQARNVAPAFTQPKQAVIHQEPPVQQPLNTEPEPITQETPVRAE
PQSVDQIKITLPNVEPAPAESAPIYEMRPSRRNTEPQYYQQPSEPYYQQP
VQQNLARQTIADIEATVDPNEGVNSSSEYLRTQLQEASQEGNQIFTQSPL
SRAPLQQPIEFDQPAQQEKESDNNEDEDVSFVMLYVAAAENRQFQGTVLV
QALEDLGFSLGEDNLYHRHLDLTVASPVLFSAANITQPGTFNPYTLHEFF
TDGVAIFMRLPSPGNDRTNLKIMIRSAKTLAQQLGGFVLTEQQELFTDAA
EEEYLAKIK
>MS0888 zntA, ZntA protein
MQQTVQIDDRAEQTTLMLEGMSCAACVRKVEKALLAVPEVAAAQVNLAEN
TALVYGNGDVELMLQAIENAGYHAEVVEDENSRREKQNVQAEHEINQRKW
QSIVALIVGFGLFFWGIFGGTTVATAENHWNWVMVGLIVLVTMLCTGWHF
FERAWKNLLKGNATMDTLVALGTGVAWLFSMFISLTPDFFPDGSRHLYFE
SGVMIIGLINVGKMLEAKAKQRSSKALERLLDLTPKTTRIIDELGEREIP
LKDVKTGMRIRLQTGDRVSVDGIVLQGSGWIDESMLTGEPIPVQKQEGDK
ISAGTLVTDGALQFRAEQVGNKTMLANIIRLVRQAQSSKPEIGQLADKIA
GIFVPVVICIAVFAALIWYLVGPEPQISYALVVLTTVLIIACPCALGLAT
PMSIIGGVGRAAELGILVRNADALQKASNVDTIVFDKTGTLTKGEPKVTA
LLTFNHFDESVALEYAASIEQGANHPIAKAVLSLAQEYNLNLEHEPEDFR
TLKGLGVSAKVDQQNILLGNYTLLQQHNIDASAANRFFQTESEKGATVIF
LAVNNVLAGVFAIRDPLREDSVEAIQRLHKQGYHLIMLTGDQEKTAQAIA
KEAGIDQVIAGILPEGKANVIRQLQEQGGKVVMVGDGINDAPALAQADVS
IAIGSGSDIAIETSELTLMRHSIHAVADALALAKGTLHNMKQNLFFAFVY
NVICIPVAAGVFYPLFGFLLNPMFGGAAMACSSITVATNANRLLKFQPKD
>MS0887 zntA, ZntA protein
MAIFLALSELSCGHCIKSVTKAIEAISGENSAEVTLNYAKINSDKDPQLF
IDAITAADFKAQLATPSFELELDGLNCGHCIKSVSKALSAVENMEVFDVD
LKKARIYGNAKPEDVVKAIVDAGFNARLARLV
>MS1457 znuA, ZnuA protein
MRFLELSTSYLLNNNFIFIKRGIFMANRLILKQVLLAAILSSAACAVNAK
VVASIKPLGFIASSIADGVTETDIVVPTGASPHDYSLKPTDVKKLKSADL
LLWIGEDVDSFLAKSIGALDNRKVITIAKLDSVAPLLGKATHHHKEHDHE
HHHADHEDEHQHDHDEHDKTGLSTNWHIWYSPEISKMVAAELAEKLTERF
PAQKELIAENLLDFNRTLNERSDKIKLQLAAVKDKGFYVFHDAYGYFNDA
YGLKQTGYFTINPLVSPGAKTLATIKEEIAEHKVSCLFAEPQFTPKVIES
LSKGTGVHVGRLDPMGDKIQLGKRSYADFLQFTADSYGECLAK
>MS0789 znuB, ZnuB protein
MFEIIFPAWLTGILLSFITAPLGAFVVWRKMAYFGDTLSHSALLGVALGI
CLDINPYLSILILTLILAVAMVWLESNTQFSVDTLLGIIAHSCLSLGVVT
VGLLQNVRVDLMGYLFGDLLAVSYEDLIYIGIGVLIVLISLIYFWKPLIS
TTVSPELAQVEGINIRRMRFILMILTALTIALSMKFVGALIITSLLIIPA
ATARRFARTPESMAIIAVGLSIVAVSLGLTLSAFYNTAAGPSVVICSSFI
FLISLLKKEKI
>MS0790 znuC, ZnuC protein
MFSTFFKVRDISSMQINSIKIPLIELQNIKVVFGAKTALQNINLSIYPNT
VITIVGPNGGGKSTLLKVLLKLLSPTDGKVIHHRDLRIGYVPQKIHLEQS
LPITVEKFLSLKKGISKAEIQDAIELLSIKHLIHSSMQKLSGGEMQRVLL
ARALLNKPNLLVLDEPMQGVDLSGQIELYQLIHQTREKLNCAILMVSHDL
HIVMADTNEVVCINRHICCAGSPEKVSNDPTFIHLFGDQFSQNVAFYTHH
HNHQHNMHGDVCCIGNKHSVQCINNGR
>MS0016 zwf, Zwf protein
MKAENNCIVIFGASGDLTHRKLIPALYNLYKIGRLSENFSVLGVARTELS
NEKFREKMRSALIEHEKADGDELNNFCSHLYYQAVNTSDAADYVKLIPRL
DELHDKYKTCGNTLYYLSTPPSLYGVIPECLAAHGLNTEEFGWKRIIVEK
PFGYDMKTAKELDVQIHRFFDEHQIYRIDHYLGKETVQNLLVLRFSNGLF
EPLWNRNYIDYIEITGAEELGVEQRGGYYDGSGAMRDMFQNHLLQVLAMV
AMEPPAIINADSMRDEVAKVLYCLHPLEPNDLQHNLVLGQYAATQLNGER
VKGYLEEKGVPPDSNTETYMALRCKIDNWRWAGVPFYVRTGKRLAARVTE
VVIHFKKTPHPVFSQEAPENKLILRIQPDEGISMRFGLKKPGAGFEAKEV
SMDFRYSDLNGASLLTAYERLLLDAMKGDATLFARTDAVHACWKFVQPIL
DYKANGGRVYEYEAGSWGPVEADKLIAQHGRVWRKPSGQMKKKV