Gene list
Applied filters:
Organism: Mannheimia succiniciproducens MBEL55E, MBEL55E
Gene type: CDS
Number of genes found: 2384
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mannheimia succiniciproducens MBEL55E, MBEL55E >MS2121 unknown MVAKRRIIGFYLYLHKIYKKPTALCKINAVRRFFRYANVKISLHFLLKVA >MS1285 unknown MVKSKSALNSQFFDRTFIEIVYFMAGILLIFYV >MS0576 unknown MRLFRTLVKFEIKSAVGFERIFVIAQSVRFYIWGFGFVFGMYK >MS0539 unknown MKTLKQAKFLKQEGNRIDIQCERDYVLHLFVLEQDIIRVAFTRKNSFKLD RTWAISPNCEDVPFAGRERFSTEGFSLPTYQLNFEHDVIEIVTEKLKVRV HQPLTLEWQYNKDGNWLPLIQERKTGAYLFGINNNKISHFIERSLDENCY GLGEKAGDLNRKGRRFEMRNLDAMGYNAEKTDPLYKHVPFYITRKNDVSY GIYYDNLAQCWFDLGNELDNYHIAYKSYRAEDGDMDYYVILGPSTLEVTK KYTALTGGTIFGPRWGLGYSGSTMSYTDAEDAQEQLKKFVDLCKEHDIPC DSFQLSSGYTSINGKRYVFNWNYDKIPEPLKMSGYFRDAGMQLAANIKPC MLQDHPRYKEAQELGLFIKDSESELPERSVFWDDEGSHLDFTNPATVQWW KDNIKEQLLERGIGSTWNDNNEFEIWDDNAKCVGFGKETPIKLIRPLHPL LMMKASYEAQKEFAPHQRPYLISRSGCAGMNRYVQTWSGDNRTNWTTLRY NIRMGLGMSLSGLYNVGHDVGGFSGDKPEPELFVRWVQNGIMHPRFTIHS WNDDKTVNEPWMYPAVTNIIRDTIKLRYKLMPYIYNAFWQSHQDLEPMLR PTFLDHEHDMKTYEETDDYLFGKDLLVASVVEKGQRQREVYLPQNNAGWY DFHAHTYYEAGQTVTVGAPLERLPLFVKAGAILPLSERTAYSCAKQDTSR ELLVFPFIREGEATSTIFDDDGETYRYQHNGYLQLTLKLSCNKDSVNLNI QKQGTWTPAYNALKITLPETETRPLTVNGKVFKSGTELSLGDIKES >MS1771 unknown MKKYHYFIKLKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKA RFSQGIEQAGLALMAENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQ KYKRNQELVSGLVRHYYYPSTYFKDNLKISDKYDYQCNNLQGPNGEQLKS IACEISGKFERPSWLYLGKNNGLSFAETTTINANKIYIQKNLDEIIPIDL MLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQNNTEEGKVI SQYNRIGFTSFAFGAQQQNNTAQCYLPYEIKPSITIRNNNYYGGYYNTTM QYSELLSYVGSNQQRYSYATLAQYFDAFVDYDKTIESINSFDGKDLSSLM YFSKNSWCLGSANTRINSTYIWAGKNESADLVSRFNRVPALGATLSSSGL LIGANLLMNTNPDENAQPSKLGANTQRIILVLSDGEDQINNASSSLNITS TLINQGMCEKIKSKLNSLQDKTYLEQPTRIGFVAFGYGPSGTQKAAWEKC VGKYYYVANNKEELLESFRKIIGLVEEVGHSTYKEPTYYSN >MS1596 unknown MASRSNKVFLGEIRLKVKNLPLKQSNKSTPTADKTNLYQTEV >MS0129 unknown MKTSIIFNLPADVINELHKRLRESNYSGFIELENWLKNLGFNVSKSGIHR YAQKLKSLDGFIGRSGSFDLAVQLNNSIDDNTPLNLLYQELGKLEYQKQQ ILQKISAMEAENQI >MS0110 unknown MTDFFKHNLRNKNMTDKQQDKTTKKKVERKFKGVERFTLSYDASDSEYAE HKINAHNLVKVVNEMITLIERSDKLLNGKQKTVEIFLQAPESGVIVKKGS LQIPFAVELYEYICTVKDIVTTIETKDIFTALGLGIPSASVGYGVFKDIF RTKGEPVIDVKTQDGSNEVELLTENTKIKTTKETAILMQDDEIRRAIKDL TVAPLANKVDAVFKIKREETTEEQGEITTEETVAVEIESGKEIETLTKLS ERIAQEPEVELKPEQLITITQINFSSGESGWKMRLDGKERAVVLQDVAFM ASINADQASFRKGDWLKVNLKRVKTFGTQTKTTYIITEVLEHLVGKDRKL IEKQDE >MS0684 unknown MFIKWEIAMNESKQTNEQTQYNEINFRRRSVLRTLAGSVLLSVTGSTLAK QCEITSPEILSPRYPDPLIEVSDPSFNKYRLYSSSVERLATGFRWAEGPV WFGDGQYLLFSDIPNNRIMRYDNITGQTAVFRENANYSNGLARDKQGRLL ACEHLTRRLTRTEYDGSVTVLADSFEGKPLNSPNDIAVQSNGAIWFTDPT FGINGYYEGEKAKAEQPTAVYRIDPQTEKLERVLNDLLMPNGIAFSPDEK HLYIVGRFSETPALREIFRYDVSTDGKLNNRTHFFDGGENGTLDGIAVDE DGNIWAGWGSINNSKNGLSGDMDGVIVINPQGKQIAHIHLPERCANLCFG GSKRNRVFMASGHSLYALYVETRGT >MS1842 unknown MSMSKYLAQNFQIFNRTFMFKKIVQWLLSS >MS0618 unknown MTIAIIIATHGVAAEQLLKTTEMLIGEQENVATIDFVPGENAETIMGKYQ EKLATTLSHCDQVLFLVDTWGGSPFNAANRVAEGNENMDIVTGVNVPMLV ETFMARDDNPSLQELVAIALETGRTGVRALRYEEPEEAPVEQAQPVPAAA PTAQPNVVTNKEGHLEIGLARIDDRLIHGQVATRWTKESRVTRIVVVNDD VAKDSVRSTMLKSVAPPGVTAHVVNVDKMIRVYNNPEYAGERMMLLFTNP TDVVKLMQAGVEFKSINIGGMAYKDGKQMITSAVAVDSQDIDAFKILDAK GIELDVRKVSNDARQYMMDLLKKNNLI >MS1165 unknown MVGTPQSSGINPGIPMYSNDSDSSTTKATLTEGRITLNKDSNPTQVTAQS LGINTQLEGANRQVAAPKDIHRELKDQQILSRAAGDVAGAVSSYVDSRKE ALKEEQQAALEEAKKAQARGDVTTAEAKLAEANALENEANR >MS1399 unknown MNSEQTQLVEHIISLEKQALDKWFKGDTSGYRELWSKQNFSYFDIVHPER IDSYDNISAFLDSIEGKLFADSYEFKMPRVQLSQDMAILTYQIFAKTNLI DMRYNCIEVFQKEGDEWKVVHSTWSAIRPMDWDFSTMKAAI >MS1458 unknown MERVDNYEQRFGGIGRLYTPQGLERLRQAHVCVIGIGGVGSWCVEALARS GVGKLTLIDMDDICVTNINRQIHALTGNIGKLKTEVMKERVELINPECKV EIIDDFISPENLAEYLHSDYDYVIDAIDSVKTKAALIAYCKRNKIKVIMV GGAGGQTDPTQIQIADLSKTVQDPLASKVRSLLRKNYHFSQNPKRKFGVD CVFSTQPLIFPQMSEGCGISASMNCENGFGAATMITATFGFFAVSRVVDK LLTKQ >MS0095 unknown MAKVTATVKGSPLLHNGKRYDIGATIELDEAQAENLGIYLDIVKPVGDGN KQTGNKQTDGAKKTQQKDAGKGDESKVE >MS1159 unknown MVVPLSKLTKGNVAYIDSIVANQAFGELDTLVGRRLADLGFSKGVPVEVV AAGVFGKGPLAVRLSNLSQFSLRAAEASKILCHINK >MS2120 unknown MQTKPFGKHPEGQRLARIEQSVHYKAGKFVNHLPTEVQTSDKPLWKIWYD FLFQQIDHLTPNRPLPVVKTDLQQLSREKNFIVWFGHSSYLIQLDGKRFL VDPVLVSGSPLSFANKMFQGTNLYQPQDMPDFDYLVITHDHWDHLDYEAV IQLKNKMKEKVITSLGVGAHLEYWGYPAERIIEMDWNEKTELENHFKITA LPARHFSGRGVVRNKTLWSSFMLEVPGETIYLGGDSGYDPIYQEIGQRFN ISLALMENGQYNKDWANIHIQPEQLTLAVKALRPKRLMTVHNAKFALARH DWRAPLEQIYRNAQKENFNLFTPKIGDVFYFSEQGEADSPNFREPWWQSV E >MS1284 unknown MRWENRHIYRVGGLYIKYKENASHKINNFDKSAVKKLRI >MS1045 unknown MLLFVYLGAKMKKLALAALILGSSLALTACDQAKEQASQTTETVTETAKD VKDNAVEKAGEVKDNAVEKANEVKDAAAEKMDAAVDATKEKVAAAKEAVA DKAEEVKNAVSDKAAEMKEAVSDKANEVKDAAEK >MS1068 unknown MADWILILSVLILRRFRKIYAWQDNKNIGDIMSTSHYVSPKGSMDQLSHM EIDLLTKRAQSDLYKLFRNSSLAVLNSGAINDDSRALLNKYPNFEISIIC KERGVTLKLDNSPESAFVDDKIIRNIQYNLFAVLRDILFVNALMQRFGLD AERGNSFITNQVFSILRNAKALSLNEDPNLVVCWGGHSINQTEYAYCRAV GLELGLRELNIVTGCGPGVMEAPMKGAAIGHANQRYKQSRFIGITEPSII ASEPPNPIVNELIIMPDIEKRLEAFVRMGHGIVIFPGGPGTFEEFMFILG IKLNPENRAQKLPLILTGPKESADYFATIDRFVLDTLGEEAQSLYTIIID DAVAVARHMKAEMVEIRDFRCKISDSFSFNWSLKIEHQFQQPFLPTHENM ANLNLHLNQSTVDLAANLRCVFSGIVAGNIKPATQDQIAEKGKFQLYGEP RLMEKVDNLLQDFIVQHRMKLPTDEAYEPCYEICK >MS0788 unknown MPYIRRQTEVKVRSFFTKFLFNGALGSVYKR >MS0253 unknown MIYSMTAFARHEIKKDWGDAVWEIRSVNQRYLENFFRLPEQFRGLENNLR EKLRQNLTRGKIECSLRIDSKKQTSAELNLNKDLAEQVIQSLKWIKQQAG EGEINLNDVLRYPGVVEAPEQDLDAISQDLLNAFDELLKDFIAMRAREGE KLHTVIRQRLDAISVEADKVRAQMPEVLQWQRDRILQRFEEIQLQPDPSR LEQEMVLLAQRIDVAEELDRLQMHVKETASILKKGGAVGRKLDFMMQELN RESNTLASKSINADITASAVELKVLIEQMREQIQNLE >MS0990 unknown MIWMSAIFLRQSNIESVRNIIDFIVRCKSILGKDEGENASWAFLNNFNIL SEDEKEKIKMNLSEDVINFLRLSLEHHYLLFDDYPLAFLFKDYKCGMDRS NAINLLKEDVSALFDRYSEHSTKVQTTAFYSMAITGKIVLNASMNIPDFN SIFSDPESDEAKIVAAFVRSSLNVGNDIISSSNGKNDWSKSFWKQCFDME ECS >MS1644 unknown MSYDANDALNEIEEALSELERVAEDLINNNPNKESELRGQGVHQATKHLR FRIRNIRRGEAI >MS1439 unknown MSLQVNSVAIMLVVLILLGVLSNNNSITISALILLLMHQTFLGKYIPFLE KNGLKVGIIILTVGVLAPLVSGKVQLPAFKEFLNWQMFLSIVIGIAVAWF AGRGVNLMSSEPIVVTGLLIGTVLGVAFLGGIPVGPLIAAGILAVILGK >MS2099 unknown MLKNSQKFGKEFCGICDSSSVYSKIFAKTYRLPIQCKNAVCNSISNGISK RLNRI >MS0995 unknown MNSQVKNMNRKLENIKFVITDVDGVLTDGQLHYDANGEAIKSFHVRDGLG VKMLMESGIPVAVLSGRDSAILRKRIADLGIKLAFLGKLEKESACYELMK EVGVTPEETAYIGDDSVDLPAFNVCGVAFAVADAPDYVKDCADYVLDLRG GKGAFREMSDMILKAQGKTDVYSSAKGFLKIVTNMAQ >MS0929 unknown MPNGKSSAIYIAFQNTRTKFRGIVLGAVYLL >MS1303 unknown MRRWLKNVFFLSGKEFRSLFSDPILVILIIYMFTAALYTVATTISTEVKN GSVAVINNDHSTLSYRLQSSLIPPYFRKVHEITANQADRLMDMGEYTFVI DIPPNYEVDILAGRNPQIHLSIDATAMTQAAIGSNYISQIFSREINDFLR LKNNKTFTPIKTAVNVLYNPNYTSKWFMGAMQIVGNLNLLTMLLVGAAII RERERGTIEHLLVMPVTSSEIAIAKIIANGSVLLVVVGLSLRFVTGGLLG VPLPAQAIPLFILGSLIFIFAIASLGIMLAIFAPTMPQFGLLCIPVYVVM YLLSGTTSPIENMPELAQWITQLSPTTIFGSYAQDVIFRGASLDLVWDKL VKMAAIGFVFLAVALGQFKTMLSRQG >MS0512 unknown MIIGPFINAGAIVFGGLIGAALGGRVPERLRTNLTMLFGLCSMCMGIVMI AKVAQMPAMILALLLGTILGELILLEQGINKLASKTKTIVEKILPNNQKK GVSHEEFLQKFVGIVILFSFSGTGIFGSMNEGLTGDSSILIVKAFLDFFT AIIFGTTLGSTIATAAIPQTVLQIALAYSAVLIIPLITPEMRADFAAAGG MLMVATGFRICGILHFQVANMLPALFIIMPISAIWLQMMG >MS0877 unknown MKDERNFYEKNNYCDFACHSNRTFNQCLGG >MS1440 unknown MEIAFLLAGKIIELTIIVLLGYALVKSKLLKSQDSYPLSIIGLYLISPSV MINAFQIDYSPQILNGLLLSLTMAVFLHIILIITGVILKRLLNLDPIEHA ASIYSNSGNLIIPLVVSMFGQQWVIYATCFIVVQTFLFWTHCRSIICGKG SISILKMFKNINILSIFLGVFLFAFQIKLPPLISGTLSSLGQFIGPNAML IAGMLIASIPLRNIITSKRIYLVTFLRLILIPIFLLIIIKLCGFDNWVEN GETIAMISFLATMSPAAATVTQMALIYGKNANKASAIYGVTTMLCVFSMP LIIALYQLI >MS2191 unknown MMLKNIFAGLTVLLLSACTLVTYQPVDTISHVNAKQGYRMRNAIQQPDGN LIILMFSGGGSRAASLGYGVLEEFKNAAVRPTAKGTTLIDNVDLVYGVSG GSVLATYYSLYGRDAVPKFEENFLKKNFQREIISQVFSLSNLPRITSPQF GRGDLLQEQLDQTLYKGATFGDLERKRKGPFVVVSATDMNLGQKITFTQE FFDGLCIDLSKMEISRAVAASSSVPLLFSPLTLNNNGGNCHFDIPELIQI SQNISNDAQKSKNLEELKNTLSLYQNSKERPFIHLVDGGLTDNLGLSGLI DIYDVAGQEGMYREAVKNQLKNIIVINVNAQNEVSSEIDKTANVPGTRDV INTIINVPIDRNSQVSLRRFREFTDEWNKSMANKPPKQRINMHFVNLSLK DLPESQLKKEVLNISTSFYLLHSDVNKLKRSAKILLQQSKEYQDVLRALQ >MS0568 unknown MNLRVFLLMMKKCIRFIFLLLLMFAAAGFWGYNYIQKLVNEPVNIKAEQL LTLERGTTGKKLFALLEKENIIADNILFPLLLKLQPQFNNVKAGTYSLEG VKTLGDLLTLLNSGKEAQFALRFTDGETWKQVKKSLENAPHLKHELKDKT DVEVFHQFKEMLPEFEVQNAYKTLDGWIYPDTYNYTPNSTDVALVKRSVE RMVKTLEKAWAERDEDLPLNNPYEMLILASIVEKESGISAERGKIASVFV NRLKAKMKLQTDPTVIYGMGESYQGNIRKKDLESPTPYNTYVIDGLPPTP IANPSEDALNAVAHPERTDFLYFVADGSGGHKFSRSLIEHNKAVQEYLLW LRRNKNK >MS0002 unknown MCGLQEELQKRLGIEEKIVHYYVLIRLSMN >MS0093 unknown MSATQPILNDIAQYLKENLPEWDVELFPNNPGTYSLSHINGAVLISYLAS KFEKPRTTEAVLQTRHVQVALTVLTRDLHDDEGALNLLDKLRLLMVGFRP VNCTECWLVDEFFNGTDEETGIWQYQLILQTETQQVQQIQTQDLPKFVTA HLRRADQSVRPD >MS2313 unknown MNKNKRNISQSIGFSDKFTKTVVYIFLQIFY >MS1363 unknown MKTDFLSSLIFSVGVTLPTILLLILGMLIRKKKMIDDRFCEQSTKVVFNI TLPVLLFFSVYGKHVDYISQMAVLSVGIIGTISLFLLAELFAARFIAEKR ERGTFVQAIYRGNSGILGLAFCISAFGDSAAVPASIYSAAVIFLYNILAV ITLTRSLSTGSVSVVSIMKGVIKNPLIIAILFALIANSISLQLPAPLLST GNYLANMTLPLALICTGATIDLSVFSNKTSNVVLMGSLGRLVVTPVFMIL IGKVFGLDGMLLGVVALMNTTPVASAAYAMVRAMGGNSVTVANIIGITTV GSMITSSLMLLILSQAGWI >MS1976 unknown MLSYRHSFHAGNHADVVKHIVEMLIIENLTQKEKGFYYLDTHSGVGRYRL FSQESEKTAEFEEGIARLWQRDDLPEEVQRYVDLIKKLNYGGKELRYYAG SPLIAAQMLRPQDRGLLVELHPTDFPLLRNNFKEFKNISVKRDDGFQQVK ATLPPKERRGLVLMDPPYEMKEDYDLVVNTIVEGYKRFATGVYAVWYPVV LRQQSKRIVKGLEASGIRKILQIELAVRPDSDQRGMTASGMIVINPPWQL EAQMKKILPYLTNVLVPEGTGSWSVNWIAPE >MS2096 unknown MTIKSVEISKAYRLVQLGSTTMLSAKHDGDADVMAAAWVGLGGPNKIIAY IGTQAYTRKLVEQNGYFVVHIPTVQQMETVLYVGEHSKHTMPNKLDNLPL FYQEGVDIPMVEGSAGYLLCQVIPNPQQEQNYDSFMGEIVAAWADDRVFD GRHWTFDTAPDELRTVHYVAGGQFYAMGKGTKFDHGPGQD >MS1465 unknown MITKQMSEVLQQNCGKNYRTLYLFFSVKYG >MS2015 unknown MQYEHIHEKFRHLVTADNQERIAFLDEPRWLGYGVAKDIMDNLVSLMNKP KRPRMLNLLIVGDSNNGKTTLIRRFFDLYGQAYIDSDSNAIYPILLAEAP PSANEKELYISLLERFYVPYKPTDTIAKLRYQTIHLFREFRVKMLIIDEF HSLLVGTPRLQRQVMNAIKMLCNELQIPIVGVGTRDAIRVLHTDPQHASR FDVAELPTWKLDKDFQKLLFQFQGILPLKKCSNLHSPELATKIHTISGGN LGNVHRLLTVCAVEAITSGTEQITLDIIEKNSWVQPTQGFRKIIG >MS1834 unknown MNLVDFLVDKLDALKATEIECIDVRGKSSVTDNMIICTGSSSRHVASVAQ KLIDESKQAGFESFGEEGKAVADWIVVDFGQAIVHIMQGDARQMYQLEKL WA >MS0289 unknown MSDIAITISILSLAAVLGLWIGQWKIKGVGLGIGGVLFGGIIVSHFSEQN GLQLDAHTLHFVQEFGLILFVYTIGIQVGPGFFASLRKSGLRLNALATLI VALGSLIVVIINKAFDVPLDIILGIYSGGVTNTPSLGAGQQILTELGMQN ITQSMGMAYAVAYPFGICGILASMWLVRLIFRVKVDDEAKKFTQESGQQT ESLQKINIRVANPNLDGLCLRDIPGFDERGVVCTRLKREENISVPKADTT IFLNDVLHLVGDSHSLQRMCLIVGEKIELEPSKLVGNIPFRTGCGYQ >MS1328 unknown MQKFNQIHSLFEHLPANYGEFSDFEQKIATLAQEMKVDLSLYEIDHLSIR VNTEDKAKSWLTTLLNYGKILSNNLVNGRAIYLIELEQPLLFMGQRVFII ELPFPKNKHYPVESWEHIEFVIPFLPNESSIEWVERVQQQFLWNQSGNLT IKVDEPKVDGEQLPNPSIAVSFADKSQNHVCIKVHPFNIKNIIKVS >MS0748 unknown MNHIYKVVWSKTTNSLVVVSELASSQGKAASVVSKGYKLSSVFKKSFQLT ALSALLISVMPAAQAAIAVGASTVTNWNGAVSVSLNGASATGASVPYNYH TPNNENYPDQGNNSNSSNIYSGTLSAAQSIAIGINATSQSGSIALGDNSR ATGGLSLALGAFSQTNQAGAIALGTSALASGFNSFATMRQAAATADFAIA MGTAANANATNSIAMGSSALALGNQSIAIGSAAMEKKVGSAGGESYRTDY VGTTNTKAQGDRTIAFGVNTSTTSNDSIAIGSNSKTNSGTGAIAIGWCSS TSYQDSVAIGSNATANGGYSLALGYNATSTNLTSISIGWNAAASNTGGGH SQGAVAIGPKTTALGNQSVVLGASASAVEQATAIGNDSKANGFGSIVIGG DDTGYSRNPNSDPYTPTALGGERIGYLANTATGDNSNYRSSLSSGIGSVV VGVHGQALSNGSTAIGVYSTAGDNGITFTNDTTSTTAIEATAIGALSRAK SIRSSAIGYSAEALGNYSTVVGANSTANGTSSLALGHNSTAYSTYSLAAG YNASANLSNSTAIGSSSNASGLNAIALGTGAQALNTNTISIGTGNIVSGE NSGAIGDPNNITGSNSYALGNNNVIYANNSFAVGNSIYISDTAQNTLAFG TNISVPTSTKTNNTLIGTSAKIQGGESSIAFGTNATVSNSVQSSAIAIGN QSKVEAAVGGIAIGNGSTISSSANNGSIALGQKTNVTGVSSIALGNNASV TGTQQGSVAIGNNTNVTNTGQGTVAVGSDTNVTVGNAVAIGDHVNVKGQR SIAIGSSSNVAEGVVNATTIGTGSNVTQNDGTAVGYNAIVSNYNGLALGA NATSTAQRAVALGADSVAGREGWDQAAYDPYIPANANTSQSAAITATKAT NNYGAVSVGSDTVKRQIINVAAGSADSDAVNVAQLKAAIGSVNTSWNIQE NGTQKDIVNAGDNVSFANGTGTTANVSVDSTGKTSTVKYSVNKSGLSVAT DGTVTAAANGDNFATAEQVAKAINDSEKTTTVEKGSDKVSVTGTTTGTKT NYVVDLSNAAKSSLDKADSALQSWTAQVNGANAKVVNQTNNTVNFVNGTN TIVKADANGNISVSTADNVTFNTVNASSFNAGNISIGTNGINAGNTTITN VANGINASDAVNVSQLNATNANVTNNTQNITKNAADIQSTKDGLNATNAT VAGNTANITNNTNAIANNTAAINKGINFGNGTTDNNFALGDTINVTSDSN IVVNTEDDGVKLSLADNVTVGNVTVNNTFKAGDVTINSTGIDAGNHAITN VANGTQDSDAVNLSQLNATNANVTNNTQNITNNTAAIANNTANISNNTNA IANNTQNITKNAADIQSTKDGLNATNATVAGNTANITNNTNAIANNTAAI NKGINFGNGTTDNNFALGSTINVTSDSNIEVSTVADGVKLALASSIAVDN VTVNDTFKAGDVTINSTGIDAGNHTITNVVKGVNATDAVNLSQLNAGKSS VEAGDNVAVTSTSDANGTVYTVNANISTVSNGSDKVTVTSSSTGNHTTNY AVDLSEAAKASLEKADSALQSLTTSADGTKAQTLDKDNSNANFISGSNIR LTPSADGITIATAENVTFTNVNTTNFKAGDVTINSTGIDAGNHTITNVAN GTQDSDAVNLSQLNATNANVTNNTQNITNNTAAIANNTANISNNTNAIAN NTQNITKNAADIQSTKDGLNATNATVAGNTANITNNTNAIASNTATINKG INFGNGTTANNFALGSTINVTSDSNIEVSTVADGVKLALASSIAVDNLTA NNSVKVGNVALTQAGINAGNHAITNVTNGTNATDAVNLSQLNAGKSSVEA GDNVAVTSTSGANGTVYTVNANTSTVSNGSDKITVTQTDAGNHTSNYAVD LSEAAKSSLNKADSALQSWTAQVNGADAKVVNQTNNTVNFVNGTNTIVKA DANGNISVSTADNVTFNTVNASTFNAGNVSISNSGINAGNTTITNVANGT NASDAVNLSQLNATNANVTNNTNNIANNTKNITNVTNLVNQGFNIGADNG TDDNVKLGEKVDFNGDGNIVTTVTNNAIAFALSNTLNLTDAGSVTMGDTV VNGSGMIINNGSTNNQTVSLTKDGLNNGGNTITNVANGSNATDAVNLSQL NAGKSSVEAGDNVAVTSTSDANGTVYTVNANTSTVSNGSDKITVTQTDAG NHTSNYAVDLSDATKASLDKADNALQSWTAQVNGADAKVVNQTNNTVNFV NGTNTIVKADANGNISVSTADNVTFNTVNASSFNAGNISIGTNGINAGNT TITNVANGTNASDAVNVSQLNATNANVTNNTQNITKNAADIQSTKDGLNA TNATVAGNTANITNNTNAIANNTAEINKGINFGNGTTDNNFALGDTINVT SDSNIVVNTEDDGVKLSLADNVTVGNVTVNNTFKAGDVTINSTGIDAGNH AITNVANGTQDSDAVNLSQLNATNANVTNNTQNITNNTAAIANNTANISN NTNAIANNTQNITKNAADIQSTKDGLNATNATVAGNTANITNNTNAIANN TAAINKGINFGNGTTANNFALGSTINVTSDSNIEVSTVADGVKLALASSI AVNNVTVNDTFKAGDVTINSTGIDAGNHTITNVVKGVNATDAVNLSQLNA GKSSVEAGDNVAVTSTSDANGTVYTVNANISTVSNGSDKVTVTSSSTGNH TTNYAVDLSEAAKASLEKADSALQSLTTSADGTKAQTLDKDNSNANFISG SNIRLTPSADGITIATAENVTFTNVNTTNFKAGDVTINSTGIDAGNHTIT NVAAGTNKTDAVNLGQLEQFIGDNSYNWNLSDGTNNSAVADNSTVAIEGS ANGDSANTSGIVTMLDGTNVSVDLSDKAKESLDKADSALQSWTAQVNGTD AKVVNQTNNTVNFVNGTNTIVKADANGNISVSTADNVTFNTVNATTFNAG NVSISNNGINAGNTTITNVANGTNASDAVNVSQLNATNANVTNNTKNITN VTNLVNQGFNIGADNGADDNVKLGEKVDFNGDGNIVTTVTNNAIAFALSN TLNLTDAGSVTMGDTVVNSTGMIINNGSTDNQTVSLTKDGLNNGGNTITG VANGSNATDAVNLSQLNAGKSSVEAGDNVAVTSTSDANGTVYTVNANTST VSNGSDKITVTQTDAGNHTSNYAVDLSDAAKASLDKADNALQSWTAQVNG ADAKVVNQTNNTVNFVNGTNTIVKADANGNISVSTADNVTFNTVNASTFN AGDVSFNTSGINAGNHTITNVANGSNASDAVNVAQLEANTTRYYSVNSTV AGNRNNDGATGINAMAAGANAVASGDNATAIGQGTKANSAAAIAIGNNAN ATSSRNDSVIAIGNNAQSTGSYSIAVGTNSVANHTWSMAMGISAKAIDDY ATALGSSAQATSQWTTALGAGANATGSAATAVGSNTTATAGGATVVGYNS SVTGANTTALGNNINVDTEGSVVLGNGSTAASATTETTATVNNLTYSGFA GADNVATGDYVSVGSVGEERQIKNVAAGNVSATSTDAINGSQLYATQNVI GNVANSVVNNFGGNATVDQNGNITFTDIGGTGANTIHDAIQNVSNVANMG WNVQANGDTATKVAPGNTVQFINGQNIEIDRDGTNITVATADNVTFTNVN TTALTAGPVTINSTGIDAGNHTITNVAAGTNATDAVNLAQLESYVGDNSY NWNLSDGTNNNAVADNSTVTITGSANGDGANTSGIVTELNGTNVSVDLSN KTKADIQQGVDANTTVNTKGITFAADSGTATERKLGETLAINGDGDLINT TVSAGKVEVAASDKLKGAVNNATTALQSWTAQVNGTDAKVVDQTNNTVNF VDGSNINITNNNGTIKVATTDNVTFNTVNASTFNAGGVSISNSGINAGNT TITNVANGTQDSDAVNLSQLNATNANVTNNTNNIANNTANITNNTNAIAS NTVAINKGINFGNGTTANNFALGSTINVTSDSNIVVNTTNAGVQLGLADN IAVDNVTVNNTFKAGDVTINNNGIDAGNHAITNVTNGTNATDAVNVSQLN ASKTSIVEGNNVNVTAKTDTNGTVYTVNANTSTVSNGSDKITVTQTDAGN HTSNYAVDLSDAAKASLDKADSALQSLTTSADGAKAQTLDKDNSNANFIS GSNIRLTPSADGITIATAENVTFTNVNTTNFKAGDVTINSTGIDAGNHTI TNVAKGVNATDAVNLAQLESYVGDNSYNWNLSDGTNNNAVADNSTVTITG SANGDSANTSGIVTELNGTNVSVDLSNKTKADIQQGVDANTTVNTKGITF AADSGTATERKLGETLAINGDGDLINTTVSAGKVEVAASDKLKDAVNNAT TALQSWTAQVNGADAKVVNQTNNTVNFVNGTNTIVKADANGNISVSTADN VTFNTVNASTFNAGGVSISNSGINAGNTTITNVANGTNASDAVNLSQLNA TNANVTNNTNNIANNTKNITNVTNLVNQGFNIGADNGTDDNVKLGEKVDF NGDGNIVTTVTNNAIAFALSNTLNLTDAGSVTMGDTVVNGSGMIINNGST NNQTVSLTKDGLNNGGNTITNVANGSNATDAVNLSQLNAGKSSVEAGDNV AVTSTSDANGTVYTVNANTSTVSNGSDKITVTQTDAGNHTSNYAVDLSDA TKASLDKADNALQSWTAQVNGADAKVVNQTNNTVNFVNGTNTIVKADANG NISVSTADNVTFNTVNASTFNAGNVSISNSGINAGDTTITNVAAGNVSAT STDAINGSQLYATQNVIGNVANSVVNNFGGNATVDQNGNITFTDIGGTGA NTIHDAIQNVSNVANMGWNVQANGDTATKVVPGGTVQFINGQNIEISRNG TNITVATADNVTFNNVNTTTLTAGPVTINNSGIDAGATQIKNVAAGTEDT DAVNYKQLKDAVSNSSTTWNLTDNNDTANSTTVGNDSTVSFNNGTNTVAV VNGTNVSYSLADNIALTNNGSVTVGNTMVDNTGISVGDNVTVTNTGFVAG NVTVKQDGINAGGNKITGVADGDISANSTDAVNGGQLYNVIQNATAGVKT EVEAGKNIVVTNSTGANGQTVYTVETAKEVDFDKVTVGNVTINKDTNKVS GIANGDISATSSDAINGSQLYTANQNVADHLGGGSKVDENGNVTAPTYTV VTNPSTNATTTANNVGDAINGLNTAISKPLTFAADSGSNSEMRLGSTVSI KGGVSDSTKLSDNNIGVVSDGKGNLTVKLAKDISGLNSVTTGDTTMNSEG ITIKNGAAGSSVSLTKNGLNNGGNRITNVAPGEVSQDSTDAVNGSQLHAT NQQVVRNAQAINQVANHVNKVDRNLRAGIAGAMAAGGLYHATLPGKSMVA AGVGTYRGESAIAVGYSRLSDNGKLGVKFSVNGNTRGDTGAAASVGYQW >MS0087 unknown MRDQSVATSAKLKTLGKMTALGVTGALTGIKASGNAVLGLAEPAMKFESA MADVQKVVDFKTPEGFKNLSNDILDMTRTIPMAAEELAAITASGGQLGVA EEDLKSFTTTIAKMSVAFDMSADASGDAMAKIANVYGIPITKLGNLGDAI NELSNNSPARAADIVNAMSRVGGTAKQFGLSENAAAALTNSFISLGKAPQ VAGTAINGMLTKLMTAEKGGKAFQGALNQVGISAKQLKRNIAKDGQAALV DFLKRLEKLPKDKAMGVLVDLFGREYADDVAVLAGNVNVLDKSLRTLQET DANGNLKYLGSMEKEFASRSATTENGLKLLSQSTDEFFKVVGARFLPIIN TVSGGLAKLMHRVTDFAKEHEGLVDTFIYVGGAIAGVVTGFSALSAVIGV SGMAWIGLSKPIGMFVSVLGTVFKWLKLGGLLFATLGVKVLDMALTFGKA MFMMGRALLTNPIGLAITGIALGAYLIWDNWSWLSAKFGSLWQTVTGYFS AAWDNIKGFFSSGIGNITATILNWSPLGLFYQIFRPVMSWFGVDLPNSFS GFGKNIIDGLVNGIRRAWNGAKDWVIGLGQSIKGWFTGEMKIHSPSRVFM EYGDNIAQGLAIGVAKNAVLAADAVLAMGDKMKNAAPKSIPSPVTQPMQV KTPVQDMADFAVDMAKNAITVPIKPVAQSVTPVIKKSAKQVSKPALIQSA VKSEQVLAPSVQTSEMPTPVTMKNARLQYIQRMQTLDKMQSTVKSEPLFT PVKTIAEPILSKEKGFFGSLWDDVKFGANVVGNLLGLSQPSLKTPDFNPS AGGRDSLIFSDYEPLNRNAVSQRTVNQDAGGIVVNFNPTINVNGNAPQGV TEQITQALQMTAHDFERLLNRVLDQRQRRAY >MS0005 unknown MKINLMNLNVNMNRAILFFSGDLDYYKTKSAVFFTALCCSDE >MS0412 unknown MQKNLIILTALLATPGCGTIVQLANPSHKYEAYDGTKYDWQQAQKWGMPI LDLPLSFLLDSALLPYVLSQE >MS0012 unknown MILNYLCSALSKNKDILSNIINKRGFMMKKLLTICVITTILAACSMPNGR SKPQEGNMQIEEALKACQQAMTSNSTREDFDACMLKKGFERPANKNQPKA N >MS0857 unknown MFTSIQREVNQFINRGLDRTLRIAVTGLSQSGKTAFITSLINQLINIDNV TNGHLPLFEAARQQRIVGVKRIPQINLNIPRFDYEANLNSLMASPPQWPQ STRGVSETRLAVRYHNSGLFSHIKEKSTLYLDIFDYPGEWLLDLPLLNLN YQQWSLEQQNLRQGLRAELAQTWLEKTKKLDLTAMADEDILAQIAKDYTA YLQACKEQGLHFIQPGRFVLPAELEGAPVLQFFPLLHLAEKDWKKLKEEA KPNSYFAILNQRYDYYKNKVVKGFYENYFVHFDRQLILADCLTPINHSRQ AFQDMQEGLQQLFKNFHYGKRRLINRLFYPRIDKLMFIATKADHITSDQI PNLVSLMRQLVQDGGRHVAFEGIETGFTAIAAIRATKQVLVEQEGKTFKA LQGIRSKDKRQVTVYPGSVPSRLPSIDFWQQQKFDFDQFEPQPLESGEII PHLRMDSVLQFLLGDKLA >MS0310 unknown MRHIKWRRFPILVKIGRLILSKINIYCFKLNNI >MS1383 unknown MLSRNPLFPNKVTRYELTPETVDCVVFCSKNYRLILPDLHKITDRFNTYF HYTITAYGKDIEPGVPTA >MS2018 unknown MTFISSFYRGTIPKPGEISLAHNGVLFLDELPEFERRVLDALRQPLESGE IIISRATAKIQFPAKFQLIAAMNPSPTGNYQGTHNRTSPQQIMRYLNRLS GPFLDRFDLSIEVPLLPKGALQSLDNRGETSAQIRQRILQVREIQLTRAG KVNAHLSGKEIERDCKLSTQDSIFLENALTKLGLSVRAYHRILKVSRTIA DLDNELHISQRHIAEALGYRAMDRLLQKLNNND >MS0620 unknown MYKNRRLTLIPHKNRKVRSKIRKFRLSGENHGNSTALW >MS0615 unknown MTNTILLICIILFFAYAFYDQFGMDKRKGETKLKVRLKKQAKIDAVIFVI LIACLFAYQSKESLLNIDSFTIFLLATAVVLAVYTAFIRSPMLILKEKGF FYSNLFIEYDKIRQLNLAEGNIFVVDLTNGKRLLLPIADERDKEKVVTFF GGYKEHKQENK >MS2359 unknown MYNHNENQENSTALLELIDGFEQNPAELFQTEMEKVAENMKDLPFYREDI PCFCPKFVQFENQWIGMALTPWMLSVLVLPGPNQQWKARTVGDKIALAFP YKTLNFTVSSLDNVPQYLSCSLHSPLEANLSKEHAVQLTKDCLTMLLSLP IKQKAPSDLNRRNMFGAMLK >MS0983 unknown MSKIKTSFNVVEEKSAHLAVLIDADNASAQTIKAILEETTKFGEATVKRI YGNFVGDSGKWKAVINEYAIKPMQQFAYTKGKNATDGFMIIDAMDLLYTN RFDGFCIVSSDSDFTALAIRLKEQGVTVYGFGKKQTPEAFLNACSQFIYV ENLLPELNDDKQVDITLPNASNTQKKVQQTENSTPTVSIDNQKIQSTELP IETIRKVFEQFDSEWVAMTAFGSTWKRLQVDIDPRSYGCKKFTDLVKKYP DVFDYKMETDSDTTQEHMYVKLKI >MS1853 unknown MNFDRTFFNKKTTYSLPYRISTRTFLHKVGGGA >MS0952 unknown MLQSPDLINKQEIIMQALKQYLIEITEQNLNDTLQLSKEHPLVLVFYAPS HQPSVEFTTLLERYAEQYQGQFALAKVNCETQQAIAMQFQIRNLPTAYLF KEAQAVDAFQNVISEEELKQRLSQILPKEEEIKFNLALDFLQAEDYDKAL PLLKEAWELSERKNSDIALLYAETYIAMKKTEPATEILNKIPLQDRDSRW HGLQAQIELLIKAADTPEIQQLQADFSKKPTTEIALKLAVQLHQANRNEE ALELLFNILKQDLSAQNGEVKQQFLSILSAIGNNDPLTNKYRRLLYSLLY >MS1078 unknown MRDQIKADAKIIGFFVLVFDYSNPYFYSYKSYN >MS2110 unknown MANQMVMAFLGRLWEFLTNLWWLPLALLFLAFLKSRWFKGRFGEKAIQSR LSGLDKKVYRPFHDLIVPSHNTTTQIDHIYVSCFGIFVVETKNYSGWIFG SEKQARWTQTIYRKKHSFQNPLRQNYAHIKALASLLELPESVFHSVVVFL GGCEFKTQMPENVCYIGQVEHYIRNIRMVMLDNTEVDRICTILQNKKYAV NNATRVVHKNNLRQRHQSYN >MS0989 unknown MKIENDLIRFIQSYETYLVSSFSKLWSAVKVEYDKLEAYSVIGGLLSRQV TLAIQMTRSPNILNGHSAPLFLRSMTDLHITLAWIMLDLEERSKKYILYG LGEEKLLVEHYKKRIDDSPNNPENELMEGMIETRLHWIDSQRRDFLVEVN LGSWSQLDYRKMAQEANCESLYTFAYKPFSQGAHNMWPHVSRYNCKYCES PLHKYHLIPDLFEAPADLDYLFRSCKYVHMAYEIFINKFGLDFSELMPLD WWDNYFMEIDVEENAING >MS1057 unknown MRSIFGKIFMRQQINKWKYHFNWIKHYISPSLVVIITVFSAYLVYQQNYL MKESKRPFLAFTPKIIVLPNNPQMIEAKIVVGNYGEGSAIIDEFKVQING QTYQSNYSSKWREILSHNQISHSCPLSEGWLIKNAVLKAGDEEDNFLRLL YPMAEVPNGKEPCITPFIDLIKAGKLTLSVKYHSIYNISYSQENIIEYDF SALQEQFKSP >MS0594 unknown MVCKTVAVVTSTIGRESLERAIRSVHTQTYPCRHYVFVDGEQFHSSAKAI LDKYPHVIALYLPMNTGANGWFNSYINAAAPFLVKEDILCFLDDDNTYRP NHIQTIVDCFNAESNLDFAYSLRNFVRPGGAFVVRDDYQSLGRYVHKLVS GCTYNIHVQGKKIPVVCRFSKQNLIDVNCMALSLVCARRVANIWCERGYG NDKAVTDYLLANTKGEMTGRYTVDYTINYRDAFSTYDEFAQYLSENFAKE IAEKFLTLFSQENIDAYFGERPWAKE >MS2138 unknown MNPIYKLDNLIKTLATNLSPVILLLTRVIIGYMFLLHGLQKLTGGVELTS LMGVGGIIETLGGIFIILGLFTRFTAFILAGQMAVAYFMFHASAETLFNP VENQGELAVLYSMTYLILMITGAGKISLDAKFNK >MS0455 unknown MDNTSAKAEAGSVFSFILTVMVEELGALVEDK >MS0378 unknown MNVSLINKRNHFKKCGQNFENFDRTLPSRLP >MS1714 unknown MNIRWNIILGTIALVLLAWFYTLNQDKPDLTRLIKAPESPEYTGHKMETT IFSPAGKKQYQAYSDTARHYDQDGHTEFVNPVVFALEVETENQGKQSWKL TAKSATLTKDNLLYLNGEVVAQSLDPISRLQRIETEAAVVNLKNQDITSD NMVTIRGLNFTSSGLKLTGNLKQQAATLKEQVKTHYEISNQ >MS2061 unknown MWWIILLKSAVNFYEEKGLNNIQAFCFGIQLKISRLR >MS0168 unknown MILFAGDPHGYFKHLYPFVRGKEDIALIILGDLQLTTVEELDKLSQYCDL WYIHGNHDSKTVAAFEALWGSKWKNRNLHGRVAEIQGKKIAGLGGVFRGQ IWMPPNKPLFLDPIHYCQYCSQEKIWRGGIPLRHRSSIFPADIENLSKET ADILICHEAPKPHPSGFTVLNELASQMQIKHLFHGHHHENFDYSELAPQT PFAITNVGFRSLCDEKGNYLLKNIDDRKNKP >MS0491 unknown MRQVIMILAAYGDIAQLGERLNGIQEVVGSIPIISTKFKALNLFQGFFVF ACYVQNSVKINRTFIVKKCG >MS1712 unknown MMVDLISALGRNVINSVKALGRAGFMLFGSLVGKPQIKKHFPLLIKQLYV LGVQSLLIILLSGFFIGMVLGLQGYVILVDFAAEANVGQLVALSLLRELG PVVTALLFAGRAGSALTAEIGLMKATEQLSSLEMMAVDPLRRVIAPRFWA GVIAMPVLTVLFTAVGIIGGHLIGVEWKGIDSGSFWSVMQNAVRTLDLWD GFIKSLVFAFTVTWIALFNGYDCIPTSEGISQATTRTVVNASLLVLGLDF VLTAIMFGAG >MS2197 unknown MTDLQELRAETREIITDLLNDGSDPDALYIIEHHIAHYDFDKLEKIAVDA FKAGYEVSEAEEFEDENGKVIYCFDIISEVELKPEIIDLQQKEILPILQK HNGIYDGWGTYFEDPNASDDEYGDDGEFFDDEDDFDDENERPVH >MS0270 unknown MLVRLGLYRIFQVYKIKKSATNRGFSVISELIFI >MS1382 unknown MESRDIGAYDSCPSGCKYCYANKSSAKARACSNITIRIRPYCSGICVKRM LSLKARKKAF >MS0868 unknown MAGLTDKGTFMEVTIEITVILFTVAVIAGFIDSIAGGGGLITIPALLMTG MPPALALGTNKLQACGGSFSASWYFIRRRAVDLSAVWLILLMTFIGAVIG TILIQLVDASLIKKVIPFLVLAIGLYFLFTPKLGEQDARQRLSYGVYAFT AGVSIGFYDGFFGPGTGSILSLACVTLLGFNLAKATAHAKVFNFTSNFAS LIFFLIGGHILWSVGLVMLVGQFIGAHFGAKMVLSGGKKIIRPMVVIMSF IMTVKMAYDQGWFS >MS1310 unknown MRLSHHSCKNITDFALINAHCRYFSAEIVFQQKCGEIL >MS1772 unknown MWCKNMNVLHSDTTISPLRRFIRQQRGSVTIEFVFMLILLILILAFMTDL AMLRSTTGRLDNISYSLANILRERTQLYDGKENLATENVNRDVNNFKLLA KRMAFGDKNSNKEIYVVLEYLAPQNSVYRIIGDSAKCEPYDSLQGLENLS PRSEINDTRKIPLYQVTVCVPNYSFFSALVPGVAKNMKETIRSSSITVSR >MS0994 unknown MKTPNMYGMTENMGKCSFKFKLLTENKDFLREKRKDLKSVKCGRKLGNFT KKARIFLRLRASCI >MS0384 unknown MYQKIRRFLSTCYLRRTIMALRNEEGHIDLKLMERMMKIPGGLVIIPLLL AVAIKTFFPQFFEIGGFTTGLFQKGQPAMMGIFLILCGASINIRQVGMPL YKGVVLTSSKFFLGVALGLLVGHLFGPEGIWGLSPIVLIAAITNSNSSLY ISLSSQFGNSTDTGAISILSLNDGPFFTLIALGASGLANIPFMAVVATLI PLLIGFVWGNIDTKFRELCGKAQPIIIFFMTIAIGSGTDVSTILKAGASG IILGILSTLTAAVFFYVFNIFLPKRERNAMGAAIGTTALNSAMTPAAVAD ADPTFTPHVPLATAQCATASIITLFLCPFVVAFFDRQMRKNKLGIYSAEG WAGKALAEREALAKSAS >MS0880 unknown MPILAYVNLNVVSKMQYSEVVKKKEGNLLQKSAVKIYRFL >MS1742 unknown MTNKNLASLFYLALFSASSAVAAVNPSDLIWKSAAFGQSTDLNFGSTILP EKVGVNKTTVDGHPVQEGALATKFTIESRGGKLANSHEGLTYYYTELPIN TNFVISADVRLEQLGPETGAKPNRQEGAGIMVRDIVGKPRAEPQPMGYEE FPAASNMVMNLLRSNTKAHNGKVNINASYREGIKEPWGTAGNKLVREDYA EGIDFENNPLRLTLEKNDQGFVVTYIQDGKEYKKVLDKVNPGILANQNAD KQYLGFFASRNAKITVENVDLKLTDGKKVEPAKFTPKAMPLIVNIESSTK ATGSDYVFQARANYEGSFVLQRGNKTLFKSPVVKAGEYVQHKLKLNQNKT DLKVQFIPKAKLKENGFEQNISIEKHQLQNPKLLYVATNGSAAGNGSAEK PLDLVTALELLPPGGTIQLQPGEYAAVTLDTTMSGLKDSPKTLKGIEGKV KFIGEVLHKASFWNMENIEVSGASLIVHGSHNNFSHIVTHGAPDTGFQIT SPEKIGRSLWASHNTVTDSISFNNMDDSQINADGFAAKMRIGDGNSFIRC ISHHNVDDGWDLFNKVEDGPNGVVTIKDSIAFMNGQTLKLKSKSASIGNG FKLGGEGLPVNHVIKNSISFRNNMDGFTDNFNPGTFTVENNVAIDNKRFN YLFRKSPYENGPKQGVFKNNRSFRFYQESKYTDVVNGSLLNNNEFLTTTE MTPAKTELLQKLQALSKVEFSEDNTGLEEVKKIQALLR >MS0742 unknown MLMFANRTFLCNIRANFIDSSIYKKTSMDMSQDSVKSLQSTFKTVGVVGR PRNDSTLQMHKNIFHWLCEQGYQVLVENEIGKALNLSENHLASLDQIGQH AQLAIVIGGDGNMLSHARILCKYNTPLIGINRGNLGFLTDIDPKNAYAQL EACLNGEFFVEERFLLEAVVKRHGETVARGNAINELVIHPAKIAHMIDFH VYIDDKFAFSQRSDGLIVATPTGSTAYSLSAGGPILTPQLNAIALVPMFP HTLSSRPLVVDGNSKISVNFAEYNIPQLEISCDSQLALDICCNDVVHIQK SPYKLRLLHLHNYNYYNVLSSKLGWLKKLF >MS1960 unknown MLSDVFYDPEEIMAKKTNQVPIAENSQTAYLHNRGTIQDNAVKALLRTPL FRSRIEKKLKGKGSYQRKAKHAGRYFEKPDDKSFGYKSFIIGFLLGPAYL L >MS1449 unknown MFLSRISFYFTLVCGLIIAMVIAPSAKANMFSVSETEINQYLSKKGEIAD KIGFPGLFAMDYKVQNLTAKVGQNNDGRVELSGTIDGLLNLQKNDYVGKI DLTVDTIPYYDAEKGAVYLRDLRITNWTGSPQQYMEKLEPMMPFLSRSLA ALMATMPIYTLDESKPRDMLIKKFAKGIRVEKGQLSLDAGIL >MS1425 unknown MQMKTTANLTALFNLFWLKTKIKWDFTYLAVYS >MS0355 unknown MQGLLLDEPLERSNVLSTKWTPKAQKCGQF >MS1375 unknown MHCPFCSTEETKVIDSRLVSDGYQVRRRRECTKCHERFTTFETAELVVPK IIKNNGMREPFNEDKLRRGIQHALEKRPVSADDVEKAISHITHQLRATGE REVPSKLVGSLVMEELKKLDKVAYIRFASVYLSFENINEFSNEIEKLKD >MS0944 unknown MTEQTFIPGKDAALEDSIAKFQQKLTALGFNIEEASWLNPVPNVWSVHIR DKDCPQCFANGKGGSQKAALASALGEYFERLSTNYFFSDYYLGQDLANGE FVHYPTEKWFPIEDDSSLPEGILDEFLLNYFDPNRELTPELLVDLQSGNY DRGIVAMPYVRQSDQQTVYIPQSIIANLYASNGMSAGNTKYEARVQGLSE VFERYVKNRILKEAISLPPIPQEVIEQYPTIAASINKLEEEGFPILAYDA SLGGKYPVICVILLNPNNGTCFASFGAHPNFQVAFERTVTELLQGRSLKD LDVFSPPSFNNGDVADLANLETHFIDSSGLISWDLFKDEADYDFVHWDFS GTSHEEYDNLMNIFNEDKKEVYIMDYNHLDVYACRSIVPGMSDIYPADDL IYANNNMGMEWREILLDLPHFHHDKETYLELLEELDEQAIDDATRVREFI GLVPPPKSGWVTLRTGELKSMLHLALGDLEMALEWANWTYNMNSSVFIPE RANYYRCLISTLELFLDESRTPIQYRNVFEKMYGKTAADFAWNAVQGGNP FYDLLADDEHLNKFQAHQKLLKAYEKLQTAKRENWK >MS1469 unknown MLTTLLQTHITFAFLSLMLLIVRGYMQLQGKDWRTVKLLKITPHLADTLL VLSGVALVFVFGYGLQMWLIGKVLLLVLYAFFSAKFFQKNAVKSNILFLI LALSAFLAAMYLGYFH >MS2305 unknown MNYKNVIGFGVAGNFAGHLEQAGEANDFLAVKTQEAVQPKAIFPFYVPSE KAGFLSVYPLSSNKLRFPDNSGDNLQIEPEIAILCHVIYQNNQVVKLIPY SFGAYNDCSIRRPNANKICEKKNWGAETKGLSDTLIPLTSFELGGEIDKY RIACFHRREEKTEIYGLDSPALGYSYFHTKLLDWIVDRMNNQPDQGPMNN IAELLAIADYPTEAIISIGATRYTEFGESHYLQKGDTSIVVVYNGEKYTK QQILEMAQAQSFPDDISALIQQVIK >MS0114 unknown MENQLAELKNEIEALRTAQEELQLLLGAQKLLFNAVAATLDKEKKQAISQ AIYEMLNSHAVFSAQEPVVLAARDHLLTFANLMAQQANEQSPE >MS0482 unknown MIRFYQLAISPMIGPRCRFTPTCSCYGIEAIKTHGALKGSWLTLKRILKC HPLSKGGYDPVPPKINNNVEKK >MS1046 unknown MHSLTANCVSWGKTSKLNRILLNYLYKEKS >MS1828 unknown MSETKTVNLSELPQQKLKDLLEFPCSFTFKVVGAARPDLIDDVVMLVQQH AKGDYNPRNAVSSKGTYHSVSIDIIAEEIEQIERLYEELAKIEGVRMVL >MS0520 unknown MKSAVKKRDFLSVDFANDFKKRSNRITGFNKISEIVFFEKTPRLYIVIIK PRRLLRG >MS1448 unknown MTTKKQAVFSRLVNELVQKNQGKRIFSFDFENQTYWVKQPEKLTGVWKIL KPHPKQSFREELHILKNLYERGAPVPQVILSGEDFFVLKDVGPTLNHWIE NAGLNLTPAEKNQILVDAIKALTSLHKKGVTHGRPAIRDIAWRQGKVTFM DFESHSRSLNLQWHKIRDVLVFIHSLCRSKHLSGEQIQYLINKYEEYCES DLWQDVLNLVAKFRFLYYILLVFKPVARMDLIAIYRLFQYLLPLTEENK >MS1369 unknown MLVVLFSLVKEQINDNFTIITSSQNFDKIDRTFMRIILISSLRRTG >MS0071 unknown MFRVVGEINMKKYISDKNFLAGFIFFFVSAFYLISAFQIETKNLVSVEAD FMPIIYGSLLLTTSIVLMITSFFKIRNTVVNKENKETDWKRIFSVIGLVF VYVLLMQYIGFIVTSIPFLFCLSVLLTPLYIKKNYIVYSIFSIVLPILAY FLFSYYLNLTMPSGFLF >MS0444 unknown MATMNIVLLTFGSRLENHYQASFAILSFLKDPAVKRVIMVTDRPEFYAFF GNKIEFIQINEDTLTQWQGEYQFFWRVKIKALEKVQKRYPAEHLLYIDSD TFLATDLAGIQDKLSQNQLFMHKLECALGDEIDNTTKKMHNSLKDKTFAG IRLDSQSTMWNAGVIALPANKAKEIIALSLRLCDEICATDCTRRLVEQFS FSIALNHYGGLNACDHIIGHYWGNKNEWNKLISAFFVNALLKNLSLQDCI NEVAEFDWNRLPIHKKQRSTNSKLKALTDKYFPDKNISYFSK >MS1803 unknown MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK SLRR >MS1397 unknown MMKKSLFLTALSLAILTGCQNVGSQALQIEKQGSFTVGGSYVTHKGTFKQ ENFIAPEGQRAYGDFAYVKYQTPTNAKKYPLVFQHGGAQSSRTWESTVDG REGFDTLFLRKGYSTYLVDQPRSGKSNLSTKAITPDTPWASNPMYADKTF WILSRMGHYDSHNQPVANAQFPAGEAAYQAFQQAWTIGSGPLDNDLNADV LTQLVDQTKGAILVTHSMGGTIGWRTALRTDNVKAIVAWEPGGTPFIFPE NEMPKITKARFEALSGAAMGVPMNEFLKLTKIPIVLYYGDYIQVGSDNVG EDKWGTELAMAKQFVATINKHGGDATLVHLPEIGIKGNSHFLMGEKNNQQ LADLMADWLKQKELDK >MS0299 unknown MNISVKNRKKTTALFRNEKHQHSRLSAELAF >MS0848 unknown MLTAFLFRIGLNFFYRADYCNGNKIQQILTALLTLFCYLFY >MS1309 unknown MRIAGTFLRKLFFSKSAVKFCEILSDKIQIGS >MS0358 unknown MTTKTTDETFTVDCPICKKAVIWSPQSPYRPFCSKRCQLIDLGEWAAEEK AIPCENADFAMDPENNEDWAKH >MS0847 unknown MADSRIVLDAREQSTSLLSTHKVLRNTYFLLGMTLAFSAFVAYISISLGL PHPGIIVTLVGFYGLLFLTNSLANSGWGILSAFAFTGFLGYTLGPILNVY IGAGLSETVVLALSGTAAVFFACSAYVLTTRKDMSFLSGMIFSLFIVLLL GMVASIFFQTPALHLAISGLFVIFSSAAILFETSNIIHGGETNYIRATVS LFVSIYNLFLSLLQLLGIFGGDD >MS0828 unknown MHWFYKKGGITPTPQAIKSLKIYAKLTALLINRVPL >MS0867 unknown MKKYLKTLVFSTALLTGLTTVNAAPEDWQRIKRPIPSSNGQAEPIGSYSN GCIIGAQAMPARGDGFQVIRMNKNRFYGHPEMISYLQRLGKKVEQAGLPT MLIGDIAMPGGGRFLTGHASHQMGLDADIWLRMGSMSDQDALNSDGKGLL VVDRAEQRVDENIWNQNHFNLIKLAAQDSKVSRIFVNPAIKVKLCQTERQ DRSWLQKIRPWFGHDSHIHVRLTCPYGANYCENQPAIPRGDGCGAELYSW FEPQKPSSGATTKKTLPPEPFLCQQVLNSPNRSEWQD >MS0331 unknown MYMFVLFGLNSEHSQHINYFLYTGELSMKKLLKLSLVAGLAMTALAVQAE ERFITIGTGGQTGVYYVVGQSICQLVNRDTAKTQIKCNAPSTGASIANLN AIADKQMDMGIAQSDWQYHAYNGTSAFEGKKNEKLRAVFSLHAEPFTLMA RDDSGIKTFDDLKGKRVNVGDPGSGTRATINVIMAEKGWTDKNFKVAAEL KPAEMASAMCDNNLDAITYNVGHPNGALKEAAASCDSHLVPVTGPEIDKL VSEHSYYAKAVIPGGLYKGTDNPVETFGSYATLVSSTDVDADKVYAVVKA VFDNFDRFKRLHPAFANLKEEDMIKNALSAPLHEGAERYYKERGWLK >MS0947 unknown MTVQSGLLLEHCKAAIYLESEIHNLSLIPKACRQFNQALEQLREQYPDAM LGAVVAFGDNAWKRLSQNSAPELKSFVALGKGNLAAPATQQDLLVHIQSL RPDVNFSAAKAAMDAFGEAIRVMQEIHGFRWVEERDLTGFIDGTENPQGD DRPVVGTIAEGEDADGSYVMTQRYEHELAKWEKLSQHKQEQVIGRTKPDS EELDEVPETSHVGRTDLKEDGKGLKILRQSLPYGTASGTNGLFFIAYCAT LHNIEQQLLSMFGEKDGKTDRLLGFTKPVTGSYYFAPSLEKLLSL >MS0011 unknown MTKQIAVLIGSGSTTSFSKLVVSHLQKMAPASIQLNIVEIADLPLYDRDL DENSPAQYTRVREQIANADGVILVSPEHNGAISAMLKNAIDVVSRPMGQS KWFGKPAGIVTVAAGMAGGVRVADQLRTIASGSFIGMPVYQQNACVGGLF NGVFDQNGEITIDAVKQMLQQFIDGYAEFVAKF >MS2009 unknown MPVFNAHVAQGKLTKEQKQGLADAFVLAIHDALNAPMEDQFVIINEHPQD NIFIHPTFPNMQRTDKRMVVTVDVSTTRTLEEKRKLTELVTKYAVEKAGI GQDDISLLIYALPLENMSFGRGILMPDDAEAMVKRTRS >MS1715 unknown MKLAINKILLTSALVMTSLSAFALKDDTNQPINIVSDNQSLDMEKSIVTF TDNVVITQGSILIKANKVIITRPPEGSKQKETVEAFGNPVTFHQMLDDGK PADGKANRVHYDLGKEFLTLTGNAQLKQLDSTIDGDVITYDVNKQQLKAS STAKSRVKTVLIPSQLNEKKK >MS2379 unknown MTHLIVATHGKFSQEIVNSAAMVFGEDENTHVVTFLPGEGGDDLVAKYKA IIATLPENEPVLFLVDLFGGSPYNAAARVAAEYENSDIVTGISLPMLLEV LDAKDGASLPELVETAKEVGLAAVKSFRQPKEEAKPAVKAEVAPAAAPAP RDPNLKGNMNISLLRIDSRLIHGQVMTSWAKAVKCEAIFAISDEVANDAI RRELLLQIVPEHLKGYVITVDKAIKVWHNPKYAGKNIIWLVTNPSDIVRL IEGGVKIRNVNVGGMTFNEGDQLISQAVAINQTDLAAFYKLLELGVDMSL QQVAANKKEPLDKKRLDEIKF >MS2343 unknown MKYLTFNFFLKKNLYMFSKIINTFIICFTLCGCSLINNALEEMQKNIDSN QQQINSMKKVAIIYNAKEIEAIVVTPAKYANKSPLIESIKKNYEIKTQNY TLQFNEKVKAEQEKSSINQYFAHELSKKIEQAGYQVTLVDKADMTNLNAY LAKTPKIDGVIKLNIMLGYTSPDNSFLFEPSTLINYEVYNKTAQRLSHGK VGAIGGWISDKYMTFNGLYEDADNARAKLKKYLMDNLDSTAQEILKVTVS NLN >MS1117 unknown MKKSIKMSIWSYSLFSVNSSIKYDYILPFLGQNSQLILCK >MS0177 unknown MIEKEKKTTALFYYHLGLVIKGEKRGTSEGVERSALTVVPIV >MS0540 unknown MEWTTIITLAGSFFLLLAIGVPISFAIGVSSLITIMLAIPFDAAIAVISQ KMASGLDSFSLLAIPFFILAGNIMNRGGIALRLIEFAKVLGGRLPGSLAH VNVLANMMFGSISGSAVAAAAAVGGTMAPLQKKEGYDPAFSAAVNITSSP TGLLIPPSNTFIVYSLISGGTSIGALFLAGYIPGILMGLGIMIIAYFIAK KNKYPVSPKPTFKEVTHRTLDALPSLGLVVVIIGGIIAGIFTATEASAIA VVYTLILSMVIYKEISLKELPQIILDAMTTTSIVLLLIGASMGMSWAMAN ADIPYTISDALLSVSENPIVILLIINLTLLIVGTFMDMTPALLIFTPIFL PIVTELGMDPVHFGILMAFNLSIGICTPPVGSTLFVGCSVAGVKIDKVIK PLLPFYAILILTLFLVTFLPQLSLWLPQTLLGY >MS1898 unknown MVSVYFLSKRFLLIKTTTPRLTRGVENYGQFKNPVNYTALLTQI >MS0302 unknown MKLFLITFGIFILIIFGMSIGYIIKKKTIKGSCGGITALGMKKMCDCEEP CDNLKDKVAKGEADASELDRFNKEPQFYEVK >MS1716 unknown MSILYAENLAKSYKGRQVVSDVSFTVKSNEIVGLLGPNGAGKTTSFYMVV GLVRHDQGKIRIDDEDISLLPMHNRAQKGVGYLPQEASIFRRLSVYDNLM AVLEIRKDLTKEQRHARAEELIDEFNIGHIRDNLGQSLSGGERRRVEIAR ALAANPKFILLDEPFAGVDPISVIDIKKIIKDLRDRGLGVLITDHNVRET LDVCERAYIVSAGKMIATGTPTDILNDEHVKRVYLGEEFKL >MS0079 unknown MASLILTPEWAEEIYQLETSDPVMGGPDGIDNRQAIQLGKRTEYLKQDVE KRAPIASPTFTGTPKAPTAKTGTATEQLATTQFVSNAISALVGSAPETLD TLAEIAQAMGEDESLKETLLAEIGKKATNEAFSTLKNLLIGIPFPYPLSA VPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFIRGWDNGRGVDSS RELLRSQGAELSAHTHYVTVTRYANSSGEFGAKISTFSAINNSGWLLSGA DGLLLAANKSGEIVSEKNSVANLISNTGGNETRPRNVAFQYICLAK >MS1133 unknown MSDFYHVINWNKLMEHNLTDIINIFNQCFESEYNTKLEKGGEYPIYLPAF LDENGVKSERPYNVIYFARGFYSSALHEISHWLVAGKERRKLEDFGYWYE PDGRSTQRQREFERVEVKPQAIEWVLATAADFRYFASADNLNGNPGDTAP SNKRFIIR >MS1639 unknown MRHFIFKNYKDSFMQKYDIHVCLVSAQAAPNLLPTLDKGFKPKKAVFVVS TRNDIKEKANSLHLAFKQNGIDVDIMNLSDEFDFQRMEIELLDLLTKYKN ENIALNVTGGTKLMAIAAQNAFSGVKPIFYINTDREEIIFISKEGDNYIP VQKLNTETFISTYLSGYGITILKNKDNFNFHKLGMFTERFATRQKNYKDI ISSLNSLACAADNSNLEASLAGYNNDLRLIIEDLADEDLVKLNGDIVDFK NESTRSFLNGNWLEYFTYKQANSITDVIDVDWNVEVVDSKYEKNKIGVNN ELDVVFMAKNKCHIIECKTMNFENEENSAKLQGYLDKLKSLKDYGGSLTK VCLVSFYPIPAHVKRRAEKDNIEIIDDYRIKDLKEKLQNWIREK >MS2376 unknown MNFKDFIKAPPAEGYLKNSSKLVTALFIIAGLCYYFSKQYGGYAAVICLV IAFMVMFGQKLMLSQITKDFNEMYFAKKQFEETQNLDYIRFIQARATQIL VDNKVLSEKAKRELGFLLQYAEGKLKK >MS0942 unknown MKKVRSNFTDFYLKKSSIFDRTFYLKSGAGYLL >MS1376 unknown MKVTSSAIKNGAFEDKYGKRGSQFTPNGMPSYSIPFEITGAPEGTKSFAV VLEDKDAVTASGFVWIHWLIANLERTSVLENESQTATDFIQGANSWSSVL AKLDITEASAYGGMAPPNCLHRYELFVYALDTKLDLQPGFKFNELHFAMQ GHILAKAEIMGTYDV >MS0261 unknown MLMGICALAFDFGTKSIGCAVGQSITGTAQALPAFKAQNGIPNWDSIEKC LKEWKPDILVVGLPLNMDGTEQEFTSRARKFANRLHGRFGVKVELQDERL TTTEARTEIFQRGGYKALNKSKVDGISAALILESWFERHS >MS1945 unknown MAGYFALTYLGKITPKLTSLFAPVTLGKLLGKLCILR >MS0650 unknown MATIRRHNESRRNLPTLDDVSVYLNKIPYGGIYAIEAEAYLIDEVVMKLI DLIDPQL >MS2157 unknown MHSQRKTMIINTGGRTDTVHYYSKWLLKRFEEGYVLSRNPLFPNKVTRYE LTPDKVDCVVFCSKNYRPILPDLHKITDRFNTYFHYTITAYGKDIEPGVQ TIEKSVETLKRLADIVGKQRIAWRYDPVLLTEKYTIERHLETFDYLAREL TPYVDRCIFSFVEMYKKLAVNMPEIILLTDEDKHRLAKAMGEIAQRYGLY LQTCATEGDFSAYGIHGSGCMTLDIIGRANGVNFKSLKHKGNRQHCGCVE SRDIGAYDSCPSGCKYCYANKSPAKARAMQQYHDPDSPLLLGHLRETDVV TQSPQKSFLAPQQMDLIGLWG >MS0498 unknown MKTSLSIFNDQQKRRALILISLFHILIIAASNYLVQVPFEIHLPFTALGA KENFSFHSTWGTITFPFVFLATDLTVRIFGAKEARWIVFAVMFPALIISY VISTLFSDGQYQNMSALMTFNSFVFRIALASFCAYAFGQLLDVFVFNRLR RLKTWWIAPSSSMTFGSLADTFLFFAVAFYQSSDPFMAEHWVELGFVDYL FKLFVGIVLFVPAYGVALNFILRNVLGITPQHS >MS2141 unknown METKPAEQQAIVDIGCENLGISVKTEDGTLTMYHATQKMRRVLT >MS2014 unknown MGTTNAGLPKNYRIKLGVIKTPLYSNESISSWLIRAALDCGTEPITFTGF YWNKWRLWTYDLDRGFEPIAQHIYADITELSLNQQVNLVNHSLYSVLRPI NGKNTLIKGQAKWVLSRGSRNRSFRVGQSYCPCCLEETPYLRNEWRFAWH FGCLKHKVLFGSKCSCCGGLYQPHLLSAEKRQLNYCHQCGEKLQVITTPL NEVEIATMETLDKVFTTNSGECFGKRVDAQVYFAVLRYFINLVRRTAVAK STHAFARFVEECGISQAEICQTRTALAFEQLPVEERKNLLVNAIKILNLS SKDFIQATQQSAITQKAFAFENYPMELDTLFKYASKGKTVSRKTVTNKPK TDSVLSMNRQWERLKRQLKIAA >MS1405 unknown MQNILILGATGSLAAQIIPTLLAETDDNLTLFARNPSRLAQFKSERVQIV QGDMMNIEQLSEALKGKDMVYAGLAGNLEPMAKNLVTAMKSAQVKRLIWV SSMGIYGETGEDHGAILDPYRRSAQIIEQSGLDYTILRPGWFTNGQEIDY QLTHKGENFKGHSVSRKSIADFVLKLVQHPELEIKQSVGIAKE >MS1243 unknown MSFANQKLSDIALTVPGAIQLFREYDLDYCCGGAVELAVAVQEKNLDINE INARLTELQNNPVNAEERDWTSASFDELIDYIVPRFHDGHRSQLPELITL AEKVEQVHGDRPDCPTGVAAELRNMLTDLTQHMMKEEQILFPMIKAGNYM MARMPIQVMEMEHAEMGDQLEVLKSLTDNLTPPADACTSWLALYSGIEHF IDELMLHTHTENNILFPRVRNAA >MS1051 unknown MWKSVSQVLADQFGAYYNIKDKEKIHSGEVHEAWLINDGIQPVFVKLDEK SYRSMFRAEADQLQLLARTNTIPVPQVYGVGCSQNHSFLLLEALPLEPIT AETMGEFGVKLAQLHAQHGSEKFGLDFDTWLGPVYQPNEWKLNWATFFAE QRIGWQLQICKEKGIELGDIQSIIDMAAGKLVKHKPKPSLLHGNLWIENC GLVKGKVTTYDPACYWGDRECDLAFTELFEPFPQEFYDNYNRTYPLDKGY QQRKPLYQLYYLINFSHRFKGHYITLTQKLLNYLMSEEE >MS0051 unknown MTVVIFLAVLLGAIILGIPVAFSLLVCGVALMVHLDLFDSQILAQQIVSG ADSFSLMAIPFFILAGELMNEGGLSKRIIDLPMKLVGHKRGGLGFVAILA AMIMASLSGSAVADTAAVAAMLLPMMKTTGYPEARSAGLIGTAGIIAPII PPSIPFIVFGVASGVSITKLFMAGIFPGIMMGICLGMLWWWQAKRLNLMT FSKATRQDLCISFKNSIWALLLPVIIIGGFRTGIFTPTEAGAVATFYALV VSMFIYRELKFKDLYKVLLAAAKTTAVVMFLVAAANVTGWLITVAELPAM LTELLEPLLGNPTVLLLVVMLAVFVIGMVMDLTPTVLILTPVLMPLIEEA GIDPVYFGVLFILNTSIGLITPPVGNVLNVITGVSKLPFDQAAKGIFPYL IMMILLLLLFVFVPSLILVPLSWML >MS2309 unknown MPIDRTDLLELVTQQDKLANYAKDIAGRMIGRQFKIPTEIQADFMAFVCR SLDATVQAHRVIEEMDELLETGFKGRELNLVNTMITELDKIEDDTDQMQI KLRLMLRQIEDRFNPIDVMSLYKTFEWIGVLADQAQRVGSRIELMLARS >MS1859 unknown MDLLITLIDDMFFASIPAVGFALIFNVPPKALGYCAILGAIGHASRTLLM HFGVSIVFGTFFAAGLIGFIGVRIAQHYLAHPKVFTVAAIIPMIPGVYAY KAMIAIVRINSLGSSPELFNQMVDNFVKGCFILGALVFGLALPRLLFYRG TPVV >MS2384 unknown MKTNNNPVRRSFLLFNNLSDNLCGHLKDSFKQKI >MS1161 unknown MRFVWCSIHLRDTISLLLLSVLLINRNLFIMKKTTLAVAIGILAISSTAS ANWYVQGDVGYSKIKASGMDDLDFKDNVFDQRISAGYDFGDIRLAVDYSH IGKAKDHYTLFRGEQWETSGSTSVETNSFGISAIYDFNLNTSLMPYVGVR LSENSLKFEDHWRDNSASESYSETKTKFGYGALAGVQYHLTDNLLLNVGV EYNRLGKVEEVKIHQYSAKAGLRYNF >MS2207 unknown MLNQQISQVIAAELSVQPKQILAAVQLLDEGNTIPFIARYRKEVTGGLDD TQLRHFETRLIYLRELEDRRQTILKSIDEQGKLSDELRAKINATLSKNEL EDLYLPYKPKRRTKGQIAIEAGLEPLADLLWSEPEHEPESAALAYVDANK GVPDTKAALDGARYILMERFAEDAQLLAKVRQYLQHNAVLVSKVIEGKET DGAKFQDYFDHQELLKNVPSHRALAMFRGRNEGFLQLSLNADPDAEEGSR SSYCEEIIREHLAVRLTGLPADKWRAQVIAWTWKIKVSLHLETDLMGSLR EKAEDEAIDVFARNLTALLMAAPAGAKTTIGLDPGLRTGVKVAVVDSTGK LLATDTIYPHTGRMNEAMVSLYQLGKKYHAELIAIGNGTASRETERFAKE VIKQSTDWSAQTVVVSEAGASVYSASEFAAAEFPELDVSLRGAVSIARRL QDPLAELVKIEPKAIGVGQYQHDVNQSQLARKLDAVVEDCVNAVGVDLNT ASAPLLTRVAGMTKVLAQNIVAYRDENGCFESRQQLLSVPRLGPKAFEQC AGFMRILNGKNPLDASGVHPETYAVVENILQVTEQSIRDLMGNSNALRRL DATQFTNEKFGLPTVQDIFKELEKPGRDPRGEFKTATFMEGVEEITDLKA GMILEGTITNVTNFGAFVDIGVHQDGLVHISSLSDKFVEDPHQVVKTGDV VKVKVLEVDVARRRIALTMRLDESAVKNSEKSDRTLSTKSGQDRNRRDNR QPQRNQFANNVFADALKGWKK >MS1547 unknown MAKSVDAADSKSAALKSVSVRVRPLAPNSRDRFLAVFFIARISYFS >MS1767 unknown MLINIHYSQVLRKQTALLTNAKVRSFFRKF >MS1485 unknown MIKMPFEKIQTAFKIKRIRSGNNRPCLRLEV >MS0034 unknown MLNYKPLTEKSGFFCILRTLMTQYIIAQTNKGVQLGITAKMANRHGLIAG ATGTGKTVTLRKLAEAFSDDGVPVFLVDVKGDLSGLTVKGTLQGKIAERV EQFNLGGENYLSGYPVSFWDVFGETGIPLRTTISEMGPMLLSRLLNLNAT QEGLLNLVFRVADDKGLLLIDLKDLRAMLKFVAENAKEFQVEYGNVSAAS VGAVQRALLTLENEGATNLFGEPALNLEDWLQTRDGRGVINILNSEKLIN SPRMYSAFLLWLMAELFERLPEVGDPEKPKFVMFFDEAHLLFDGVPSALV DKVEQVVRLIRSKGVGIYFVTQNPLDLPDTVLGQLGNRVQHALRAFTPRD QKAVKSAAETFRANPQVDVVETISTLGVGQALVSFLDEKGMPTPVEIVGI FPPKSQLTPLTNEQRTDWVKDDELYPHYRDLVDNESAYEILNDQSVQAQV QQQVQDEENSDFFSGMISSIFGTKKKSRQTVAEQMVSSVAHQVGRNLRNQ VTKQILRGILGAITKK >MS1393 unknown MGVYNKVPFAKFLPKPTACKNTSFFDRTFIRKENKAYFNKRIAPKEFGKE ANF >MS0023 unknown MIVLDFLLCRFLKDKEQNMSKVSEITRESWILSTFPEWGTWLNEEIELEQ VPANNFAMWWLGCVGLWVKTPQSANICIDLWCGRGKATKQVKDMVRGHQM ANMAGVRKLQPNLRNSVGVLDPFAINEVDAIVATHYHNDHIDVNVAAAVV NNPKLDHVKFIGPQYCVDMWTKWGVPAERCVVVKPGDTVKIKDLELVALD SFDRTCLVTLPARGAEDNGGELNGICPSDEEMGLKAVNYLIKTPGGNIYH SGDSHYSIYYAKHGKDYDIDVALGSYGENPLGIQDKMTSIDILRMAECLR AKVVIPVHHDIWTNFMASTNEILELYRMRKDRLQYQFHPFIWEVGGKYVY PRDKDLIEYHHPRGFDDCFEQEPNVPFKSIL >MS0736 unknown MTDKTNIIREPEGSLILRTLAMPSDTNANGDIFGGWIMSQMDMGGAILAK EIGKGRVVTVCVDKMTFLRPISVGDVVCCYGKLVKIGRSSMQVKVEVWIK KVYDGVRDRHCVTEALFTYVAIDKEGKPRAVPREDNPELEQALALLNNHT TEEN >MS2154 unknown MQKLKIETQSGTLLDGVLFSQTPSKTVIIAITGIHGNFYSNPFYYNIGHT LSQSGIDFIYAQTRNAFGKTDFVNPKTGQPESIGSWNEDFAKTIEDLTAY VDFAEQKGYQHIVLAGHSLGANKVIHYLAETQDKRVAKFILLSPANVTHL TNAISEQQRAYIRHQVEKGNSQRLLPFELFGWLPCIADTAFQWLYSPLLN NVHVEPNSDFSQVAKIQHTGALLIGTLDRFTYGDPPGFLRNINNHFQSAD KNTLIFIENTGHTYQQKEQEVADKLLDLVKDWGY >MS0288 unknown MSVKKSNLSRPNWLAISRSERVVGTNEKVLGKRIRTLGIHQRYGIMISRL NRAGVELVPTADSILQFGDVLHMVGNVETMDAAISIIGNAKQKLQQVQML PVFIGICLGVLLGSLPIHIPGFPVALKLGLAGGPLVVALILARIGSIGKL YWFMPPSANLALREIGIVLFLTVVGLKSGGNFVNTLTQGDGVTWMGYGVL ITFVPLMAVGIIARIYAKMNYLSICGLLAGSMTDPPALAFANAIKEENGA AALSYATVYPLVMFLRIISPQLLAILLWVA >MS1482 unknown MENVVSQIIAQYANVCILIFVRIFSVNSQQIKVRSNFVYFYRVA >MS0086 unknown MYFMLGDIALEAIDLTEFSETFAAEFAEHAVLKGKPRLQAMGEKLNELSF AIRLHHKIGGVESRYQALLTAKAEQNALALIWGRGKYKGNYVITQLSSTT LFTDKYGNALCREMTISLKEFVGDSEDSLFGDALNFGSNSLLGSILPSGV VSTLSTVKNAVSRGVELYNQGKRLVDEVQNTVAVIRKFADDPATALGYLP LALRNLDGALGSFGEITGLSDTLSGLSDLLPSAVKFSHKIDDIYTDLQIL KDSFTNASGNDWSNWFTPADNALSSVNESFDYLAKPVAEMTAWIVLRADD EPDTEKDNDDTDLA >MS0082 unknown MGAMNTQIQSTHWQLAPETDGVSVVSGVDDIHLCIANILSTQKGTDILRP EFGSDHFKFIDYPEDVAVPNFVREITQALQKWENRIVIDEVLVDGEAPHF TFTVSWSLTDDVYREIYRTQVQQ >MS0718 unknown MIMINSVYLLVNKGSWRALSFILAILLTACFFFNIHQFTSELRTANPIWV VLILWSTVILWIHGMGFDIRSDLGKYLFFPIFGYLISFVALFQHFLY >MS0234 unknown MCHNLSRIIDLKFLLMINYAIILHNYLKINNYLDYRHETTA >MS1075 unknown MRKNRPHFCFYRLERRLIFYFIRLNRQQITF >MS1925 unknown MSFLWSLLSFIIAISVLVSVHEYGHFWAARKCGIKVHRFSIGFGKVLWRK VDKHGTEFVVSMLPLGGYVKMLDERNEEVPEALKSQAFNNKSVLQRAFVV MAGPLANFLFAIIAYWAIYTIGIPSVKPVISAVQPQSIAAQAQLPVDSQI VAVDGTATPDWETVNMVLASKLGNRQVQLTLTPFGENMEFRKTLDLSRWK YDPEKESAFGSLGIEPVSGKVEMKISKIMEHSPAQKAGLQIGDMIRQSDG EEINWQAFVKLVQQGKSIPLQIEREGVLFDVILTPEFTDKRWLVGISPTF EPLNDKYRSELKYDMLEALQKGVEKTAQLSWLTIKVIGKLFSGDLSLNNL SGPISIAKGAGMSSSIGLVYYLSFMALISVNLGIMNLFPLPVLDGGHLIF LAAEGIMRKPVSERIQNIGYRIGAILLLMLTAFALFNDFLRL >MS2107 unknown MGFNEKINMQKMQQFKKKVNARLGVKLNITT >MS1679 unknown MLAERRLELYFAENPPHFFDEMAKSAVILSPENFHNAKNLLGREFDQILF DGRTSLNLDALAIAAGTLRAGGRLLLWLDKNPHVDPDSLRWSGAEQAVET PNFYAHFNRLLQVYGCDNGIQAQNNQSVSTQKTNIASTATAEQQQIIRQI LQADSDIFILTAKRGRGKSALAGLLAKELRNSAQYHKKPFNVYLTAPNKS AVETLQLFAGEKITFIAPDELCRRIGQNARQFSQDWLLIDEAAMIPLELL FQLTSTFKHILCCTTIHSYEGTGRGFLLKFLPNLHRSFQQFELIRPLRWA ENDKLEKFIEELLMLEAEDRLIQPPYSIKSAVKIRQISQNELVEHITDFY GLLTLAHYRTSPLDLRRLFDAVKQHFLIAEWECYLLAGVWALEEGGFSDK ALIRAICRGERRPKGNLVAQSLAFNCNLPEACALKSLRISRIAVQPDWQG RGLGLQLVEKLAQTAQADFLSVSFGYNEELAHFWQKCGFILVNIGEYKEA TSGCYSAIALRPLTAAGEDLVKRAQQYFRRNLAFTFHPLHDKLSVEKSSA EKITQLNGQDFGILENFADYHRTFYSSQGAIYRLFIRLGADTSPHVG >MS0705 unknown MKVIIFSLGFLKIMKKMTALLSVGENYLMLCSRYRHEVKNNLQFI >MS0238 unknown MRIPRIYHPDSLTNIKTCRLTDEAANHVGRVLRMQAGERLELFDGSNHVY TAVILQADKKSVTAEIRDCQLDDRESHLKIHLGQVISRGERMEFTVQKSV ELGVNVITPLWSERCGVKLDAERMDKKIQQWQKIAVAACEQCGRNIVPEI RPMMKLQDWCAEQDGMLKLNLHPRAKYSIQTLPDIPAEGVRLLIGSEGGL SPQEIAQTERQGFTEVLLGKRVLRTETASLVAITALQLCFGDL >MS0088 unknown MLFTRLSPAIALPTLAPIIKAIDKLDDMLYSSFRFLRRQSDMLKFIDTLF LVFAWLLMTVAGLAMLSVGLYYYPLMTAVLFGLLLLAPVLIAADKYLVNI NMPASAPKWLQARHGALTNIQQTLWRTAETELKKAHH >MS1647 unknown MKFMQTHKIYLTPISPIHIGCGEDFEPTNYVIDNEVLFNFDPANLALNNR QKTELLNRVNRLDLLSIQRFFLENKEKVLSSTYYFADVAEGLANDYKNKV GKVAQRESDGNKVINNLSIERTAFLPVKHLPYIPASGFKGALATALLDQA HQAKNNPRVNKNDHGKLFKEYIGEFAESKLRFVKFADFSPLVQAESKIYY ALNFKKKVGKIGGEGRAMALRRECIKSGQYRAFLSELALMQGDANKMQIA DYFTLLKNFYLPIFKQEAELLAERNLVNRHYLKQLEQLFNLPNVALIRLG KNGADSKTYQADGIAQIKIMGAKGTPLNFKDSSTTVWLAGTNQQQQNDLL PFGWAIIEADPTAENEPLKQWCDAQPKSKFNRSVILAKREEQKAKQAQLK AEEEAKQQAKLAEEKAKAEMLNSLSDNQRLIMDFVEKLKNTSERQADNTG SPLLKEAEALINQAIEWENAERQFACEQITVELLKSGIRITGLKQGYKRP ASISRTSVDMSAP >MS1727 unknown MKLTSKGRYAVTAILDIAINAEDGPVTLSDISERQNISLSYLEQLFAKLR RHGLVKSVRGPGGGYQLGQPSGQISIGMIIAAVNENISVTKCLGQGNCQG GKVCLTHHLWAELSDRIENFLNEITLEELVSKQHSQKTHTDFDNLLVVDN >MS2271 unknown MKIFGAMYDKTMQWSKHRFAAFWLCLVSFIEAIFFPIPPDVMLIPMSMSK PKSAVRLALYTAVSSVVGGMIGYAVGYYAFDFVQGYITQWGYQQHWDTAI SWFQQWGILVVFVAGFSPIPYKVFTIAAGVMQMAFIPFVITAFVSRAARF LLVAKLAAWGGEKFAAKLRKSIEVIGWAVVVLAVIAYLILK >MS1514 unknown MTELTHYNQYIADENAMIAFGQQLIQAINKLDNNKPVVIYLNGDLGAGKT TLSRGMIQGLGHQGNVKSPTYTLVEEYHLQNKHIYHFDLYRLSDPEELEF MGIRDYFGTDTICLIEWAEKGIGLLAEPDLIVNIRYADNARDIDLIAQNA QGEQIITLLAAK >MS2303 unknown MMALSQEKRLIEAPVNGGRNYNGPKVAKFLVG >MS0124 unknown MFDVLEQLKLQIHQAIVQLEQAEKALHKQKMTHASIYVENAKGILMKLGG RIK >MS2276 unknown MRLFILILSAILLLFQYDLWFGKNGYLDYKETAEEIAMHKAENTKLSQRN QVVAAEIRDLKDGVEAIQERARLQYELVKPNETFYRIAKENKDNR >MS0655 unknown MFAQSYNRRVYIISFFILRLSLMNNTASMSPQNNNDEIDLIDLIKVLWQK KLVVILTSFFFALIAAIYAFTAKEQWTSKATTIAPKVADMGYYLSLRSEY ANILNIKEFTSKDVVDNLFNNFRVALFSNNMKREFFAQSKWFQNYANENA KDEDAKQKLLSDILDKSLIVTIPDIKKNPNALGINISFAAETPKEAQEVL TEYINFINATVLTEDKIDFLADIKIAIDNLELQKDKIQRDTESVRQVQLE NLTTALDIAKSAGIKEYSKTSGNVSIPQFALGDAQIPFTDSKLSDGSYLF MLGEKYLQAQVDTLTNNKVVYPVSFYTIEKQVSLLNSLEQKANTDSKVTS YYYLTSPDYPTTRDWPKRVLLLLIGAVLGGVLGCLWVLGKQIFSQK >MS0879 unknown MCYFKCLFKIKELFNMQTFLKFTNFMSKTFALWVLVFAFLAFQFPAQFAI FAPYIPYLLGLVMFGMGITLTFNDFGEVFKHPKSVFIGVAGQFVIMPAIA FCLAKIFNLPADLAVGVILVGSCPGGTSSNVMTYLSRGNTALSVACTTIS TLLAPFLTPAIFYILASQWLDINAGAMFMSVLKMVLFPIFLGLIVRAIFK KSISEISRTMPLVSVISIVLILSAVVAVSKDKIVESGLLIFGVVVLHNCL GYLVGFFGARLFKLNIADSKAVSIEVGMQNSGLGAALAAAHFNPIAAVPS AVFSFWHNVSGPILANIFANIKNDDKK >MS0758 unknown MIIYLHGFSSSRPDDYENVMQLKMIDPDVRVISYSTVHPRHDMTYILNET HKLVSETQDDKPMICGVGLGGYWAERVGFLCGVKQIILNPNLFPEENMEG KIDRPEEYLDIKTKCIEDFREKNQSRCLVFLSKNDKVVDPKRSEALLSHY YEVIWDDTDAHQFKHIAPYIQRLKEFKAA >MS0318 unknown MRNNMSEQKITFADQKRKTVETAEFTEDGRYKRKVRSFVLRTGRLSEFQR NMMNDNWADFGLEHQNNYFDFAEIYGNTNPVILEIGFGMGKSLVEMAEQN PERNYLGIEVHTPGVGACIAYAVEKQVKNLRVICHDATEILQDCIADDSL GGLQLFFPDPWHKSKHHKRRIVQPNFVDNVMQKLQQSGFIHMATDWENYA EQMLDVLSQSKALTNTSKTNDFIPRPDFRPLTKFEQRGHRLGHGVWDLYF VKN >MS0387 unknown MIRFPRFNLRSSTLIAIVALYFTLVLNFAFYGKVLTQHPFTGKPEDYFLL TVPFFVFFTLNAVFQILAVPLLHKIIMPLLLIISAAIAYSQVFLDVYFTT DMLENVLQTTSAESTRMITWQYVLWIIGFGIIPAFLYLSVKINYHTWFKE LGIRLGAILVSAVVIFSISKFFYQDYAAFVRNNKPTVNLILPSNFITAGV NEIKRIHDANRPYEKIGLDAQQEKPDPYRHFTVIVVGETTRAQNWGLNGY QRQTTPKLAARGDDVINFNHVTSCGTATAVSVPCMFSYLTKDQYNGSKAE KMDNLLDVLQRAGVNIFWLDNNSDCKGVCLRVPNETVNMTLKDYCTEGEC LDEVLLRDFDKILNETTKDTVLILHTIGNHGPTYYERYTPEYKKFVPTCD TNQIQTCSNEQLVNTYDNSILYIDNFIDSVISKLENRDDLESAVYYVSDH GESLGENGMYLHGAPYAIAPEQQTRVPMVFWFSKTWKKNEGVDLNCVREK AKTREFSHDNLFSTVIGMMDMNLKTSVYQPEFDILASCKRH >MS1692 unknown MSDFQQYLNFNEEENKRRQLEVYADTHISALTERRIAAISAPQQWLVLGE PFCPDCRVFVPFVQKFAELNPNIKIKYVARKNYHERSRFDSDEQQKLVVE THNIPSLFRIENDTTRLVLKEFPEFFKRRAEQAPDQKDQLKADYRAGKFN EELERELVKLFTV >MS0107 unknown MLKELLSTDGKLSTTSTVQLIGALCVFGLTVYAVVTGQPYAESLLNNVLI YLFGATTAKGVVTSYQAKIKGAINGQITGVSESRKTA >MS0354 unknown MSDQIQPNYLLPLSFCLSVSRKSVKIDRTFAPSASILC >MS1139 unknown MRSFFFNFYKILVKLTALFGADSGFDGIGEAQGARRGAVGLVNKPQKNSR KRRTIRFSSLITCI >MS1108 unknown MKGKLITFNVFYESKFLTEEGGYTNDNEIFKVLLERA >MS1456 unknown MENPFTAKWSTQGNTLCLGHWDINYQGLPLVLPEERRDQDMGTKGIYNFI DPDDELYLEGLDEDDWLLENIDWLSDVFIEHNIPLEEENMRLFYKAVNKA DWRCGSCGGCI >MS0075 unknown MFKQAPLPFIGQKRMFLKHFERLLEDIPNDGEGWTIIDAFGGSGLLSHVA KHLKPEATVIYNDFDGYAERLAHIDDINRLRQAIYPLLANCAKSKKVPND IKTQIIDVIKGFDGYINEHILCSWLCFSGQQVKTLDELFKEDFWNCIRKS DYPSADGYLDGIEVVSESFHTLLPKYQTDPKALFVLDPPYLCTQQASYKQ ENYFDLIDFLRLVHLTRPPYVFFSSSKSEFVRFIEAMIEDKWDNWQAFEN YERVIVKTSSSYSGKYEDNMVFKF >MS2200 unknown MQSLKGLFRFGLATPTLKKCCFMTALCIANQSAVSFCKFFLLMQFCKTI >MS2088 unknown MSGLQVGEQVVQKAGALINEGDQVEVVLSKGAE >MS0078 unknown MTVQFDNQGFAIESGFMTVHVIDAQGVYVHSEEQYISEGGSLSANAVLSE PKAARQGFAVQWTGKVWQYVEDHRGEVYYNTQTKAEVTISELGKIPENLT ALQPSDPNCEWNGEVWVLRAEKQAELKAQKLQQFIDGVDNKASRIYSIWT RFEIEYAQREAAAVAFKAANYQGEVSRFIADFATKAGIDNVTATNLILVQ AEGLRKLLVELANQRMRKYELKKPNLTEDEMQTIYDDIIQQMDNLAEAYN NG >MS1431 unknown MSEPIVATPALIKRMGKPKDVFDLAANFPVGYILNPQTGKVWDWMLGR >MS1384 unknown MNKVTEMFKSNPYFVQIKVFYDYQHRRAHRYRPLLQ >MS0987 unknown MILSSLARYYQRLAKETDSVGNPKVPSYGFSEEKIGWVLVINQDGQLVDV VPNLSDGKKPQPKLLNVPRPEKRTSGVKANFLWDKTAYVLGVESNKDKAT AKEQPFVISQKTFEAFKQSHLELFKDSQDLGLQAVCHFLEKWQPEHFSQP PCLTEMLDANLVFKLDGCSGYIHQREAAQTLWADLLKDDNAEQGICLISG ANAPIARLHPAIKGVFGGQSSGGSIISFNKESFASFGKEQGSNAPVSEVS AFAYTTTLNYLLRRENNHCLSIGDTSTVFWAEADNSANAEAAEGFFASVF SPPDDEQESQKVFNILEQIAKGRGIKEVSPDLAPDTRFYILGLAPNAARI SIRFWLDTTFGQLAEHLAQHWQDLAIEPCPWKTFPSIWRLLLQTAVLSKT ENISPVLAGEMTRAVITGSLYPMSLLSQLIARIRADGDINGLRIAMIKAV LQRRFRKGFIQEEIPMSLNTESTNPAYLLGRLFAVLERIQTQALGDLNAG IADRYYGSASSVPYSVFPRLLSGAKHHLSRLRKDKAGMAVNLDKDLAEVI GALPDVFPRHLSIDEQGCFAIGYYQQKQRYFTKKETTESTEN >MS1845 unknown MKFQTGSFYMGIEGGTSSSNNKSAVKNTALFIK >MS1823 unknown MLSILGRINELFIAFHSCYERKVIFKVSPMSFFAILYMLATYFLGSISSA ILICRLVGLPDPRQSGSGNPGATNVLRIGGRWAALAVLIFDILKGMIPVW CGYYLGLTPFELGMVALSACLGHIFPIYFKFRGGKGVATAFGAIAPISWG IAGAMLGTWGIIFLLSGYSSLSAVIAALVTPFYVWWIRPEFTFPVALVCC LLVYRHHENIQRLWRGQEDKVWAKFKKKEDSQGE >MS0912 unknown MSERIAKKQGSVLKTLSCILLSACIGGAAGYGSSYLAKNLWTVQSSLEKP ALTELGNYYSLYSTYQLLNNEKANQDPTGDIFNRFKQLAGSYEHAKVFWE NTDYYKQKLTDDSQHDSQLLDQLSREIKLLDTNAATTQLSLELDNPKRAR ELLTEYIDYTGLANRKNIYGELIVKWKTLFDQVNSAANLNVADTERQKWK SMLSMMQSVKPLDDQLVSYHFIQKPGQAEISSPNRICWAGIGSTIGAFFG LFIGLFIRRK >MS2264 unknown MAKKPGKADEKDEDVTRLDKWLWAARFYKTRTIAKEMIDGGKVHYNGQRT KPNKTVEIGATIKLRQGNDEKEIKVTALSTQRRGAPQAQLLYAETEQSIE NREKNALARKMNAMPHPDHRPNKKERRDLIKFKNQG >MS1298 unknown MDVNMSEKTFEKLTALLTENRASFRVIEHPRAGKSEEVAKMRGTELGQGA KALLCVVKGNGIKQHVLAILPANKKADLQKIALALGGTRASLASPAEVHE LTDCVFGAIPPFSFHEKLKLVADPGLFGIYEELAFNAGTLERSLLLNTQD YQRIANPQLIEFATEN >MS1231 unknown MKSPEPCFLLDYLRLRKKLAVNDGEKKSVSFVGKETPEMLE >MS1484 unknown MQKLVKWVVALSIMSATFTLSAKDFAIYDMMDYVGKPQDLTADKISRAML IYESELVKPDPTGKRKHGVLNLEKVIELARRSHREGYTVISTDIESWFGN KGGQLLSPEELKRDFELMFNIFKNENPNAIISNYGLPTETLSVIRFYRGD VPYQVSLDKWKEFNKRRNKSGVIADYANPVLYIVNPDIATWEKDVIHTVQ EIKKRYPNKKIIGYIWPQYYSAKKSGYFKQFIDPKTWREMLEITYKYTDG VMIWSDKRDENDKIVRWEDPRVQAIMAETKAFIRAHDKDIKVEGKKKK >MS2004 unknown MKAVKVEYQGKLRQQITHLSNGQTVITDAGKSVGRHGENISPADLLAASL AGCAMTIMALRAEQLGADFSGCYAEVEKEADMQQFQVTKIVIHFYLKAGF SDEVRQAVENATRDLCIVGRSLRADLVQEFHFVYQ >MS0226 unknown MIYENMLQNIKIYFSMTDMQIRHRHIRRNQR >MS2300 unknown MVLMDSLSKKVVYHKIVNAERVIYYRKAINELREKDYKIQSITCDGRRGL LKDILNTPIQMCQFHQVAIVIRRITRKPKSEAGKELKILIKTLKTSSKNK FYINLHHWYLKHKNFLNERSSIPDKAGKYPFKHRNLRSAYSSLKRHEEFL FTFEKYPELKIEKTTNRLEGLFSELKRKLALHNGLSKKNKIMFIKDFLNE KS >MS1148 unknown MNISVWHSALPFALKYLCTIKLVQEFFVNKKFNFI >MS0388 unknown MQSFYTEIFSGNKKARNFMNRAFLANRIKLFSATAFR >MS0905 unknown MRYKMKKFLVSLEKDIQRRELFFSQRNTQDFEVFNAINTMTQDLTSLGNL FDIIKFAQYYGRNVTKGEIGCTLSHLAIYQKIADDETINERDYALVCEDD ALFAENFQQVIQEIVKQPMGADIILTGQSKILEFNHIELEINYPSTFKFL QKKIANSGYRYSYPYRNYFAGTVCYLITKAAAKRFLAELTNGRLPFWLAD DFILFNEKFKLNTAIIRPLLAIENPVLTSNLENSRGSLNNNLFKKLLKYP LKKLLAFKRNL >MS2086 unknown MCSHLYTFILCLNKENHMTAYVVFIRDEMKDQAAYDRYLQLGVPTLAPFG GEILVANGAHEAFEGADFDGSVVLRFPDMASARAWYTSPEYEAVKSMRYC NADWQPLFAFHTIDSNAGYSGAAIQR >MS1688 unknown MKHSMIKTVAFAFITAAFSMQAMAASPMEREIVRNISNQTHQPVKAETNR YEALKTELDAKLAQLSQASDMTVFNALATDAKELAQQTKAAVLAQTFDFG EASDNRRAELDADYSGWKLNKLIDSLDQAATKADLAAAKTEIRSHI >MS2381 unknown MRKQLYITHGYTANSQSHWFQWLKNQLIPHQIHTNIFDMPDSSKPNPQIW LAHHQTYINQCDENTVFIGHSLGCIATLRYLQRQKKKIKGLILVAGFDEP LDNLPELTSFTLQRIYYPELIANIPQRIVIGSSNDEVVAPKYTQKLAANL QASYLTVENAGHFLARQGFTEFPLLLKECLNIFNG >MS1410 unknown MKSAVKIFEFFAYFIRIVYFSSLPHSDGKYMQ >MS1432 unknown MLICNLAVTENLEELQQILTEDCYCPNLANYEIREFEPNMIATDMAEFEG K >MS1138 unknown MEGLLIVLVPMLLGYLIKTKNTGLLQSINKTVMVLLFIILFVMGVSLGQL DDLATKLPVIGISALVFIVCILTCNIIGLLAYDKLSPRPLKHLGTDIPPR GKLLLDSLKLCSMVIFGFLFGLCTKGGFDLPLHASTYVLVALIFFVGIQL RNNGISLREVLFNKRGIYTAIIMIITALIGGIVASLWLGLPVTQGLAIAS GLGWYSLSSVVINDAWGPVFGSIAFFNDLSREVFSLFIIPFFMFNYRSTA VGLAGATALDCTLPVIQRSGGMEVVPLAISFGFVTNIVPPVLLVFFSSIP L >MS1308 unknown MRLRLLLQHLVMDFQFTRYLGSVEAKCSMGHEAVANWFNSEVRSDSQKIY TALSVLAQAKKQSYEQEIRLIGAEYSLFINADEVMVKANNLDMTDGSEQD LEEDFHYYDEESIAFCGLEDFENFLTSYLNFIA >MS2077 unknown MRKIHIFLPHFFFKDLYLSLFDSIYLEVLCNTFNWLPI >MS0914 unknown MNNNGNNMTTKTERQTWSSKITYIMTVAGATVGFGATWRFPYLVGENGGG AYVLLFCLAMILIGIPMILVENVIGRRLRVNSIDAFGDKLQDENISGGWK IIGYMGLLGAFGIMAYYMVLGGWVMNYIISLISGILDISTPITKETAKEF YDFSIGNSPLHIALYTFIFVIINYIILAKGIIGGIERAVKFLMPLLFVFL IGMVIRNVTLPGAMDGIIYYLKPDFSKITPKLFIMVLGQVFFALSLGFGV LITLSSYLSKEENLIQTAVITGFTNTIIAILAGFMIFPSLFSFGIEPNAG PTLVFQSLPIVFSHLWSGTFFAIVFFSLLLIAALTTSITIYEVIITALQE KLKMRRSKAILLTLGGIFLLGNIPSILGDNLWKDFRPFDKSIFDAFDFIS GNILFLLTALGCAVFVGFVLKDKAKAELSPTPDSLFTTVWFNYVKFVVPL IIIVIFVSNII >MS1126 unknown MKKLFLVMLAATFVTACANKDIYFNGSEGSNSGLKYDHNTDSLSINK >MS1625 unknown MRNSEPNKNSRFFAALWIGIVLALGIFAYGIYSYFDILAWEQSGQMPHIG AFSALIYNLFGAVGILLGYGLLALIVFVQGWRAYKRNR >MS2147 unknown MQTSNALDNLKSIAKNNKKRLAGTFGLVAAENVLFLTYPVFGSFAVNAMM SGDVWASLSYSLLVLIIWSIGAMRRAVDTRAFARIYAELAVPVVASQRAK GLDTSSVTARVALSRQFVDFFEQHLPILIMSAFQIIGSALMLLILEFWAG VTACAILAFFAFLMPKYAKTNDLLYLKLNNRLEKEVDVIERNNGYQLNKH YGWLAKLRIRISNREAAGYLWIGVAMALLFGVTVVQIATTQGVKAGHIYA VITYLWQFAMSLDDMPRLLEQFSNLKDIGKRVEV >MS0816 unknown MTALLIGRINQKSAVIFFQIFSRIISPLIEQFS >MS0010 unknown MRNLLQNSNLNEIIKRPDLSPVVFLWLILQLTLPSFKLSNKDNSRL >MS0958 unknown MGLKDIFNGFLSQKSGYFLLNNRLNISCKNSEKLTALLIIKLTG >MS1390 unknown MEFPLSITANINSHEGNFSRELELRSALTFIVGPNGSGKTHLLKGLKESF SGFTEKKVRFLSAGRLGPLEQYRSNYDQFDRSNESDNARHGNKNEREYRH KIENINGDLHTLSARPDILIKVRERLQKLFKRNIDVDWDAGSLKISFSRL GATNTYYSSGREASGLLHLVGILSALYDDEVGVLLIDEPEVSLHPQLQAF LLKEIQRAAGIPNDDDYKKLIIMATHSTEMLKISNSNSLLNFIFCNDLKE NPIQIAQNAGELNNKKVKGLIARLGQEHKLALFSKTPLLVEGPSDVIICN ALSDKLYLNLEAAGSQILPINGKEAMPETVKLLRLMGKNPTVLVDADAFA DGLNLVNAYFNNTEIKEKANELASKQGNADILSWAKQVYDDFCNAVTNNW NEISEQAQSHPYFSLSDDVDKKDDIDKKNKRSALCTLFVSENLAKEWTNI KNRLDVLFSIFQECGLFILKKGAIESYYSTAQFESDDKVDKSVAESENID SLPSDKIDSLREEYKDVIDCLMYASNSEKIDESRAIRDELLSFITPIHAR YSEGETSFNKPSTIFSYGINNRDELEISMSSKVLDVKGFPIILRKNDNVT TVVNSALGLK >MS2324 unknown MRLNGYIGLFIKESTMIRKGFVMQVNPDCHAEYKKRHDEIFPELVEELKS HGAHHYSIFLDKQRNLLFGYVEIENEQRWNDVAKTAACRKWWAFMRDVMP SNPDNSPVSQELEQVFYLD >MS0802 unknown MSLEKRFELIERGSTVRQEIIAGLTTFLAMVYSIIVVPGMLSKAGFPAES VFIATCLVSGLGSILIGFWANAPMAIGCAISLTAFTAFSLVLGQQVSIPV ALGAVFLMGAVFTLISATGIRAWILRNLPASIAQGAGIGIGLFLLLIAAN GVGAVVSNQAGLPVKFGEFTSFPVMMSLIGLAFIIGLEKLQIKGAILWVI IAITIVGLIFDPNVTFGGEVFKMPSFGEQSLFAALDIQGALQPAILPVVF ALVMTAVFDATGTIRAVAGQANLLDKDGQIINGGKALTADSVSSLFSGLF GTAPAAVYIESAAGTAAGGKTGITAIVVGVLFLLMLFFQPLATLVPGYAT APALMYVGLLMLSNVSKLDFDDFVGAMSGLICAVFIVLTANIVTGIMLGF AALVIGRIVSGEMKKLNVGTVLIALALVAFYAFGWAI >MS2266 unknown MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS >MS1910 unknown MNMAINSKYQDKQVDEILKDIIEVLEKHKAPVDLSLVVLGNMVTNLLTSS VGANQRTVLAQAFSDALLNSVKTKHH >MS0904 unknown MSALINLFYIYDPWLFHIVRMSLVSGLFALLWFGYKWYKKELKRFVLPLD SLAVCIALILLSVLPVLINGTTEFGVIGMYVKLLVLFSLGIVIYNLFYTS SNGKDQLIRDLKLGIGAQSLLGFLALAGIPLFITISLATNSDMGGELSRF IGSEQEYRLYNFTSSAFFPLSAFYLMLLHFLLAYDDNENNGTALKSVYVF LLLFIGLISGRTFFIFSVISLLLYFKPRYIPAILAFTLLVLFFAYNYPAN PYVAHALEIVINLIQGGSQISSSSDTLVNKHLFMPELKQLIMGDGQYYVI GRTANSYYGGSDSGFIRQALYGGVGYILLCFLFTAYFVKRIADNWFNGSW KFILSALFLLSILNIKADTFAYPGIMFVFLMFVSLFGDKGKIIIVERK >MS1876 unknown MINSPKELFLKTTQMIRELEMLSIDLIHLLQDSYKSFI >MS2160 unknown MAKPTALLKQTKQEKKMWDSNTLKQICQADDLKIAPFHPDMTSTGTPTWI WEVAVDGRLFVRAYYGTNSRWYQSALAQKAGKIHAIDQVFEVKFEPIKDE ALNQKIDDAYRTKYSSSRYMSHMISAGSRAATVEVIPA >MS0296 unknown MSHLIVKEQKTIIRNAFFTFLYFTLAIGLAIGILYTDIFYLQNMIEEESL VEYTQSLSLTILTLMFSRHAYRSPQWRGGFVLITGFFLCMLIRESDALFD NLIRHGSWAYFAIITALVCIIYAFTHRQSTIDGLAQFAKQKEFHSFIIGL LTVLLASRLIGYGGLWRFILYNDYPHIVKNIIEETTELFGYLIMLFSCLS LTRHFK >MS1814 unknown MPDRRNPIFRQFIHKVASNSQIAKLSSTNAEK >MS1207 unknown MPKKYYFQIGQDVVLTILLLSLTGYHLFEEVTHEWVGLGFFALVLLHIGL NFWWIKKLNQGEFDAYRMVKTGLNFVLFFVFLTACISGILLSKHIFAEFP FHLTDDFTRKIHMLSTHWIQIIIAIHLGLHWKTLADFLAQILRWDLNKPL TKWILPTIWTMLSVYGLWAFIGRDLFPYLMNQVDFAFFDKAESKAIFYFD YFAMLILFAYSTRVLVWLIFFKNEKK >MS1070 unknown MYLVEVFFKNINEDNLPQQIPLINQLIDQWRYNGQIIGREIPVFVANQEN ERGLATRVICPEQQSLLPEYNNAEVNRCLANIENCGLILHSFQIVGEDLN SDITYEDKKPDWQILYTTYLQVCSPLHSGDRLAPIPLYKQLKDVPHLSMD VIKWQENWQACDQLQMNAVALESQALREISDINSRIFKHGYSLTKEIEEH TGVPTYYYLYRVGGKNLASESARHCPICHGDWKLAQPLFDQFHFKCDHCR LVSNISWNFL >MS1809 unknown MLIRRRKPSLKTKNKLSLSLSKRRNLRLKHLKRRKAKIAARHEYNFVVCQ DC >MS1380 unknown MMTIQSSEILETIRMVADQNFDVRTITIGIDLHDCITNDIDQLNQNIYNK ITTIGKDLVETAKILSAKYGVPIVNQRISVTPIAQIAAATKADSYVSIAQ TLDRAAKAIGVSFIGGFSALVQKGMSPSDEVLIRSIPEAMKSTDIVCSSI NIGSTRAGINMNAVKLAGETIKRTADITPEGFGCAKIVVFCNAVEDNPFM AGAFHGSGEADAVINVGVSGPGVVKEALANSNATSLTEVAEVVKKTAFKI TRVGELIGQEASKMLNIPFGILDLSLAPTPAVGDSVARILETMGLTVCGT HGTTAALALLNDAVKKGGMMASSAVGGLSGAFIPVSEDEGMIAAAESGIL TLDKLEAMTAVCSVGLDMIAVPGSTPAHTISGIIADEAAIGMINSKTTAV RIIPVSGKNVGESVEFGGLLGYAPIMPVKEGSCEVFVNRGGRIPAPVQSM KN >MS0663 unknown MMGRINMLQQFTRYFSVGIFNTLIHWLVFAFFYYVFSLDQANSNLIAFIV AVTFSFFMNAKFTFKQQVSSVKFVSYTCFMGLLSYATGLCADYFNFPAII TLIGFSVISLICGFLYSKFIVFKG >MS0316 unknown MSRTVFCEYLKQEAEGLDFQLYPGELGKRIFDNISKRAWGEWMKKQTMLV NEKKLNMMNADHRKLLEQEMVNFLFEGKDVHIEGYIPQATN >MS0314 unknown MDKTNVLFYVIKKIAQKRPHFFLFLSLKIDNIANKSNYYFLC >MS0206 unknown MKIHRTFSKIRTLTMSEGTCSYFCVAHYAYVLNITGEPISALSPA >MS1982 unknown MLFYGSHYALLKNKKLRQRSQKLTNKVKLCEYLFFIGIKLTN >MS1000 unknown MANLASKGHVSEYFCPADIVLKNPIFKSLIYISSNFYY >MS0582 unknown MSPRLTANGLIMIDTNIVKKADVNIIIIAITAL >MS0054 unknown MRSSSQIKQFILNFISLETYHNEYGIQFQYMKRISGRAKMKVSYEELKSE FKRVLLSRNVREDIAEECATVFADTTQAGAYSHGVNRFPRFIQQLENGDI KPEAQPTKVLSLGAIEQWDAHQAIGNLTAKKMMDRAMELASQNGVGIVAL RNANHWMRGGSYGWQAAEKGYIGICWTNALAVMPPWGAKECRIGTNPLIV AVPTTPITMVDMSCSMYSYGMLEVHRLQGRQTFVDAGFDDNNNPTRDPAT VEKNRRLMPMGFWKGSGLSIVLDMIATLLSNGESTAAVTEDKDDEYCVSQ VFIAIEVDRLIDGKTKDEKLNRIMDYVKTAEPVDPNQPVRLPGHEFTTIL ADNKANGIPVDDTVWAKLKSL >MS1745 unknown MFLFIFLIGIKVGLLFLLRKRLRILFGFKFMWRFE >MS0825 unknown MKTFKSMLALCLALGVSGSVLAVDNANPFATVKDGTKIMAKEKVGATKNT VNEQLSGGQKAVKNSAGIKSTA >MS2291 unknown MDGAGEMFFHLNTSKVRLQKCYFLNVGVFQ >MS1725 unknown MKVYTGRTDIKLFEHLIDEPNQTVTHLVLKAGQAVPEHKVSQTVIVVPIK GRIDFSNREESQEIYPGRIVQMIPDEWHALKALEDSELMVVKSTLAA >MS0492 unknown MRNFAIISLFSELVYLIGRLLSKIITNKRKTLV >MS2182 unknown MSANIATFYEMVNAPTRLSQYFLPHFYCFSDQNAKTIWHIFNSQYNERGY FLN >MS2142 unknown MTSFKQRNRRKINMYIINITVNADLPEEKQKEMFPLHVEWFKKHFQEGKF LMLGPFIDTDKHAGVIIASTESREELDAILKEDCYYPDFAKYEIREFEPK MIAENMADFIEK >MS0612 unknown MNLIHSKEGEMNITKFPVILTLAVFSAGIALSPAYARGKNIFTENTERAE KTYMSYGKSQQLDPNNSKDVNELANSVEFEVYEISENHSSHTIFESNAGI CRGYQSSNGVELTDSTTYYVDDASDDYYASITGATIYAHASPKNVQYAPI FNIHDPKILKEIHQDEEKYGKDLATKNVNARENILSKGICR >MS0330 unknown MSEKIQSVDYDDLRDLVASNDEGGRNPAGFPKKLIVGTAILWSVFQLYYT SPFPFWLQEVLTQNNIDLNVVVDDTKARSVHLAFALFLAYLSFPALATSP KHRIPIIDWICATAGAFLGAYYLFFYQSLVTRFGAPNLQDIIAGCIGIVL LLEATRRSLGLPLAVIAVIFLLYNFFGQYLPTSWIISHRSGSLSQIINQQ WITTEGVFGVALGVSTKYVFLFVLFGALLDKAGAGNYFIKTAFAYLGHLS GGPAKAAVVSSALTGLVSGSSIANVVTTGTFTIPMMKRVGFTQEKAGAVE VASSVNGQLMPPVMGAAAFLMIEYINMPYNELILHAFLPALISYIALVYI VHLEACKMGLKGLPRTDPAKPFLVTLIRAIGTFLTLCIIYFVLELTLGWL KTAVPNEAFLIVCLLLLIVYILLIRRVASFPDLEPDDPNAKIVVLPATKP TVNAGLHYLLPVVVLMWCLMIERMSPGLSAFWGILALSAIIITQRPLLSL FRKENTDKFIQLKEGVQELIKGLETGARNMIGIGIATATAGIIVGVVSLT GFGVQLSGIIEILSMGNVLLMLILVAIFSLILGMGLPTTANYIVVSSLMA LVIVEVGKQNGLIVPMIAVHLFVFYFGIMADVTPPVGLASFAAAAISGGS PIKTGATAFYYSLRTAILPFLFIFNTDLLLLDVGWAKGILVFITATIGVM AFTAATMGYFFTKNKKWEGFALILAAFMLFRPGFFMEYVSPTERHIEPAQ LVQEIENAAAGQNLTIKVAGLNPYGKEIEFYSKLSIPAGENGEEKLKAMG LTLLNTGEKIQINGNETDKILIDNVEIDSPAAKAGLNWDQTIIDVEVPKN SLPKELMFIPALLLVSALAWNQRRRRNS >MS1007 unknown MNKSRFAFQLIKSVVIVGATISQIQKKSSN >MS1142 unknown MGYYSVFSYSAGNFASCPASALFILSEISVF >MS0230 unknown MSRQKKSRNIVDVMPQRKSDKSQISPASYARPSKKLTRYELDAKAREDKK KKKHKGLTSGSRHSRSEQHNNQQMQEKRDPRLGSRKKVPLVVEFVNNPEK GQFIQPVQVQPAEEKVKKLDPMLELEQLENNECLNQLLDALDEGKTISAE DQKFVDECLDRIAQLMDELGIEDEEESEDDLLRTFEKIDINQFK >MS0407 unknown MLTFFIITLIVGSIVGFLAGIFGIGGGLVIVPTLLYLLPMVGVPDEKLMA TALGTSFATIIITSLASAYRHNKLGNVVWEAVKYLAPTLVIATFISGLFI GKLPKDISSKLFACLVVYLAAKMVLSIRNKKSKTPAKPLTPQSTILGGIL IGIASSAAGIGGGSFIVPFLNSRGIEMRKSVGSSSFCGAFLGLAGMLSFM IGGWSVEGMPDWSLGYIYLPAVLGITLTSFFTSKFGAEMANKLPVASLKR YFAIFLILMAIKMLIG >MS1905 unknown MKYQWIFFDADETLFSFNAFAGLQKLFADNGLKFNEQDFTQYEKVNKPLW VKYQNAEISAEQIQTIRFEPWEQKLGKSAVEINQDYMLALADLCKRNHSH PGKTGKTGNYYKRLYRLATSSSAKNRFSTIFPVYYYFARTRHSQTGRPNL RA >MS2261 unknown MMKSDKPIECVGCNTFDVGSILNNDELEAKIEKVFAGKEEAEQGLAALTA KARDIESEPCKISSEITPVDGGYKLTASFEFSCQAEVVIFQLGTRSF >MS2052 unknown MNQNQPHFYRGRFSVAPMLDWTTRHCRYFHRQFSRHALLYTEMVTAPAII HAKYDHLEFDPAENPVALQLGGSDPEQLQHCAKLAEQRGYTEINLNVGCP SDRVQNGMFGACLMAKADLVAECVEKMQAEVEIPVTVKTRIGIDNLDSYQ FLCDFIQKVHSKGCNEFIVHARKAWLSGLSPKENREIPPLDYNRVYQLKR DFPQLSISINGGIKTIEEMTAHLQYVDGVMVGREAYQNPALLGQIDRALF DLNAPIVTPREAVEKMFPYIERQLSRGVHLNHIVRHMLGAFQNCKGARQW RRLLSENAHKTGAGIEILETALHFVEE >MS0077 unknown MANQVYLALYKNKRSWAKEPWKAFADAITRNFTKGDFSHCELVVERRQFT SGSHYEHEVIYDCYSSSVQDKGVRCKQINVRDGKWVLIPLQNVTEEQIKH YFEQTKGKHYDWWGALGVVLGIKQKRSKYFCSEWCFNAIFATEEGWRFSP NQLAVMFKKGY >MS2106 unknown MHIDFLIKSQTTLLPIGHSELCQNNVYILG >MS1841 unknown MQKRDRTFKRCYNERPILSILITDFKENMSDTSLSLHPIAIINTPYKEKF SVPRQPNLVPDGVGTVELLPPFNQPEAVRGLEAFSHLWLIFQFDKVPQGK WQPTVRPPRLGGNRRIGVFASRSTHRPNPLGLSKVELRKIEISNGKVLLH LGSVDLVDGTPIFDIKPYIAYVDSEPQAKSSFAQEKPQAKLKVEFTPSIQ NIIQKIEQKRPHFGRFLTDVIAQDPRPAYQAGKPSEREYGITLYEFNIRW RIRQNSADVAEVFDIEQTGNI >MS0737 unknown MKQLLEFIPLILFFVVYKLAGIREAAIALIIATIFQMLILKLKYGKIEKQ QIIMGIAVVFFGTLTAYFNKVEYLQWKVTIVYALFALILLISQYGFKKPL IEKLLGKEIQLPEKIWNKLNLAWAGFFILCMLINIYISQYCSEEVWVDFK SFGIIAMTFIATLFTGIYVYRYLPKDDQNK >MS1103 unknown MLPRKYPLMSAVIKQNFQKIDRTFYKKKSRA >MS1886 unknown MNMTFGWLASLAGKIGLDMLNNSPDRLLKIGRIQNQLESERIIETEKAKV YAEHEASRLKQKLSEVEPEKQSKIVGKIAILNNQIELLTQQQNTINSFIE TIKDIGDIPKESLKEPDNDWLREWTKNAGRFSNEDANRLWGKVLAGEMKK PGTFSYRVLDGLRNLSKDDANLILQIIPFITNGLVYRSNDLIFNMGTNWG NWYQLEEIGIVRHVGSVSTSASTPVDQYTPMYIRGISYALVLSSETQKTI SEPIIILTELGNAIMQLIEPTFKNDINLVHKQKEYMGNLGNYLKKEYQVT YSIIKIPNN >MS0521 unknown MRALKKISQLLAKNTALVIILTALFTFIVPEAFTWVKGDAQVLVLGIIML SMGMTLGAKDYQILAKRPLDILIGTVAQYTIMPFVAISIAQAFNLSPGLT LGLVLVGTCPGGVASNIMSFLCKGDVAFSVGMTTVSTIIAPVMTPLLLNY LVGETIDMDGWGMFKFMLLVTILPVGLGSLFNMGCHKQKWFNDVRSVMPG VAVIAFACIVGGVVAFQGERFLESGLIMLMAIGCHNITGYILGFAAGRVF GMNTAKKRTLSIEVGVQNAGLATGLSAKFFPTNAESVVACAVACVWHSVS GSVLANIYQWWDKKHGEPVTEIHEIKKPVTESV >MS0710 unknown MAGHSKWANIKHRKAAQDAQRGKIFTKLIRELVTAAKIGGGDAGSNPRLR AAVDKALASNMTRDTINRAIDRGVGGGDDTNMETRIYEGYGPGGTAVMVE CLSDNANRTISQVRPSFTKCGGNLGTEGSVGYLFNKKGLIIIDAGADEDA LTEAAIEAGADDIQPQDDGSFEIYTAWEELGDVRDGIEKAGFKIAEAEVS MIPTTSVDLDAETAPKLLRLIEMLEDCDDVQNVYHNGEISDEVAALL >MS0100 unknown MSKFTFEEQAKYFEKKLNLKTDNYLDVLGEEHDYFFMVAGANRNEVLTAL REAVDAAVLKGETLDGFRRRFDDIIANTGWEYNGGRNWRTRIIYDTNVYG AYNRGRLQQHLDMAEDMPYWEYQHNDNAHPREQHMAWDGLVLRYDDPWWR YHYPIKAYGCHCTVVAHDEADLRRYGKKVGTAPEIEFEQKTVGIRSGNPR TVTVAKGTDVGFTPWNFDRIKQRRNASIDSVLMQKLITAAPKFASLLVEN ILERPLAVTMLNAAMKDMVDTVAAEKVARGQLKYVGVLAPEIIEKLTALD KAPQTAVIAVRDEDVLHALRDSKQAKGISLPVEFWETLPEKLRNPQAILL QAKEQQRDKNAKDVLLFVYDTEQGKVAIKMDYEVKLKGQLSKKKLKHSLN MVTTGSLFKDTTALHDFDVLWGSLD >MS0089 unknown MADLAFWFGFSHSELEEMTLNEIERWLKQAKRQIDANYTKAAV >MS2253 unknown MSLEILDQLEGKIKQAVETIQLLQLEVEELKEKNQQAQQANDELRSENEQ LKGEHNNWQERLRSLLGQIDNV >MS1050 unknown MTIKAIIFDMDGVLIDSEPVWKQAGIDIFNAEGIPVTYDDMLALTGIPSL GIVKAVYEKYQRSPVPVAEMAQRLNDHAISLILAQKPLIDGVQETLQKLT ALGYKLAVASASPRILLEEITQSCGIDQYFSYLSSATELSHNKPHPAVWL HAAEMLGVEATECIGIEDSVVGMVSVKAASMKCIVVPGVLGSDDPRWALA DIKLATLREIDETVIGKLDSI >MS2125 unknown MGDLFVKFNFAWPWMGLAMAAVLSVLMISTDIFRSDENSHRWTDPVWLAW LVVPLYMFHQFEEFALSYNVATDSYNVVTEVCRLHGYKSYEPCPIPAVHF PFVNVLFAWIAAPLAAIMSKRNSLVGLSLYGFIFAEGVLHLTFGLLDHQP FLNHGGLITGSLLFIPISLWVIFIGVKANIMSCNAMICTIFAGVIGQICL FNAYSIFPDFGVTGMLIMDAIAVFIPLVLAALVSRRII >MS2111 unknown MGIPAHQSTSGEKQEIFYRLFSLLILLQILFIFLKSA >MS1918 unknown MDNQTISDQSPLSLGEQLRRAREKLNISIDEVAAKLNLRSAIIQAIENDE FVQKSIPSTFMKGYVRNYAKFLKLPDGLLTSSMPNFAEEPKNDLNKNSRT KHSVNPHAAHSRWVGYLTTLVVLFVAGMTALWWWENYQQSNNERDNLVQN YVATEDRTAERSDNVVEIPAIQTIPEASTPVPEANTNESVEIAPVVANTP VVTNETQPVQQTAEQTNTAQAMLQQHSTEPEQAQPTDNETTEPATVTAGD LQIEVTGVNCWISVKDAKRKVLAQKEYKQGEILTFNEGSPYSVIIGAPGN VKITYKGEAYPLKVDGRVAKFKLQ >MS1867 unknown MQAPTRRCGGRNGVKWRWCYQNTAKISLPTAVIFSVTFLIKRQLWKNSGF FTTSQDFSAKK >MS1132 unknown MPKRAEILRKALVEFYGTEDSIDLAKFDVGKI >MS0180 unknown MLHNQQLLEEGLKKLKEIDGSQADKVMDALSDIAPDLGKYIISFAFGEIY NRPRLDLQQRELITLAALASQGGCEKQLHVHIHASLNVGLSRKQIVETFI QCIPYLGFPKVLNAVFVAKEVFSERDGTENFEKNDRTFK >MS1488 unknown MKSFIASLVIASRWIKLCGLMLLDFFVFPVLLWMCYALRLLDLSAEIVPN FYLGEFWISLFAVACLFVCSVYHFVIRTFNETLIVRLLIASVMTVIGLLL LGHFTDIFVPASVAIMFGFMMFLWIWLSRSAIRLTVRYILNPRVTSKRIA IYGAGIGGQQVVQTLLRSDEHLPLFFIDDDKNLRNRRVGGLKIYSAKAAL DALERYEIDEILIALPSISRARKNEIVEFLSQSHRRIMELPSLTKLVDGQ INISDIKEVDIVDLLGREPVDPVPELFSKNIQGKVVMVTGAGGSIGSELC RQIIRNQPKTLLLFEISEYALYAIEQDLRGIIRKESLVEMEILPLLGNVQ NKQRLVEIMKAFNVETLYHAAAYKHVPMVEYNVVEGVQNNIFGTYNTAKA AIEANVDSFVLISTDKAVRPTNVMGTTKRIAELCLQALAQEQGSAHHTLF SMVRFGNVLGSSGSVIPLFKKQIAQGGPITVTDKRIIRYFMTIPEAAQLV IQAGAMAKGGDVFILDMGEPVKIVDLARNLIKLSGLTIKDGDNPNGDIEI RFTGLRPGEKLYEELLIGDDNVEQTYHERIMTAKEDYLPPDKLRELIRQL EEACDNNDCEQVRRLLLNAPTGYHPVSELADVVWTKQHSDD >MS0697 unknown MDDLDYFLCRLPIPRFWSYAMRLEKLKKPVDVFISTFSIIVMVLLVICVT WQVFSRYVLQIPSTITDEIARFSMIWVGLLGAAYTVGLQKHLSIDLFTHN LTPRNKAFSNLFINFCIMGFSLGVMIFGGLTLVSNVYASGQLSPSMQIPM AYIYLALPLSGLLMLFYSILFFIDNLHSLKEYD >MS2164 unknown MNIVHNPFPLCVIISLSIRPFFIRLYYEFPNSLLDKNE >MS0246 unknown MSMQPKFLLLKSLFYIIPLALTVSGCFDKTAEKLQETPQKSTALSTQESF PPIKNNYDFAMKDDKIGQNLKANVDYYMLALSWSPSFCWTQYEKYGNHLP DSAEYQCGIKKKYGWVIHGLWPQSATARTVAGHPRLCKGDLPQVEENVVR QYMAESPSPNLLQAEWEKHGACAFDRAEQYFAKQQALYRTLTLPTVEMKG KELFSWLRKNNPQLRHAYLGASRDELYICYDLNWQVINCPKQ >MS0727 unknown MTLMRKITKGMTSVSLLITMVLFSVIMLSILQWSGYQRKSAVEIYQYFQA VQIAENQKQRLFLGLGCESQVVQNGIQFRLLCVGEKITVSYPMGKLTL >MS2119 unknown MRLKFDKFWGVDFTPLFITYHLSNCDKCLIRKANDFIKIYLDCTNQPYFT IGGECGYFYAN >MS1129 unknown MNIIEIKELNRYFGEGENTVHVLKNISVNIEKGDFVAIIGQSGSGKSTLM NIIGCLDTATSGSYKIDGKETNELTSDQLSDLRSQKFGFIFQRYNLLSAL TAAENVALPAIYAGKSQSERLARAEELLKKLGLDGKEKNKPSELSGGQQQ RVSIARALMNGGEIILADEPTGALDSHSGENVLEILRQLHSEGHTIIMVT HDKNIAASANRIIEIKDGEIIDDTQKHPVQNTVNNQSKAKSRFGFSKDQL MEAFQMSVSAIIAHKMRSLLTMLGIIIGITSVVSVVALGNGSQQKILSNI SGLGTNTMTIFNGTGFGDRRAEQMQNLTVNDANALAKQSYVQNVTPNSSS SGLLIYGNQSFTSTNLKGIGEQYFDVEGMTLKQGRSITAQEVRDNAQVAL LDESSKKSIFPNDNPIDKIVMFAKRPFRIIGVVADRQMGAASSSLNIYAP YTTVMNKVTGGTKIDSITVKIADNVNTAVAEKSLTEYLTVRHGKKDFFIM NSDTIKQTIESTTGTMKLLISSIAFISLIVGGIGVMNIMLVSVTERTKEI GVRMAIGARKSNILQQFLIEAILICMIGGISGIMLSLIIGGIFNVFMTDF TMVFSTFSIVAAVLCSTLIGVIFGYMPAKNAAQLDPITALARE >MS0505 unknown MKTLLLLSTMLLMTACSNSVSVLPLPSTAKPAVKTAVMDKTTQKGTATLY RCKDDKEVRVVRNINTGNKSKKRQKSGSVINLTFNNVTQKLTSTVSESGN SYTNIHWHWFERGDANMLTTSVGKVLAEQCIIQKASPLEALEKDTNK >MS0202 unknown MEKLRFATRLNSFASKAHSYWPSIKGKPTIRQMIERASKVKGLTDVDLNY PQHLNEAPKELGKFITDCGLNVNGMAMRYSTNPEFQLGAFTHPDEKVRRE AIELTKRGIDCGRELGTNLMTIWLGQDGFDYSFQADYNKIWDDLIYAFRE VAEYAPDCDISIEYKPNEPRSFSILPDVSTTLLAISDIGAKNIGVTLDFA HVLYADEMPAFAAAMIARRTKIMGLHLNDGYHKRDDGLMVASANPKATLE LIWQLRKAGFDGAYYFDTFPDASGLDPVHECEVNIQTVNQLVKIADQLNS VDQLNVAIANQDAVSSQGIINQFLLGR >MS0098 unknown MIKITLDDTQAVKKLQSVAAQLKAPRRLYALLGEELKKIHDDRFKTEKDP NGKPWTPLAAKTLARKRKRGKSLKILRQDGNLANKTAYNILDDGVEFGSP EVYAALHQFGGKAGKGRQVTIPARPWLGVNKENEYYLLKKAVSHLQKSLG KIK >MS1256 unknown MHRILGEQNEKMDCYFSYCAYLGWNLVENSRTTSAENRIA >MS2251 unknown MMKSAVKKTKFFNLACQHQLLNYIFGAIEWKPLSTAF >MS1200 unknown MKQNLFSLLIREENFSLKADGIQSVRKKCGGNLSRIY >MS2220 unknown MCLYRVILAHPRINHKKCGQKSLFFEGWLCQRLPMGKYPYNDYEYVLKIH IFKFN >MS1656 unknown MNYKISPKMTALFIFLSIKKPFIAIKGLILFILD >MS0029 unknown MFEQIYEKNTALLCGILFDDIKLMKVGGDFNEEF >MS0759 unknown MKIKPLFLSLFCLAPAVLNAQWANVGKADYNWGPFLVYTVSFDTENGEYQ DHQSPLMFSFDYAKPVEGKNFSIILIKEMTSLGATKEQTEKWLKELSAIP MPDFLPNDRLSYIALENTGYFILNDQVLDHYFDAEFNQYFIQVWLSGKTG FARLQNQLLGKEKGTVTESYPRAPAVVPLTEEDADPQLPPNYQLTDRTII NC >MS1470 unknown MNLGYTNGSKEVSGASKKHRTLIRFNLTVKVRCFTEDYGKFSRRLI >MS1377 unknown MNLGYADIPEKKNDPLYGLQILFHDFYHKLSLLLTL >MS2304 unknown MIEKEKKTTALFYYHLGLVIKGEKRGTSEGVERSALTVVPIV >MS2302 unknown MLTLIKLFFDSQNYIKSAVNFPTVLRLRASVTVMTRRFS >MS2127 unknown MKKIFAFLIALFCTTSVLGAPMNIEIQIANSNEKITASLADNQTARDFYA QLPLSMQLEDYANSEKIGHGIPKRLSIADSPKGYAGKKGDLTYYAPWGNL AIFYQDSHVGYANGLVYFGKLTAGLETLSKLNGEVVTIKKAE >MS0985 unknown MRIISLSALQHYAFCPRQCALIHNEQLWAENFLTAQGNALHERVDSGEPE TRKGVRFERSVHVSAEQLGISGILDMVECEIQTGKLKPVEYKRGKPKPKP SDEIQLCAQALCLEEMTGKKVEEGALWYMQTRHRHPVIFSAELREKTLQV INEVKTLLESGITPPPNYSKSCKACSLIDLCQPKLLERDKSGKYVVGLFW E >MS2126 unknown MLAGYTYYEKQSKAQTFAEVFANSNLSGELTDDFE >MS0700 unknown MKNLTALIEQLQAKVQQLTLQFAAFSDKKIYAKFDRTLFSEDFESGQFYF DQIQHTLAQIAGLKETEIPQIQFFSEKLLAQCTALSDAINQNNGRKTAPT PKIPSQREKIKHELNQLPPRERLVRYYEALQALNEKINELEDKRDTAHNE QQKAGYQHQIDITLPRRKRCLEAIEVLEEYLSFKEN >MS0105 unknown MVGLLLSFLGCCFGFAKMLVAQFQNAMEERHANQQRVNEKVEDLERMVNK MNSSMPLVYVLRDDYIRGQTVLEAKMDAVHKTLSDLYKIESAK >MS1921 unknown MAYTSLEEQEINEIKNFWKENGKTIIVSVIIAIAGVFGWRYWQSYQLSQH HQLSDQYQQVIYEFRQDPAAQKDNLAEFIAQNGKSGYAALALFEQAKTAV EKQDFSQAETALKQALNNAPDEIFASIAALRLANVQFQQKDFDGALVSLN LVKDTSWDSRKQILNGDILLAKGDKAGAKAAYQQAQKNASALEQQWLQVR LNNL >MS2294 unknown MTDNNILIFMKNTNIVHFLNKNYFHIFSNNLI >MS0803 unknown MVGQFNQLEGKNMKLLAKLGAAALLAFTLAACSDPAADLKKLQAWDRDNA AAQQQIQAELQQALSTVKEPSELEPVLASYKAKVQDLVKSLDQLDIKSNE IKALKEKTKAVFLESQDVTADSLKVLVVSRTEETVNALKAKTEALNKNVE ELMKLQNDLQAKFGDKTAETKPAEQAPAQPAEQAPAQPAQQPAEQAAPAQ PAK >MS0249 unknown MKDFQKIYGIITKKTFFLSIYTLNIVNYKVAITHNSVEEEKLC >MS1642 unknown MEPISLPEYAGSTLRGAFGRALRKIACMTKQADCKGCPLYRSCPYTNIFE TPAPTSHELQKFSQVPNGYIIEPPEWGEKIYLTGTELRFNLALFGRLIEQ LPLIAFAFKRAFEYNVGRGKAHLVDIAKFSQNMTACQSILKEGNIIEHEK QIILPESLPNYLTIQIETPLRIQENGKPLRENQINADRFFIGLAKRISLL SEFHHQPLNLDFELIKNDLQAVKYEKNLTWLDWTRYSSRQDQKMKLGGVV GSWQFENLSPELIQLLYFGQWLHCGKNATFGLGKYRITNL >MS2002 unknown MLEILQRHKRLRLNIGYAKGGEAVSAAGKNFLKTDRSFYAFVPNQAAYKS KR >MS0433 unknown MFVLRYLVMWTNFIADFSKQLTPEVWALIGSSTLETVYMSFSATLFAVVL GLPLGVYNYLTKPNQALANTKVNRFLEWVINIGRSIPFIILLFNLMPVTR FLVGTTLGTTASIIPLGVCALPFFARLTSNALGDIPSGLTETAKAMGTTV WQLVTKFYLPEALPILIKATTLTLVTLIGYSAMAGAVGGGGLGNTAISYG LHRNMPYVLWISTIIIVVIVMLCEKYGNKLADHFDHR >MS0295 unknown MAIVSVPVEKSYRLLNIGATTLVSAKAEDIENVMSVAWSCALDYGPLSKV TTVLDKQAFTRGLVEKSGLFAIQIPVANQAELVVKLGTTSRHNNPHKIDD VEIFYPDGFDVPLVKGCAGWIICQLIRDENNQQNHDLFIGKVLAAYADDR VFKDAHWIFEQAPNELRTLHYVAGGQFYLIGESLEVK >MS1903 unknown MEIMAFPLRPFPLLLAVFISLLAIWSAVEPVSRAVWYAEVVPVFAVFMLL IVTYPWFQFSNPAYFCMSLWLILHLIGAHYTFELVPFKWGSDLLAGWLGE GRNHFDRVAHYIIGFYSFPMAEFLLRRKLTGPIVAGFFSLFFIMSVAAGY ELIEWQYAVIAGGQEGIAFLGSQGDVWDAQKDILADTLGALTALLLFYFI RPDKKYS >MS1049 unknown MDQNWQTYRTLVNDHIAIFSANLAIFEQFSSEAAKLSKVVQFSIGYCADE NGLPQAEEHQLLFRNILRALTNLSALSDTLYAGHIVSNGKAKLYFYTNDT DAFIQVLNGLGYTDDLDIQDDPNWDIYFDFLLPSPLESKMNATEEILDLL VRNGRDLADIFLVEHTFYFEDKENLLEFIESAELDDVSFNALKYTDEPVP VNDEEMLYMAKIEQELTLNNNEIFTLVEKFEHLAHQYFGEYVGWECDELE PNRGQLN >MS0366 unknown MKKSVLLFIAASMLVACSTGSISQRTLPADPLLPPSLVQPGFVRMPHNLH YYADINSVWVDSDSKNMIHFDAVINLRKGPHVYSDRDKIAKSMRQAKVVN CDTMKLTHLKTDYYSEFWGTGDPVTPEHQKMRTVDLRKGSSLYTLAQVLC INLYRK >MS1901 unknown MFKSAVKISGFFTALFSCPRILQIFHSLSPINRVN >MS2317 unknown MADPHIHSPMDAWDYLTVCIYRSGFVLAAIFTALLPYYPDIAQTGLLVAA VFCASSLHLYLKNFRLILQFATWIALLCRLFSQPELAFGGALLTLGGLCF KEYFCFRILGLNLQPVFVALLWGSVVFEFSLAINILSAISAVLFLLLSIQ KWRMPLHFDIGDKTKYQV >MS1838 unknown MRSFFGLNLFCLLHLYFKNKKCGKNFRIFYRTFANSI >MS1391 unknown MDFVLQRKQAVIPIEVKAEENLKAKSLKVYVEQFQSEKAIRFSMADYREQ DWLVNVPLYACLNFNY >MS0814 unknown MLNAIDLTASELYKEKAEFTLEFKNLPYFFIFRRNYYAI >MS1381 unknown MQQYHDPDSPLLLGYLRETDVVTQSPQKSFLATRQMDLIILRKD >MS1212 unknown METLDKIKKQIAENPILIYMKGSPKLPACGFSARAVEALINCQVPFGYVD ILQHADVRAELPKYANWPTFPQLWVEGELIGGCDILLEMYQAGELQTLLK EVAERHKEQV >MS0906 unknown MNSNLIRLIFITLLSLGLTLISSFVLARLLSVQDRGLHQLFITAVSYVVT FATGGSGFALALSMRKKQYAGWQNYFIAFLALSVLAAIIAIYCFDFTAFH VLFVLNVVLTAILTMTLEKSKIDANLRVYRQLTLQQPVLLVAVYGICYLL LGEQPLEIAIELFTLFSAMQALACLYYLKKINADFKRKNEIQPIQKRFFL KTWFKQNLLQIFGATTASLDKFLIVYFLGNYTLGLYTVCIAFDSLITKFI NMLADYFYSGLLNNINRIKSVLILILLMAVGAVILVPLLAEPIIIFFFSA KYAEVAPVLILFIINAIIGGLSWVLSQNMLLLGKQVLLFTRQIIAIAVFV LLFYLFKDYQLYGVAYAFIGASLTRLIISVIYYLKYPITDVKPEKSAV >MS0326 unknown MMKISTKGKHNFWSQLLVSMIAIFALPCAQGLNYSDAVTNENYQAQRTSI KQPAAKFSALIQQQVAVQQRQAQQCNVDCPKFAKIEPHFCLSPSYFHAPI RGSPLV >MS1548 unknown MMNTLDRYIGKSILGAIFATLLTLVGLSGIIKFVEQFRSVGKGSYDSMQA FLYTVLTMPKDIETFFPMAALLGALIALGNLASRSELVVMQSAGFSRMKI GFAVMKTALPLVLLTMVIGEWGIPQTEQFARDMRSKAISGGSMLSVKNGI WAKDGNDFIYIKRATEDANLNNIYIYSFNDNRQLQRVSHANKASYENGSW VLKQVNESQISADEIKTKNYLNRPWKTSLTPDKLGIFTVKPTSLSISGLS SYISFLKETGQDSKKFELTYWRKLFQPISVGVMMMLALSFIFGPLRSVTA GARIVTGICFGFVFYVINEIFGPLSLVYNVAPIIGALMPSLLFLVITWWL LSRKRD >MS1700 unknown MLIYKPSLLKCGRFFENFFSSHKISKNFTALLILSLWKIFLN >MS2076 unknown MIQFIWRFYATHSTGSRFNIQSLNSRILAVKKLAEIAAGIAYIR >MS1502 unknown MIKIMQKNAFFIAQYKKYSSDFMGTDHFYSGGLHKTDLLLRQGLK >MS1718 unknown MRQFMELIIISGRSGAGKSVALRALEDMGYYCVDNLPINLLPELADILST SQQSAAVSLDIRNLPHSPETLDTLLQQLADAQHQVRIIFLEADRSTLIRR YSDSRRLHPLSMQDLSLEAAIEAEAGYLEPLLQNAELVINTSEISTHELA QRLREFLKGKPDKELKIVVESFGFKYGLPLDADYVFDVRFLPNPHWNPDL RPMTGLDQPVIDFLGKYSEVNNFIYSTRNYLETWLPMLEQNNRSYLTIAI GCTGGKHRSVYIAQQLGEYFQAKGKKVKIQHKSLEKHHKKNSA >MS0139 unknown MKKNTNSTRSNQSNSKPNQSKGEVRIIAGKWRGRKLPVLNAQGLRPTGDR VKETLFNWLMPYIADAVCLDCFAGAGSLGFEALSRRAQGVTFLELDKQAA TQLKKNLQTLNVPVEQGQVLNQNSLDYLKFGQNLPQFDLVFLDPPFHLGL ADKAIELLGQNNWLKPDALIYVETERDKPLLTPPHWQLLKEKTTGQVSYR LYQA >MS0693 unknown MERDRMKIVIAPDSYKESLSAMNVANIIEKGFKQIFPDATYVKVPVADGG EGTVDTMVEATNGKRIELDVVGALGSQQKAFWGISHDNSVAFIEIAAACG IEQVPMEKRNPLITTTYGVGELILSALDSGVRHFIVGLGGSATNDGGAGM LQALGVKLLDEQGKSLGYGGAELARLSKIDFSTMDCRLAECKFDVACDVT NPLVGENGASATFGPQKGATPQMVKQLDEALSHYADIIKQDLNIDVKDLP GSGAAGGLGAAFAGVLKGELKSGIGIITQLLDLESKIKDADLVITGEGRI DHQSINGKVPVGVAAIAKRYDLPVIGIAGSLGKDIHVVYDYGLDAVFSVL NKVCSLPEALDPTNAAENLEITARNIAATLKMKIS >MS1348 hypothetical protein MERHPMVERVDYPGLASSKDYELKQKYTPNGLCGVLSFELKGDKQTAMKW LDSLQIISREVHVADIRSCALHPATSTHRQLSDEEMRAANITPGFIRLSI GIENPEDLLADLQNAFDQIK >MS0352 unknown MTLNITSKQMDITPAIRTHVEERLAKLEKWHTQLINPHFILSKIPNGFQV DASIGTPIGNLQATAQSDDMYKAINEVEEKLEKQLNKLQHKDESRRASER LKDSFE >MS0043 unknown MPDKFHNFSSRIVKFVFIYNKNLVKNDRTL >MS1678 unknown MKRRVLMKNENIKEKNGWFKRVIEKYNRFCKDFGYDQATCRSCGVPEIKA DENGNLLKKEPKNKK >MS2166 unknown MNNDLTTSAIARNNVLNNKYALAELETNLQLGGLSFEGETVFTKQQAAQI LDVTERTIDNYIASSGDELEKNGYRILRGKSLKNIRLAYVDEMNFVDISP KAPSLGIFTFRALLNLAMLVTESERAKFIRSRMLDIVIDVIAQKSGGKTT FINQRDVDYLPAAYQEESYRKQFTNALRDYLEMSNVKYGIYTDKIYQIIF CENTKEYRQILKLAEKDKTRETMYAEVLKAIGSFETGLAAGMKQKSEMLG RKLTPTELNELLAEAASNPFLQPFILDARTKMSSRDLGFREVLHEKLEKY IQAIPENDFERFLGERSRSLKEQLEDAETLAVLQRLKDR >MS0640 unknown MESVSRHIITRQKLVKKDRTFKVRSIYRRILLFFLSIF >MS0823 unknown MKMNEINHNRRKWLALGGIILGATILPNSVLAAASTPSPRILRLRNINTG ERFSSEIVNGKLLSSSALNQLNWLLRDRRNNHTYRMDPNLFSKLYQIQGN LGLRNTEIQIICGYRSAATNSAMHRRSRGVASNSFHVKGQAIDFRIDGVS LANVKRSAESLSNGGVGYYPRSNFVHVDTGPVRTWSGS >MS1008 unknown MSILLCMLVCLLEFMKFIGDFCNILCNDDIFLQAYSSL >MS1578 unknown MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK SLRR >MS2003 unknown MFINKPINKRNKCSAFLLRKTDVQRICTICLRPTCRAFSVSSETNVRRRM PVFRRQNRGDYQRRHRIFVKTNAETLPVFQAKGAVQYEYFSRGKRQKMHY WSIPDEDVEEREKLQQWFDLGIKALAGA >MS1435 unknown MNPLRQNVDKTEKCGGNFENFYRTLGFVISIAICDN >MS2016 unknown MNLPRYGGIMMSNSGFTEKRYHHRLDRGRIILQKGNIYLNREDGEQYELV DYMDEPSQLLVRNLNTRTTKVVSIHQLENFKMNERTDLSVDLTAISNEYW EKAQQKYEAIKPLLGMDQHRPYAVKARAEDVGVNPRTLYRWLQAYNSIGS IAGLVDQKRGWQQGNSRLTPEQDKLIVQVINEFYLHKQRPTTEQTIREIR RRCKIEKVESPSKETIRIRILHISEEERLRKRGQREKARNKFKPKPNSFP DADYPLSVVQIDHTPVDLIIVDSKYRKPIGRPFLTVAIDIYSRMIVGYYL SLDAPSVTSVAMCIARGILPKERLLLDLGLQGSEWNAFGYPVKVHVDNGP DFQALDLSKSCSAHGIHLEFRPMGRPEYGGHIERVIGTFMKEVHSLAGTT FSNIKERDSYDSEKEAIMTLDEFEKWLVHYIVNVYHKRVHSALGISPEQK WKIGIFGDENEVGCGYPQLPVDEQTLLLDFLPSITRTIQHNGVTIDGLRY YDVALNMYISDSDESGKSKEFLFRRDPRNISKIWFYDPKLKRYFPIPFAN QAMPEMSIWEYREVRSRIANKGDKYINEQQVLDGLTEMREMVAESAQRTK KARRQAERQKMHKASKPIIETKVETKAVVPVVVTSNLLALDDESLSFGEV D >MS0104 unknown MMEKIRREGMRWNLLNALHKARPYTTHEQFLREVMASIYPNVTPLEIRQQ LEYLADRKLIELNKQPSGAWYADINRLGVDIVEYTIDCQAGIARPEKYWE >MS2133 unknown MWNFCSGWFEFEILPKLTALLENRAKIIEHYAIVF >MS1155 unknown MSVRAYCMYYVSLYLSPKRLKTAYVVVVTVAAVKKVVINKSGLS >MS1025 unknown MSIVFYFKTNFIHFNMLVAKSAVKNLKILTALFCYHFSAAC >MS1904 unknown MRVILAPMQGVLDAFVRQLLTEVNDYDLCISEFVRVVDQLLPEKVFYRLC PELKNAGKTTTGTPIRVQLLGQFPEWLAENAVRAVELGSFGVDLNCGCPS KTVNGSHGGASLLKQPELIYQATRAIRTAVPKHLPVSVKVRLGWDSADFA FDIADAVQQGGANEITIHGRTKADGYKAERINWEKIGELRRKLAIPVIAN GEIWNWQDGQNCLAVTGCEDLMIGRGALNIPNLSRVVKRNEEKLPWHKVI RILQKYAHLENIHDTGFYHVARIKQWLQYLKKEYPQATMLFDYIKTCHNA DELRIKMEHLQ >MS0986 unknown MMSKVALQSAITNKNNAISPKKKPPNLPKIKELIMSIQNRYEFVYFFDVT NGNPNGDPDAGNMPRLDPESSKGLVTDVCLKRKIRNFVELANENQAGYEI YVKEKSVLNLQNKRAYEALEIEPEAKKLPKDEAKARDITAWMCKNFFDIR SFGAVMTTEVNSGQVRGPVQLAFAQSIDPIIPLEVSITRMAVTNEKDLEK ERTMGRKYIVPYALYRVHGFISANLAAKTGFSEEDLQKLWQALQLMFEHD RSAARGEMAARKLIVFKHDSALGSVPAHKLFDSVKVERINGESGTPATGF ADYQISIEKDKFNGVSVEELL >MS0478 unknown MIEWRYLIQDNNSELNMISYSSLNQQLKSADIGATASELHGLLSGLICGG INDDSWQPLLYQFTNDNHAYPIALLNEIKEIYQDIGQKLADMDNFSFELW LPEDNEVFARADALSEWTNNFLLGLGLAQPKLDKETDEIGEALDDLHDIC QLGYDEEDNEEDLSDALEEIIEYVRTIASLFYTHFHRPQAQEKPVLH >MS1289 unknown MFSLKRQQGASFEQQARLFLESQGLQFIAANQNFKCGELDLVMLDGETIV FVEVRQRKNDHFGSAVESVDWQKQQKWINAASLWLATQNHSLEDTDCRFD LVAFGATASNVQWLKNFIE >MS0984 unknown MSAKLNKFNNEIIPLWDEVLKCVNEIDGVKIPDVYAENFAYLKKIITFLS EAMSCIDADYLPNDSLNNIKAYLVNIKSYLTNSQNYSNSHVQNVENRLDE LLKIIFPFILHKGKAIKGLRLGLNEYSKAITDYVENKFSEIKVTQENIDA IENKLNDELGKFSALREELEEYGESIFSENGVKDKIEELLNNSESKLSEI EELHVSIYGEDGLKQEIDNFYSNISNQNEAINELKEDSSVTLQSLEDFYN KIFGKEDENGKKVGGLKQEIEQRKIELDNFKQKQQERYEELNKQIENLLP GATSAGLSNAYNEMRNKFSGSAKWYGWGFYGSLIVLSVVIYCVRDLLIIK EIPLDKGLGISLLALLGNFAVKLPFILPALWLVIFVSKRRSEAERLTQEY AHKESLAKSYDSYKQQIEKLSEEDQNELLPVLMDNMIKAIALNPAETLDK KHQSDSPISEVLKDKNFLTSIADRVKDMSSNSK >MS0404 unknown MDQQISFDEKMMNRALFLADKAEALGEIPVGAVLVDERGNIIGEGWNLSI VNSDPTAHAEIIALRNAAQKIQNYRLLNTTLYVTLEPCTMCAGAILHSRI KRLVFGASDYKTGAVGSRFHFFEDYKMNHGVEITSGVLQDQCSQKLSRFF QKRREQKKQQKATALLQHPRLNSSEK >MS1648 unknown MKTYRFTLSPKSAFGTPLVGDSLFGQLCWAIVNRFGEAHLTELLAGYTEQ RPFMVVSDCFPQGYLPLPTLPSRFWQTDESHKADRKKLKKVQWVRVEDTQ QQAVKFWQEFAISADFKFEKESQDQYHNTIDRSTGTTGEDIFAPYATELT WYLQTQQLDLYIVLDEDRFDLDDLKQVLKDVGDFGFGRDVSIGLGKFSLA DEVQAVEFSPQNANCYFTLANSTPQDLGLNKENSYYQITTRFGRHGDIQA LSSSPFKKPIILAKVGAIFTPNEYKVRSFLGNGLGNISNTQPNAVHQGYA PVLNLFVDFENKEKQ >MS1576 unknown MMKKWLLTAISGVFLTACGSSSKSGNDLKYIAYQDLDGKTQQVAFLKTLS TENNADPKTSISGEALQKAKFGNLDTHQKIGDIYTIYDAQANPMNVIFVI PSRGKSFSPHKADDMAQLAKEKSFDFYEFGKARIAHSQFSAKSAICRDYK AKSGVDVKIATTYYLDSGGENYLATLVGAQASRKNGEIRKFTYSPSFNID NKKLQEQIQREVSSHGEKVAKSNVIEKLSVLENIVCR >MS1106 unknown MRICKNISLRQKTSQKSAVKFQEFSPHFYQTQ >MS2252 unknown METFIHGFLVCGGLIIAIGAQNAFVLKQGLLKNHILAVILTCFICDIVLI SLGVLGLGSLISESREATVALGIVGALFLTVYGARAFRSAYLGNSSLEIQ SQRQDNTSSAWKAVLATLAITLLNPHVYLDCFAIIGGIAGTLTPDQKILF LCGALCTSFLWFFSLGYGARLLIPLFKRPITWRILDFVIGSVMWLIAFGL AKYAYQLA >MS1162 unknown MFFQAGSSKVEYQGIAGAFSQGNWGVAKALSTAVLGQVSDKGRDSGITTS SVNTKNILIRDGENQQRLFGENVEETVRKLNRENLHQTVNKTDVEKVKSD LERDLDVATALVKNISDSGDELYYNAEKNEDSSFTVSKKTPDCEHISCLD IENDNSQQLKALIYSDNILTEEQAKLLSKISIAGMLNFTREEKVASAILY GDDLASLDDLGVILNRGSAGYWNEFLYAGFERFRAWVNMPTVFGASNATK DHAQIAKKLDEYNAYAAANGKPQYKLQDMAHSLGVSENKNMLNWSNYLNQ DYKNTELDYLHAAGSYPSEEIDRQAKSIFAGVTTRYIGVDGDRVYSGILG GYLIGNNKNAMPNNGISGLEAHSEANQNINNLKYIYDENNSEQSKVMERT KKLLKLTYPKDSIREFNTIKKDGDGL >MS2098 unknown MEKSFAEFAIVQAFIAKFLQKPTVCPFNAKMPSVTQFQTAFQSG >MS1179 unknown MNISLFSNIMHRVLFEISSQGVTWNLFKTHQSMQNSF >MS1636 unknown MRDRYLIVYDISSSKRRYYTHKYLSAYAVGGQKSFYECWLTNRELVEFKQ KLINCIDKQEDKLFIFQLNKDTQPQLFGCASLPKFNQPYLII >MS0027 unknown MDFKTNLDALPAIDRLSGLDVIKDNEVIHHIPAVAGKLGSLRVYNALAAQ FNGKLDRTSAQKGVEIFAEHSADAKQNPNKHPNIDLLFDVINNDLTYKLQ PIEK >MS2380 unknown MRLKNENFYNEEDCLIFLIKIPDFRPHFVKISR >MS1389 unknown MQSDSELSDGISTFTTPPLSPKFLQKPLGKR >MS0452 unknown MTHTIEYIKDLMKKRTFTVKNLTNFTVNERSFILIVFKYLARKSAVNFRQ IF >MS0382 unknown MPVIGVVADDLTGATTTGVLLARSGSKTTVCFNTEAAIKSNAEVPSDSLL ISTSSRPLLKHEAYKHVKEATQVLKNMGVQYFTKRIDTTMRGGVGVEIDA MLDTLPENTIAVVVPAMPQSRRILVGGYSVIDGVALTNTPVARDVRTPVR EGYIPALLASQTRRKVGLVSLTDVMHGVYEIRIALMDQITQGKQVIVVDA ITLEDVSNIAQACLMLDNPILAVDPGPFTAALGYQRGLIHREEPNIPQTD ATCAEDKTVLVVAGSATPVTKLQMDTLCQDPRNISISVDPVLLIEGGDIA DTEAKRVVGLVKDYMANDVRPRAILLETALHGPLLNLDAEDAKRKFVRGE SAERINAGLGMIVSDIFKQIGHQRFAGIFATGGDTMVNVCNQLNVSAIEM IDYVIPQSDIGRLVGIFDNKMPIVGKGGLTGDEYTACKIVDRLFLEACRD K >MS1027 unknown MCARRVGILAHQKIITIVNGGQECPPYRKIMSGFNYEKNLPHQEQGIQAV LGVFDHASRRFHQPDENPQIVFGQNQYTANLQKVQNENGIDRTLSLNSDG INVLDISMETGTGKTYTYTKTMFDLHRMLGVFKFIVVVPTLSIKAGTQQF LQSQSLAEHFEQDFGSDYQGVRLKTYVVESQKATKGKKTHIQTAIDAFVK AENRQEIHVLLINAGMINSPSMSNAGDVALKDLFDNPVEAIAAVRPFVIV DEPHKFPTRESAKTWKNIKQLNPQYILRYGATFNEQYYNLIYRLTAVDAF NDGLVKGVRVFQEEMQGGMEASIKLLSLDGKEAVFELTENGKSKKFALSK GDDLAQIHSAIFELKIDALNKTTLVLSNGLELKRGALLNPYSYAQTMQDA MMQRAIAEHFKLERELLVERTTKIKPLTLFFIDDIKGYRSGNEISGSLKE KFESWVKAEAERRLKNETNEFYRAYLQQTLADLSLVHGGYFSKDNSESDD KIEQEINEILHDKQALLSLDNPRRFIFSKWTLREGWDNPNVFQICKLRSS GSQTSKLQEVGRGLRLPVNELMERVREPQYKLNYFVDSSEKDFVAELIGE VNQHSFSETIPQKFDEALEQKILQKYPEIEPLDLMFELVEKGIIDRKKVF TENGYTRLKVAYPQAFEQTLKKDKIGKAGEGKDTIKMRVGKYEELKALWE LIHHKAILQYKIGSENEFLALFTAYLRENLTKFKQAGIRTAINETYINNG IMLNRRKENLENDDFIRFNTMSYREFLSELAVSAKIQMNTLHQAFYALRD ELNISEFMNQQTINQIRGGFNQFLLNHSFSKFELGYQLVNNRIHPTKFTD EKGCAKEVNRADLGIFGDTEKRPSENYLFDEIFFDSEIEHQNIADNEIEN VTVFTKIPKNSIKIPVAGGGTYSPDFAYIVKTKTGETLNFVIEAKGVESS DILRKSEERKIKHAEKLFTKIAEKVQVKFLTQFEGDMVAELIRRNI >MS1997 unknown MTEFKLNYHKTHFMTSAANIHQLPKDEGMEIAFAGRSNAGKSTALNALTN QKNLARTSKTPGRTQLINLFEVEPQYKLVDLPGYGYAAVPEQMKLQWQKS LGEYLQHRECLKGVVILMDIRHPLKDLDQQMIEWAVSSDLPVLLLLTKAD KLSQSARSKQVKTVREAILPFQGDVQVEAFSAQNKIGIDKLAAKLDSWFS SLLTE >MS0102 unknown MSKQQTVELDLNPIMQALSRTPMVLLGYQKRWCEDTNPVKVVEKSRRIGL TWGEAADCALLAASNSGMDVWYVGYNKDMALEFIRDCANWAKFYGLAAGE IEETEEVFKEGDEKESILAFTIRFASGWRITALSSSPSNLRGKQGLVIID EAAFHPCLSELLKAAMALLMWGGRVHIISTHDGVDNPFNELIQEIREGKK PYSLHTITFEDAMKDGLYERICLRTNRAYSKEGEQQWEAEIRASYGEDAA EELDCIPKNSGGKWLSRALIESQMHSHTPLVRKEMARDFELIDEPVRAKE IAQWLQEEIQPLLDDLDKNRPHFLGEDFARKGDLTSLVIAAQQPNLTNEI QFIVELGNMPYAQQEQIVLYILKALPLFSGAAFDGGGNGGSLAEKARDAF GESLIHIIQLSEKWYKENTAPFKAALEDGTLTKLPKNADVLADLRAFEIV RGVPRIPDKRAKSVDGGKNKRHGDTAIALLLLHFATRQDVRLPVVAVTRR ARRSQTISEGY >MS0386 unknown MKCITFNSKIKFITKSAVKFHKNLPHFSLCPFFCYTTKSSV >MS1675 unknown MSEQNTFSSPEHITVLLHEAVDGLALKDKGIYIDGTFGRGGHSRLILSKL TENGRLIAIDRDPRAIAAAEEIQDSRFHIEHNSFSAIPYICEKLGLVGKI DGILLDLGVSSPQLDDAERGFSFMKDGPLDMRMDTSKGLSAAQWLQQVTE EDLAWVLKTFGEERFAKRIAHAIVNYNKSAVQNGTEPLTRTLPLAELIAQ AVPFKDKHKHPATRSFQAIRIFINSELDELESVLHSALTVLAPEGRLSVI SFHSLEDRMVKHFMRKQSKGESIPKGLPLREDQINRNRTLKVIGKAIQPK ESEVFANPRSRSAVLRVAERIG >MS1757 unknown MHIIKEKLAKSLMFVVIIALCITVMSIILFGINQFKIGSQLASVNQVSNL SHLLVRQQANLFSMLLVNNAGNEQLTDNLENLTKDKFVLDASIYGKNGEL LAQTRNTLDLREQLGLNEESSKHHVVNRQQIVEPIYSPNGIEGFLRVTFD SKYGQTTQNKINQIFHRLYGELIIVFLAGVILASSVHYFLSHYRRARRSQ ITEQINTVKEIKNSSALVFHRRRRRYR >MS0786 unknown MNSDLKEKLMTTPFKPELLSPAGTLKNMRYAFAYGADAIYAGQPRYSLRV RNNEFNHETLKQAIDEAHSLGKKFYVVVNIAPHNSKLKTFIKDIQAIVDM NPDALIMSDPGLIMMVRENFPDMDIHLSVQANAVNWATVKFWKQMGLTRV ILSRELSIEEIAEIRRQVPDIELEIFVHGALCMAYSGRCLLSGYINKRDP NQGTCTNACRWEYKIEEGTTDDVGNIVPKDNVQKYEPEIVVKNVSPTLGE GATTDKVFLYTEPNRPDEQMTAFEDEHGTYFMNSKDLRAVQHVEKLTQLG VHSLKIEGRTKSFYYCARTAQVYRKAIDDAAAGKPFDTSLLDTLESLAHR GYTEGFLRRHTHDEYQNYEYGYSISERQQFVGEFTGKRNAQGMAEVAVKN KFLLGDEVEMMTPKGNIVFKINRMLNRKNEEVEAGLGDGHFVFLDVPADI ELDYALLMRNLTGGNTRNPHQK >MS2174 unknown MIMFKKLLIATALCASFSAMADDSFTLKVKGVENGKFQNKHLLSAEYGFG CAGENISPEIEWKNAPKGTKSFVLTVYDKDAPTGLGWVHWEVVNIPANVS KLPAGIDAKDNNLPKGALQTRTDFGVPGYGGACPPENEKHRYEFTLTALK VEQLPNVTADSTPALVGFFTNANAIAKAQVTVETAR >MS0831 unknown MMNYVAHAKDQALTAHHDLFSYHPMPFYEDTEQTRSRFHKKLDLNLYCIK RPQQTCFIRVQNPDLMAWGIEQGDMLVVEKNDSLSIGDLIVIEVNQKLEI FEFIAYDKNEFVFLSLSSKLNNIRTANWSTLPIIGTVTNTIHQMKPKNTI SFAA >MS1860 unknown MQNQSEVSSQKQREITRLCVHTALLLLQHGAESALVVALTTRLGLALGVD SVECALTPNAVIVTTLTDGHCLTTTRKNIDKGINMKVVTDVHHIVIAAEH RIYSLEQVKSKLENMKPIKYNRYFVVLMIGLSCASFAHLSGGDNLICLIT FIASSIAMYIRQELSVRHFNPLIVFCCTAFVASMISGLALKFQLGNDAQI ALASSVLLLVPGFPLINSLADILKGHVNMGLARWSIATVLTFGACMGIVF ALNVLNIANWGY >MS0312 unknown MIMKLSNKFSLAALTVASVLLAACQAPSSVLTFAPHAPNTTLNVSNQNAV VAVVTKDERSQKQVSSYVRDGALFPLTASPEVDTIFQQVMQQNLNSKGFR LGSANAANTHMLVSVKDFYAKVEEGNLRHKINSKIQLQIHVQGVKGNFTK SIGTTRTDEGAFTVSNEDIQKALDAALKDVVNGIYADQDIGNAIRQYSN >MS0769 unknown MSHFTSIFIGVQFSNIFYAFLRKKSVKLLNFLSG >MS1387 unknown MKQLENVRIFGGEQQVWQHQSATLNCTMNFAIFLPKQAKTEKLPVLYWLS GLTCTEQNFITKAGAQRYAAQHKVIIVAPDTSPRGDDVADNESYDLGKGA GFYLNATQQPWAKHYQMYDYIVNELPALIAEHFPVNGKQAISGHSMGGHG ALTIALKNPQRYSSVSAFAPIVAPTQVPWGQKAFQHYLGDNQTQWTQYDA TALVNAETRLPIRIDQGDKDSFLTEQLRPELFLDACRAHHVACEYYLRQG YDHSYYFIATFIGEHIAFHAKALYQDSEALPL >MS1866 unknown MRGKEWSEMALVLSKYGKNFAPHSGDFFRYISY >MS0701 unknown MLTNEVVISILVLLILSLLRINVVIALVISALTAGLVGGLGITKTIETFT GGLGGGAEVAMNYAILGAFAVAISKSGITDLLAYKVIKRLGNRPTGSSIA GFKYFILAVLVAFSISSQNLLPVHIAFIPIVVPPLLSIFNKLKLDRRAVA CVLTFGLTATYMLLPVGFGKIFIESILVKNINEVGAALGLQTSVAQVSMA MSIPVLGMILGLCTAIFISYRKPREYIVKIAEPTTAEIEQHIANIKPFHV MASIVAVLVTFGLQLFTSSTIIGGLAGLIIFAVCGIFKLKESNDIFQQGL RLMAMIGFVMIAASGFANVINSTGGVTELVNSFSQSVGADNKGIAAFLML VIGLFITMGIGSSFSTVPIITSIYVPLCLTLGFSPLATVAIVGVAAALGD AGSPASDSTLGPTSGLNMDGRHDHIWDSVVPTFLHFNIPLLVFGWFAAMT L >MS1454 unknown MIRKLTQGFTPKHYLVEILFGLTALLGFYLIIAWSSYSPLDTTWSVSSFQ PEIINKAGKFGAWVIDLFFVLFGYVGNLLPFLLLIAPIYFIRTKRVDSLT WTRFSLRMFGFILLVCGLTTLAALTLSNSNYHLAGGVLGGSIVKLVYPSF GKFGLLMSAVVFSIIGFIFCSGASLIRLLMRFYNWLTEKNEESSLVQAQN DEEILQQEDEDIQDWIDGDIDRQQDLIQSAEDLQSHRDMITPAHRGINIM GLSTPSQFTENTEDDETPNPENFGGYAVDEIDNLPEVTISSQNANIDLPN ENNFTPMWQKQKSLAENMPEFLDGENTGVVLSEEEITRDLLTQVHIPEVK LTPAKLQHPLTENSAVTKQAAYGMGESESFEDDNMADLAAQFARQEAERE RIRLEKAQAMGLADLPEPQVSLQPTQPNLFESDEEAETEETNGLRTISID QAIQLFGDHKPLIKPTTELPSLDLLDKRTSHVQEITPEEIHETSQRIEQQ LRNFNVKATVKDVLVGPVVTRYELELQPGVKASKVTNIDTDLARALMFKS IRVAETIPGKPYIGIETPNAYRQIVSLREVLDSDEFRHSKALLPMALGKD ISGKPIIIDLAKTPHLLVAGSTGSGKSVGINTMILSLLYKVKPEEVKFIM IDPKVVELSVYNDIPHLLTEVVTDMKKAANALRWCVDEMERRYQLLAKLR VRNIEGFNERIDEYRAENIAIPDPLWKPGDTLDSVPPILEKLSYIVVIVD EFADLMMVAGKQVEELIARLTQKARAVGIHVILATQRPSVDVITGLIKSN IPSRIAFTVVQRNDSRTILDQNGAEALLGRGDMLYLGNGTTDLVRVHGAF MSDDEVVRVADDWRARGKPNYISEILESTGDDDDDNGLSGEGSEDLDDLF DEVMEFVIRTGTTSASSIQRRFRVGFNRAARIMDQLEEQGIVSEMRNGKR EILARNPDY >MS1125 unknown MEILKILTALLSFFIFHLVKIMQLDREFWKHKSLLEMNEKEWEALCDGCG KCCYRKFIEGGGRRERLYFTRVACNLLDCETGKCRDYANRFKLERDCTKL TKKNLPDFGWLPKTCAYRLLYENKPLFDWHPLISGRAESVIEADILIKNG IHEKEVIDWFEFVIDEE >MS2078 unknown MQRAKIALKVNAAYIFNSVPNRVRFFYAENPYFSTALFL >MS1010 unknown MSHYIYLMQNGGINPTLRRNMPNYRRDFTTGGLYFFTVVLKDRSQDYLIK YINEFRQAYKITQERYPFETVAICVLPDHFHLLMQLPENDSNYSVRIGFL KSQFSKLLPLQCRKVSESDQKQGDAGIWLRRFWEHLIRNDEDLANHWDYI YYNPVKHGYVQYVKEWQFSSFHRDVDKGIYPKDWSGCPDLIIKGEM >MS0555 unknown MGFKCGIVGLPNVGKSTLFNALTKAGIEAANYPFCTIEPNTGVVPMPDPR LDALAEIVKPERTLPTTMEFVDIAGLVAGASKGEGLGNKFLANIRETDAI GHVVRCFENDDIVHVSGQINPADDIDTINTELALADLDSCERAIQRLQKR AKGGDKEAKFELSVMEKLLPVLENAGMIRSVDLDKEELQAIKGYNFLTLK PTMYIANVNEDGFENNPYLDRVREIAEKEGAVVVPVCAAIESEIAELDDE EKVEFLQDLGIEEPGLNRVIRAGYKLLNLQTYFTAGVKEVRAWTVAVGAT APKAAAVIHTDFEKGFIRAEVIGYDDFIQYKGEQGAKDAGKWRLEGKDYI VQDGDVMHFRFNV >MS1343 unknown MILDMINSYVFSLFIQLYNEQKNKSYIKISLSIKIPKKVQSNGIKYDG >MS0844 unknown MKFRLTALAVAALLTSTASFAGVVTTSSNVDFLAIDGQKASKSLIKQARS FNITDTNQHQVVVRVSEIIRGGSESNLFESDPIVVTFQGTTEDIQISAPT LRSERDVEKFKQSPVISVTTASGAAVQTKQEYLTQEGFLPSVNLVENLSN YNASGAKAAVASFATTTMPTAMGTTGAGKVAKGKVTVQGENAAEQMLQYW FQQADKETQTRFLNWAKKQ >MS1014 unknown MKMYKTLKKLTALLLVTQSAWAQEQFEEKFVSLTLCSDRLLMEIARPDQI AAMSPYSKNPLMMPDKTNRDKPTIEPRLTALLPYLDKTVLINEHFYPQLT ADLKKLDVKIIPINDSPQTPEQLFELIIRLGKLTQNEEYAERLVTELKTQ HFNLNQPLPETLILSETGIVDAFLPQYQTLLQLLGLTPLKTAISTQNFAL EKLLLSQPNLLITLTDKQGYNEQAKLLSHPLLEKLFKNRPHFTLPMKYTY CFDHGVWQGAKVIYNQPHNSPL >MS0678 unknown MRDNHYFLLYRLKNNLVLMDIQKTMLLNSKGVKRISGEWQNSGVTMSY >MS0907 unknown MNKMNQTLLSLKQELKKILTLIEDKNDVLYFDYPMHLNVGDLLIYAGTER FFKDYGINIRLRRSLQAFEINEVRRYVNKNTTILCHGGGNFGDLYPLIQK LREDLVINFPENRIIVLPQTAHFSSQEALEKSAAVFSKHKNCYLFARDTA TEKLMRAFSANVQLCPDMAHQLYGTLPFRTKEQQKSAENPQNILYFLRKD IEASHIEKAVQSRLSAAAVVKDWEDILLPKDMRFEKFCSKLGKLANILNL GFMKDLLNHIWYKYSLNVIERSRKEFSKYDLVVTSRLHGHIFSCLLGIPN RVCDNSYGKNSGYYNQWTKNVDYAEKYE >MS1861 unknown MISQLFNANLKFTYKIAQIRKAEKWNKNKTRKKPL >MS1416 unknown MYIINIAVNDHVSAEQHDKLFAEHAEWFKKYFQAGTFLMLGPFKDQANAG VIFAVTESRAELDRILAEDCYYPNLASYEIREFEPKLIASNIAEFTGK >MS1996 unknown MIFHLVNETFCIDVQKSEKNTALWVTKVAKCGRFLRIFIGLLT >MS2137 unknown MEHEVCHRHLTSQDKCGKAGEQAQDDEDSAQGFDDAAYAH >MS1781 unknown MAHASSVCLLILTTKKYSFIFMLTNQLTIGLYLIAVLQLIYLSWTDIKSR IIGNRVIISLFFTMVALSWLKYEQVFVLQGAIGLAVCFILFMLKVMGGGD AKLIAVLMLSIPPAQLISFFFLTAVFGLLLIIIGWLFFRQSIKQKGLPYG VAISSGYLATLWLFAS >MS1327 unknown MNMKKVTLTIAMIVGLGLTACSGSQKQYDDGYAGEILFSQYEGSNLKLTV RYNNCDGKEGKVENLVITQPYDSDLPVGACVRVSTAEDGTKNIRNISRSV SRSWLSRTGIIR >MS2184 unknown MGKNTMDNQQDSMPEGFSKFSWAIAAFCLPVFLWPLALLVSTNLEKNPAL SQQQSMSMSMFLWLYPLLLAVMARICYKLHQGSPKSAKRLLMTSAVIFYG ILFYVARVGFSG >MS1947 unknown MNLQKLLLTRRFISTMAYFAMQSVFFIYLPQSFASDWAAVCSEPMQEPAY >MS1258 unknown MIHIQTKFFKKITALFLQCKEQKIISYNFYEYLND >MS0653 unknown MSKLKALRFNFLGLFADCLLPPKTDGLINGYGK >MS0080 unknown MGNGKMAKLQYPAIIETDKKFTALADLGKRLNSLDKSQIMTSFTYLVPTA FLELLAEKWSVTGYDGWLLAESEDAKRKLIKRAVELHRYKGTPWAIREII RQLGFGEVEFLEGLFDKRRDGSFVRDGAYFHGDRSKWAHYRVILKTAITN EQAALLRKTLRVFAPARCVLASLDYRTVALQHNGKATRNGQYNRGTA >MS0340 unknown MLMIAKKEYYYGLDQLRALLMLIGVLTHAASVISPFYRWDYHSDRYQDAL IHNIVHVAHFFRVEAFFLIAGFFSAMVLLKKGKHYFLKGRYLRVFMPLIS SILLINTFEVWFVVRHDITPWENIGIGNFIVHAWFLLTLMIISLVCLLPV DKFLDYLSGFNRFIKLGLFIFYMYLPFGIKFVLNMFVPMADHPLFYSFYG YLIEKTLYYSIYFFIGYIIYRSEAVRIFFNKKTVKALLWGITVTGLTYQT LTIGEAKESLPFTMRAINVFIQHASAISVSLLLFNFFFSASFPPSRSVAF LVRSAIIVYLFHHPVLIVLGYYFDVPGMTPFVYFMILVTCGYLLSFLSYL IINGNKLTRFLFGLK >MS1226 unknown MNLSGCNKHQHYLNFIGTGLYLSLIFYISMLHGTRLFAVFLNIPQCLQYI LFILLEKI >MS1093 unknown MIMHFACPDTKRFFNGERFVRFISCERLAIRKLQQLNAATSLEFLTKLPN NKLETTLYNHVSYYNLKINEQWSLLFLWDHNSPTDVKLVDMKEV >MS0259 unknown MLHLVLEDRHIIGQEKRLGVHSTEQDHLAVVFEDDGETGYFYAINTQEAQ PVVDSLSVYNVNGIESLQEPRQVQICWSEDGNRAFLLVNGYPHAAFDFTR LIGYNHSKYPLPELGSMWSHENITDKLVEEWLTP >MS0375 unknown MKKILILAAVSFLAGCVGSSTLPQKNAEKLPHIEVPKEIVHNGKTYYLRA QQDLGSVARYVYLENKENLKNWKSEIEILNDRNTEQRSIADRIALREKVY KNTGVEHFQLMEKDDSLYAFVIYAPSAQHDDWQVDVAKGENVMGCGFAQY QYSLKIPKTKKLMNMGKVKLIGYLKKYAVDKEMERLSTTKWNWVCRNNE >MS2366 unknown MVNYENNTLNKKKKFENYQKNNKLFTNIYL >MS0915 unknown MLKGGDFSLFQCFFIDKTKGFRSLLEKIQKNTRFFE >MS0965 unknown MKSAVKKPEILNIYEILKREILTLVMRRNYMPNITNNKKAFLVFH >MS2124 unknown MGFNKYYLKMEEHFLYVPYYNHQRRIRVLLPKDYYKEDWQSYPVLYMHDG QNIFYSKESYSGYSWKIIPTIKYHKEFPKIIIVGIDNATVDRLDEYAPWR TDVGNTAEARNTGGKGAEYGQWVVETVKPFIDGHYRTKPQRENTLLAGSS MGAIITAYMGAAYPHIFGHLGVFSLASWFSENEFLRFMHEHPIDRASRVF IQVGTKEGDDADAQYISNMNQAYIDSTLYYYQALIRTGHPLDNIRLKIMA NEIHHEKYWASHFVDFLRFSLMGK >MS1064 unknown MIRRVAMSDFGYAMMMVVLSLVIVVGLAVAIF >MS1115 unknown MFRKYGFIFKFTNRFKQYVKVRSKFLKFLKFTENQPHF >MS0109 unknown MNNSEFLTYAILTLGLVMAIPMFVRMGEILSQKVRLMLFPVKKVKIRRWH NDIFMGYGELDLTSSEPIIAQLDRIDAELKIRKENER >MS2298 unknown MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ GYVLAGSKPISPEQQRIRELENRIKELEEDKDILKKATAILMSLENKNTK SLRR >MS1156 unknown MEQHIIVGLIVGACILYVLRKFVFKSKTAKNSLCGGCDRCGGKKGCH >MS1214 unknown MVPEVLAVSAEVDLPVAVLVAVASVEVAPVEVGKVDKFSRF >MS0001 unknown MKGGDMSAIKDRLKDIDCALSDLERERKEILLDAGAPEIIGLKDDINALT VSLEYIDDEILPLLQQLSIDPDAYKYLSEDIKLSLLRDLPESVSAIKAII NKLTPVKHCIESFNHQNDIGF >MS0415 unknown MRNMKLAQHLALLKNRQLINDELSKIMLEIEQRLISHWHVDVTTKQVEMG LLHLAMALGRIKRGYAAQALHKDIFAEIQSAVCFPKVLKIHSDILALIPF PIPESEQTHLIANWYSLVIAQPWVLNIT >MS2224 unknown MLFEMAWQTTKINFFDRTLIKKSMTKASLTGKSGYN >MS0321 unknown MKTINLCLSFRLASVYLRKRKSMSSISFLISTALGLYIFVLMLRMWLQYC KVDFYHPVSQGIVKLTNPVLTPLRKAIPTVKNIDLAALFFVFVLGMVKVP LLYIANGQWAAEIIRQEWLQYVLIGALTVVAAFGKMIFYVIFFGAILSWF NRGNDQFSYLLYQLGEPVLSPIRKILPRTGMIDFSPMVLAFGLFFADKVL YDIFGILWQLAS >MS0103 unknown MAPRSSIEKLPEDVRRWLERALTENGFSGYVELEELLKEKGYQISKSAIH RYGQKIERRFKAIKDSTEAARIIAEGAEDKEDKRSEALMGLLQSSLFEAL VDIEDAKEDEKMSPMEKFQALSFAGKNVASLIQASTKLKAYQAEVKQRAE AAAKEVERVVKKGGLSDEVADEIRRKILGIATK >MS0184 unknown MRNPIHKRLENFETWQTLTFMACLCERMYPNYQLFCKVTEQSENGKVYQN ILNLVWESLTVKGAKINFDNQLEKLENIIPDVNDYEFFGVVPALDACEAL SELLHAIIAGSVLEQAIKVSQLSLQTIVTMLETQEDHELTETELKASEDI QQELDVQWQIYRTLKEQEERSVNLILDLKNELREAGISNIGIEIEH >MS1623 unknown MWLIRIVYLQNSIFKIQFCFIRANDCRHIKACD >MS0111 unknown MQQFLSAIHGGQFGQVINNYYAAPDCWQALSTNELHNAIKITKQKRQLAH RNKWKNPAVIGGMSCTLFAMVVWVGNLWYLFSDYSRLSTPNSIFSYIAGG LLLASLLCLYFAAPQIRREKIFIDRCNHVIDICEQLIHERKYD >MS0660 unknown MASLRFLKFLLIIIFLGLCIVTIDNIFIVSDFVLFGIFLFLLFLEVIINP KRNFINSLVLLAVFLNILGVFAIEHSEGSYYLYEVEQWITEEGSIPLLLI YQFFFLQGVFLFVQERKIENWNITNIHFKSFFILSLTFLLLLTFGLIAKY SPAPVLRVDRFVYDKEILGKFGQITNILFYYSLGLGILYFKEKKKIYLFL LFFIELAFLLKGHKFWNLIEILFLFLIPYTSNIIRAKVISLILAVGSVMT LFIVAAISINVYYFPSFDPVDYARQRLSQEGQLWWSTYKGYEPKLRTDEL YAELKNYTNLDEYNQYDIGMYKVMRLNTTPERFEWKLDKLSRYVYSTPPL LYYYLGAGLGILGMFLLGMAFSFWGNLLIYTINRGDLFLSLIVSRFFYIF RKAAKDGDIYKFFSIEFMLLILISLSFYIFYKYKDDLYKRKSLIAVNGV >MS0666 unknown MGESKLAKIEHLIAEINKLHCYFTNDYFKLGKYQKIDFNNGLTKVPLEHI LSYRLNLHESNNDYLYCADLYDIAYFYRVKTSESILDKIERFKQRSEG >MS0547 unknown MNWFVKPIAKTAKKRPHFWGHRWNLSGLRFKIPGKS >MS1822 unknown MLKDKMMDALQADWIAPGNIHGFTTYRQGGVSQEPYTSFNLGNHVGDDPN AVKINRNLLVENFNLPQLPVFLTQTHSIRVITLPFEGTDLDADAVYTAQP NQVCVVMTADCLPVLFTNKAGTEVAAAHAGWRGLCDGILEETIKCFQCPR DEIIAWFGPAIGPNAFQVGEDVMKQFVAQDNKAKQAFIADPNTEGKFLGN LYQIASQRLHNMGVTNISGGEHCTYEEQDKFFSFRRDINTGRMATVIWFE >MS1166 unknown MLLKKMKILHKSFLHLHKMDILDIASILAFADNN >MS0822 unknown MYGKSTFKLAQLAILISGLCSSCAISDYVSPHSSNKESIDVDLANQQIEQ EKMAEDARISAEKQRQAEAKLTEIIGERDLQFKSAVAKIYADNEYALLWQ DKDAEKKFLREYAAMVASGISVRSARSLEAISATNAGDNPVYDILLTDAF LDYMYYAKNVFNSAQNWLYTINGYKPAKPGETDVEEWLSAVKNGQNFAYV NSLTTNNSIYQQTIDKIGSSDFDDDKSVNSAILYKLALNAQRLRVIPNFS NGIFVNIPSYQLNYYRDNQLILNSRVIVGKKERRTPVMYSKLSNVVVNPP WNAPTRLINEDIVPKIKKNPGYLSAHGYSILDSKGNKVNPNSINWAAIGS KFPYRIRQDAGDNSALGRFKFNMPSSDAIYLHDTPNHNLFNKQDRALSSG CVRVEKSNQLASILLKEAGWSEDKKQRVLNSKKTTSAPIYSDNPVYLYYV TAWVENGQVNTLPDIYGYDIVQQPSYVNWHTVKKYL >MS1896 unknown MSKKHQILPQTRWTATSFWSLEFRSLSVLLLSFVIVGIGDGLLLLSNLGS APWTILSQGVALQGGFGVGWASLLISIMVMLAWFPLKLKLGLGTLLNILV IALFLGITTAYVPAPTSLLGRLVFVFIGVFCFGVGTAFYLTCHQGAGPRD GLMVGLCQRFHWRIGIVRTSIEVTVCLLGFLLGGTVGIGTVVFALSIGWV VQLSLMVINRSPCLLDNT >MS1854 unknown MRWQGRRESTNVEDRRSERSGISMGGKKTGVLGFIILLVGAYYGVDLSGL VGTSSNIGEVGSSLSQNEEETLEKLSRVVLADTESTWQDYFARSGQKYSA PTMVLYNGATPSACGTGQSAMGPFYCPNDHKVYLDLSFYNDMKNQLGAGG EAAFAYVIAHEVGHHVQNLTGILPRISRLQQSNPAQANQLSVNLELQADC FAGVFGYQAVKNNMFEASDLEVAFAAAEAVGDDRLQKRSQGYAVPDSFTH GTSQQRLTWFRKGLQTGDPTQCNTFTN >MS1016 unknown MTVQINKLDPDAAIDIAYDIFLEMAPENLDPADIMLFNLQFEERGAVEFV ETADNWDEEIGVLIDPDEYAEVWVGLVNENDEMDDIFAKFLISHREDDRE FHVVWKE >MS1362 unknown MLGTGAEILNIRAELPPLGKSRFNLIYISHCVIRLSFPIKFALYCIKPNE QKGNFMESKLIKQITPAISLHQYNEIPVIKLNHAVGQAEIALQGAHLFSW KPAYCPQDVLWLSEIEPFKLGTAIRGGIPICYPWFNNAGTPSHGFARISL WQLSDYEVSAEKVRLEFSLFSEQRLIIAKIQFVFTGECEITFTNYAEENA QAALHTYFRVGDIRQLELYNLPTRVFNSLTQTEENVPSPRTIGELVDCVY SAELGATLIQDNQLNRKINVEHINASDIVVWNPWHKPTGGMSETGYQTMV CVESARINKRLNSGERLGVKISLR >MS1164 unknown MCGDKKRQATVIASALSLAVAGKRAEAIAAGAVSPYVKEVIKKATDSPEM QALNIPLHVLWGEVEAELAGGKAQTGAIAAGVGEVGAAVLAKSVYGKEAS ELTVEEKQTLLNASKALAGVASAATSANGNAASTLAETSIGMTVAENAVE NNYLSQLSDNRRIWLREQLNRDDLSSVQREKYEQEFIQLEQDNHTSDILV AKAKYNPESMTQSDWELYQNYATRYYFESIRTEKPENVIADLDNILSNQY IKGYSYPYATAEKYRHELPSRWSLFGTNKSADEQFYTDIYSKYQNRKTYQ ESFDGRVAQSTAEALSYAGTMLSAGTVASVASKVGKFTSNGINKASSAIG TFATKYPKAAEGIVVGSISTGFDLYNGDASPEKTAMNYILGRGLAGKSWD KQLSVNAIYKGVISVNENRSDKDIVLGQVSNAIALGSGESVEGLLNLVGQ KGISKQIISNIVSGYVENKIDNRSKDSKEIRKEGDK >MS1190 unknown MKYYQKALYDEMTSFYLITCDDEGDKLGRIRSLIGGLVRKARQVIGQEGD DKVRIHQLLQLFYGDWGFHCDPEHYFEAENLYLAYVLETHSGMPVSLGAL LLYLADSLKLPLYPVNFPTQLILRAEVDNEVAFIDPWNGHYLSQAHLQKL YEGAFGFGAEISSEELERADVNTLLNRFRQLAKNALIRENRNDAAYRYIA SLLRYHPEDPYEIRDRGLVLAQMGCYQAAAEDLQYFVDQCPQDPTSFLLT AQLAELKDHFSELH >MS1040 unknown MRLFFMNFQEGKMNKIIKFSVVLIILLFLGFWFYTIYMTKLTGCSMKSGD GFFQDRLICDNQEIVPTGYLSSTLLEPKLIARGVTIYQENGKACYTDEQK FYIYNIEDKTTQVLNLEEFIKINAVSFKLPSEFYTLPADYLKDYANNCAK >MS1225 unknown MIFSVELIEKSFSFFIELVLHLRRFDFLSFRVICST >MS1643 unknown MELVASDGEGIYHCSATNLLRHFFTFFSVSFFLKSIFDFKGIRI >MS1473 unknown MLELLFLLLPIAAAYGWYMGHRSAKKDQEDVSNKLSRDYVTGVNFLLSNQ TEKAVDLFLHMLQKQEEENEIDSNSQFEAELTLGNLFRSRGEVDRALRIH QNLDRSSYYTFEQKLLAKQQLAKDFMSVGFFDRAETLYIMLVDEPEFAEG ALQQLAVIYQKTKDWKKAINVAEKLAKISPQEDNIELAQYYCEYARTLGE ESKEQPKEILQQALTVSPSCVRASMLLGDFLIQEEQYAKAVPVLENVLTQ NASYVGEVLPQLKECYQHLNQLDNFELFLIRANQEYKHNSSVALALADLI AEQDGRAAAQNKVYQQLTQNPSLFLFHRFVQYQVDDAEEGRGKDSLVLLH RIVGERIKQSFGYRCTNCGYQSHKLLWCCPSCRQWEKVKPIRGIESQI >MS2058 unknown MKCHRLNEVLELLQPYWSKDSDLNLIQILQKIADEAGFEKPLAELSDEVI IYHLKMHGTDKLEPIPGIKKDYEEDFKTAILKARGIIK >MS1339 unknown MYINSIFFYIDYSKLKIGVTMELKEFALKLRKNLTEEESILWYHLRKKQL AGFHFRKQAVIAPYVVDFICYKAKLIIEIDGEQHFLPSALVYDEKRTFYL KSKGFRVIRFTNYEIKRELDSVLDKIWYELTGEF >MS0726 unknown MRTCKRGFATLMIVFIIAGLAVSTMLFTDDQLHYHRGIMAQRSAYVSQMA QLQNLAIEQMPVICQQIPDGLPDNTTSYTLPISLSTASSNKSAVEISHFL RCRRYSLLATKPTKKFESYSTAVNEENIELFRHRFNQSYINEDTGQKVFL YWLDETTESLILSGDTNAVVIAKEPLKIEGKGRLRGVVISDYPVELEGVQ LSYNKYVMDFIYREFSLWKLAERSWSDFDAENN >MS1074 unknown MLVILPVSDHLLKVKSMFDNALLSLSHEQQQQAVEKIQVLMQRGMSSGEA IALVAKELREAHDNEKINSEKTKSAEK >MS1691 unknown MKQYQYRITLEYLEDNQGNPKDEKIQFTAANHDDIFKIIELSKQREGFTS DMAEQFTVGLKLMGEVMMAHRDFPLFREIKPHFLEIMKLVKGKGKAE >MS1634 unknown MRYLIGYDITDSKRLQRIYRRMIKFATPLQYSVFLFNGTKEQLDKYMQTV LRLYNKKEDDLRIYPLPVQAKYWQIGKNPMPEGIVLSTFVF >MS2095 unknown MKDLTAREFGYGHPTPLFMIGTYDEDGRVNFMNSHWGALNHGGYINLNIN TNKKTHLNIEKMKAFTVTLATEKLMPYADFFGTYSGFQYPDKFEKSGLTA HKAKYVNAPIIDGSTLVIECELVEILYQEHIHTIIGRVKNVSVDESVLDA QGKVDASKLGMIFFDSFSRGYFTLGERVGDAWSIGQSILNS >MS1507 unknown MFGKGGLGNLMKQAQQMQERMQKMQEEIAQLEVTGESGAGLVKVTINGAH NCRRVEIDPSLMEDDKDMLEDLIAAAFNDAARRADELQKEKMASVTAGMP IPPGFKMPF >MS2148 unknown MASGWRFKLPTSGKKHQKFAKPHQQTAKIAKCLTENKENHMTAYVVFIRD EMKDQAAYDRYLQLGVPTLAPFGGEILVANGAHEAFEGADFDGSVVLRFP DMASARAWYTSPEYEAVKSMRLEATLGRAVLLEGVA >MS2240 unknown MEMTSTQRLILANQYKLMGLLDPANAQKYARLETIVKGGFSLELKELDNE FLAISEAECQTVLETLEMYHALQVSYENLADKSDLTAHRLQFIGYDAIRE RKYLNYLRFITGIEGKYQEFMRCAPGCDSQTPMWDKYNKMLDMWKACPHQ YHLSLVEIQNILNA >MS2227 unknown MVTTMAEYDKLRLEWDCRRGMLELDKIIMPFYLEQFDNLTETQKATFVRL LACTDLQLFSWLFKRARASDTELQQMVDLILEKQGVVINN >MS1272 unknown MKNIRTFISIFLILLPLWAQAQREVKCRVVRVSDGDSLTCLARNNKQIKV RLLDIDAPERRQPFGNKARQQLAQLIFKREITLRISGYDRYNRTLATVFN EKNENINLKMVQLGLAWAYNQYSENPEYGKAEALAKKRKIGLWRETNPIE PSRYRRELYKRNIQNKKQRTEKN >MS1274 unknown MNLQEQLKNAKNWEERYRLIIQAGKNITKPTEQELAEMQPLSGCEAQVWF KISQNSDRTLHFQAYSDARIINGLLWILSLAVNGKPTEQCRRFDLTSYYA ELGIAQRLTSTRLNGLKQIEGCIHQAGN >MS0116 unknown MPYFTTTQQGEEMNHTDFQPLPYPQTPESARAYFNLHGINRSEWARYFGI DQQAISDHLRGRLKGTWGKSHKVAVLLGLKPNPETKVTA >MS2017 unknown MRKTHSFQQVRKIKPTWMSVSGHIPFKNGVSIPYESTLERDFLMYFTYLP SVDKIVSQPTTLPFVKNGITYTYTPDFFLSFTDGRKPMLIEVKPKAKWQK HWKEWKEKWKAAICFCQENGYVFHVYDEDRIRHLALFNLNYVQRYKRIQH EQEDINVILAQVKLMGNTTIDYLLSRFFAGSLYRMKGLQIIYHLLATKQL HCNWFLPLNEFTEVWGNNDE >MS0881 unknown MKFADFSLCKFKCCIQNAIQRSRKEKRREFITKKCGQNLSFFMTALNLFY MRTTGRYFP >MS0374 unknown MLAIISPAKTLDYQSAVPKFEISQPQLTQYSQQLIDICKQLSPAQIASLM SISDKLAGLNAARFADWQADHNEQNARPAIYAFKGDVYTGLDVESLTSDD VLFAQQHLRMLSGLYGLLKPLDLMQPYRLEMGTKLANKKGKDLYAFWGNV ITQTLQQALDEQGDNILVNLASDEYYKAVQASQLKARIIKPVFLDNKGGK YKVISFYAKKARGLMCRYIIQNRLTEAEQLKEFNLAGYWFDEAASTKDEF VFKRDLGE >MS1660 unknown MFNHINYFSIYCISKKSNLQLCYLSPRKNAGLNKSLLLPKLVGLAGCSGC VVGC >MS1759 unknown MPSFDIVSEITMHEVNNAVENANRILSTRYDFRGVEAVIELNEKNETIKL TTESDFQLEQLIEILIGACIKRNIDSTSLDIPTESEHHGKLYSKEVKLKQ GIETETAKKITKLIKDSKLKVQTQIQGEQVRVTGKSRDDLQAAIQLVKGA ELGQPFQFNNFRD >MS2241 unknown MKNLTKSALFISFVCTSPLALSAPDDSKTEALQKLEQQCNALKDSNIMNT SIKSVKWFAGGNLPPDEQASFTGASNSNIEAAPHCVVNGEIEKRIGADGK EYAIGFQLRLPSNWNNKFLFQGGGGLDGFIAPAIGSIPTHGSTATPALMR GYAVVSMDSGHTGARDPSFAKDQQARLNFAYASTGKVTTVAKQLIEQMYK EQPKHSYFMGCSNGGREAMHAAMRYPLEFDGVVAGNPGFRLSYAAVGEAW DNQQFMKYAPTNEQGEKIVANSLTQEDLDIVSKAVLKRCDAKDGLADGVI NAWEACDFKPEMVEKEIGKDKVALLNAVFGGAKNSRGENVYASWPYDAGI NSKGWRAWKIGDSQTAVPNGRNFTMGVESLTNYFMMPISPDFDPMQFDFD KDTQKVAQIAGMNDADETELTTFQARGGKMIIFEGVSDPVFSAHDLRDWY NKLNQDMKDANQFARVFMVPGMTHCGGGPALENFDPLTALEQWTDENKAP DFILAKAGEEFPNKEKEMPLCPYPQVATYKGGDKNKASSFECR >MS1511 unknown MTKRKLTQNQKRRIHSNNVKALDRHHRRAKKEIDWQEEMLGDTQDGVVVT RYSMHADVENSQGEIFRCNLRRTLANVVVGDHVVWRRGHEKLQGISGVIE AIKPRENEIARPDYYDGLKVMASNIDRIIIVSSVLPALSLNIIDRYLVIC ENANIPAVILLNKVDLLTDEQWREAEEQLEIYRKIGYETLMISAISGKNM EKLTALLADGTSIFVGQSGVGKSSLINYILPEVNAQTGEISETSGLGQHT TTSSRLYHLPQGGNLIDSPGIREFGLWHLEPAQITNGYREFQYFLGTCKF RDCKHIDDPGCALREAVELGKIHPVRFDNYHRLISSREENKSQRHFMEQD IR >MS0268 unknown MIKNSPESSKCGQKRENFCQKIPLRITPDGVLSDYDAAETI >MS1635 unknown MPTLYIDRRTTELKVNGDVLICYEKGERIATIPLASVDRLYMKGDINLQI SLLSKLGEKGIGVVFLQGRKNKPMQFLPQPHNDAYRRVTQTYLADNKLFC LTLAKNIVLNKCIKQCQFLAKFIEHNPKIITFIAELQKLFNLIVKQENID SLRGIEGRMGAIYFAAFADILPRSLGFNGRNRRPPKDPVNAVLSLTYTLL YSEATLAVYGAGLDPYIGFFHTLHFGRKSLSCDLMEPIRPSVDEWIAECF TAEVLKIDQFSQTNEGCILGKEGRVIFYTAFEKVVSEWRKIFEKQAYELV HLICGYQTEYHQDQFDDYTINMAHILGNEKCDI >MS2021 unknown MRAIDTECKINFTGYWPYLENTGLSVLYGLVI >MS0981 unknown MRKLQNTLYITTQGSYLHKERETLVVEQDRKKVAQLPVHSIGHIFCFGNV LVSPFLMGFCGENNVNLAFFTETGRYLGRLQGRQSGNVLLRRAQYRISEQ NPIPIARNIIAAKIQSAKRVLQRRLRNHGEHEEVQAAVMALNFSLQQLKQ AENLDLIRGIEGDAAARYFGVFQHLLAEKNGFGFDGRNRRPPRDGVNALL SFLYSILGKDISGALQGVGLDPQVGFLHADRPGRDSLAQDLLEEFRAWWV DRMVLSLINRGQIKPQDFVTEDGGAVNMKPEARKLLFQSLQAKKQEKIVH PFLQEEVEIGLLPYIQAMLLARHLRGDLAEYPPFLMR >MS0142 unknown MMTTKTIAITAATGQFGTIALDLLVQRKANVIALVRSPEKISNAQARKFD YANIEGQVEALNGVDTLILVSGNEIGQRFPQHNNVIQSAKKAGVKHIIYT SLLGASNENTVKSLAGEHVATEQALKESGVPFTILRNTWYTENYTGSIGA ALANNAFYGSAKDGKIASATRADLAEAAVNVALSEGHEGKTYELAGSTSW TLADLAAEISKQTGKQIPYIDIPAQDYAAALVKAGLPEGFAGLIAEWDVD VSKGALYSEDKTLEQILGRPTTSLADAVKAAL >MS0559 unknown MSRKFCLPKHISAEDFLRDYWQKKPLIIRNGLPEIVGMFEPEDILELAQN EDVTARLLKQFSEDSWTFTPSPLTERDFTELPEKWSVLVQNMEQWSAELG RLWNLFGFIPQWQRDDIMVSYAPAGGSVGKHYDEYDVFLVQGYGQRRWQL GKWCDPSTEFKPNQPIRIFDDMGELVVDEVMNPGDILYIPSRMAHYGVAQ SDCLTFSFGLRYPNLSDLMERIQHGFCYQNPEIDLNEFSIPLRLNQSAQP TGKLSETEIQAMKRQLLEKLTSSPQFDRLFRQAVASAVSSRRYEMLVSDE ISEPEEVLTALENGAKLLQDNNCKLVYTSNPLCIYANGEWLDELNSVEAE ILKRLADGEALALTDLMQLIQQTDERDLAMDLLLDGICNWLDDGWILLN >MS2378 unknown MTTMEIILVTLVAAICGMGSVLDERQTHRPLVACTLIGLVLGDLQTGIIV GGTLEMLALGWMNVGAAMAPDAALASVIAAILVIKGGQDKGTAIAIAIPV AAAGQVLTIFVRTLTIFLQHKADDYAAQANFRGIEFCHFAGLSLQALRVA VPTLAVALVAGTDTVTAALNAIPEVVTRGLQIAGGFIVVVGYAMVINMMR AGALMPFFFIGFVIASFSNYNLVGLGMLGACLALIYIQLNPRFNQAQLPA SSTSQKQLADDELEGL >MS0528 unknown MDVIIPISILLILFVIGTPVAFCIFCSTLTYFLMSHQPMVILIQRLAGGL ESVTLLAIPFFIMAGVFMNHTGISERLLKFCEVLTGHMNGGLAQVNVALS TLMGGLSGSNIADAAMNSKLLVPQMVARGYSASFSAAVTAAGSLITPIIP PGIAMIIYGYVNNVSIGRLFLAGVVPGTMLCILMMILVSIISKKRGYLPI REKRASCKEVIVSAKDAVLALLLPIIIIGGIRMGVFTPTEAGAVAVIYAL ILGMFIYRNMDAKKLWLATRESALGAANVLLIICVAVAFSKFLTWERVPQ ALASWMTTVVDSPIAFLMLVNVALLVLGMFLEGNAIMIVLAPLLAPIAHS YGIDPIHFGIVFIFNGAIGTITPPLGTVMFTTCSITEVPIEKFIKDVLPF WGLLLLELVLLTYIPTITTWLPNLVYGVAQ >MS0799 unknown MSYKLKSNRGNEYELVSFGIEKICGFFGQSKT >MS0616 unknown MTTEIKKVTKSDLNSVVLRSNLFQGSWNFERMQALGFMYSISPVIKRLYP DPNSQERKDAIKRHLEFFNTQPFVAAPVLGVTIAMEEERANGKPIDDAAI NGIKVGLMGPLAGVGDPIYWGTARPVFAALGAGLALSGSILGPLLFFVLF NLVRLATRYYGVTYGYKKGLDVVQDMSGGLLQKLTEGASILGLFIMGALV QKWTSINVPLVVSTIQKQDGTTEITTVQSILDSLMPGLLPLLFTFACMWL LRNRVNALWIIVGFFVIGIFGAWTGILA >MS2108 unknown MGYRVNSVLGTKFRIWATARLKDYLTKGYAINQQHLSQNAHELEQALALI QKTAKSSGLTLESVWWTLSAVIRKHFYCLQAAEKR >MS0600 unknown MRDKDDIENIKIKSFNSTYFSFIENDVALMRKRFHLTV >MS1856 unknown MFKVSKEFSFDMAHILDGHDGKCQNLHGHTYKLQVEVMSAQLHQSGAKKG MVVDFSDLKTVVKKFILDPMDHAFIYDNTSERECKIARLLVELDSKTFGI PVRTTAEEMSRFIFNRLKHDAGLPVSAIRLWETPTSFCEYRE >MS0557 unknown MNWLTLAFGSAFFAGLTAILGKLGVEGINSNLATFIRTIVVLFVSAGVIS MRNEWQLPQHIAVRPLMFLILSGVATGLSWLCYYRALQLAPASWVAPIDK LSVVIAIVLGIVILGEPISIKLITGSILILAGVLVLAL >MS2233 unknown MGFSARIKICIKGNLSVSFPTMLVSYQTFFSYILGLFYATNSNF >MS2062 unknown MIAVYAIAKVKADKITAFEDVVKELVAKSRGDQGCISYACGSVQGKENTY TFIEQWQSMEDLKLHTQQPHFIEAGAKFADILSAELEINVVDYLA >MS0286 unknown MTDKIEKAKNSTREATPQSAVKNSEKTRKWCRRIFCIFCIVVLVPLIGLL GALSFESGQQGLLKLTDKMTDSLSFEQISGNLQDGLELHNIRYQSSGIDT LVEKARFQLDFNCLWRREICVEDISLQKTDIHINTALLPPSESERKTDSG EMSRIYLPFGLTVKNVAVSELALSIDNNYLNLGVFKTAATLNNRRGLTLL PTIINDFSFVSKTSAEQQAEAEKKAEDEAEQAQPVDWAKIDEILTPALLG NLNQITLPFDIHVEDIQGQNWQYESFVDERSQQQVIVSRFQLQADATNYD VELKTFDIVSNLADLQAQGQIRLNEDFPLNLVLHGDIHQDKASVLPMKRL DLELSGNLKNQTALLLTTQGDVDATLKGTVELGKEKMPLDLQLTSKKAQY DFAVANLKPLKLQDVNAKITGNLLDYQAEISGQVEGMGAPKTEVDLLGSG KLYQAEVKQLKLHGLEGRIDLQGDVDWQDGAKWNAELDLNKINIGAYVKD FPAVLTGKVSTSGLANSKTWQVSVPTLDLTGSVSQRPLVLKGGINLGQEA LLDIPNLLMTYGENKLIAKGLLSDKSDFNLDINAPNLKGLLPDFSASLVG KAVLTGDMAEPNLDIDLKGDQIQFQDFYLAKFNVQGKVNSVPQIEGNLAL DVSGFNYGDINIHSVKLTAKGNEKAHELQLRSEGDPIAAQLNLSGGFDRA LQQWKGTISQTDIKTPIGDVTNNQFAVNYEHKSAKATISAHCWHNPDVEL CFPQSFTVGQNGEIPFEMKKLDLNLVNKLTEQENMLAGILTGKGKFAWFA DKPVKLDASVTSNAIYFSQKVDGKNFKLDMAKLNVNANLENNNLAVTSAI HLQNQGNVAADIKLLDIDQVGKLSGSLKMSGVNLDLINQILSNKERISGD VGAALTFAGDLNKPLINGSLDIKNMNAVVQNMPFDITDGNLALRFYGTRS SLQGYIQTPDSRLNIDGNADWQDINHWHTAVRAKANEFKLDIPSMAKLKV SPNVEMKASPTLLELTGNVDIPWGRIAIESLPDSAVSVSSDEVILDEPPR TRIVKLATETDGMVIRSDLKINIGNDVNLEAYGLKTNLNGRLLVKQEKGQ LGLYGMINLRRGRYASFGQDLLIRKGQISFNGLPSHPMLNIEAIRNPEAM EDAKVIAGVKVTGLADSPSVDVFSEPAMPQDQALSYLLTGRSLENSGEAG SGGSVGAALLGMGLAKSGKAVGSIGETFGIQDLNLGTSGIGDSSKVVVSG NLTPRLQVKYGVGLFNGLAEFTLRYRLLPRLYLQSVTGVNQAVDLLYRFE F >MS1424 unknown MSHKSALSPEQIYSVSPTLATYTKTLISDDMWNRPILSKHDRAMITVAAL IARQQTMGMKHYFNLAMDLGVSAKEMSEVVLHMAFYAGWSNAFAAVDILK DIFAERGISPDQLPTLEPEMLPMSQALPDNDFFMGLIDQNIRSFVPKLAD NSTDVLYHQVWLRPDLNPRDRNLISVTALIAQGLYDFVTVYSLRAKAVGI SKEEMQELLAHLAFYAGTPYIVPAIPHVAKAYE >MS1807 unknown MYLNELNKIPLIIKEIPAKRKRLLMYFYRLERLSAFRENRHNFPLFIYWV NL >MS1286 unknown MQSEMLLPAIGGFIAGVILTYLVLRLTKGSIKNQAKTENALQQAKAELAE QKKQLERHFAESASLLKTLSEDYQKLYRHLAASSTTLLPEFKELFNGSTV NQDKPRIEPTISDLKTITEENEDQPRDYSEGASGILRVER >MS2383 unknown MSTTAIIMMIISLIVIWGGLALAVLRLPKE >MS1818 unknown MNSVLKKFAKLSALFGVMLFATTVQAEQAKVDAKQESPVAAFDQTLDNVR DPNKYCAQCHNLDTSKDQAVGTNHAGKFHGTHLTKNNPATGKPITCVSCH GNISEDHRKGAKDVMRFESDIFSTEQPMYSVQEQNQVCFSCHKPDDLRKK LWAHDVHAMKAPCASCHTLHPAKDAMKDIEPKERVKICVDCHSEQRLRKE AADAAQSATEQKDKQ >MS2112 unknown MVGRNAHPTFQMAFSISNSQREMLYTSPQHGKTTMLEQFIAQLYQQKNAQ QAV >MS0835 unknown MDWLFAWSVWHWLILGFLLLIGEILIPGIFLLWWGISAIIMAAIMALFTT LTLTVLGISYAVLALLLSLVWWKYQHNKDKSDEARSVLNQRNHAMIGAIG AVQEIALNGVGRGYFGDTTWRIQGKELKVGDRIEVLKVRGITLIVRKLGN >MS1067 unknown MRLAWVAFNTILHKEIRRFTRIWVQTLVPPVITMTLYFVIFGQLIGGRIG AMNGFSYMQFIVPGLIMMSCITNSFGNVASSFYSTKFARNVEELLISPTS THVIILGYMAGGMARGLFVGTLVTVVSLFFVEFNIHSWTIIFITVLLTTA TFSLGGLINAVFARSFDDISIIPTFVLTPLTYLGGVFYSISLLPAFWQGV SKLNPIVYMINGFRYGFLGISDVGLGYTFAVLITFITALYGFVYYLISHG VGLRS >MS0070 unknown MFILIAMVFIAIFLLVETLKSKMIYFNQILRI >MS0874 unknown MKLKYKLCIALFAWVSAFHVAAAPQTHAEVSNVTTELNDIQIRLKAQQSA DKGDWKTVYTLLLPLAQRGDSQAQVNLGILFSSGRGVEKNLEKAYWWFNE SAEQGNAKAVTYIGLMYLEGVGVKQDTKHAIRILEKAGRVDYPRAMLALG NAYYMEKNLQKSFLWFERAAMKGVSEAQFKLGMMYEKGEGTHKDEEQAVY WYQTSLKANDDIAEFAKERLSALGRLR >MS1082 unknown MIAFGYITALSLSYFLLAPDFKGLSFTEYFIQSEAKPIFLTLGLLLPIGF IVMSKAVEYGGIVRTDAAQRLALFLQIIAAVILFGETLNNMRVGGVIVAF FALFCLLTKPTKSIENALKAVFALAAVWLIWGVTGILFKKIALMGGAFPT TLFVTFSIAAVLMFTYLLIKRTFWNASSLVGGIILGCLNFGNILFYIYAH QYFKENPTIVFATMDIGVICLGMIVGALVFKEKISKINMLGIVLGITAIL LLRV >MS1931 unknown MMLSPSEILKKTTALFAATICLYFACKLILMGTGFYPQPKLTDILLFAIL IVIFNSSKNLFYFLLLPFIIAHALYAPVGITFGAPSYQYIASVFATDLME SREFLSQLSIKNYLMPVGIIGLTLAFRWITQKYDLKLHKNKMFLASITAF MLLANSPFKFIDEISTSGTQVISELQRLNNMTIESEWGDSQLINSNYDDY VLIVGESARKDYHHAYGYPVKNTPFMSKANGVLIDGMTAGGTNTIASLKL MFTQPNTQTKEGNYSLNFVDLIKSAGIKTYWISNQGYLGEFDTPISAIAN KSDEKIFLKSGDSLNSNTSDFELLPKFTQVLERPSTGKRFIVVHLYGSHP ITCDRLNDYPKLFDDDKIAKKYFNVNCYISSIKKTDEVIKRIYDALAENK AKTDRTFSMIYFSDHGLAHQITEDNIVIHNSSGKSKRHYDIPLFKISSDD TKRHEYRVFKSGLNFTAGLAYWVGISNAKLAVREDLFSNEPDKDDYGLKA EIDKIDVPEDKAVVIPGTH >MS0477 unknown MTKIFLYGIIAKNFCITGSHNMSSKTIELNFLGQVLRLNCPEEQHDSLRE AAKLLDSRVTEMKDRTGILQVEKALAIVALNLSFELLQEQHKTHKTENVL QNQIEQLTRSLESISASTPTQQASYSID >MS0679 unknown MKTKLFIRIVSLVTKNDIRPSVLKCGHFSAFFKNRKKTTALLSDD >MS0143 unknown MIFKEIKMNNFFERGNVLAAACPSRQILQHLTSRWGGLVLIALRSGTKRF SELRKTIDGVSERMLTQTLQQLEEDGMLVRKSYNTVPPHVDYTLTEFGAQ ASEKMFELVDWLESNLNDILTHKVSKQ >MS1011 unknown MHKLKLILLTSTLFGLFACSNTQKTQIKTGYLKDNISQEELSNPTQYKRY YYSCQNFETGTESYLSTYFPLSRESRMKDNFGIYFQLDNGKVQPFDHIAN KPLNARASRFEVIYRSYHPIQGAYIDLIASENSSVYYKDYRGMRSPWLDC KES >MS0747 unknown MKELFATTARGFEELLKLELSSLGATECQVAQGGVHFMADDETQYRALLW SRLSSRILLPIVKTKIYSDLDLYSAVVRQNWLAYFDERVRFLVDFNGTNR EIRHTQFGAMRVKDGIVDYFERNGKARPNVDKDYPDIRIHAYLNRDDLVL SLDLSGEALHLRGYREDSGAAPLRETLAAAIVLRSGWKQGTPLVDPMCGS GTLLIEAAQMEAKIAPQLHRMHWGFDFWRGHNQAAWEKVKREAVAMAEAE FNKNPNPHFYGFDLDHRVLQKAQRNAQNAGVAHLIKWKQGDVAALKNPTP EDKGTVICNPPYGERLGTTPALIALYSVFGQRLKEQFPGWNASIFSSEQG LLDCLRMRSHRQFKAKNGPLDCIQKNYQISDRTLSPENKSAVENAGEFKP NANVATDFANRLQKNIKKIEKWAKQEGIEAYRLYDADLPDYNLAVDHYGD HIVVQEYAAPKNIDENKARQRLLDAVTATLAVTGVETNKLILKVRQKQKG ANQYEKLANKGEYFYVNEYGAKLWVNLTDYLDTGLFLDHRLTRRMVGQMA KGKDFLNLFAYTGSATVHAALGGAKSTTTVDMSNTYLNWAEQNLILNEAD GKQHKLIQADCLQWLANCAQQFDLIFVDPPTFSNSKRMEDSWDVQRDHIK LMGNLKRILRPNGTIVFSNNKRGFKMDFEGLTRLGLKAEEISAKTLPLDF ERNKQIHNCWIVEFV >MS2065 unknown MLFCPVSQIVRQVRPFILTQRALAQIQWQDVPFSQTVKTTLTEQQKQAFT AQFAGIASPVAAYRIPANQGTLEIEIESPVIDQTLFVPTAVVLDGNFNVA ATYPSSSFKLQEEGGLKGNRLSAELNLTPAMNQDYIYLLIYTTQQDLAKT TMMPHPAKVYAKATGRQPPAINDIEVAHSLNGQVQINVSSANGTKFIGLP TEIFSSNKASTPVGKPAASPATAAQNPNAVVTVVDKDTEAYFNQAVTKAL KAKDVNKALNLVNEAEKLGLKSPRQIFLKNVNSN >MS1428 unknown MNMNKKIVMILKILLAVIVLLTGAVWAFMTYHPVWGGTPDEGSMARIRAS KAYNATLGKFENQEPTQLLTTDEKPSITTWITRLMAADEGKNPSEPLPSA AFDKNVLKDGEMVWFGHSTVLFKLGGLNVITDPVFHNASPIPYIGISPFK TEHSYSVESLPELDIVLLSHDHYDHLDYRAIQELDSKTKHFIVPLGVKAH LQRWGVADDKITEMDWDEQTKIGTLAITLVPARHFSGRTLNIKDPTLWGG YIIQSPELKYYYSADSGYGKHYRETIAKHAPFDFVMIENGAYDKKWALIH ETPEEALQALKDIGATKVLPIHWGKFDMANHVWTDPINRLMKDVASQPEI SVATPKIGQIFHTQGDLPAEQWWQGVR >MS1640 unknown MQIDRFERHLDPSSIQSGDVVIGTLPIHLAADICQKGAKFYFLSVNVRAE QRGTELTCEQLVEQGCSIEAFYIQKL >MS1987 unknown MTEQNLLSSLAHMISEQRNPNSMNLDSLSPLELVTLINNEDKQVPLAIEK VLPQIAQAVEKIVRTFQQGGRLVYIGAGTSGRLGVLDASECPPTYGVKPE MVVGLIAGGERALRHPIEGAEDNAEQGKADLQQINFSKKDILVGIAASGR TPYVIGALNYAKSLGAITISIASNPDSAMASIADIAIDTLVGAEVLTGSS RMKSGTAQKLVLNMLTTASMVLMGKCYQNLMVDVQASNEKLRARAIRIVM QATDCEKEVAERFLKAADNNAKLAIMMVLTNLDKQQASVLLQRHQGKLSR ALSQ >MS0291 unknown MYCVQCEQTMVTPKGNGCSFSQGMCGKTAETSDLQDLLIATLHSLSAWAL KAREHNIIIHEADAFAPRAFFATLTNVNFDSARIAGYAQQALIYRNQLIK AVNEVEPNPNIDHPLANIELNGISVEQLALQAKQFALDTDRQQIGEEAHG VRLLCLYGLKGAAAYMEHAYVLDKFDNDIYAEYHGFMSWLGTQPGDLNEL LEKALAIGSMNFKVMAMLDAGETEHFGNPVPAMVNVRPVKGKCILISGHD LKDLKELLEQTEGKGINVYTHGEMLPAHGYPELKKYKHLVGNFGSGWQNQ QKEFARFPGAIVMTSNCLIDPNVGDYADRIFTRNIVGWPGVTHLEDHDFS PVIEKALQCDGFPYTELEHLITVGFGRKTLIDASDAVIDLVKAGKLSHVF VIGGCDGDKEERHYYTDLAYALPKDTAVLTLGCGKYRFNKLDFGTIDGGL PRLLDAGQCNDTYSAIMLAVTLSQKLGIGLNELPLSIVLSWFEQKAIIVL LTLLALGVKNVYSGPSKPAFLNDNVMALLHEKFGLSGLTTPEQDFGHIIN KNL >MS0534 unknown MYPSHLKFLSLKSAVNFSIVLPASSYSLFADA >MS2090 unknown MQTSNALDNLKSIAKNNKKRLAGTFGLVAAENVLFLTYPVFGSFAVNAMM SGDVWASLSYSLLVLIIWSIGAMRRAVDTRAFARIYAELAVPVVASQRAK GLDTSSVTARVALSRQFVDFFEQHLPILIMSAFQIIGSALMLLILEFWAG VTACAILAFFAFLMPKYAKTNDLLYLKLNNRLEKEVDVIERNNGYQLNKH YGWLAKLRIRISNREAAGYLWIGVAMALLFGVTVVQIATTQGVKAGHIYA VITYLWQFAMSLDDMPRLLEQFSNLKDIGKRVEV >MS0451 unknown MRLILDKFSENFINCAHKKAHLILLNTLSEKFIKI >MS2109 unknown MQTLITPEKYIFRRYFLQNSSKIRPLARNFLRDKMNVV >MS2262 unknown MNYMQDNDKLYRYLFQDRAVRGEWVRLNQTFIDTLNTHHYPNVVRNLLGE MMVATNLLTATLKFNGDITVQIQGDGPLRLALVNGNHRQQIRALARIDGE IRDDMSLHQLIGKGVLVITIAPQEGERYQGIIALDKPTVTECLEEYFQRS EQLQTQLLIRVGEYEGKPVAAGMLLQIMPDGSGSPDDFDHLATLTATVKD EEIFGLPAEELLYRLYHEETVELYEPQAIQFHCGCSQERSGSALLLINDD EIDEILEEHNGSIDMQCECCGTHYFFNKEAIEKLKKSGEEPVTTH >MS2019 unknown MLSLFGQLKLFLSSLTGLRLTDIYTMNGLLRKISSKWLTESIIKTLPIFI SSNLVAIVIWKLQISHLAMPLILGVIAGGLVDLDSSIGGRIKNLIFSLIA FAISSLGAQISLGYGWIFIPAVMVSAFILVMLGALGQRYSTIAFGTLVVA IYTCLSYNPEMPWYGNMSMILMGATIYGLVSITVYLCFPNRVTQENLANS YDALGEYLQAKSEYFDPDDDNLATKQINLAEANRKVMPAFDQTRVSLFYR LQGHNHQVRTRRLLRYYFSAQDILERASSSHYQYHELFQELNNTDLMFRF QRVMELQAAACQKIATALRRRETYTHSPRGKKALQGLLDSLNYYNKQGLP NTYRWQMIAENLRNIENQLSQIEQDNISVEASDNELVKSIRLTGENVSGI QNMFRVIRGQCTFSSQLFRHAVRLSILMVICSALVQIFNLDSKGYWIALT AIFVCQPNYVATKKRLIQRIIGTVLGVIVGYSFQYLSPSLEALLGLTVLT GSLYYFFRLSNYGSSTFFITLLVFVSLNVIGVGANEGILPRLFDTLLGTA LAWIAVSFLWPDWKYLNLHNNLKSTLSACTEYMRHIIAQLQFGYNDQLAY RVVRLEVHNNISSLSAVISNMYSEPGKYQKALEFAPKLLGITYTLLSYIS TMGAYRAASRELDHNTEFSALFFKYGKQTTEILDCLTDKKCSTDNINERI KIIDENLARFNKYDKQSNGIEQVLVQQLRMILQLLPQLGVLVKTENSYFQ LES >MS0554 unknown MLKKYRTFIKVRYFYIKFFETPVSPDFYTK >MS1618 unknown MRSPFCQNMRTFEGFSQLQKPADKSKQNLKLSLIKLGKSVKF >MS1633 unknown MFEQEAWQPSAPIKTLFTRAKIIREIRKFFTERGLLEVETPVLSEFGVTD VHLSTFNTEFIAPIGENSKTLWLMTSPEYHMKRLLAAGSGAIFQICRVFR NEEAGSRHNPEFTMLEWYRPHFDMYRLINEVDDLLQQILDCEPAESFSYQ FVFQQYVGLDPLSAPRAELVAKAREHHFMCDENEERDTLLEFLFSTVVEP QIGQTRPAVIYHFPASQAALAQISSEDHRVAERFEFYFKGLELANGFNEL TDANEQLIRFERDNRQREKMGLPQRAIDKRLLAALEAGMPNCAGVALGVD RLLMAALNANRIEEVMAFGVNNA >MS1271 unknown MRQKIFLFVRSLIILYLILFIGEGIAKLIPIGIPGSIFGLLILFIGLTTQ IIKVDWVFFGASLLIRYMAVLFVPVSVGVMKYSDLLVSHASSLLIPNIVS TCVTLLVIGFLGDYLFSLNSFTRLRKKAIKKRDINNVNNKGEAS >MS1329 unknown MYLVKFLHFFSLYLFQLAKARYHSGLFFEIK >MS1483 unknown MSYNNRNDCFFMQHDKNTQNSTALLSAVNLLKKCGQKSICKRLHIEL >MS0509 unknown MKKTYFISDLHLSENRPKLTALFEHFMHNIAPQAQAVYILGDLFDFWVGD DEKSPLISRVQAQIKQLTEKNIPCYFIHGNRDFLLGEKFAESCGLQLLPD YKIVNLYGTDTLICHGDTLCTDDVNYQTFRRKVHQKWRQRLFLLLPLKVR IKIADKIRRQSRHDKKMKSAEIMDVNGEFVCQIFERFNVRQMIHGHTHRQ NIHQIPPHFKRIVLGDWHDDYASILEVSEQETYFLPQTAR >MS1490 unknown MTRKYWLIIMKNKALVLDLDDTLYAEIDFLYSAYKHIASRLAPERSETLF NRLVELYHRGENAFQYLVEQYDVDLSTLLDWYRFHVPQIRLFPHVADQLN RLKEDFRFALITDGRSVTQRNKVKALGIEPLLDFIVISEEVGSEKPSLNN YRLVQDALHCRDYIYIGDNPKKDFVTPNKLGWKTICLKDRGTNIHRQDFE ILEEFRPHFYMSDWSELPTFLDF >MS1398 unknown MTALLRWKSMKQTSLFSQNNTQNQPLASRLRPTSLDEFVGQKHLLEPGKV LQQMIVQDELSSMIFWGPSGVGKTTLAQIIAHQTNAKFITFSAVVSGIKD IKKIMEEAETDREMGEKTIVFIDEIHRFNKAQQDAFLPYVEKGSIILIGA TTENPSFEINSALLSRCKVFVLEALSNNDIVLLLKQALNHPQAFIPLEVN ADEKLLQAIAEFANGDARIALNTLELAVKNVEKQGNSVHLSENLLADILN NRQIVYDKTGEEHYNIISALHKAMRNSDPDAAIYWLSRMLEGGEDPVYIA RRLIRFAGEDIGLADTNALTLTTNVFQACRFIGMPECDVHLTEAVVYLSL APKSNAIYQARCKVREDVKNTRNDPVPLHLRNAPTKLMKNLGYGKGYKLA HHYEDKLTTMQTMPDNLLGKQYYFPTEEGNEQRFKARLAQIKQWKAEHK >MS0418 unknown MFEWMFEADFWQQHSLWFMFVSSFLSATVLPGNSEIIFLALVSANLFTAQ DYFSPPVFNLLSLATLGNTLGGLTTYWLGRVFPKPELRDQSNKKVRWVFA KFQRYGIFVLLLSWLPLIGDLLCAVAGVMRLNWFASLCCIFIGKALRYVF LLYLAVGYTFW >MS2308 unknown MHENFDQQLIFSELIYGNKQYLRIICTFAFKTATTTFRQSHGSL >MS0108 unknown MSAPLTFQQVFDRVVGHEGGYVNDPHDPGGETNWGITKYTARENGYTGSM KAMTREQAYKIYEKAFWQRYHCEKLPEAVAFQFFDAAVNHGVGNASRMLQ RAVNVADDGIIGKVTLSAVEKMPISDLLLRFNAERIRFYTKLKNFPRYGK GWMNRIAGNLAYAAIDNEV >MS0094 unknown MYVTIDELTTAFARKTLVQLSNDEPTATEPNLTVLDTAIKVAEERIDAAL RSRYTLPLTQVPTLISQHALTLARYWLYARRPETKMPETVKETYTQAVKE LEQIANGKLHLGIAESAVEKSNDLLPDNSEYEVRATQRINTDGY >MS1649 unknown MKLTNIIEIKAKLVLKTGLHIGAGDSEMHIGGIDNSVIKHSITQSPYIPG SSLKGKIRTLLEWYSGEVKSEPLSINNVASANNSENVKNILRLFGFAGHS ENNKELCQELKSSRLAFWDCALNEDWEKMIREDNQLLTEAKSENTIDRIT ATAGNPRQTERVPAGAEFDFKLALRQFEGDSEELVKLVLKGLRLLELDSL GGSGSRGYGKVEFQGLTVGGKEEKLPENPFA >MS1427 unknown MQVISGHWGEMLPFYLQRLDDSIPQAATGLKRSITQTFKEQVFVTPSGML TLPHFNFIYELVGADRILYSIDYPYQTLDGARAFIENLPISQAEKELIAY KNAEKLFGLG >MS1629 unknown MIIFKGLYIGTSSLYYFCRCNPSPAAFFVYFQPHFILAF >MS1110 unknown MFLIYRNSLNRNRGFPSLPDKICAKSTALLPAP >MS1137 unknown MYLRQLDISGFRGIKRLSIHLRPDMVLIGENSWGKSSLLSALSLILNVDN GLYHFVPTDFHRADNMKDITLLFTFSESSINEEHEKFNPVYRHIFVPHED GFERIYLRVSGDINEQNQVQTYYSFLDQQGQPIDVENVDFLVKELTHDHP VYRFRDARLNRHKANSQPLKYAENIDAVSRELYAVTELVKYYFVETQEYA QMSSDPGVLWDLAQSLCYRLEQRKNPELQQRLVNAITSLFEHNGKLNPGS HRFMRPILLLEDVATRLHPRMVAIVWKLANYLPIQRITTTNSVELISQVN LRSICRLVRYDDRTRAYQLNRRDLGKEDLRRLSFHVHHNRSRALFARTWI LVEGETEVWILSELAELLGIDLDIEGIRIVEFAQSGIRPLIKYARAMGIE WYALTDGDEAGKKYTETVKTMLLEHELLSNRVTTLPRQDIEHFFYSSGFE NVFIRLARWEPQGGHYPIHKIIQKAIQRTSKPDLAITLSNEMANRGRDSI PLLFKRLFSKVVSLTRTQES >MS1292 unknown MEIKMNNEITSEKSSWRSKLGALGPGILMASAAVGGSHIIMSTQAGAIYS WQLVPIIILANLFKYPFFRFGAQYTLDSGNTLLEGYLQKGKFYLWFFFLL NIFATIINTAAVGLLCAAILTFVLPFPVPVPILSLIVITVSSGILLLGKY RMLDGLSKLIMIALTVTTVMAVLTALFRNRIQGVAQADYVAPSPWNLGPL GLGFIVALMGWMPAPIEISALNSMWVVAKRRLTKVTYRDGIFDFNVGYIG TAVLALVFLALGALVQFGSGEQVQMVGGKYIAQLINMYASTIGDWARGLI AFIAFMCMFGTTITVIDGYSRTNVESLRLLLKRKESSPKYLNLAVILAAL SGLAIIFYFNNAVGPMLSFAMITSFVFAPLFAWLNLSLVLKGEHKVRGGL FWLSIAGLIFLISFAGLFIANQAGWLA >MS0147 unknown MYPVDKHIKGTGKRVNKKSSKIHRTFLALPDKIYYNRPNFRK >MS1665 unknown MALLSTLSDRKSAVKKTEILNYAFVQSSTS >MS2123 unknown MHFERRSQWSSELGREMYFNVYGHTGKPVIVFPSSGGNQEEYANFGMIDA CRSFIDRGLIKIYTPDSYDKESWLATWKSGHDMALAHNAYDRYIVHELVP LIRHESQWNGTMIATGCSMGAFHSVNFALRHPDLFDTTIALSGVYDARFF TGEFYGDPTVYFNSPIDSLWGQNDDWFLNQYRRNHFIVAVGQGAWENEHV SDTVRLQEAFNAKGIPAWFDYWGEDVDHDWPWWRKQMPFFLSKLEEQGII >MS1474 unknown MIKYIFAFVIIIAIILVAITVGANNDQVITFNYIIAQSELQLSSLVAILF GFGLILGWLITGFFYLKLKLKNITLTRQVKRQTQQINELTTSLDKAAQ >MS2363 unknown MPKNKRLTIKAKQITARGAKNKIDAEINRPPTGRGLLIIILLLVMFWFFT VHIAVSYKG >MS1584 unknown MGPRSETRQINNYKIFYFLTALLSKVRFNFPLF >MS1710 unknown MLKATFKTWVSKILLLATALFAVQNALADESPYGLTRQAAEKLFADIKAN QPKIKQDPNYLKTIVRQDLMPYVHVNYAGSLVLGQYFKSTTPAQREQFFA AFDQFIVQAYAQALTMYSNQDIQVQPQQTVSDSQASVRVKLLQKGQEPLN LNFQWRKNSKTGKWQVYDMTAEGVSMVDTKKQEWSSILRKNGIDALTAQV QRAAAVPVSLGKK >MS0527 unknown MNMPISKKKFWLSDIDEILASFFLALIVLLSGYGVVMRYFLNTPSAWVEE ICVVFFIWFTFLASSALCKNNELIRIDYLLTKIPAKVANFIDGVIQPLIM IFCLGFMIYLGFKLLPMSKMRFTPALQISYVYIYAIIPISALFMLYYELR KIVYYFKINKRN >MS0714 unknown MISKIKVSLVALCAGLFFVSVNTSAAETQTQVPQQCQKLFSATERLIEEA EKQPGTHTQVSKIKNKLNQSKKQILEMELATQIKSCDHGLARLNRLNQQD QITN >MS2246 unknown MSENKTQVNPSVERFEQAVADKSYESACTELLSILGKLDSNFGNINDIEF QMPKQLAEANLQQDKIVYFCTRMATAITTLFSDKELNISESGAQRFFLFQ RWMSLIFASSPYINADHVLQVYNQNPDRISSEVHLEANRSALLKFCILYF PESNLNINLDTLWNVDANICVSLCFALQSPRFIGTATAFSKRALILQWLP EKLAQLPNLNNVPSSITHDVYMHCSYDVAENKHWVKNALNQVIRRHVLEA GLQDRDVKKLGYRNGKPVMVVLLEHFHAAHSIYRTHSTSMIAAREHFYLI GLGNESVDQKGREVFDEFHEVAGNNLIEKLAFLRNLCEENGAAVFYMPSI GMDLLPIFASNIRYAPIQVIALGHPATTHSPFIDYVIVEDDYVGSEQCFS EKLLRLPKDALPYVPSALAPEKVDYNLRENPDVVNIGIASTTMKLNPYFL EALKAIRDRAKVKVHFHFALGQSQGITHPYVERFVKTYLGDSATAHPHSP YHQYLEILRGCDMMVNPFPFGNTNGIIDMVTLGLVGVCKTGPEVHEHIDE GLFKRLGLPEWLIANTVDEYVERAIRLAENHRERLALRRHIIENNGLKTL FTGDPRPMGTVLLAKLKEWASENQVQLEIAE >MS1580 unknown MSKKLSVAVLVGIFALLAFLYGQKNKAETQLLTLLQKQGIKVNSLDFSFL PHPTLTANKVRYLVPESSRLVAFEQVAAEFSGASLLLGDFKISNMRFNDG EIRSEPQSPPVLYSLNFSLKPAALYLNRLENLLHFFKTKEVLDGGNNQWL YELNLTAKNPSNDNLHFATTFKLLTRGIALKDTNASVDLNELTYSDNKQF TLTADKIYLTTQQSAVENYEFSAENLKLNNENLGRVQGEWLASGINPQGY LVNLTSSICNYCNSMIDVRSVNPQNSIIRFKTEFFPLETLLGILKLPVLL SGKSDVTAELYLSEEQPTIGDFNLNVLNGKLKGVNLLSLIGQYLPINYDE GKLKNLETGFIQYNAQFRWRGRNLHIDNMLLQTEDLILKGRGYADLQTMK CDAMVNIGVNDAQYKQLTLPIRFFDDCTSPQYKIEINKDFRHQLRNFIKD KFN >MS1429 unknown MKKLLIALLLGVTTMAQAEYRMSLFELTVKPENQQAIEAIGKHNLGTSIQ TEAGTLAMFHTVKKDEPSKNVILEVYQDDQAYQVHSQAEHFKQFVEVAKT AVIERKAEALNSQFLAEKRPLADFENGNYLINLATVRVKSAQNDAFKAIV VDEMKQAMAKESGVLLMYAATLKEQPNEWRFFEIYADQAAYAQHRQTPHF QAYLKGTNGMIESKGVVELQGKTLVTKGVFQSK >MS0830 unknown MKREVLLNKKVSKKPANHTDFSSKEKHFFVLYFN >MS1196 unknown MENLSMKYQKLENQEAHWKWLYLIKKNREGENITRYEERSLQQSKVHDLL ESQNYPEKIEEWIANHMAESLIIKLDQAIRARRKRFFNAEKLSTKKKSID LEYGVWLRLSKYSKKMKMTLSETITYMIDERESKALYESQMSAMKAGLKD LLNK >MS1773 unknown MNKIRKLLSCRKGVSSIEFTLTVGLFFMVVFMILELARLTLFTSYWDYLL TESVRITKNQRAENNDYASLFRTVLEQQHQQQNNAVLAFFDVRDEKIDVK VEYAESVDDLVNEVFRQPTIVNGVAVSPTGADASIARYSLSYSYRFLVPL PFISEQWINPMFNREIFVVQEYERPSFRYNN >MS0673 unknown MDDIMSNYRRDFSPGATYFFTVVINQRSDGLLIKYINEFKQAYQDVVSYY PFETIALTVLPDHFHLIMQLPENDSDYSKRISSLKYNFSSLLPTYYRNMN LSRQFKREAGIWQRRFWEHLIRDDRDLDNHIDYVYYNPVKHGYVSQVMDW KYSTFHRDVKNGIFELDWGSYISESVRNLYLD >MS0735 unknown MMYYVIFAQDKPNTLEKRLEVRPQHLARLEQLKTEGRLLTAGPNPSEDGK SVTGSTVIAQFDSLAEAQAWAQQDPYVDAGVYGEVIIKPFNKVF >MS1957 unknown MNEQQFKNELEKLTEDKNRTYMYIIYGLFILAVVFKPLAIIGAVFAFMKR EELSVLAQTHCNYLIKTFIVAFIGSFLIFVPVIFWFIFAWYVYRVASGFQ NFYGNREVNGESWFK >MS1719 unknown MKWTDAQEIAENLYDLYPDVDPKTVRFTDMHQWICQLEEFDDNPEASNEK ILENILLRWLDEYE >MS0980 unknown MMILITYDVSLENEGGERRLRHIAKHCLDYGIRVQYSVFECEVTPAQWVE LKDKLLNTYDKETDSLRFYQLGSKWKHRVEHYGAKRAIDMFRDILII >MS0656 unknown MNNKDIKIIIATHKKHFMPSDEIYLPLHVGKLGKTDLGYQGDDTGDNISA KNPNFCELTGLYWAWKNLANDYLGLIHYRRFFSVKSRSERKNNPLETLYL TSEEASQLLEQYDVIVPSKRNYYIETLYSHYANTLHAEHLDVTRKIIADT CGEYLDSFDSVMKQRGGYMFNMFIMSKELVNDYCSWLFPILFELEKRIPA EQYSAFHARFYGRVSELLFNVWLKQYSQSKPLKVKAIPFVYGEKINWLKK GFAFLMAKFFGKKYEKSF >MS0402 unknown MLNKFRLNRRNFFVLIPLIKADKSAVVFCEDLTIL >MS0281 unknown MEIFTNAISYIDLNSIIAIFAAGLFGLFVGAIPGLTATMAVALMVPFTFF MEPIPALALMISVGASSIYAGDIPGALLRIPGTPASAAYVDDSYLLVKQG KVNRVLGLGLMSSVIGGVIGTIILALAAPSLAQFALKFSSFEYMWLSLLG LSCATLIAGKFITKSLLTLLFGILISTIGFDEFTGQARFTFGFVSLYEGV SFIPAMIGLFAISGAIEYYATRYKGQNPINTDITELEQTKNSLNLFKGIA KPLIKRKGSILRSSVTGTLIGALPGAGADIAAWISYALSKKTSKTPEQYG KGSEDAIIDASSSNNASLAGSWIPSLVFGIPGDSAAAIIIGVLYMKDMQP GPSLFLFQPDKLYAVFILFLIANLALIPLALIVVNFLKKIIQINKDILYP IVIIFSMVGAFAINNSPEAIIVMLIMGVIGYFLQKNHYPISPIILGMILG PMLERNLLASLTKSDGNLIAFVERPVSAVLGCCFLLVVILQVWGIVKNFN EKN >MS1130 unknown MIFNINYFLFSGKVRSKKFKNFDRTLILKFL >MS2319 unknown MIRHQLKSAVKNTALLCLLFLISLPAFSAKLAIVIDDLGYHPREDAAILA MPKEISVAIIPSAPYARQRNQLAYEQGRDILIHMPMQPISQMNIEAGGLS IGMDAQQVAHNVQQAKNIVSHAIGMNNHMGSAATADRPLMTELMAELRKQ HLFFLDSRTIGRSVAEKIAKESGVRALQRHIFLDDSDVYGDVQRQFQQAI HYARKHGTAIVIGHPRKNTVAVLRQGLANLPPDIQLVSMGSLWRDEKVVP PAPFILIFSDKPAPTSVAPFEPVPLLRGVPR >MS0613 unknown MKKTIIAAFVVSAGVIACSSPVENRPQAPLDMQTVRHYQNKVYGGNTVPA AQRVKEQPVVDTPMNVSDTRRQDRLDTRQTVRPGNVVIVPSIGYGYHHHR YRW >MS0991 unknown MTNTKKVYYAHSEKDLPHEQWQTFSSHAENVAKLAAQFAEIFDAYQLAYN TGLLHDLGKYTPAFDKRLHGGPSVDHATAGAKIAIERWGFPLGKILAFCI ASHHTGLVNGDGEGDNRSTLKQRLSVPFGKGNLPELDPIWQSELPLPEKL TFPALKPDPYYQPFALAFFIRMLYSCLVDADFLDTEAFYANLKQQDIDRG NAPSLDQLHQQFNRFISDFRERKKALQPQTEEEQRNAKLNRLRSQILDHA IAQAQQEPGLFSLTVPTGGGKTFTSMAFALEHAKKYGMLRIIYVIPFTSI IEQNAQEFRKAFGEFGEAAVLEHHSTFDDEKLLDKDTKDKLKLASENWDM PIVVTTAVQFFESLFADKSSRCRKLHNIANSVIILDEAQMLPLNLLLPIM QSIKELARNYHSSIVMCTATQPAIQTQHGFYRGFENVREIAPNPTALFAD LRRTSVQHIGMQSDKDLIDKLTENQQILIIVNNRRHARSLYEQAKQLDGT FHLTTLMCAKHRSQMLEQIRQHLQAGRSCRVIATSLIEAGVDVDFPLVMR AEAGLDSVAQAAGRCNREGKKLAEQSFVWVFQPEQQWKAPTELGLLSAAM RSTVRCYGDNLLSVEAISHYFSAVYEQKGKDLDNKQILAKCHAAGKTLDF PFQTIAKEFCMIESHMLPLIIPFDKEAEKRIEELRHAEKVGGLLRKLQPY TVQIPQKSLEALFKAGRIEAINEQQFGNQFYSLIGLDLYDEVAGLDWGDL GFITIENSVF >MS0685 unknown MSDCTMCCERDHRKGELKAKSAVENLKVLYKHLFLGIGSAKSRETEQNF >MS1605 unknown MLLEVLFYIGLIVEAMTGALSAGREKMDIFGVIVIAFMPALGGGLMRDII LGNYPVNFIANPHWVLIVAVTALATIFIAPLITHFNRSFRTVFLVLDGLG LILFSIFGTQIALEMGFGLTVASISAILTGAFGGVLRDILCNRIPLIFQK ELYAFIAFFTACLYIGLQHLGLSINLTVMITLTVGFIFRLLAIYFSWGFP VFDYHEEEMSPKEIMPRLPKRRKYKEK >MS0696 unknown MDALLSWEWFVPAVMFFSFFVLVFVGVPISFSIGIATLAAAAFMLPFETT LIVSGQKIATGLDSFSLLAIPFFILAGSLMNSGGIATRLINFSQILVGRI PGSLGHTNVMANMMFGSISGSAVAAAAAVGGTMAPLQAKAGYDPAYSAAI NVSSCISGLLIPPSNVMIVYALTAGGISVATLFMAGYVPGILMGFGIMAM NYIIARKRRYPVSDKPTFAEVVKYSLDAVPSLLMVVVVMGGILGGVFTAT EASAIAVVYTFILSVIIYREIKLTQLPKIILDAIVTTSIVLFLIGVSVAM SWAMTNADIPYMVNELLISVSDNLIVILLIINMLLLVIGIFMDMTPAVLI FTPIFLPIVKELGMDPVHFGIMMIFNLCIGLCTPPVGSALFVGCSVSGVK LQDLIKPMLPFFAVLVITLLMVTYIPQLSLFLPGLFEL >MS1713 unknown MGENTLVEVNNLTFKRGERTIYNDLNLKVQKGKITAIMGPSGIGKTTLLK LIGGQLHPEQGEILFEGKDICQMSNSELYKVRQRMGMLFQSGALFTDIST FENVAFPIREHTNLPESLIRQIVLMKLEAVGLRGAAELMPSELSGGMARR TALARTIALDPELIMYDEPFAGQDPISMGVIVSLIKRLNEALNLTSIVVS HDVQEVLSIADYAYIIANKRVIAEGTAEQLLQSTDPQVVQFINGQEDGPV HFHYPSQDYEEELFGRGINK >MS1420 unknown MLKQMFITAMLFVATMAYAETVRPAYYVAEFQPTDREGIKAYSAQVESTF KPYSGRFIVRGGEADVKEGFGVQGRLVIIKFDSLKQAQEWYNSSAYQKII PIRQRSGNSRTYIVEGLPDNNSK >MS0751 unknown MSLKTSNLAFKERVNHEVNNEIMRKAVVKAQETIGANRQKMVDELGHWEE WRDLAKQIRNHVLQNLDAYLYQLSENVIKNGGHVFFAETAEEATNYIRRI AREKNAKKIVKSKSMVTEEIGLNAVLEQDNIQVIETDLGEYLLQISGDKP SHIVVPAIHKDRHQIRKDLHEKLGYEGAETPEDMTLFVRKKIRQDFLEAD IGISGCNFAVAETGSVCLVTNEGNLRLATTLPKTHIAVMGMERLAPTFQE VDVLITMLARSAVGAKLTGYNTWLTGPRLEGETDGPEDFHLVIVDNGRSD ILASEFKEVLRCIRCGACLNTCPAYRQIGGHGYGSIYPGPIGAVISPLLG GYEEFKDLPYACSLCTACNSVCPVRIPLAQLINKHREKMVAQNLRPPLEK LSILGFNFANSHPAVWKVGVNMGAKLMNKLIKDGKAPISVGALGEWTKAR NLPQSDGESFRDWFKKRGSN >MS1047 unknown MHSVLFSYHINSKYHKGNIMLCAIYKSKKKEGMYLYVAKRDYFDEVPETL KMAFGTPNFVMLFNLLGEKKLVRAENQEVLKHIQEQGFYLQMPPKQESLF EQFKAEQKAKQTKNKTALKVR >MS2060 unknown MAESFSVERRFFDDKNYPRGFARHGDYTIKESQALEQYGQAFKALDSGER APVTDEEELFVSFCRGERPAATFFEKTWNKYRSRISATKRVYTLSGVVGD GLDEFNAND >MS1447 unknown MKKRLILLCTLVLLGGCTVGGGFGVGNNGAGVGISTGIGF >MS1999 unknown MFYINNDNFKREPYRGYCPKHLEKHFTKPRLDIRGNGYYSPNNEYVVAWY ESDFIFKSDKAQAHAKVKFVNLTCPKYTGKNDAEYLSVLYISVPLNYLHY FTIGSIWKGGVAKEQFAFEEFYITVEAENQDGKRENLSIVSFGESEENGL EKPFDYTIYTIPTEYHNYGDDLNRLLSISYQNQKFIIHPLHIFMMHYGYS TDIKRILATYPLDEVRERLFIDKVVENFDVERYVVLPKYFVKKDAIFLYH LKYDEETTGIRVKNLVSQFRLNVRNQKHPIEIGFWHTQKVELKIRGIRLG NAVLCAEIIGLNQPEGEDITLVLSQSKREKSGNKTEEQNNEVNNVPITRV YTREPELDELPLTDNSPDNRTVEYNKRQFELLGRQRQIRALKKAREESNQ KMKALTPDEIDSLGVGESDGRNGKVGLAFCFLDDTPVGKSSKLYKLWQHA KAVAEWNHGEAHWYTPNLGFRNDGELLPVSLETNKCFDYPEIAIIIRLQI YGETFFLIDFSMKQRDISIRGLGYKPDPNEDFLYESDMSKSELSELLSAV HHHEHLPKEYIEKKNKEKTKLTTFNHSEAESSNWAYNAVQKLTSRAITKP TYF >MS0681 unknown MKKRGKKPELDWETEEQEEIIWVSKSEIKRDAEELKKLGAKLVDLTKTNL DKIPLDGNLLEAVELARRSVKEAKRRQLQYIGKLLRNTDVEPIRDALDKI ENKHNQQQAMLHKLELMRDELVSKGDEGLVALLIDYPQMDRRHLRNLIRS AQKEKEQNKPPKAYREIYQYLKDFIIEE >MS1780 unknown MNYRVLFFISFLILAMGLSGLFFMLPEESPNTPQQQTTSEKTAPKSQISI LIAQTTRQIPQGTLLQAEDYALSELTVDSDDARANFDLKAWLANNENSSL QGYLAKQTLQVGSFLSPDLLLSPQHPDYLLSSLDPMQEVAYRIYIKAENG YIFDTLRAGSHVSVGSQQIAGGKNNKERTELIKLVGDTVVLQSKIYGQDE KLMDRNIVGYISVKLNAQQLQKFYSLPKGANLILFPNNIWKQGEPNHRGI FIRELRGQ >MS1392 unknown MRSKNEVFLQAVGFGRNFANGTLLYTPNNILEKM >MS0119 unknown MTETSEKSTALSNESYLRHSMVIMDKWGNGEAYDEKLIVDRGKHCQRTMV ESMLEFGRVLIILKEHMVHGKFQETLEHEFDVTPRAAQKFMQATLKFCGE GLQDTTPKLVQLGKSKLLELVTQDDDDLKELAEGGTVAGLKLDEVDRMSV QELRKALRNAKAEKEAMGKVLANKDNKINELDVELAKKKKDIETRSPDRK GGDLRKETSQIAYGAEAILRGQVRPAFDALLEHTEESGMDHTQFMSGVVA EIELILIELKETYGLNDVPSVEADDWENQSDKSLGSVLDEIIADQQAM >MS0073 unknown MKKVISLICASILSLGIVGCDKKIDSTNVAQSDDSWPQKTITLIVPFGAG GDTDFHARNLASHLEKELGARIIVNNVSGANGNAGMKQVVSAKPDGYTAL FFHESMLTNKVVGLAQQAHEALSPVAATIVDDSYVIAANAKSGLKNLTDL INKAKSEPGKLIYASSVGGYSYYLGRVLEQKTGIDFNIVDAGGGSDRNAA LLAGKIDVNVNPYGVMKSYFDSGDFIALATINNERNKLFPNVPTAKEQGF DWNAERYYFLSFPKGTDEKIISKMEQAIKTVVDNPDFRKKTEDAYSVSPT FVGTKDLLNHLDSALKEFETNKDLVNN >MS1651 unknown MSMQIKSIQLAFSVLYHYFDECVKKALSNLPIQRVDIPDSLLAQAESLVL GKLPSEKKKDLPLTSIFEGISQQNNSQQYLYDFKPLSPDSIFPDLQRNEG HQPFELWQHLAKAVEEIPTSHRENINLWLDHFDTALQCYTSQITCPYDQS ISFYDFTKAVAAFVVASMDKSADKNRPFLLIQGDFFGVQDFIFSGGSQSN KQAAKLLRGRSFQVSLFTELAALKVLNACDLPATSQMMNAAGKFLIIAPN TPEIHKKLDDVQKELNEWCIKNTYGLIGLGIAKMSAGKVDFEQKNYEKLI KLLFENLETQKLKRLDLTDTTQSVQEESYPNGVCEMNSFFPALPNSNRSI MTEDQVKIGELLAKKQRIIVCDVGTEINNSYRTQTLKLDMFGYNVIFTDS RKDTKDFGHPVKLYQIHRFWDFSLAKNTKDELWNGYARRYINAYVPFDEQ EQIKTFDEIAQADEGINALMTLKGDVDNLGTIFQKGIQPANIAKMAALSR QMNQFFSLWLPAYCAEYSPNMYTVFAGGDDFFLIGPWHSTQKVAFEMQQA FKRYVAENPEIHFSVGMVMSKVGLPVPRLGDLAEMALEKAKSIDSGKNAV TIFNRTVKWTDWQQLCDLEDEIHRLAKDYNISTSYLYSLIRLCEQANDKN NIESTMWRSHFYYRTARYVIDKLNQEKRDKALNEITISLGENGISQYKIN FAIPLTNYFYQKR >MS1630 unknown MRYFIGSFTFLTANGIIFLTIFRIQPLFRLSALILILLLFSAFISVGLAL TYKLLKSFINSSILNRTLRAVYPIGMLILVGLSIYNAYTPKVIHYQIELD KPLKAMRIAVASDFHLGKLFGSEQIDKLARIIEREKADLVLLPGDIMDDN LNAYLAEQMSSHLAKLKAPLGVYATLGNHDFFGQQQAIADEINKTGIKVL WDEAVTINNEFVIVGRNDDLNKARPTTKRLLQNVDTNLPVFLMDHRPTEV TEHSALPIDVQVSGHTHNGQIFPANLIIKAMYRLGYGYEKIADGHFFVTS GYGFWGIPMRLGSQSEIFIIDVKGKN >MS0536 unknown MKNNPLKTIDRTIKVRSVFSKFFKVPYATKISR >MS0190 unknown MKKDRTLNKSAVFFCEILLPNQAELPKLRLNFPAGKVSFFFLFFHFKGCL RATKGLHNVK >MS1044 unknown MHKLIIIRGHSGSGKTTFALKKIAEFKRQYPVGHVFHIENDHYLIENDKY IWTEQRFRQARLQAQKTIYRAFRFCRKHNAPDCLIVISNVGVNKQEIQCF VHQAEKQNMQVEIYRLRHFYPNTHHVPEDTVMSMYRHLCANPIEGEIIID >MS2198 unknown MKLKSLFCLCLALPLMAAANNEAPQNPQNIVSFSAETEKEVPRDLMQVSL YLHEEGNNLKNLNKVIAEKLNKGLTLIKQQPAIEIQSNNRQTQVRYNNKN QKDGWIATAELVLQSKNFSQLSQLIEDLSPLFAIGNIEAALSKEAIVAME DEMTDSVLAKFQAKATLIQHSLQAKGYRLLDINIDSLNEHYASPMVNHVA MKMAVAEQAAPVQLESGKTRLKAIARGRIELIKE >MS0123 unknown MNECEQIKQVWKQEYDEAAEAAAKTERSGNYYQAAELWKKAKEKALNLSQ KEWCKRRYQYCISWASRREK >MS1247 unknown MKKVSLATILALSTMGMAFFANAADNVQTNAPAANAPAYEVMPCGMVREY NPMCNGAMCNRGYPDGRANMRRGFKNMPGNAPYMGMQMGQRGGFVANQSV TRVADAGKWEDDQMIVLEGNIIKRVGRKDYVFKDGSGELEIEISRRAWHG DIFSADDRVRLVANVEKSWGKTEVLAVHIEQIRPDVAAPSQGNKTGNNQ >MS0192 unknown MSLFSLWVMAFGLSMDAFAVSICKGLAMEKFQWCGALKAGLYFGLFQAVM PLIGFLLGVQFSEYITDYDHWVAFFLLALIGVNMLRESLSDEDDEDSCSN DFNFKTMMTLGFATSIDALAVGVTFAFLSVDIYSSVVTIGLITAALSIIG VKSGHFLGKKIKTKAEILGGLILIGLGVKILMEHTLFG >MS0018 unknown MNNQKALRELTLRGMILGALITVIFTASNVYLGLKVGMTFASSIPAAVIS MAVLKMFKGSNILENNMVQTQASSAGTLSSVIFVLPALLMMGYWQDFPFW QTLLICVSGGILGVIFTVPLRNVMVVKSDLPYPEGVAAAEILKAGDEAGK ESGVKEIMAGGIIAAVVSFLTNGLRIITDGASLWFKGGAAIFQIPMGFSF ALLGAGYLVGMMGGIAMLVGTLFTWGAAVPYFTATTPMPADMGIADFAMS LWKSKVRFIGVGVIGIAAIWTLLVLMKPMIQGMSQSFRALKDKNNINLDR TSQDLSPKAMIYTILGSTVLIIIALVSFLQPVGLPTSTTFLFVVLCTLLA VLIGFLVAAASGYMAGLVGSSSSPISGIGIISIVLISLVLIVVGHSLGLM DSKDGQRFLTALTIFTSAIVFCVATISNDNLQDLKTGYLVHATPWRQQFA LIIGCIVGALVITSVLEILYHAYGFAGAMPREGMDVSQALSAPQATLMMT ISNGIFSDNLEWTYIFVGIGFGLSLIIIDTLLKKSSQGRLALPTLAVGIG IYLPPVVNVPLIIGALLSWLIQRHLRHYAKRSGKDISELNKKAERFGTLF AAGLIVGESLIGVIMAFIIAASVTSGGSDAPLALELADWDSMAEILGLVA FIIGIAIFTRRVLKAKKA >MS0106 unknown MAKRIRKTQMKYLKKLRKRWQGWRFAKQNPVVAESRSYITLALKLGVENP VMRKRRRNP >MS0851 unknown MSDYRVEYVIAKFIAIANYVNMLFGRCLYAKKIIFRFLIFARHRMGSILC GL >MS1989 unknown MFKTYFELKRTNHGTKLINIAQIIKRKKSNI >MS1360 unknown MSLTLLALDTSTEACSVALLHHGEKTHLDEVAQRSHTKRILPMVDEILAQ SGLRLNQLDALVFGRGPGSFTGVRVGTGITQGLALGADLPVIPVSDLAAM AQAAYELHQAEQVITAIDARMNEVYFAQLIGEKVRSEFGEFLQWNEVIAE QVCSPEQAIAQLRANRTQGDWLNVGTGWAAYEALTKTPFGKISAIQLPSA LYMLSLAVPAWYNRQYVKAVDVEPVYLRNEVTWKKLPGRE >MS1782 unknown MLTNLTTKTYIATTEAIRRFKQDHKGVTAIEYGLIAVVMAAFIVYVFADD TSFVQSLKEKFSDVSKSVGNATFKE >MS2340 unknown MAGLFTQPGRRRYGVAIFAGIIGGLISAFVKWGAEHPFPPRSPIDLFTAA CPQPVLDALNSGAIVMDQALQQCSRAFLNPPHVFLRDVFGIDPTAPAFMF ADQAFNWIGVTHITFSLVFAIGYCLVAEVFPKIKFWQGIGAGLIACVVVH YIVFPAMGLTPPVAEWPWFEHVSEIVGHVFWFWSIEVIRRDLRNRITHEP DAEVPLDQPYR >MS0733 unknown MACIIADFPQDEKYKGRNIKVRSKNGNFQQKT >MS1404 unknown MTTYSQPAIIWPEKYTPGETDNYASNEVIIKDLSVLDVWEYLIDTKAWPT YYNNAENIVVGDGSQTKLAANATFVFDTFGFHVSSKVEEFELSNDGNLAR LAWSGTFGEGDEFSDVYHAWLIENLPNNRVRILTEESQIGKLPQQLAQTL PNPMINGHQAWLVGLANSAKNKTSY >MS0982 unknown MIIRRNMMSLPQVYSLDLGRKVNAKEADIAYQKGLIRSQKNFRCPHQLCG IAITCANLERPKQERKVDPYFKSVEYHKPSCPFAEEERRIKLHEADKNSL YENVASGEILVNLTEPAPKKQDSSDISEVEKGSFSRATQSSDSEKEKASI NHTKTLSVLVSSFLNNENFQITLPKPYQEKIFLKDAFIKIDGQNLSNLEQ NCWRIYYGKAWINKLSNGDYRIVFDNKMKDPDLRKNAVCPSFFIPKDWID NSPYEKFSKSQMDKLADNKWHREVFIFSDVPSLSHTKEYINFMLEGLPFL EMIYLKK >MS1541 unknown MREAKSAILFFTGLISNIFLFLSCQNKPQL >MS2314 unknown MPFTFAHPVTVLYFPRNSRYFHFPALVLGTMSPDFMYMLHWKTDVGGHTL FGSEWVNLPLCLLFYAVYRLILATPIKQHLPAFCGSNVPQVTFKNPLAWL IVFLYSAWIGMATHIALDELTHDGGYFVQLFPILQTKIIFHIYDWLQYGI GAVGLISIILYQRRMAGKYPYRSSRSAKQKWFFWLSVVSLTVIIFYLSNP LYPLVWNEVASIVLRIINSFFISLTIHGVIFTVMKKRSLKIG >MS1216 unknown MSKMARTTRSCLHLWYVVYHSFFLKWIVMSLFSRIPFDKKLIESAIARFE QESSAELRVYIERNLPESENLSCVDRALQIFMQLEMDKTQAHNGVLIYIA HKSHKCAVIGDLGIHQFVGDNFWQQQCQLMISYFKDDEYTQAVIAAIESI GKELAIHFPVKPDDKNELPNEVIING >MS1234 unknown MKMNKKFAQIVKNPAFRNMVLKTIFNVTNVMSATKYLR >MS1401 unknown MSVLHFIGIDVAKKKFDVAYLKDKERQMVKTKVLDNKPAGFNQLLDWIKK NVSNDFSTIHITLEPTGVYHEALAYFLHDNGFVVNLINPARLPKFAEYKG FVHKNDRGDCKLLALLGAENPHEYWQPEPLSIRQLKAKLSRLEALKSDLL RENNRLEQAESGNLPDEVLQSIHHIRKALQDSIKALSQDIDDHINGNPEL KKDKALLKSIPGVGDVITKQMLVVYHSKHFQKAADMAAFLGLIPKERTSG TMKGKIMLSKRGSPQIRALLFLPAVAAKSYNPDIKAHYERLLAKGKTKMQ AIGAAMRRLVHICFGVLKNKSVYQPQTILA >MS0229 unknown MTWTYILSILAIIIIVAMAGYSIYIFRELHKQKRRFAQARQARIARLHES ITIIAKAMQSGECNHSEGVIRLKMLLEPLGQKEIDAYHSMHRLYQTVKDM PTHDSRRALKRNERMKLDLARETAEAELEEKIKSELEQLLADIASYQKIK Q >MS0690 unknown MVSFPQIYLFSAIALFQSACNQHYNLVYHFTNEFQ >MS0214 unknown MLKINYERRLVMKEVSSKSLNDDELALLDNLLLEYANEESDEGIFTLSEL DGYLTAIISSPMLIQPSTWIPAIWDNDLPEWENEQEMAMFFDLLFRHYNS IIMMLQTGLEYYSPCFEYSNFTDGDYPIVDDWCFGYMRGVKLADWQNLPT KLQPYLKLIEDQTHLHSSLDDYVSPSLQEQNELADRLIEAAVKIYRYFR >MS1833 unknown MKIQLIAVGTKMPDWVKVGFEEYQRRFPKDMPFELIEIPAGKRGKNADIK RILEQEGKAMLSACGRGKVVTLDIPGKPWTTDQLARQLESWKNDGRDICL LIGGPEGLSPECKAAAEQSWSLSPLTLPHPLVRVVVAESVYRAWSLTTNH PYHRE >MS2217 unknown MVCYSPICSYNYINKCKAVLLAPLFFTPESKCTQKSAVKNFRNFYRTF >MS0130 unknown MKQNTVAKNKEISMLDLYPLPQDPFQIATVVLAVICLIFLFIMARRKRDV QELQQDLNKNILDFNQLLEKFDTLTAAKNQLDQDVIKAQTTAEGLQIRLQ ERNELIQGLQTELNEEQLRHETLTGSMNTLKERFGVASALVTNLQQQLVE SQNAVARKEQDLNKIQEKTTALSQELTELKTTLSEKEKNFAEQQQAFAQS KQQLSAEFQNLANRILEEKSQSFSQSNQIALDALLKPFREQIDGFQKRVN EIHSESLKGNANLESEIKRVLNIGNQMSQEANNLTSALKGEKKTLGNWGE MQLERALQLAGLVKGEHYEAQAHFKDAQGKNNYPDFVVHLPDNKHLVIDS KMSLVAYENAVSTDDENKRQHFLREHVKSVRNHMDDLWRKDYTNLIGMRS PNFVLMFVAVEPAYIEAMKADLNLFNYGYEKNVILVSHTTLMPILRTVAN LWRIERGNAEAREISERAGDIYNQICLVAERLAKLGNTLSTVNGHYNSAV TALVGNQGLVGKVERFKDLSAKANKAMPAVEMLHSDLDTEKLLVVKAED >MS0009 unknown MNNNYKKISPILTAVLLSACTAQVPLPKTCEDFINEYAKLSVDTKKIIPE TLLGEDMRDYILADRYTLREKYQDSVNSSYQSIKTNLGRNAAEMSLKAIE QSCYIGTEQIKALDFMQ >MS0930 unknown MCSSSHKENKFVKIYISSGKFICQMAKVLLFISPFKIPEQNLEVLC >MS1731 unknown MWVKTPNKTGYNDLNALSVQIMKTALLILLKGLNN >MS0517 unknown MDSIPLSTLFITLFILLILSAYFSSSETGLLSLNRYRMRYLAEKGHKGAK KTETLLKKTDKLLSLILICNNLVNISASAIATIIGMRLYGDMGVAIATGV LTFVMLVFSEIYPKTIAAIYPEKVAFTSSHLLILLMKLFSPLVFFMNIII QGLIKITGLKTETKAHSISPEELRAIVNESGKFIPSAHQKMLLSILDLEE VTVDDIMVPRNDISGIDIDDDWKAIMRQLNHAPHGRVVLYKGDMEQNVLG MLRVREAYRLMMDKNEFTKEMLIRAVDEIYYIPEGTPLTAQLLNFRHRKE RIGLVVDEYGDIKGLVTLEDILEEIVGEFTTSTTPSINEEITKQSDGSMI IDGSANIRDINKMLNWHLNTDEARTFNGLILEYLEEIPQEGTVCEIEGLQ ITILEVSENMVKQAKVVKL >MS2066 unknown MKKTLLAVAIGGAMFATSAAAVDFHGYARSGIGWTSGGGEQTALKVNGGG SKYRLGNETETYAEFKLGQELFKDGNKSIYLDSNIAYSIDQQVDWEATDP AFREINVQFKNFAEDLLPGATLWAGKRFYQRHDVHMNDFYYWDISGPGAG VENIDLGFGKLSLAVTRNTEGGGTATYGQDKVYYIDNNGQIQYRYEDRKA DVYNDVFDIRLAELNVNPNGKLEIGFDYGNAHTKNGYHLEPGASKNGYMI TLEHTQGEFFGGFNKFVAQYATDSMTSWNTGHSQGGSVNNNGDMLRLIDH GVVQFSPKVEMMYALIYEKTDLDNNQGKTWYSAGIRPMYKWNKTMSTLLE VGYDRIKEQSSGKKNDLAKVTLAQQWQAGDSIWARPAIRVFGTYGHWNDK FNITDRTNAGYKAKDAEFVAGVQFEAWW >MS1650 unknown MNIKFGKDKSPEIFSSIAEQTAEQIKSNKDKNKTTQLRKFYDELAMWNER VQLAREDKEAKFQELVPFIKMLKAKVAYAEGRKHIDKNFSDVFNRCIDQA NNAETLRDAKLFMEAVMGFCKLEELKR >MS0717 unknown MFYLAWVVGVLLAILASVMITIRIEKSGKFDE >MS0161 unknown MLYTFSKADYAPRELADLLARLTTQDAVLLWQDGVLLALKYGDYFVKHSS QVYLFEPDIRARGLSALIQQKNKSFNRIQMPQLVQLTTRYFPQLAL >MS1147 unknown MLFSLIKYGGLFYYKFYKNLAKFTALLCLINTA >MS2362 unknown MGTGFACGGWALAWTVYIFNKGKYHPLVRPALLASLFGYSLGGLSITIDM GRYWHLPYFFLPGQFNTNSVLFETAVCMTIYICVVTLEFAPVWLGFFGLK KLFKKLNKIMFFVIALGALLPMMHQSSMGSLMIVAGHKVHPAWQSYEMLP VFSLLTAFIMGFSIVIFEGSVVKASLAGQAPDERHLFSQLTKTAAVLIAL FLMFRFGELIYHNKLHYVLGLFKFEAWMFWAEVWLMTLPLLALFLGERRN DGRWLFVSALSMLLGAALWRLDYSMIMYNPGNGYKYFPSGQELLISIGFV SIEVCAYILIIRLFPVLPVLKEANKETSEYIIAEKAALSENLAQKNS >MS1001 unknown MIFPENRIIKATKFAQKFMPFIAVFSVVWQQFYAKSDLVALAIAVLCAIV ALCIPLQGLYWLGKRAQTGLPAQSAVKFFEISKLLEKKNVTTSQIERPTY QHLADLLAKAQKHCTKEFWEEL >MS0174 unknown MEYLIPKSAVVFEEEIKKSRFITYLRHTEGLVEAKAFWQDVKLRHPGARH HCWASVAGAPNNSQKLGFSDDGEPAGTAGKPMLSALQGSQIGEISAVVVR YYGGILLGTGGLVRAYGNGVQQALKLLETTVKIERQVYGLYCDYGQVNWL QLLCERYNVLIENQLFQENVWFQLAISDDKLEPFKQELTERSAGQLTIEP AE >MS0572 unknown MKSMRKFIKYFLLTVVFVFHVVLFAGINYVFPHYETTKITGVEVKRVDKD GPITKANPADGPTRDVYYIYTQQPDKQKPMVYRNEDTRWGFPFYFKFGSA DLQAKASTFAQDQRLVEIKYYGWRIVMFDEFRNAVSMREVTEDSGSHPIL SYIFYFLGIITLFFSIQLIRGWFDSEA >MS2005 unknown MDTTYFGRAFGIMVLYDSISKQALFVEAVKYETNALYAAALAELKAKNIE IQSIVCDGRKGLMQLYPDIPTQLCHFHQVQILNRYLTRNPKTDAGKALRQ LALSLKYSTQSSFQAAFEAWYRQHKAFLNERSLNEKTGKSSYTHRRLRSA YFSLKRNLPYLFVFEDYPDLDICNTTNLLDGKFADLKQKLRCHQGMKRDA KIKFIKDYFSYK >MS1865 unknown MGAKLLPYFDNANAISLHSFPRNVGEVQQGGGKILTISHCMTAPLFTRVN KKSDCFVIKVRSILEKFCW >MS0781 unknown MNKNSHKKDRTFMYGLRYCLICTANGDSFDF >MS0589 unknown MIDWDFSFNYSALFEISVKFTVKKTDFLIHLAQFFANSI >MS1294 unknown MQTLTGLRRLMVINVLVIYLAAVNIAAYFLMKIDKKRAKNKEWRIEEILF FSFCFMGGFIGIHLGMVHFRHKTKN >MS0052 unknown MKALAQFVGKAIETICVIILATMSVLVFLNVVLRYGFNSSINITEEVSRY MFVWLAFLGAILAFNENQHVSVTVFVEKLSPSAKKLLHLITDVIMLFCCY LIVDGSWIQFNLNLNNLAPISGLPQGITYLASTVAGFSIGILILARIATN IAALVKGETK >MS2158 unknown MKAMKKLSKILFIASALSLPTTMFAADTQSAAQTTQGVKQMTLSARQLSL AQIGAFTATGDMESLKTAVNQALDSGLSVNEIKDAMVQLYAYTGFPRSLN ALNALAETVKEREAKGLKSEQGKTATPLPANTDILALGSQTQTELTGQKV DISALSPEIDRYLKTHLFGDIFASDLLNWQEREIVTVGALSHLQGVESQL NAHIGISKKNGVNDEQIAAIKAIQPSGLPQLSQFPIGEPNDAYAQYFTGK SYLYPVSTEQVKMFNVTFEPSCRNDWHIHHATKGGGQMLIVTAGRGYYQE WGKPAQELKPGDVVHIPANVKHWHGAAKDSWFQHLAVEIEGENTSNEWAE RVSDEEYAKLK >MS0813 unknown MIKVMVDKLKFFYLKKRNFVFIFLFVGKMKLEPNYFLLKNCFNNA >MS0385 hypothetical protein MCKNTEYLFSRHFIHMHIILFILLIILIKFLPHFGVLFAFPLTAVFVAIL AMIVAALGPWEFYKFKTTVDPRHLNKTSMLVTSGIYRYSRNPMYLSLVLF LFSEILWLGNWLGIVGIVIFVTYLNLGQIKREEAALAEKFGKTYLAYKQR VRRWI >MS2307 unknown MLIKIFVHITLMFRKNTRFKLKKIYNARVIDFNPIQIRNSKGTDMSDEIE LKLAVSPRAADILVQEIARYPILAQKKTFLANCYYDSADGYFAHQKMGLR VRRENDRFTMTLKTNGNVLGGLHIRPEYNVELESDAPDLSKLSIFNETLP KLPADLQVQPVFNTDFERHIWLLEGENREQIEVALDRGEIKSGEKTEIIS ELEFELKKGNVADLLSFVAGLNLTDGVRLSALSKAKRGYQLAYNQSRKPV DWLDKWRDILKSEENHGNLTAQLKALFHHEQQLVEETVALKADYFARNFL TSVERIGAFFNLYHHYIEQPNLLGRIVNEKLAQGKNVDDSVISELTESNN YLFNQIRDLIRLHSETKDNLLALTKLIALLHEAGYVRRMLNLIRLTME >MS1900 unknown MKYLQNAGTRKKCGKKSGNFYRTFEQLSLIA >MS0113 unknown MYSTVKHIVPQGEKRNMTNTVKTANQIKLEFHQQGKTISSWAKENGYSRT DVSRVINGLAKGQRGKTLEIAVKLGMVIL >MS2168 unknown MLDIKPILAIKGTSMLIWNDIERNELMYHLVGFDLENSNVIQNKSNSDYI ATLADGQAEKNLYNNELDSGSTIPLVSKEQENDPVTDLVTPLAETLVRLM EVLGNEEKGIVQLGIELSVADKKNIRKTYIEPALKLGLIERTIPEKPTSP NQKYRKIKH >MS1446 unknown MATLEQNLQQMLQGSVEDLGCELWGIECQRAGRFMTVRLYIDKEGGVTVD DCADVSRQVSAILDVEDPIADKYNLEVSSPGLDRPLFTLEQFQRYVGQEI SVHLRIPMLDRRKWQGKLERIEGDMLTLIVDDQEQSFALSNIQKANVIPK F >MS0883 unknown MYRNKLLFKITDSFVTGVMTMQNFEYYTPTKIVFGKQTEQQVGELIKEQG CQKVLIHYGGNSAKASGLLDRVKASLDNAGIAYTELGGVVANPLLSLVYQ GIELCKKEQVDFILAVGGGSVIDSAKAIAYGVAEPDKDVWELYDRKRQAT ACLPVATILTLAAAGSEMSESSVITKEEGDIKRGYSNNLSRPVFSILNPE LTMTLPKYQTASGNVDILMHTMERYFTPHDTMEITDGIAESLLKTVMKNA QILAKDPQNYEARAEIMWSGSLSHNGLTNCGGGNGDWATHMLEHELSGMF GVTHGAGLAAVWGHWARYVYQALLPRFERFALRVMGVAPAENAEQTALKG IEAMENFFRSIDMPTNLSELGVNATAEQIAEMAKKCAIASKGCIGAAKPL YEQDMAAIYTAAQNA >MS1347 hypothetical protein MTAPPASWGSEILPIPIALTDLFHIQQRKITMKFETQCLHAGYSPKNGEP RVQPIVQSTTYTYDSAESIGKLFDLQEAGFFYTRLANPTTNAAEEKLAAL EGGVAALCTASGQAATFYALMNLVESGDHFISTTNIYGGTYNLFAHTFRK MGVEVTFVNQDDNLDELRKAIRPNTKAVFGETISNPTLRVLDIEKFAALA QAANAPLIIDNTFATPYFCRPFKYGANIVVHSTSKYLDGHAVALGGAIID GGNFNWEQEKFRQFSQPDITYHGLVYTRTFGKAAYAVKARVQLMRDLGAT PAPQNSFLLNLGMETLPLRMKQHYANAQAVAE >MS2140 unknown MIVEIYQDEAAYQRYRETAHFKAYIVQTKDMLLDKKLHELTGMTLMNKGR F >MS0522 unknown MKFYRTLEDFKVISFDLDDTLYDNSQVILDAERHSVDFLREISQIPQLDG GYWRYWKNKTALDFPLLAEDVTQWRIKTIVELLRAHQKSAVEIERISHAA MEDFFEWRHKMQVPQQSFEVLNKLKRQYKLAALTNGNVTPSRAGFDQFEL VLTGGVQGRAKPHQDLFRQTAGYFNVRPHEILHVGDNLVTDVQGAIQAGC QAVWINLSDKKIQHFSEATLVPTFEITDLNELLFFRNL >MS0065 unknown MSRLDMQKLILADDFTGANDTGIQFVKNNIKVDILLDISKGYSGKSDVLV FNTDSRAVSIQEAKERVTRVLSLYEGMSVYKKIDSTLRGNIGAEIEACMD ATNTLIAFICSALPDAGRIIKNGICYVNDVPLLETEFATDPKTPIISSSV KEIITSQTDIPVIEVMHDELCRPMVVNAKIKQAIAHNQKVIFSFDATTNQ DLVRIINLSNSLDESVLLIGSSGLAGCMTMRKAILPMLFVVASMSEKTTQ QVNYIRHDETNFVIDLDTELLLSSNQYNDSVIKQALAQFELGKNVIIKTD SSIEARNNVDNLSEKLALTRAELGDHICMKLSALTKEILIKNFYQLSAIF LTGGDIAIAVAKALNADSYHIAGEVENGVPFGYFLNSPLSRIPVITKAGG FGSDAVLKNTIEVIKNLS >MS1601 unknown MTDDINEIVKSAVNFQRISQKKQKARKDPNAPYVRPKLELPEGHNKLLLH TCCAPCSGEIIAAVKASDVQFTIFFYNPNIHPHREYLIRKDENKRFADKN NIPFIDADYDRDEWFKRTKGLEHEPERGARCTKCFDMRLERTALYAHENR FPVIATSLGISRWKNQEQVYDCGRRAAARYEDVIFWDFNWRKDGGSARSD KLRKEERFYKQEYCGCVYSLRDTNKWRESRGLGKIEIGTVYYSVDE >MS0091 unknown MSIAINQIVNANVYIDGNSQIGKAQQIKIPDIEFEMVDHKGLGLFGTIKL PSGAKAIEGGVNWDSYYPEVRAKLYNPFKNFQLQCRSNLQVFNAQGLAAE EPMVTIMNVSSVKIGGTDVESKENAKFDDTFAVHSIKQTVAGKEILFIDV FANIFRVNGEDVLSKYRTNVGQ >MS1036 unknown MTDKIYDLHCHSTASDGILSPSEIVQRAHEQGVQSLALTDHDTISGLTEA RRQAELLGVEFINGVEISTSWENKVIHIVGLNFDENSPEMTALLAKQAQL RLNRALTIGEKLAKAGVANAFEGASALAKGEVTRAHYARYLVQIGKVANE NQAFKRYLSQGKSCYVKAEWCDIPAAISIIKQAGGIPIIAHPLRYTMTAR WIKRLIADFKNWGGEGIEVSGCGQTADQRQLIARWANEFELLASVGSDFH FPCGWVELGKSLWLPENVTPVWSQFGDKPKYLQNTCKS >MS0033 unknown MEKPTALLTQPCLCQSGKQYTDCCAPLHTRQTLPANAEQLMRSRYCAYVL QLIDYIVETTVPSQQQLLDRTILQQWAKTTNWIGLEIVSHREKLSKIHSA VEFNAFFATDEGKQVHNERSLFVQINGRWYFVDPTVPLPNNKQPCVCGSG KKFKACCGGLL >MS0232 unknown MVLMPAGLSIKSAVKIDKVLYNPRFLYFPPYKRISWH >MS0224 unknown MMKFNRIRNIFMKNKLIFWAALSGFFSIAFGAFAAHGLSKILEPQALNWI DTGLKYQFFHTLALLCLGCFQLLYMPQANVPACRYRLLNLIGFSWFAGIL FFSGSLYALALGGAHFLVWLTPVGGIAFLVGWAGLIWLSLRH >MS2238 unknown MRPLINPTKSLIVFSFSELNSAYSKGQYNAIQAKFANV >MS0617 unknown MEISTLQVILVFLVSCVCGAGSILDEFQTHRPLIACTLIGLVLGDMTTGI IVGGSLELLALGWMNIGAALAPDAALASVISTILVIVGGQDISTGIAVAI PLAAAGQVLTYVVRAITVGFQHAADKSVEDGNLARLDWIHFGALMLQAMR IAIPALIVALTAGTDVVQTMLNAIPPVVTTGLKIAGGFIAVVGYAMVINM MRAGHLMPFFYAGFVIAAFTDFNLVALGVLGTIMAVIYIQIHPKYNKSQQ VVVAAASNNDLDNRLD >MS1372 hypothetical protein MYLPSLAIIGGLILLIWSADRFVDGATATARAFGMPQLLIGIVIIGFGTS APEMIVSALSALNGNPGIALGNAYGSNITNIALILGLTALISPLAVNSQA LKQELPMLIFITAISALLIYDNEVSRLDAFVLLFIFFIYMSWSIINGLKN KNDSLAREITEELAEQEEMSLKQALMWLLVGLVLLMTSSQLLVWGAVEFA HYFGVSDLVIGLTIVAVGTSLPELASSLAAAKKDQVDLAVGNIIGSNLFN TLAVVGIAGVISPMQIGPEVFNRDMLVMSALTVALLIFGLGFGRSKKAGK INRFEGLLFFVCYIVYNLYLFQTAV >MS0125 unknown MTIHSAKLQLVVTADKDDLNIKTGVDCYDLPHQLTEIMSDLLVKIPVLIR SAWFYITDNYADAENGFDVTLTFHFEKEQGDDWSASAKSTHPGTVEDLLL GMAKMIFQEDPIIDELIEKELEELDLPEYVQHFDPTC >MS0128 unknown MTIAKFDNEDFRNKAPDLLADLAKHSVNIIKQHADIEDDLAENIGMLIAM KIGESWGGLNIYMPKAQTLFFCEREKQIYNDFTGNNHAYLARKYKLSLQC IYQIVKRVQKDEINKRQYQMFRED >MS0134 unknown MIFLLHLGTFMYKTFKKLTALLGALAALSACHSAVQPAKTVFLAGATGVI GEPLGKALVAKGYHVYGTTRSAEKAKQLEADGITPVVLDIYDAAAVEKAV VNAKPDVVISQLSSLPKGLKEEEMAEGLKRDNRIRIAGTRNLIAATEKAG TPKFITQSFVFYAESATPPIEESALLSTKDPVYGESTAAMMNLEKQTLAG KFTPVVLRYGWIYGGKSGFNAPIEGYSTIHIDAVVDATVRAVEADLKGIY NVSEASPFINIDKFRKAVPGWKDK >MS1955 unknown MKYAFIDYENLHSLDGLELQNYKRIFLFIGANQTNIRLTEKFDDEINVTF VTIKDVSSNNVDFHIAYYLGKLDATVDKNIEFHILSKDQGYNGICSFIRH QRENRHCSRIAPAVSEPLALPKPDESSKQKIEIIFKEYKSFMVKREKKHL PTKTQSLRNNIHNQTSLKGLEKQDVNNVIIKVINKLSQEKLLKITDSKVS YP >MS1980 unknown MLGVIADDFTGASDIASFLVENGLSCVQMNGVPKAPLADKVDAVVISLKS RSNPVNEAIEQSLNAFNWLKANGCSQYYFKYCSTFDSTEKGNIGPVTDAL LDALNDDFTVITPALPVNGRTIFNGYLFVGDTLLSESGMRNHPITPMKDA NLMRLMDAQSKGKTGLVAYSDVIQGAARVKERFAELKAQGYRYAVVDAVD NAQLAVLAEAVADLKLVTGGSGLGAYMAARLSGGQKGANAFVPAKGKTVV LSGSCSVMTNKQVNAYKAKAASIYLDVESALTNANYADELYREVVKHLDE PLAPMVYATVPPEQLHEIQAKFGGDKASHAIENTFAKLAQRLKNEAGVVN FITAGGETSSIVVQQLGFTGFHIGKQIAPGVPWLKALDENISLALKSGNF GKEDFFEYAQGMLL >MS1026 unknown MITETLFNAENITANSPQLEQLKQLFPNCFDTSGHFLLEKFQAEIAQHTD ISHEFYSMNWLGKSYAKLLRNLPPETLLAEDVEHNSKEENAHSQNVLIQG DNLEVLKHLKNAYRNSVKMIYIDPPYNTGSDGFVYQDDRKFTPEQLATLA NITPDEAERILNFTDKGSNSHSAWLTFMYPRLYVARELLKEDGVIFISID DNEVAQLKLLCDEVFGEGNFVAKLPTIMNLKGNNDEFGFAGTHEFTLVYI KNKNSVEDLNGIPLENEDLAEYSKEDEIGKYKQGATLMRTGEAGSRNARP KGYYPIYVNTELTRMSLERQKEDDFEVYPKTTKGKDMSWRRSPETLSKTF SEFIIKKTSSGISFYKKQRLEEDLEKGKKPKSLFYKPQYSSGNGTTLLES LFGKRIFNNPKPIELLKDFISIGMGKNDLILDFFAGSGSTAHAVMQLNAE DGGNRQFILVQLPEQTDTKSEAYKAGYKTIFDITKARIEKSAVKIREDFP DASGAKSIDSGFKIYQTTDNFNAVAEDEFNPNQAQLPNLTSLTESQIQTL LTTWRVYDGAKLTEIVQAVDLGGYIAYLCDKRLYLLHEHFNSQHLLTFIQ KLDNDTAFNPNRVIVFGNHIESAMQQELNQALASYSNRKNISLSLIVRA >MS0707 unknown MEKIVSLLIIPAIWVKKCGYFRRSFDFFQL >MS2263 unknown MHIIHNFPCLLKRTELSAVRILLLYWDNYRNFNILQNSEKNDRTF >MS1018 unknown MEKIMENQSFLQNFFKLNQHKTSTKTEIIAGITTFFTMVYIVFVNPSVLG DAGMDKQVVFVTTCLIAGFGTMAMGLFSNLPIALAPAMGLNAFFAYVVVG KLGYSWEVGMGAIFWGSVGLLILTLLQVRYWLMASIPLALRVGIGAGIGF FIALIGFKNMGLVVANPATLVALGELHDPKVLMGILGFFIIVVLAARNIF SGVLVSIVVVTALALQFDENVIYRGLVSMPPSLDAVVGKVDIAGALDIAL LGIIFSFLLVNLFDSSGTLLGVTDKAGICDERGRFPKMRQALYVDSVSAV VGSSIGTSAISTYVESGAGVSVGGRTGLTAVVVGVLFLLTIFFSPLAGLV PAYATAGALVYVGILMASSLIKVQWEDLTEAAPAFITAAMMPFTYSITEG IAFGFISYCVMKVGTGRWKEVNAPVWVVSVLFLIKFIWIG >MS0117 unknown MKRMLEKLAMWFLHRNGYIVRSKAECNLVPDFVLRMQEKQVKPIQPVDWA EEGETK >MS0285 unknown MKIFSDKITHFVSVWLLKAVMFLALLISSPAIAESAVELKVEGIANEKLR ENVQLYLATLDKEDADGSERYQNKVKENIDKALRVYGYYGSTVAFNQQPR SNAPDLLIARVDIGKPTLIEDTDIVITGDALHDEYFKRLEKKVPAKGTVL DHETYEDYKTELQKLAVQRGYFDADFPVHQLQVMPSTRQAWWRMDFNSGS RYRYGEISFEHSQIREDYLRNMLEIKSGDEYLINDVSNMTNNFSSSGWFQ SVLVRPELHEDSKTIDLHLLMYPKKKNAMEVGLGYSSDVGARAQIGWTRP WINNRGHSLHSDLYVSSPKQTFEITYKMPLLKNPMRYYYEFSTGIENEDD TKTDTKSLAATFAALRYWNNATGWQYSLGTKIRYDEFTQADQEHKTFLLY PTTSVSRSRISGGLFPIRADTVSATVDLGRKLWLSDVDFFRVRANAGWIK TFAPNHRFLTRGEIGYLHTNELERIPPALRFFAGGDRSVRGYGYKKISPR NSKGKLIGASRLATGTVEYQYQFVPNWWLATFADAGLAANSYSTSELRYG AGMGVRWASPVGAIKFDIATPIRDKDDSKNIQFYIGLGTEL >MS0591 unknown MLSYRYYLYQLLNQKCGENYRTFFYLIIMRLLIKINLH >MS0489 unknown MNKPIKYKPLIFLSNGVLRLLGNIIKILSYPFHAIFPKKRFTIPEFSPAF RPSNKQSKINKTIWQTNYSNKVTLPVYCNYLVNRALSWSYEYRYVSTEAR EEYIKANADTRVYEAYSKLTDGAAQADFWRIFTLYNEGGVYMDIDGHLVW CLADIIDENDTEVVITRRDKYTNFFLASAKGNRFLKDTLDIIVNNIEQRK IDGGVFTLTGPTTLNMALKGKNVNSRRDKFTCAQGTFTNEYFQYMDKKKG KWNHTKNEDLLKK >MS0416 unknown MLDVGLISYFSLVDLSIAFQHSKRNKIKKCIDHKFIGKYKKIVRGRKMYD LITMNQYDALIFDMDGTIIDTMPSHAKAWEKVGEVLGYPIKGDVMYEFGG ATTKIIAQETMRRYGVPAELLEQVVTMKRQFGQEMVLQNATLLPTMQVLE HFLGKKPMALGTGSHKAMVDMLLQRFDLNDYFSAVVMAEDVQKHKPDPET FLRCAELMKVDPVRCLVFEDADFGVTAAHAGGMDVFDVRINQIMKVS >MS0779 unknown MAEVKNLLIGPAIHRILRLTCGYFRQQRNRL >MS2245 unknown MMTVKERLFHAVLFEAGAIILSVLFIWLTTGKSGMVESASMILISFIAMV WNMIFNWIFDKFFTFPKQYRTAKLRLFHTVAFETGLLIFTIPVIAYFLAV DWFTAFLMDVGISITIMLYGYFFNWGYDHMRATLINKR >MS0397 unknown MIMKKFFLFATALLLASCSAQKPNLVSTQKPILNIAANLAQSIEANAGAH SAWVKNKSQQPIAFNYNLYWYDENGITQLFSTQQEKYQGALLLQPQQKAE INLTKPTAESVNYRLYLFSGNN >MS0344 unknown MLNPASCDLFAIPYFQFAQLKKYCPELIPQIKADYKREWNEWKTCILQVS EGLGSPFAEPHIEKWCNGWQVRAHFFAYFKYEFNKNSAAILSVLLNRRRL QVSLDWHCYRADRSQINLSQYNQWTEDFDFRQFADFDIWRGDESEYADFR RVKQLTSQDLSLRSDEDFWCIGKNVEKADLADIDAVDFISRTIRELLPLY EKCHQ >MS0115 unknown MLILTCTGLTVANGQDISALTSKQFQTTCVGD >MS1961 unknown MQTTIEVNMSVKTELLFSNTWNVRISDPGEEGAHSHFFETIYITLEAYID GDNVSYEFTRKVEDEVKIKRNFTQLDELFKFLADYLDAVSLGNLGVKIGQ LGLVK >MS2172 unknown MKLKALTSALILATTLSGGIAMAKTQSATVAEMPAQTIQLTQEWDKVFPK SDKVEHRKVTFKNRYGITLVGDLYLPKNAQGKLQAIAVSGPFGAVKEQVS GLYAQTLAERGFVTIAFDGSYTGESAGLPRDLASPEINTEDFSAAADFLG SLENVDREKIGVLGVCGWGGFALNAAVGDPRIKVVATSTMYDMTRVMANG YNDSVDNDARYQMKQDLNNARWEAMSHDYANTGAPVLPSEKELNADTPKF VADYVNFYKTKRGFHPRSVGSNGSWTTTTPIAFINMPILQRAGELRAPAL IVHGENAHSRYFSEDAFKTLGSKDKELHIVKGASHTDLYDNQANKIPYDK FEQFFKANLK >MS1235 unknown MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS >MS2155 unknown MLKITTFTDPMMGLSYESEPFFRKLETHFAGHIEFHTVMAGLVRNVYEFV NPADLAISEAMAIERYLPHLAAIYNAEQSISGMPISMENLDLFSTDRTSS IPLNLAYKTVQQLAPEKADEFLYRLRFATIVEVRPTTKLNELARVAGQVG INEQTFLNAYHLDDVKASLTEDFQRFQQLGIRGLPAYLLEYQGKRVVVNG VLDDRQFFTLIAQLTQNNISPQKPEISQSAVKNLIEKHKLISPIEIQYAF GLANVNNIMPYLNPLLMNGEIKRIEVQDRGKLSSLNYSFFSLYEPYRIL >MS1855 unknown MQQHKQIGRICTLLIRGSKTSHAKKLRKTMNITNPNGDRKAVVIFSGGQD STTCLLKAIADYGVENVEAVTFQYGQRHAIELEKAKWIAQDLGIKQTLID TSVIKTITANAMMDNIKITKDEAGMPNTFVDGRNALFLLYTAIYAKGQGI RDIITGVCETDFSGYPDCRDVFIKSMNVTLNLAMDYQFNIHTPLMYLTKA QTWQLADELGALNYVREHTHTCYLGVEGGCGSCPSCILRENGLQQYLASK Q >MS2255 unknown MEKLFDINEQGLSVRCKLFYEKDVHSIENIVLILHGFGSSKEVKSNAKFG ERLITKYKNYGAIAFDLPCHGADARKKLSVAECLTYIQLVVNYAKEKLNA QNLYAYATSFGGYLTLKYIAERENPFRKIALRAPAIQMFHTLTANMTDDE RHKVAKGKEIMLGFERKMKIGKEFLDELEQGDIQQYDYLDYADDMLILHG TADEIVDIATSQTFAENNVIELIAVEGADHPFSNPQLMDLAIGRIVEFFH >MS1020 unknown MAVTPMFNHQYLTESNHIVAIGGGHGLGRVMSALNFLKENLSGIVTTTDN GGSTGRIRLHQGGIAWGDLRNCLNQIIDVPTTASAVFEYRFAGTGDLAGH NLGNLILTALANMQIRPTEAIDLIRNFLRVRSAIIPMSDIPVDLAATLKN GEQVIGEVEIDKLPEPPASLYLHPQVEATPEAIAALRQADIILLGPGSFL TSIMPVLLMDEVKAELRQSQAKKIYIDNLGLELSPAANLSLAERIRWINQ AVGKDIIDGIITKPEFAQNCGQIRAKIMARRLNAGDVSYRHDRALLCQAI DDLVAELNK >MS1705 unknown MEIANNLKQIHKNIVSICQNAGLPSNSVKLLAVSKTKPVEDLEQAYQAGQ RAFGENYVQEGVEKIEFFQAKHPDMEWHFIGPLQSNKTRLVAEYFDWMQT VDREKIAIRLNEQRPANKSPLNVLIQINISDEESKSGIKPADMMALAEII ENLPHLRLRGLMAIPAATHDVAIQAQSFSAMHKLFVELQQSLPNQRIDTL SMGMTDDMTAAIKCGSTMVRIGTAIFGSRN >MS0928 unknown MAILGTTRRDELIFRRLCVENRLFIFYKQTLYSIFLNYLPNVDFS >MS0037 unknown MLLKPMFKNNHFMTALLIRQRKRPQTRTFLL >MS2256 unknown MKNTFRLAGETVVLEQLVQYHSYWLLFKHIFVKNYPRGIL >MS2114 unknown MLFISPCVITISKISNCLKSVIQFSGSFYLLKGKTMSLPYILIALIAGTA LASQAAINSKLAQAMLGQPLVSAFISFASGTIALLLLCLWKADLSASLRE LPNVEPWKLIGGVLGAGLVLTTILLAPKLGITNMLFFIIVGQLCAAAVID HFGLLGMAQRSFQLSQFIGLLIIACGLGFYFFGNKIVN >MS0099 unknown MEQEKINAALARVQEAGYKSSLMLALAEWAEQKLRQGETLDVASLSAWAA DPTRKKAYSFAVNRFLAEFSDSASKDK >MS1396 unknown MPSLKANQQYKEQLMTNIEQLIERQALKDLVDTFSNLADEKNVAAQMPLF TEDAIVNTYIGGELVFEMAGRAQIEQVFSDYLAPFHAVYHLNGQHTVTFQ DETNATAINYCQVALVSKQDGKEMLLSHYVRYNDTYTKIDGKWLIAKRIA NFMISENRELGVTA >MS2038 unknown MRLKNRIFLTALLRLFLVKRKHNNPENSVDFSLLNC >MS1711 unknown MRQTIKYEFGVGLFLLIGIAALIFMGLKVANVQGFSETKSYQVFATFDNI GGLKVRAPLKVGGVVIGRVTNISLDEQNYLPQVTIAINEEYNQIPENSSL SIKTSGLLGEQYIALSVGFDDGETAMLKEGDKIVDTKSAVVLEDLIGQFI YGDKDKKDDSAEPQAAE >MS0097 unknown MPRNGGNMTQIEIFKAGKRLDAHGTEVDITVEDLQDTVKFYNPEFHEAPL VIGHPKLNNPAWGWVKGLSLDGDVLKADVDEVDAEFAEMVKSGKFKKVSA AFYLPNSPNNPHKGVLSLRHVGFLGAMPPAVKGLKQVEFAEDDDFLEFSD WGQASLFSRLREWIIGKFGIEDADKALPHFEVEWLKEDAMRDQIQKQVQS EQVTPEPIFNEPQKPEGETGMTPEEIEALKAENEKLKAEKAKAEAAQAEA ALAAEKAGNAEFAEGLVKQGKLAPVVKDALVRALDNLADLKAGKDPEFGE GEEQDVLSQFKTALSQSPKIIEFGEVATSDKTKDTPPDEVEYAETDDPTR IELDRRIRAYMKEHNVDYVTALGAVK >MS2080 unknown MSTILLSYGSQSRKDPATAEQMETKKLKRRGNQRVKNSRKPTALFIGIC >MS0734 unknown MTTPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGHANISG YDFIVIDTGGIDGTEEGVEEKMAEQSLLAIEEADVVLFLVDARAGLTPAD IGIAQYLRQRQNKITVVVANKTDGIDADSHCAEFYQLGLGEIAQIAASQG RGVTQLMEDVLAPLAEKMKTDESAVENDENSEQEKDEWEHEFDFNSEEDA ELLDEALAEENEEPENKNIKIAIVGRPNVGKSTLTNRILGEDRVVVYDLP GTTRDSVYIPMERDGQQYTIIDTAGVRKRGKVHLSVEKFSVIKTLQAIQD ANVVLLTVDAREGISDQDLSLLGFILNAGRSLVIVVNKWDGLSQYTKDQV KSELDRRLDFIDFARVHFISALHGSGVGNLFDSVQEAYACATKKMTTSML TRLLQMATDEHQPPMINGRRIKLKFAHPGGYNPPIIVIHGNKIDKLPDSY KRYLSNYYRRSLKIVGSPIRLQFQEGSNPFAGKRNKLTPNQLRKRKRLMK FIKKSKR >MS0636 unknown MNLSYANGSRGASKMSKNRPDFYDQHTSIATRTATTGKIHRRFNLWRIGG >MS2209 unknown MKSILAKGLILALLTTSVSAYALNRQQHDTVVGAALGGVAGAVLGNDVTS TVAGAALGGVVGSQWNANKQRDDHYRVGDRRHHRDFDRLRYEDRRHHPKE RYFAHHKPKHSDYREMRRHRH >MS1264 unknown MLKDEDINLFRESIKGAKKLAQNTFVAPKKVNVKKKSEQREIREKSDTLF YFSDEYEPLLNEEDAVKYLREGEDTYLLKQLRRGDFSPELFLDLHGLTKE QAKLELASLIQACLDEHVYCASIMTGYGTYTLKRQIPRWLVQHPNVRALH RAPKEWGGDAAILVLIDS >MS0567 unknown MFVYNREIPLDDLGGGVQRKILAYSENIMSVEVHFEKGAIGSLHSHPHEQ LTYVLSGSFEFTIGDETKIVNAGDVLYKQPNVMHGCVCLEKGVLLDTFTP MRKDFIK >MS0532 unknown MPYTGRGRVLSTLRGIEKVRIGSYQLKNRILLAPMAGITDQPFRKLCAAY GAGLTFSEMMSTNPQVWHTEKSRLRLAHHQAAGINAVQIAGSDPKEIAKA AQINVDYGAEIIDINMGCPAKKVNRKMAGSALLQYPDLVRQILEHVVNAV SVPVTLKIRTGWNKEHRNCVEIAKIAEQSGIQALTIHGRTRECLFEGNAE YDNIKAVKRQVSIPVIANGDITSAEKAKSVLEYTGADAVMIGRGALGNPW LFKSVESLVETGSIVFEPSLDEKCGVILQHIQSLHQFYGEEKGYRIARKH VAWYLQGIQPSSNFKQTFNAITEPQEQLIALEEFFNSIRNG >MS1239 unknown MTSCGYDIFSFLWLKSGRILGYIGFLYKMAMGFEELFI >MS1998 unknown MSLAIVYTRASMGIQAPLVTIEVHISNGKPGFTLVGLPEKTVKEAQDRVR SALINTQFKYPAKRITVNLAPADLPKEGGRFDLPIAIGMLAASGQIDADK LRRFEFIGELALTGNLRGVHGVIPAILAARQAKRYAVIAAQNANEAALIS DQESFFATSLLEVVQFLNEQNKLPSTSDLTPQSAKNGSSTITKDLTDIIG QQHAKRALIIAAAGQHNLLFLGPPGTGKTMLASRLTDLLPEMTNQEAIET ASVTSLVHNELNFTNWKQRPFRAPHHSASPAALVGGGCEN >MS1112 unknown MMSWYYIFNAMLFMYPALMAVYCIISASYYYFFIEGKLKKPKYSKMKLED VPLVSIMVPCYNEADNLDDAIPYLLKLKYPKFELIFINDGSKDDTGKIID RWAQKDSRIVALHQENAGKASALNHGLTVAKGKYVGCIDGDAVLDYKAVD YMVQALESNPQFGAVTGNPRVRNRSTILGCLQTSEFSSIIGLIKRAQSVM GTIFTVSGVCCLFRVEAMQKIGGWSTNMITEDIDVSWKLQTSGYNIVYEP RALCWTLMPETIRGLFKQRLRWAQGGAETIIKYFPQVWRLKNRRLWPMYI EYFLTAFWAYSLIIVLCINTYLQITEETFEISIFRPLMTVLFLTFFLQYM FSLFLDSRYEKGLLRYSLYCIWYPYVYWLLNMVTLVFGIPKAIFRNKSKL AVWTSPDRGV >MS1973 unknown MKNWINLLPWRQQLIAQKNRKFIYKISLFFTALLILEMGISLFHGNLTRQ LSEKQQQFYRQQDEFAKLTRQVSQLRRSYEQTEEQNLISSDSVSLFLSWL ARLPLNEGELTEFLLQQNSIHLHGYAENQQEFDSIHQYIVQTEWIYESKL THFSTSANGLLAFSFAIEWGRNGKASMD >MS1676 unknown MFRGAQAINLDTKGRIAIPTRYRPELLAENQGQLICTVDIRQPCLLLYPL KEWEIIEQKLCQLANFDPAQRSVQRVMSGYATECELDSAGRILLSAPLRQ RAKLEKTIMLVGQLNKFEIWSETEWQAQIERDLELGLSGELATSDALKML SL >MS1069 unknown MDYKDNSLKTLKLGQKTDYIANYDRTLLQPVPRALNRDGLGITKQQPFSV GADIWTAYEISWLNIKGLPQVAIADVEIDYRSTNLIESKSFKLYLNSFNQ TKFSDMSEVQRTISEDLSICAEGNVRVQLHSLSNYSHERIADFAGECLDE LDIEISDYGFNAEILQNCTALSTEIVEETLVSHLLKSNCLITSQPDWGSV QIHYQGKRIDHEKLLRYLVSFRQHNEFHEQCVERIYCDIMKYARPEKLTV YARYTRRGGLDINPFRSNFEAIPQNLRLARQ >MS0800 unknown MFMLFTTIIGIALGVLDVLFGFYDHQTGQGFLSGIYSLAVLIPTIAVSAR RLHDTDRSAWWLLLGFIPVIGILILIVFWCFDGSFTTNRFGVNPKQDFLY EKNKRTQSDIISKS >MS1777 hypothetical protein MLTKEQQVFFRNEVLSNLDIEKLDEIQSEYGKLVDELVQIVYQVSNQHGH YLTALDASGMAEIIADEITGYGPLRELMEDDTINDILVNGPDDVWIERAG ILEKTNKQFINNEQLTDIAKRLVARVGRRIDDGSPLVDSRLPDGSRLNVV VPPIALDGTSISIRKFSKNKKSLQELVNFGSMTLEMANFLIIAARSRVNI IVSGGTGSGKTTLLNALSNYISHTERVITLEDTAELRLEQPHVVRLETRI AGVERTGAITMQDLVINALRMRPERIIVGECRGAEAFQMLQAMNTGHDGS MSTLHANSPRDATSRLESMVMMANASLPLEAIRRNIAAAVNIIVQASRLN DGSRKITNITEIMGMESGHIVLQDIFTYQPSKYRDENGKIIGEFISHGLL SNSVVYQNAQIFNLSNELQSIFEGLQ >MS2258 unknown MIALAEIGTNFNRSTNFGGDNTGFLLKNDIKKLVILVSQHDKKTEKPTAL LSK >MS0342 unknown MNSPQKSSNAKFWAICTTALTAAVASTLCCIGPLIYLAFGLSSAWLMDLS EYSYLQIPMLIISLVTFSYGFWLLNFSDKIICTKYLSRRTLQILYWIMAP VILFFLSYPYVLPYILELLE >MS2020 unknown MAIFRKYGAFGIVWFSHLERIFIILIKEEDIFELIIKFLLNEIFKILTAL LMKTAFN >MS0092 unknown MAFHHGSETKRVNGGSVAVSTVDGAIIGIVGTAPMGAVNELTVCLTKKDF SQFGTILDQGFTLPDAFDILARYASGQVYVVNVLDPAKHRTTVTDEVLTQ DSDTLVATTAKKGLISVTNVKLGGSLLTEGETYSVNLESGEITLTVAAGE QDLTASYVYADPEKVTEDDIKGGVDSLTGKRQGFELLRDGFNLYGADAKI LICPEYDKTASCAAALATLADQMHAKAYVQLPKGTSLSKAIQGRGSLGTI NASASNENVRHFFPYALGSSNNLESLATHAAGLRMKVDVDEGYWFSTSNH ELSGVIGMEIPLTARVDDIQSETNRLNAVGITTIFNSFGTGFRLWGNRSS NYPTETHISCFEVASRTGDIIDESIRQAELQFIDKPIDDALIDSFIETID TFLRSQKSLVGYSVGLDYDYDLVDAFSQGQIPLIYDYTPKIPGERISNKS VMTRTYLANLVSQR >MS1466 unknown MERLLARPVTKIRLPVKKPILKSYLVVEKSCVV >MS0339 unknown MIGINRNFARIGGALGYQYGFMHKNPEHFVIDV >MS1443 unknown MYSSKQVKSAVKNFQILTALFSICLYKRYNRH >MS0558 unknown MAQINIEITYAFPEHYYLKKFTLDEGTTVQSAILQSGILQQFTDIDLREN KIGIFSRPVKLTDSLNDGDRIEIYRPLLADPKEIRRKRAAQQAKDQEEKK KAEKSANKEN >MS1071 unknown MTIHLQTKKHLQQLQFAMQSLDLWQTVPPAEEAFLSTEPFAIDRMTATEW LQWIFIPRMYALLESGTELPAQIAISPYIEEALKETDNLSLLLSPIIEIE QLLQKS >MS1971 unknown MRIKILFKLFFIIVSVIRLLTFQSWAESDPFDKTKRNFSQNTDMLVEKTN QCHQSAAVWAENTEFKQLKIVGVLQYEQERKVFLMDAERHIFTAGQGDFL AKERMQLQAINTREVDFMVWNNPQDCGQGELMKIKF >MS1092 unknown MALGWYELKLAKDGQFMFNLKAANSQVILTSELYRSRAAAENGIASVQKN GGDEKNFEFRENKNGEPYFILKAQNHQEIGRSEYYSSKAAAQNGVNSVMN NAATTVIKDITKS >MS2267 unknown MKMNKKFAQIVKNPAFRNMVLKTIFNVTNVMSATKYLR >MS0838 unknown MALILNILNFVLGGFATTLGWLIATIISAVLVVTLPYSRGCWEITKMSLM PFGNDIIHVKYLEPRSSLSNSLGSVLNVLWFVLFGWWLCLLHILTGITQC LTLIGIPTGLAHFKLARISLFPVGQRVVPKEMAQLVYRHQAEREFENQRN A >MS0897 unknown MNKSKLALIIIVALGAVWTGGAWFTGKTAEAEYKTQVENLNKQLTSAKSA VAFSVKIENVRFDRGLFSSEMSYDILIESTQDKTQTWRLPFAGTLYHGPL PLNLVRQFDFSPAVFSSHSQLIKNELTTPWFDYTKEQNPITADISLNYMQ EFDAALNLAAGDIKLEDIAVNWSNAFIKYAATPKKQGEFNYRYDAAKLTL TDKALADIKLADKAETQSDLSAIDIELQNLDGLMQIIPATNDITTGLYKG KIENLTYTYHFADNKKTANTIYFNNFNYDYAANENEGMLNYDIHNRADSL KINDKNLGGVQLDLQANHLPTNLVAQLIHGIKQQADEKEINEILLKILEN QPHFRISPLALQNTAGKFTGNFNIELAHADFANALKKGNVLSLFKQFSLN IDTDKPALVEFLSTLQQLSGVAKEKADGYAQQQVDRIANTLKKQNVIVEE GGSAKFNLAIVNDKLMLNGNEIPEEYIGLMLFGLMMQSK >MS1430 unknown MNPTVVVSNIKTLLLAVLSGRIFAPVMQHDCQPYLDSGELEIVFPNLESQ MWGIYLYRPYQTITPKRVLVVFEILERLLKMHSEQ >MS1732 unknown MMALSQEKRLIEAPVNGGRNYNGPKVAKFLVG >MS2169 unknown MEIEHKMDKCHQVHYLAYPYYLFEYYKENASE >MS1215 unknown MDRFSRMWRKLQKSAVCFLLIFVTVNVWAADFPGSPNPFRYVNDYTNTLS ENDKNYLENKLINFSRETSSQIAVVMVKTTGEYAISDYAFTLGDNWGIGR KQLNNGVLLLVAKEDRKVFIATGQGLEGALPDAFLSQIIRRVILPNFRQE QYASGINGALDYIIAASKGEYDAAAEQNDEGFEQYIPFLMVLVFVLFVLF GELNGRRKPYISPTTNHQLEQVILQSARRRRGNSGGFGSGGFGGFGGGGS SGGGFGGGGFGGGGAGGSW >MS0118 unknown MGGRRRNKMSEKINSTQRALRILKALKGRTLTGLSNKELADRLNESPVNI TRSLQALIAEGLVVKLEETGRFALSIQMLQIAVTHQRDTEKMQARMAEMD QRVNAGAF >MS2181 unknown MNEAIFLTKRALMLKKLSFIAMLAMLSACSLSSYVPFMGNDKPVINLDKD KIDQKSYAVAYASTVQSYNGRITEDYDVNSFASGANDWYLGRILVPTEQI RARLGSGLDSKLHAYYSGVIFAADLQTNFSRLSATCWSKVDTQSMTQGIY DAVIDLRKGKVRGENDEYITKGSEELLNLCK >MS2129 unknown MKFKLKALTATLFLGSSLLGANAMAQLPQNATAIEVPAQSIQLTQEWDKI FPKSDKVEHRKVTFKNRYGITLVGDLYVPKGATGKLPAIAVSGPFGAVKE QSSGLYAQHLAERGFVTVAFDGSFTGESSGLPRNTASPEINTDDFVSAVD FLGSLDNVDREKIGVLGICGWGGFALNSAISDPRIKAVATSTMYDMTQVM ADGYEIKMEPNPKVPYERTSPMTTEARYKMKQDLANARWEAAANGYSLNG KAEDHLTPQDKITAETPRFVREYSNFYKTKRGFHPRSVNSTTGWNTAMTP SFINMPILQRAGELKAPALVVHGEFAHSRYFGEDAYKALGSKNKELYIVP GANHTDLYDDVNGKIPYDKFEQFFKANLK >MS1041 unknown MFMKSSYPFSNTWEKLLIGFFCTPIILGILLFINEVTGFQLVCISLIGTI CLWGVFITVKILQINTQQSHQCRFKEF >MS0427 unknown MKKSLSLLAVLAFSFGLIACDGDNVRSEMQMMGRYNSELISAASAEEFHK ASENLQKFSLEAMNKRPSTVKSDEEFKAYQQGMQHFIDVVQQADQLAQQG KFEEAKDLTKQLLEMKNQYHAEFKNK >MS0217 unknown MSNTSAASFNSALICSKYIQFSFYEQVKNAENTAPDKCFAIL >MS0076 unknown MKKNLVALAVVAMAAQAHNPKSKQASKQASKQASKQASKQAR >MS0090 unknown MSVSKPLNLFKISPTAVGVNSIVKLNNNPQWSFFMSQKLDDLLVFRTIQL DYPIKDGEGNTVTELKMRRAKAKDMRRMSAQKTEAEQEIFMFAQLVGLVP EDIDELDIADYGKLQKAFTEMVQGKSA >MS0621 unknown MCQMLAMNCNTPTDIVFSFEGFRRRAGMTDSHSDGFGIAFFEGKGVRVFR DDQPGAVSPIADCVKQYHIKSLNVIAHIRKATQGVVNIENTHPFIREIWG ENWVFAHNGNLNALPDLSSCYCTPIGDTDSEAAFCYIAAKLKERFCRKPT ENEIFDTIKELAAELAQHGTFNFILSNGQWMIAHCSTNLHYLTRQAPFGV AQRIDDDGIIDFSNYAKDTDKVTIITTFPLTKDEIWAKMEHGGMVMFKDG VKIREAIGTPKEAVDDGTLGCTKIAA >MS1868 unknown MQKVKLPLTVDPVKDAQRRLDYVGYYAADQLVRLNESVVKVLSDAQVTLS FFIDPQKLVVMKGQAQVEVELECQRCGQTFNQTLECTFCYSPVANLSKID ELPEIYEPIEFNEFGEIDLIGTIEDEFILNLPIVPMHSSEHCEVSAQEQV FGELPEELAKKPNPFAVLANLKQK >MS0126 unknown MKVKCSACGAVYSLDALIANQSASQALNAALMVSGELGEALIRYLGLFRP AKTSLTFDRVATLLNELTPMIQAGKITRDGREFPAPTEAWIYAK >MS1438 unknown MKNLKLSIATIAVASLLSACTSQYATEKHEQLKLQNQAALGIVWMQQSGE YQALAHQAFNTAKTAFDQAKKTKGKKKAVVVDLDETMMDNSAYAGWQVKN GEDFTQETWTKWVNARQTAAIPGAVEFANYVNNHGGTMFYVSNRLENGER QGTIDDMARLGFPGVSEKTLILKDGKSAKSARYKTITDQGYDIVVYVGDN LNDFGDATYRKPNAERRDFVAQNAKQFGTKYIVLPNPNYGDWEGGLDSNY YKGDVKNKVDIRLNSIKAWDGK >MS1451 unknown MSNLSFDFVENDFKPLAARMRPTTLEQYCGQQHLLGNGKPLRKAIEAGHA HSMIFWGPPGTGKTTLAEIIAHKINAEVERISAVTSGIKEIREAIERAKQ NRLADRRTILFVDEVHRFNKSQQDAFLPHIEDGTIIFIGATTENPSFELN SALLSRARVYILKSLTNQDILHVLEQALADKERGLGNENLDLEEGILELL ADYVHGDARLALNCLELMVDMADESEKGKKIDRTLLTEVLGERQARFDKQ GDRFYDLISAVHKSIRGSAPDAALYWYARIITAGGDPLYVARRLLAIASE DVGNADPRAMQVAIAAWDCFTRVGAAEGERAIAQAIVYLAVAPKSNAVYN AFNQAKQLAKESADFDVPVHLRNAPTKLMKNLGYGAEYRYAHHEPNAYAA GENYFPEELKDTVLYEPTNRGMEIKIQEKLAWLRELDKQSSVKRYK >MS0936 unknown MLDKIIGAVISNALGGNSTNNSSNSSLISNVLGSLLQSQGGMEGIFNKLQ QGGLDNLLESWIGTGRNQPMQANQVSEVFGEDTISSVARQAGVPASQAQD ILSQALPQIIDMLTPNGREGGVRTDSLTQATQQVQQDNGFGLDDLIGGVL GSVLGGGQQQSAQPEQQRSQGGLEDLLSQMLNTQTRSSRTPTASTNDELA QDIGSILDGFFKQR >MS2177 unknown MDFNAILNQVLSAAQETVKKTASGNSTTDKVAKIGGGAAAIGVLSMIFGR TGGAGLAKLGSLAALGSLAYQAYQDYQHKQSQVVPVTEMEFTQSVQQSAE LSKVILQAMIAAAAADGAISDREQQAILSQAGDDAEVQQWIRQEMYQPAT VREIAQQVGDNQALASQVYLAARMVCADLARKEIVFLANLAQALGLDEAL VEQLEKQAGF >MS0940 unknown MMHKMMPILFGEIKNEKTDNDSVIVGFTCTSQRTRFDVIRKL >MS1426 unknown MMTRRTFLTASGLMASGLFLPKICKSETLLQRRRPMKIIAVEEHVLDADL GKASMPAALAQAPYLPDWGKTVQDGYNLDRSRPQIEQNALINPKGFDMGE GRLKEMDLAGIDMQVLSYGGFPQFALKEQSAALNRAANDRLAEAVAKHPD RFAGFATLPWGQPQEAVKELKRAVNELGLKGALLNGRPSEHFIDHSDYEP LLAAFHELNVPLYLHPGVPVQAVQQAYYGGFSPEI >MS0083 unknown MQTHNFGATYQEGIVTEVDAAKHKVRCKIPALEDLETAWLPFLTPNAGGN QFYCLPDKDELVALLLDARGEGGCVLGAIYNDQDPTPVANAEIWCHKFKN GTEISHNRKTGDVVVNTKGHVTVTAGAGATINADTVVNGKLHATGKITSG EEVSAPKVKQGTVELGTHTHGSSPQPNK >MS0665 unknown MNKYLKSDFIFSLFLSIAIMFICLYFEKSFFFVDDAQNEFLPFTRQIGNV WLNGEIPFILKNTFIGSNTMIDIHRAIFLPQNIFLSILSVKITSLKIISI IAAFINLFVMSFSALKLSEAFSLTKAAGIVLAFLFCINPIFLYFYLESWW NAAAGQAWFVASLASVAWLMRAFSIKRLLLNVITVLSIFASGWPHSVLVY GFLALIFSIFLYLNKRHNDLILFVLISFSIILIAIPLYSEYVISGDLINR QSFKFNNVGNFLSTTLNQLLLTFNVTYYHFMHRYGGYSITHIPMGYSSIY ILLLICFGSLKNIARNPNSLFLLVLCTVFFILTQTPTEIGPFRYPFRFTP YFSEVLTMLSIFSLEKLGIVKTRARVFLVVLLLSISLLLSIFSLEENFGK YAILQFLFFAVTTWYVVRYNSISLKSGLPYTAFIFLLMLLAKDSVIGYLS FPDLKNSINMENNYSQGGYILSLTNGKRPKNNLEDLNSTHFMLYGLKSIN GASPVGNKYISKTISTRSSQAFFNAKETILGLSKTYKDKCYFDLFGIDTV ILNKKDNSSLISQKLSDCGFSERKVKSHDVIYFLRNDFNAKGSVSYHSDT LSINQQISLKNNSEFYQLSGLKGDELIFNRVYWYGYRAYINDKEIPLLNY DGLLRIILDHDYQNGVLRLEYFPKSWKYALLIALSGFLLLLFSVGYMQRM RKWVSLN >MS0773 unknown MKKAGLTLALLLTGCGILGPSYSGETTAGALLKSDTERNINIFFRAIHQC SPEKIHTQINSAKPATQNSVEQAQETWTVTGCGKTEVFNIQYVGDGVGGT YIRMSKKN >MS2270 unknown MKKIVLLLAVITTLTACSSADTPTPRDENQLADGIMIPVEGTGAIAGGSF MPEIEQQSMPDSMK >MS1464 unknown MNNLALEKLISQKLNSATIADYAPNGLQVEGRPEIRKIVTGVTASQALIE AAIARHADAVIVHHGYFWKNEDPCIRGMKGKRIKTLLINDINLYGYHLPL DVHAELGNNAQLAKLLKIENLQPLESDSVSIPVFGELAQPISLEDFARRI EKSLQRKPIVCNAEELTQNPPHLIRKIGICTGGGQGYIDLAAARGCDTFI SGEISEQSTHSARELGIHYFACGHHATERYGIRALGEWLAQEYQFEVEFI DIDNPA >MS0269 unknown MKTNLKLGLLIGALGLFSTGAMAAHLPDEIYQPRGAKVIKADRQGKGEFE VEFRLDAREHRIPVLAEKAISHARYHGFRLVESEIEHDDADLKFKRGDQE MDIEIELKDHHRIEYKAELDLDKN >MS1268 unknown MKILITGATGLVGKALTRQLLKQSHQITALTRAVNTAQKLFPEVDWVSSL STYKNLDQFDAVVNLAGEPIFDKKWTDEQKLRLKNSRILLTQQLTQLINR GKRPPVFISGSASGFYGNAGSQLLTESALPATSFTAELCQAWEAAAQQAD TRVCVIRTGMVMSPRGGALARMLPLYRFGLAGKLGSGQQFMPWIALKDMV RGIIFLINNPNAVGAFNFSSPNPVTNKEFNRLLGSRLKRPHFFSVPACIL RLFLGERACLLLDSQNVYPKKLLDLGYTFQFEHLETYFSKTLKQKRKK >MS1884 unknown MSHILAVKVAQVENLSLSDGSTIETAIRKKAVDKVRVHQLGAEGNDVGDK KHHGGVDKALFFMAQKSLEKLTALLKLDYDYLQDSRFGENFVVSESDENS VCIGDQYRIGSALVEVCQPRKPCNTLSKNTEVPETRKTVVETGLVGWYVR VLEDGVIAQGDKLELVKRPYPEMTVALVHGLLSQPAKNLDKTVLDKAIAC APLAEGYKKTLYKQAEKLAQQSSESAFFNTPEF >MS1331 unknown MNKALLPVLVSSIFMLSACNEEKNIELAAQLQHYQQQVDQLKTELENANN KLTQTQNELTAQQQAFPALKTTEEKIFTRNEEISFTENRPTGSGIINYYI DTVKTSIPWLDKLLISQAIDILNQDAEPKDKLTINDSDSDQQKAVLTEKL ENNYQRDLDILTANKLPGIDYIIETSYLGQRENLVSFSLFRHAYYGGERS SFYTRYLNIDSETQSIIRLSDVIPPVKQKELKELLWNSYANALGNNKPYI KKQNFYIAKDFYFTPDGMNFVYSPSSIAPFSAGEITLQLYWNEINTLING QYIWHDIK >MS1638 unknown MKTYNKRILLSVTGMSPAVVTETLYALVTEKNFIPTEIQVITTIQGKNKL LSALLGIEGGRKERKGALAEFIEDYGSQYGFSAIHFDESCIHIIEDTSGE KLPDIRTPQENEFAADNIVKLVGSLCQGEESQLHVSIAGGRKTMGFFMGY ALSLYGRKQDSLSHVLVDEQFETLPNFYYRKPYSHIIINRDGVELDASKA NVMLAEIPWVRLGLGVPEGLKHQAISYSESVKNAQALLSQQSITFLAPLE DRLVKFGSKVIKLAPRGYALLLGLVVAKDAGWQFGIREEKHTIDTYLKIY SQIKEDEEMQKRLAGMDNDLKDVLSESRTDIRKKITENFSLGKGAESDYI PSSSRKTGNYELNIDLDNIDISAIQNELARLKIL >MS0496 unknown MCLTANYLSGRVKNFRIFDRTFNRLKFSASIKALN >MS1916 unknown MTEKINLMNLTRQQMREFFKELGEKPFRADQLVKWIYHFGEDNFDNMTNI NKKLRDKLKAVAEIKAPEIAVEQRSADGTIKWAMQVGDQQIETVYIPEAD RATLCVSSQVGCALACTFCSTAQQGFNRNLTVSEIIGQVWRASKVIGEFG VTGIRPITNVVMMGMGEPLLNVANVVPAMELMLDDFAYGLSKRRVTLSTS GVVPALDNLSGMIDVALAISLHAPNDELRDEIVPINKKYNIKMLIDSVNR YLSVSNANHGKVTIEYVMLDHVNDSIEHAHQLAEVLKNTPCKINLIPWNP FPQAPYGKSSNTRVDKFQKTLMEYGFTVIVRKTRGDDIDAACGQLAGDVI DRTKRTAAKRQFGQNIDVQLQ >MS0260 unknown MLGVLNMTRQNIFIILAFSNEINKMELQDKLLIAMPNLQDSYFSQSVIYI CEHNEQGAMGLVLNQVTDLSIAELVAKLNFMMADGRHYPETYVFAGGPVS MDRGFILHTATERTFEHSYRVTDNLQLTTSEDVIETFGTPEAPEKYLVAL GCATWTSGQLEKEIADNDWLVVPANNHILFDVPWAECWTAAQQLLGFQPA NLVAEAGYC >MS1100 unknown MATLGATRRDELIFPIFNDLKKCKNLPHLIRFQRP >MS0979 unknown MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK SLRR >MS2159 unknown MKKILVLTGSPHPNGASSRLADEFVKGAKEAGNDVFRFDAGLQPLGELHF LQLDASERTIADNDIVSREVLPKLIEADVVVFVSSLYYFGMNAQLKAVID RFYSINHELKDDKQSAVIMAGYGEGDDLKPMKDHFNIIQKYMRWQNIGTI VAEDSWNAAKLAKHLQEAYALGKSISA >MS0127 unknown MRNKLLQLVHIGKTQLGMDDETYRSLLSQQFYQNSAKNISYSGLIKLVKL LQSKGAKIQLPKSKSTLSPLQRKVWAVWKSTADNPTSQALNAYVARIGVD EPWNMMNNSQASFVLETLKKWQERKGN >MS1637 unknown MEARRRLGYDGNRVIHEYANKNLYRWRINARSLSYCL >MS0938 unknown MAKTRGSARDSIKIYTEKCGENSLIFYRTLDEKI >MS1911 unknown MSITVNQIVLHQIIKPASANIPANNNNETENGETATQNTQLETVLRQELL PITAEAEQFMLELHQAYQNKTKGYGVFQEQSRFAQSLNRLLERETDFLPF SYEAAKLLSSELAKYAFAESGTFVLCRYNFLATDYLFIALLDSKASVLVD EKLEIHRTQYLNINQFDIAARINLTDLRVNANSNRYLTFIKGRVGRKIGD FFMDFLGADEGLNPQVQNQCLLQAVSDYCQKGELSAEQSQAVKKQVFDYC KGQINAGDEIELTELSETIPTLNQQPFADFAAEQDYGLENNIPPVRSALK SLTKFSGSGKGVTISFDAELLDKRIYWDDMQDTLTIHGLPANLKDQLQRL LKNHN >MS0167 unknown MMKKVITYLKTLMIEKTSLDQLKAFYLRFKFTAPI >MS1641 unknown MGSGYIVAKMQPLDWENIGLQICNLSEKFDRLLKSNFYCLREQNDHLVYF KTRRRN >MS0943 unknown MAFSSLFTLLDDIASVLDDVSVMTKVAAKKTVGLISDDLALNANQVSGKD IGAERELPVVFSVARGSLINKVILIPLALLLSVYFPSAINILLMAGGAYL CFEGVEKLLHKFIRQEQHEEIKDSGDEKDKIKGAVRTDFILSAEIIIIAL GSIQQADISTKILTLSAVGLGITVFVYGLVGLIVKLDDIGLWLLRKNGKF SQKSGEFLLFIMPWFMRSLSVIGTIAMFLVGGGIFVHYLPEIHHFVEQFR IYHQLAWLVEGLTGMIIGAIACAVILPLLKLFSRKAH >MS1187 unknown MNFPVSLYIALRYWRAKSADRFARLVTNLASSGIVLGVMALIIVLSVMNG LEKHQKQQVLSGIPHAVLMPQEGYLDLQAAQPSMPDFVRQAVPINSTNVI LQTAQGVSAGHIIGVQKPSDDLILDYLTQQQLSELLPAGEFKILIGNRLA DKLRLNIGDKVRLMITENSQYTPFGRVPTQRLFTVSDIYFSDNSEVSGYE IFANLSDIGRLMRIRPEQVQGYRLFLDDPFQITALPQFFSADKWKLEDWR SQKGEFFQAVRMEKNMMGLLISLIIIVAISNIITSLSLMVVDKQGEIAIL QTQGLNKRQVRRIFILQGFLVGLVGTIIGTILGVLITLNLADIIELFGQR GIFLPTSLELGQIIVIVAFSLLLSLLSTIYPAYRAAKVEPAEALRYE >MS0872 unknown MGRLKRFLLIFVLAFFAAGAYFFYTVQIFEQPKISFNSAHPSSLTPQNQY CFAVNSPLQIIRQNRFKFVVWNIHKGLDEGWQQSLQQFAQEADFLLLQEV ASTQQLAQEIPQFSTALYVTSFSYLGRESGVSILAKTMPQRICGGAEKEP WILIPKVGNAMTFPLQNGQSLLVVNLHLVNFEFHPTSYRNQLENMMRLVA KHQGPIILAGDFNSWNQPRLNLVRRFAKQYQLNEVNYHPDERLRFLTNPL DHVFVRGLNVITSTTVKTSSSDHNPIFVEVALDKPNSK >MS2232 unknown MQQIQISDAAQGHFRKLLDQQEEGTNIRIFVVNPGTPNAECGVSYCPPNA VEATDTEMKYATFSAFVDEVSLPFLEDAEIDYVTEELGTQLTLKAPNAKM RKVADDAPLIERVDYVIQTQINPQLASHGGRITLVEITDEGYAILQFGGG CNGCSMVDVTLKDGVEKQLVELFAGELKGAKDITEHQRGEHSYY >MS0356 unknown MLKNGGGTTKNPEFTYILHRKSAVEILKNYAKIYRTFIFTIKVKFG >MS1589 unknown MNRRDHLLQELGITQWQLRRPDVLKGAINIAVEEHIRLLVIAECTLSARD FFIQDVLRSAEIKLQDCLFLTFSQAAHLTVQHPVNYWLLSDEQGIIEQTL TFCTLQNSLWQTPDLPRLKLDRRAKQALWKQIQTSL >MS0085 unknown MTTQTLLKHIVKQGERWDNLSYQYYGDALEYGRIIDANPHISFCEVLPTG VTIYIPVLNVKPTSNENMPPWLRGTNE >MS2210 unknown MTSKIIKAVVIGALATSVSACGLHGQQRDTATGAVIGGVAGNIIGGNTVS TVAGAALGGVVGSQWNKHR >MS2135 unknown MLQLRKSNERGHANHGWLDSYHTFSFADYFDRNHMHFSDLRVINEDFIQP TMGFGTHPHKDMEILTYVLQGAIAHKDSMGNVKTFTAGEFQIMSAGTGIY HSEFNPSESELLHLLQIWIMPNELGVSPRYDQKQFADKEGATLILSPDAE GESFKVYQDMKLWRHQYKAHQKVELGLNSRRNYWLQVVKGNLTVNDIALA TSDALGISAEELATIETSDEVEFLLFDLR >MS1645 unknown MKIKLYFPDESIATIKRMGLRQMSPETLRWTADHPSSSYGMGALLRGKSG EILDGKSFAAMVHAFGAWIETDSEDTSRRVHNALVTAATGTEESVKVAKE >MS2122 unknown MSRALNFVMISPHFPTNFETFAVRMREKGINTLGIADTPYEQLSETLRNN LTEYYRVDNMEDYEQVYRAVGYFAHKYGRIDRVESHNEYWLELDAKLRTD FNVFGYKNDDMLAIKTKAQMKEVFRKSGLKVAKGRVFKDDEDARKLAKQL KFPVIVKPNSGVGASDTYKIKSAVELEDFFGYKNPNVEYIMEEFIDGDIV TFDGLTDHDGKIVFYSSLEYSEAVLDTVEKDGDMFYYVPREISPKLVKLG EQCVEAFNVRERFFHFEFFRVKKSGELLPLEINCRPPGGLTIDMWNYAND FDVFREYANVVTENKFYSDITHPWNVVYISRKANQNYVNSIDDVCQKFGD NIISVQTVPGVFAKVMGEHGILVRTKTIEQMREIVQFAQAKQ >MS0873 unknown MKLLISNQHGAIVMALMPFFYGMLLSQPVWAHIFLLLAWFSLYLLSYPFL NLFKGRNLAQYKTWVWIYACAVIIFVIPALIYNWKILYFALTIALLSSVS VYFVKQKNERAFLNNLNGIVIFAVAGMGAYYFADSVWDYKIWQVACYPSL FFIGTTLYVKSMMRERKNPLYLKLSIIFHIGCILVFLFVQQYILTLAFII PLVRAIYLPAKKLSVKQIGLIEMAVSLLFFVILLWATI >MS1709 unknown MQAEKHLKWAAEQNDDRIAFRLDGELSRDTLLPLWNEFQKREQRSSFLSE RQIADKNISWDLSQVSRIDSAGFALLCDLLHYCQAKKNADKTLLLENVPP QLLTLADLVGLADWIKPYLK >MS1353 unknown MAILGTARKDELTFRQLCVENRLFRMTKRHVFRLDLTKNKLRRSRL >MS0476 unknown MEKNAFGILQPKLDVRNVLPLNQLDIIFTPLVAFDKSANRLGMGGGFYDR TLQNWQNKSFLPVGLAHQCQQVEKLPVESWDIPLYDILSA >MS0350 unknown MIIILSLVMQKKFALDHVLQHFLWVGELYGLCYDRQKLMKKLLNKKKDDM SRNIQELKNIVAKLRDPDGGCPLGSETIL >MS0504 unknown MTALFVCYAHKVKQTNFKEKSISNSRCFLND >MS0122 unknown MAKKAVRIKAETHEINLQTQDDVALAIKEIGDLEREQVRLSTLQADEKAA IDEKYTAELTALKDKVKPLQKAVQAYCESRRNELTNGGKQKTAYFTTGEV QWRAKPPAVIARGIDVILESLRNSGLFRFIRTKEELNKEAMLAEPDIARS IDGVTIREGVEEFVIKPNDEEVRT >MS1697 unknown MKKTLLAIIAALAMVSAAQANVYVEGNAGYSKIKSGEVSDHRFSPNVALG YDTGDMRYAIDYTHYGKSTDGNSEVKAHGFGVSAIYDIEVGSPVKPYIGA RLSANDIDAKEEKRSGGSRIIKETDSYKLGYGALAGVQYQVAKDVSLNGG VEYNRLGKANGHNINQYGAKVGVRYDF >MS0279 unknown MKKSFVKTLLATSMLFSETAMAAYPEKPITIIVPWGAGGNTDTIARLVAK GLQEELKTNVNVINRTGGSGVVGHNAIKTAKADGYTLGVVTVEIALMKHQ KMADLSYKDYTPIARLGVVPGGVQVAKDAPYKDINELLAAVKANPGKLKA SGSGLNSIWHLNLLGILKSAGLPEDSVKFVPSQGASAALQELVSGGIDFT TSSPGEAQSMTDAGMVRHLAITTPTKSELYANIPVFQEATSYKWTLNGWN VLTAPQGLPDDIKLVLEKAMEKVYATGELQKFANKQGFEASALYGSELEK FMADEDQKFGDLLSTK >MS0761 unknown MKKSKMNDKIIFNQSENPKDSKEPTDFVAKQEFIDITDADVRLDPEDLTG EFSLGQEGELLTENLTESLTPKPRWWKKLLILTAVLFFGATVAQSVQWLI DTWQQQQWIYFVFALVSLFVVILGFSALFREWRRLAILRRHIDLQRQSET LLQKSAVNFEQDLPAQDSESGKQLCLKIAESMNLEPQYPALNQWQKQINE SYSAQEVAYLFSQTLLKPIDAKAIKLVTKSAVEAGTIVAISPLALVDMFF IAWRNIRLVNRIARLYGIELGYASRLRLIRLVLLNVAFAGATELAQEIGM DWLSQDIAAKLSARAAQGIGVGLLTARLGIKAMEFCRPLVFSKQEKPKLT AIHRELLSTLKSTVFTSSKIKDKEKM >MS0096 unknown MAGQKVATRLTDPVLTQYALGYHNNEFVGELLLPIADVPKEGARLPKFGK EAFVTENDERELHAASNKITPAKVTTEDIALGEKDLAYPIDYREGKEADF DYEQFAVDLVMEKMALNRELRIKALVTNEAAYGAKNKIVLSGTSQFSHAD SQLFKVFDDAFEAVRMASGKSVNRIVISSNVWTAIRNHKEVLDILKQRGL KSLSPSLFAELIKGEGQDDLQIAIGRASYTTQLDQDTQPVWENDIVMAYV PQKAADGKHKMYKPSFGYTFRRQGAFVVDKYDEVGGKVYNARATDINKEY LLMTDAGYLIKSAV >MS0541 unknown MTKFISLLERILAVFCVVLCIALVISVVWQVFSRYVLNAPSTVTDELARF LFIWVGLVGAAYGLGKKKHLAIDLLLMKLEASPKKYAFLQLIINLISIFF ITVIMCYGGMKLVLDTIAAGQISPVLGIQMGLVYLALPVSGFFMLIFSAR DLFAELRQLSAQN >MS0254 unknown MTKLIHLTQYKLIELTGVDSEKFLQGQLTCDVTKLKTGDSTLTAHCDPKG KVSSVFRLIRVAQEQFYLLFRTDLLPAGLDQLKKYAVFSKVAFAEPEVQL AGVIGENCGQFSASFVVNSGNAAILINPAERLEFNASAEAWDCVEIQRGY PILSAKTQNEFIPQALNLQCIEQAVSFQKGCYIGQETVARAKYRGTNKRA MFIFKARSQIIPEIGGEIEMRLENGWRKTGVILSAVNFGEVLWLQVVLNN RLEDGQQFRLPADETALELYPLPYELV >MS0579 unknown MRSKIRKFLPHFYCNQALPDITTGAISAPIYFAFSFSSPTKISQRQSKPN YIQSIG >MS0921 unknown MTTIYYILIAIAVLALIFGIILGFASVKLKVEADPIVDKIDAILPQSQCG QCGYPGCRPYAEAIANGDIITKCVPGGQPTVIKIAELLGVDAPDAEFTED NTPKVAFIHEDMCIGCTKCIQACPVDAIIGTNKSLHTVIPDLCTGCELCV APCPTDCIKMIKVEKNIDNWDWKVNPDLVIPVMNTTDGEKKLVVGK >MS0988 unknown MANRIRLHIWGDYACFTRPEMKVERVSYDVITPSAARGILSAIHWKPAIN WVIDKIYVLKPIRFESVRRNELGAKISESKVSGAMKRKSVADLYTVIEDD RQQRAATVLKDVAYVIEAHAVMTSKAGVDENTTKHIEMFKRRALKGQCFQ QPCMGVREFPAHFALIDDNDPLPLSQLSESEFNRDLGWMLHDIDFEHGNT PHFFRAELKNGVIDVPPFYAEEVKR >MS1608 unknown MVNLVIVSHSKKLADGVAELAGQMVTGGCKIAVAAGIDDEENPIGTDAVK IMSAIEEVFSADGVVILVDLGSAILSAETALDLLDPEIAEKVAISYAPLV EGALAAAVSASTGDDLQTVLAEAKAAGDLKLQQENK >MS1114 unknown MNKVSLLTLLIGGALAVQYANGSPIDERRENIIKYSRLGDGQLVEGTKQL IDLYNKTKDKKVRDDLITLLVRQNRDAEALSISETYKLTDFSSNELEYLA RAARNERQFSKSLAFYNQLNNLDTKNPNGLLGLALVSTDMAKFEQSKLYL SRYKHRFGTDEQYNQANAYFLDSSEPLITRFHRWNSELDTNPNDIELVKK LYRLAAQLNISPVQEQLIAKYPEVFTDNDKSWLLHDQAVRISKNSPNKQQ LNTAYSMLDKVYIKVPEDNSLKQQSLQDMVVVGSKLKNDDSNRAKNSYEL LTESNQPIPNYVKEAYADYLVASGSPFAALSLYKEVEQSHLAEGGEVPFT LGIKIVQALNDAAKYPEARDYLENNIGEPSLMVLDFTRSRKIENPDYGNY FSTKVSSLVAQGDLSSAMQLIDERLSVTPGDGWIMLTKAELEAARARTDD AADWVHKAQAFLPEDTAWAEVAQANLALSVNDWRTASRLVNTWTTEEKDN ANWFMEQYDQAKSARLVASGGISHRTSPAGENESNQEYYLYSPKTDDGHD VYIHYLTTKSPDDGLPFEQQRVGAGVEANFYPFMVNAEAGKGIKLNDKAY FAATIQYQLNQHWQFSLNGGLNSANTPIKAIYQDTYAKDLGFSVNYKYSD RFEAGAGITAMKFDDENLRKNLSFWSNFNLFKHNRWNLNGSLYGSYERNK AIPGAYYYNPLKSRSLEDNFDLSYYQPFDHSITLTHHFKAGGGYYWQDSF ASSKTWSVAYGQEWRLGKKLNISYDVGRKRSIYDGSPEFNNFINLTLSVS F >MS1974 unknown MKFVKNRAKEVNVGIYGIVRDGKQQLDVVWLDERDKVYQEKQLLPAAYSQ YEMINLIHKSLGYERLNAKFISVIPPHHIWSRSLFLPTILTHQECDQQCA YTLQNELPVPLDSVFYDYSATEVAEGTYLKIYAVMQKVAQEQVAGCAPYH INILDNAAFAVKRAFNFVMPEDFPEDTLFLYRDESISLAIQTKTEIEQRI LQLNQTGLSDLYTVFCRRYNEQPAHCYAYSNIERRDSPHWRLVETPYPFM ALGAALWSAEERKKEESEKTAESLH >MS1787 unknown MNIQKRYLQAEKEARWSLGLTILYVIGWCVCAYLPKGSAGPLGFPLWFEL SCIYLPILFVVIGYWMIKIVYQDIDLDHSGSSGKDKSAGENS >MS0194 unknown MTNGKRHNSPTYVSNKNVNFCKKGGYFSDLVYKNKSASWKNHQKRPHFFP IALIIDK >MS0770 unknown MQNLIKKAIEKIRNQVNKQFRRSINRKNQRLLTNHEMSVIASNCNGAFIL HDLAEQFRSPFVNLYLEPADFVKYLQNIHHYMQADLQFIKTDKAYPVGKL EDLTVYFMHYHSEQEARNKWIERTKRINLDNLFILMTDRDGCRYEDLSAF DKLPFANKIVFTHKKYTEFSSALYIPGFEAQSQVGDLFEFSGWNGKKFYD QLDYVNWFNTGKY >MS0357 unknown MKIQHTEDQQQGEFFILSETGEKVAKLTYFYQSPRVINANHTYVSDSLRG QGIADKLYQALIQLIKEKRLELIPSCSYIAKKWRRDHQKS >MS2328 unknown MSTVQQAYELAKKQFADIGVDTEQALALLDQLPISMHCWQGDDVSGFEQG AGALSGGIQTTGNYPGKARTPQELRADLDKAVSLIPGKKRLNLHASYLEA DHRVDRNEVKPEHFANWVAWAKANNMGLDFNPTYFSHPLSAEATLSHQNK EIRDFWIEHGKACRKISEYFGKELGTASVMNIWIPDGSKDFVVDKFAPRQ RLVEALDEIIAEKIDAKYHLDAVESKLFGIGVESYTVGSNEFYAAYAVSR GTALCLDAGHFHPTEVISDKISAVMPFVQHLLLHVSRPVRWDSDHVVLLD DETQAIAGEIIRNQLFDRVHIGLDFFDASINRIAAWVIGTRNMQKALLRA LLEPTDELRALENARDFGSRLALLEEQKSLPWQAVWDMYCERHNVPVGRR WLDEVRAYEKTVLSQRV >MS1775 unknown MVVNEESNMILGTLLFYLALTLSGFLVFFLAVSNKKKLSRNQDIISGMYP KEKNNKETQKNKNQQQVELEQLIITNNKFLNILSTIDKNIKVKLFITLIL TGIYALFNLDAERKSLAIAGAVIFVLVILIPGSLANMILKRKIKNMMTDL PGFVDLVAICVQTGMTINAALLRVAEDFKILNPDLSYVMLRIIRKAEIIG LPSALDTLAVSLPTREIRMFTTVLQQSLNFGSSIYSHLLQLSSDMRELQL LTIEEQLGTLSAKMSVPLILFIMFPIIILIVAPGVMRVFPNV >MS2113 unknown MKKDLIYRKRYLERVRPFIGKSLIKVFTGQRRVGKSYLLFQIMQEVQASD SQAHIIYINKEDLAFSHIKTAQDLAEFVLIEKKSGKKNYVFIDEIQEISE FETALRSLLLDDELDLYCTGSNAHLLSRDIAGSLSGRAIEINVHSLSYFE FLEFMRLEDSDKTMSQFLKYGGLPYLKDLPLQDNIVFEYLRNIYSTIAVR DIINRYALRNVQFLEQLTQFFASNIGNLFSAKKISDFLKSQRISANTVQV QNYAEYLANAFLIHKVPRYDIEGKRIFEIGEKYYFEDLGLRNALIGYRVQ DRGKLLENTIFNHLQIAGYDVKIGGLGTQEIDFVAEKDGERIYVQATLTI NEEKTLEREFGNLLKIQDNYPKYVVTMDEFDGNTFEGVECLSLREFLMLL MDSND >MS1946 unknown MMKQYYIGVMSGTSLDGVDLALMDFTLNPPKLMATDFTPMPEKIREKLTA LLRSGETSLRNLGEIDHQLGLLYAESINRFLQKVRLKSEDICAVGCHGQT VWHSPNCEFPFTMQIGDMNLVAAKTGITTVGDFRRKDMALGGQGAPLVPA FHQDLFFAAERLTVVLNIGGISNISVLEENCPTVGYDVSVGNALLDSWIE LHQGKRYDKDALWAKNGKISTALLTDLLAEPFFQQAPPKSTGRELFNLAW LNKKLEKFTALSQPMPSPQDVQRTLVEFTALSIANELKKLQKSDRTNLLL VCGGGARNPLIMQRLTALLAEWQVSTTSEFGLDIDYVEAAAFAWLAYRRI HNLPSNLPSVTGAKSEVSLGVIFPK >MS0913 unknown MSLKNCKWWDKFHLPVKKPEFSGFIHTLFGYYFLRINNPINKPKNAPIVE PIPAQHMRLGEDISACPGF >MS1778 unknown MLLLDKNQSNQNGARKIVVLSDSEEMQNNVSQLLRTRGFENVEQRKRHFL SADIAFSPEDIIGMIIDIKDETDVSLIAEHITAIVPQNLWICAVGNSDSI TLAQNLADTGILYFHADTQLHLMMEKITSSKISIPHTRHTVNVCVLGCKG GIGSSLIATHIANQIISKKKVPVLLAQGPNGSQDLDLAFDKKLQGDIAKY DEYLDIFNGVPQGLNDKVTEKYNFVIYDQPIFNIDKDLYPEFFKYSNSFV LVVERRIGALRVAKQFLEQCDRLRSLTNQAVRVFVCISDHKPKSEKLMAK SDIETLLGATVDAVIPFIKNTEAKTILALNLSKAHKKSFYTLAMKIIGVL SRNNLNNENKSLFKGLYRLLFNR >MS2382 unknown MRMTKSNSTRETFSGRRAFIFAAIGSAVGLGNIWRFPYTTYENGGGAFII PYLIALLTAGIPLLFLDYAIGHRHRGGAPLSYRRFSKHFEAFGWWQVMVN VIIGLYYAVVLGWAATYTYFSFTMAWGDKPIDFFIGEFLKMGDITQGVSL EFVGMVVGPLIAVWLVALGVLALGVQKGIARTSSILMPVLVIMFLILVIS SLFLPGAAKGLDALFTPDWSKLSNPSVWIAAYGQIFFSLSICFGIMITYA SYLKKEFDLTGSGLVVGFANSSFELLAGIGVFAALGFMAAASGHEVSEVA KGGIGLAFFAFPTIINEAPFGQILGVLFFGSLTFAALTSFISVIEVIISA VQDKLRIRRAKVTFIVGVPMMIVSTLLFGTTTGLPVLDVMDKFVNYFGIV AVAFVSLIAIVANEKLGLLGDHLNETSSFKVGFIWRLCIVITTGILAFML FSEGAKVFAEGYEGYPSWFVNSFGWGMAVMLVIVAVLLSRLKWKNEVQVS GE >MS0766 unknown MHSIEIKGRILIIFCSITSKIDILIKKYYIKLNNI >MS1287 unknown MNNSYGTLYIVATPIGNLQDITQRALDIFTQVDLIAAEDTRHSGLLLSHY GIKKPFFALHDHNEQQKADALVEKLRQGTNIALISDAGTPLISDPGFHLV RKCRQTGLKVVPLPGACAAITALCASGIASDRFCFEGFLPAKSKARKDKL QNIAEEDRTLIFYESTHRILDTLEDIEAILGAERYIVLAREITKTWETIT GDTVANLRKWLAEDPNRTKGEMVLVIEGKAKSDDAEEISPQAIKALALLA KELPLKKAAAIVAELYGYKKNALYQYGLEYLD >MS0997 unknown MLDHKMKNYYLQANLLLAILERNKNKCYFNSLIFCFPSSAFSKGTFFMSV FYDLFQ >MS0861 unknown MRMIFYFDKFSDLKMAKQDADYITLDLFANVPKIGRPKTNPLSREQQIRI NKRNQLKRDKSSGLKRVELKLHTDLVRQLEDLASQQQISRAEVIEKILQN YFNIQENR >MS0414 unknown MLHLSGSDAPITANELLAIEKRLNIVLPQEMKNLYLKFNGGQPTEYVHDD NYLYPIWAFSCLSEIEDDLQLIDENWCPNGFAPQELLPFAYNAVGGFFAL SLRKQDFGFVYFILIEEKIEIIGKWKNFAIFLNSFIEKTQIDEN >MS0393 unknown MALHKCPECRHKISQNAMICPHCGFSFETASLEKYKQTLEQRRLHNQQIN KKSAKLQFIWLIIFALFIALAGYFTS >MS1083 unknown MRSKISKFFTALCLYFKLKKIYKDMRYAFSHFCNFLQCFRFSFIQTK >MS0186 unknown MQGFFVTKFNQIKYLHLDLSSIKFRIETYFGVFLL >MS0362 unknown MKNDIKTLSLESSQSAKTNPIRLYSGGIGHSGKTKPAGAKFI >MS0573 unknown MGLFEAIFILFLLIVISAIISSSEISLAGARKIKLQSLANEGDTRAEKVL KLQEHPGRFITVVQIGLNMVAIFGGMIGESALRPYIQQTIHQYTNAPWVD GAASCASFVVVTAAFILLADLMPKRIAITYPEQVALRTVGVMSFCIVIFK PLVLLFDSVANGLFRLLKISTVRHDSMTSEDIVAVVDAGAEAGVLKAQEH YLIENIFDMQERTVTSTMTTRENIVFLNRTFDRQKVMETLTKDSHSKVLI CDNGLDRILGYVESHTLLTLYLREEQVSLTDQRILRKPLFIPDTLSLYEV LELFKSSGEDFAVIVNEYALVVGICTLNDVMSIVMGELVSSEEEQIVRRD EDSWLIDGATPLEDVMRALNIESFPDWENYETISGFMMYMLRKIPKKTDF VLYDKYKFEIIDTENFKIDQLMVSIRKDLNEQN >MS0894 unknown MNKLALYCRIGFEKETAAEITEKAAEKGVFGFARVNNDSGYVIFECYQEG EADRLAREIPFNQLIFARQMIVISDLLENLPPTDRITPIIEEYNRIGSLV NLHRTTELFVETADTNEAKELSVFCRKFTVPLRQALKKQGYLAFKEVKKS GLTLHIFFVKPNCCYVGYSYNNNHSPNFMGILRLKFPPQAPSRSTLKLHE AILTFLSPEEERKCMNESMYGVDLGACPGGWTYQLVKRGLFVYAVDHGKM AASLHDTGRIDHCPEDGFKFQPPKRSKIDWLVCDMVEQPIRIAALIAKWL VNEWCRESIFNLKLPMKKRYAEVQNCLQLITNELDKAGFKYHIQAKHLYH DREEITVHISVKK >MS1906 unknown MRSKLIKIICLRSRICVSETIHTLEKQAKLAIITNGFTALQHLRLQRTGL AQYFQFITISQELGIAKPDARIFEHSLQQADIEDKSQVLMVGDNLHSDIL GGKNAGLDTCWLSYDKANDSDIAPTYSIKKFNELLDVVAA >MS2297 unknown MKMNKKFAQIVKNPAFRNMVLKTIFNVTNVMSATKYLR >MS0865 unknown MNMIRIIRSFGLALSLVFAFVGSTFAADLPTEKSLQAQIEQLQKDEQTEV NKALVQNLQDAQELLAQIAKQKADNEKLNKDIDRSTRTLAESKANIERFK KQEKTVEQLKEDFRKLSLTTLQDRSESATENLQNLQAELLTLNANLSGQK TAPERAQAALTENLKQSQALNSQLSNVNIEKTLQTKLTAQLALLELKNAY NQILLYGNNALTNLYTSQVNEKTLEQTQLQKQLTALQDIINEKNLEKTQE QVEKATESQQKSAATNTNPVIVRELDLNTAVTKDLLEQTTKLNALSQDNL RVKGILDNLQQTQHNIEEQISALQGTLVLSRIINKQQQSLPQDSMIKGLP KQIADLRVKIFDITEFKNNVNNAPAYIASLEKSDKVTFTDAEKNQLTDIL AARDKVLTDLLKQLNGQLNVSINLELNQQQVQTISDALQSKLKQQSFWVK SNSNIDLKWLQDAPMLIRYQLRGIGNTFDFSNWRDNLVPAVFWILLLIAL TAIIHRKKEKIKQQLTRINNKIKSLGTDNQWNTPLAIFWTIILCLPSTFM FLAVFILVTYICFQDPTQVWPWGLKMSVYWLSFAFLLAMLRPNGIGYTHF GMPKQSNETFRKILKQSVWVIALLLNTSIFTNLEMGVTYDVLGQTMTIIV LIVTIFIVAPGFRKAISTYQEATNNDKQGTHTYVLYLIRAVLLLAPIILI VLIAVGYYYTALVLIEHLVATYFAVITWVIFRNIIQRAFSVTSRRLAAKR LQEKREQARAKAEASEHPEVDSGEVILEVKEETLAVSEVKQQISKITDFL LWLCLFGLLYWVWSDLVTVAYYLDGVTLWKQSVTTESGTVMESVTLFNLL IAILVLFATYVLVRNIGGVLEVLIFSNLKLSQGTPYTITTLLTYAVIALG ASFAFGTLGMSWSKLQWLFAALSVGLGFGMQEIFANFVSGIIILFERPVR IGDVITLGEFNGTVSKIRIRATTLVDFDGKEVIVPNKAFVTERLVNWALS DTVTRVIIRVGVAYGSDLELTKKLLLQAADDCDKVLKTPSPVVYFLTFGA STLDHELRVYVGNISDRNPTIDFLNNRINTLLAQHNIEIAFNQLDVFIKN QNADEEVKLGNEQLKLQK >MS1193 unknown MFKWQKGILIALMMVLSGCSAKPSQSLDIEKSNKPKLIVGTTSKDSREFI SCIDSKLANNENVRSHKKKSYQIKNGKTDKYSIISQKGYSYLLSVNCSKI QQTVMNFFYFPQQKENEILEPILACLSAVNSVNLKTYPIESVIRDMPNKF TALR >MS1924 unknown MKKLLIASLLLGSTSAIAAPFVVQDIRVDGVQAGSEGKVLAGLPVRVGQR ATDGDIANVVKTLFARGYDNVKAARDGNTLVISVEQQPVIADVTIDGNSS IPTDALKQNLDANGFKAGEVLNREKLEAFRQGIQEHYESTGRYNAKVETI VNNLPNNTAEVKLQIKENDVALLKGISFEGNQAFDSDTLQEQMELQPDAW WKFFGNKFENNQFGKDLETISDYYHNHGYAKFRVTDTDVQLNDEKTEARV KVGVNEGDLYTIKDARIVGDVAGMQDELQPILKTIHVGEMYRRGELQSVE EQIKAKLGERGYANATVNVHPDFDEENKTIAVTFIVEAGRRYSVRQIRFE GNTVTADSTLRQEMRQQEGSWLSSQLVELGKVRLDRTGFYESVEHRTEEV PGSDDELDVIYKIKERNTGSINFGIGYGTESGFSYQASVKQDNFLGMGSS VSLSGSRNDYGTSVSLGYTEPYFTKDGVSLGGNIFYEKYDNSDSDTEASY ARTTYGVNTTLSFPVNENNSYYMGLGYAYNKLKNITPEYNREKYMKSMGY DETGDWRFKAHDFTFSTGWTFNNLNRGYFPTKGVKATLGGTVTVPGSDNK YYKLNADVVGYYPLERSQTWVLSGKATVAYADGMGGKKLPFYQNYNIGGI GSLRGFSYGGVGPNAIYIDSNGNYTQLDSDVVGGNAMATASAELIVPTPF VAEKNQNSVRTSFFVDAGSLWNTHWKAEDKARFPTLPDYSDPSRIRVSAG VGFQWQSPIGPLVFSYAKPIKKYDRDDVEQFQFSIGGTF >MS0934 unknown MDSRLLEIIACPRCQGRLQLDKENERLICRFEHIAFPIVQGIPVLLVEEA VSLAEDPKDIT >MS2001 unknown MYLDSRYWQHNPRVADGADAFVQAFTQLAQSKPQARGTIKRVIAEGDYVV LHVHRQDTPDDLGRAVVDIFRLDKDGKIIEHWDVGQAVPEKTASGRSMF >MS1810 unknown MKNYSETIIIGAGAAGLFCAGQIGKAGKSVTVFDNGKKAGRKILMSGGGF CNFTNLEVLPSHYLSHNPHFVKSALARFTQWDFIAMVAAQGIAYHEKESG QLFCDNGAEDIVKMLEARCTENRVSIQLRQRIDLVEAVHNDENARFKIQS GGQTWYCKNLVIATGGLSMPALGASPFGYQIAEQFGLNVLSPRASLVPFT YRENDKFLTALSGISLPVRVTAQNGKSFSNNLLFTHRGVSGPAILQISNY WQPNESVEIDLLPTDSIEEYLSQLKASSPKLQLKTALSRILPKKLVELWF ERQLLQDETLANLSKVRLKNLENLIHHWQFQPNGTEGYRTAEVTMGGIDT KEISSKTMESQKVKGLYFIGEVLDVTGWLGGYNFQWAWSSAYACAVGITQ TE >MS0171 unknown MYAILLQIKTFKTIFSPETLRYTMRAFSGQGEIPYWWYMFT >MS1213 unknown MCQNRLNLSTLPTSTGATSTEATATKTATGRSTSAETAKTSGTKTA >MS2150 unknown MLTQSPAFNPQSQLSRCGAKNQAIGCNGNFMARCTTGREQSVSLEVFFIK LPEKPYS >MS0081 unknown MTEAIKIINDDVKIVLAETIADYEKRTGKTLRPAHIERSIIQSYAYREQL VRQGINHAFLQTFPQFATGLALDLCGEPMGCYRLSDLPAEVTLRFSVEGD HDAVVIPEGTLVAATDNVVFATDTEVRISSTESYVDVVGICQITGAVGNG WQLGQVKTLKSTLDAKVTVSNIDVSDNGIDTESDDDYRKRILLAPEAFTT CGSVAAYEYHTRSVSQYIADVDIATPVGGTVQVTILTKQGLPSSILLNKV KDHISGEKLRPLCDTVVVSSPERVAYSVVANLDLLETVAESDVKVQAEAA LRAFISSRTQLLGADIVPLDIQAALKVAGVYNVTLASPTLTKLTKQQWAE CESITININGERQDG >MS1544 unknown MFFHNFHNVLGKGHFVHKISLKKNRTLHKKVRSIFR >MS1402 unknown MRYNSSFQITLSRQMNKPNITIQPIQASHYADYVALIGKQLGEGYFKQAD FEALANNPQAICFEAVDEQNQVVGVITSVTLDRESALALLKIQAQNTPDY VLQSDRIGIFKTIAIDENRKGCGIGSALVRKLLESFKQAGLNSIACVAWQ YGETENIRGIMQAFDFTCYEKIANYWLDDPEPFICPACGEPPCRCQANIY FRQI >MS2377 unknown MSENKKQLTARDIRATYWRSTFLLGSFNFERMQAMGFCVSMIPTIKRLYS QKEDQAAALKRHLEFFNTQPWVGSAIMGVTAAMEQERANGATDIDDAAIS GVKVGLMGPLAGVGDPIFWGTLRPVLAALGAGLAISGSLLGPLLFFIGIN ICRALTRWYGFKYGYAKGTEIVSDMGGGRLQKLTQGASILGLFVMGSLVS KWTSINIPLELSRYHNAMGEEVVTTVQSVLNDLLPGLAALLLTFFCMYLL RKKVNAMYIIFALFGVGILGYWLGILA >MS0084 unknown MSNVPKPDFSLCYEKTNITADIEPRLVQFTYTDHLEGQSDELTVEFEDIS GKWVRQWFPTQGDKLRAAIGYKDSLLVDIGEFEIDEVEYRYKPSTINLKA LSTGISKANRTLKPKAYENTTLAQVVAKVADSLKLKLVGKIKAIPIKRIT QYQERDVEFLARLAREYHHSFKIVGSQLIFTDKTELGKSEPVLILEERDT ISLSLRDRIKDTAKAVDISGFDASGKKVVKKRKKATALRPNLKQVKASSE DTLKVVTRGETQEQIDARGEAALAEQNDNQTAGNITLIGNPELVAGATIL LKNLGVFSGKYLIKSSRHSFGRNSGYTTEIEVRMLEFIADDLITLGMEKT NANA >MS1963 unknown MIKKCHKVLTLLIVFWSRRYFKVEYGYSLYQ >MS1848 unknown MQNFDRTFLLFRSYIGTCRRFKLEKFYFSGKISKNLPIS >MS0141 unknown MKQPAIFVGHGSPMNVIEENNPFNQKFAEITRTFAKPKAILCISAHWYSK ELEVQSGANPKMIYDFYGFPPQLSRVQYPASGNPRLAAQIQQLLAPEEVR LNPDRGYDHGAWAVLKHLYPEADIPVIQLSLDRTKPASWHFALAQKLKSL REQGVLILASGNIVHNLSALSYEHINRLDAGYDWAYEFRDQINRAIAGNN IELLTHIERLGRPAMLSVPTPEHYLPLLYVVAMREEQDNVELFNDHLVGG SLSMTSVFIG >MS1160 unknown MILISFLNFVYFIKRLKFPMIQASNLSGLKKCGRISPFFYDLD >MS0101 unknown MKKNLVSEIATRARSIDFWAFGYYLPNPDPILKKMGKDIAVYRELLSDGQ VRSGVRRRKAAVKKLEWRITTTNNAKVDEQLERIFSRLKMNHIITEMLNA ALYGYQVSEVMWGERDGLFVPLEIIGKKPEWFVFDEDNQLRFRTKENWVT GELLPEDKFLLTTQEATQDNPYGLGDLSLCFWAATFKKGGLKFWLEFTEK YGSPWLVGKHSRQAQQPDKDRLADSLEAMIGSAIAVIPDDSSVEIIESSG KAASADTYEKFLKHCKAEINIALLGQNQTTEQESNRASAQAGLEVAEDIC ADDRAMIEETFNTLLQWIVKYNFNVEQLPQFEFFEQAEINTTQVERDTKL HGMGVRFSKTYFQREYGFEDGDIEIQQAQSAVKNPQVSEFAEHNQQGLHP IADGIIEQLEIEGESQVDDWLQTVKDRLAKADSLEDFRNQLDSLIPELTF AEYGKLLAMASTVSELAGRQSVNDERKVKGDE >MS0298 unknown MATNYYDITLALAGVCQSAKLVQQFALEGKADEEAFNTSLYTLLQTTPKD ILSVYGGHERNLKLGLETLLEQLNGSTEDITRYWLSLLALSGKLEKNAQA KSELARRIQYLPTQLEHYDLLDEQMLANLASMYVDIISPLGNKIQVKGSI EVLQQTSMHHRIRACLLAGIRSALLWRQVGGSKWQLLFSRRKIFNMAKQI YSSL >MS0343 unknown MKKLILATALSSVAAFTQAQIVPNANSATHTYEFTQSYDLQVPKGSSGET KLWVPLPFSNDYQDVKSVEFDGNYQQAYITENNQYGAKTLFALWDKDAQK RDLKVKLVVTTKDREPMKQGLLENYQAPENIEYSVDVQQYLKPTQHIKTD GIVKQFADKIVGKESNPLKKAEMIHQWIVNNMERDNSVLGCGDGDVEKIL TTGVLKGKCTDINSVFVALARASGIPAREIFGIRLGQAVKMGEYSKGAFG SAKDKVANENGGQHCRAEFYLAGFGWVPVDSADVAKYRLTENKSVEDKDT QAVSQYLFGNWEANWMGFNHARDFNLYPMPELAPLNNFGYPYAEVGGDPL NSYDAKKFGYEFTSKEL >MS1206 unknown MGYFQVPKVRSKIFKFFTALLSSLHYATQILKYKYLKNIFVQKDYFFSFL KNINQTNTLVLYANKINIAK >MS2289 unknown MIIRIAKKQDYPQIIDIYNQAIPSRRITADLEPVTMESRKDWFEFHLHSE RHPIWVLENSIIKNNQEEKQILGWCTFSPFYPRAAFDNTVEISIYLDNKA KGNGYGSKILQFMKEQMMCRDINTLMAYVIEENNISRKAFEKQGFKLWGR YPNIANMGDCYQTFLMYGYQSGIKNS >MS1909 unknown MQFIKNGRQYREATSQKISWGHWFALFNIIWAILFGSRYAFIIDWPSTLW GKLYFFISILGHFSFVVFAGYLLIIFPLSFIIKNERTFRGLSVIVTTICL TLLLIDTEVFSRFNLHLSSVVWNLLVNPEDGELSRDWQIFFAPMPLILLV QMLYSRWSWNKLRSLERQKWMRKVGIFFVTMFVATHLIYAWADAYIYRPI TMQKSNFPLSYPMTARTFLEKNGLLDKTEYAQTLEQEGRPEAFNIDYPKH KLAYMPIERKPNILLINISGMRYDSVIESKMPNLTEFAKQSAQFMNHYST GNNSNLGLTGLFYGLNASYTDSILHNKTESELFKKLQAEHYQMGLFSANN FKDSLFRQALFQKVNLPRIKAGNQSAVKNWLIWLNKAHLDQAWFSYLDLD VLTAVQNADPKSKEEETEIYDNQLGNVDVQLQIVFEQLQERGLLDKTIVI ITADHGHAFQLSDKEHIDYFGLDEIQVPMIIRWNALLNEQQSKLTSHVDL VPTLMQNVFKVENPITDYAQGESLINISRKADWILVGNYRWNVIISPNGN QYHIDRKGQYQKYNVDYEKESSLRPPLGLFLEVFTQSRSFMAK >MS0170 unknown MRYSLFLHIMKYKYDKNAKKLTALCIRDIFVL >MS1935 unknown MIIPWQELEPDTLINIVESFILREGTDYGMEELSLAEKRDNLLKQIHSGK AVIFWSELHETIDIKTTT >MS1603 unknown MRRTFSAEYKAEAVKLVIERGYSVSQACRELGVGETALRRWISQVQAEQQ GYVLAGSKPISPEQQRIRELENRIKELEEDKAILKKATAILMSLENKNTK SLRR >MS1513 unknown MMGSYYTNIYLPCNPKMKKTDFLPKKYQILLFWQTNMPPIIFGHHPI >MS2273 unknown MLELAYLQTLPQQRALLKADYADFIVKEDLGYAMTGEGEFVALYVRKTDA NTLFVGEQLAKFVGLSPRNMGYAGLKDRKAVTEQWFCLQMPGKAMPDFSR FNMAGVEILQVTRHSRKIRTGSLNGNHFEILLRNAVETDELKVRLENIKN FGFPNYFTEQRFGKDGHNLTQAMRWANGEIKVKDRKKRSFYLSAARSEVF NLVVSERIRQGLANQVLAHDILQLAGTHSWFTADGKEDLALLQTRLENHD LQLTAPLIGETQQLACELENKLVERHQSLISLMKRERMKPARRPLLMQAR DFHWEFVENGLKLKFYLPAGSYATALVRELVNIDENE >MS1560 unknown MRSFSIFLGVMIRPTGNVMINIQPSALMQSATWAQPIEMLFACHGRVKNF CRQLGMLPDYLAENGVNQAVKNDVKQIITYFNVAAPLHHKDEESDFFPAL LHYVPEAKTDILKLEAEHIGIHGIWEQLGVQLQELIDEKRTTIEQSLLDD YRAAYERHIALEEPLFELGQKHIPAEQLTAMGKIMAERRKVKNS >MS2083 unknown MQIFIYQELKMKKFISLLILTALCASVTACGVKGPLYFPEQEQPKQEQAE >MS1453 unknown MKFWLEIPIIKNLYRLIRKVDEKNCFKNDRTFRIGIKFDRLGGCGR >MS2000 unknown MNAEQRVIVTPTFWGKFFGKIKSVELQQNKVIVTDKKNNITEHDLGKTFD FPAIQKSFFGTKLFFKDDSTEVVLSKLAKKQTDSLLLEIEKVVASNIKVK VKEGFQHFANLAENQYLRDSDIPTLNDRVRLSVLSYGDNKEHFQKYFDES LVKKIQYISSLLGFVQLLHNTVSISFRHNKLPICI >MS0801 unknown MKSIVKVLGILVIAGSVGACSNMSKTQKNTAIGAVAGGVVGHAIGESTGA TLGGAALGGLIGSQVK >MS1257 unknown MFFALKEKCGYFFEKFCLDMNHGSKNETGIAGL >MS1972 unknown MLLNGGAMVKQVWIKQLWHRCIQASVLKQNTGLLILALFGLFLPLNRLYS SWEQLIRLENNINEQQRQTIYQQRLLQSLEKKAKNDLLTPQSAALLSQIN QYVQSSSVNVKIQNAQWHFSSSAVLQLRMEGDFLSLNQFITDILQKFETL RLSSLKLFKPDENLAAYLTLRLQLTKE >MS1320 unknown MRLYVAFYKIYKIKFLFKFMTKITFLTFFSDENQQK >MS1765 unknown MQIMKKIMALLVAGAFIFTLSACETTKGVGKDIQNAGQKMEQVFN >MS0135 unknown MIKLLKPVDVKSNEKQALLWSWLYVFALFLAYYTLRPIRDELGAAGGVTQ LTWLFTGTLVAMLMLTPLYGYLVKHWKREKFITISYRFFMLNLVVFAMLM AMATGDVLVWTGRIFFIWVSVFNLFVVSVFWSLMADIFNTDQGKRLFGFL ATGSTIGGIAGSAFVSFFADVFSNYILLLMAILLLEMSVLAAKKLSKLGE IELRASNSAGRFNQEIGGGVLDGLKRTFQSPYLLGISGFILLYSITSTVL YFQQAEIVNSTFSDRAERTAFFANIDLWVNSLTLFFQFGLTGRMMKYIGI LPVLSLLPLFSVISFAALAMNPTIVVFVLVQVSRRVANFAFARPSREVLF TRLSREDRYKAKNVIDTLIYRSGDQIGSWGYAGLGALGLSLTGISWLTVP VCVLWFGLSIWLAGKEDVD >MS0446 unknown MKINSFIKLFSLESPMKIELPKIFIISLKNSPRRDVIAQRFNALGIKFEF FDAIYGKDLSQEELSKIDREFAVKRFSTKKPLTLGEIGCALSHIAVYEHI LKNNIEQAIIFEDDAIIHHEFKKIVEETLSKVPSRREIIFFEHGKAKSWF CKRSIHEGYKLVRYRSPSKNSKRCIFRTTSYLITLSGAKKLLNHAYPVRM PSDYLTGGLQITQINAYGIEPPCVFCGVDSEINAIEDRYN >MS1394 unknown MKLSILNLVPVREGQNYQQAMASMVTLAQYAEQIGIERYWIAEHHNTKNL ASSATALLIQHTLAHTETLRVGSGGVMLPNHSPYIVAEQYGTLETLYPNR VELGLGRAPGTDMRTAYALRKGREHSDFPTEIAELRGYFENTNPVSAYPA AGLKVPFYILGSSTESAYLAAELGLPYAFASHFAPRMMEMAVEIYRKQFK PSPHLAAPYVILGVNAIVAQTDEQARQLATTQTQFFLNVVTNAQQNLQPP LASADDVWKRHLSAQFPPHFGPVDFQEIPLYNQERAVVEQMTACSLIGSS ASVTHQLNTLRDQVHFDEIMAVSYIFDEQLQRLSYKMLKEIVDKI >MS0313 unknown MKKISLFLTALLAASSALAANNQAAPQQENAKTEFMFGAKAANDPVGIWQ KDGRHFSKKDLSKQFCWTLTNFRSDSGNVNITITLTSPKNTNFNLGEHIS KNTTTHIFNFTYPITQTYYNCWAFEESDPEGKYTLTVKANNTTFPTQVFT LTK >MS0750 unknown MDFQHNREQFLNRLAAKMGKARSFSPQAMEEPVNRYPTERLTELSQAQLC EEFVNFAKVMMVDVKVCPESDVVSSALSLCEKYGGNSVILNDDERLTRLG ITQALQEKYPCHIWSPETGQQNIDKAEKANIGVVYAEYGLAESGGIVLYS QPERGRAVSLLPEKSIVVLRKSQVLPRVAQLAKVLHDKAQKGERMPSCVN IISGPSSTADIELIKVVGVHGPVAKIYLLIDDL >MS0325 unknown MRKQKIVNIKDLVESSDLSKIMQKGLFLNRLNQQLQQWFPSQFKGMYRLA NFTENGLHIEVANAVVRQGFLFRRQELLQLVQKEYPEITRLNFKINPELN R >MS0158 unknown MVIRRKCGSFFAGNIMLSIKRPFTDEDRAILNSYKAVVEGVSALLGSHCE ILLHSLEDLDNTAVYIANGHNTNRQAGTTLSEADLQSLQAMENGMVLKPY FTRHKGNNGLMKSTSIAIRNGKRQIIGLLCINLNLEVPVSQFIQAFIPTQ DYPVTTAGNFASSVEELVLQTVETTIEEITADRLVANNNKNRQIVTTLFE KGIFDIKDAINLVAERLNISRHTVYLYIRQIKQDDQK >MS2055 unknown MMFPPSHGLFNVGMKKSAVKKTKILLNKVRK >MS1887 unknown MKKLSLALSILLLAGCMGTELSTKDKTYNASTDARIRIFGQNGRPSTLTI EHNGNKEKITIGGGVGQAFSSLVGAKGNESIGMPESVYSKDPSQFSNIGS TPFFKEFIIPANAKVNVKNEIMSAPHIFKDVTTGKTTTTYYKCSGGKEIS FVAEAGKNYEVIPSSSTNECGVTLNELN >MS2013 unknown MGTLETTVKNRSMKITKIRPQKRNSCSICGKAQVTRKFQEEYYCANCYAQ WFKKKTCKQCGQLKRIHREGELCLECEKLTDCVRCGKTSGTFEIGMISRY GAVCSSCTRYFREEIECSECGKMTRDRYRSLVTNESVCLQCYRRYTFATC KNCRRYRKVHNQEKQLCKKCDEKLLSTCSKCKGEMPSGYGNVCPDCARRS LLFNMIRLNGHILRNKAVKTAYKKFIFWYMRKCGISVVLHKGSDFMRFFI DCDDIWQKIPDYAELVTHFKPNGLRANLTVLRWLLDTNQVVVDEALKDDL AEMQRIQSLFNKLKESIPCIASYYKLLQRRYDDGKTSLKSVRLALQPAID LISSQAVTDYPTQEQLNNYLSEKTGQIAAITGFINHLKSAYRRELKIDRK LIQQMKAKQLKKHYSKRLVELYKQTELTTAEQMDLLSVVLYSLHGIEIKK PKFDAIVLIDGVAYYRDNTKDYFLPQDIYLRIKPQF >MS1143 unknown MLIGLFIGLLFGFFLQRGQFCFVSGFRIIYTQRNFRFLTALLIAVSIQSI GFFSLSGLDLITIPNTPMPLLATLIGGLLFGIGMVLANCCASGGWFRTRE GAVGSWIALICFALTMAATQTGALKQWINPLLLETTTLDNIYNTFNLSPW ILVTVLVLITVVMIVYHIKHPRYQFPQEPTTALIPHRIFTKHWHPFTAAV WIGLLGVLAWLVSEQYGRSYGYGVAVPTANVVQYIVIGQQRYLNWGSYFV LGILLGSFIAAKLSGEFEIRLPEPKAILQRMLGGVIMGIGASLAGGCTIT NALVSTAYFSWQGWLATLMIMIGCWLTSVLVKPTQCRI >MS1747 unknown MKQKIVLATGNKGKVREMSDVLADFGFEVVAQTDLDIESPEETGLTFVEN ALLKARYAAKVSGLPAIADDSGLVVEALNGAPGLYSARYAGIDGETADAE NRRKLLRDLADVPVGKRQAKFVSCIVMLRHETDPSPIIAEGECIGEIIFA EKGENGFGYDSLFFTPEKGCTFAELETVEKKKISHRARALAVLKSKLGA >MS1907 unknown MSKSTCRAYFFIGGFAMDKMVIWLLALIIAAPVLVLAVSPTLNKMGNQVG NMGNNNSAAFSQQGNSVHGQDGSIYNRVGNTTYSNKGTVYYNTGEHTYAS DGSYCTKIGAVTQCNKPTK >MS1646 unknown MSNILNWPDYKVLQVSELEHDYQVHAEVSEPPTQCPHCNHPEIVGFGRRD EVIMDTPVHGRRTGIMLNRRRYRCQSCRKTFLEPVPHKDEKRQMTNRLIQ YIERESLRRTFSSVAEDVGVDEKTVRNIFNDYCERLEKTLNFEMPQWLGI DEIHIIKPRCVITNIQQQTIVDMLDNRNKTTVTRYLSKRTDRDLVRYVAM DMWRPYRQAVETMIPDATVIIDKFHVVRMANESLERARKAIRSALTPQQR RGLMRDRFVLLKRRHELTDAEYMRFSGWTLNYPEIGQAYELKEAFFEIWD CQTRHQAQEAYYSWLRQITPEMKAHYDPLIKAMGNWHDDIFAYFDHPITN AYTESLNNLIRVVNRVGRGYSFEALRAKILFTEGFQKIKKPRYQRQRIPE GAMGRMPFYGVAEAGPSTNYGADISTLVREIEAGRL >MS1136 unknown MHNSIKKVRSVFQKFYLTSHHNHSADSKTSGNLTALYHNKLYK >MS1740 unknown MEVVIMAKGKKIQLTFESFIDSDTNVKVTRLTPKDVTCHRNYFYQKCFTQ DGKKLLFAGDFDGNRNYYLLDLQSQEAIQLTEGKGDNTFGGFISHDDKFF FYVKNESSLRKVDLATLEEKVIYTVDENWKGYGTWVANSDCTKLVGIEIL KSCWQPLTDWDKFKAFYHTNPTCRLIKVDILTGDLEVVLQDNVWLGHPTY RPFDDSIVGFCHEGPHDLVDARMWFVNEDGTNVRKAKEHQEGESCTHEFW VPDGSKMIYVSYFKGQTERVIYSVDPNTLENTRLITMPPCSHLMSNFNGN LLIGDGCDSPVDVADSDSYNIENDPFLYLFNIEKQRTVKLAKHSSSWQVL DGDRQITHPHPSFNPNDSAVLFGSDFEGRPAIYLADISQLKD >MS2311 unknown MIMQKLIKLFFSGILLTLSIQAAQAETQYVTENLNTYLRKGAGDNFKIAG AIQAGEAVSVLDRKEKYSLIRDSKNREAWILTAELTDTPSSKEENPRLKA QVQELTSKLNRLDADWQQRTTEMQRRTKDADQKSSQLLEENSQLKRELEI TKNKNRDLEAMLDAGKREIAIQWFIYGGSVLGVGLLIGLIIPLILPKRRR RDGWA >MS0738 unknown MNQFITLLLSTWGILSIHQISRRQSVDYMQTAKSTLGLIFGVIILNILIA LPLMGGLINIIPAAINPAAASAGIIGFALMIFGVYVYVRLCLAPIHYTVS KTNIFASLQQTWQLGNKRTSTLFLYCLLVYFIVPFIAQQVAFLANNTFLN IITTLIISFLSVFTLVVTYRFYILFTQKA >MS0725 unknown MMIYRGETLVGLLISMTLSAFLILIAVQFYVYVQHTNLQVMQRLELQAEL QSILQIIAKDLRRTGFDLPYSEPEKIKFDHFSKESPNSCVIFTYGLGESD KTKLKKQNTEEDTKVVLGYRLYNQRLEAIPQAKKTNTERNEKTLVEGCSL RLGWEQLIDSDKFAVSQLQFKWLVEKKGIEIYLKGYLKQQKSLFYETSII LPIMNEVMWDENL >MS1572 unknown MENVNKQSFQDVLEYVRLYRLRNKLLRDIGDNDRKIRDNQKRVLLLDNLS QYITNDMSVEDIRAIIENMRDDYEGRVDDYMIRNADLSKERREIKEKMKA QKKAHAELLKKADD >MS0120 unknown MAILPEVLMNIALDVKRAKARGDKLEPIYQRGCELTKLSRATLIRQLKPY LPPSGRKVRSDKGTNQLELAELKTISAAWLENRRNQYKKRMLPLDELLAM LRANGEIKAEFVDKATGEIRPYSESAVSRALINARLHPDQLLKPKPAIRM RSLHPNHCWQIDPSLCVLYYLKRDHKQTENGLQVMEAKRFYKNKPANVAS VESDRVWRYVITDHTSGVIYVEYVYGGETSENLCNTFINAMQRKPHGDEP FCGVPKMVMLDPGSANTSKMFDNLCYQLGVKLQINEPGNPRAKGQVEKGN DIVERQFESRLRFKSVANLDELNERAHEWMRAFNATKKHSRHGMPRYKAW LHITKEQLVLAPSLDICRELMVSKLVERQVDGQLQVKFEGLTYDVSGVPN LNVGDKLRLGKNPYRPDCIQVECFEQVFDENNEMSLKPYWFVVEPIETDK FGLDVNAAVIGESYKSHAKTTLETNRETVERLAYGATDDDGVKAAKKANK PLFDGRIDPFKTIDERPDVMFIPKRGQEHELTTNARRVEQKPVGLVECAK QLKARFPQWNGKHYKQLATHFADGVPAELLETWLQDEKLPEILNPETKIL KLSAA >MS0319 unknown MAIKRNQRQRKKMHLAEFQELGFLVNWQFAENTAIEQVDEVVDRFIRDVI QPNGLAYEGSGYLQWEGLVCLEKLGKCDESHRELVKNWLESNGLQQVEVS ALFDIWWDYPVKEA >MS1279 unknown MVTGVMFEPEPIYDELDKKPAELTHDQPLGFTDVVDNAKKEAKTTKTANK KTKDKKSASHLRIVK >MS1785 unknown MVINMNNAQVDELVVKHLKANPQFFVQHIDLLDQLIIPHAQKGTLSLVEM QLERQRERIKELEAELALFADLAHQQQDIFLALMPLQKRLAQCKNFPEGV EEINKWARNFELQQAKILLFNDCWQKNPSVGEEFWIDRKGFELIRLERIG LRHFYLGELTNKEKSLLFLPEELPVGSIACCLLGMKKNQHKSTALLLFSA RDTAHFHNGQDTAFLKHLVSIVELHLHRWLMIYQQAE >MS1332 unknown MVNIYGMTLNNRPRSLIMSKTTLSLLISATLLLSACNDEEVRSLKEQLQT SRQQIAQLQAELQQTNATSTIKADSAQPEAAISPTEDTAIQGKIIQDEIP TLYVKPVTVFDKTEKFNFNVSKKPKNNEPLYEESHIHYAMHTVETGIEWL DSLLYQNLMADITIEDADKQKEFESIPNAKDRYAAFIEYFYNNALPEIKN GTTLGSDYYINLDYVGQRENILTFKVANYMYDGGAHGMYSTDYINIDSRK KAVIDLNTLVNKDKQNQLKELLWKSYLAYTNNDNSDIPFTEKQDFDISQQ FYFSSEGVNFVYPPYALGSFAEGEITLSISWSDAKNIINKDYLREGFTIQ E >MS0409 unknown MTSEDPIYEKLNETTSIRGFITACVAIFDESVDQLINRVFRKTDFAVKSV VDSLFINSGPLFDLSIRLKVLLGLGIISHETFMDINAFIQLKEALNNDGK EYEFFDPIIISFIQGLNVRQDKSFLNLDTKIDGTKDSLLYQVKVLRREKL IRSYLILSVTDLYDQLQVESPL >MS1628 unknown MTRFTMKKNIKIILLILYILSPIDLVPEAFVGLLGLSDDLIALILLIKQI LKK >MS0739 unknown MTINFQQILQDSWNFIRNQRKFTLMLTLTFCLVTLILNIFGSSLFQSVTE TAINEPIDKNELSTMMQRVQKAVTFYYYM >MS0042 unknown MTALYNFIQAIRLKSAVILWIILLKPYYVYTKYRNWALRQFSNYWHNILQ NN >MS0962 unknown MVFPEYKVRIPQFKCFNQKIQFFDRTFFIILGLTMSITLSTKQKQFLKGL AHHLNPVVMLGNNGLTEGVLAEIDNALNHHELIKVKIAGSDRETKQLIID AIIRETGSGAVQTIGHILVLYRPSEDIKIQLPKK >MS1211 unknown MKTLCKSSTRFDKQALSLGFSELILQENAARGVAELVRQKLQTGEKILFL CGGGNNGSDAIACARMLSGDYECELYFITENLNANAQAQLNIALKVGVNR VSQPDLANIGCVIDGMFGSGLSRDLDAEIIRLLDRINAHNALKIAIDFPS GLDGNGNIRGACFKTDFTLAMGALKIGLFSDVAKDMTGEVSLVNLGLSDN RFITNQEDFLLERTDLNLPSRTLQNVHKGSFGHAFVALGQMPGAGIMAAT AALIMGAGKVSVVGKAENLTPQIMQKNNFDGAGAVAIGIGLGNADIDISA IKHLPCVLDADLCYRAEIREFLPNPSAVFTPHPKEFSGLLRNLGLADISI EEAIKNRFELAREFSRQIKGVLLLKGANPIIAYNGTLYLCNLGSNKLSVG GSGDVLAGIILGYLAQGFTALEAAQNAVLAHAKSAENYQGNDYSMTPLDL IDGLRYL >MS1616 unknown MNKFMSEEQNSKFLTALFSIMPVVLLLGIDIYTMFLQTEAKAISHFNLGV LAAQFICSLVFLKGRICNGQRGRLTQAVMYFAVYWLVWLLLSLFSSYHFI LTDMLSVAGLLMLFSMWRQPLEPGSRRLMLNMGALAGLLGVICFFVQLAE IPVLHWVQYNFFGQALAGVILANLLLVISRNRLQSFMALLPLVMSVLLVL NSIFTLAVLAYGQLGSAVVFANNFAFVLYFLLHLVMIAILAFHIFRKAKL SYNTLMLLLVISLSLPLWASFAYLE >MS0072 unknown MFELLLQGISSLFTITGLSCLLGGVFIGIIFGSVPGLSATMALALFLPIT YALDPNMAVILLIALYIGGISGGLITAILTGIPGTPSSIATCFDGYPMTK RNQAFKALGVGVTFSFLGTIFSTIVLIFLSPILAKIAIKFGAYEYFAVAL FSLSMLVGLSGENIWKGLISGLMGCMFATVGMDSIASVNRFTFGSEEIAY GFDVLPVLIGLFAINEIIAKADTVKTEHQNMQVITAIRMEKGLGFSLKEF FGQIKNFFVCSSLGTGIGILPGIGGGAANVMAYTVSKSISKHPEKYGTGI IDGVVASESSNNAAVGGALVPLLALGIPGDTVTAILLGGLTLHGIIPGPL MFTENVGTVYAIFTAMLFGSVVMFIMEFYGLRLFTKILSIPKHFLLPAIF LFCIIGAFGVRNNFFDVWATVLFGIIGFSFYKLSIPAAPFILGFIIGPLA EINLRRGLMFSQGDFTAFFKAPIALTFFILTGLVIVFAVKSRIKKHN >MS1522 unknown MFKVQRIYDFEPMENDCAVFVDRLYPRGVNKEKFAHCLWLKDVCPSHELR RFYHENPQENYGEFVLRYQLELGNELPQKGLIMLKRLEKEHPQVILLTAV KDVRHSHIPVLLKALAAIVEFL >MS1954 unknown MQPYSKSLKELSQKLRSDQTDAERKLWQRINREQLLGFRFNRQKPLLNYI VDFYCPKAKLIIELDGSQHYEPDYQGKDALRDAELNSLGFTVMRFSNDEV YYEIEAVVDQIYLFLESIDHDRAD >MS1796 unknown MGLCLKLPQASGKVFSIYAYFKRNKRFTKKFF >MS0908 unknown MLMFELTKLITNMLLPPFNILILLVLSFLFLAFKFKKLAALCALSGLTIL YVFSIPYTAQLLNDSLTTEDNLTVEDYRSAQAIVVLGAGLRDSKELYNKI TVPGIALERMRYAAYLHKETELPILISGAGPNGNSEAKIMGQEFFTFFGV QPKWLEERSTNTKQNALYTREMLEREGIKRIILVTNQWHMQRAKLLFEAQ GFEVLPASVGSGVTPESYELNAMHFIPQAGAMAANMQLLKEWLGFIKEKL >MS1549 unknown MILTRYLTKEVFKSQVAILFILLLIFFSQQLVRVLGSAANGNVPADLVLS LLGLGMPAMAQLMLPLCLFIAILLTFGRLYAESEISVMRACGVGQRILVK VALGLSVLTAALAAYNVLWVSPWAIQKQGQIVEDARANPNMSALSAGQFM TSNDSDFVLFIDNIKDNKISNIYLFQTKEKGNSKPSVIVAENGELQSLPN GDQILSLQNSQRVEGSAALPDFRITNFTEYQAYLGHRNVDSDENETTELP LAELLALKTPAAKAELNWRISLILAVPLMALLAVPLSKVNPRQGRFAKIL PALLLYLIYFLLQSSLKSAGGAGKLDAGLLMPLVNLFFLLLGIMLNSWNS AFMYKIRHLFSKKSAI >MS1733 unknown MFLPELTGLCLFSHDDSRKALSFCSLKNWKQAEKLRDFLKRKSEYKKSWP GKSGQVFSNNERHRYKNT >MS0211 unknown MKFLTALFYSFFLFIAKKYSKRYNNQPFLRNVAGGKVSSAFCVIGIT >MS1350 unknown MLNKVFLSGVALLLAGCAAEQAPIPAQFAGADYQLSDKDAKQWVALGKRA ESCIYPNLTRIQQEHFAKEDSYIYSQYVFFYPLEDVIGSDAVKIIEADQQ SMDYATYQFKKFKQSDELPKLDELTTAQCNTLRIKAREDLAVVKGQRISA MVEDTNTTGGTSNANKVGTEDNKFFFDIIKWGSALLL >MS0215 unknown MFIYAGHKTKSAVQNFQIFDRTFIPPFVRLYPLH >MS0651 unknown MSLRLQFPGGIIFSALAIPGKIWYFLSVNPTETYLPSMNEEHSTKTTQDT IQKKNFFQSLFDRFFQGELKNRDELVEVIRDSEQNELIDQDTREMIEGVM EIAELRVRDIMIPRAQIVFIQTDQDLESCLDTIISSAHSRFPVVGNEKDN VAGILHAKDLLKFLRTDAEEFQLESLLRPVVIVPESKRVDRMLKEFRSER FHMAIVVDEFGAVSGLVTIEDILEQIVGDIEDEFDEEDIADIRQLSRHTY AVRALTDIDDFNQQFGTHFEDEEVDTIGGVIMQAFGYLPKRGEEITLENI HFKVTSADSRRIIQLRVTVTDEQLAEIEKSAEEKEE >MS0372 unknown MIKGIQITKAANDNLLNSFWLLDSDKGEARCLAAKAEFAEDQIVAINELG QIEYRELAVDVAPTIKVEGGQHLNVNVLRRETLEDAVNNPDKYPQLTIRV SGYAVRFNSLTPEQQRDVITRTFTESL >MS1185 unknown MNTPFFISWRYQRTKQKNRLVSLIALFSSIGIALGVAVLILGLSAMNGFE RELNNRILAVVPHSDITAYQDGRINDRQDLERRLMANSDIKAVSPYVSFT ALVENGAKLKVVQVHGVDHKMLDNVSSLGKFVLNNGWQQFAERGGLVLGS GIARDLDVSEGDWVTLLISQNTDGDQLSQPARERVQVTGILRLDGQLDHS YALIPLATAQEFLGFAKNEISGIEMKVADPFKVQQLNFANLNDYPQMLNL QTWINKFGYMYRDIQLIRTVMYIAMVLVIGVACFNIVSTLIMAVKDKAGD IAIMRTLGANNGFIKRIFIWYGLQAGMKGCLIGIILGVILSLNLTSIIKA VESLLGHKLLSDGIYFVDFLPSELHWQDVLLVLVAALMLSLLASLYPANR AAKLQPAQVLSGH >MS1986 unknown MFIEPNKKPKIYTALFTQAYYHIFIRFILNNL >MS1493 unknown MNINVISIFKLLLLFALGLVILSPALSTQIGVPRLDSALCFLFFFLAVIT PFLRDMETDFFKLQFPVYVLFFFGFLSVLNAFSTEKLVDLFFFGIVMFLF HYSFLTFNRGDGEAGIRHLLLGISLIVLAGFFIEALLGFQLVSGNEELTV TDKAFKGFFFNTNDQSVIMISLAVAVGFFYIIRENNWKIKLIGYALIFIM GLAIVISASRSVLLSYLIMLMLILFLNASAYFKAVYLFFACVIALFIFNL SWLQEVFILLAKIDWLERPIERFSLVIFSMGDDKSVGYRTEIYTTFLDNF KILWLGYGPRDYIQYFDQIKLSFPLGYTNPHSFFIELYLAFGIFAFLAFI YFLLNSIIYVMNTRLLAWKERIFILFVFINFCWIVWVPSSILRLPLVWYP LFLVLVYTVLVKNGTFVSPKLVGRRRSS >MS1176 unknown MQAGNPVLCNTTLFVKDPKPNSVLLRNLMMSFIFSHNK >MS0178 unknown MAVKKCGKFPYIFAPTPVNGGDDAPVQLVGMV >MS1706 unknown MKIKSKSAVKIFDFLTALFLTTKNGTLGVPFLLYKYYKLVLIGKTFDSCA >MS2167 unknown MNVPILYFDVVHSIQIHDWIIEKSGGLAGLYPDGTGKLESVLEHIQNDLY YPNFEDKLVHLIYSINKLHAFLDGNKRSSIVLGSYFLELNGYDYCVKEFT IKMENIVVWLAESKISKELLLKLVCSILNNEEQYSDELKYELICATSDDF GN >MS0436 unknown MKANNNPVRRSFLLFNNLSDNLCGHLKDSFKQKI >MS0732 unknown MINLKIDGFDVRVDEGTTILEAAKSVGINIPTLCYLKDVSDIGSCRVCVV EVEGFEKLPTSCNTLAQEGMVIRTQTDKVVKSRRMALDLILSHHNLICFS CPSNGACELQNVAHQCGISESSFPNFRLPGIEVPHVEDNPFLGYRPDLCI HCQRCINTCANVSGCSSIKLASRGIFRAIETPFGKDWKETTCESCGNCAE ACPTGAIYKKEAKSYRSWEIQRVRTTCPHCAVGCQYDLLVKDNKLVGAEG VDGPSNGGRLCVKGRFGSYKFVMSGDRLTDPLIKDRATGKFRKASWDEAL DLVASKFMTLKRQYGGDSLAGFACSRSPNEDIYMVQKMVRTCFGTNNTDN CARVCHSASVEGLARTLGSGAMTNPIYDITHDVDAILLVGSNPEEAHPVI GMQIREAVRNGTKLIVVDPRDIGLTKQADIHLKLRPGTNIAFANGMCHIF IKEGLIDEKFIAEHTEGFKELKKIVKDYTPEYVAEICGIDADDLRAAARI YATAKKAPIIYCLGVTEHSTGTEGVMSMSNMAMMVGKIGREGCGVNPLRG QNNVQGACDMGASPNQYPGYQSVKDPEIRAKFEKAWGVKLPAHIGLHATD VFPAAIKGKIKGLYICGEDPVVTDPDTNHVINALKSLDFLVVQELFMTET ALLADVVLPGRSYAEKDGTFSNTERRVQRVRKAITLPGNSRLDTDIICEL MRRMGYNQPNLTASEIFDEMASVTPSFRGISYERLEKEPTQSLQWPCTDQ YHPGTPIMHVGKFARGLGLFYPTVYTPAKELPDAQYPMMLTTGRILYHYN TRAMTGRTEGLMEIAGHSFIEINSADAKRLNIENGERVRVTSRRGTITTE ARVSDKTNEGETWMPFHFADGNCNWLTNAALDQFARIPEYKVCACRIEKL PEDEAFNMKGKYITQKMVAAQWRKKMDKSIAKLVR >MS1948 unknown MGDIYEFTKTITNPAIYFHYGVFCHAVGIFYLFAAIICFGLGSCLFGTNA RACLLAPSDFYSEKPIVMANILVKFDNGENRGFIFGSSRLVLSFLTMLVM NLIPHLFLI >MS1356 unknown MPSDGMYKIIRRYFFNKRNKCYFFVKKARQMQGKFLFS >MS0841 unknown MFDRKREETTALFYYHLGLVIKGEKRGTSEGVERSALTVVPIV >MS1680 unknown MMKTVTIDLQIASEDQSNLPTLEQFTLWATNAVRAEHFEPEITIRIVDEA ESHELNFTYRGKDRPTNVLSFPFECPEEVELPLLGDLVICRQVVEREAQE QGKPLTAHWAHMVVHGSLHLLGYDHIEDDEAVEMESLETEIMTGLGFEDP YSYDEE >MS0280 unknown MFRLITPIALFFFGLFVSIYSYQSYGDFAEYGAAFYPTAVGVLVSFFSLV DFIMELRIKDKYVFQHFDFFQDGKIILLIIAIISFYIFVADYLGFIITTS LILIFLTLPFLEKYKLLTALLLIILSIGIYLLFARVLLVGLPSGIIFE >MS1460 unknown MVARRAHNPKVVGSNPAPATKFKRKALILLMPFCYLPYIEKGDKKNRP >MS0322 unknown MSAIEQTAEGLRLRIFLQPKASRDKIIGIHDDELKIAITAPPVDGAANAH LLKYLSKAFKVPKSAIILEKGELNRHKQLFIPEPKLIPEELQPLL >MS0398 unknown MSAGISLRIILLKIVSSAILCFLIFGKNLPHFLSEISLNSNLSRKPKE >MS0421 unknown MRNSSMMNLLNRRNSVKNNRTLAYFYSKCGQKFMIFYKLISSLS >MS0475 unknown MNRRVQGPTTHNGISGSSNHLLLCKPFLSFANKFELKFDKLAKN >MS0434 abc, Abc protein MIKLKNISKIFDVAGKKLNALDNVSLDIPKGDICGVIGASGAGKSTLIRC VNLLERPTSGSVFVDGQDLTQLSEAQLIAERRNIGMIFQHFNLLSSRTVY ENVALPLTLEHMAKEKIHEKVTALLALVGLTDKKDVYPANLSGGQKQRVA IARALASDPKVLLCDEATSALDPATTQSILKLLKEINRTLGITILLITHE MDVVKNICDQVAVIDKGQLIEQGSVSEIFSNPKTELAQEFIRSTFQANLP EEYLAKLTDTPKRSDSYPIIRFEFTGRSVDAPLLSQTSRKFNVSFNILVS QIDYAGGTKFGFTIAEVEGDEDSITQAKIYLMESNVRVEVLGYVD >MS1552 abgB, AbgB protein MELTQQQLVQWRREFHRFPETGWAEFWTTSRIADYLEQMGFEILLGNQII NRDFVRGRQQAVVEKGLANAVAYGAKQKWLEKMDGYTGCVAVLDSGKPGK TLALRFDIDCVNVMETKAPEHIPNKEDFASLNDGFMHACGHDGHITIGLG TALWLSQNKDKLSGKVKIVFQPAEEGVRGAAAIAASGVIDDADYFSASHI GFCADSGTVISNPKNFLSTTKIDIRYQGKPAHAGAAPHLGRNALLAAAHA VTQLHGISRHGEGMTRINVGVLKAGEGRNVIPSKAEIQLEVRGENKAVNQ YMVDQVMRIANGIAVSFDVEYETEIMGEAVDMINDTELVGLVEEIVLAHP KVHSANANYAFNASEDATVLGRRVQEQGGKAIYFVLGADRTAGHHEAEFD FDEDQLMNGVNIYTALVQRLLG >MS0768 accA, AccA protein MRKKMSQQNQTEYLDFELPIAELEAKIESLRSVTDQDSKIDLDDEIKRLQ KKTAELTKKTFADLDAWQVSRMARHPNRPYTLDYISRIFTEFEELAGDRA FADDKAIVGGLARLDGRPVMVIGHQKGRSVKEKVLRNFGMPAPEGYRKAL RLMQMAERFRLPIITFIDTPGAYPGVGAEERGQSEAIARNLREMSTLTVP VICTVIGEGGSGGALAIGVGDKVNMLQYSTYSVISPEGCASILWKSAEKA STAAEVMGLTASRLKELELIDNIVTEPLGGAHRQYDEMAQALKQRILSDL EDLDILDKETLLDRRYQRLMNYGYV >MS1789 accB, AccB protein MFQYCKVRLLFFIFSNKTDQNIMAMDIRKIKKLIELVEESGIMELEISEG EESVRISRGAAAPSAVQYTLPAAAPAPVAAPHAPVAAPVAAPDAVAELSG HIIRSPMVGTFYRSPSPEAKAFVEVGQTVKMGDALCIVEAMKMMNRIEAD KAGVVTAILVNDGDAVEFDEPLIVIE >MS1788 accC, AccC protein MGLSHLFTFVDQISNWNTLMLEKVVIANRGEIALRILRACKELGIKTVAV HSTADRDLKHVLLADETVCIGPAPSVKSYLNVPAIIAAAEVTGADAIHPG YGFLSENADFAEQVEVSGFTFIGPTADVIRLMGDKVSAINAMKKAGVPCV PGSDGPLGTDMVKNKQIANRIGYPVIIKASGGGGGRGMRVVRNDESLEES IAMTKAEAKAAFNNDMVYMEKYLENPRHVEIQVIADTHGNAVYLAERDCS MQRRHQKVLEEAPAPGITEEIRRDIGQRCANACIEIGYRGAGTFEFLYED GKFYFIEMNTRVQVEHPVTEMITGVDIVKEQLRVASGLPLSVKQEDIKVH GHAIECRINAEDPKTFLPSPGKIAHLHSPGGLGVRWDSHVYAGYTVPPHY DSMIAKLIVHADTREGAIRRMQNALAETIIDGIKTNIPLQNLILEDENFQ KGGTNIHYLEKKLGMGE >MS1174 accD, AccD protein MWLNKQNFYDLIKRLKMSWIDKIFSKSPISSSRKANVPEGVWTKCTSCEQ VLYRDELKRHLEVCPKCGHHMRIDARERLLALLDKDGVTELAADLEPKDI LKFRDLKKYKDRLTAAQKDTGEKDALVVLSGTLYGLPIVAAASNFGFMGG SMGSVVGAKFVAAAEEAMEKNCPFVCFSASGGARMQEALFSLMQMAKTSA VLAKMKEKGVPFISVLTDPTLGGVSASFAMLGDINIAEPKALIGFAGPRV IEQTVREKLPEGFQRAEFLLEHGAIDMIVKRSDMRDTLASLLTKLMNKPS PFNAEELSDTE >MS1336 aceE, AceE protein MSQMINDVDPIETSDWLLAIDSIIREEGVERAQFIIEELMQHARSKSVAL PTGATTEYVNTIPPSEQPPYPGNLSIERRVRSAIRWNALMMVLRAQKKDL ELGGHISTYQSAASIYEVCFNHFFKAATEKNGGDLVFFQGHAAPGIYARA FVEGRISQEQMDNFRQEAKANGLSSYPHPKLMPDFWQFSTVSMGLGPVNA IYNARFLKYLNNRGLKDTTDQTVYAFLGDGEMDEIESKGALTLAAREGLD NLIFVISCNLQRLDGPVNGNGKIVQELEGLFFGAGWEVIKVMWATGWDKL FAKDTSGKLTKLMMEVVDGDYLTFKSKNGAYIREHFFGRYPETAALVADM TDDEIWALRRGGHDTEKMFAALARAKKSDKPVVILAQMVKGYKIPEAESK NTAHQTKKMSHASLKSFRNHFDLPLTDEQIDNYEYITFAPDSEESKYLHE RRAALNGYVPARLPKFTTEFKVPALEDFSQLLEEQPRAISTTMAFVRVLN TLLKNKDIGKQIVPIIADEARTFGMEGLFRQVGIYNPHGQNYVPSDKELV AYYREAKDGQVLQEGINELGATASWLAAATSYSVNNLPMIPFFIYYSMFG FQRVGDMMWAAGDQLARGFMIGGTSGRTTLNGEGLQHEDGHSHIQAGVIP NCVSYDPAFAFEVAVIMQDGINRMYGEKQEDVFYYITTLNETYDQPAMPA GVEDGIRKGIYKFETVGKGEAAIQLMGSGAILRHVRQAAQILADDYGIAS DVFSVPSFTEVAREGADVARWNLLHPTETQRVPYIAQVMSDKPAVAATDY MKLYAEQVRAFIPAQSYHVLGTDGFGRSDSRENLREHFEVDAHYVVVAAL NELAKQGKLEKQVVADAIAKFGLDVDRINPLYA >MS1354 aceF, AceF protein MSNFDIITPDLPESVADATVVKWHKAVGDKVRRDEVLVEIETDKVVLEVP ALNDGIIESIIEPEGATVVSKQLLGKAALLPVGEVTVRAETPTVAPQIED SAVASSADTLGPAARRLIAEHDLNVNEIKGSGVSGRITREDVEAVIAQKA ASVAAKSAVENTVISSPAAVRTEKRVPMTRLRKRVAERLLEVKNSTAMLT TFNEVDMQPIMQLRKKYAEKFEKQHDTRLGFMSFYVKAVVEALKRYPVIN ASIDGDDIVYHNYFDISIAVSTPRGLVTPVIRNCDKLSMAEIERQIKALA EKGRDGKLTVDDLTGGNFTITNGGVFGSLMSTPIINPPQAAILGMHAIKD RPVAIDGQVAIRPMMYLALSYDHRLIDGKDSVGFLVTVKELLEDPTRLLL EI >MS1335 aceF, AceF protein MPTLRRMINMSKQIQIPDIGADEVTVTEVMVKVGDTVTEEQSIINVEGDK ASMEVPSPEAGVVKEILVKVGDKVTTGSPMFVLESADSAPASAPQAAAVA PAAAPTTSAVIEIHVPDIGSDEVNVTEIMVKVGDSVAEEQSIINVEGDKA SMEVPAPQAGVVKEILIKEGDKVSTGSLIMKFEVAGGAPAAETPATTVQA APAVSAVQDVNVPDIGGDEVNVTEIMVKAGDSVAEEQSLITVEGDKASME VPAPFAGVVKEILVKSGDKVSTGSLIMRFEVAGSAPAVQAAAPAQAAPAP VAPAPQAAPAQSLAPVNQDSIATSASYAHATPVVRRLAREFGVNLDKVKG TGRKGRILKEDVQEYVKNALKALESGATASTGAASGAGLGLLPWPKVDFS KFGEVEEIELTRINKISGANLHRNWVMIPHVTHFDRADITDLEAFRKEQN VLAEKQKLGVKITPVVFIMKAAAKALEAYPRFNSSISEDGQRLTLKKYVN IGVAVDTPNGLVVPVFKDVNKKGIIELSRELMEVSKKARDGKLTASDMQG GCFTISSIGGLGTTHFAPIVNAPEVAILGVSKSEMAPVWNGKEFMPRLML PLSLSFDHRVIDGADGARFISYINGVLSDLRRLVM >MS0999 ackA, ackA protein MCERMSFNVNPKDRLFMSQKLVLILNCGSSSLKFSILDPKTGEEKLSGLA EAFYLDDARIKWKLHGEKGNAELGKGAAHSEALNFIVNNIFPLDPTLKDG IVAIGHRIVHGGEKFTSSVIVTDEVVKGIEDAIQFAPLHNPAHLIGIKEA FKIFPHLKDKNVVVFDTAFHQTMPEEAYLYALPYSLYKEHGVRRYGAHGT SHYFVSREAAKRLGVAEDKVNVITCHLGNGGSVSAVRHGQCIDTSMGLTP LEGLVMGTRCGDIDPAIMFYMHDTLGMSVEEINTTLTKKSGLLGLTEVTS DCRFAEDNYDNEDESLRVPAKRAMDVYCYRLAKYIGSYMAVIGERLDAIV FTGGIGENSAHVREITLNHLKLFGYQLDQEKNLAARFGNEGIITADNTPI AMVIPTNEELVIAQDTARLCIKD >MS2369 acnB, AcnB protein MANFLQEYQQQVDERAKEGVVAKPLNADQTAQLIELLKNPPQDKAEFLLD LFKNRIPAGVDEAAYVKASFLSAVTKGDVACPLISAKSAVEILGKMQGGY NIEPLLSALDNPELAPAAAKELSGILLMFDNFHDVRERAEQGNPYAKQVL QSWANAEWFTNRPKLAEKITVTVFKVSGETNTDDLSPAQDAWSRPDIPLH ANAMLKMPREGIIPDQPTLVGPIKQLESLKQKGFPLAYVGDVVGTGSSRK SATNSVLWFMGEDIPYIPNKRAGGIVLGGKIAPIFFNTLEDAGALPIEVD VSALNMGDVIDIYPYAGKICVHNTDQVLAEFSLKTDVLLDEVQAGGRIPL IIGRGLTHKARLALGLNESEIFKKPQAVQASEKGYTLAQKMVGRACGVEG IRPGQYCEPRMTSVGSQDTTGPMTRDELKDLACLGFSADLVMQSFCHTAA YPKPIDVVTHHTLPDFIMNRGGVSLRPGDGVIHSWLNRMLLPDTVGTGGD SHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESVLVRFTGNMQPGITLR DLVHAIPYYAIQQGLLTVEKKGKKNIFSGRILEIEGLENLKIEQAFELSD ASAERSAAACSIKLNKEPIIEYLNSNIALLKWMIAEGYGDARTLERRIKA MQTWLDDPQLLEADKDAEYAAVIEINLDEIKEPIVCAPNDPDDARLLSDV QGDKIDEVFIGSCMTNIGHFRAAGKLLNKFKGMIPTRLWVAPPTKMDAAQ LTEEGYYSIYGKSGARIEVPGCSLCMGNQARVADNATVVSTSTRNFPNRL GQGANVYLASAELAAVAALLGKLPTPEEYLSYTVDLQQDKDDTYRYMNFD KIENYMKKADKVIFRQAV >MS1875 acpP, AcpP protein MSIEERVKKIIVDQLGVKEEEVKSEASFIEDLGADSLDTVELVMALEEEF DIEIPDEEAEKITTVQSAIDYVQNNQ >MS2089 acrA, AcrA protein MKKKYLILLLALALTACGEAQTEVAETTSRMKVNVVQVQPTQLNYRLLLS GSIQAKDDVSVGTSLQGLQVLDVKAEVGDWVEQGQVLATLEQSQVQSQFR QNDALLQRAKANLVSQQSTLKEAEATLKRYQQLIKTDAVSHQELDQQRAK AESARAAIQAAKAEIAQVQAQLDDSRHQRKKAEVLAPTSGIVTQRLAQAG NLTDSNALFHIARDGVLEAVVRASADEISVLETGLVANVQMLDKVTSGLI RLISSQIDSATHTAKIHIALQEKLQVPFGTPINAVVQLPEMTAQIAVPFS AVNFGADGNHFVMVVNADGTVVRRKITLGEVS >MS1128 acrA, AcrA protein MKKKVLAIAVLAALIAGGAYYFMNGSKKAPTYLTEDVQRSNVEKTVVASG SIESSNEVDVGAQVSGKVVKLYVTLGQEVKKGDKIADIDSTTQINSLNTA KAALASYQAQLKAKQTGYNVALSSYNRLSKLYTQQSTSLDNLNSAKNTLD AAKAEVDALKESIKQAEIQVNTAETNVDYTKITSPIDGTVISTPVSEGQT VNANQTTPTIVTVANLDKMLIKPEISEGDITKVKAGQQVTFTILSDSTTT YDAVIDSVDPATTTTTDASATSSASSSSSSSSSTSAVYYYANMAVDNPNR VLRIGMTTENTIKIARAENVLTVSNMALKKQDNKYYVNVLNAQNQPERRE VQVGVQDDFHTEIKSGLTESDKVILSQIEDGEKVGNLGRGPRMF >MS1301 acrA, AcrA protein MKAKYVVAILAAVAVAGGTIWYNAQLKAEKLAGIAAVNGRLELKRLDIAT LYAGRVEEMYVQEGDEVQPGQNLARLSSSISQTQVDAANAQKQRAQEAVT RAVAQIDSQQQQLKVAKLELDNAQKLRRDNLVSASELERRQANYRAAVAA VNTAKAAKAEADAAVNQAQAQLEQALSQNSDMLIKAPKAGWVEYQIAEVG NVLGIGGRVVSLLDPTDTYINVFLTSAQSNQVKVGEEARIVVDGMNAVFP AKITYVAADAQFTPKSVETTEERAKLMFKVKLQIPAEIALQYNKLLKGGM TALGYVKYGQEALWPENLTVKLPQGE >MS0454 acrA, AcrA protein MTDSQLAKPKRSHAFLLKVGLAVAVLVFALVIGLNKFKEIMIGKAIANMP ETANPVTALTVGSSEWTPVIETTGLVRPNQGAMLSSQASGTIKRIYVKSG QAVKKGDVLVELDNAVEEATLKASEAQLPSVRLTYQRYANLIKSQSVSQT ELDSAKAAYDQLVANINSLKASIERRKILAPFDGITGIVQVNEGQYISAA TEIVRVEDISSMKVDFSVSQNQLEDLHIGQKVTATSDARTGETFAAKVTA IEPAVNKSTGLIDVQATFAPEDGKKLLSGMFTRLRLALPTERNQIVVPQV AITYNMYGELAYVLMPLSDEDKEKLKDNENLSKMYRAQQITVFTKDRQGI YAQLKGNEVKVGDILVTGGQQRLSNGSLVVISDKDGVGTVQPAEKTNL >MS2087 acrB, AcrB protein MNFRISAWAIRNPIPIIVLFLLLTIMGIRSFQALPINADPNISFPAVNIT ISQTGASPDELENSVTRRVEDAVAGMAGVRHITSSITEGTSTTSVEFRLE TDTDRAVNDVRNAITQIRGDLPQNIDNPIVERMDTEGAALGYYAVQSPNM NQTELAWFIDDAVSCELLAVNGVQQVKRLGGEKREIRVALQSTKLNALEI TAEQVSQQLAQTNANVPAGRVEWFNQEQSVRVIGSQINLDDLANLPIALS DNRKVKLSELATITDSHAEMRSRTRLNGREVLGFQVFRSKGSSDTVVESG IQQALKKLIETYPDIHLTEVHNSVDTTRENYDVAISTLLEGAALTVLVVW LFLRNWRATLVAAIALPLSILPAFWIMKLLGYTLNSISLLAITLVIGILV DDAIVEIENIETHMQQGKRPFQAALDASDAIGLAVVAITASIVAVFLPVS FIDGMTGQYFGQFGTTVAAAVLSSLMVARLAIPLLAAYLLKPHISKHHTA QHVGRLKKSYLSLLAKALQFRKTTLLMGGGLLLMSAMIIPQLPTGFVPKG DTGMSQIDITLPPSSPLAQTDDMLQQLDRIIREFNEVDLVFTTAGSSEIN KGEVLIKLKPYKERSVSQKEFEDKLRDELVKFADIRANFRNEMAGRDVSI LLTGNDPVKLDQTAAELKKQMQEIKSIENVQINAPLVKPELQVKLRKNEA AQAGISSQAVGNLLQIATLGTTDGNAARFNLPDRQIPIRVTLSENERNQP EVLQHLRVASSNGGTVILNTIADIQFGAGSASLERFDRERRIAVEADLAV GQTIGTALSQINELPIMQKLPDGVRVPSAGDAEYMDEMFSQFGFAMATGV AMVLLVLILLFKDFLQPFTILTALPLSIGGAALGLLLYGAALDMSSVIGI LMLMGIVTKNSILLVDFVIEKRQQGVERTTALIQSGAERVRPIIMTTIAM VAGMIPAVFAGGASAAFRAPMAIAVICGLTASTLLSLVFVPVVYSLMDDM RNYLAPKLAKLTSVTEEDRVV >MS0456 acrB, AcrB protein MKFTDIFIKRPVMAIAISMLIVILGLQAISKLAVREYPKMTTTVITVSTT YAGADAGLIQAFVTSKLEEAIAQADNIDYLSSTSAPSSSTITVKMKLNTD PASALADVLSKVQSVRSELPSGIEDPTLTSSTGGSGIMYISFRSDKLHPS QVTDYIERVVKPQLFTVEGVAKVQVYGAAEYAMRIWLDPQKLAGQNLSAT QVTTALSNNNVQTAAGSDKGYFNIYRNKVETTTNTVEDLGNLIVYSDGDK LVRLRDVADVELNKESDDTRAAANGSDAVVLSIEPTSSANPLTVADNIKP LYETIKKNLPDSIESNILYDRTVAINSSINDVIHTIIEAVVIVLVVIMMF IGSLRAIFIPIVTIPISLIGVIFMLQMFDFSINLMTLLALILAIGLVVDD AIVVLENVDRHIKEGETPFRAAIIGTREIAVPVISMTIALVAVYSPMALM GGITGTLFKEFALTLAGAVFISGIIALTLSPMMSAKILKHESSKFEEKVN RTLSKLTTGYTYILGLVMQARKAILLFAVIIFATLPILFSSLSSELTPAE DKGGFLGMVTAPSNVNVDYVQQATKPYEEILNNTPEKQYSQVIAGAPNTN QALVITTLKDWAERSRSQAEVMAELTKKAAAIPEVSISAFAFPEIETGEQ GPPVVFVLSSPGSTKELAQTAETFLDKIRKSGKFVYSNLTLKYDVAQMRI QVDKEKAGTYGITMQQIASTLGSYLSEATITRVDIDGRAYKVISQVKREN RLSPESLKNYYISASNGQSVPLSSLLTVELEPQPYSLPRFSQLNSAEIQL VPSPTTTTGDAIAWLKDAAQDLPQGYSYDWKGEARQLVQEGNSLATTFIL AVLIIFLVLAIQFESIRDPFVILVSVPLAISGALLTLNLLSFLGVTGVTL NIYSEVGLITLVGLITKHGILMCEVAKEEQLNHGKTKMEAIMTAAQLRLR PILMTTAAMIAGLIPLLYASGAGAVMRFSMGVVVVAGLAIGTLFTLFVLP VIYTYIGSNHKPLPEFDENAPRIGSSH >MS2295 acrR, AcrR protein MEQKLSPKQKGRPRTFDREKALESALFVFWNQGYTNTSIADLCNAININP PSLYAAFGNKSQFFIEILDYYRRVYWDVIYAKMDVEKDIHRAIHIFFRDS VNVVTVANTPGGCLSAVATLNLSAEETKIQQNMRQLKSDILKRFENRLKR AIVDKQLPSQTDIPALALALQTYLYGIAIQAQAGTSKDDLLKVASKAGLL LPKLI >MS1936 acrR, AcrR protein MAEQLTLDSIEPEPEKQSAKIEKRSIKERRQQVLTVLTHLLHSEKGMERM TTARLAKEVGVSEAALYRYFPSKTKMFEALIENIESSLFSRISYSIKMET NTLNRVHDILQMIFDFARKNPGLTRVLTGHALMFEEAKLQARVALFFDRL ELQFVNILQMRKLREGKTFPIDERTIATYLVTFCEGQFMRLVRTNFRHMP NQGFEQQWRFIEPLFE >MS0153 acrR, AcrR protein MINMAGVRAIQKEKTRRALIDAAFNQLNAEKSFSNLSLREVAREAGIAPT SFYRHFKDMDELGLTMVDEAGLTLRQLMRQARKRIEKGGSVIVISVETFF EFIAHSPNVFRLLLRESSGTSQAFRTAAAREIKHFVDELAEYLANKNNYS EYVAYVQSEGMVTIVFTAGANALDMNNKERELLKERLILQLRMLAKGAHH HMMERERHNTHLPATGKS >MS0453 acrR, AcrR protein MRQSETDMAEQIFAATERLMAKDGLHHLSMHKIAKEARISAGTIYIYFKS KEELLEQFAWRVFSLFQTALEKDYDETLSYFEQYKKMWLNVWYFLQDNPN IVMNMQQYQSLPGFFDICKEMDYNSRWATFCQKAQQAGAVCELSVSILFS LSMESAMNLAFKKLYINEFLADEELMTIIERTWRSIQK >MS2211 acrR, AcrR protein MKKNLNFVVKESITEALLRLMAKKNFDEINITAITELAGVSRISFYRNFD SKEDVLIKYMYVRAKELYKPFESQDVSVRDKLIGMFKSIEGMEDIINLLY AQNLSHIFLQYFNFVRGAKPEQENLDAYQNSIVVGVCFGALDEWIKRGRQ ETPEQMVDLLQNVIWGFVKE >MS1300 acrR, AcrR protein MKQDIRITKTLGLIRHVFLELLEEKGFEHIVVQDILDRAQINRSTFYKHF QNKHAVALMLVDEIKQLLTENFENRFSIPTTEFAQKMVPIFWQHRDLIHL IGKIENPRIHLYKDLALVIKEEYIKQAVREQPQSSEELDFQGYLFAIVSL GTIRYFVEKGELPDPSVIVGDIESVFNLLIIK >MS0845 acyP, AcyP protein METYMLKKQFVVYGIVQGVGFRYFTWKKATEIGLNGIVKNQRDGSVYILA EGSASQIDSFRDWLSHGPPSARVDRVEENDYSGTHSFGLFSVEH >MS0637 ada, Ada protein MDSIYYSYYSSPVGNLLMIAQQGKLTNLDCELEQTAPNPKWILNNELPLF RQVKSALDRYFSGEKEDFSDIPLNPQGTTFQQSIWQALRRIQLGKTTSYG ELARLINNPKAVRAVGGAVGSNPISIIIPCHRVLGKNGQLTGFGGGLPMK RFLLNLEKIRYVDKGVEYVKQKLLKKYTA >MS1386 adhC, AdhC protein MSNTIKSRAAVAFAPNEPLKMVEIDVERPKKGEVLVKITHTGVCHTDAFT LSGADPEGVFPVVLGHEGAGVVVEVGEGVTSVAVGDHVIPLYTAECRECE FCKSGKSNLCVSVRETQGKGLMPDGTTRFSYNGQPIFHYMGCSTFSEYTV VADVSLAKINPQANPEEVCLLGCGVTTGIGAVHNTAKVQEGDSVAVFGLG GIGLAVIQGAKQAKAGRIIAIDTNPAKFELAREFGATECLNPNDFDKPIQ QVIIEMTKWGVDHTFECIGNVNVMRAALESAHRGWGQSIIIGVAGAGQEI STRPFQLVTGRTWKGSAFGGVKGRTQLPGMVEDAMKGIIRLRPFVTHTMP IERINEAFDLMHEGKSIRTVVHY >MS0796 adk, Adk protein MEISMKIILLGAPGAGKGTQAQFIMNKFGIPQISTGDMFRAAIKAGTELG KQAKALMDEGKLVPDELTVALVKDRIAQPDCANGFLLDGFPRTIPQADAL KDSGVNIDYVLEFDVPDEVIVERMSGRRVHQASGRSYHIVYNPPKVEGKD DVTGEDLIIRADDKPETVLDRLAVYHKQTQPLVDYYQAEANAGNTKYFRL DGTKKVEEVSAELNSILG >MS2161 aes, Aes protein MKILLKLTALLLSLGVALNVAADGGKNHPSYPLLASEFRDLSLLESVRFT PEQLQDKAKLTELNAAFLQAAEQSEVQPNEKITAPAQGAQPAVDLYIYRP ATAKNEKLPVIYFMHGGGYLFGNARQNNAALAELADLNKAVVISVEYRLA SQTPYPADIDDAYHGLAYLFKNGQKLNADTNKVVIMGESAGGGLAARLAL KVRDKGEFKLAGQVLIYPMLDYRTGTSQSLYNTPYTGGYVWTAEYNRIGW ETLRGGQTIAQAEMPYYSAATATDLAGLPPTYMMVGSLDLFANEDMDYAN RLVQAGVPTDLQLVSGVYHAFEIFNPNASQTLAYKLARTNAIQQMLAK >MS1419 aes, Aes protein MNFAKNLQKSTALFNVCTGNANMKKLLLAPLLLAFCLPSLAESYRTLDQV SPAYQEAAKMLKMDFADPNVRENAQKQNIQRANESYQPTAHWTVPAQGSQ PAVELYVYKPKSVAGKLPVIYYIHGGGYILGNAKAAGDNLQAIAEANKAA VISVEYRLATVAPFPADLNDAYHGLSYVYKNAGKLGLDKEKVVLMGESAG GGLAARLALFTRDKGEFTPEGQVLIYPMLDYRTGTPESPYDTKNLGEFLW TESANRLGWATLRGNQTISDEQLPYFSPAFAKKLSGLPRTYMMVGDLDLF VAEDLNYASRLIQAAVPTELQVFPGLFHAFEAFNKDGKQTKEYEQSRNQA IQEMFSHPVK >MS0155 ahp1, AHP1 protein MSNMEGKKVPQVTFHTRQGDAWVDVTSAQLFDNKTVVVFSLPGAYTPTCS SSHLPRYNELTPEFKKLGVDDVICVSVNDTFVMNAWKCDEDADNITVLPD GNGEFTEGMGMLVDKEELGFGKRSWRYSMLVKNGVIEKMFIEPNEPGDPF KVSDADTMIKFIKPDWEPKPSVALFTKPGCPFCAKAKALLTEKGYPFEEI VLGKDATVTSVRAMSGRATFPQVFIGGKHIGGSDDLEAYFANK >MS1521 ahpC, AhpC protein MVLVTRQAPDFTSAAVLGNGEIVDNFNFKQHIAGKPAVIFFYPLDFTFVC PSELIAFDHRYEEFKKRGVEVVGVSIDSQFTHNAWRNTAVDQGGIGQVQY ALAADTKHEIAKAYGIEHPEAGVALRASFLIDANGVVRHQVVNDLPLGRN IDEMLRMVDALQFHEQHGEVCPAQWEKGKEGMKDSPEGVAKYLKQNADKL >MS0348 alaS, AlaS protein MKTTAQIRQSYLDFFHSKGHQVVESSSLVPHNDPTLLFTNAGMNQFKDVF LGMDKRPYTRATTAQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGD YFKHDAIAYGWEFLTSPQWLGLPKEKLYVTVYETDDEAYDIWNKEVGVPA DHIIRIGDNKGAPYASDNFWAMGDTGPCGPCTEIFYDHGEHIWGGLPGTP EEDGDRYIEIWNIVFMQFNRHADGTMEKLPKPSVDTGMGLERIAAVLQHV NSNYDIDIFQTLIKKVAQLTGEKDLTNKSLRVIADHIRSCAYLIADGVMP SNEGRGYVLRRIIRRAVRHGHLLGAKETFFYKLVPTLADVMEHAGEIVNQ KRALIEKTLKAEEEQFARTLERGLLLLDDALSQVKDNVLSGDVAFKLYDT YGFPLDLTADVCRERNITIDEKGFEREMQAQRARAQASSNFGVDYNNVIK VDGQTEFKGYETTSLSSAKVVALFTDGKSVERVQSGENAVVILDRTPFYG ESGGQIGDTGYIATDLAAFRINDTQKYGQVTGHIGQLESGSLSVGDTVSA QVDTERRLAVAANHSATHLLHAALRKVLGDHVAQKGSLVSESALRFDFIQ PEAISKEQIIEIEAIVNRQIRENISVTTEVMDIEAAKQKGAMALFGEKYG DLVRVVGMTGFSIELCGGTHVKRTGDIGLFKVVSESAIAAGIRRIEAVTA ENAINWLNNQQNILNQSADLLKSDTASLVEKIQQLQDKAKKAEKELQQLK EKAAMQAGSDLAKSAVKINDISVIVQQLDGIETKSLRVMVDDLKNQLGSG VIVFASVVEDKVNLIVGVTADLTGKVKAGELVNLMAQQVGGKGGGRPDMA MAGGSQPENVGAALSACSDWLESNL >MS1182 alr, Alr protein MKPATVKISSVALKHNIQIIKQKAPHSKIIAVVKANAYGHGVEFVSSTLE NLVDGFGVARLAEALSVRSNGVTKPILLLEGFFSPKDLPILSVNNIQTVV HNQDQLDAIKRANLENPIKVWLKIDTGMHRLGVSLEEVDYYYNELMNCPN VDEVGFVSHFSRADETDSDYTNIQLNRFLDATKNKKGNRTIAASGGILFW EDSHLEYIRPGIIMYGVSPINIPSSEYGLIPVMTLTSSLIAVRDHKKGEP VGYGGIWVSERDTKIGVVAIGYGDGYPRNVPAGTPIYINGRRVPIVGRVS MDMVTVDLGPDCKDKVGDEAVLWGKELPIEEVAEITGLLSYELMTKLTPR VLTEYVD >MS0767 alsT, AlsT protein MNAKRYFGVLNDFVIMVEQGIHWLVDNVEGPLWDATIVILLGVGLFFTIT TGFVQIRLFPHSLREMWFGREVQGDSLTPFQAFATGLASRVGVGNISGVA TAIALGGPGAVFWMWLTALIGMSSAFAESSLAQLFKIKEADGSFRGGPAY YITQGIGSRWLAAAFAIALIFTFGFAFNAVQSNSIVEATRNAWLWDEHYV GMGLVLLTALIIFGGIKRIGKFSARIVPVMALVYLLIAVSILLIHYDRIP SVISLIIRSAFDFSAMAGGVFGAMLSKAMLLGIKRGLFSNEAGMGSAPNV AATADVKHPASQGLIQMLGVFVDTMVVCTCTAIIILLSDNYGGEQLQSIS LTQNALKYHMGEFGLHFLAFILLLFAFSSIIGNYAYAESNIRFIKNNPVV VNLFRAMVLFFVYFGAVNSGGIVWAFADTVMAVMAMINLVSLIILSPIVW LLLKDYHRQAKQGIVPVLDIMLHPRLLKLRLDQRLWNRR >MS0353 alsT, AlsT protein MSLETILSSIDSFIWGPPLLILLSGTGLYLTLRLGFLQIRHLPRAFAYMF KKEEGNHQRGDVSAFQALCTALSATIGTGNIVGVATAIQAGGPGAMFWMW LVALLGMSTKYAECLLAVKYRVRDKNGFMAGGPMYYIERGLGIKWLAKLF AVFGVLVAFFGIGTFPQINAITHAMNDTFSVPVTISAAIITILVAAIILG GVKRIAAVSSYIVPFMAVLYVTTSLIILLINADKVPSALALIIESAFNPE AALGGALGFTVMKAIQSGVARGIFSNESGLGSAPIAAAAAHTKEPVRQGL ISMTGTFLDTIIVCSMTGLVLVITGAWQSSDMAGAAVTNYAFSQGLGTNI GATIVTVGLLFFAFTTILGWCYYGERCFVYLVGIKGIKLYRTAFIILVAC GAFIKLDLIWILADIVNGLMAFPNLIALIGLRKVIVSETKDYFMRLKTNN YSLDDNEEQIVNS >MS1515 amiC, AmiC protein MKRFFLLFLTALFFAINPAWAAVWTIAIDPGHGGKDPGAISRNLGIYEKN VTLSIAQELKGILDRDPNFRAVLTRRADYYISVPQRSEIARKNKANYLVS IHADSSENPALKGASVWVLSNRRANDEMGQWLEDHEKQSELLGGAGSVLS NHGSEKYLNQTVLDLQFGHSQRTGYELGRSILRNFAKIADLSRTSPQHAS LGVLRSPDIPSVLVETGYLSNATEEAKLSSPSYRKRIAYIIYQGIVDFRK RHLGGEINTSSKIAGQMPTEKTQSANTAKAKDNFKDDKETVQDSGVRHKV KSGETIAKIAGKYNVTSEEIITLNKLKRKDLYIDELVKIPAEKTQSAKTA KAKDNFKDDNETVQDSGVRHKVKSGETIAKLARKYDVKSEDIVTLNKFKR KDLYIDELVKIPASAKNRQKNEPVTNTKNTEKAENSAKTAKSELVNGSYT VKNGDTLFSIANRFGVKQEDIIELNKLKNANIFVGKKLKIPTGAKLKEDT KQQTKTNKTTKKETQAEPKTIEKATVSYTVKSGDSISRLANKFDVKAAEI IELNNLKNKELHIGDKIKLPANAKNVVTESKKTSVNKNTAVKGTKSSTKN TTNKKTAKQDTKKK >MS0365 ampD, AmpD protein MMTVKHPIKIKNGWLSGVRKIISPHFDSRPSQADISLLVIHYISLPPEQF GGGYIEDFFQGKLNPETHPYFQTIYQIRVSAHCLIGRDGRVTQFVSFNDR AWHAGESCFQGREKCNDYAIGIELEGSNEQPFTEVQYQRLAELTNIIRHY YPKITEDRIVGHCDVAPGRKIDPGQYFEWTKYFDLLKESQ >MS2071 amyA, AmyA protein MKKSLLYLCLFTSSAAFAQGWQHTHFQHFNDNAESNLFQSQTPLGKGNYP LTFTLDNQCYQPQSAVKLNQTVSLIPCSGEAPQLRLFRQGNYIAQIDMRS GTPTLRISVEQRAENDNNTVKSCPVWNKQPIEIDVSSTFTEGESVRDFYS GQTAKVKNGKVTMMPAENAGGLILLEKSADKKTEVFDWKNATVYFVLTDR FHNGNPANDNSYGRHKDGMQEIGTFHGGDLQGLTAKLDYLQQLGVNVLWI SSPLEQMHGWVGGGNKGDFPHYGYHGYYHLDWTKLDANMGTEADLKNLMR QAHRRGIRVLFDVVMNHTGYATLADMQEFNFGDFYLKPEEMAATLGKKWT TWQPQKGQNWHSFNDFIKFGDSKAWQNWWGKDWVRADIGDYDSPKFDDLK MSLSALPDLKTESESAVKLPRFFQHKNTNAKELANAKVRDYLITWLTDWV RRYGVDGFRVDTAKHVEKPTWLALKQASQQALKEWQQKNPQESFGDDFWM TGEAWGHGVFKSDYYQNGFDAMINFDFQDQAKNALDCFARIDPVYQDMSN KLKDFNVLSYLSSHDTRLFFHSDSERNAVKQKTAANLLLLSSGAVQIYYG DESGREFGATGSDPVQGTRSDMNWKELQNDQSKQALHQHWQKLAQFRQRH RAVGAGVHQTLKSEGYFAFSRTLGEDKVMVVWAGN >MS1236 amyA, AmyA protein MNDNWWKNGVIYQIYPKSFQDTTGSGTGDIQGIIKRLDYLQTLGIDGIWI TPMYVSPQIDNGYDIADYRNIDPSYGTMADFEQLIAEAHKRDIRIVMDMV FNHSSTFHQWFKQGEDPNSEYHDYYIWREQPTNWQSKFGGNAWKWSDKAQ KYYLHLFAPEQADLNWENPKLRAELYDICRFWAEKGVDGLRLDVVNLISK PEKYEDDFEGDGRRFYTDGPKIHQYLQELNQNALKPFGLMTVGEMSSTKL EHCQRYANLDGSELSMTFNFHHLKVDYPNGEKWTYAKPDYVELKSIFNYW QKGMHGKAWNALFWCNHDQPRIVSRFGDEGELRTLSAKMLAMLLHGMQGT PYIYQGEEIGMTNPNFSSIEEYRDVESLNAYQILQNQGKSAVEILQILAQ KSRDNSRTPVQWDASPNAGFTSATPWIGVAKNYPQINVEQALADRDSVFY TYQKLIALRKQLAVLTDGDYSDLLPNHESVWLYQRSTAGERLTVAANLSN QPQFIEIKPQGQVLINNYADITQEDSGICLKPYQALYFLA >MS2050 ansB, AnsB protein MKLTKLALTMSLGLGVSFANAAELPNITILATGGTIAGSGATSVSSSYKA GQLTVQTLIEAVPEMKDLANITGEQVVNIGSQDMSDEVWLKLAKTINAKC NETDGFVITHGTDTMEETAYFLDMTVKCEKPVVLVGAMRPATEKSADGPL NLYNAVVVATDKKSAGRGVLVAMNDKVLGARDVTKTSTTAVETFNSPNFG SLGYIHNSKVDYERSPESKHTTATPFNVDNLTALPKVGIVYAYSNMPTEP LKALLDAGYEGIVTAGVGNGNVNQANSAILEKAAKDGVAVVRSSRVPTGY TTRNGEVDDNALGFAASGTLNPQKARVLLQLALTQTKDINKSNNILMISK SGRST >MS0548 apaH, ApaH protein MTRRDYEKIDGSAYANIYAVGDLHGCYELFMRELESVKFDTTRDLVISVG DLIDRGPHSLSCLRLIRNSWFKAVKGNHECMAIEGLLGQDEHYQRLWLYN AGDWVLSLNPTERAEVLDLLKFCAGLPLVIELNDEGFKTVIAHADYPYDQ YRFGRPLTQEQAVWERRRIEMRDETEIKGADAFIFGHTPLKRVMQLGNRL YIDTGAVFFGNLTLLRLK >MS0631 apaH, ApaH protein MATYFVGDLQGCYDELQRLLEKVRFDPTQDLLYLVGDLVARGDKSLECLR LVKSLGKSAQTVLGNHDLHLLATAFGIKKVKSRDRVDAIFHAEDFEELIH WLRHQPLLVYNAKQNWVMTHAGISPDWDINTAQACAKEVENVLQQGDYCH LLSQMYDSRPDLWSADLTGIERLRYIINVFTRMRFCYRDHRLDFDCKSPV DKAPEELTPWFNLSNPLYKQVDIIFGHWASLVDTPTPHHIYALDTGCVWN NRMTMLRWEDKQYFCQPALKDYAFNG >MS0303 apbE, ApbE protein MKLKQTFTWLSAVIMAISLAACKKDPEIITLSGKTMGTTYHIKYIDDGGL TQNAEQAQEQIESILKDVNDKMSTYIPNSELSRFNQYKEINQPVEISADL AKVIKEAVRLNKITEGALDVTVGPLVNLWGFGPEKRIDKQPSATQLEERR AWVGIEKLALTEQAGKFTLAKAVPELYIDLSSIAKGFGVDQVADYVESIG AKNYMSEIGGEIRAKGKNIEGKDWQIAIEKPNFDGSRSVQDILGLKDLAM ATSGDYRNYFEENGMRFSHEINPQTGKPIQHKLASITVLSPSTMTADGLS TGLFVLGEEKALEVAERENIPVYLIVKTESGFDVKMSSAFKNLLNSSKEG K >MS0716 appB, AppB protein MFDYESLRFIWWILIGVLLLGFVVTDGFDMGVLTLLPFAGKKEVEKRIMI NSVAPHWDGNQVWLLTAGGAMFAAWPIVYAASFSGFYIAMILVLAALFFR PVGFEYRAKIDNPAWRKAWDWGLFLGGFVPSLVFGVAFGNLLQGVPFEFN DLLQVKYTGTFFELLNPFAILCGLISLSMLITHGAAWLQMKTTSDLRDRA RAITQVGAFATLITFILAGVWLLYKDGFVLNSTVDHFAPSSPLGKTVSLE TGAWFNNYYEMPVLWIFPALVVVGALLNIASSKADRSGFAFFFSALTMLG VIFTSGIAMFPFVMPSITHPDMSLLMWDSTSSELTLSLMFGLALVFVVIM LIYTIWAYAKMFGRLDGNFIEENKNSLY >MS2117 apt, Apt protein MSEKYVVTWDMFHMHARKLAERLLPASQWKGIIAVSRGGLFPAAVLAREL GLRHVETVCIASYDHDQQGDLKVIHKAETDGEGFIVVDDLVDTGNTAREI RNMYPKAKFVTVFAKPAGAPLVDDYVIDIPQNTWIEQPWDLGIGFVPPLA RK >MS0870 apt, Apt protein MDRTLGNKMNEQLQLIKSSIKSIPNHPKEGIIFRDITSLIEVPEAFQATV DLIVGNYKNQGITKVVGTESRGFIFGAPVALALGLPFVLVRKPRKLPRET ISQSYQLEYGEDTLEMHVDSVKAGDNVLIIDDLLATGGTVDATIKLIKRL GGDVKHAAFVINLPELGGEERLRSLGVEPFTLVNFEGH >MS1209 ara1, ARA1 protein MQWLKYKHRCKYSLGGNKMQTFKLNNGVEIPVLGFGVFQIPPEETEQAVI SAIHAGYRHIDTAQAYMNETETGAGIRNSGVVREEIFVTSKVWIENYGYE AAKASLDRTLARLDIGYIDLMLLHQPFNDVYGAWRALEEYLAAGKIRAIG LSNFTADRVLDVGLYNKVMPAVNQIEINPFHQQQAQVEGLLSEGIVPEAW GPFAEGKFGIFENPVLAKIGQKYGKSIAQVVTRWLVQRGVVVLAKSTRPE RMAENLNVFDFELDADDFAQIAALDVGKSQIISHTDLAMVRQFKEWVFNV >MS2075 ara1, ARA1 protein MLTFVKQGLELGVDTLDHAACYGAFTSEAEFGRALALDKSLRAQLTLVTK CGILYPNEELPDIKSHHYDNSYRHIMWSAQRSIEKLQCDYLDVLLIHRLS PCADPEQIARAFDELYQTGKVRYFGVSNYTPAKFAMLQSYVNQPLITNQI EISPLHRQAFDDGTLDFLLEKRIQPMAWSPLAGGRLFNQDENSRAVQKTL LEIGETKGETRLDTLAYAWLLAHPAKIMPVMGSGKIERVKSAADALRISF TEEEWIKVYVAAQGRDIP >MS0687 ara1, ARA1 protein MKKITLKNGDKLTLLGMGTWFIGDNAHYRQEEIAALRYGIEHGINLIDTA EMYGNGRAERLIGEAIAPYDRNSLYLISKVLPNNANKRKMEQACNNSLKA LNTDYLDMYLYHWRGTTPLAETVECLEALKNKGKIKAWGVSNFDLEDMQE LLALPNGNQCQLNEVLFHLGSRGIEYALKPYQDKLAIPTVAYCPLAQAGS LQRNLLRHPEVTTIAEELNCTPYQLLLLFVLAQPNMIAIPKAGQVRHMKE NIACLDMQLTQQQLARLNNAFPSPTHRIHLDIV >MS0058 araA, AraA protein MEFLKKLEVWFVVGSQDLYGDEALKQVNANAEQITRYLNDQNPFIQIKLK PLATTPEDILSLCQAANYEENCVGVIAWMHTFSPAKMWIGGLTRLNKPLL QFHTQLNKNIPWNEIDMDYMNLHQTAHGDREFGFMVSRFRKPRTIVVGHW QSESVKQKLDRWMRVLAAIYDQQHLKVARFGDNMREVAVTEGDKVEAQIK FGYSVNGYGLYQLVNSINTVNDEDITALVKEYEASYQLADSLKDGGEKRQ SLIDSARIELGLKAFLDKGGFKAFTDTFQNLAGIKQLPGLPVQRLMAQGY GFGAEGDWKTAALVRAIKVMSYGLPNGCSFMEDYTYNLDDNNEIVLGAHM LEVCPSIANNKPILDIKPLGIGGKEDPARLIFTSKSGKATASTIVDLGNR FRMITADMQAVDKPQDMPNLPVGHAFWKLEPNFDIGTQAWILSGGAHHNV FSLDIDADMLRTFAEYFDIEFIHINVKTELPNLKNELRWNEAAYK >MS2173 araC, AraC protein MPKPLILSRKNLANLGSVIQQRKLLYTRMAVDEPTLLYIQVGQKTLRWRG QELTIQAGEMVLLAAGQTFDVLNNPDAKLGFYQAGWIALEQRVVDEFADL FGVETYVQELAKIQPLAPLKAHFDVVRQALENDEAPELVLKLKLFELLAW LKAEHLSFVPHEKHNLLRQIRKMIASNTAFEWTAETIARQLHLSETSLRR ALQKSDTTFREVLTDVRMSRALTLLQITKWQVARIANEVGYDSPSRFTVR FKQRFGFLPSDIRENLSQPVQNEQQKLVRIGVKK >MS2323 araC, AraC protein MTDILQLSHHSYFISEESPITVERRHYQPPFPLHRHDFNEIVIISAGNGI HFWNDEIHPITTGNVLYIESGDKHKYGEVDKLKLDNILYRPEKLSLFPIM KDYIPHNNEKKSLRINQETLVQLQSLISQLEIESKKTNKSSMHLSEAIFL QILILICRTQQQENKAYSDISKLESLFSALNQSISQEFYLADFCRQHQLA VSSVRRIFKQQTNMTIAQYLQKLRLCRAATLLRNTSESVANIAIRCGYSD SNYFSSVFGKTFSCTPTEYRSRFIKK >MS0060 araC, AraC protein MKYQREVQQETNPLLPGYQFGSYLVAGCTPIEKGNEVDFAIRRPNGMKGY IINLTTKGEGTVFEGDRAFTCCKGDLLLFPPNAEHLYYRSQSSESWHHQW IYFRPRSFWANWLQWSHISDHVGRLTITDPTTYEEILALFKKIEREYNAK DIFSEAMSMCLLEQLLIKCIKLDPVNSQRMLDPRILETCHFISANLHINH KITEIAEHIHMSPSRLTHLFAQQTGSSIIKWREEQRMIKAQHLLHTSGAP IYAIARQLGYDDQLYFSRLFKRYSGLSPSDYRNSR >MS1400 araC, AraC protein MANIRQNQSISELHYQPHKHHPYGIELFTVASLRARSAEVVMEKNYLYQC DMIIVVTQGSGTLWQDFEPVACMQGSVLWIKQGQACSFGNDKHWDGWVLM IKNKPLLSEFDYQINTLWLSENELENVEQSLKQLKQDSEKPYSIVHKQLI HHQFYAFLWRLISLTPNQTILYSPRLRSRFDSFQSLLESYFHEWHHVHQY ATALACSEKTLSRACLEITQQPAKTVINNRLLLEAKRLLVQSNQSIASIS LQLGFNEATHFVKFFKREAGITPQKFRELG >MS2105 araC, AraC protein MSGILFLLITCLFIQIIMFSEQSFARLLDVIPHNQTYHSPIKGLIIHHSD HPFSYDNVIQEPSICIVIRGEREVQLGNQCYLFDNRHFMFCPVNVPMCGK VLQATAEEPFVVMSMKIDLQAVNKILLEQTALLAKNSENPTAFGQWHLDA ELENAFERLLLLHENTKDITFLAPLIQQEIYYRLLTGEQGDKLKQMVSFG SNTQKIAKATEYLKAHYIETITVESLAELCGMSLSGFHNHFKKHTTLSPL QYQKSLRLMEANRLISQENLPISTAAFQVGYESPSQFSREYKRYFGKAPS VR >MS2131 araC, AraC protein MLNWLIRQTLKLKSGEKGTMGIETPVPELFVFHSETDLRDVSQLQESGIC LILQGRKDVRVGDQHYRYQAGEFVCYTVDLPIMTEYLTDDGGYLDLRLFF DLPLMREIIDELNRQNFSFAPASQQKIVSTASPELIRAFEMLICLTENSQ DLPIMLPLIKKAIYFYLLTGEQGGTLRQIALQNSNSQRIVETVGWLKEHY NESFDIEQLAAASSMSISGFYAQFRRLTGMSPLQYQKNLRLTKANALLKL GQKNISEIAFEIGYDSLPQFSREYKRYFGHSPRSDLSRAG >MS1229 araC, AraC protein MRFCWYTSDNNKSVVNQINMKTSHLAKQTSTELADKSGSEIISPLSLSLD ARPFNVEIQQPPGNMPAYHWHGHIEINIPFDDDVEYSFNEHSTLINAGHI SIFWASIPHRLTDKHNCRTMAVFNIPVYQFLSWQLSQNLINHITHGIIIQ SKNPRLVSLFEVQRWEQELKLEDPNRHKLVYDEIQLMIKRVSLDGWLLLL EPPKKNNHQLSGSKHAQNYVRTMLDYIANHYNAPLTVQSVANAVGLNTNY AMGLFQSAMQLTIKQYIIMMRINHAKALLSDTNRSVLDISLTTGFSSMSR FYDNFLKYTGVSPNKYRKQIRADDNWSAQGLIPTTQAIKGASTGEKLIMT GEHFNQSEEF >MS2322 araC, AraC protein MIQKLLARDFFNNKEQPIILEPRAPQEIFPEHTHDFDELVIVKHGSGRHI LNGYPHDLYPGVVLYIQAQDHHSYENLQDLCLTNILIQSNNNFKYLNNID ILLNGLKPENSSYQLINKKTAEYIDSLLEKINAIDESYNLQNECLFFQVL SSIQAHQFNDSGYGNTEEKGRQMIRWLENNFEKEIDWEELAEKFALPIRT LHRYIKSQTGHTPQNYVTKLRLAQAYYQLKYTEKNIINIAYDCGFNDSSY FSTCFKNEYSIAPRELRI >MS2327 araD, AraD protein MQNIINSWFVQGMIKATYDMWLKGWDERNGGNVSLRLLDDDVVSYKDEFY QNPRHVEITQNITALANQYFIVTGSGKFFRNVIIDPADTLAVIKVDEQGK GYYIMWGLVNGGVPTSELPAHLQSHIVRMKVSGGKDRVIMHCHATNLIAL TYVLELDPKVITRELWEMSTECLVVFPDGVGVLPWMTPGKDEIGYATAQE MAQHPLVLWAFHGVFGTGPTLDDAFGLIDTAEKSAEILVKVLSMGGKRQT IQTDEFKLLAERFGVTPMDGVL >MS0046 araD, AraD protein MLKELRERVLQANLELPKHKLITFTWGNVSEIDREKGLVAIKPSGVDYDV MTVDDIVIVDLDGNHVWGDKKPSSDTATHLELYRQFPEIGGIVHTHSRHA TAWAQAGEDLIALGTTHGDYFYGAIPCTRKMTAEEIAGEYELETGKVIVE TFRKRGINPTDIPAVLVNSHGPFVWGKDGFNAVHNSVVLEEIAYMNAFSK LIRPNVQSMQQELLDKHYLRKHGKNAYYGQ >MS1979 araD, AraD protein MTDLEQKELMVQLGRSFYERGYSVGGAGNLSVRLDENRVLVTPTGSSLGR LKVERLSVLDMDGNVLEGDKPSKESVFHLEMYRKNPKCNAIVHLHCTYLT ALSCLQGLDPTNAMKAFTPYYVMRVGKMQVIPYYRPGSPEIARELSERAL SGKAFLLANHGVVVTGADLLDASDNTEELEETAKLFFTLQGQKIRYLTDD EVKDLENRGK >MS0061 araH, AraH protein MIMTSTTQEKSAGSFSKIWNAYGMLLIFAVIFVCSCVFIPNFATVVNMKG LGLAISMSGIVACGMLFCLAAGEIDLSVASVIACAGVVTAVVINMTQSVT IGILAGLGLGIAVGLINGFVIAKLKINSLITTLATMQIARGFGYIISDGK AVGITKEEFFELGYQDIFGVPLPIIFTVICMVVFGFLLSKTTYGRNTLAI GGNQEASRLAGINVDRTKLIIFVVSGFVSALAGVILAARMTSGQPMTSVG FELVAISACVLGGVSLNGGVAKISFIIAGVLILGTIENAMNLLNISPFAQ YVVRGLILLIAVIFDKYKQKFIKS >MS0199 araH, AraH protein MSAIKLNVRDAGTLVGLVIIFVVFSFLSPVFFTVPNLLNILQQSSLNAAI ALGMTLVIISAGIDLSVGPTAALSAVLGASLMVSGVPVPIAVLGALCIGS LGGLFNGVLIAYAGLQPFIVTLGSLSLYRALSLIYTGGNPIFGIPAEFRA FMNGSLFGIPSSILIVASIALILWVVLNKTPLGEYIFAVGGNEEAARVCS VPVAKTKVAVYMISGFLASVAGLVLVGRLGAADPTLGNLWELDAIAAAAI GGASLMGGKGSIIGTILGAVILGALRNGLTLLNIQAFYQLLATGLIIIVA MLIDRATRGK >MS0641 araH, AraH protein MAGQKNKTWDFFKQNAIYFVLLILLGVIIAQDPSFLNLMNFSNILTQSSV RLIIALGIAGLLVVQGTDLSAGRQVGLAAVVAATMLQAIDNLNRVFPNLP EMPIFVVILIVCSIGAVIGLINGFVVAILNVTPFIATMGTMIIVYGFNSL YYDAVGGSPIAGFSENFSSFAQGFFRFGSFKLSYITIYAIIATILMWILW NKTRFGKNIFAIGGNPEAARVSGVNVTRNLLVIYMLAGVFYAFGGMLEAG RIGSATNNLGFMYELDAIAACVVGGVSFAGGVGTIIGVVTGVLIFTVINY GLTYIGVNPYWQYIIKGSIIILAVAIDSLKYAKKK >MS1610 araH, AraH protein MFSFKKLISKLGIGLVLLFMIIGMSLTSQAFLSTNNIFNILLQVSVICVI SVGMTYVILTGGIDLSVGSIVALSAVCLGLFTHWGVAWLGENPSQGALLA VVLLSIVGAVLVGALCGYVNGVVIVYGKVTSFITTLGMMGIARGLALTLS DGKTIYNFPEQLRFFGNGRLAVTENFSIPIPVIIALIVVLISFYVLTQTV FGRQIYALGGNREAVRLSGINVNKLEIKTYVINGALAAVGAVILVGRLNA AQPIAGTGYELDAIAATVIGGTSLMGGVGSVVSTSIGALIMGVLQNGLTL LNVTSYLQRLIIGMVIILAVFLDQLRRGEVSTGGLRRIFFRE >MS1579 araJ, AraJ protein MLNRKLVNRVEYFRVIVMAFAAFVFNTTEFVPVALLSDIADSFQMPVSNT GLMITIYAWIVSLCSLPCMLMTARLERRRLLISLFILFIASHILSAFAWN YEVLLIARAGVALTHSIFWSITAALTIRIAPKNKKTQALGLLALGSSLAM VLGLPLGRIIGQAFGWRTTFTLIGVFAALILILIVRLLPKIPSQNAGSLK SLPVLARRPMLITLYIFTILVISAHFTAYSYIEPFMIQIGRVSANKATAV LLIFGVSGVVASVLFSRLYRIAPIKFLLSSVAILTLALICLYGVSGISGA IFALVFIWGVAISALSLAMQMKVLQLAPDATDVATAIYSGIYNIGIGGGA LIGNQVMQHLGLANIGYVGAVLGAVSIIWFILMFLKFSRVPLNIVNQ >MS0754 argB, ArgB protein MRSTELVQWFRQSTPYVNMHRGKTFVIMLDGNTIASSNFINIINDISLLH SLGIKLIIVYGARVQINSLLAQNNVTSVYHKNIRVTDPRTLELVKQAVGQ LSYDITARLSVRLPHSPVLNVVSSNFILAQPIGVDDGVDYMLSGKIRRIE IDNIKHHLDNNAIVLLGPIAPSVTGETFNLPFEEIATQVAIKLKAEKLIG FSSTQGILDPQGISIPDLLPQDAAKYLNQYIQQGEYHCSQARFLQAAIEV CKAGVKRSHLLSYEEDGSLLQELFTRDGVGTQLSVDNSEDIRIATVQDIP GLIELIHPLEQQGILVKRSREQLEMDIANYTIIDRDGVIIACAALNQYPE ENMAEMACVAVHPDYRSSSRGDILLEAIQKRARQLGIEKLFVLTTRTVHW FQERGFRLANVEDLPKEKRDHYNYQRRSKILIQPLNEEE >MS0236 argB, ArgB protein MKPLVIKLGGVLLDTPAAMENLFTALADYQQNFARPLLIVHGGGCLVDDL MKRLNLPVQKKNGLRVTPADQIDIIVGALAGIANKTLVAQAAKFKLNPVG LCLADGNLTQATQFDPELGHVAMVVAKNPALLNNLLGDAFLPIISSIAVD DNGLLMNVNADQAATAIAALINADLVMLSDVDGVLDANKQRLTELNSAQI EQLIEDKVITDGMIVKVNAALDAAKILNCGVDIANWKYPEKLTALFAGEI IGTRINP >MS0235 argC, ArgC protein MAQKAIVIGASGYTGAELARILTHHPEFELAGLYVSTNSADANKSISTLY PQLKTICDLPLQPLPEDLTEIAQNADLAFFGTAHEVSANLAPVFLQNNCK VFDLSGAYRVNSESFYQEFYGFEHKHPELLKQAVYGLAEWNADKIKTTDL VAVAGCYPTVSQLSLKPLIEEGLLDVNQLPVINAVSGVSGAGRKASLTSS FCEVSLNAYGVFNHRHQPEIATHLGTDVIFTPHLGNFKRGILATITAKLK AGVSDEQIKRAYAKYYANKPLVRVYEQGLPSIKAVEFSPYCDIGFATKNN HIIIVGAEDNLLKGAAAQAVQCANIRYGYNEVLGLI >MS0783 argD, ArgD protein MILVAGPNVLRFAPALNISQQEVAEGFKRLDQALQKFA >MS0782 argD, ArgD protein MSQYTRKTFDEVMIQNYVPADFIPVKGKGCKVWDQQGRDYIDFTSGIAVN ALGHCPDEIVDVLKKQGETLWHSSNWFTSEPTLELASKLVEHTFAERVMF ANSGGEANEAALKLARRYAVDNYGYQKDTIISFKKSFHGRTLFTVSVGGQ AKYSDGFGPKPAGIVHLPFNDLDAVKAMIDDHTCAVIVEPIQGESGIIPA TKEFLQGLRRLCDENNALLIFDEVQTGVGRTGYLYAYESYDVVPDILTSS KALANGFPISAMLTTTKIAASFKPGVHGTTFGGNPLACAVGAKVIETIAN PAFLENVQKTSALFISELNKLNEKYHLFNEVRGQGLLIGGGIN >MS0829 argD, ArgD protein MTITTPVKAVLASNQYFLDRQNAMESNVRSYPRKLPFAYAKAQGCWVTDV EGNEYLDFLAGAGTLALGHNHPVLIQSIKDVLDSGLPLHTLDLTTPLKDA FTEELLSFFPKDQYILQFTGPSGADANEAAIKLAKTYTGRGNVIAFSGGF HGMTHGALSLTGNLGAKNAVQNLMPGVQFMPYPHEYRCPFGIGGEAGAKA VERYFENFIEDVESGVVKPAAVILEAIQGEGGVVPAPVSFLQKVREVTQK HGILMIVDEVQAGFCRSGKMFAFEHAGIEPDIVVMSKAVGGSLPLAVLAI KKEFDAWQPAGHTGTFRGNQLAMATGYASLKIMREENLAQNAQQRGEYLT QALRELSKEFPCIGNVRGRGLMMGIDIVDERKPQDAAGAYPQDGELAATI QKFCFKNKLLLERGGRNGNVVRVLCAININQAECEEFIKRFKQSVTDAIK AVRG >MS0674 argE, ArgE protein MKNTIINLAQDLIRRPSISPDDQGCQQVIAERLTKLGFNIEWMSFNDTIN LWAKHGTTSPVVAFAGHTDVVPTGDENQWNYPPFSAQIVDDMLYGRGAAD MKGSLAAMIVAAEEYVKANPNHAGTIALLITSDEEAAAKDGTVKVVESLM ARGENIDYCLVGEPSSAKQLGDVVKNGRRGSITGDLYIQGIQGHVAYPHL AENPVHKATKFLTELTTYEWDNGNEFFPPTSLQIANIHAGTGSNNVIPGE LYVQFNLRYCTEVTDEFIKNKVAEMLQKHDLTYRIDWNLSGKPFLTKPGK LLNAVVESLESVAGIKPKLDTGGGTSDGRFIALMGAEVVELGPLNATIHK VNECVSCRDLATLGEVYRQMLVNLLGK >MS0233 argE, ArgE protein MKRLPKFLDMYSQLIALPTISALEPEFDQSNKALIELLADWLATLGFKTE IIPVENSRAKYNLLATYGEGEGGLLLAGHTDTVPCNEELWTTNPFKLTER DGKFFGLGTADMKGFFAFVIDAVRQIDLTKLTKPLRILATADEETTMLGT RTFIRHTHIRPDCALIGEPTSLRAVRAHKGHVGKAVRIIGKSGHSSDPAK GINAIELMHEATGYLMQMRNELRDKYHHDAFEIPYPTMNFGAIHGGDAVN RICGCCELHFDIRPLPKMRLEDLDEMLQQKLAPMFEKWGDRISIEALHEP TPGYECEHSAQVVQVVEKLLGEKCEVVNYCTEAPFIQELCPTLVLGPGSI EQAHQPDEFLSAEFIEPTRDLLTKMIMHFC >MS1555 argE, ArgE protein MSVNMKRIQTIIEKLASISSVPGELTRLAFSAEDEAAHNYLIELCKPYDL SIRRDQVGNLFIRKSGIEDHLPAVTFGSHIDTVVNAGKFDGPLGSVGGLE ILFQLCEQGVQTRYPLELIIFTCEESSRFNYATLGSKLMCGIANRESLSR LRDKQGNSLEEAMATIGLDFTEVDQVKRNAEEFKCFFELHIEQGPRLANE RKTIGVVTGIAAPIRCIVKIQGQADHSGATAMHYRRDALLGGAELALAIE RAAIDAGHSTVATVGNLNAKPGVMNVVPGYCELLVDIRGIHSEARESVFT VLQQQIEQVTAKRGLSIELQLISKDQPILLPDQMVQQISRAAQDLGYAYE IMPSGAGHDAMHMATFCPTGMIFVPSKNGISHNPLEFTSWEEIEAGIKVL QLVVLEQAEKV >MS1073 argF, ArgF protein MPFNLKNRHLLSLVNHSPREIKYLLDLARDLKRAKYAGTEQPRLKGKNIA LIFEKTSTRTRCSFEIAAYDQGANVTYIDPTSSQIGHKESMKDTARVLGR LYDAIEYRGYKQETVEELAKFSGVPVFNGLTDEFHPTQMLADVLTMIEHS TKPLNEIKYVYIGDARNNMGNSLLLIGAKLGMDVRICGPKSLLPEENFVS ICEEISKETGARLTVTDDIDLAVKDADFVHTDVWVSMGEPIEAWGERINL LMPYQVNTDLMKRTGNPNVKFMHCLPAFHNCETKVGREIAAAYPNLANGI EVTEDVFESPMNIAFEQAENRMHTIKAVMVASLA >MS1479 argG, ArgG protein MSNTILQNLPLGQKVGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPD EDDYNAIPKKAMAYGAENARLIDCRKQLAQEGIAAIQCGAFHISTGGVTY FNTTPLGRAVTGTMLVAAMKEDDVNIWGDGSTFKGNDIERFYRYGLLTNP NLKIYKPWLDDQFIDELGGRFEMSQFLIANGFDYKMSVEKAYSTDSNMLG ATHEAKDLEDLSTGIKIVKPIMGVAFWDESVEIKPEVVTVRFEEGVPVEL NGKRFDDVVELFMEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMAL FHIAYERLVTGIHNEDTIEQYRINGLRLGRLLYQGRWFDPQALMLRESSQ RWVAKAITGEVKLELRRGNDYSILDTVSPNLTYEAERLSMEKVEDAPFDP IDRIGQLTMRNLDVTDTRNKLGIYSEAGLLTAGKDAVVPQLGSK >MS0237 argH, ArgH protein MALWGGRFTQAADQRFKDFNDSLRFDYRLAEQDIEGSVGWSKALVSVGVL TTDEQQQLERALNELLIEVRSNPQAILQDDAEDIHSWVESKLIDKVGNLG KKLHTGRSRNDQVALDIKMWCKAQVTELQYAVRDLQAKLVETAENNQHAV MPGYTHLQRAQPISFAHWCMAYVEMLERDYSRLADAYNRMDSCPLGSGAL AGTAYPVDREQLAKDLGFAFATRNSLDSVSDRDHIIELLSTASLSMVHLS RFAEDMIIFNSGEADFVELSDRVTSGSSLMPQKKNPDACELIRGKAGRVI GSLTGMMVTVKGLPLAYNKDMQEDKEGIFDALDTWHDCLTMAAFVLEDIR VNVERTREAALKGYSNATELADYLVAKGVPFRDSHHIVGETVVYAIKVHK GLEDLSIEEFRQFSDVVGEDVYPILSLQSCLDKRSAKGGVSPLRVAEAIA DAKARIAAKK >MS1267 argR, ArgR protein MAIEKTDNLLTVFKDLLSQERFGSQSEIVSALQDLGFSNINQSKVSRMLT KFGAIRTRNTRMEMVYCLPNELSVPNTSSPLKNLVLDIDHNDFLIVIKTS PGAAQLIARLLDSVGKTEGILGTIAGDDTIFITPTKGTGIKELINTIQQL FENSL >MS1330 argS, ArgS protein MNIQWILSDKIKRAMIAAGAEQNAEPLVRQSGKPQFGDYQANGIMGAAKK LGLNPREFAQKVLEQVDLSDIAEKTEIAGPGFINIFLNKNWVAQQADTAL NTPNFGIKTAHPQTVVIDYSSPNVAKEMHVGHLRSTIIGDAVARALEFMG NHVIRANHVGDWGTQFGMLIAYLEKMENEHADAMQLSDLEAFYRAAKETY DNDEEFAVKARSYVVKLQSGDEYCRTMWKKLVDMTMQQNQRNYERLNVTL TEKDVMGESLYNPMLPAIVEDLKKQGLAVEDDGALVVYLDEFKNKDGDPM GVIVQKKDGGFLYTTTDIAAAKYRYHTLHADRALVFSDTRQSQHMQQAWL ITRKAGYVPDSFSLEHHNFGMMLGKDGKPFKTRSGGTVKLADLLDEAVER ATLLINEKNTALSEQEKAAVIEAVAIGSVKYADLSKNRTTDYVFDWDNML SFEGNTAPYMQYAYTRIRSIFNKTDVNPTALSAAHIEIRNDKERALAIKL LQFEEAVQTVAKDGTPHILCNYLYELAGVFSSFYEHCPILNAEEPVKLSR LKLAKLTEKTLKQGLDLLGIKTVEKM >MS1575 aroA, AroA protein MEKLTLTPISHVEGTVNLPGSKSLSNRALLLAALAKGTTRVTNLLDSDDV RHMLNALKQLGVNYSLSEDKSVCEVQGLGKAFAWQNGLALFLGNAGTAMR PLTAALCLANADSVPAEIILTGEPRMKERPIKHLVDALLQAGADVQYLEQ EGYPPLAIRNTGLKGGKVKIDGSVSSQFLTALLMAAPMAERDTEIEIIGE LVSKPYIDITLNMMKIFAVDVDNQNYQRFVVKGNQQYQSPNIFLVEGDAS SASYFLAAGAIKGKVRVTGVGKNSIQGDRLFAEVLEKMGAKITWGEDYIE AERGELNGIDMDMNHIPDAAMTIATTALFAQGETVIRNIYNWRVKETDRL SAMATELRKVGAEVEEGEDFIRIQPPASDQFKHAEIETYNDHRMAMCFAL VALSNTAVTICDPKCTAKTFPTFFDEFSAIATV >MS1968 aroB, AroB protein MVCVNVELKERRYPIYIGENLLTDTGVYPVKMGDKVMIVSNPTVAQYYLT PVTETLEKLGCQVSHVLLPDGEKYKTLDSLNMIFTALLKENHGRDTTLIA LGGGVIGDVTGYAAASYQRGIRFIQIPTTLLAQVDSSVGGKTAVNHELGK NMIGAFYQPCTVIIDTRTLVTLPKREVNAGLAEVIKYGAILDLPFFEWLE AHIDNLVALNQQDLQYCIARCCQIKADVVARDETEKGDRALLNLGHTFGH AIETHLGYGNWLHGEAVAAGSMMAAVLSEKLGDLSYSEVARLEKLLARAN LPTVSPDTMQAEDYLPHMMRDKKVLAGKLRLVLLKTLGQAYVASDTDKSL VLDAIRVCSQNN >MS0866 aroC, AroC protein MAGNSIGQLFRVTTFGESHGIALGCIVDGVPPNMALSEADIQPDLDRRKP GTSRYTTPRREDDEVQILSGVFEGKTTGTSIGMIIKNGDQRSKDYGDIMD KFRPGHADYTYQQKYGIRDYRGGGRSSARETAMRVAAGAIAKKYLREQFG VEVRGFLSQIGDVKIAPQNISEIDWAQVNDNPFFCPDQSAVEKFDELIRQ LKKDGDSIGAKLTVVAENVPVGLGEPVFDRLDADLAHALMGINAVKAVEI GDGFAVVEQRGTQHRDEMTPQGFLSNHAGGILGGISTGQPIIATIALKPT SSITVPGRTVNLNNEPVELITKGRHDPCVGIRAVPIAEAMTAIVLLDHLL RHRAQCGLK >MS0133 aroE, AroE protein MQLKSSLIVNQHLRLEQITEDDAEPVFRLICRQRDYLSRWLPGVGLTSNV SSTLKFIRSLKPLEQVFTIRRDDEIIGLVSFNKADYSNLKLEIGYWLSQS EQKQGIMTQCVQTMIDYAFNQLYFNRIQIKCAIGNTASKGIPQRLGFQLE GIERQGLLLLSGEFADFEIYSMLAQDWKNKQDKQIMDTYAVWGNPIAQSK SPAIHKIFAEQTGQNMKYIAMLGDEQHFERQLQEFFAQGAKGCNITAPFK ERAYRLADEYSERALTAGACNTLKKLENGKLYADNTDGAGLVSDLQRLGW LKPNQQILILGAGGATKGVLLPLLQAQQKILIANRTLAKAEELAEKFSPY GEIRAVELKTIPPYRYDVVINATSLGLTGKTADIQPEILQQAGAVYDMQY AKETDTPFIALAKSLGVNNVSDGFGMLVGQAAHSFRLWRGIMPDIEVLLN RGI >MS2315 aroE, AroE protein MINKDTQLCISLSGRPSNFGTRFHNYMYEKLGLNFVYKAFTTNDIEHAVK GVRALGIRGCAVSMPFKESCMPFLDEISPSAKAIESVNTIVNTDGYLKAY NTDYIAISKLIAKYQLKPTACVIIQGSGGMAKAVAAAFKNAGFDNLKIYA RNATTGGYLAKLYGYQYIDSLYGQNADILVNATPIGMKGGGKEESIISFP EAMIDQASVAFDVVAMPAETPLIKYARQQGKTVISGAEVAVLQAVEQFEL YTGQRPGDELIAEAASFARANS >MS1104 aroG, AroG protein MKDSIHNVHIIDEKVLITPAELKQKLPLPIALRTQIETHRREIADIVHKK DDRLLVVIGPCSVHDTKAAIDYAKRLKALSDELKDQLYIVMRVYFEKPRT TVGWKGLINDPRIDGTFNVEEGLHIGRKLLLDLAEMGLPLATEALDPMTP QYLADLFSWSAIGARTTESQTHRELASGLSMAVGFKNGTDGSLATAINAM KAASMGHSFIGINQQGQVNLLHTEGNPDGHVILRGGKKPNYQQEFVNQCE EELAKAGLETAIMIDCSHGNSNKDYKRQPSVAKDAVNQIVAGNKSIIGLM IESNINAGNQSSEQKVSEMKYGVSITDACIDWETTDNLLRKIAAALKNRA E >MS1184 aroG, AroG protein MISFKVRLNFSIFRMIYRELIMPTKNKNNIRVANDDTRIANIEQLLPPVA LLEKYPASNVAVKTVRNARNKAHQIIHGEDDRLLVIIGPCSIHDPKAALE YANRMAKMREKYKDTLEIIMRVYFEKPRTTVGWKGLINDPYLNDTYALND GLRIARKLLSDINDLGLPTAGEFLDMITPQYVADFMSWGAIGARTTESQV HRELASGLSCAVGFKNGTNGGVKIALDAIGAAEASHHFLSVTKFGHSAIV STKGNLDCHIILRGGDKGTNYDAENIAKVCANIEKSGRIGHVMIDFSHAN SSKQFKKQVEVCHDVAKQIAQGSNQIFGVMVESHLVEGRQDLVNGKAETY GQSITDACIGWDDTEIVLQELSDAVAARRKVNGK >MS1969 aroK, AroK protein MRLRILLLFIENFKKNNTMAEKRNIFLVGPMGAGKSTIGRQLAQLLNMEF IDSDNEIEQRAGADISWIFDIEGEDGFRKREERIINELTQKQGIVLSTGG GAILSKETRNHLSARGIVIYLQTTVDKQFERTQRDKKRPLLQGVEDVRKV LEDLAQVRNPLYEEVADITLPTDEQSAKLMASHIVELIDNFNS >MS1790 aroQ, AroQ protein MSQLSRILLLNGPNLNMLGAREPKHYGTLSLAAIEANVQALAAKNNIELE CFQANSEEKLIDKIHQSFKKVDFILINPAAFTHTSVALRDALLAVAIPFV EIHLSNIHKREPFRHHSYFSDVAEGVICGLGAKGYECAFEFAVEFLAKKA >MS1959 arsC, ArsC protein MSVIIYHNPRCSKSRETLKLLQDQNINAEIVLYLEKRFSVSELQSLMKKL NIHSPKEMMRIKDALYQELQLNNEHISEQELLEAIGNHPALLERPIVING DKAKIGRPPEAVLSIL >MS0672 arsC, ArsC protein MITVYGIKNCDTVKKALKWLTDNNIEHKLHDYRTDGLDPEFLINAEAQFG WQTLVNKRSTTWRNLDSQIKENMEKHTALSVLAEQPTLIKRPIILQDGIA LIGFNIKEYKKAFG >MS0220 artI, ArtI protein MQVVLKRRKQNNSDNIYLTINQGSYMKKLLLAAALAGTTFAAQARDITFA MEPSYPPFELTNAQGEIIGFDVDVAKAICKEIEANCNFKSQSFDALIPSL KAKRFDAAISAIDITETRAKQVLFSDAYYDSSASFIAVKGKADLNSAKNI GVQNGTTFQQYTVAEAKQYSPKAYTSLQDAILDLKNGRIDIIFGDTAVLA DMLAKEPELTFVGDKVTNKKYFGNGLGIAVNKSDKALVENLNKGLAAIKA NGEYQKIYDKWMTAK >MS0704 artI, ArtI protein MKKLLLSTLLITTAFAVSAKDISFAMEPTYPPFEFTNEKGEIIGFDVDIA NALCKEMQANCTFKSQAFDALIQGLKQKRFDASISGMGITEARKKQVLFT EPYFSSSAAFIAKKGTDFTKVKTIGVQNGTTYQNYIIKEKPEYEVKAYAS FQDALLDIQNGRIDAIFGDIPVLVDMIKKTPELAFAGEKIDNKTYFGNGL GIAANKANQELIDEFNQALIKIRQNGEYQKIYDKWMTAK >MS1277 artI, ArtI protein MFKKLVLLATGMFAVATTTQAVAADSLLDRINNKGTITVGTEGTYAPFTY HDASGKLTGYDVEVTRAVADKLGVKVEFKETAWDSMMAGLKAGRFDIVAN QVALTTPERQATFDKSEPYSWSGAMMAVRADDDSIKTLDDIKDRKAAQSL TSNYGELAREKQAKIVPVDGLAQSLLVVQQKRADFTLNDSLAILDYLKKN PNSGLKSAWEAPAEEKLGSGLIVNKGNDEALAKISAAVIELQKDGTLKKL GEQFFGKDISVK >MS0900 artI, ArtI protein MKKATLATLIAAMFVTATAQAQTSPDTLTKVLETKELVVCSPGDYKPFSF DNNGKFEGVDNDLMDKLAQSMGAKVTIVKTTWKTLMDDFTANKCDIAVGG ISITLERQQKALFTEPYFINGKTPIVRCENVDKYQTVEQINRPEVRIIAN PGGSNEKYARNELSNANLTMNAENLTIFQQVIDKKVDVFVSEAAEAIVKA HEHKGVLCAVNPDKPLKPAQNGWLIHNGDYRFKSYVDQFLHLEKMSGNLD KTINKWLPRD >MS1808 artI, ArtI protein MIERLQCHHFPNTESSHLKGLLLRIIAAFALVLWAIDMVFPWQQMMRSEE NRYNAIQQRGKLVVGTVNNPVSYFIGNEGQAGLEYELSRAFADYLGVELE MKAMDNGEQLFDALEDNEIDIAAANLLYQAKKAETFQLGPAYYSASWQLV YRKGESRPQSLSQIKDKLIIARGSELPLILQGYQTKYPNLKWQLENNQTQ EELLLQVAQGKIKYTVANSIDVSAVQQVRPEIAVAFDVTDEASVHWYLPN NSYNELQAALLDFMNTALEGGLIARIEEKYFNHFSQFDYVDMRQYVQAIN DILPKYAPLFDRYKGDLDWRLLAAIAYQESHWNENATSPTGVRGMMMLTK DTAERMKIADRTDAEQSIKAGSEYLHWLISQVPDSIPKEDRIWFALTGYN MGLGHMLDARRLTKNLGGDPDNWLDVKKNLPLLAEKRYYPNLKYGYARGY EAFQYVENIRRYMNSIINYYRVQENADNKDKPSETDENLPLPLTDNQEKQ E >MS1684 artI, ArtI protein MKIFKKTTALLAAALLATGLTACDNKDSGAASADNNAVSAIERIKKADKV RIGVFSDKPPFGYVDKDGKVQGFDVEIAKAVTKDLLGDENKAEFVLVEAA NRAEYLLSNKVDITMANFTVTPERKEVVNFAKPYMKVALGVVSKQDAPIT DVAQLADKTLLLNKGTTADAYFTKNFPKNKSLKFEQNTETFQALLDGRGD ALSHDNTLLFAWAKENPGYVVAIKNLGDLDYIAPAVKKEDTDLLQWLDGE IEKLAKDGTLNKAYQKTLQPIYGDEIKEADVLVEYQ >MS1687 artM, ArtM protein MSIIMNWQYIWNALPRFVDATILTLELSFWAILFSVIIGVICAVVMSYRV RGLQTIVKAYIELSRNTPLLIQIFFLYFGLSKIGVKLEGFTCAVIGLAFL GGSYMAEAVRAGIESVSKGQVESALSIGLTPMQTFRYVVFPQAFAVATPA IGANCLFLMKETSVVSAIAIAELMFMAKEIIGMDYKTNEALFLLVVFYLI ILLPVSVFIGYLERRLRRAKYGA >MS0222 artM, ArtM protein MFREYFMEIARGIPTSLLLTAVALAVAFVLALFLTFLLSMENKPVKRVIN IFLTLFTGTPLLVQFFLIYSGPGQFQWIVNSALWPLLSNAWFCAMFALAL NSAAYSTQLFHGAVKAIPKGQWESCAALGLSRLQTLKILIPYALKRALPS YSNEIILVFKGTSLASTITIMDIMGYARQLYGTEYDAITIYGIAGVIYLV ITGLMTLLLRKLEHKVLAFERLEVEKA >MS1686 artM, ArtM protein MGLTLLFEGNNLQRLLAGLGITAEIAFVSVFFACILGIVMGVVMTSRNIF VRGFCRLYLEIVRIIPLLAILFIVYFGVAKWFNVHLSGVTVCILVFIFWG TAEMGDLVRGALTSIEKHQTEAAYALGLSKIQTFIYILLPQSLKRVTPGA INLFTRMIKTSSLAMLIGVLEVIKVGQQIIETSLFRDPTSALWIYGVIFA LYFAICYPLSLFSKYLEKRWEN >MS1276 artM, ArtM protein MLNNLLLSIPFMTESRVDLVISAFWPMVEAAVLVSIPLAVSSFIIGMIIA VAVALVRVTPVNGVIHRLFLVIVKVYISIIRGTPMLVQISVVFYGLPALG IFIDPIPAAIIGFSLNIGAYASETVRAAISSVPKGQWEAGYTIGMSYMQT FRRIIAPQAFRVAVPPLSNTFIGLFKDTSLASVVTVTEMFRVAQQMANMS YDFLPIYIEAGLIYWCFCWVLFVIQAKVEKRMERYVAR >MS0221 artM, ArtM protein MFFEYLPLMSTATLMTLGLAVCSLIAGLVLAIFFVVLETNKFVCVRKPTA IFVTLLRGLPEILVVLLIYFGSTELVEKLTGEYIEFSPFLCGVIALAIIF AAYASQTLRGAIQAIPLGQWESGAALGLSRGYTFVNIILPQVWRHALPGL SNQWLVLLKDTALVSLIGVDDLMRQASLVNTNTHQPFTWYSFAALLYLII TLVSQFFMRKLEMRFTRFERGVK >MS1101 asd, Asd protein MAILFLLFHDFLPPYFLLQLKICRIERLFIAERIMSTSLNIAIAANFDLC EKIASYLEESLLEVEKLSIVEIYPFSEEQGIRFNGKAVAQLPVDEVEWSD FNYLFFAGDLAHIPLLAKASEAGCLTIEMNGVCSALADVPVVIPGVNEEQ LRDLRQRNIVSLPDAQVTQFALSVRSLLNNASNAQIVVSSLLPASYYDAD GVHKLVGQTAKLLNGIPPDEEEMRFAFDVFPAKSLNLNAQLQRVFPQLEN VVFHQIHVPVFYGLAQMVTVKAEFEPEQDSILAEWSTNDLIRYHQDKVMT PVLNGEAENNEDEVHLQISALESVEGGIQYWLVADNQRFSQAFLAVKLLE SIYRQGY >MS0006 asd, Asd protein MKNVGFIGWRGMVGSVLMDRMQQEQDFANLNPVFFTTSQAGQKAPVFGGK EAGNLKDAFDIEELKKLDIIVTCQGGDYTNEVYPKLKATGWDGYWVDAAS ALRMEKDAIIVLDPVNQHVIADGLKNGIKTFVGGNCTVSLMLMALGGLFE RDLVEWISVATYQAASGAGAKNMRELVSQMGLLEKSVSEELANPASSILD IERKVTAEMRADSFPTDNFGAALAGSLIPWIDKLLPSGQTKEEWKGYAET NKILGLSDNPIPVDGLCVRIGALRCHSQAFTIKLKKDVPLEEIEQILASH NEWVKVIPNDKETTLRELTPAKVTGTLSVPVGRLRKLAMGPEYLAAFTVG DQLLWGAAEPVRRILKQLVA >MS0036 asnA, AsnA protein MKKSFILQQQEISFTKNTFTEKLAEHLGLVEVQGPILSQVGNGIQDNLSG TEKAVQVNVKMITDAAFEVVHSLAKWKRHTLARFGFAEGEGLFVHMKALR PDEDSLDQTHSVYVDQWDWEKVIPEGRRNLDYLKETVREIYAAILETEAA VDKKYGLKSFLPKEITFIHSEDLVKDYPGMTDKERENELCKKYGAVFLIG IGGVLPDGKPHDGRAPDYDDWTTTSEGEYKGLNGDILVWNPILNRAFEVS SMGIRVDETALRKQLSITGDEDRLKFDWHQDLINGRMPLSIGGGIGQSRL AMLLLQKRHIGEVQSSVWPKAVMEQYENIL >MS1042 asnS, AsnS protein MTKIVSVAEVLQGRTAIGEKVTVRGWVRTRRDSKAGLSFLAVYDGSCFDP IQAIINNDIVNYESEVLRLTTGCSVIVTGTVSKSPAEGQAVELQAETVEV VGWVEDPDTYPMAAKRHSIEYLREVAHLRPRTNIIGAVARVRHCLAQAIH RFFNEQGFYWVATPLITASDTEGAGEMFRVSTLDLENLPRDDKGAVDFSQ DFFGKESFLTVSGQLNGETYACALSKVYTFGPTFRAENSNTTRHLAEFWM VEPEVAFATLADNAKLAEDMLKYVFNAVLKERMDDLKFFEKHIDKDVINR LERFVASDFAQIDYTDAIDVLLKSGKKFEFPVSWGIDLSSEHERYLAEEH FKSPVVVKNYPKDIKAFYMRLNEDGKTVAAMDVLAPGIGEIIGGSQREER LDVLDARMAEMGLNPEDYWWYRDLRKYGTVPHSGFGLGFERLIVYVTGLQ NIREVIPFPRTPRNANF >MS1984 aspA, AspA protein MAATRKEVDLLGEREVPADAYWGIHTLRAVENFNISKVTISDVPEFVKGM VMVKKATALANGELGAIPADIAKAIVAACDEILTTGKCLDQFPSDVYQGG AGTSVNMNTNEVVANLALEKIGHQKGEYNVINPMDHVNASQSTNDAYPTG FRIAVYNSILKLMDKIQYLHDGFDNKAKEFANILKMGRTQLQDAVPMTVG QEFKAFAVLLEEEVRNLKHAADLLLEVNLGATAIGTGLNTPAGYSELAVK RLAEVTGLPCVKASNLIEATSDCGSYVMVHGALKRTAVKLSKICNDLRLL SSGPRAGLNEINLPELQAGSSIMPAKVNPVVPEVVNQVCFKVMGNDTTVT FAAEAGQLQLNVMEPVIGQAMFESIDILANACVNLRDKCIDGITVNKEIC ENYVLNSIGIVTYLNPFIGHHNGDIVGKICAQTGRSVRDVVLEKGLLTEA ELDDILSVENLMNPTYKAKLSK >MS0708 aspS, AspS protein MMRSHYCGGLNRENIGQEVTLSGWVHRRRDLGGLIFIDMRDREGIVQVCF DPKYQQALTRAASLRNEFCIQIKGEVIARPDNQINKNMATGEVEVLAKEL SVYNAADVLPLDFNQNNTEEQRLKYRYLDLRRPEMAQRLKTRAKITSFVR RFMDDNGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQ LLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFLTAPEVREIME KMIHGLWLNTINVDLGKFPVMTWTEAMQRFGSDKPDLRNPLEITDVADIV KDVDFKVFSGPANDPNGRVAVIRVPNGASVTRKQIDEYTQFVGIYGAKGL AWLKVNDVNAGLEGVQSPIAKFLTEEKIKAIFDRTSAQTGDILFFGADKW QTATDALGALRLKLGRDLALTQLDQWAPLWVIDFPMFERDEEGNLAAMHH PFTSPKDFSPEQLEADPTGAVANAYDMVINGYEVGGGSVRIFDPKMQQTV FRILGIDEQQQREKFGFLLDALKFGTPPHAGLAFGLDRLTMLLTGTDNIR DVIAFPKTTAAACLMTEAPSFANPQALEELSISVVKTDKE >MS2348 atpA, AtpA protein MQLNSTEISELIKKRIAQFDVVSEARNTGTIVSVSDGIIRIHGLSEVMQG EMIALPTGRFAMALNLERDSVGAVVMGPYTDLAEGMEVQCTGRILEVPVG RGLLGRVVNTLGQPIDGKGEIKNDGFSPVEVIAPGVIDRKSVDQPVQTGY KAVDSMVPIGRGQRELIIGDRQTGKTALAIDAIINQRDSGVKCIYVAVGQ KASTIANVVRKLEENGALANTIVVAASASESAALQYLAPYAGCAMGEYFR DRGEDALIVYDDLSKQAVAYRQISLLLRRPPGREAYPGDVFYLHSRLLER AARVNEEYVENFTKGEVKGKTGSLTALPIIETQAGDVSAFVPTNVISITD GQIFLESNLFNAGVRPAVNPGISVSRVGGAAQTKAVKKLAGGIRTALAQY RELAAFAQFASDLDDATRKQLSHGEKVTELLKQKQYAPLSVAEQAVILFA VEFGYLDDVELNKIADFETALLDYANRTNTEFMQELTKSGDYNDEIKNTL KGILDNFKANNTW >MS2352 atpB, AtpB protein MSGQTTSEYIGHHLQFLKTGDSFWNVHIDTLFFSVLAAIIFLAVFRSVAK KATSGVPGKLQCMVEILVEWINGIVKENFHGPRNVVAPLALTIFCWVFIM NAIDLIPVDFLPQLAGLFGIHYLRAVPTADISATLGMSLCVFALILFYTV KSKGFGGLVKEYTLHPFNHWSLIPVNFVLESVTLLAKPISLAFRLFGNMY AGELIFILIAVMYSANAAIAALGIPLHLAWAIFHILIVTLQAFIFMMLTV VYLSIAYNKAEH >MS2345 atpC, AtpC protein MTTFNLTIVSAENKIFEGAVKSVQATGIEGELGILAGHTPLLTAIKPGIV KFTYNDGIEEVIYVSGGFLEIQPNIVTVLADVAIRGSDLDQDRILAAKKK AEDNIVAKSGDLNHEMLTAKLSKELAKLRAYELTEKLVKNKR >MS2346 atpD, AtpD protein MSAGKIVQIIGAVIDVEFPENAVPKVYDALKVAEGGLTLEVQQQLGGGIV RCIAMGSSDGLKRGLSVSNTGKPISVPVGTKTLGRIMNVLGEPVDEQGPI GAEEEWAIHREAPSYEEQSNSTELLETGIKVIDLICPFAKGGKVGLFGGA GVGKTVNMMELIRNIAIEHSGFSVFAGVGERTREGNDFYHEMTDSNVLDK VSLVYGQMNEPPGNRLRVALTGLTMAEKFRDEGRDVLFFVDNIYRYTLAG TEVSALLGRMPSAVGYQPTLAEEMGVLQERITSTKTGSITSVQAVYVPAD DLTDPSPATTFAHLDSTVVLSRNIASLGIYPAVDPLDSTSRQLDPQVVGQ EHYDVARGVQGILQRYKELKDIIAILGMDELSEDDKLVVARARKIERFLS QPFFVAEVFTGSPGKYVSLKDTIRGFKGILEGEYDHIPEQAFYMVGSIEE VVEKAKNM >MS2351 atpE, AtpE protein MENIMESVITATIIGASILLAFAALGTAIGFAILGGKFLESSARQPELAS SLQTKMFIVAGLLDAIAMIAVGISLLFIFANPFIGLLQ >MS2350 atpF, AtpF protein MNLNATLIGQLIAFALFTWFCVKFVWPPIIKAIEERQSSIANALASAEKA KQDQADSQAAVEQEILAAKEEAQKIIDLANKRRNDILEEVKTEAENLKAT IIAQGHAEVEAERKRVQEELRVKVASLAIAGAEKIVGRTVDEAANNDIID KLVAEL >MS2347 atpG, AtpG protein MASGKEIKTKIASVQSTQKITKAMEMVATSKMRKTQDRMAASRPYSETIR NVISHVSKASIGYKHPFLVEREVKKVGMLVISTDRGMCGGLNINLFKTVL NEIKKWKEQGITVEVGVIGSKGIAFFRSLGLKIRAQHSGMGDNPSVEELL GIANDMFDAYKDGKIDALYLAHNQFINTMSQKPSFAQLVPLPELDTDNLG ERQQAWDYIYEPDPKMLLDSLLTRYLESQVYQSVVDNLASEQAARMVAMK AATDNAGNLINDLQLVYNKARQASITNELNEIVAGAAAI >MS2349 atpH, AtpH protein MQNYKKVELMSELTTIARPYAKAAFDFAVEQSATDKSAVEKWTEMLGFAA QVADNEQIRDFFANTFSVQKAADAMVSICGEQLDQYGQNLIRLMAENKRL TVLPAVFDEFQRYVEEHNATAEVQVISAQPLNATQEQKIAAAMEKRLARK VKLNCSVDNSLLAGVIIRTDDFVIDGSSRGQLNRLANELQ >MS1797 avtA, AvtA protein MELFPKSNKLEHVCYDIRGPVHKAALRLEEEGHKILKLNIGNPAPFGFEA PDEILIDVIRNLPTAQGYCDSKGLYSARKAIVQYYQSKGIHGATVNDVYI GNGASELITMAMQALLNDGDEVLVPMPDYPLWTAAVTLAGGKAVHYLCDE EQDWFPAIDDIKSKITSRTKAIVIINPNNPTGAVYSKELLLEIAEIARQN GLLIFSDEIYDKILYDGAVHHHIAGLAPDLLTITMNGLSKAYRICGFRQG WMILNGPKDKARGYIEGLDMIASMRLCANVPMQHAIQTALGGYQSINELI VPGGRLYEQRNRAYELLNQIPGVSCVKPMGALYMFPKIDIKKFNIYDDEK LVLDLLAQEKVLLVHGRGFNWHAPDHFRIVTLPYVHQIEEALNKFARFME NYHQ >MS1248 avtA, AvtA protein MRYDKMSPFIVMDIVREAAKYPNAIHFEIGQPDLAPSEKVKKALQSAVEN NKFSYTESLGLLALREKICQYYDRTYHVKITPNRVLLTPGTSGAFLIAYA LTLAQDDKLGLTDPSYPCYKNFAYMMDIQPEFMPVDKHNCYQLEVGQLKG RNIKALQISSPANPTGNIYTAESLKSLNDYCMENHIDFISDELYHGLVYD QNAATALQFNPRAYVINGFSKYYCMPGMRLGWIIVPEDKVREAEIIAQNI FISAPTLSQYAALEAFEEEFLTATKQVFQQRRDFLYDALKDLFTIEFKPQ GAFYLWADVSKYTDDSYQFAKKMLHEIQVAATPGIDFGENGTKHYLRFAY TRDIEHLREGVERMKQWLKNK >MS0764 azlC, AzlC protein MSEIVSKTPVRDAAKAAFPYSAPMIAGFIFLGIAYGLYMKQLGFGVLFPV FMALLIYAGSVEFIVAAALVAPFSPLNVFLICLMVSGRQIFYGISMLEKY GGHLGKKRWYLITSLVDEAFSLNYMAKIPSYIDKGWYMFFVSLYLQIYWV MGAGIGNLFGAMLPFDLKGIEFAMTALFIIIFAENWLKEKSHESSLLGLG ITLTSLIIVGKEQFLIPSLLGIWIMLTLSRPKLSSKLKRIE >MS0765 azlD, AzlD protein MTLTEQIITVGMGILGVHICRVLPFLIFPPNRPIPEYIRYLGKVLPAAMF GMLVIYCYKNVDIFSGFHGFPEFLAGLITLALHLWKKNMFLSMAVGTGLY MFLVQAVFVN >MS2286 baeS, BaeS protein MNAIITLIYSWFIMLCAYFSVWAISDYLLGNSLLALLFLPFALRLGINLH TPKIFWLVSYCAEFCILSLLMYASPNEYYLPAIILSIASLPVTFIGQKYY QGNEARKLAVQGVIAVFVSLLNGIVSFSLSISFFYTFLTSLTGMLLIMPA CFLGYDYLFKRKWIPLTASLVHKPISLRAKHILIYILLFLLNIFIQVDIP EEFHRFALFFLAIPIILLAYHYGWQGALLGTLLNSIALIASTGSFSRGEL TDLFLSISAQTVTGIFLGLAIQRQRDLNNSLMVELNRNRTLTRQLINTEE SIRKEISRELHDEIGQNITAIRTQASIMKRLETSPKIEKVGSMIEQLSLN IYDTTKGLLNRIRPKMLDDLELQQALQNLFLELDLENHGISTALFWENKQ NEPLDHIQEITLYRLCQEGLNNIVKYSHASQVIISVLIRKDIELIIQDNG DGFNPETVKSGFGLQGMKERVDILCGKFQLISKERSVSPQQHGTTIKITL PRL >MS1244 baeS, BaeS protein MHEQQRERNMRGRFARHLAPLRAPMSYDDDALAFAIFTRSGEMLVSDDAN GAKFQFAPDRGFTETKLSEGNESWRIFWLPSKDKNLIIAVGQEMDYRNNL INNFVLGQMWIWIASLPLLIGLIIFVIHHELRLLNRVSAEVRERSPEDNH LIDTADMPTEVLPLLQSLNGYFARTAETFRRERRFTSDAAHELRSPLAAL RIQTEVAQMLTDEPELQTQALDNLTKGIDRASQLIEQLLILSRLDNLSQL NELEPIYWEQLIPAVISEQYSHAQQRNIEIKFDRKALPQVKKGQPLLVSL MLRNLIDNCIKYCPEGSVIQVNLNEDTVVIEDNGYGVSDEDINKLGQRFY RPAGQNEKGSGLGLSIVHRIAELHHYEFIVENIKDQSGKCIGFRSIIKLN >MS2288 baeS, BaeS protein MKSSSKKGLLMKSLLSFKRLTKHSVTTFIAHYLSLIIILAGIITTFSFAI MGSNKSDAELINVAGSLRMQSYRLIYTMEYEPEKVDIGLRQYRISLHSNP LVTIHHHLLTPGDVKQSYSDLVRRWQEMENLARTNQQEQYRNQITSYVNQ LDQFVYSLQRFAEKKVIIAVLVIILSMLLIVGIASYLIWHTHQEIVKPLN QLMRASTQIEMRQFQHILLDTKRDDELGRLAKSFTHMSNELHKLYANLEE KVTEKTQKINQVNRSLAMLYYCSQELSASDLNRNKLLHVLKHVMATEHLR AFELDLIELRQWNITLGEPSVALSWQEQTIGSEDNKLGSLRWQAGLPCPD TRTMENIAQLLGRTLYFNQTQRQQQQLLLMEERSIIARELHDSLAQVLAF LQIQLTLLKHNLNKDDDKAKRQSLLIIKDFEQALSDGYIQLRELLATFRL TVQEANLKLALEQVVDSLRNQTDIQMTVDCSLPSQTFNPQQLIHALQIVR ESTLNAIKHSKADLIEVIAHTNEEGEQELIVRDNGVGIASLNEPDGHYGL NIMQERSQQLNAKLTISNRATGGTEVKITLPNTLA >MS1245 baeS, BaeS protein MNLLKMNSIRLRLIVILSFIALVIWGVTSALNWHYVRQEVNNMFDIQQYL LAKRLSSSYCNRFCMNSSASVI >MS1914 baeS, BaeS protein MIIEKLSNHLQSITTQIFAIFWFTFTLLLLLAFFIPSLDNRVYSALTSEQ LETYQKQIVTSIRTNQISRLLVAPAKFAIDSTAPIRPILMDSNKRIIGAL PDELPTVQQFVLQSANVSAPMVRNFNNIQLAGPFIVHLNTPENEPFLLYF VKTINPQKEGVNFIFDHPALIVFLIMLFSSPILWWLAWRIAHPLRRLQHF AGLVSKGDFTLHKELEESGVYELRQLSKSLNQMTESLDNLLSTRQALLYS ISHELRTPLTRLQLAVALIRRRQGESRELTRIETEAERLDQMINDLLQLS RNQLKSELERERFPITEIWQDVLEDTKFEAKQRNIHFKATCKIPDVAKYA INGNRSALASALENILRNALKYTNTSIEATFTLENNYLRIDIDDDGIGVP ESEYSKIFTPFYRLDTARTRSTGGTGLGLAIVLNIIKQHHGEIGANKSRL KGLCITMKLPLDK >MS1730 baeS, BaeS protein MKNVKFFAQRYIDWVTKLGRLKFSLLGFILIAILALCTHIFLSLMITGQI HWESLLYSVVFGVISAPFVIYFFALLVERLELSRQNLTNLVGELQQEIRE RTTAEQRLAQAIRDKTTLMATISHELRTPLNGIVGLSRILLDSKLTEEQY NYLKTINVSAVSLGHIFNDIIDLEKLDGSRIELYKKETDFHALITDVYNV AQLMAEQKHLKFILQVDKDLPNWLLLDYTRLSQVLWNLISNAVKFTDKGT VTLKISRLSENRYAFAISDTGPGIPENELNKIFTMYYQVKANFNKHKAAG SGIGLAISKSIARLMNGDLVVESEIGKGSTFILTIQADEVSKPVSDGTAD LDLSLSILLVEDIELNIIVAKSLLEKLGHQVDTAMTGQEALTKFERNNYD LVLLDIQLPDMTGFDIAKILRTKYEDGVYDYLPPLIALTANVMQNKSDYQ KQGMDDVLRKPLSLDSLNQCLSEYFGDEIGVSSAQNSVMTKAAELPDDFD YPLLDDLVEMLGASFVLKNLALFKQTMPEYIDELLTIYQNYQKDKEKKKD VAACMHKIKGAAASVGLKHIQLLAEKGQHDEADIWRENIKRWIDEIEQSW FEDVTKLEHWLAKK >MS0264 bcp, Bcp protein MNTLKIGDFAPHFSLSNQHNETVSLTDFQGKKVLVYFYPKALTPGCTTQA CGLRDAKAELDKSGVVILGISPDSPKKLAQFAEKKELNFTLLSDENHQVA EQFGVWGEKKFMGKIYDGIHRISFLIDEKGVIEQVFTKFKTGEHHQVVLD YLTQNS >MS0227 betT, BetT protein MFIEDNRPNNMEHLMSLYQQLRATSTLKAPIFMPTVIFVLLVTVFCSVFP EQAQITLNTVKQSIFTHFSWFYILAGSIFFLFLIFLCGSRLGDIRLGADN DEPEFSFTSWIAMLFAAGMGIGLMYFGVAEPVLHYVKPVQENLTEAERMK EAMMTTFYHWGIHAWAIYAVIALALAYFGFRYKLPLTVRSGFYPLLKNNL SGFWGHLIDIVALCSTIFGLTTTLGYGAMQVNAGFNNLGLIDSNSFVVLA VIMIVSMMLAVISAISGVGKGVKILSETNLVLGGLLLIFVIIAGPTLWLF SGLTENLGYYFSSLLELSFRTFAYEPEHQSWLSGWTILYWSWWASWAPFV GMFIAKISKGRTIREFILGVLFVPSLFNILWMTSFGGSAIWIDQQTHGAL AAISDNTEALLFGFFDQLPFGQIASVIALLVISIFFITSADSGIFVLNNM ASQGSGKAPKWQSVFWGALLAVLGLSLLYSGGLASLQSMTLIIALPFMAI MLVLCFGLWKGLMVDTQYSSKKFTQGSVLWTGENWKERLEKIVNPTDRKD IRRFLNQIARPAFNDLVKEFLEHGLNAQMNFIDGKNPKIEFEVVNENLRN FLYGIRLQSRQLSDLVVDDDNLPNLEESKIYEPITYFFDGREGYDVQYMT QEELIVDVLKQYERFMNLAMDKSHNLMTADVENMAE >MS2104 bfr, Bfr protein MNKMATEKQIEILKGMAKAWFGNSQQHSIHAEIIRQKGFSKLADKIQAEA EGEWKEAQRVNARLLELGVTPTLAINNYPIITDIREQLEYDYNEGLKGMA ELNAMIADFADDYITRRMIEEFIVDEQEHTNWLAEHIGLIEEIGYQNYLI QQL >MS0396 bglX, BglX protein MSTLLIDLKGQELLAEEAELLAHPLVAGLILFTRNFYDRSQIQALIKDIR RRVKKPLLITVDQEGGRVQRFREGFTQLPAMQSFAAMISDPALQLTTAKE AGWLMAAEMTALDIDLSFAPVLDLGHECKAIGDRSFCEEVEPAVRLASAF IDGMHQAGMATTGKHFPGHGHVLADSHLETPYDERPSAVIFERDIQPFQQ LIAQNKLDAVMPAHVIYRHCDSQPASGSKYWLQDILRQKLGFDGTVFSDD LGMKGAGFMGDFVARSEKALSAGCDLLLLCNEPEGVVQVLDNLKLEENPP HFAARQRRLQSLFKKKAFSWNELTKTRRWLENSKKLTALQQSWLDSK >MS1523 bioB, BioB protein MKTTLQLYSNTPHPQVEYWSVCKVEALFETPFLELVHRAALVHRENFNPQ AIQLSTLMSIKTGGCPEDCGYCPQSARYQTGVQKQELLNVEDIVEKAKIA KSRGASRFCMGAAWRGPKPKDIAKMTEIIKAVKALGLETCGTFGLLDDGM AEDLKEAGLDYYNHNLDTSPEHYNKVIGTRGFDDRLNTLGKVRKAGLKVC CGGIIGMNESRKERAGFIASLANLDPQPESVPINQLVKVEGTPLSDAQEL DWTEFVRTVAVARITMPKSYVRLSAGRQGMSEEMQAMCFMAGANSIFYGD KLLVTGNAEEDCDRLLMEKLDLEPETTENRYLSQNN >MS1006 bioD, BioD protein MSVFFVTGTDTSVGKTIVSRAIIQAMQNAGIQIVGYKPLACGQDDPVYTD VQESGQTDYDNMDNRDVLVLQDSTNEEVSYQDINSYTFAHTMPMLSQEGK HIDINKINTDLKRLSSRYQSVLVEGSFGWLTPINQDYTFASWAAEHKMPV VLVVGIKEGCMNHALLTVQSIEQMGLPLLGWVANRINPMLGHYAEIIEDL SKRIKAPLLGKIPYMHKPETQELGHYITDIDRLSYMKTEILK >MS0775 birA, BirA protein MFYVDVKKYCGRIIQGWQSFGNGFFKKNFYFFELYDKKTSYIFSGKFMSS LLEILADGQPKTFKKLTALLSLSQAQLLDETERLQTLGIQIKASPQTLQL IPQLDLLDGARLSKALFPHRVVIQPVIDSTNQYILNHLAELKKGDLCLSE HQTAGRGRRGRQWLSPFAGQLILSIYWTLNARKPLDGLSLVIGMAIADAI KSAGGKEINLKWPNDLLLNGRKLAGILIEIANRQQDQLNLVIGIGINLSL PKLKAQIDQPWAELCEILPQLDRNELLIRVVKHLYLYLAAFEREGINAVF REKWAETDYYFNKEVNIITEKQTITGINQGIDENGYILIKTKNGELLKFN GGEVSLRKPA >MS0892 bisC, BisC protein MQVTRRKFFKICAGGMAGTSVAALGLMPTAALAAPREYKLLRAKETRQSC TYCAVGCGMLMYSIGDGAMNSRGKLTHVEGDPDHPVSRGALCPKGAGVLD FVNSPNRIQYPEYRAPGSDKWERISWHDAIHKIAKLLKDDRDANWESANE EGTPVNRWLTTGFLTASAASNETALISQKWARAFGLLVLDNQAST >MS1030 bisC, BisC protein MFATHLLGVIMQVSRRKFFKICAGGMASSSAAMLGFMPTQALAAPREYKL IHAKVARNNCTYCAVGCGMLMYSLGDGAKNARGKLFHVEGDPDHPVSRGS LCPKGAGVLDFVNSPNRLKYPEYRAPGSDKWVRLSWEDAIHRIAKLMKED RDAYFEEKNAQDTTVNRWLTTTMFCSSATSNETGILTHRWARSLGMVTIN NQAATCHGPTVPALAATFGRGAMTNHWVDIKNANLVIVMGANTAEAHPVG FKWVIEAKKNGAKLMVVDPRFNRTAAVSDFFAQIRAGSDIAFLLGVIRYL LEHDAIQHEYVKHYTNAALIVADDFEFNDGLFSGFDESTAQYDRTSWAYA TDESGQPLRDLTMQHPHCVLNLLKKHVERYTAETVENITGVKQATFNQFC ETLAETASPNKTATFLYALGWTQHTVGAQNIRAMAMIQLLLGNIGMAGGG VNALRGHSNVQGASDMGLTPVGLPGYLQLPNEKDVSLEKYLERVTPKTLV QGQTNFLQNTPKFVVSLLKSFYGDNATAENEWGFHYLPKYDQVYDQLKMI EMMNEGQINGFLCQGFNPVSSLPNKNKVVSALSKLKYCVVFDPTETTTSN FWQNHGEYNDVNPAEIQTEVFRLPTVCFAEEDGSIANSGRWLQWHYKAAE PPAEAKPDVDILAEIREAILEMYEKEGGRGLEPLKATAWDYVNPLEPKAE ELAKQNNGYALADLYDTAGNLIAKKGELLSNFGQLRDDGTTACSAWIYTG QWTEKGNQMDNRDNSDPSGLGNTLGWAFAWPANRRIVYNRASADLTGKPW DPKRQLVKWNGKNWNYIDIADFGTAPPNSEIMPFIMQNDGLGGLFCLNRL ADGPFPEHYEPMETPIGTNPLHPNVISSPVARVMENDKPNFGTSNEFPYV GTTYSLTEHFHAWTAQVQLSMITQPEAFAEISEELAQEKGIKQGDVVKVH SKRGYIKMKAVVTKRIKPLTVNGQTVHTIGFPIHWGFSGVGKKTFVTNTL TPPVGEVNSLTPEYKAFLVNIEKTTEAL >MS2281 bisC, BisC protein MGNIMELNRRDFMKANAAVAAAAAAGITIPVKNVHAADDDMGIRWDKAPC RYCGTGCSVLVGTKDGRVVATQGDPDAEVNRGLNCIKGYFLSKIMYGADR VQTPLLRMKDGKFHKEGDFTPVSWDQAFTVMADKIKAILKEKKDPNAIGM FSSGQTTIFEGYAKVKLWKAGLRSNTIDPNARHCMASAAVAFLRTFGMDE PMGCYNDIEKTDAFVLWGSNMAEMHPILWSRISDRRLSSDKVKVVVMSTF EHRSFELADTPIIFKPHSDLAILNYIANYIIQNDKVNWDFVNKHTKFKRG ETDIGYGLRPEHPLEVAAKNRKTAGKMYDSDFEEFKKIVAPYTLEEAHRI SGVPKDQLETLAKMYADPQQNLVSFWTMGFNQHTRGVWVNHMVYNVHLLT GKISKPGCGPFSLTGQPSACGTAREVGTFVHRLPADMVVTNPKHVEIAEN IWKLPKGTISNKPGFPAVQQSRALKDGKLNFLWQLCTNNMQGGPNINEEI FPGWRNPDNLIVVSDPYPSASAVAADLILPTCMWVEKEGAYGNAERRTQF WRQQVKGPGQSRSDLWQIVEFSKYFKTEEVWSEELLAQMPEYRGKTLYEV LYLNGEVNKFQTPTNVPGYINDEAEDFGFYLQKGLFEEYASFGRGHGHDL ADFDTYHQVRGLRWPVVDGKETLWRYREGYDPYVKAGEEVSFYGYPDKKA IILGVPYEAPAESPDEEYDLWLCTGRVLEHWHTGTMTRRVPELHKAFPNN LCWMHPTDAKKRGLRHGDKVKLITRRGEMISHLDTRGRNKCPEGLIYTTF FDAGQLANKLTLDATDPISGETDYKKCAVKVVKA >MS0588 bisC, BisC protein MKKVNNSRRNFLKSSSLGFAGASMATATTGGITGLLSVTANAAETNSKTV VTAAHWGPLGVVVENGKVVKSGPAIAAPIENELQSVVADQLYSEARVKYP MVRKGYLDGNQDRSLRGHDTWVRISWEQAFDLVAKEMKRVRETYGASGIF AGSYGWYSSGALHAARTLLHRYMNITGGFVGTKGDYSTGAAQVIMPHVLG TIEVYEQQTSWEVILESSDTIVLWGANPLATMRIAWTSTDQKGLEYFKKF KETGKRIICIDPVRSESCEYLGAEWIPINTGTDVPLMLGIAHTLVNENKH DKEFLKNYTTGYDKFEEYLLGKIDNQPKTAEWAEKICGVPAQTIKQLAAD FSAKRTMLMGGWGMQRQRHGEQSHWMMVTLASMLGQIGLPGGGFGLSYHY SNGGVPTARGGILGSITANPSTQAGAKTWLDDVSKFSFPLARISDALLNP GKTIQYNGTEVTYPDIKLIYWAGGNPFVHHQDTNTMVKAWQKPETIIVNE VNWTPTARMADIVLPATTSYERNDLTMSGDYSMMNIFPMKQVVEPQFEAK SDYDIFAELAKRAGVEEQFTEGKTEMQWLKGFYETAFNAARANRVLMPKF DDFWNENKPITFNPTDSAKKWVRYAEFREDPLLNPLGTPSGKIEIYSNTI AKMNYDDCKGYPSWMEPEEFAGNVTAEEPLALVTPHPYYRLHSQLAHTSL REKYAVKDREPVLIHKDDAAARGIANGDIVRVFNKRGQVLTGAVVTDGVI KGTVAIHEGAWYDPLDLGQTERPLCKNGCVNVLTRDEGTSKLAQGNSPNT CIVQVEKYTGEVPEVTVFKQPKIA >MS2337 bisC, BisC protein MLYGKFQRISWEEALDTIADNLKRIVKDYGNEAVYNNYATGIVGY >MS0891 bisC, BisC protein MTNHWVDIKNANVVVVMGGNAAEAHPVGFRWAIEAKKQNGAKLMVVDPRF NRTAAVADFYSPIRSGTDITLLSGVIKYLLDNNAIQHEYVKHYTNASFLI NEGYGFEEGLFTGYDEAERSYDKSTWSYQLDENGQPKRDLTMQDPRCVIN LLKKHVERYTPEMVERVCGTKQKAFLEFAETIASTAVPNRTMTILYALGW THHSVGAQNIRAMAMIQLLLGNIGMAGGGINALRGHSNVQGTTDMGLFPS MLPGYIPLPTETDTSLESFLNRITPKTAAEGQTNYWQNTPKFVVSMLKTF YGENATKDNEWGFHNLPKQYKKKMDHLQYIDLMDQGKITGYLCQGYNPLA SYPDKNKISSALRKLKFLVVMDPLKTDTSEFWQNHGEYNDVNPAEIQTEV FRLPTVCFAEEDGSIANSGRWLQWHYKAAEPPKEAKPDVDILSEIREVML EMYEKEGGPSIDTIKAMTWNYQNPLEPKAHEIAKESNGYALEDLYDANGN LIAKKGELLSSFAQLRDDGTTSAANWIYSGQWTPKGNQMDNRDNSDPSGL GNTLGWAFAWPANRRVLYNRASADLAGKPWDPKRPLIKWNGKNWNYIDIA DFGTAPPNSNVMPFIMNNEGISRLFALDKMVDGPFPEHYEPIESPIGTNP LHPNVISNPVARILDNDKASFGNASEFPYVGTTYRLTEHFHWWTKNADLN MIAQPQPFVEISEDLANEKGIAQGDVVKVTSKRGYIKAKAVVTKRIKSLD VDGKKVHTIGIPLHGGFIADGRKSFLPNALTGRVGDANTQTPEFKTFLVN IEKTTEAL >MS1708 bolA, BolA protein MEITMETQEIERILKQALNLDEVYVQGENAHYGVIVVSEEIAKLSRLKQQ QTIYAPLMDHFSSGEIHALTIKTFSPEKWKLERMLNVVN >MS0311 bolA, BolA protein MSKQQELTERLTRQFSPLFLQIENESHMHSSDRGGESHFKVVIVTDEFEG KPKVVRHRMIYQFLAQDLENGIHALALHTYTPKEWQSLGKIIPKSTNCLG AG >MS0488 brnQ, BrnQ protein MNKNTFIVGFTLFAIFFGAGNLIFPPKLGLESGSEFWSAITGFILSGVGL PLLGIIVSAFYEGGYKTATTKISPWFSVIFLMAVYLSIGPFFAIPRTAAT SYEMAILPFIGKSSSLSMLIFTLFYFAISLWFALNPSKTVSRIGAILTPI LLFAILALVVKAFFILIDNDPSEVIFTLRESNNSFLFTGIIDGYLTMDTL ASIAYSVIVIAAIQSKGIKHGKELTKQTLLAGIVAAIALAAIYLAIGWIG NRVHISAETISLLQERNQDIGTYILNKITAQAFGNFGRSLLGVIVSLACL TTAIGLIVSVSEYFNEIYHKISYKTYVIIFTLIGFIIANQGLSAVISKSV PILLVLYPISMTIILLLSVNIFVKVPLVAQRLSIALTTLVSIGSVAGLEQ ANNLPLKDYSMEWIPFAVTGALLGCLIHVFYKSES >MS0793 btuC, BtuC protein MYSAVLFSYICYLSGIGMFSYKNKIHSKLIRQLGILGMLSLVAVVAYLFY RLPNRWEYALYHRSLSLVAIVVTGAAIALATMIFQTIVNNRILTPSILGL DSLYLLIQTLIIFLFGSKTLLGINQTLLFFLSTGAMVMFALGLYHFLFKR ERQNIFFLLLVGIIFGTFFQSLTTFMEVLIDPNEFQIAQDIGFASFNRIN LDILWIALGILLVVIFYTCRYLRYFDVLALGRDQAINLGVDYQAVTRRLL IIVAILTSVSTALVGPLTFLGLLVMNVTFEFIRGYQHKILIPAAMLISVM TLVMGQVLVSQVFTFNTTLSIIINFTGGVYFIYLLLRANKKWQ >MS1013 btuC, BtuC protein MDTQRIDIKSVPASIRLFNNKMNKLNMLLGILLTLLITLAATHRLGDFSA LLNPDGVLTDMRSLVLWEIRLPRILLALLTGAGLALAGNAMQGIFQNPLA SPGLLGSANGATTASVFILYYFSAPFTILLCGGVLGALLSFLLVYLMAKN RGSTMMILSGVAINMLLGSLIALLLSNAESPWALAELYRWLQGSLVWAKT DTLLMCLPIVLAGLFCLYSQRRYLDLLTFGEETAATMGVDPKRSFFITTL GVALLVGATIPQTGTIGFIGLIAPHLARMMLKKRPSQLYLTSALFGALLL LIADLCILYIPLFSHIYIGTLTAIIGAPFLIWILLAQQKMLAK >MS1203 btuC, BtuC protein MVKKLNIALFALAVLFFTWLSVVQLNNNDALAYLLFANYTLPRVFMAILA GCALGIASSLLQQVINNPLASDNTLAVSSGAQFSLFLVAIFVPNWLGAGS MFIALIGALVALALVFLLAWRRTISPLLMILAGLVVNLYLASFSAVLMLF YPEESRGLLLWGAGSLVQESWYDSLQLLWQFTIALILIFIFAKPLQILTL NDNNAKSLGVPVNLIRFLGLVISAFLVAIVVSRVGMLGFVGLAASSIVRQ FSTTNLLKRLILSAYMAAMLLLLTDLTLQLFAYYRQIELPTGAVTALLGT PLLLWLMFNISNNGRLVSQDESLSLGKQPVKAAGVIISLLLLLSILCALF IGKNASGWYWDSVMLTLRYPRLLVAMAAGIMLAVAGTLLQRLSHNPMASP ELLGITSGTAFGILTVIFFVATPTRGQFWFAGILGGFLVLLFIMLINQRN QLLPEKILLTGISIAALFDALQRIVLAGGDYKWQQLLAWTSGSTYHATPQ LATGFLSIAVLLFLLALPLDRWLALLALQTPVAQALGLDITKVRWILIIF SAFLTALSTLLVGPLSFIGLLAPHLAHFCGWHKPKAQLIGAVLLGTLVMT IADWLGRQLLFPYEIPAGLVATLIGGAYFLFMMRKI >MS0792 btuC, BtuC protein MIHRRYLILLLLLLSIISLFLGVSSVNLKGLLYFNSEQWQILLISRVPRL ISILIAGSALSICGLVMQQLSRNRFVSPTTAGTMDSARLGILISMLVFPT ASMLFKTVIAIVVSFLGTLLFMTILSRLKFKDSIFVPLVGIMFGNIISSV TAFIAYQQDILQNLSGWLQGDFSLIMSGRYEILYFSIPALITAYLFANRF SIVGMGQDFAVNLGLNYQQVLYLGLAIVATVSSIIIVSVGVIPFLGLIIP NLVTLYLGDNLKKILSHTALLGAVFVLFCDIFGRIVIYPYEIAINAVVGV FGSAIFLYLLFKRYRHV >MS1113 cDA1, CDA1 protein MGLFMFKHCMRLVTYLSTSLLFAVQALAANNHFGILCYHNIIDESVQSEK YYPQTISAQKLISQFNWLRTNGYIPVSMQQILDARNGGKALPEKSVLLTF DDGYQSFYTVIYPLLKAYNYPAVYAIVTDWIETPANKKVTYGDEKLDRKE FVTWQQLREMKDSGLVEIASHTHDLHHGVKANPAGSNVPAVITPAYINGK YETESQYEARLRKDFQRSFSLLKQHLGAAPAAMIWPYGRFNEKAAAIAEE AGFKVHMSLVDTINNTPDQFHLGRLLLDNETSINTIENYLKNKNKDVLVQ RSLRIKLDDVYDPNPAQQSKNLDALIERIYRQDIERVYIQAFSDTDNDGV ADALYFYNQQLPVRADLFSRVVWIIKTRLGKAVYAWMPISAFKGKNNTQQ IKSIYRDLALYSKINGILFDDNLSSDNKFTDLKPLDAASLRLTDELKDIV YPYPLGGREDFATMRMISAPVNMSDESEKQFNQNLAELNRHYDAVIVSAA PYVKGSELTQSGARNWLGNIIKKTVPQVAKDRLAFELQTVDWRTQQAITD DELIDWMRDIQTKYHFYNFGYYPDNFQENQPKLNEIRPHFSINTNLGLK >MS0939 cDC9, CDC9 protein MMLLENYKNQDITGWVMSEKLDGVRGYWDGKQLISRQGGVLAAPDYFLEN FPPFPIDGELFSQRDQFAEISSITRSQQDKGWHKLKLYVFDVPEAPGDLF TRLATLKNYLKTNRTSYIEIIEQIPIRDKNHVRQFLQQVETQKGEGVVLR NPNAPYENKRSTQILKLKSHLDEECTVIAHHKGKGQFANALGALTCKNQR GKFRIGSGFTLEDRVNPPAVGSVITYKYRGLTKTGKPRFATYWRKREDLQ ETP >MS0778 cafA, CafA protein MNSVELLVNVTPNETRIALVDTGILKEVHIERQAKRGIVGNIYKGRVTRV LPGMQSAFVDIGLEKAAFLHASDIVSHTECVDESEQKQFIVKDIAELVRE GQDIVVQVVKDPLGTKGARLTTDITLPSRYLVFMPENSHVGVSQRIESED ERARLKALVEPYCDELGGFIIRTAAEGATEDELKQDADFLKRLWRKVMER KAKYPTRSMLYGELALALRVLRDFVGAGIEKIRIDSKLCFTEVNEFCEEF MPELVDKLVLYSGNQPLFDVYGVETAIQIALDKRVNLKSGGYLIIEQTEA MTTIDINTGAFVGHRNLEETIFNTNIEATQAIAQQLQLRNLGGIIIIDFI DMQTDEHRNRVIQSLEEALSKDRVKTNVNGFTQLGLVEMTRKRTRESLEH VLCGECPACQGRGHVKTVETVCYEIMREIIRVHHLFSSEQFVVYASRAVS EYLINEESHGLIAELEVFIGKQVQVKTEVYYNQDQFDVVVM >MS1622 cafA, CafA protein MKRMLINATQQEELRVALVDGQRLFDLDIESPGHEQKKANIYKGKITRVE PSLEAAFVDYGAERHGFLPLKEIAREYFPDDYVYQGRPNIKDIIKEGQEV IVQVNKEERGNKGAALTTFVSLAGSYLVIMPNNPRAGGISRRIEGDERLE LKDALSSLDVPEGVGLIVRTAGVGKSSEELQWDLKVLLHHWEAIKQASQS RPAPFLIHQESDVIVRAIRDYLRRDIGEILIDSPKVYEKAKAHIKLVRPD FISRIKLYQGEVPLFSHYQIESQIESAFQREVRLPSGGSIVIDVTEALTA IDINSARSTRGGDIEETALNTNLEAADEIARQLRLRDLGGLIVIDFIDMT PVRHQREVENRMREAVRQDRARIQISRISRFGLLEMSRQRISPSLGESSH HICPRCQGTGKVRDNESLSLSILRLLEEEALKENTKQVHTVVPVNIASYL LNEKRKAIHDIEKRHNVEILVVPNKEMETPHFSVFRVRDGEEINTLSYNL VKVYEEQETTFIADEPFTTRVTETAAVTTENVLESAALSMTISEPAPTIE VKKEEQPSLFVRIVAAIKGLFASEPKVEEKVEEAPQPNTRNRRNNQEHRN SRRNRNERNNRGNNEEPADKVQSEKSAEKAERPARSERTRRNNRNRNAAA DDSLNNESVIEAVTNETTDDEAKAPQQRRQRRDLRKRVRVTEEQVTVAAE PVDKPSLAESVPVEPVVEENVYQERDRQDNRRRLPRHLRVNNQRRRRNAE QVSAMPLFAAVASPELASGKVWIESPTAPAKPKESAFLSVDELLEQQSEV KQPGVTTPAPATQVIFDKADNDIAPLASFVTQPANESVQKKVQESLDRLE QTNGQQTEVKEDDNASNMTLSDVTATESAVEKTEVLNLSNYRFSGRLGTI SAVKHTKAEMTLAKAADEVLPPFEIVQWQDSRYYFHGKGSAGHNSAVSHV FTAATQAKAE >MS1805 cah, Cah protein MMKKLLSTAAALLVSGFILSGCSTTEKHWGYTGDVSPEYWGGLSDKFKTC AVGQKQSPVNIQVQKATDKDLPALNINYLASKATVVNNGHSIQTDLTDEN STLTINGKVYTLKQFHFHSPSENTIDGQYLPLEGHFVHVAKDGGIVVVAV LYEIGGENAQLADIWAGMPEKAGEKVKLKAKFNPATLISSKQSYYSFEGS LTTPPCTEGVDWIVLKAYCSGQLI >MS0864 caiC, CaiC protein MISMLMFPWLNYANSAQYQNKTALRDDLQGEVFTWPQLAQRIEQTRLSLQ RQGLSMGQGIALCGKNSLDLLCFYLAGLQLGLRVLGINPAFPVEKINRLC ELNDISLRIDFSSSQYHCRRLKNSAKDDRTFTLTEGYTMTLTSGSTGLPK AAVHSVNAHLANAVGVSELMRFGANDSWLLSLPLYHVSGQGILWRWLQQG GELVLPQADFYASVIGVSHVSLVPTQLQRLLSYLAKHPNKFVCTKHILLG GSQIPLELTRQANRLGIQCYSGYGMTEMGSTVFAKESDETAGVGLPLKGR EYRLVDDEIWLKGAGLAEGYWIDQKIRTLTNKQGWFQTKDKGQWLNNELV LLGRLDNMFISGGENIQPEEIENIIQGYELVNQVFILPRDDAEFGQRPVA MIQFNLDADTENNFKSAVEKLKIWLSDKIERFKQPVAYFPLDVEKARQEG TIKISRNLLKTELMTLLGK >MS1358 caiC, CaiC protein MEKGWFKNYPEGSPREIDTSEYHSILDMFDKAVREHPDRPAYINMGKVLT FRKLEERSRAFAAYLQNELKLTRGERVALMMPNLLQYPIALFGVLRAGLV VVNVNPLYTPRELEHQLQDSGAKAIVVVSNFASTVEQVVFNTDVKHVILT RMGDQLSFGKRTLVNFIVKYVKKLVPKYKLPHAVTFREVLSVGKHRQFVR PDLARDALAFLQYTGGTTGIAKGAMLSHGNIITNVFQAKWIAESFIGDRR RERIAIIPLPLYHVFALSVNALLFVELGITAVLITNPRDVDGMVKELRKY PFTAITGVNTLFNALLNNENFKEVDFSSLKLSVGGGMAVQQSVAQRWHDL TGNNIIEGYGMTECSPLIAASTILTDKHDGSIGVPVPNTDIRIMRDDGDE AELGEPGELWVKGEQVMQGYWQRPEATAEVLKDGWMATGDIVVMDKNYIM RIVDRKKDMILVSGFNVYPNEIEDVVMLNPKVLEVVAIGVPHEVSGETIK IFVVKKDESLTRDELRAHCRNLLTGYKVPKEIEFRDELPKTNVGKILRRV LRDEELAKRNAQ >MS2237 carA, CarA protein MSEPAILVLADGSIFRGTSIGAAGHTIGEVVFNTSMTGYQEILTDPSYFK QIVTLTYPHIGNTGTNSEDLESNGVYAAGLIIRDLPMIHSNFRANQSLSD YLKDNNVVAIADIDTRRLTRLLRDKGAMAGCIMSGEVDEQKALELALSFG SMAGKDLAQEVTAQQSYRWTQGEWVLGKGYAEQQNASFNVVAYDFGVKHN ILRMLAERGCKLTVVPAKTSAEEVLALNPDGIFLSNGPGDPEPCDYAISA IQTLLATKKPIFGICLGHQLLGLASGGKTKKMAFGHHGANHPVQDLDTQK VMITSQNHGFEVDEHSLPANVRVTHRSLFDNSVQGIELTDQPAFSFQGHP EASPGPHDVAYLFDKFIDAMKQAKA >MS2236 carB, CarB protein MAMSTKPSGASKFVYKTANNFLKVLSRENNMPKRNDINTILIIGAGPIVI GQACEFDYSGAQACKALREEGYKVVLVNSNPATIMTDPNMADVTYIEPIH WQTVEKIIEKERPDAILPTMGGQTALNCALDLSKNGVLKKYGVELIGATE DAIDKAEDRGRFKEAMAKIGLNTPKSFVCHSFDEAWKAQEEVGFPTLIRP SFTMGGSGGGIAYNRDEFQAICERGFEASPTHELLIEQSVLGWKEYEMEV VRDKADNCIIVCSIENFDPMGVHTGDSITVAPAQTLTDKEYQIMRNASLA VLREIGVDTGGSNVQFAINPENGEMIVIEMNPRVSRSSALASKATGFPIA KVAAKLAVGYTLNELRNDITGGLIPASFEPSIDYVVTKVPRFAFEKFPKA DDRLTTQMKSVGEVMAMGRTFQESIQKALRGLETGICGFNLKTEDMEKLR HEISNPGPERLLYVADAFGIGWSIEDVHHYSKIDPWFLIQIQDLVLEELA LEKKTLADLNKDEIYRLKRKGFSDKRIAQLVKSDETSVRSLRNAFNIHPV YKRVDTCAGEFKSDTAYLYSTYEEECEAAPSDRKKVMILGGGPNRIGQGI EFDYCCVHAALALRESGFETIMVNCNPETVSTDFDTSDRLYFEPLTLEDV LEIIHVEKPWGVIVHYGGQTPLKLANALHANGVNIIGTSADSIDAAEDRE RFQKILHDLNLKQPANRTARNTQEAVGLANEVGYPLVVRPSYVLGGRAMQ IVYNDEELNRYMREAVSVSNDSPILLDHFLNNAIEVDVDCICDGEQVIIG GIMQHIEQAGIHSGDSACSLPPYSLSMEIQDEIRRQTAAMARALNVVGLM NVQFAVQNDVIYVLEVNPRASRTVPFVSKATGQPLAKIAARVMAGISLKE QGIQGEVVPQDFYAVKEAVFPFIKFPGVDTILGPEMRSTGEVMGVGATFA EAFLKAQIGAGERIPRTGKVFVSVDNNDKPRLLPIVKRLQEQGYGLCATF GTAKFLRENGIAVQTVNKVREGRPHIVDAIKNDEIALIINTAGGMAESVA DSASIRASALKQRVPLYTTIAGADAISLSVANLDIHDVYSVQGLHAGLTK >MS1491 carB, CarB protein MNILVTSAGQRVSLVQAFKKELSQLVSDGKVLTVDLNPELAPACYVADGH FQVPRVTDAGYIPTLLKICEENNVKLIIPTIDTELLILSEHLQRFKEKGI FISVSDTEFVRKCRDKRLTNQLFIEHNIAVPKQFEKGQFEYPVFVKPYNG SLSKGIFVAEKPEDISPEQLENPELMFMQYISPAEYDEYTVDCYFDKNSE LKSAVPRKRIFVRAGEINKGVTRKNAIVTQLSEKLSRLPGARGCLTIQVF YKESTAEILGIEINPRFGGGYPLSYLAGANYPRWLIQEYLFNQPIPAFDD WEADLLMLRYDAEVLAHHYEK >MS1302 ccmA, CcmA protein MADTLLAVQVDKLKHSYGKTTALCDLSLQIPRGKIIGLIGPDGVGKSTLL SLIAGVKIIQSGSVTVFGLNVAEKKARDLLSHKIAFMPQGLGKNLYLTLS IYENIDFHARLFGLPKAHRKARIERLLNATGLAPFADRAAGKLSGGMKQK LSLCCALVHSPDLLILDEPTTGVDPLSRRQFWQLVEDLRRETPGMTVIVA TAYIDEAEGFEQVIAMDDGKLIAYKPTKQLIAETESENLEQAYVKLLPAD KRGSGKGLTIPPFEVDANEPPVIVAKGLTKRFGDFTSVDNVSFTIPKGEI FGFLGSNGCGKSTTMKMLTGLLDPSEGTATLLGQPIDASNIDTRKRVGYM SQAFSLYEELTVRENLELHAKLFQIPPAQWNTYVHSAMEQFDLADLADEK PSSLPLGIRQRLQLAAACLHKPELLILDEPTSGVDPAARDMFWEYLIKLS REDRITIFVTTHFMNEAARCDRISFMHRGRVLAVGTPEELRTGKNAATLE EAFIIYLEEQADDITAPSNETGQSAVKNDEVLPPAEGLWAWWSLIWTFAV REGKELLRDNIRLFFALLGPIIMLIAMASSISFDINPMKFAVLDHDNSSA SRHLVEYFSGSRYFIRQADLHSVDEINSNIQSAKVKMVLEIPTDYGKKLL NWQQPEIGVFIDGAFPSTAENLNGSVIGVLTQYQREISKHIDMSVSSTVL LEPRFVYNQDFKSIFAMTPGIIMLAMILVPSMMTALGVVREKEMGSIMNL YGSPASPLQFLLGKQIPYIILAFVSYLAAVCVAIIVFKVPIKGSVLAMFF GVILALLATTAFGLFVSAFVKTQIAAIFATAIISMIPALNFSGMIYPVTT LPDTIYTAARTFPGYWLQLVSLGGFTKGLNFTDFFDCYLALSTIFAVYIT LATLLLKKQEV >MS0601 ccmA, CcmA protein MQSVNQLKIDRLACQRGDKILFTDLSFNLQSGDFVQIEGHNGIGKTSLLR ILAGLAQPLSGKVRWNSEEISKCREEYYYDLLYLGHHAGIKPELTAWENL KFYQQAGHCRQGDEILWNVLEKVGLLGREDIVASQLSAGQQKRIALARLW ISQAPLWILDEPFNAIDKNGVKVLTGLFEQQAEKGGIVILTSHQEVPSSA LTVLNLAQYKFTDNE >MS1066 ccmA, CcmA protein MQYSRIGYIFKLFMRIYMYALEIKGLTKQYKNGFKALHGIDLCVKEGDFY ALLGHNGAGKSTTIGIISSLINKTSGQVKVFGYDLDSQLGLLKQQIGLVP QEFNFNQFEKVLDILANQAGYYGIERSEAEKRAEVWLKKLDLWDKRNQQA MRLSGGMKRRLMIARALMHKPRLLILDEPTAGVDIELRRTMWTFLRELNE QGTTIILTTHYLEEAEMLCRHIGIIQQGRLVVDMPMKDLLAKLETETFIF DFAPNSPKPIIRDYRLKQIDVDSIEVEMPREKGLNHLFEQLSNQGIQVLS MRNKANRLEELFVSMSLNKPTDEVK >MS0602 ccmB, CcmB protein MIFFEIIKRELRIAMRKQAEILNPLWFFLIVITLFPLVIGPDPVLLSKIA PGIAWVAALLSALLSFERLFRDDFIDGSLEQLMLTAQPLALTALAKVIAH WILTGLPLILLSPVAALLLSLDVRIWWALVLTLLIGTPILSCIGAIGVAL TVGLRKGGVLLSLLVVPLFIPVLIFSASVLDAATLNLSYAGPLAILGAIL AASATLAPFAIAAALRISLDQ >MS0518 ccmC, CcmC protein MVTGLRIMSFALFSALFYIISILFIAPMLAKAQSGEQIQRPNKNWFILTA LFAVICHFISLFPFFSNLFSGENFTLMEIGSLISVLIAILATVAIALKIK TFWFLLPIIYCFATINVTLAAFAPSHVIQNLAQDLGLLLHILLAMFAYAV CFIAMLQSIQLAWLDRKLKTKQMVISPLLPPLMMVERHFFRVMLSGEILL TLTLLTGAVYLADFFGNENIQKAIFSFLAWIVYAVLLIGHWKYRWRGKKM IIYTISGMILLTIAYFGSRAMLGMN >MS0603 ccmC, CcmC protein MWKWLHPYAKPETQYKLCGKFIPFFAVIALLLLSVACIWGLAFAPADYQQ GNSFRIMYVHVPSAIWSMGVYGSMAVAALIGLVWQIKQAHLSVIAMAPIG AALNFLALVTGAVWGKPMWGAWWVWDARLTASLILFFLYLGVMALYSAFQ DRNTGMKAAAILCVVGVINLPVIHFSVEWWNTLHQGASITKFEKPSIATP MLIPLILSIFGFMALSIWLTLVRYRVELLKEDRKRPWVKALIK >MS0604 ccmD, CcmD protein MFFESWSDFFYMGGYGFYVWLSYGITFITLLILAIQSYRGKKIVFREIQR EQQREQRLQATKSRGTL >MS0605 ccmE, CcmE protein MNPRRKSRLTIILFVLLGVTIASSLVLYALRQNIDLFYTPSEVISGKNDD PDTIPEVGQRIRVGGMVVEGTVKRDPNSLKVSFNVNDIGPEITVEYEGIL PDLFREGQGIVAQGVLKEPKLLEATEVLAKHDENYVPPDLSEKMEQVHKP MGISNQDMQGESDRDRLDKAVNSVEEGKK >MS1815 ccmF, CcmF protein MIPELGFIALLIALLSSFLLTLIPLVGMIKRNTNLLSYAWNFSYLFAIFS TISIACLAYSFSVNDFSVEYVAAHSNSQLPLFFKIAATWGGHEGSMLFWL FSLSLWTAAFAFFSRKIDPVFSARTLSILGFICLSFAIFILFFSNPFIRQ FPLPPEGRDLNPMLQDIGLIIHPPLLYLGYVGFAVNFAMTLSVMLSGHVD AAIARWTRIWVLLSWFFLTLGIMLGAWWAYYELGWGGWWFWDPVENASLM PWLIGLALLHSLIVTEQRGIFSYWTILFSLFAFAFSLLGTFIVRSGVLTS VHAFAVDGERGTALLLIFFLLTALALTVFALKVNLRQSAVRFSVFSKESF LLLANVVLTIATVSVFLGTFYPMVFSAMGWGSISVGAPYFNSIFVPLLLI MLIAMVFVLATKWQKMNRTLLRQKSILLIPALLIAYLIIHFTVRQDESLR FHFSAFVLLSLAIWLLLATLWINWRKIGLRRSGMILAHCGVAFAVIGAVM SGYFGSEIGVRLAPQQSQMLNGYEFRYIGFTNELGPNFTSEKAHFEIYKN NQKLTALYPERRYYEVRTMNMSEVGIQWGVLGDIYIVMGDKLAPNEFSFR LHYKPFVRWLWLGGILMALGALVAAVSLVQRKNAMAFSSAIKKE >MS0606 ccmF, CcmF protein MIAELGNFALALGLAISVLLAVFPLWGAEKGNKQLMSLARPMTYGLFICL TFAFGALFYAFAVNDFSIQYVVNNSNSRLPLQYRLSAVWGAHEGSLLLWI WLLSVWSVAVSLFSRQLPQEAVARVLGIMGLVTIGFLIFILFTSNPFART FPNLPIDGKELNPMLQDVGLIFHPPLLYMGYVGFSVAFAFSIASLMTGKL DTAWARWSRPWTMAAWVFLTVGIVLGSWWAYYELGWGGWWFWDPVENSSL MPWLAGTALLHSLAVTEKRGAFKAWTVLLAILAFSLCLLGTFLVRSGVLV SVHAFASDPTRGLYILAYLIVVIGGSLTLYAYKGGQIRSRDNAERYSRES MLLLNNILLMAALVVVLLGTLLPLVHKQLGLGSISIGAPFFNQMFLVLMT PFALLLGIGPLVKWRRDKFSAIRKPVIISLFLMIILGFALPYFIGNKLSL SAVLGTMMVVIITLLSLYELKQRAAHRDNFLIGITKLSRSHWGMFLAHLG VAMTVWGVTFSQNFSVERDVRMSVGDSVNIAGYEYKFQGIRDANGPNYLG GTAQVDILKEGKLEGSLFAEKRFYTVSRMTMTEAAIDWGFTRDLYVALGE QLEDKSWAMRLYYKPFIRWIWFGGVFMALGGVLCMFDRRYRFSKILNK >MS1812 ccmH, CcmH protein MKKILFSILVFVSLSLQAEMVDTYQFKNVEDRTRAVALAKSLRCPQCQNQ NLVESNSPIAYDLRIEVYKMVDEGKSNQQIVEVMTSRFGNFVLYKPPFEL TTALLWCLPIGLLLLAVLLMVRYLRRRSENREICTALSERQRRELAELLA KNKDKK >MS0608 ccmH, CcmH protein MKKLTALLIMLVAVVASPCFAAIDAFNFSSAQQENDYHALTNELRCPQCQ NNNIADSNATIAIDMRAKVFELLQEGKSKQDVVNYMVERYGHFVTYNPPI TVATILLWILPALLICFGLAFVFRQKGKTLIKNSSQDISTENSTVENLSD EQQKRLKALLKNKE >MS1835 cdd, Cdd protein MKNTIIKGLTDLVEQKRDNLIRQVVVQLEAQGYKAVLEQATVQQFCRQFA LSPVEFALRCLPVAACYALTPISQFNVGAIAIGQSGSFYFGANQEFVAAS MQQTVHAEQSAISHAWLAGEKAIAHMVVNYTPCGHCRQFMNELNSAERLK IHLPHSQNNLLHNYLPDAFGPKDLNIQNVFFDGQSHPFNYQGHDPLIRAA VEAASQSYAPYSQAFSGVALQLGELIICGRYAENAAFNPTFLPLQSALNY QRLQGLIDVKVSRVVMAEAKADLTSLPMTQSLAGAHLGLDIEYISL >MS1926 cdsA, CdsA protein MLKERILSAIALIAVVFAALFLFSPFYFALCLGAVVTLGVWEWTQFAKIK TEVWRYVISAIAGTFLFLWIYSHHSYLNAGRVFDGLAEPLLLAAVIWWIA AFFLVINYPKSASIWSKSLILQIIFAFFTLLPFFIGVLKLRLDGYIIDAH HGVVLLLYVFILVWAADSGAYFVGRKFGKHKLAPKVSPGKTWQGVIGGLI TACVLAFIFQTIAGESLFNRGSTFSLTLLSVATVAISVLGDLTESMFKRE SGIKDSSQLIPGHGGILDRIDSLTAAVPFFAYFYFFVL >MS1111 chb, Chb protein MMIFSSCKTPNHFLCAIFIGFGLNIPQTYAETVTQQFQKAFSSAEVSEKK ESGLALDIARHFYSAETIKNFIDTIHKNGGTFLHLHFSDHENYAIESTIL DQRAENARRDENGFYVNPKTGKPFLTYEQLKDIMDYAKQKNVELIPELDS PNHMTAIFNLLAEKNGKDYVQKLRSKWTNEEIDITNPDSIAFMTSLIEEV VWIFGNSTKHFHIGGDEFGYSEENNHEFINYANKLSAFLKEKGLKTRIWN DGLIKSTIDQLDPNIQVTYWSYDGNTQNKQAARQRRTMRISMPELIERGF SVLNYNSYYLYFNPKESPNISKDSDFAMRDVIKNWDLSIWDEKNTQNKVA EPNKISGSALAIWGEYAGSLKGDSIHKATENLLKAIIYKTNAAGDSTGTI SRKLQQLDFAQINANSYIDLMQVRNNESVTLENYPQTVHLLQTNALSGKK RVLWVSGSHVHKIRLEPQWQKTGLNEKRNGKSYTAYKYQDNILWLDDNIT EQ >MS1015 cirA, CirA protein MCCFHLIIMKKIDVYHENFPLENSIRLDRIFFILAKESAMKLNLITSALL FSTIHSSFAVEPIKTELTPIEVYSAYAIPVNQDQTASSLTVLTEKDFAGR NAAYVSDVLKTVPGVAFGINGGRGATTSLFLRGADSNHTTVIIDGVRMNP INGNGFDFGGLALSNIERIEVLRGEQSALWGSNAMGGVIYITTKSGLYKE KPFNVDFDLGTGSRNTRDASATISGYNKGFYYALHGDSHRTKGISALSSN RFNYTALDGTKVTTGGAGEQDGFHRDNASIRLGYDDANKGLELLAQHSSQ SAHYDNSLAEERLFNDYMRTRETLFKLAGYWGSEHELFKHSLSASHLKTD NDTFSLWASAYDAKRLNTNYQLDINFDRDGATTQSFSILGEYQKSKYDST SYTDEKALNEKSIAAEYRLFHENGHSLSLSGRYTDNSKYDDTFTGRIAGA YRLSPNLKTHASFASAVQNPTFIEYFGYYGSYAANEGLKAERSRGGDIGL LIESTDKHHSLDITYFARNVDNFISSELVDPVYYIYRSINLEGTTKIRGV EIAYNGQLTDNLTAYANYTFTRSKDSQGDSLVRRPKHQANAGLNYQITEK FGSNVNIAYVGKRIDNYYEETYPYAVHAVNMPSYTLLNLGVNYQLTSNIN IYANLNNLFDKKYENILGYGQDGRNVYVGLKGSF >MS1315 cirA, CirA protein MNFKFNLIYTALFSGLAFSSYATETNQEVNTELEQINVATELEKAKAAGN KQKDIVNLSLLGRQPAFTSPISVVNYDEKAFEDKQPRNVVDAIAKTDASV MNFGGETNTLSGIYVRGLQLDARQISVNGLAGLYSTYNSPTAAVSSAQLI KGASTATTGMDPEGSAGSAMNIETKHATDNPINKIGFGWFSNNRLQESFD FGRRFGENNAWGVRVNGKYRDGDTARHGYDEINKEFAVAADYRGDKFRAA IDYMYAKRATNGGRARVQDIQNLDFAMPKAPDGKINLIPSWSGQTTEDQT VMGTFEYDLPYNMMLSGGLGHMESKYYGAFGQIRMTNTEGDYSIRQMRAI DYRIRTTSANLKLQGELETGSLYHMWNTSFDFVQRQRDFDQSPVLSNFST NIYNPVFPSVTAYSALQQSTDEKSRSYSWALADTVGFFDNSLRLTLGGRF QWIKQHNYKNDSKGDKNRFSPMVTLAYVPNPDLVFYGNYLEDLEPGYVDE DGNMAKPVVSRQIELGVRKNWGDLFTTTASIYQIRRPGIVTTNLAKNNAD FTVGEEQGEERNRGIEFNIYASLFNNTVRPSLGITYNKGELIDYSTYAGA IKTGTQVASPRIISKANIDWDTPFIENLTLNAALQYYGKSYQDIDKKYKL PAYTTVDLGAKYLIKLNETQTLTLRAAVENVFDKNYWQVQRGKYDRSFAV VGMPRTYWLTAEYTF >MS1205 cirA, CirA protein MKKTFIYSTVAQTVLLTIAGTAIAAEDGTEQLDTIDVVTEGSMFRMGEVP FHQAKSAVAITREQLDSQNVDKLDEVAKYQAGFANQVFGNDTNTNWFRVR GAEVSQAVNGLPTFSYGFFTPYVNSFGLEAIEVTKGADAMTFGAAKSGGL INYVTKRAHKDQIGHGEFKTTFGSHNQYGLAADYTGTLTDDERLRYRVVA SYLGRDGDWEGTDNQTLYIAPTLTWDISDKTRFTLLTGYQRDHGTPSSNF LPQEGTLVPSPRGYIHRRTNLGDPVKDTETNRQYNIGYEFSHDFNNGLSF NSSYSYSHIDNYHRGAYAYPSAYNADWSPLAPSAAGYSLSRAVVFNDGKA ISHTTNNYLTWNYDNAWLKNTLVVGTDYRHNKVDALYSLYGTTSNTNLFT PSVGWNQAQDVSAAPHVQIKSRQLGFYLQNNARFADKYVLGLGIRHDRAE QREYTSTQKVKDNHTSYSASLMYEAPFGLNPYFSYSESFNLPTGLSGDET LYDPNITRQYELGLKYIPTWLDGTISVAGFRAKDTGALVGNGLGATISSA DPIYRKGFEVQADVNFTSNWTGTLAYTYTKSESKDSAGKKTRQPLIPTNM VAAKTAYSFTEGLLKDLTVGVGLRYLGHSVTSKGSLYSHARLPSATVVDL MARYAITPNWIAQVNIDNVGNRRYVSGCDYYCYYGAERRATANLSYKF >MS0516 cirA, CirA protein MKKLKISLLPLTAFVAATVHAETLDTIDVVSDNFSPQAENIAAKGVTKVR QATKMSDVIRGVPGVNVNGARSTVERYNIRGVSEEYLNVTVDGARQNGYS FHHNGNYGIDPEILKRVDIDVGSNSVSTGAGSLGGSMKFETVDAADMLEE GENFGGKVKYGYGSNGNSNQGTAMLYGRRGNLDLLGYFNYRHQRDGEDGN GLKNKNKGHLSNYLFKTKYNISNEQWIKASAERYTNTALSCYRANMGMCL GDVPQPGEPGYVETNHGKAYTELTRKTYTLSYGFNPEHNNWVNIKANAYN TETEVASMGSPKSKVRTVGGTLSNTSEFELGVTSHQFLVGGEYYNSKAQA LGSVNNAYVADMDSTSVYVEDKIALGNLMIIPGVRFDHYKADLASDFDKS YHRFSKALGLKYSLTDNLIVFANYTELFKGPDAGEIYLRGTRAYDGNLEA ARGDNKEVGFSYAKDGLFSDIDGFSFTAKYFKTDYDNINQTVSASRCVNT SAISSGSIYCNLGKVDIKGVEAQAKYRYEDTSFSVSYARARSEQKSTGLA AFADTGDRYNFTLSQYISSAQVELGWNTMYVRAIDVDDSTLKESYAVSNM YVSWSPAQAKGLELTFGIDNIFDKAYKDHSTQYYGSVDLDPGRNYKLSVS YKF >MS0278 citB, CitB protein MHGIAWGYCFIIVKVRNMSEKTKVIVIDDHPLMRRGIKQLIELEEQFEVV GDAGSGNEGVELAIKTSPDLIILDLNMKGLSGLDTLKVLRQEGVDARIVI LTVSDSKADIYALIDAGADGYLLKDTEPDTLLAQIKQIAQGEIILSDSIK NLLVERHPAHEPIHALTDREMDVLQLIATGLSNKQIAAQLFISEETVKVH IRNLLRKLNVHSRVAATVLYLEYKGS >MS2285 citB, CitB protein MRKFYRTFFQNATIRRSKAYPGEITMIKVALIDDHVIVRSGFAQLLSLEE DIEVVGEFGSAKETRQNLPRIKADVCIIDISMPDESGLDLLKSIPSGIHC IMLSVNDSEMIVKKALELGAKGYLSKRCSPEELVQAVRTVYTGGVYLMPE LTVKLVTNKNNNPIQQLTKRELEICELLIRGLGAKEIGEQLGLSFKTVHA HRANAMSKLDVKNNVELANLFHQYS >MS0424 citT, CitT protein MNEQLLIWFQSPLLWVVALLLGAVFLFMQNKLHMDVIALLVMLLFCLSGI LSLDEVFAGFSDPNVILIALLFIVGEGLVRTGVAYQVSEWLMKVANNSEI KVLILLMLAVAGLGSFMSSTGVVAIFIPVVLMICQQMNISPKRLMMPLSV AGLISGMMTLIATAPNLVVNAELARIDNLRFSFFSFTPIGLTILVIGIFY MLLVRRWLSSSTEDLKQKQKRDSITDLIEEYQLHQRTKRFVVKSNSQFIG HAVEDLHLRSNYGLNILAIERWKHFRPLFIAASLGKTEIKEKDILLIDVA NPDLDLSAFCHLYHLEPTEIRNTHFNEQLKSVGMVEITPVPDSVAIGRSA AELRFRSNYGLNVIGIKRNGELLQGHLVEEPYKLGDQLLVIGDWKLIRKL PDRTKDFFVLDYPSEIERAVPARSQSMHAILSVVTMVVLMVSGVVPNVVA ALIACLMLGKFRCIDAKSAYDSIHWASLILIVGMMPFSIALQKTGGVDLV VNFMINTVGNMGKHWILISLFILCAVVGLFISNTATAILMAPIAINMAHQ LNLSPVPFAMTVAIAASAAFMTPISSPVNTMVLGPGGYKFGDFIKIGVPF TILVMLVTVFLVPVLFPF >MS1378 citT, CitT protein MSTDTQENESSKNRRNMIILIADIGLFFILLNVLPFDEAPRKGLSLLAFV AVLWLTEALHVTITALFVPILTIGLGLFSTKEALVAFADPTIFLFFGGFA LAAALHIQKIDRLIANKIMTMAKGNLCVAVLFLFFATAFLSMWMSNTATA AMMIPLAMGIMSNMDREKEHNTYVFVLLGIAYSASIGGMGTLVGSPPNAI AASQVNLTFADWVKYGVPVMLLLFPIVIGLLYFNFRPKFNQTFDYQFEQI QLTTPRIITLSIFVLVALLWVLGSEINPYIASLLGLGGKIASFDSIVGVS AALLLCICRVVNWEQIQHHTEWGVLYLFGGGLTLSAVLTHTGASKIMADG IVAIIEGKHYYIICLIIASFIVLLTEFTSNTASAALLVPIFISIAESLNM SPLGFALIIGLGASCAFMMPVGTPPNAIVFSTGMVKQRDMLRSYKINLSC IIIVSAIGYLFWL >MS0970 citT, CitT protein MNIQTSAPKVEVRLGFKLQGLLIAVLVGIAILLIPTPEGLSTKAWGMFAL FVATIVAIIAKAMPMGAATLVALVISGLSGLTPLSPAKGEVGMLSGFSNG TIWLIAIAMFLSRAVIKTGLGKRIALYFVARFGKRMMGVAYGIALADVVI GPGIPSASARGGGIMYPIMQSIADAYNSKPGPTARRAGAFLAIAVSQIDT IVCTMFLTAMAGNPLIAELAKSQGVEITWMTWFLGAIVPGIVSLILLPYF VYLIYPPELKDTPKMAEMAREELNSMGKMSQAEWILALDFILLLLLWTVG DLVFHIPATVSAFVGLVILLLTNIMSWKNIISETAAWDTMFWFAVLVMMA NALNKYGTISWISTHIADSVGSFSWPVAFTILVLVYFYTRYFFASAMAHI SAMYLAFVAAAIAVGTPPIIAAIGLGYTSTLSMSLTQYAGGPGPALYGSG YNSTGQWWGVSFAVSILSLAIWFSVGGVWMKLLGWW >MS1915 citT, CitT protein MRVNFMNTQTSVSPPSIFSRNSLIFMADVIIFALLLAFLPFEQNVNKGLA LLVFVGVLWLTEALNVTVTAVLVPLLAIGLGLVTTKNALVAFADPTIFLF FGGFALATALHIQKLDRLIANRIMALAKGNLFIAVLYLFSVTAFLSMWIS NTATAAMMLPLAMGILSQLDREREHNTYTFVLLGIAYSASIGGMGTLVGS PPNAIVASQLHLTFSDWLWYGMPVMIILMPLMIGCMYVIFKPRLNIRFTQ DFEKIEMTTPRIITLLIFILTAVLWVFSSSVNPMLSGLLGLPKDIASFDS VVALLAAALICISGVASWKQIQDNTEWGVLLLFGGGLTLSAVLKDSGASK VMADGIVFLIQGGHFYVIGLIVTAFIVFLTEFTSNTASAALLVPIFISIA QALGMPEMGLALLIGLGASCAFMLPVATPPNAIVFGTGEVKQSDMIRAGV VLNILCIFVIGTIGYLFWFG >MS1783 clpA, ClpA protein MNIEKFTTKFQQALAEAQSLALGKDNQYIEPVHVLSALINQQDGSVAPIL TSAGVNVGALKAELNSEINKLPQVSGNGGDVQISRQLLNLLNLCDKIAQR NNDKFISSELFLLAALEEKGSLGDLLKKCGAKKESLEQAIKTIRGGQSVN DQNAEESRQALEKYTIDLTERAESGKLDPVIGRDEEIRRAIQVLQRRTKN NPVLIGEPGVGKTAIVEGLAQRIVNGEVPEGLKGKRVLSLDMGALIAGAK YRGEFEERLKAVLKELAQEEGKVILFIDEIHTMVGAGKTDGAMDAGNLLK PSLARGELHCVGATTLDEYRQYIEKDAALERRFQKVFVDEPTVEDTIAIL RGLKERYELHHHVQITDPAIVAAATLSHRYISDRQLPDKAIDLIDEAASS IRMEIDSKPEPLDRLDRRIIQLKLEQQALKKEEDEASRKRLDMLEKELSE KEREYAELEEVWKSEKAALSGTQHIKAELESARTQMEQARRAGDLNKMSE LQYGTIPALEKQLAAADSAEGKEMSLLRNRVTDEEIAQVLSRATGIPVSR MMEGEKEKLLRMEEELHKRVIGQGEAVEAVANAIRRSRAGLSDPNRPIGS FLFLGPTGVGKTELCKTLANFMFDDENAMVRIDMSEFMEKHSVSRLVGAP PGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHHDVFNILLQVLDDGRLT DGQGRTVDFRNTVVIMTSNLGSDLIQENKDLGYEGMKEIVMSVVGQHFRP EFINRIDETVVFHPLAKENIKAIAQIQLARLTKRMEQHGYAINFSETLLD FISEVGYDPVYGARPLKRAIQQEIENPLAQQILSGKLLPAKPVTVDYEDG KVVAKQ >MS1847 clpP, ClpP protein MALIPMVVEQTSRGERSYDIYSRLLKERVIFLSGEVEDNMANLIVAQLLF LESENPEKDINLYINSPGGSVTAGMAIYDTMQFIKPDVRTLCVGQACSMG AFLLAGGAAGKRAALPHARVMIHQPLGGFRGQASDIQIHAQEILKIKQTL NERLAFHTGQPFEVIERDTDRDNFMSAEDAKNYGLIDSVLVKR >MS1846 clpX, ClpX protein MTKETETTCSFCGKSQDEVGKLIAGVDGYICGECIDLCHDLLHDEETREQ QSAEEAVETEEKLPTPHEIRAHLDDYVIGQDYAKKVLAVAVYNHYKRLRS NHGIADVELGKSNILLIGPTGSGKTLLAETMARMLNVPFAMADATTLTEA GYVGEDVENVIQKLLQNCDYDTEKAQRGIIYIDEIDKITRKSENPSITRD VSGEGVQQALLKLIEGTVASIPPQGGRKHPQQEMLRVDTSKILFICGGAF AGLDRVVQKRIHKGSGIGFDAEVKGKEDEVSLTDLLKQIETEDLIKYGLI PEFIGRLPVVAPLSELDEKALVQILTEPKNALTKQYQALFGLENVELEFT PEALNAMAKKALERKTGARGLRSIVEGALLDTMYDLPSLEGLVKVVVDEA VINEHSAPKLEY >MS0250 cls, Cls protein MLVNKIKRAKQRLERLPYLAQSVADFEVIYNPAQFKQTIIQLIRSAKNRI YITALYWQFDEAGQEILNELYAAKQKNPALDVKVLVDWHRAQRNLLGAAK GTTNADWYCEQRAKHQAQSMFFGVPINTREVFGVLHIKGFVFDDTLLYSG ASINNVYLHHKDKYRYDRYQKITNPALADVLVAFVNQYLLDPNAVHALDD AARPATKEIRMHIKAFRKKLALEAGYWINNAVAFSNDQLTVSPLFGLGTS GNVLNNCIEDLFMLVKEKLVICTPYFNFPRTLKGKISHLLKQGKKVEIIV GDKVANDFYIPPTEEFKIAGALPYLYEKNLRAFCKKFAAQINEGLLVVRT WKDGDNTYHLKGVWVDEDYILLTGNNLNPRAWRLDAENGLLVHDPKHELR VQMEEELRQIRQHTTVLRHYSELQKMNQYPEPVQRLLKRFERVKVDKLVK MIL >MS1171 cls, Cls protein MMIKSLCLCIILNSCIKFSKSAVRMNINLDSIIAYIVPVIMWTLIVTITL RQIIKRQSSSAMLSWLMIIYIVPVVGILAYLVLGEINLGKRRANASKQLL PKYMKWFAGLKNQQHLLINDQQPSLASPLFALAQRRLSIPCINGNELHIL DTPESIIQNIIDDIHQAQYSINMVFYIWSNGGLVEQVQQALIQAKQRGVK IHILLDSVGSRAFFKSENYQKMTALGIEIEEALHVNLLRVFLRRIDLRQH RKIIVIDNQISYTGSMNMVDPNFFKQDSHVGKWIDIMVRIDGPVSAVLNG LHSWDWEMERGQGLYVPLPSPQHPMDNYNIHAVQILATGPGLPADLMEQS LATAIFAAKESITITTPYFVPSQNIVDALQIAALRGVNVSLILPVHNDSL MVRWASRTYFDDLLTAGVKIYNFTEGLLHTKSILVDNKMALVGTVNMDIR SFSLNFEVTMVVEDQTFANEISLLHENYMNGATLLDEQRWLNRPVFTRII EKLFFLFSPLL >MS1477 cmk, Cmk protein MTKNIVITVDGPSGAGKGTLCYALANRLGFALLDSGAIYRVTALAALQCK ADLTNEAELAELAAHLDIEFLPEAGEVKVMLGGEDVSGLIRTQEVADAAS KVAVFPQVRSALLQLQKDFATPKGLIADGRDMGTVVFPTAQVKLFLDASA EERAKRRFKQLQNKGISGNFDQILAEIKDRDFRDRNRPVAPLKPADDALL LDSTTLSIEEVIAQALSYIHQKVKI >MS2188 coaA, CoaA protein MNIESQSSVSEKFSPFLTFTRKQWAELRKSVPLKLTEQDLKPLLGFNEEL SLEEVSTIYLPLARLINYYIEENLRRQTVMNRFLGNTNANVPYIISIAGS VSVGKSTSARILQSLLSNWPENRKVDLITTDGFLYPLEKLKKENLLHKKG FPVSYDTPKLIKFLADVKSGKPNVSAPIYSHLTYDIIPDKFNKVDRPDIL ILEGLNVLQTGSRKAEQTFVSDFVDFSVYVDADEALLKEWYIRRFLKFRE SAFTDPNSYFKDYAKLSKEEAVETAANIWNTINGLNLRQNILPTRERANL ILRKGADHAVQEVKLRK >MS1951 coaD, CoaD protein MTKTVIYPGTFDPITYGHLDIIERSAVLFPQVLVAVASNPTKKPLFELAE RVRLAEESVAHLPNVQVIGFSDLLANVVKERHITAIIRGMRTTMDFEYEL QLAHLNRALTDGVESLFLPSTEKWSYVSSTIVREIYLHRGDVSQFVPPPV LTALMEKNR >MS0359 coaE, CoaE protein MAYIVGLTGGIGSGKSTIADLFMELGVPVVDADEVSRRLVEKGSPLLSKI ATHFGADILTNGGELNRSKLREIIFNRPEQKNWLNALLHPAINEEMQRQL QAQQAPYVLFVVPLLIENNLMSLCDRILIIDVSPQTQLERATKRDKNQRE LIQQIMNSQVSREKRLTFADDIINNDEDFAQNGDRIKQKVLELHQRYLQL AQQKSSTYDNKNDR >MS2235 cof, Cof protein MTIPNLRDKIKIVFFDIDETLIMKFEDILPDSVLPVIRKLKQNGIIPAIA TGRSRCSLPTKIKALIAEEPIELFVTMNGQFSVFQNKVIEKHPIPTEKVQ HLVDFFDAQQIDYAFVSDNNVAVSKITAKQKSALDPILTDYIVDKDYFKH NEVFQLLPFYDQSQDELVKNANILDGLRVVRWDKDSVDLFDAEGSKARGI ASAIKRLGFEMENVMAFGDGLNDLEMLSTVGVGVAMGNARDELKKVADFV TDRIEDHGIYNFLVKAGLIED >MS2344 cof, Cof protein MQYKAIFSDIDGTLLNSRHQISSKTESVIKLAVSKGIPFIPVSARPPYAI TPYTEQLQTNQGIICYSGALILDKNLRELYSVQIDQADLAALNQILADYP YLSINHYAALDWFSNDLDNYWTKQEADITGLFPKQTPSNLTKVHKILVMG EADKIKPLEQKLKQKLPHLSIHLSKPEYIEIMNKAATKAKAIGFMERHLH VSADEVIAFGDNFNDLDMLEYAGLSVAMGNAPDEIKQVAKKVTASNDEDG IALVLNEIFNL >MS2225 cof, Cof protein MKQLPFRAIVSDMDGTLLNANHVVGDFTINTLEKLAQKGVDIVMATGRGY TDVASTLSKMKIKNAAMITSNGAQIHDLQGNRLYSNYLPEDVAFEVMQLP FDADRVCMNTYQNNDWFINIDLPQLRKYHQTSGFMYEVVDFKKHHGRDTE KVFFIGKKPADLMEIEQELTTRFGNYATITYSTPVCLEVMNKNVSKATAL AHLIEQREYSLSDCIAFGDGMNDIEMLTEVGKGCIMQNADPRLLQLLPDN ERIGLNKDESVASYVRAVFGIY >MS0842 cof, Cof protein MAYQVLAFDLDGTLLNSQGIILPSSKKAIEAARAKGMQVILVTGRHHTAV KPYYYELNLETPIVCCNGTYLYQPQTDEVLRSNPFSKTQALQLIDIAERQ KIHILMYSRNAMNYMELNPHMEKFQKWVQSCPQNVRPDVRQVSSFRDIVN NEDIIWKFVMSAPNRELMQQTVNMLPQDQFSCEWSWIDRVDISNKGNTKG SRLLEYLRSVNMNPEQVVAFGDNQNDLSMLTSVGLGVAMGNADEIVKQQA KCIIGTNNENSIADFIEGLK >MS1748 comEA, ComEA protein MTTLFLILCIGSKKYTVFLCMNRLFPESNVVATEQRYFNFKRESLMKLSV RKFLLSCLAAGSLLSAGTAFAADKVPASAETQAIKTSETAKPADNIGNTV NINTATAEEIKQTLIGIGAKKAEAIIQYREKHGNFTNVEQLLEIQGIGEA TLDKNKDRIKL >MS0826 comEA, ComEA protein MKEKTKSTKNQLSESAKERMETAKNSVTSTKDKAASMKPTVKNALNSSSK VNINTADAKTLQSLTGIGEVKAKAIVDYRKKVGKIKNASELSNIDGIGDA TIEKITPYLNF >MS0931 comEC, ComEC protein MMKLDLFLFCFIVNTLCLLVLPESFLLDFPLFLHFLFPLVIAAFIYWFKY RRLWRGFYYLFCGLIAVFYIHFQALSLFRAADGVKYLPAKVQTDFVIDEI LYQRDYRNIIVKAQLAPEFKPQRIYVNWQADQAVKTGEKWRGELHLRAVS SRLNYGGFDKQKWYYAQGITAWAKVKSAVKISEDLSLRQQLFNHYLAQTE RLRQQGLLMALAFGERAWLQEDVWQIYRKTNTAHLIAISGLHIGLAMLLG MGVARLIQFCLPTRYISPYFPMLSGLVFAAVYAGLAGFAIPTLRALIALV IVSLLKLLRGYCNVWQLFLRVIGVLFIFDPLMVLSNSFWLSVCAVFSLIL WYQIFPLNLLEWKGKSVTDGKFAWLFGLIHLQLGLFCLFSPMQLMTFQGI SLAGFWANLIIVPLFSFLLVPVILFALFSNGAWESWRIADWLAQWFTHLL SYFQDYWIGVSNQTSWLICCLLCLLLLTVVHFIYPLKKQIPEKNELLTQF KTKKISLKSDRTLSPVLRKYLVSVATLFLASGAMLWLYQQWRQPDWRFET LDVGQGLANLIVKDGRAVLYDTGAGWKNGSMAQSEIIPYLQRQGLILEKV ILSHDDNDHSGGIADILQAYPSINILQPSMVNYEKTEQNSFNFDRTFCKQ GLNWQWHGLNFQVLAPAKIAERANNTDSCVLLIDDGQYKLLLTGDADLAA EQQFVAHLGKVNVLQVGHHGSRTSTGEALIKQIKPDFALISAGRWNQWGF PHPVVTQRLKRHKSAVYNTAFSGQISFEFYPNKIEVKTARSNYQPWFRQI VGGERD >MS2234 comFC, ComFC protein MNWFAFRCIYCQRKLAIGSHGLCCSCNKQIRRFNYCGVCGSELAENTLGC GNCLQNRPAWHRMVIIGAYKMPLSSLIHRFKFQNSFYFDRTLARLLYLAI RDARRTHGLMLPEVIIPVPLHHFRHWRRGYNQADLLAGQLAKWLNIPCNN RLIKRVKHTRTQRGLSAAARRVNLQKAFRFADKKQACPYKSVALVDDVIT TGSTLNALAGLFVQQGVEQIQVWGLARA >MS0341 copZ, CopZ protein MKKTLLFLTALLFSGSGFAAERNVTLHIEEMNCQLCVYLVNKELRNIDGV ISTKANFNTRLVKVVADEKVTDEMMINAIDKLHYHAVVKK >MS0320 corA, CorA protein MINAFALENARLTRLDEDNLSTLNKAIWIDLVEPTSEEREILQDGLEQSL ASFLELEDIEASARFFEDEDGLHLHSFFYCEDEEDYADLASVAFTIRDGR LFTLRDRDLPAFRLYRMRSRYQRLDECNAYEVLLDLFETKIEQLADVIET VYSDLERLSRVILDGKQGEAFDDALGTLTEQEDMSSKVRLCLMDTQRALS FLVRKTRLPANQLEQAREILRDIESLQPHNESLFQKVNFLMQAAMGYINI EQNRVMKFFSVVSVMFLPATLVASTYGMNFEFMPELGFKYGYPMAIGLMI AAGVTPYMYFKRKGWL >MS0346 cpsG, CpsG protein MTKLTCFKAYDIRGRLGDELNADIVYRIGRAFGQFLKPTTIVVGGDVRLT SKELKSAVTNGLLDSGVNVIDLGEVGTEEIYFATSFLKADGGIEVTASHN PMDYNGLKLVREGSRPISADTGLADIQRLAEENNFPAVTQRGVYKQQSVL GEYVEHLLSYINLDNLKPMKLVINSGNGAAGHVIDAIEAQFKARRVPVEF IKVHNNPDGTFPHGIPNPLLHENRQDTIDAVLANKADMGIAFDGDFDRCF LFDETAQFIEGYYIVGLLGQAFLQKHKGAKIIYDPRLIWNTIKLVEENGG EAVMSKSGHSFIKEKMRAIDAIYGGEMSAHHYFRDFNYCDSGMIPWLLVM ELVCTTGKTLGQLVNDSIDTYPSPGEINSKLADAKTAIARVRAAYEKDAV SVDETDGISIEYPTWRFNLRSSNTEPVVRLNLETRGDKKLMTEKTEEILA LLRQ >MS0771 cpsG, CpsG protein MTTIFTIAQNWLEQDPDLETKAELAQLIDNAKAGDENALKELTARFDGRL QFGTAGLRGRLQAGSMGMNRVLVAQAAGGLADYIKQYDHNAPSIVIGYDG RKNSDIFARDTAEIMSGAGIKAYLLPRKLPTPVLAYAINYFDATAGVMVT ASHNPPEDNGYKVYLGKENGGGQIVSPADKDIAALIDKVAAGNIKNLPRS QDYVVLDDQVVDAYIEKTASIAQEPRTDINYVYTAMHGVGYEVLSKTLQK AGLSQPHLVKEQVYPDGTFPTVSFPNPEEKGALDLAVKLAKKQNAEFIIA NDPDADRLAVAIPDAKGNWKSLHGNVLGCLLGWHLAKKYHAAGKQGVLAC SLVSSPALAEIAKKYGLQSEETLTGFKYIGKVKGLLFGFEEAIGYLVDPD KVRDKDGISASVAFLDLVLYLKKQGKTILDYMNEFNREFGAYVSGQISIR VSDLTEISKLMTALRNNPPSEIGGFKVTQFIDHLKTERNNDILVFTLENG SRLITRPSGTEPKIKFYLDAKGKDALDADNVLAQFDESVRALLRREQYGK QDC >MS0967 cpsG, CpsG protein MAERKYFGTDGVRGKVGTFPITPDFALKLGWAAGKVLASQGSRQVLIGKD TRISGYMLESALEAGLAAAGLSAAFIGPMPTPAVAYLTRTFRAEAGIVIS ASHNPYYDNGIKFFSAQGTKLPDEIEEAIEAMLEQPIDCVESAELGRASR IKDAAGRYIEFCKGTFPTELSLSGYKIVVDCANGATYHIAPNVMRELGAE VIEIGTSPNGMNINEKCGATDIKALKAKVLETKADVGLAYDGDGDRIMMV DHLGNVVDGDQILFIIAREDLRAGKLKGGVVGTLMSNMSLEISLKTLGIP FIRANVGDRYVLEKMVENDWKLGGENSGHIIIADKNTTGDGIIASLAVLT AMAQHKLSLNELASAVKLFPQVLINVRFSGGTNPLESDAVKAVAAEVEKR LAGKGRILLRKSGTEPLIRVMVECSDAELARKSAEEIVEAVKAN >MS2226 crcB, CrcB protein MIMWQSLILISSGAALGASLRWGMGLILNPLFAAFSFGTLIANYLGCFII GLIMAMIWQHPQFSGEWRLFMITGFLGSLTTFSSFSAEVMENFIQQKWLI GLGIMSAHLFGCLIFTGIGVLITRWLN >MS1934 crp, Crp protein MLEQVNAHQTNVLPQTQPVQPASPMDPTLDWFLSHCHIHKYPAKTTLIHA GERADTLYYIVKGSAAVMVKDEEGKEMILSYLSQGEFFGEVGLFEEGQVR SAWVKAKNACEIAEVSYKKFRQLLQVNPEILMYLSAQLSRRLQNTSKQVS NLAFLDVTGRIAQTLLNLAKMPDAMTHPDGMQIKITRQEIGQMVGCSRET VGRILKMLEDQNLIAAHGKTIVVFGTR >MS1077 crp, Crp protein MPKFIEKQMKVLTPDAAKTGRRIQSGGCAIHCQDCSISQLCIPFTLNEHE LDQLDNIIERKKPIQKSQILFKAGDELTSLYAIRSGTIKSYTISETGEEQ ITSFHLPGDLVGFDAITNMSHPSFAQALETAMVCEIPFDILDDLTGKMPK LRQQMLRLMSSEIKSDQEMILLLSKMNAEERLAAFIYNLSKRYAARGFSA REFRLTMTRGDIGNYLGLTVETISRLLGRFQKLGILSVQGKYITINDMVQ LVELSGTNRTKIKLVD >MS1273 csdB, CsdB protein MFDTTGFRSHFPYFQHPDRVIYLDNAATTLKPQSLIDATVKFYQSAGSVH RSQYDEEQTALYEQARSQVRQLINAESDKAIIWTSGTTQAINTVANGLIP YIQSDDEIIISEADHHANFVTWSMIAQKCGAKLRILPIQDNWLIDENALL EALNKRTKVVVLNFVSNVTGTEQPVEHLIRLIRKHSSALVSVDAAQAISH VKIDLRKLDADFLSFSAHKIYGPNGLGVLSGKLTALELLQPLIYGGKMVD RVSKQQISFAELPYRLEAGTPNIAGVIGFNAVLSWLNQWDFEQAEHHAVQ LAEQTKVRLKNYEFCQLFNSPKPSSVISFVFKNIAGSDLATLLAEQNIAL RTGVHCAQPYLSRLGQHSTLRLSFAPYNTQQEVDAFFTALDKSLALLEE >MS1095 cspC, CspC protein MEVGVVKWFNNAKGFGFINAEGSDADIFAHYSVIEMDGYRSLKAGQKVNF EVVHGEKGSHATKIIPILE >MS1144 cspC, CspC protein MSKLNGLVKWFNSDKGFGFITPADGSKDLFVHFSSILGNNYRSLNEGDRV EYNVENTQRGPAAVEVAVIK >MS0166 cspR, CspR protein MLDIVLYEPEIPQNTGNIIRLCANTGFRLHLIEPLGFTWDDKRLRRSGLD YHEFAHIKKHKTFEVFLESEKPKRLFALTTKGGPAHSEVKFELGDYLMFG PETRGIPMAILDSMPMEQKIRIPMTENSRSMNLSNSVAVTVYEAWRQLDY IGAVNLNRK >MS0347 csrA, CsrA protein MLILTRKVGESLLIGDDISITILNVRGNQVKIGVNAPKDVSVHREEIYQR IKQAEEKESTS >MS1677 cstA, CstA protein MLWFLLCVAILLLGYFIYGKIVEKIFVINPDRKTPAYSLRDGVDYVPMTK KKIWLIQLLNIAGTGPIFGPILGALYGPVAMLWIVFGCVFAGAVHDYFSG MLSIRNGGANVPYLAGKYLGRPAKHFMNIIAILLLLLVGVVFVASPASLL TNITSDLMNSGSVGAAAVNDEAGAAKGNILIMWTAVIFIYYIIATLVPID KIIGRIYPFFGALLLFMTCGMLFGLFFEGIPFFRTLGGDISLADFFTNMH PKNAPIWPLLFITIACGAISGFHATQSPLMARCTENEREGRFIFYGAMIG EGIIALIWCMVGLSFYNDQAGLAEAIQIGTPSKVVYDAAIGMLGVFGGIL AVLGVVVLPITSGDTAFRAARLLIADFFKYDQRNLTKRLTIALPLFAIGF WVSTIDFSVLWRYFGWANQTTAMVMLWTAAAYLFRHQKFHWVCTIPAVFM TLVCSTFLLNAPIGFGLDYQLSVWLGGAVTAVAVIAFFMLLKPISADEQD >MS1002 cvpA, CvpA protein MIDYIIIGIIVFSIVVSLLRGFVREVMSLASWVVAFVIASQFYPYLANFL TQIESEYLRNGTAIGILFILTLIVGAIVNYVIGQLVDKTGLSGTDRVLGA CFGFLRGVLIVSALLFFVDTFTNFDQNDMWKESKLIPHFGFVVEWFFEQL QANSSFLNSTLNK >MS0277 cyaA, CyaA protein MKYDLQFAKKQVDDLHRLRVERVLQGSTADFQHVFQLIALLLHLNHPALP GYVTDAPAGVAHFKLSDYQKNFLAQQFPTGFDFVRLEQESNAHQQEKTPI YGVYVMGSIASISQTAKSDLDTWVCHSPDLTPYALNKLQQKTQLLKIWAK KFNTDITLFLMDEFYFNHYRYSNTLSVENCGSAQHMLLLDEFYRSAIRLA GKPLLWLHLNVENEADYGKEVQRLQQTKQINRADWIDFGGLGAFSANEYF GASLWQLYKGIDSPYKSVLKIVLLESYSQEYPNAKLISMQFKQQLFNLKP VKEQCFDAYLAMLERVTEYLTKLKDEKRLDFIRRCFYIKVTETVRERPLA PWRAKILKNLTAQWGWSEETIKHLNRIHTWKIRSVRETHNKLIRVLMLSY RNLVNFARKHNVNASIAPQDISILTRKLYTAFEVLPGKVTLMNPQLALDL SEKNLTFIEVTEEHGVKPGWYVVNQMPSVVYPSQNRYIEYNPILIKLIAW TYFNGLLTSKTKVHISSTHVDIEKINQCITDLRVSFPVKASPPTDEELTH PCEIRSLAVMINLTKDPTPYSDINRTEIQQSDLFSLDGENESLIGSVDLL YRNKWNEIKTLHYEGDKAMLSALKVLSNKIHRGSGVPESVNVFCYNQYYQ EEISELVVGLLNKCISIQLGTTQLPMSSVPRMTGKNWKLFFEEHDATLHQ PQTEPVFISQVIAEQKQVKVKRNQPYKHLLNYPRQIDSFASEGFLQFFFE DNEDETFNVYILDENNRLEIYRQCDGSKEQKIREISQIYNLSGSDQNDNH YKIIKRDFNYPQFYQLKHQQKGILILPFSGSCMV >MS2082 cyaY, CyaY protein MNIAEFHQNIDQIWDSIEEQLENQDIDADCERQGAVFTITFENRTQIVIN KQEPLLELWLASKLGGFHFSYKNGDWLNYEGKRFWDCLAQACAAHGEEVS FA >MS0715 cydA, CydA protein MLDVVELSRLQFALTALYHFLFVPLTLGLSFILVIMETLYVATNKQVYKD MTKFWGKLFGINFALGVTTGITMEFQFGTNWSYYSHYVGDIFGAPLAIEA LLAFFLESTFVGLFFFGWDRLTKAKHLLATYCVAFGSNLSAMWILVANGW MQSPVASEFNFETMRMEMTSFMDLWLNPIAQSKLVHTLAGGYVSGAMFVL AISAYYLLKGRDIGFAKRSFSVASVFGFISIIATIIMGDQSAYEVGNVQK TKLATMESEWHTQEAPASWNAFAIPNDAEMKNDFEFQIPYLGGIMATRSL DQTYPGIHDILIENEGRVRNGMVAYGLLEELRAQKKAGQVNEETKAQFLA TRDDLGYGLLLKRYTDKVVDATEEQIKQATRDTVPNVAPVFWSFRVMAAL AGVIMVLLSGAFIQNLRNATTKIPLLLHALLWCLPLPWIAIECGWFLAEY GRQPWAIYEVLPVGVANSSLSTGDLWFSIGLLCGLYTLFIVVEMYLMYKF GRLGPSSLKTGRYYFEQSSKAGA >MS1065 cynT, CynT protein MKKIEQLFANNHSWALRMKEENSSYFKELADHQTPHYLWIGCSDSRVPAE KLTNLEPGELFVHRNVANQVIHTDLNCLSVVQYAIDVLNIEHIIICGHTN CGGIKAAMANQDLGLINNWLLHIRDIWYKHSHLLGNLSPEKRADMLTKIN VAEQVYNLGRASIVQDAWKRGKKLSLHGWVYDVSDGFLIDQGVLATSRES LEISYRNSIARLKTLDEEDIFRKGNKENNDEIIG >MS1340 cysA, CysA protein MMLEINVKKRLGQLVLNARLTIPGQGITGIFGISGSGKSSLINLVSGLIH PDEGNIRLNDRTLIDTANNICLAPNQRNIGYVFQDARLFPHYSVKGNLCY GIKRFNQQEFNRIVRLLGIEHLLARYPLTLSGGEKQRVAIGRALLSNPEM LLMDEPLSALDLPRKRELLAYLEKLSQEINIPILYVTHSLDELFRLADFV VLLDEGKVAAFDSLENLWQSPLFEPWQEQGQKSAVLSLPILNHNFSYKMT ALLLGEQQLWVKLLNGDEGKTVRICIRSTDVSITLTVPEKTSIRNILSGK IITLLPKGNQVDVKIALGKDEIWASVSTWAAEELQLQIGQSVYAQIKAVS VM >MS1261 cysA, CysA protein MSIKIENLEKHFGSFHALKNINLQFKQNQLTALLGPSGCGKTTLLRIIAG LEFADSGKILFEHRDVTDLSAKDRGVGFVFQHYALFQNMTVYDNVAFGLR VKPRKERPSKEEIQQKVTALLKLVKLDWLANAYPNQLSGGQRQRIALARS LAVQPKVLLLDEPFGALDAQVRKELRRWLRDLHQELNVTSIFVTHDQDEA LDVSDRIVVMNQGQIEQIDEPNQIYHAPQTPFVTQFVGDVNVFHGHIDEG NLVIGEFSHKIDPATNTTQPVNNQSATAYIRPYELTISRHADNALATGKI THINAIGFIVRIEIESAQSDQPIEVILTKAAYSQSQYKVNEQIYLVPDKL NLFQQMNI >MS2212 cysE, CysE protein MLREVWNNIRNEAKELVEHEPVLASFFHSTILKHKNLGGALSYILANKLA TSTMPAITLREIIEETYQDDPRIIDSAACDIHAVRQRDPAVGLWATPLLY LKGFHAIQSYRITHHLWQQNRKSLAIYLQNQISVAFDVDIHPAARVGCGI MFDHATGIVVGETAVIENDVSILQGVTLGGTGKESGDRHPKIREGVMIGA GAKILGNIEVGKYAKIGANSVVLQPVPEYATAAGVPAKIISKDRSAKPAF DMNQYFIDDAEALNI >MS1254 cysG, CysG protein MMNYFPVFADLNNRPVLVVGGGTIAARKVNLLLKANAEVRITAQKLNAEL TALVEQDRIIWIAKEFHGEQIRNVFLVVAATDDEQLNEQVFQVAESRQKL VNVVDDQARCSFIFPSIIDRSPIQVAVSSGGAAPVLARLLREKLEALLPQ HLGVMADISGKWRHKVKQQLKTITERRRFWESLFNGRFSRLLKNRQIEAA KKELELQLTKDYQGGSVSLVGAGPGDAGLLTLKGLQEIQQADVVLYDALV SAEILDLVRRDAELIFVGKRAQGRQVAQQETNQLLADLALQGKRVVRLKG GDPFVFGRGGEELEVLAQQGIPFSVVPGITAAIGATAYAGIPLTHRDYAQ SAVFVTGHRKADASDIEWQTLARSNQTLVIYMGTLKAATIAQSLQQYGRA ASTPVAVISQGTQETQHTQIGTLKNLAELAEKAPTPALIVVGEVVSLHEK LAWFGEDKFAQKRPHFTLDSLRIERVA >MS1253 cysH, CysH protein MIIKPNFWQIPQPTATDFAALAEKEQLLAQRIHEIANRHQHAKFASSLAV EDMVITDVIAKSKAKITVFTLETGRLNPETLALADTVKKTYPDLDFRLFR PNPIAAEKYDREKGRFAFYESVELRRECCFIRKIEPLNRALADADAWLTG QRREQSVTRTELEFHEWDQSRGIDKYNPIFDWHEMDVWAYILKYDIPYNE LYKQGYPSIGCEPCTKQVKAGEDIRAGRWWWENKDSKECGLHK >MS1252 cysH, CysH protein MTTQNQIENGHLDWLEAESIYIIREVVAECSHPALLFSGGKDSVVLLALA RKAFQLEGRDLVLPFPLVHIDTGHNYPEVIQFRDEQVKKLNARLVVGHVE DSIAKGTVVLRKETDSRNAAQAVTLLETIEANGFDALMGGARRDEEKARA KERIFSFRDEFGQWDPKAQRPELWSLYNGKLHKGENMRVFPISNWTELDI WQYIEREKLELPPIYYAHQREVVERNGLLVPVTPITPKQPGDESKVVSVR FRTVGDISCTCPVASTAATPAEIIKETAVTEISERSATRMDDRTSEAAME QRKKQGYF >MS1249 cysI, CysI protein MSDKKQKGLEWQDNPLSDNERLKEESNHLRGTILDDLEDGLTGGFKGDNF QLIRFHGMYEQDDRDIRAERQEEKLEPRKFMLLRCRLPGGIIKPEQWIEI DKFARDNNYYQSIRLTNRQTFQYHGVPKTKLQDMHRLLHKLGLDSIATAS DMNRNVLCSSNPVESELHQEAYEWAKKISEHLLPRTNGYLDVWISGKKVQ SSDSFLGQEDEPILGNRYLPRKYKTAVVLPPLNDVDLYSNDMNFVGIKDE KTGKLAGFNVLVGGGLSFEHGNTKTYPNIALELGYVPVEDTLKAAESIVT TQRDFGNRADRKNARLRYTIQNMTLEGFREEVERRMGRRFEAIRPFEFTE RGDRIGWVKGIDKKWHLTCFIESGRLVDKPDLPLMTGMLELAKVHKGDFR ITANQNIIIANVAEEDKRQIEDIARQYGLIRKITKLRENAMSCVSFPTCP LAMAESERVLPEFIDELDKIMAKHHVEQDYIVTRITGCPNGCGRAMLAEI GLVGKAVGRYNLHLGGNIAGTRIPRLYKENITLDEILSELDGLIARWATE RDQGEGFGDFVLRVGIIKPVVNPVVDFWDENLIPTVAV >MS1250 cysJ, CysJ protein MSNTTNPLPPETEQLLAKLNPIQLAWLSGYAWAKAQGEDAGTNVTNKNAA STLVTEDKPLNVTVLSASQTGNANGVANQLAERLKAEGVNVTRKALKEYK AKTIGDEQFVLLVTSTQGEGEAPEEGVPLYKLLHGKKAPNLANLEFAVLG LGDTSYPNFCQAGKDFDKRFEELGAKRLLARADADLDFKSTADKWIQDVV EAVKAKSAVSASVVASVVSASSAQSAVNYSKENPYTAKLITNQKITARDS AKDVRHFEFDLSGSGLQYKAGDALGVWAENDPDLINEVLGLLKIQPDESV QLNGKSLDIHGALLSRLELTQNTPAFVKGYAQLANNKKLTALVSSDKKLA DYVNDTPIVDVLHDFPAKISAQQFADLLRPLTPRLYSISSSPEEVGEEVH LSVGVVRFEHEGRARTGVASGFLADRVEEDGEVKIFVEPNDNFRLPQDKS KPIIMIGSGTGIAPFRAFLQQRQAEEAEGKNWLIFGNQHFATDFLYQAEW QQFVKDGYLHKYDFAWSRDQAEKIYVQDKIREKSTALWQWLQEGAHVYVC GDASKMAKDVENALLEVIAREGKLTPEDAEEYLNDLREDKRYQRDVY >MS1770 cysK, CysK protein MTIFADNSYSIGNTPLVRLHNFGHNGNLVVKIESRNPSFSVKCRIGANMV WQAEKDGVLTKDKEIVDATSGNTGIALAYVAAARGYKITLTMPETMSLER KRLLRGLGVNLVLTEGAKGMKGAIAKAEEIVASDPNRYIMLKQFENPANP AIHQQTTGVEIWQATEGKVDVVVAGVGTGGTITGISRAIKLDQGKQITSV AVEPAESPVITQILAGEEIKPGPHKIQGIGAGFIPKNLDLSLIDRVETVD SDTAIKTARRLMAEEGILAGISSGAAVAAADRLAKLPEFQDKLIVAILPS ASERYLSTALFEGIEG >MS1251 cysN, CysN protein MKDYIMSNLNQYAPLRFITAGSVDDGKSTLIGRLLYDSKALLSDQLLSLD KSKSNGEVIDFSILTDGLEAEREQGITIDVAYRYFSTAKRKFIIADTPGH EQYTRNMVTGASTANAAVVLIDASQLDFSKEEVELLAQTKRHSAILKHLN TPHIIVAVNKMDLLNFEQNKFNAITAAYTKLAKQLGLKEVVFVPVSALQG DNIVHKSDATPWYEGEALLTVLENLPTDDHQSEKAEDFHFPVQLVSRLDQ DKQDDFRGYQGRIESGSIRKGDKVRIEPTGYETRITEIYSPNGLVQSAKV GEQVTLRLADDIDISRGDTFLAENSATVATKALKATVCWFDQRALNPARK YLLKHTTLTVFAKVSSVDRVLDVQTLSHSAQADSLKMNDIGEVQISLQKP ITATTYAQNIATGSFILIDEATYHTVAAGMILEI >MS0017 cysQ, CysQ protein MQITQQLLDDVLKIASLAGEHLKTFYAKSVNVEIKTDNTPVTEADLFLSQ FLIEKLTVLTPDIPVLSEENCNIPLAERQKWQSYWLIDPVDGTQQFINHT GQFSIMICLVQDNQPQLGIIHAPIIGKTYYARRGLGAFLIENGCCRKLPP LQPHNHQHIKITIGSSNPEKIRQSVQPPYKADLLLYGSSSLKSGLVAEGV ADCYVRLGNTGEWDTAAAEVLLNEVGGKIFNLQYRPLTYNQRETLVNPHF VMANAQLDWKKIFRFDL >MS0622 cysS, CysS protein MSGRDPRRLIKTQILEPSMLKIFNTLTREKEEFKPINPNKVGMYVCGVTV YDLCHFGHGRTFVSFDVITRYLRYLGYDLRYVRNITDVDDKIIKRALENN ETCDQLVERMIAEMHKDFDALNILRPDVEPRATKHIPEIIAMVETLIRRG HAYVAEDGDVMFDVESFQKYGALSRQNLEQLQAGARVEIKSVKKNPMDFV LWKMSKPNEPSWDSPWGKGRPGWHIECSAMNDKELGNHFDIHGGGSDLMF PHHENEIAQSCCAHDGEYVNYWLHTGMLTINEEKMSKSLNNFFTIRDILT KYDAESVRYFFLTAQYRSLLDYSEENIGLARKALERLYTALRGCETVEIP AEDQYVIDFKTAMDDDFNTPGALAVLFELAREINKLKTEDQTKANQLASR LKQLAGVLGLLEQAPETFLQGDAADAEVSKIEALIKRRNEARAAKDWAAA DAARNELTAMGVVLEDGAKGTTWRKL >MS1259 cysU, CysU protein MPGFRRGLTVTILWLTSMIVLPLILLVITALQLKGTEIWQIITSTRVISS ILLSFKMALAATVVNIIFGFLLAWILVRYNFRGKSLVNAFIDLPFALPTA VAGIALASLYAPTGLIGGILAKAGVQIAYTPSGIAIALIFVSLPFVVRAI QPVLANFDPSFEEAAHILGASKWTTLTKVIIPALLPAIIGGAGMGFARSL GEYGSVIFIAGNVPLVSEIAPLIIMSKLDLYDVQGASVVALLMILISFIL IFLVNWLQWAINKRITQVK >MS1260 cysU, CysU protein MEKTDWQKWGLISVGLLFFTIILLFPLLTVFYYALEQGIDLFIKSIQEEE AQAAIWLTVKVALIVLPINIFIGVVMAWTIAYFNFKGKSFLTALLDLPFS VSPVVVGLMFLLMFGIDSFFGQWLATHQIRVIFALPGIVLATLFVTFPLI VKSLIPTMNAQGNSEEQAALILGANSWTLFRKITFPKIKWALIYGVILSN ARAMGEFGAVSVVSGHIRGLTNTIPLYVEISYNEYQFVAAFACASLLALL AIFTLGLQNTLTWLQKRKFNRH >MS1341 cysU, CysU protein MRRKFNAAYFIKISFMLSSLINYFQFSPNEINAIRLSIKVAVVAICCSLP FAIFVAWLLARKNFWGKSLVNGIIHLPLVLPPVVIGYLLLISMGRNGIIG RYLLQWFDFSFGFSWYGAALASAIVAFPLVVRSIRLALESVDFKLEQAAR TLGASSWRVFFTVTLPLALAGVLAGVILGFARSLGEFGSTITFVSNIPNV TQTIPLAMYSFIETPGAESSAARLCIVAIFISLISLLLSEWLAKRTQTKL GQIDVRN >MS1769 cysZ, CysZ protein MLFPTALCMLALIRFLIYIFLGSFMKKEKEIKSGFHYFVMGWHLIGQQGL RRFVVMPVLLNIILLSGLFWLFVSKISDMIEGVISFIPDWLSWLSGILLA LSILMILLVFYFIFNTLSGFIAAPFNGLLAEKAEAMLTGESGENMTTMEF IKDTPRMLAREWQKLLYSLPKYIGLFLLSFIPLIGQSLIPVLTFLFTAWM MAIQYCDYPFDNHKISFPTMKFKLNENRIQNVTFGTFVTLCTFVPFINFV IIPVAVCGATAMWVDTYRKQLYLDKNLQKSTAVSTASTEKPGSDIARHSN NIRNR >MS1434 czcD, CzcD protein MNAQAKKDKQAHHEHHAHSQVHTEHEHSQVPKNKMILGISLAIISCYMVV EFIGGYLFNSLTLMADAGHMANDSLSLFLALVALFLSAKAQKWFALLNGT SLVFVAVMILIEAFKRWQAPTEMAALPMMTVAIIGLLVNILVAWIMLKSD QENLNIKAAYLHVLADLFGSVVAIIAGLSAWLLDWQWVDVVASVILSALV LRSGLSVIKQAITALRSDGEEFSMDTHSH >MS1607 dAK1, DAK1 protein MALTKQQILQWLENCNRTFNERRDYLTELDTAIGDGDHGLNMQRGFSKVM DKLPTIKDKDIGTILKNVGMTLLSQVGGASGPLYGTFFIKGAQSAVGKEE ISFEELVQVLKDGVAGIVSRGHAELGDKTMCDVWLPVVNQLEQEDGNQPL DALLKSAVEKANISLQATIPLIAKKGRASYLAERSAGHQDPGATSTTYML EALYNAVK >MS1606 dAK1, DAK1 protein MKKLINSVETVLDEQLQGLAKSHPQLVLNTEPVYVRRADAPVAGKVAIIS GGGSGHEPMHAGFVGEGMLDGACPGAIFTSPTPDQMMECAMAVDSGEGVL LLIKNYTGDVLNFETATELLADMGNQVATVLVDDDVAVKDSLYTAGRRGV ANTVIMEKLLGAAADKGYNLNQLAELGYKLNNQGHSLGIALGACTVPAAG KPSFTLAENEMEFGVGIHGEPGIERRPFENLDKTVQQMFDTIIENGNYER KMRRWDCQANQWNEVTEQKQALQPGDRVIALVNNLGSVPLSELYGVYNKL TECCEKFGLTIERNLIGSYCTSLDMQGMSVTLLKVDDEILSLWDAPVNTP ALRWGK >MS0960 dacB, DacB protein MSKFTKNLFASSLLFSNIAFAQIDVQPLTQILPQGASIGFIAENINQNKI IADHNGQTFMLPASTQKVFTALAAKLALGDEFRFETSLQTQGKVQNNQLD GDLIVKFTGDPDLTTGQLYGLFATLKKQGVNQINGNLILDTSVFASHDRG SGWIWNDLTMCFNSPPAAVNLDNNCFYVNLDANKSVGEFVQFNVPTQYPI QVFGQVRVVGAEEAPYCQLDAVVHDNNRYQIKGCIARQTKPFGLSFAVQD TDAYAAAIVQRQLRQAGIQFSGQVQQPHQPQQGTVLAQHLSKPLPELIKK MMKKSDNQIADSLFRTIAYHTFKRPASFQLGSLALKRILSTQAKIKFGHS IIADGSGLSRHNLVDPNTMLQALNYIARNEDKLHLMDSFPVAGVDGTISG RGSLINPPLIKNVLAKTGSLKGVYNLAGFMTNARGERIAFVQFINGYSTG ELENKTKRAPLVQFESKLYNALYAD >MS1829 dacC, DacC protein MLKNALKKTSLAIFAGLLALPLTVSAEDVNFGIVPPQVNAQTYVVMDYNS GAVLASLNPDQRQYPASLTKMMTSYVIGDALKQGKIHNTDTITVGESSWG KNFPDSSKMFLNLNQQVTVEQLNRGIIIVSGNDACVAMAEHVSGTTDNFI NSMNKYAEQFGLKNTHFTTVHGLDDANQYSSARDMAIIGAHIIRDLPEEY KIYAEKDFTFNKIKQPNRNGLLWDKTINVDGMKTGHTSQAGYNLVASATN ADNMRLITVVMGVPTYKGREVESKKLLQWAFNSFDTFKTLEAGKAVTNQD IYYGNQGKVQIGVLQDRFITVPKGRNADLKARFELDKKYLEAPLAKGQVV GKVIYQLDGKDVAKVDLQAMQDVEEGGIFGKAWDWLVLTIKSLF >MS0849 dacC, DacC protein MELEQLGFRLFSKNTLAVISFVTEAIGRMLLGERLSSTLL >MS0850 dacC, DacC protein MVYDFTHNKVLESRSPNSILPIASVTKLMTANVFLENNRNPNCSSSITDE DYDHIKGTRTKLPKYTPISCNELLKAMLVHSDNYAAHALSRSAGMSRAQF IKKMNQKAQQLGMNSTRFSDSSGLSSSNVSSPMDLVKLAKYSLDKQLIKT LSNTRATYIRAGRHNVFMQNTNKLVREEMFDAAINKTGYIRESGYNLVFV NKHQCNRSTIGVISLNNASSAYRSNFTKHKLEEYGCLAANDVEINEFNDQ DFENYDEQGLAQLIEQVAK >MS1592 dadA, DadA protein MLKVTTAHIHFNRDNTPVSEQFDDIYFSTADGLEESRYVFQEGNNLWRRW LQFGENHFVIAETGFGTGLNFLAVTALFREFRTQYPDSPLKRLFFISFEK YPMSCADLRSAHQAYPQFNSLAEQLRQNWLQPIVGCYRFHFEETVLDLWF GDIADNLPQLGDYMVNKIDAWFLDGFAPSKNPEMWNENLYKQMFRYTKPA GTFATFTAASAVKKGLESAGFSLQKRKGFGKKRECLQGFKPLNAEQNPAV HTPWLLSRSATLSENTDIAIIGGGISSLFSAISLLQRGANVTLYCEDEQP ALNASGNKQGAFYPQLSDDDIHNIRFYIHAFAYGQQQLRWAIQQGIEFEH EFCGVALCAYDEKSAVKLAKISDYDWDTSLYQPLNQQELSEKAGLPLPCG GGFIPQGAWLAPRQFVQNGFAFAQKCGLKLKTFEKITALSQSEKGWILHN DKNEQFHHETVIIANGHKLKQFTQTARIPVYSVRGQVSQIPTSSQLLKLK SVLCYDGYLTPADQAKQFHCIGASHVRDCEDRDFSLQEQQENQAKIQLNI AEDWTKEVNTADNLARTGIRCAVRDRIPLVGNVPDFERQADEYRNIFNLR RRKQFIPQAAVFENLYLVGALGSRGLTSAPLLGEILASMIYGEPIPLSED ILHCLNPNRSWMRKLLKGTPVK >MS2290 dadA, DadA protein MLKFSYQEHIKTYYYDTRNQDFTQPTLTGGQSADVCVVGAGFGGLSAALE LAERGKSVIVLEGARIGFGASGRNGGQAINGFEDGMDAYIDDMGLEKARK LWEMSLEAIDIIEQRIAKYNIQCDWRKGYATLALNHRRMDDLVTIEQTSR EIFAYDYMQLWNKAELKQYLGSDIYVGGLYDGNSGHLHPLNYCLGLAKAC LDLGVRIFEQSPVIDLDVGKSKVIAETAEGSVTAENVVLATNAYVTSLPK RIQRGTARKILPIDSFIIATEPLDQETANAVINNGMSVCDNNLLLDYYRL SADNRLLFGSDSSSNKDMVQVMRNNMLHVFPQLENVKIDYGWAGPIDMTI NAKPCLGRIASNIFYAHGYSGHGVALTGLAGRLIAEAIEGDDERFAIFES LKSPSVYGGRIVKNLATKIGVKYYKWLDKYR >MS1967 dam, Dam protein MSHSGKTKHGLKHRSFLKWAGGKYRLTDNINNLFPKRRKCLVEPFVGAGS VFLNSQFERYILADINADLINLFNTVKTDVDAYIEALKPVFFHAEANSAG YYYARRDDFNNSTDPFFRSVLFLYLNRFGFNGLCRYNSLNEFNVPFGAYK SHYFPEKELRYFAEKAKSAVFICADFNETFKLADDESVIYCDPPYAPLLQ DSNFTKYAGNDFSVTHQQALAELAKQTVNERNIPVLISNHDTAFTREIYH GAKFKRIKVQRTISQAAERRVKVNELIAVFK >MS0265 dapA, DapA protein MSSTRPLFYGSIVALITPMDGHGEVNYDELKKLVEYHIASGTHAIVSVGT TGESATLSIDENVKTIQKTVEFAAGRIPVIAGTGANATSEAITMTKLLNN SGVAGCLSVVPYYNKPTQEGMYQHFKAIAECTDLPQILYNVPGRTGSDMK PETVGRLSKIENIVAIKEATGDVSRVKQIKELAGEDFIFLSGDDATGLES IKLGGQGVISVTNNLAAADMAKMCELALAGNFDEAEAINQRLMGLHHDLF IEGNPIPVKWAAYKLGLIKEPVLRLPLTTLSEAAQPKVLEALKQAGLI >MS0067 dapA, DapA protein MFKPQGIIAPVLTALDDNEKFNPEVYKNYINYLIKAGIHGIFPLGTNGEF YGFNEAEKLEIIKTAIEAADGCVPVYAGTGCVTTKETVEFSKKVVDLGVD VLSIVSPYYIAVTQDDLYRHYATIAENVTAPILMYNIPARTGNNIDYKTI KKLAQYENIIGVKDSSGNFDNTLKYIENTDSRLSIMAGSDSLILWTLLAG GTGAISGCSNVFPELMVSIYEYWKQGDFEKANEAQKKIRDFRNVMQMGNP NSVVKRAAQLRGLGTGPAKEPSNCANNPVIDKALQDVFKLYD >MS0282 dapA, DapA protein MKKINLEKTMSIQGIIPVMLTPFMENNEIDYDGLRKLTDWYIDNGSDALF AACQSSEILFLSLEERVKITKTVMDQVQGRIPVVASGHISDSFEQQVEEL TAIYNTGVDAVILITNRLDPNNEGTTVLKSNFEKLLAALPKDIVLGLYEC PVPYRRLLTDGEISYFAGFENMVVLKDVSCNLETVKRRIQLTKNSNLKIV NANAAIAFEAMKAGSEGFSGVFNNIHPDLYAYLYKNKNSSDPMVQELANF LAICGAAESFGYPNFAKLMHTKIGTFKHYNSRVIKDDIKVKYWAVEELLD HIMQGSERYRNKLNLR >MS0971 dapB, DapB protein MTLKLAIVGAGGRMGRQLIQAVQAAEGVELGAAFERKGSSLIGADAGELA GLGELGIKVAEDLAAEKDKFDIIIDFTRPEGSLEHIKFCVANNKKLILGT TGFDDAGKQAIGKAAEKTAIVFASNYSVGVNLVFKLLEKAAKVMGDYSDI EIIEAHHRHKVDAPSGTALSMGEHIAKTLGRDLKVNGVFSREGITGERKR TDIGFSTIRAADVVGEHTVWFADIGERVEISHKASSRMTFANGAVRAAKW LANKQIGLFDMTDVLDLNNL >MS1177 dapD, DapD protein MSNLQSIIEAAFERRAEITPKTVDAQTKAAIEEVIAGLDCGKYRVAEKID GDWVTHQWLKKAVLLSFRINDNQLIDGAETKYYDKVALKFADYTEERFQQ EGFRVVPSATVRKGAYIAKNTVLMPSYVNIGAFVDEGTMVDTWVTVGSCA QIGKNVHLSGGVGIGGVLEPLQANPTIIGDNCFIGARSEIVEGVIVEDGC VISMGVFIGQSTKIYDRETGEVHYGRVPAGSVVVSGSLPSKDGSHSLYCA VIVKKVDAKTLGKVGLNELLRTIEE >MS1784 dapF, DapF protein MQFSKMHGLGNDFVVVDAVTQNVYFPEEVIKKLADRHRGIGFDQMLIVEP PYDPELDFHYRIFNADGSEVAQCGNGARCFARFVTLKGLTDKKDIAVSTT NGKMILTVQDDGMIRVNMGEPVWEPAKIPFIANKFEKNYILRTDIQTVLC GAVSMGNPHCTLVVDDVETANVTELGPLLENHERFPERVNVGFMQVINPN HIKLRVYERGAGETQACGSGACAAAAIGIMQGLLENKVQVDLPGGSLWIE WQGEGHPLYMTGDATHVYDGVIKL >MS1581 dcd, Dcd protein MRLCDTDIERYLDEGIIEITPRPGNEKINGATIDLRLGNSFRVFRDYSAP YIDVSGPREEVSAQLDRVMSDEIIIRDDEPFFLHPGVLALATTLESVRLP DNIIGWLDGRSSLARLGLMVHVTAHRIDPGWEGRIVLEFYNSGKLPLALR PNMIIGALSFEILSNHAARPYNRRRDAKYKNQQSAVASRINQDE >MS1199 dcp, Dcp protein MSNPLLENTPLPQFSKIKPEHIQPAIEQLIQDCRITTENLLKQPQLSWDN FCQPLSEVNDRLSKAWSPVSHLNSVKNSNELRDAYQACLPMLSEYGTWVG QHQGLYNAYVQLKNSPEFAGYSPAQKKAVENSLRDFKLSGISLAPEQQKR YGEIVSRLSELSSQFSNNVLDATMGWDKVITDEEQLKGLPESALQAAKQS AQNKGVEGYRFTLEFPSYIPVMTYCENRELREEMYRAFVTRASDQGPNAG KWDNSAIMEEILTLRVELAKLLGFNSYTELSLATKMAETPAQVLSFLDDL AMRSKPQGEKELADLYAFCEKEFAITELEPWDISYYSEKEKQALYAINDE ELRPYFPEQRVISGLFELIKRIFNIRAVERQGVDCWHKDVRFFDLIDETD EVRGSFYLDLYAREHKRGGAWMDDCIGRKIKADGALQKPVAYLTCNFNAP VGDKPALFTHDEVTTLFHEFGHGIHHMLTKVDIGDVSGINGVPWDAVELP SQFMENWCWEEEALAFISGHYQTGEPLPKEKLTQLLKAKNFHAAMFVLRQ LEFGIFDFRLHDNYKPGKANQILDTLNAVKDQVSVVKAVDWARTPHSFGH IFSGGYAAGYYSYLWAEVLSADAFSRFEEEGIFNAVTGKSYLDEILTKGG SEEPMVLFERFRGRKPTLDALLRHKGIAN >MS0542 dctP, DctP protein MILFTVLNKRYRANYYFGIPFAANDRRKTIDIPPILCMENLHMFMKKKVL TLAISGLLAATVSFSVSAKTTLKLSHNNDKTHPVHISMQAMADEVKKLTD GEVVIRIYPNSQLGNQRESMELIQSGSLDMAKSNASELEAFEPIYGAFNV PYLFKDSEHYYKVLRDPEIGGKILDSTKGKGFIGLTYYDAGSRSFYAKKP IKTPADLKGLKVRVQPSPSALEMMKLMGASATPLAFGELYTALQQGVVDA AENNPTALTLMRHGEVAKFYSEDEHTIIPDVLLISEKSWGKLTPEQQKIV KEAADNSMMSHKDLWAKATEEEIQKAKDTMGVEFVKVDKQPFVDAVKPMH DKALADPVIGPIVQKIDAAR >MS2257 dctP, DctP protein MKLKSVFSNSVLAKAEMTLRLGLEPSIESPQGVGAKEMAKVADELSKGKI AIEFFPDQQLGTGPQMIEMVKKGELDIFQGGAGLYSSIEPRLNVFDIPYL FDSVEQAYKVLDSDFGKEILATLEPANLKGLSFWENGIRSVTSNVKPINT PEDLVGLKIRVMPANQVHVDLWQGVGAKPEPLPYGEIYGKLKSGELDAQE HPIAPIYTGKFYEVQKYLSLTQHMYGPLIQVMNLEKFNALPKETQDILLK ASYAGAVKMRQFSNENAAKFIDDMKNKGMLVNEVDTTPFKAKMRPLVEKP YVEKNGDDWLKKINASIEADRKK >MS0698 dctP, DctP protein MKLKTFILSSLSIVLPLCAVSTNAIAAKVTLKLAHNLEQSHVIHKALDYM AKEVKEKSNGELILRIYPNRQMGDARETIELLQNDALDMTKANSSELEPF VKEMAVFTCPYLFNNDEHFKKVLYGSAGKSITDKTKNSGFTVLSSYVGGS RNFYTKKPIYSPADLKGMKIRVISTPTTNRIIELLGGSPVPVPLGEVYTA LQQGVIDGAENNIPSYTSTRHVEVAKYFTEDQHTSMPDYLVIANKVWNKL DENQQKILLDAAKESEIYQQKLWDEETIHSRREAEKIGTTFIQVDKQPFR DALIPLYNDFKQNPVFSQIIADIEAEAK >MS0526 dctP, DctP protein MKLFSKSIKTILSVGLLGFTINAQAETEIMVAYGNQPGEPIDKAMHFWAD KVKEKSNGDIVFKLFPSSQLGSETEVMEQAKFGSNIITISDYGALMDIVP DLGVINAPYISQSFEKKSKLLQSDWFKDLSAKLDQNDIHIIVPDVVYGTR HLLTKKRVTKPADLKGVKVRVQHSRLFLETIKAMGGVPTPMSLSDVYPGL SEGIIDGLENPAVVLFGGKFYEVAKNLSLTAHTKHMSPFVAGTAFWNTLT PEQQQIIVDTSREMVVYGAGLINEAEKDALDKLKAAGVTINEVDLPVFEQ SVGGVISNGFPEWSPNLYKNVQEKLEQF >MS0049 dctP, DctP protein MKLFSLNKLSALIAGVALLSAVTAQAETSLRFGYEAPRSDTQHIAAKKFN DLLKEKTNGEIKLNLFPDSTLGNAQTMISAVRGGTVDLEMSSSSNFTGLV SELNVIDIPFIFKDRTHAYQVLDGEIGQKLLSQLDAHGLKGIAFWEVGFR GFTNSKHPVTKPEDIKGLKVRTNQNPMYIKAFSILGANPVPMPLSELYTA LETKAVDAQEHPIGIVWSSKLYEVQKYFSFTNHGYTPLIVVMNKAKFDGL SPELQKAIVDAAQEAGKYQRQLNLDNEQGIVEKMKKAGIEFVDNLDTAPF KAAVEQETRKAFIDANGDSLIKQIDALGK >MS0050 dctP, DctP protein MKLFNLKTLATLVAGVALMSATAQAEISLRFGYEAPRSDTQHIAAKKFAE LLKDKTKDEIKLKLFPDSTLGNAQTMISGVRGGTIDLEMSGSPNFTGLEP KLNVIDIPFIFKDREHAFKVLDGEIGQGLLKDLESQGLKGLAYWDVGFRA FSNSKHTVTKPEDIKGLKVRTNQNPMYIEAFSLLGGNPVPMPLSELYTAL ETRAVDAQEHPIGIFWSSKLYEVQKFLSLTNHGYTPLIVVMNKAKFDGLS PELQQAVLDSAKEAGAFQRQLNIDNEKEIIGKVRKEGVEVTEQIDQAPFK AVIEEKVRKTFIDKYGKDLVEKIDALAQ >MS1877 dcuB, DcuB protein MLYLEFLFLLLMLYTGSRFGGIGLGVISGIGLVIEVFILRMPLGKAPIDV MLVILAVVTCASILEAAGGLKYMLQIAERVLRSNPKRVTILAPMVTYVMT FMLGTGHSVYSVMPIIGDIALKNKIRPERPMAVSSVASQLAITSSPLSAA IAYYLTQITKMPGYEHITLLNIISVTVPATFVGTMAMALYSLRRGKELED DPEYQRRLKDPTWRDRILNTTATSLDAELPRSAKMAVWLFVLSLVTVVVI AMLPEIRTVGVPVDGKPVKAISMSFIIQMMMLCFGGIILIATKTNPQSVP NGVVFKSGMVACIAIYGIAWMSDTYFSYAMPEFKAAVTTMVESYPWTFAF ALFAVSVVINSQAATAVMMLPVGISLGLPAPVLVGLIPATYAYFFIPNYP SDIATVNFDVTGTTKIGKYYFNHSFMIPGLIGVTTACLVGYALAHMIIV >MS2216 dcuB, DcuB protein MSAMFLIQFAIVLLCILMGARAGGIGLGVFGGLGLAILSFGFGLKPAGLP IDVMFMIMAVVSAAAAMQAAGGLDYMIKIATNILRRNPKYITFMAPAVTW LFTFLAGTGHVAYSVLPVIAEVARHNGVRPERPLSMAVIASQFAIVASPI AAAVVAVVAYLEPQGITLANVLSVTIPATLLGIFLACVFVNKIGVELKDD PEYQRRLQDPEYVKANHADVNMDEIQLKPTAKLSVGLFLLGALLVVVMGA LPELRPSFDGKPMGMAHTIEIVMLTIGALIIFTCKPDGTEITRGSVFHAG MRAVIAIFGIAWLGDTLMQAHMDEVKGMVSGLVETAPWAFALALFILSIL VNSQGATVATLFPLGIALGIPAPILIGVFVAVNGYFFIPNYGPIIASIDF DTTGTTRIGKFIFNHSFMLPGLLSMAFSLGFGLLFANMFL >MS1553 dcuC, DcuC protein MDELKPVIAVAGIIATIYLLIKKYETRTVLIGVGLLMSLLTLNPMGALDA FAKSMTSGGLIMAICSSMGFAYVMKYTQCDTHLVHLLTKPLGGLKFFLIP VATVITFFINIAIPSAAGCAAAVGATLIPILKSAGVRPATAGAAILAGTF GSMMSPGSSHSAMISEMSKLTITEVNLTHAPYTMVAGAIGAVMLTLLALF FKDYGDEHRQAYLREQKEAEDKFVKVNVLFALAPLVPLVILVIGGTSLQQ VSWLGWTQMGVPQAMLIGAIYGILVTRISPVKITEEFFNGMGNSYANVLG IIIAAGVFVAGLKSTGAIDSAIGFLKHSNEFVRWGATIGPFLMGIITGSG DAAAIAFNSAVTPHAVELGYTHVNLGMAAAISGAIGRTASPIAGVTIVCA GLAMVSPMEMIKRTALGMVLAILFLALFML >MS1664 ddlA, DdlA protein MKPLKEQKIAVLLGGTSAEREVSLTSGDAVLTALRNQGYDAHPIDPKEYP VAQLKEQGFERAFNILHGRGGEDGVIQGVLEQIGLPYTGCGVMTSALTMD KMRTKMLWKGFGLPIADMEIVTRDTVDELNPLEVVERLGLPLMVKPSREG SSVGLTKVNAVEELKNAVDLALTHDDTVLIEEWLSGIEMTVPVLDDQVLP AVQIIPEGEFYDYDAKYISDNTRYICPAPMSEESLQELQKLVKRAYDVVG CRGWSRIDVMTDANGNFRLVEVNTTPGMTSHSLFPKSAATVGYSFEQLVV KILELSA >MS1118 dedA, DedA protein MDILIDFFINYGYLAVLLVLIICGFGVPIPEDITLVSGGIIAGLGYANPH IMVFVSMFGVLAGDSVMYWLGRIYGVRILRFRPIRKIMTLQRLRMVRDKF EQYGNRVLFVARFLPGLRAPIYTVSGITRRVSYPRFIFLDFLAAIISVPI WVYLGYHGGNNHEWIEAQIRKGQMGIYAVLAIVVIFVGWKVYKSKKAKAD KTN >MS2201 def, Def protein MSVLNVLIYPDERLKTIAEPVTEFNDELQTFIDDMFETMYQEEGIGLAAT QVDVHKRVITIDITGEKTEQLVLINPELLDGEGETGIEEGCLSLPGLRGF VPRKEKVTVKALNRQGEEFTLHADGLLAICIQHEIDHLNGIVFADYLSPL KRNRMKEKLVKLQKQISRHQA >MS1373 degQ, DegQ protein MLKKIIQSAVIGLACAGFILAVLPRFSSTGQPFYSGEDVVLSFKNAVRAA SPAVVNVYNQSLSSSSVDEKFQVNNLGSGVIMSKDGYILTNMHVIQNADQ IVVALQNGALFEATLIGTDKLTDLAVLKIRADNLTTIPQNPDRSIHVGDV VLAIGNPYNLGQSVSQGIISATGRNAIGDSVGRQNFIQTDASINRGNSGG ALINSVGELVGISTLSFGKDPSDVAEGLNFAIPMNIANDVLQKIIRDGRV IRGFFGVQSDIIFNDGSDSEPGVKVKSVVSNGPAAKAGIQPNDVILEFNG EKANSPAQMMQVISNVRPGSVVKVLIERAGNQLELPVKIEEFPDTLPQ >MS0993 degQ, DegQ protein MKKTSFTLTAIALGLSVLAAPTVSVADFSSFFGGDKSESAEQTSANKAAS NVAQSAVNSPFITNSLAPMLEKVLPAVVSIAVEGNQKIAKRSFDIPEEFK FFFGEDFFGDNSSKSSSRKFRGLGSGVIINAEKGYVLTNNHVIDNADKIT VLLQDGREFKGKILGKDSQSDIALVQLENPKNLTEIKFADSDKLRVGDFT VAIGNPFGLGQTVTSGIISALGRSTGSDSGAYENYIQTDAAVNQGNSGGP LINLNGELIGINTAILSPSGANAGIAFAIPSNMANNLAQQIGEFGEVRRG MLGIKGGELNADLAKAFHVDAQQGAFVSEVLPGSAADKAGIKAGDVIIAM NGQKVSSFAEMRAKIATSGAGKEIELTYLRDNKKENVKVTLQADDQAQST ANAEAVIPALEGAELTNFNENGKKGVKLSKVAENSPAAQRGLKTGDLIIG VNRIAIEDLTQLRKAMENKDSVIALNIERGNNNFYLLIQ >MS0152 deoC, DeoC protein MHPQELAKFIDHTALTAEKTAQDIIKLCDEAIENQFWSVCINPCYIPLAK EKLAATNVKICTVIGFPLGANLTSVKAFEAQESIKAGAQEIDMVINVGWI KSGEWDKVRSDIQAVLQACNGTLLKVILETCLLTPDEIVKACEICRDLKV GFVKTSTGFNKDGATVEDVALMRQTVGDKLGVKASGGIRDTETAMAMINA GATRIGASAGIAIIKGLQDNSGGY >MS1899 deoD, DeoD protein MTPHINAPEGAFADVVLMPGDPLRAKYIAETFLENAKEVTNVRNMLGYTG TYKGRPVSVMGHGMGIPSCSIYTKELITEYGVKKIIRVGSCGAVRNDVKV RDVIIGLGACTDSKVNRIRFKDNDFAAIADFDMTQAAVQAAKAKGINYRV GNLFSADLFYTPDVEMFDVMEKYGILGVEMEAAGIYGVAAEFGAKALSIC TVSDHIRTGEQTSSEERQLTFNDMIEIALESVLLGDQA >MS1938 dfp, Dfp protein MFPKLWFLGAGGVMRNLNGKRIVVGITGGIAAYKTIEFIRLLRKSNAEVR VVLTAAAAEFVTPLTLQAISGNPVAQSLLDPQAELAMGHIELAKWADAII IAPASADFIARFTVGMANDLLSTVCLASAAPIFLAPAMNQQMFSQAVTRQ NLKSLAERGVKLIGPNSGFQACGDVGAGRMSEPAEIYAALCEALFLRQDL LGIKVAITAGPTREAIDPVRYISNHSSGKMGFAIAQAFADRGAEVTLISG PVNLAAPDKVNRINVVSARQMWQQSMKSAVENHIFIGCAAVADYRVAEVS EQKIKKTDDNDELTLNLIKNPDIIADVAHLTENRPFVVGFAAETQNVEQY AKDKLQRKNLDLICANDVVGGSVFGAEQNTLHLFWQNGEKVLPTDTKKAL AKSLVQEIIELYRK >MS0242 dgkA, DgkA protein MEKTTGLTHFIKSAGYSIQGLKSAIKYEAAFRHELAAGLILIPAALYLAN DKFEMALMIGSYLIVLVTELLNSALEAVVDRIGSERHELSGRAKDQGSAA VFVAIANCVMIWLILLIF >MS1791 dgoA, DgoA protein MTTRCYHLYRYSIPVDSQLILRNRFLKRREGLLVQIKCKENEGWGEIAPL PEYSRETLEQAQEQAIQWLKDWDAARSRNEKLSFDGLYPSVAFGLSCALA EMKGSLQTEGNYQVAPLCYGDPDELYEPLDQMQGEKVAKIKVGMYEANRD GMIADMLLEAIPDLYLRLDANRSWTPAKAAMFAKYVKPEHRPRIQFLEEP CKTREESRRFAEQTGINIAWDESVREPDFEVVAEPHVTAIVIKPTLVGSL EKCVSLIEKAQNLGMKAVISSSIESSFGLTQLARIARQYTPDVTPGLDTL DLMEYQVVRRWPGSGLPVADFNSGFITEINF >MS0689 dgoA, DgoA protein MSTPVITEMQVIPVAGHDSMLLNLSGAHSPYFTRNIVILKDNSGNTGIGE VPGGEKIRQTLEDAKPLVIGKTLGEYKNVMNTVRQTFNDRDAGGRGLQTF DLRTTIHVVTAVEAAMLDLLGQHLGVTVASLLGDGQQRDAVEMLGYLFFV GDRKKTNLAYQSQENDLCDWYRVRHEEAMTPESVVRLAEAAYEKYGFNDF KLKGGVLDGFEEAEAVTALAKRFPQARITLDPNGAWSLDEAIKIGKQLKG VLAYAEDPCGAEQGYSGREIMAEFRRATGLPTATNMIATDWRQMGHTISL QSVDIPLADPHFWTMQGSVRVAQMCNEWGLTWGSHSNNHFDISLAMFTHV AAAAPGDITAIDTHWIWQEGNQRLTKEPLQIKGGLVEVPKKPGLGIEIDM DQVMKANELYKSMGLGARDDAMAMQFLIPGWTFDNKRPCLVR >MS1269 dgt, Dgt protein MQKIQLNNIWQQRFITDKPREKDHRPPYRRDRGRILHSAAFRCLQAKTQI HAVGENDFYRTRLTHSLEVAQIGNSLVAQLKFGDSFEHLAAQLNADKTAL QQSLKPLLPSNDLIESLCFAHDIGHPPFGHGGEVALNYMMREHHGFEGNA QTFRIVTKLEPYTLSAGMNLTRRTILGLVKYPTLLDLSSPDYAKSDLQSN GDPRYVKINDWRPGKGLFYDDLPMFEWLLQPLSDADRTLFGQFQQPRQSP SDMLKTKFKSLDCSIMEIADDIAYAVHDLEDAVVVGLVNRQQWQEAEVEL KNCRSNWIQSNIEQITEKLFSDQHYQRKNAIGALVNYFITHIRWKMTADF AEPLLRYNAELPKGVLDALNVFKRFVFKYVIRDVETQRIEYKGQRILTEM FQIFESDPERLLPRNTVKRWQNATDEGKHRIICDYIAGMSDAYALRLYQQ L >MS1361 dinG, DinG protein MANIDQIKAAFSERGQLSSNIKDFRPRSEQLEMAEAVGKAIENKGVLVVE AGTGTGKTFAYLTPALLSKKKTIVSTGSKNLQDQLFKRDLPTIQKALNYS GKIALLKGRANYLCLERLDQVIAQGVLGDKSVLVDLSKVRKWNNATKTGD LSECVELAEDSPILPQLTSTTESCLGSDCPNYGDCYVAAARKRALAADLV VVNHHLFCADMAVKENGFGELIPNAEVIIFDEAHQLPDIASQYFGQSITS RQLFDLCKDINIVYRTEIKDMPQLGVASDHLLKMVQDFRLLLGEGNNRGN WREWLVKPDVQKGFKVLQEKLDFIADVVKLALGRSQTLDSIFERISALKA QLVRLSDTSVTGYCYWFETFNRQFGLHITPLTVSDKFGEQMNNHESAWIF TSATLEVGGSFNHFRQRLGIRATDEKVLQSPFNYPEQALLCVPRYLPGSN QNHTMTKLAEMLLPVIEANKGRCFVLCTSYFMMKGFAEYFREHSGLSILL QGEISKTKLLEQFVSEEHSVLVATSSFWEGIDVRGDALSLVIIDKLPFTS PDEPLLKARVEDCQLQGGNPFNDIQIPEAVIALKQGGGRLIRDVTDSGAV IICDSRLVTRPYGETFLKSLPNAKRTRDLNKVVEFLKSIQQNRT >MS1135 dinP, DinP protein MHKLRKIIHIDMDCFYAAVEMRENPALRDKPIAVGGSVQQRGVLTTCNYP ARKFGLHSAMPTGQALKLCPDLILLPVNITLYKQVSHQIKQIFHRYTDNI EPLSLDEAYLDVTDCVQCSGSATWIAEEIRRAIFNELHLTASAGVAPLKF LAKIASDQNKPNGIFVITPGEVDNFVKTLPLSKIPGVGKVTGQKLLQMGL KTCGDVQKLDLTVLLNRFGKFGQRIWQYSHGIDEREVQSHWQRKSVGVED TLLRNITDIEQGIVELERLYPILEQRIKRACPDIPFERFRKLGVKLKFED FQVTTLEKSAVEFKRENFIVLLRQIWQRRQGRAIRLVGLQVTIPEQKAEQ QMSLW >MS1722 djlA, DjlA protein MNNPFALFDLPIEFQLDQNRLSERYLALQKALHPDNFANSSAQEQRLAMQ KSAEVNDALQILKDPILRADCIIALNTGEQQNTEEKSTQDMAFLMQQMQW REQLEEIENTQDIDGLMTFSAEIEQSNKEKISEISTALSMKDWQQAKLIN DRLRFIKKLMTEIERIEDKLADF >MS1902 djlA, DjlA protein MERTMNFIGKILGFIIGYRFGGLFGGIAGLILGHIADKKLYELGSVNSSF FSKKITRQSLFMQTTFAVLGHLSKAKGRVTEEDIQLANNLMSQMQLDVAN RQLAQNAFNRGKEADFPVREVIREFRIGCGQRADLLRMFLHIQVQAAFAD SNLHNNEKELLFVIAEELGLSRFQFDQMLAMEMAARQFTQGGFYRQQQYQ QQSHQQYNQENYQNSYRTSSGPTVEDAYKVLGVNAGDNQQTVKRAYRRLM NEHHPDKLVAKGLPKEMMEMAKEKAQQIQAAYDLICKVKGWK >MS0927 dksA, DksA protein MTTNTNKASLSLLALAGVEPYKEKAGEEYMNEAQLLHFKKILEAWRNQII QETTRTVSHMQDEAANFPDPADRATQEEEFSLELRTRDRERKLMKKIEST LKKLETEDFGYCDSCGVEIGIRRLEARPTADLCIDCKTLAEVREKQMGY >MS1568 dltE, DltE protein MAILITGASAGFGKAACITLVKAGYKVIGAARRLEKLTELKQQLGENFYP LQMDVSQTAEIDSALASLPADWAEIELLVNNAGLALGLEPAYKVNFDDWL TMINTNIIGLTYLTRQILPQMVERNKGHIINLGSIAGTYPYPGGNVYGAT KAFVKQFSLNLRADLAGTAVRVSNIEPGLCGGTEFSNVRFKGDDEKAANV YKNTLSIQPEDIANTILWIYQQPAHVNINRIEIMPISQSSGALNVVRE >MS2336 dmsC, DmsC protein MATIIVIYQGFGLSQIHSSAQQAVALVPDFAVNQVIRLCLLAAAGMVLLK SKQPLLLSIAVILALFAEMIGRELFYSLHMTVGMA >MS1878 dnaA, DnaA protein MSEHQLPLPIHQIDDETLDNFFVGHNDLLVDSLSKNIACLKQQFFYVWGA EGSGKSHLLKAVSNQFLLQNRPAIYVPLSKSQYFSPAVLENLEYQDAVCL DDLQLVVGNEEWEIAIFDLFNRIKEKENTLLLISANQSPNALPIKLPDLA SRLTWGEIYHLNVFTDEEKILVLQRNAHERGIELPDETANFLLKRLDRDM HTLFDALLKLDKASLQAQRKLTIPFVKETLGL >MS0485 dnaA, DnaA protein MERDLSQLWQNCLLQLQDQISSSDFGLWLRPLQADTSMPNTIVLYASNMF VKSWVENNYLAQITKIAQDLSNNTDLVIKVQEGSKPAARKVVAQQEIANT PVQHSAPMPENEPQAAFRSNLNQHHLFENFVEGKSNQLARAVGQKVANRP GDKSANPLFLYGGTGLGKTHLLHAVGNGIIAGNSNARVVYIHAERFVQEY VKALKAERIENFKKFYRSLDALLIDDIQFFAGKDGTQEEFFNTFNSLFEG EKQIILTSDRYPREIEKIDDRLKSRFSWGLSIAIEPPDLETRVAILMKKA EEKNIYLPEEVAFFIGQKLRTNVRELEGALNRVHANADFTGKAITIDFVR ETLKDMLALQDKLVTVENIQKMVAEYYRIKVSDLKSKNRSRSIARPRQLA MALAKELTNRSLPEIGKAFGDRDHTTVLHACRTIAALRDDDNNIQEDWSN LIRTLSA >MS1183 dnaB, DnaB protein MVYDIAVFSVLIESFFMARQPSQSPDKQTAQINIPPHSIEAEQAVLGGIM LNNSHWENVVEHVITEDFYTAAHRLIFREMEELARQNHPIDLITLDQALK NKGVVEDVGGFAYLAELSKNTPSAANIIAYADIVREKAVLRELIGVGNTI AQSAYSPKGREVKEILDEAEREVFKIAEKRSAENEGPENILNVLERTIDK IEFLSKNQHANGGVTGVTTGFKDLDKKTAGLQPSELIIVAARPSMGKTTF AMNLCENAALSSEKPVLIFSLEMPADQIMMRSLASLSRVDQTKIRTGQIT EDDEWARISSTMGMLTNKPNMYIDDSAGLTPTELRSRARRVYRENGGLSL IMIDYLQLMRAPGFDNRTLEIAEISRSLKALAKELEVPVVALSQLNRTLE NRTDKRPVNSDLRESGSIEQDADLIMFIYRDEVYHETTEENHNVAEIIIG KQRNGPIGRVRLTFQGQYSRFDNYAGGHQFNDDDY >MS0574 dnaE, DnaE protein MPEPRFVHLRVHSDFSMIDGIAKVKPLVKTCVQENMVAMALTDFTNFCGL VKFYGEALGSGIKPIMGADVSVKSDLCGDEHFELTLLAKNNAGYKNITLL LSKAYQRGYEDVPYIDQDWLAEYNEGIIVLSGGRKGDVGKKLLKTGAADE VESAVGFYQKYFPDHYYLSLSRTGHNEEETYIKTALKLAEKHNLPVVATN DVVFLKSEDFEAHEIRVAIHDGFTLDDPKRPKLYSDRQYFRSEQEMCELF ADIPSALENTLLIAQRCNVTIRLGEYFLPQFPTGELSTEDYLIKRAKDGL EERLKVLFPDEKEREEKRPAYDERLDTELGVINQMGFPGYFLIVMEFIQW SKDNNIPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP DFDVDFCMDNRDKVIEHVADMYGRGAVSQIITFGTMAAKAVIRDVGRVLG HPYGFVDRISKLIPPDPGMTLAKAFDAEPQLQQIYDSDEEVKALIDMARK LEGVTRNAGKHAGGVVISPTLITDFSPLYCDSEGKHPVTHFDKNDVEYAG LVKFDFLGLRTLTIIKWALDMINARMDRDGKPHIDINHIPLDDPESFNLL LKSETTAVFQLESRGMKDLIKRLQPDCFEDIIALVALFRPGPLESGMVQN FIDRKHGREEVAYPDAQYQHECLKPILEPTYGVIVYQEQVMQIAQELAGY TLGGADLLRRAMGKKKPEEMAKQRSVFEKGAIEKGIDGELAMKIFDLVEK FAGYGFNKSHSAAYALVSYQTLWLKTHYPAEFMAAVMTSEMDNTDKIVGL YDECLRMGLTVTPPDINTGKHHFSVNDHGEIVYGIGAIKGVGEGPIEALV SAREKGGIFKDLFDLCARVDLKKINRRTFESLIMSGAFDKLGPHRAALSK NLEDALKASDQHAKDEAAGQADMFGVLTESPEEVEIAYANTPRWSEKQIL DGERETLGLYLSSHPISRYLKELSHYSPNRLKDLVPNIRGQVSTASGLVV ASRFAVTKKGNRLGIATLDDRSGRLDITLFAEALEKFGEKLQKDSVVVVS GQVSFDDFTQGLRMSVRDLMTLDEARSRYAKSLAISLSQQQITPQFLKRF KSVIEPYSGGTMPINVYYQSPQGRALLKLGIQWYIKPTDELLSELVNMLG ESAVELEFE >MS1761 dnaG, DnaG protein MGVPIPRSFINDILAKADIVDVVNSRVKLKKAGTNNYQACCPFHHEKTPS FTVSKNKQFYHCFGCGAHGNAIGFLMEYDKLEFLEAVEELANFLGLEVPR EAGSDKKFEKSQPHYQNKRNLYELMHDIAEFYRQQLPHSIPAQAYLQKRG LSEEVIERFAIGFVPDSFNAVLRRFGTTKAEQQKLFDLGMLSRNDRGDIY DRFRNRIMFPIRDRRGRTIAFGGRVLTDERPKYLNSPETLTYHKGNEIYG LYEALQINDSPEMLLVVEGYMDVVALAQFGVNYAVASLGTATTAEQIQLI FRASEQIVCCYDGDRAGREAAWRALENALPYLQDGRQLKFVFLPDGEDPD TYIRQYGKDAFEDYIQKALSLSDFMFTHLIEQVDLSSKEGKSKLAALAVP LIKRIPGQMLRLYLRNILAQKLGIIDQTQLESLIPSKIEQPEAAIEKSPA VKRTPMRLLIGLLLQNPQLAQLDYDLEPLKSLNEPGFELFYALTKLCRDN MGITMGQILEYWRDSQYSKPLEILAIWDHLVTDDKIQETFLETLLYLYVR FTDQNIERLIAKDRSTGLSPEEKQELAQLLARPQQNNS >MS0899 dnaJ, DnaJ protein MGERAHPTLVTKIMAKQDYYETLGVQKGADEKEIKRAYKRLAMKYHPDRT NGDKAAEEKFKEVNEAYEILMDKEKRAAYDQYGHAAFEQGGFGGGAGGFG GGFGGFGGFEDIFSEMFGGGASRQRVVRGEDLRYDIEITLEEAVRGTTKD IKINTLAACDHCDGSGAEKGSKVETCPTCHGHGRVRRQQGFFMTETTCPT CQGSGKKIEKPCKHCHGDGRVHKKKNLSVKIPAGVDTGNQLRLSGEGAAG ENGAPAGDLYVVIHVKDHHIFERDGSNLYCEVPISFTMAALGGEIEVPTL DGRVKLKIPAETQTGKLFRMRGKGVTSTRAGYAGDLICKIIVETPVKLNE EQKELLRKFEESLEGQSKQRPKSSSFLDGVKKFFDNLGK >MS0898 dnaK, DnaK protein MNLTRRIKMGKIIGIDLGTTNSCVAVMDGDKPRVIENAEGERTTPSIIAY TQDNEVLVGQPAKRQAVTNPKNTLFAIKRLIGRRFEDQEVQRDVNIMPFQ IIKADNGDAWVDVKGDKLAPPQISAEVLKKMKKTAEDFLGETVTEAVITV PAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALAYGLDKGKGNQTIAV YDLGGGTFDLSIIEIDEVGGEKTFEVLATNGDTHLGGEDFDNRVINYLVD EFKKEQGVDLRNDPLAMQRLKEAGEKAKIELSSAQQTDVNLPYITADATG PKHLNIKLTRAKLEALVEDLVARSMEPVKVALSDAGLSVSQIDDVILVGG QTRMPLVQQKVAEFFGKEPRKDVNPDEAVAVGAAVQGGVLAGNVTDVLLL DVTPLSLGIETMGGVMTTLIEKNTTIPTKKSQVFSTAEDNQSAVTIHVLQ GERKQASANKSLGQFNLEGINPAPRGMPQIEVTFDIDADGIIHVSAKDKG TGKEQQITIKASSGLSDEEIQQMVRDAEANAEADRKFEELVQARNQADAL VHSTRKQLTEAGDKLSADDKAPIEKAVNELEAAAKGEDKAEIEAKIQALI QVSEKLMQAAQQQAQADAGAQQAQGNNGGDDVVDAEFEEVKDNK >MS1721 dnaK, DnaK protein MALLQIAEPGLMAAPHQHKLAVGIDLGTTNSLVATVRSAHTEILLDEKDR PLVPSIVHFGDNNEITVGYEAGELASIDPQNTVISVKRLIGRSLEDVQAR YPNLPYRFEASENGLPLISTRKSAVSPVEVSSEILKKLTALAKRRLGGEL QGAVITVPAYFDDAQRQSTKDAAKLAGLNVLRLLNEPTAAAIAYGLDSGK EGVIAVYDLGGGTFDISILRLSKGVFEVLATGGDTALGGDDFDHLVADWI TEQSGISPQDDKQKRQLVELATRLKIQLTDNETVAIQYQNWHGKISRNQF NQLIQPLVKRSLISCRRALKDANVTADEVNEVVMVGGSTRVPFVREQVGE FFKRQPLTSIDPDKVVALGAAVQADILVGNKPDSEMLLLDVIPLSLGIET MGGLVEKIIPRNTTIPVARAQEFTTFKDGQTAMTVHIVQGEREMVADCRS LARFTLRGIPPMAAGAAQVRVTYQVDADGLLNVTAMEKSTGVQSSIQVKP SYGLTDDEITQMLKASMDNAKQDIDARLLAEQRVEAKRVIESVLSALSHD RDLLNDEELSAIKKALVELDKLQQQNDTLAIKQGIKDLDAATQEFAARRM DKSIRSALTGHSVEDI >MS0486 dnaN, DnaN protein MRERAMQFIVSRDNLLKPLQQVCGVLSSRPNIPVLNNVLLQIADDCLTIT GTDLEVELSTQAKLISGTEGKFTIPAKKFLDICRSLPDEAEIHVTFEEER AIVRSARTKFNLATLPAEEYPNLADWQSEVDFTTEQATLRRLIEATQFSM ANQDARYFLNGMKFETEGNLLRTVATDGHRLAVCTIALEQDLQNHSVIVP RKGVLELARLLEATDAPARLQIGTNNLRVQLANVVFTSKLIDGRFPDYRR VLPRNADHILEADWDVLKQAFVRAAILSTERFRSVRLQLDQNQMKITATN PEQEEAEEIIDVSYSGNEMEVGFNVSYILDVLNALKCQRVRMRLTDASSS CLIENCEDASAEYVIMPMRL >MS1570 dnaQ, DnaQ protein MKVEIDLNRQILLDTETTGMNQFGAHYEGHCIIEIGAVEMINRRYTGRKL HLYIKPDRLVDPEAIKVHGITDEMLEDKPVFTEVAQEFIDFIKGAELLIH NAPFDVGFMDYEFRKHHIDVKTADICSVTDTLQLARQMYPGKRNSLDALC DRLGIDNTKRVLHGALLDAEILGDVYLVMTGGQTSLFDDNEPELADIHSA KAHILAQNADKVAHHLSLLQPTDEELQAHLEYIKLINKKSKDNCLWEKRL GSDSNEETQH >MS0702 dnaQ, DnaQ protein MVTETQNETETTEKIDYNLLKNRFRGYYPVIIDVETAGFNAKTDALLELA AITLKMDENGLLVPDQKCHFHIKPFEGANINPESIKFNGIDIDNPLRGAV PESEAITGLFQMVRKGQKNAGCQRSIIVAHNATFDQSFVMAAADRTKIKR NPFHPFSSFDTASLSGLMFGQTVLVKACQAAHIAFDGKQAHSALYDTERT AELFCYMVNHLKALGGFPHIAEN >MS0871 dnaX, DnaX protein MSYQVLARKWRPKNFAEVVGQEHILAALSNGLRENRLHHAYLFSGTRGVG KTSIARLFAKGLNCMDGVTAEPCGKCAHCKAIEEGNFIDLIEIDAASRTK VEDTRELLDNVQYKPVQGRYKVYLIDEVHMLSRHSFNALLKTLEEPPEYV KFLLATTDPQKLPVTILSRCMQFNLKALDQKQISHHLQHILKEEEIPYEM TALDKLAKAARGSIRDSLSLTDQAIAMSNGNISRDVVRVMLGLLDDNQPI EILYALQQGNGENLMKVIQAVADKGGNWDELLIEVGETLHQIAMQQLLPS TSNDETQIGFLAKHIAPEDVQLFYQIVVNGRKELAFAPNPRIGVEMTLLR ALAFHPKLVQSQPSQQEQLSNVQTYVQSAVKKTENLVDMPVVSQSIKAKY ESPAHSAAANAEQPSSAALSALEQIQKLRSQASGNGEKKNVNVTSSPLTE TDSSSLSDLSETSPKVTALPVVTMQNKSKKQADLLDRLVNLSNSKNTETE NAEDSAENTENDSEDETNLAETYRWEWTNPELAQEETAVRPSDIKKAILQ EKTPEVITKVIAMADERDEWTKTVSQLHLDELKLVKQIALNSVVLIQHEN EMKLGLRSAQKHLVRDKSVEILQDALTKFYGKTINLTIDFNDDESLFTPL DHRRQIYQELSEQAKEDLLKDKKVRLLQDMFDAKLDMDSIRPV >MS0465 dppB, DppB protein MQHYFIRRLIMMIPLMLLISFVAFSLMNLVPSDPAETMLRINNITVTDEA VKEARQALGLDKPFLLRYALWLYALLQGDLGKSFLSNQNVWDEITQAFPA TFYLAVTAFAVIFLLSLTLSLLCMLMLNSLWDKIIRGILFFFTALPNYWL ALLFIWLFSVRLNWLPSNGLEQKSGIILPALTLSLGYIGVYVRLLRGAML NQLQQPYVFYARTRGLSEKQILFKHILQNSLHTSYIAMGMSIPKLLAGSV IIENIFALPGLGRLCIQAIFGRDYPVIQAYILLMAMLFLVGNFVIDWLQH RRDPRIKRGY >MS0855 dppB, DppB protein MLYSFLRRLFLLLIILVILSAVSYTIFMRDPINQVFAEPYFYSGYFTYVD SLLKGDLGITYNGGDSLLMLILTVLPPTLELCFAAMIVAFLFGVPFGLLG AFFNKNIFGKAINAVSSLGLSVPVFWIAPILLYVAAIQHWQISAVGQINL LYEIKPITGFATIDVWFVDEPYRTKVIQNVFQHLILPTLVLAITPTMEIT KLIQQRTEYILAQNYVKLSITRGWPIWRILTKYVLRNSLPLVIPQIPRLI TFVLAQGMLIEGVFGWPGIGRWLIDAVSQQDYNSISAGVIVIGLFIIVIN ALTEILTFILDPFNKKGWYAR >MS1367 dppB, DppB protein MFKFILKRILMVIPTFLAITLVTFALVHFIPGDPVEIRMGERGVDPIVHA QMMEQMGLNDPLPEQYLNYIKGVVQGDFGRSFRNNEPVLKEFFTLFPATV ELAFFALLWSLIAGIFLGVIAAVKKDSWISHTVTALSLTGYSMPIFWWGL ILILYVSNFLGLPAGGRLPDEYWIDFDTGFMLIDTWNSGEPGAFVAAIKS LILPAVVLGTIPLAVVTRMTRSSMLEVLGEDYIRTAKAKGLSTTRIVIVH ALRNALIPVITVVGLIVGQLLSGAVLTENIFSWPGIGKWIIDAINARDYP VLQGSVLIISTIIIVVNLLVDVIYGVVNPRIRHN >MS1366 dppC, DppC protein MTTEITSSTPQTPLQEFWYYFRQNKGAVIGLTFIAAVFFICICAPFVSPY DPIVQHRDALLLPPAWMENGSLSYFLGTDDIGRDILSRIIYGARLSVFIG LLIVILSCIFGVILGLLAGYYGGLLDVIVMRLMDIMMAIPSLLLTIALVT ILGPSLFNAAIAIAIVSVPSYVRLTRASVLNEKNRDYVVASRVAGAGVLR LMFIVILPNCLAPLIVQMTMGISNAILELAALGFLGIGAQPPTPELGTML AEARSFMQAASWLVTIPGVAILLLVLAFNLMGDGLRDALDPKLKQ >MS0464 dppC, DppC protein MSGFIKQLRSDIFAQCCLFILTMIGLAGIFAPWICTFDPATIDMQAKLLP VSAQHWLGTDHLGRDIFSRLIWGVRSTVFYGLFAMLLTMMLGILIGMTAA IGGKKTDEFIMRLCDVLLSFPGEIMILALVGMLGPGIEHILVAVILVKWA WYARMIRGTVMQYTHKNYVHYSQAIGVSPWRIIRRHLLPVATAELIILAS ADMGAVILLISGLSFLGLGVQPPTPEWGAMLSDAKNIMLLYPQQMLPAGL AITLTVTAFNGFGDFLRDVLDPDNPLKGTNNE >MS0854 dppC, DppC protein MQDKEPYEFRQTETLKAIWHDFRKDRIALFGLYIFILLILTAVFAPWIAP YASDDQFVGRELMPPSWFPNGEVTYFFGTDDIGRDIFSRLINGVSYTFGS AAIIIIFTAVVGGMLGILAGMSSGMKSRILGHFLDAFLSVPILLIAMIIA TLMEASLLNAMLAILLALLPHFIHEIYQAIQQELKKEYVLMLRLDGASNM ELLRETILPNISVRYIKELIRSFVVAILDISALSFISLGAQRPTPEWGAM IRDSLELIYLAPWTVILPGLAIIMTALVVIIFGNGLCKAISKHYE >MS0463 dppD, DppD protein MNKPIIRFDNFSIENPDSDRPLIAPLNLTLPPYRTLALVGESGSGKTLLG RSILGLLPEQLNTTGNIYFQDKKIISVTGTPTVDDKQKTNEIATLEIRGK AVSFIMQNAINAFDPLFSLQDQFCETLQKHTALSYRQALIKAQQSVSKVK LSSALLKRLPSQLSGGQLQRMMLALTFALEPELVIADEPTSALDSLTQFE LLPLFKQMAKERSMIFITHDLALVQELADDIAVLKRGEIVEFRAKSILFS HPQHPYTQYLLAMRAKLNQPFARLVRKKQ >MS0853 dppD, DppD protein MALLDIRNLTIKVNTPNGYVSVVDNVNLTLNEGEICGLVGESGSGKSLIA KVICNTSKDNWIITADRFRFNDVELLKLSPYKRRKLVGKEISMIYQEPLS YLDPSKKIGQQIMQNIPSWTFKGKIWHWFGWRKRRAIELLHRVGIKDHKD IMNSYPNEITEGEGQKVMIAIAIANQPRLLIADEPTNSLESTTQLQIFRL LSSMNQNNGTSVLLASNDMAGISEWCHSFIVLYCGQNAESGPKENILETP HHPYTSALLHSMPDFSQPMPLKSKLNTLRGTVPLLEQMPMGCRLGPRCPF AQKKCIKKPPLRRIKQHEFACHYPLNLLETNRKEKDTITPLTLSPESKIS >MS1365 dppD, DppD protein MSLLNVNQLSVHFGDGKAPFKAVDRISYSVNKGEVLGIVGESGSGKSVSS LAIMGLIDYPGRVSAEALSFDGVDLLSLNEKQKRKIVGADVSMIFQDPMT SLNPCYTVGYQIMEALKAHQGGSKKERRERTVELLKLVGIPAPESRLDVY PHQLSGGMSQRVMIAMAIACKPRLLIADEPTTALDVTIQAQIVDLLLTLQ KQENMALILITHDLALVAEAAHRIIVMYAGQVVEEGRAEEIFKRPKHPYT QALLRSLPEFAEGKSRLQSLQGVVPGKYDRPQGCLLNPRCPYATEHCRRV EPDLIQLGEGKVKCHTPLNAQGEPSNV >MS0670 dps, Dps protein MNTKTISFPSLTLTEKSQALTADINKNATHSVPGIDVNTGHSIAEALQAR LQGLNELALILKHAHWNVVGPQFIAVHEMLDSQVDEVRDFVDEIAERMAA LGVAPNGLSGNLVANRQTPEYPLGRASAQDHLRIIDKFYSFNIESHRVVL AHYGELDPISEDLLVAQTRALEKLQWFIRAHLDNGNGSI >MS0429 dsbB, DsbB protein MLSFFKTLSMGRSGWLLLAFSALVLELVALYFQYGMQLQPCVMCVYERVA LGGILFAGIIGAIAPSSWFFRFLGIIIGLGASVKGFLLALKHVDYQLNPA PWNQCAYLPEFPQTLPLDQWFPYLFKPIGSCSDIQWSFLGFSMAQWILVM FAFYSILLAIILISQVKAGKPKHREIFR >MS1540 dsbG, DsbG protein MKKFVTALSLMAISMAATADNAQITTQLKKLGATNIEVKDSPISGIKTVV TNEGVLYTTEDGKYVLQGKLFELTDKGPVDVTGKALLATLESYKNEMIVY PAKNEKHVVTVFMDITCGYCQKLHSEIKEYNDLGITIRYLAFPRGGLGTK TAKEMEAIFTAKDPAFALDEAEKGNPPKELKAVNITKKHYELGVQFGVRG TPSIVTRSGELIGGYLPPKELLSALESVK >MS0846 dsrC, DsrC protein MLNINNTQIETDPAGYLLNLNEWNEDVAKAIAEKEGVVLTEAHWEVIYFV REFYQEYKTSPAIRMLVKAMAQKLGEDKGNSRYLQRLFPDGPAKQATKLA GLPKPAKCL >MS1019 dsrE, DsrE protein MQKLLFILNESPYGTEKTFNGLRHAVNLLEEHGKEVEVKVFCFSDAVLAG LSGQNPNDGPNVQQTLEVLAGLGAEVKLCTSCTKARGITQLPLVKGVSLG TLDDVSDWTLWADKVINF >MS0159 dsrE, DsrE protein MRYVLSVRQPVYGSQGAYLAYQFAQELICQGHLISQIFFSQEGVSNGNGL VYPANDEFNLVKAWQTFSKKHNVPLHLCIAASQRRGVVDKLTALDPAQTN LAEGFVLAGLGEFSKAMLEADRVITL >MS0160 dsrF, DsrF protein MKLAFVFRQSPHGTAISREGLDALLAATAFCDEEDIAVFFMADGVLNLLA NQQSDLILQKDIASAFKLLDLYDIGQRYICAESMDDFALSYDDLVINCEK IDRTLMLQKLQQAEKIITF >MS0031 dtd, Dtd protein MIALIQRVSQAKVDVNGQTVGQIGGGLLVLLGVEKEDSKEKADKLAEKVL NYRIFGDKNDKMNLNVQQTDGELLVVSQFTLAADTGRGLRPSFSKGAPPQ LANELYQYFVQKCGEKVRVETGKFAENMQVSLTNDGPVTFWLKV >MS1937 dut, Dut protein MKKIDVKILDSRIGNEFPLPAYATSGSAGLDLRALIEEGFDLQPGETKLI PTGLSIYIADPNLAAVILPRSGLGHKHGIVLGNLVGLIDSDYQGPLMVSM WNRGEQPFRIEVGDRIAQLVFVPVVQAEFNIVTDFTQTERGEGGFGHSGK Q >MS1928 dxr, Dxr protein MRMPAGFLINAMKKQNLVILGSTGSIGKSTLSVIEHNPEKYHAFALVGGR NVDLMVEQCVKFQPEFAALDDENAAKQLAEKLKSAGKKTKVLAGQKAICE LAAHPEADQVMAAIVGAAGLLPTLSAVQASKTVLLANKETLVTCGQIFID EVKRTKARLLPVDSEHNAIFQSLPPEAQQQIGFCPLKELGINKIVLTGSG GPFRYTDLTEFDNITPEQAVAHPNWSMGKKISVDSATMMNKGLEYIEARW LFNAGAEEMEVIIHPQSIIHSMVRYIDGSVIAQMGNPDMRTPIAETMAYP GRIVSGVTPLDFYQLSGLTFLEPDYERYPCLKLAIEAFAAGQYATTAMNA ANEIAVEAFLNRMIKFTDIARVNAKVVELIQPQQINCIDDVLAVDKQSRL VAKEVIVSLKA >MS1059 dxs, Dxs protein MQNYPLLSLINSPEDLRLLNKDQLPQVCNELREYLLESVSRTSGHLASGL GTVELTVALHYIFKTPFDQLIWDVGHQAYPHKILTGRRERMTTIRQKDGI HPFPWREESEFDVLSVGHSSTSISAGLGIAIAAEKENAGRKIICVIGDGA ITAGMAFEAMNHAGALHTDMLVILNDNEMSISENVGGLNNHLARIFSGSI YSSLRDSSKKILDTMPPIKNFMKKTEEHMKGVISPISTLFEELGFNYIGP IDGHNIEELIATLSNMKTLKGPQFLHIRTKKGKGYTPAEQDPIGFHGVPK FDYHTGQLPKSTATPTYSQIFGEWLCETAEQDEKLIGITPAMREGSGMVE FSNRFPDQYFDVAIAEQHAVTLAAGLAIGGYKPVVAIYSTFLQRAYDQVI HDVAIQNLPVLFAIDRAGIVGADGPTHQGAFDLSFLRCIPNLIIMAPSNE NECRLMLHTGYCCGKPAAVRYPRGNAIGVELEPLRKLEIGKSNLVRQGQD IAILNFGTLLPNALDVAEKLNATVVDMRFVKPLDHERINELAKTHRTLVT LEENTIQGGAGSAVSEVVNIQQHHVNILHLGLPDEFVAQGTQQEVLKELK LDATGIEEQIKNFLRIA >MS2206 ebgC, EbgC protein MYIGDLNRNDYQRDLPKVLADVCDYLKTLDLSALENGRHEINENIFMNVM TPTSDAAENKKSELHHRYIDIQLVISGLDGMEYSVTEPALEKYEEYHQEE DYQLTAAEIADKNWIVVRPNQFVVFYPYEQHKPCCNVNGQAELKKLVVKV PVALL >MS0053 ebgC, EbgC protein MIFGHIAKVNPKQYPQAIRFALDYLAKTDFDSMEAGRYPLKDDKIYVQVL DLETKPKAEYLPEVHRNYLDVQYLHSGTEIMGVSTDLGNNAVAVEYNPER DILYYAEAENEQELHCQPGNFAVFFPEDAHRAAIYNGSEKIRKIVVKIAM SEI >MS0566 eda, Eda protein MAYTTAEIIEKLGALKVVPVIALDDAEDILPLAATLAENGLPVVEITFRS AAAEEAIRLLRQTNPDILIAAGTVLTPDQVVRAKNAGVDCIVTPGFNPNI VRLCQELNIPITPGVNNPMAIEGALELGVSAVKFFPAEASGGVKMIKALL GPYQQLQIMPTGGINVNNIRDYLAIPNVVACGGSWFVEKSLIANKHWDEI GRLVREVLALVR >MS0546 eda, Eda protein MAYTTEQIIEKFSALKVIPVIAVEEAQDIIPLVKTLSENGLPVAEITFRS AAGEEAIRLTRQHFPDVLIAAGTVLTPAQVVAAKNAGADCIVTPGFNPNI VKLCQQLELPITPGVNNPTAIEAALELSINAVKFFPAEASGGVKMIKALL GPYANLQIMPTGGISTANIKDYLTIPNVVACGGSWFVDKALIKAKNWAEI GRLVREAVELVR >MS0508 efp, Efp protein MRIIMATYTTSDFKPGLKFMQDGEPCVIVENEFVKPGKGQAFTRTRIRKL ISGKVLDVNFKSGTSVEAADVMDLNLNYSYKDEDFWYFMHPETFEQYSAD SKAVGDAEKWLLDQAECIITLWNGSPISVTPPNFVELEVVDTDPGLKGDT AGTGGKPATLSTGAVVKVPLFIQIGEVIKVDTRSGEYVSRVK >MS0266 elaA, ElaA protein MLFMRIHPMNWQCKTFNQLSNIELYQILQLRSDVFVIEQQCIYRDMDNKD LLASHLFLSKDNQIVAYCRLLPKGVSVADAAIGRVIIHEKYRGRHLAHKM MGKAIDIIIHEWHENKIYVQAQEYLQGFYQSLGFKATSDVYLEDEIPHLD MYWES >MS0144 emrE, EmrE protein MNPWILLAISILLEIIATSLLKLSDGFTKLIPTVGSMLLYGLSFYCVSIV YRTLPVGIVYAVWSGVGIVLTAIIAYFAFGQKIDGSGLVGMLLIVGGVLI INVFSRSV >MS0256 eno, Eno protein MAKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGLAAAPSGASTGSREALE LRDGDKSRFLGKGVLKAVGAVNNEIAGAIVGKDASNQAEIDQIMIDLDGT ENKSKFGANAILAVSLATAKAAAASKGLPLYAYIAELNGTPGVYSMPLPM MNIINGGEHADNNVDIQEFMIQPVGAKTLREALRIGAEVFHNLAKVLKAK GLNTAVGDEGGFAPNLGSNAEALACIKEAVEKAGYVLGKDVTLAMDCASS EFYNKENGKYEMKGEGRSFTSQEFTHYLEELCKQYPIVSIEDGQDESDWE GFAYQTKVLGDKVQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSL TETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSR SDRIAKYNQLIRIEEALERAGTPAPFLGLKAVKGQA >MS0367 era, Era protein MTETKPENIVQHNETTAAEQETYCGFVAIVGRPNVGKSTLLNKILGQKIS ITSRKAQTTRHRIVGIHTEGPYQAIYVDTPGLHIEEKRAINRLMNRAASS AISDVDLIIFVVDGIHWNADDEMVLNKLRASKAPVVLAINKIDNIKNKDE LLPFITELSGKFNFKEIIPISAQRGNNVHNLQKVVRQSLRKGVHHFPEDY VTDRSQRFMASEIIREKLMRFMGEELPYSVTVEIEQFKMNERGTYEINGL ILVEREGQKKMVIGQGGQKIKTVGIEARADMERLFDNKVHLELWVKVKSG WADDERALRSLGYMEEY >MS2051 eriC, EriC protein MFNLKHFKTRLGFIIHKKLRQTHRISHKSIEFICLLSGAALVALFSLAFA KLSDLGLQWNARWSAHYPLAVWFILPLGLAALSWFTAKFTPYVGGSGIPQ VIAAISLPHGKNKNRLVEFWQTLWKIPLTFLAMLIGASVGREGPSVQVGA AVMLAWGNFCRKHNFAFRGLSANELIAAGAGGGLAAAFNAPLAGVIFAIE ELGRGFVLRWERRILLGVLAAGFILVAIEGNNPYFPQYQGAYSSQYIYLW VILCGVICGFFGGVFARLLAKGAAGVSPAKIRGWVRRHPIYTALLLGLML AALGSYTKGQTYGTGYHVVTQALSGELLQPESVGIAKLAATVATYWTGIA GGIFTPSLTIGAGIGSQIAAYAGDLIDPRLLVLLCMSGFLAGATQSPVTA SVVVMEMTGSQPVLIWALIGCIVASFISRQINPKPFYHLGAARFRQRVQE ESKLKNNDVNP >MS2190 eutG, EutG protein MIMSNAVENTVSPAQAEVNSLVEKGLVALEQFRQLNQEQVDYIVAKASVA ALDQHGALALHALEETGRGVFEDKATKNLFACEHVVNKMRHWKTAGIISD DDVTGITEIADPVGVVCGITPTTNPTSTAIFKSLIALKTRNPIVFAFHPS AQQSSAHAAQIVRDAAVAAGAPENCIQWIAQPSMEGTNALMNHPGIATIL ATGGNAMVQAAYSCGKPALGVGAGNVPAYVEKSADIKQATHDIVMSKSFD NGMVCASEQAAIADAEIYEEFVNELKSYGVYFVNKKEKTLLEEFMFGVKA NGANCAGAKLNADVVGKSAYWIAQQAGFEVPKKTNILAAECKEVSPKEPL TREKLSPVLAVLKSRSTEEGLTLAEAMVEFNGLGHSAAIHTKDAALAKRF GERVKAIRVIWNSPSTFGGIGDVYNAFLPSLTLGCGSYGKNSVSNNVSAM NLVNIKRVGRRRNNMQWFKVPSKIYFERDSIQYLQSVPDMRRVVIVTDRT MVDLGFVQKIAHQLESRRDPVSYQLFADVEPDPSIQTVRRGVDLIRNFKP DTIIALGGGSAMDAAKVMWLFYEQPEIDFRDLVQKFMDIRKRAFKFPSLG KKARYIGIPTTSGTGSEVTPFAVITEGNKKYPIADYSLTPTIALVDPALV MTVPAHVAADTGLDVLTHATEAYVSVLANDYTDGLALQAIKLVFRYLEKS VKENDPEAREKMHNASTIAGMAFANAFLGMNHSLAHKLGGHFHTPHGRTN AILMPHVIRYNGTKPTKTATWPKYNYYKADEKYQDIARLLGLPAATPEEG VKSYAKAVYDLAVRCGIKMSFKEQGLEEQAWMDARHEIALLAYEDQCSPA NPRLPIVADMEEILTNAYYGYDESKY >MS1802 eutG, EutG protein MGVVLMSTYYFLPTRNVFGENAVEEVGELMRSLGGNRPLIVTDGFLAQSG MAEQLATILRGAGLEPIIFGGAEPNPTDKNVESGIAFYHDHNCDCIISLG GGSSHDCAKGIGLIASNGGRIQDYEGVDRSTNPMVPLMAVNTTAGTASEI TRFCIITDTARKVKMAIVDWRVTPQIAVNDPLLMKGMPAGLTAATGMDAL THAIEAYVSTAANPLTDAAALMAISMIQQYLPKAVANGDYMKARDKMAYA QYLAGIAFNNASLGYVHAMAHQLGGFYNLPHGVCNAILLPYVEEFNLIGN LNRFRDIANAMGENIQGLSTDDAALKAIAAIRRLSKQVGIPANLKELGVK PEDFDVMAENAMKDVCMLTNPRKATKQQVIEIFQRAYDGN >MS0069 eutG, EutG protein MTYSLLHTNKVIAGAGCVAQITDVVNSFDATNVVIITDQGVFNAGLINEP KMLLEQAGVNVHVISDTPPEPPVDKVNDIYKVAMQFNVEMVIGIGGGSAM DTAKLVAILLNNHVALRDVVDGKVKFKNRGIPTLMIPTTSGTGSEATQNS IVLVPERELKVGIVDEKMLPNCVILDPKMTTGLPKHITANTGIDALCHAI ECYISKKCSPFTEMFALKSIELIAKSIRIAYEDGHNLQARENMLLGSYLG GVSIATSSTVAVHALSYPLGGKYHIPHGLSNAILLPDVMKFNLDACVEKF ARIAKAMDLNIAGLTEQEAAEAMIEELYALIRDLNIKCDLKTVGITEDIL DELVDAGYSVRRLLDNNPKEMTKQDIRGIYKKIL >MS2325 eutG, EutG protein MSNRFILNETAYFGAGSIQHIVTEVQKRGFTKGLIVTDKSLIQFKVVEKV TALLEGANLAYEIFDEVLPNPTMNVVKAGLAKFKASGADYMIAIGGGSPQ DTAKAIGIIVNNPEFADIRSLEGTAPTKKPAVPTIAVPTTAGTAAEVTIN YVITDEENKRKFVCVDPHSIPVVAVIDSEMMASMPPTLKAATGVDALTHA IEGFITLGAWELSDMFHLKAIEIIARALRSSVKGEQQGVEDMALGQYVAG MGFSNVGLGLVHGMAHPLGAFYSTPHGVANAVLLPHIMAYNADFTGDKFR SIAKAMGVRNTEILSIEQARVAAVEAVKTLNKDVGIPATLREVGMKEEDI PELAKAAFADVCTGGNPRPTSVNEIERLYRAIY >MS0514 exbD, ExbD protein MKKFDEINIIPFIDIMLVLLAIVLITASFISQGKIQVNVPKASSTVAFKA DDLAKLLTVTEKGEIYFNDKPIQLTELEQEINGWDKEQKVTLKVDAKSSF QDFVSITDLMAKNDIKNVAIVTVKEKGK >MS0721 exbD, ExbD protein MRLIFEEFYEVPMAYRRKERNIKSEINIVPFLDVLLVLVLIFMATAPIIS QSVEVDLPDAVESQNVSNEDKVPVIIEVAGVGQYAISIAGQRTENLTEEM VTEQTRAEFEKDPNTMFLIGGARDVPYEEVIKALNLLHLAGIKSVGLMTD PI >MS0121 exeA, ExeA protein MMNGDAMLKLKQVLIDKGVSLRQLAQKMNVSPATVAQLVNHNQRVKQQWA EFETNLARNLTALGITAPLKDLLKDEATGESLATEPAASAPKTKQDIEDD IMLLAKQALFPATKKHFSLFRDPFAEDIRSADDVFSSADVRYVREALFQT AKHGGFMAVVGESGAGKSTLRRDLIDRINQENAPITVIEPYIIAMEDNDV KGKTLKAAHIAEAIINTLAPLESVKRSPEARFRQLHKVLKESVKSGYSNV LIIEEAHSLPIPTLKHLKRFFELEDGFKKLLSIVLIGQPELKVKLSERNT EVREVVQRCEVVELAPLDSELENYVAFKLAKVGKKVDDIFDEDAFAAVRQ RLVAVSRNKTSASLLYPLAVGNLLTAAMNLAESLGVPKVSGEVVMGV >MS2265 fAA1, FAA1 protein MNRSELDFHFVNRVRQQAKMLNQATALRHKVNGGWVDISWEEFQFQIDRV SLALLAHGIDVQDKIGIFAHNMPQWTIADLGALQIRAVTVPIYATNTAKQ AEFIINNAEIKILFVGEQEQLDTILEIKNNCPTLEKIILMKSTAEFSPNE SLLSWHSFMGKSADTDPNRLLERLNDARLTDLFTLIYTSGTTGDPKGVML DFSNLAHQLKSHDLALPDVVGREDVSLSFLPLSHIFERAWVAYVLHRGAV VCYLESTNEVRNALTELKPSLMCAVPRLYEKMYSAIQDKVIHAPLHRRML FQWAINQGQKFAHTQKSTWRHKIADKLVLSKLRNLLGGNIKMMPCGGAKL EGKIGEFFHAIGINVKLGYGMTETTATVSCWADKHFNAASIGRLMPNAEV KIGENNEILVRGGMVMKGYYNNSAETAKAFTEDGFFKTGDAGEFDENGNL YITDRIKELMKTSNGKYIAPQYIEGKLGKDKFIEQIAVIADAKKYVSALI VPSFEALEDYAKQLNIKYQDRLELIKHSEIIKLFEKRLEELQQELAHFEQ VKKFTLLPQAFSIKMEEITPTLKLRRKVILERYRRQIEAMYS >MS1194 fabA, FabA protein MTDSCTLNKKSSYTYEDLLASSRGELFGPKGPQLPAPTMLMMDRVIEMNE TGGNYGKGYVEAELDIKPDLFFFGCHFIGDPVMPGCLGLDAMWQLVGFYL GWIGGEGKGRALGVGEVKFTGQILPTAKKVIYRIHLKRVINRKLVMGLAD GEVEVDGRVIYTATDLKVGLFQDMSTF >MS0460 fabA, FabA protein MTTENRPAKIIEAHEIMTLLPHRYPFLLVDRVVDFEEGQWLKAYKNISVN EPCFTGHFPGQPILPGVLILEALAQSMGLLAFKTHEIKGGELFYFAGIDD ARFKRPVLPGDRLELFVEVIKERRGITSFTGVASVDGEVACEAKLMCARR >MS1591 fabB, FabB protein MKRAVITGFGVISSIGNNKEEVLASLKAGKSGIEIVPSFVEMGMRSHVAG TVKLNPAELIDRKIYRFMGDAAAYAYLSMKEAIEDAGLSEDQVSNERTGL VIGAGIGSAHYQVKAADAARGSRGVKAIGPYAVTKTMSSSVSACLATPFK IKGVNYSISSACATSAHCIGNAFELIQLGKQDIVFAGGAEELSWEGAAQF DAMGAVSTKYNETPEKASRAYDADRDGFVIAGGGAVVVVEELEHALARGA KIYAEIVGYGATSDGYDMVAPSGEGAERCMKQAMANIDTPIDYINVHGTS TPVGDVKELGAIRNVFGEAKPAISSTKSMTGHSLGAAGAHEAIYTLLMLH NDFIAPSINIETLDEQAEGLNIVTETKENAGLQTVMSNSFGFGGTNATLV FKRYAK >MS1873 fabD, FabD protein MKKFAMVFPGQGSQSVGMLAELAEQFPVVQETFKQASEVLGYDLWQLVQQ GPAEELNKTWQTQPALLAASVAIYRIWQQQYPELKPEVMAGHSLGEYSAL VCAGVIDFQDAIKLVELRGKLMQQAVPEGTGAMYAIIGLDNESIINACKA AEQGEVVSAVNFNSPGQVVIAGSKAAVERAAAACKEAGAKRALPLAVSVP SHCALMKPAADQLAVSLESISFKAPEIAVINNVDVKAENDAEAIRTALVR QLYSPVRWTEIVERMAKNNIEVLLEMGPGKVLTGLTGRIVKELSAQQVND AKSLETVKEILA >MS0543 fabG, FabG protein MITMPFNFWYMEKDKMTLAKKHNFKDKVVVITGAGGVLCAYFAKEIAKTG AKVALLDINLESAQKFADEINAQGYIAKAYKTNVLELDSIKQTRDAIAAD FGTCDILINGAGGNNPKATTDNEFHELDLPPTTKSFFDLDKSGIEFVFNL NYLGTLLPTQVFAKDMVGKKGANIINISSMNAYTPLTKIPAYSGAKAAIS NFTQWLAVHFSHVGIRCNAIAPGFLVSNQNRALLFDEQDNPTARAHKILT NTPMGRFGEAKELMGGILFLMDEEYASFINGVVLPIDGGFSAYSGV >MS0563 fabG, FabG protein MVNTLLIHFLHRRIYMNLFDLTGKVALVTGCNTGLGQGMALGLAQAGCDI VGVNLVEPLDTKEKIEALGRKFVNIEANLMKQEGLTDVVEKAVSVFGKID ILVNNAGIIRREDAIDFSEQNWDDVININLKTVFFLSQLVAKQFIAQGHG GKIINVASMLSFQGGIRVPSYTASKSAIMGITRAMANEWAKYNINVNAVA PGYMATDNTAALRADEARSKEILDRIPAGRWGTPNDLVGPCVFLASAAGD YVNGYTVAVDGGWLAR >MS2145 fabG, FabG protein MQRFEQKTALVTGAGTGIGQAIAVRLAQEGAKVLVVGRTEKTLQETTALH PNIAYAVADIEKDDDVQKIVQQLNQKYGGLDILINNAGWAPVTPISQVKI EEYDKVFGINVRALVNLTLQCLPMLKARKGNIINMSSAICRNHLPNMSMY AGTKAAVEIFTKIWAKELGADGVRVNSISVGPIETPIYDKTDLSNDGIQD HIDRIRKTIPLGAFGKSEDVANVTAFLASDEARFITGSDYSVDGGFGA >MS1412 fabG, FabG protein MSILEKMKLTGKTAFVTGGARGIGKSVAIAFAQAGANVVIADFDIAEAEK TAAEIAKEEGVKSIAVQTDVTDQASVNHLMDVIKQQFGKLDIAFCNAGIC INVPAEEMSYEQWLKVINVNLNGVFLTAQAAGKLMIEQGTGGSIINTASM SAHIVNVPQPQCAYNASKAGVIQLTKSLAIEWAKHNIRVNSLSPGYIGTE LTLNSKDLQPLIKEWNAMAPLHRLGKPEELQSICVYLAGDTSSFTTGADF IVDGAFTCF >MS0955 fabG, FabG protein MIKIIFLKCNFHLNEEQKMSELFSLKNKRILITGSTRGIGNLLANGLAEH GAEIIIHGTRLETAEKIAADFNTKGFKAYAVAFDVTDSKAAQDTIDYIEK EIGPIDVLINNAGIQRRYPFCEFPEKDYDDVISVNQKAVFIISQAVARYM VKRQRGKIINIGSMQSELGRDTITPYAASKGAVKMLTRGMCVELARYNIQ VNGIAPGYFATELTKPLVENQEFTSWLCKRTPAGRWGDPKELIGAAVFLS SKASDFVNGHLLFVDGGMLAAV >MS2175 fabG, FabG protein MFKKIILTLFSGLIFTEVTMAQTKYGVGSYNTEEVAAEMEYIEKHIRPLN PKPTKRIFITGSSAGIGELTAKMLLAKGYEVVAHARDAKRAADVKRDLPE IKHVVIGDLAKPDEVDKIADQVNALGRFDVIIHNAGVYRGENIFQINLLA PYVLTAKITQPQTLIYVSSNMHNGGELRLDAFNAGNVGYSDSKLQLLTLA KSLAVRWSKVRVNAMHPGWVGTKMSGGSAPDPLRQAYETLVWLAEGTDPA AQTSGGYFFNKQPDSHYRRDSEDSAQQAVLWQALEKITGVKLPE >MS1406 fabG, FabG protein MWELQRSKKMKQKEVIVAIGSGSIAQAIARRVSIGKQVLLADIKLENAEA AAKTLREAGFEVSTTVVDVSSRASVQALVQTAVDLGAVKGVIHTAGLSPS QASPEAILKVDLYGTAVVFEEFGKVIAAGGSAVVIGSQSSHRLAIDEISQ AQADELATLEPEKLLELPLVQEINDSLRAYQISKRGNALRVQAEAVKWGK RGARINCISAGIIYTPLAYDELTSSERGEFYRNMLAKSPAGRGGTPDEIG ALAEFLFNSSYISGSDILIDGGVTASYKYGELKPA >MS2144 fabG, FabG protein MNNMKKLLILVGAGKGLGNAIAKEFASHDFRVALIARNAENLTAYRQEFQ ALGYEVMTQVADALYPETLTKAINAIQAEWGTCDALVYNVGITELDNDRP ITNELLMQRYQIDAASAYHCAMLVATPEFAAKQGAIIFTGGGFAKTFQPI LALKPLCIDKAALNAMNIVLHHLLAPQGIFVGSVLVSNVIQPNDPKYAPD VIAKAYWKMYCERDEFELLY >MS1421 fabG, FabG protein MKLQNKVALVTGGGTGIGRAIAKQMAEAGATVIIIGRREAQLQESARQHA NIHYIVADVLNSDDITRTLNEIQQRFGKLDVVVNNAGIAPVTPIENVNLA DFDRTFALNVRAVIDVTSQAIPYLKSTQGNIINITSGLVNNPMPMNSIYT ASKAAVLSMTRTWAKELAPYGIRVNSVAAGATKTPLYDGLGLSETEAKDY EATVEHIVPLGRFAEPDEIAPAVVFLASDDARYATGAHYGVDGGFGI >MS2163 fabG, FabG protein MNNIQGKVVIITGASSGIGEATAYKLAEQGAKIVLAARREAQLKAIADNI KAKGGEAVYRVTDVVKPEDNQALVELAKSAFGKVDAIFLNAGLMPSAPLS ALETDNWNRMIDVNIKGVLNGIAAVLPTFEAQKSGHVLATSSVAGLKVYP GGTVYCGTKWAVKAIMEGLRMESAQAGTNIRTATIYPAAVQSELVAGITD ETTSQGYRQLYDTYEIPAERVANVVAFALSQPDDTNVSEFTIGPTTQPW >MS1874 fabG, FabG protein MQGKIALVTGATRGIGRAIAEELATKGAFVIGTATLEKGAESISAYLGEK GKGFVLNVADQESIESVLEQIKKEFGDIDILVNNAGITRDNLLMRMKDDE WFDIIQTNLTSVYRLSKAMLRTMMKKRFGRIITIGSVVGSSGNPGQSNYC AAKAGLIGFSKGLAKEVASRGITVNVVAPGFIATDMTEVLTEEQKAGILA NVPAGHLGEPKDIAKAVAFLASEDAGYITGTTLHVNGGLYMA >MS1871 fabH, FabH protein MYSKILATGSYLPAQVRTNADLEKMVDTSDEWIYTRSGMKERRIAAADET SATMGANAAAKALEMANLDPQEIELIIVGTTTNSHSYPSAACQIQGILGI KDAISFDVAAACTGFVYALSVADQFIKSGQVKKALVIGSDLNSRHLDETD RSTVVLFGDGAGAVILEASEQQGIVSTHLHASADKEDMLSLPHIERGEDK SGYITMQGNATFKLAVGQLSSVVEETLEKNNLQKSDLDWLIPHQANIRII SATAKKIRYGYVTSCINHRKIRQ >MS1872 fabH, FabH protein MDMSQVVLTIEKYGNNSAATVPVALDEAIRDGRIKRGQLLLLEAFGGGWT WGSALVRF >MS1467 fabI, FabI protein MKAQGAELAFTYLNDKLQPRVEEFAKEFGSDIVLPLDVATDESIQKCFAD LNKVWDKFDGFVHAIAFAPGDQLDGDYVNAATREGYRIAHDISAYSFVAM AQAARPFLNKDASLVTLTYLGAERAIPNYNVMCLAKASLEAATRVMAADL GKDGIRVNAISAGPIRTLAASGIKNFKKMLSAFEKTAALRRTVTIDDVGN SAAFLCSDLSSGVTGEVLHVDAGFSITAMGELGEE >MS0638 fadL, FadL protein MKKAINKTFLASCILCAAGQASAAAFQLAEISTSGLGRSYAGEAAIADNA AVIATNPALMSQFKTNQFSAGGIYVDSQIRMNGTVSANLAGQTVAQAPAS KTSVVPGSLIPNMYFVSPLNDKFAVGAGMNVNFGLKSEYEDDYAAGVFGG KTELTALNLNLSGSYRVTEKLSAGVGLNALYAKAEVSRNAGILADAVGNV ASNPQALSAIVAQRPDLASKMGALSGLASGLQRDTLLTHLQDKTAWAFGY NLGLAYDLNERNRFGIAYHSKIDIDFKDRNAVSYLPYGTTPYIGEGGLVL HLPSYWEFSGYHKLTDKFAMHYSYKYTEWSRLKNLHATYADRSLRNDGLA FHKDEEYKDNSRIALGATYEVDEKLTLRAGVAYDESAAPRTHASASIPDT NRTWYSLGATYKFTPALSVDFGFAHLRGRKLDFSEEQSLAGGLVTVKADY KSKATANLYGLNINYSF >MS0538 fadR, FadR protein MTDNAELRSYKKIGSILKQELIDGLYQIGERLPPERDLAEKMNVSRTVVR EAIIMLELENLVEVRKGSGVYVINMPLTSEENQDDTYEDVGPFELLQARQ LLESGIAEFAAIQATRSDILRLKEILNKERMTLAEDDKDYTADEEFHSAI AEITQNEILIKLQKELWKYRTKSSMWQGLHAHITDQEYRKSWLQDHQNIL NGIQRKNPALAKKAMWQHLENVKQKLFELSDIEDPDFDGFLFSVNPVVVG L >MS0431 fadR, FadR protein MFTQKSANSSPSVLKARSPAALAEEYIVKSIWSNFYPPGTDLPAERELAE KIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNDVWQTSGLNILDVLVRL DSTMSPTLIANMLSARTNIAIIYIPRAFKVSYEKALASFDGLENLPETAE SYTAFDYEILHKLAFISLNPIYGMVLNSLKGLYTRVGSYYFAIPEARALA KKFYIELRELGKAHRLDEIPSLFRQYGRESSLIFEAAQDGLAQYLIEN >MS0244 fba, Fba protein MAKLLDIVKPGVLTGDDVSKVFAYAKEQGFAIPAVNCVGSDSVNTVLETA ARVKAPVIVQFSNGGASFYAGKGIKPASGARADVLGAIAGAKHVHTLAQE YGVPVILHTDHAAKKLLPWIDGLLDASEKEFAKTGRPLFSSHMLDLSEEP MEENMAICREYLARMDKMGMTLEIEIGITGGEEDGVDNSGVDESKLYTQP EDVLYVYDQLNPVSSRFTVAAAFGNVHGVYKPGNVKLKPSILGASQEFVA KERGLQAKPINFVFHGGSGSSTEEIREAISYGAIKMNIDTDTQWAAWDGI LQFYKANEAYLQGQLGNPEGPDSPNKKYYDPRVWLRKMEESMSKRLEKSF QDLNCVDVL >MS1615 fbp, Fbp protein MKTLDEFIVDRQAEYPNAKGALTGILSSIRLVAKVIHRDINRAGLTNNIL GFSGIDNVQGEHQMKLDLFAHNMMKQALMAREEVAGFASEEEENFVAFDT ERGRNARYVILTDPLDGSSNIDVNVSVGTIFSIYRRVSPIGSPVTLEDFM QPGNRQVAAGYVVYGSSTMLVYTTGNGVNGFTYDPSLGTFCLSHENMQIP ATGKIYSINEGQYLKFPMGVKKYIKYCQEEDAATNRPYTSRYIGSLVADF HRNLLKGGIYIYPHATNYPQGKLRLLYEGNPIAFLAEQAGGIASDGYNRV LDIQPSQLHQRVPLFVGSKQMVEKAQDFMHQFKED >MS0719 fcbC, FcbC protein MNNTFQFPVRVYYEDTDAGGVVYHARYLHFFERARTEFLRTLNFSQNQLL HEQNIAFVVKSMTIDYRFPACLDDALIVESEVVEVKGATILFSQILKRDE LVLTTATVKVACVDLGKMKPAALPAEVKAAISK >MS0893 fdhD, FdhD protein MIEITKRTISFFKNLTFIKQIDKDTVSDSNNSRFEFIQKEETLAVEMPVA LVYNGISHTVMMATPSNLEDFALGFSLAEGVIDRVSDIYGIDVEETCNGV EVQVELATRCFVRLKDLRRTLTGRTGCGICGSEQLEQVTKKLAKLDRTFC FELKKLDGCLALLQQAQTLGKQTGSTHAVGFFSPQGELLAIREDVGRHVA LDKLLGWHAKQGKPQGFVLTTSRASYEMVQKTASCGIEMLIAISAATDLA VRMAEECNLTLIGFAREGRATVYTEKVRLKI >MS0843 fdhE, FdhE protein MQSSQKCGKIYRTFLLGIITMSIRILPEHEIKKTATSFEQPALLFANPQN LYERRAKRLRKLSESHPFAEYLNFAAEVSEVQLKILKMHPLPQDERLTKE NFSLDNSIQPLHTKNWKRDVIWREYLAEILAQIKLKATNQITTTIDWLEK ASDTEIESLADKLLAEDFSSVSSDKAVFIWAALSLYWLQLAQQIPHTTNM ESGENLHVCPVCNAAPVASVVHFGAAQGLRYLHCSLCESEWNMVRAKCSN CNQAEHLEYWSIDEEMAAVRSESCGDCHSYLKILFQEKDPHVEPVADDLA TIYLDIEMEEKGFARSGLNPFMFPSEEA >MS0889 fdnI, FdnI protein MSKVEFTNDTKIVRHKYPARVSHWFLVIAFFMTMFTGVAFFFPDFAWLHE ILGTPQLARAIHPITGIIMFIAFIILAFIYASHNIPERNDIRWLKGIVEV LKGNEHGVAYNGKYNLGQKMLFWTLNLAMITLLVTGIIMWRRYFSGYFSV TTLRIAILLHSASAFALFTGILVHIYMAFWVKGSIRAMVEGWVTVRWAKK HHPKWFNHEIKPEIERYMLEKDKASK >MS1028 fdnI, FdnI protein MSKVEFTNDTKIVRHKYPARVSHWFLVIAFFMTMFTGVAFFFPDFAWLHE ILGTPQLARAIHPITGIIMFIAFIILAFIYASHNIPERNDIRWLKGIVEV LKGNEHGVAYNGKYNLGQKMLFWTLNLAMITLLVTGIIMWRRYFSGYFSV TTLRIAILLHSASAFALFTGILVHIYMAFWVKGSIRAMIEGWVTVRWAKK HHPKWFNHEIKPEIERYMLEKDKASK >MS0969 fdx, Fdx protein MKIYQVRIENYQLTFSHNNKASLLSELEALGLKPEYQCRSGYCGSCRVKL KKGRVSYKELPLAFVNPDEILLCCCQAEEDLEIELLS >MS1720 fdx, Fdx protein MPKVIFLPHETLCPEGMVVDAAAGDNLLEVALEAGIEIEHACDGSCACTT CHCIIREGGDSLNETTDQEDDMLDKAWGLEVDSRLSCQCQIADEDLVVEI PKYTINHAREENH >MS1202 fecB, FecB protein MRILTALLLFFSLSVNAQIKIATLDWTVAETLIALNNAPVAVGDKASYKI WVGKPALAENTQDLGLRLQPNKESLARLSVDRFINSDFFASIEPSLTAKA PVSTVNFYQPGDTWQNIENATRQIGELIEKSEQAEQLITQTNTQLAKIGQ TLTHFRDRPVAIVQFIDTRHLRFYDSHSLFGTILNKLGLTNAWNHSGGVW GSENLSITALATLPKNTRLVVVKPHPANVANALKYNSLWRNLALAEDPLL LPAIWSFGALPSAVNFAQNLQSALLNQRSETW >MS0795 fecB, FecB protein MIDKENSMKKTLITLAAGLVAAFGVVSAQAADIGLETFGGKQIVPENPKR VVVLDFAALDTIREIGAKETVVGISKGRIPQYLAEFDTDKYANAGTMPEP AFEKINEMSPDLIIASARQKKVLARLKEIAPVFYMENDYENYYPSFEQNL LALGKIFNKESAVKEKLAQLDNRMTALAKLTAGKSALVTIVNESRISAFG DKSRYALVYQKFGFTPIDKNLSSSTHGNSVGFEYIAEKNPDYLLVVDRTA AITDKANNAQTVLDNALIKPTKAAKNNHIVYLNAENWYLAFGGLQSMDTM ISEIESAVK >MS1158 feoB, FeoB protein MNKDKLTCFALVGAPNCGKTVLFNGLTGSNAKVANYPGVTVERREGLFVD DPSVSIIDLPGTYSLRTTTLDEAVARDEILGKYGRKIDGIIAVADATNLR MTLRMVLELKMLGLPMVVSLNLIDVARSRGLTIDEKKLSELLGVPVLETV ATRKEGIRGVKNAIANLPQNSAGIDSNQVAQLLESLDSNALYAQVEDILA QTVKTEMVMPKWHQRLDRLTLHPVWGFVLLLLVLLLVFQAVYSWSEPVMD FIEDFFANLGEWVATLMPEGILQDLILNGIIAGVGSVLVFVPQITILFAF ILLLEDSGYLPRAAFLLDNLLSTSGLSGRAFIPLLSSFACAVPSVMSART IQDPRERLVTIAIAPILTCSARLPVYALIIGAVIPDQTVWGIFNLQGLVL FGLYFIGVFSAGLVAYIMKRLARRKGSIRQFPLLMELPTFRLPNFRHIFS SLWDKVKAFLKRAGTVIFALSVILWALVTFPGAPEGAAGAAIDYSFAGML GSLIQPIFAPLGFTWQMCIAMIPGIAAREVVVAALGTVYAVGASGSEEAI QNALVPIVHEQWGIPTALAFLAWYVYAPMCMATLAVIKREVNSMKKTLMI AGYLFVLAYIFSFVVYQISIRIF >MS1157 feoB, FeoB protein MVLADITEGTAQAKLDNSGMNARPDNPLVDNKLSNRKAARGK >MS1012 fepC, FepC protein MIEIKNLSLPYGLHNINTRIPAGKLIGIMGANGAGKSTLLKAMAGILPLT NGEIWFGGQKLSAMSAAQKNQQFAYLAQDSRVHWDLSVYDVIALGLPYQL QATAEQTKVRSVSEKFSISHLLEKPYRQLSGGEKARVQLARCRIKDAPLL LADEPIASLDPYYQIDIMQQLKALTPERTCVVVIHHLDLAYRFCDEVILL HQGNLIASGETQAVLNAENLAKAFSIRAEINLATKGISGIEKIGG >MS1201 fepC, FepC protein MFQLEQASFAIPNRALLAPTTLTFRRGKVYGLIGHNGSGKSTLIKLMAKQ NPLSSGEIFVRGKALRHWGSREFAREVAYLPQHLPTATQLTARELIQMGR YAWNGLLKSNKEKDKSAVENALILTHTEKFAEQQIDVLSGGERQRIWLAM LLAQQSNFLLLDEPLAALDIAHQVEVMKLIKKLSRELNLGVVIVIHDVNL AAAFCDELVALHSGKLLVKGTPGQIMTTETLQRIYGLELNVIPHPQTQVP VVFY >MS0794 fepC, FepC protein MAIEIKHVNKSYGSKKVVDSVSLVIPKGKITSFIGPNGAGKSTVLAIISR LLNADSGDVLLNGKLLNEQKSADIAKQLSILKQSNHINLRLTVEELVAFG RFPYSKGNLKKNDRTFIDNAIGYMDLEEFRHQYIDELSGGQRQRAYIAMT LAQDTDYILLDEPLNNLDMKHSVQIMQVLRKLVTELNKTVVIVIHDINFA SCYSDYIVAMKNGKLVRQGSIAEIMQTSVLEEIYGMEIPIQEINGNKIAV YFKN >MS0519 ffh, Ffh protein MFENLSDRLSKTLRNITGKGRLTEDNIKDTLREVRMALLEADVALPVVRE FINKVKESALGEEVNKSLTPGQEFLKIVQKELESAMGESNESLNLASQPP AVILMAGLQGAGKTTSVGKLAKFLKERHKKKVLVVSADVYRPAAIKQLET LAQSVDADFFPSDVKKNPVDIAKAALADAKLKFYDVLIVDTAGRLHVDGE MMDEIKRIHEVLNPIETLFTVDAMTGQDAANTAKAFNEALPLTGVILTKV DGDARGGAALSIRQITGKPIKFLGVGEKTDALEPFHPDRVASRILGMGDV LSLIEDLQRSVDHEKAEKMAQKFKKGDQFTLEDFREQLIEMKKMGGMMSM LDKLPGAKNLPEHVKNQVDDKMFVKMEAIINSMTLKERANPDIIKGSRRR RIAMGSGTQVQDVNKLLKQFDEMQRMMKKMRSGGMAKMMRGMKGMMGGGL GALGGLGGMFGKR >MS1167 fhaB, FhaB protein MNKRCYRIIFSKTLNCLVVVSELAKTVGKAVAEFSNKLLPMRFFRQKTPD FSLHFAAFICFIGLGIIYVPQAMAKPLEIHADRSAPSGNQPTVLRTANGI PQVDIQTPSAGGVSRNVYSQFDVAEKGAVLNNARKSTNSQLAGWVTANPN LVRGEAKVILNEVNSKDPSQLKGYVEVAGKKADVIIANPSGLHCDGCGVI NAGRTTLTTGQVELENGNVKGFNVRGGKVEVAGKGMDTSRVDYTDIVAGK VKVDGGIWAKELKVTTGKNKVDRTNSKVVYVGNDSTAPSSSENMDKQPIA YAVDVSELGGMYANQIHLVATEQGVGVNNAGKIGASAGNVHIDSNGKITN SGYLGAQQDIAVTANNNIENKGSIYTQQGDIKLKGRDISQQGNIIAEGAA QKKGRVQITANRDIQQSGDTLAENYIDYQAKNIKVTNNATIVAGLDFTQN TLADKSEAGKNARFNAEKSAVINGKILSSNRTEIKAADINLTNSQLHSNH LSATASVGSIIASDSNIYTEKSAVFSTPVSLVTQNAQLNAGHISINATQA DNTQGTWINRDEQDLNLNLQRGLTNTRGQIATNGQLLFNGEQIDNQAGLI SADSYQISAATFDNAKGKLIQQGVNPFNLTVNGTLTNDKGVIGYQMQNLN TANNSMENPVHSDNIPTNTAENITANTSTSEKPIHSGTDAIDVSDKIINI VSNVNVTEKLNNTDGYILSGSQTVLWGEGKLSNDSGTLNLSEFTWESNKN INNQLGLISALNALSLHAAELNNNQGTIRSGKNILLSTHALSNNGGLIQS GGNVTINTHGYNLDNSNTLSSNGNKGIVTSGELNLSNINQLNNEQGYIVS TQAQRIQTEKLNNTRGTLATNNTQSLKVTGTLQNQDGNLYSGQLTLDSNL LINRKGNITAASALDITVKGALDNTQGIIAAKENSVIQVATLDNQNGLIG IEQGKLNLTAATRLTNQRGQIISQGDLRLMGGDLQNNQSGTIKSLAKLTV NTGNQQINNQQGTLSSAGELTIQSGYLDNRQGLIYAQKSLFVDTHNQNLD NRNSGTKGILSLSDIILQNIGQLNNSRGQIQAQQDFSVNADTINNTGNGL LYSNTDLSLKAKSLDNRQGTVQALGNVTFDRFSTVNNSVEKNKSGSLIQA GRALTVSALNIDNQNTKTAETVPTQGLVGQAVVLSSDAVNNRQGGIYAAQ ILSANIKNLINNQKGEMLSGGTLNAVGSALKIQNSEGVIASVGKLAIDAA QISDIGKIQSKNDADIALKQDLTLNGGIEVEGSLKLKTNNLTNDGSLLTG KGLHIQTGKLVNNEKAILSSGTTLLNANSITNYGLINGSNTSLKTIDLNN LGTGRIYGDQLSIQARELNNFELNGKSATIAARKRLDLGVGTLTNRNDSS LISLGEIHIGGTLDSRGYATGKATAVYNPNGLIEAQDNIYINSGLVSNTH NYFRTALKLISQETVTEYQGSGDPTIYEEGTPGLYVFNHESDHLHVPTGA YYESWYKYHYLKSVQRTEVLPAEYEPGRIYSGKNITIRGDRVDNINSRII AGGKLDIPANILNNKEETGVEIVTKAGCAEGSRSEACRNLPAELRGYGDD VSLRKHSNNSPYGLHSYWRHHEKGRDSTGHSRQDYTPPQEIKEGIPLEVA AYKEYSLPSFGKANISAINVPQSIDVQVKSAVQNEKEFKPSSLSSDIAVT EQDVTVNNQTTGSIVKDNAVVRTQNRAIALPRSSLFMVNPQSGSGYLVET DPAFTQYRNWLSSAYMMNALNLDSDSMHKRIGDGFYEQRLVQEQIAELTG RVYLSGYSNQEEQYKALMTNGITAASQFQLTPGMALTGEQIARLTSDIVW LVNKTVTLADGSTATVLYPQVYVVVQKGDINGYGALLSGDITAIQSSEMT NSGTLAGKNLLAISAENITNRFGKMTADNTLVSAEQDLVNIGGTIEAAQI LNVNAGRDIVVNTTTHRTQNANGETVTLGIRGGFYLTGKDGRQMWVNAGR DINLTAGEIINNAQNALTAIQAGRDILLNSGQQSDRYENIRDAENYLKTA NRRDIGTMISSQGSGKTVLQAGRNIEAKAAAVSAGGDVLLNAGNNVTLSS GEEYHYVDSAFKDTGRGFLNKTTTKTRDISEQTFAIGSQLDGNNVDIFTQ QGDVNIIGSDVVAENELNVAAKNINIAAATNQVYSENLKQVKKSGLMGSG GIGFTIGSRSQKHVYGENTITQSDARSTVGSVNGNVSMTAENHVNIEGSD VIAQTDKSIDIIGKSLTVEAGRDVIDSTETHEYKKSGLTVSLSTPVTDMA LNARNSLRCSKEVKNERLSNLYQVKAAQEAVMAAQAADSTIDSINALIGD GQMVEGDVSNPSLKISIGVGSSQSKQTSRSQQISYSGSELSAGNINLKSS AGDINLFGSTINATKAVLDSANNINLFSLQDSYRNRSDNENSGWNAGVFV GMNGNSFGIGIEGSAQSGKGRENTDTITQKNSYINVRQAVIRSGRDTNLK GAVINAERLTADIGGNLLIESRQDSNVYNSEQSQSGANFAVAVYGTGTNV NVNASMDKAKLNYAQVEEQSGFKVGRDGMDINVRGNTHLKGGLIESEAAA NKNRFSTNTLTTEDIENHSEVSVQSVSGGLSTNMMANAVNAMRAAISVLG TANKDDHSTTQSAVSGNIDLNIKNGEKPTALSQDTMNANKQVNRYDIEEY KEKAELAQVIGEIGQNGITIVLQPKLDKAQQEKDEAEAILKNPNSTAGER REAQIQFNQAQTTLNQYGKGGDIQMAIRAVTGVLQGIAGGDVNAAIVNGL SPYANLAVKEATTDSLTGEVNLVANLMAHAVLGAIEAQITGNNAIAGAAG AVTAEATATLLAKSLYDVGKVDSSGRIKTVNDLTEYEKDSLLVLSQVAAG ITGGVIGDSTQSAVVSGDIGKRAAENNLFGTVLNNPQINWQAVAEGEKIK RERDEEIRAYIKKEHPVIYQTAEGTYYFMSATGKAIYVAREMVIELAPMV IAPEIAAGTKVYAAVSRIALSGGANVVAQKVSGQEFNWAEFGGAVVSGAI TPSLKTTKEAIRFNAGVGMAVGLANGGDGLESATYSGIGTYMGGKISNPT WSAIVSEVIQKIPTINESLSKEHEGK >MS1163 fhaB, FhaB protein MLSAQAKTFAPLGDVHVNNHTELTGGLVTSTDKAEVEGKNRFSTGTLNAT DIQNQAETSGSAYKVSGSADINGGWTGDKKEALSAAIGYGEVDENQTATT KSGINTANIDIRDKQEQVAKTGQTAEDMLTQVKTEISTDMATQNSGVLEN HFDKDTVQKELDYQVKVTSEFQEITLPEIDRQMANKAAEYREEEKIFRQA GNEQAANEMAANAEKWEMGGEYKQRVDAIANAVGLALGGVGVEGTLTGAA SPYINEAIKAQLPEDKNRAANVIAHVIWGAVEAKLQGASATTGALSTAVG ELSAPIVSEVLYGTSDPNLLTEEQKQFVSNLSRIAATATGAISSRAEGNR SVQVAKDAVTSGKVAENAVENNYLSQLSDNRRIWLREQLNRDDLSSVQRE KYEQEFIQLEQDDHTSDILVAKAKYNPESMTQSDWELYQNYATRYYFESI RTEKPENVIADLDNILSNQYIKGYSYPYATAEKYRHELPSRWSLFGTNKS ADEQFYTDIYSKYQNRKTYQESFDGRVAQSTAEALGYASTMMSAGTVASV VSKVGKFTSNGINKASSAIGAFAVKYPNISGAISDGMISSGVHVGYKLST GQDVNEYEVLGAFAGGALTRNHTLGNQIRINIGVATVSSLSKDPSGNSLG KDYFGAIVAPIINKPFSTKDSTLGNIVGGSLGEYGGDLDNRVKDYKEVKK ILSDGEYSK >MS1169 fhaC, FhaC protein MRVFTGVILSLCSACVLAVDSPNLNQLNVQSDAALQQRQEEQNKALQRQQ VADPNIRLENRLEPSEGFPEKENPCYQISHIILTDFSPEISDFSVIPPSS IPSSRFYWALNAIYSTRDFSLPHCLGSEGINILLKRIQNRLIEQGYITTR VVVQPQNLQNGILVITVIPGKIGQIQLQDESSFPYATSATLWFAMPTNNG EILNLRHLEQGLENLKRNTSADANMQLSAVEDEVGASDVIIRYKQGFPIH LTLGLDDSGTKATGRLQGTATLSWDNMFSLNDLFYASFTKSIKRHSDNVD EPHGSKNVSLYYSVPWKNWLLTLSGYQYRYHQSIAGAFENYQYSGKSTQL RMNLSYLLYRNSSRKSYISFGGWARKSFNYINDVEVEVQRRRMAGWDIGL KHIEYLGDATLQISANYKRGTGAYKALPAPEEYFDEGTSRPQIITVGIDL NYPFNIGEQPWKFNTSWNAQWNQTPLIQQDKFSIGGRYTVRGFDGELYLS GERGWLWRNELAWNVFNKGQELYLGIDKGNVYSRFDDLPGNSLVGGAIGL RGKIWGLDYDYFVGVPIDKPAGFKTSHVTTGFNLNYRF >MS0531 fis, Fis protein MLEQQRSPSDALTVSVLNSQSQVTNKPLRDSVKQALRNYLSQLDGQDVND LYELVLAEVEHPMLDMIMQYTRGNQTRAATMLGINRGTLRKKLKKYGMG >MS1863 fkpA, FkpA protein MSKQFFDSVALDSVSAKGGYGVGLQIGQQLLDSRLNVEAEAVAKGIYDVL NNNAPALDLNEVSKALQELQQKAQDAAQAQFKQIEEDGRAFLVENAKKDG VQVTESGLQYEILVEGNGNKPSREDTVRVHYTGTLPDGTVFDSSVSRGQP AEFPVGGVIAGWVEALQLMPVGSKWRLAIPHNLAYGERGAGASIPPFSPL VFEVELLDIL >MS0157 fkpA, FkpA protein MLFVIFVKPTEPRLSIKEIVMLKIQKFSAVALLVGAVLATSACKDDKKAQ AAAEPAKQEAPAAAQAENSRVKDPSYAVGVLIGNDLKGLVEAQKDVIAYD NDKILAGVAEALQGKIDLTNQDVVNTLKDIDEKLKVAAQTKAEEQAKQAK AESEKFIAEFKQKDGVKETKSGLLYRIEKEGEGAAIKPTDSVKVHYTGKL TNGTVFDSSVERGQPVEFLLDQVIPGWTEGLQLVKKGGKIELVIPAELAY GEQDLGTIPPNSTLHFEVEVLDVTPAKK >MS2250 fldA, FldA protein MEKSIAIITGSTLGGAEYVADHLAELLENRGFSVQVENNAAFTDVAEQSL WLIVTSTHGAGDLPDNLKPFIRQINTEDLTQVRFAVVGLGNSDYDTFCHA VDKVENALTAQGAARLCDSLRIDVLTTDDHEQCAENWLPNFVAAL >MS0176 fldA, FldA protein MKTIILYSTHDGQTKKIAEYLAQNLDKGAKVVNLTELTQNLADFDRIIIG ASIRYGRFDKNLYKFIEKHTALLQTKLGYFYGVNLTARKAGKDTPETNVY VRKFLAKIHWKPTDSAVFAGALFYPRYKWIDRIMIQFIMKITGGETDPTK EIEFTNWESVKNFAKKIQNMN >MS0860 fldA, FldA protein MAIVGLFYGSDTGNTENVAKMIQKQLGNELVDIRDIAKSTKEDIEAYDFL MFGIPTWYYGEAQCDWDDFFPTLEQIDFTDKLVAIFGCGDQEDYADYFCD AMGTVREIVEQRGAIIVGNWPTEGYSFESSRALINNDTFVGLCIDEDRQP ELTAERVNTWVKQVYDEMCLAELA >MS2202 fmt, Fmt protein MMKPLKIIFAGTPDFAAQHLQALLNSHHQVIAVYTQPDKPAGRGKKLQAS PVKQLAEQYNIPVYQPKSLRKEEAQAQFAQLQADVMVVVAYGLILPKAVL EMPRLGCLNVHGSILPRWRGAAPIQRAIWAGDKQTGVTIMQMDEGLDTGD MLHKVYCDITAEETSASLYHKLATLAPPALIDVLDELESGKFIAEKQEDS KSNYAEKLSKEEAKLDWSLSAAQLERNIRAFNPAPVAFLTVPVNEAEERI KVYRAEVLPHQNSAAGTVLAFDKKGLRIATAEGVLNIQQLQPSGKKPMSV QDFLNGRADWFVLGQVLN >MS0400 focA, FocA protein MKSEDFKLAWMASPTEMAQTGLDVGVYKATKKQAYSFLSAISAGMFIALA FVFYTTTQTASAGAPWGLTKLVGGLVFSLGVIMVVVCGCELFTSSTLSTI ARFESKITTIQMLRNWIVVYFGNFVGGLFIVALIWFSGQIMAANGQWGLT ILNTAQHKIEHTWVEAFCLGILCNIMVCIAVWMAYAGKTLTDKAFIMILP IGLFVASGFEHCVANMFMIPMGMVIANFASPEFWQATGLNAEQFANLDMY HLVIKNLIPVTLGNIVGGGVCIGLMQWFTSRPH >MS1858 folA, FolA protein MTLSLIVAATKNHVIGKDNQMPWHLPADLKWFKENTLGKPVIMGRKTFES IGRPLPKRVNIVLSRHPFEHEGVIWKESLESAVDFLKDSAEIMLIGGGQL FEQYLSQADKLYFTEIQTELEGDTFFPAINTDEWEISYEEYRPADENNAY DLRFLILERKS >MS1824 folB, FolB protein MIDRIFIEELTVFAQIGVYDWEQQIKQRLIFDIEMAWDSSKAAETDNVSY CLNYAEVSQFIIQYVQSKPFLLIERVANEVAEQLQKEFGIKWIKLKLSKP KAVAEARNVGIIIERGQC >MS1173 folC, FolC protein MQEKHNLQATSSLSEWLSYLENSHFKAIDLGLERIKAVANELDVLNPAPF VITVGGTNGKGTTCRLLETMLLKAGLRVGVYSSPHLLRYNERVRIQNQEL PDEAHTQSFAYIEARKTQSLTYFEFTTLSALYLFKQAKLDVVILEVGLGG RLDATNIVDNNLAVITSIDIDHVDFLGSSREQIAFEKAGIFRAGKPVVIG EPDVPAAMLAHAGLLGCELACRDKDWSFAQKADSWTWQNQKVRLENLPIC RIPLQNAATALAAVQFMPVQISEEIIRQSLQEVELAGRFQRINAERLVPL ATLVRRSVESLPQIIIDVGHNPHAARYLAGKLIELKQKTSQKITAVCGIL KDKDSEGVLSPLLPIIDKWHCVTLEGARGQSGSNLFVTLKNLANKQQIPF HGESENSVESGIISAISQMDNNEILLVFGSFHTVTGFLELL >MS1852 folD, FolD protein MTAQVISGSALAKKVKTEVGQKIEQYVAQGKRAPGLAVILVGADPASQVY VGSKRKSCAEIGINSKSYDLEESTSEAALLTLIDELNNDADIDGILVQLP LPKHIDSTKVIERIAPHKDVDGFHPYNVGRLCQRIPTLRACTPYGIMKLL >MS1851 folD, FolD protein MIVGASNIVGRPMAMELLLAGCTVTVTHRFTTNLEGYVRQADILVVAVGK AEFIPGNWVKEGAVVIDVGINRCEDGKLRGDVEFAAAAEKAGFITPVPGG VGPMTVAMLMFNTLTAYENNG >MS1043 folE, FolE protein MSRISAEAEKVRHALIEKGIETPMIALTKSKNERRIGIENRMREVMQLIG LDLTDDSLEETPVRLAKMFIDEIFSGLDYTNFPKITNIENRMKVSEMVLV NDVTLTSTCEHHFVTIDGLVSVAYYPKKWVIGLSKINRVVQFFAQRPQVQ ERLTEQILLAFQTILETEDVAVYMKATHFCVKCRGIKDTNSYTVTSAFGG VFLEDRETRKEFLSLINK >MS0925 folK, FolK protein MKTVYIALGSNLNTPIEQLNSALTALNKLPQTSLSAVSSFYQSKPLGPQD QPDYVNAVACIHTELAPLELLDYLQQIENEQGRVRLRRWGERTLDLDILL YDDLVIKSERLILPHYDMTNREFVIIPLYEIAPNLILPQGIAIAELAKNF ANHDMKICYKP >MS0966 folP, FolP protein MKLYANNKVLDLSMPKVMGILNFTPDSFSDSGRFFQLDKALAQVEKMVKA GASIIDIGGESTRPMAEEVTLEQELERVVPLVEAVRQRFDCWISVDTSKA QVMCESAKVGMDIINDIRALQEPDALETAVKSGLPVCLMHMQGQPRTMQT NPHYDNVVSEVLEFLQNRTALCLQAGMNPQNIIWDMGFGFGKTVQHNYKL LQQLSVFAAQGYPVLAGLSRKSMIGAVLDKTVEQRVTGSVTAALIAAMNG ATILRVHDVEETMDALKIWQATLQA >MS1653 frdB, FrdB protein MYTVRKLMQPKQQLKLRRTQMANQAMMNVEVLRYNPEVDKEPYLRTYQVP YDNQTSLLDALGYIKDRLDPELAYRWSCRMAICGSCGMMVNNIPKLACKT FLRDYSGHMRIEPLANFPIERDLIVDLSHFIESLEAIKPYIIGNEMPALD GQPHPSAELAKSRTKQTPAQLEKYRQFSMCINCGLCYAACPQFGLNPEFV GPAALTMAHRYNLDNRDHGKAERMPIINGENGVWTCTFVGACSEVCPKHV NPAAAINQGKLESAKDYLISMLKPKA >MS1654 frdC, FrdC protein MTTESKRNKYVREVTPTWWKSWSFYKFYMLRESSAIPTVWFCLVLLYGVF CLTTANGFVEKFIPFLQNPVVVILNLISLALLLLHAFTLFQMTGEVMSGS LGLKSEVIQKALKVLFAIVTVVALVLVCI >MS1655 frdD, FrdD protein MVDQNPKRSNEPPVWLMFSAGGMVSGLAFPVLILILGILLPFGIISPDNI IAFSHHWFGKLVILALTIFPMWAGLHRLHHGMHDIKVHVPNGGLIFYGLA AVYSFIVLFAVIAI >MS2007 frnE, FrnE protein MKKIKIEMYSDYACPFCYIGKSHLEQALAQFEHADKVEIVHKAYELYPQT GETVTSTTQGRIEWKYHKTPEQALEMIRHIENLAKRAGIAMNYENVQNTN TFKAHRLTKFAASKGKENEMYNRLMKAYFTDNLPLADRKTLLQCAEDVGL DLAETEAFLNSNDFADSVTADETQARHIGVRSVPFFVINGVEVAGSQPPA RFLALLQQVYAANNM >MS1929 frr, Frr protein MINEIKKDTQDRMEKSLEALKGHIAKIRTGRAQPSLLDAIQVDYYGSATP LRQLANVVAEDARTLAVTVFDRSLIQAVEKAILTSDLGLNPSSAGTTIRV PLPPLTEERRRDLIKIVKGEGEQGKVAIRNVRRDANDKIKALLKDKEISE NDQRKAEEEIQKITDSYIKKVDEVLAEKEKELMDF >MS2178 fruA, FruA protein MKDKPMNIFLTQSPNLGRAKAFLLHQVLAAAVKQQNHQVVENAEQADLAI VFGKTLPNLTALLGKKVYLVDEEQALNAPENTVAQALTEAVDYVQPAQQD VQPATASGMKNIVAVTACPTGVAHTFMSAEAITTYCQQQGWNVKVETRGQ VGANNIISAEDVAAADLVFIATDINVDLSKFKGKPMYRTSTGLALKKTAQ EFDKAFKEATIYQGEETTTATETQTSGEKKGVYKHLMTGVSHMLPLVVAG GLLIAISFMFGIEAFKDENIAGGLPKALMDIGGGAAFHLMIAVFAGYVAF SIADRPGLAVGLIGGMLATSAGAGILGGIIAGFLAGYVVKFLNDAIQLPA SLTSLKPILILPLLGSAIVGLAMIYLLNPPVAAAMNALTEWLKGLGSANA LVLGAILGGMMCIDMGGPVNKAAYVFGTGMIGSQVYTPMAAVMAAGMVPP LGMAIATWIARAKFNASQRDAGKASFVLGLCFISEGALPFVAADPVRVIV SSVIGGAIAGAISMSLAITLQAPHGGLFVIPFVSQPLMYLGAIAVGALTT GVLYAIIKPKQAAE >MS1510 fruB, FruB protein MYSKDVEITAPNGLHTRPAAQFVKEAKAFASDVTVTSAGKSASAKSLFKL QTLGLTQGTVITISAEGEDEQNAVDHLVALIPTLE >MS2179 fruK, FruK protein MAKVATITLNAAYDLVGRLKRIELGEVNTVETLGLFPAGKGINVAKVLND LDVEVAVGGFLGEDNVGDFEHLFQQQGLQDKFQRVAGKTRINVKITETDA DVTDLNFLGYQISEQDWRKFTADSLAYCKEFDIVAVCGSLPRGVTADMFQ SWLSQLHQAGVKVVLDSSNAALTAGLKANPWLVKPNHRELEAWVGHELPT LKDIIDAAKQLKAQGIANVIISMGANGSLWLSDNGVILAQPPKCENVVST VGAGDSMVAGLIYGFVNNLSQQETLAFASAVSAFAVSQSNVGVSDRKLLD PILANVKITTIEG >MS1080 ftn, Ftn protein MLKKAIIDKLNEQINLEFYSSNIYLQMSAWCSNHGYEGAAAFLLRHADEE MEHMHKLFTYVSETGGLPLLGKIDAPQNEFKSLRDVFEITLKHENLVTAK INELVEVTFANKDYSTFNFLQWYVAEQHEEEKLFNSIIDKFNLLGEDGRS LYFIDKELATLDLA >MS1079 ftn, Ftn protein MLSANVVKLLNEQMNLEFYSSNLYLQMSAWCDQKGYTGAAAFLSAHAAEE MEHMRKLFTYLNETGSTAVIEEIEAPTHEFKSLKEVMELTYQHELHITSK INELVGKTFEEKDYSAFNFLQWYVAEQHEEEKLFNGILDKFNLVGNEGKS LFFIDQELAKLAADH >MS1662 ftsA, FtsA protein MAKIVESKTIVGLEVGTSKVVAVVGEVLPDGVVNVLGVGSCPSKGIDKGS ITDLAAVVNSIQRAIEAAESVADCQIMSVTLAITGEHIQSLNESGFVPIA DGEVTQDEIDQAMHTASSVKLPEGLSLLHVIPQEYAVDKQQNIKNPLGLQ GVRLKAQAHLIAGHQAWVNNLQKAVETCGLKVDQVVFSGLASTYSVLTED EKDLGVCLIDFGGGSMDIMVYTNGALRYSKVVPYGGNTITDFVAQSLTTS RNEAESIKINYGSAFMPSAELLEQFAKKKIEVAGLGGGAPRTFTKAQVVE VTSRCYHDLLQVVENELTQLRNELAMRGIKQELIAGFVLTGGSSQMTDIA KCATDIFESHVRVGYPLNITGLTDYVNKPQYATVLGLLQYSHHNEEESTQ MFGGSASESSFLGSIFEKCKKIANKVKSEF >MS0008 ftsE, FtsE protein MIRFANVSKAYLGGKPALQGLTFHLPVGSMTYLTGHSGAGKSTLLKLIMG MERANGGQIWFNGHDITRLSPYEIPFLRRQIGMVHQDYRLLPERSVVDNV ALPLIITGFHPKDAEKYALAALDRVGLRDRANYLPVHLSGGEQQRVDIAR AVVHKPQLLLADEPTGNLDDKLSMDIFNLFEEFNKLGMTVLIATHNLGLI QQKPKPCLVLEQGHLR >MS1832 ftsI, FtsI protein MKNMNLLTKLFATPRHDPIRDNKAERNLFARRALVAFIGTLLLTVVLFTN LYNLQVTEYDKYQTRSNGNRIKLLPVPPTRGLIYDRYGKLLAENLTFFGL YIVPEKVENLDRTFDELRDVVGLTDSDIEQFKKERRRSSRYTPIMLKSDL TEEQIARFAVNQYNYPSLDIRPYFKRHYLYGEPLTHILGYVARINDKDVE RLKKEEKDANYAGTTDIGKLGIERFYEDQLHGTAGYEQVEINNRGKVIRK LSEQPPVAGKSIYLTIDLELQRFITDLLAGQKGAVVVMDPRDSSILAMVS SPSYDNNLFVGGISGSAYKRLLEDPTRPLYSRATQGAYPPASTVKPFIAV AALTEGVITPNMTIFDPGYWILPNSTKRFRDWKKTGHGSLNLYKSITESA DTYFYQVAYKMGIDKMSEWMTRFGFGVPTGVDIQEETSGIMPTREWKQKR YKKPWVIGDTIPVGIGQGYWTATPLQLAKATSVLVNDGKVNTPHLMKETV GSEKEPYKDPLLYEDISEPTKAAWNEAKRGMYGVVNAPNGTGRKAFTGAA YRVAGKSGTAQVFSLKENQRYDASQLKRELHDHAWFTAYAPYENPHIVVS IILENAGGGSGNAAPVVRQIMDYYLLHRLPQVAKLEGITDEQSVNSAAEQ SKAAENNSEEAATSGDMPIEEPISLPHEGATE >MS1673 ftsI, FtsI protein MVKFNTSRKASAKPKKSVKKNIMPNTAVKLNKPKIIYETSFLSGRFQVAV CLIIVCLLALVARAAYIQIINVDTLTNEADKRSLRTQEIQSVRGSILDRN GQLLSVSVPMHSVVADPKFVLDENSLADKDRWKALADTIGVPYKDLVKRI EKNPRSRFEYFARQVPPSVADYVKKLRITGVVLKSDSRRFYPRAEETAHL LGFTDIDNNGIEGIEKSFNSLLIGKSGSRTYRKDKYGNIVEDISDVKKYD AHDVTLSIDEKLQSMVYREIKKAVAENNAESGTAVLVDIRTGEVLAMVNA PSYNPNKRNGVSEDLMRNRAVTDTFEPGSTIKPFVVLTALQRGAVRRNEI INTGPLVLNGHEIKDVAPRNQQTLDEILENSSNRGVSRLALRMPPSALME TYQNAGLGKATDLGLGGEQAGFLNANRKRWSDIERANVAYGYGINATPLQ IARAYITLGSFGIYRPLSITKVDPPVIGNRVFSEKITRDVVNMMEKVAIK NKRALVDGYRVAIKTGTAKKLENGRYVDKYMAYTAGIAPVSDPRFALIIL INDPKAGQYYGGAVSAPVFSSIMGYTLRANNITPDGVPATDKTAARTIRL NNKLSQKMNGETMRKQAN >MS0963 ftsJ, FtsJ protein MGKKRSASSSRWLNEHFKDPFVQKAHKQKLRSRAYFKLDEIQQSDRLFKP GMTVVDLGAAPGGWSQYVVTQIGDKGRIIACDILDMDPIVGVDFLQGDFR DENVLAALLDRVGEDQVDVVMSDMAPNFSGMPSVDIPRAMYLVELALDMC KQVLAKKGSFVVKVFQGEGFDEYLREIRSLFTTVKVRKPEASRDRSREVY IVATGYRG >MS1674 ftsL, FtsL protein MLENSERYPLQNIVVDDLFSANKLVVALLIAIVMTAVTTVWVTHKTRTLT SEKGELVFEKQALENEYLNLKLEETTQSDNTRIEAIATVKLGMKHIDSEH EVVILE >MS0450 ftsN, FtsN protein MVQRDYAARGGRRKKTTGLNKKLLIAVAAMVVLAFAAGLYFIKNSSQPVV EQNLQVETKPQPKSQLPSRPEEVWSYIKELESRTVPTDAVQTEKVIQLSE KQKEELKKLAEQERQAELERTKKSEQETIADKTVDEQTSSAVVSAVNDEA ALKAEQQALEKRKKEEERKKAEAVKVAETKKADTAKSGGGSYGLQCGAFK NRAQAESLQARLAMTGLNARVNTSADWNRVVIGPVGDRAAAAAAQKQASS ITNCVIIGM >MS1663 ftsQ, FtsQ protein MSVVKRKTTQKKIKLAEPKTRVFLQVKPLLVLCCVGLLYFAYINWQTLLD KLDSKPISSFALVGTPQYTTNADVRDMILKMGELKGFFGQDVDVIREQIE SMPWIKGAVVRKIWPDRLSIWVAEYAPVAFWNSEDFVSLDGVVFKLPKDR LKNDNLPRLYGPDYQSLAVLDAWKQIFNELKSKGITLKAVSIDERGSWEI VVENDITLKLGRGEWKSKIDRFMTIYPQVEIPENKKIAYIDLRYKVGAAV SFADIN >MS1668 ftsW, FtsW protein MYIFSKIKAGYQRWTTLTPTNLLYDRSLLWLFIILLFIGFVMVTSASIPV GTRLFDDPFYFAKRDAMYVILSMGICYYFIKVPMANWESWHKRVFILALI LLILVLIPGIGKSVNGARRWIPMVLFNFQPAEFAKLALICFLSGYFTRRY DEVRSRKLSAAKPLIVMGFLGTFLILQPDLGSTVVLFVITFGLLFVVGAH IMQFLVLAATGGFLFVVLVLSSAYRMKRITGFMDPFKDPYGTGFQLSNSL MAFGRGEFTGEGLGNSIQKLEYLPEAHTDFVMAVVGEEFGFAGITVMIIL LALLVFRAMKIGRESLQLEQRFKGFFAFGISFWIFFQGFVNLGMSLGLLP TKGLTFPLVSYGGSSLVIMAISIAILLRIDHENRLMRGGHARLKDD >MS1831 ftsW, FtsW protein MTEKKSIFSNIWTRLHLDFLLLVGLLVVSGYGLIVLYSASGGSETMFRSR IIQVVLGFAVMIVMAQFPPRFYQRIAPYLFFVGLIMLILVDLIGTTSKGA QRWLDLGLFRFQPSEIVKLSVPLMVAVYLGNKKLPPKLSETVIALAIIVV PTLLVAIQPDLGTSILVSASGLFVVFLAGMSWWLILAAVVGLAAFIPIMW LYLMHDYQRTRVLTLLDPEKDPLGAGYHILQSKIAIGSGGLWGKGWMLGT QSQLDFLPEPHTDFIFAVLSEEQGMFGITLLMLIYFFIIIRGLIIGVNAE TAFGRILTGALTLIFFVYIFVNIGMVSGILPVVGVPLPLISYGGTSFVSL MAGFGVIMSIHTHKRTLYHKGN >MS0007 ftsX, FtsX protein MSRVRARAFTLRTIIMSSKNSAPFWVQMQYVLRHVWADLVKRKYGTILTI LVIAVSLTIPTVSFLLWKNTHIASTQFYPESDITVYLHKNLSEEDANAVV EKIRQVEGIDSLNYVSRQQSLNDFRNWSGFSEELDILDDNPLPAVVMIQP APEYQDSKKREDLRANLNKIKGVQEVRLDNDWMEKFTALTWLIGHISVFC AVLMTLAVFLVIGNSIRSDVYSSQANISVMKLLGATDQFILRPYLYTGII YGFFGGFFACFFSSLLIGYFASAVQYVTDVFAVKFSLNGLEIGEVLFLLI ICAIVGYISAWISATRHIKMLDHKAG >MS0140 ftsY, FtsY protein MADEKKKSGFWSWLGLGKKEAGDSAEKQADQAPSAYEKAEQTVEETKRKI DELANQAQGIAEQVKDQVDEIKEDLADKLEQTKQDIVHQVEQVQVEAEQK FERTIEKFLNSEPQSDENQSEQEKIEAVSATEKEQQTAEATHSTDLDVNE VTVETQEKPGKGGFFSRLVKGLLKTKQNIGAGFLSLFTGKKIDDELFEEL EEQLLIADIGVPTTAKIIKNLTEHASRAQLKDTQALYQQLKVEMAEILKP VEQPLIVDTGRKPYVILMVGVNGVGKTTTIGKLARQFQQQGKSVMLAAGD TFRAAAVEQLQVWGERNNIPVVAQSTGSDSASVIFDAMQSAAARNADILI ADTAGRLQNKNNLMDELKKIVRVMKKYDESAPHEIMLTLDAGTGQNAISQ AKLFHEAVGLTGISLTKLDGTAKGGVIFAIADQFNLPIRYIGVGEKIEDL RPFHAEEFIDALFTHEEN >MS1661 ftsZ, FtsZ protein MFEPVEYGFDDEIGRTLIKVVGVGGGGGNAVNHMVNNMIHNGGTLVGENS MTSDEHGEIIFYAVNTDAQALRKSIVQQTVQIGAATTKGLGAGANPNVGR KAAEDDQEAIRAMLEGADMVFIAAGMGGGTGTGAAPIVAQVAKELGILTV AVVTKPFSFEGKKRMAFAELGIKELSKHVDSLIIIPNEKLLKVLGKTTTL VQAFSAVNDILRNAVTGISDMITSPGLINVDFADVRTVMSEMGRAMMGAG IAQGAASDGRAEKAAQDAVASPLLEDVDLSGARGVLVNITAGMDLGLDEF YAVGDTIRAFASDEATVVVGTTLIPEMSDEIRVTIVATGIGDIDEPAATL APVAQRPVTGAAPNPNQPGQAPQQPTTQPEQPARPTSFGNNNDLFKPAFL RGDK >MS0760 fumC, FumC protein MTAFRIEKDTMGEVQVPADKYWAAQTERSRNNFKIGPAASMPHEIIEAFG YLKKAAAFANTDLGVLPAEKRDLIGQACDEILARKLDDQFPLVIWQTGSG TQSNMNLNEVIANRAHVINGGKLGEKSIIHPNDDVNKSQSSNDTYPTAMH IAAYKKVVEATIPAVERLQKTLAAKAAEFKDVVKIGRTHLMDATPLTLGQ EFSGYAAQLSFGLTAIKNTLPHLRQLALGGTAVGTGLNTPKGYDVKVAEY IAKFTGLPFITAENKFEALATHDAIVETHGALKQVAMSLFKIANDIRLLA SGPRSGIGEILIPENEPGSSIMPGKVNPTQCEAMTMVAAQVLGNDTTISF AGSQGHFELNVFKPVMAANFLQSAQLIADVCISFDEHCATGIQPNTPRIQ HLLDSSLMLVTALNTHIGYENAAKIAKTAHKNGTTLREEAINLGLVSAED FDKWVVPADMVGSLK >MS0859 fur, Fur protein MSEENIKLLKKAGLKITEPRLTILALMQEHKMQHFSAEDVYKLLLEKGEE IGLATVYRVLNQFDEAHILIRHNFEGNKSVFELCPTEHHDHIICVDCGKV FEFNDDIIEKRQQEISREHGIQLQTHSLYLYGKCADIDKCDK >MS1503 fusA, FusA protein MSLNDYPQQVNKRRTFAIISHPDAGKTTITEKVLLYGNAIQTAGSVKGKG SQAHAKSDWMEMEKQRGISITTSVMQFPYNDCLVNLLDTPGHEDFSEDTY RTLTAVDSCLMVIDAAKGVEERTIKLMEVTRLRDTPILTFMNKLDRDIRD PMELLDEVESVLKIHCAPITWPIGCGKLFKGVYHLYKDETYLYQTGQGST IQEVKIVKGLNNPELDNAVGDDLAQQLRDELELVKGASNEFDHELFIGGE LTPVFFGTALGNFGVDHFLDGLTEWAPKPQPRQADTRIVESSEEKLTGFV FKIQANMDPKHRDRVAFMRVVSGKYEKGMKLKHVRIGKDVVISDALTFMA GDRAHAEEAYAGDIIGLHNHGTIQIGDTFTQGEDLKFTGIPNFAPELFRR IRLKDPLKQKQLLKGLVQLSEEGAVQVFRPMMNNDLIVGAVGVLQFDVVV SRLKTEYNVEAIYENVNVATARWVECADSKKFEEFKRKNEQNLALDGGDN LTYIAPTMVNLNLAQERYPDVTFFKTREH >MS0164 fusA, FusA protein MARITPIERYRNIGISAHIDAGKTTTSERILFYTGVSHKIGEVHDGAATM DWMEQEQERGITITSAATTAFWSGMSQQFQQHRINVIDTPGHVDFTIEVE RSMRVLDGAVMVYCAVGGVQPQSETVWRQANKYQVPRIAFVNKMDRTGAN FLRVVEQLKTRLGANAVPLQLPVGAEDNFKGVVDLIKMKAINWNEEDQGM TFTYDDIPADMLEACEEWRNNLVEAAAESSEELMEKYLGGEELTEEEIKG ALRARVLANEIILVTCGSAFKNKGVQAMLDAVVEYLPSPVDIPAIKGINE DETEGERHASDDEPFAALAFKIATDPFVGNLTFFRVYSGVVNSGDTVVNS VRQKRERFGRIVQMHANKREEIKEVRAGDIAAAIGLKDVTTGDTLCDPNA PIILERMEFPDPVISVAVEPKTKADQEKMGLALGRLAQEDPSFRVHTDEE SGETIISGMGELHLDIIVDRMKREFKVEANIGKPQVSYRETIRTRVNDVE GKHAKQSGGRGQYGHVVIDLYPLDPEGPGYEFVNEIKGGVIPGEYIPAVD KGIQEQLKSGPLAGYPVVDIGVRLHFGSYHDVDSSELAFKLAASIAFKAA FNKANPVLLEPIMKVEVETPPEYVGDVIGDLSRRRAMVNGQEANDFVVKI NAEVPLSEMFGYATDLRSQTQGRASYSMEPLKYAEAPTSVAAAVIEARKK >MS0457 fxsA, FxsA protein MPIIFIITLIAFLFIYGELSLLIAIGSAIGAFGVIMLLLLSVFIGGVILK SKGLFGLNFRRQIAQGEIPADSVVKSLLWMIAGILFIIPGFITDLLACLL LLLPSGLFEKWISQKFTVINSGFTAQGFGRHSHRYRYYKDQNTEVFEAEY EKEVDEKKRIK >MS0827 gadB, GadB protein MGRRPPYGTNMADISKHRQSLFCSDPQSIADYETAMSNAVKAVSNWLKNE KMYTGGSIRELRKTIGSFNPSKQGVGVNQSLDHLVDIFLNPSLKVHHPHS LAHLHCPTMVASQIAEVLINATNQSMDSWDQSPAGSIMEEQLIDWLRQKA GYGQGTSGVFTSGGTQSNLMGILLARDWAVANHWKNEDGSEWSVQENGLP AEALKKLKVVCSENAHFSVQKNMAMMGMGFQSVVTVPTNANAQMDVAELE KTLATLKAEGKIVACIVATAGTTDAGAIDDLKAIRKLADAYQAWLHVDAA WGGALLLSKDFRHLLDGIELTDSITLDFHKHFFQSISCGAFLLRDERNYR FIDYKADYLNSEYDEEHGVPNLVSKSLQTTRRFDALKLWFTLEALGEDLY ASMIDHGVKLTKQVEEYIRTTEGLEMLVPTQFAAVLFRVAPEGYPAEFID ALNQNVADELFARGEANIGVTKVGNKQSLKMTTLSPIATLENVKALLALV LAEAERIKDAIANGTYVPPID >MS1227 galA, GalA protein MKNNFIRLTGGNTDLIIRPEPAEILYWGKRLEIDEITEADLLSLERGVSN GGLDVDTPVTLAAENGRGYWGSSGVDGHRDGYDWAPVFKTKSAVKNGEVL LIEAVDEIAQLAFKTEIEADGNGVFKFRNSLTNLGEGKFTVNRLAVTLPV PEYADEVLSFYGRWCRELQENRTCLKHGAFMQENRHGRTSHEYAPNLVLG QPHFSQQQGEVWGFHLAWSGNHRIRADVLIDGRRFAQLENLYLPGEIVLE QGESLSTPWVYTAYSDCGLNGMSQQFHRHIRSRILHFAQPIRPVHLNIWE GVMFDHDPAHIIAMAEKAAEMGVERFIIDDGWFIGRNDDFGGLGDWFLDE KKYPNGLKPVIDAVKKLGMQFGIWVELEMISKRSKLYQQHPDWMLRLDGY DQPEERHQYVLDLVNPDAFNYILERMDWLLGENQVDYIKWDHNRRLVQPG HLGKAAVTAQTQAAYRLFDILQKRYPHVEIESCSSGGARIDFEILKRSQR FWTSDNNDALTRQKIQRGMSYFYPPEVIGAHIGGAPCQTTMRNFSFDFRG LTALFGHMGVELDPVKESAEERQGFAKYIALHKQLRPLLHSGESFRLDHH DDTTLINGVVAQDKKQAVVLISQLDMPDYKQMGKLRVPYLEANATYQVKL LSIPDYIKQGKGGHLMKVFPQWVLDNFAGKPVSIRGEWLAKAGLTIPVLD PQSAMLVEFKRL >MS0798 galE, GalE protein MTILVTGGAGYIGSHTIVELLNAGEDVVVLDNLCNSSPKSLERVKQITGK SVKFYEGDVLDRTLLQRIFAENQIKSVIHFAGLKAVGESVQKPAEYYMNN VTGSLVLVQEMKKAGVWNLVFSSTATVYGEPETIPVTENCKVGGTNSPYA TSKLMVEQILTDVVKAEPRFSMIILRYFNPVGAHESGLIGEDPNGIPNNL MPYISQVAIGKLPELSIFGNDYDTHDGTGVRDYIHVVDLAIGHLKALTRH EDDAGLHIYNLGTGIGYSVLDMVKAFEKANNMTLPHKFVARRPGDIAAYY SDPSLAAKELSWTAQRGLEQMMKDTWNWQKNNPKGYRD >MS0648 galK, GalK protein MQPKDLAKKLFSEKFNRTSELNVYAPGRVNIIGEHTDYNDGFVMPCAINY GTAVSGAKRDDHTFCVYAADLDQFDRFRLDRPIEQNPSEKWTGYVRGVVK FIQERCPEFTQGADLVISGNVPLSSGLSSSASLEVAVGKFCQQLGELPLS NTDIALIGQKAENKFVGANCGNMDQLISALGQQDHLLMIDCRSLETKATP VPHNIAVMIVNSHVKHDLVTGEYNTRRQQCEAAAKFFGVKALRDVSIQQF KEKEAELTALDGEAAKRARHVVTENQRVLDAVDALNQGDISRLGELMGQS HDSMRDDFEITTPEIDYLVELAQQVIGKSGGARMTGGGFGGCIVAVAPVE KVEEVRKIIADNYQKRTGIKEDFYVCTASQGVHLC >MS0649 galM, GalM protein MHRLSRSAFMLKTLQKTTALAPDNQPFQIVTLSNKNGMKVQFMDWGATWI SCQVPVNGELREVLLGCRAEDYPRQSAFLGATVGRYANRIANARFELNGQ TFPLTANQGVHQLHGGDGFDKRRWKIEKCGENFVTFCLNSVDGDQGFPGN VEVVLDYELSEDNALTVRFHATPDKDTPLNLTNHAYFNLNNAIRGCDVRG HSLQLNADYFLPVDTDGIPNAPLKSVEGTSFDFLEEKPIGLDFLQEEQQL TKGYDHAFLLNNNAEKTCAILTALDRSLSLQVFTSQPALQVYTGNYLAGV PTRLGGSYADYAGIALETQALPDTPNHPEWQQYGGITKAGETYRHWTTFR FI >MS0647 galT, GalT protein MICFSPDHSKTLPLLTVEEITEVVKVWREQLRELGQKYQWVQIFENKGAA MGCSNPHPHGQIWANSFLPNEVARADLNQKKYFEKQGSVLLLDYAKRELE RKERIVVETEHWLAVVPYWAVWPFETLLMPKKAHIKRLTDLTEEQSRDLA LALKKLTTKYDNLFEISFPYSMGFHAAPFNGEENEHWQLHAHFYPPLLRS ATVRKFMVGYEMLGESQRDLTAEQAAARLRDLSEVHYKMRK >MS0646 galT, GalT protein MGLSFPAPGKTPLARSAGKKLPRKKNQAMTRIVIFVRVMRVLRASLIPII KNLMYSEMIFRLYCLKRRRRNKQRIRYFKAAGLRVKAV >MS0645 galT, GalT protein MSISFDPTEHPHRRYNPLTDQWVLVSPHRAKRPWQGQQEKSCRGRKTKP >MS0345 galU, GalU protein MKGKTMKVIIPVAGLGTRMLPATKAIPKEMLTLVDKPLIQYVVNECIAAG VKEIVLVTHSSKNAIENHFDTSFELETMLEKRVKRQLLEEVRSICPKDVT IMHVRQGNAKGLGHAVLCGRPAVGNEPFAVVLPDVLLAEFTANQKTENLS AMIKRFNETGSSQIMVAPVDPKDVSSYGIADCNGAEFSGGESAVISRMVE KPSPEKAPSNLAVVGRYVFSATIWDLLERTPVGVGDEIQLTDAIDMLIAK ETVEAFHMTGESFDCGDKIGYMKAFVEYGIQHEKLGNEFKNYLKAFAKTL >MS1739 gapA, GapA protein MIILFNLNIIGENLMAIKIGINGFGRIGRIVFRAAQTRDDIEVVGINDLI DVEYMAYMLKYDSTHGRFDGSVEVKDGNLVVNGKTIRVTAERDPANLNWG AIGVDIAVEATGLFLTDETARKHITAGAKKVVMTGPSKDATPMFVRGVNF SAYAGQDIVSNASCTTNCLAPLARVVHETFGIKDGLMTTVHATTATQKTV DGPSAKDWRGGRGASQNIIPSSTGAAKAVGKVLPALNGKLTGMAFRVPTP NVSVVDLTVNLEKPASYETIKQAIKDAAEGKTFNGELKGVLGYTEDAVVS TDFNGCALTSVFDADAGIALTDSFVKLVSWYDNETGYSNKVLDLVAHIYN YKG >MS1919 gcpE, GcpE protein MSAFKPTIKRRESTKIYVGNVPVGGDAPIAVQSMTNTRTTDVEATVAQIK ALERVGADIIRVSVPTMEAAEAFKLIKRQSSVPLVADIHFDYRIALKVAE YGVDCLRINPGNIGREDRIRAVVDCAKDKNIPIRIGINAGSLEKDIQEKY GEPTPEALLESALRHVEILDRLNFDQFKVSVKASDVFLAVEAYRLLAKAI KQPLHLGITEAGGARAGAVKSAVGLGMLLAEGIGDTLRVSLAADPVEEIK VGFDILKSLRIRSRGINFIACPTCSRQEFDVIGTVNALEQRLEDIITPMD VSIIGCVVNGPGEALVSDLGVTGGNKKSGFYLNGERQKERFDNEYIVDQL EAKIRAKIAAQDPKNRIL >MS1379 gcvR, GcvR protein MANSVITVIGKDRVGIVYDVSKILAENQINIVNITQQLMDDYFTMIILVD TSKCSKSFPELAEFFTQESKNLALDIRLQNEEIFKAMHRI >MS0196 gdhA, GdhA protein MQTLTILLIRGKLMSSTVSSLEDFLSLVAQRDGNQPEFLQAVREVFTSIW PFLEANPQYRSQALLERLVEPERAFQFRVAWTDDKGQVQVNRAFRVQFSS AIGPYKGGMRFHPSVNLSILKFLGFEQIFKNALTTLPMGGGKGGSDFDPK GKSDAEVMRFCQALVAELYRHIGPDTDVPAGDIGVGGREVGYLAGYMKKL SNQAACVFTGRGLSFGGSLIRPEATGYGLVYFAQAMLAEKGDSFQGKTVS VSGSGNVAQYAIEKALQLGAKVVTCSDSAGYVYDEAGFTTEKLAALLDIK NVKRGRVKDYAEQFGLQYFPGERPWGVKVDIALPCATQNELELTDAQKLI ANGVQLVAEGANMPTTIEATEALQAAGVLFAPGKAANAGGVATSGLEMAQ SSQRLFWSAEEVDQKLHNIMLDIHANCKKYGTDANGNINYVAGANIAGFV KVADAMLAQGVY >MS2354 gidA, GidA protein MFYSENYDVIVIGGGHAGTEAALAPARMGLKTLLLTHNIDTLGQMSCNPA IGGIGKGHLVKEIDAMGGLMATAADQAGIQFRTLNSSKGPAVRATRAQAD RVLYRQAVKVALENQPNLDIFQQEATDILIEQDRVTGVATRMGLKFKTKS VILTAGTFLGGKIHIGLDNYTGGRAGDPASIALADRLRDLNLRVARLKTG TPPRLDARTINFDILAKQHGDAQLPVFSFMGSVDQHPRQIPCFITHTNEQ THEVIRNNLDRSPMYTGVIEGIGPRYCPSIEDKVMRFADRNSHQIYLEPE GLTSQEIYPNGISTSLPFDVQMKIVNSMVGLEKTRIVKPGYAIEYDFFDP RDLKPTLETKAIKGLFFAGQINGTTGYEEAASQGLLAGINAGLFVQEKES WFPRRDQAYMGVLVDDLCTLGTKEPYRVFTSRAEYRLLLREDNADSRLTP IAHELGLIDENRWARFNQKMENIERERQRLRNIWIHPRSEHLDVINEVLS SPLVREASGEDLLRRPEINYQILTALDLFKPAMDDKEAVEQVEIAVKYQG YIEHQQEEIEKQKRHENTAIPDNFDYTLVAGLSNEVRAKLEQHRPVSIGQ ASRISGVTPAAISILLVNLKKQGMLKRGE >MS2353 gidB, GidB protein MVNKLEQELTQKLEILLKQTALSISDQQKNKLVQLVLLLNKWNKAYNLTS VRDPMEMLIKHILDSVVVSPYLQGDLFIDVGTGPGLPGLPLAIINPDKNF VLLDSLGKRISFIRNAVRELELSNVVPVLSRVEEYIPDHKFDGILSRAFA ILKDMTDWCHHLPNEKGLFYALKGVYQQEEVMDMSNNFQVIDVIKLHVPE LIGERHLVKVKKM >MS1221 glcD, GlcD protein MLPRLKEVPQLTPLVSDFLDDLKAQYFEGDIASNYADRLSLATDNSVYQL LPQAILFPKSVSDVVRITKLAQQHKYLSLTFTPRGGGTGTNGQAINNNII VDLSRHMTGILELNVEERWVRVQAGVVKDQLNQFLKPHGLFFAPELSTSN RATLGGMINTDASGQGSLQYGKTSDHVLGLRAVLVNGDIIDTSAVKTERF LDNLAAKKVTFTSKRLHEEVFHRCKEKREQIVRDLPQLNRFLTGYDLKNV LNEDESEFNLTRLLTGSEGTLAFICEATLDLTPIPQIRTLINVKYSSFDA ALRSAPFMVQANALSVETVDSKVLNLAKEDIIWHSVKELLTEEANSPILG LNIVEFAGNNKNLIERQVAALCAQLDEKIANRESNIIGYQVCSDLPSIER IYAMRKKAVGLLGNAKGAAKPIPFVEDTCVPPENLADYITEFRALLDGYN LQYGMFGHVDAGVLHVRPALDLCDKEQVQLFKHISDSVAELTRKYGGLIW GEHGKGIRSYYGEKFFTPELWQELRYIKFLFDPHNRLNPGKICSALNSEQ QLYPILSPMRADNDRQIPIKMREEYAGAMNCNGNGLCFNFDVHSTMCPSM KVTGNRLFSPKGRAGMVREWLRLMANENVTPEQLNFHHSQVKLSELVEKV KNSVKKWRGEYDFSHEVKAAMDTCLACKACASQCPIKIDVASFRSKFFYF YHQRYVRPSKDYIVANLETVAPYMAKQPKLFNFVMKSKFMKIAAEKALGM TDIPLLSEPNLRHQLVEIGYQGKTLEQLERLSPTEKSNMLLIVQDPYTSY YDATVVADFVELCRKLGFKPVLLPFKPNGKAQHIKGFLGQFARTAKNQAD FLNRMTKLGLPLVGVDPAIVLSYRDEYNEILQQNRGDFHVITAHEWLKNQ LDSNLLKTAVKNLQKNHRTLNTHEWYLFPHCTEQTFMPNSPQEWQQIFTA FGQHLEVEKVGCCGMAGVFGHDMKNQEMSKAIYAGSWATKLTGKNIEYCL ATGYSCRSQVARLENEELKHPVQALLSLFH >MS0661 glf, Glf protein MKKYDYLIVGAGLFGSIFAYEATKRGKKCLVIEKRDHIGGNCYTQNVEGI NVHKYGAHIFHTSNKVVWDYIQQFAEFNRFTNSPIARYKDELYSLPFNML TFNKMWGVITPQEAEAKIKEQIAQESITEPKNLEEQAISLVGRDIYEKLI KGYTEKQWGRKCTELPAFIIKRLPVRYTYDNNYFYDTYQGIPIGGYTGIF ERMLDGIEVKLGVDFFTEREYYENLADKIVFTGMIDEYFGYQFGKLEYRS LRFDNEVLDMPNYQGNAVVNYTEAEVPYTRIIEHKHFEYGTQPKTVITRE HSKEYEEGDEPYYPINDARNNELYAKYKELADEKSNVIFGGRLAQYKYFD MHNIIAEALECVNAHFR >MS1120 glgA, GlgA protein MKVLHVCSELYPLLKTGGLADVLGALPAAQKEIGLDARILIPAYPAISAG IPDTGVVAEFHNSAAGHVVLRYGEFNGVGVYLIDAPNLYAREGNPYHDQW YNDYADNYKRFALLGWVGAELATGLDPWWMAEVVHAHDWHAGLTSAYLAY KGRPAKSVFTIHNLAYQGLFAYHHLFEIGLPTSMFNVNGLEFYGQISYLK AGLYYSDAVTAVSPTYAREITTPEFAYGFEGLLSTLHSQGKLVGILNGVD DNIWNPNTDGYIQDHYKLKSMTGKKKNKAALQAHFNLPEKPDALLFVMIT RLTEQKGVDLLIQSAENIIKQGGQLALLGSGAPSLESALLGLAHKHPKNI AVKIGYDEPLSHLMVAGGDVILVPSRFEPCGLTQLYGLKYGTLPLVRQTG GLADTVVDSTAENIKERRATGFVFNEANSQALSHAISRAFSLWKKQRTWF TVRTVAMEQDFSWQISARRYEELYRRI >MS1123 glgB, GlgB protein MKKLVAQSVIDAFFDGTHSDPFAVLGMHETHNGIEIRVLLPEAHRVIVID KETHKAVVELELVDERGFFNAIVPKANQFFAYELQVYWGKESQILEDPYR FHPMINELDNWLLAEGSHLRPYEVLGAHFVEYDNVAGVNFRVWAPNAKRV SVVGDFNYWDGRRHPMRFHPASGIWELFLPKVALGQLYKFELIDSNNQLR LKADPYAFAAQLRPDTASQVSALPEIVEMTEKRRAANQSDKPISIYEVHL GSWRRNLENNFWLDYDEIADELIPYVKEMGFTHIELLPISEYPFDGSWGY QPLGLYAPTSRFGTPDGFKRLIEKAHESGINVILDWVPGHFPSDTHGLAA FDGTSLYEYADPKEGYHQDWNTLIYNYGRHEVKNYLSGNALYWVERFGLD GLRVDAVASMIYRDYSRRDGEWVPNQYGGRENLEAIEFLKHTNYVLGTEL PGVAAIAEESTSFPGVTLPPEHGGLGFHYKWNMGWMNDTLEYMKLDPVYR QYHHGKMTFAMLYQYSENFVLPLSHDEVVHGKGSLITKMSGDTWQKFANL RAYYGYMWAFPGKKLLFMGNEFAQGREWNYQESLDWFLLDDGQGGGWHSG VQRLVKDLNKTYQNQTALFELDTNPQGFEWLVVDDNQNSVFAFERRSKSG EVIIVVSNFTPVPRDNYRIGVNEPGKYEEILNTDSAYYKGSNLGNYGEVI AEEIENHGKAQSISVMVPPLATVYLRLKK >MS1121 glgC, GlgC protein MNNAVLNQPNKYDLVKDTLVLILAGGRGSRLHELTDKRAKPALYFGGNRR IIDFALSNCINSGLNRIGVITQYAAHSLLRHLQTGWSFLPQERGEFVDML PARQQIDDNTWYRGTADSVYQNLAIIRGHYKPKYVLILAGDHIYKMDYSQ MLLDHVSSGAKCTVGCIEVPREEAKEFGVMAVNETLKVKAFVEKPQDPPA MIGKPNSSLASMGIYVFNADYLYEALDRIKTPNTSHDFGKDVMPLALNDG VLYAHPFDRSCKGRNTEGAIYWKDVGTLDSFWQANIDLVSEEPQLDIYDQ TWPIRGNPVQAYPSKFFYDEPNCKQVDNSLIAGGCMVKNASISYSVLFDN VSVNAGSSIEQSVILPQVKIGKNCMLRRCIIDRHVQIPDGMQIGVDLELD SKRFRISKNGIVLVTESMLHKLNGKSVASEAHLD >MS1119 glgP, GlgP protein MLDKDFIYESPKLTVEALKQAIVSKLVFDIGRSAQEATTRDWLNATVYAV RDFVAEGWIQTVNQFREEKTRRVYYLSMEFLMGRVLSNAMLSEGVYDTAK QALSELGLVLEDILEKEADPGLGNGGLGRLAACFMDSIATCNLPGMGYGI RYEYGMFKQTIEDGSQVEKPDAWIAKGAPWEFTRASKRYRVRFGGNLHFE GEKCIWTPSEEITALAYDNIVPGYETKSAATLRLWTANAGDIFNLANFNK GDYFGAIEERSSIENVSRVLYPDDSTWAGRELRLRQEYFLVSASLQDIIK RHKKFHGGKIANLADKVAIHLNDTHPALAIPELMHILVDQEGISWKKAWD MTRRIFSYTCHTLMSEALETWPIELMAKVLPRHLQMIYEINAEFLEYVRT YVSADVDFIRRVSLIEEGNQRKVRMGWLSVVGSHKVNGVAEIHSDLMVSS TFADFAKIYPERFTNVTNGVTPRRWIGVANPKLAALFDKYIGTEWRKDLS QLSLLKPYIGKPEIIGELAKIKFANKKRLARYVKNTLDIEINPNAIFDVQ VKRIHEYKRQILNVLQIISRYNQMIANPEKNWQSRVFILAGKAASAYYTA KQTIRLINDIAEVINNDERLKGRLKVVFIPNYSVSIAEIIIPAADISEQI SLAGTEASGTSNMKFALNGALTLGTLDGANVEILENVGEDNIFIFGNTVE QVEELRRNGYSPVTFYQQDEELRQAVDQIALGHFSPKEPTRYQGLIDSLR NYDYYQSFADFRSYADMQAKVDEKYQDQAAWFNSTLENIANMGFFSSDRT ILEYAERIWKIKPLKLEN >MS2073 glgP, GlgP protein MTFQSIVEKYCRYFDVADPKNLTLQQWYQIVAEGSLELACSQPFAKPAES RHVNYLSMEFLIGRLTGNNLMNLGYYEQIRDYLKQYQVELVDVLEQERDP ALGNGGLGRLAACFLDSMAALGQNATGYGLHYQYGLFKQSFAEGMQKETP DTWDRNNYPWHSFNPSKTRYVGFGGKIKHIQGDNYEWSPKLTIQGKAFDL PVVGYRNNLIQPLRLWQADSDQSFDFDAFNEGKFLKADKTIVNAAALTQV LYPNDNHKAGQKLRLMQQYFHCACSVADILERHFAEGYQLADFAKRQVIQ LNDTHPTLAIPELMRLLLDDYHLTWDQAWDICTNTFAYTNHTLLPEALEQ WDQRLFKQLLPRHYQIVEKINDIFHQKVRSEFGENSQVWEKLAILFDYRV RMANLCVVTCFRVNGVAQIHSDLLVTDLFPEYHKLFPGKFCNVTNGITPR RWIRQANPKLSDLLDRTLKQDWAKDLELLSGVEKYVDDAGFREEYQAIKR HNKIVLADEINRTLALKVNPDAIFDVQIKRFHEYKRQHLNLLNIIADYQS LKANPNQDYTPRVFVFAGKAAPGYYLAKNIIHAINNVAEIINNDKQVNDR LQVAFLPDYRVSLAEKIIPAADVSEQISMAGKEASGTGNMKLALNGALTL GTLDGANVEIAEMVGEENVFIFGHTVESVRELLAKGYHPKDYYKKDSVLK NAVDFLAHGKASNGDKETFRLMLDSLLERDPFLVFADFDSYRLAQQKIGS AYLNREAWLRSAILNTARLGTFSSDRSIRDYQQHIWLKK >MS1122 glgX, GlgX protein MLKNQTGKPYPLGATLVEVNGTKGVNFSIFSASARAIELCLFDNSGREVR FPILDKTDDIFHIWVPNVPLGTKYGFRIHGDERHNPKKLMLDPYAKMVVG KPDLTSKENQAWYLLSDERDNSKIAPKSIIIDGEFDWEQDKPLNIPWTET IIYELHVKGFSKLRADLPEEIRGTYSALAHPSVIAYLKELGITAVELLPI NFSISESHLQERGISNYWGYNPMAMFAVEPQYAATEDPVHEFKTMVKTLH QAGIEVILDMVFNHSAESERDFPTFSYRGIDEQTYYWSDAQGNYLNWSGC GNLLHLAHPYMRRWAIDCLRYWVEEYHIDGFRFDLATNLGRETPAYKAHS ELFKAMRLISGFKNTKFIAEPWDMGEDGYQMGNFPPFFAEWNDRFRDDIN RFWLWQSGELGAFAERFAGSADIFKQEGKYPHNSVNFITAHDGFTLRDLV SYNHKHNNANGEDNRDGRNENYSHNHGIEGSTDGLDEPQKTAVENARILS SQSLLCSLLLSNGTPMLLAGDEFGNTQFGNNNGYCQDSGLTWLKWSNFNL DLFEIVKKLIIVRKGIQSLVNDKWWTEGNVRWFNEFGSLMNVSDWQERGA KALQVLLDEQWLCVVNAKTELQVFSLPEGDWNMEISMTGCKNQNNQLIVD NLSFCLLRRIL >MS0189 glmS, GlmS protein MHNGIIENYEELRTLLQERGYVFQSQTDTEVIAHLVEWEFRTAGSLLEAV QKTVKQLRGAYGTVVLNEEEPEHLIVARSGSPLVIGYGVGENFLASDPLA LLSVTRRFTYLEEGDVAEITRKSVQIYTRDGQKVEREIHEGNFEADAADK GPYRHYMQKEIFEQPVAIMNTLDGRIKEGKVNIEAIAPNAAEILSKVEHV QIVACGTSYNAGMVARYWFEAIAGVSCDVEIASEFRYRKFVTRPNSLLIT LSQSGETADTLAALRLAKESGYMSAMTICNVASSSLVRESDFAFLTRAGV EIGVASTKAFTTQLTCMLLLNAAIGRLKGNLSEEQEHHIIQSLQRLPAQI ESALVFDKQIETLSEDFAEKHHTLFLGRGEYYPIAMESALKLKEISYIHA EAYAAGELKHGPLALIDSEMPVVVVAPENDLLEKVKSNIEEVRARGGQLY VFADSDAGFEDSDNFKTIVLPKVDEVTAPIFYTVPLQLLSYHIALIKGTD VDQPRNLAKAVTVE >MS0188 glmS, GlmS protein MCGIVGAVAQRDVAEILVDGLHRLEYRGYDSAGVAVLNNAHEMQIVRRVG KVKALDDAIAKNALLGGNRYCAHPLGNSRRTDRS >MS1949 glmU, GlmU protein MIIMKKLSVVILAAGKGTRMYSDLPKVLHKIAGKPMVKHVIDTAKQLSAD QIHLIYGHGADLLKSHLADEPVNWVFQAEQLGTGHAMQQAAPFFADDENI LMLYGDSPLISKETLEKLIAAKPENGIALLTVNLDNPTGYGRIIREKGSV VAIVEQKDADAEQLKITEVNTGVMVSDGASFKKWLGRLNNNNAQGEYYMT DVIGLANQDGFQVAAVSATDKMEVEGANNRLQLAALERYYQHKQAERLLL EGVMLIDPARFDLRGTLEHGKDCEIDVNVIIEGSVKLGDRVKIGAGCVIK NCEIGDDVEIKPYSVFEDSTIGARASIGPFSRLRPGAELAEETHIGNFVE IKKATVGKGSKVNHLTYVGDAQVGTDCNLGAGVITCNYDGANKFKTVIGD NVFVGSDVQLVAPVNVANGATIGAGTTVTKDIGENELVISRVPQRHIAGW QRPTKKK >MS0262 glnA, GlnA protein MANPNAIQRVAKLIEDNDVKFVLLRFTDIKGKEHGVSLPVNLVADELEDF FEEGKMFDGSSVEGWKAINKADMLLMPMPETAVIDPFAQITTLSIRCSVY EPNTMQSYDRDPRSIATRAENYLKSTGIADQALFGPEPEFFLFDDVRFST EMNNVSYKIDDIEAAWNTNRKFEDGNNAYRPLKKGGYCAVAPIDNAHDIR SEMCLILEEMGLVIEAHHHEVATAGQNEIASKFNTLTLKADETQIYKYVV QNVALEYGKTACFMAKPFAGDNGSGMHCNMSLSKDGKNVFQGDKYAGLSE TALYYIGGIIKHAKALNAFTNPTTNSYKRLVPGFEAPVLLAYSASNRSAS IRIPAVTSPKAIRVEARFPDPLANPYLAFAALLMAGIDGIINKIHPGDAM DKNLYDLPPEELKEIPAVCSSLEEALDSLQADHEFLIQGGVFSKEFIDAF VAIKRKEVERVNMTPHPVEFEMYYA >MS1305 glnD, GlnD protein MIGHNNFIIQSVILRFFMFQSVEGLLTPGLIKQQKEQLKQTELENFAQAD VNSLISHRTLFCDNFLIRLWRQFSLHEVTDLALIAVGGYGRREIFPLSDL DFLILTEQPMPADLAKKVEEFIQFVWDCGFDVGASVRTLEDCDSQGRADI TIATNLLESRLLTGNETLFDKLSSIVGREDFWPRKTFFEAKIQEKKQRYQ RYNNTSYNLEPDIKYNPGGLRDLHLIYWIALRHSNALSLEEILQSGFIYP EEYAELERNQQFLFKVRFALHLILKRYDNRLLFDRQVKVSELLGYQGEGN QGVETMMKAFFQSLQAISLASDILAKHYKEHFVDENGEEECQVLDDNFQM INNAIFLVREDCFVQQPDTILDLFSYLIIRPQAELHSSTLRLLHLALGQL NGYLSELPAAREKFLRLLTQPRGIERALIPMHKYGVLTAYIPEWKGIEGL MQFDLFHIYTVDEHTMRVLAKLETFLSEETAEAHPLCVKLFPSLPDRALI YIAALFHDIAKGRGGNHADLGAVDVGRFAAQHGFDCREIETMKWLVKQHL FMSVTAQRRDIHDPEVVMNFAAEVQNQVRLNYLVCLTVADICATNTTLWN SWKRSLFASLYQYTNQQFNQGMDNLLDNQEQEEQNKALALEILQSQGFTE DVQSLWKRCPGDYFLRNTPKELAWHAVLLAGVETELLVKISNRFSAGGTE VFIYCKDRPNLFLKVVAAIGNKKLSIHDAQIITSLDGYAFDSFIVTELDG SLLKFDRRRVLEKAIINSLNSNELTKLQGSENHKLQHFNVKTEVRFLNTE KTTHTEMELFTLDKAGLLADVSLVFSELNLSIQNAKITTIGEKAQDFFIL TNAKGEALSERERQSLSEKLQARLD >MS1278 glnE, GlnE protein MTMPLPSIEQTLIQLADNLITHFPEQFNSQIYQQIQKDISNIKTPVGALM RAVSMSDFVTEILQKQPHFLAECWHKTPQLADCDSYAARLSVQLADIREE TGLYKTLRDFRNQEMAKLSICQSLNSATVEEIFIRLSQLAEALIIGARDW LYQRACLDWGTPTDNQGNVQQLYILGMGKLGGFELNFSSDIDLIFTYPAN GETVGSRKPIDNQKFFTRLGQRLISALDEFTEDGFVYRTDMRLRPFGDSG ALALSFNAMESYYQEQGRDWERYAMIKGRILGADEQDPNVKTLRQLLRPF IYRRYIDFSVIQSLRDMKSKIEREVLRRGLVDNIKLGAGGIREIEFIVQV FQLIRGGREISLQQHELLKLLPEIEKLNLITADQHQDLLQAYLFLRRVEN VLQAINDKQTQLLPADELNRCRLISATCEFTQWDNNHRPQKIQYPIHDWE SFYQVLQQHQQKVRSVFNNLIGFNNENEADDSDNAWSDFLDADLEQGEIA DILAQQGVSEEERDEIIGRLEAFRHSVSHRSIGIRGREVLTQLMPLLLLQ IFSNKKYRTLLPRMLNIVEKILTRTTYLELLLENPQALTQLIELCAKSQL IAEQVAQHPILLDELLDREALLNPPSFEQYPAELQQYLLRLPEDDDEQFI TALRQFKQATLLRIAAADILGALPVMKVSDHLTFLAETILHTVVNLAWQQ ITARFGKPEHLQNNEKGFLVAGYGKLGGIELGYRSDLDLVFLCDEIHSGQ TVGGKKVIDSHQFYLRLAQKIISIFSMTTSAGILYEVDLRLRPSGEAGPL CCSFKAFEDYQMNEAWTWEKQSLVRSRAVYGEPALREKFELIRTGILASP RDLTQLKIDVREMREKMYRHFAGADDNKFNIKKDQGGITDIEFIAQYLVL AHAPENPNLAYWSDNVRIFDIMAEHGIITLNEAEKLKNCYTGLRNQIHHL NLLGEPPIVSKEEFADERRFIHQIWQKLFFE >MS0426 glnK, GlnK protein MKKIEAIIKPFKLDDVRESLSDIGITGMTVTEVRGFGRQKGHTELYRGAE YMVDFLPKVKLEIIIPDELLDQCIEAIMETAQTGKIGDGKIFVYNVERVI RIRTGEENEDAL >MS1685 glnQ, GlnQ protein MALLEIKELVKNYGEVTALNGVNLSVEKGEVVVILGPSGCGKSTFLRCIN GLEEIKSGSLKLADVGELGKDISWVKARQHIGMVFQSYELFAHMTVIDNI LLGPLKVQKRARAEVEKQADALLKRVGLYERKNAYPRELSGGQKQRIAIV RSLCMNPDIMLFDEVTAALDPEMVREVLDVVLGLAKDGMTMIIVTHEMQF ARQVADRIVFMDNGNIIEESEPEQFFTSPKTERAKTFLNILDYYI >MS1275 glnQ, GlnQ protein MIKVKNIHKAFGENVILRGIDLDITKGEVVVILGPSGSGKTTFLRCLNAL EMPEQGTIEFDNAAPLKIDFAAKPSKKDILALRRKAGMVFQNYNLFPHKT ALENVMEGPVRVQSKKVAQAREEALALLTKVGLADKADLYPFQLSGGQQQ RVGIARALALQPELMLFDEPTSALDPELVQDVLDTMKSLAKEGWTMVVVT HEIKFALDVADLVIVMDDGVIVEQGSPKQLFDNPQHERTKAFLQRLRSH >MS0219 glnQ, GlnQ protein MTISVKNLNFFYGSSQALFDINLTAEDGDTVVLLGPSGAGKSTLIRTFNL LEVPKSGDLTVADNHFDLSQNTDAKKMRQLRQDVGMVFQQYNLWPHFTVM ENLIEAPMKILGLTESEAQKEAMELLTRLRLEEHAHRFPLQLSGGQQQRV AIARALMMKPKVLLFDEPTAALDPEITAQIVSIIQELQETGITQVIVTHE VGVARKVATKVVYMEKGRIVETGDASCFEAPQTEQFRQYLSHD >MS0490 glnS, GlnS protein MISKFKVIEMELKALFNLDPNVKVRTRFAPSPTGYLHVGGARTALYSWLY AKHNDGEFVLRIEDTDLERSTPEATAAILDAMEWLNLTWEHGPYFQTERF DRYNEVIDQMIEQGLAYRCYCSKERLEELRHQQEANKEKPRYDRHCLHDH EHSPYEPHVVRFKNPQEGSVVFEDAVRGRIEISNHELDDLIIRRSDGSPT YNFCVVVDDWDMGITHVVRGEDHINNTPRQINILKALGAPIPVYAHVSMI NGDDGQKLSKRHGAVSVMQYRDEGYLPEALLNYLVRLGWGHGDQEIFTLE EMIKLFELEHVSKSASAFNTEKLLWLNQHYIRELPAEYVAQHLAWQYQEQ GIDTSKGPALTEIVSMLGERCKTLKEMAASSRYFFEEFDGFDEAAAKKHL KAAAVEPLEKVKEKLTALSGWDAHSAHEAIEQTAAELEVGMGKVGMPLRV AVTGAGQSPSMDVTLAGIGRERVLARIQKAIDFIKAKNA >MS1127 glnS, GlnS protein MNNNEILIEETRPTNFIRQIIDEDLASGKHNNVYTRFPPEPNGYLHIGHA KSICLNFGIAQDYQGKCNLRFDDTNPVKEDVEYVDSIKQDVEWLGFKWEG EPHYASDYFDQLYGYAIELIEKGMAYVDELSPEQMREYRGTLTEPGKNSP YRDRSIEENLNLFEKMKNGEFAEGAACLRAKIDMASPFMVMRDPVLYRVK FASHHQTGDKWCIYPMYDFTHCISDAIERITHSLCTLEFQDNRRLYDWVL EHISIERPLPHQYEFSRLNLEGTLTSKRKLLKLVAEGAVDGWNDPRMPTI SGLRRRGYTPAALREFCRRIGVTKQDNVVEFSALESCIRDDLNRNAPRAM AVLNPLRIVIENFTEKEVLTAPNHPNYPELGTHEMSFTKEIYIDQADFRE EANKQYKRLVLGKEVRLRHAYVIKAERVEKDEQGGITTVYCSYDPETLGK NPADGRKVKGVIHWVSATENLPAEFRVYGRLFNVPNPGAEEDILAAMNPE SLVVKHGVVEMSLANAEPEKAYQFEREGYYCADNKDSKAGNLVFNLTVSL KEGF >MS0597 gloA, GloA protein MKLEHVAIYVQDLEKAKAFFMKYFNAQPNEKYHNPRTNLMTYFLTFSGGA RLEIMTRPEIIELDKNIFRTGLIHLSMQVGGEEKVRELTERLRTDGYQVI SEPRKTGDGYYESCVLDGEGNQIEIVA >MS0610 gloA, GloA protein MLNDVIRTLPAWLNDWGKKEKKTTLFDFRQQI >MS0611 gloA, GloA protein MISLFTGFHHIAIIVSDYEKSKYFYTQILGAEVIEETYRASRHSYKLDLK FADGSQIELFSFPSSPSRLTMPEACGLRHLAFKVKDIEEAVQYLKTQQIE CEDIRIDELTGKKFTFFKDPDNLPLELYEFNSFKGG >MS0703 gloA, GloA protein MMRILHTMLRVGDLDRSVKFYQDVLGMRLLRTSENPEYKYSLAFLGYDDE DKTAVIELTYNWGVTEYELGSAFGHIAIGVDDIHATCEAVKAHGGKVTRE PGPVKGGSTVIAFVEDPDGYKIEFIENKNAKAALGN >MS0946 gloB, GloB protein MLVPIPALNDNYIWLYGRENLPVIAIDVAECKNLSAYLTQHHLQLEAVLL THYHDDHTGGVEELKRYYPDIPVYGPAETADKGATHIVNEGNIQTAHYRI EVVPSGGHTANHVSYLIDNHLFCGDTLFSAGCGRVFTGDYGQMFESITRL KQLPDKTVICPAHEYTLSNLVFAEAFAPNEKVKSAVKNQRISVESLRAQN KPSLPTTLALEKNINPFLQAENLADFIYLRKAKDNF >MS0824 gloB, GloB protein MNIDIIPVTSFQQNCSLIWDDRKNAAIIDPGGEPKKLIEKIEENGLDLKM ILLTHGHLDHIGAAPALKAHFGVDIIGPHEDDVFWFENLPQQSAQFGLFE ANAFLPDMWLNRENEVLEVGSLKLEVLHLPGHTPGHVGFFEHQNIVAFTG DVLFRNSIGRTDFPGGSYDDLISSIKEKLFPLGDDWIIIPGHGPYTTIGA EKKTNPYLK >MS2011 gloB, GloB protein MKKLVLTTLISATLGLSAIAAHAHPTYAPAKNAVKMQKTQVPGYFRQMVG DYEVTALYDGVGNLDMSLMAPFTQFSKAELDAMLDDEFAQRSELGGLEGT IIGFLVNTGDNLILIDAGKGEAEAPIFLDKQGRLIDSLKAAGYQPEQVDI ILPTHMHADHINGITEKGKRVFKNATVYLPLQEKAFWLDTPMDKLPSEIH PFIEAARYAVAPYLKADKVKFYNAGDEVFAGVKTVPLFGHTPGHSGFEFT SKGEKILFWGDVMHNGAVQMAHPEVAIEFDADAEAARTNRQTILTKIAAD KTLIAAAHLPFPGLGHIKTEKDGKGYRWYPVQYRPFDKH >MS1993 glpA, GlpA protein MLGCVFYLTTNQFTFTRGGIMGMSSQLYKNVGDFSPINTDVIIIGGGATG AGVARDCSLRGLKCVLLERHDIATGATGRNHGLLHSGGRYAVNDRESAEE CIKENLILKRIARHCVDDTKGLFITLPEDDLDYQKKFIEACQASGIEAEA IDPALAKFMEPSVNPDLVGAVVVPDGSIDPFRLTAANMIDAVENGAQVFT YCEVKGLIREGGRVIGVNVYDHKNKINRQFFAPMVVNAGGIWGQGIAEYA DLKIRMFPAKGALLVMGHRINGMVINRCRKPADADILVPGDTICVIGTTS DRIPYDQIDNMEVTPEEVDILIREGEKLAPSLRHTRVLRAYAGVRPLVAT DDDPSGRNVSRGIILLDHAQRDGLDGFITITGGKLMTYRLMAEWATDLVC QKLNNSKKCETSDRTLPGSNESREETSQKVVSLPTTIRNSAVYRHGSRAT RLLENERLDRSLVCECEAVTAGEVRYAVDELNVNNLIDLRRRTRVGMGTC QAELCACRAAGLMARFDVATPRQSTEQLASFMEERWKGIRPIAWGDAVRE AEFTSWIYYSLLGLNDVLPEDAQGVNNNEF >MS1994 glpB, GlpB protein MNFDVVIIGAGIAGLTCGLTLQEKGVRCAIINNGQAALDFSSGSMDLLSR LPNGSTVDSFAQSYAALAQQSPNHPYVILGKDVVLDKIQQFETLAKSLNL SLVGSSDKNHKRVTALGGLRGTWLSPNSVPTVSLEGKFPHDNIVLLGIEG YHDFQPQLLADNLKQNPQFAHCEITTNFLHIPELDHLRQNSREFRSVNIA QVLEYKLSFNNLVDEIKQAVGNAKAAFLPACFGLDDQSFFESLKQATGIE LYELPTLPPSLLGIRQHRQLRHRFEKLGGVMFNGDRALRSEFEGNKVARI FTQLHLENAVTAKYFVLASGGFFSNGLVSEFEEIYEPLFRSDIVKTERFN ATDRFSWISKRFADPQPYQSAGVVINAECQVQKDGNNVENLFAIGAVIGG YNGIELGCGSGVAVTTALKVADNIIAKESSN >MS1995 glpC, GlpC protein MNIQELIKQAKQDMQSPIAAEIFHDKSFESCIKCTACTAVCPVSRNNPLY PGPKQAGPDGERLRLKSPSFYDEALKYCLNCKRCEVACPSDVKIGDIIVR ARNKHLAQQNKPFVQKLRDAILSNTDIMGTLATPFAPIVNTVTGLKATKF VLEKTIQVSKHRTLPKYSFGTFRSWYMKNAAKEQAKFDQKVAYYHGCYVN YNNPQLGKEFIQVFNAMDIGVVLLEKEKCCGLPLSVNAFPERAKKLAQFN TDYIEKMLDENGLDVISEASSCTLNLRDEYHHILGIDNAKVRPHIHMVTP FLYKLFQQGKTLPLKPLKLRVAYHTACHVEKAGWAPYTLEILKQIPGLEV VVLPSQCCGIAGTYGFKAENYETSQAIGKTLFDNINEGGFDYVISECQTC KWQIDMSSNVTCIHPITLLAMSINQ >MS0752 glpC, GlpC protein MNVNFYVTCLADVVKAGVAKNTVLLLEKLGCKVIFLEKQGCCGQPALNSG YTKQALPGMKNLVETFEVNDYPIVAPAGSCVYAIKNYPEYFTRFNEPQWA ERAQKIADRFYDLTDFIVNVLGVTNVGATLTGKAVYHPSCSLSRKLGIVK EPVSLLQQVKGLTLLPIANQQTCCGFGGTFSVKMAEISGEMVKEKVAHIS EADPDYLIGADVSCLMNIAGRLEREGKKVKVMHIAEVLMQEEK >MS1990 glpF, GlpF protein MPLFFYFILYIMTHILCENREFKKIPNGLFLIKTTNHNFFKHNSQSSKEK NMNPYLAEFLGTALLVLMGNGVVANVCLNKTKGNGSGWIVITTAWAFAVY VAVVATGPYSGAHLNPAVTLGLAANGGFSWTMVPGYIIAQILGGIFGGLV VYLFYRDHFSATEDEGAKRASFCTEPAIRNYGSNLFSEIIGTVVLVSVIF YISAGSITLPGAEGATPVGLGSIGGLPVAILVWAIGLSLGGTTGYAINPA RDLGPRIALTLLSKKLKTSPDWGYAWVPVLGPCIGGLLAAIGYQIVM >MS1965 glpF, GlpF protein MKKLFAEFFGTFWLVFGGCGSAVLAAAYPELGIGFAGVALAFGLTVLTMA YAVGHISGGHFNPAVTLGLVAGGRFQAKEAFSYILAQVVGGVMGATVLYA IASGKVGFDAVNGGFASNGFGEHSPNGYSLAAVFIAEVVLTAFFLIIIHG ATDKRAPAGFAPIAIGLALTLIHLISIPVSNTSVNPARSTAVAVFQGGWA LEQLWVFWVAPIIGGIIGGIIYRVLLESKD >MS2185 glpG, GlpG protein MQLLFRSEIPSFAWQFRDYIRKKYQIELILQQEKTDMRQNVIAVYLSGNS EQTAAILQDLAEFHRNPFDERYERASWETGDVSSGSHSLKELAENSSQGI KQQLLKTGPVTLLITLICIIVYGFEISGMAEQIMQFAHFPYEFGENQQIW RYFTHSLVHLSSMHITFNLVWWWIFGGAIERYFGSTKLIIIYVLAAFATG VTQNFASGPHFFGLSGVVYAVLGYVFVADKFSPNNRFNLPSGFFNVLIIG IALGFVTPLIGIKMGNTAHITGLLVGLILAFLQEKIGKKSK >MS1988 glpK, GlpK protein MNTYFINYSSRRLTMSEKKYIIALDQGTTSSRAVLLDHDANIVEIAQREF TQIYPKAGWVEHNPMEIWATQSSTLNEVVAKAGITSDEIAAIGITNQRET TIVWEKETGNPVYNAIVWQCRRTSDITDKLKADGYEDYIRQTTGLVVDPY FSGTKVKWILDNVEGARAKAERGELLFGTVDTWLVWKLTQGRVHVTDYTN ASRTMLFNIHTKQWDDKMLEILDIPRSMLPEVKNSSEVYGQTNIGGKGGV RIPVAGIAGDQQAALYGHLCVTAGQAKNTYGTGCFMLLHTGDKAITSKNG LLTTIACNAKGEPEYALEGSVFIAGASIQWLRDELKIVHDSYDSEYFATK VPSTNGVYVVPAFTGLGAPYWDPYARGAILGLSRGANRNHIVRATLESIA YQTRDVLEAMQSDSGEKLKYLRVDGGATANNFLMQFQADILDVNVERPVV KEVTALGAAYLAGLAVGFWKDLSELQDKARVERTFTPDNDNEKRERRYKG WKKAVRRALEWAKEDVE >MS0380 glpR, GlpR protein MKLNEKEQLIIDSLKRKDVITNIELSEILQCSTVTIRSLIRSLEKKGLII RTHGGAKLCNDYLDIHIPAGNIFKEREAKLRIAEKAYQYIAERDTIILDD SSNSYYLAQVIKKYSDKYLIIITNSLPVIAELSTCSAVEIISIGGVLRGN KNAFVGDFAIEMLKNFKATKAFIGVHGIDPEFGITSIGNEQMMIKKQIFK IAQYVYVLTCSEKFGTGYLLVSAPLSQVHKIITDKNIDKNILNVIKSSVD IDLV >MS2186 glpR, GlpR protein MKQSIRHQKIVELVKLQGYISTDELVTLLNVSPQTIRRDLNELAENNLIR RHHGGAASPSSAENSDYSERKLFFSLEKNHIAQAVSRLIPNGSSLFIDIG TTSEAVANALLGHQNLRIVTNNLNAAHILMKNDTFKITVAGGSLRQDGGI IGEATVNFISQFRLDYGILGISSIDLDGSLLDYDYHEVQVKRAIMESSRE TVLVTDHSKFSRQAIVKLASVTDVDYLFTDQEPPKSIMELIHNSSVELRV CK >MS0024 glpR, GlpR protein MVRSNIMNEQIRHNKLLTLLGENGFLSVQEIMTALNISPATARRDITKLN EQGRLKKLRNGAEAVIQSTFQPQKKQNEIKNLDEKQRIAALAASLCQNDS SAILTCGSTMLLLGNALCNRNVQIITNYLPLANQLIENDHERVVIMGGQY NKSQAITLSLSEHNEAFAADIMFTSGKGLTAQGLYKTDMVIASSEQRLLK RAQKLIVLVDSSKLDKTVGMLFTELKNIDLIITGQEADPDFIRTLREKGV DVMLA >MS0074 glpR, GlpR protein MSVDRQNAIKLFLRSHNMATVEQLVKITNSSPATIRRDLIKLDDAGIINR THGGVSLRDSFPYQPTTNEKQYQHVTEKENIADYVVSLISPGDSVLLDAG TTTLCIAKKLVNIPLRVITSDLHIALLLSEYKQIDIVMTGGAIDKSSQSC IGQHGLDLLQNINPDFAFVSCNSWSIERGITAPTEDKANLKKCLLQNSRR KVLVADSSKYGKCSLFKVIELNRLTDIITDHNLPQSAQKALNELDLSVAF A >MS0187 glpR, GlpR protein MKRNFQQRNTQQRRHGIMQLLQQKGEVSVEQLVQLFETSEVTIRKDLTAL ESNGFLLRRYGGAILMPQDLMDESQDENLSKQKLSIAKAAAERIRDHHRI IIDSGSTTAALIKQLNSKQGLVVMTNSLSVASELRSLENEPTLLMTGGTW DTRSESFQGKVAEQVLRSYDFDQLFIGADGIDLARGTTTFNELVELSRVM AEVSREVIVMVESQKIGRKMPNLELNWQQIDVLVTDDLLSEKDKAVIERH NIEVIIAK >MS1983 glpR, GlpR protein MIPAERQKMLLNLISQQDIVSISQLVETLGVSHMTVRRDIQKLEEEGKVV SVSGGVKMLEHLSIEPTHNDKSLLSPSQKSQIGIKASEIIPEKTTIYLDA GTTTLEIAHHIVDREDLLVITNDFVIANFLMKAGKCELIHTGGSVNKSNY SSVGELAAQFLRQISIDIAFISTSSWNLKGLTTPDENKLPVKRAILQSSN KRILVSDSSKYGKVATFQICPLSEFDVIICDSDLLENAKDAINEMRIELL LV >MS2316 glpR, GlpR protein MREKKVKPRERQSAIVEFLQINGKTAVEQLAQIFKTTGTTIRKDLTALEA EKKVLRAYGSVVLVNKDEIDLPEANKTNTNLEVKRRIGQKATEFIGDGDS LLMDSGTTVLQMVPYLAKYRDLTIMTNSLHIMNALTGLERDYELLITGGT YRQKSASFHGILAESTVEKFTFDKLFIGTNSFDLDYGLTTFNEVHGVSKS MCKAAREIIVLADSSKFQRRSPNVVCPLEKINTIVTDKNLDPAIHQALIE KNINVILV >MS2254 glpX, GlpX protein MNRSLSIEFSRVTEAAAISAHSWIGRGDKNAADEAAVKAMRYMLNRIHMD GEIVIGEGEIDDAPMLYIGEKVGSGMGEQVSIAVDPIDGTRMTAMGQSGA LAVLAAGGKNTFLKAPDMYMEKLVVSAEAKGMIDLNLPIEQNLRRVASRK GKLMSELVVMVLAKPRHNEIIKQIQSLGAKVLAIPDGDVAASVQVCLPDA EADVLYGIGGAPEGVITAAAVRALGGDMQARLLPRNEVKDDTPENQQIAQ EEMRRCQEMGVAVNQVLSLNELAHDDNLVFVATGITNGDLLKGIQIKGNF ATTETLMIRGQSHTIRRIQSMHYLDGKDPDLYKSIAL >MS2371 gltA, GltA protein MADKKATLTVDGKNYEFDIVKGSLGYESIDIHGLSQNKLFMYDPGLVSTA VCESAITYVDGDEGMLLYRGYPIDQLASNADYLEVSYLLLFGERPTKQQY QDFSKLVKRHTLVHEQLTKFFQGFRRDSHPMAVMCGVSGALAAFYHDSID VKKEEHRELTAIRLLAKIPTLAAMCYKYSIGQPFMFPQNNLSYAGNFLYM MFATPCEPYVVNPVLERALDKIFILHADHEQNASTSTVRIAASSGANPFA CIAAGIASLWGPSHGGANEACINMLEEIGTVDRIPEFIARAKDKNDPFRL MGFGHRVYKNYDPRAKVMRETCHEVLKELNIKNPLFDVASELERIALSDP YFIDHKLYPNVDFYSGIVLKAIGIPTSMFTVMFALARTVGWIAHWKEMYK QGNFKIARPRQIYTGYTERDFPAIDKD >MS0731 gltD, GltD protein MAKFFLAPADNYDVKIGELVDKFVNKVRSFPPGTCPLVVQYASLRSSMSQ TCGKCVPCRDGIPHLSFLLRDILAGEGDDSTMRQIRELAEMIRDGSDCAI GYQPAIEILDSIEEFKEEYESHIHNKSCQKVIGQRIPCINMCPAHVDIPG YIAHIGDGNYAEAINLIRKDNPLPTACGLVCEHPCEERCRRRLIDDAINI RGLKKYAVDQVAADVVKVPQALPDTGKKVAVIGGGPAGLTCAYFLAQMGH RVTIYERQKALGGMLRYGIPNYRFPKDRLDQDLNAILSAGRIEVKYGVMV GDDIAIEDIYNSHDAMFVGIGAQKGKTLRIKGSEANNVFSAVEMLDDIGN GKIPDYTDKVVVVIGGGNVAMDAARSAVRCKAKDVRIVYRRRQDDMTALH AEIEAAIMEGIELITLAAPVAIEKDEQGNCTGLTVQPQMTGPYDHGGRPS PVAVKKPPFTIGCDVILIAVGQDIISLPFEEFGMPANRGIFQADLTTAVP DMDGVFVGGDCATGPATAIKAIAAGKVAAHNIDEYLGYHHEFPCETKAPP PKENVRIQVGRANTTERPAYIRKCDFEHVENPYTYEEAMQEAERCLRCDH FGCGVLQGGRDL >MS0030 gltS, GltS protein MTFDTYETLALACLVLLLGYFLVKRVKLLSNFNIPEPVVGGFIVAIVLTV VHEIWGLSFSFDSNLQRTMMLVFFSSIGLSANFARLIKGGKPLVMFLVVA AMLIAIQDTVGIFGSMALGLDPAYGLIAGSVTLTGGHGTGAAWAETLTND FGISGAMELAMACATFGLVFGGIIGGPVARFLLTRLHKEEVPEDENVDDV QEVFEKPVYRRKVNSRAIIETISMMAVCLLVGQFLDELAKGTAFQLPTFV WCLFTGVILRNTLTLVFKFTAPDQTIDVLGTVGLSIFLAIALMSLKLWEL AGLALPVFVILTLQVVVMATFAILVTYRVMGSDYDAVVLSAGHCGFGLGA TPTAVANMQAVTAHFGHSHKAFLIVPMVGAFFIDLLNASLLKFFVEVAAY FH >MS1295 glyA, GlyA protein MLQNHSIAEFDPVLWDAIQNENRRQEEHIELIASENYVTKAVMEAQGSQL TNKYAEGYPGKRYYGGCEYVDIVEQLAIDRAKELFGADYANVQPHSGSQA NAAVYGALLNAGDTILGMDLAHGGHLTHGAKVSFSGKIYNSVLYGITAEG LIDYEDVRVKALESKPKMIVAGFSAYSQVVDWAKMREIADEVGAYLFVDM AHVAGLIAAGLYPNPLPHAHVVTTTTHKTLAGPRGGLILSACGDEEIYKK LNSSVFPANQGGPLMHVIAAKAVCFKEALQPEFKAYQAQVLKNAKAMVEV FKQRGFEVVSKGTENHLFLVSFVKQGLTGKAADAALGEANITVNKNSVPN DPQKPFITSGIRVGSPSITRRGFNEADASTLAGWMCDVLESIGKDNYDQV IAETRAKVLEICKRLPVYGD >MS1953 glyQ, GlyQ protein MSAKFNVKTFQGMILALQDYWAQQGCTIVQPFDMEVGAGTSHPMTALRAL GPEPMAFAYVQPSRRPTDGRYGENPNRLQHYYQFQVVIKPSPDNIQELYL GSLKMLGFDPTQHDIRFVEDNWENPTLGAWGLGWEVWLNGMEVTQFTYFQ QVGGLECKPVTGEVTYGLERLAMYIQGVDSVYDLVYSDGPLGKTTYGDVF HQNEVEQSTYNFEYADVDFLFECFNKYEQEAKFLLKQEPRMENDKEIWVE TALPLPAYERILKAAHSFNLLDARKAISVTERQRYILRIRALTKGVAEAY YASREALGFPGCK >MS1956 glyS, GlyS protein MTTQNFLAEIGTEELPPKALKKLATAFAENVENELNQAGLTFEKVQWFAA PRRLAVKVLNLATSQPTKEIEKRGPAVSAAFDAEGKPTKAAEGWARGCGI TVEQAERLATDKGEWLVHRATIEGQPTKNLMLDIVTRSLANLPIPKMMRW GDKTEQFVRPVHTVSLLLGGELIEGEILGIASGRTIRGHRFLGEAEFQIA HADEYPQILKDKGSVIADFNERRAIILADSQAKASALGGVADIEDDLLDE VTSLVEFPNVLTATFEERFLAVPAEALVYTMKGDQKYFPIYDKNGKLLPH FIFVSNINPTDPTPIIEGNEKVVRPRLSDAEFFFNTDKKQRLEDLLPRLE TVLFQQQLGTLLDKTKRIQALAGEIATQIGADKAKAERAGLLSKCDLMTN MVFEFTDTQGVMGMHYARHDGEDEEVAVALNEQYMPRFAGDNLPNSLVAS SVALADKFDTLTGIFGIGQAPKGSADPFALRRAALGALRIIVEKNLPLDL AEIVKKSTALFADRLTNQNVVDDVVDFMLGRFRAWYQDEGIAVDVIQAVL ARRPTKPADFDARVRAVSHFRTLDSAEALAAANKRVSNILAKIEGEISSK IDRTLLLEPEEKALAEQVLALQSELAPLFAKGEYQPALDRLAGLREVIDN FFDKVMVNAEDEKLRQNRQAILNTLRNLFLQVADISLLQ >MS0218 gmhA, GmhA protein MILADSFKQGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISD VSHLSCVSNDFGYDYVFSRYVEAVGKEGDVLFGLSTSGNSKNVLNAIEAA KAKGMKVIAMTGKDGGKMAGLADVEIRVPHFRYADRIQEVHIKVIHILMM LIEFEMAKAA >MS1290 gmhA, GmhA protein MFNGLKILLNNMLEKIKDLYTENIQTQISASRLLPETIVEATTKLVSCLL RGNKIIVCGHGRSYANAQFLVANLLNRYELERPSFPSVLLTIDSAVGSAI VSDNHITTLYQRQFNAIAQQGDILVALVPNSGDESIINVINCATNKDVEI IALTGANDDHLQGLISENDLEVQTPAIKESRILEGHLFIINALCELIDHT LFTQSG >MS1738 gmk, Gmk protein MSQGNLYILSAPSGAGKSSLISALLEQDQANTMMVSVSHTTRQPRPGEEN GVHYHFVSVEEFELLINEGAFLEYAKVFGGNYYGTSLPTIEKNLAQGIDV FLDIDWQGAQQIRKKVPSVKSIFILPPSLAELEKRLIGRGQDSAEVIADR MSKAMDEISHYNEYDYVIINDDFTRALADLVHILRAEKLTLAYQTEQNQA LINQLLAK >MS0013 gnd, Gnd protein MSTKGDIGVIGLAVMGQNLILNMNDNGFKVVAFNRTTTKVDEFLQGAAKG TNIIGAYSLEDLAAKLEKPRKVMLMVRAGDVVDQFIDALLPHLEQGDIII DGGNSNYPDTNRRTKALAEKGIRFIGTGVSGGEEGARHGPSIMPGGNPEA WPYVKPILQAISAKTDKGEPCCDWVGAEGAGHFVKMVHNGIEYGDMQLIC EAYQFLKEGLGLSYEEMHEIFQQWKQTELDSYLVDITTDILAYKDTDGQP LVEKILDTAGQKGTGKWTGINALDFGIPLTLITESVFARCVSSFKEQRVA AAKLFNKTVSPVEGDKKVWIEAVRKALLASKIISYAQGFMLIREASEQFG WNINYGATALLWREGCIIRSAFLGNIRDAYETNPDLVFLGSDPYFKGILQ NALADWRKVVAKSIEAGIPMPCMASAITFLDGYTSERVPANLLQAQRDYF GAHTYERTDKPRGEFFHTNWTGRGGNTASTTYDV >MS0957 gntK, GntK protein MTQGKSFILMGVSSTGKTSVGTEVAHRLGLKLIDGDDLHPRANIIKMGEG KPLNDEDRAPWLERIRDAAFSLEQKSEVGVIICSALKKKYRDLIRQGNER VKFLFLYGSYELILERMRQRKGHYMKEEMLKSQFDTLEVPQADEADVIHI DIDGSFEEVVQRCITALKPYL >MS0524 gntR, GntR protein MFFIKNKDNMSRDLNLRQDIINQMIDDISSDLLTSPLPSLSALATLYNVS RTTIRHAITYLTEQKIINRIDAQLIITKKPSADDKITYIKIKKPGNNQIK KLEKYFSSAVQQKIIKPGDDFTELELAKNANVDIFTVREYLIQFSRFNLI SHISAGKWRLTKLTQHYADKLFELREMLECHALNCFMNLPKNDIRWKQMK LLLQEHRILRNNIVEKYVDFSLLDQQLHSLILSAADNPFINDFINLISVI FHFHYQWDNSNLRTRNILAVEEHLAILVKIVSQDDLGAITELKRHLQTAK NGLMNSIRLMNN >MS0688 gntT, GntT protein METAASMSQMLIGLAIGIALLLILAMKTRIHVFVALILASLTTGLIGGLP FAEVISSVTKGFGSTLGSTGIIIGLGVMMGAILEKSGAAEQMAFSIIKLI GKAKEEWALALTGYVVAIPVFADSGLIILTPLARSLSRMTGKSVIGLGLA MATGLQLAHVFIPPTPGPLAVAGILDIDMGMMIIWGMILTVPTLVMSTLY AKWLGKKIYQIPNEDGTDFERKEFKEEYIKSIENVEQIYKDKNLPGAGLS FSPIVIPLILILGNTTVNFLKIENGFADLLKIVGHPIIALIIGLLIALYG LGRRLSKAETNKAIEDGVKSTGMILFITGAGGALGYVVRDAGIGNALGEA VLTVGIPGILIPFVIAALMRIALGSATVALITAATLAAPLVPQLGLNPTL VAMSTCAGAVSFSYFNDSGFWVFNGLYGLKEVKDQFMAKTMVSFIGAFSC LALVLIFNIFM >MS0954 gntT, GntT protein MLIFIMIASVALLLLLIMKFKVHAFVALTIVSLLTALATGIPINKILPTL LNGFGNTLASVALLVGLGAMIGRLLEITGGAKVLADTLINKFGEQKAPLA LGIASLLFGFPIFFDAGLVVMLPIIFSVAKQFGGSLIRYAFPAAGAFAVM HAFSVPHPGPVAAGDLLGANIGLLTIIGLICAIPTWYIATYLFGLHLGKK YHLDLPKAFLNAMPINETAVLTPPSFKKVILILLLPLGINYAGYGVKYFS RCKSN >MS1977 gntT, GntT protein MSLKIAAILLALLYQEYCMSNEMLILIGIVSVIALLLIMIKGKVHPFVAL SLVSIAVALSSGIPMGKVVPTLISGMGGTLGSVALIVGLGAMLGKIIEKS NGADVLASWLLDKFGEKRAPFALAMTGFIFGIPVFVDVGFIVLIPIIFSV ARRIGGNMLVYALPIGLSMLTVHVLMPPHPGVVAGAQVLNADIGLVLGLG FIAALPAVLIGQTFIPLFTKNNFVAIPASSDLLEYQKQVSKNVDGLPKFA TVLAMIVFPLLLIMSGTVSATVLPKESIVREFFSMVGASPFALLLAVCVS SYILGIRRGWRKEQLEEILNSALAPIAGIILITGAGGMFGKVLNESGVGN ALADVLSSTGLPILALSFILAAMLRAAQGSATVAVITTATILAPAVTSAG YSDIQTALVTAAIGAGSMTLSHVNDSLFWVWTKFFGITITQGLRTWSILS TIYGSLAFLIVTLMWMFA >MS0953 gntT, GntT protein MLDTVLNTLAVAKVIDGSQLWVETLRLLGKTPIALLITLIVSIVLLKNQR SYEQIEKICDSSLGPICAIVLVTGAGGMFGGVLRASGIGEVLASTLGHTG MPVIVAAFIISSALRVAQGSATVALTTTAALISPMVAADPSLSQMDLCFI VISIASGATVLAHVNDSGFWLVSRFLEIDTKTMLKTWTVQETLIGIVGFI IAYVGSIIF >MS0335 gntT, GntT protein MIMSITVAFIIGVAVLLFLALKLKVSAFLSLLATALTIGILSGMGTTEII KDIVAGFSKSVGSIGLVIIFGTMLGNYLEQSRAAHKMALDAVRLVGTKNS SIAMSISGYLISIPVFSDVGFLILSPLIKAISKKSKIPLAALAVALSAGL LATHVYVPPTPGPLAAAGLLGIDIGRAIIWGAFAAVVMTLFGWMYAHFYL MKKSPDYYTFVETVVEEKEVDETNLPGSLASLMPLLLPIVLILLNTTCAA IFPKDSPVLSVTKFIGDSNIALVIGALTAIALLGKRIGKEKVLKIMDSSL KDAGSIIFITAAGGALGQILKTSGAGDSLAQAVVSSGLPFILIPFVISAI LKIVQGSGVVAVITSATLAAPIATQLGIDPILIFLASGAGARAYCHVNDS YFWVYTNCCGFDMKTGLKTLSNASIFMSLGGLLATFIASLII >MS0686 gntT, GntT protein MSGISLIISFIIAIIIMIWMISKLKVHPFLSLMTISLALALVAGIELNKI PGMIGDGFSSTFKSIGIVIIFGAIIGTILEKTGAALKLADMVVKLVGQKH PELAMLIMGAIVGIPVFCDSGFVVLNPIREALYKKIAANPVATAVALSGG LYASHVFIPPTPGPIAAAGALGLESNLLLVIIMGVVVSIPVLTAVYFFAG YIGKRVTLDEEAQADAAIVKNYEQLLKQYGILPGKFLSLAPILMPIVFMA LGSIAKIAEIGGNTGIIIQFLGTPIIALAIGVIFSVFLLLQTKKITEFND LTNETLKIVGPILFITAAGGVLGKVITEAGFVDYIKQNAHIISTTGIFFP FIISAVLKTAQGSSTVAIITTASIMGMYSAGDSLMSVLGLTSEIAAALCV MAIAAGAMCVSHANDSYFWVVTNFGKMTAQQGYKTQTLMTFIMGIVGIIT VYILSLLLL >MS2331 gph, Gph protein MNSQFKLIGFDLDGTLVNSLPDLALSVNSALAEFELPQAPEELVLTWIGN GADILIGRALDWAKEQSGKSLTDEQTAQLKERFSFYYAENLCNVSRLYPN VKETLETLKEQGFILAVVTNKPTRHVQPVLKAFAIDHLFSETLGGQSLPA IKPHPAPLYYLCGKFGLYPHQILFVGDSRNDILAAHSAGCTAVGLTYGYN YNMPIADSHPDWIFEDFADLLKIV >MS2321 gpmA, GpmA protein MELVFIRHGLSEWNALNLFTGWRDVNLSEKGVEEAKEAGRKLKAAGFEFD IAFTSVLTRAIKTCNLVLEESNQLWVPQIKTWRLNERHYGGLQGLNKAEA AAEHGDEQVRIWRRSYDVLPPVLDPKDPNSAHNDRRYAHLPADVVPDCEN LKVTLERVLPFWEDQIAPAIKAGKRVLVAAHGNSLRALAKHIEGISDADI MDLEIPTGQPLVYTLDDNLKVVSKRYL >MS1172 gpmB, GpmB protein MKKDLRLYLIRHGRTVWNEQGLMQGWGNSALTEQGVKGAQLTGQALAEVP FIAAYSSCLQRTIDTANYILGERSVPLFQHIGLNEQFFGSWEGTNVETIR QTAEFQQMVNDPKNYQASSNGGETWQQVAERAMKAMQDIIDVHHRGDILI VSHGHTLRLLLALFAGATWQNHREQGKSVAMLNTAINMVRYVQHDEDQAG KFIIERLNDAAHLG >MS0287 gppA, GppA protein MNNENLRAKATALNNVAKHEMREVREIAAIDLGSNSFHMIVARIVNGSIQ VLSRLKRKVRLAAGLDENGVLDQAAISRGVDCLALFAERLQGFKAENVNV VGTYTLRSAVNNQEFLRQAQAVFPYPIRIISGEAEAEMIYAGVSHTQPEQ ARKLVIDIGGGSTEMIIGEGFTPLLVNSRNMGCVSFAKQFFVNGEISEQN FNRARQTALERVRDLSEQYRQLGWKHVLGSSGTIKTVHQVIMANIDNDGI ITAGRLDHLIERTLKATHFDNLKLSGLIEERADVFVPGLAILSAVFDAFD IQQMRYSDGALREGVMYSLETNFQVTNIRERTAEGLAEQFNIDREQAHRV TQTAVLLAQQFTGWQSPEQAEELQEILLWAALLHEVGIVINHKNLQKHSA YILQNIELPGFDKEQQRLLATLVRYHINNFRLEDISAGRYEIQDVLSLIR LLRLAIALNKSRQATESTEEISLKTDRISSLWTLTFEPNYLRDNPLVEND LAAEQLQLKDIGINFKFA >MS2213 gpsA, GpsA protein MSIQASPVTILGAGSYGTALAIALSRNGYPTYLWGHNPTACAQMAQERQN ARFLPDISFPEALRVESDLKSAVEKSKDLLIVVPSHVFGEVIQQIKPFLH NRHRIIWATKGLERGTGRLLQNLVEQELGSQYPLAVLSGPTFAKELAAGL PTAITLAAENEQFAKEFQARIHCSKHFRVYINNDMVGVQLGGAIKNVIAI SAGMSDGMGFGANARTALITRGIAEISRLGVSLGANVNTFMGMSGLGDLV LTCTDNQSRNRRFGMMLGQGVDARTAMDEIGQVVEGYYNTKEAYMLAQKQ GIEMPITEQIYQVLFCGKDAKEAATALLGRKSKVE >MS0961 greA, GreA protein MKQIPMTVRGAELLKQELDFLKTTRRPEIIKAIAEAREHGDLKENAEYHA AREQQGFCEGRIQEIESKLSNCQIIDVTKLPNNGKVIFGATVVLVNTEND DEVTYQIVGDDEADIKSGLISVNSPIARGLIGKEVDETVSIVVPGGKVEF DIIEVNYI >MS2208 greA, GreA protein MAKSNYITRAGWNVLDQELKYLWKDERPKVTQAVSDAAAMGDRSENAEYI YGKRRLREIDRRVRFLSKRLEVLQIVDYNPKQEGKVFFGAWIELENESGE IKQYRIVGCDEFDPAKNWISIDSPVARALIGKQIDDEVRVETPAGKVLLY VNNIWYEK >MS0459 groL, GroL protein MAAKDVKFGNDARVKMLAGVNVLADAVKVTLGPKGRNVVLDKSFGAPTIT KDGVSVAREIELEDKFENMGAQMVKEVASKANDAAGDGTTTATVLAQAIV NEGLKAVAAGMNPMDLKRGIDKAVAAVVTELKALSKPCETSKEIEQVGTI SANSDSIVGQLIAQAMEKVGKEGVITVEDGTGLEDELDVVEGMQFDRGYL SPYFINKPETATVELDSPFILLVDKKISNIRELLPVLEAVAKAGKPLLII AEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIAILTAGTV ISEEIGMELEKATLEDLGQAKRVVINKDNTTIIDGIGDEAQIKGRVAQIR QQIEESTSDYDKEKLQERVAKLAGGVAVIKVGAATEVEMKEKKDRVEDAL HATRAAVEEGIVAGGGVALIRAASKAAASLQGDNEEQNVGIKLALRAMES PLRQIVANAGEEASVVASAVKNGEGNFGYNAGTEQYGDMIAMGILDPTKV TRSALQFAASIAGLMITTEAMVTELPKDDKLDAAAAMGGMGGMGGMM >MS0458 groS, GroS protein MAVGKGRVLENGTVQPLDVKVGDTVIFNEGYGVKAEKIDGEEVLIISESD ILAIVE >MS0743 grpE, GrpE protein MEKIMSEQEKNQENLENAEELTQKANDTENSAEQAEPADETASDALEEAI ARVQELEEQLAETAKKEQDLLLRSRAELDNMRRRAEQDVEKAHKFALEKF SKDILNTIDNLERALATPANKEDEAVKSLFDGVELTLKELLATVARFGVE PVGAVGETFNPELHQAISMQSAEGFETNQITVVLQKGYLLNGRVIRPAMV MVAA >MS1052 grxB, GrxB protein MKLYVYEHCPFCVRARMIFGLKNLPFEQEVLSNDDEATPTSLVGKKVVPI LVKDDGTAMPESLDIVKYVDENFGDKLLTEQIRPELEVQLKQIGSYYNHL LLPRFVKLGLAEYNTQSALNYFIQKKTKSIGDFAENLANTPQYLDKLNRD LTLLDNLILAQDKVNGEQLSVEDIILFPMLRNLTCVKGVIFPTRVKNYVD CMAKMSKIDLFYGNAV >MS0755 grxC, GrxC protein MFVVIFGRPGCPYCVRAKNLAEKLKNSLDDFDYRYVDIIAEGISKADLSK SVGKEVETVPQIFIDEKPIGGCTDFEALMKEQFNIVA >MS1683 gshA, GshA protein MRFDQGNLMNIQQIVKEKGLGLLFRQGTVGIEKESQRVHADGSIVTSEHP KAFGNRSYHPYIQTDFAESQLELITPPNKKIEDTLRWLSALHEVTLRTID ENEYIFPMSMPAGLPPEQEIRVAQLDNAADVAYREHLVASYGKAKQMVSG IHYNFQLDPKLVETLFNAQTDYKSAVDFQNNLYLKMAKNFLRYQWIPLYL LSATPTVEANYFKDGSPLKPNQYVRSLRSSKYGYVNAPDIIVSFDSIEKY VETLEHWVNSGRLIAEKEFYSNVRLRGAKKAREFLHTGIQYLEFRLFDLN PFEAYGINLKDAKFIHHFILLMIWLEETADQDAVELGRARLGEVAFEDPH SETAYRDEGEQIINQLIDMLKAIGAEQSAVEFAEEKLAQFANPGQTLCAR LVDAIEQAGGYQQLGGEIAKRNKVQAFERFYALSAFDNMELSTQALMFDA IQKGLNMEILDENDQFLRLQFGDHFEYVKNGNMTSHDSYISPLIMENKVV TKKVLAKAGFNVPQSLEFTSVEQAVASYPLFEGKAVVIKPKSTNFGLGIS IFQQGVHDKADFAKAVEIAFREDKEVMVEDYLVGTEYRFFVLGNETLAVL LRVPANVMGDGVHTVAELVAAKNDHPLRGDGSRTPLKKIALGEIEQLQLK EQGLTVDSVPAKDQLVQLRANSNISTGGDSIDMTDEMHPSYKDLAVGITK AMGAAVCGVDLIIPDLKKPAEPNLSSWGVIEANFNPMMMMHIFPYSGKSR RLTLNVLGMLFPELV >MS0671 gsp, Gsp protein MSEISPNIPTHDAFGSLLGYAPGGIAIYSSDYETADKNEYPDDAAFRSYL GREYMGYKWQCVEFARRYLYLNHGMVFTDVGMAYEIFSLRFLRQVVNDAL VPLQAYANGSKKSPEPGALLIWQEGGEFQETGHVAIITEVFNDKIRIAEQ NVIHYRLPSGQQWTRELPMSVTEQGYILHDTFDDTEILGWMIQTDDSTYS LPQPTAAPESLEIHAEHIENKGQFDGKWLNESDPFEKLYVTAMNGHQVSR TDQYRYFTISETAKHELIRATNELHLMYLHATNKVLNDDNLLKYFNIPKL LWPRLRLSWENRRYQTVSGRLDFCLDERGLKVYEYNADSASCHAEAGAIL GRWAKVAGLDNGEDPGAHLRNALADCWKHRDNTPLVHIMQDNDSEEDYHS MFMQSALLQAGCRTKIIHGTEGLHWDKRGRLLDDEDNQILSVWKTWAWET MLEQLREDATGREVAPPIRTGYPEDKVRLIDVLLRPEVLVYEPLWTAIPS NKAILPVLWSLFPNHRYLLESGFELTQNLIKNGYAKKPIAGRRGDNVTLF ADQHSRLDVTHGRFGKQEHIYQQLWCLPKVEEQYVQICTFTVGGHYGGSC LRSDPSRIIVGDSDMQPLRVLNDKDFLAK >MS1970 gspD, GspD protein MRAGRVDENKILKEYLPMESIISFGKKCGLFFGIFISSAFAGESGTFAER QFSIHLKKAPLVATLQQLALEQNANLVIDDELEGTLSLKLEKVNLERLFH SVAKLKNLSLHKDKDIYYFTKNNLIEPSSIAGELKNTENFTALSEPNLVS TTVKLHFAKASEVMKSLTSGTGSLLSPVGSVSFDERSNQLLIQDERRSLQ NIKNIIAQLDKPIEQIAIEARIVTMNDESLKELGVRWGLLEGVNSAHRIA GSLEANGFADIGQNLNVNFPTSATPAGSVALQVAKIHGRLLDLELTALEQ ENNVKIIASPRLLTTNKKSASIKQGTEIPYVAVNRKNDTEHVEFREAVLG LEVTPHISKDNSILLDLIVSQNSPGANIVYGNGNLISIDKQEINTQVFAK DGETIVLGGVFNDTITKSEDKVPILGDIPLIKHLFSKESEKHQKRELVIF VTPHILKQGESLEQIQKRFKYAPKSPEK >MS1779 gspD, GspD protein MTAALTAFMLVCAAPIFAKPMYLEQGTSKYIELDKKIDTIFVSSSEVADY EIVDDYSFMVYGKQEGTTDVTAFDANGNILYTDTLNVNALINNIVDTNKQ IKARFPNSNLQVKKLGKAYVIEGKANTQHESEEVNRIVGEAIGVAPKVTE TTLKRGNGMSDEKIPFLDKYEYNGVINNTNIDKTKQINVKLTVAEVNKNF SDSLGIKWEHLSGSVLENWTSGANGYSGGFDGTTGSIALINANRLSAFIT AVNNANNGKILAEPNISMLSGETADILVGGEVPFAQRDSDGNTSIIYKDF GVKLMVGAKVQKNDQVRIVLAQEVSTLAGNYTYTSVGDIPYFQTRRAKST FEVGNGESFIIGGLLSSSDLEGVSKVPLFGDIPILGAFFRSVTTSRETKE LVVVATVNLVTPNDAEKVIYPSFEQTGTLERFFNLTPFKNVYHKTLTTNF MKNGGFIQ >MS0363 gspE, GspE protein MIIKNSDVRLMQSIKIIAGNGEQYMIDHELWQRNQQQQHVLLRYLAVPIK EEEQKLWLAIDNVENIAACETFSFLTGKIVEPVLVSNETLKSLLQPDEPQ SLSIEETSLIYTESLSQNKENKNTDEPIIRLLNNIFESALSKNASDIHFE PQKNQLRIRFRIDGVLQQQTPVNLSLAGRIISRLKLLAKLDISETRLPQD GRFDFKTTFAETLDFRISTLATSNGEKIVLRLQQNKPVDFSFEQLGMEPA QQQKLEQALNQPQGLILVTGPTGSGKSITLYTALQWLNSASKHIMTAEDP IEIELEGIIQTQIQPQIGLSFSRLLRTFLRQDPDVIMLGEIRDDESAQMA LRAAQTGHLVLSTLHTNNAYSAISRLLQLGIKQHEIDNSLLLIIAQRLVR KRCQKCGQFSENFINCDCHQGYRGRIGIYQFLHPRWQAQKWQYVTDFPSL YQAAKNKVQQQITDKQELLRVLGSEK >MS1617 gst, Gst protein MTALFYSFIMQPKFLINGNFMIILYALTQSRAYRIAWLLEILNLPYKLEI IERDGETNLAPDALRSIHPLGKSPIIKDGDLVLTESGAIVEYLINRYGGG KLKPEMNSTDYWQYLHWMHYAEGSLMPLLVIKLIFRKIDEADMPFIAKPI ANKITEKVKQGFIQPQLKLHLDYIESQLAEKFWLVGDELTGADIMMSFPL QAAVSYFETNQYPHISAYVSRLNHTESFKRAEQKLGPLTFF >MS2100 gst, Gst protein MVTLHYLKQSCSHRIVWLLEALSLDYELKIYDRDPQTLMAPAELKAQHPL GKAPVLQDGDLVLAEGNAIIQHLLDRYDDENRFTPAHKTGAYSNYVYWLA VSASMFSANFLALLSTRSDLGDFAQYATAQTPLFFNHVEQTLEGKQWIVG EQLTGADFALSFPLQWGMKYVDEADYPNIVRYLAQIENHPAYVKANEKTA GELDLSKF >MS1281 gst, Gst protein MTSAANKRSIMTLFSDKSDIYCHQVRIVLAEKGVAYETEIVDPQALSEDL MELNPYGTLPTLVDRDLVLFNSRIIMEYLDERFPHPPLMPVYPVARGKTR LLMLRIEQDWYPALEKAEKGTEEERATALKQLKEEMLAIAPIFTQTPYFM SEEFSLVDCYIAPLLWRMQELGVEFGGAGAKAIKGYMAKVFERESFVQSL GNNAPKNLMDEK >MS2085 gst, Gst protein MKLYYLPGSCATVPYVALEWIGEPYEAQAVTHDYIKSAEYLVLNPQGQVP LLVDNDLVLTQNIAILTYLDNLFPEKKIFGSKTARDKAKAMKWLAFFNGD LHKAFVPLFRVPAYAEGNEELTNEIRKDAAANVIRMLSIADEYLTRHIHF GEQISVADVYLFVELRWCKMLGLDLSQFANLQAFYQRIAADVGVKTVLIK QGISE >MS0257 gst, Gst protein MKLWYSTTSPFARKVLVTLKHQQLEDKTDLLRITSSFDPDSPHNQVNPLG RIPALQRNCGNWLFGSLLICEYLDQKGACPKLIPESGKPRWAVLALHNLV DGIMENTMPMVAEKMLRPENEWWTSRHQQLMDRNVRSFTQLEQALLPFGT ELNIGTITAVCLIDWWIFRADKIGYDLAAHFPHLVTWAEDMNNKYAILAA TKPGI >MS0772 guaA, GuaA protein MNNIHNHKILILDFGSQYTQLIARRVREIGVYCELWAWDVTEQQIREFNP TGIILSGGPESTTEDNSPRAPEYVFNAGVPVLGICYGMQTMAMQLGGLTE PSSHREFGYASVSLENSTALFAQLNDDLNSSLPKLDVWMSHGDKVTRLPE GFQLTGTTSTCPIAAMSDESRHFYGVQFHPEVTHTKSGLALLTNFVVNIC GCTTNWTPENIIEDAVARIKAQVGDDEVILGLSGGVDSSVTALLLHRAIG KNLHCVFVDNGLLRLNEGDQVMEMFGDKFGLNIIRVNAEDRFLDALKGID EPESKRKMIGKVFVDVFDEESHKQTSVKWLAQGTIYPDVIESAASKTGKA HVIKSHHNVGGLPDYMKLGLVEPLRELFKDEVRKIGLALGLPAEMLNRHP FPGPGLGVRVLGEIKKEYCDLLRKADAIFIEELYNSGWYYKVSQAFTVFL PVKSVGVMGDGRKYDWVVSLRAVETIDFMTAHWAHLPYDLLGKISNRIIN EVDGISRVVYDVSGKPPATIEWE >MS0774 guaB, GuaB protein MLRIKQEALTFDDVLLVPAHSTVLPNTANLSTQLTKEIRLNIPMLSAAMD TVTETKLAISLAQEGGIGFIHKNMSIERQADRVRKVKKFESGVVSEPVTV FPELSLGELAQLVKKNGFAGYPVIDQNDNLVGIITARDTRFVKDLNKTVA EVMTPKEKLVTVKEGAKREDIIALMHSHRVEKVLVVDDNFKLKGMITVKD FQKAEQKPNACKDELGRLRVGAAVGAGPGNEERIDALVKAGVDVLLIDSS HGHSEGVLQRVRETRAKYPNLPIVAGNIATAEGAIALADAGASAVKVGIG PGSICTTRIVTGVGVPQITAISDAAAALEGRGIPVIADGGIRFSGDIAKA IAAGASCVMVGSMFAGTEEAPGEIELYQGRSYKSYRGMGSLSAMSQGSSD RYFQSDNAADKLVPEGIEGRIAYKGLLKDIIHQQMGGLRSCMGLTGSATI EDLRTKSQFVRISGAGIKESHVHDVTITKEAPNYRLG >MS1486 gumC, GumC protein MSKKQNDVIDLTKLLGLFWDQKRIILLSTLLCAGLGLVYSLLAPSIYMAT SSVQVEEKYTGGALQGLSSIFEQESTAGTEIAVIKSRAIVSKAVEDLNLT TEVSPVYPIPFFSKAVEKLMGDKPEITVARFVPKREDAQEYTLVIGSNEN EYSVLDEQKQLVLNGVVGEKYDNQDIEILVSQLKGSSGKRFSLKKMEKSD VLELVEALQKAVTADEKGKQTGVIELTFKGEDPEYIQKVLHSITQSYLEH STARNSAEASNSLSFLQKRLPEVRDRLTKSENELNEYRQKKASVDLELEA KSVLDTLVQLDSNLNALTIRESEISQRFTKRHPNYVALLEQRQVLLDEKA RLTKQLESLPETQKDTVRLTRNFEVDQQIYTQLSNKIQELDVVKAGAVGN VRILDEAQTLPKPVAPRKLIILVLTAIVGFLLGSGGVILKSILQNGILTV SEVSETGLVTYASVPFSKKQSALSRSKGGNRIGEGLLSDRYADDFSLESL RSLRTGLNFMLAESNKRVVLLSGVSTGVGRHFITANLADLLAKADKKVLL IDADLRNSHLHHILGVENNMGLSELLAQNIPFEQGVRHLDSRFDLITCGS RSDAPSELLSVSRCKQLLDWAAQHYDTVLVTAPPILSVTDAAIVGQHADI TLLIGRFEQTSVSEIEASRERFDNAGVEIKGFVLNGVKPRAVNKGDYFRN EYA >MS0996 gutQ, GutQ protein MDYLQNARETLATEKDALTLLSRNLDQSFNNVIDLILNCGGRLVIGGIGK SGLIGRKMVATFASTGTPSFFLHPTEAFHGDLGMLKPIDIVMLISYSGES DDVNKLIPSLKNFGNTIIALTGNKHSTLAKHADYVLDISVEREACPNNLA PTTSALVTLALGDALAVALINARHFQPMDFAKFHPGGSLGRRLLCRVKDQ MQTNLPVTALNTSFTDCLTIMNEGRMGVALVMENDDLKGIITDGDIRRAL AANGADTLNKVARELMTSNPKVINQDTYIGQAEDYMKEHRIHSLIVVDND NKVVGLVEFSS >MS0858 gyrA, GyrA protein MTELVQDITPVSIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLF SMNQSGNTYNKSYVKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY MLVDGQGNFGSIDGDAPAAMRYTEVRMQRITQELLTDLDKETVDFSPNYD GKEMIPDVLPTKIPSLLVNGSSGIAVGMATNIPPHNLGEVMDGCLAYMDN EDISIDELMQFIPGPDFPTAALINGRRGIEEAYKTGRGKVYVRAKASVEI NDKGREQIIITEIPYQVNKAKLVEKIGELVRDKKIEGIAGVLDLSNKEGI RLEIDIKRDAVGEVVLNHLYALTQMQVTFGINMVALDHGQPRLFNLKQII EAFVKHRREVVTRRTVYELRKARERAHILEGLAIALANIDPVIELIRASK TADEARENLLSRAWSLGNVAPMLEAAGVDASRPDGLAAELGAHDGQYFLS ETQARAILELRLHRLTGLEHEKIVEEYHEILLQIGELIRILTSSVRLNEV IREELELVKSTYNDERRTEITAASGDINLEDLIAQEDVVVTLSHEGYVKY QPLTDYEAQRRGGKGKSATKMKEDDFIERLLVANTHDTILCFSSRGRLYW LKVYQLPEASRGARGRPIVNILPLEDNERITAILPVASYDEDKFVVMATA CGIVKKTALTEFSRPRANGIIAVNLRDEDELIGVDITDGSNEIMLFSSQG RVVRFAEAAVRAMGRTATGVRGIKLALTNDISDDESAVEIEEISDDNAED TLDLNIDKVVSLVIPKNEGAILTATQNGYGKRTALNEYPTKSRNTKGVIS IKVSERNGKVVAATQVEETDQIMLITDAGTLVRTRVSEVSIVGRNTQGVR LIRTAEDEHVVSLERVAEPEEDEFDAESPETAVENSEE >MS0875 gyrA, GyrA protein MSEINYEGIEQMPLRTFTESAYLNYSMYVIMDRALPFIGDGLKPVQRRIV YAMSELGLNATAKYKKSARTVGDVLGKFHPHGDTACYEAMVLMAQPFSYR YPLVDGQGNWGAPDDPKSFAAMRYTESRLSKFAELLLGELGQGTVDYQPN FDGTILEPQYLPARLPHILLNGTTGIAVGMATDIPPHNLNEIADAAVMLL DNPKATLDDILTLVQGPDFPTEAEIISPKEEIRKIYENGRGSVKMRAVWK KEDGEIIITALPHQASPSKVIAQIAEQMTAKKLPMVEDIRDEADHENPVR IVLVPRSNRVDSEALMAHLFATTDLEKNYRVNMNMIGLDNKPAVKNLLQI LTEWLSFRRSTVTRRLQYRLDKVLSRLHILQGLMIAYLNIDEVIHIIRNE DEPKPVLMARFELSDEQAEAILNLRLRHLAKLEEHELQAEKDQLEQERAQ LEQILSSERRLNTLIKKEIQQDAKTYASPRRSPIVERAEAKAISESEMIP AEPVTVILSEMGWVRCAKGHDIDPQGLNYKAGDKYLAHACGKSSQPAVFI DSSGRSYALDPLSLPSARSQGEPLTGKLTLPAGASVDYLLIENENQQLLM ASDAGYGFICKFEDLIARNKAGKAVISLPENAKVLPPKNIENSTALLVAL TAAGRMLIFPVKDLPSLSKGKGNKIVTIPAASAKERTDLLVKLLLISENS SLVFHSGKRKITLKPEDLQKYRAERGRKGTQLPRGLTSQAEITVVEPN >MS0878 gyrB, GyrB protein MANNYSAEDITVLKDLEPVQLRPGMYTDTSRPNHLGQEVIDNSVDEALAG FANKIEVILHKDQSLEVIDNGRGMPVDIHPTEKVSGIELILSKLHAGGKF SNKNYEYSGGLHGVGISVVNALSELVEVIVKRDGQIYKIVFSNGQKIEEL QVIGTCGRRNTGTTVRFKPNPKYFDSDKFSVTRLRHLLRAKAVLCSGLEI KFTDKVNDTEESWCYQDGLSDYLIEAVQGYNALPQTPFIGDFSADSEAVS WALLWLPEGGELIAESYVNLIPTIQGGTHVNGLRQGLLDAMREFCEFRNL LPRGVKLVADDIWDRCAYVLSLKMHDPQFAGQTKERLSSRQSAVFIGGVV KDAFSLWLNQNVEIGQQLAELAINSAQRRLRASKKVVRKKLVSGPALPGK LADCSQQDLEKTELFLVEGDSAGGSAKQARDREYQAILPLRGKILNTWEV SSDQVLGSEEVHNIAIALGIDPDSDDLSQLRYGKVCILADADSDGLHIAT LLCALFLRHFPKLVQQGHVFVAMPPLYRIDLGKEVFYALDESEKEGILDR LKSKRGKPNVQRFKGLGEMNPSQLRETTMEPNTRRLVQLTYEQDETNMTE TFELMDMLLAKKRAEDRKNWLQTKGDQVDLTV >MS2249 gyrB, GyrB protein MSENTQENYGASSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDN AIDEALAGYCKDIIVTIHEDNSVSVQDDGRGIPVDIHPEEGVSAAQVIMT VLHAGGKFDDNSYKVSGGLHGVGVSVVNALSDKLQLTIRRQGHVYEQFYS LGEPNEQLKNIGETDKTGTTVRFWPSPTIFSNTVFEYEILKKRLRELSFL NSGVSIKLFDERDGANDHFHYEGGIQAFVEYLNQNKTTIHPKPFYFSIEK EGIGVEVALQWNDGYNENIYCFTNNIPQRDGGTHLAGFRGALTRTLNSYM DKAGLNKKGKNDKDKVETSGDDAREGLVAVISVKVPDPKFSSQTKDKLVS SEVKGAVESAMNERLQEYLEENPNDAKIIATKIVDAARAREAARKAREMT RRKGALDIAGLPGKLADCQERDPAFSELYLVEGDSAGGSAKQGRNRKNQA ILPLKGKILNVEKARFDKMLSSQEVGTLITALGCGIGRDEYNPDKMRYHK VIIMTDADVDGSHIRTLLLTFFYRQMPEIIERGYVYIAQPPLYKVKKGKQ EQYIKDEPAMTQYELAIALEDAALYVNANAPAMTGLPLEKLVADYNNTHQ MIERLHRRYPEALLKELIYYPQLTTELMKDTGATEEWTKNLIAVLTEKDT QGNSYSFRLQYDTERQVNDIILTVRTHGVDTNYTLNYQFATGNEYARIVK LGNQLKGLLEDGAYVTRGNGKLEISSFEQAIEWFVKESRKGLTVQRYKGL GEMNPEQLWETTMDPDARHMLKVTIKDAVAADQLFTTLMGDEVEPRRDFI ESNALRANLDI >MS1324 hemA, HemA protein MTILVLGINHKTASVALREKVAFSPEKRDLAFQQIAQSELAQSEVILSTC NRTEIYLHNKHISPEADQENQRWLEQCIQWFADVHQLDVDELRNCLYIKQ NQSAVNHLMRVSCGLDSLVLGEPQILGQVKQAYQYSEDYCQAQHMPMSSE FSRLFQKTFSVAKRVRTETNIGNSAVSVAYAACSLARQIFEGLKDLNILL VGAGETIELVARHLLRHGVKKLMISNRTLARAELLVEKLEHNKYIQVLSL QQLQDGLNQADIVISSTGSPIVLITAEMVKQAQQKRRNAPMLIVDIAVPR DVDERVEKLDGVYHYTVDDLHSIIQRNLSEREKASKEAETIIDAEASDFF EWMKVHQFSNLIRTYRESAEQIRQDLLEKAVQAIGQNEDPETVLQELSYK LMNKLIHSPTQAMQAMMKQGSIQGLRSFSSALGIADKKERNPQK >MS0503 hemB, HemB protein MTQQIFTGFPARRLRRLRKHDFSRRLVAENALTADDLIYPVFVIEGENRR EPVPSMPGVERLTIDQLLVEAGLLVKYGVPVIALFPVVGAERKSLMAEEA YNTDGLAQRAVRALKAAYPQLGVMTDVALDPFTTHGQDGIIDETGYVVNE ITTEVLVKQALSHAQAGADIVAPSDMMDGRIGKIREALEAEGFINTQIMA YSAKYASNFYGPFRDAVASAGNLKGGDKKTYQVDPANSDEGLQEVALDLQ EGADMVMVKPGMPYLDMVYRVKDYFGVPTFAYQVSGEYAMMMAAIQNGWL KEKECIMESLLCFKRAGADGILTYFAKQVAEWLYLEGKNK >MS0276 hemC, HemC protein MNNKNHLKIATRQSPLALWQANYVKDRLTALYPDLQVELVTMVTKGDVIL DTPLAKIGGKGLFVKELEHALLNHEADIAVHSMKDVPMEFPQGLGLSVIC KREDPRDAFVSNKYRSLAELPQGAIVGTSSLRRQCQLKSLRPDLDIRSLR GNVGTRLSKLDNGDYDAIILASAGLIRLGMAERIASFIETDISLPAAGQG AVGIECRVDDELVQSLLAPLAHQETTICVLAERAMNNRLQGGCQVPIGGF AQVKNGEVFLRALVGATDGSQIIRAEGKSAVENAEVLGVQIAEDLLQQGA DKILKSVYQD >MS0275 hemD, HemD protein MAVLVTRPAEKGIQLVDMLNKSGVAALHLPFFNITAGRELNDLPNKFNQL KPNDYVVAVSQSAVDFAAETLQNTGFHWRTDLQYFTVGQQTALHFASLSE QPVHYPFLSENSEGMLALAQMQNLKGKNVLLLRGNSGRELFPQQVLARGG DIDILECYQRQPIDYDNVEQTSICKRAGIQTIVVTSGELLNTLIQFVPEN EYDWLRSCQLVVVSTRIENMARKFGWTDIVVSPKADNNTLLQTILTLTS >MS0183 hemE, HemE protein MTTLKNDRYLKALLREPVDMTPVWMMRQAGRYLPEYKATRAEAGDFMSLC RNADLACEVTLQPLRRYELDAAILFSDILTIPDAMGLGLTFGAGEGPKFD RPIETKSAVENLPIPDPEQELQYVMNAVRTIRRELNGEVPLIGFSGSPWT LATYMVEGGSTKAFTKIKKMMYAEPKLLHKLLDKVADSVVLYLNAQIKAG AQAVMIFDTWGGVLGHREYLDFSLQYMHKIVNGLIRENDGRKVPVTLFTK GGGLWLDAIADTGCDAIGLDWTVNLAQAKAQVGHKVALQGNMDPSVLYAA PERIEQEVRSILADFGEGSGHVFNLGHGIHQDVPVESPKVFVDAIHQYSK PYHK >MS1346 hemH, HemH protein MKSSKTGILLANLGTPDTPSPKAISRYLKEFLSDPRVVDLPRWKWLPLLN GIILPIRSRRIAKNYGAIWTEQGSPLFAITQKQKALLTEFFQQRQQNVII EIGMTYGNPSMQYAIDNLIEQKVDKIIVLPLYPQYSSTTTAPVFDVFAQA LKRHRHIVPFEFIHSYHLDENYIEALVKSIKVRLKNDEFLLFSFHGIPKR YEQEGDFYRPQCEQTAQAVVQKLGLKKEQWRLCFQSRFGSEPWLQPYTDK FLETAAQQGITKLAVICPGFSADCLETLEEIKEENKRIFLAYGGESYHYI PALNDSPEHIACLGNLLLKRMTI >MS1265 hemK, HemK protein MIAYNIVKLILHRYVLYSHISFHIGIIMTSVTFNQELIDFIQADDITNNL HSIKDYLRWTYSNFNRSDIYYGHGQDNSWDESTQLVLSGLDLPLDLPQEL YNANLTQAEKETVINLVIKRLAKRLPVAYLTNSAWFCGLEFYVDERVIIP RSPISALIENRFQGIIVKEPKRILDMCTGSGCIAIACAEQFKEAEVDAVD LSIDALNVAEINIDRYNLSERVFPIQSDLFDNVPADKYDLIVSNPPYVDR EDLADMPEEFHYEPEMALGSGVDGLTITKQILANAANYLNDDGVLVCEVG NSMIHLIEQYPDVPFNWVELRNGGVGVFVLTKAQLIAYQTKFTD >MS1191 hemK, HemK protein MSKIQEKITMKSGIRFKQLTEDFMVIGEEITITKISPQTYQQWLAFAEDS LQDMTKQDPYANPKVDANRLLQFVTQKSKGTIIAFSDTLLTENESALLSQ YLVRRCEGEPIAYILGEQDFWSLNLEVSPDTLIPRPDTEILVEKALEFAK FRLNSPHFSGELAILDLGTGTGAIALALAAELAPISQKCGAKLRILGVDL TNGAVELAKRNALRNQLPQVEFLQSNWFEQLENRQFDIIVGNPPYIDRQD EHLALGDVRFEPLTALVAEDSGYADLRHIIERAPFHLKHQGWLILEHGWQ QGQKVRSIFNEFSQNYWQQVATMKDYGDNERITLGCWNKE >MS0910 hemL, HemL protein MTDSNTLFSRAQQVIPGGVNSPVRAFKGVGGTPVFIQKAKGAYIWDTDDK QYVDYVGSWGPMILGHNHPAILSAVIKTAENGLSFGAPTPIEIDLAELVC KLIPSMEMVRMVSSGTEATMSAIRLARGYTNRDKIIKFEGCYHGHADSLL VKAGSGALTLGQPNSPGVPADFAKHTLTCTYNDLESVKQAFEQYPQDIAC IIVEPVAGNMNCVPPQNNFLQGLRELCNQYGAVFIIDEVMTGFRVALGGA QSYYNVEPDLTCLGKVIGGGMPVGAFGGKKEIMQFIAPTGPVYQAGTLSG NPIAMAAGLACLTELQKAGNQERLAQLTEKLALGLKALADKHHVPFTVNY VGGMFGLFFTDKAQVTCYQDVMACDTEKFKVFFHKMLDEGVYLAPSAFEA GFMSLAHTDADIDRTLTAADKAFAALA >MS0228 hemN, HemN protein MGKHFMSDIIWDLALIQKYNQSGPRYTSYPTALEFNENYTDEDFKAAAAR YPGRPLSLYVHIPFCHKLCYFCGCNKVITRHQHKADIYLDYLEQEIKHRA ELFKTRSVTQIHWGGGTPTYLSEAQSARLMTMLRNHFSIADSAEISIEMD PREIELSMLDHLRKIGFNRISMGVQDFNKDVQKAVNREQDEEFIKALLVH ARELGFRSTNLDLIYGLPLQNVESFMFTLQKVIELNPDRLSVFNYAHLPS RFAGQAKIKDHMLPAPETKLTILQKTIETLGAAGYKFIGMDHFAKPDDEL AIAQQNGILHRNFQGYTTQEECDLLGLGVSAISLLGDTYAQNQKELKRYY ADVDATGIALHKGLVLSKEDCLRRDVIKQLICNFKLNYELIEKQYNIDFK THFTEDLALLAPLAEDGLVEITNTAIQVSARGRLLIRNICMCFDTYSRQL AKRQQFSRII >MS1746 hemN, HemN protein MTTLVLPPLSLYVHIPWCVQKCPYCDFNSHAQKGAIPEIDYIRHLVTDLQ ADLLRFKDSVQNRPLHSIFIGGGTPSLFSAESITCLLTEIKKLIDFSPHI EITMEANPGTVEAERFQGYVKAGVTRISMGIQSFNDEKLKALGRIHNARE AKSAVEIAKISGLKSFNLDLMHGLPNQTPEQALDDLRQAIALNPPHLSWY QLTIEPNTMFAYRPPVLPDDDELWDIFERGHRLLTEAGYRQYETSAYAKP DFQCQHNLNYWRFGDYLAVGCGAHGKMSFVNGDIIRYSKTKHPKGYLRGE YLYEERLINEADRPFEFFMNRFRLLEPVPKTEFVQFTGLPESAVKKQIDW ALEKNYIIETESTWQITERGKLFLNELLEVFLPDD >MS0274 hemX, HemX protein MTDKQTENVETVEIVEDKATDAIKADSSEKNSKNNRTPEPVVVKKGGSAL GLLAILIALGVGGAGYYFGQQQVQQIQQKLTALQSQQPGATAESPDFEQT KEHILKLVNENNQTNADKIAVLQREITAKDQALLSLQSQINAVSNSVKAE QPNDWLLSEADFLLTNALRKLVVDHDVDTSISLIKLADEALEKVATPQAS AIRSALNNDLKQLLALNNVDQNAIMQRLSQLANSLDELTVLNIDFDDQEN SAAVTDSVGDWQNNLKKSAVSFLNHFVRVTPTNSSKKELLAPNQDIYLRE NIRLRLQIAILAVPRQQNELYKQSLEAVSSWIRSYFDTNSDLAKNFLKNI DELSEQSIYVDVPERLESLNALDQLLNKQVHEVKKVELSVDKGLTENNET GATPETEQNVPATSEPVPVEPQQ >MS0273 hemY, HemY protein MFKVLFLMLALLAGLVAGPYLSGKQGYVLIATGNYNIEMSITTLIILFIA AMAVVYILEWAISHFFKWSNATYTFFSKRKRNKAQRQTLEGLMRMTEGDY SKAEKLIGKNAKHSAEPILNFIKAAEAAQQRGDEFSANRYLIEATELAGS DNLLVELARTKILLQQNKLPAARSSVDSVLEMAPRNPEVLKLATEIYLRS KAYSTLDNILSTVESSGLYTQKEFIDLQRKTEDGLLDEIMNEEGADGLLA WWEDQSRKRRNDFYVKLALISRLIDCNDHDSAYEISLDAFKKVDENKDLA GKFFNLMTKLQATDNSKLVKLLEKRMSSTSAEHQCCLNRALGYLYVRNND FNKASECFKQVVSGQDKLDPSDATMAAYVFEQVKDTESAKRVREEGLKAA MAVNRLSTEEVIEKPALALEQKADNKESNKIANWANR >MS0195 hepA, HepA protein MSFAVGQRWISESENDLGLGVIVGMDNRTVTILFPASDEQRVYALAAAPL TRVEFQKGDTVVHHEGWKAQIIDVTENNGVLIYLTIRLDTQEEAVLREMD LAHKISFSKPQERLFGAQIDRSDRFTLRYHALQQQQAQFQSPLRGLRGIR AGLIPHQLHIANEVGRRVNPRVLLADEVGLGKTIEAGMILQQQLFAGKVE RVLIIVPETLQHQWLVEMLRRFNLHFSLFDEERAADFAANEYDEERNPFE SENLIICSLDWIVAQPKRAQQILQAEFDMLIVDEAHHLVWSERQPSMAYQ VVEQLSRRIPAILLLTATPEQLGQESHFARLALLDPDRFYNYDAFVAEQK NYQPVAEAVQTLLNEKPLNTAEQNAIADLLEEQDVEPLFKVINSMAEESE RLQARQELIDNLVDRHGTSRILFRNTRQGVKGFPHRIYNQVTVEMPKQYV NAVKVMNLLGEEIGDGLFYPEQIFQKMNPEAKWWEFDPRLEWLITFLKNH REEKVLVICRHANTAIQLEQALREKEAIRSAVFHENMSIVERDRASAYFA LQEEGAQVLLSSSIGSEGRNFQFACHLVLFNLPDNPDLLEQCIGRLDRIG QTRDIRIHTPCFADTPQVVLARWYHEGLNAFEETCPMGMTIFTECGEKLK NFVKNPTQLDGFEEFVAQTRKRQQVLKQELENGRDRLLELNSNGGERAQK LAEHIADEDNSTALVNFVLNLFDVIGIEQEDLGEKSIAIIPASTMLVPDF PGLKEEGVTVTFDRRLSLAREELEFLTWDHPIVTNGIDLITSGDIGKTAV SLLINKSLPPGTLLLELIYVVESQSPKGLQLTRFLPPTPVRLLLDAKGNN LAAQVSFQALEKQLRPVKRNMANKMAKMIRPNIERLIAGGDKHIAEQARE IIQSAKQKADQTLSAELDRLNALKAVNKNIRQDEIDILAQIREQSLTQLD QANWRLDSLRVIVSNKE >MS1978 hfi, Hfi protein MKTEENKMPKFAANLTMMFNEVPFLDRFEAAAKAGFKYIEFLWPYDYPAE QIKAKLDQYGLKQILFNSLPGDIAAGEWGVSAIPGREAESHQHIDLALEY ALVLQCPTVLIMGGVVPPGQSRAKYKQTFIDNLRYASEKFKPHNINIVLE ALSPQVKENYLMKTQDDALEIIELVNRDNVFLQLDYYHAQNVEGNLARLT DRVAPVMKHIQIASVPDRHEPDEGEINYQYIFDKLDAIGYDGYVGCEYKP RTETTAGLAWFEKYK >MS0964 hflB, HflB protein MNDMVKNIILWVVVAVVMMTAYQGFSSSANGSAVDYTTFVTDVGNNQIAQ ARFEDTEILVTKTDGSKYSTVMPIYDDKILNDLLNKKVKVEGTMPEKRGL LSQILISWFPMLFLIGVWLFFMRQMQGGGGKAMSFGKSRAKMLTKEQIKT TFADVAGCDEAKEEVGEIVDFLRDPGKFQKLGGKIPKGILMVGPPGTGKT LIAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKNAPCLIF IDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFEGNEGVIVIAATNR PDVLDPALTRPGRFDRQVVVGLPDVRGREQILKVHMRKVPIGADVDAMTL ARGTPGYSGADLANLVNEAALFAARTNKRVVTMLEFEKAKDKINMGPERR TMIMTDKQKESTAYHEAGHAIVGYLVPEHDPVHKVTIIPRGRALGVTFFL PEGDQISVSQKQLESKLSTLYAGRLAEDLIYGEENISTGASNDIKVATNI ARNMVTQWGFSDKLGPILYTEDDGEVFLGRSMAKAKHMSDETAHIIDEEV REIVARNYDRARQLLIDNMDILHAMKDALVKYETIEEIQIKQLMNREAVT PPAGWEDHSSADNSSSNTAETEPSENETEENSVN >MS0834 hflC, HflC protein MDIMEGFPITVIVFIVLILFVVSSALKTVPQGYNWTIERFGRYIKTLSPG LNFIVPFIDRVGRKINMMEQVLDIPSQEVISKDNANVSIDAVCFVQVIDA RSAAYEVNHLEQAIVNLVMTNIRTVLGSMELDEMLSQRDNINGRLLSIVD EATNPWGVKVTRIEIRDVRPPRELSEAMNAQMKAERNKRAEILEAEGVRQ AQILRAEGEKQSRILRAEGEKQEAILQAEARERAAQAEAKATQMVSDAIV NGDTKAINYFIAQKYTEALKDIGGSNNSKVVLMPLEAGNLIGSVAGIAEL LKDVKK >MS1619 hflC, HflC protein MSLNDQDPWAKPGQNDPKQPENPSNKPDNKSGWSDRQDNKEQSPPDIEEI FGNLLKKLGGNGGQSNNGNNTNLPKNLNKLAPAAIALAVVLWGLSGLYTV KEAERGVVTRFGQLHSIVQPGLNWKPNFIDEVIPVNVEQVKELRTQGAML TQDENMVKVEMTVQYRVQDPAKYLFSVTNADDSLNQATDSALRYVIGHMT MDDILTTGRAVVREQTWKTLNNVIKPYDMGVEVIDVNFQSARPPEEVKDA FDDAIKAQEDEQRYIREAEAYAREQEPIARGDAQRIVEGATAYKDKVVLN AKGEVERLQRLLPEFKASPDLLRERLYIQSMEQIMSKTPKIMLDGNGNNL NVLPVDQILRNKNTQPAAEPSANQGTLSSNELKSAVKNGQNSHGQTQTDD RSISRQGRFN >MS1620 hflC, HflC protein MRKFLLPVLVILAAILYSSIVIVNEGTRGIMLRFGKVQRDSDNKVVVYTP GLHFKIPFIDNLKPLDARIRTLDGQADRFVTVEKKDLLVDSYVKWKISDF GRFYTSTGGGDYNQASNLLRRKVNDRLRSEIGTRTIKDIVSGTRGELMDG ARKALNTGQDSTAELGIEVVDVRVKQINLPDEVSSSIYQRMRAERDAVAR QHRSQGKEKAAFIQADVDRKVTLILANANKTAEELRGEGDATAAKLYTEA FSGEPQFYSFVRSLKAYENSFAGSDNMMILKPDSDFFRFMQPPKK >MS1519 hflX, HflX protein MNNDVNISKSAVNFTALSSISAPRSDQSDNAIVVHVFFSQDKNPEDLDEF QQLAQSANVNILQVITAARSTPQAKYFVGQGKAEEIAQAVETHNADVVLV NHSLTPAQARNLESLCQCRVVDRTGLILDIFAQRARSHEGKLQVELAQLK HLATRLVRRKTGLDQQKGAVGLRGPGETQLETDRRLIKVRIAQLQNRLAK VEKQRNQNRQTRQKADIPTISLVGYTNAGKSTLFNRITQANVYAADQLFA TLDPTLRRLQIQDVGTTILADTVGFIRDLPHDLVSAFKSTLQETTEAGLL LHIIDAADPRKLENIEAVNAVLEEIKAADLPTLLVYNKIDTLENLEPHIE YDDQHIPVAVYLSAISAEGIDLLFAAIREKLKNEILHLQLNLSPNEGKIR HQLYLLDCIRREEISDQGEFLLEIQIDKIQWLKLAKKFPQLEKCGKNL >MS1518 hfq, Hfq protein MAKGQSLQDPYLNALRRERIPVSIYLVNGIKLQGQIESFDQFVILLKNTV NQMVYKHAISTVVPARSVSHHNNPQQQQQHSQQTESAAPAAEPQAE >MS1475 himA, HimA protein MTKSELIESLVEKNHSISVKSVENAVKEILEHMSQALESGDRIEIRGFGS FSLHFRQPRVGRNPKTGAQVKLDAKCVPHFKAGKELRERVDFNA >MS1089 himA, HimA protein MTLTKVELADNLIEKHGLNKSEAKALVEDFFEEIRVALEKGNDVKLSGFG NFELREKASRPGRNPKTGESVPVSARRVVVFKPGQKLRARVEKTKPKS >MS0185 himA, HimA protein MNKTDLIDAIASAAELNKKQAKAALEATLDAITASLKAGDSVQLIGFGTF KVSERKARTGRNPQTGAEIQIAASKVPAFVSGKALKDAVNG >MS0112 hipB, HipB protein MNLSSLFSVRLKNERNRLGLTQAEIAKKCGVSREMWGKYERGVALAGSEV LFSLAAIGVDMDYILLGTRKEVFEEITTEALKDMPKADFSDKTGLLVQLF MQCDDNGRAAILSVAQTMAGMANKTGHQNSDSTGGQSFAGDVHGGQFSTG TINNYGEKK >MS1883 hisA, HisA protein MKKSIIIPALDLIDGNVVRLHQGDYAKQTTYSDNPIEQFASYLAQGAEQL HLVDLTGAKDPAKRQTALIGKIIAATHCKIQVGGGIRTEKDVADLLAVGA NRVVIGSTAVKERAMVKEWFNKYGAEKFVLALDVNIDASGQKIIAISGWQ EASGVSLEELIEDFQSVGLQHVLCTDISRDGTLAGSNVDLYKEICAKYPA VNFQSSGGIGSLEDIKALKGTGVAGVIVGRALLEGKFNVAEAIECWQNG >MS1890 hisB, HisB protein MTQQPTLFIDRDGTLIDEPKTDFQIDSLEKLKFERNVIPALLKLKNRYRF VMVSNQDGLGTDSFPQEDFDKPHNAMLAVFRSQGIEFDDILICPHKPEDN CDCRKPKIKLLKKYIDKKLFDPADSFVIGDRPTDVQLAENLGIRALQYHP ENLDWDMIAEKLLREPVADPKGLGQPRHAVVARKTKETDIKVEVWLDEAG VNQINTGIGFFDHMLDQIATHGGFRMNVSCKGDLHIDDHHTIEDVALALG AALKEAIGNKRGIQRFGFVLPMDECKAECALDLSGRPYFKFKAKFNRDKV GDFSTEMTEHFFQSIAYTLLATLHLSVKGDNAHHQIEALFKAFGRTLRQA IKIEGNEMPSSKGVL >MS0435 hisB, HisB protein MNKERMVKKAIFLDRDGTINIDHGYVHKIDDFHFIEGSIEALEELKNMGY LLVLVTNQSGIARGYFSEDEFLQLTEWMDWSLADRNVDLDGIYYCPHHPE GLGEYRQDCDCRKPKPGMLLQAIEELNIDPAQSFMVGDKVEDLKAAVSAN VKYKVLVKTGKTVTQAGEQLADYVLDSIADLPRIIKRLKK >MS1891 hisC, HisC protein MSISQLSRKNVQALTPYQSARRLGGNGDVWLNANEYPTSPDFNLSERIFN RYPEPQPEAVIKGYAAYADVKPENVIVTRGGDESIELLIKGFCEPEDKVL YCPPTYGMYAVSAETLGIATKTVPLTEDFQLDLPEIEKNLAGVKVIFVCS PNNPTGNVLNQADLIRLLDITAGSAIVVVDEAYIEFSPETSMIKQLGNYP HLAIIRTLSKAFALAGLRCGFTLANPELIGVLQKVIAPYPLPVPVSDIAA QALQPQGVAQMKMRVADVLANRAWLIGELKQIPSVVKIFATEANYVLVKF QDGEKVFNALWEKGIILRDQHKAFGLKNCIRISIGTRAELEKTVVALKLA >MS1574 hisC, HisC protein MVVRPNFKTTIPNNHKSAVENMTFLQQANTGVQALSPYQAGKPIEELERE LGISNIIKLASNENPFGFPESAKKAIQNQLDNLTRYPDSNGFSLKAAIAE KFNLQPEQITLGNGSNDLIELIAHTFATEGDEIIFSQYAFIVYPLITKAI NAKAREIPAKNWGHDLEAFLAAINEKTKLIFIANPNNPTGNFLTEAEIDS FLAKVPPHIVVALDEAYTEFTAKEERVNSLALLKKYPNLVVSRSLSKAYG LAGLRIGFAVSNPEIAGLFNRVRQPFNVNSLALAAAEAVLNDDDFVEKAA ENNRRELKRYEEFCQKYGLQYIPSKGNFITIDFQQPAAPVYDALLHEGVI VRPIAGYGMPNHLRISIGLPEENQRLFDALIKILNLK >MS1892 hisD, HisD protein MQTLIWKDLTEQEKKQALTRPAISAAGNIKDAVDAIRENVVANGDKALFE LSEKFDRVKLNSLEVSEQQIEEAAQRLPEELKQAIQNAKKNIEAFHLAQV PVEADVETQSGVRCQVLTRPINRVGLYIPGGSAPLFSTVLMLAIPAKIAG CKKIVLCSPPPIADAILYAANLCGVETIYQVGGAQAVVAMAFGTETVAKV DKIFGPGNAFVTEAKRQVSQAVNGAAIDMQAGPSEVLVLADENADPDFVA SDLLSQAEHGADSQVILVTPSERLALETELAVERQLTTLPRSEIAQKALA HSRIFIAENLQQCVEISNEYAPEHLVVQVQNARDLLSNIDNAGSIFLGAY SPESMGDYASGTNHVLPTYGYTRTSSSLGLADFSKRMTVQELSPQGFKDL AKTVEVMAAAERLDAHKQAVSIRLAKIK >MS1882 hisF, HisF protein MLAKRIIPCLDVRNGQVVKGVQFRNHEIIGDIVPLAARYAEEGADELVFY DITASSDGRTVDKSWVERVAEVIDIPFCVAGGIKTIADAEQIFTFGADKI SINSPALADPDLISRLADRFGVQAIVVGIDSWFEQETGKYWVNQYTGDES RTRQTNWQLLDWVKEVQKRGAGEIVLNMMNQDGVRNGYDLTQLKLVRDVC KVPLIASGGAGEMVHFRDAFIEANVDGALAASVFHKQIINIGELKEYLAR EGVEVRR >MS1893 hisG, HisG protein MSTNKRLRIAMQKKGRLSDESQELLKQCGVKINLQGQKLIAYAENLPIDI LRVRDDDIPGLVFDGVVDLGIIGENVLEEEELTRTAAGDKVEYKMLRRLE FGGCRLSLAVDSDVEFDGPESLSDCRIATSYPQLLKRYMAEQGVPFKSIL LNGSVEVAPRAGLADAICDLVSSGATLEANGLKEVEVIYRSKACLIQRKE PLSEEKQALVDKILTRIQGVQQADESKYIMLHAPKDKLEEITALLPGVEN PTILPLAHDDTKVAVHVVSQENLFWETMEQLKEKGASSVLVLPIEKMLA >MS1885 hisH, HisH protein MIIIDTGCANLSSVKFAFDRLNIKAEISRDIATIKSADKLLLPGVGTAMA AMKILQDRNLIETIQNATQPMLGICLGMQLMTEYSSEGNVPTLSLMSGHT DLIPNTGLPLPHMGWNKVRYEQDHPLFAGIEQDSHFYFVHSYAVLPNEHT IATSDYGVPFSAALGCKNFYGVQFHPERSGKNGAQLLKNFVENL >MS1881 hisI, HisI protein MQNKINWQKVDNLLPVIIQHFQTCEVLMLGYMNQEALAKTCDEKVVTFFS RTKQRLWTKGETSGNFLNAVDMSLDCDNDTLLILADPIGPTCHTGEESCF HQFATQSEGDWTWFAKLERVLAERKFADPESSYTATLYAKGTKKIAQKVG EEGVETALAALSKDKGEIVSETADLIYHLTVMLHEQNLEWGDVIDKLKER HQGIGLHPEGSNK >MS1920 hisS, HisS protein MAKTIQAIRGMNDCLPTETPVWQWVESKVRSTLASYGYSEIRMPIVENTP LFARAIGEVTDVVSKEMFTFNDRDNESLTLRPEGTAGCVRAGIEHGLLYN QEQRLWYMGPMFRYERPQKGRYRQFHQAGVEVFGIANAEIDAELIILTAR LWKELGIEQHVTLQLNSIGSLEARANYRSALVEFLQQYVGLMNEEEKERL LKNPLRILDTKNEALQQALNNAPKLLDYLDDDSREHFARLCAILDNVGIS YEINPKLVRGLDYYNKTVFEWVTTALGAQGTICGGGRYDGLVEQLGGHAT CGVGFAMGLERLILLVQEVNKAIPLPQSAVDIYLIFAGENTASAAFRLAE KVRSELPHLRTMMHCSGGNFKKQFKRADKSGAKIALVLGESEVQNQQVVV KDLLGGAEQQTVALEAVIDHLKTSFKE >MS0399 hit, Hit protein MWIYSFGLRDKLLFKLISDKKCGRFFPKIRKHKMAEETIFSKIIRKEIPA DIIYQDDLVTAFRDIAPQAKTHILIIPNKLIPTVNDVTAEDEAVLGRLFI TAAKIAKLEGIAEDGYRLIVNCNKHGGQEVFHIHMHLLGGEKLGPLNAK >MS1923 hlpA, HlpA protein MKKVVKATALSLALALTSSMAMAAENIAFINAGYLFQHHPDREAVAKKLD SEFKTQADKLAANKKSIDAKIASLQKDAQNPKNRPSELKKREDEINKLMK DHDEEVRKFQVENDKRQNEERAKLLEGIQVATNNIAKDKGYTYVLDANSV VFAADGKDITEDVLKAIGGKTETKPAEATK >MS1168 hlyC, HlyC protein MKIETFDVIAPSIFSDEPLDESQLFGAFATIWLRSEYHSKAPLYRFAERI LPVLRNKQFALFIKDNEPVAYFSWAYFTQEAEEAYLHNDDVLLEAGNWCA GNNLWIIDWFAPANLTKEIKPLIERHLFPNEILTALYHKSAVNHVQKRYF KGCAVSRERFRDFIRNHS >MS0292 hmp, Hmp protein MSYRAYVIRPNRKGIKQMAKTNKNPLCINELQVYSIVQEAPKVKTINFIA QDFYPYEAGQYALVSIRNTPHITRAYSLSSTPGESRFVSITVREIDGGVG SGWLNNEVKVGDQVWFSNPMGDFSCQKVIADNYLLVGAGSGVTPIMSMTR WLLKNRPQANVSVIHSVHSPQDVIFKSEWAQLKADNPRLNLVFNASVNAT AGFESGRISKEILTKAVPNLTDYTVMTCGPQAYMDALKTMVLELGGSEQR FFTEAFFNTALAGDISSDKKTTLTVSGAKPMKTEVPVGMTLLAALEAQEQ PVVSGCRTGLCGLCKTKVTGGEYEIVSNGDLTQEEIAQGYVLACSCRVKA DMTVEI >MS0680 hmp, Hmp protein MKNIKFLLWGVLIGISALWFLADDLIPEPFTYFSFRFVVNQYTGILSISL MSIAMLLATRPRWLENYLNGLDKGYRLHKWLGISALITALTHFWFTHGTK WMVGWGWLERPLRQRQRLGQNAGAGLEQWLGGMRGIAESIGEWAFYLALI LMIVSLVKKIPYRWFVKFHKWLAAAYLALVFHSVVLIKFEYWHQPIGWVT AVLLTVGAVSALLILFNLAGKKIRYQGTIRSARPLQKIDGLDLTINVPTW QGHKAGQFAFVHALNDTEKPHPFSFASAWDPASRDIRFCIKALGDYTDTL AQRWKANDKLLIEGPYGRFTFADDAQQQIWIATGIGITPFMARLEELAQS THKQTVDLFYSYRESDPVLIAELQQKSAEAGINLHLRCSAEQSRLTSADI INTVKDLTKTSFWYCGISAFGDTLCKDLCRQGLPASRFHQELFEMR >MS1322 hns, Hns protein MNEVIKTLNNLRRLRSMAKELSIEQLENIIEKFQLVIEEKKAEELEIKRL EEERKNRLEKYRELLKEDGITADELAQILAGKNNTAKAKRAPLSAKYKYI NENGEQKTWTGQGRMPKAIQLQLNAGKSLSDFAI >MS0361 hofF, HofF protein MLNQLATLLDAQIPLKESLHILIQNCSSIPLNQWLRNLLSQLERGFAFSK SIELQGLYLSAQELQLIKVGEMSGKLSYVCSQIAHFRQQQLALQRKIQKI LLYPLVVLVISATLTLLLLIFIVPQFAEMYQDNNQDLPFITKFLLVLSHS LTHYIWYIIGVATLTFIFIKKQWRHSIWLYKCAQQLMALMPLISTIKQQA RLINFCRSLQLMLNAGIPLQQGLQAFLPQIKTWQNTGALPGDLILVEEVQ AILHWIKQGYGFSNSVGSRLFPQQAQQILQVGESSGQLSNILQKIADDYQ QQLDHKIDLLSQLLEPFLMLLIGIIIGVIMLGMYLPIFNMGNIM >MS0364 hofG, HofG protein MKNYRTLSQYPAIRKGFTLIELMIVIAIIAILATIAIPSYQNYTKRAAIS ELLQAGAPYKADVELCIYEKGGEKDCSSGANGIAEPAKTKGKYVDAVTVS SGTISVTGKGSLSGVSYSTKATGNASEGISWTTTCTPKDIFPAGFCDNKE >MS0724 hofG, HofG protein MRKGFSLIEFLTVLLLISISGSLTLSGWQSLGESQMLQQEQQRLLLFIKN IQARVENSNQVWHLVANRSFDQKNWCFTAQIKHDLFICDCFYPVLCPKEL LPHFYYPLFPDTVKFVGKKYYPAITAKFGGVRRTTENNCFSLISSNKQSV LSFSKMGNVSIKKPGSSSSCFNTAEE >MS0332 holA, HolA protein MNRIFAEQLSPSLAGRLAKVYLLVGQDPLLLSESQDNIIQAATKSGFDEK LEIQIDNGTNWNDLFERCQSMGLFFSKQVITLHFPENPTALLSKNLAELI SLLNSDLLLILHFGKLTKLMEKQDWFIQSEQYDRNAVLVNCQTPTAEQLP RWVANRCKAMGLIAEQDAVQLLCYSYENNLLALKQTLQLLDLLHADHKLT FVRVKNIVEQSSVFTPFQWIDALLEGKEARARRILTGLQAEDIQPIILLR SLQRELTILLQLAKPQHKTASVDSALPVAQLREGFDRLKIWQNRRPLFTQ AFQRLTYRKLYLAVQQLAELERLAKQEFSADIWDQLANIIPKICR >MS0570 holB, HolB protein MGKFNFVMTAIIYPWLQSYYERITAAFQQGYGHHALLFRAEQGIGADQLI HAVANWLMCQHSSPRPCGECHSCRLFAAGNHPDVYQLAPVENKDIGVDQV REINEKVSQRAQQNGNKAVYVQSAERLTESAANALLKTLEEPRPNTYFLL NADLSSPLMTTIYSRCQVWLINTPSEQQALNWLQLHNYSEISEIQTALRI SYGRPLLALHCLEQGWLEKRREFFRAFWLFYTRRSPLELLPLFDKELILQ QVDWLLAFLSDALKDKLNITSGWICRDLIRGIQQFNERQTVAGLLTATKI MQKVRSDLVQINAVNQELILLDGLTRLITEVFEH >MS1557 holC, HolC protein MPKQAQFYLIEKTQADNALSATEALACNLAADAWRLGKKVLIACETEEQA LNLDEALWQRDAEQFVPHNLSGEITNYATPIEISWQGKRNAQRRDLLISL QNNVPDYAQSFNHVIDFVPAEEERKAVARERYKLYRQLGFEMVMEKA >MS0683 hpt, Hpt protein MKKHHVDILISEQEVKARIQQLGAEITAYYRQQQVEKLIVVGLLRGSFMF MADLVREIKLPVEIEFMTTASYGSGMTTNHDVKITKDLDGDIKNQHVLIV EDIIDTGYTLEKVREILNLRTPASLKICTLLDKPSRREVEVPVDWIGFRI PDEFVVGYGIDYAQHHRNLGYIGKVVLEE >MS1437 hrpA, HrpA protein MMKMKTPKREFNALQKSLAEQIEDVMIVEQSRLLARIRGLGQIKKEQSQQ AAALDIEQQIQQAKLRLELRKSAVKNPIVFPENLPVSQRKTEIQKLIAQN QVVIVAGETGSGKTTQLPKMCLELGFGQKGLIGHTQPRRIAARSVAARIA EEMQTELGGIVGYKVRFNDQIGEDTQIKLMTDGILLAEIQTDRFLNRYDC LIIDEAHERSLNNDFILGYLKQLLPRRPDLKVIITSATIDVERFSKHFNN APIIEVSGRTYPVEVRYRPVAETEEQDQLQGILNAVDELQAEGRGDILIF LSGEREIRDTAEALEKQNLRHTEILPLYARLSAQEQNKIFHPGGLNRIVL ATNVAETSLTVPGIKYVIDPGTARISRYSYRTKVQRLPIEPISQASANQR KGRCGRVSEGVCIRLYSEQDFNNRPEFTDPEILRTNLASVILQMTALGLD DIEAFPFVDAPDKRHIQDGIKLLEELGAFEWQKSPPSAFGTSPRKRGEGN LASNSSLPPFTGGAGHSPEGGKRVLTQTGRQLAQLPVDPRLAKMLLSAVD LGCVLEVMIIVAALSIQDPRERPQEKQQSADDKHRRFADKKSDFLAFLNL WNYIQEQQKVLSKNQFRRLCQKDYLNYLRVREWQDIYHQIRLTVREMGLP INSEPAQYPQIHSALLSGLLSHIGMKEAEKQQYLGARNAHFAIFPNSVLF KKQPKWVMAAELVETSKLWGRMVAEIDPEWVEPLAKHLIKSSYSEPRWSK SRGQVIANEKVSLYGVPIVASRPVNYGAIDPQTSREIFIQSALVEGDWHT RHKFFFENQKLIREVEDLEHKSRRRDILVDDRTLFEFYDSRIGADVVSQK HFDSWWKKAAQQDPELLNFEKSFLMKEDAQKVSQLDFPNFWHQGNLKLKL TYQFEPGTDADGVTVHIPLPLLNQIEMQGFDWQIPGLRHELIVSLIKALP KSLRRNFVPAPNYAEAFLARVANFDKPLTETLSYELRRMTGVNVEVEEWK LEQIPPHLRMTFRVIDEKGKKIAESMNLDELKFGLKDQVQQSISAVADDG IEQSGIHIWNFDSLPQCYEQKKQGFTVKAFPAITDEKEAVGIKLFETEYE QSVAMQQGLRRLILLNVPSPIKYLHEKLPNKSKLGLYFTPFGKVLDLIDD CIACAVDKLIADFGGFVWNERDFERLRDFVRENLNEITVDIAQQVERLLT LTFEINKRLKGKMDFTMAFALSDIKSQLAGLIYPGFVEKTGYARLPDIQR YLQAIDKRMDKLAQDINRDRAAMLRVEQCQQAYQQLLAKLPKSKPLSTEV LEIRYMIEELRVSLFAQQLGTKYPVSEKRVLGVITEI >MS2165 hsdM, HsdM protein MTTDNLHTKQSTISSVIWSMANMLRGTYRPPQYRRVMLPLIVLARFDAIL APYTDAMKAKADELQAMGGKAPEGALYEMALTKAADPNRKQPLYNTSGYN LQRLLADQDHIAANLVKYLQGFSAKAKDIFDKFEFENEIEKLDSSNRLYA VVSQFQKDLKENGIDLSPQSISNLQMGYIFEELVRKFNEQANEEAGDHFT PREVINLMVNLIFEEDQQRLSQPHAIASIYDPTAGTGGMLSESEKHLKSY NDSIKLQLFGQEYNAESYAICCADLLIKDEPISNLVFGDTLGVKNSKNTG TGFVPHDGHQTKKFDYMFSNPPFGVEWKNEQDFINDEAKSGFAGRFGAGL PRINDGSLLFLQHMISKMKPVEEGGSRIAVVFNGSPLFTGDAGSGESNIR RWIIENDWLEAIIALPDQLFYNTGIYTYVWIVSNKKSDRRKGKVQLIDGT QHYQKMAKSLGDKRNELSPAQIADLTRLYADFKDGASGRISTKFCSKIFN NQDFGYLKLTVERPLRLNFQAGQERIEKVKTQTAFINLAVSKKRKDEAQI KAEEAEGQRQQQAILAALSTIGDGLYQNRTAFLKLLDKALKGLDFKLGAP LKKAIIEALSERDQSADICLDSKGNPEADSQLRDTELVPLPKEITLPLPV DYGEGKTDELVKQVKAHCEAYLQAEVLPHVDHAWIDYSKTKVGYEIPINR HFYQYQPPRALDEIKAEISELEAEIMAMLGNV >MS2171 hsdR, HsdR protein MITEKDFENEIERFLLAEGGYVQGKNSEYNKETALFEEDVLSFIQTTQPK RWERLAQGQKANVKAVLIKALCQELEAKGALDVLRHGFRCYGKTFQTAYF APNTSINEETQQRYDANILKITRQVVTEDGDRPDIVLSLNGIPVATAELK NVLSATHWTVEDAIYQYRKERNPKGKLFTFKKRTLVHFAVDTEEVYMTTK LDGEQTYFLPFNRGYNKGRGNPPIAGNVKTAYLWEQILTRHSFLEIIARF LHLSVEEKKVRTDSGLRLLQKETMIFPRFHQLDAVRQLIAHSREHGAGRN YLIQHSAGSGKSNTIAWLAHQLSSLHNRDDQKIFNSVIVVTDRVVLDRQL QATISQFEHKDGVVQKIEHNSQQLAVAIASDTPIIITTIQKFPFVMAALA RKQESGINVAISTEGKQFAVIVDEAHSSQSGEAAMELRKVLNKDGIEAAV MAEFLDDDDDETGLSDEAKKQLFIEAAKRQRQPNLSFFAFTATPKWKTKA LFDEPGADGNTPFHHYTMKQAIEEGFILDVLENYATWKQYFKLLKISEND KELSKSKAKKEMMRFVNLHPSVIAQKVEIIVEHFRTTTMHKIGGRAKAMV VTNGREHAVRYKLAFDEYIKEKGYTGIKSLVAFSGGITLKEAPEKEYTEA LMNGIREVDLPEQFASEHYQVLLVAEKYQTGFDQPLLHTMFVDKKLSGIQ AVQTLSRLNRCAKGKTDTFVLDFVNQPEDIYKAFKPFYEVTELGDIPSNE KLDELAATLDQWKIYFQPEIRQFAEIWFSAKQQPTGSEHKQLNSILDKAV ARFLAVGDEIQQGVDDLSELKNEQQNLFKSQLKSYLSLYQFVSQIMDYSD DLHEQRYVYLRALQSKLPNNSDRNKVDLSKDVVLHFYKLQKRSEGKIHLD EGGADPLKGATDVGSGRADATDELSNMVQEINGMYGTQFTIADQLFFEQI IEDALADNEIVGAAKNNSLESFTAYFADKLLDLLFQRMQGNEEISNQVMS DESLRNRVVKRLAKQIYQRK >MS2170 hsdS, HsdS protein MQKYDKYKPSGVEWLGDVPEGWEVTKIKYIAELTPKKSELTELDKECSFV PMEKLKLGNLVLDETRTISDVYNGYTYFEDNDLLIAKVTPCFENKNFVIA EKLVNGIGFGSSEIYVLRVKNCLNRYLFYRLQENTFMDLAIGSMTGAGGL KRIPSEFLNNYSIALPPLEEQTAIAHYLDQKTAYIDRLIDRQQTLLEKLS EKRTALITEAVCGRLPIAPYSASLKRGTGFDEENGSPNTAQTAPLFSKEG LGEICLKDSGIQWLGKVPEGWEVIRLRFLCNIQTGNMDTQDNEPDGIYPF YVRSPIIERSNNYTFEDDEAVLMAGDGVGAGKVFHYVQGKYGCHQRVYSL NQFQNITGRFLFYYLREFFSRKIEEGGAKSTVDSVRLPMLKDFPTCVPPL SEQTTITHYLDQETAKIDRLRTQIETVIERLKEYRMALITQVVTGKVKV >MS0272 hslU, HslU protein MSMTPREIVSELDAHIIGQNEAKRAVAIALRNRWRRMQLPEDLRQEITPK NILMIGPTGVGKTEIARRLAKLANAPFIKVEATKFTEVGYVGKEVDSIIR DLADISMKLVRQQAVEKNRMKAQDAAEDRILDVLLPPAKDQWGNVQETGN ASTRQVFRKKLREGQLDEREIEIDISTPVNVEIMTPPGMEDMTSQLQSLF EGMSPNKTKKRKMKIKDALKVMLDEEAAKLVNPEELKQKAIEAVEQHGIV FIDEIDKICKKGEHSGGDVSREGVQRDLLPIIEGSTVNTKHGMVKTDHIL FICSGAFQVARPSDLLPELQGRLPIRVELKSLTKEDFERILTEPNASLTL QYRELMKTEGVDIEFTQDGISKIAESAFRVNEKTENIGARRLHTVLERLM DGISFDASERAGEKVVIDEKYVSDALNDVVENEDLSRFIL >MS0271 hslV, HslV protein MTTIVCVRKNGKVAIGGDGQATLGNCVEKGTVRKVRRLYKDKVVTGFAGS TADAFILRDLFEKKLELHQGHLVKAAVELAKEWRTERSLRRLEAMMIVAN ESEFLLVSGSGDVIEPEFDVLAIGSGGNFAKSAALALLRTNNELSAAEIV KQALIIAGDIDIYTNHNHVIEEV >MS1696 htpG, HtpG protein MSNKETCGFQTEVKQLLQLMIHSLYSNKEIFLRELISNASDAADKLRFKA LSAPELYEGDGDLKVRISFDDKKGTLTVSDNGIGMTREQAVDHLGTIAKS GTKEFLSALGNDQAKDSQLIGQFGVGFYSAFIVADKVEVRSRAAGVAADK GVLWASAGEGEYSVENIEKKDRGTEITLFLREDEKEFLNEWRLREIIGKY SDHIGLPVEILTKEYDEEGKESGVKWEKINKAQALWTRSKAEISDDEYKE FYKHISHDFADPLSWMHNKVEGNQEYTSLLYVPGKAPWDLFNREQKHGLK LYVQRVFIMDDAEVFMPNYLRFMRGLLDSNDLPLNVSREILQDNKTTAAL RKALTKRSLQMLEKLAKDEPEKYAVFWKEFGLVLKEGVAEDFANKEQIAK LYRFASTHTDSSEQNVSFEDYISRMKEGQKAVYYITADSYVAAKNSPHLE LFNKKGIEVLLLSDRIDEWMLSYLTEFDGKPLQSVTKADLDLGDLADKEE ENQKEQDEKFDSFIQRVKSLLGERVKDVRITHRLTDTPAVVSTDNDQMTT QMAKLFAMSGQPVPEVKYTFEINPQHELVKKAAKVTDETEFGDWIELLLN QAMLAERGSLENPVAFIKLVNALLAK >MS0437 htpX, HtpX protein MFKKSKKIFAVSFIVATLAACADTAQINQEAASSYTQTINQARAKGVVDT SSATSKRIQSVFNQMVPYADKENTTGVKFNWQLTVVKSNELNAWAMPGGK MMFYTGLVDKLNLTNDEIAVVMGHEMAHALQEHGKQSRNVGIMTGILGAA ADIAAAATLGVDTGGLGGTVADLGVNKPFSRSNETEADEIGLFLMAKAGF NPQAAPQLWVKMQKAGGSNGPSLLSTHPSDASRQENLQRLMPEALKIYKA RNSK >MS1134 htpX, HtpX protein MMRILLFLATNAAVLIVFNIILSLTGIRGQDAMGLLIMAALFGFTGSIIS LLMSKRSALAATGAEVIEQPRNDTERWLLQTVHSQAEKAGLPKPDVAIYH SNDVNAFATGASKNNSLVAVSTALLNNMTRDEAEGVLAHEISHIKNGDMV TMTLLQGVLNTFVIFAARMIARMVANNRSSEESNSGIYFLVAMVLEVVFG FLASMIAMWFSRFREFRADAGSAELAGKQKMIAALKRLQAIHEPQEMDGK LAAFAINGKRGGFTSLFLSHPPLEKRIEALETSK >MS0869 htrB, HtrB protein MIRDGLVKRINMSEKNKRLTARVGYEPHFSWSYLLPKYWGIWLGIFVLLI FAFIPFRLRDNLAAKLGLVIAKYAKKPRHKARVNLQYCFPQWSAEQREKV IDDMFITVTQVMLGIGEIAVRSKAHLQRRSVFFGIEHIQKAKEQGYNIIL MVPHGWAIDASGIILHTHGMPMTSMYNPHRNPLVDWLWTITRERFGGKMH ARQNGIKPFLNMVRKGDMGYYLPDEDYGAQASEFVDFFATYKATLPGLNK MAKLAKAVVIPMFPRYNAKAGRYEMEIHPAMELSEEPKQSARSMNAEIES FVSPAPEQYVWILRLLKTRKDGKDIYQ >MS1263 htrB, HtrB protein MAKNNTPIFQKSFLAPKYWPFWLAVGIFRLILLLPYPLLCKIGLGLGKLF SKLSVGKRRSQIVRRNLQLCFPNWNEEKIESTLQANLESVGMAIIETGMA WFWSDKRIAKWSKIEGIEYLKNNAKDGIILVGVHFLTLELGARIIGLQHP GIGVYRPNDNPIMDWLQYRGRIRSNKDLLDRKDLRSMIKALRTGNTIWYA PDHDYGRKNAVFVPLFSVPDAATTTGSYYLLKSSPLSKVVPFAPLRNTDG SGYTVTVEPPVDFTDILHDKEAIAKRMNKVVEREIMLGVEQYMWLHRRFK TRPNESDKSLYD >MS2365 hyaA, HyaA protein MEQIMQRTDGLLSALTHSVTDVSRRDFMKLCTALAATMGLTSKASAEEIT NALTNPQRPPVIWIGAQECTGCTESLLRATHPSIENLVLDMISLEYHEVL SAAFGDQAEENKHRALEKYKGKYVLVVDGSIPVKDGGVYCMVAGNPIIEH IKEAAKGAAAIIAIGSCSAWGGVPSSGGNPTGAKSLSEVLPGIPVINIPG CPPNPHNFLATVAYILTYKKLPATDKLNRPLFAYDRLIHENCYRRPHFDA GRFAKEYGDYGHRHGWCLYHLGCKGPETYGNCSTLDFCDVGGNNWPVGIG HPCYGCNEKGVGFTKGIFQLANVENPTPRVEKPDVANQEGETASMTAIAL LGAATAVLAGVAVETLKELSVQRKNQLEKEKTKQENSNNTGK >MS2361 hyaB, HyaB protein MTDKKRITIDPITRIEGHLRIDCEIENGVVTNAWSTGTMWRGMENIVKGA DPRDAWMIMQRICGVCTTVHAILSVRAVEDAVGAKVPLNAQYIRNMILAA HSIHDHIVHFYQLSAMDWVDITAVLKADPEKAANMLKGVSSWGLNSANEF RNVQTKVKKLADSGQLGIFANGYFGHPAMKLSPEVNLIAVAHYLQALECQ RDANRVVALLGGKTPHIQNLAIGGVANPINLDSQAVLNLERLMYVKSCID RLNDFINQVYKVDTAIFAAYYPEWLNLGKTSGNYLAVPEYPVNAENSEFA LTGGYLQNFDLNTFRPITQQKDNFVVQGIKESGKHAWYEDDEALAPWAGL TRPKYTQWDENGKYSWVKAPSFYDDVVEVGPLAYLLTNLAAKNEVTTKHF NELKSIYDQLAGRNLEINDLHSTLGRIIGRTVHCCALNEILTQQWQLLVN NIGKGDTIAYLKANIPENGEFRGVGFGEVPRGMLSHWVVIKDGKIENYQA VVPSTWNSGPRNQHDALGPYEQSLIGTPVADPAKPLEVVRTIHSFDPCMS CAVHVVNTETGETTKVKVL >MS2360 hyaD, HyaD protein MKPLILGVGNILLSDEGIGVRAVQHLEKNANFTPHFDLVDGGTCGMELLD VMANRDYLIIIDAVIAGKRPGEIVVLKDEQVPALFSRKISPHQLGICDVL SALKLTDEYPKHLCLIGIQPESLESHIGLTKTVENAMPAVFQCLAQQLTD LGLPSPVIN >MS1029 hybA, HybA protein MSAVQEQNIIKRSATSGVTPPPQVRKDVVEVAKLIDVTTCIGCKACQVAC SEWNDIRAPQEQCVGVYDNPRDMNAQQWTVMKFSEVEENDRLEWLIRKDG CMHCAEPGCLKACPAPGAIIQYANGIVDFQSEKCIGCGYCIAGCPFNVPK MSNEDNRVYKCTLCVDRVNVGQEPACVKTCPTGAIHFGSKEEMLHYAETR VADLKSRGYDNAGIYNPEGVGGTHVMYVLHHADRPELYAGLPKDPEIDVT VKLWKDILKPVAAVAMGGLALAEIAHYVGVGPNNEEDVEDHSAHFEREDA EEEQSHHNKGGK >MS2364 hybA, HybA protein MDRRKFIKAGMLGGVASALPLNAAHAEVKNQEPIPGALGMLYDSTLCVGC QACVAECQKINGTPVNPKGEQTWSNNDKLSPFTRNVIQVWSEGEGTNKDQ PQNGYAYIKKQCMHCVDPNCVSVCPVQALTKNPKTGIVGYDPDICTGCRY CMVACPFDVPKYDYDNPLGQISKCELCNQKGVERIVQGKLPGCCHVCPTG AIIFGSREELMAEAKRRLSMTQGTDYEFPRQHVNSKDKYQAKIPAYEQHI YGEIEGGGTQVLVLSGVPFENLGLPQLDEIATGARAAHLQHTLYRGMILP LVGLAGLTFITYRNMHGKKPEHHQEEDNNE >MS1817 hybA, HybA protein MTACSRRNFVSGMGALILTTGTSVKLSAQGEKPNETAPKRYAMVHDETSC IGCTACMDACRETNQVPEGVSRLEILRSEPHGEFPNQEYEFFRQSCQHCT NAPCVAVCPTGASFIDPETGIVDVNKDLCVGCQYCIAVCPYRVRFIHPVH KTADKCNFCRDTNLAAGKQPACVEACPTKALTFGDMNDPNSAVARKVREN PVYRTKLTLGTEPNLYHIPFAKGEHR >MS0890 hybA, HybA protein MSAVQEQNIIKRSATSGVTPPPQVRKDVVEVAKLIDVTTCIGCKACQVAC SEWNDIRAPQEQCVGVYDNPRDMNAQQWTVMKFSEVEENDRLEWLIRKDG CMHCAEPGCLKACPAPGAIIQYANGIVDFQSEKCIGCGYCIAGCPFNVPK MSNEDNRVYKCTLCVDRVNVGQEPACVKTCPTGAIHFGSKEEMLHYAETR VADLKSRGYDNAGIYNPEGVGGTHVMYVLHHADRPELYAGLPKDPEIDVT VKLWKDILKPVAAVAMGGLALAEIAHYVGVGPNNEEDVEDHSAHFEREDA EEEQSHHNKGGK >MS1545 hybF, HybF protein MEIVEEQCHRNNVNKVTDIWLEIGPLSCVEPDAIEFCFEVCRKNTVMENC KLHFVPVPALAYCWHCEKTVEIKSHHDACPQCGGIHLQKQGGDDLRIKEI AVE >MS1463 hypB, HypB protein MCTTCGCGHPEQVRIGELQHTHSHSEHQSAVKMPDFSQSVFHSMKPSIHE HAGEQDNTQKRLLKIEQDVLGKNNRIADSNRNLFNYLNLTVFNLVSSPGS GKTSLLTATLNSLKNDRNCYVIEGDQQTENDADRIRATGVPAIQVNTGKG CHLDAQMISDAMMKLRPQENGLLFIENVGNLVCPSEFDLGEKAKVVILSV TEGEDKPLKYPHMFAASKLMILNKVDLLPYLKFDVEKCIENAKRVNPQIE VIQLSAATGEGLQDWLNWLQQ >MS2358 hypC, HypC protein MCLGVPGQIIDVGEDGFQPAVVDVCGVQREVNISLICENNTTDLLGKWVL VHVGFAMSVIDEEEAKQTLSALMTMSQLDHEVGDFAGLNKN >MS1462 hypD, HypD protein MQFVDEFRDPKLAKHLVERLTKLMKNLPQFSAKNPLYLMEVCGGHTHSIF KFGLDRLLPESIEFIHGPGCPVCVLPMGRIDLCIEIAQNPNVIFCTFGDA MRVKGRKGSLLEAKAQGCDVRIVYSPLDALNIALSNPDKKVVFFSLGFET TMPAAAVTLQQAKRRNIANFWIVSQNITIIPTLRSLLSQDQIKIDGFIAP GHVSMVIGSAPYRELCKKFRKPFVIAGFEPLDILQSIVMLVEQFADGRCE VENQYKRIVHEQGNMLAQKAMAEVFQLKARSEWRGLGEIEESGVELTVDY RRFDAEIYFNSKAQQVADDPNSRCGDVLTGKCKPADCPLFGSDCNPDNAY GALMVSSEGACAAYYQYRRE >MS1461 hypE, HypE protein MTDFITMAHGNGGAAMQQLIRDYFVEAFDNPTLAQGEDQARIPLAELIKC GEKLAFSTDSFVIDPIFFPGGNIGKLAVCGTVNDIAVGGAIPKYLSCGFI LEEGLPLSELKEIIRAMAETCRRAGVQIVTGDTKVVQKGAVDKVFINTSG IGVIPAEIDWGAHQIEAGDKIIVSGTIGDHGATILNLRENLGIKTDLHSD CAVLSSLIDLLRPIQGVKAIRDATRGGVNAVLHEYAQTQNLGMQVHEEDL PMRNEVRGICELLGLEPLNFANEGKLVIITKAEKTQEILTALHSHELGKN AAIIGEVTDDKKVRVVGIFGQTRLLDLPANEPLPRIC >MS1546 hypF, HypF protein MNTEKQSVIELRIKGKVQGVGFRPFVWLLANQYGLKGDVNNDGQGVLIRF IEPDCASLQQFLRDLQNKLPPLAQITEIQETTKIGENLPHFSDFTIRESE NNAVDTQIVPDAATCPACLKELFAPRNRRFHYPFTNCTHCGPRFTIIKSI PYDRPNTSMANFPFCPECEREYKNPADRRFHAQPNACPVCGPHIWLQNQH TKIADHEAALIQTLHLLNEGKIIAIKGIGGFHLACDATNRQTVQLLRSRK RRPTKPLAIMVPDLQFLTALSRAETKLLTSSAAPIVLLSKHKVPAVDELI APHLNEIGVMLPSNPLQHLLLKAINKPLVMTSANPSGQPPVLDNESAVKF LQNLADFYLCHNRDILQRADDSLVRIAFDGLETLRRARGYVPDEISLNIS NNKNILALGSDLKNTFCLLKRNKAVVSQHIGDTADEKVRSQLEENLALFQ HIYQFKADLIAVDSHTGYFSSATGRQIAQCQQIPVMEILHHHAHIRAVMA EHNCNEKVIGIALDGIGMGENQQLWGGECLLINRTEVKHLGGLPAVALPG GDLAATQPWRNWLAHIHQFVENWQELAAKSCEKYNWQSLSRAIEQKINCP TISSAGRLFDAVAYSLNIAPENLSWEGEAACRLEALAGQSQFTEKSAVKI RQILPEFADELIWTNDKNNKTFLNLAKFWQSWRNYKAQKADKAFAFHLAL AAGFAELARQQANKYQCRTIVLSGGVMHNRLLRRLLKENLQEFNVLSAHQ FPMGDGGLSLGQAAIAADFT >MS1693 icc, Icc protein MISNTYIYEADSDVIRFVQITDPHLFKDEQGELLGVNTQQSLTQVLTELK ENQFNYDFVLATGDIVQDSSEEAYLRFCKSVQQLDKMVFWIPGNHDFQPK MFDILVQEHGNLSPKKHLLLGDKWQILMLDSQVFGVPHGQLGQYQLEWLD SKLKDNPDRYSLVVLHHHILPTHSSWLDQHNLRNAHELAQVLAQYDNVRG ILYGHIHQAMDGTWKDYQIMATPSTCIQFKPDSNVFALDTLQPGWREVEL HSDGSIITRVNRIQKASFLPNMQEDGY >MS2370 icd, Icd protein MQSKVKIPQGDKIQLADNGALIVPHNPIIPFIEGDGIGVDVTPAMKAVID AAVEKAYGGTRKISWMEIYAGGKANQVYGENTWLPAETLELIRQYHVAIK GPLMTPVGGGIRSLNVAMRQGLDLYNCLRPIRYYEGTPSPVKHPEFVNMV IFRENSEDIYAGVEWVAGSPGADKLINFLQREMGVTKIRFTEDCGIGIKP VSKQGSQRLVRAALQYVIDNDRSSLTLVHKGNIMKFTEGAFKEWGYQVAK EFGAELIDQGPWMKLKNPNTGTEIIIKDCIADAFLQEVLLHPKDYDVIAT LNLNGDYISDALAAQVGGIGISPGANIGDDAAIFEATHGTAPKIAGQNKG NPGSLILSGEMMLRHLGWLEAADLVVNAVAKTIADKTVTFDFAEMLEGAT LRSTSEFAEDIIANM >MS0562 iclR, IclR protein MEKENQPEAVSSVLKVFGIIEALAEQKEIGITELAQRLMMSKSTTYRFLQ TMKTLGFVSQEGETEKYTLTLKLFEVGAKALEYADIIGLANHEMSYISRQ TNETLHLGTLDGTEIIYLHKIDSGYNLRMYSRIGRRNPIYSTAIGKVLLS GLTNKEIRELLADLTFVKHTSKTLENIDQLIEEIEKVRKQHYAEDNEEQE PGLRCVAAPIYNRFGRIIAGLSISIPTIRFEEEKLPQLVNLLQVAGKNIS EQIGYHDYPEILAP >MS0055 iclR, IclR protein MFFIVRRLKEMEKNSGNQSLIRGLRLIEILSRFPNGCPLVQLANISELNK STVHRLLQGLQQEGFVQPAITVGSYRLTSKCLSIGHKIFSSLNIINIISP HLENLNLDLGETINFSMRENDHAIMIYKLEPTTGMMRTRAYIGQHLQLYC SAMGKLYLAYDRPAYLKEYWQTNNDNIQTLTCNTITELPVMEKELDEIKK QGFAVDKEENEIGISCIACPIFNFQNKVEYAMSVSISTSKLNQYGIEHLL EKIKLTAEAISLELGWLPESVQN >MS1751 ileS, IleS protein MVRKMSEQKDYKNTLNLPETGFPMRGDLAKREPGMLKNWYDNDLYQKIRQ SSKGKKSFILHDGPPYANGSIHIGHAVNKILKDIIIKSKTALGFDSPYIP GWDCHGLPIELKVEGLVGKPNQKISAAQFREECRKYAREQVEGQKKDFIR LGVLGDWDNPYLTMNFDTEANIIRAFGKAVANGHLYKGSKPVHWCLDCAS SLAEAEVEYEDRTSPSIYVRFAAADESAVENKFVLTEQGKGKLSAVIWTT TPWTLPSNKAISINPELEYQIVQFGDERFILAAELVESVAQAVGVESWKA LGSAKGSDLELLQFKHPFYDYNVPFILGDHVTLDGGTGLVHTAPDHGQDD YVVARKYNIGMAGLIGNDGKFNSNAKFFAGLGVFEANGKVLEKLDEVGAL LKLEKIRHSYPHCWRHKTPIIFRATPQWFIGMETQGLRQQALSEIKKVRW IPDWGQARIEKMVENRPDWCISRQRTWGVPVALFIHKETEQLHPRTLELI EEVAKLVERKGIQAWWDLDAKDLLGDDAAHYSKVPDTLDVWFDSGSTYYS VVKNRPEFNGKEADMYLEGSDQHRGWFMSSLMLSTATDNKAPYKQVLTHG FTVDGQGRKMSKSIGNIVTPQEVMDKFGGDILRLWVASTDYTGEMTVSDE ILKRAADSYRRIRNTARFLLANLNGFDPKRDLVQAHEMISLDRWAVDCAF RAQAEIKEAYDNYQFHTVVQRLMKFCSVEMGSFYLDIIKDRQYTTKADSL ARRSCQTALWHIAEALVRWMAPILSFTADEIWGYLPGERGEFVFTEEFYD GLFALDVSESLDDAYWQQVITVRNEVNRVLEQARNDKVIGGGLEAEVTIF ANDEYSALLNKLGNELRFVTITSKAEVKTLADADVAEGEVAGLAIKAIRS ANHKCPRCWHYSDSKDANSLCSRCEENVNGNGEERRFA >MS2218 ilvA, IlvA protein MVNNLSNAPTGAEYLRAILISKVYEAAKVTPLQLMPKLSERLGNRIYVKR EDHQPVHSFKLRGAYAMISGLTQAQKEAGVITASAGNHAQGVALSAKNAG IRALIVMPQNTPSIKVDAVRGHGGEVLLHGANFDEAKAKAIELSQTEQMT FIPPFDHPAVIAGQGSIGMELLQQNGHINRIFVPVGGGGLLAGVAVLIKQ LMPEIKVIGVEAKDSACLYYALKAGRPVDLERVGLFADGVAVKRIGDETF RICQQYVDDVILVDGDEICAAMKDMFENVRAVPEPSGALSLAGLKKYAKQ HNLQGETLVNLLSGANLNFHTLRYVSERCEIGEKHEALFAVTIPEQRGSF LKFCQILGQNAVTEFNYRYADEKQACIFVGVRITGEQEKQVIIQQLKQGG YDVQDLSDDDIAKTHIRYMVGGRSSSDLNERLYSFEFPEQKGALLKFLET LGTTDANISLFHYRGHGADYGDVLAGFQINDADLPAFKQHLEKLGYAYQD VTDSPSYRYFLG >MS1319 ilvB, IlvB protein MKMKKLSGAEMVVQSLRDQGVKYLFGYPGGSVLDIYDAIHTLGGIEHVLV RHEQAAVHMADGYARSTGEVGCVLVTSGPGSTNAVTGILTAYTDSVPLVI ITGQVRSNLIGTDAFQECDTIGLTRPVVKHSFMVKHAEDIPETIKKAFYI ASSGRPGPVVIDIPKDVVNPANKYTYEYPKEVSLRSYNPNVQGHKGQIKK ALKALLVAKKPVLFIGGGVIIGNSSEKLTQFAQLLNLPVTSSLMGLGGYP GTDKQFLGMLGMHGTYQANMAMHNADLILGIGVRFDDRTTNNVEKYCPHA KVIHVDIDPTSISKNIAADIPIVGSVDNVLTEFLSLLEDDNLSKSQSDLT EWWKQIDEWKAKKCLEFDRTSQAIKPQAVVEAIYRLTKGEAYIASDVGQH QMFAALHYPFDKPRHWINSGGAGTMGFGLPAAIGTKFAHPDSRVVCITGD GSIQMNIQELSTAKQYGTPIVIVSLNNRFLGMVKQWQDLIYSGRHSQVYM NSLPDFAKLAEAYGHVGIQINTADELEEKLTQAFAVKDKLVFVDVLVDAT ENVYPMQITGGGMNEMLLGKPAEK >MS2223 ilvB, IlvB protein MNGANLVTECLKAHNVDTVFGYPGGAIMPVYDALYDCGINHLLCRNEQGA AMAAIGYARSTGKTGVCIATSGPGATNLITGLGDALMDSIPLVAITGQVA APLIGTDAFQEADVLGLSLACTKHSFIVQNIEELPEIFAKAFKIAQSGRP GPVLIDIPKDVQFAETLLQPIVYSVEKPTALSAKSLEKAVELLKNAKRPV AYIGGGVGMAKAVPALHEFLTATRIPTICTLKGLGAVPADNPYYMGMIGM HGTKAANYATQEADLLLVLGARFDDRVTGKLSSFATEAKVIHADIDVAEI NKLRRADVALCGDLEQALKALSFALDIEPWRADVQRLKRDFDWDYGENEG EGDINPLFLLNRVSRLKAENAIVVTDVGQHQMWAAQHMSFGKPENFITSA GFGTMGFGLPVAIGAQKARPRDQVILVTGDGSIMMNIQELGSIKRAKTPI KILLLDNQRLGMVRQWQSLFFHGRHSSTILDDNPDFVTLASAFGIRGERI EKAGEVNEALDRFFASQEAYLLHVCVHEDENVWPLVPPGACNVEMIEEMS >MS0045 ilvC, IlvC protein MSNYFNTLNLRQKLDQLGRCRFMERSEFADGCNFLKGKKIVIVGCGAQGL NQGLNMRDSGLDISYALRPEAITEKRASFQRATENGFKVGTYQELIPTAD LVVNLTPDKQHSKVVADVMPLMKQGASFGYSHGFNIVEVGEQIREDITVV MVAPKCPGTEVREEYKRGFGVPTLIAVHPANDPKGEGMAIAKAWASATGG DRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSIVCYDKLVADGKDPAYA GKLIQYGWETITEALKQGGITLMMDRLSNSAKIRAFELAEEIKEHLNFLY LKHMDDIISGEFSATMMADWANGDKDLFAWREATGKTAFENAPKADGIKI SEQEYFDNGVVMVAMVKAGVEMAFDAMVASGIYEESAYYESLHELPLIAN TIARKRLYEMNVVISDTAEYGNYLFSNVATPILAKEIVSQLKRGDLGEPT PAAEIDNVYLRDINDTIRNHPVELIGQELRGYMTDMKRISSQG >MS2219 ilvD, IlvD protein MEIFMPKLRSATSTQGRNMAGARSLWRATGMKEGDFGKPIIAVVNSFTQF VPGHVHLHDIGQMVVKQIEAAGGVAKEFNTIAVDDGIAMGHGGMLYSLPS RDLIADSVEYMVNAHCADAMVCISNCDKITPGMLMAAMRLNIPTIFVSGG PMEAGKTKLSDQLIKLDLIDAMIQSADKNVSDSDVDAIERSACPTCGSCS GMFTANSMNCLTEALGLSLPGNGSCLATHADRKQLFLDAATQIVELCKRH YEQDDYSVLPRSIATKAAFENAMSLDIAMGGSTNTVLHLLAVAQEAEVDF TMADIDRLSRIVPCLSKVAPNTNKYHMEDVHRAGGVMAILGELDRANLLH HDTKTVLGLTFAEQLAKYDIKLTRDEAVKTFYRSGPAGIRTTEAFSQDCR WETLDDDRENGCIRDKAHAYSQDGGLAMLSGNIALDGCIVKTAGVDESIL KFTGEAIVFESQEDAVDGILGGKVKAGHVVVIRYEGPKGGPGMQEMLYPT SYLKSMGLGKACALLTDGRFSGGTSGLSIGHCSPEAASGGTIGLVRNGDI IAIDIPNRSIQLQVSDEELATRRAEQDVKGWKPANRAREVSFALKVFGHF ATSADKGAVRDKTKL >MS2192 ilvE, IlvE protein MCRIGIFMDYPLFETVAVERGEILNLDYHQTRYEQALHQYYGRKVLPFNL QEILQKSTALLTLKRSEPLIRCRIDYNDQDYRLQCFAYQRKVFRSFQPVI CDHIDYGLKFSDRRIFAELLRQKGKHDEIIIIKQGLVTDCTIGNLLFRKN QQWFTPEAPLLNGTQRAKLLAEKRIQTLNIKRQDIAQFDEIRLINAMNPF SESL >MS0896 ilvE, IlvE protein MKDLDWKNLGFGYTKTDYRYIAYWKNGEWQKGELTKDNTLHISEGSPALH YGQQCFEGLKAYRTKDGSIQLFRPDQNALRMQQSADRLLMPRVPVDMFID ACKQVVKANEEWVGPYGSGATLYLRPFLIGVGDNVGVHPAKEYIFSIFVC PVGAYFKGGLAPSKFLISTHFDRAAPHGTGAAKVGGNYAASLYPGKYAKE HGFADCIYLDPATHTKIEEVGSANFFGITKDNKFITPISPSILPSITKYS LLYLAKERLGLEVEEGDVYVKDLDQFAEAGACGTAAVITPISGVQIDDKY HVFYSETEIGPITQKLYDELTGIQFGDKPAPEGWIVKVE >MS2222 ilvH, IlvH protein MDSSWRKSMTNELTIVAHHRPEILERILRVVRHRGFTVIKLKMNLENGKI WLDFVVEGERDICLLVHQLVKLEDIIDITTDEECECDE >MS1318 ilvH, IlvH protein MRRILSVLLENESGALSRVVALFSQRAFNIESLTVAPTDDPTLSRMTIEA SGDEAILEQIEKQLHKLVDVFKVINLSDCEHVEREVMLLKLRATGSTRDE IKRLTDIFRGQIVDVTTKSYTIQLAGTKDKLNAFVSAVKEETTIIEIVRS GLISLSRGEKNCL >MS2357 imp, Imp protein MKKNYYSLISFSIFTALYSTAGFADLQQQCLAGVPQFSGEVVKGNANEMP VYIEADKAELNHPTKGVYQGNVDIKQGNRHLITETAEIIQSGQDENVQRY AYAKGGFDYKDNIINLTGDDAKVHLNTKDTDVKNADYQFVGRQGRGSAQS AEVREDYRLLNNATFTSCLPNDNSWQIEAKEMKQYIKEEYAEMWHARFKV AGVPVFYTPYLQLPIGDRRRSGLLIPSAGSSSRDGYWYSQPIYWNIAPNY DATFTPKYMTHRGWQMNGEFRYLNEIGEGKIAGEYLGDDRYKDYIGDNKS RHLFYWAHNAKLFDNWRLNVNYTKVSDKRYFSDFDSDYGSSTDGYATQTA RLAYFQPNYNFAISAKQYQVFDEVSVGPYKALPQIDFNYYQNDLAQGLLD FKLFAQAVRFENDSTLMPTAWRYHAEPSLNLPMSNQYGSLNVETKLYATH YEQRKGSSARAEDIDRSVNRMIPQIKVDLQTVLASDKTFVDGFTQTLEPH LQYLYRPYRDQSNIGSKRNTEYLGYGYDSALLQQDYFSLFRDRRYSGLDR IASANQFTLGGTTRFYDEQANERFNLSLGQILYLNDSRIDNNSDHSTSGR ASSWALESNWKLSDQWNWRGSYQYDTRLNETSLANTTLEYNPEKNNLIQL NYRYASQAYIDQNLTSGANRYNQDIKQIGTTIAWEVSDNWVLVGRYYHDI ALNKLVEEYAGIKYNTCCWSVGVGARRHLVSKSNYTYSANKDTIYDNSFG ITFELRGLGNEQHSGIVDMLDKGMLPYVKPFNL >MS0654 infA, InfA protein MSKQSKQEKLPPPHKNGYNSHHSKKLPQKCGKNFIIFLEDTMAKEDCIEM QGTILETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTIEMTP YDLNKGRIIFRSR >MS1444 infB, InfB protein MTDKETQNENAPKKLSLQRRVKTTVAGGKVQVEVRKSRKIDTEAAKKAAE EAKLKAQEAAEKAAAEKAEKEAAEKAKKNAEKARVAAAVKKPEPVKVVDA EKDRIKAEEAELRRKADELARQKAEEQARKAAEEAKRLAELAADRETTEV SDDFSDYHLTSTYAREAEDEEERRKEGRGRGKNKVGKAKKGGRDDNGSKD ERNADRRNQKDVKGKGKQGKKGSSAIQQAFTKPAQAVNRDVVIGETITVA ELANKMAVKATEIIKTMMKMGEMVTINQVIDQETAQLVAEEMGHKVILRK ENELEESVLEDRDVNAEKVTRAPVVTIMGHVDHGKTSLLDYIRKAKVAAG EAGGITQHIGAYHVETNGKMITFLDTPGHAAFTSMRARGAKATDIVVLVV AADDGVMPQTIEAIQHARAASVPLVVAVNKIDKPEANPDRVEQELLQYDV VSEKFGGDTQFVYVSAKKGTGVDELLDAILLQSEVLELTAVKEGMATGVV IESYLDKGRGPVATILVQSGTLNRGDILLCGFEYGRVRAMRDELGKDVES AGPSIPVEVLGLSGVPAAGDEATVVRDEKKAREVALYRQGKFREVKLARQ QKAKLENMFSNMAEGDVAELNVIVKADVQGSVEAIVQSLQELSTEEVKVK VVGSGVGGITETDATLAAASNAIIVGFNVRADASARRIIETENIDLRYYS IIYELLNEIKAAMSGMLQPEFKQEIIGLAEVRDVFRSPKFGAIAGCMVTE GVIKRNNPIRVLRDNVVIFEGELESLRRFKDDVNEVRNGMECGIGVKNYN DVKVGDQIEVFEVVEIKRSI >MS1054 infC, InfC protein MSRAEEVELDLVEISPNAEPPVCRIMNYGKFIYEKEKAAKEQKKKQKVVQ VKEIKFRPGTDEGDYQVKLRNIVRFLEDGDKVKITVRFRGREMAHQDIGL DVLDRVKQDTAEIAMVESAPGKLEGRQAVMVIAPKKK >MS1208 insB, InsB protein MKLLRQLSVAALGLFALQAAQSKNLIVYFTVPESVPTEKLDGVSGASVII KDNERLGSAEYLAKEVQKTAGGDLFRLETVQAYPTVHQQLLDFAQEEQRK NIRPALKAKPNLNGYETIFVAYPIWWYKLPMPLYSLFEQVDFSGKNIVPL VTHGGSRLSGTDRDIAQLQPKATVKDGFEYYLYKTTGADSTMEQKLADWL VRQGYAK >MS1529 iolE, IolE protein MKTIKGPGLFLAQFIDNKAPFNRLDSLAQWAAGLGFKALQIPCNHKHIFD VELAAQSQTYCDEVKGILAQYGLIVSELSTHLEGQLVAVHPAYDTAFDGF APEQVRGNSEARQLWAVNIIKQAAVASKRLGLNAHASFSGSLAWPYFYPW PPRKEQLIQTAFDELAKRWKPILDYFDEQGVDLCYELHPGEDLHDGVTFE RFLDKLNRHPRCNILYDPSHMLLQNMDYLQFIDFYHERIKAFHVKDAEFV KSAKSGCYGGYQNWLERAGHFRSLGDGQIDFKAIFSKLTQYDYTGWAVLE WECCMKDSAVGAKEGAEFIQKHIIPVAEKSFDNFADTGDDSEQAKIMLGL K >MS1307 iscA, IscA protein MTDMTIPLTFTDAAAKKVKNLIIEEENQDLKLRVYITGGGCSGFQYGFTF DEKVNDGDLTIENDGVKLVIDPMSLQYLIGGTVDYTEGLEGSRFVVHNPN ATTTCGCGSSFSI >MS1723 iscA, IscA protein MSVEQFSVEDAEQSSQSASIGMTESAAKHVKKCLESRGKGIGLRLGIKTS GCSGLAYVLEFVDELNSDDNVFEQHGVKVIVDTKSLVYLNGTQLDFVKEG LNEGFKFTNPNVKDQCGCGESFNV >MS1724 iscU, IscU protein MAYSEKVIDHYENPRNVGSFDKKSSDVGTGMVGAPACGDVMQLQIKVNEE GIIEDAKFKTYGCGSAIASSSLITEWVKGKSLDEAGAIKNSQIAEELELP PVKVHCSILAEDAIKAAIADYKSKKGA >MS1600 ispA, IspA protein MHCLLSELKENRVKSTALFLFNYIKNCIMTTTNNLMDIKTIQALVSDDMQ KVNEEILAQLNSDVPLINQLGYYIIHSGGKRIRPMIANLAAKALNYQDNK HITCAAFIEFLHTATLLHDDVVDESDMRRGNPTANAEFGNAASVLVGDYM HTRSFQMMTELGSLRILQVMSAATNVIAEGEVQQLMNVRDPDTTEQNYMK VIYSKTARLFEVSTQTVAILADAGAAIEEGLQNYGRYLGTAFQLVDDILD YSANAATLGKNIGDDLAEGKPTLPLLHAMRHGNPQQAELIRNIILQGGNR DALDEVLTIMHEHKSLDYAMEHAKQEAQKAVDALAPLPDNIYKRAMISLA YLSVDRAY >MS1060 ispA, IspA protein MMRLMYQFSNDLQQAQQRINRFLEAQFDEINTRPSPLADAMKYGLLLGGK RIRPFLVYATGRMLGADTAQLDYAAAAIEAIHAYSLIHDDLPAMDNDSLR RGQPTCHIAFDHATAILAGDALQAFAFEILTKSTALSAEQKLRLIQVLSH NSGVFGMCLGQSLDLISEHKQISLSELEQIHRNKTGALLSAALKMGFICS SHFADNALEAKLDRYAAAIGLAFQVQDDILDIEGDLAAIGKNVGSDLESD KSTYPKLLGLAGAKQKARELYVAAVGELEHLPFDTTALRALAEFIINRKN >MS2275 ispD, IspD protein MTRHSRPIIAVVPAAGVGSRMQADKPKQYLTLLGKTLLEHTLEVLLSYTP IQQIILAVAENDPYLDQLDVIRQPKIKIVQGGRDRAGSVFNGLKAITQPH AWVMVHDAARPCLTHEDLDKLLQIEDDNGGILAIPAVDTIKRASAEKQII QTEDRSQLWQAQTPQFFRADLLYRALQQAFEHGLAVTDEASAMEFAGFRP HLVAGRSDNLKVTRPEDLKLAEFYLSRK >MS1535 ispE, IspE protein MKTHQFSTALFSDYKQGDSFNFPCPAKLNLFLYINGRRTDGYHELQTLFQ FLDYGDWLSIKVRNDGKIRLTPEIPDLKTEDNLIYRAAKLLQQKTACRLG ADLHLDKVLPMGGGVGGGSSNAATALVALNYLWKTGLSVNELAELGLKLG ADVPIFVHGKAAFAEGVGEKITYCEPPEKWYAVIKPNVSISTAKVFSEPD LTRDTKKKPLEQLLQQEYTNDCEKVVRKLYPEVEELLRWLVKYAPSRLTG SGACVFAEFADEQSAQTVFNLKSKQFSGFVAQGLNVSPLHKMLEQLNRQN HG >MS2274 ispF, IspF protein MIRIGHGFDVHAFGEARPLIIGGVEVPYHTGFIAHSDGDVALHALTDALL GALALGDIGKLFPDTDMQFKNIDSRILLREAFRRVQEKGYKIGNVDVTII AQAPKMRPHIDAMRAVIAEDLQCSVEQVNVKATTTEKLGFTGRSEGITTE AVALLVKSC >MS0507 kamA, KamA protein MRILTQNNPVREENWLEILANSISDPEVLLKTLSLPIDKFEKDIHARKLF AMRVPLPFVRKMELGNAQDPLFLQAMSSADEFLTADGFSKDPLEEQQVVA PNILHKYKNRLLLMVKGGCAINCRYCFRRHFPYADNQGNKANWQKALDYI SANPQIEEVIFSGGDPLMAKDHELDWLIKKLEKIPHLQRLRIHTRLPVVI PQRITGAFCKILTESRLNTVLVTHINHGNEIDEQLTRALNKLKNAGVVLL NQSVLLKNINDNAQTLKNLSDKLFRAGILPYYLHLLDKVEGASHFYVPDQ RAVEIYRELQSLTSGYLVPKLAREIAHEPNKTLYGG >MS1189 kdsA, KdsA protein MQNKIVRIGDINVANDNPFVLFGGMNVLESRDMAMQVCEKYVEVTNKLGV PYVFKASFDKANRSSIHSYRGPGMEEGLKIFQELKQTFGVKIITDVHEIY QCKPVAEVADVIQLPAFLARQTDLVEAMARTGAVINVKKPQFLSPGQMGN IVEKIEECGNDKVILCDRGSNFGYDNLVVDMLGFGVMKKVSKGAPVIFDV THSLQCRDPFGAASGGRRDQVTELARAGLAVGIAGLFLEAHPDPNNAKCD GPSALPLSVLEGFVSQMKALDDLVKSFPQLDTSK >MS0935 kdsB, KdsB protein MTNFTVIIPARFASSRLPGKPLADIAGKPMIIHVLEKARLSGATRVVVAT DNEEVKQAVEQFGGEVCMTSAKHNSGTERLAEVVETLNIPDDEIIVNIQG DEPLIPPVIVSQVAENLCKFKVNMASLAVKIHESAELFNPNAVKVLTDKD GYVLYFSRAPIPWNRDAFARLNSGELKQEELDLADHYLRHIGIYAYRAGF IKQYVQWEPSALEQIESLEQLRVLWYGEKIHVELAKEIPAVGVDTAEDLE KVRAILSNF >MS1952 kdtA, KdtA protein MLRFVYSFAMYILQPFVLLFILLRSIKSPNYRKRLNERYGIYANLTPPKP QGIIVHAASVGEVIAATPLVRRIQQDYPDLPITMTTVTPTGSDRVKAAFG DSVSHFYLPYDLPDAMDRFIRFVRPKACIVIETEIWPNLIRQLHNKNIPF IIANARLSARSAKRYGWVKNILNRMFNEISLIAPQDDISGNRYLDLGYRG DLQLTGNIKYDLVISDALSQQIKRLHQEWAGERPVWIAASTHEGEEGIVL QAHRSLLQKFPDLLLILVPRHPERFKAVEDLIVKGGFSYCRRSENVAPGS DTQVVLGDTMGEMMLLYGISDIAFVGGSLVKHGGHNPLEPLAFKLPVISG YHTFNFPEVFTKLRDVNGVLEIKENSTALSSAVEKFLLSPALRERYGNAG YEVLIENRGALQRLLQLLTPYLENKK >MS2136 kefB, KefB protein MVTEGANYLVSIVTFLGAAIIVVPLFKKIGLGPVLGYLAAGLAIGPFGLA LFTDSTTIIHIAELGVVMFLFLVGLEIQPKQLWGLRKYIFGMGSLQVLGA TGALTAIGLLYNFSLQFSFIAASGFVLTSTAIVMQTLSYRNDMTSDPSRR IIAVLLFEDLLIVPLLALVAILSPAQTADSLQSHSLWQHIIVSFAGLALL IVAGIWLLDPLFRLVAKTKIRELMTAVALFVVLGSALLMEATGLSMAMGA FLAGVLLSNSSFRHQLEVDIDPFKGLLLGLFFLGVGMSLDLTHVLNHWKM IVSALFLMMITKGIIIYAVARATGSTKLQSLDRAVLMAQGGEFAFVLFSS AALQGVISAEVHANMTAIVVLSMALTPLFIVIYQKWIAPKFAVREVLEND VIEEQNDIILIGLGRFGQIVNHLLRASGFQPTIIDKDAKLVSGMKKRGIR SYFGDACHPDLLHRAGIETVKLVIVAIDNTKQATKIVQHIRQINPKAKII ARAYDRHHVFELAQAGANVQIRETFDSALRTGKQALTTLGIEQEKVHRIG NMFFGKDRHSVKLMADVYDPKKPMFTNADMLKIAFEQDEELKLEIQKILD EEI >MS0630 ksgA, KsgA protein MNSKRHLGHTARKRFGQNFLHDDNVIQGIVAAIYPQKGQFLVEIGPGLGA LTEPVADQTDRLTVVELDRDLAQRLRHHPFLHQKLNVIETDAMQFDFGKL YEDEHLAEQGQKLRVFGNLPYNISTPLIFHLLKFYDKIQDMHFMLQKEVV KRLCAAPNSKAYGRLTIMTQYFCQVMPVLEVPPTAFKPAPKVDSAVVRLI PHKELPHPVKDLYWLNRVTSQAFNQRRKTLRNALSTLFTPEQLTALNIDL TARAENLSIADYARLANWLADNPPADVRRDEIIEENEE >MS0806 lacZ, LacZ protein MFIHRYFEDPQALHINTTPHHAYFIPQKCGQKWENFEPEQSLFYLSLNGY WDFRYYLSPQELPESPNEVNFEAKIPVPSNWQTQGYDRHHYTNINYPFPF DPPYVPQNNPCGIYRRTFELNKKENKHYLLNFEGVDSCLYVYINQTFVGY GQISHSTNEFDITDFVQAGNNEIFVVVLKWCDGSYLEDQDKFRMSGIFRD VYILEREANYLQDFFIRTDLSPNLNNAQIKVETKFLENNQNIDYALYDPS GKLLIQQQTDKFEISFDNPETWNAENPRLYTLIMSYGQEQIVQRLGFRRI QIENGILLFNGQPIKFRGVNRHDSDPVTGYHISRTQAVRDLQLMKAHNIN AIRTAHYPNAPWFSELCDRYGFYLIGESDIESHGSSMLAVRQTEPSIFLN VKNSYEHERIRQDNIDNLCYFARDPQFKEALLDRTYANVERDKNRTSVII WSLGNESGYGENFEACAAWVKSRDPGRLVHYESSIYQHSAHRNDLSNLDF YSEMYAATEDLDAYFANPANLKPFMLCEYSHAMGNSNGDAEDYFQAFHRH PGSCGGFVWEWCDHAPYRDDKPEHFGYGGDFGESPHDGNFCVDGLVSPDR IPHTNLLELKNVNRPVRAWLEQGKVYIKNYLDFTNLKEILTIRYSFSENG KVINQSELQIDCAPHQIQVLDIALPADNGNLCWLNLDYVLTQPTDLLSKN HLLGFDQLIIFKQGALPAQIFKNKTGHFKIQDTAQTLKICQGDFSYELDK NKGIFSRITYREQNLIEQPLDFNIWRAPLDNDRLIRQSWQQAGYDKTYSR AYQINWIEKNEGILIQANLALLAVSQGRILNLAVNYLLGADGQMKISLRA TRPEHLPYLPRFGLRFFLPKGQTQGQYFGYGPQESYVDKHHLAKLGIYPL NATDNYVDYLKPQENGSHYGSRYITLNSLHVSADQPFSFNLLPYSQEELT TKAHNYELKESPWDILCLDYKMSGIGSNSCGPNLKEQYRLSETDFNWGVF LQWSNPAR >MS0749 lacZ, LacZ protein MFLPNYFQNPQILHVNATPHHAYFIPHDSVESAVKNPRESSAFFTLLNGE WNFQYFASYYDLTEDFLTRHLPDKIPVPANWQNHGYDHHQYTNVNYPIPF DPPYVPQDNPCGLYQRKFCINLNKAKRYLLNFEGVDSCLFVYINQQFVGY SQISHCTSEFDVTDFLRQGENEIHVLVLKWCDGSYLEDQDKFRMSGIFRD VYLLERESHYLQDFFIRTELAEDLKSAVLKVEPFFVQAGEPAALREMAWQ LSDPQGNILLSAVTERGFEYVVNELQLWNAEQPKLYTLLFRYGSEVICQK IGFRKIEVKEGVLHFNHQPIKFKGVNRHDSDPKTGYVISREQALTDLRLI KTHNFNAIRTAHYPNAPWFAELCDELGFYLIAESDIESHGSNAVYVEMPE TSILLNVKTDPKTDEIQQKTVDEYCYFARDPNFKQAILDRTYANVQRDKN RASVVIWSLGNESGFGENFEAAAKWVKGFDPSRLVHYENSIYQHSEHTND LTHIDLYSEMYASTESMQAHFADPNNRKPYLLCEYSHAMGNSCGDAEDYW QIFNQYPQACGGFVWEWCNHSPYLTDGKMGYGGDFGDEPNDGNFCADGLV TADRQVQSSLLEMKNVNRPLRANLTAQGVELTNYLDFTDTEDFIAVHYQF SENGIVVGEGYIDDVKISPKQTALLPLNLPEDNGNLWLLDLTYYQKKETP LVAKNHQLGFDQIALFGQRIVPVARIGRVKSAVKIWQDSAVIKIQTEKAQ FVMDKRKGIVQQIMTEQGGLLREPLDFNIWRALADNDNLIKRQWQAAGYD RAITRAYEIQAEDFSYKAVVKAKCGLVALSKARILTLDVVYHIYANGELK IEIDAEKAPQLPFLPRFGFRFVLDEAFQQGEYFGYGETESYADKHHGAKL GLYRTTAQQNHRDYLKPQENGSHWGCSFVKLRSENEEICVTSDKPFSFNL SPYKQESLQKAKHNYDLEDSNSTVLCIDYKMSGIGSNSCGPVLKAIYRLK ENKWHCGFRLQIL >MS1728 lasT, LasT protein MLNNIRLVLVETSHSGNIGSAARAMKTMGLANLYLVSPKQGIDEQAVALS AGAEDVLRKAVIVKSFDEAVADCSLVIGTSARLRHLQSTLIEPRECGKIS IQEAHCGQIAIVFGREKFGLTNDELLKCRYHLNIPANPDYSSLNLAMAVQ LVSYELRMAWLNGRQSESMSDSVEIAQKPTALELEYFFAHTEKLYQSLGF IQNQGVMQKLRHLYNRVNLKKNELNILRGMLSAVEKRLDLLRDK >MS1188 ldhA, LdhA protein MMKSAVVFTALFLYAISHIKDELYLPKQGAFMKIVFLDSTALPPHLPIPR PDFDHEWIDYPYTGAEQTVERAKDADIVVTSKVIFSREVMEQLPKLKLIA LTATGTNNIDLIAAKELGIRVKNVAGYSSVTVPEHVLGLIFSLKHSLAGW YRDQLEGKWGESKQFCYFDYPITDIRGSVLGVVGKGCLGTEVGRLATALG MKVLYAEHRDAQSCREGYTPFDEVLKQADIVTLHCPLTEHTTNLINKETL SLFKKGAFLINTGRGPLVDEQALLDALKSGHLAGAAIDVMIKEPPEKDNP LIVAAKTMPNLLITPHIAWASDSAVTTLVNKVRDNIEEFVATGK >MS2079 ldhA, LdhA protein MTKSVCLNKELTMKVAVYSTKNYDRKHLDLANKKFNFELHFFDFLLDEQT AKMAEGADAVCIFVNDDASRPVLTKLAQIGVKIIALRCAGFNNVDLEAAK ELGLKVVRVPAYSPEAVAEHAIGLMLTLNRRIHKAYQRTRDANFSLEGLV GFNMFGKTAGVIGTGKIGLAAIRILKGFGMDVLAFDPFKNPTAEALGAKY VGLDELYAKSHVITLHCPATADNYHLLNEAAFNKMRDGVMIINTSRGVLI DSRAAIEALKRQKIGALGMDVYENERDLFFEDKSNDVITDDVFRRLSSCH NVLFTGHQAFLTEEALNNIADVTLSNIQAVSKNATCENSVEG >MS1217 lemA, LemA protein MKKWLLIIIVAVIAGFTLMSSYNGLVKAEEEIDSVWANVESQYQRRSDLI PNLVNTVKGQANFEQETLTGVIEARAKATQTKIDPANMTEEQLAQFQQNQ DSVGSALSRLLVSVERYPELKAHEGFMNLQAQLEGTENRINVARNKFNEA ARVYNQKVRQFPTKLAAMILGFKEKPYFKSTAGAENAPTVSFDK >MS0371 lepA, LepA protein MFFYVFLASFLLFISSDIFFSLTKKCGKNFANFIALSRTFGYNQPIKLFI KNNYFEHITIKMKNIRNFSIIAHIDHGKSTLSDRLIQTCGGLSDREMEAQ VLDSMDLERERGITIKAQSVTLNYKAKNGETYQLNFIDTPGHVDFSYEVS RSLAACEGALLVVDAGQGVEAQTLANCYTAIEMNLEVVPILNKIDLPAAD PERVAEEIEDIVGIDAMEAVRCSAKTGVGIEDVLEEIVAKIPAPKGDPNA PLQALIIDSWFDNYLGVVSLVRVKNGVLRKGDKIKVMSTGQTYNVDRLGI FTPKQVDKNELECGEVGWVVCAIKDILGAPVGDTLTSQHNPASSVLPGFK KVKPQVYAGLFPVSSDDYEAFRDALGKLSLNDASLFYEPETSTALGFGFR CGFLGLLHMEIIQERLEREYDLDLITTAPTVVYEVELTNGDVIYVDSPSK LPPLNNISEIREPIAECNMLVPQEYLGNVITLCVEKRGVQTNMVYHGNQI ALTYEIPMGEVVLDFFDRLKSTSRGYASLDYSFKRFQAADMVRVDIMING ERVDALALIVHKDNAPYRGRELVEKMKELIPRQQFDIAIQAAIGNHIIAR STVKQLRKNVLAKCYGGDVSRKKKLLQKQKEGKKRMKQLGNVEVPQEAFL AILHVGKDSK >MS0370 lepB, LepB protein MVKTVNKERLMANFFLPILLVVGFAIWKVLDHFTLPNTFSILLIILTALS GILWCYHRFVVNPKRSRQITRIEQRTGKTLSAEEKQKVEPVSEGSEFVAS IFPVLAFVLILRSFVFEPFQIPSGSMEPTLRIGDFLVVEKYAYGIKDPVF QNTLIETGKPQRGDVIVFKAPPQPNVDYIKRIVAIGGDRIRYNELDRKIT LVYGENGKPCSENCEVKEFSYSEPVENKEFQFIIGQNPDGSLMYGPSPLE TTESGDVEHKIHWYPEPISEGYRYKDYSTQDNYITEWTVPENQYFVMGDN RNNSEDSRFWGFVPEKNIVGKATYIWLSLDKKQNEWPTGIRSERIFQKIQ >MS0369 lepB, LepB protein MLMLKRLAKLVVLLSVLRGIIAYLINIVPTASMAPTFMPQDFILVNRVAY NLKIPVLGDTITPLNTAKRGDIVIFRQDGGTDEYIKRIIAVEHDHVRYDQ KSGIISVTPNYRQNNCQINHCETLLYKQQNERNYLNPETVLFSQQGEKLA LIERQEFTDETNHAILLTKVRYDQSAHYFKQDNLPLGEWIVPAGHYFVMG DFRENSIDSRFFGFIPHDNLTGKAVSVIFNPKQETRFFKTIQ >MS0599 leuA, LeuA protein MNVHNKRIRTMANNRVIIFDTTLRDGEQALKASLTVKEKLQIALALERLG VDVMEVGFPVSSAGDFESVQTIAVHVKNSVVCGLSRAVNKDIDAAAEALK VAERFRIHTFIATSALHVEAKLKRSFDDVVEMAVAAVKRARRYTDDVEFS CEDAGRTGIDNICRVVEAAINAGATTVNIPDTVGFCLPTEYGNIIHQVMN RVPNIDKAVVSVHCHNDLGMATANSLTAVLNGARQIECTINGIGERAGNT ALEEVVMSIKTRQDLFGVDTRINTQEIHRVSQMVSQICNMPIQPNKAIVG ENAFSHSSGIHQDGMLKNKNTYEIMSPETIGLKKEKLNLTARSGRAAVKG HMADMGYTEQDYDLDKLYEAFLKLADKKGQVFDYDLEALAFIDMQQGDED RLKLDVITSQTISTLPASAFVQVELDGKRINKTSNGGNGPVDAVYNAIMQ IVGMDLKMSHYNLTAKGEGAEALGQVDIVVEYQGRKFHGVGLATDIVESS ALALVHAINAIYRSQKVADLKKDLKHIHTV >MS0598 leuB, LeuB protein MSTYNVAVLPGDGIGPEVMAEAIKVLDKVQAKFGFKLNFTQYLVGGAAID AKGEPLPAETLQGCDNADAILFGSVGGPKWTHLPPDQQPERGALLPLRKH FKLFCNLRPATLYKGLEKFCPLRADIAAKGFDMVVVRELTGGIYFGQPKG RDGEGSDTRAFDTEVYYKYEIERIARAAFDAAMKRRKQVTSVDKANVLQS SILWRETVAEIAKEYPEVQVENMYIDNATMQLIKAPESFDVLLCSNIFGD IISDEAAMITGSMGMLPSASLNEEGFGLYEPAGGSAPDIAGKGIANPIAQ ILSAAMMLRYSFNLNEAATAIENAVQKVLADGHRTGDLADNSTPVSTAEM GTLIANAI >MS1105 leuB, LeuB protein MVIMTHKIAVIPGDGIGIEVINEGVKVLNCVSQLDPKIQFEFTHFPWGCE FYSKTGRMMDDDGIERLSKFDGIFLGAVGYPGVPDHISLWGLLLRIRKSF DQYVNVRPVKLLKGAPCPLKEKSPKDINMIFIRENSEGEYAGSGSWLYRD KPNEVVIQDGVFSRVGCERIIRYAFELARTEKKSLTSISKGNALNYSMVF WDQIFQQLSQEYPDVETHSYLVDAAAMLMITKPERFEIVVTSNLFGDILT DLGAAIAGGMGLAAGANLNPEGNFPSMFEPIHGSAPDIAGKQLANPLATV WSASQLLEFFGYKEWAARLIDAIEYLLVEQKTLTPDLGGTAKTADVGDAV VAYLQKHFA >MS0333 leuC, LeuC protein MENAMSKTLYDKHIDSHTIKELDNEGNVLLYIDRTILNEYTSPQAFSGLR EENRDVWNKKSILLNVDHVNPTRPVRDANMTDPGGTLQVNYFRENSKLFD IELFDVTDPRQGIEHVVAHEQGLALPGMVIAAGDSHTTTYGAFGAFGFGI GTSEIEHLLATQTLVYKKLKNMRVTLTGKLPFGTTAKDVIMALVAKIGAD GATNYAIEFCGEVIDELSVEGRMTICNMAVECGARGAFMAPDEKVYEYIK GTPRAPKGEMWDLAIAEWRKLKSDNDAVFDKEIHMDCSDLEPFVTWGISP DQADVISGEVPDPNLLPEGQKRKDYQAALEYMGLEPGMKFEEIKISHAFI GSCTNGRIEDLREVAKVLKGRKIAQGVRGMIIPGSTQVRARAEAEGLAKI FIDAGFEWRQSGCSMCLAMNEDVLSPGDRCASGTNRNFAGRQGAGSRTHL MSPAMVAAAAVAGHLVDVRKFVEGD >MS0596 leuC, LeuC protein MAKTLYQKLFDAHVVYEAEGETPILYINRHLIHEVTSPQAFDGLRVAGRQ VRQVSKTFGTMDHSISTQVRDVNKLEGQAKIQVLELDKNCKATGISLFDM NTKEQGIVHVMGPEQGLTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVE HVLATQTLKQARAKSMKVEVRGKVNPGITAKDIVLAIIGKTTMAGGTGHV VEFCGEAIRDLSMEGRMTVCNMAIEFGAKAGLVAPDETTFEYLKGRPHAP KGKDWDDAVAYWKTLKSDEDAQFDTVVVLEAKDIAPQVTWGTNPGQVIGI DQLVPNPAEMTDPVTKASAEKALAYIGLEPNTDLKNVPVDQVFIGSCTNS RIEDLRAAAAVMKGRKKADNVKRVLVVPGSGLVKEQAEKEGLDKIFLAAG AEWRNPGCSMCLGMNDDRLGEWERCASTSNRNFEGRQGRNGRTHLVSPAM AAAAAVFGKFVDIRNVSLN >MS0334 leuD, LeuD protein MDKFTLITAKAAPMMAANTDTDVIMPKQFLKGIDRKGLDRGVFFDLRFNL DGTPNEKFILNQADWQGSQFLVVGPNFGCGSSREHAVWGLKQLGIRALIG TSFAGIFNDNCLRNGVLTICVSDQEIEQIATTVSNPATNTISVDLEGQKV LTENGEIAFDVDPLKKEMLIKGLDAVGFTLSMKDDILAFEQSYFKANPWL KL >MS0595 leuD, LeuD protein MTRIDKMAGLKQHSGLVVPLDAANVDTDAIIPKQFLQAITRVGFGKHLFH EWRYLDAEETQPNPEFVLNFPQYQGASILLARKNLGCGSSREHAPWALAD YGFKVMIAPSFADIFYNNSLNNHMLPIKLSEQEVEEIFQWVWANPGKKID VDLEAKTVTVGEKVYHFDLDEFRRHCLLEGLDNIGLTLQHEDAIAAYESK IPAFLR >MS0338 leuS, LeuS protein MQEQYRPDLLEQEVQKYWQNNQTFKAVKDSSKEKYYCLSMFPYPSGRLHM GHVRNYTIADVVSRYQRMNGKNVLQPVGWDAFGLPAEGAAVKNKTAPAKW TYENIDYMKNQLKMLGFSYDWDREIATCKPEYYKWEQWFFTELYKKGLVY KKTSVVNWCPNDETVLANEQVHEGCCWRCDTPVEQKEIPQWFIKITDYAE QLLSGLDTLPEWPDMVKTMQRNWIGRSEGVEITFKIENSDETVAVYTTRP DTFYGVSYMAVAAGHPLAEKAAQNNAELARFIQECKNTKVAEAELATMEK KGMATGINAIHPITGKPVPVWVANFVLMHYGTGAVMAVPAHDQRDFEFAT KYGLPIKQVIAPMNGEEIDLTKAAFTEHGKLVNSAEFDGLDFEAAFNGIA DKLEKMGVGKRQVNYRLRDWGVSRQRYWGAPIPMLTLENGDVVPAPLQDL PIVLPEDVVMDGVKSPIKADPDWAKTSYNGQPALKETDTFDTFMESSWYY ARYTSPQYHEGMLDSDEANYWLPVDQYIGGIEHATMHLLYFRFFHKLLRD AGLVSTDEPTKKLLCQGMVLADAFYYTSPTNERIWVSPTKVMLERDEKGR ILKATDDEGHELVHAGMTKMSKSKNNGIDPQEMVEKYGADTVRLFMMFAS PAEMTLEWQESGVEGAKRFLGRLWNLVFEYNKNPVKTAPNPTALSSAQKA LRRDVHKTIAKVSDDIGRRQTFNTAIAAIMELMNKLTRAPLTDEQDRAVM GEALSAVVRMLYPITPHICFQLWKDLGNEDIIDFAPWVQADEAAMIDDEK LVVVQVNGKVRGKITVPADMAEEEIKRVALAEENVQKFLDGLNIVKVIYV PGKLLSFVAK >MS0744 lexA, LexA protein MSAFCTKKQGIYMKPIKALTARQQEVFNFLKHHIETTGMPPTRAEISREL GFRSPNAAEEYLKALARKGVVEILSGTSRGIRLLVDTEESANDEDAGLPL IGRVAAGEPILAEQHIEGTYKVDADMFKPQADFLLKVYGQSMKDIGILDG DLLAVHSTKDVRNGQVIVARIEDEVTVKRLERKGDVVYLHAENEEFKPIV VNLKEQPNFEIEGIAVGIIRNNAWM >MS0406 lgt, Lgt protein MENQFLAFPQFDPIIFSLGPISLRWYGLMYLIGFIFARWLAVKRANRPDS GWTVEQVDNLLFNGFAGVFLGGRIGYVLFYQWDLFVQEPSYLFRVWEGGM SFHGGLIGVIVAMLVTAKLQKRNFWVVADFVAPLIPFGLGMGRIGNFIND ELWGRVTDVPWAVLFPSGGYLPRHPSQLYEFVLEGIVLFCILNWFIRKPR PAGSVAGLFLLFYGLFRFIVEFFREPDAQLGLYFGQQISMGQILSTPMIL LGALFIVLAYRRRSAVKN >MS1766 lig, Lig protein MTIMDINQQIKQLRDTLRYHEYQYHVLDDPKIPDAEYDRLFHQLKALEQQ HPELITADSPTQRVGAKPLAGFAQITHELPMLSLDNAFSDEEFNAFVKRI QDRLIVLPQPLTFCCEPKLDGLAVSIFYVNGVLTQAATRGDGTTGEDITL NIRTIRNIPLQLLTDNPPARLEVRGEVFMPHEGFNRLNERALEHGEKTFA NPRNAAAGSLRQLDPKITSRRPLVFNAYSVGIAEGVELPATHYERLQWLK SVGIPVNSEVQLCDGSEKVLEFYRSMQQKRPTLGYDIDGTVLKINDIGLQ RELGFISKAPRWAIAYKFPAQEELTRLNDVEFQVGRTGAITPVAKLAPVF VAGVTVSNATLHNGDEIARLDIAIGDTVVIRRAGDVIPQIIGVLHERRPA NAQAIVFPTQCPVCGSKIVRIEGEAVARCTGGLFCDAQRKEALKHFVSRR AMDIDGVGAKLIEQLVDKELIRTPADLFKLDLITLMRLERMGEKSAQNAL DSLEKAKNTTLARFIFALGIREVGEATALNLANHFKNLDALQAASPEQLQ EVADVGEVVANRIYVFWREQHNIDAVNDLIAQGIHWETVETKEAGENPFK GKTVVLTGTLTQMGRNETKDLLQQLGAKAAGSVSAKTHFVIAGDNAGSKL TKAQELGVAVMSEAEFLAIVNAYKR >MS1826 lipA, LipA protein MSTAFKMERGVKYRDAAKTSIIQVKNIDPDQELLQKPSWMKIKLPANSAK IQSIKNGMRRHGLNSVCEEASCPNLHECFNHGTATFMILGAICTRRCPFC DVAHGKPLPPDPEEPKKLAETIQDMKLKYVVITSVDRDDLPDRGAGHFAE CIKEIRKINPNTQIEILVPDFRGRIEQALDKLKDNPPDVFNHNLENVPRL YRDIRPGADYQWSLKLLREFKALFPHIPTKSGLMVGLGETNEEILNVMQD LRNNGVTMLTLGQYLQPSRFHLPVARYVPPEEFDEFRTKAEVMGFEHAAC GPFVRSSYHADLQASGGLVK >MS1827 lipB, LipB protein MHFVYSLVLATYLEFYMEQKLIIRQLGIRDYQKTWHEMQEFTDNRTDKSA DEIWLVQHPSVFTQGQAGKAEHLLRSTAIPVVQSDRGGQITYHGIGQQIM YVLIDIKRLKTQGRDISVRQLVSALEQSVINTLADYGIESYAKADAPGVY IDGKKICSLGLRIRRGCSFHGLALNINMDLEPFHSINPCGYAGLEMAQLA DFVSPQEADCGKVSPKLVEHFVTILGYNKQQIFNIKE >MS0753 lldP, LldP protein MAFFLSILPIILLIYLMVKRNAWPSYVALPWIAVCVLIIHLAFFGTNIAI VSANVTASIIAVQTPITVIFGAILFNRFSEVSGVTNTLRKWLGNINPNPV AQLMIIGWAFAFMIEGASGFGTPAAIAAPILVGLGFNPIKVAVFALIMNS VPVSFGAVGTPTWFGFGPLNLNDEQILEIGSMTALIHCFAAFVIPLLGLR IMVGWQEIRKNILFIYLSVFACVVPYFLIAQFNYEFPALVGGAIGLLISV LAANRNIGLARVENNLDNNAVSFKEISKALLPTGMLIAILIITRIQQLPF KAWMNDATTWFAVRIGSLGDFEISRGLIFSLKNIFDTSVSASYKLLYVPA FIPFIVTVLIAIPLFKVSFRNATDIFGDSLKRSKNPFIALIGALIMVNLM LVGGEGSMVKTIGKTFAETTGEHWTLFSSYLGAVGAFFSGSNTVSNLTFG SVQLSTAELTGLSTTLILALQSVGGAMGNMVCINNIVAVSSVLGTNNQEG NIIKQTVLPMIIYGIIAALVALFVIPLFYNI >MS0652 lnt, Lnt protein MNNIFTYIIAILTGAAGVLAFSPFDLWGFAYVSLIGLLFVIKNPQKKTAL LSAFLWGLSFFSIGVSWLHVSIHQFGGSPLWLSYILVVVLAAYLSLYPLL FAYIVRRFNVTSLAIFPVIWTFTEFLRGWIFTGFPWLQFGYTQIDSPFYG IAPLFGVTGLTFFVMWASAVIFSGISTLIQTPKKLPVALVNALLLLSVGG LAALSSQKIFVKEVPEKALTVTLAQGNIEQNLKWDPQYLYATLDIYRKLI LEHLATSDLIVLPESALPALENQLQPFYQALQTATQEKGTEVLIGSVYHD EKSDKLLNSIVSVGNSAQPYQAGNAASAMRYSKHHLVPFGEYVPLENLLR PLGSVFNLPMSAFQSGDFIQKPLLAKGRALTPAICYEIILGSQLQQNLQP NTDFLLTVSNDAWFGDSIGPWQHLQMARMRALELGKPLIRATNTGISVFI NAQGKVISQAPQFEQTALTEKIAPTEGKTPYAALGDKPLYLLAFIFVMLR VLAIFIKRKVLKSAV >MS1452 lolA, LolA protein MQQRLAQVNYFSADFNQNVSSANGKNVQTGSGKLQIKRPNLFRMDNKAPQ ETQIISDGKTLWFYDPFVEQVTANWVENAVNNTPFVLLTSNDSANWNQYT VVQNGDTFVLKPKAKNSNIKQFDIRIDNQGVLKGFSTIEKDGQTNLYILR NITNQPLADSLFKFTVPKGAEFDDQRNKNKK >MS1534 lolB, LolB protein MNHLKSFFTALVAGFILTACSSLDISDTRPADVKTIDKSDIQWQQHLKQI KQIQHYSSQGQIGYISSKERFSSRFEWNYAAPTDYTLKLYSTISSTSLVM QMHNTGMTISDNKGNRRSEADAKALVREIIGMDVPLEQFAYWLKGQPDEK ADYQVGENHYLASFTYPLDGTVWSADYLNYHEEKQPALPKDILLKNANQT LKIRVDNWTF >MS1844 lon, Lon protein MSKRIQQKELPVLPLRDVVVFPFMVMPLFVGRAKSIHSLDKAMESGKQLL LVSQKQAELEDPTIDDIYNVGTIVNIIQLLKLPDGTVKVLVEGQQRANIL KLTDQDYFSATVTPIETTLGDEKELEVLRNTVLEEFDNYAKQNKKIQPEL AKALADVGDFDRFADTLAAHLPISVANKQEVLERENVTERLEYLLGTMES EADLLQVEKRIRNRVKKQMEKSQRDYYLNEQIKAIQKELGDNDNALENDE IGQLRQKIEDAKMPLEAREKVEAELQKLKMMSPMSAEATVVRSYIDWMIQ VPWAKRTKVKKDIVKAQEILDADHYGLERVKERILEYLAVQSRLNQIRGP ILCLVGPPGVGKTSLGQSIANATGRKYVRMALGGVRDEAEIRGHRKTYIG SLPGKLIQKMAKIGVKNPLFLLDEIDKMASDMRGDPASALLEVLDPEQNS HFNDHYLEVDYDLSDVMFVATSNSMNIPTPLLDRMEVIRLSGYTEDEKLN IAQRHLVQKQMERNGLKAGELAIEESAIVDIIRYYTREAGVRGLEREISK ICRKAVKNLLLNKELKSITVNADNLHDYLGVRRFDFGLADTQNRIGEVTG LAWTEVGGDLLTIETASVVGKGKLTFTGSLGDVMKESIQAAMTVVRTRAE KLGIAADFHEKRDIHIHVPDGATPKDGPSAGIAMCTALVSCLTGNPVKSE VAMTGEISLRGKVLPIGGLKEKLLAAHRGGIKTVIIPKDNVKDLEEIPEN AKNALTIIPAETIDEVLTVALENPPEGVEFIKAAPLAKVKAPKARKPVSK RSTSTVN >MS1195 lonB, LonB protein MNLSLSSEQLLSWQHLMPTLELADIPEQSISFFDLQPRANSAIQQFLQNS HRSLLVLKADDQAEYAPLLADYIQSLLPQNSQVKGVNYFIEQADSFSFAR ISVEPAQSKEDNFAAIKQVGTALYFDENQLFGSLLVHPISKDIQLNAGLV HQLNGGVLILSVAGLLARFDIWCRLKHILTTQTFDWYSMHPFKPLPCHIP SYPLELKVVLLGSREELAAFSELETELFGLGDYSELESYFSLEEPVQKVK WVQYVRTLAKQYDFPSISDKGVERLYQLYVRESEDRAVINISPLMLKNLL SKAVLVCTGTELSAVDFEKVFQITAMQHSFLRDRTYDDILHEQVFIPTEG EAIGQINGLSVIEYQGTPTSFGEPSRLSCIVQFGEGEITDVDRKSELAGN IHSKGMIIAQNCLANILELPSQLPFSASLVFEQSYAEIDGDSASLAAFCA LTSALADLPVSQSVAITGAIDQFGLVHSVGGVNEKIEGFFAICERRGLTG EQGVIIPASVIHQLSLSIEVITAVKNRRFFVWAVEDVYQASKILFQRDLT EEDKSYNGDNEPISRLIARRIEQRTDHLSRGFWGLLFGRK >MS1985 lpd, Lpd protein MTKHYDYISIGGGSGGIASINRAAGYGKKCAIIEAKHLGGTCVNVGCVPK KVMFYGAHIADAINHYAEDYGFDVSVNKFDFAKLVESRQAYIGRIHTSYG NGLSKNKVDVFNGFARFVDAKTVEVSYEDGSSEQITADHILIATGGRPSI PNVKGAEFGISSDGVFALNELPKRVAVVGAGYIAVELAGVFNSFGVETHL FVRQHAPLRNQDPLIVDTLVEVLVQDGIQLHQKAIPQEVVKNADGSLTLK LEDGRETVVDSLVWAIGREPATDVINLQAAGVETNDRGFIKVDKYQNTNI PGIYAVGDIIEGGIELTPVAVAAGRRLSERLFNNKPDEHLDYNLVPTVVF SHPPIGTVGLTEPKAIEKYGAENVKVYTSSFTAMYTAVTQHRQPCRMKLV CAGADEKIVGLHGIGFGVDEMIQGFAVAIKMGATKADFDNTVAIHPTGSE EFVTMR >MS1334 lpd, Lpd protein MEIFKILTALADFRKEGIPSCLPVTLAGKSACSYIGKNMSKEIKTQVVVL GAGPAGYSAAFRCADLGLETVLVERYSTLGGVCLNVGCIPSKALLHVAKV IEDAKHVEHHGIVFGEPTIDLDKVREGKNAVVGRLTGGLAGMAKMRKVTV VEGLAEFADSHTLVAKDREGNPTTIKFDNAIIAAGSRPVQLPFIPHEDPR VWDSTDALALREVPKNLLVMGGGIIGLEMGTVYSALGSQIDVVEMFDQVI PAADKDIVKIFTKRIEQKFNLLLETKVTTVEAKEDGIHVSMEAKDGKVET RVYDAVLVAIGRTPNGKLIGAEKAGIEVTDRGFINVDKQMRTNVPHIFAI GDIVGQPMLAHKGVHEGHVAAEVIAGKKHYFDPKVIPSIAYTEPEVAWVG KTEKECKAENLNYEVATFPWAASGRAIASDCADGMTKLIFDKDSHRILGG AIVGTNAGELLGEIGLAIEMGCDAEDIALTIHAHPTLHESVGLAAEVFEG SVTDLPNPKAKKK >MS1058 lplA, LplA protein MRKIIMYFIDNKEITDAGINIALETYLVENRLVNEPILLFYINSPSIIIG RNQNTIAEVNQPYLDEKNIRVVRRMSGGGAVYHDLGNLSFCFIKDDDGSI GDFAGFTRPVIEALHQLGAKNAKLEGRNDLLIDGKKFSGNAMYAKGGRMT AHGTILFDSDLEEVSKALKSRKEKIESKGIRSIRKRVTNIKPYLSLEYQH LTTRQFRDILLLKIFNVTSREQVPEYRLTEEDWQKVYALREQRFANWDWN YGRSPQFTLEYYHKFPAGLVEYKLNVEQGKIQNIRIFGDFFGLAEIAELE KALIGIKYEKQAISQIFNHFNIKQYFGNIEPEALTELLVNGIYEE >MS1288 lppC, LppC protein MTILLQRAKFKKRLMPILFPLMLAGCTNLFGSNFQDVLRNDANASSEFYM NKIEQTREVEDQQTYKLLAARVLVTENKTAQAEALLAELTKLTPEQQLDK SILDALIAAVKRDNDSASALLKTIPLAQLSQSQTSRYYEVQARIAENKTD IIEAVKARIQMDMALTDVQRKQDNIDKIWALLRSGNKTLINTTQPEGNVA LAGWLDLTKAYNDNLSQPSQLAQALQNWKTTYPNHSAAYLFPTELKSLSN FTQTQVNKIALLLPLSGNASILGSTIKSGFDDSRGADKSVQVDVIDTMAM PVTDAIALAKQNGDGMIVGPLLKDNVDVILSNPTAVQGMNVLALNSTPNA RAIDKMCYYGLAPEDEAEAAANRMWNDGVRQPIVAVPQSDLGQRTASAFN VRWQQLAASDADVRYYNQPDDAAYNLTADPAQNQAIYIVVTDSEQLMSIK GALDNSGVKAKIYTNSRNNSSNNAVEYRLAMEGVTFSDIPFFKDLDGEQY KKIEAATGGDYSLMRLYAMGADSWLLAHSFNELRQVPGFSLSGLTGKLTA GPNCNVERDLTWYSYQGGNIVPLN >MS0461 lpxA, LpxA protein MIHPSAKIHPTAIVEEGAKIGENVIIGPFCLIGADVDIGKGTVLHSHIVV KGITRIGEDNQIYQFASIGEANQDLKYNGEPTKTIIGDRNRIRESVTIHR GTVQGGGVTRIGDDNLFMINSHIAHDCIIKNRCILANNATLAGHVQLDDF VIVGGMSAIHQFVVVGAHVMLGGGSMVSQDVPPYVMAQGNHARPFGVNIE GLKRRGFDKPTLHAIRNVYKLIYRSDKTLDEVLPEIEQVAQKDSSISFFV EFFKRSTRGIIR >MS0422 lpxB, LpxB protein MRSIMENLIKNNPTIAIVAGEVSGDILGGGLIKALKVKYPQARFVGIAGK NMLAESCESLVDIEEIAVMGLVEILKHLPRLLKIRSDIVQKLSALKPDIF IGIDSPEFNLYVEDRLKAQGIKTIHYVSPSVWAWRQNRIYKIAKATNLVL AFLPFEKAFYDRFNVPCRFIGHTMADAIPLNPNRTEACKMLNIDENQRYV AILAGSRGSEVEFLAEPFLQTAQLLKRKYPDLKFLVPLVNEKRRRQFEQV KAKVAPELDLILLDGHGRQAMIAAQATLLASGTAALECMLCKSPMVVGYR MKAATYWLAKRLVKTAYISLPNLLADEMLVPEMIQDECTPEKLVEKLSVY LDETESAVQNRQVLIQRFTELHQLIQCDADSQAAQAVADLLEGKVNG >MS1659 lpxC, LpxC protein MIKQRTLKQSIKVTGVGLHSGNKVTLKLRPAPINTGIVYCRTDLTPPVYF PADATAVRDTMLCTALVNDQGVRISTVEHLNSALAGLGLDNVIIEVDAPE VPIMDGSASPFVYLLLDAGIEEQDAAKKFIRVKQKIRVEDGDKWAEISPY NGFRLNFTIDFNHPAISKNLSNYTLEFSAQKFVQQISRARTFAFMKDIEY LQSQGLALGGSLDNAIVLDNYRVLNEDGLRFKDELVRHKMLDAIGDLFMA GYNILGDFKAYKSGHGLNNKLLRALLANQEAWEFVTFEDKEKVPQGYAIP SQVLI >MS1922 lpxD, LpxD protein MSVYSLKELAEHIGATSRGNTDVVVDSIAPLDKAQANQLTFISNAKFRPF LAQSQAGILVVSEADIEFCSANSNLLITKNPYVAYALLAQYMDTTPKAAS DIASTAVIASSAKLGTNVSIGANAVIEDGVELGDNVVIGAGCFIGKNTKI GANTQLWANVSIYHEVQIGSDCLIQSGAVIGGDGFGYANERGQWIKIPQT GSVIIGNHVEIGACTCIDRGALDSTVIEDNVIIDNLCQIAHNVHIGTGTA VAGGVIMAGSLTVGRYCQIGGASVINGHMEICDQAIVTGMSMILRPITEP GIYSSGIPAQTNKEWRKTAALTLDIDKMNKRLKALEKKLAD >MS0933 lpxK, LpxK protein MLKMKFWYTKSWIAYLLLPFSFLFWLVSQCRRWLFQAGIIKSYRAPVPIV IVGNLSVGGNGKTPVVIWLVKALQQNGLRVGVISRGYGSQSAVYPLLVTE KTDPLEGGDEPVLIAQRTQVPVCISANRQQAIELLLQTQPCDVIVSDDGL QHYKLQRDFEIVVVDAQRGFGNGFVMPAGPLRELPSRLDSVDLVIANGKA NRYSQTVMTLAADYAVNLVTKEKRLLTEFESGSAFAGIGNPQRFFTMLQG FGIQLKQTYEFQDHQKFSAELFAKFSKNEPHFMTEKDAVKCFPFARENWW YVPVEAKITGQSAVNFIENIVERVKNGQ >MS1270 lrgB, LrgB protein MIYFYTLLTIAAFMIALLITKRIKSVLLNSFVLTVIILVAVLLAADIPYD QYMAGNAPLNNLLGVSVVALALPLYEQLHQIAVRWKAILFIVTSASLLSM FSGALLALALGASADVVATVLPKSVTTPIAMAIAQNIGGVPAVAAVGVVV AGLQGSVFGYLVLKKLQLKNSEAIGLAVGSVSHALGTVSCLEVDAKAGNY SSISLVLCGIISSLLAPLVFKLVSFCM >MS1455 lrp, Lrp protein MVNFMEKKLPKALDSIDIKILNELQRNGKISNIDLSKKVGLSPTPCLERV KRLEKQGVIMGYKALLNPELLNSPLLVIVEITLIRGKPDVFEEFNAAVQE LDEIQECHLVSGDFDYLLKTRVADMAAYRKLLGTTLLRLPGVNDTRTYVV MEEVKQTNFLQLK >MS0035 lrp, Lrp protein MYAIDSLDQQILRVLTKDARTPYAEMAKNFGVSPGTIHVRVEKMRQSGII EGTKVRIDERKLGYDVCCFIGIILKSAKDYDKVIKQLEGFDEVVEAYYTT GNYSIFIKVMTHTIAELHSVLATKIQLIEEIQSTETLISMQNPILRDIKP >MS1750 lspA, LspA protein MMTKSKTGLSFLWLSAVVFFIDLLTKYIVTQNFELYESVNILPIFNLTYA RNTGAAFSFLAEHGGWQKYFFIVLALAVSAVLVHLLRKNSARQKLQNSAY ALIIGGALANMADRAYNGFVVDFFDFFWREWHYPVFNVADIAICVGVGLL ILDSFKNGEKKADKQ >MS0419 luxS, LuxS protein MPLLDSFKVDHTVMKAPAVRVAKIMRTPKGDDITVFDLRFCVPNKEILSP KGIHTLEHLFAGFMREHLNGDSVEIIDISPMGCRTGFYMSLIGTPNEQQV ADAWLASMRDVLTVQDQSTIPELNIYQCGTYTEHSLADAHETARHVIEKG IAINKNEDLLLDEKLLNL >MS2084 lysA, LysA protein MDFFQYKNNKLYAEDLLVSELAEQFGTPLYIYSRATLERHWKAFDSALGD HPHLVCFAVKSNPNIAILQVMAKLGAGFDIVSQGELERVIAAGGDPHKVV FSGVAKNEKEIARALELDIRCFNVESLAELQRINEVAGKSGKIAPISLRV NPDVDAHTHPYISTGLKENKFGVSVDEAREVYRLASRLPNIKVTGMDCHI GSQLTEIQPFLDATDRLILLLEQLREDGIELEHLDLGGGLGVTYSDETPP HPSEYATALLNKLKQYTNLEIIMEPGRAISANSGILVTKVEYLKSNETHN FAIVDAGMNDMIRPALYQAYMNIIEADRTLNRESKIYDVVGPICETSDFL GKQRRLAIAPGDYLVQRSAGAYGASMSSTYNSRPLTAEVMVDGSQAHLIR RRAELTELWALESLLP >MS1613 lysC, LysC protein MANLSVAKFGGTSVANYAAMTACAKIVIADPNTRVVVLSASAGVTNLLVA LANGCEATQRAKLLAEVRQIQENILNELKDAGTVRLEIEELLTNIEYLAE AASLATSSALTDELISHGEMMSTKIFVQVLRELNAQATWVDVRTVVATNS NFGKAAPDDEQTQKNSDNVLKPLIDRGELVITQGFIGRDPNGKTTTLGRG GSDYSAALIAEVLNAKDVLIWTDVAGIYSTDPRIVPNAQRIDTMSFAEAA EMATFGAKVLHPATLLPAVRSNIPVYVGSSKAPEQGGTWVTRDPQPRPTF RAIALRRDQTLLTLSSLNMLHAQGFLANVFNILAKHKISVDTITTSEVSV ALTLDKTGSASSGAELLSSDLLNELSEVCTVKVDTGLALVALIGNDLHLS AGIAKRIFGTIEEYNIRMISYGASTNNICTLVHSAHADDVVRALHKELFE >MS1703 lysC, LysC protein MRVLKFGGTSLANPERFLQAARLIEKAHLEEQAAAVLSAPAKITNHLVAL SEKASLNQPTETNFNEALDIFYNIINGLHEKNNNFDLKGTSQLIESEFNQ LAELLEQIRQAGKVEDAVKATIDCRGEKLSIAMMKAWFEACGYEVTVINP VEKLLAYGNYLESSVDIEESAKRVDVASIPKNNVVLMAGFTAGNEKGELV LLGRNGSDYSAACLAACLNASACEIWTDVDGVFTCDPRLVPDARLLPSLS YREAMELSYFGAKVIHPRTIGPLVRSNIPCLIKNTGNPTAPGSIIDGNEP QSGELQVKGITNLDNVAMFNVSGPGMQGMVGMAARVFSTMSKAGVSVILI TQSSSEYSISFCVPSKLAAKAKDALNTEFAKELLDKDLEPVEVIEDLSII SVVGDGMKQAKGIAARFFSALSQANISIVAIAQGSSERSISAVVAQNKAI EAVKSTHQALFNNKKSVDMFLVGVGGVGGELIEQIKQQKEYLAKKDIEIR VCALANSNKMLLNENGLSLDNWKEDLSNATQPSDFDVLLSFIKLHHVVNP VFVDCTTAESVSGLYARALSEGFHVVTPNKKANTREMAYYNLVRENARKN QRKFLYDTNVGAGLPVIENLQNLLAAGDEVERFNGILSGSLSFIFGKLEE GLTLSQATALAREKGFTEPDPRDDLSGQDVARKLLILARESGLELELSDV EVESVLPKGFSEGKSAVEFMEILPQLDAEFAARVEKAGAQNKVLRYVGQI NDGKCKVSIVEVDADDPLYKVKNGENALAFYTRYYQPIPLLLRGYGAGNA VTAAGIFADILRTLRN >MS0763 lysR, LysR protein MKPIFLELRHLKTLLALKETGSVSLAAKRVYLTQSALSHQIKLLEEQYGL PLFERKSNPLRFTAAGDRLLQLANDILPKVVAAERDLSRVKQGEAGELRI AVECHTCFDWLMPAMDSFRQHWPLVELDIVSGFHTDTVGLLLTHRADWAV VSEVEETDGIVHKPLFSYEMVGLCAKDHPLAHKEIWEAEDFADQTWITYP VPDDMLDLLRQVLKPAGINPVRRTSELTIAIIQLVASKRGVAALPFWAAK PYLDRGYVVARKITQNGLYSNLYAAYREEDANSAYLEDFYETVKSQSFST LPGLSVLE >MS1097 lysR, LysR protein MDKLNAISVFCRIIESQSFTQAAALENISVAMASKLVAQLEEHLKTRLLQ RTTRKIVPTEAGLVYYQRCQPILLELKEADSSISDLSTSLQGNLVVSVPM DFGLKFITPTLPAFISANPNLHVEMEFSDRRVDLMAEGYDLALRIGSLQD STLVAKKLATTSMHFAASAEYLRRYGTPRKPEDLQYHQCLLYKAIGNQIY WEFANKGKIQRVKMRSKMVCNNGLTLVQLAKADLGIINSPRFLVEEELAS GELIEVLPEFKQQLLDIHAVYPHRRHLAAKVKAFVEFLSGLNLGSET >MS0154 lysR, LysR protein MNIRDLEYLAALAEYKHFRRAADACHVSQPTLSGQIRKLEDELGITLLER TSRKVLFTQSGLILVEQAKKVLREVKLLKEMASNQGKEMTGPLHLGVIPT VGPYLLPYIMPALKEAFPDLELYLYEAQTSHLLDQLESGRLDCAILATVP ETEPFIEVPIFNERMLLAVSEQHPWAKEKSIKMHALQGHEVLMLDDGHCL RDQALGYCFTAGARENSHFQATSLETLRNMIAANAGMTLMPELAMLNEGT RAGVKYIPCTDPEPKRTIALVYRPGSPLRSRYERVANAVGDAVKAILHTE GD >MS2152 lysR, LysR protein MNDKFSGIEEFLMTVEMGSFSAAAERLNLTGSAVGKSISRLEQRLNTQLF HRSTRKITLTREGEVWLASCRRMMEELEQAKLLLSSQSQQIIGEIRIDLP TTYGRSHILPKLLAIQADYPKLYLNISFQDRKVDMIAEHIDIAVRFGELA DLTDIIAKQIDCFQNQLCATPAFVSKWGKLNHPDDLTHFPCIVGNQISWR LMNEQGKSTGFPLNVQHQINDGDARLQAVLADCGIAFLPDWLIQPAVEAG KLVQLLPEFTPPPEPIYVLWQKKLHLQPKVKAIVNSLV >MS1403 lysR, LysR protein MRELRNLDLNLLKAFDVLMDEKSVSKAAQRLSVTQPAMSGILQRLRDSFN DPLFVRVQRGIVPTNRALELRQPIKQLLQSAEQLLQPKIFDPQTAELTLT IACTDYALRAVISPFLAVLKQRAPKIKVAILAINEQNLQSQLEQGVVDFG LVTPDFSAPDIHSKDLYQEQYVCALRKDHPVAQQGSISLEQFCRLEQALV SYQGGSFSGATDKALAKLGLTRNVTVSVQNFIVMPEFLANSDLLAVVPKR LVENLANIHYFEPPLQIDGFTKTLVWHERTHRDPAYRWLRELMAEVC >MS0336 lysR, LysR protein MLKDKKTWPLIEDLNVFLTIIRKNSFSGAAKELGQSNSYITKRINILEDH LHTSLFYRNTRNIKLTAAGEYVQNQAIAIIDKMDSLMTNIVEDKKSMFGH LHICSSFGFGRTHLAKPISLFAKQHPNLSLDLTLTDHKLDLIKENIDLEI AVGNDLNDRYFAKKLANNRRILCASPDYLQSYGLPKKVEQLSKHNCLFLK EKNSSFGVWKLFNGKILKSITVNGGLTTNNGEVILQWALEGHGIIYRSLW DAEKYLISGELVHILPEYYEDAPIWVVYPNKLSESLKTEIFVNFLTEYFA KKELTKSHDE >MS0044 lysR, LysR protein MRKKPMEFNELKLFLHLAESQNFSRSAAQNHMSTSTLSRQIQRMEDELGE PLFLRDNRRVQLTECGEKFKIFAQQSWNQWQHFKQQIHHNENELNGELKV FCSVTAAYSHLPQVLEKFRLRYPKVEIKLMTGDPALALHQVQSQQVDLSL SGRPLHLPNSIKFHYIDDISLSLIAPRIACPATQLLQHSPIDWQRIPFIL PVEGPARQRIDQWFRQQKIKHPKIYATVAGHEGIVSMVALGCGLALLPDV VIKNSPMNSQVSSLTLDIPVYPFELGVCVQKKSLELPLIKAFWDSLQTEN AG >MS2008 lysR, LysR protein MHITHCLRTLKALKLQNNVHLCTIKPKEQRRETMNLDWSDIHYFVLMVEK QTLKATAEALQVEHSTVSRRIERLEKQLNVHLFDRINKRYLLTADGQRLY TEAKKLQFNVRQFVQAAQDSLQEMTNVLVSMPPMIAHALVSPHLAAFQQR FPAIRLVLSSNTAISSLHQRQADIALRLVVPQQNDLVVRRLRDMQYGWFA HADYVKNTPESQWQYIDFGVTGPHTPWLNKQLADKSIGFVCNDFAVMQSA VMQKLGIGWLPFEYGNSSEFIQVHTSEIFIGQLHLVMHEDVRHAQKVRDV ADFLIEILRE >MS0884 lysR, LysR protein MMLDKVEAVRYFCIAAETLHFRETANRLAISPQVVTRMIAELERELGEPL FKRNTRNISLTDFGQAFLADAQQWLKATETLFQTDFKESMSGTVRITLPR LPNNDVILTELLTALSPYPDLHIDWRPDTALYNSITRQIDIGIRISLEME PHFIAKKITHIKERIVASTALLNRLGQPRDLDDLQNRFPLCAEINPQTGK AWHWFNTAEQSFVAKKPYFMSSESYSNLAVILKGLAIGVLPDYYYLPHVQ TGKLKILFPDLPIPEWKMFLYRPYQENTPLRVIHVFGLLEKILVKHYHTT G >MS2176 lysR, LysR protein MQFYSITVPKIAFSYLFHKMKTAKSFIPENEIMNRLDALKYFIVAAETLS FKSTASRFSVSPQVITRVISELEGELGEQLFKRNTRAIRITDFGSRFLAD AIAFLQQEERLFGGVKTAEESLSGLVRITLPPSDYADKILLRLLTALAPY PDIQIDWRTDFDTLKAVDDQIDIGIRISRTPEDHWVAKKITDLQEPIVAA PSLIAKTGLPKDVFDLAANFPVGYILNPKTGKVWDWMMGEQPIILTKPTV ITSDIKSLLPAVLSGRIFAPIMYHDCKSYLDSGELQVVFSNEETLIWGIY LYRPYQTITPKRVLLVFELLEKILEEGF >MS1210 lysR, LysR protein MMNYAAMLHNLPNLNELYFFVQIANAGSFTKAAERLGVTTSALSQNMRSL EKHLDVRLFNRTTRSISTTEAGEKLLAEIAPHFLAIADAVRHLDEIRDEP QGTIRINTSEIAANLIIYPKLQPFLLANPHIKVELVIDNRWVDIVAQGFD MGVRLGYAVFNDMIAVQISEPMKMVLVASPGYLKDKPLPKKINDLTNYHL IGSRFSSEHSQLEWEFMDKGQKVGFQPMPQFSINNDLRTQAALDGFGIAW LPEIRVHEELKNGNLVEILPQYAYTYDPFYIYYPNRKGNSKAFQMVVELL KFKK >MS1415 lysR, LysR protein MHSSIYGYLTVFHTIAAEGSIAGAARKLQMASPSISQSLKLLEQHIGLPL FNRTTRKMELTEAGHHLLASTQDAIAQLSVAVESVQDLSGVPKGVVRMTV PHVGYWLIIEPHLAEFCERYPDIQLEISINDGTVDILKEGFDLGIRFGDK VDEQMVAKKLTAPFRLGLYASSAYQQQFGLPKKIAELKNHRLVGFRFATS NRIFPLSLNDKGEEVSVEMPTPIVANSLIVAKDVIKSGIALGRFFEPLMS KQADRAAFIPVLEKHWKTFGALYLYYMQHSQKAGRVRAVIEFFTEKAQVE KK >MS2130 lysR, LysR protein MLNKFDALRYFCVAAETLNFRETANRLSVSPSVITRVVNELEAELGEQLF KRHTRSIKLTSFGEQFLLRAQHLLAESETLFKMGKNQADDLAGIVRITVP SWRNNDEIIRQLLITLESYPEIIIDWREDMGKLDMVEDRIDMGLRIGLEP DQDFVVRKITEIGDVLVASPALVKKLGQPTDLTDFERRYPMAIPINSNTG KPWTLFLNEDITLNPKNPAFYSVDNYSALQAVLLGKCAGLINDFMVKPYL EFGELIQLFPEIQIDKWQLFLYRPYQTVTPARVLKVFDLLTEILRKTYY >MS1395 lysR, LysR protein MKENLNDLRAFLVVARTGSFTKAGAQMGVSQSALSHSIRGIEERLNIKLF HRTTRSISTTEAGEQLYQRLSPLFDDIDNELNELSEFRNAVTGTLRINGN EHAFYYALGDKFVRFSQKYPEVNLELVAENRFIDIVAERFDAGIRLGSDV AKDMIAVRLTDKLPMCCVASPEYLANYGTPKTPYDLTEHQCLLHRLSNGG VMNWEFIDPKSKGRILKVQPQGTISANGGRVLENYARSGLGILWCPLDMV EEDIRSGKLIRILQQWDMDYDGYHLYYPNRRQNSPLFKALVEELRLVK >MS2143 lysR, LysR protein MPEMKKTDRFNHLISFTHAARFGSFSAAAEALDLTPAAVSKNVALLEQAL NVRLFNRTTRSLSLTEEGQVFYAESKKALALLEEAVNQITLAESQEIAGN VRISMPNVVGRNLVFPLLKSFNEDYPKIHLELDFDNKAIDFVKAGFDFVL RVGESSEGSLVARHIGMIQTCLVASPAYLKSQGVPKNMADLPQHQLLMTR LPNGKLQPWTFNEQGDNVHFLHAQPHLVLTDAEMQTQAAVQGFGITQLPV YLALPYLQNGELVTILNDSYQPLKLSLNILFPHRTLLAQRVRTTMDYLLE QLKQHEGLRMTQEELKAFSFK >MS1039 lysR, LysR protein MKIQQLRYIVEIVNQNLNVTEAANALYTSQPGISKQVRLLEDELGLEIFE RNGKHIKTVTPAGKKIVAIARELLVKTQAIKAVANEFTQPNHGVLRIATS NTQARYMLPAVIERFSKQYPNVSLHVHQGSPNQLYDALLSSEVDLAITTE AQYLFDDVVLLPCYMWNRSIIVKADHPLAKLSHVTIEDLGKYPLITYTFG FTGVSDLDQAFNSAGILPNIVFTATDADVIKTYVRLGLGVGIIASMAHTD ADTDLIRIDASHLFKSSMTQIAFKHSTFLRNYMYDFINYFSPHLTRAKVE KAERARDNTAVQKLFEGIDLEVR >MS0895 lysR, LysR protein MERKMFKRLPPLNSLKAFESAARFLSFTKAADELCVTQAAVSHQIKLLED FLNIRLFIRKNRSLELTELGKNYFQEISPILQKLADVTEKLKSTDNPHLT ISVLQSFGINWLVPRLNRFNQLYPNIEVRIKSAEQDEGILGNDIDVAIYY GYGNWDNLKTEKLSEDNLLILASPKLLANNPVNSKDDLKHHTLIHVHTRD NWQNMATELGISDLNIHIGPLFSHTFMALQAAVHGQGIVLANSILAQQEI DNGNLQVVLPYELKDPKSFYVVSDTNRTNDQNISAFRQWIMQEMKYN >MS2151 lysR, LysR protein MNSTEYGQLLIFQAIAKEGSISACARALRISVPAVSKALRQLENRLGVPL FQRSTRKIQLTETGVQLLEQTVQAVDTLSQAFENAKTLAKTPTGTVRITV SQVAFSLILQPVYAEFRERYPHIVLDISINNATVNLIDEQFDLGIRFGNH LEEGIVARRLTGEIREGLFISPQYAQKFGTPKTLADLAHHQLIGYRFITA NRFHPLTLMENGQPHTIEMPMSLILNDSEMAIDAIRQGFGIGRIFEAQYE RLESKIDLLPVLKKHWQTLQPMYLYYQPKSQKVKRVQVLIEFLQEKMEVL GW >MS2134 lysR, LysR protein MQKWKDNMKEISLDDMRLFVSVVQSGSLSHAGELTGIPVSRLSRRLTQLE QALGTQLLNRGKKGVSLNELGERFFEHSQQMLQQAELAIESVQKSLENPS GLLRISVAADIFYLFIQPYLATYLNENPQVNLEINLSNQKINMIQDGVDL AIRTGVIDNENVVARLWKKMEFGVFASQAYLAKYSEPQSPNDLYQHHIIS QMYTLPWRFQQGNQEVAVFPHSRLTCNDFAIVEQQLKQHSGIGILPITKN HNRSDLIRILADWQLQSVPVSLIYYRNRGAIATVRSFVEFLQRLV >MS2092 lysR, LysR protein MNKLDALKFFITAAETLNFREAAVKLAISPSVVTRTIAELENQLGEPLFK RSTRSITLTSFGELFLPKAKRLLEDSDTLFQTAKDDNEMKGVVRITLFRL PNHEQILFELLTALRPYPELFIDWRLDMMRLDTVEHRIDIGIRVGREPNP NFIIKPIAKVQHIFVAAPDLLERLGAPKDFEDLRQRYPFSGLINPETGKV WEFMLDGVNTFLPRHLEFFSTDPDTQIQAALAGRAVVQASDLACKEYLAN GRLVKVLPQIQQEKWQLYLYRPYQTITPKRVMKVFEVLEGVLRKYLG >MS1689 lysR, LysR protein MQSSIYGYLTYFHEIVIEGSIAGAARKLEVAPPAVSNALKLLERHLGLPL FTRTTRKMELTEAGQRLFESTKDMLRGLDSVMESVRDLTEKPSGLVRITT SIISYLLVIRPHFAEFCERYPDIRLEISVNDGIVDIVKEGFDVGMRFGDR LEQNVVAKKLLDPVRLGLYASESYLRKYGKPETLEDLSQHKLLGYRFVTA NRTYPLTFNQDGREISIDMPYSVLTNNLTVELDTVRQGVALGQLFEPVVN ALHDRKNFIPVLDAHWTQYPALYLFYMQHSQKAGKVRALIDFLEEKIKG >MS2116 lysR, LysR protein MNTKNTSVYALKLFLQVLELGSLSEVARRENLSASMLSRLIKQLEDDWGA ALFYRNTRAITPTETGLLLAEYARQIVSQFQAAEQAITAQTAEIAGTVRI NAPVFFGQLHIIPHLAELQARYPNLIVNLVQTDDYIDPFTDSTDIIFRLA PLNDSSLKVRILAQQHFCLAASPSYLQKYGTPKIPADLAKHHALLYKGKT GTLRWLLQEGENWQACSPKIALTSNNGNAIATACVQGMGIALLANWAASD LLKEGKVVRLLPEYNFSTQTVPVYVAMLYPQTAFISPSVRAVLDYFREIF QDKSW >MS2006 lysR, LysR protein MNLDWNDLHYFVLLVEKETLTAAANALDVEHGTVSRRIERLEKQLGLHLF NRINKRYLLTDDGRDLYAEAKKLQLNIKQFAQTAQDKCQSMGEVTVSAPP FVANSLITPLLAHFYRRFRHIRLILNSDSGLSNLHRSQADIALRIAQPKQ DDLVAHRLMNVEYRWFAHRDYLACTPESERQFLSLNLTGTHQQWLQTQLT GKSVRFACNDFNIMKSAVLQQLGVGLLPVCYIDSPDLAAVKNMEYFRAPL YLVMHEDVRQSQKVRMAADFLIENLRD >MS1543 lysU, LysU protein MSEQQNAELDFHGEMAVRREKLAALRAKGNAFPNTFRRDALAQDLHNQYD ETDGEQLKEKDLHVAVAGRIMTRRTMGKATFITIQDMSGKIQLYVARDNL PEGVYGEDVKSWDLGDIVGIKGTLFKTKTNELTVKAHEVQLLTKALRPLP DKFHGLSDQETRYRQRYLDLISNEESRRTFVIRSKVIAGIREYFIGKGFI EVETPMLQVIPGGAAARPFVTHHNALDIDMYLRIAPELYLKRLVVGGFER VFELNRNFRNEGVSVRHNPEFTMIEYYQAYADYHDLMDNTEELLRKLALD ILGTTIVPYGEYEFDFGKPFERITMHDAVIKYGAEKGIVKEDLYDLDRAK AAAAKLGIEIQKSWGLGSIVNAIFEEVAEHHLIQPTFLMAHPAEISPLAR RNDENPEVTDRFELFIGGREIGNGFSELNDAEDQAERFDAQVAAKDAGDD EAMFKDDDFVTALEHGLPPTAGEGLGIDRLAMLFANAPSIRDVILFPAMK HKG >MS1749 lytB, LytB protein MKIILANPRGFCAGVDRAISIVELALEIHGAPIYVRHEVVHNRFVVNGLR ERGAVFVEELNEVPDGAIVIFSAHGVSQAVRQEAKNRNLKVFDATCPLVT KVHMQVARASRKGTKAILIGHEGHPEVQGTMGQYDNPEGGIFLVENVEDI AKLGLKDNEELTFMTQTTLSIDDTSDVIVALKAKYPAIQGPRKNDICYAT TNRQQAVRELAEQSDLVIVVGSKNSSNSNRLAELASRMGVPAKLIDDSND IEPDWLKGINTIGVTAGASAPEVLVQSVIARLKELGVDSVEELEGCEENT VFEVPKELRIKEVG >MS0924 mET2, MET2 protein MDSSMSAQQVTLFTEQPLDLIFGGRLGQIDVAYQTYGTLNEDKSNAVLIC HALTGDAEPYLSPVENQAGGWWQSFMGEGLALDTSRYFFICSNVLGGCKG TTGPASINPKTNKPYGSQFPKVTVQDIVRLQKALISHLNIPHLHAVIGGS FGGMQATQWAIYYPDFVDKVVNLCSSLTFSAEAIGFNHVMRQAIINDPNF NNGDYYEGEPPENGLSIARMLGMLTYRTDLQLAKAFGRATKNEGHYWGDY FQVESYLSYQGQKFLGRFDANSYLHLLRALDIYDPSIGFDNIKEALSRIK AHYTLVAVTNDQLFKLTDLHKSKTLLEQAGVPLDYYEFPSDYGHDAFLVD YDTFEPKIRSGLE >MS0553 mHT1, MHT1 protein MPITILDGGMSRELMRRNAPFRQPEWSACALYEEPSAVQAVHEDFIAHGA EVITTDSYAVVPFHIGEQRFHTDGKTLADLAGRLAKSAVKNSGVLTTKIA GSLPPMFGSYRADLIQPERFAEIAQPLIDGLSPYVDIWLCETQSAIIEPV SIKALLPKDDRPFWVSFTLTDDELTCEPQLRSGETVKSAVEKMVDLGVDA ILFNCCQPEVIGEALAVTTATLTALNATHIQTGAYANAFAPQPKDATAND GLDEVRKDLDPPAYLAWAKKWTAQGASIIGGCCGIGVEYIETLAKNLK >MS1293 mMT1, MMT1 protein MGFNVVSRSRQIINVSFISIFTNIILVAVKVTIGFFTNSLAVMLDALNNL SDSLSSLVTIVGTKLATRAPDKKHPYGYGRIEYITAIVIGAIIFLAGATA LKESIAKIITPEETNYNITAIVIISIGIVVKYGLGRFVKNSGEELGSQAL IASGTEALFDSVLAIGTLFCAILSFFWNITIDAFLGAVIALGIMKSGFDR LKETLDNIIGVRADEALTSKIKCHIRRYQGVVGAYDLILHNYGPTEIIGS VHIEVPDDMTAREIHRLTRSIKADIMRDFSIEMTIGVYAANDSEPYVAKL KADLLRILHSYQEILEFHAFYVDRELKQVTFDLIFDFETKNVESLKEEII QRIKKDYSEFEFSIVVDPDFSSTGLVTACA >MS0376 mMT1, MMT1 protein MSKQYSTLVKRASLLAVFTAVTLIVVKAFAWWQTGSVSMLASITDSTLDL LASFMSLLILRFALMPADHNHSFGHGKAESLASLAQGAFIIGSALLLLLH AFQRLGEPKVIQQTGLGITVTMFSILLTFILVAYQNKVIKLTDSPAIKAD QLHYQTDLLMNAAIMLSLLLGSLDFIWADAVFAILIAVYILVNGGKMCFD AVQLLLDLALPEQEIEQIERLIREDPNIIGFHDLRTRRAGEVRFIQMHLE LSDDLSFVQAHAITDSLETRLKQAFPRVEIVIHHEPTSVVLAEQKAK >MS2068 malE, MalE protein MKNKCVKLTLTAIAGLVLSTSVMAKMTEGKLVIWINGDKGYNGLAEVGKK FEKDTGVQVLVEHPDRLEEKFAQVASTGDGPDIMFWAHDRFGGYAQAGLL SEVSVSKEFKDKFVDFAWDAETYNGKIIGYPVAIEAISLIYNKDLVKEAP KSWEEILELDKKLKKEGKNAIMWNLSEPYFTWPVAASNGAYAFKYKDGKY DVKDIGVNNEGAVKALQFVVDMVKNKNISADMDYAVAEASFNKGQTALTI NGPWSWGNIDKSGVKYGVAVLPTLNGQASKPFVGVLSAGVNSASPNKDLA KEFLENYLLTDEGLDTVNKDKPLGAVALKSYQEKLAADPRIAATMENAKN GEIMPNIPQMTSFWYAEKSAINNAVTGRQTVKAALDDAHARIQKQQ >MS2069 malF, MalF protein MSTLTQPKSTHWFKYLIAGIVLLFDFYLVGLMYLQGEYLFAILTLVILTS GVYVFTNKNAYAWRYVYPGIMGMTIFILFPLVATIAIAFTNYSGSNQLSF ERALSVLTEQRYFAGDKYNFKLYPQADNQYKIVLTNPATAQTFVSESIAL KAADVPVSEQAEPTGEIAPLRIITQNRSALQAMKVILPNDNELTMSSLRQ FSEQKARYQFDKENNILRNNENGKLYKANDETGFFQAVNESGDWLSETLE PGYTVGSGFHNFVKIFTDKGIQKPFVQIFIWTVMFSLLTVVFTVILGMVL ACLVQWEALKGKAIYRLLLILPYAVPSFISILIFKGLFNQSFGEINMILN QLFGISPEWFNDPFLAKAMILIVNTWLGYPYMMILCMGLLKAIPSDLYEA SAMDGASTWQNFSKITFPLLLKPLTPLMIASFAFNFNNFVLIQLLTNGRP DMIGTTTPAGYTDLLVSYTYRIAFEGSGTQDFGLAAAIATIIFLLVGGLA LLNIKATKMEL >MS2070 malG, MalG protein MAIVQSKSVRYRVWATHLILISFLALIIFPLLMVIGISLRPGNLAIGDII PSQISWEHWQAALGFEVTHADGTVTPPPFPVLRWLWNSIKVATITSIGIV TLSTTCAYAFARMKFKGKKTILQGMLIFQMFPAVLSLVALYALFDRLGQY VPFLGLNTHGGVIFAYLGGIALHVWTIKGYFETIDGSLEEAAALDGATPW QAFRLILLPLSVPILAVVFILSFIAAITEVPVASLLLRDVNSYTLAVGMQ QYLYPQNYLWGDFAAAAVLSAIPITLVFLLAQRWLIGGLTAGGVKG >MS1587 malK, MalK protein MLSHHKNKNGGAYPTLYRQYNIMTNQNDNFLVLKNINKTFGKSVVIDDLD LVIKRGTMVTLLGPSGCGKTTILRLVAGLENPTSGQIFIDGEDVTKSSIQ NRDICIVFQSYALFPHMSIGDNVGYGLRMQNIAKEERKQRIREALELVDL AGFEDRFVDQISGGQQQRVALARALVLKPKVLLFDEPLSNLDANLRRSMR EKIRELQQSLSITSLYVTHDQTEAFAVSDEVIVMNKGKIVQKAPAKELYQ QPNSLFLANFMGESSIFNGQLQGNQVTLNGYQFTLPNAQQFNLPNGDCLV GIRPEAVTLKETGEPSQQCSIKTAVYMGNHWEIVADWAGQDLLINANPEV FNPEQKQAYVHLSSHGVFLLKKE >MS0812 malK, MalK protein MENIVQSKPIIELRSLKKSYNENTIIDNFNLTINNGEFLTILGPSGCGKT TVLRLIAGFEEANGGQIILDGEDVTDLPAEHRPVNTVFQSYALFPHMTIF ENVAFGLRMQKVPNEEIKPRVLEALRMVQLEEMADRKPTQLSGGQQQRIA IARAVVNKPKVLLLDESLSALDYKLRKQMQNELKALQRKLGITFIFVTHD QEEALTMSDRIIVLRKGNIEQDGSPREIYEEPSNLFVAKFIGEINIFDAQ VLNRVDEKRVRANVEGRVCDIYTDLAVKEGQKLKVLLRPEDVQLEELDEN EQSSAIIGHIRERNYKGMTLESTVELEHNNKLVLVSEFFNEDDPNIDHSL DQRVGVTWIEKWEVVLNDENDNA >MS0584 malK, MalK protein MEQTDMAKLEIKNITKKFGDFYAANNISFTAEEGEFVTLLGPSGCGKTSL LKLIAGFHIADEGEILIGGKNVNEIPPEKRNTAMCFQSYALFPHLNVSHN ICYGLKQRKIDINEQKQRLDLAIKQMDLEIHRLKLPNELSGGQQQRVALA RAMVTRPDVILFDEPLSNLDAKLRESVRFEIKQLSKQYNLTSIYVTHDQA EALSMSDKIIVLNKGKIEQIGSPQEIYHHPINRFVADFIGIANITEAHVK EMENNLYEVNSIYGNFTVYSEIKPQSDHIYICFRPEDIEIVPASENKENM LTVDVTHTAFMGNITEIQALIRKDDKEQKLRLQLTKFPQLTENYQLSFCV PRDAIKFLESVK >MS1524 malK, MalK protein MIKLERVYFNYKTMPMNFNIHIKPQERVAIIGASGAGKSTLLNLIAGFER ADDGEIWLNGVNHTYTEPYERPVSMLFQENNLFTHLTVEQNIALGLKPDL KLSAAEQSLVRQTASAVGLSRFLDRKPTALSGGQKQRVALARCLLRDKPI LLLDEPFSALDPALRAEMLDLLSQLCNEKKLTLLIVTHQPSELQGRIDRI LTVENGHFAKNNDLK >MS2067 malK, MalK protein MANVSLRNVGKSYGDVHISKDINLEINEGEFVVFVGPSGCGKSTLLRMIA GLEDITTGELYIGEKLMNDVEPSKRGIGMVFQSYALYPHLDVADNMSFGL KLAGVKKNERDQRVNQVAEILQLAHLLDRKPKALSGGQRQRVAIGRTLVS QPEVFLLDEPLSNLDAALRVQMRVEISKLHKKLNRTMIYVTHDQVEAMTL ADKIVVLNAGGVAQVGKPLELYHYPANRFVAGFIGSPKMNFLPVKVTAVE ENQVKIELPDANHHNFWIPVSGEGVNIGENLSLGIRPEHLVPAEQAQVSL RGIVQVVELLGNETQIHLEIPEIKQPSLIYRQNDVILVNEGDEMNIGIVP ERCHLFKEDGTACQRLFAEKGV >MS2074 malQ, MalQ protein MTITTKQFRRAGIMPYFFDERGVKKWAPHNIKKALFNTFEGNIQASSTPI PAVKIFYQNRPHFLPINSADKKHPLKGRWQLQLENTQTVISGPIKTRGIN LPKDLPLGYHQLQLQSAGKVFNCTVIVAPQSCYQPQALREHKKLWGTFLQ LYTLKSEQNWGIGDFGDLKKFLQNLAPFQADFLGLNPIHALFPANPDSAS PYSPSSRQWLNIAYIDVNQLAEFQQSDEAQAWFNSAEVQQKLTELRQAEW LNYGEIIPLKLKGLRFAFKKFQQNPTALSSRQFAQFVQQGGESLQVQATF DALHQYLADRFDNQWGWDFWAKEYQDYHSLAVQQFRAEHQAEIEFYAWLQ FIADQQLAECDEVCRQQNMRIGMYRDLAVGVTGNGAETWNDKRLYCLNAS VGAPPDVLGPQGQNWGLTPMNPHVLQQQAYAPFIELLRANMKHCGALRID HIMSLLRLWWIPKGDSAVNGAYVRYPVDDLIAILALESQRHQCLIIGEDL GTVPKEIVGKLKNAGILSYKIFYFEFDEHGQSRDLQTYPYQAMTTLSTHD LPTINGYWRGYDFELGEKFGVYPNPKILDILQRDRVRAKTQILQRLRQHH VPVEAKISAELGSSVSNKFVHQLQTYVAQVSSGLFGFQPEDWLGMTEPVN IPGTSTQYANWRRRLTANVEDIFADTDIQHLLKEVNAIRKE >MS1124 malQ, MalQ protein MSELQQAAQLGIALSYYDIEGRLIQAKEETLQYFTALFQPSPDGKNKSTK QFHDVFVMNAQERAIYEFQRLALSPSACEYQLFDEQNKLCGVQTLSDPKS LSLPPLEAGYYLLKLKCNDAEYRIRLLVQPSTAYQPPLLERKKAWGLNVQ LYSLRSTRNWGIGDFADLRNLIKSAVKFGADFIGINPLHLVYPAVPEWAS PYSSSSRRWLNCIYLDIESLPEYTLSKLAKKWRVEHNEQIEQLRKAELVD YATVNQLKQSALALLFDFFNRSKSAQIAARRTEFNVFVENQGEALLYQGL FNVLDSIEHVDLPENENQIGWLGWRKEWQHLTSKKRKALLKEHQKQVYFY AWQQWLAEQQLAEAESLCLTEGMQLGIYGDLAVNSSRGSSDVWSDQKLYC INASVGAPPDPLGPVGQNWNLPPYNPNQLKRRGFQPVIDMLRANMRHFGV LRIDHVMGLFRLWLIPEGKTAADGVYVHYPFNELMAILAIESQRNQCLII GEDLGTVPDEVRSKLKEFQILSYFVVYFSNQNGEFPQGKDFPVNAFATIG THDVPSLAGFWHCRDLALFAQLDVLKDDLLKAKYDQRLTDKQALLDRLRL DGYLPADYQGDALNMAMHDNLNLVIHRYLAESASQLVGVQLENLLNQEVS FNLPGTSTEYPNWRKKLAVNLDDIFNDERIIALLKTINYARSQPKS >MS2072 malT, MalT protein MLIPSKLVCSFRLQNSVPRTRLIQELDKSAFYPVVLINAPAGYGKTTLVS QWIEDKKNVGWYGLDEGDNNSDRFAVYFSAALHSAINEEVDVLLEENRKA NLLALFNQLLIKASGFPQHFYLVIDDYHLIENDEIHEALKYWIRHQPANM TLILISRSVPPLSVASLRVQEQLLEIDINQLMFDHQESVAFFQARLGSEL KQQDIIELCNEVEGWPTALQLISLFAKNKSQTLQVPLQDIAKRLAKSNNF HINEYLADEVLNKVDKSTRLFILRCSVLHSMNETLVEAVTGEPNSRKKLE SLEKQGLFLQQMANSKWQTVDDSWWKFHPLFASFLNFCCQHELYDELSQL HRRAAQAWLKLGYVTEALHHAMQLSDTCLLLEILDEHAWTVFHQGELQLL EESLNSLDYAHLTEHTNLVLLKAWLVQSQHRHVEVSGILAEFSRALNENK VELSKTAQAEFNVLRAQVAINSGDENTALQLASDALKDLSENAYYAHIVA TSIIGEAHHCHGNLAEALSMLQKAERMARQHHTYHNILWSLLQQSEILLA QGFSQAAYDMLDKASEFVKENHLQKVPMYEFLLRLKGKILWEWYNLDKAE SMAVAGMNALQKFEDKLQCLALLTKISLVRGNLDNTSRLLNEVEQLERSH AYHHDWTASADQVRMFYWQMTNDVAAARNWLIQNPAPISDKNHFTQIQWR NIARARILLGQYDKAQEILDNLIETAEKFSLTSDLNRALIVRNRLYFLQG AKELAQQDLIAALKLTRQTNFISAFVVEGDVMAQQIRNLLQLNVLDELVL HKAQFILRNINQFYRHKFAHFDETFVSQLLKNPKVPELLKISPLTQREWQ VLGLIYSGYSNEQISDELQVAATTIKTHIRNLYQKIGVTNRNEAISYTKE LLALMGYN >MS0614 manA, ManA protein MTGIYKLTGSLQHYVWGGHDYLPELLHIKKEPNQYYAEWWLGAHSSSPST IEVEEKQLSLIDFIRQHPEVLGSQSRALFGDELPYLLKILDVKKPLSIQL HPTKKQAEIGFAEENSKGIDLKDAKRTYKDNNHKPEMMIALSDFWLLHGF KTKEKIIETLKNRPTLTALLSKLETQDMHGFYADIMQASQTELANWLLPI IESNKIAYEKGELSLENPDYWVLYAMEAMEISPEKLDSGLICFYLFNIVH TKTGEGIYQDAGIPHAYLRGQNIELMACSDNVIRGGLTPKHVDIPALLEV IDCREVVPEIIPPAPQENGAFIYSTPAKDFALENVRYDAGIKVESKAENA TIIFVMQGTLKISQKNTALFLKQGESAFICADTQYRIEGIEQGYCVLSKL P >MS1776 maoC, MaoC protein MILYYLIIIFSLLLFTYTALNLNSAKKKINQNDDSFKTINEFFKNIKFKT ALWKYYFTAGNERNLLRNVLLTLIIFFFFHTLNYLYIKVDKFIFLGVFLV LFFIIVWKLGQRRNRKEFEEMFPEVIQILNSATSSGAGLLQALERCGKDI SGQIGEEFTAIHKRLAIGEDANSVFEDSYSRYPYKEYYFFITIIRVNLDK GGQMREVILRLGRVIADSKKMEKKKSAMTSEARMSAMIVASFPVAFFIFM KFMMPENFEFLLNDPGGRMILYYVLGSESLGMGIIWWLMRKAT >MS1306 map, Map protein MAIPLRTEAELEKIRIACKLASDILVMIEPHIKEGVSTGELDRICHEYIE RVQGATPANVGYHGFPKATCISLNDVVCHGIPSEDKILKNGDILNLDVTV IKDGYYGDNSKMYIVGGETSVRSKLLCEVTQEALYVGIRAVRAGVRLNQI GKAIQQYVEKQGFSVVREYCGHGIGDQYHTEPQVLHYFGDDGGVVLKPGM VFTIEPMVNAGKKEVRLMGDGWTVKTKDRSHSAQFEHELVVTETGCEVMT IREEEEKEGRISRIMVNAEA >MS1180 marC, MarC protein MVAVINPFGVLPVFVNMTNHQTKAERNHTNLITSFSVGVILLVSLFFGKI ILSLFSISINSFRIAGGILIISIAMTMISGKLGEDKQNKDEKNADFANMN SIAVVPLAMPILAGPGAISSTIVWASQYSSWFDWIGFSFAIILFSLLCYG LFRSGPTIVNALGKTGSNVVTRIMGLILMSLGIESIVVGITKLFPGLTH >MS0138 marC, MarC protein MFDSLFVQFVVLWAVIDPIGSVPVYLSKTVGLSVEERHKVARKSVIIATI VLMFFLVIGQGLFETMQIPLSAFQIAGGLVLLLFALTMIFGEGKPETEMK MRTSLSELAVYPLAVPSIASPGAMMAIVLLTDNHRFDFFEQCLTTVVMLL ILFITYLLFLIANKIQRVIGNTGAAVISRVMGLILAAVAVNNVLVGIRDF FGIAL >MS2091 marR, MarR protein MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE >MS2146 marR, MarR protein MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE >MS0223 mauG, MauG protein MKKYFLSAIAIAGLGYLSMVGYAHFFDKSQSEKLYAATEVPQQFKPVAKV MFDNGCQYCHSPSADIPGYANFPIAKQLMEQDIAQGLRSFRLDRMFEGMK DPSKLSEADLAKLEQVIRNDQMPAAKFLHIHWGTRPDADEKQVLLDWIKQ QREAHFLPQNTQGADAARLVQPIPDAIATDPHKVALGEKLYFDGRLSADG SIQCHTCHQLEQAGVDNLPVSEGIEGKKGGINAPTVFNAAFNKWQFWDGR AKTLADQAGGPPINPVEMGSKDWDEIIARLDQDEAFKKEFLSVFPELSQA TVTEAIGEFEKTLITPNSAFDRYLKGEQSALNDQQKRGYELFKNAKCDTC HTGTAMGGQSFEYMGIYDDYFKARGTDLTDADKGRFAETQDPYDMHRFKV PTLRNVALTAPYMHDATAKDLKEAVRIMGHYQSNKDFSDAELDDIVSFLN SLTGEFKGKLLTNEKMK >MS0351 mazG, MazG protein MIPCLIEESYEVVEAIQQKNTADLREELGDLLMQVVFLSQLAAEENKFTF DDVVNDIAEKLIYRHPHVFGDKEAADEHAALRNWNEMKAREAKNQAHTSI LDNVPFSFPALLRAEKLQKKCAKAGFDWQQVAPVIAKVEEELEEVTQEIN CPAPQQAKLEEEIGDLLFAVVNLSRHLKCQAEESLRKANHKFERRFRAVE DKLRQQNKTATESSLMEMDMLWDEVKHEEKVSSD >MS0836 mdaB, MdaB protein MKHLVIFAHPNTKNSFNKAILERVLQASQKMNVDTTVRDLYGMNFNPVVS WEELTGSFKEIIPAAIRHEQQLISEADLITLIYPLWWMGFPAILKGYFDR VFTHGFAYKTDETGTVGLIQGKKMQQFITMGNNEERYQQMGFARSLNDTL VNGLFNYVGIIDIDHRLLGDIHIISSEERQALLNEVEQKTKENLTALLEG KA >MS2139 mdaB, MdaB protein MSNILIISGHPNLANSVVNTIILDEFAKTLPQAEIRKLDQLHTNYEFDVA AEQAAIEKADVILWQFPFYWYAMPALMKKWLDDVFVHGFAHGSTAKIAGK KLLISLTTGAPLEAYQREGFFKHKMDDFFAAFETTAILCGLDFQGVQFLN GVSYVGRNEEKIAQQQAEAKVYAQTVIEKVKRL >MS2094 mdaB, MdaB protein MKTTVLVVHPNIKQSRVNAALAKGAADVAGVKVRYLYDLYPDGKIDATAE QAVLEKADRIVLQFPMYWYSSPALLKQWLDDVLAYGWAYGDKQALKGKEL MLAVTTGGGEEFYQKDGLAGHTVAEFLVAYETIASYLGMNYGKMFVTGNC LNISDDEIAAQVPRYQAVLSA >MS1417 mdaB, MdaB protein MKNVLIVSGHPNLKTSIANQVILDETAKALPNAEIRKLDELFHNGTFDIA AEQAAVLKADVLVFQFPFSWFSLPGVMKIWLDEVFEHGFAHGSTAQLAGK KIIFSTTTGAPAEVYQKDGFFKYTMEEFAAQFEIMAQLCNLDYQGLIYTN GIGYTSRENEEKINAQKAEAKKHAQRLVALIEKA >MS2162 mdaB, MdaB protein MVIFLTVCYHTSGRFLSIFCKCRYSTSGEEYMNRRNLLKAGVALAAVAAM PFGRAQAKTPSKKTLVIVSHPYPESSTFIKGLQQAAETVEGVTVRNLETI YGFDTRAVKGDEERRIMRAHDRVVFIFPTHWFNITPMMKAYLNETWGSVG PGLWQGKEMLVVSTAAGGSETYGKNGRVGVELADVFLPMKASALHCGMTY LPPLVFQGVRSSELANYQQQLIERLMQ >MS1266 mdh, Mdh protein MKVAVLGAAGGIGQALALLLKLQLPAGSSLSLYDVAPVTPGVAKDLSHIP TDVVVEGFAGTDPSEALKGADIVLISAGVARKPGMTRADLFGVNAGIIRS LTEKVAEQCPKACVGIITNPVNAMVAIAAEVLKKAGVYDKRKLFGITTLD ILRAETFIAELKGLDPTRVTIPVIGGHSGVTILPLLSQVQNVEWSSEEEI IALTHRIQNAGTEVVEAKAGGGSATLSMAQAAARFALALVKASQGAKVVE CAYVEGDGKYARFFAQPVRLGTEGVEEYLTLGKLSAFEEKALNAMLETLQ GDIKSGEDFING >MS0950 mdlB, MdlB protein MEKTRQRYLFKWLRAQQQPVKKLLNLNILLASTSSVILVLQTWLLATLLN DLVMLDTKPRELLPHFFGLAIGFTLRALILWLRERIGFKCGQQLRNIIRR RILEKIHQVGPAVINNKPAGSWATLMLEQVENLHNFYSRYLPQQMLSVIA PVVILIAVFPINWAAGFILMATAPLVPVFMIIVGLAAADSSQKNMTTLAR LSAQFLDRLKGLETLRLFDRAEQQTNHIEIGTEAYRKTTMDVLKMAFLSS AVLEFFTSISIAIMAVYFGFSYLGQINFGTYDTSLTLFAGFFCLILAPEF YQPLRDLGTYYHDRAAAIGAADSIVEFLEQKELHSATQTRQLNETTALEI KAENLVINSPQGQALTKPLNFSLSPLSHTALVGQSGAGKTSLMNALLGFL PYEGSLTVNGIELNRLEPTQWRKHIAWVGQNPLLLQGSIKENLLLGEIQA SEEEIAQALKQAKATEFTDKLGLDYEIKDGGTGISVGQAQRLAIARALLR QGSLLLLDEPTASLDAQSENQVLQALNQISQTQTTLMITHRIEDLKQCDQ ILVMQSGEIVQQGIFEQLQNEGYFAELLAQRNTDVA >MS0932 mdlB, MdlB protein MVSADCRRRAGLIFGSYYNIGYNAPIIFLRRYTFMQKLQENDLSTSQTFK RLWPTIAPFKIGLIAAAAALVLNALTDSGLIYLLKPLLDDGFGKADTSFL KLMAVLVIVFIFIRGITSFISSYCLAWVSGKVVMTMRRRLFKHLMYMPVS FFDQNSTGRLLSRITYDSEQVANSSSNALVTIVREGAYIISLLAVMIATS WQLSVVLFIIGPVIAVLIRLVSKIFRRLSKNMQNSMGELTATAEQMLKGH KVVLSFGGQQIEEQRFNEVSNDMRRKGMKMVVADAISDPIVQIIASLALS AVLYLATIPSIMSQNLSAGSFTVVFSSMLAMLRPLKSLTNVNSQFQRGMA ACQTLFDILDLDTEKDKGKYEAERVKGDVSFKDVSFTYQGKDQPALKHLS FDIPHGKTFALVGRSGSGKSTIANLVTRFYDINQGEILLDGVNVQDYTLS NLRTHCSVVSQQVHLFNDTIANNIAYAAKDKYSREQIIAAAKAAHAMEFI EPLENGLDTVIGENGASLSGGQRQRLAIARALLRDSPVLILDEATSALDT ESERAIQAALEELQKDRTVLVIAHRLSTIEKADEILVIDHGEICERGSHE ELLALNGAYKQLHKMQFNG >MS0394 mdlB, MdlB protein MLNKIFSWFERRVEAYPDQTPNTPENGLFKFIWSSLDGMKKWILLLAVLT VGTGVMEALLFQFMGVLVDWLGNYTPVTLWQEKGTLLWGMGFLLVFSILW SFLASAVRLQTLQGVFPMRLRWNFHRLMLGQSLSFYQDEFAGRVSAKVMQ TALAVRDTVLTIADMMVYVVVYFISSGVVLVALDGWFLVPFVVWVVLFVM ILRVLIPKLAKTAERQADARSLMTGRVTDAYSNITTVKLFSHGAREASYA KKSMEEFMVTVHAQMRLATSLDTLTYAANVFLTLSTAILGILLWQKGAVG VGAIATAVAMALRVNGLSRWIMWESARLFENIGTVNDGMTTLSKPHTIID KPNAPQLEVKKGEIRFDNVDFCYDPAKPLLNHFNLTIRPGEKVGLIGRSG AGKSTIVNLLLRFYEAQNGTISIDGQNILDVQQESLRRQIGLVTQDTSLL HRSVRDNIIYGRPEATDEDMINAAKRAEAADFIPFLSDAKGRRGYDAHVG ERGVKLSGGQRQRIAIARVMLKDAPILLLDEATSALDSEVEAAIQESLDK MMENKTVIAIAHRLSTIAAMDRLIVLDKGQIVEQGTHAELLAQNGLYAKL WRHQSGGFLSEHAD >MS0949 mdlB, MdlB protein MRSLIPFLTLFKYAKFPLILGVILMILGLAASIGLLTLSGWFLAATAIAG AGTLFNFFYPSSAVRGLAIGRTVARYFEKIVTHDATFRILSKLRVQVFGK IIPLSPAVLNRYRNSDLLNRLVADVDTLDSLYLRLIAPFVSAIFVIVLIT VGLSFINLPVALFIGITLLVLLLVIPTVFYKLGTKFGKKLTLSRATYRSQ FVEFIQAQAELLLFNAEDKIKQNLANTESEWQAYQQRETNLAGLSSAILL FANGLILAVTLWAAAHIDLGTGEYKAALLALFAFSALAAFEILMPIGAAF LHIGQVIASADRVTEIISQPPLVKFSGKQTALSATADLIRLKDVSFSYPE RTSCALNGLSLTIRKGQKVAVLGKTGSGKSTLLQLLVRNYNPSQGEILLA EQPIHAYSEQCLRENICFLTQRVHTFSDTLRNNLQIANKTKIDDLKMREV LVQVGLAKLLEQKSGLDLWLGEGGRPLSGGEQRRLGLARILLNESPILLL DEPTEGLDRETERRILRLLMQHSANKTLIMVTHRLTAIEQFDQICVIDDT NLVEKGSYQELNSKNNGFFKKLVERI >MS1204 mdlB, MdlB protein MNIFISTIKGYRWHLVAVLILTFIFSGFGIGVLAFINNKLMKATEQKELL IWTFIGLLILFLITSVIAQISLTALGHKFVYLMRKQLVKQLLDTGTEQLN QIGKARLLASLSGDIRNITFAFVRLPELVQGSILVLCAGAYMFYLSESLF FVTALWLSVTVWVSNIAVRRVYHHLRIVRETEDKLYKNYQSAIEGHKELS LNRERAKFYFERELEESASTQRDNSVRADSYHAFANNWTNVMVLGAVGLV FYLSLAEGWSNLETATTIALTILFLRTPLISAVGALPMLLNAKVALDKLS KLNLAPYTEDFAISNPLPRDWKEIRFENVTYSYPTAEGTMSFALKPVNLT IKRGELIFLIGKNGSGKSTFSMLLAGLNHATDGKIFVDNIEITAANQRAF RAQISAVFSDFYLFTQLLGQAGFASLKEAGQWLETLQLENKVTVENHRLS TTNLSQGQRKRLGLLIALLEHRPLLILDEWAADQDPTFRRTFYQVLLPLL REKGHTVFAISHDDSYFHLADRLLLISQGELRELFGEERETASHDAVEKL NNIIKETKI >MS1569 mdoB, MdoB protein MHSFKNYKEIMKNLKRLLGASYPIIIFLPVNLVVLSLSRLGLSLWQSERV DATQGWTELFLQGMRVDLASLCWLFFPLILLGTLFSGNNKFGKIIQWIIK LGLTVFSTFFVFMELATPAFIKTYDYRPNRLFVEYLNTPKEVFTMLAHGH LAALISTVILTALFAIIFWKFAQKLSRDLYFPKWQYRLVSFLILGLFAFI GARSSFEHRSLNPSMVAFSNDTLVNSLVLNSGYSVLFAVQQMKDESNSSE IYGKMPLEEVVNTLKGLNPRPETAYISAELPTLTHNQASYQGKPKNLVIL LQESLGAQFIGTLGGKPLSPNIDELAKEGWLFDNLYATGTRSVRGIEAII TGFTPTPARAVVKLQGSQDNFFTIADLLKQQGYDTSFIYGGEKHFDNMAS FFYGNGFTRIIDQADYSNPTFSGTWGVSDEDLLNKANETFESLHQQGKPF FSLVFSSSNHDPFEFPEGKIQLYEQPQATRNNAAKYADYALGEFFKKAKQ ANYWKDTVFIIIADHDSRVNGQQLVPIKHFHIPALVLGADIEPKRDHRLI SQIDIPPTLLSLIGISGDYPMIGFDLTKAENPNRALMQFDKNLAYMRDNR VAILQPNKPATGFIYDEKTGDLTAASIPENMAKEALAYALWGSYAYKNKL YKSDYLLKK >MS1409 melB, MelB protein MKQQNVWLKRIGYGFGDFGCNLVFSTMASYLMFFYTDVFGIEAAVVGTMM LSTRLLDAVTDVLMGLVVDRTNTRWGSGRPYFVIGAIPFAIFTTLTFYVP DFGTAGKIIWAYCTYIMLSLAYTVVNIPLNTIVPRLTSDINERNILVASR MICALLGTTVVMGITQPLVDFFGQGDYKQGYFITMTLYGILAMLIFFFTF TQTEEVVPPTVVRTENSSVLDDFKGLTSQTWILVLVNFFYFGLFVVRNTS VIYYFTYNLNSTSWLTFVGFFGILSGLPILLLLPRLQKIFPQRTLIIACC LLYIIGDAIAYIGKDSLTLQLVSLAVTGLGMYGIFGVTFAIQPDVIDYSE YEKNRSIPGMIASMQGFFVKFGMGVAGLSIGWILEGGGYQPNVVQTESAL FSIEVCYIWIPVIICLSIIALMYFYKLDGLRSEMTRVLDLRRKQMEYAHQ H >MS1228 melB, MelB protein MSLSMKTKLSFGLGAYGKDFAIHIVYMYLMYYYTDVLGVSGAIVGTIFMI ARVWDAVNDPIMGWIVNNTRSRWGKFKPWILIGTVLNSIVLFSLFCADYF SGTALIIYIAVTYILWGMTYTLMDIPFWSLTPTLTLDQREREELVPYPRF FSSLANFITAGTCIAFVDYVGGDDKGFGFRMFTLVIIVCFLISTVITLMN LKEKYSSDNLETGEAQQRIPLKTLVSLIPRNDQLSSLLVMALSYNIASNI ISGFAIYYFTYVVGDKEMFPYYMSYAGIANLIIIIFFARLVRLFSRRTLW ITISVSSILSCLILAYTGMAETPSVFLIILAGIFMQIGSALFWTLQVIMV ADTVDYGEYKLGIRSESIAYSVQTMVVKAGSAISAFLIGVLLTAINYVPN EVQNENTIFWMKVIMIGLPILFYSIKLFVYFRYYKLHGDLLAKVNIALLD KYRNVKED >MS1840 menA, MenA protein MTQSALKTWFETARPKTLPLALAIIFTGSAVAYWFGSFDWQITLLCLLTA TLLQILSNFANDYGDFQKGSDTVERIGPLRGIQKGNMTEGQLRNGLIVTI ILILISGFALLATAYQSLQDLIVFITLGIASIVAAIAYTVGKKPYGYLGL GDIFVFLFFGLLAVAGTYYLQAHSMNWTVFLPAGACGFLSTAVLNINNMR DIEQDKKAGKHTLVVRLGAEKSRVYHCLLLTSGVLCYALFSAINVDSRWG FLFILAVPLLVKHAGFVYKTKEPILLRPMLAQMSLLALLTNVLFSLGLVL AK >MS1792 menB, MenB protein MLYPSEEFLYAPVEWADHSEGYTDIRYHKSKDGIAKITINRPEVRNAFRP QTVKEMIHAFSDARFDEKIGVIVLTGEGEYAFCAGGDQKIRGDYGGYKDD SGVHHLNVLDFQRDIRTCPKPVVAMVAGYAVGGGHVLHMMCDLTIAADNA KFGQTGPKVGSFDGGWGASYMARIVGQKKAREIWFLCRMYDAKEALDMGL VNTVVPYADLEKETVRWCREMLQNSPIALRCLKAALNADCDGQAGLQELA GNATMLFYMTEEGQEGRNAFNQKREPDFSKFKRNP >MS1794 menD, MenD protein MSVSTFNRCWSKVILETLTRHGVKHFCIAPGSRSTPLTLEANRLQEQRRA LCHTHFDERGLGFFALGLAKSSQTPVAIIVTSGTAAANLYPAIIEARQTG DNLIVLTADRPDELIECGANQAILQQNMFAGYPVASVNLPRPSQDYIVSW LISTLDQACHQQAQQAGVIHINVPFAEPLYDADEDEIDVHPWLAPVQRWL NHNKPWADHQALQEEVVMHEHWDNWRTKRGVIVAGRLTQEQSMGITAWAN TMGWVVLTDIQSGVEPSLPYADIWLANKTVREKLLQADLVIQLGYAFVSK RINQFLADFKGEYWIVDESAHRVDPYHHIHTRFTAKVHHWLRAHPPLRQK PWLLEPLALSKFCASFIEQQVGGNLNEASLAHHIERILPNNGILFLGNSL FVRLVDALGKLPEGYPVITNRGASGIDGLLATAAGVGMGSNQPVVAMIGD VSALYDLNSLALFKNVNQPTIIFLINNNGGAIFDMLPVESSVKSEFYRMP HHTEFSQAASMFDLKYARPYTWADLSSVLKQAYSRKEATVIEIKVGPMDG SNTYKRLIEQISYAVIGA >MS1795 menF, MenF protein MFLTMDSLQSLQQQLIQQMDAYQPCKNQPEITALTAKIQLEQNLLAWLKA QQDYPQFYLHCRADSPNEQENHIAAIGQVRTFTTVNHAQTFVRRADFTLV GGMTFNGECDFYLPRLLLRQVDGELTATLFIDSQKDLSVEKQLAGKCLKN FTKSVALEPVVQSVRLVEKKATQAQWCEWVEQALLEIKKGSFTKVVLANE SIFSSRQPINAIDFLAESEKKNTGCYHFFFAQKADYAFIGSSPERLYLRN GQYLQTEALAGTAVMSDDEEQNQRQGEWLLKDEKNEYENMLVVEDICGNI ESFTQNIEVQSVELKRLRLVQHLRRKIFAKLTALTADEACLNAIHPTAAV AGLPKQNALRFLAKTETFERSWYAGTLGFMNRARAEFCVTLRSAFVEQNR IRVFAGAGIVAGSVPLLEWQEIERKASGLLSLLQNSGEHICQ >MS1839 menG, MenG protein MSYTGNRFILEKSNLYKGNFMRIDTSELCDVYLDQVDVVEPIFSSFGGVN EFFGKVTTIKCFENNGLIAEILEEQGEGRVLLVDGGGAVRRALIDAELAQ LAADNGWEGIIVYGAVRQLSRLENINIGIHALAPIPVGADEDTQGESDIP VNFGGVTFFPEDYVYADLTGIILSQEPLELEELGEE >MS0804 mesJ, MesJ protein MSDIFNQFQQQIYQQKILIAFSGGLDSTALLALCKKLQENRPHFQFRAIH IHHGLSPNADKWALHCENICRQFSIPLIVEKVRVDKSNGIEAGAREARYH AIANHLNIDEVLATAHHLNDQTETFLLALKRGSGVQGLSAMQKESVVFNL PIFRPLLQFTRAQLEDYVKSQKINWIEDESNEDNSYDRNFLRNIILPKMK TRWAHFDQAVYRAAQHCFEQTQLVNELLEDEFQKIFEKNDRTLSVKLFDR YSYIKQKALLRLWLARLQLAMPSQKQLEQLIRDVIFAKPDAIPQFKLANQ VIRRYQQKLYLTADFADLTDVAVPLKISQTVPLPDGLGHISLREQSGCFV FSWRHYQVQLPPCKQPIEIRFAYSGKVKLHKNGVNQDIKKVWQNLNVPPW LRNRTPLIFYGDQLKSAVGFFKVFDC >MS1087 mesJ, MesJ protein MTELETKENKKQIYNFNKLQKRLRRNVGNAIADFNMIEDGDKVMVCLSGG KDSYTLLDILLNLRHNAPVHFDIVAVNLDQKQPGFPEHILPEYLSSIGVE YKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGH HRDDMLETLFLNMFYGGKLKSMPPKLVSDDGKQIVIRPLAYCKEKDIEKY AVAKQFPIIPCNLCGSQPNLQRQVIKEMLQTWDRRYPGRIETMFSAIQNI TPSHLCDPNLFDFKNIKRGQLPKGVEGDIAFDKEELPQTPIIDEDTEDFV NNGQLIRFKEVN >MS1627 metC, MetC protein MTQNYSIETILAQAGNKSDARTGAVSTPIFLSTAYGHRGIGESTGFDYTR TKNPTRLVLEETIAKLENGDQGFAFSSGMAAIQVLMTLFTAPDEWIVSSD VYGGTYRLLDFAYKNNNSVKPVYVNTASVEAIETAITPNTKAIFVETPSN PLMEECNVTEIAKIAKKYNLLLIVDNTFLTPVFSRPLDLGADIVIHSATK YLAGHNDTLAGLVVAKGQALCERIFYIQNGAGAVLSPFDSWLTIRGLKTL ALRMERHQANAAAIAEFLKAQPQVKDVLYPNKGGMLSFRLQDENWVNPFL KAINLITFAESLGGTESFITYPTTQTHMDIPAEERIARGVTNDLLRFSVG LENVEDIKADLAQAFAQFK >MS1520 metC, MetC protein MSNKYSLATTLVHAGRSKRVSQGSVNPVVQRASSLVFDSIADKRQATVNR AKQALFYGRRGTLTHFALQDLMCEMEGGAGCYLYPCGAAAVTNAILAFVQ SGDNILMTGAAYEPTQDFCNKILSKMNVSTTYYDPMDGEKIAELVQPNTK VLFLESPSSLTFEVPDVPNIVKAVRKINPEIVIMIDNTWAGGILFKALEH DIDISIQAGTKYLVGHSDVMIGTAVSNARCWDQLRENSYLMGQMVDADTA YTTARGIRSLAVRFKQHTESSIKVAQWLAEQPEVKAVFHPALPSCPGHEF FKRDFTGSAGLFSFELKEQLSREKLERFMDNFKLFSMAYSWGGFESLILY NQPADIAAIRPNIKRKLTGTLIRIHIGFEDVNELIEDLKAGFERLK >MS0941 metE, MetE protein MTKLFPNATVRTSAPYRFDIVGSFLRSDAIKSARAACACGDISCADLTRA EDAEIAKLVERQKSVGLHAVTDGEFRRTFWHLDFLAGLDGVEEVDAEKFS VQFKHHNVRPKTLKIVAKVDFSENHPFVEHFRSVNELAKGTEVKFTIPSP SMLHLITNVRATNYQPIPRYENNNQQLLDDIADAYIKAMNIFYKLGCRNL QLDDTSWGEFCAEDKRAAYQERGFDLDQIAKDYVYMLNKIVDAKPAQDIA ITMHICRGNFRSTWFSAGGYEPVAEILFGSCRVDGFFLEYDSDRAGDFKP LRFIKNQQVVLGLVTSKDGTLENREDIINRIKEAAQYVDINQLCLSPQCG FASTEEGNILTEEQQWAKLNFIREIAEEVWGK >MS0787 metF, MetF protein MSYAKDIDTLNQHVADLNGQINVSFEFFPPKNEKMEETLWSSIHRLKTLN PKFVSVTYGANSGERERTHSVVKNIKQKTGLEAAPHLTGIDATPEQLKEI AQDYWNNGIRRIVALRGDIPAGYTKTPFYASDLVALLRSVADFDISVAAY PEVHPEAKSAQADLINLKRKIDAGANHVITQFFFDIDNYLRFRDRCASIG IDAEIVPGILPVTNFKQLQRMAALTNVKIPNWLAVNYEGLDEDQTTRNLV AASVALDMVRVLSREGVKDFHFYTLNRSELTYAICHILGVRPK >MS1631 metG, MetG protein MSNQHRQILVTCALPYANGPIHLGHMLEHIQADIWVRFQRMRGNEIHFVC ADDAHGTPIMLKADQMGITPEQLIADVKEKHYADFCGFNISFDNYHSTHS EENRELSELIYSRLKENGFIKSRTISQLFDPEKSMFLPDRFVKGTCPKCK AEDQYGDNCEVCSATYSPTELINPRSAVSGATPVIKESEHFFFDLPSFES MLKEWNRSGALQSEVANKMQEWFDAGLQQWDISRDAPYFGFKIPGTENKY FYVWLDAPIGYMASFKNLCKRENLDFDRFWNKDSNTELYHFIGKDIMYFH SLFWPAMLDGANYRKPTNIFVHGYVTVNGEKMSKSRGTFIQAATYLKHLD PECLRYYYAAKLSNRIDDLDLNLDDFVQRVNTDLVNKLVNLASRNAGFIQ KRFDGKLADKLEDESLFAEFIAQSEQIAAYYENREFGKAIREIMALTDKA NKYVDDKAPWVIAKEEGREAELQAVCSMGIQLFRVLMGYLKPVLPKLAER SEAFLQAELTWDNLAQPLLNHGIAPFKALFSRLDVKQIDAMIEASKAENA AVNATVKKEEKNSKKSTALLTDFEPIEPEISIDDFAKIDLRVAKVIKCEE VPESKKLLKFQLDLGFEQRQVLSGIKGAYNNPEELEGRFVIVVANLAPRK MKFGVSEGMILSAGTGGEDLYLLDVDAGVKAGSRVM >MS1009 metH, MetH protein MHNKIDILKASLAQRILILDGAMGTMIQQYKLSEQQFRGERFKQSSVDLR GNNDLLSLTQPLLIQAIHEKYLQAGADIIETNTFSSTSIAQADYDLQAIA YELNFAGAKLARIAADKYSSADKPRFVAGVLGPTNRTASISPDVNDPGFR NITFMQLAEAYGEATRGLIAGGADIIMLETIFDTLNAKAAVFAIEQVFEE LGVRLPVMISGTITDASGRTLSGQTTEAFYNSLRHAKPLSFGLNCALGPK ELRQYVEQLSKISECYVSAHPNAGLPNAFGGYDLGAEEMAAQLKEWAESG FLNIVGGCCGTTPEHIKAFAEAMQGVKPRPLPQIKTAMRLSGLEPLSIDD DSLFVNVGERNNVTGSAKFKRLIKEEKFGEAIEIAIDQVENGAQVIDVNM DEALLDSQKCMTRFLNIMATEPDAAKVPVMIDSSKWEVIEAGLQSIQGKG IVNSISLKEGEEKFIRQAKLIRRYGAAAVVMAFDEKGQADTEARKVEICT RAYDILVNQAGFPPEDIIFDPNIFAIGTGIEEHNNYGVDFINATGRIKQT LPYAKVSGGVSNVSFSFRGNNPMREAIHAVFLYHAIKQGMDMGIVNAGQL AIYDDLDPELREVVEDAVLNRRPDATDRLLEIAEKYRNQDSTGEDNGVAE WRSWSVEERLKHALVKGITHFIIEDTEEARQKFSLPLEVIEGPLMAGMDV VGDLFGDGKMFLPQVVKSARVMKQSVAYLEPFINATKQKGSSNGKVVIAT VKGDVHDIGKNIVSVVLQCNNFEVIDLGVMVPADKIIETAIAEKADIIGL SGLITPSLDEMEYFLGEMNRLNLNIPVLIGGATTSKEHTAIKLYPKYKYE VIYTTNASRAVTVCAALMNPESKAELWARTRKEYEKIQQSFAERKPLRSS LSLEQARANGFNPFAGEWANYQVPQPKQPGISEFKDVPIAMLRKFIDWSP FFRVWGLMGGYPDAFDYPEGGEEARKVWHDAQIMLDEFENNGKLTPSGVL GIFPAERAGDDIKIYQNSDRTLLAGVARHLRQQSERGKNSKIPYNLCLSD FIAEGSNGQQDWLGMFAVCAGTQEHALVDSFKAKGDDYNAILLQAVGDRL AEAMAEYLHFELRTRLWGYSDETFDNQALIDEKYIGIRPAPGYPSCPEHT EKQLIWDLLEVEQRIGMKLTESYAMWPAASVCGWYFSHPASSYFTLGRID EDQAADYAKRKGWDEREMRKWLGVSMK >MS1966 metJ, MetJ protein MGFFSLKYRQILRLLIGNFMADWDGKYISPYAEHGKKSEQVKKITVSIPI KVLEILTNERTRRQLKNLRHATNSELLCEAFLHAFTGQPLPTDEDLLKER HDEIPEQAKLIMRELGINPDEWEY >MS0669 metK, MetK protein MSSYLFTSESVSEGHPDKIADQISDAVLDEILKQDPKARVACETYVKTGM ALVGGEITTSAWVDIEYIARQVICDIGYTSSEMGFDGHSCAVLNGIGKQS SDINQGVDRDDPLNQGAGDQGIMFGYATNETEVLMPAAITYAHRLMERQA WVRKNGTLPWLRPDAKSQVTLKYENNKIVGVDAVVLSTQHSDSVTQEDLH EAVMEEIIKPVLPAKWLSKDTKYFINPTGRFVIGGPMGDCGLTGRKIIVD TYGGAARHGGGAFSGKDPSKVDRSAAYAARYVAKNIVAAGLADRCEIQLS YAIGVAEPTSIMVETFGTGKVADELLVALVREHFDLRPYGLIKMLDLIKP IYRETAAYGHFGREQFPWEKTDRAAELKAAAGL >MS0216 mfd, Mfd protein MTTHYFNLDIPTQAGDHKIVANVLTGSDGLAICEMAEQFQGLTVVVANDT KSAVRLEKILQESGKLEVRYFPDWETLPYDSFSPHQDIISSRLSALFYLQ NTRKGILILSVSTLMQRICPPQYLQHNVLLIKKGDRLVIEKLRLQLENAG YRAVEQVMEHGEFAVRGALLDLFPMGSPLPFRLDFFDDEIDSIRTFDADT QRTLEEIRQINLLPAHEFPTDDKSIEFFRAQFRETFGEIRRDPEHIYQQV SKGTLVSGIEYWQPLFFENMATLFDYLPANTLFVDMEQYQIQAERFYQDA VQRFESRKIDPMRPLLAPERLWLRIDEVNRALRNYPRISLKAEKVRTSVR QKNLPLKALPELQIQPQQKEPLQNLRHFIEKFKGHIVFSVETEGRRETLL DLLSPIKLRPKQVNSLFEAQSQTYSLQISSLDNGFIIEQENGEPIAIICE TELLGERVQQRGRDKRKSVNPDTLIRNLAELKIGQPVVHLDHGVGRYGGL VTLENAGIKAEYLLLTYANDAKLYVPVANLHLISRYVGGSEETAPLHKLG SDSWAKARRKAAEKIRDVAAELLDVYAQREAQKGFAFHYNREEFMQFSAT FPFEETHDQEAAINAVISDMCQPKAMDRLVCGDVGFGKTEVAMRAAFLAV MNHKQVAVLVPTTLLAQQHYENFRDRFANLPVNVEMVSRFRTAKEQKKIL EDLSAGKVDILIGTHKLIQSDVKFNDLGLLIIDEEHRFGVRQKEKIKQLR ANVDILTLTATPIPRTLNMAMNGIRDLSIISTPPARRLTIKTFVRQADDL LIREAILREILRGGQVYYLHNDVASIENCAEKLTALVPEARIIIGHGQMH ERELERVMTDFYHQRFNVLVCSTIIETGIDIPTANTIIIERADHFGLAQL HQLRGRVGRSHHQAYAYLLTPPPKLMTKDAVKRLEALESLDNLGAGFILA THDLEIRGAGELLGSEQSGQIESIGFSLYMELLEAAVQAMKQGREPSLDE LTQQQVEIDLRIPALLPEDYLGDVNMRLSFYKRIAGAENKPALDELKVEL IDRFGLLPEATKNLMQITELRLMAKQLDIIRIDGSQNGGFIEFSPTADID PMKFINLIKQQPAVFKFDGPTKFRFSCALEQAQKRLDFIFNLLQSLMD >MS0642 mglA, MglA protein MTAQSAQSDNQVLLTMTNVSKSFPGVKALDKANLTVKSHSVHALMGENGA GKSTLLKCLFGIYAKDEGEILFLGKPVNFKTSKEALENGISMVHQELNLV RQRNVMDNLWLGRYPLKGVFVDHTKMYNDTKAIFDELDIDIDPREKVANL SVSQMQMIEIAKAFSYNAKIVIMDEPTSSLSEKEVEHLFKIIEKLKDRGC GIVYISHKMDEIFKICDEITILRDGKWINTVPVKGSTMEQIVAMMVGREL TQRFPPKINEPKEVILEVEHLTALNQPSIQDINFELRKGEILGIAGLVGA KRTDIVETIFGVRERKSGTVKLHGKIMKNRTALEAINNGFALVTEERRST GIYANLSIEFNSLISNMKSYMNKWGLLSDKKMKSDTQWVIDSMNVKTPSH KTTIGSLSGGNQQKVVIGRWLLTQPEILMLDEPTRGIDVGAKYEIYQLIM QLAQKDKGIIMISSEMPELLGITDRILVMSNGKVAGIVETAKTSQEEILQ LAAKYL >MS1611 mglA, MglA protein MKMTDTNILTLKNISKSFFDVTVLEDINLDIRCGEVLCLIGENGAGKSTL CKIIAGIYSRDTGEMLYQGQPYSPTTVKEAQEAGIGFIHQELMLVPKLTV MENIFLGAEKTLSFGRMNWSEMREKTQHIIDELELDIKPDDLIADLSIAQ QQMVEIAKAVFSEYKIIIFDEPTSSISRKNTEVLFKIIHQLKTKNVAMIY ISHRLEEFKYIADRVTVLRDGRITGTMRYEETSPEDIVRLMVGRKVDFSR YRRDTVFTQEKLRVENIQSKHISPISFQVNKGEILGFAGLVGAGRTEVLR AVYGADEATGKIYIDQKEISIHSPEDAVKHKIGFITEDRKSQGLVLGMSI RENITLPILKRFWNGWQLDKKKEREVVEANRSKLHIVSKDQEQQTKTLSG GNQQKVILARWLESGVDILFFDEPTRGIDIGAKSEIYDLMRQFTENGGTI VMVSSDLPELITISDRVIVMRNGEKVKEIANRDDITEENLMHAMIGI >MS0200 mglA, MglA protein MATERLSMRNMTKKYGTVTVLEDVSFNVKAGEVHALIGENGAGKSTLLNL LSGVRDATAGEIYIDGQKVNINSPKAAKDCGIAMIHQELQNVPELSVFQN IFLGRSLKKNLGLFVDKSKESQLALEVLKSLDPGIDPSVPIKTLKVAQQQ IVEIARALLDNAKIIAMDEPTSSLTPSEFERLAELIKGLANSGVSIIYVS HKMDEIFKVCDRATILRDGRFIDCVNMSEQTEESIVTKMVGRKIEKLTHM SYATDEKILEVRNLGRDKAVKDINFIAHKGEVVGISGLVGAGRTELFRLI AGLDKPTSGEILVEGKRLKLNSVRDSIKMGIGLVPEDRKKEGILRDRSVL INIAMPSMDRFSQNGFIRKDYLGAVSHRLMMDLNLKPFDLEKTVGTFSGG NQQKVIIGRWLAAGTKIYLFDEPTRGIDIGTKSEIYNLIENLAKAGNVIL VVSSEMPEIIRVSDRVLVMKEGAITAELCGGDINEENIAQYAIGQNKIKN EEGLNYVSN >MS0062 mglA, MglA protein MQVPYLEFDNVSKSFPGVKALQNISFKCYEGKVHALMGENGAGKSTLLKI LSGNYLPSEGKLSIGGRQLVFRNTKEALLAGVAIIYQELNIVPEMTVAEN LCLGQLPHSFGIVDKAELIERTQQYLDKLDLNISPNTPLKELSIGQWQMI EIAKALSRGAKIIAFDEPTSSLSAPEIEKLFSVINELRDEGKVILYVSHR MEEIFRISDEITVLKDGQFVETFSDLSKITNDDLVRSMVGRNLGDIYHYR PREVGDVRLKIAHLSGEKLQGDFSLTVRAGEVLGLFGLVGAGRSELLKVI FGADPCVSGSIELDGKTLSIRSPKDAIEQGIVLCPEDRKKEGIVPTASVG ENINISARRLHNFFKFIINDKWEKKNAEKQRQQMNVKTPSIEQLIVNLSG GNQQKAILGRWLSEDIKVLLLDEPTRGIDVGAKSEIYDLIFKLADQKLAI IVVSSDLPEVIGVSDRIMVMRAHQITGVVERADATEEKVLKLAMVESLNV GD >MS0839 mgsA, MgsA protein MPLSAKKPRILIQGGYMETTFRHVAAQKHIALVAHDHCKEDLINWCQKNV HHLQNHQLYATGTTGHLIEKATELKINSLLSGPMGGDQQLGALIAENKID VMIFFWDPMNAVPHDPDVKALLRIAAVWNIPHAMNIASADLLINSPLINR EIELRIPDYQTYLQKRLK >MS1793 mhpC, MhpC protein MNTMTLVFLHGLLGTKSDWRKIIENLPHFRCVSLDLPFHGEHKFTEANNF EQCADFISHQIKSAVGNQPYFLVGYSLGGRIALYYALQSQCEKGNLQGLI LEGANLGLTCDEARKVRWKNDEFWAQRFITESAESVLNDWYQQPVFAHLN AQQRADLIEKRVTNCGKNIGKMLEATSLAKQPYLGDKVRESTLPVYYLAG EKDQKFRQMAVQEKLNLQLIANAGHNAHLENPVEFSQKLTALLRNHKIKK TDNL >MS0862 mhpC, MhpC protein MKLLNYQFHQLKQPSNQATMVFIHGLFGDMNNLGIIARAFSDAYNILRLD LRNHGQSFHADEMNYSLMAQDIIHLLETLQLTKVILIGHSMGGKAAMKTA ALRPDLVEKLICIDIGPIAYAHRWHDDVFAGLFAVKNAQASSRQEAKPIL ASYIKDEGVIQFMLKSFDGNAAEKFRFNLSALFNNYGQIMGWEEVFFDKP TLFIKGGNSDYLQSGYGTRILAQFPQASSFTINGSGHWVHAEKPEFVVRA IQRFLESN >MS0882 mhpC, MhpC protein MMLYETKGNGEPIIFLPGLFAGGWIWNSVVRNIQDKGFKTFTFTDPIPVA FEGSQQKALTELDTITENCSTPVYLVGNSLGALIALHYAFQRKDRVKGVI MSGAPGQLEMEAGVSLDELKTGKDKYTTLLGSRIFYDQSKIPPHGIEEVK YLFGTEKIFRNIVRWLYFSRKYDVPDVLQKISIPIDFIWGQYDLITPIEP WIDIAKNFPQTSMTIIKDSGHSPMVEQPELFTEALLRKISSGRTHIK >MS2156 mhpC, MhpC protein MIMTISALDFFKRDVTLPNQLDGLPHKLSDVTGLQIGSFKTNDGVSLNYW KAGSGEPLVFVPGWSSNGAEYINLIHLLKDKFTVYVLDQRNHGLSDKVKF GNRISRFAMDLHEFFNAENIEKAHLCGWSMGCSVIWGYVDLLGTSRVEKF VFIDEAPSIYCHSNWTEEERINAGAFTTSAEMMIDMYYGRGTCNMLQVNT DLFNFYNTIDALAFENSMALCDQVCPHDKDALEQVLFDHILNDWRDVLIN KIDKPTLVVSGEHSNWVESQRWIAQTVPNSEDLIYGKHEHGDHFLHLKMP QKFAGELTEFLNRMS >MS1517 miaA, MiaA protein MNQKPTAIFLMGPTASGKTDLAIQLRQELPVEVISVDSALIYKGMDIGTA KPSKEELALAPHRLIDIIDPAESYSAANFRSDALREMADITEQGRIPLLV GGTMLYYKALLEGLSPLPQADEKVRSKIEEKAQKFGWATLHKELSLIDPV SAARINPNDSQRINRALEVFYISGKSMTELTEQKGEQLPYHILQFAIAPE DRAILHRRIEMRFHKMIESGFKQEVERLYHRGDLHIDLPSIRCVGYRQMW EHLRGDYDLDEAVFRGICATRQLAKRQITWLRGWKYPIQWLDSLKNSENK EIIKRAFDLTMQNG >MS1690 miaB, MiaB protein MTQKLHIKTWGCQMNEYDSSKMADLLQSTHGLELTEEAEQADVLLLNTCS IREKAQEKVFHQLGRWKELKKKNPNLVIGVGGCVASQEGEHIRERAPYVD IIFGPQTLHRLPEMINQIRAGEKAVLDISFPEIEKFDRLPEPKAEGPTAF VSIMEGCNKYCTYCVVPYTRGEEVSRPLDDVLFEIAQLAEQGVREVNLLG QNVNAYRGPTHDGGICSFAELLRLVAAIDGIDRLRFTTSNPIEFTDDIID VYRDTPELVSFLHLPVQAGSDRILTMMKRGHTAIEYKSIIRKLRAVRPNI QISSDFIVGFPGETNEEFEQTMNLIQQVNFDMSFSFVYSARPGTPAADMP DDVTEEEKKQRLYILQQRINNQAAQFSRAMLGTEQRVLVEGPSKKDIMEL TGRTENNRIVNFAGTPDMIGKFVDIKITDVFTNSLRGDVVRTEDQMGLRV VQSPQAVINRTRKEDELGVGRFGG >MS2247 miaB, MiaB protein MRYFMSYSAPNIGFVSLGCPKNLVDSERILTELRTDGYNIIPTYENADLV IVNTCGFIDSAVQESLEAIGEALEENGKVIVTGCLGAKENQIREVHPKVL EITGPHSYEAVMEHVHKYVPRPERNPYTSLVPAQGVKLTPKHYAYLKISE GCDHKCTFCIIPSLRGDLDSRPITQVLDEAKRLVDSGVKELLVVSQDTSA YALDQSKENQNKTVFWNGAPIKNNLITLCRQLGTLGAWIRLHYVYPYPHV DDLIPLMAEGKILPYLDIPLQHASPKVLKAMKRPGSVERVLERIQKWREI CPELTLRSTFIVGFPGETEEDFQMLLDFLQEAQLDRVGCFKFSPVEGAVA TDMADQVPEEVKEQRFQRFMELQQQISAQRLQQKIGKTLPVIIDDIDEDG IIGRSMADAPEIDGVVYVDNRSESAVKIGDIIQVAITNADEYDLWGTC >MS2355 mipB, MipB protein MTTQLDALRNMTVVVADTGDIEAIKKYQPQDATTNPSLILSASALPQYAS LIDDAINYAKAKSTDKAQQLIDAEDKLAVNIGLEILKIVPGRISTEVDAR LSYDTAATVEKARKLIKLYNEAGINNDRILIKVASTWQGIRAAEILEKEG INCNLTLLFSQAQARACAEAGVYLISPFVGRILDWYKANTDKKEYVPNED PGVISVTSIYNYYKQYGYQTVVMGASFRNIGEITELAGCDRLTIAPALLK ELQESNADLPRKLDYKGEVKPKPAPLTESQFYWEHNNDPMAVDKLAEGIR KFAADIEKLEAMLSTKL >MS1459 mltA, MltA protein MNFIRTFIMFFTKNFVLKAATVLAATVLAACSSNTNAVKKTTESSVDPAQ FGAKYKGRSYSTSLFSSANVDNYSGVVNQGDFLTQLSNVRAYSTGISSTY YDNYNKISQWVLAGADVNQLANYGIRPQVMSGEDGYQNVLLTGYYSPVIH ARYSAQGKYQHPIYAMPSQKRFTRSQIYAGALEGKGLELAYSDSMLDNFL LGVQGSGYVDFGNGNLNYFAYAGQNGYKYQAVGRLLVEDGEIPKEKMSIQ AIRDWAERNPSRLQSLLERNPSYVFFKNDPAGKVKGSAGVPLVPLASVAS DRSIVPSGSVLLVEIPQIDNEGNWTGEHRLHLMVALDVGGAVKGNHFDLY QGIGDKAGHISGLLKHYGRVWVLR >MS0901 mltB, MltB protein MKIKYKFLALTACLMLAGCSSNNNKSAVNSAEDLSALPATAVYSNARTLN NFDDYVQFLKRKAAGQGVSSATLTTQNNIRYIDSAVRLDQKQAGNAARRQ GLPPLPPNPNGVTNYLTKHLTQAKVDKAEDNYYDVQVPLQKASSAFGVQK EFILALWGMESSFGYYQGDYDVLSVLATLAFDGRRETLFSKEFINAMKML DAGHLNRSKMLGSWAGAMGQTQFMPSSYLNYAADGDKDGTKDIWSNEYDV FASIANYLHTVGWDDTLPWGIEVSLTTPLPLSLAGTEKEKARSLNDWQAQ GVLPKNMFDADKLKALSNADLWLVRPDKEVGRTFLVSNNYRTILDWNRSN YYALSVGMFADRIKQTLGF >MS0315 mltE, MltE protein MIKVMKLKKFLVLLLIPFLYACSSDRSGNYDDAFAKDTNGLDLLTGQFSQ NIDQIWGVNELLVASRKDYVKYTDSYYTRSHISFEEGQITIETLADANRL HSAIVHTLLMGSDAKGIDLFASGDVPISSRPFLVGQVVDNFGRQINNIDV ANSFASYLLQNRLQSRRLSNGRTVQFVSIQMIANHVNVRARKYLSLVRQA SRRYGIDESLILGIMQTESSFNPYAISYANAMGLMQVVPHTAGRDIFKLK GRSGQPSKSYLFDPANNIDAGVSYLWILKNEYLAGITNPTSMRYAMISAY NSGAGAVLRVFDSDQEYAINIINRMQPEQVYRILTTVHPSSQARNYLLKV DKAQRSYRRAR >MS1565 mltE, MltE protein MNFSRFALALFSFGAVMPVIAAEQSLSQQREIYQKINQLLSISQSENTQN IAKALLDEMKDYPLYPYAEYKLISSNLANTDFAQIEAYLQRRKDFPLAKN LTKQWVIQHQNNQDWQGILANQDKLPKDIVSQCALLQAKSPISPVIDTNN QNALKSAVNSAQILTEKAEHPLPQAELEKLWLTGNSLPKACDPILDQWNQ TGGLTADLIRRRAVLALEQGNSGLLTHLSAQTQDTGLQNWLKTLAAIQKT PQKLKDPANPFNPDKLEPNTQNKRIAKALFPSFVRTIKDNEVGDPQRLLA QFDGWAKRFNLTAEETTDWEIAVISQLFDSPNTLLQQWRDTELKNLKADK LTERRIRMAIRNKEDIKPWLTLLSDKAKNADEWKYWTAKTLQRSTDKAEQ NQANALLSSLLNQRGFYPMLATQELGRAYQINLRNEENLAKPTASSKPEN PAPAKPSAQELTAQKYAAELSRIEELRILADTNNMNTEWRSLFARANFDE QIALTEYARDKQWFDLQVEGTILAKAWNHISLRLPNAYPQWFDLLLKNKK IDRTFAMAIARQESAWKPYVTSSADARGLMQLLPSTAKLTAQKAGLPYSN ANQLYDPFNNIMLGTAHLQELQDKYGNNRILISAAYNAGGSRVDQWLAKS AGKLTMAEFVASIPFYETRGYVQNVLAYDAYYQLLQNKKAQIFANEEYNR LY >MS0577 mmcQ, MmcQ protein MLRKISFRCSIKRQINKIGKNNMAKQDLKRHIFDYVLTQYGSEAEYLWKS YPDFAVFRHQDNRKWYAIVMNVEKEKLGLAGSGKINVMNVKCSPEMLSLF LAQEGFLPAYHMNKSHWLTIRLDGSVDKETVCFLLNGSFDLTATKQVKKK LGIMRYSEWIVPANPKYYDVENELHEGKEIFWKQSNNVDVDDIVYIYVTE PTAAIRYKCLVLEVNIPYKYRHEELQIKRVMKIRCLKEYDRKLFTRDKMA QFGVSAVRGPRHMPYSLKREIDNLTNDEVSV >MS0692 mmsB, MmsB protein MKIGFIGLGIMGKPMSKNLIKAGHSLVVLDFNKAAVDEIVALGATSAATP KEVAEQVEVVITMLPNSPHVKTVVSGENGLIEAQNTNYVFIDMSSIAPLA SREIYAELEKKGIDMLDAPVSGGEPKAIDGTLSVMVGGKKDVFDKYYDVM KAMAGSVVYTGDIGAGNVTKLANQVIVALNIAAMSEAFMLATKAGVDPEL VYQAIRGGLAGSTVLDAKAPMVLDRNFKPGFRIDLHIKDLANALDTSHGV GANLPLTSAVMEMMQSLRSAGDDKLDHSALARYYERLTGTEIKRF >MS1981 mmsB, MmsB protein MRTVKQHSSTIRRNIMSYSVAVIGLGSMGMGAAVSCVNAGLETYGIDLNP AALEKLKAAGAKDVASNGDAFAKDLDAVVVLVVNAAQANAALFSETGIAK KLKPGTAVMISSTMAAADAQAISQKLTELGLIMLDAPVSGGAAKAMKGEM TVMASGSKEAFDKLQPVLDATASKVYNIGEAIGLGATVKIVHQLLAGVHI AAGAEAMALASKAGIPLDVMYDVVTNAAGNSWMFENRMKHVVDGDYTPLS MVDIFVKDLGLVNDTAKSLHFPLHLASTAYSMFTEASNAGFGKEDDSAVI KIFSGIELPKKGGN >MS1021 moaA, MoaA protein MQSIPIKNVGTNRLVDTFQREYYYLRLSVTDVCNFKCTYCLPSGYQPPVQ KESFLSLDEIRRIVGAFAAMGTEKVRLTGGEPTLRKDFLAIVETISALEG IKKVALTTNGYRMEKDVERWKKAGVSSINVSVDSLDPRQFYSITGENKFH QVMKGIERAFEIGYEKIKVNSVLMKNLNDQEFDRFKNWVKDKPIQMRFIE LMQTGEMDQFFNRYHLSGQILAEKLLKEGWILRQKDRTDGPAKVFSHPDY LGEIGLIMPYEKNFCASCNRLRVSAKGKLHLCLFGEEGVDLRDLLVSDEQ QVILQSRLYAALQGKREHHLLAQGNSGIRTNLASIGG >MS0425 moaB, MoaB protein MTALSQTKLKIGLVSVSDRASQGVYQDQGIPELQAWLEAALTEEFDVETR LIPDEQAEIEKTLIDLVDNRQCHLVLTTGGTGPAKRDVTPDATLAVADRE MPGFGEQMRQVSLQFVPTAILSRQAGVIRKDSLILNLPGQPKAIKETLEG VKDKEGNVLVKGVFAAVPYCLQLISGIYVDTKPEVIESFRPKSARRC >MS1022 moaC, MoaC protein MTEFTHINSNGEANMVDVSNKRETVREARAEAFVSMSAETLAMIVSGEHH KGDVFATARIAGIQAAKRTWELIPLCHPLLLSKVEVKLTALLDTNQVRIE SLCKLTGKTGVEMEALTAASVAALTIYDMCKAVQKDMVISNVRLLEKTGG KSGHFKVE >MS1023 moaD, MoaD protein MLKVLFFAQTRELVGVDQIDVEAAFSTAEALRAHLAEKGGKWALALEAGK LLVAINQTLSSLDSSIRDGDEIAFFPPVTGG >MS1024 moaE, MoaE protein MPFFRLLPGDKMSDIKIAVQEAEFDQNSEYRWLSQSDSVGASVIFVGKVR DLNLGDEVSSLYLEHYPAMTEKALNEIVDEAKSRWDIQRVVVIHRVGLLH TGDEIVLVGVSSAHRGDAYHANEFIMDYLKTKAPFWKKEKTDKGERWIES RDSDQQAAEKW >MS2057 mobA, MobA protein MTITISAVILAGGLGRRMGGVDKGLQFWRGKPLIETVYQRLHRQIERISI NANRNREIYARFGVPVFSDRLAGFQGPLSGILTALERATTDYVLFVPCDC PNFPLNLLEKLKSAVEFSQISLAYAHDGERDHPTFCLVSTQLKNALADYL AGGERRMLYFMQMHGAVAVDFSTEKQGFININNLADLNSP >MS2056 mobB, MobB protein MIFDEMNKMTTQNSLPMLGITGYSGSGKTTLLEKLVPQLTGLGIRVAVIK HTHHDVNIDKPGKDSWRMKEAGASQVIMTCDRRWAIMTETRQPVSLSYLA GQFDSALTDLVLVEGFKQEPIAKILLHRKGMEKTLPELDEYVIATATDYP LVQNLPDLNINDVPAVARFIQRWYEEKCGQKNENFAK >MS1342 modA, ModA protein MSKIKNIFIGLTLSCAVSLFAEAKVTVFAAASMTNALEEVASDYKKVNPN EDIVFSFASSSVLARQITEGAPADIFISADQKWMDFLAEKDEIVKDSRVD LVGNKLVMIAPRTSKIEKVDLTNDKWQTALDKTYLSVGDPDHVPAGIYAK TAFTYLNQWAALENKLARAKNVRDALRLVEQGESPLGVVYATDAAISQKV RVVAIFPAESHPPVEYPAAIVKNKDNKESKAFFNYLKSDKAKTVFEKFGF SAK >MS1344 modE, ModE protein MDNTEILLTIKLHQRLFVDPKRIRLLKEIAHCGSINQAAKNAKVSYKSAW DHLEAMNAISPKPLLERNIGGKNGGGTQLTNYARRLLQLYDLLEKTQEKA FQILQDESIPLNNPLSATARFSLQSSARNQFFGKVTKLELKNGHCMVSIQ IEGLNRPLVASITEKSAVRLGLVPGKEVMLMIKAPWIKTQLEEPVDKENQ FLAEVRSVSDKGGEKEIILSIGENPEFCATIEKTVDVAVNQKRWLYIDPE QIVLASL >MS0833 modF, ModF protein MSKLAMRNAQFELHQNNLLSIPHFEIHSCDFWVVMGYNGSGKTAFSLALE KKLSLYNGEYQNQFDSISLLSFEKQQKILEQTFKDLNNDEAPDDFGKTAR EIILNGTDKNNLCEFYAQYLHIEKLLDRPFTKLSTGESRKVLLAQALVSE PDLLILDDPFEGLDRQSVQDWLKLLESLKGKLALVLIVNRFSDIPSIADF VAILDNKQMILAGKRQDIEGQSVYQQLKYAEDAEDVPLPGSASPLIRLPE GQNPFELKNVTIQYGDKVILNNLNWTVKAKQNWWIKGPNGAGKSTLLSII TGDHPQAFANHVVLFGKRRGSGETLWDIKQKIGYVSSQLHMDYRVNSTAI DVIISGFFDSIGVYRQVPDALRIKAMQWLSRLNMDSLAKKPFRSLSWGQQ RLLLITRAMVKHPPVLILDEPLQGLDGINRKLVKSFIEQLVSNSETQLLF VSHQDQDAPNCITHLFEFIPQEEGYCYQQKTLESADIME >MS1005 moeA, MoeA protein MNSLLSLEQALEKMLATLPSPSLDNLETLPISQAQNRICAQDVMSPINVP SFDNSAMDGYAVRLADLEQSPTLSVAGKSFAGIPFTDEWKPLSAVRIMTG AMIPQGADAVVMQEEVSVNEDGSVTFEKLPKPGQNIRRIGEDVKQGDVVL SVGAELNTVSLPLIASLGIPEIKVFPKLKIAVLSTGDELVPVGQPLNEGQ IYDTNRFAVKLMLEKLHCEVLDFGILPDNEAEFEKAFMHAQEQADLIITS GGVSVGEADFTKTVLEKLGEINFWKLAIKPGKPFAFGKLPNAWFCGLPGN PVSALVTFYQLVQPVIAKLSGAANYKRPQQFPAIAATNLKKSPGRLDFQR GLYRLNAQGQLEVEPVGLQGSHVFGSFVKSNCFIVLERERGNVTAGETVT IEPFNHLLG >MS1312 mrcA, MrcA protein MSTEQDKSAQSNTSNKPAKTLQNKKRRFTLFMAKLAFTGACLIGAYGIYL DGQIRSKMDGQIWRLPAEVYSRIESIRLEDNWSLDKIKQTLLENDYRQTT LVAAPGDFKIEDNSIVLIRRAFPFPEQAEAQRVFRLRFTDNKLSVIEDLI NLKAINEFKLSPKLIAMLQSEKEERLAIPLQNYPRLLIDALLLTEDRNFY QHEGISPLGMARAMITNIRAGHTVQGGSTLTQQLVKNLFLTNERSISRKI HEALMAILLDFRYDKNTILETYLNEIYLGQSGDIQIHGFELASHFYFGLP IREISLDQIALLVGMVKGPSLYNPWRNPGYALERRNVVLRLLLEHKIIGQ ELYDMLSKRPLGVQEKGKITRNYPSFIQTLQAELRDNLGENRENKLLGAR IFTTLDPKQQRAAEQAVVKATGELQLKTKNPDLQAAMIIADYKSGQILAI VGGTQIQYAGFNRAFMAKRQIGSLVKPSVYLAALSEPDKFRLNTPLNNRP ITITIKGSPPWSPRNYDNRFSGSVMLIDALARSLNVPTVNLGMKTGLKKV IETQQAMGWDKVNIPKVPSMLLGSLQVSPYDVTKLYQTIANNGGKVRLTT IQSITDRQGNLLYRHDNNAEQVVPEEAAYQTLYAMQQVVDRGTARSLMEN FGQYHLAGKTGTTNDARDTWYVGIDGQNLATVWVGRDDNGETKLTGASGA LYLYKDYLTRVPVKTLKLTKPKGIKFVGINSYGGWNCDNPIRTIPVWAGK DQVFCAPTPKPAPVEQAQPAVVEEAAPAQ >MS1567 mrcA, MrcA protein MRLKTLKFPFVNKKNTRTFKKKCGRFLSYFIGLTVALTFLFRFVPIPFSA YMAEQKLAHIIQLDFDYKVNYDWISLEDISPYMQLAVIAAEDQNFPNHGG FDWNAIKSAIKYNEKSSRIRGASTISQQTAKNMFLWHGQSWIRKGIEVPV TFMLETLWSKKRILEVYLNIAEFGNGIFGVEAASRYYFKKPAKRLTQSEA ALLAAVLPNPIIYKANRPSLLVRKKQAWIIRQMNSLGLNYLKKL >MS1975 mrcA, MrcA protein MKIVKLIFSTLLTIVILGCVAGGLLYFHIKSQLPDVQSLKTVELQQPMQI YTADEKLIGEVGEQRRIPVKLENVPKMLINAILATEDSRFYEHHGLDPVG IARAVSVAIANKGASQGASTITQQLARNFFLTPEKTIIRKTKEAILAIEI ENTLTKNEILELYLNKIYLGYRSYGVAAAAKTYFGKNLADLTLSEMAIIA GLPKAPSTMNPLYSLKRSEERRNVVLGRMLEMQFINKEQYDEAVQEPIKA SYHGAQIEFRADYVTEMVRQEMVKRYGEESAYNSGFKVYTTILSQDQAQA QKAVRNNLIDYDMRHSRYRGATPLWQSSETPWENNKIIDTLRKLPNSEPF LPAVILSVAKEGTELLLASGEKMTLNAAAMRWGGRNVSLKTGEQIWIRQR DNNEWVLGQIPEANSALVSLNSDNGAIEAIVGGFSFEQSKFNRATQSMVQ VGSSIKPFIYAAALEKGLTLSSVLQDTPISIRKPGQAEWRPKNSPDRYDG PMRLRVGLGQSKNMIAIRAMQTAGIPYVAEFLQRFGFKREQYFASEALAL GAASFTPLEMARGYAVFDNGGFLVDPFIINRIVDNSGKDIFIANPKIACT TCDEMPTIYGQTTDKVDGFKENDSVNADGNLAQTDENTNGEETDQNGENN DVPELQNQGGTINEDALNLMVEGKTDSSQVQYAPRVITGELAFLIRSALN TAIYGEQGLGWKGTSWRMANEIKRKDIGGKTGTTNNAKVAWYAGFGANLT TAVYVGFDDNKRNLGKGEAGAKTAMPAWINYMKFVLEDVPERVLPTPANI IEKSIDLGSGLLSKGGGRTEYFIKGTEPKRAFVQERGYYVPEGLPFQTPS ASEYVPIGQPAPAAPAPSRKELF >MS0590 mreB, MreB protein MLFKKIRGLFSNDLSIDLGTANTLIYVKGQGIVLDEPSVVAIRKDRVGSL KSIIAVGKDAKMMLGRTSNNIDAIRPMKDGVIADFFVTEKMLQHFIKQVH SGNFLRPSPRVLICVPAGATQVERRAIKESAIGAGAREVYLIEEPMAAAI GAKLPVSTPTGSMVIDIGGGTTEIAVIALNGVAYSSSVRIGGDRFDEAII AYVRRTFGSIIGEATAEHIKQEIGTAYIQDESEVKELEVYGSNLAEGAPR AFRLTSHDVLEAIQQPLDGIVTAMRTALEECKPEHAADIYERGMVLTGGG ALLRNIDVLLSKESGVPVVVAEDPLTCVARGGGEALEMIDKHGGDIFSED >MS0592 mreC, MreC protein MKAIFTKAPSLGLRLVLAVMLSVGMILFDGQTNIMIQTRNFIDTAVGGLY YLANTPRTVLDNVSDNLVDTNKLQIENKVLKQQLREKNADLLLLDQLKVE NQRLRLLLNSPLRTDEYKKIAEILTAETDVYRQQVVINQGRNDGAYVGQP VIDEKGVIGQIISVGEAASRVLLLSDVTHSIPVQVLRNDVRVIASGTGRT DELTLDNVPRSVDIVKGDLLVTSGLGGRFPEGYPVAVVENVSRDGSNYFA TVSAKPLASLERLRYVLLVWPAGDDIHKARAASPEDVRNAVKQRLANTAS EQKKIPVTEDDATKAPVQLNNSEENIPSPENLPEMNRNDTQVDPELKEHR EED >MS0593 mreD, MreD protein MKGNFFVQLFALLAIFIVALVLEISPWPAGFHSFKPAWLVLALTYWVLAL PTRINIGTAFIFGVVWDVLLGTVLGVHALVLSCFAYLIARYHQILRNLSL WQQSLLIVLLVFFVRLGVFLLELFIHSAEFDWKEIFGALISGLLWPWVFL LLRKIRRQLGLH >MS1632 mrp, Mrp protein MSIIYTDNLSAGQQAQIQTLFQQYRHPSLKKDLIALSAVKKAEKGGDTLR IELSMPFPWNSAFEQLKADLSDKLLSATESKNIKWQLTYQIATLKRANNQ PAVKGVKNIIAVTSGKGGVGKSTVSVNLALALQAQGARVGILDADIYGPS IPHMLGAPDQRPTSPDNQHITPIQAHGLFANSIGFLMDEENATVWRGPMA SSALSQLLNETLWPDLDYLVIDMPPGTGDIQLTLSQQIPVTGAVVVTTPQ DIALLDAVKGISMFNRVSVPVLGIVENMSMHICSNCGHHEAIFGTGGAER IAQKYHVEMLGQLPLHICLREDLDKGTPTVVSNSNQEIRDAFMQLAEKIG YELYFQGAVIPSEIMFREVK >MS2196 mscL, MscL protein MIAINFRRIFMSFMKEFREFAMRGNVVDMAVGVIIGGAFGKIVSSLVGDV VMPVLGILTGGVDFKDLKFVLAEAVGETPAVTLNYGLFIQNVFDFIIIAF AIFMMVKGINKLKKPVEEAPKGPTSEELLSEIRDLLKK >MS2333 mscS, MscS protein MAAEEQAKEVAQQVDVVAETSKVIDKVSNMDLNAVLHDWVIPYGTKILLA IAIFVIGKMLARGISKLLGKAALASTKDEMLQSFVTSISYFLFLLIVVIA SLSQLGINTSSLVALIGAAGLAIGLSLQNSLQNFASGVMLLIFKPFRKGD LIETGGMTGVVEEMGLLVLELRTGDNKTVLIPNGKVFSDSIVNYSDNKTR RIDFTFDVSYESNLKEAKDVVARILADNELVLKHPAPIVAVGALAANCVQ LVVRPWVKTADYWTAYWGITESVKLEFDKAGIVIPYNQMDIHISGNTASE LNNELNK >MS0411 mtlA, MtlA protein MLSANAKVKIQSFGRFLSNMVMPNIGAFIAWGFITALFIPTGWFPNEMLA KLVGPMITFLLPLLIGYTGGKLVGGDRGAVVGAITTAGVIVGTDIPMFLG AMIAGPTGGWAIKSFDKWADGKIKSGFEMLVNNFSSGIIGMILAILFFWV VGPAVKIISDWLAAGVDVLVNAGLLPLTSIFVEPAKILFLNNAINHGIFS PLGIQQSQEFGQSVFFLIEANPGPGLGVLLAYMIFGKGSAKQTSGGAAII HFFGGIHEIYFPYVLMNPRLILAVIAGGATGVFTLVLFNAGLQAPASPGS IIAVLAMTPKTSFLGVITSVIAACAVSFVVASFFVKLQKEDESGKLEEAQ AASKAMKSNTSQQVTNYDGLKKIFVSCDAGMGSSAMGASMLRKKINDAGL PIEVANCAINDLPEDARLVITHQDLTLRAKKQVPNAMHFSLTNFLDNKFY DSLVNDLKANFDEKAPVAQAKEGEIEVNGTTFSLQPEQIFLGLKANDKFA AIRFAGEQLVKAGFVQPSYVDAMFEREKLVSTYLGEGVAVPHGTIEAKDA VLKTGVVVCQYPEGVRFTDEEDGVAKLVIGIAARNNEHIQVVSAITNALD SDEAIELLTSTNDVNKVLELLKA >MS0529 mtlD, MtlD protein MKLLNRTNFPGRQHPTKIIQFGEGNFLRAFIDWQIDILNEKTDLNAGVTI IRPINTDFPPSLNTQDGLYTTIIRGLDENGNKVKESRIIRSVNNEINIYQ SYDEYLQLAHNLEIKFIFSNTTEAGISYHADDKFDDRPQVSYPAKLTRLL YERFSVVNGDKDKGFILLPCELIDYNGEQLKELVFKYAKEWNLSAEFIQW LETANTFCSTLVDRIVTGYPRAEAAELEAELGYKDTFLDTAEHFHLFVIQ GPKSLAQLLRLDQVDLNVLIVDDIRPYKERKVAILNGAHTALVPVAYMAG VNTVGEAMNDAELCRFVKSTMDKEIIPVLSLPQDELQQFADAVIKRFQNP FIQHQLLSISLNSMTKYRTRNLPQLISYVEKFGKLPPHLTFALAALIAFY RGERDGQTIPLQDDEHWLVNFKTWWQEQAAGEISLFQLVHHVLKQEAHWE QDLTTIPQLVETVTQQLEAILEKGMRQALREYCVD >MS0410 mtlD, MtlD protein MKALHFGAGNIGRGFIGKLLADSGMQVIFADVNDSVIDLLKSRRSYGVKI VGDSINTVERVTQVTGVNSKDETAIITLFNEVDLVTTAVGPNVLKIVAST FAKALEARIAGGNTKPLNIIACENMVRGTSFLKEQVFTHLNPDYKDKVEQ LIGFVDSAVDRIVPPVKPDAEDPLLVTVEEFSEWIVDQTQFKGAIPDIKG MELTDNLMAFVERKLFTLNTGHAVTSYYGKFKGYKFVKESIEDESVKAFV KSVMQESGAVLIKRYGFDPQAHAAYIEKILKRFANPYLVDDVDRVGREPL RKLSYNDRLIKPLRGTIEYGLPNDNLIRAIATALSYRNENDPQALELAKS LAEAGVTQTIKKYTELQDENVIARIAKAYETL >MS1222 mug, Mug protein MSVQIIETHPFPPVLPARATVMMMGTFPPKSEKRCMEFHYPNFQNDMWRI YGLIFFEDKEYFQVPGEKRFDAERIKAFLHERGIASCPTVIKAVREQGNA SDKFLKIVEPVNLTQVLQKVPNVRWLFTTGGKATEALFSLVPELKLKEPK TNEYIDFPFQGHELKLYRVPSTSRAYPLSLEKKAEAYRKFFELSGILK >MS1084 mukB, MukB protein MSEELELESEFLPEEKSETIVPATVLTQSAGIERGKFRSLTLINWNGFFA RTFDLDELVTTLSGGNGAGKSTTMAGFVTALIPDLTLLHFRNTTEAGASG GSRDKGLHGKLRPGVCYAVLDAVNSRQQRILAGVRLQQVAGRDKKVDIKT FSIQGLEISQNPTAVLTETVSQRQARVLSLTELKDRIEEQGAQFKQYHSV ADYHAMMFELGMIPRRLRSSSDRSKFYKLIEASLYGGISSAITKSLRDYL LPENLGVRKAFQDMESALRENRMTLEAIKMTQSDRDLFKHLITETTNYVA SDYMRNANERQGNIETALSFRKEWYAAKSEQDLSQHRLIDLSREAAELTE NEKALEIDHQSASDHLNLVLNALRHQERIERYQEDVNELTEKLEEQKIVV ENANEQLEESQLQFETLETEVDQIRGQLADYQQALDAQQTRALQYQQAIQ ALEKAKALCGLADLSVKNAEVYHEEFEAQVETLTDRVLQLEQKMSISEAA KTQFDKAYQLVCKIAGEIPRSAAWDSAKELLREYPTQKLQAQQTPQLRAK LHELEQRYQQQQSAVKILKDFNQRAGLSLETADELEDYHAEQEALIENLT AEFSEQVETRSTLRQKLEQLTALFEEKARSAPAWITAKSALERLTEQSGE QFEDNQDVMNFMQAQLEKEREFTMQRDQLEHKRQQLDEQISRLSQPDGSE DARLNVLAERFGGVLLSELYDDVAIEDAPYFSALYGPARHAIVVRDLNAV KEQLAHLDDCPDDLYLIEGDPAAFDDSVHSAQELAQGVVVQVSERELRYS KFPEIPLFGRAAREKYLTELEAERDKIVEQYAQRAFDVQKCQRLHQQFSQ FVGLHLALAFQPDPEQQMREINQQRNEINRELTALSTDEQQLRIKLDNAK EQMQLLNKLIPQLNVIADESLSDKVEECREQLDIAEQDEIFIRQHGMTLS QLEPIANTLQSDPENYERLKDDLYQAIDMQKQAQQKAFALADVIHRQAHF SYEDTVKTETNDLNEKLRVRLEQVQAKREQQRDQVRQKQQQFAQYNQVYI QLQSSFETKNQMLKELMDEVGELGLTVDENSEQRARVRKDELHHQLSTSR QRRSFVEKQLTLIESEAENLTRRIRKAERDYKQQRELVVAAKVSWCVVLR LSRNSDVEKRLTRREFAYLSADELRSMSDKALGSLRTAVADNEYLRDALR ISEDSRKPENKVRFFIAVYQHLRERIRQDIIKTDDPIDAIEQMEIELSRL TDELTGREQKLAISSESVANIMRKTIQREQNRIRMLNQGLQNISFGQVKS VRLVVNIRDTHAMLLDALSGNQEEYQDLFNDNRMTFSEAIAKLYQRLNPH IDMGQRTAQTIGEELLDYRNYLDLQVEVYRGADGWLQAESGALSTGEAIG TGMSILLMVVQSWEEESRRIRGKDIIPCRLLFLDEAARLDGKSISTLFEL CERLDMQLLIAAPENISPEKGTTYKLVRKISGNQEHVHVVGLRGFGAKE >MS1085 mukE, MukE protein MNENLQELIPTKLAAAIANPLFPAVDSQLRSGRHIGQEYLDNFAFLADFQ NELDMFYRRYNVELIRAPEGFFYLRPKATTLIARSVLSELEMLVGKVLCY LYLSPERLAQQGIFSVQEVYDELLNLADESKLLKAVNQRSSGSDLDKQKL AEKVRAAVNRLRRLGMIHTVGEQNSGKFTISESVFRFGAEVRSGDDPREA QLRLIRDGEAATPDSLSQEKSAVKNDEEIEDELDEGLGEEE >MS1086 mukF, MukF protein MLETSQTIPELVSWAKEREFSLNLPTERLAFLLAIAIYNAERFDGEMVES DLVDIFRHVSNEFEQSKETIATRANNAINELVKQRFLNRFSSEFTESLSI YRLTPLGVGVSDYYIRQREFSALRLSVQLAIVANEIQRASELAEEGTAKQ EDEYYWRRNVFAPLKYSVAEIFDSIDLSQRIMDENQQSIKEEIAELLTKD WQAAIASCERLLDETSGNLRELQDTLNAAGDKLQEQLLRIQDCVIGRDDL YFIDQLITDLQAKLDRIISWGQQAIDLWIGYDRHVHKFIRTAIDMDKNRV FSQRLRQSIHNYFDMPWYLWTAQAERLIDLRDEELALRDEDALGELPEEL EYEQLSDLHDQIVDYMQNLLIAQRERNQPIDLSLVLKEQLEGYPLARHFD VARIIVDQAVRLGMASADLSGTYPQWQEINNRGAEVQAHVIDEYK >MS1707 murA, MurA protein MEKFRVYGQSRLTGTVDISGAKNAALPILFASILAEEPVILTNVPDLKDV ETTFKILRKLGVNVECAEEPGKVLIDAGNINQFVAPYELVKTMRASIWAL APLLSRFHEGQVSLPGGCTIGARPVDMHISGLEKMGAAIELDEGYVKATV NGRLKGARIYMDKVSVGATLSIIMAATLAEGKTVIENAAREPEVVDTAIF LNAMGAKISGAGTDTISIEGVERLAGCRHRIVPDRIETGTFLVAAAISGG RITCRGTKADTLEAVIEKLREAGMQIDITEDSITLDSLGQRPKAVNIRTM PHPGFPTDMQAQFTLLNVVAEGTSKITETIFENRFMHIPELIRMGAKAEI EGNTAICHGVEHLSGAQVMATDLRASISLVLAGCIASGETIVDRIYHIDR GYERIEEKLRGLGARIERFSD >MS0028 murB, MurB protein MQSLKPFHTFAVPAQAKNIVEITALEQLQQVWDGCRQENQPVLFLGQGSN VLFLKDFAGTVLINRLMGIEHNEDEQFHYLHVNSGENWHNLVEWSLSQSI GGLENLALIPGCAGSAPVQNIGAYGVEFKDVCDYVDVLDLNQGKQFRLTN AECEFGYRESVFKHKYAQGFIVTAVGLKLAKAWQPVLKYGTLANFDKSAV GFQQIFDEVCAVRRAKLPDPKEFGNAGSFFKNPVISAGHFALLQQEYPNI PNFPQDDGSVKLAAGWLIDQCQLKGYQIGGAAVHQNQALVLVNKGDATAS DIVELAHHVRQSVAAKFDVYLSPEVRFIGELGEVNAEQAIS >MS1614 murC, MurC protein MKHIHILGVCGTFMGGLAIIAKQMGYRVTGSDTNVYPPMSTFLQEHNIEI IPHFEVSQLQPAPDMVIIGNAMKRGNPCVEYVLENRLPYMSGPQWLHDNL LCNRWVLAVSGTHGKTTTTGMLTWILEQNGLNPGFLIGGIAGNFGMSSRF TDSPYFVIEADEYDTAFFDKRSKFVHYNPKTLIINNIGFDHADIFDDLKA IQRQFHHMIRTIPASGRILSVATEQSVKETLDMGCWSEKQFLGKEQEWNA ERITNDCSRFAVFHLGEKVAEVHWDIVGQHNMHNALMAIAAAYHAGVKIE DACRALATFVNAKRRLEVKGEVGGVTVYDDFAHHPAEIQATLTALRDKVG GGVRILAVLEPRSNTMKMGVHKDEIAPALVRSDYVFLLQPDNIPWEVVEI ANKCVQPTKWTADLDKLVDFVVQEAQPTDHILVMSNGSFGGIHQKILDKL ANK >MS1666 murC, MurC protein MINAKKEFQQRVRNMIPGMRRVHQIHFVGIGGAGMGGIAEVLLNEGYAVT GSDIAESAVTNRLISLGAKIHFSHAASNVDNASVVVVSSAIKADNVEVVA AHEKRIPVIQRAQMLAEIMRFRHGIAVAGTHGKTTTTAMISMIYAQAGLD PTFVNGGLVKSAGTNAHLGCSRYLIAEADESDASFLHLQPMVSVVTNIEP DHMDTYHGDFDEMKRTYVNFLHNLPFYGLSVMCADDPVLLELIPQVGRPV ITYGFSEEADYRIENYEQTGFQGHYSVITPAGERIDVLLNVPGKHNALNA TAALAVAKEEGIENEAILAALADFQGAGRRFDQLGSFIRPNGKVMLVDDY GHHPTEVNVTIQAARKGWENKRIVMIFQPHRYSRTRDLFDDFVRVLSQVD LLIMLDVYPAGESPIAGADSRSLCRSIRNLGQVDPILVTDTAELPEIMDR VLQDGDLVLAQGAGNVSKLSRQLVELWTKA >MS1669 murD, MurD protein MTDYQGKNITVIGLGKTGLSCVDFLTAKKANVRVIDTRKIPAGAEQLDKS IPLHTGSLNQQWLLESDMIVISPGLSVKTAEIQTALSAGVEVVGDIELFC REAAKPVIAITGSNGKSTVTALVTEMGKAAGLSVGMGGNIGIPALSLLNE NHDLYVLELSSFQLETTYSLKATAATVLNVTEDHMNRYADLEEYRQAKLN IYHHCQTAVINGEDPLTKEDDKQSAQQQVSFAENNADYWLKTENGKKYLM AKDKLILACDEIKLTGRHNHMNALAAIALAQAAGIKNSGILTALRTFPGL AHRFQLAHMANGVRWVNDSKATNVGSTVAALTGLHIEGKLHLLLGGDGKG ADFSELEKLINKPEIFCYCFGQDGAHLAKLSSQSQLFNTMEQAIETLRPT LKPGDMVLLSPACASLDQFASFEKRGEEFTRLAKLSVAQ >MS1672 murE, MurE protein MRKLTALFGRDDRFDAIKLNRMTLDSRSVRTGCLFVAIKGHSVDGRQFIP QAISAGASAVLKECDNADEHLQVSEQNQIPVISYYHLSEHLSDIADQFYK APSQHLTLVGVTGTNGKTTVSQLLAQWAQLLGRKPAVMGTIGNGLFGALK PAANTTGSAIEVQSSLADFVQQGADFAAIEVSSHGLVQHRIEALHFAAGI FTNLSRDHLDYHHSMENYASAKKRLFSELSCQQKIINADDEIGVQWLREL PDAVAVSCNPAYQPTQENWLKVTAVSFNSQGATITFNSSWGGAILTSRLI GAFNVSNLMLVLATLLSLGYSLDELLKTVSQLTGVCGRMEMLHAAHKPTV IVDYAHTPDALEKALQAARAHCTGRLWCVFGCGGDRDRGKRPLMAQVAER FADYVIVTDDNPRTEDRHQIVQDIVAGFKRPESVNIVYDREQAIRTAVQS AVENDVILIAGKGHEDYQIIGHTKHHFSDQEAVKKYLG >MS1671 murF, MurF protein MIKLNVKKIAQILKAKLIGEETLTIESVSTDTRRKMPNGLFFALKGEKFD GHNYLAEAVAQGCTAVVVDHPCEIDVPQLVVKDTRLALGRLAAWLRRELE PLTVAITGSCGKTTVKEMTAAILQRTAGDDEAVLFTEGNFNNDIGVPLTL LRLTEKHEYAVIELGANHAGEIAYTAHLTMPDVALVNNVSAAHLEGFGSV EGVAQAKGEIYSGLTPDGVAILNLDSNYAHYWGGDINDREFESFAYDHVG ADYYAEKIMLSEYGSRFTLNTPKGAIKIELPYLGKHNVANAVAASALAMN VGASLEDIKRGLENPSHVKGRLFPIQLSTNLLLLDDTYNANVASVKSAIS VLSDYREAFRIFAFGDMAELGDETISCHQEVADFAKAANLDLVVTYGSES AVVSKACGGVHFSNKEALIASLKEIISHQLKENEDIVLLAKGSRSMKMED VINSLKDRFLC >MS1667 murG, MurG protein MAQRKKLLVMAGGTGGHVFPAIAVAQYLQKQGWDICWLGTKDRMEAQLVP KHGIPIEFIQISGLRGKGIKALLGAPFAICRAIMQARKIILRQKPDAVLG MGGYVSGPGGVAAKLCGVPVILHEQNAVAGLTNVWLSKIAKRVLQAFPTA FPNAEVVGNPVRQDLFSMPDPEQRFAERTGKLRVLVVGGSQGARVLNLTV PEMAARLTDKLEIRHQVGAGSVEKITALYEEKGALSADVKITEFIDNMAE AYAWADIVICRSGALTVCELAAVGTPAIFVPFRHKDQQQYLNAKYLADVG AAKIVQQAELNADVLVDLLTNLDREQLLAMAIKAKQMSAPFAAQRVAEVI IENAN >MS1734 murI, MurI protein MIMNTEIKPTILFFDSGVGGFSVYKEVKQLLPNAHYLYCFDNAFFPYSEK SEEVIIERTLTVCKKINEQYPLDAIVIACNTASTVVLPTLRQHFAIPIIG TVPAIKPAAEKSQTKHIGLLATKGTVKRTYVTSLIERYAQDCIVEKIGST KLAEIAERKLHGESVDLIALRNELTPWIQLSDLDSVILGCTHFPLIKEEI QLCLPQVKFYFEPGTAIAKRVFDLLAGITPKDKTETDNCIFYTKHFELED KFIQALRFWGFKNLKLLSILE >MS0635 mutH, MutH protein MAAELHIPVPPDLKRDKGWVGQLIETALGAKAGSKPEQDFANLGIELKTI PINSAGFPLETTFVSLAPLIQTAGVNWHNSHLRYKLSKVLWIPIQGERQI PLAERRIGSPILWQPDPQQEARLQQDWEELMDYIVLGKVHEITAKIGEVL QLRPKGANSRAKTKGIGQNGEIIETLPLGFYLRKEFTAQILQNFLRNK >MS1516 mutL, MutL protein MPIHILPPQLANQIAAGEVVERPASVVKELVENSLDAGASRIQIDIENGG ATLIRIRDNGLGIAKEDLSLALARHATSKISCLDDLEAILSLGFRGEALA SISSVSRLTLTSRTAEQKEAWQVYAQGRDMETTIKPASHPVGTTVEVANL FFNTPARRKFLRTEKTEFAHIDEVVRRIALAKPQIAFTLTHNGKILRQYK SAVEIEQKLKRVSAICGEDFVQNALQIDWKHDNLHLSGWVAVPNFHRPQN DLSYSYVNGRMIRDKVINHAIRQAYGDYLTNEQYPAFVLYLDLDPNEVDV NVHPTKHEVRFHQARLIHDFICQGVGNALQSEQADFARYDTPASADEIQE PAANWHSSLIKPNRSAAGHNIFESASDKNISGANTYSHGSAKINRFSTKF AENIPHFSTKSVSKTEQKLYGNLLTTPAEAKKNTAINAESENSFEKNVST PQQSTQLSGQFLHSLALVKNQALLLQQGQDFYLLPLAKLQKLKFELTLQQ PDIAQQPLLIPILFRLNERQLAQWQKQKNFFLQSGFEFDENPAQHRITLN KVPSCLRQQNLQGCVIRLLEENHEKISDFLTALCNQLQLNEIHVLADALT LLTEVELLLKTQNKIQLAQLLISVDFTQYLQ >MS2244 mutS, MutS protein MNVMENLEQHTPMMRQYLALKAENPDILLFYRMGDFYELFYDDAKKAAAL LDISLTKRGQSAGQPIPMAGVPYHAVEGYLAKLVQLGESVAICEQIGDPA LSKGPVERKIVRIVTPGTVSDENLLPERQDNLIVAVYQEKDKFGLATLDM TSGRFQISEPENAESLKAELQRLAPAELLYCEDFADMQLIEHYKGLRRRP IWEFELSTAVQLLNRQFGTKDLRGFGVEKAILGLCAAGCLLQYAKETQRT ALPHIQSITLIQNNENIQLDAATRRNLELTQNLAGGTENTLASVLDKCVT PMGSRLLKRWIHQPIRHIQKLRQRQQIISEIIQLDLIGELQPYLQQVGDM ERILARVALRTARPRDLTRLRTALEQIPTIKDILKNSPKFTALFQQIGDF DELFALLQQAIIDNPPLLIRDGGVIAEGYNAELDEWRALSDGATKYLEDL EIRERESTGIDTLKVGFNAVHGYYIQISQGQAHKAPIHYVRRQTLKNAER FIIPELKTYEDKVLKAKGASLALEKQLYDALFDRLLPHLGALQLASLTLS ALDVLTNLAERAETLNYVAPDFSDEIGVKIENGRHPVVEQVLKEPFIANP VDLNQQRHLLIITGPNMGGKSTYMRQTALITLMAYIGSFVPAESALIGPI DRIFTRIGASDDLASGRSTFMVEMTEMANILHQAGANSLVLIDEIGRGTS TYDGLSLAWACAEWLAKKLRSLTLFATHYFELTVLPEQLAGTANVHLDAL EHGDSIAFMHAVQDGAASKSYGLAVAALAGVPKNVVKLAKQKLANLEKLS QQSADQKLQDLRTINQNQGELNLMEEEDGKNAALEMLAQLDPDDLSPKQA LAYLYQLKKLL >MS1694 mutT, MutT protein MLIFCEQVQKNYKKNLKIFNFELSLPIVFAGGSVMSELQQFSQQDIEVLN EETLYSGFFKMKKVRFRHKLFAGGMSEVVTRELLYKGAASVVIAYDPVRD EVVLVEQVRIGAYDPNLSSSPWLMELIAGMIEEGESPEEVAMRESEEEAG VTIDNLEYALSVWDSPGGTVERLYLFAGRVDSSKAKGLHGLACEHEDIKV HVVSRETAYQWVNQGKIDNSSAVIGIQWLQLNYRRLQKNWC >MS0709 mutT, MutT protein MNYKNPNSVLVVIYAKNSGRVLMLQRQDDPEFWQSVTGSLAEKEMPFLTA LREVKEETGIDIKRENLTLVDCHQSVEFEIFPHFRYKYAPNVTHCKEHWF LLELPDERVPVLTEHLAYQWLEPAKAAELTKSPNNAQVIRKYLINKSA >MS2341 mutT, MutT protein MLKPHVTMACIVHCKGKFLFVEEIEYGKRTLNQPAGHLEENETILEGASR ELYEETGIRAKMQHLVKIYQWHAPRSQKDYLRFVFALELDDWAEITPHDS DITQGFWLTLEEFNYYIRQENQCARNPLVTEALEDYLAGSRYPLDILTLF NN >MS0328 mutT, MutT protein MDKKTVQVAAGIIRNEFGQIYLTQRLEGQDFAQSLEFPGGKVDVNETPEQ ALKRELEEEVGIVALNPVMFEQFVFEYPNKIIHFYFYLISEWIGEPFGRE GQEGFWIEQLDLDESQFPPANSKLIQRLLAEMNC >MS0019 mutT, MutT protein MNLLQKPEILGISVAAKSRIFEIQAVELKFSNGELRTYERFKPSSRCAVM VLPIDGEDLLMVREYAVGTERYELGFTKGLMEAGETPEQSANREMQEEIG LGAKQFMLLRTVNSSPSFMNNPMHILIAQDFYPSKLPGDEPEPLQLVRVP LANINELIEDPGFSEARNLVALYTLRDYLRKLK >MS0408 mutT, MutT protein MIDFDGYRPNVGIVICNRKGQVLWAKRYGQNSWQYPQGGINDGETPEQAM YRELYEEVGLTRRDVRIVYASKQWLRYKLPKRLLRYDSKPMCIGQKQRWF LVQLMSDEKNINMNCSKSPEFDGWRWVSFWYPVRQVVSFKRDVYRKAMKE FACFLFDANKTVNPLSTNNNDEKKANYSAKKPYSPYRNQDKKRKTRV >MS0258 mutT, MutT protein MNLVYFCKMYRFQMTDKLLNEPWLTWAIQIQAIAQNGLAYCQNVYDIERY EQLRDIAVEMLSYKTAIPQDKVKNLFCNEQGYQTPKVDTRAAIFKDDKIL LVQESDGLWSLPGGWCDVLESIDSNTVKETREEAGLDINTKFIIAIHDQH KRNYPPFAYAVLKTFVMCELIDGEFQPNSETIASDWFALDELPPMAEEKN TPSQVELCFQAHHSKHWVTQFD >MS0317 mutY, MutY protein MLAQSSIQAPFARSVLRWYDKYGRKNLPWQKNKTFYQVWLSEVMLQQTQV STVIPYFERFIDAFPTINVLADAPLDEVLHLWTGLGYYARARNLHKAAQT VRDQYGGEFPTDFQQVWDLTGVGRSTAGAILSSVLNAPYPILDGNVKRVL SRYFTVEGWAGEKKTENRLWRLSAEVTPTERAADFNQAMMDLGAMVCTRT KPKCGLCPLSKKCGATLTNSWEKYPAKKPKKQLPERESYFLILAQNGKVA LEQREQSGIWGGLYCFPQFEDKSTLLQYLQQLGIREYQEWSAFRHTFSHF HLDIFPIYAQYRQTERDENRSDWKKIEENGADYKSTISSTINYWYDPENP DQIGLATPVKNLLTEFQKGQHYVKNRIL >MS1388 mviM, MviM protein MGMKLGIVGTGMIVRDLMQTLHKVRLEQLAIWGRDQAKTAQFAAEQGILQ VFSDYAAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTD EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ YSSRYDRFKAGETLPVFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCTYA ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP ANAMNRFTYTPNQGEAQHFEFGDVHRMLPEFERFVDIVDRKDFAQAEKML DISAAVSEVIEQARKGAGIRFAGE >MS1414 mviM, MviM protein MDMKLGIVGTGMIVADLMQTLHKVTLEKLAIWGRDQVKTTQFASENGISQ VFADYEAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTG EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ YSSRYDRFKAGETLPAFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCAYA ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP ANAMNRFTYTPNQGEAQHFEFGDAHRMLPEFERFVEIIDRKDFAQAEKML DISAAVSEVLEQARKGAGIKFAGE >MS1528 mviM, MviM protein MKKINVGIIGTGFIGAAHIEAIRRLGFVDVIALAENNQQLAEQKAKELNI PLAYDCVDKLLANPDIQVVHNCTPNHLHFAINKKVILAGKHVFSEKPLCL TSQEADELTSLAEQQGVTTAVGFVYRNFAMVQQAADMVRDQQIGRVFAVN GHYLQDWMLLETDYNWRVDPKVGGKSRTVADIGSHWCDTVQFVTGKKIKE VFADMSIVYSTRKASKQVESFVTVNADSSYELKPVETEDYASVLVRFEDG SKGSFTVSQVSAGHKNDLTFDISGSEKSLHWEQETPQYLKIGYRQQANQI LCDDPSLVNPAVRAYNHFPGGHIEGWPDAFKNMMLAFYAFIAEGKDPQQD TAKFAMFKDGAQIVHIVDTIIESAQQGKWISVK >MS1500 mviM, MviM protein MKKFALIGAGGYIAPRHLRAIKDTGNTLVVAMDVNDSVGIMDSHFPDAEF FTEFEQFEAFVEDQKLKGEKLDYVAICSPNYLHAPHMKFALKNGINVICE KPLVLNSTDLNMLSEYEQKYGAKVNSILQLRLHPSIIALRDKVEAAPADK VFDVDLTYLTSRGKWYLKSWKGVDQKSGGVATNIGVHFYDMLHFIFGDVV KNEVHYRDEKTVSGYLEYKRARVRWFLSIDANNLPENAVQGEKLTYRSIT IENEELEFSGGFTDLHTQSYQRILEGKGYGLEENRTAIETVEVIRHAPII ENPANPHPFLAKVLNK >MS1755 mviN, MviN protein MSKRLLKSGIIVSTMTLLSRVLGLVRDVVIANIIGAGATADVFLFANRIP NFLRRLFAEGAFSQAFVPVLAEYQRSGELSKTQEFIGKVSGTLGGLVSIV TLLAMVGSPVVAAIFGTGWFIDWINDGPNAEKFTSASLLLKITFPYLWFI TFVALSGAILNSLGKFGVMSFSPVLLNIAMITTALLLAPQMESPDVALAI GIFIGGLLQFLFQLPFLKKAGLLVRPRWAWNDEGVKKIRTLMIPALFGVS VSQINLLLDTFIASFLMTGSISWLYYSDRLLEFPLGLFGIAISTVILPTL SRQHVNRADDVQKSAADFRATMDWGVRMILLLGVPATIGIAVLAQPMLLV LFMRGQFSLTDVQATSYALWSINVGLLSFMLIKILANGYYARQDTKTPVK IGIIAMISNMVFNLLAIPFSYVGLAMASAMSATLNAYLLYRGLAKADVYC FTKQSAVFFLKVLAAALVMGTVVWYFSPQLVIWNEMAFLTKVIRLAELIL IAASSYLLMLVILGIRKRHLLAR >MS0182 nPY1, NPY1 protein MQLIRSSDYGFWLLSQGSHIHLVNNYLPEGRAEDFHLQGKKGMVIGELDR QPLWLVEEQPNDTRAYFDLRDQLYLPERTFNLLNRGVELNHFFKTHQFCG KCGDKTMQTEDEWAVQCTNEECNYRTYPVICPSIIVAIRRGKEILLANHR RHAPKYGKGGMYTTLAGFVEVGESFEQTIHREVFEETGIKVKNIRYFGSQ PWAFPNSQMVGFLADYESGEIRLQEEEIADAKWFRYDEPYPEFPEKGTIA RALIEATLKLCAEHQDK >MS1888 nadE, NadE protein MKTAAYADYLIQWLENQRTELYGMDGYTLGVSGGIDSAVCAHLAARTGAP VQALILPAEVTSPSDVADAQATLESAGIDGQIISIAPWYDLIMQQLSPVL NSEPERVNVLKGNLMARLRMIALFTTAQSHRSIVLGTDNAAEWLTGYFTK FGDGAADVLPLAGLRKEQVFELGRYLGVPQSVLDKKPSAGLWAGQTDEAE MGVTYAEIDAYLRGETVSPQALQQIRFWHNRSHHKRMLPPKPKSPDEAEC >MS0169 nadR, NadR protein MSNFSYLQQKRKQLNLKVNDICEQANVTRAYFNQLVSGKIKNPSAAKLTA LHKALQITEQDNKKVGVIFGKFYPVHTGHINMIYEAFSKVDELHVIVCSD TERDLQLFYDSKMKRMPTVQDRLRWMQQIFKYQKNQIFIHNLVEDGIPSY PNGWRAWSNAAKALFKEKEINPTVVFSSEPQDKAPYEKYLNLEVHLVDPA RESFNVSATKIRTQPFKYWKYIPKEVRPFFAKTIAILGGESSGKSVLVNK LATVFNTTSAWEYGRDFVFDKLGGDEQAMQYSDYPQMALGHQHYIDYAVR HAHKVAFIDTDFITTQAFCIQYEGKPHPFLDSMIKEYPFDVTILLNNNTK WVDDGLRSLGDYKQRQRFQQLLKKLLDKYKVPYIEIESPSYLERYDQAKA IVEKVLNDEEVSELTHEND >MS2204 nagA, NagA protein MKYALTNCVIYTAKNVLYEHAVIVEKDKIQAVLPERELIPEIQRINLKGS NLTAGFIDLQLNGCGGVMFNEDISVKTLEIMQETNLKSGTTSYLPTFITS PDEDMKSAVKIMRDYLAKHKNQALGLHLEGPYLSVEKKGVHREEYIREIS PEMKAFLLDNADVISKITIAAENPAMQYAGEFVEKGIIVSVGHSNGTYEQ AKRAFAQGASFATHLHNAMSPVSSGRAMGVVGAVLDSDEVYSGIIADGLH VAFGNILIAKRAKGDKLCLVTDATAAAGADIEQFTFVGKTVYVRNGKCYE ANGTLGGSAVTMIESIRNAVEQVGIPLEETLRMCNYYPAKAMKVDDRLGS IEAGKIANLTAFTHNFDIVGTSVNGEWKFA >MS2205 nagB, NagB protein MRLIPLKNDEQVAKWSAQHIVDRINAFNPTEDHPFVLGLPTGGTPLKTYR ELIKLYQAGKVSFKHVVTFNMDEYVGLPKEHPQSYHSFMYNNFFNHVDIP EKNINILDGNTPDHDAECRRYEEKIKSYGKINLFMGGVGVDGHIAFNEPA SSLSSRTRIKTLTPDTLIANSRFFNNDVSQVPKYALTIGVATLLDAEEVM LLITGHQKALALQACVEGAVNHLWTVSALQLHRHSIVVCDEPATQELKVK TVKYFTELEAYAIHSVI >MS0015 nagB, NagB protein MNYITFPTAQAAVEKIAQEFVLYSQLDRPVHISLSGGSTPKLLFKTLATP PFNTKVRWENLHFWWGDDRMVVPSDPESNYGEVQKLLFDHIRIPVENIHR IRGEENPDQELARFSAELTACVPNLEFDWIILGMGSDGHTASLFPHQTNF ADENVALIAKHPESGQIRISKSAKLIEQAKRITYLVTGSAKAEILKQIKT TPAEQLPYPAARIRAKNGITEWYLDADAAKLL >MS1413 nagC, NagC protein MTRNEEALDIKHTNYRNIYRLFFQYNGLSKPQIVKLLNLSLPTVSNNIGE LEAEGKIREGGFFQPQGGRPAIAYQLVENAFISIGVEIQKKNVRCLALNL QGNILAQKDTALYFENEPQYIESLCNIIHTFIRSLGCLYTQILGIGFSIQ GIVSKDGQSMLYSRVLPGEHFDVKELQPYFDVPVKLFHDVKCAALTELWF SEQIDNAVYISISEHLGGAIIINNQIDLGKKGYSGALEHLQIHSEGNLCY CGQRGCLETYCSLSALLSPNETIEAFFKALRNKDELVLMRWDAFLEHLAK GLNTVYLLLERDIILGGEIAFYLIPEDLKILQEKILKLSTFPLEGDFIRI ATQQKYTSAIGAALPFLIEYLP >MS1527 nagC, NagC protein MKNGITWKNSLFLRMIMLYGFDIGGTKIELAVFNDKLERQYTERVETPKD SYEQWLDVIVNLVEKADQKFACKGSVGLGLPGFVNHETGIAEITNIRVAD NKPIIKDLSERLGREVRAENDANCFALSEAWDEENQQYPFVLGLILGTGF GGGLIFNGKVHSGQIGMAGELGHLQLNYHALKLLGWDKAPIYDCGCGNRA CLDTYLSGRGFEMLYRDLKGEALSAKEIIERFYAADKTAVDFVGLFIELC AISLGNIITALDPHVIVLGGGLSNFDYLYEALPKALPKHLMRSAKVPVIK KAKYGDSGGVRGAAALFLTK >MS1508 nagE, NagE protein MGLFDKLFGSKNSKTVEVDIYAPLSGEIVNIEDVPDVVFSEKIVGDGIAI RPNGDKIVAPVDGVIGKIFETNHAFSMESKEGIELFVHFGIDTVELKGEG FTRVAQEGQSVKRGDTIIEFDLPLLEQKAKSILTPVVISNMDEISALDKK VGQVVAGDSVVLSLKK >MS1408 nagE, NagE protein MSDKNFIMPMSGELLSLEQVPDSNFSQKLLGDGFAVRLSGEVVVSPFSGV VIAAFPTGHAFIIRREDGLEVLIHIGLNSAGKADAFRMQINKYDEVKQGD VLVYVDTNKLSDQQEDLISPIVFANPDIKISLHKLNQAVMVGDDSAVTLD >MS2278 napB, NapB protein MRKYLTLILAAFAGFAVAEEPSSSLTMEQIPENIAPAYTNPQKDAGNIPT TFPFQPPLVPHSVRGLQVTKNANQCLSCHSPEVSPTTGAPRVPESHFLDR DGKPTEGTSPRRYFCLQCHVQQTDVNPIIQNKFESIRAKQGK >MS2282 napD, NapD protein MSNFNLNENWYVCSLVVQARPEKLSQVKADILAIPTAEIHGEKLEEGKLV VTLESSRQLALADLIDEVKDISGVIVVSLISNYLDEK >MS0247 napF, NapF protein MALLITDKCTNCDMCLPECPNEAISVGDEIYLIDPALCTECVGHYDTPTC QKVCPINKCIITDPDHIETQDQLWERFVLIHHADQV >MS2283 napF, NapF protein MMDRELPRRQFLRGGFLKSLQSETVKQQGFLGVRPPWTVAEAQFVADCTR CGDCIAVCETQILVKGVGDFPEVRFSRGECTFCMKCVEVCRQPVFRPTEE AAWQHKVEIQTGCLANNQVECRSCEDNCERRAIRFKREIGSVAKPQIDLE LCNGCGACLSVCPVLAIKVLTTSAGE >MS2334 napF, NapF protein MTAMQGKNEQYYKAYLTYNRISRRALLRGVFHPAEQATQIREFRLAPRPP FAAAEDLFLAACNGCGACVAACPYNLIRISGQKAMLELEYAACDLCGKCA ESCSTHALHPAFKKDTQLRPHFSEHCLLKQNQFCAVCQEICPQQAISADL QLNHEVCNGCGECKLACFVSAIQLI >MS2280 napF, NapF protein MKLDPNRRQFLKNATRTAAGVCGIGVILGLQQHQANAKEGVALRPPGALA EKDFLAACTRCGQCVQACPYDMLHLASLLSPMEAGTPYFVARDKPCEMCP DIPCMNACPSGALSEELTDINDARMGLAVLLDHETCLNWQGLRCDVCYRV CPLIDKAITLDRIHNDRTGIHAKLIPTVHSDACTGCGKCEQACVLEEAAI KVLPMDLAKGLLGRHYRLGWQEKQNAGKALLEEQHPDGLRPAFDARMPEG QVEPVYQHMKVQPDVKVATPNRATYDYVPNPTTVDAPEHYPNLDLNTKGV K >MS2279 napH, NapH protein MATVKTTPNKPKDAGLEARQKLGWWHAYRFLILRRLSQLSIILMFLSGPL WNVWILKGNYSSSMLFDVVPLTDPLITAESLATGYLPEWTTIVGALIIVA FYAVFASKAFCSWVCPMNIVTDAAAWLRRKLGIRQSAKLPRNLRYVILVM ILLGSAVSGTLLWEWINPVAALGRVFVFGLGATLWLVAVVFLFDLLVVEH GWCGHLCPIGAAYGLIGAKSLIKINVVDRERCDRCMDCYNVCPEPQVLRL PLHGSESDSPIVLDKDCITCGRCIDVCPENVFAFGSRFEKQVQVKNI >MS0181 ndh, Ndh protein MKNIVIVGGGAGGLELATYLGNNLGKKQRANVVLVDRNQTHLWKPLLHEV ATGVLDSETDAVSYRAHAHNHYFNFEQGSITRIDRTNKYVELAPVTGQEG DVLVVARRIPYDYLVIAIGSKSNDFNTKGVAENCIFLDSPNQALRFQHKM LELFLKFSENNALEEIGEDDSKQRLVQDGKVNIAIVGGGATGVELSAELF NAAQHLSSYGYGKIQSGHLQVTLIEAGDRILPALPERISSSVQQELENLG VTVKTGTMITEATEKCLITKEGEEINADLMVWAAGIRVSAITQQFDGLEV NRINQLNVKNTLQTTVDDSIFAIGDCAFLLQKDGKPVPPRGQAANQMATI CGQNIVALFNNKPLKDFHYFDKGSLVSLSKFTALGNITTGKRSSLTIEGR LARLAYISLYRLHQQKLHGCFKTGLIILIGRLNRFIRPSLKLH >MS0668 ndk, Ndk protein MSLERTFSIIKPDAVERNLIGKILARFEQSGFEIVAAKMVRLTKAQAEGF YAEHQGKPFFEDLVEYMVSAPILVSVLQKENAVKDYRTLIGATDPAKAKE GTVRKEFAESLRRNSVHGSDSLESAAREIAYFFIDSEICSR >MS1944 nei, Nei protein MRNPNFCSGFLLCRQNPDRNTVMPELPEVETAKNGITPYLEGYLIEKIIV RQPKLRWEVSPQLAQISQQKITALSRRAKYLIIHTEQGYIIGHLGMSGSV RIVSARDPVDKHDHLDIVMNNGKIMRYNDPRRFGTWLWSANLDEFHLFLK LGPEPLSDEFNAEYLFKKSRKKQTPVKNFLMDNSVVVGVGNIYANETLFM CGLHPEKITAKLTKAQCALLVEKIKQELKRAIEQGGTTLKDFLQPDGRPG YFAQELQIYGKKGAPCPNCGTKIESLVVAQRNSYFCPKCQKK >MS2153 nemA, NemA protein MMNPKYQPLFEPYTLNNGVEIKNRLTVAPLTIYDSGKDGEMTETGRRFWQ NRFEGFGLYIMPFTNVHPSGIGFESPNAFDERHLPTLREYAEMAHSQGAK AVVQIAHSGLRADPAMTQGAELVAATGDYYGCFRTMSEQEVWDMVTNYAY AAELVLRAGFDGVEIHGANGWQIQQFFSASTNLRNDYWGGTLEKRMRFPL AIIDGIDEMRQKHNRPDFIIGYRFSPEEPGEDGITMKETLALVDALLEKP LQYLHISLWDFYKKVRRGADTHLTRMQVVHDRIAGRLPFFGSGNLYTADD MLKAYQTGWVESVSIGKSIMLNPNLVELIETGRESEIESAFDWDKADYYR YTPAMLDGTRAGTDFFPPSKQNGVRYKTNHF >MS2010 nemA, NemA protein MNKKFERLFETVTFPNGATISSRFAMGPMVIVGSESNGEIGADDLAYWQR RNDAGSLLITGATAVSDYSDAYGNGLKLHKDELLDGWKQLAAVMKAKGNR AVVQLFHAGYRAAFTYKDKGVAYSASSKEYGFLDYPVTGMTEAQIEDTLN EFAAAAKRAIDAGFDGIEIHGANRYLIHQFFSAVSNVRDDQWGGSLENRA RFALEVVKRIQEVIKQYAKADFILGYRISPEEIHREGNGFTFDEALYLID EVAKLGVDYFNVSQSGVRGFAAEPKAGAYMGQAISKVIKTRLVGRALLLA SGDLTSPDKILEAVTEYADITSNATMVLLDPDTKNKIQSGREDEVSLAVD ETTIDDLKLPKAFYKIAPMIVTSQFVPQHTKDLIYKEPK >MS2097 nemA, NemA protein MNAKFSPLFQSYTLNNGVEIKNRLVVAPMTHFGSNPDGTLGENEREFISN RANDMGMFILAATLVQDGGKAFHGQPEAIHASQLASLKETADIIKAQGAK AILQLHHGGKQAVTELLNGKDKITASDDEATGTRAATVEEIHSLINAFAN AADLAIQAGFDGVEIHGANNYLIQQFYSGHSNRRTDEWGGSRENRMRFPL AIVDEVLAVKAKHQANDFIVGYRFSPEEPEELGLTMEDTLALIDTLKGKA LQYLHISLHEFFKKARRGADTNAFRMQLVHDRIGGKLPLIGVGSLFTAEQ ILEAYNTGWAEFIALGKTVMINPTIATLIKEGKENEIVTTLDPEKADQYG IKGILWDLCKNGGAWLPPLKGKDDWHPVDV >MS2012 nemA, NemA protein MSRTYQQITQSRSFQMAKFRYLTEPFQIKNLQLKNRVVMPPMCMYVAKED GIANNWHFVHYVSRAVGGVGLIIVEMTNVADNARISPDCLGLWNDEQAQA LKKIVDECHAQGAKIAVQIGHAGRKALGWDDVVAPSAIICDESVTSDKSR WSYKMPRALTTEEAEQVVLQFQSAVRRAVAIGFDAVEIHAAHGYLIHQFY SPKMNIRTDKYGQDKCLFGIEVIQAAKAVMPAEMPLLVRISAQEYSDNGF PAEYGVSVAKRFAEAGADVLHVSGGGDGNFI >MS2296 nemA, NemA protein MNPIFSPLFQPYTLNNGVEIKNRLVVAPMTHFGSNTDGTLGKQEHRFISN RAGDMGMFILAATLVQDGGKAFHGQPEAIHTSQLPSLKATADIIKAQGAK AILQIHHGGKQAITELLNGKDKISASADEESGTRAATIEEIHTLIDAFGN AADLAIQAGFDGVEIHGANNYLIQQFYSGHSNRRTDEWGGSRENRMRFPL AVIDAVVAAKIKHQRDDFIIGYRFSPEEPEELGLTMEDTLALVDVLKEKP LQYLHISLWDFYKKARRGADSNTARLQLVHERIGGKLPLIGVGNLFTAQQ ILEAYQTGWAEFIALGKTVMVNPKIATMILNGQENQLITEVDENQADHYG FPDFLWNATMSATQAWLPPVKGKPWSPLDI >MS1349 nfnB, NfnB protein MHLIKLNNREDIMTISKQDVLEAFKFRSACRYYDPAKKISKADMDYILEL ARLSPSSVGSEPWKFVVLQNPVIREKIKPVTWGIKHPMDEMSHLVVILAK KNARYDSDFFRTSLEKRGLTPEQMEATLARYKSFQTDDIKVLESDRALFD WCSKQTYIALANMMTGAAMIGIDSCPIEGFNYAEVNRILAEEGLFDADEY GVSCMVTFGYRAREITKKYRKPAEDVIEWIE >MS2115 nfnB, NfnB protein MTILSTEQILSAFKNRKSCRHYDETRKISEQDFNFILELGRLSPSSVGSE PWKFIVLQDPKLREAIKPFSWGMASTLDSASHIVVILAKKNARFDTPFML EGIKRRGVTEPEAIEKTLAKYKDFQENDMQTLNDERALFDWCSKQTYIAL GNMMTGAAMAGIDSCPIEGFNYAEMNRVLSEAGLFDANEWGVSVAVTFGY RTQEIAQKARQPQEDVVIWAK >MS2149 nfnB, NfnB protein MRNKPMSTLDFATTVRERHSVRQFLPTPMTNAQIREVAEDARRSPSSTNT QPWSVHIVSGETLARLKKRIMEKFEQGELCPDFAYDQSKFDGIYEPRWRE FYKEMFAANGVTRDDSEGRKKITRRNAEFYDAPHAAFLFMPDVGDGNVNA ASDMGMYSQTFLLSLTARGFGGIPMLFLAFFADVVREELGISPDFKLLHA IAFGYPDQDAAINQFRSKRASVDETVTFYE >MS0756 nfnB, NfnB protein MFFSIIAIGVIYQTAYQEVIMSEKNFIETLLSHRSIRQFKSQQIAPDIIE QLVDVARFASSSNHLQCISIVRVMQPQLRHELMLCASGQAYVESAAEFWV FCADFNKHKQICPEAQLDYTEVMLIGAVDAGIMAQNVLAAAENLGLGGVY IGSIRNQIEKVGELLNLPEYVVPLFGMCLGYPDQNPPLKPRLPQELMFFE NQYRPLDKEMLNDYDKEVAEYYKKRSQADMDWSRNVVKTLGKPVRPQVLG YLQKQGFVKK >MS1109 nfnB, NfnB protein MDALTLLTTRRSEKKLSAPVPNNEQLELIFQAATHVPDHGKLQPYHFIVI ENDGLKKLETLLKSAVTELKLDEKRLQKAEKIASTAPMMIAVVAKINTDI AKVPAWEQMLSAGCSAYAMQLAANAQGFDNVWVTGPWVDGSDLREALGCA PKDKVIGFIILGTSQEKITREPKTVKTENFVSYL >MS1801 nhaA, NhaA protein MSPSINLTFKGGCNMFMAQIQRFFKMGSASGILLFFFALLAIIFANTSLN NFYFNFLDIPVSVQFGEFMINKTLLHWINDGFMAVFFVLVGLEVKREMLE GSLSRYQLAIFPAVAAIGGMIVPALIYYLITNQHPELSNGWAIPMATDIA FALGIVALLGTRVPLPLKVFLLALAIIDDLGAIVVIAVFFSEELSIQALS VAIVAIAGLITLNRMKVGHLCAYLIFGLILWAAVLKSGVHATLAGVIIGF CIPQKDSEGKSPLHTFEHILTPWCSFFVLPLFAFANAGVSLGTINTDMIF STLPLGIALGLIVGKPLGVFSFSYFSVKLGIAKLPEGIKWKQVFAIAILC GIGFTMSMFLAGLAFTDGQSDSLINTLSRLGILLGSSVSAILGYLLLKST TK >MS0430 nhaB, NhaB protein MSGYTAFFNNFLGKSPNWYKLSIIVFLILNPILYFLISPFIAGWCLVIEF IFTLAMALKCYPLQPGGLLAFEAIAIGMTSPAHVKAEIMASFEVILLLMF MVAGIYFMKQLLLFAFTRLLITVRSKIVLSLSFCLSAAFLSAFLDALTVV AVIISVVMGFYGVYHKVASGNNFDDSTDITNDEKIKKDQQVLEQFRSFLR SLMMHAGVGTALGGVMTLVGEPQNLIIAEQASWGFGEFFIRMAPVTVPVL ICGLITCVMIEKMNIFGYGDKLPRKVWGILAKFNRAQQQKMNRQERQKLI IQGIIGIWLVCGLAFHLAAVGLIGLSVIVLTTAFCGITSESTIGKSFQES LPFCALLVVFFSVVAVIIDQHLFGPIINYVLSASESTQLLLFYGFNGLLS AISDNVFVATVYINEAKNALHAGIISLEQFELLAVAINTGTNLPSVATPN GQAAFLFLLTSSLAPLIRLSYGKMVYMALPYTIVLTVVGLLAVEYILPGA TKYFSSLGWITALPV >MS1321 nhaC, NhaC protein MQLADFSTSLWSILPPILALTLAIFTRKVLFSLSVGIIVGSLMLSNGSLA QGVSYLFDSVTSLIFNFDEENHFVLNDNNVNILVFLLLLGILTALLSVSG SNQAFAEWAQKRIKGRRGAKIMAACLVFITFIDDYFHSLAVGAIARPVTD KFHVSRPKLAYILDSTAAPMCVLMPVSSWGAYIITLVAGLLAEHSITGYT PIGAFMTMSAMNFYAIFSIVMVFIVAYFSFDIGPMAHHEKLAMEQANNVQ ETNSAVQGQVRNLVLPIVGLIIGTVTMMMHTGNQALLADGKEFSVLGAFE NTTVGISLVVGGMTAVLISTILIVLAKKLSSGNYAKAVIAGMKSMVGAIL ILCFAWTINKVVGDMQTGKYLSSLISGNLTPALLPALLFVLGAAMAFSTG TSWGTFGIMLPIAAAIAVNAAPELLLPCLSAVMAGAVCGDHCSPVSDTTI LSSTGAKCNHMDHVTTQLPYALLVATATILGYLVVGFTETPLFGFITTGM ALFLLIFITKKR >MS0136 nhaP, NhaP protein MSVYAYICFLFSISIFLAFFTRKISARIQSTIAITASAMIGSLVLILFGY FGWFKVEDIAIHIMERVDFKNFLLNGMLGFLLFAGSLGIKLPLMKEQRRE IAVFALFSTLASTFFIGILIYYAAMLVGLRIDFVYCLVFGSLISPTDPIA VLAIIKNLKAPKRLSMQVEGESLFNDGVGLVIFTTLFAVAFNGQEPTFSG VFGLFLKEAVGGILFGFVMGFAIHLLITFTKEVSLEILLTLTIPTAGFML ANLLHISGALAMVTSGIIIGNWTRRQGFSERNRYFLDHFWEMIDHSLNSL LFFLIGLALLLVEFTFESSMLMLLAIPVCLIGRYVSLWIPYQIMSRFRRY NPYTLRILTWGGLRGGLALAMALSIPSNVVNISGNGMNLDLRDVIILMTY AVVMFSILVQGTTIEKMIETSKVIDPKRDAYVKLGGSRDYPPHI >MS1726 nifS, NifS protein MKFPIYLDYAATCPADDRVAEKMMQYLTRDGIFGNPASRSHKFGWQAEEA VDIARNHIADLIGADSREIVFTSGATESDNLAIKGAAHFYQTKGKHIITC KTEHKAVLDTCRQLEREGFEVTYLAPKSDGLVDLDEFRAAIRPDTILASI MHVNNEIGVIQDIEAIGKICREHKVIFHVDATQSVGKLPINLAELPVDLM SMSGHKLYGPKGIGALYVRRKPRVRLEAIIHGGGHERGMRSGTLAVHQIV GMGEAYRICKEEMAEEMAHVTKLRDRLYNGLKDIEETYVNGSMEHRVGSN LNISFNFVEGESLMMALRDIAVSSGSACTSASLEPSYVLRALGLNDELAH SSIRFSLGRYTTEEEIDYTIDLVKSAVKKLRDLSPLWDMFKEGIDMSKIE WSAH >MS0432 nlpA, NlpA protein MNFKKLLTVAAVTSVFALTACNDEKKADTAAPSAQNTPAQTITVGVMSGP EHQVAEIAAKVAKEKYNLNVKFVEFNDYALPNPAVSKGDLDINAMQHKPY LDEDVKKNNITNLTIVGNTFVYPLAGYSKTIKNVSELKEGAKVAVPNDPS NQGRALILLEKQGLIKLKDNTNLAATPLDIVENPKNLKITPVDTAVAARA LDDVDLAVVNNTYAGQVGLNTADNGVFVESKDSPYVNIIVARTDNKDSEA VQNFVKAYQTEEVYQEAVKFFKDGVVKGW >MS0267 nlpB, NlpB protein MKKWLLSVAVLATVTACSSSNESRQVANDSYEKNAESKINFSPLATGGVT IVGQDNKYQLPTTNISKGPAVDIRPPTTPMSIIGNSVAQFDGERASIVYP AAKSAVYNLDQVARLLKEENIEFTRQENKLLTDWAPTGRVDEVGDVKLRY LIEQLGNKEANALSVTVLEAKRNEIIFTPSVTDKQRYTSDRLNNFVGNLN HAYRTQMAQTAPVATSNGAIQAEIVTDGNNRTALGLTSSFAQSWEKLGQV LPELGFEIDEETAGRGYRVLKYKPVDDSQWARLGVNKPELEKGEYSMQLS AYGNQSAVVLMDEDKAALEGDKAQAVYKALQVLMTK >MS2320 nlpD, NlpD protein MIMWFLVSNKIPRKLTALLGLGLFFAFPLQAADLSKIQQQIKQQEQKIAE QKRTQNQLQSTLKEQETKMSGMIGELRQTETDLKETRKIISETNKQIRTL EQQERAQKEKLAKQLDAVYRSGNPSSVVEHLLSDDAKKADRMKVYYEHMN QARMDAIAEIRNTRAQLDEQKNVLNTQLQEQQTQLSTQKKQQQELQKMKN ERQSTLNKLSKSLKQDQNRLQTLKENEIALRNEIQRAAQAAQQQEKRERE AYTAKKESEEKRSNKPYQPTSQEQQLIRSNSGLSGRYAYPVVGRILHAFG SQQAGEVKWKGIVISARAGTAVKSIANGRVILANWLQGYGLVVVVDHGKG DMSLYGYNQSVSVKVGSLVRAGQQIAEVGNSGGQGSSGLYFEIRRQGNAV NPMGWLR >MS2269 nlpD, NlpD protein MVTVLQTVLVCGAAGVWTAGCVFTAGVVAVFPTAVAGAVAAGIVGWLATG LSIPAGMELCWISGCHEPLLPVLSTGCIIPGLNVPSAFSTGAGEVLELQA DNTATAIGNNKNDFFILNSLKKLFHTIRHTLLFDFRHKTAACNRSGTFHR NHNTVR >MS0791 nlpD, NlpD protein MAQHVKLARDRRKRKSRIKAAIFFMAIISIFTGTFLSLKDSVEDKNIDGD TALAQAEQFEKLTPDAGTSDRLTQHLLDQAKVLAEDNNATSYDDDLSGQD DEVDEIKIDPDDFDISSLPPEAQSALSDLLDVADQAKRISDQFSHTIVRG DELKDVLELSGLEPMTAEGLIASYPELKKLKAGQQMYWILDKNGELEYLN WLVSEKEERIYERLESGKFERQILEKKSVWKKEVLKGKITSSFRASLLKL GLDQRQVSQLTNALQWQFSMKKLMKDDNFAILIFREYLGDKLTGQGNVEA IHIISQGKSYYAIQAENGRYYSRQGETLGKGFARYPLQRQARISSQFNPR RRHPVTGHVRPHKGVDFGVPTGTPVISPADGVVEKVAYQKGGAGRYIMIR HGREYQTVYMHLSKPLVKAGQSVKRGERIALSGNTGISTGAHLHYEFHIN GRPVNPLTVKLPGTSNEMRDSERRQFLTKAKYVERQLKM >MS2268 nlpD, NlpD protein MFLIAYISGMDVKELAALNGMTSEPYNLKVGQTLKVANRVGGGAETETIT EQQCTEVPVEQPAVTYTAGANGTQYGSDGTITGPVKAGAGTAGAASAVAG PVVASAATSAPSPAYNSGVTAGVGTVAPATTGVVAGTASSAVQTPASNIS WKWPTSGRVVQGFSNSDGGNKGIDISGSKGQPVYAAAAGRVVYAGNALRG YGNLIIIKHNDDFLSAYAHNDSISVNDQQEVKAGQQIAKMGSSGTNSTKL HFEIRYKGKSVDPTSYLPRR >MS0294 norB, NorB protein MREEYRHQSRVNEDGNVVLSETRLQAIRQTADYYIKLYGDDPSMISSRES FAMKNNTLPDPEARQKLSDFFFWTAWVASTNRPDAEATYTNNWPHEPLIQ NVPTTENIMWSLISIICLIAGIGFLIWAYSFLRDHNETAPQAPAADPLSK LNLTPSQKALGKYVFLTLALFVVQVGLGGVLAHYTVEGQKFYGVDISQLF PYSLIRTWHIQSALFWIATGFLTAGLFLAPIINGGKDPKYQKFGVNFLFL ALLIVVVGSYSGNFFALSHQIPAEFNFWFGHQGYEYLDLGRFWQLLLFVG FLLWLWLMLRCTSHSFKQGGDKNLLAIFIASIIGVGLFYGPGLFYGEHSS ITVMEYWRWWVVHLWVEGFFEVFATCALAFIFYNLGLVGYHSATVASLMA GSLFLVGGIPGTAHHFYFSGTTTPALAAGAVFSALEVVPLVLLGSEAYEH WSYQHRTSWMQKLRWPLMCFVAVAFWNMLGAGVFGFLINPPISLFYLQGL NTTAVHAHAALFGVYGFLTLGFVLLVARYLKPDFQFNEKLMKTGFWSMNI GLVLMIAISLLPIGLYQVSASISEGLWYARSEGFLQQDFLQTLRWLRTVG DLILIFGAVLFAYEVTRLTFSRRT >MS0293 norB, NorB protein MGKYKKLWAALVVVLTVTFTILGYIGVEVYRQAPPVPQAYVSQTGETVMT KDDILAGQTAWQTTGGMEVGSLLGHGAYQAPDWTADWLHRELTAWLDIRA QATFNKSYTELDPASQAALQADIARGIPPSKQGE >MS1314 norM, NorM protein MQKITHWQEYKIEAKSLILLSLPILLAQIAQNSMGLVDTIMAGRVSAADM AAISVGASIWMPLVLFGQGLLLALPPTISYLNGSAQRHRIAHQVRQGIWI ILFSIVPLALLIYHSDTVINRMGMEEHLAQITIKYLHAMLFGLPAYLLLV NFRCLNDGLAKTKPAMIITLIGLLLNIPLNYIFIYGKLGVPAFGAVGCGI ATTIVNWIMCILMISYTKSARNQRDLKVFENIIELPNPATLKKLFKLGLP IAIAICSEVALFALTSLLLSPLGTNAVASHQIALNTSSFIFMLPMSLGMA TTILVGQSLGERSPLKAKDISYVALFIGLATATLTAFLTVVLRYQIAGIF VKDTEVISLAASLLLLNALYQFSDAVQVVVGGALRGYKDTKAILYITLFC YWVLGMPIGYILSRTDLITAHMGPTGFWIAFVVSLTVAAVLLFYRMYKIQ KQSDEQLLTKLEKLK >MS0920 nqrA, NqrA protein MTDVLARFNSGKLWDFDGGIHPPEMKSQSNQTPITKAPLTEDFYIPVKQH AGDAGNLLVKEGDYVLKGQPLTQGDGLRMLPVHAPTSGTVIAIAPHIAAH PSGLSELAVHIHADGKDQWREQNPIDDFLSQSAEQLIEKIYQAGIAGLGG AVFPTAAKIDSAQKKVKLLIINGAECEPYITCDDRLMRDNPDEIIEGIRI LRYILRPEKVVIAVEDNKPEAVQSIKNALQGANDIEIRVIPTKYPSGAAK QLIQILTGMEVPAGQRSSSIGVLMQNVGTAFAIKRAIINDEPLIERVVTL TGDKIPNKGNQWVRFGTPISFLLKNVGYQYDERLPVFLGGPMMGLTLPNL DAPITKLGNCILAPDHFEYDPQAREQSCIRCSACSDACPVHLMPQQLYWY ARSEDHEKSEEYSLKDCIECGLCAYVCPSHIPLIQYFRQEKAKIWEIKDK AKKAEEAKLRFEAKQRRLEREEQARKLRSQRAAEARREELANQKGVDPVK AALERLKQKQAAITEKPKISALKTVVNEKGEVLPDNSEVMALRKARRLAR QQAISTENSVLTQVDSGTQSDNSDVQKNPENSTALDGKKAAIAAALARAK AKKLAQNNTESVSDNVTAVKSAVQNTEISAPSDSAEKTETDPKKVAIAAA IARAKAKKAAQNNTESASDNVSAVKSAVQNTEISAPSDSAEKTKTDPKKA AIAAAIARAKAKKLAQQQSNKTE >MS0309 nqrA, NqrA protein MITIKKGLDLPINGKPEQVIRDGNAVTEVALLGEEYVGMRPSMKIHEGDT VKKGQILFEDKKNPGVVFTAPVSGTVTAINRGAKRVLQSVVIRVEGNDQE TFAKYSPAELVSLSSEQVRQNLQTSGLWTALRTRPLSKIPAVDAVPSSIF VNAMDTNPLCADPAVIINEYQADFTNGLTVLTRLHNKVNLCKAAGSNIAS VDNVDSHEFAGVHPAGLVGTHIHFIDPVGINKSVWHINYQDVIAIGKLFT TGELFTDRVVALAGPQVKNPRLVRTNIGANLSQLTANELADGNNRVISGS VLYGAKAEGAHDYLGRYALQVSVIAEDTEKEFFGWISPQANKYSITRTVL GHFGRKLFNFTTAENGGHRAMVPIGSYERVMPLDILPTLLLRDLEVGDTD SAQALGALELDEEDLALCTFVCPGKADYGSFLRQALDKIEKEG >MS0308 nqrB, NqrB protein MGLKNLFEKMEPAFLPGGKYAKLYPLFESVYTLLYTPGTATQSTTHVRDA IDSKRMMIIVWLALFPALFYGMYNVGHQSINAVLSLGTSVDSLAANDWHY ALAQALGVDFTAAAGWGSKMLLGATFFLPIYIVAFAVGMFWELLFAIVRD HEVNEGFFVTTILFALIVPPTLPLWQAALGISFGLVVAKEIFGGVGKNFM NPALAGRAFLFFAYPGQISGDLVWTATDGFSGATALSQWAQGGEAALQHV ASGQPITWMDAFLGNIPGSMGEVSTLALIIGAAIIVFARIASWRIIAGVM VGMIITSSLFNLIGSESNPLFAMPWYWHLVLGGFAIGMFFMATDPVSASF TNKGKWWYGALIGVMAVLIRVVNPAYPEGMMLAILFANLFAPIFDYLVVQ GNIKRRKARTA >MS0919 nqrB, NqrB protein MFKIASSPHSHSGKLTARIMLWVILAMLPAIFAQLYYFGFGVLFQITIAV VFALCLEFLVTILRKKPKLFYISDFSVTLTALILAVAIPPYAPYWIILIG IFCAVILGKHVYGGLGQNPFNPAMVGYVVLLVSFPMQMTTWLAPVQLLHE PPTFIDAYHLIFSGGTTDGFSLHQLTASIDGMSSATPLDAVKTGLKANRG LAEINRSPLFTQSSLAGLGWFQVNLAFLLGGLFLVWKRIIHWQIPTALLI TVCLFSLCSWLFSDNMPSPLWQLFSGATMFCAFFIATDPVTASITPKGKL VFGVLVGLLLCLIRFYGGYPDGAAFAILLANICVPLIDQYTRPRVTGYDL RGKN >MS0918 nqrC, NqrC protein MGVGQTSVKYGAILGIVALICTIISTALYFLTKDKIEAEILKQQQELLAQ VIPANYYDNDVTATCKTTESREIEKICTALLTGKVSAYAVEATAPDGYSG AIRLLMGITPEGEVLGVRVLAHKETPGLGDKIETRVSHWILSFNHQKISE DNLQDWAVKKDGGKFDQFAGATITPRAVVNQVKRSALAVLKNEQNNR >MS0307 nqrC, NqrC protein MAKFNKDSVGGTLTVVVLLSLVCSLIVAGAAVLLKPTQEIQKQLDKQKNI LMAAGLMQQGTNVQQTYAKFIEPKIVDLATGDYVDGITNFDAKASAKDPA TRVAIAPADDKAGIKVRSKFAEVYLVKDEAGNTTQVVLPMYGNGLWSIMY GFVAVQPDANTINGITYYEQGETAGLGGEIANPNWQKNFVGKKLFDAQNK VALVVGKNASSNKEHGIDALSGATLTSNGVDGSFKYWFGPQGFGPYLAKF KAEGAN >MS0917 nqrD, NqrD protein MEKEQSIWHDLLSQGLWRNNPAIVQLLGLCPLLAVSNSATNALGLGFATL LVLTCTNTMVSLFRKQIPHEIRIPIYVMIIATTVTAVQLLMNAYTYSLYQ SLGIFIPLIVTNCIVIGRAEAFASKNSVLHSAFDGFAMGLGMTLSLFLLG ALREVLGNGTLFDGIHLLLGDWAKPLRIEFFHNDSNLLLAILPPGAFLGL AVILALKNVIESRTK >MS0306 nqrD, NqrD protein MAGSNLKKLLLSPIADNNPIALQILGICSALAVTTQLQTAVVMAIAVSFV TGFSSFFISCIRNYVPNSIRIIVQMAIIASLVILVDQILRAYAYDLSKQL SVFVGLIITNCIVMGRAEAFAMKSGPVESFVDGIGNGLGYGAILLIVAFL RELIGSGKLFGVTVLETVQNGGWYQANGLFLLAPSAFFIIGFVIWGLRTW KPEQVEK >MS0922 nqrE, NqrE protein MQKQDNNALTQTNYTNMTDYILLIISTALINNFVLVKFLGLCPFMGVSKK VETAIGMGLATTFVLTVASLCTYLADSYILAPLNASFLRTLVFILVIAVV VQFTEMVINKTSPTLYRLLGIFLPLITTNCAVLGVALLNINLAHNLTESV IYGFGASLGFALVLVLFASLRERLAAADVPAPFKGASIALVTAGLMSLVF MGFTGLIRV >MS0305 nqrE, NqrE protein MEHYISLFVKSVFIENMALSFFLGMCTFLAVSKKVSTAFGLGIAVIVVLG ISVPVNQLVYTHILKDGALIEGVDLSFLNFITFIGVIAALVQILEMFLDK FVPSLYEALGIFLPLITVNCAIFGGVSFMVQREYNFPESVVYGIGAGTGW MLAIVALAGLTEKMKYADVPAGLRGLGITFITVGLMALGFMSFSGIQL >MS0304 nqrF, NqrF protein MDSNFIFGIGAFTAIVLVLAVVILIAKSKLVDSGDITISINNDPEKAITL PAGGKLLGALASKGIFVSSACGGGGSCGQCKVKVKSGGGEILPTELSHIS KKEAKEGWRLSCQVNVKSSMDVELPEEVFGVKKWECTVISNDNKATFIKE LKLAIPEGEEVPFRAGGYIQIEAEPHTVNYKDFDIPEEYHEDWDKFNLWR YVSKVDEHIIRAYSMASYPEEKGIIMLNVRIATPPPRNPDVPPGQMSSYI WSLKPGDKVTISGPFGEFFAKDTDAEMVFIGGGAGMAPMRSHIFDQLKRL HSKRKISFWYGARSKREMFYVEDFDQLQAENDNFTWHVALSDPLPEDNWD GYTGFIHNVLYENYLKNHEAPEDCEYYMCGPPVMNAAVINMLESLGVEHE NILLDDFGG >MS0992 nrdA, NrdA protein MCFSKVKTFMNKALMVTKRDGQVEPLDLDKIHRVITWAAEGLENVSVSQV ELRSHIQFYEGIRTSDIHETIIKAAADLISKDAPDYQYLAARLAIFHLRK KAYGHFDPPRLYEHVKKLVRLGKYDESLLSDFSREEWDEMDNFLDHSRDM TFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAASLFSKYPAETRLD YIHRFYDAISTFKISLPTPIMSGVRTPTRQFSSCVLIECDDSLDSINATS SAIIKYVSQRAGIGINAGAIRALGSPIRDGEAFHTGCIPFYKHFQTAVKS CSQGGVRGGAATVYFPMWHLEVESLVVLKNNRGVEENRARHMDYGVQINR TMYQRLIKGGDITLFSPSDVPGLYEAFFADQAKFEELYVKYEQDPTIRKR TVKAVDLFSLLMQERASTGRIYVQNVDHCNTHSPFDPAVAPVRQSNLCLE IALPTKPLNNINDEDGEIALCTLSAFNLGKIDDLDELENLADLAVRSLDA LLDYQDYPVPAAKRSSLGRRALGIGVINYAYYLAKNGVRYSDGSANNLTH RTFEAIQYYLLKASMNLAKELGACEYFNETTYAKGILPIDTYKKDVDNLT SEPLHYDWEQLRTEILEFGLRNSTLTALMPSETSSQISNATNGIEPPRGH ISVKASKDGILKQVVPDYENLSDKYELLWDMPNMDGYLHLVGIMQKFVDQ SISANTNYDPKRFEDDKVPMKILLKDLLTAYKYGLKTLYYQNTRDGADDA QEDLDDGCAGGACKI >MS0633 nrdD, NrdD protein MLAMGSFFIIKRDGSRASFEIQRIINAIKKAAKAVGIDDERFCHLVSQQV FDEIFQHNQNEIDISRIQQFVENKLMASAYPQVARAYIEYRHDRDLAREK RSQLTKDIEGLIEQSNVEILNENANKDAKIIPTQRDLLAGIVAKHYAKSH ILPRDVVEAHEKGEIHYHDLDYSPFFPMFNCMLVDLKGMLTQGFKMGNAE IEPPKSIGTATAVTAQIIAQVASHIYGGTTINRIDEVLAPYVQLSYEKHL KHAQEWNVPDQKAYADALIEKECFDAFQSLEYEINTLHTSNGQTPFVTLG FGLGTSWQERLIQKSILKNRIRGLGKNHKTPVFPKLVFTIKHGINQSPKD PNYDIKQLALECASKRMYPDILNYEQVVKVTGSFKAPMGCRSFLGAYEEN GELVHDGRNNLGVVSINLPRIAIEAKGDEQRFYEILDQRLAVTKKALMTR IARLENTKARVAPILYMEGACGVRLKADDNIAQIFKNGRASVSLGYIGIY ETINALYNQGHIYDNEMLREKGVQIVEYLSKATKEWQKETGYAFSLYSTP SENLCDRFCRLDTKQFGVIEGVTDKGYYTNSYHLDVEKKVNPYDKLDFEM PYPSLASGGFICYGEYPNIQHNLKALEDVWDYSYDRVPYYGTNTPIDECY ECGFTGEFECTSKGFTCPRCGNHDSEKVSVTRRVCGYLGSPDARPFNAGK QEEVKRRVKHM >MS0968 nrdF, NrdF protein MAYTTFSQNKNDQLKEPMFFGQNVNVSRYDQQKYETFEKLIEKQLSFFWR PEEVDVAQDRIDYAALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLV SIPELETWIETWTFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEEIIKR AKDISAYYDDLIRDSQLYNLFGEGTYKVEGKECKVTLRNLKKQLYLCLMS VNALEAIRFYVSFACSFAFAERQLMEGNAKIIKFIARDEALHLTGTQHIL NIMAAGQDDPEMAEIAEECKQEAYDLFLAAAEQEKAWADYLFKDGSMIGL NKDILVQYVEYITNIRMQAVGLPLPFQARSNPIPWINAWLVSDNVQVAPQ EVEISSYLIGQIDSKVDTSDFGDFDL >MS1857 nrdG, NrdG protein MGNADLFLRISGITAVENLLISNPRYPIVEIFESLQGEGFNTGMPCIFVR FGKCNLACPWCDTDYERFEYRTLQQIVEKVRSFSAKNIIITGGEPTIQPN ISLLLAQFKRDGYFLAIETNGLRAVPPQIDYISASPKAMYAEKYRRRCID FAHEVRIVMDADAENFCQQIEQKIRAERYYLSPCEIEGKMNLLETIALLG KLNQRPNKPKWQLSIQTHKLAGIE >MS0632 nrdG, NrdG protein MNYLQYYPVDIVNGEGTRCTLFVSGCTHACKGCYNQKSWSFSAGVPFDNA MEEQILKDLKDTRIKRQGLSLSGGDPLHPLNVETLLPLVQRIKRECPDKD IWCWTGYKLEELDDYQRKMLPYIDVLIDGKFVQELADPALVWRGSSNQIV HRFRSNEF >MS1819 nrfA, NrfA protein MNGVIIVNILRKTLSSLAIVGLGFAMANSAVAEEKAMATHQQAQQLQQPA PETAAKRAPTKEELTPVNPNLKIEAANEKFAADFPRQYNSWAKTAEQTEF HKEVEDDPRMIVMWGGYAFAKEFNSPRGHIYAVTDVRNILRTGSPKDANG GPQPMACWTCKGPDVPRLIAEWGEEGYFSGKWAKGGAEVVNSIGCADCHD TQSQDFKDGKPALRVARPHVLRALDTVGKTFATSDRTDQRAGVCANCHVE YYFDKSTGANNVVFPWYKGRDVDSIEKYYDEIGFKDWEHSISKAPMLKAQ HPDFETWSMGTHGKNGVTCVDCHMAKTQDKDGKVYTDHQVVGNPVKDNFQ NTCARCHDQSQDTLIKTVEQHKADVREVMLKLEDQLVKSHFEAKTAWDNG ATQEEMKDALQAIRHAQWRWDFAAASHGMHMHAPDVALKIIASGLDRVAD ARAKLAVILAKHGVQQPIQYPDISTAEKAWKVMGIDIEKERKEKEEFIKT VIPEWNKEAISKGLILTAPPTTPAK >MS1816 nrfD, NrfD protein MSTLTYPVPFHTPDLVWDSSIAIYLFLLGISSGAVQLAIAYRRSHKLEKP SENWIVRSAAVLGTIPTLIGLTLLIFHLARPWTFWKLMFNYQFNSVMSMG VMLFQVYMLFMVIWIAVLFKAEIDNLIKKFVPKLRFVTNIIGACERIFSA AEVILFILAAVLGAYTGFLLSALISYPMLNNPVLPALFLASGTSSGIAAT FLCILIAGKLKGDSHEVHYIHKFEVPIMVTELGLIVCFFVGLYFGGGQKV VALQNALSGFWGAVFWIGVMLIGIMIPLIANLFASDKWKYNAKFIILVSI FDLIGVLCLRYFILYAGQLTIAM >MS1811 nrfG, NrfG protein MTQFIIGLIIFAAIGLLLFVFFTKKVSWQQNYRQQQNIALYEQQLQSNPG EELANEFAQRLLMDEQQSEAALTLKTAVGFSRKLSALLWLVLIVMPLLYY FSLNRFDYVRQGEKAFAQQQSRLITASAEDKNIDYVLSIQNKLRKDPNNA DDWVELGQAYMLSNDYDNALLAYGNAEKLEGGKPHILGLAATTLYYQAGQ KITPQVQHIIDIALAADPKEVSSLSLMASDAFLKNDFPSALQYWQRLLDS GHTGLDRREIIRNMNLATMLQNNRMQQKAN >MS1774 nrfG, NrfG protein MFNNSMKKALFLSLIFALGGCSGLPMSDSESFVAKEKLYHSTNNYNGLIS LYREQLKTTEDNSVRYKLALTYYQKGDSQSSLDYLQPLLNEQNLYFQSAT ILQIRNLIQLQNYNEAISSASMLISKYPHNSEAYNLRGIANAQLGKYKNA EQDINSARNRFINDVIAINNLAMLKIINGDYKNAVNLLLPQYLNGAKEQR LVHNLVFALVKSGDTNYALDIIKKERLNTSPEDLVNALKKTEKVPNKVTT ARYKK >MS1820 nrfG, NrfG protein MRKFKSLTLIALSVLVIASCSSSEKPVEQASEQELFSTGANYLQEGNYTQ ATRYLEAVDSRFPGSSYSEQAELNLIFSTYKSQDYTKTLTTADRFLQQFP QSQHLDYVLYMAALTNSALGDNLFQDFFGVDRSTRETTSMKTAFNNFQTL VQNFPNSPYTPDALARMAYIKDRLARHELEIAKFYAKRSAWVATSNRITG MLRSYPDTQATLEALPLLQESYEKMGLTQLASQAATLVKANEGRVIKEAE KPKEPFLSLPSWLSFGSSDSSDKEKVATKSDDSFFSWPSWLSFGSKD >MS0494 nrfG, NrfG protein MIQLKKLFNFVVFLPGLFFAFALSGCVNGADDVFVSKNKIILGEQYPNVH FDQEVMIVRISQMLIIGQLSKNERADLYFERGVLYDSLGLWGLARYDFTQ ALALQPRSPAIYNYLGLYLLLDEDYDSALEAFNAVLELDPNYDYTYLNRG LDFYYMERYNLAQQDLLKFYEAKKDDPYRALWLYINELKFKPNEATQNLA RRAKDLSTEYWGTYIVQYYLNEISVKDLLDKAKVFVDPQSSQYAEILTET YFYLAKQKLNAGHAEEAETLFKLAMANQVYNFVEYRFALFELAKLKTNSE QTEQAVVQRVKTTQAPNSKELDAE >MS0609 nrfG, NrfG protein MTFWISALVFTLIMTFICFYPLLRGQTDREQETNRDSLNKAFYFDRLKEI EEDEKQGLLDNAAQLKTELQQSLLEDIPEGVTEKTDKKAYSKLWFVSAFL FLGIIAGVSYFKVGGWQSQEMMAKSYEKLPYFYERLKEEDTKPLDDTELQ QFATALRIKLQKEPNDADGWWLLGQLGTAMGNGELAHNGYSKAAELKPDN TDYKLAYARTLMFSDDKADRAKGNELLKEVIRSDHSNLQALSLLAFNYFE EEDYKMAAVTWAMMLRLLPEDDPKRDLIEKSIRSARDALAEQEQEKHKRM IPQNK >MS0916 nth, Nth protein MNKQTRIEILTRLRDNNPQPTTELTYNSPFELLIAVILSAQATDKGVNKA TERLFPIANTPEAILALGVEGLKEYIKTIGLYNAKAENIIKTCRDLIEKH QSQVPEDRAALEALAGVGRKTANVVLNTAFGHPTIAVDTHIFRVSNRTGF APGKDVVKVEEKLNKVVPNEFKVDVHHWLILLGRYTCIARKPRCGSCIIE DLCEYKDKTDL >MS1897 nupC, NupC protein MSMLTSLLGIFVLLAIAYSLSSNRKAINFRTVGGALLIQILIGAFILYVP AGRDILLSMANGVAKVISYGNEGIKFVFGGLAGDKIFEVFGGDGFIFAVR VLPSIVFFSALISLLYYIGVMQWVIKIIGGALQKLLGTSKSESMSAAANI FVGQTEAPLIVKPYISRMTESELFAVMCGGLASIAGSVMAGYAGMGVPLT YLIAASFMAAPAGLLFAKILVPQTEKFDDAIEHVELEKPANILDAAAGGA SSGLQLALNVGAMLIAFVALIALINGILGGVGAWFGMPELSLGEIFGWIF RPLAWLIGVPWEEAGVAGQMIGTKLAINEFVGYLEFTKYLTPETPMVLGD KTKAVITFALCGFANFSSIAILIGGLGAMAPNRRGDIARLGIKAVIAGSL ANLMSATLAGLFIELSGVALG >MS1445 nusA, NusA protein MSKEILLAAEAVSNEKLLPREKIFEALESAIALSTKKKYEQEIDVRVAIN QKTGEFDTFRRWLVVDEVVNPTKEITLEAAQFEDPDIQLGDYVEDQIDSV AFDRITMQTARQVISTKIREAERNKVVEQFRSEEGKIVTGTVKKVTRDSI ILDLTGNKEDPAKAEAVITREDMLPRENFRPGDRVRGVLYKVNPESKGAQ LFVTRAKPVMLEELFRLEVPEIGEELIEIKGASRDAGLRAKIAVKSNDKR IDPVGACVGMRGSRVQAITNELGGERVDIVLWDDNPAQFVINAMAPADVN SIVVDEDNHSMDIAVEQENLAQAIGRNGQNVRLATQLTGWTLNVMTTEEL QQKHQAEDNKVLNLFMTSLELDEDFAQLLIDEGFSSLEELAYVPVSELTA IDGLEDEDLVEELQNRAKDALTAKAVAEEEALKQAEVEDRLLNLEGMERH IAFRLAEKNIKTLEELAEQGVDDLADIEELSAEKAADLIMAARNICWFGD E >MS0975 nusB, NusB protein MTEQVKKRPSPRRRARECAVQALYSFQISQNPVETVELSFVTDQDMKGVD MPYFRKLFRQTVENIPSVDSTMAPYLDRSANELDPIEKAILRLAVYELKY ELDVPYKVVINEAIEVAKTFGAEDSHKYINGVLDKIAPALARK >MS0205 nusG, NusG protein MTETAVKKRWYVLQAFSGFEGRVATTLREYIKLNHMEDQFGEVLVPTEEV VENVAGKRRKSERKFFPGYVLVEMEMNDDTWHLVRSVPRVMGFIGGTPDR PLPISKREADLILNRVEENADKPRPKNTFQPGEEVRVTEGPFADFNGTVE EVDYEKGRLKVSVSIFGRATPVELEFSQVEKANG >MS0039 oadA, OadA protein MTKKIKFTDVVLRDAHQSLFATRLRLDDMLPIAAELDKIGYWSLEAWGGA TFDSCIRFLGEDPWVRLRELKKAMPKTPLQMLLRGQNLLGYRHYADDVVD KFVERCVANGMSVFRVFDALNDPRNMQQALTAVKKQGGHAQGTLSYTTSP VHTLDTWLNVTEQLLEIGIDSLVIKDMSGILNPMAAGELVGAIKGKFGDD VELHLHCHSTTGMAEMALLKAIEAGADGIDTSISSMSGTYGHPATESLVA TLQGTEYDSGLDIPSLEKIAAYFRDVRKKYAKFEGQLRGIDSRILVAQVP GGMLTNLESQLKQQNAADKLDAVLQEIPRVREDLGYIPLVTPTSQIVGTQ SVINVLMGERYKTIAKETAGILKGEYGRTPAPVNAELQARVLEGNQPITD RPANHIAPEMDKLAAEVKQQAAEKGIKLAENEIDDVLIVALFPQIGLKFL ENRGNPAAFEPVPTAEQAPAKAAAPVAPKAQSGAAVYTVELEGKAFVVKV SEGGDITNIAPTQTSNAVPAPQAAPVAAPASGGTPVTAPMAGNIWKVVAT EGQKVAEGDVLLILEAMKMETEIKAAQAGTVQGIAVKAGDAVAVGDTLMT LA >MS0038 oadB, OadB protein MVSSMESIISLLKGTGVMHMEWGQAVMILISLLLLWLAIARKFEPLLLLP IGFGGLLSNIPEAGLAMTALDNLLHLGSPDQIAAIAAKVGAIADPAAIKA AVSGISASEHAQLEAMAVDMGYSAGILALFYNVAIGYGVAPLIIFMGVGA MTDFGPLIANPKTLLLGAAAQFGIFSTVLGALTLNYFGLISFNLAQAASI GIIGGADGPTAIYLTSRLAPELLGAIAVAAYSYMALVPLIQPPIMKALTT EQERKIRMVQLRTVSNREKIIFPIVLLLLVALLLPDAAPLLGMFCFGNLM KVSGVVDRLSDTTQNALINIVTIILGLSVGSKLIADKFLQPQTLGILILG VIAFCIGTGSGVLMAKLMNKFSKNKINPLIGSAGVSAVPMAARVSNKVGL EADNQNFLLMHAMGPNVAGVIGSAIAAGVMLKYVSAMIN >MS0040 oadG, OadG protein MTETELFKEGLNLMFSGMGFVIIFLLILIWAIGIVSKLINTFFPEPIPVA QAKKTVTPTQSAVVDDIERLRPVIVAAIAHHRRTQGLN >MS0506 oapA, OapA protein MPFIIPIDFGNKTSKDIILKVLSDNRHTKILIFVTALSCATVTISLALRY IVERTPENNPSSEKPSQNELDLGFNQVEPITPKKVVKPEPSLFNKAKSIF AKKEAVEPNHFAVRKEPTFGESADNAKIENTTESTIAEKVKTVSATATSA TSASIENMNSDINTSAESFNAQDVQAEPQVETSSKDTTDTEVKSKMKKPE DWAVMQKLPRKHRRLAIALICVVILLLALLWLKPSSDTVEDFQTDNNKNL PIEFQPLDQSQAIENVDVNNTAPTTEAVEQANATAENALSAPAPTTPNLQ SESVATEAAKAETTDTTKVAEQAKPIEKVKAVDTPKATEKPRAVEQVKAK PAEKTKATEKVSVAENKPVKPAPKKPSVTDAKPATVSAAGSASKTLVIPQ GTSLMQVFRDNNLNISDVNAMTKANGAGGALSNFKPGDKVQVSLNAQGRV KTMRLANGATFTRQADGTYQYSK >MS1594 obg, Obg protein MKFIDEALIRIEAGDGGNGCVSFRREKYIPKGGPDGGDGGDGGDVYLIAD ENLNTLIDYRFEKRFAAERGENGRSSNCTGHRGKDITLRVPVGTRAIDND TKEIIGDLTKNGAKLLVAKGGYHGLGNTRFKSSVNRAPRQKTMGTPGEKR DLQLELMLLADVGMLGLPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGV ARVDANRSFVVADIPGLIEGASEGAGLGIRFLKHLERCRVLIHLVDIAPI DESDPADNIGIIESELFQYSEKLADKPRWLVFNKIDTISDEEAAKRAKDI TERLGWEEDYYLISAATGKNIPQLIRDIMDFIEANPREVEEEEKAAEEVK FKWDDYHNEQLSERGFDDEEDWDDDWSEEDDEGVEFIYKP >MS1218 ompA, OmpA protein MTKIFKRIIKMKKTAIALAIAGLAAATAAQAAPQENTFYAGAKAGWASFH DGYTQYAEDGVGSHTKSVTYGVFGGYQIFNRDNLGLAVELGYDDFGRAAL RTNGATSAKHTNHGAHLSLKPSYDLGALAPVLSGLDVYGKVGAALVRSDY KVNDGYSYGFNKSDFADHSLKTSLLLGAGLEYALPSLPELAFRLEYQWLN KVGKLENANGTRFDYTPEIHSVTAGVSYRFGQGVAAPVAEEVVSKTFTLN SDVTFAFGKATLKPEASASLDNIYGEIAQVQSPAVSVAGYADRIGKEAAN LKLSQRRAETVANYLVSKGVAQNAITATGYGEANPVTGNTCDAVKGRKAL IACLAPDRRVEVSVQGTK >MS1219 ompA, OmpA protein MRTGIGKKAAILNFSQRRVEPVANSWVSKVVAQIAITATGTGEAIPVTGN TCDAVKGRKALIACLAPDRRVKVAVKGEKQATM >MS1220 ompA, OmpA protein MKKTAIALAIAGLAAATVAQAAPQENTFYAGARAGWASFHDGVDAFHNSD GLSAKKNSVTYGVFGGYQILNQNNFGLAVELGYDDYGRIRLIETDAKRGK FTNHGVNLSIKPSYEVLDGLDVYARVGAALIRTDYKDYSIDAAGHSLKVS PTFAAGLEYALPILPELAMRLEYQWIENVGRDQKWSGYNDADFTPDIGAV TFGLSYRFGQGAAPVAAPEVVNKTFTLNSDVTFPFAKATLKPEATSTLDG IYGEIAQVNNVSVNVNGYADRHR >MS0561 ompC, OmpC protein MKKTIIALITSALFFSGGASAVTVYSAEGTKVNLDGRASFELINRTDKRS DLIDRGSRVRIHAYQDIGSGFTALANVEIRFTKDGDIGNQIYTKRLYGGF QHKLGSLTFGKQALLADSIGYSNFTYELGKITMMPKDADKAVRLLTDWFY GFRFGADYVFGTSEKYDDSDRELANKNRAYELAMFYNNKFGEFNVKGAVA YSQQKAGTLAKDEYDKKAMSTSVQLGYGKGAVGFDWTKGKSIEGKKDFKF RVGNNKFEEINLFEVGAKYAVTDKNNVYAEYLWGTGEIQGQDDGKFKGWF LGADHQFNKRVVTYLEGGSFKTKRSGDTLEKEKRIALGLRVYF >MS1337 ompC, OmpC protein MKKTLVALAVAATTAAMAVPATAATVYEQDGAKVELSGSFRAFLGRVGDD NRGDLKNDGSRVYVKASQDLGNGLSAFAGYQIRFEEEAYKTAQRGSDSDF GDPTTRELYAGIKHVDIGALSFGRQNTNSDDFLDDAAYYTSASLSPLTTR SDKSVKFKSAEWNGFSFGLDYLFGDSDKLDVTNDGNYKNGYAAVLFYHNA IGEHAYNLKALYSQDRYEGFGSETGVKKTQWGLHAGYNYGPFDAALSYVN YRTKFETGYFGTVGQVSLAREAGIIGDAKGNYILLDAGYRIIPESRLYVE WERLDAKADDADYTAAIRNQYTAGIDYRLHKNVVPYIEYAHTRTKFANAE TEKDNTFGVGLRVFF >MS1913 ompR, OmpR protein MAKILLVDDDTELTELLSELLSLEGFEVQIACNGEEALAKIDESYDIVLL DIMMPVLNGIETLKRLRQNFTTPVLMLTARGDEIDRVLGLELGADDYLPK PFNDRELVARIKAILRRSVLNKSASSEEETPFEERKAIEFAGLTLYPGRQ QVMYQGQDLELTGTEFALLCVLIKHPGEVLSRELLSLEALGKNLTSFDRS IDMHMSNLRKKLPTRPDDFPWFKTLRGRGYILLTD >MS1504 ompR, OmpR protein MLSPQILIVEDETVTRNTLKSIFEAEGYEVFEATDGNQMHQIIETQEINL VVMDINLPGKNGLMLARELREKTNTALMFLTGRDNEVDKILGLEIGADDY ITKPFNPRELAIRARNLLHRTMAENEKNSNTHVDAYRFNGWTLDINKRAL IDPESVEYKLPRSEFRAMLHFCENPGKIQTREDLLKKMTGRELKPQDRTV DVTIRRIRKHFEDHPDTPEIIATIHGEGYRFCGEIE >MS1246 ompR, OmpR protein MSSMMMRILLIEDDALIGNGIKVGLTKSGFSVDWFTDGKTGLQAIKSAPY DAVVLDLTLPGMDGMDILQQWRNEKIDTPVLILTARDTLNDRVTGLQRGA DDYLCKPFALAEVIARLQALIRRRYGQANPIVEHSLVKFDPNSRKVSLQG KDIPLTTREYNLLELFMMNKERVLSRSFIEEKLYNWDDEVSSNALEVHIH NLRQKLGKQFIRTVHGVGYALGKNEE >MS0740 ompW, OmpW protein MKKLALVLGISVALISGTVMAHSAGDVLIRAGGALVVPDVENSNPAWSGL DVNSNAQLGLTATYMVTDNIGVELLAATPFSHEIKLGNTLVGKTKHLPPS LYAQYYFLDKNSPVRPYVGAGVNYTTFFDEKEVLNGVTDLKLKDSWGLIT NIGLDINVTDNFYVNAAMYYAKIKSKATFKVGGVAQENKVTLDPTIFFLG VGYRF >MS0466 oppA, OppA protein MKKICTILTALFTATCVYADSTNNRLDYASTKDIRDINPHLYAGEMAAQN MVFEPLVINTNQGIRPFLAKSWRISEDGKSYLFHLRKDVKFTDGEPFNAF VAKMNIEAVLANFNRHAWLELVRQIDSVRAPDEFTLELTLKNPYYPTLTE LALTRPFRFLSPKCFNQGKTSQGVMCYAGTGPWILKKHKKNALADFSRNE NYWGELPKLNGVTWHVIPERQTMLLALLKGDIQLIFGADGDMLDMDSFKQ ISESGQFISAMSEANASRAIVLNSARTITSDQKVRQALQYAVDKAAIAKG VFNDTESIAETLMAKNVPYADVDVQTYPFNLLKAAQLLEEAGWNLSVGKN IREKAGKPLSLLLSYNINNAAEKEIAQLLQADFRKIGVDLQILGEEKQAY LDRQKNGDFDLQYSLSWGSPYDPASFVSSFRIPAHADYQGQKGLPNKTEI DEMIGELLITPNEQTRIKLYQKLFKTLAEQAVYVPLTYSKTKAIYSAQLE GVGFNPSQYEIPFEKMSFKK >MS2053 oppA, OppA protein MKLTTKFTLAALVLSAIGFVQAAETTFINCTSRAPTGFSPALVMDGISYN ASSQQVYNRLVEFKRGSVDIEPGLAESWDISDDGLTYTFHLRKGVKFHAN KEFTPTREFNADDVIFSFQRQLDSNHPYHKVSNGTYPYFNSMKFPSLLKS VEKLNDHQVRITLTRKDATFLASLGMDFLSIYSAEYADKMMRAGKPETID NQPIGTGPFVFAGYQVDKAVRFVANKDYWKGKAAIDRLVFSITPDAGTRY AKLQQGACDLAEFPNTADIERMKADKRIQMPSQESLNVAYIAFNTEKAPF DNVKVRQALNYAVDKNTILNAVYQGAGIAAKNPLPPTIWGYNDQVQPYEY NPEKAKQLLAEAGFPNGFETELWVQPVVRNSNPNPRRMSELVQSDWEKVG VKAKLVSYEWGDYIKRAKAGELTAGTFGWSGDNGDPDNFLSPLLAGVNAG NSNYARWKNAEFDALLDKAIGLTDKAQRAALYKQAQVIAHDQAPWIPMAH AVTYAPLSARVRDFKQSPFGYTSFYGVRVEDKK >MS1325 oppA, OppA protein MSNKMRSSLFSGKFSLVAKSAVIFCCFLSSVGCDRIKNLFSDTKQSVSEQ PAESMTSTKQIQTETVPEQHILSRGVYSDLVLNIRDVKSSEQADFMRDLF EGLVIFDIHGNIQPAVAESWETKDNKTWIFTLRQDAKWSNGEAVTAEDFA QAWKLLALSSSPLRQYLAFIHIDQAQEILEGKSDISQLGIKAQDEYHLQI SLDKPISYLPEMLAHIALLPAYSGGNSNKGELISNGAYKLAGQKADTISL VKNEFYWNAEKVSFPQVHYQKLADNTDVKKVDLVTDFRQIKMENVVNFPK LCTYFYEFNLKDQNLAKTAVRNALNSMISSHNIVRDSGLSGFAVSYFVPR NMEFESDESWQATVVEQILQQADFSEKNPLQFKLTYEQEGIHPNIANRLV RSWSQSDLISVKMEPVNWSQLQEKRAKGDFQIIRSGWCADYNDPSAFLNL LYSKNPDNKTGFSQERVDKLLEKAQQTISEPERNELYRQVLLISRQEHLF LPIFQYAKAVYLNPTLQGFDIHNPTEVIYSKDLSRKPMRQKN >MS0856 oppA, OppA protein MFIRKVTFIGFLLFSAMLPFFSWAAPRVPEILTQNGLIYCTHSSGFSFNP QTADAGTSMNVITEQIYNKLFEIKNNSSRLEPSLAQSYKISEDGKTITVY LRKGVEFHHTPWFTPSRNFNADDVVYSLNRVLGHNTSLPEFNASEQQKGM KRQYNIFHELAKKTRFPYFDSIKLNQKIESVTALDPYTVQINLFAPDASI LSHLASQYAIIFSHEYALQLNADDNLAQLDLLPVGTGPYQVKNYFRNQYV RLIRHENYWKKEAEIKNIIIDLSPDRTGRLAKFFNNECQIAAFPDVSQLG LLQENGERFQTTLSDGMNLAFLAFNFKRPLMQDAEIRRGIAQAINRHRII KDIYYNTASVANKIIPSVSWAGSDSNNHSFAYDYDPAQAKKVLQDRQLSL DMWVLKEEQLYNPSPIKMAELIKHDLTKAGIEVKVRLISRNFLMEQLRNN SENYDLILGGWLAVSLDPDSFMRPILSCGTTSEITNLSNWCSQSFEEILD RALISNSTNERAVNYHLAEQEVLSELPILPIASVKRILISNSNVQGVEMS PFGSISFEKLSFKKGEK >MS0462 oppF, OppF protein MSLLKVENLTKSYRTFNSLFSHLSHPALQNVSFQLEKGESVGLIGENGSG KSTLARIISGIEKADSGHVWLNGTDIYQRKNRRQQISVVFQDYFSSVNPT MTVLQAICEPLLEQKQAAAKSLEPLVVQFLKKVNLSTDCLHKYIYQLSGG QAQRVCLCRALINNPSLIILDEALSSLDIVTQVQLLELLIELKNEFQLSY FFISHNIQMICYLCERVLFFKQGQIITQSDIENLAEIKSDYAQKLIRSVI >MS1364 oppF, OppF protein MSESIKQATPLLEAVNLKKYYPVKKGLFAKPQLVKALDGVSFCLEKGQTL AVVGESGCGKSTLGRLLTMIETPTDGELYYNGQNFLENDKTTQKLRRQKI QIVFQNPYGSLNPRKKIGSILEEPLVINTDLTAAQRKARVLEIMAKVGLR AEFYHRYPHMFSGGQRQRIAIARGLMLQPDIVVADEPVSALDVSVRAQVL NLMMDLQKEMGLSYVFISHDLSVVEHIADQVMVMYLGRCVEQGRVEAIFK NPRHPYTQALLSATPRLSPKLSSERIKLEGELPSPLNPPKGCAFHTRCRL ATERCKQEQPLLKDYSDGTRIACFMVE >MS0852 oppF, OppF protein MSALLQIDNLSKSFTDNLGFFAEHKLQAVKHISFTLEKKQMLAIIGKNGA GKSTLAKMIVGIIPPTSGRILFKSTPLEFGDYKRRASHIRMVFQDVNNAF NPRLNVGQTLDDPLRLLTTLNERERNERIFETLRLVGLYPEHANVGINTL SISQKQRVALARALILEPEIIIIDDALGSLDASVKTQLTNLMLDLQERLS LSYIYVGQHLGIIKHCADKILVMDEGEMIEYGETRHVLTRPQNDITKRLI ESYFGKELDNSAWEKPEQNEM >MS2242 oraA, OraA protein MTTLAFSYAVNLLSRREYSEFEIRCKMQEKAFSEQEIEDTLAQLQQKNWQ SDKRFTENYLRARAQRGYGVNRIKQELRQLKGILPETVDEALMECDIDWS EIALNVLAKKFPDYRARQDAKNKQKIWRYMLSHGFFAEDFADFIGNGTED EFY >MS1512 orn, Orn protein MQLDNQNLIWIDLEMTGLDPENERIIEIATIVTDKDLNILAEGPVLAVHQ SDELLAKMSDWCIKTHSANGLVDRVKASKLTERAAELQTIDFLKKWVPKG ASPICGNSVAQDKRFLFKYMPELADYFHYRHLDVSTLKELARRWKPELLN GFEKKNTHLALDDIRESIAELAYYRDHFIKLDGDQK >MS2101 osmC, OsmC protein MKIFYHTSATATGGRDGHTRVDDGSIGFDLVGFQNESGKVGTNPEQLFAM GYAACFDSAMNHVAPTLNLKPTKSSTTVGVGIGQKPDGAFGLDLDITITV EGLSLEDAKTLINKAHEVCPYSNATRGNVDVRLHVNVIQSFDL >MS1291 osmY, OsmY protein MNMHKLKKLTFIIGSALLLQGCVAALVGGGAVATKVGTDPRTTGTQLDDE TLKFQVYNAVNKDEQIKQEGRIVVSSYSGRVLLLGQVPTESLKSVATSLA KGVDGVGDVYNEIRVGSPITVTQKTKDSWITSKIKSDMLLNSSVKTTDIK VITENGEVFLMGNVTQEQANAAAEVARNIAVLKKS >MS1345 paaI, PaaI protein MGIWTKDYTLSELNQIGEHCSVAHLAIRISAIEENWIEATMPVDQRTKQP FGLLNGGLSVALAETLGSIAGNLCLQEGQAAVGAEINASHLRPATSGLVT ARATPVKLGKTLQVWQIDIRNEQNKVCCTSRLTLSVINKNEK >MS2194 pabA, PabA protein MSKRLLIVNNHDSFTYNLVDLIRRLSVPMRVIEVEKLDLDEVEQFSHILL SPGPDVPEAYPEMFALLTRYYRHKAILGVCLGHQTLCRFFGGRLYNLRQV RHGVCGRLKVRSKSAIFSGLPEEFDIGLYHSWAVDSQNFPAELTITAECH EEVVMAFEHKTLPIYGVQFHPESYISEYGEQMLINWLNS >MS1150 pabA, PabA protein MATILFLDNFDSFTYNLVDQFRGLGHQVKIYRNDCDLALLESIALQPDTI LALSPGPGTPAEAGNMLALIQRVKSAVPIIGICLGHQALIEAFGGKVVHA GEVLHGKVSKINHDEQAMFLNLQNPMPVARYHSLKGSNLPEELVVNATYN DIIMAFRHKNLPICGFQFHPESILTVQGAKLLENSVNWLLNK >MS2293 pckA, PckA protein MTDLNQLTQELGALGIHDVQEVVYNPSYELLFAEETKPGLEGYEKGTVTN QGAVAVNTGIFTGRSPKDKYIVLDDKTKDTVWWTSEKVKNDNKPMSQDTW NSLKGLVADQLSGKRLFVVDAFCGANKDTRLAVRVVTEVAWQAHFVTNMF IRPSAEELKGFKPDFVVMNGAKCTNPNWKEQGLNSENFVAFNITEGVQLI GGTWYGGEMKKGMFSMMNYFLPLRGIASMHCSANVGKDGDTAIFFGLSGT GKTTLSTDPKRQLIGDDEHGWDDEGVFNFEGGCYAKTINLSAENEPDIYG AIKRDALLENVVVLDNGDVDYADGSKTENTRVSYPIYHIQNIVKPVSKAG PATKVIFLSADAFGVLPPVSKLTPEQTKYYFLSGFTAKLAGTERGITEPT PTFSACFGAAFLSLHPTQYAEVLVKRMQESGAEAYLVNTGWNGTGKRISI KDTRGIIDAILDGSIDKAEMGSLPIFDFSIPKALPGVNPAILDPRDTYAD KAQWEEKAQDLAGRFVKNFEKYTGTAEGQALVAAGPKA >MS0926 pcnB, PcnB protein MISKNALSVVEKLNRNGYEAYVVGGCLRDLLLDKKPKDFDVATNARPDQI QAIFQRQCRLVGRRFRLAHIMFGRDIIEVATFRANHSDIENENASKQSEE GMLLRDNVYGTLEQDAERRDFTVNALYYSPKDNLVYDYFNGIEDLKAGKL RLIGDPVTRYQEDPVRMLRSIRFMAKLDMFLDKSSAPHIRKLAHLLKNIP PARLFDESLKLLQSGQGVKTYNLLREYHLFEQLFPSLMPYFTERGDSFAE RMILTALTSTDERIADKLRVNPAFLFAAFFWYPLREKVEILKNEGGLNNH DAYALASNEILDLFCKNLAAPRRHTATIRDIWFLQLQLLKRNGKAPERTM EHNKFRAAFDLLAMRAEIEGGEAIELSAWWHEYQLSTDEQRSALVKEQDK QHPQGKKKFYRPRRRKPKVKTATNP >MS2312 pcnB, PcnB protein MQTYLVGGAVRDQLLNLPVKDRDWVVVGATPEQLLSLGYQQVGRDFPVFL HPKTKEEYALARTERKSGAGYTGFICDFSPHISLEQDLIRRDLTINAIAQ DNQGKFIDPYEGISDLKNRTLRHISPAFAEDPLRVLRVARFAARYHQLDF SIAPETIALMAEITEKGELQQLTIERVWQETEKALKEKNPEIYFQVLLQV GALKILFPELYALYGVPNPAQYHPEIDSFLHTMLVLQQAVRLTENTEFNK SAVRFAAICHDLGKALTPKDILPHHYGHEKAGIQPIRTLSNRLKVPTYYK ELAEFTCEYHSYIHKAFELKPETVIKLFNKLDVWRKPQRFEELMLVCVAD TRGRTGFEQTAYPQKDYLRQLYQTALQVNVQQVIEDGFEKQGIRDELTRR RTIAVKTKKAEILPRFVGQ >MS0066 pdxA, PdxA protein MKPILGITMGDAAGIGPEVIIKALEDKRIYDLAHPVVVGDFKIMQRALPI VKSNLKLRKVDDVDHYQSEFGYIDVIDLDNLPADLPFAKVDARAGKAAYE FIERAVDLTLKGKIHAIVTAPLNKEALHAGGKMFPGHTEILAKLSNTEDF SMMLTSEKLNVIHVTTHVSMRQACDLIKKERVLTVIELAQEYTKMLGFKE PRIAVAGFNAHAGEHGLFGTEDEEEILPAVKEAQAKGINVIGPIPPDTVF HRAANLDEFDMVVVMYHDQGHIPLKLIGFDSGVNVTTGLPFIRTSVDHGT AFQIAGQGIADSRSMTEALYLGAKMANIKYSNQ >MS0381 pdxA, PdxA protein MKKPVIALTMGDPAGIGPEITVDTMLSEEIHNVCKPFVIGSIPILERAAK VRGVSIKFNKIQDPSEAKYELGTFDVLETGNYDTDSIKWGEVQKLAGQMS YDWVLKSIELGMAKKIDAVSTAPIHKIAIKLAGVKEPGHTEIYQNETHSE YGLTMFSCHKLRVFFVSRHMSVIDACKYATKERVLQDVRNIDKELRNIGI ENPFIAVAALNPHGGDNGLFGREEIDELIPAVEAAKAEGINATGPVPADS VFHIGKSGKYDAILSLYHDQGHIACKTLDFEKSVTITFGLPFMRSSVDHG TAFDIAGKNIANGVSMIESTKVLAEYTAKYMNKKA >MS2064 pdxH, PdxH protein MIDLHNIRNEYSQQALSEKQCDDDPLKQLEKWLNEAIQAKVNEPTAMNVA TVGENGKPSSRVVLLKEVNERGLVFFTNYHSHKGRDLAVNPFAAVNLFWA ELQRQVRVEGRVERISPQASDEYFASRPYTSRIGAWASEQSAVISGKNSL LTKAALIAAKHPLQVPRPPHWGGYIVIPELIEFWQGRPSRLHDRIRYRLE KGEWVRERLSP >MS0805 pdxK, PdxK protein MKNVLSIQSHVVFGYAGNKSATFPMQLMGVDVWALNTVQFSNHTQYGKWT GMVIPKEQIGEIIRGIDEIGELKNCNAVVSGYLGSAEQVDEIIKAVEKVK SLNPQALYLCDPVMGHPDKGCIVADGVKEGLINLAVSHADILTPNLVELR EISGLPVENFEQAIEAVKVIRAKGPKTVLIKHLSKVGKYADKFEMLLAND EGIWHLTRPLYTFAKEPVGVGDLTAGLFLANKVNGKSDLEAFEHMANAVN EVMKTTFELNSYELQLIAARKLIVNPVSSVKAVKIA >MS1550 pepB, PepB protein MKYSVKQTALEQENKSLFIAIFENQELSPAALKLDLKLKGEITEAVKNGE VSGKIGRILVLRHGAQRIILVGCGKQNEVTERQYKQIIQKAVKTAKETIA TTIINALTEVKIKDRDLYWNVRFAVETIEEDNYIFEQFKSKKSENNSKLA EIIFYTEENHEQAELAIRHATAISSGVKAAKDIANCPPNICNPAYLAEQA NQLAGRSSLIETTVIGEKEMRKLGMNAYLAVSCGSKNEAKLSVMEYRNHE NPNAKPIVLAGKGLTFDAGGISLKPAADMDEMKYDMCGAASVYGVMNAIA ELQLPLNVIGVMAGCENLPDGNAYRPGDILTTMSGLTVEVLNTDAEGRLV LCDTLTYVERFEPELVIDVATLTGACVVALGQHNSGLVSTDDNLAQDLER AAKLANDKAWRLPLSEEYQEQLKSKFADLANLGGRWGGAITAGAFLSNFT KNYPWAHLDIAGTAWLQGQNKGATGRPVSLLVQFLLNQVK >MS0667 pepB, PepB protein MQIQLSNLPAPKSWGKNPLLSFSDNQATIHLENSEKSDRTLIQKAARKLR GQGIDDVELVGNDWSLENCWAFYQGFYTAKQDWAVEFPELGDDHEELLAR IQCGDFVREIINLPSSVITPLELAQRSARFIAGLAEEYAGKSAVDFHIIS GEELKAQNYLGIWNVGKGAENPPAMLQLDFNPTGNPESPVLACLVGKGIT FDSGGYSIKPSNFMDSMRTDMGGAALVTGALGLAIARGLNRRVKLFLCCA ENLVSGNAFKLGDIITYRNGVKAEILNTDAEGRLVLADGLIDASSENAQF ILDAATLTGAAKVALGNDYHCVLSMDEELTTDLFNAAKEEQEPFWRLPFE ELHRSQISSSFADISNTSSAALAAGASTATAFLSHFVKDYQQNWLHLDCS ATYRKTPSDLWATGATGLGVQTIANLLLTKATQL >MS0815 pepD, PepD protein MQYNEQLLERFFNYVSLDTQSKPGAKTSPSTQGQLKLAKILEQELYSLGL DEIEVSKHGIVTALLPGNIENSPTIGLIAHLDTSPQCSGKNVKPEVIENY RGGDIALGLGDEFISPVTFTFLHKLVGKTLIVTDGTTLLGADNKAGIAEI MTALSQLKESSVPRCHIRVAFTPDEEIGLGMKFFPIEKFSCDWAYTIDGG AVGELEYENFNAAGATVTIFGRAIHPGSAKDKMVNALTLACEFQQGFPTD EVPEKTEEKQGFFHLNSFHGDIEKVELHYLIRDFDKQAFTQRKAFLEKWV DEFNCRKQLKEPVKVTITDNYYNMYDTVSKVPQSIELADSAMKACGIVPI HQPIRGGTDGAWLAEKGLACPNIFTGGYNFHSKHELITLEGMCSAVDVIM KIAQLAVK >MS2118 pepD, PepD protein MSEIQSLQPQLLWKWFDQICSIPHPSHHEEQLAEFIVNWAKGKGFYAERD EAGNLLIRKPATKGMEHCQSVALQAHLDMVPQANEETDHDFTSDPIQPYI DGEWVKAKGTTLGADNGIGMASALAVLDSENLAHPALEVLLTMTEEVGMD GALGLRKNWLQSEIMINTDTEDNGEIYIGCAGGENADLTVPVQWQENNYE HCYQISLKGLRGGHSGCDIHTGRASAIKTLARFLANLQQNQPHFEFSLSE IRGGSVRNAIPREAFATLCFNGEPANFTQGVKSFESLLKTELAIAEPDLQ LTAQPAEKATKVFAPNTKNNVVNLLNALPNGVIRNSDVVENVVESSLSIG VLKTTEDAVKGTILVRSLIESGTNYINGLLISLTELCGASVQFSGRYPGW EPHAETPILTLTKEIYGELLGYEPAIKVIHAGLECGLLKKIYPALDVVSI GPTIVNAHSPDEKVHIPAVRTYWELLTKVLAGIPAKK >MS1554 pepE, PepE protein MQAVLSPLNMEIISGKMLRHNGESREEHLAEFLIVNPTALVYAHPESTAL HIEGRQATILE >MS1034 pepN, PepN protein MHAKAKYRKDYKKPDFTVTDIHLDFQLDPQKTVVTAHSQYQRLNPAATVL RLDGHSFQFASIKVNGKDFATYQQDGESLTLDLSDIDAERFELEVITRLV PAKNTSLQGLYQSGEGICTQCEAEGFRQITYMLDRPDVLARYTTKITADK SKYPYLLSNGNRIAGGDLEDGRHWVEWNDPFPKPSYLFALVAGDFDLLED SFTTKSGREVKLELFVDRGNLNRASWAMESLKKAMKWDEERFDLEYDLDI YMIVAVDFFNMGAMENKGLNVFNSKYVLANPETATDEDYLNIESVIGHEY FHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDVGSRAVNRIKNVKFLR TAQFAEDASPMSHPIRPEKVLEMNNFYTMTVYEKGAEVIRMMHTLLGEKK FQQGMKLYIAENDGKAATCEDFVAAMEQASGVDLTQFRRWYSQSGTPELT VTDSYDEKKRSYKLYVSQMTAPTADQMDKVNLHIPLKIALYDMNGMPFSL IKDDEAVNDVLDILLEDQVFEFHNITSKPVPALLCDFSAPVKLDYDYSTA QLIALLKFAHNEFVRWDAMQMLFAQELRRNLSAYQQGEQLTFSAEILSAL QQVLENYQSNVELTTLILTLPKETEFAELFKTIDPEGIAVVCDFMQHAIA EGLQDLWLKTYHQINLEEYCIDMRDIALRGLRNLCLQYLAFTDYGNALVN KHYLYADNMTDKLAALAAATKAQLTCRDKVMKDFEEKWQHDGLVMDKWFN LQATRPDGNVLTLVKQLMDHPSFNFNNPNRLRALVGSFESQNLRAFHAVD GSGYRFLTDVLLRLNESNPQVAARLVEPLIRFSRYDSQRQTLMKRALERL REVENLSNDLFEKIEKALQ >MS0479 pepP, PepP protein MDLAYMAELPADEFVLRRQKLAAQLTDNSVFIVFSEVEKRRNNDCTYPFR QDSYFWYLTGFNEPNSALVIQKKGKLVETTIFVRPSNPLMEIWNGRRLGV ERAAEKLHLDQAFSIDDFARIFGKICQNSTALYHYQGLQPWADQLLAETF ISPPDYINWAPMLDEMRLFKSANEVRLMQQAGQITALGHMKAMRQTRPNR FEYEIESEILHEFNRFGARYPAYTTIVAGGENACILHYTENDQPLKDGDL VLIDAGCEFAMYAGDITRTFPVNGKFTQAQREIYQIVLNAQKRAIELLVA GNSIQRANDEVVRIKVKGLLDLGIMRGDIDELIANNAHREFYMHGLGHWL GLDVHDVGSYSKEGQNGDRNSKVRDRPLEIGMVLTVEPGLYISPKSDVPE QYKGIGVRIEDNILITEYGNKVLTAAAPKEIGDIEALMATER >MS1958 perM, PerM protein MIEMLKNWYLRRFSDPQAMGLAAILFFGFVAIYFFSDLIAPLLIALVLAY LLEMPISFLSDKLKLPRFLSILLILGGFIAVTILMIFGLIPTLINQTVNL FSDLPNMLNLSHQWVMSLPESYPELVDYQMIDSLFITIREKTLAFGESAV KFSLSSLMNLVTIGIYAFLVPLMVFFMVKDQDELIAGFSRFLPKNRTLAS KVWQEMQLQIANYIRGKLFEILIVAVVSYIIFLFFGLRYPLLLAVAVGLS VLIPYIGAVLVTIPVALVAIFQFGATPTFGYLMTAYIVSQLLDGNLLVPY LFSEAVNLHPLTIIIAVLIFGGLWGFWGVFFAIPLATLVKAVVNAWPSNE DEAIS >MS0428 perM, PerM protein MNKSVSVNQFLIGFAALVIILAGIKMAGEIVVPFLMSLFIAIICSPIIKF MTNRKIPHWLAISILFLFIVLVFFFLLGLVNSSIREFSQSIPQYRVLMSE RLNEITALIQKWNLPLNLEKETILEHFDPSSIMNFVSRLLLSFSNVLSNA FVLILVVIFMLLEAPTAKRKVALALSGNEKDASKEEKHLERILQGVISYL GVKTAVSLLTGLCAWVLLETCGVQYAVLWATLTFLFNYIPNIGSIIAAIP IVLQALLLNGFSTGFAVMTGIIAINMLIGNFLEPKLMGRTLGLSTLVVFL SLLFWGWLLGTVGMLLSVPLTMALKIMLEASPNTTKYAALLGDVEESN >MS0377 pfkA, PfkA protein MIKKIAVLTSGGDAPGMNAAIRGVVRSALFEGLEVFGVYDGYYGLYHNKI KQLNRYSVSDVITRGGTFLGSARFPEFKNPEVRAKCAEILRSHGIDALVV IGGDGSYTGAKLLTEEHGIQCIGLPGTIDNDAPGTDYTIGYQTALETAVD AIDRLRDTSSSHQRISIVEIMGRHCSDLTINAALAGGCEYIVASEVEFDQ EELIQQIERSIANGKRHAIIAITELITDVHELAKRIEERVHHETRATVLG HVQRGGSPCAFDRILASRMGVYAVDLLLQGKGGYCVGIQNEQLVHHDIID AINNMRRSFKAELLDMNERLF >MS0403 pflA, PflA protein MSVLGRIHSFETCGTVDGPGIRFILFLQGCLMRCKYCHNRDTWDLHGGKE ISVEELMKEVVTYRHFMNASGGGVTASGGEAILQAEFVRDWFRACHKEGI NTCLDTNGFVRHHDHIIDELIDDTDLVLLDLKEMNERVHESLIGVPNKRV LEFAKYLADRNQRTWIRHVVVPGYTDSDEDLHMLGNFIKDMKNIEKVELL PYHRLGAHKWEVLGDKYELEDVKPPTKELMEHVKGLLAGYGLNVTY >MS0401 pflD, PflD protein MAELTEAQKKAWEGFVPGEWQNGVNLRDFIQKNYTPYEGDESFLADATPA TSELWNSVMEGIKIENKTHAPLDFDEHTPSTITSHKPGYINKDLEKIVGL QTDAPLKRAIMPYGGIKMIKGSCEVYGRKLDPQVEFIFTEYRKTHNQGVF DVYTPDILRCRKSGVLTGLPDAYGRGRIIGDYRRLAVYGIDYLMKDKKAQ FDSLQPRLEAGEDIQATIQLREEIAEQHRALGKIKEMAASYGYDISGPAT NAQEAIQWTYFAYLAAVKSQNGAAMSFGRTSTFLDIYIERDLKRGLITEQ QAQELMDHLVMKLRMVRFLRTPEYDQLFSGDPMWATETIAGMGLDGRPLV TKNSFRVLHTLYTMGTSPEPNLTILWSEQLPEAFKRFCAKVSIDTSSVQY ENDDLMRPDFNNDDYAIACCVSPMVVGKQMQFFGARANLAKTMLYAINGG IDEKNGMQVGPKTAPITDEVLNFDTVIERMDSFMDWLATQYVTALNIIHF MHDKYAYEAALMAFHDRDVFRTMACGIAGLSVAADSLSAIKYAKVKPIRG DIKDKDGNVVASNVAIDFEIEGEYPQFGNNDPRVDDLAVDLVERFMKKVQ KHKTYRNATPTQSILTITSNVVYGKKTGNTPDGRRAGAPFGPGANPMHGR DQKGAVASLTSVAKLPFAYAKDGISYTFSIVPNALGKDDEAQKRNLAGLM DGYFHHEATVEGGQHLNVNVLNREMLLDAMENPEKYPQLTIRVSGYAVRF NSLTKEQQQDVITRTFTQSM >MS1478 pfoR, PfoR protein MKNRLKNFLIRQNIKFSLRRYAIDAMNFMALGLFGSLIIGLILKNTGDWL DILWLNELGALAQSSMGAAIGVGVAYALKAPPLVLLSSTTTGIAGATLGG PIGCFIAAAIGAEFGKLVNKTTPIDILITPAVTLLSGIATAQFMGPFLAS LMRETGAMIMWAVELHPIPMSILVSVLMGMILTLPISSAAIAVTLSLSGL AAGAATIGCCAQMIGFAVIGFKENRWGGLLSLGLGTSMLQIPNIVKNPKI WVPPTLSGAIIAPFATVIFQMQNIPSGAGMGTSGLVGQIGTINAMGNSPY IWLVILVLHFILPAILSLLITYLMRRKGWIKPGDLKLAV >MS1537 pfs, Pfs protein MKIGIVGAMKQEVEILANLMRNQTVTQVAGCTIYEGLINGKQVALLQSGI GKVAAAIGTTALLQLAKPDVVLNTGSAGGVADGLKVGDIVISTETAYHDA DVTAFGYAKGQLPACPATFISDEKLTALAKQVAQAQGHNVKRGLICSGDS FIAGGERLAQIKADFPNVTAVEMEAAAIAQVCHVFRVPFVVVRAISDAGD GQAGMSFEEFLPIAAKQSSAMIIGMLEQL >MS1181 pgi, Pgi protein MQNINPTSTAAWKALEAHKGTLENTTINDLFQQEKNRFADYSLTFNNEIL VDFSKNKITRETLNLLRRLAKECALDEAKEAMFSGEKINRTENRAVLHTA LRNRSNTAVLVDGKDVMPEVNEVLAKMKAFSERVISGEWKGYTGKAITDV VNIGIGGSDLGPYMVTEALRPYKNHLNMHFVSNVDGTHIAEVFKKTNPET TLFLVASKTFTTQETMTNAKSARDWFLATAKDEKHVAKHFAALSTNAAEV EKFGIDTDNMFGFWDWVGGRYSLWSAIGLSIILSVGFENFEALLSGAHEM DKHFRNTPIEQNIPATLALVGLWNTNFQGAQTEAILPYDQYMHRFAAYFQ QGNMESNGKYVGRDGKVISNYQTGPIIWGEPGTNGQHAFYQLIHQGTTLI PCDFIAPAKTHNPLADHHNKLLSNFFAQTEALAFGKTKETVEAEFLKAGK SLDEVKDVVPFKVFAGNKPTNSILLQEITPFSLGALIAMYEHKIFVQGVI FNIFSFDQWGVELGKQLANRILPELSGDEQVTGHDSSTNGLINQFKAWR >MS0245 pgk, Pgk protein MSVIKMTDLDLAGKRLFIRADLNVPVKDGKVTSDARIRATIPTLKLALEK GAKVMVTSHLGRPTEGEFKPEDSLQPVVDYLKDAGFNVRLAQNYLDGVEV NEGEIVVLENVRINKGEKKNDPELGKKYAALCDVFVMDAFGTAHRAQAST YGVAEYAPVACAGPLLAAELDALGKALKEPQRPMLAIVGGSKVSTKLTVL DSLSKIADQLIVGGGIANTFIAAEGHNVGKSLYEEDLIPEAKRLAAATNI PVPVDVRVGTEFSENAPATEKSVTEVQADESIFDIGDKSAEELAKIIKSA KTILWNGPVGVFEFPNFRKGTEVISNAIAEATANGAFSIAGGGDTLAAID LFGIKDKISYISTGGGAFLEFVEGKVLPAVEILEKRANG >MS0973 pgpA, PgpA protein MMAHSSSTTNPLDKLSLRNPVHLLALGFGSGLIRPAPGTWGSLAAIIIGA PILHWIGTVPFLVLILLGFALGVYLCQKTADDMGVHDHGAIVWDEFIGIF ITLLAIPQISLFWCIAAFVLFRIFDIIKPYPISYFDKRLESGFGIMVDDV LAAVYAAISLFLLHYII >MS1532 pgpB, PgpB protein MYLMLKCTLSIFRENFIMLKRLSLYTLLLCLVPIFVWISGWHWQGDAALT QFDYFLYWLTETGSSPYAIITCGVFALLFFPFAKTKKQWVAVVAIMAVSM VVTQGLKTGLKHVFAEARPYVVELAANSDISTEYFYDQTKEQRQSIVTDY YSSRAETPGWLVEHRENEVSYSFPSGHTIFAVSWLLLAAGFFRLLGQTSS GAKILLILTALWAFLMLVSRLRLGMHYPLDLLISTLIAWVLHCALFVFLE KKRLFPRD >MS0959 pgsA, PgsA protein MTMKLNFPTFLTLFRVALIPFLIIAFYLPFGWSAFLSTAIFFVASITDWF DGYLARKWKQTTRFGAFLDPVADKVIVATALVLIVEYYHVFWITIPAIVM ISREIIISALREWMAEIGSRTTVAVSWIGKVKTTAQMFALGCLLWRYQYW MEALGIILLYIAAILTVWSMIQYLKAAKDYLLEEINS >MS1657 pheA, PheA protein MALDLSEIRQQITQIDRSLLKLLSERHRLAFDVVRSKEITQKPLRDEKRE QQLLQELINFSENENYQLEPQYITQIFQKIIEDSVLTQQVYLQKKLNEQR EQSIHIAFLGKRGSYSHLAARSYATRYQEQLIELSCSSFEQIFEKVSSGE ADYGVLPLENTTSGSINEVYDLLQHTDLSLVGELTYPIKHCVLVNGQDDL SKIDTLYSHPQVIQQCSQFIRSLNKVHIEFCESSSHAMQLVSSLNKPNIA ALGNEDGGHLYGLTVLRSNIANQENNITRFIVIARKAITVSPQIHTKTLL LMTTGQEAGSLVDALTVFKKYQIKMTKLESRPIYGKPWEEMFYLEIEANT NHPDTQAALEELRQYSTYLKVLGCYPSEIVKPVDVR >MS1091 pheS, PheS protein MQNLKEITEQARAALDELHDKGLDALEAFRVEYFGKKGHFTQLMQSLRNV AAEERPAVGAKINEAKQAVLDILNAKKEAFEKAALNAQLEKERIDVSLPG RKVELGGLHPVSLTIERVTKFFSELGFSVESGPEIESDYYNFDALNIPKH HPARADHDTFWFNPELLLRTQTSGVQIRTMEKKQPPIRIMVPGKVYRNDY DQTHTPMFHQIELLYEDKKVNFTELKGVLYDFLRAFFEEDLQVRFRPSYF PFTEPSAEVDIMGKNGKWLEVLGCGMVHPNVLRSVGIDPNEYSGFAAGMG VERLTMLRYNVTDLRSFFENDLRFLKQFK >MS1090 pheT, PheT protein MKFSEQWVREWVNPAVNTEQLCDQITMLGLEVDGVEAVAGEFNGVVVGEV VECAQHPDADKLRVTKVNVGGERLLDIVCGAPNCRQGLKVACAIEGAVLP GDFKIKKTKLRGQPSEGMLCSYRELGMSEDHSGIIELPADAPVGKDFREY LILDDKEIEISLTPNRADCLSIAGVAREIGVVNQLAVTEPAINPVPVTSD EKVAINVLAPEACPRYLLRSVKNVNVNAETPVWMKEKLRRCGIRSIDPIV DITNFVLLELGQPMHAFDAAKLAQPVQVRFAADGEELVLLDGTTAKLQSN TLVIADQTGPLAMAGIFGGQASGVNAQTKDVILEAAFFAPLAITGRARQY GLHTDSSHRFERGVDFELQHKAMERATSLLVEICGGEVGEICEVVSETHL PKLNKVQLRRSKLDALLGHHIETETVTEIFHRLGLPVSYENEVWTVTSAS WRFDIEIEEDLIEEIARIYGYNSIPNNAPLAHLSMREHHESDLELSRIKL ALVGNDFHEAITYSFVDPKLQSILHPEQAVWILPNPISSEMSAMRVSLLT GLLGAVVYNQNRQQNRVRLFETGLRFIPDESAEFGIRQELVFAAVMTGSR LSEHWASKAEPADFFDLKGYIENLLSLTKAGPYIKFVAKEFPAFHPGQSA AIVLDGEEIGYIGQLHPMAAQKLGINGKAFACELIVDKVAERNVANAKEI SKFPANKRDLALVVAENIAASDILDACREVAGSKLTQVNLFDVYQGQGVP EGHKSLAISLTIQDTEKTLEEDDINAVISVVLSELKDRFNAYLRD >MS0619 phnA, PhnA protein MDQMPKCPKCNGEYVYHDSVNFVCPDCSYEWSGEDVAEEEETKVWKDSNG NVLQDGDDVILVKDLKVKGSSIVLKKGTKAKNIRLVDGDHDVDCKIDGQP FSLKSEFLKKA >MS1186 phnL, PhnL protein MNNGILLNCQNLTKDYIEGSVTTRVLKDVTFSMNDKELVAIVGSSGSGKS TLLHTLGGLDQPTSGEVFIKGKSLQKASQDELAKLRNTYLGFVYQFHHLM ADFTALENVLMPMLIGNQNKTEAKDRAEKMLNAVGLSHRITHKPSALSGG ERQRVAIARALVNNPALVLADEPTGNLDHKTTESIFELIQQLNEDQGIAF LLVTHDLNLAEKLNRRLIMQDGVLRPEM >MS1682 phoH, PhoH protein MTTQYKQEFTLTPQDNARLQSLCGAYDDNIKLIESEFNLDIARRNFTFII QSKDKQSKPHHEALIKSAVKLIQDLYVETAPVRGKIKELDLSDVHMAIQE SRMLLQPAQTDEKADTDESKVYTTTIKTKRGLIKPRGKNQIEYLHNILIH DISFGIGPAGTGKTFLAVAAAVEALERQEVRRILLTRPAVEAGEKLGFLP GDLGSKIEPYLRPLYDALYEMLGFERVEKLMERNVIEIAPLAYMRGRTLN DSFIILDESQNTTVEQMKMFLTRIGFNSKAVITGDVTQVDLPRSQKSGLK HAIEVLEKVEELSFNYFDSKDIVRHPVVAKVVQAYESWEAEDEIRKRKLA EQRRAERAEIAENLKVD >MS1917 pilF, PilF protein MMTFKFQQKLTALFSLFISLLLSACSSSPQVNAENLAKQQAAKARIELGL AYLHQQNINQAKQNFDKALEHAPQYYLTHSALAYFYQQLGDVKQARIHYQ KAIDLDNNQGDVHNNYGTFLCSQGQFEQAYDEFEQALQSPNYYRQTDSYE NLALCALSAKNNERFQRYLTKLEKLDPKRAAKLENISN >MS2310 pitA, PitA protein MELLHNYGTVIIFITAAFAFFMAFGVGANDVSNAMGTSVGSGTITARQAI YIALIFEAAGAYLAGGEVTETIKSGIIDPMDFVTHPDTLVLGMMSALFAS GSWLLIASRWGWPVSTTHSIVGAIVGFGCITAGAGAVKWSALTGIVGSWF ITPFIAGVLAYGIFFCIQKLIFDTEHPLRNAQKYGPHLMGATVFIICIVT VAKGLKHVGLNLTGLETLLISIALSLVSVVISYFYFRSKKFIKKVHKGVF GGVEHVFSILMLMTACAMAFAHGSNDVANAIGPLASVVTIVESGGDIAAN APIAWLVLPLGAAGIAVGLIVMGYKVMATIGTGITDLTPSRGFSAQFATA ATVVVASGTGLPISTTQTLVGAVLGIGFARGIAALNLTVIRNIIASWIVT LPAGAFFAIIIYYILDAIFR >MS0004 pldB, PldB protein MLNREPKFSNFALTELLPFAERCPLNYIKGKNNIKIAYRHFKHEHDGNDR LLVLVNGRAENLLKWTEVAYDFYRQGYDVLSFDHRGQGYSDRLLKNRDKG YIDEFRYYTDDMAAVIAEAYSYRQYSSCHLVAHSLGALISTYYLANYDHH IKSAVLSAPFYGLQLRRPFIDQIIINLMILFGQGKRYVPGKEGYKPANLN NNDLSFCKTRMKWMNRINRNHPAIHLGGPTFRWVHLCLNAIKGLPKIIPR IEIPILILHSDKEKIVNNKNLQKLTALFQHVQVEEIQNAKHEILFERDAL RARAIQRISKFFTDFK >MS0745 plsB, PlsB protein MQDVNLGVGKCIMTSLLNLYRKVLEAPLSFLVKNNPIPSNPIEELKLNVS QPIVYVLPYTSQTDFVIFRKNCLSVGLPDPIETNDIHGKQLPRYVFLDEG RQIFKSKGPKKETEKVFYNYLELHRAFGDLDVQVIPVSVLWGRAPGREDK GKLPQLRLLNGMQKTIAAMWFGRDTFVRFSQAVSLRYMVTEHGADQSIAQ KLARVAKMHFAKQRFSATGPRLPNRQAMFNKLLQSPVILSAIADEAKSKN MSRERAHQEAEKILKEIAADVSYENLRVLDRLLRWLWNKLYQGIDIENAD RVRQLALEGHEIVYVPCHRSHIDYLLLSYVLYHQGLVPPHIAAGINLNFW PAGPIFRRSGAFFIRRTFKGNRLYSTIFREYLGELFHRGYSVEYFIEGGR SRTGRLLTPKTGMMSMTLQALQQRQTRPITVVPVYIGYEHVLEVDTYAKE LRGAAKEKENAGLVLRVIKKLRNLGKGYVNFGEPITLSNYLNQHYPEWKN TDDEKPTWFNKAVDSISNQVMVNINNAAAVNAMNLTGTALLSSRQRALSR EQLLEQLQSYQEFLQNVPYSDDVIVPAESPEEMLKHVLGLERVGVLVEKD SFGELVRLERNSAVLMTYYRNNIQHLFALPSLVASIILHYEKIHNGELLH AVQRIYPFLKNELFIHIEKEELTLVVEKIIAEFHRQKLIDVDGDVFGIND RGIRTLQLWASAVREILQRYRITIAILQYKPDIARNALEKESQSVAQRLS VLHGINAPEFFDKAVFAEFTASLKDNGYFDEAGNAVTEKLDELADILNHI ISAEVNLTIRSAIEKAEEMPAQE >MS0510 plsC, PlsC protein MLKLIRIILVAICCVLICVLGTIFSLIRFRHPSNVGVMARWFGRLYPLFG LRVEHRFPDNVDQNVPAIYIGNHQNNYDMVTISYMVRPRTVSVGKKSLIW VPFFGILYWATGNIFLDRDNKNKAHNTMTELARRIQQDNISIWMFPEGTR SRGRGLLPFKTGAFHAAISAGVPIIPVVCSTTHKKIDLNRWNNGKVICEI MQPIDTQSYSKENVRELASHCYDLMKKRIAELDAELAQVGK >MS1870 plsX, PlsX protein MSRLTLALDVMGGDIGPRITIPASIKALEKDPMLSLLLFGDSQQINPLLE QVPSALKERLRVCHCSRVVENNQGLSYALRHSKGTSMRLAIEAVQKGEAQ GCVSAGNTAALMGLSKVLLQPLKGIDRPALISLLPTMDGGRTVMLDLGAN IDCNANNLYQFALMGAIFAENQLDLVFPRIALLNIGIEEIKGYKSIREAA DLLTGNSSLNYIGFIEGNLLLNGKADVIVSDGFVGNIALKTLEGAAKNVI SLIKGKSRNHLLKPLFNWLIKLLFKDSYQRLQKINPDQYNGASLIGLTSI VVKSHGAANIEAFNNAIHDAALQARQQIPEKILAGLQK >MS1889 pncB, PncB protein MQNSALLMDFYALTMANSYFEQGRQDEIAYFDYFFRRVPDEGGYAVFAGL EQLLDYLENLQFSEQEIAFLQQKQIFSEGFLDYLRHFRFRGDLWAVKEGT AVFPAEPLVVIRAPIIDCTLIEAFLLLTLNHQTLIATKAARIVSVAQGRN VLEFGARRAHGVDAAHFGARAAYIAGVDGTSNVYSDFACGIPALGTMAHA YVQSFDNEYEAFLQYAKTYPDNTVLLVDTYDTLHQGIPNAIRVHREYLAP RGYKLKGIRIDSGDLAYLSIRAREMLDAAGLTETKITVSNSLDEYLIKDL LLQGAKIDSFGVGERLITAKSEPVFGGVYKIVALENAGQIIPKIKLSETL QKTTTPGFKNLWRLYDKNRKAIADVITLHDEIIDSTKPYMLFDPEYTWKT KWVTDFVAEQKLQQWISQGGKQQPIPTLEESRAHCRQELQSLWNEVRRLE KPHGYYVDLSQDLWTLKRHLIERHSGKKE >MS0493 pnp, Pnp protein MNPIVKQFKYGQHTVTLETGAIARQATAAVMASMDDTTVFVSVVAKKDVK EGQDFFPLTVDYQERTYAAGKIPGGFFKREGRPSEGETLIARLIDRPIRP LFPEGFLNEIQIIATVVSVNPQISPDLVAMIGASAALSLSGVPFNGPIGA ARVGFINDQFVLNPTMAEQKQSRLDLVVAGTDKAVLMVESEADILTEEQM LAAVVFGHQQQQVVVEAIKEFVAEAGKPRWDWVAPEPDSALISKVKAIAE NRLGDAYRITEKQVRYEQIDAIKADVIAQITAEDEEVSEGKIVDIFTALE SQIVRGRIIAGEPRIDGRTVDTVRALDICTGVLPRTHGSAIFTRGETQSL AVVTLGTERDAQILDELTGERQDTFLFHYNFPPYSVGETGRVGSPKRREI GHGRLAKRGVAAVMPSISEFPYVVRVVSEITESNGSSSMASVCGASLALM DAGVPVKSAVAGIAMGLVKEDDKFVVLSDILGDEDHLGDMDFKVAGTRTG VTALQMDIKIEGITPEIMQIALNQAKSARMHILGVMEQAISAPRAEISEF APRIYTMKIDPKKIKDVIGKGGATIRALTEETGTSIDIDDDGTVKIAAVD GNAVKTVMARIEDITAEVEAGAVYTGKVTRLADFGAFVAIVGNKEGLVHI SQIAEERVEKVSDYLQVGQEVQVKVVEIDRQGRIRLTMRDLGSKEESQEL SVEQ >MS1224 pntA, PntA protein MLIGVPRELLDNESRVAATPKTVQQILKLGFDVIIEHDAGFKASFEDNAF EQAGAKIGDQQAVWNADVIFKVNPPTDDEIALMKEGSTLVSFIWPAQRPD LMEKLSTKNVNVLAMDAVPRISRAQALDALSSMANISGYRAVIEAANAFG SFFTGQITAAGKVPPAKVLVIGAGVAGLAAIGAANSLGAIVRAFDSRPEV KEQVQSMGASFLEIDFKEEGGSGDGYAKVMSEEFNRRAMELYAEQAKDVD IIITTAAIPGKPAPRLITKEMVDSMKPGSVIVDLAALTGGNCEYTKAGEI FVTDNQVKVIGYTDFPSRLPTQSSQLYGTNLVNLLKLLAPNKDGQIDLNF EDVVIRGVTVIRNGEVTWPAPPIKVSAQPQQKAAAQKVEKKEEKPKDPRI KYGVMALAAILFLWLASVAPSAFLSHFTVFVLACVVGYYVVWNVSHALHT PLMAVTNAISGIIIVGAVLQIAQGSFFISVLAFIAILVASINIFGGFKVT QRMLAMFRKG >MS1223 pntB, PntB protein MSVGFVQAAYIVAAILFIMSLAGLSKHETAKAGCWYGIVGMTIALIATIF GPATEGQLWILVAMAIGAVLGIRKALKVEMTEMPELVAILHSFVGLAAVL VGFNSFGLHSAVAMPVNLDEAAQAAFLAEQAALDNIHNVEVFLGIFIGAV TFTGSVVAFGKLSGKINSKALMLPHRHKLNLAALVVSALLMISFLNSPEN IFPVLLMTAIALAFGWHLVASIGGADMPVVVSMLNSYSGWAAAAAGFMLN NDLLIVTGALVGSSGAILSYIMCKAMNRSFVSVIAGGFGNDPVVSSDEEQ GEHRETTAEETAELLKNASSVIITPGYGMAVAQAQYPVAELTQKLRDRGV NVRFGIHPVAGRLPGHMNVLLAEAKVPYDVVLEMDEINDDFEDTDVVLVI GANDTVNPAAMEDPNSPIAGMPVLEVWKAQNVIVFKRSMAVGYAGVQNPL FFKENTQMLFGDAKDRVNDIISALN >MS0193 pnuC, PnuC protein MSLSKSLKDEFFGGWTKFEAFWLILFLAIQIGLFIYQPDSWIATIAAITG IICVVFVGKGKISNYLFGFISVSLYAYTSYTFKLYGEMMLNLLVYVPVQF IGFFMWRKHMTNKNTLNTAGVEEVIAKALTAKQWVLVILAAGLVTYAYIE WLRHLGSALPALDGVTVGVSIVAQVLMILRYREQWSLWIIVNILTISLWV GMYLENGETSLPLLTMYIMYLCNSIYGYYNWTQLVKKHQAG >MS0225 polA, PolA protein MFLLYFEIVMAQIAQNPLVLVDGSSYLYRAFHAFPPLTNSLGEPTGAMFG VLNMLKSLITQVQPSHIAVVFDAKGKTFRDELFEQYKSHRPPMPDDLRKQ IQPLHDIIRALGIPLLSIEGVEADDVIGTLALQASSAGKKVLISTGDKDM AQLVDDNIMLINTMNNTLLDREGVIEKYGIPPELIIDYLALMGDSSDNIP GISGVGEKTALGLLQGIGSMAEIYANLDKVADLPIRGAKKLGEKLAAAKA DADLSYVLATIKTDVELDLNPEQLIIGTANKDELIEYFARYEFKRWLNEA LNDESSVTKPQEQAVKINNYQATPALAKQESAVKNSVKIDRTLYETVDNQ AKLQQWIEKIRQVKLVAVDTETNALDPMLAELVGISFALENGEACYIPLA HVHQVAAQAENAQGDLFAESEQSSESRWEPVVGQLNKAECLSQLKPLLEN PEIKKIGQNIKYDLTIFANNGINMQGVTFDTMLESYVLNSTGRHNMDDLA ERYLGHHTIAFEDIAGKGKNQLTFDQIELKKAAEYAAEDADVTMKLHQTL WREVAQSPELVKLYQEMELPLVSVLSRIERNGVLIDSRALLAQSKEFSQK LTALENKAHELAGQHFNLASTKQLQEILFDKLGLPVLKKTPKGAPSTNEE VLEELAYEHELPKLLVEHRGLSKLKSTYTDKLPQMVNRKTGRVHTSYHQA VTATGRLSSSDPNLQNIPVRNEEGRRIRQAFIARKGFKVIAADYSQIELR IMAHLSADKGLTAAFSEGKDIHRSTAAEIFGLALENVTAEQRRSAKAINF GLIYGMSSFGLSRQLGIARGDAQRYMDLYFQRYPGVQTFMTNIREKAKSQ GYVETLFGRRLYLPDIQSANAMRRKAAERVAINAPMQGTAADIIKRAMID IDKAITDDPDILMIMQVHDELVFEVKEDKIEHYSALIKSLMENAAQLHVP LIVDVGVGDNWDEAH >MS0583 potB, PotB protein MKGIKKDLKAWLLLCSGLGTILFLMGSTFYIVVTQSLGLYNISGEDSRFT LQYWHDVLTNSVFQSSYIYSVKVSLLGAILSIIVSYPIAMWLRNELPAKV TIITILRAPMLVPGLVAAFLFVNMISYHGILNETMVFLGIWHEPKTLQND EFGWGVVILQMWKNIPFALILIGGAVNSLKTDLLDAAANLGSTSWQRFRY VIFPLTLTAVQVSFILIFIGALGDFAFYSIAGPRSTYSLARLMQMSAYEF EEWNQSAVMAMMIMLTSAFFTILVSIIIKPLAVKRGDIK >MS0811 potB, PotB protein MKMTTRKFQNSTVAVIFAWLIFFMFVPNFLVLIVSFLSKDSSNFYALPFT FENYARLFEPLYGTVVWNSLYMSGIATVICLLIGYPFAFFMAKLNPKYRP ILLFLLVLPFWTNSLIRIYGMKVFLGVKGILNEFLLFTGIIDEPIRILNT EVAVIIGLVYLLLPFMILPLYSAIEKLDLRLLEAAKDLGANGIQRFIKII IPLTMPGIVSGCLLVLLPAMGMFYVADLLGGAKVLLVGNVIKSEFLISRN WPFGSAISIGLTILMALLIFVYYKANKLLNKKVELE >MS0581 potC, PotC protein MSSAKITTKNSKIIARISLTFFVLVNFIWLVLPFLMAGLWSLVDPKQPWS YPDILPPSLSLERWQMVWENTSLPEAMFNSYTIAPTVSLITISLSIPTAY AFGRMEFRGKKIAELLTLIPLVIPGMIIALFFSRMLLDLNISNPFVGIVI GHVVLTLPYAIRILSAGFSSVPQDLIEASRDLGASKFTVFKDVYMPMLKP SFLASIIFCLVKSIEEFAISFVIGSPDFITVPTILYSFLGYSFIRPNAAV VSIILLVPNIILMMIIEKLLKGNYLSQSTGKA >MS0810 potC, PotC protein MSRVLRNIFMLVVYAYLYIPIIILVGNSFNADRYGLSWKGFSFAWYERLA NNDTLIQAAVHSVTIAFFAATFATIIGSMTAIALYRYRFRGKQAVSGMLF VTMMSPDIVMAVSLLALFMIIGISLGFWSLLLAHITFCLPYVVVSVFSRL KGFDLRMLEAARDLGASEVTILRKIIFPLALPAIISGWLLSFTISMDDVV VSSFVSGVSYEILPLKIFSLVKTGVTPEVNALATIMIVLSLLLVLLGQII GKKDKS >MS2292 potD, PotD protein MLVTATAFFSTASFAAPKQLYIYNWTDYIPSDLISKFTRETGIKVNYSTF ESNEEMFSKLKLTINKPGYDLVFPSSYYISKMVKENMLTPINHSKLTNLK QIPSNLLNKDFDPANKFSLPYVYGLTGIGINTSFVNPDEVTGWGDLWKEK FKGKVLLTADSREVFHIALLLDGKSPNTQNEEEIRNAYQRLTKILPNVAA FNSDTPELPYIQGEVELGMIWNGSAYMAEKENPAIKFIYPKEGAIFWMDN YAIPKNARNIEGAHKFIDFMLRPEHAKIIIERMGFSMPNEGVKVLLKPED RVNPLLFPPEDEVKKGVFQADVGDATDIYEKYWNKLKTN >MS0809 potD, PotD protein MSWNYQGQIFYSLSTGANSMKKLAGLFAAGLIAVAVTGCNDKESKSADAN APETAKDNGTVYLYTWSEYVPDGLLDDFTKETGIKVIVSSLESNETLYAK LKTQGADGGYDIIAPSNYFVSKMAREGMLKELDHSKLPVIQELDPDWLNK SYDPNNKYSLPQLLGAPGIAYNTQTYKGSDFTSWGDLWKPEFAGKVQLLD DAREVFNIALLKLGKNPNTQDPAEIKAAYEELLKLRPNVLSFNSDNPANA FISGEVEVGQLWNGSVRIAKKEQPGSIDMIFPKEGPVLWVDNLAIPATSK NPDGAHKLINYLLGAKAAEKLTLAIGYPTANVEAKKVLPKEITEDPAIYP TAELLRTANWQEDVGEAVELYEKYYQELKAAK >MS0580 potD, PotD protein MRNILRKALSLTITALAVANFAQAENLTDKSWPDIEAQAKKEGKLTVSVW YLQPQFRVFVKEFEKQYGIQVKVPEGTLDGNINKLIAEKNLEKGKMDVVV LSADRVSNVTNNGVLANIKQLPNFGKLNHFLQGVDLGETAVGYWGNQTGF AYDPLRITEDQLPQSWQDVENYIQQNPKKFGYSDPNGGSSGNAFIQRALV YVNGEYDYMTPTVDAAQVANWKKTWEWFNARKNVMIRTASNADSLTRLND GELVLVSAWQDHLFSLQKQGAITTRLKFYVPQFGMPGGGNVATIAKNAPN PAASLVFIHWLTSPEVQQKLSQEFGVRPLDSESGKRDTLFFSTPWRKAEM EAFTKEVVSR >MS0552 potE, PotE protein MSNKKIGLLSLTALVLSSMIGSGIFSLPQNMAAVAGAEAISIGWLITGIG IIFLGLSFFFISRLRPELDGGIYTYAREGFGDLMGFMSAWGYWLCATIGI VGYLTVAFEGLGVFTDSENTVIFGQGNTVASFIGSSIIVWLVHALIAGGI KEAASVNLVATFVKVAPLVLFILLGFWFFDTDIFNSDVKASALNNNIGDQ VKDTMLITLWVFTGVEGASVLSAHAKKRTDVGLATVLGILIALALYVAIT ILALGILPRETIAEMPNPSMGPLLDAMMGPTGKVIITACLIVSVLASYIS WTMYSAEIPYRGAQKGAFPKILDKLNENSTPINSLWFTGFIVQFCLILVF VFEQSYNTLLLISTSMILIPYFLIGAYLFKLAIQTNSAWYIKLTGFMASI YGLWIVYAAGLQYLLLSVVLYVPGILLFLYSHRKFHGKFKLKGFEQTILA MIFILFCYAVYRLPELLAA >MS2221 ppa, Ppa protein MADFNQILTPGDVDAGIINVVNEIPEGSCHKIEWNRKVAAFQLDRVEPAI FAKPTNYGFIPQTLDEDGDELDVLLITRQPLATGVFLEAKVIGVMKFVDD GEVDDKIVCVPADDRDTGNAYNTLSDLPAQLIKQIEFHFNNYKALKKPGS TKVTHWGDVEEAKEVIRESIKRWNEQ >MS1017 ppc, Ppc protein MTEEYLMMRNNINMLGRFLGETIQEAQGDDILELIENIRVLSRNSRSGDD KARAALLDTLSTISADNIIPVARAFSQFLNLTNVAEQYQTMSRSHEDKVS AERSTAALFARLKEQHVSQEEIIKTVQKLLIEIVLTAHPTEVTRRSLMHK QVEINKCLAQLDHTDLTAEEQKNIEYKLLRLIAEAWHTNEIRTNRPTPLE EAKWGFAVIENSLWEGLPAFIRKLNDAAVEHLNYALPVDLTPVRFSSWMG GDRDGNPFVTAKITREALQLARWKAADLFLTDIQELCDELSMTQCTAEFR EKYGDHLEPYRVVVKDLRSKLKNTLDYYNDILAGRIPPFKQDEIISEDQQ LWQPLYDCYQSLTACGMRIIANGLLLDTLRRVRCFGVTLLRLDIRQESTR HSDAIGEITRYIGLGDYSQWTEDDKQAFLIRELSSRRPLIPHNWTPSEHT REILDTCKVIAKQPEGVISCYIISMARTASDVLAVHLLLKEAGISYHLPV VPLFETLDDLDASKEVMTQLFNVGWYRGVIKNRQMIMIGYSDSAKDAGMM AASWAQYRAQDALVKLCEQTGIELTLFHGRGGTVGRGGAPAHAALLSQPP RSLKNGLRVTEQGEMIRFKLGLPAIAAESLDLYASAILEANLLPPPEPKA SWCRVMDELAVASCEIYRNVVRGDKDFVPYFRSATPEQELAKLPLGSRPA KRNPNGGVESLRAIPWIFAWMQNRLMLPAWLGAGASIRQAMESGKAAVIE EMCNHWPFFNTRIGMLEMVFSKTDSWLSEYYDQRLVKKELWYLGESLRKQ LSEDIATVLRLSGKGDQLMSDLPWVAESIALRNVYTDPLNLLQVELLRRL RADPEHPNPDIEQALMITITGIAAGMRNTG >MS0624 ppiB, PpiB protein MKKFKFLTALFALFFVFNANAKNVTLHTNYGDIKIALNEKKAPVSSKNFL DYAQTGFYDNTIFHRVIDGFMIQGGGFEPGMNQKKTNEAIRNEANNGLKN LRGTIAMARTSAPHSATAQFFINLQDNDFLNFTEESQQGWGYAVFGKVTE GMNVVDKIAKVATGRVGMHRDVPKEDVVIKSVTVE >MS0623 ppiB, PpiB protein MRNQIPIHFYKENTMVTLHTNFGDIKIALNHEKAPETAANFEAYCKEGFY NNTIFHRVIDGFMIQGGGMEPGMREKNTKAPIKNEANNRLSNKRGTIAMA RTSDPHSATAQFFINVADNAFLDYRAKEMFGREVVQEWGYAVFGEVVEGM DVVDKIKGVKTGNAGFHQDVPKEDVVITSVTVE >MS0360 pppA, PppA protein MSNLYLVFFACLLGYLTYYYLSNFRTKLHKDIYYAFYQIFPQKQPHFTIE QADQAGQLSPLSLANQWIYIGVSIVLCSIIQTLTKDLTLTCFYMSYLVLL FIIGKLDWHYQLIEPALCQLLLLILSGASYFRLISNSLEDVVESAVISFV IFYLVYHISKLCYKKEVFGQGDYWLISALSAGLSWRDIPLMISLACLLAL FYALIYNRLLAHTKISLVPFAPFLCTANLVTLFIKMLI >MS0818 pqiA, PqiA protein MIFDQMPNAQRYIRKYGKSAVDFPAKFLLDVDFADKNRSIILSD >MS0819 pqiA, PqiA protein MPKIDRTFPFFLLLMSKYGIKYPPLKQKRDNMAKASSLSASQNASIVRCN DCNALVALSELKKSQQAECPRCHNVLKSQDRWRLRRCAIIAISILILMPF ALTYPLLSVDLLGITVDASVWGGVWKMATEGYPYTAFLVFICAVFLPVSF ALLVILLYLAKLTHQKPRNLLFALGYIKPWVMFDVYLVALGVSAFKVRDY ATIHIDIYLIAFVLTSLLTTLLFIKINPKELWNDFYPQNQHLIKISPENP PHFCRTCEYTFEHSAFDRKSHTICPRCHSRLDTPSYVNLQNTWATLIAGI IMLFPANIFPISYTIMNNVATGDTLMSGVITFIGMGSYFVAFVVFFASIF VPVSKVFIMIYLLLSIHFQWKHSIKWQMSLFHIVHFVGRWSMLDLFVLSL MMSLVTRGQIINFSVGPAALYFGIAVFLTMLSTTFFDTRLLWNIYDKQPS K >MS0817 pqiB, PqiB protein MASPFSLRCFQPHSLIPVYFGIFMTNNQANNRVKINAENNVQAAKIKQDK RISPFWLLPIIALCIGALLFFQIIKEQGETIRITFTTGDGLVANKTQVRY QGLQIGIVKKVNFTDDLKKVEVQASIYPEAKNVLRENTKFWLVQPSASLA GISGLDTLISGNYISLQPGDGNYKDDFIAEETGPIAQVSDGDLLIHLLAD DLGSISEGASVYYKKMPVGKIYDYRFTPDQKKVEIQVVIDKAYANLIKQD TRFWNISGINANVGPSGITVNMDSLNAIVQGAITFDSPDNSPKAKQDQQF TLYPTLQAAQRGIEVKITLQNQAGLKAGKTEVFYNNLQVGTLAKLDNEDI THAKISGTLLLDPNISNELRTNTNIILRTPKMNLATLEKLPDMLRGQFFE IIPGSGEPQREFQVYKESDLLLKQADTLVFTLTAPETYGIAEGQQIFYNN LPIGEIVKQTLNEQGVEYQAAIAGKYRHLIYGDSQFVAASNLDISLGIDG LRVEAASPDKWLQGGIRLIANKNKGSALSSYPIYKDLSSAEAGITSSTLT PTITLNAQNLPNIGKGSLVLYRQYEVGKVLDIRPLKNSFDVDVAIYPKYR HLLTKNSLFWVESASQVDITARGISIQTSPLGRVLKGAISFDNSGGNNNK TLYANELRAKSAGQVITLTADNATNLTKGMALRYMGLEVGQLESINLDQN KNQVVVKALMNPNYMNLVAKEGSEFRIISPQISAGGIENLDSLLQPYIDI DAGKGKYKTTFAIKNNNNTDNKYNNGFPIILEASDALNITTGSPIYYRGV EVGKINRMELNELGDRVLIHLLIANKYRHLVRKNSEFWISSGYSAGVGWS GIEVNTGTVQQLLKGGISFSTPSGTVIQPQAAANQRFLLQIKKPVEAKTW NSAVLPEQN >MS0821 prc, Prc protein MKLNKSKTYLATLVVSAVIGMSGNAFAVHPTLKASDIVIPQPTEENGLAT KRATTRLTQSHYRKFQLDDEFSHKIFARYLDFLDYSHNLFIKSDIDELQA KYAALLDDELNEGKLDIAFAMYDLMAKRRYERYEYALSLLDKEPSLKDDD QIEIDRKKAAWPASVEEANKLWAARVKNDIIDLKLKDKKWSEIKKTLTKR YNLAIRRLTQTNADDITQLFLNAFAREIDPHTSYLAPRTAKSFNESMNLS LEGIGATLQQEDDVTTIKSLVPGAPAERSKRIKAGDKIIGVGQAKGEIED VVGWRLEDVVDKIKGKKGSKVRLEIEPAKGGKSKIITLVRDKVRIEDSAA KLTVEKVNGQNVAVIKIPTFYIGLTADVRKLLEQMKAKKATSLIIDLREN GGGALTEAVELSGLFISDGPVVQVRDAYNRIRVHEDPDNAQVYTGPLLVM TNRFSASASEIFSAAMQDYNRGIIIGQDTFGKGTVQQSRSLNFVYDLDQE PLGFIQYTIQKFYRINGGSTQLKGVTADINFPAIIDTKENGEEKEDNALP WDKIPAATYSQVSHARDAVEVLKSKHLDRISKDPEFIALAEDLKIRDERS ERKYLSLNYEKRKAENDKDDARRLKALNERFAREGKKALKDINDLAKDYE APDFFLKEAEKMASDLAKFETDKESIQAKAMSLENKADTKDVKAESKKTA EVKTETVKSKEDVRPETK >MS1192 prfA, PrfA protein MKPSIISKLDSLNERYEELEALLGDASVISDQDKFRAYSKEYSQLEEVVK TFSRWKQLNSNIEEAELLLDDPEMKEMAQMEIEESKNELEEVEQHLQILL LPRDPNDEYNAYLEIRAGTGGDEAGIFAGDLFRMYSRYAEMKRWRVEVLS ENESEQGGYKEIIALVSGDNVYGQLKFESGGHRVQRVPKTESQGRIHTSA CTVAVMPELPESEMPEINPADLRIDTYRASGAGGQHINKTDSAVRITHIP TGMVVECQDERSQHKNKAKALAVLASRLVQAEQDKLAAEQATTRRNLLGS GDRSDKIRTYNYPQGRVTDHRINLTVYRLDEVMNGKIDELIQPIITEYQA DQLAALSDQP >MS1542 prfB, PrfB protein MEQPDVWNEPEKAQALGKERSALETVVNTIKKLDQGLEDVDGLLELAVEG EDEETFNEAVTELDELEQQLAKLEFRRMFSGEHDACDCYIDLQAGSGGTE AQDWTEMLLRMYLRWAESKGFKTELMEVSDGDVAGLKSATVKVSGEYAFG WLRTETGIHRLVRKSPFDSNNRRHTSFAAAFIYPEIDDDIDIEINPADLR IDVYRASGAGGQHVNRTESAVRITHIPSGIVVQCQNDRSQHKNKDQCMKQ LKAKLYEMELQKKNADKQALEDSKSDIGWGSQIRSYVLDDSRIKDLRTGV ENRNTQAVLDGDLDRFIEASLKAGL >MS0449 priA, PriA protein MILLMKFVRVALAVPLMRFFDYILPEQMQPVIGGRVLVPFGRQKRVAIVV EFAQETDIPKEQLKPVLNVLDDAGLFNDDMWNLLKWGAGYYQFSLGDVLF SALPVKLRNGESVVEKNKILWKLTALGEQAMVSGELKRAKKQLEALTELT KNPLEKGNNEFSAAIWSQLKEKRFVEEVTQPLQIIPWQIRLGGKEIMRAE QRLTLNKQQALALSRLLFHQGFAAWLLDGVTGSGKTEIYLQYIEEILKQD KQVLVLVPEISLTPQTVQRFQARFNVDIDVIHSNMNDSQRLLVWQRARTG QSAIVIGTRSALFTQFKRLGLIVIDEEHDNSFKQQDGGWRYHARDLAIVY AKQLDIPIVLGSATPSLESLNNVKNRKFKHIVLSHRAGAGSGLKHEVIDL KRQRIQHGLSDTLLRKMASHLEKGNQVMLFLNRRGFAPVMLCHECGWIAT CTQCDKPYTYHQHQRVMKCHHCEIQKPVPMQCGACGSTHLVTTGIGTEQL EFVLQQQFPQYEVTRLDRDSTVRKGALENHLSAIKQGKSRILIGTQILAK GHHFPDVTLVALVNVDSALFSLDFRAEERLAQLYVQVSGRAGRAEKQGEV VLQTHYPDHPLLQQLLHDGYHAFANSALQLRRQMGLPPFSAQALFRAQSK SSEEAEQLLQQIASYFYDWKNRQNMPDLQLLGPMPAPFSKKAGRFRWQLL LQHPSKSVLQHALGQFNFENEVKSSQARWILDVDPQDLS >MS0470 priB, PriB protein MKTTILRMLKSNLSINNRLSLEGFVTEQPKRTKSPNGIEHCRIWLEHRSE QIEAGLKRQAWCKMPVHISGTQLVQKTQSITVGSHLLVVGFLTLHKTSKG LSQLVLHAEHIEYL >MS0533 prmA, PrmA protein MAWVQIRLNSTNEKAETISDYLEEIGSVSVTFMDSQDTPIFEPLPGETRL WGNTDVIALFDAETDMQQIVRLLRQEGHLDENTAYKIEQIEDKDWEREWM DNFHPMQFGKRLWICPSWREVPDPNAVNVMLDPGLAFGTGTHPTTALCLE WLDGLDLAGKTVIDFGCGSGILAIAALKLGAKEAIGIDIDPQAILASRNN AEQNGVADRLKLYLSEDKPANMKAEVVVANILAGPLKELYPVISELVKEK GNLGLSGILATQAESVCEAYQAKFDLDAVVEREEWCRITGKLK >MS1604 proA, ProA protein MTDLIQMGKQAKQAAFALSQLSQQEKNHALALIAERLEAQQERILAENAK DIQAARENGLSESIIDRLLLTKERLTGIADDVRHVISLADPVGKIIDGGV LDSGLKLERIRTPLGVIGTIYEARPNVTIDVASLCLKTGNAVILRGGKET QHSNKILVEVIQNALQQAGLPEMAVQAITDPDRALVMELLHLDKYVDMII PRGGAGLQALCRDNSSIPVIVGGIGVCHIFVEQSADQDRSLAVIENAKTQ RPSTCNTVETLLVQESIAEEFLPKLARRLKTKEVKFHADSTALSILQGVS ADVKPVTEQQLRNEWLTYDLNVVIVKGIEEAVEHIREYGSEHSESILTES QKLANQFVAQVDAAAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMG LEALTTYKWVCVGDYTSRA >MS1862 proB, ProB protein MKFGTSTLTQGTPKLNRAHMIEIVRQLAQLHQEGYRLVIVTSGAMAAGRH YLNHPKLPPTIASKQLLAAVGQSQLIQTWEQLFAIYDIHIGQLLLTRADI EDRERFLNARDTLHALLDNRIIPVVNENDAVATAEIKVGDNDNLSALVAI LVQAEQLYLLTDQQGLFDSDPRKNPQAKLIPVVNEITDHIRSIAGGSGTT LGTGGMSTKITAADIATRSGIETIIAPGNRENVIADLAHGEAIGTKFTVQ TDKLESRKQWLFAAPSAGILTIDQGAENAILEQHKSLLPAGIVNIEGRFS RGEVVKIRTQQGKDIALGMPRYNSDALYLIQGKKSQNIEKILGYEYGSVA IHRDDMIVLNK >MS1799 proC, ProC protein MKNKLLTFIGGGNMAQAIVFGLLNKGYSAAKLIVCDRNEAKRNLFAQKGV EVNLTNVEAAEKAEVVVLAVKPQAMAETCGPLSAVDFSGKLVISIAAAVS VSRLSALLPTAKNIVRVMPNTPALVSEGMAGLFASAGLNGEYQDFAEDLL NAVGKTCWLQKEEDMHAVTAGSGSSPAYFFLFMEAMEKTLSSMGISPENA RTLVQQSALGAAKMVENNPQLPLSTLRENVTSKGGTTAAALAVFNQYQLD KIVQQAMEACVARSQEMEKLF >MS1798 proP, ProP protein MNVRPFTWLALSYFGYYCAYGVLVPFLPVWLKSQNYGTELIGAVIASSYL FRFLGGIFFPSRVKRANQILPALRLLAWANVFVITAMAFVSESFWLIFIA IAVFSMVNAAGMPLTDSMATTWQRQIRLDYGKARLIGSAAFVVGVTVFGS LIGAIGEQYIISILIGLFGLYAVLQMVPPQPKPADEDKNSAKSAVGFGEL LKNPTHLRLIIAAMLIQGSHAGYYVYSVIYWTNRGIAVETTSLLWGLGVI AEILLFFFSGRLFRNWSVNAIFYLSAAAAALRWGAFSYTDALWQIALLQC LHSLTFAALHYAMVRYIGMQPQNAMVRLQSLYSGLASCASVALLTALAGI IYPISSHWVFLVMMICALIALFVIPRKPTNA >MS0191 proP, ProP protein MSNKVNSYGWKALMGSAVGYAMDGFDLLILGFMLSAISADLSLSPTQAGS LVTWTLIGAVAGGIIFGALSDKYGRVRVLTWTIVLFAVFTGLCAFAQGYW DLLIYRTIAGIGLGGEFGIGMALAAEAWPARHRAKASSYVALGWQVGVLA AALLTPLLLPIIGWRGMFLVGIFPAFVAWYLRAKLHEPEVFVQKQAEVAT GKRQSPFKLLIKDVATAKVSLGVVVLTSVQNFGYYGIMIWLPNFLSKQLG FSLTKSGVWTAVTVCGMMAGIWIFGRLADRIGRKPSFLLFQIGAVISIIA YSQLTDPAIMLFAGAALGMFVNGMMGGYGALMSEAYPTEARATAQNVLFN LGRAVGGFGPVIVGAVVSAYSFKIAIALLAVIYVIDMIATVFLIPELKGK ALK >MS2054 proP, ProP protein MASGEANYRSLAWIAASALFMQSLDATILNTALPTIAADLHHSPLEMQLA VISYALTVALFIPISGWVADKYGTLRVFRFAVGMFALGSLACAMSSSLIM LIFSRVLQGFGGALMMPVARLSIIRSVPKQELLPVWNLMATAGLTGPILG PILGGWIVTYTSWHWIFLINIPMSLLGIWLANRYMPNVTGSLQKLDWAGF FFLGGGLVGVTLGFDLISEEFIAKWQATVIVILGVILIITYCFHAQKRER LALLPLSLFKIRTFRVGIMANMLIRLCASGIPFLLPLMYQVVFHYSADKA GMLIAPIALSSMLVKPLCGRILTKLGYRTALISASIVLTLSIAVMSFLHI DSPVWILIVNVALYGGCISIVFTAVNTLTISELSDQDASAGSTFLSVVQQ VGIGLGIAVSALILSLYRYFIGESAVQLQQAFGYTYLTSASFGVLLVLVL SGLKKEDGAHLHK >MS2374 proP, ProP protein MSGEKTSRYVLGVTLVATLGGLLFGYDTAVISGTVSSLDTVFIQPKGLPE ISANSLLGFCVASALIGCIIGGACGGYLSSKYGRKKALLIAALLFLISAF GSAYPEFGLKTINETNNIPYYLSNFLIQFVIYRIIGGIGVGIASMVSPMY IAEITPARIRGKMVSFNQFAIIAGQLIVYFVNYFIALNGDNTWLNMLGWR YMFLSEMVPAALFLILLFFVPESPRWLVLQNKFSQAEITLLKLLGERSGK TELQNIVSSLEHRVVKGAPLFSFGLGVIVIGIALSVFQQFVGINVALYYA PEIFKSLGASTNNALLQTIIMGTINLSCTTIAIFTVDKYGRKPLQIIGAL GMAMGMFVLGMAFYANLSGTIALTGMLFYVAAFAISWGPVCWVLLAEIFP NAIRSQALAIAVAAQWIANYIVSWTFPMMDKSSYLVERFNHGFAYWVYGL MAILAALFMWKFVPETKGKTLEELELLWNKK >MS0499 proP, ProP protein MNLREHIDNNPMSAYQWTVVIIAAIMNLLDGFDVLALAFTATAIRGDLGL SGAELGYLFSAGLLGMAAGSLFLAPLADKIGRRPLLLISVTLSALGMLGS AYSASYGALGFWRLITGLGVGGILVGTNVLTSEYSSRKWRSLAISIYASG FGIGAVLGGMFAVVLQEEYSWHAVFLAGFILTAVCLIVLLIWLPESIDFL MTQQPRNAQIRLNKITKKMGLKGQWTLPEKVLASASKLPLTQLFNKNYRK STALIWIAFFAIMFCYYFVSSWTPALLKEAGMTTEQSVSVGMMVSLGGTC GSLLYGLLASRWKAKQMLVQFTVLSAFSVIIFILSSSILWLAMLFGILVG GFMNGCISGLYTLNPSIYAANIRSTGVGWSIGVGRIGAILAPLAAGVLLD YGWDKQSLYIGVGFVLLIAAIALSLLRIKTTLVKC >MS1178 proP, ProP protein MPNKAETSPAKLRLKAFLKRIKIMNTTENSKQKPVNVVAFAFLLTAFLTG IASSFQTPTLSLFLAQEIQVSPFMVGMFYTSNAVLGIVLSQILAKYSDSQ DDRRKIIIFCSLLAIGGCITFAYNRNYYVLMFFATFLLSLGSSANPQAFA LAREYADYTKREAIMFTTIMRTQISLAWIVGPPLSFSIALGWGFEYMYMV AASAFLLCAIIAKALLPYVPRKAVVPLTKPDEVAGLPAKNKKQSDKQSIR LLFITCFLMWSCNGMYLISMPLHVINELHLSERLAGILMGTAAGLEIPVM LIAGYLTKYLTKKSLILTALFMGLFFYIGMLFAEQTWQLVALQAFNAIFI GIIATLGMVYFQDLMPGKMGSATTLFSNAAKSSWIVAGPFVGIIAQIWNY SSVFYISIVLVAVSLFSMSKVKSV >MS0785 proP, ProP protein MQNKFAVYLAAIGHLVTDMAQGALPALLPLFIKNYGLTYQEAGGLIFANT VLASIAQPFFGYLADKRSMPWLIPLGMMLSGCCIAAMGFVHSYPGLFFFA MIAGIGSALFHPEGARLVNRMSGGEKGKAMGIFAVGGNAGFAIGPMFAGL AYLFGAQTLSIFALINTIIALIIFLQLPKLTVENVVNKAKNTASTTLQND WRSFAKLSVIIFVRATNFTVLNAFIPIYWIHILHQQETDANFALTIFLSM GVAITFIGGLLSDRLGYVRIIRYAYLIFLPTILIFTQSENLWLSFILLIP LGLGVFTQYSPIVVLGQTYLAKSVGFAAGITLGLGITMGGIFSPIVGWIA DHYGLQIALQTLSVLSLLGLIFSYRLKITDTEKPEKK >MS0797 proP, ProP protein MSQNHFFSHIFNRNMLICIFTGFSSGLPLYILTSLIPTWLRSTEIDLKTI GFFTLTSLPFIWKFLWSPFLDRFVPPFLGRRRGWMLIFQLLLLISLGLFG FIDPHTNQGLSLLIGLATMVSFFSASQDIVLDAYRREILSDQELGMGNSI HVSAYRIAGLVPGSLSLILSDHFSWQAVFIITALFMLPGLLMTLFISHEP QIELKSNRTLAENIVEPFKEFFQRKGLWGAIGILTFIFLYKFGDSMATAL ISAFYLDMGFTKTQIGLVVKNASLWPMIIAGIIGGMITLKIGINKALWLF GLVQIVTILGFAWLAQLGPFEKVDSFAIFALTVVVMAEYVGIGLGTSAFV AFMARATNPVYTATQLALFTSLSALPRAVFNSFSGVLIENMGYYHYFWLC FFLAIPGMLCLIWVAPWKEK >MS1530 proP, ProP protein MNTETKQPALIVPRLSLMMFMEFFIWGSWSVTLGIVMTKYDLSTLIGDAF SMGPIASIISPFILGMLVDRFFPSEKVLAVLHLIGAAILWFIPEFITGQQ GGTLVFALLAYMLCYMPTVALTNNIAFHSLADSEKSFPVIRVFGTIGWIV AGLFIGQADLSASPAIFQVAAICSLILGLYSFTLPNTPPPAKGKPFSMRD LMCADAIALFKIPHFLVFAICATLISIPLGTYYAYAAPFLDAVGFEKIGS LMSMGQMSEIVFMLLIPFFFKRLGVKYMLLAGMLAWFLRYAFFALGVSEE IRWAVYLGILLHGICYDFFFVVGFMYTDKVADEKIRGQAQSLVVLFTYGL GMLLGSQISGGLYNNMFADNTDVSTWSTFWWIPAISAVVISVIFFIFFNY KEDKREA >MS0807 proP, ProP protein MLMTSQNKINAVPSNQNFYLNNRNYWIFSGYFFVYFFIMATCYPFLGIWL GDINGLSGEDRGTVFAMMSFFALCFQPVFGYVSDKLGLKKHLLWVLGISL LIYAPFFIYIFAPLLKVNVWLGSLVGGAYIGFVFQAGAPASEAYIERVSR RSKFEYGRVRMFGMFGWAICASIAGVLYATNPNLVFWLGSIASLILLLLI ALAKPEQTSTVQIAEKLGANKNPVNLRQAFALLKLPKFWALLAYVMGIAC VYDIFDQQFGNFFNTFFESHEQGIKMFGYVTTAGELLNALIMFFVPLIIN RIGAKNALLIAGTIMSVRIIGSSYAIEAWHVVVLKTLHMFEVPFYLVGLF KYIANVFEVHFSATIYLVACHFAKQIGNMLVSPLVGAWYDTYGFQDTYLI LGCIAAGFTLLSVFTLTGKSLSSQS >MS1407 proP, ProP protein MMTSSRPNLTLLLILGALMACTSLSTDIYLPAMPTMAKELQGNTELTITG FLIGFAIAQLIWGPISDRIGRKIPLFIGMALFAVGSVGCALSQSMAEIVF WRVFQAVGACVGPMLSRAMIRDLYDRSQAAQMLSTLTIIMAAAPIIGPLL GGLLLKISSWQAIFWLLVVIGILLFLSIIKLPETLPPAKRAAGSFWSAFG NYRILLKNRAFMRYTLCVTFFYVAAYAFITGSPFVYIDYFKVDPQYYGFL FGVNIVGVALLSAVNRRLVRHYPLESLLRVSTMIALCAVLILVVLVFMDL DGIAGILSVAVPIFIMFSMNGIIAACTNAMALDSVQPEIAGSAAALLGSL QYGSGILSSLLLAYFSDGTPHTMAWIIALFVGLCAVIGWGQRPRSA >MS0392 proP, ProP protein MSTAKKRNFIFIATLGILSMLPPLGVDMYLPSFLNIARDLQVDPERVQYT LTFFTFGMAAGQLFWGPVGDSYGRKPIILLGVIIGAVAAFFLTGVNSIEN FTALRFIQGFFGSAPVVLVGALLRDLFDKNELSKTMSMITLVFMIAPLVA PIIGGYLVLFFHWHSIFYVICAMGILSAILVFFIIPETHHQDNRIPLRLN VVVRNFVTLWRRKEVLGYMFSSGLGFGGLFAFLTAGSIVYIGLYGVPVDQ FGYFFMLNIGVMTLGSVINGRVVHRVGAERMLQIGLTVQLIAGIWLLIVA CFDLGFWPMALGIAVFVGQNSLISSNAMASILEKFPTMAGTANSVAGSVR FGLGATVGSLVALMKMDSAAPMLFTMGICVIVAVCCYYFLTYRSL >MS0820 proQ, ProQ protein MLGIIFSYITCCVQDRKQMTEIQTDVQAESQKLTNNKEIIAYLAEKFPLC FSVEGEAKPLKIGLFQDLAEALKDDERVSKTQLRHALRQYTSNWRYLHGC RLGAERVDLQGNPAGVLEQEHVEHAQQQLAEAKAKFAEKRAAEKAANTKQ VKKRPARKPSDKAMKATRKPANDKVRKAKVELKEIDFATLQKGSQVKVKV GDSAKQAVVLDVIKDNARVELDNGLVLTVATDRLFA >MS2063 proS, ProS protein MVNYRQFFNLKNNRNPIIMRTSKYLFSTLKETPNDAQVVSHQLMLRAGMI RPMASGLYNWLPSGIRVLEKVKNIIREEMNKSGAIEVLMPVVQPAELWQE SGRWEQYGLELLRFNDRGNRDFVLGPTHEEVITDLVRREVSSYKQLPLNL YQIQTKFRDEVRPRFGVMRSREFIMKDAYSFHTTKESLQATYDVMYQTYS NIFTRLGLDFRAVQADTGSIGGSASHEFQVLASSGEDDVVFSTESDYAAN IELAEAIAVGERAQPGAAMQLVDTPNAKTIAELVEQFNLPIEKTVKTLIV KGATEEQPLVALIIRGDHDLNEIKAEKLPEVASPFEFADEADIKAKIGAG VGSLGPVNLNIPVIIDRSVALMSDFGAGANIDGKHYFNINWERDVALPKI ADLRNVVEGDPSPDGKGTLLIKRGIEVGHIFQLGQKYSEAMNATVQGEDG KPLVMTMGCYGIGVTRVVASAIEQHHDERGIIWPSDAIAPFTVAIVPMNM HKSESVQAYAEELYQTLLAQGVEVIFDDRKERPGVMFADMELIGVPHMVI IGEKNLENGEIEYKNRRTSEKQMIAKDQLLDFLKANVNV >MS0549 proV, ProV protein MTTSVKISVKNLTKIFGSHPKSAFKLLQNGKTKEQIFAETGSTVAVNNVS LDIMAGEIFVIMGLSGSGKSTLIRLLNRLIEPSAGHVFIGDDDIAEMSEK ALRAVRRKRISMVFQSFALMPHMTILENVAFGLELSGVNSKNRRRMALET LARVGLEAYADVYPGELSGGMQQRVGLARALANDPEILLMDEAFSALDPL IRTEMQDELLRLQENSERTIVFISHDLDEAMRIGNRIAIMQDGQVIQVGR PDEILQNPANDYIRSFIQGVNVSNVLSAKDIASKRHLLNIVQKSEDETPH VAFKLLEQHERDFAVVLDRYGYYKGMVSVDSLQQARSNRQSLSQSFIEIT PLSPEQSISDIINDVATTREPLPVVDDKGHYYGVVTKVKVLQTLDRGTEA >MS0550 proW, ProW protein MTTENIRTADPWEATLQAAQQDNAYAWLQGSEQSQDFNWMYPFDHTLVPF GDWVESLINWLVTHLRSFFQFISAPIDYILSLFQTSLNVLPPTVMIILFT LLVWQFTHFRLALATLLSITLIGAVGAWNEMMITLALVLTSVSFCLLIGL PLGIWMARSTRASAIVKPVLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVT IIFALPPIVRLTILGIQQVPEALIEAAQAFGASKKQLLYKVQLPLAMPSI MAGVNQTLMLSLSMVVIASMIAVGGLGQMVLRGIGRLDMGEAATGGLGIV LMAIVLDRLTQKIAENMHSQHKVRWYERGITGLFIRKK >MS0551 proX, ProX protein MAYPMKLTILFSLALFASNAVRADDKAIQPLQSPLAEETFQTLIVVKALE ELGYRVNPPKEVDYNVAFTSIANGDATFMAVHWLPLQADKYANAGGDRKL YRQGTFVEGAVQGYMIDKKTADTYNITNLAQLKDPKLAKLFDTNGNGKAD LIGCSPGWSCEYTVSQHIDGYGLSRTVEVTQGNYSALIANTIAQYQNGKS ILYYTWTPYWVSGVLVPGKDVVWLQVPNRPDPGKTVADTNLANGKNYGFT VSSMHIVANKTFTDAHPDAARLFAVMRLPAGDISAQNMAMRNGQNSSQDI ERHAEAWIKFHRVQFDEWIKQAKSAKN >MS1536 prsA, PrsA protein MPDIKLFAGNATPELAKRISERLYISLGDATVGRFSDGEIQVQINENVRG SDVFIIQSTCAPTNDNLMELIVMVDALRRASAGRITAVIPYFGYARQDRR VRSARVPITAKVVADFLSSVGVDRVLTCDLHAEQIQGFFDVPVDNVFGSP VLINDILKKTDLENPIVVSPDIGGVVRARAVAKLLNDTDMAIIDKRRPKA NVSQVMHIIGDVSDRDCILVDDMIDTGGTLVKAAEALKERGAKRVFAYAT HAVFSGSAAQNIANPALDEVVVTDTIPLSAEIKALGKVRSLTLSAMLAEA IRRISNEESISAMFDA >MS1864 psd, Psd protein MLGEYNIMNSLEKKQITYGQRLKIAFQYAMPQIYLTQIAGWFANKRWGAV THFVIKMFAKKYNVHMAEAAKPNFSDYATFNEFFIRQLKEYARPINQNTD ALCLPADGKISQCGHIDDELLLQAKGHSFSLRDLLAGDEELTRLFKDGEF VTTYLSPRDYHRVHMPCNGTIRKMIYVPGELFSVNPFLNTHIPNLLARNE RVICLFDTDFGPMVQILVGATITASISTVWEGVINPPRTGDIRTWTYEGQ SAVSLAKGQEMGAFQLGSTVINLFPKNAVKLADYLQVDTVTRVGEILAYK K >MS2183 pspE, PspE protein MFKEITPQQAWQLMIEENATLVDIRDEQRFTYSHAKGAFHLTGQSYGKFQ IQCDFDDPVIVSCYHGISSRNVAAFLVEQGYDNIYSIIGGFEGWQRAGLP IETAY >MS2248 pspE, PspE protein MFPVVSYLSIILMIKEFMMNITKITARQLQEKLAQGALLIDIRDADEYSH ECIEQAVSQPLTGLKPEICNNSPCVIFHCQSGMRTQANMALLAKASAAAA EVYILDGGLNAWKKAGFATVVNKAQPLPLMRQVQIAAGSLVLLGVILGYS VSPWCFLLSGFVGAGLIFAGVSGFCGMAVLLSKCPWNK >MS2215 pspE, PspE protein MEEFMPMATEFAKNHTLLIAAWVAIFVIVIFQLVKSFTSKVKILSNAEAT SLINNEDAVVIDLRSIDEFKRGHIAGSLEFIPTDIKNRNLGKLEQHKDRH VILVCANGFTARSSAQLLTKQGFAHVYVLNEGIMGWKSQNLPLVK >MS0998 pta, Pta protein MSRTIILIPISAGVGLTSVSLGLIRALEQKGTKIGFMKPISQPRSGEDML DRTTSIVRTSTTIETTEPVMLSEAENLIGQNQTDVLLEKIVAQHQQISKD NDIVIVEGLIPSRKNSYANSVNYDIAQALDAEIILVSAPATETPAQLKER VEAAAASFGGKSNPNLLGVVINKFNAPVDESGRTRPDLTEIFDSFQHSHN NIKEIYKLFENSPIKVLACIPWSADLIATRAIDLVKHLGASILNEGDMNR RIRSITFCARTLPNMIEHFKAGSLLVVSADRPEILTAAALAATTGIELGG ILLTGGYKIDCEIKKLCNPTFENTKLPVFRIEGNTWQTALSLQSFNLEVP VDDKERIENIKQYTSGQFDADFIHSLASASVRARRLSPPAFRYQLTELAR AAKKRIVLPEGDEPRTIKAAVLCAERGIAECVLLAKPEDVKRVADSQGVK LGNGITVIDPASVRENYVARLVELRKAKGMTEMAAREQLEDTVVLGTMML EAGEVDGLVSGAVHTTANTIRPPMQIIKTAPGSSIISSIFFMLLPDQVLV YGDCAVNPDPTAEQLAEIAIQSAESAKSFGIDPRVAMISYSTGTSGSGAD VEKVKEATRIAQEKRPDLLIDGPLQYDAAVMEDVARSKAPNSKVAGKATV FVFPDLNTGNTTYKAVQRSADLVSIGPMLQGMRKPVNDLSRGALVDDIVY TIALTAIQATQC >MS0556 pth, Pth protein MSEIKLIVGLGNPGDKYADTRHNAGEWLVERLARRFNFNLKDEAKFFGKT ARAVIGGEEVRFLIPTTFMNLSGKAVGALATFYRIKPEEILVIHDELDLP PGVAKIKQGGGHGGHNGLKDTIAQLANNKNFYRLRIGIGHPGDKNLVSAY VLSKPSPIDRSAIDKALDEAASCMEILLKDGITKATNRLNGFKA >MS1509 ptsA, PtsA protein MISGIPASPGIVFGKALVLKEEKIVLDMQKIAEDQVETEVARFYEGRTAA VEQLSAIRDRAEKTLGEEKAAIFEGHLMILEDEELEEEIIDYLRSNKVNA GVAASKIIDQQVAMLADIDDEYLKERAGDIRDIGNRLIKNILGMKIVDLG EINEESILVAYDLTPSETAQLNLDKVLGFITDIGGRTSHTSIMARSLELP AIVGTNNATAMINSGDYLVLDAINNAVYVNPAQDVIDGLKAQQAKLAEEK AELAKLKDLPAVTLDGHRVEVVANIGTIRDCEGADRNGAEGVGLYRTEFL FMDRDQLPSEEEQFIAYKEVVEAMNGRQVVLRTMDIGGDKELPYMNLPKE MNPFLGWRAVRIALDRREILNAQLRAVLRASAFGKLAVMFPMIISVEEIR ELKSVIETLKQELRTEGKAFDENIQIGVMCETPSAAVNAKFLAKEVDFFS IGTNDLTQYTLAVDRGNEMISHLYNPMSPSVLSLIKQVIDASHTEGKWTG MCGELAGDEKATILLLGMGLDEFSMSAISVPRIKKLVRSVNFAEAKALAD KALQLPTAAEIEKLVADFLAEKTLN >MS0784 ptsG, PtsG protein MLVLARIGENFCLIYKRGVAMNYPKIAQQVIEKLGGKENIANLAHCATRL RLTMNDESKIDKQAIEDIEGVKGQFSTSGQYQIIFGSGTVNKVYAEMNTI MNGSPSADSTGESQQAKGPQQGLIQRLIKGLADIFVPIIPAIVAGGLLMG INNVFTAKDLFEEGRTLLDLYPQYKDLADLINTFANAPFVFLPVLLGFSA TRKFGGNPFLGATLGMLLVHPALTNAYGYAEALAGGNLQLWNIFGLEIEK VGYQGTVIPVLIAAWVLATLEKFLVKVVPSVLNNLVTPLFSLFITGFLAF TVIGPFGREAGEFLSQGLTWLYDTLGFIGGGVFGALYAPIVITGMHQTFI AIETQLLASTAATFIFPIAAMSNIAQGAACLAVAVLNKDAKTRGLALPSG ISALLGITEPAMFGVNLRFRYPFYAAMLGAGSAAAFIAFFNVKATALGAA GLIGIASIRAGDWGMYSVGMVISFCVAFAAALVLGARANAKE >MS1237 ptsG, PtsG protein MAKINKVDPKNVDKLIVAVGGRENIATVTHCITRLRFVLNDESKVDAKTI EELPMVKANFATGGQYQVVIGQEVGDYYQVLLEKTGLASVDKEQVKAAAR KNQKWYESLISHMADIFIPLLPALISGGLILGFRNVIGDIKMFEEGTKTL VDISAFWASMHSFLWLIGEAIFFFLPVGICWSIARKMGTSPILGITLGVT LVSSQLMNSYALGSQIPEVWDFGLFSIEKVGYQAQVIPAIMAGLTLSYIE RFLNKIVPDFLNLIIVPVTSLILVVFLAHSIIGPIGREIGNGVAFVVKAA MTGEFAPIGAALFGFLYAPLVVTGVHQTSLAIDMQMIQSIGGTPVWPLIA LSNIAQASAVVAVLIMAKKASVREVAVPAALSAYLGVTEPAMYGINLKYR FPMLCAMTGSACAALVCGFAGVLASSIGVGGLPGILSIQHQFWGTFAIAM LVAIIVPILLTMAIYKRKEAAGTLE >MS1717 ptsN, PtsN protein MVKFTEILSPENIRQGIICSSKKRLLEVISDIVTKRFNLQEEEIGYHIEQ LECFETLLSREKLGCTSLGNGIAMPRAKLPIGDKPVAVFLQLASPVNYEA PDKRDVDLVLAILIPEKCCAAYSPYLPELAERFSDKMLCKQLRAAQSADE IWQIFQYMDNCLHEHTDDTATEEK >MS2180 ptsN, PtsN protein MFNLPENNIHLSAQAGNKEQAIELAAKALEQAGYVESGYLQGMLGRELQT STFLGNGIAIPHGTLETRNMVKNTGVQIFQFPQGIEWGDGNIAYVVIGIA ARSDEHLALLRQLTHVLGDEDTAAKLATLQDAKKFRAILMGEDDEFAVKT ENISLDVDTQSLLTLVAINAGKLEQQSAVENSFVSDVIASPALPLGNGLW VTDSPLGNLKNALAFSRAKNAFSVNGKNVQGVVTVSAKDDAVNETLARLL SEQVQQTLLAGNAEKIIAALNGIQAEQAVTAEQVTTQAVPAAGTVIGTFT LRNENGLHARPCANLVNLVKKFDAKITVENITRGTAAVSAKSLMKVVALG VTQGHRLRFVAEGAQAQQAIEAIAKEIAAGLGEPVSAVPPAEPDTIEVAN PATPEVEQPKSDSIEAVFVINNENGLHARPAATLVNEVKKYNASVAVRNL NRDGGLVSAKSMMKIVALGATKGSRLHFVATGEEAQQAIDGIGAAIAAGL GE >MS0021 ptsN, PtsN protein MLKQSLIDNNSIKLHQKAANWQEAIKIAIDLLVKSGAVEARYYDRIVECI KEMGPYIILAPGLAMPHARPEDGVIRTAFSLVTFDTPIHFEGEDDPIRMM VALAGSDSDKHMEGLMEITQILEDEDSETGVNIQKFLDCNTEAEVFAVID AALSE >MS0149 ptsN, PtsN protein MITSKQKRSFMLKQFLPLSHIQYVDSVENWQQAVQLSAAPLLAEQLIEPR YVERIFQIHREIGPYYVIAPQIAMPHSRPEDGSNAQALSLVVLKQGVNFG SDNDPVQLVLMLAAKDSESHLEMLSAVAELFSDEEAVQQIIQSTTVSEIA EIVHRY >MS1621 purA, PurA protein MGKSVVVLGAQWGDEGKGKIVDLLTDRAKYVVRYQGGHNAGHTLIINGEK TVLRLIPSGILRANVTCLIGNGVVLSPSALMQEMGELESRGINVRERLLI SEACPLILPYHVAMDHAREAALGKKAIGTTGRGIGPAYEDKVARRGLRVG DLFDKEQFAEKLKNILDYYNFQLVNYYKVEAVDYQKTLDDVMAVADIITG MVADIGAILNTARKNGDNILFEGAQGAMLDIDHGTYPYVTSSNTTAGGVA TGSGLGPRNIDYVLGIIKAYCTRVGGGPFTTELFDEVGQEIARKGNEFGA VTGRPRRCGWFDAVAIRRAIQVNSITGFCMTKLDVLDGFDEVKICVGYKL PNGEVVDYAPLAAKDWEGVEPVYETMPGWKENTFRVTDVDQLPVNCLNYI KRIEEVTGVPVAILSTGPDRVETMILQDPFTA >MS0297 purB, PurB protein MQLSALTALSPIDGRYQDKTTALRGIFSEFGLLKFRVTVEVRWLQKLAAT AQINEVSSLSQEANDYLNQIVTNFAIEDAERIKEIERTTNHDVKAVEYFL KEKSAALPELAAVSEFIHFACTSEDINNLSHALMLKTAREEVILPEWQKL IDEITRLANEYKEIPLLSRTHGQPASPSTVGKEMANVAYRLRRQYKQLEQ IEVLGKINGAVGNYNAHLSAYPEINWHQFSEEFVTSLGVNWNPYTTQIEP HDYIAEFFDCVARFNTVIIDFDRDLWGYIALNHFKQRTIAGEIGSSTMPH KVNPIDFENSEGNLGLANAVMSHLAQKLPVSRWQRDLTDSTVLRNLGVGL GYCLIAYAATRKGISKLEVNEQHLRDELNQNWEVLAEPIQTVMRRYGIEK PYEKLKELTRGKRVDEQAMHDFIEKLDIPAEEKARLQQLTPATYIGAAVQ LVEKL >MS1481 purC, PurC protein MAELSLKKIYSGKVRDLYEIDDKRMLMVASDRLSAFDVILEDPIPRKGEI LTQISNFWFKKLAHIMPNHFTGDTVYDVLPKEEADLVKNRAVVVKRLKPI KIESIVRGYLTGSGLKDYKQTGTICGLQLPQGLVEASKLPEPIFTPSSKE EVGDHDINISYAECERQIGKELAAQVRDAAIALYKEAAAYALTKGIIICD TKFEFGLDENGTLTLMDEVLTPDSSRFWSVDTYREGTNPPSFDKQFVRDW LEQSGWNKQPPAPKVPADVIQKTVDKYQEALDLLTK >MS1296 purD, PurD protein MNILIIGNGGREHALAWKAAQSPLASKVFVAPGNAGTARESAVENVDISA TDVPALVKFAQDNNVGLTIVGPEAPLVVGVVDAFEQAGLTIFGPCQSAAQ LEGSKAFTKDFLARHNIPTAEYQNFTEVEPALAYLREKGAPIVIKADGLA AGKGVIVAMTLAEAEAAVKDMLSGNAFGEAGSRVVIEEFLDGEEASFIVM VDGKNVEPMATSQDHKRVGEGDKGLNTGGMGAYSPAPVVTQEIHQRVMEQ IIYPTVRGMAAENNVYKGFLYAGLMIDKNGQPKVIEFNCRFGDPETQPIM MRMQSDLVELCLKACKGELDQIKSEWDPQAALGIVLAAEGYPGDYRKGDE ISGIPVQASQDEKVFLAGVAEKEGKLVTNGGRVLCVTALGNSVLSAQQKA LKLAEQVNWTGRFYRRDIGYRAVAREQNG >MS1033 purE, PurE protein MSNTHAQIAIVMGSKSDWSTMQEATGMLDQLNVPYHVEIVSAHRTPDKLF SFAENAQAKGYKVIIAGAGGAAHLPGMIAAKTLVPVLGVPVKSSMLSGVD SLYSIVQMPKGIPVGTLAIGPAGAANAGLLAAQILAAWDSELSARLQKFR EQQTNAVLNNPDPRN >MS1003 purF, PurF protein MCGIVGIVSQSPVNQSIYDALTVLQHRGQDAAGIVTIDDENRFRLRKANG LVSDVFQQVHMTRLQGNAGIGHVRYPTAGSSSVSEAQPFYVNSPYGLSLV HNGNLTNSDELKSKLFKLARRHVNTNSDSEALLNILAYYLDHMQTEHLSP EDIFYAIKKTHKDIRGAYACIAMIIGHGMVAFRDPHGIRPLILGKREESG KTEYMFASESVALDTAGFDVVRDIEPGEAVYITFDGKLYAEQCAENPVLT PCIFEYVYFARPDSTIDGVSVYAARVHMGERLGQKIANEWADADIDVVIP VPETSNDIALRIATILGKPYRQGFVKNRYVGRTFIMPGQKQRISAVRRKL NTISSEFKDKNVLLVDDSIVRGTTSEQIVDMARAAGAKKIYFASAAPEIR YPNVYGIDMPTKHELIAYGREPEEIAKLIGVDKLIFQDLSALTQSVQQEN PNIKEFDTSVFTGHYVTGDISTEYLDNIAQQRNDAAKRKRAKDATNLEIH NEG >MS1297 purH, PurH protein MQIRPIRQALLSVSDKTGIVEFAQGLVQRGVKLLSTGGTAKLLADSGLPV TEVSDYTGFPEMMDGRVKTLHPKVHGGILGRRGTDDEVMQKHGIEGIDMV VVNLYPFAQTVAKPNCTLEDAVENIDIGGPTMVRSAAKNHKDVAIVVNNA DFHMILAEMDQNQNSLTLETRFDLAVRAFEHTAQYDSMIANYFGQLVKPY FAAEEEDKDAKCGQFPRTLNLNFIRKQTMRYGENSHQNAAFYVEKEVKEA SVSTAKQLQGKALSYNNIADTDAALECVKSFDEPACVIVKHANPCGVALG KDVLEAYNRAYQTDPTSAFGGIIAFNRELDEATATAIVDRQFVEVIIAPT VSAAAVEVVKRKKNVRLLACGELSKPQARLDVKRVNGGLLVQDADLGSVS IDDLEVVSKRKPTKQELEDMLFCWKVAKFVKSNAIVYAKNNQTIGIGAGQ MSRVYSAKIAGIKAQDEGLEVKGCVMASDAFFPFRDGIDAAAEVGIECVI HPGGSMRDQEVIDAADEHNMVMVLTKMRHFRH >MS1032 purK, PurK protein MQKSALYPTVYVLGNGQLGRMLRYAGAPLDINVQPLAFNASVFDLPKDSI ITAEIERWEETPLTTMLGNHHKFVNKNVFVKTADRLTQKSLLDELALPTS PWCLVENHQQWADIFTNVGEKVVVKRRMGGYDGRGQWIITEENKTLITDE LLNEVIAEKFIPFDYEISLVGARFRNGDTRFYPVTHNLQQDGILRYSVTD ETFPQQARQQVQAEAMLSKIMAKLDYVGVMAMECFVVGDKLLINELAPRV HNSGHWTQLGCSVSQFELHLRALLDLPTPKLTTFAPSVMVNLIGTDHNKL WLDTPFSQLHWYGKEVRTGRKVGHINISHPDKNVIITQLEKLAGELPNDY QSGLNWAINKLK >MS1806 purL, PurL protein MFQIFRGSPALSEFRLNQLSARFQKADLPVKSCYAEYLHFADLSAGLSAE ETDELEQLLHYGPTLAQHESKGECFVVIPRVGTISSWSSKATDIAHNCGL DKVVRLERGIAYYFEFERTLSAEQQQRLVSHIHDRMMETVVRAPEQAAVL FDSQDPKPFTTVDILNGGRKALEIANVELGLALASDEMDYLVENFTALGR NPNDIELYMFAQANSEHCRHKIFNADWVIDGEKQEKSLFKMIKNTFEKTP DHVLSAYKDNAAVMEGSKVGRFFADQDGQYRYHNEDAHILMKVETHNHPT AISPFPGAATGSGGEIRDEGATGRGAKPKAGLVGFSVSNLVIPGFEQPWE NELSKPSRISSALDIMIEGPLGGAAFNNEFGRPALLGYFRTYEEKVNSFA GEEVRGYHKPIMLAGGIGNIRAEHVQKGEIPVGAKLIVLGGPAMNIGLGG GAASSMTSGKSKEDLDFASVQRDNPEMERRCQEVIDRCWQMGEGNPIAFI HDVGAGGLSNAMPELVHDGGRGGKFELRNILCDERGMSPLEIWCNESQER YVLAVAPENLAVFEELCQRERAPYAIIGEATEEEHLTLHDNHFDNNPIDL PMSLLLGKTPKMTRDVKSTQVNNSPVDQTNIELKEAFHRVLRLPVVAEKT FLITIGDRTVTGMVARDQMVGPWQIPVSDVAVTTAALDTYHGEAMSIGER APVALLDFAASARLAVAESITNIAATNIGDIKRIKLSANWMSAAGHEGED AGLYEAVKAVGEELCPALGLTVPVGKDSMSMKTTWSENGEQKTVTAPLSL VISAFARVEDVRKTVTPQLRTDKGETALLLIDLGEGKNRLGATALAQVYK QLGDKPADVVNVELLKGFYNAMQTLVQQGKLLAYHDRSDGGLIVTLAEMA FAGNCGIRAEISALGDNDLGILFSEELGAVIQVRESDLAAVREVLTQHGL IHLTKDLGLVTEYDEFEIKRGTKVVLSEKRSELRGIWAELTHQMQRLRDN PECADQEFAAKKDPANQGFSAHLTYDINEDVAAPYIATGKKPRIAVLREQ GVNSHVEMGAAFDRAGFEAIDVHMSDLHTARQNLKDFNALVACGGFSYGD VLGAGGGWAKSVLFNTALRDQFQAFFEREDTLALGVCNGCQMISTLADII PGTENWPRFVRNTSERFEARAALVRINESNSVWFQGMAGSHMPIAVSHGE GRVEFKNDSQLQGLRDQGLIIAQYVDNNIRPTEVYPANPNGSVDGITALS NTNGRVAIMMPHPERVFRTVSNSWHPEDWSEDGAWMRLFRNARVFFK >MS0626 purM, PurM protein MSKQSLSYKDAGVDINAGNALVDRIKPHVKRTTRPEVIGGLGGFGALCAL PTKYKEPVLVSGTDGVGTKLRLAIDLNKHDTIGIDLVAMCVNDLVVQGAE PLFFLDYYATGKLDVDVATDVVAGIAEGCVQSGCALIGGETAEMPGMYHA GDYDLGGFCVGVVERAKIIDGSKVKTGDALIALGSSGPHSNGYSLIRKVI EVAGVNPATEQLAGRPLADQVLAPTKIYVKSVLELIEHVDVHAIAHLTGG GFWENIPRVLPEDVKVVINENSWEWQPVFKWLQEQGNITRHEMYRTFNCG VGMVIALPQADAEKALQVLKAAGENAWLIGQVEPLNAGEEQVIIR >MS0627 purN, PurN protein MKKIVVLISGQGTNLQAIMDACKAGKINAQVAAVISNKADAYGLIRAKNS GIPTAVFERKNYADNSQMDRAISDYIDGIAADLIVLAGYMKILTAGFTRH FAGKILNIHPSLLPKYPGLNTYQKAIEAGDSEHGTTVHFVNEKMDGGAVI LQAKVPIFPDDRIEDVEERVKIQELQIYPLVVKWFVDGRLKEAGGKAYLD GQLLAENGYAAE >MS0148 purR, PurR protein MSLANNSNKNRRSTGKVTLADVAKEVGVGTMTVSRALRTPKMVSENLRQK IHEAVQKLGYVPNSAARELASVSSRNIVIVTSSLVSVENNLILNSLQKEL QPLDLQIIILVANKKGWLRELINNSPLAVILLNLQCPSTEAQWIRNSGLI CLEIGSKQANPLGINVCVDSKSAVQKVISFLVAKGYRDIGLLCAQQEQAI FQQYLACWHSALHANHLNSHQILHCSEPVSFSAGAKLFNEAISTWGCIDA FVFLSDELACGALFEAQRQHIGIPYDVAIIGLGDLEISQTTYPALTTLNI PYAKLGETAGKKLAELLQTEKDPQTECIQLISTLRERESG >MS1531 purR, PurR protein MSVQKIAKLAGVSVATVSRVLNDSPSVKAVNKEKVLAAIKALNYQPNLLA RQLRTSRTGMILAMVSNIANPFCAAVVKGIEREAEKNGYRILLCNTESDL ERSRSCLQLLSGKMVDGVITMDAISELPELQNIIGDAPWVQCAEYDPDSS VSSVSIDDISATEFVIDQLVKTGKKRIALINHDLSYQYAQHRELGYLDGL KRHGLAYCEIIYADELDYLSGKEAVLSLLKNAQRPDAILAISDVLAAGVI NGLNELNVAIPEDIAVVGFDGIDISQITTPSLSTIQQPCKEIGEMAFSLL LQQIDSTSSVKRVHHLLPWTFIKRQSS >MS0284 purR, PurR protein MATMKDIARLANVSTSTVSHVINNDRFVSEKIREKVMAVVKELNYQPSGL ARSFKTKETKTIGMLVTASDNPFFAEVVHAVERYCCQQNYNLILSNTEGS PQHLQHNLQMLINKQVDGLLLMCSETHTQDNMPINLPIPAVIMDWWPSEL TADKIFENSELGAYLATKHLIHHQHKRIAIVNGDLRKPIAQNRLIGYKKA LTEANLPIDETLIFEGKFDFQTGFDALERLLKTDCPPSAIFACCDAIALG IYQAAWRHNLIIPRHLSVIGYDDTILSQYIAPPLSTIHQPKTELGKLAVQ TLLERIKNPQKTYRTFVLDPVLVERESVATRKES >MS1317 purR, PurR protein MKLEELAKLAGVSRTTASYVVNGKAKQYRVSDKTIEKVQALIKEYDFKPN AMAAGLRAGKSNTIGLIIPDFENLSYAKIANQLEKSCRENGYQLLITCSN DNVANELECAKHLFQRQVDALFVSTVLPADNHYYQQNNAIPIIGFDRHID SEGVDNVLTDDKHDAYELAVSLFDKADYQRILFLGALPELPMSKAREEGF KQALGKKQVQVDYLYASQFRKENAEQLVSEWIEKNGIVPDAIFSTSLTLL QGLLMSFIKRNEAFPKDLVIATFGWHEMLELLENKIVCSVQDHSKVVQAL LDLALHKMRIKKLKQPHPVIQRRLAYHNWQ >MS1238 purR, PurR protein MKYTINEIAKLCNVGKSTVSRVLNKDPKVRSETREKVQRVIDRLGFQPNR SARAMRAGQEPVVGVIVSKLDSGSESQTLRAILQALQAEHITPLIVESRF EAEQVRHHFQLFRERQVNAVILFGFFPLPLEIVREWQGSLVVIARTYPNI SSVYYDDEQAITRLMTELYRQGHRRIAYLGIQDSDETTGKLRTQSYLQFC RSHNIRPNSVSVELSAESAYLHCAELFTRPVDALVCATGRLALGAFKFSQ QSGRVFPIAYVGYNELLQYMMPNALSLDFGYCQAGLKAVELLMRQLRGKS STEHYLVSTHQP >MS0644 purR, PurR protein MITIRDVAKQAGVSVATVSRVLNNASSSEKARKAVQSAVEKLGYSPNANA QALALPTTDTIGVVVTDVTDAFFAILVKAVDQVASSYNKTILIGIGYHNA EKERNAIDTLLRKRCSCLVVHSKALSDEELANYLEQVPGMVIINRSIQGY EHRCVSLDNQRGTFLATETLIRLGHKRIGYIGSNHHINDEEERRQGYIQA LQHHRLPQIDDAIIQSSPDFEGGEEAMIKLLSYHSDLTAVVAYNDSMAAG ALSVLNENNINVPRQFSIIGFDDMPISRYLIPKLTTIRYPIDLMANYAAR LALSLVNEGIETPLHAQFNPTVVRRFSTENCNNP >MS1063 purR, PurR protein MATIKDVAKMAGVSTTTVSHVINKTRHVADETKQTVLDAIKALNYSPSAV ARSLKVNTTKSIGMVVTTSETPYFAEIIHAVEEQCYRQGYSLFLCNTQND PDKLKNHLEMLAKKRVDGVLVMCSEYKDDSRDLLKSFSYLPIVIMDWGPV NPDTDLILDNSFEGGYLAGKHLVDNGHKKIGYLSAELTKVTAKQRYQGFI KALSEANVEMKSEWLFEGSFEPEDGYECMNRLLALEDRPTAVFCCNDIMA LGAISAITEKGYRVPDDFSVIGYDNVHSSRFFAPPLTTIHQSKARLGERA LRLLFERIAHKDAKRETIEIHPELVIRKSVKKIA >MS1242 purR, PurR protein MTKHKRPTLQDIANHLGITKMTISRYLRNPASVAEETGKRIAKAIEEFGY IPNRAPDILSNAKSRAIGVLVPSLTNQVFADVIKGIEEITDEAGYQTMLA HYGYSEKKEEQRIESLLSYNVDGIILSENSHSERTKKMLQVANIPVIEIM DTSEIGIQQVIGFDNIAAAQAMVETMIKRGYKKIVYFSARLDKRTQLKMQ GYQQAMKKYQLSPRIIATKEHSSFTHGAELLHQALKQYPDIDGIFCTNDD LAIGALFECQRLGIKVPKQIAIAGFHGHDVGQSITPQLATVITPRLQIGR IAAQELLARLQNIPAQSSIINLGYQIHLGESI >MS2375 purR, PurR protein MKSGLKHHRIALLFNANKVYDREVIEGVGQYIQASQCLWNIFIEDDFVYR KESLHNLDIDGIIADFDDPETVAMLEHTEIPVIAVGGSYQNPAFYPHYPY VATDNYALVETAFLHLKQKGINQFAFYGLPNETPKHWSEERKNAFMQLMA DYGHQTYIYLGEQAHSDNWLEVQSKLCDWISRLPPHTGIIAVTDARARHL LQACEYLNIAVPDELCIIGIDNEELIQYLSRVSLSSVVQGTNQIGYQAAK LLDQLLKGRPVSQTPILVPPLRVEQRRSTDYRSLHDPLVIQAMHYIRHYA TQGIKTEQVLDHLRISRSNLEQHFKAEMNKTIHQVIHEEKLDRAKNMLKF TDVPIQEISDICGYPSLQYFYAVFKKEYGQTPKEFRER >MS0808 purR, PurR protein MLMVSLKDVAKEAGVSLMTVSRALKSPDKLSPKTYKVVKEVIDRLGYVPN LAAQHIRGVAANTIGVLSLGTATTPFSVEILLGIEQTVRQHGWNSFVINT FENDSQAMEDAVEQMLSHRPSAIIIARNGLKNVSIPEKLRSFPLVLANCQ TQDMAVAAYIPDDYQGQRVVVDRIVAKGYQRPLFLHIPKNYIATAKRRQA FEDAWANHSGQKPVQFFMRRDGEDYFEGAQPLIDYLEKPDPLPFDVIICG NDRIALVAYQLLLAKGYRIPEDVAVCAYDNMVGIAQLFIPPLTTVELPHY QMGQEAALHLIEGRKDRDIHQLPCPLIEGESC >MS0420 purT, PurT protein MTMTTLGTALTPKATKVMMLGSGELGKEVVIELQRLGVEVIAVDRYKNAP AQQVAHRSYTISMLDGEALKALVEKERPDYIVPEVEAIATATLVELEQKG FTVVPTAKATQLTMNREGIRRLAAEELGLPTSNYQFVDNFTDFKSAVENI GIPCVVKPIMSSSGHGQSIIKSFDQIQQAWDYAQQGGRAGAGRVIVEGFV KFDYEITLLTVRHIGGTSFLAPIGHRQQNGDYRESWQPQAMSEIALQKAQ QVAEKITSALGGRGIFGVEMFVCGDEVIFNEVSPRPHDTGMVTLISQELS EFALHARAILGLPIPEINLISPAASKAIVVEGKSTQVQFGNLEQVLAEPN TNIRLFGKTEVDGHRRMGVILSRDISVEKALEKAFRAYDKLEINL >MS1323 purU, PurU protein MIEKKILLTDCPDDKGLIAKITNICYKHQLNILHNNEFVDFETKHFFMRS ELEGIFNEATLRADLEFSLPEGANFRLIDAQKRKRVVILVTKEAHCIGDI LMKNYYGGLDVEIAAVVGNHETLKELVERFDIPFHCVSHEGLTRVEHDKL LAEKIDEYAPDFIVLAKYMRVLNPDFVARYPNRVVNIHHSFLPAFIGAKP YQQAYERGVKIIGATAHFINNELDQGPIIMQNVINIDHTYSADAMMKAGR DVEKTVLSRALDLVLHDRVFVYKNKTIVL >MS1551 putA, PutA protein MTKDLNVFDKHYGLLINGEWTDGSEGKTLTAHNPANGAELATFIDATDAD VDAAVTAAQEAFKTWKHTTAAERAAILNKIADVIDENTELFALQETLDNG KPIRETRAADIPLAADHFRYFAAVIRSEEGSANQLDDEDLSLILREPIGV VGQIIPWNFPFLMAAWKIAPALAAGCTVVIHPSSSTSLSLLSLAQKINHL LPKGVFNVITGKGSKSGEYMLHHTGFNKLAFTGSTEIGRKIGVAAAEMLI PSTLELGGKSANIFFDDMPFDKALEGAQKGILFNQGQVCCAGSRIFVQEN IYDKFIAALKEEFKKVKVGLPWEDDTQMGAQVNSNQIKVISKYVDIAREE GCEIIIGGEKATDPALAKGEFFQPTLILAPDNTKRVAQEEIFGPVAVVIK FKEEADVIHMANDSEYGLGGAVWTHNINRALRVARALETGRVWVNCYNRL PAGAPFGGYKTSGIGRETHKMMLDAYTQVKNIFISTREEREGMY >MS2132 putA, PutA protein MINIPSLIQAQRNFFAKGATKSLSFRKEQLLRLKALLEENTQAIIEALKT DLNKPADQVMLAEISPLIHEIDYMLENLDRLAAPKDVESPETLSFFGMGE YHSQIIYEPYGVTLNISPWNYPIQLSISPIIGAIAAGNTVVLKPSEFTAA TSALLNRLVAQYFVPEFFVVIEGDVAVNQALLAEKFDYIFFTGSVPVGRI VMAAASKHLTPVTLELGGKSPFIVDKSANLEQAAESLIFGKTFNSGQTCI APDYLLVQQDVKAEFVAILKQKLQQKFDDNPFENYAKVVSERHYLRIKSF LNDGKIVAGGLFNDETHQMLMTVLDGVTWESPVMQDEIFGSVLPMLTFNG FDEAIERILAQPKPLALYCFTETEENATAVLSQVSFGGGAVNSCFLHFFN HNLPFGGVGDSGMGSYHGDRSFYELSHEKAVVTRKIV >MS1741 putP, PutP protein MNVDYLVMAGYFALIIAISLLFKKMASNSTSDYFRGGGKMLWWMVGGTAF MTQFSAWTFTGAAGKAFNDGLSVIAVFVGNMVAYACAYWYFARRFRQMRV DTPTEAIGRRFGTSNEQFFTWVIIPLSVINAGVWLNGLSVFASAVFDADI TMTIYVTGISVLIISLLSGAWGVVASDFVQMLVVAVISVACAVVGLVVIG GPGEIIDRFPGGFVSGPDMNYPLILICTFLFFIVKQLQSINNMQDSYRFL NAKDSKNASKAAIFALLLMLVGTIIWFIPPWVTAIIYPEAASLYPQLGKK ASDAVYLVFAKNVMPAGTIGLLMAGLFAATMSSMDSALNRNSGVFVRSFY APIIRKGKADDKELLRAGQIVCVINGILVILMAQFFNSLKHLSLFDLMMQ VATLLQSPILVPLFLAIIIRKTPKWAPWATVLFGMFVSWSVVKVFTPEYV ASWFGVEDLTKREISELKVIITIAAHLIFTAGFFCLTTLFYNEAKDTNNE RRIAFFKDVDTECVAEEGQDEIDRLQRKKLSTLVMLMAAGLLLMILIPNP LWGRALFACCSLAIFAVGYGLKRSAEV >MS1786 putP, PutP protein MNLGVIFPLVIYLAFIFGAAIYAYVKRQRGDFLTEYYVGNRSMTGFVLAM TTASTYASASSFVGGPGAAYKYGLGWVLLAMIQVPAVWLALGALGKKFAM LSRETNALTINDLLLYRYKNKYLVWIASIALLIAFFAAMTTQFIGGGRLL ETTIGINYTQSLLIFALTVGLYTFIGGFRAVVLTDTIQGTVMILGTMILL GAVIYAAGGTEAAITKLTEVDPQLVSPYGPNNMLDFQFMTSFWVLVCFGV VGLPHTAVRCMAFKDSKALHSGMLIGTIILSIIMFGMHLSGALGRAIVPD LTIPDQVIPTLMIKVLPPIVAGIFLAAPMSAIMSTIDAQLIQSSAIFVKD IYLAAKPEKANNQKLISRFSSLITLTITVILVFLSLNPPDMIIWLNLFAF GGLEAAFLWVIVLGIYWNKANAYGAIASMVVGLGSYIYLTVAKLKLLDFH AIVPALVFGLIAFLIANKIGERKQIKA >MS0777 putP, PutP protein MFGLDPTLITFTIYILGMLAIGVLAYYYTNNISDYILGGRRLGSFVTAMS AGASDMSGWLLMGLPGAVYVSGLIEGWIAIGLTIGAYLNWLFVAGRLRVH TEFNNNALTLPEYFHSRFGTSHNLLKIISASIILVFFTIYCSSGVVAGAK LFQNLFGIPYATALWYGALATIAYTFIGGFLAVSWTDTIQATLMLFALIL TPVVIVVSLGGIDGFSASMQSAEIDMQKDFTDLFTGTSTLGLFSLAAWGL GYFGQPHILARFMAAYSAKSLHKARRISITWMIICLIGAISIGFFGIAFF HANPQIAEVVTKEPEQVFIELAKLLFNPWVAGILLSAILAAVMSTLSCQL LLASSAITEDFYKGFIRPKAGEKELVWLGRIMVLIIAALAIWIAQDENNK VLKLVEFAWAGFGSSFGPVVLLSLFWKRMTSSGAIAGMLTGAIVVFSWKS VIPATSEWSGVYEMIPAFSLASLMIILVSLLSPAPNKEIVETFEKANLAY KNAE >MS1197 pykF, PykF protein MIISIALLTISAYNTAQFFLHGCFLVFSSNMKKGYLNQTNKIFTEYLMSR RLRRTKIVCTMGPATDKGNNLEKIIAAGANVVRMNFSHGTPEDHIGRAEK VREIAHKLGKHVAILGDLQGPKIRVSTFKEGKIFLNIGDKFILDAEMPKG EGNQEAVGLDYKTLPQDVVPGDILLLDDGRVQLKVLATEGAKVFTEVTVG GPLSNNKGINKLGGGLSADALTEKDKADIITAARIGVDYLAVSFPRSSAD LNYARQLAKDAGLDAKIVAKVERAETVETDEAMDDIINAADVIMVARGDL GVEIGDPELVGVQKKLIRRSRQLNRVVITATQMMESMISNPMPTRAEVMD VANAVLDGTDAVMLSAETAAGQYPAETVAAMAKVALGAEKMPSINVSKHR MNVQFESIEESVAMSAMYAANHMRGVAAIITLTSSGRTARLMSRISSGLP IFALSRNESTLNLCALYRGVTPVHFDKDSRTSEGAKAAVQLLKDEGFLVS GDLVLLTQGDASSSSGTNLCRTLIVE >MS0691 pykF, PykF protein MNEKYVPNQFRQKLLKGETLIGCWCALGNPITAEVLGLAGFDWLLFDGEH APNDVLSFIPQLMAVKDSASMPIVRVPKNEPVIIKRVLDIGFYNVLVPYV ESKEEAEEAVSATRYPPEGIRGVSVSHRNNGYATIPDYFKVINDNIGVIV QIESQKGVDNVDEIAAVNGVDCLFVGPGDLSAALGYLGQPNHPEVQKVIQ HIFATAKKHGKPCGILAPVEADARRYLEWGATFVAVGSDLGVFRGATKAL SEKFKG >MS2103 pykF, PykF protein MNVFDKAFLHNKFKAAVLEHKTQIGFGLVSGSAVNAEIVAGSGYDFIWID GEHGPNTVTTIIDQARAIAPYGSHVIVRPLEADRALIKQLLDAGIQSIIA PMVESGEQAEYIAQSMYYPSRGKRGFGAPAVRAGRWGRLPEYIKHAEDEL FLAVQIESKKGVENLKDIVTTDGVDAVFLGPADLAVDMGYFGDFSGEEMQ ATIEKLIKDIRALGKPVGTIAGSPEEAKRYIDWGASFVVVGVDTIFLAHM ADSVLGACRSIVK >MS1035 pyrD, PyrD protein MLYPLIRKGIFALEPENAHDLAIKMLHLAGNPILNKLLKALLACPSGNEK TVMGIKFKNPIGLAAGADKNGDAIDGFGAMGFGFIEVGTVTPLAQDGNAK PRQFRIVEAEGIVNRNGFNNYGVDYLVENVKKAKFDGVIGINIGKNKVTP VERGKDDYIFCLNKAYNYAGYITVNISSPNTPGLRQLQYGDALDDLLKSI KERQAYLAQVYNKYVPIAVKIAPDQTEEELVQIADTLRRHKMDGVIATNT TISRDTVAGMKNADQTGGLSGKPLQHKSTEIIRRLQQELKGEIPIIGSGG IDGVQNAQEKIVAGAELLQVYSGLIYHGPGLVKALVEAIR >MS0251 pyrE, PyrE protein MQNYKQEFIKFALSRNVLRFGEFTLKSGRVSPYFFNAGLFNTGADLARLG EFYASAIQASGLNYDVIFGPAYKGIPIGTTVSVALFNKFNLDKPVCFNRK EAKDHGEGGNLIGSPLQGRILLVDDVITAGTAIREAMDIIAANNARLAAV VIALNRKERGKGELSAIQEVERDYRCDVLSIIDLDDLMQFIENEPEYSQY LPAMKAYREQYGVA >MS1472 pyrF, PyrF protein MKAKFSKEEVNMSNKIIVALDYETEKEALQLVDQIDPSLCRLKVGKEMFT TLGTNFVKLLQDRDFDVFLDLKFHDIPNTVARAVRSAADLGVWMVDLHAS GGLRMMEEAKKILEPYGKDAPILISVTVLTSMEDLDLLQIGINASPMEQV IRLAHLSQRAGLDGVVCSPQEVEILRQHLGKEFKLITPGIRPVGSEFGDQ RRVMTPPAAIEAGSDYLVIGRPITQAANPAEVLRSINASIANLIA >MS0255 pyrG, PyrG protein MIGYNHFISLLIPTLGFTMATNYIFVTGGVVSSLGKGIAAASLAAILEAR GLNVTMMKLDPYINVDPGTMSPTQHGEVFVTQDGAETDLDLGHYERFIRT KMTKRNNFTTGKIYSEVLRKERRGDYLGATVQVIPHITNEIKARVIEGAA GHDVAIVEVGGTVGDIESLPFLEALRQLAVQVGREKTIFMHLTLVPYIPT AGEVKTKPTQHSVKELLSIGIQPDVLICRSDRMVPPNERAKIALFCNVPE KAVISLKDVDSIYRIPALLQSQGLDDLICQRFRLACKEADLSEWEQVLYR QANPTGDVTIGMVGKYVELPDAYKSVNEALKHAGLTNRLNVHIKYIDSQD VETKGIDVLKGVDGILVPGGFGYRGVEGKILTAQYARENNIPYLGICLGM QVAFIEYARHVAGLTQANSSEFDKNCPQPVVGLITEWQDADGSVEQRSEN SDLGGTMRLGAQQCHLIEGSKARELYGKETIEERHRHRYEVNNTLLPQIE AAGLKVTGLSADKKLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGF VKAAKENQKK >MS1930 pyrH, PyrH protein MRVKMNKPIYKRILLKLSGEALQGDEGFGIDPSILDRMALEIKELIAMDV EVGVVIGGGNLFRGAKLAKAGMNRVVGDHMGMLATVMNGLAMRDALHRAD VNAKLMSAFQLNGICDTYNWSEAIKMLREKRVVIFSAGTGSPFFTTDSAA CLRGIEIEADVVLKATKVDGVYNCDPAKNPDAKLFNKLTYAEVIDKELQV MDLAAFTLARDHGMPIRVFNMGKPGALREVVTGETEGTIIS >MS0628 pyrR, PyrR protein MEKIIIDESQFMRTISRISHEIIEKHQNLNDLVIVGIKRRGAEIADLIKR KINELSGQSLPSIDLDITFYRDDLEYVEPASQSPVYSGASEFISVQNKTV ILIDDVLFTGRTIRAALDALVDFGRAAKVELVIFVDRGHRELPIRADYVG KNVPTSRSEKVQVRTMKFDGCYEVALISK >MS1763 qRI7, QRI7 protein MRILGIETSCDETGVAIYDEDKGLIANQLYTQIALHADYGGVVPELASRD HIRKTAPLIEAALQEANLTAKDIDGIAYTCGPGLVGALLVGSTIARSLAY AWNVPAVGVHHMEGHLLAPMLEDADNRPQFPFIALLVSGGHTQLVKVEGV GKYEVMGESIDDAAGEAFDKTAKLLGLDYPGGAALSRLAEKGSAGRFVFP KPMTDRPGLDFSFSGLKTFAANTINQAIKNEGELSEQTKADIAHAFQTAV VETLAIKCKRALKETGYKRLVIAGGVSANKQLRQGLANLMDDLKGRVFYP APQFCTDNGAMISYVGYLRLKHGERTDLAIEVKPRWPMIELEAI >MS1558 queA, QueA protein MHLSDFYFDLPDELIARYPKPERSSSRLLRLSGENGDISHHTFSDVYDLI NEGDLLIFNNTRVIPARMYGRKASGGKIEVLIERVLTESRFLAHVRSSKA PKAGTELILGEDKLGEGKGVKAVMIGRQDALFELEITEKSTALLDILQKI GHMPLPPYIDRPDEDADQERYQTVYSKIPGAVAAPTAGLHFDEELLAKLK AKGVNFAFVTLHVGAGTFQPVRVNNIEEHHMHAEYVEVPQQVVDAIVATK AAGKRVIAVGTTSVRSVESAALAAQEKGFAQIIEPFFADTSIFIYPGKQF RVVDCLITNFHLPESTLIMLVSAFAGYKNTMNAYKSAVENRYRFFSYGDA MFITKNNHVKGLD >MS0014 racX, RacX protein MVSITGIIEIIHIINDKVKDFEMKTIGILGGMSPESTASYYLEINRAVNL ALGGNASAKLLISSVDFEEIVQCQKAGDWQKAGKILAEQAKLLEQAGADG ILLATNTMHKVARPIIDNISVPFLHILDAVADSIKAKGLNKVALLGTAFT MSDNFYRDGLIERGITPVVPDEETQKEIHRIIFEELCIGKILPQSKTFYL KTIEKLTALGAEGVILGCTEIGLLINQADSTLPFFDTALLHSEMAVDFVL EK >MS1941 radC, RadC protein MTKRYLLEELQQNQEFNSTDTARIYLQTALEQREREIFLVLFLDNQHRLI KQEEMFLGTINSAVIHPREIIKTALYCNAAAMILAHNHLRESPNRVNRIA ILRKGSVRRQI >MS1940 radC, RadC protein MEFSTQSNPSIKNKGERLMLQIEKENLMPREKLLKFGANTLDNKELLAIF LRTGIKNCPVMQLSEAVLTHFGSLRQLINADRHNFAPLRASASLNLSNYR PVRK >MS1939 radC, RadC protein MDKLSDADALKGAKLCRSALINCRNEPKCVKTASESCITGQFFIPVRKNI ANNSLLSKVLAPNFNSFSRGIRFSFSICSIRRSPLFLIEGLLCVENSIMP TELDQFFETDRKNKIFSSLSFEVECQNLFVFILCFLNYGFWVPEE >MS0578 rarD, RarD protein MFMSISAKLKGWHYAFACYAIWGTFPIYWYPLNSSAMPADQILAQRIVWS VVFAVFLLIIFKQSRAVLRAFTKPKILAIFFLSSFLIALNWLVYLWAITN HHVLDASLGYFINPLFNVFLGRLVFKERLNKPQLLALCFATAGILWLAIP AGQIPWVALLLAGSFGFYALIRKLAPMEALAGLALETLLLSPFALAYLFF CYTQNTLVFSELNSLQLGVLLGSGAATTIPLLWFAMGARQISMSLLGMLQ YISPTLQFLCGSLLFGEALSITRLIGYSLVWIGVAIFLLAMRKKMQNK >MS1442 rbfA, RbfA protein MSREFKRSDRVAQELQKEIAVILQREVKDPRIGMVTVSDVEVSRDLAYAK IFVTFLFNNDDEAIKQGMKALEKAAPYIRSLVGKAMRLRIVPELRFEYDR SLVEGMRMSNLVTNVVRSDKERHIDEENGED >MS0032 rbn, Rbn protein MRRVVMTQLKLLFAVFYCRFQQNKLTQAAGALTYSTMLAIVPLVMVVFAI FSAFPMFNEAAAELKTFIFDNFSPSAGDTVGQYIDEFVVNSKSMSAVGII SLIAVALLLINQIDRTINDIWNSKNRNFIFSMTIYWTLLTLGPIFIGMSF AINTYIRSIIAFEGDLGLPFGLKLLSFVPFLLTWLSFSLIYTLVPNTKVN FRYAAVGALVAAIFFTLGKKAFAWYMATFPSYQLIYGAMATLPITLLWIQ LSWLFILLGAQLTAVLGDMRLIKSGDLNLTAIKEKTE >MS0643 rbsB, RbsB protein MKKTVLSAVALAVGLGAGISTAQADTKIGVTIYKYDDNFMALMRKEIDKE ATNLKDVQLLMNDSQNAQSIQNDQVDVLISKGVKALAINLVDPSAAPTVI SKAKPDDIPVVFFNKDPGEKALAKYEKAYYVGTDPKESGTIQGDLIAKQW KANPALDLNKDGKIQYVLLKGEPGHPDAEARTKYVIEQLNANGIETEQLF IDTGMWDAAMAKDKTDAWLSSSRANDIEVIISNNDGMAMGALEATKAHGK KLPIFGVDALPEVLQLIKKGDIAGTVLNDGVNQGKAVVQLANNLAQGKDA TEGTQYKAENRVVRIPYVGVDKDNLSEFLK >MS1612 rbsB, RbsB protein MASKLLKNIFKFSALISALPALAFAADKPQIALLMKTLSNEYFISMRQGA EETAKEKNIDLIVQVAEKEDSTEQLVGLVENMIAKKVDAIIVTPNDSIAF IPAFQKAEKAGIPIIDLDVRLDAKAAEAAGLKFNYVGVDNFNGGYLEAKN LAEAIGKKGNVAILEGIPGVDNGEQRKGGALKAFAEYPDIKIVASQSANW ETEQALTVTTNILTANPNINGIFAANDNMAIGAVTAVENAGLAGKVLVSG YDGIPLAIEYVKQGKMQNTIDQLPKKQVAIAIEHALKQINKQEIPPVYYV DPVVVDKEESKNY >MS0639 rbsB, RbsB protein MKKLVLNTVAISVLLGSGLAVAQTEPLIGVTIYQYDDNFMNLMLSEINKE SANFKDVRFLMNDSQNSQAIQNNQIDILLAKKVKVLAVNLVDPPAAKTVI AKAKKHNVPVIFFNKDPGAKLLASYNHAYYVGSSPKNSALEQAKLIAKHW NANKQFDLNQDGKIQFAMLTGQPDSTAAEVRSKYVIEELHNLGIQTEALF VDTAMWNGNMARDRMELWLNDTKGKQIEMVIANNDAMALGALESLSAQNK QLPVFGIDALPETLTLIKTGKITATVLNDGAYQSKVLVELARNLALGKNA AEGMPWKPENNSILSPDIAIDKDNVEQYRK >MS0063 rbsB, RbsB protein MKLSKRTFLKSLVAVSILAVTGLNSNPVYSSSAEPIKLGFLVKQPEEPWF QTEWAFADKAAQALGNVQIIKIAIPDGEKTLNAIDNLAANGAKGFVICTP DPKLGPAIMAKARAYDLKVIAVDDQFLNAAGEPMTNVPLIMMAASEIGQR QGEELYKEMQNRKWDVKDTAVLAITADELDTARRRTEGSIEALIKAGFPK EKIYKSPTKSNDIPGALDAANSMLVQHPEVKNWLIVGMNDNTVLGGVRAT EGQGFKPENVVAIGINGVDAVNELSKPRATGFLGSLLPSPDIHGYRSVEL LTKWIREGVEPEKYIAVQDVVFLKRDNFKEELSKKGL >MS0201 rbsB, RbsB protein MRKFLKTTLVSAVFTFMIGSAYAQLTPLNSDTEQDRINWTELESKLGSFP TLKEGLKIGGVSKTLTNEYWRSLGEGYKNFADKHKVFVAYQAAANEGDQL GQLSIAETMITEGYSALLFSPQTDVNLQPAAEVAQSKNIPVVNVNDAVMP TATHYVGNVQKDNGVRVANWFIEHSAEGGKVAVVEGQPGVFAAKQRTEGF TETINKSGKFEVVASVPANWSREQAYNVAMTILQRNPDLIGFYVNNDGMA LGVVEAVKAAKLQGKVAVFGTDGISDAYKSILAGELTGTVDSFPVLTGEV AMEVVLRLTAGQKLPRVVTTPQALITLENAKTYSEADAKTIREILSK >MS0283 rbsK, RbsK protein MDKIMKKLTVLGSINADHVISVPHFVKPGETLTGSNYHIAYGGKGANQAV AAARLGADVDFIACIGDDDIGKAMKAAFERDGIHTHTISTIPHQTTGIAM IQVAESGENSIVISAGANAHLTEELLAQHQESIAHADCLLMQLETPISAV EKAAVFAKRNGTKVVLNPAPAQPLSDHLLAHIDMITPNETEAEILTGVQV TDEQSAAEAAQVFHHKGIETVLITLGSKGVFFSSRGVQRIIPGFRVKAVD TTAAGDTFNGALITALLEDKSMEEAIRFAHGAAAISVTRKGAQPSIPSRE ETLAFLNEQH >MS0565 rbsK, RbsK protein MIMKKIAILGECMIELNGEPFGRMRQTYGGDSLNTATYLARVSRREQFEI SYVSALGKDKLSLGMLAHWRNDGINTDCVLLDEKRQPGLYLIQLDEKGER TFLYWRNQSAARYLLQHEGYTEVLARLATADMIYLSGISLAILPENDRTL LIRQLGNLKKAGVKIAFDSNYRPALWDSFQQTQACYQALLPLVDLALVTF DDEQSLWRDENVQQTISRLVQLGVGTVVVKSGEHGAVFYHNGETQQVATE VVQRVVDTTSAGDAFNAGFLNGYLQQKSLVDCCRQGNKLAGIVIQHKGAI IDKTATQHFIREFN >MS0197 rbsK, RbsK protein MPNKVVVVGSLHYDIVVESTHRPVKGETVIGKRWYPKFGGKGGNQAVAAA KAGCRVFMVSAVGPDNFAPFLLEHLNKSGVNTDFVQKISGVGSGMSVAIM DSEGDYGAVVVSGSNLEIDINRLDNETLWDNAKMLILQNEVSDSINFEAA KRASRRHIPVCLNAAPAKKLSAEFTKLIDILIVNAVEAEAMCGLSVNSLD SALQAALKLSQDFSRVIVTAGGDGVAYADKESNGKIASIKVKLISTLGAG DCFVGHLCTALSENNTLRDAVAYANQKAAEHVSTVQE >MS1233 rbsK, RbsK protein MHMTNKIWVLGDAVVDLIPDGDNHYLRCAGGAPANVAVGVARLGVPSAFI GRVGKDPLGEFMRDTLNQENVNTDYMLLDPKQRTSTVVVGLTDGERSFTF MVNPSADQFLQISDLPQFQAGDWLHCCSIALINEPTRSATFTAMKNIRAA GGKVSFDPNLRESLWKSQDEMIDVVMEAVSLADVLKFSEEELTLLTHTDS LEKSFEKITALYPDKLIIVTLGKEGALYHLHGKKEVVAGKALKPVDTTGA GDAFVSGLLAGLSQTENWQQPEQLVTIIRQANASGALATTAKGAMSALPN QQQLAEFLAN >MS0545 rbsK, RbsK protein MKNLTLIGECMIELNGEPFGVMRQTYGGDTLNTATYAARVASPEKLNVGY VSALGTDKLSQGMIERWQADGINTDLVLRDEKRSAGLYLIQLDKQGERTF LYWRNQSAARYLLQHPDYNRVLSALKNTDMIYLSGISLAILPENDRTLLI EQLGELKKSGLEIAFDSNFRPALWDSREQAQNCYKALLPLVDVALVTFDD EAMLWADNDEQATITRLSSFNIPKIIVKQGRLGATVCEKGKQTFVPTIPV EHVVDTTSAGDSFNAGFLVGYLQGKPLNECCKQGNQLAGIVIQHQGAIIE KSATEHLRNAFA >MS1800 rdgC, RdgC protein MYWFKNAMIYKLTKELDWSEDKLQQNLAQCAYHPCGQSDMSKFGWTTPLR GAELFCFSVGKQILLVAHKEEKIIPAHVIKRELDNRINELEEKENRKLKK TEKQALKDDVVSVLLPRAFSKNQQTAIWIDTEKNLIYVDAASSKRAEDVL ALLRKSLGSLPVVPLAFANEPSMVMTDWIIKNDMPQWLVPLEEAELKAAD DRGIIRCKNQALDSEEMISHLQAGKFVTKLALEWEEHLTFVLNDDGTLKR LKFADMIREKNDDILKEDFAQRFDADFILMTGELAKLTENLIEHFGGEKN RL >MS2243 recA, RecA protein MATNDEKSKALAAALGQIEKQFGKGAIMKLGDTQALDVESISTGSIGLDV ALGIGGLPMGRVVEIFGPESSGKTTLTLSVIAQAQKAGKVCAFIDAEHAL DPIYAAKLGVDVKELLVSQPDNGEQALEICDALVRSGAVDVIIVDSVAAL TPKAEIEGDMGDSHVGLQARLMSQALRKLTGQIKNANCLVVFINQIRMKI GVMFGNPETTTGGNALKFYSSVRLDIRRVGAVKDGDEIIGNETRVKVVKN KLAPPFRQVDFQILYGEGISKNGELIELGVKHKLVDKSGAWYAYNGDKIG QGKANAMKWLAENPTVAAELENKIRAELLANPEQALLADIETNSEEKEDF E >MS1099 recB, RecB protein MNSTLLIEASAGTGKTFTMASLYLRLLLQAGENCFFKPLEVEQILVVTFT EAATQELRERIRHRIHLAKKQLTQYAENKNKQVFYGTENEILADLVDSLE LPVAIQRLKIAEQNMDLAAIYTIHGFCRRMLVQYAFNSGIHFNLQLVKDE TELLTRFSNELWREHFYNLSFSLTNFIHRNLKSPTDVLQKIRKFVTSENL NVELNEPHLLQLEFNRFLSQYIEKNINEIKQLKTAWIESENEIQRLIEKA KTQKLIKGASYKANHLPGRYEKIRQWAQDETDFSIPEPLSKYFSQSAVDS YLTKNEPVNHAVFKQADSAVERAQSTELYVKVILYHYIQWMRDKLDRYKA SHQEKSFDDLLRLLKEAVVSPEHGNELVKLIRYQYPFAMIDEFQDTDAQQ YQIFSKIYIESAQAETGFIMIGDPKQAIYQFRGADIFTYLKAAQQAKYHF TLGKNYRSEGNLIHAVNQLFNFSSAQPFLYENIEFSSVEPGKAQGRFILN EQQEAPLGVYLGEEPSDEQLAETCANCISQWLQLALRERAGIQTAEKFLP LEPKDIAVLVRNAKEAELIKNALQARQISSVYLSDKSNVFDCNEAKELLL ILQACLNPFSERNIVNAIATAIFCLTGADIQHIKQHETDWEKWIDRFVGY QRSWRQQGVLAMLHQLFLAEQIPQKLINMPNGERRVTDLFHLAELLQEAT TLNESDAALLRWFERQIRGENTQDENIIRLESEQQLVKIVTIHKSKGLEY NLVWLPFISAKAKVNPQHISTYYNAQAQAVQWDMDACHNDEVIKERLAEE MRLLYVALTRAKYHLAMALPDNFTKNWNALLYALTRGEIGTQAKLTDEYQ TKPLLDDFAQRISPANIHYYQTDEIQGGGYQQKDNHAQYVAQEFHGKIER DWTISSFTSLTQMHEWNSQKGRHEAFSPIVTTESAVNFSLILDEAKDIDL TFLPKINEDKNNFSDIVTGYRQGYSPFDFPHGINVGTALHRFFEKNEFNQ PIIDEYVKNLCQTIQLSEEWKQPLIQWIEAILTTPLFNGEPLNLAQLDKK DCIKEMQFYLKLEGRFKLHSFNRLLQKYHTIKREPYLFDEIQGMLRGFID LVFRHESKYYVLDYKSNFLGKDMAFYARSQLTDVMKNHHYDLQYLLYTLA IHRYLKQRVTDYDYDSHFGGVIYCFLRGMNGRNPDYGIYSAKPARELIEG LDNLF >MS0728 recC, RecC protein MQLKINSLKITALLVIRKFIVFTVYHSNRLDVQKDILIELMQLLPPDDPF QTEIILVQSPGMAQWLQLKIAEKKGIAANLKFPMPASFIWQQYINVLEDV SQQTQFNKDAMTWRLMQLIPQFLSEPCFQALENYLKNSPYSEQQKLYQLA RKVADLFDQYLVYRPNWIHAWENNQPESIEQAIGTYQKDDNPELITQIKR DIKWQGILWRALIDEVQRGAGYKVRHRANLHQAFIDKLRSAKPENLPQRI FIFGISALPQSYLETFEAMSRYCDIHLFFNNPSREYWGDIVDDRFLQKLQ TRQRFDHYENNHTALLSSATLTNMQQENYEFSPDNEKLLVGNPLLAGWGK LGRDFFYLLTDLMTRAEEHNREIIAFVDLDDKTLLSQVQGHILDLIPMAV KKLNKPQEDNSLTIHACHSVMREVEVLHDYLLSLFELDKNLTPKDIVVMV ADIDKYAPYIQAVFGQYQKDLQTNQFYQADKRYIPFSISDNKLTESDVLI ASFLMLLNLKESQFSAEEVLAYLDIPAIRMRFQIELEDLETIREWVKNSG IRFGLEKRTDNSLKNYNAWQSGLERMLLGYAMRAENGIWQDSLGFDDSHG LQGKLAGLLAAFIERLYQWQQFLRNPHSYEEWGQALLELVDHFFLENEQS LEAILYLKEIIQQLHEQLDEVNFTSKLEIDVIAEVMAEQLNDKNTSLKFL VGKVSFCTLLPMRAIPFKAVCLLGMNDGEYPRQQTPNSFDLMQYHRQKGD RFRRDDDRYLFLEALLAAENYFYVSYVGQSIIDNQQREPSVLVSQLLDYL AENLANNDEEIEQIRTSLVQYHSMTIFSPDNFSAMHRSYAKEWLPLVNRN QYPVPDFTQQISGEIDEVREIDILQLVQFVQHPVKFFFEKRLGVYFQQTD EQIPETENFTLDNLDNFLIKDELIRFADDETDNYFERLKLEGILPYGHFG DIYKRRLQNEAAELKNKISAYLSQEPAHQFVEITLDMGEQSVLLTGHLDH LYQPFAQRVKWRVGEVKDKHIIENWLYYLLQLCTTDNVNPPLYYGKNGCI GFKTLEKSTALSILKLYVKAYLQGLKQVQIVPTYKIDDYLKSCQPETEFD TLSAFNNLRDLFKSSNNYTNEKEDIYWTRVFQQATELNSDKEKLMQIQQT TRDWFGLMLNSVEKVKL >MS1098 recD, RecD protein MLEILAKLQQENVITAGDYHFAKMIAEKCEEGTDKSSRTKNNLTALLAAL CNYSHQQGNTCLFLEEQIKSNLFGLAYRALEQDYLQQIDEKIGYLPVAQW QQILKSHIAFTTEPKTKIAPFVFQFNALYFYRVWQDEYLVARYLKSAVKN SKVLAEQPDTKIIHQLIGENTGLNQGQKIAIATALRQQFCLISGGPGTGK TYTVARLLVALQQLHQGKLQIKLAAPTGKAAARLTESIENALQQMTLSAK LKHCIPTEAMTIHRLLGGRSFKFNAQNPLPLDVLVIDEASMIDLALMSNL LQALPSHARLILLGDKDQLASVEAGAILGELGQFLEQGYSASFIDYLNRV TDSHLAFNSVQGDEIRDYLSHLTESRRFDEKSAIGHLAKAINSAEIDRSL QLFSQLDDIEYVDFNRYFANGIQPESSAEYLAYCVNLVVERAVREYRDYL LEIETRSAKSELTEQDIEKIFAGFKKVRFLSALRLGELGVEKLNLSIAEG LRRQNLIQFKNSRDWYQGKPVMIIQNDANVGLFNGDIGLFIQGKVWFELG ENHYRRISPSRIPSHETAFVMTVHKSQGSEFNHAFLVLPTENVPVLSREL VYTAVTRAKQRFTLFATDNIWKSAVRKQVKRQSGLGRLLIENI >MS0487 recF, RecF protein MAIARLIVENFRNISAVDLEFDHGFNFLVGNNGSGKTSLLEALFYLGHGR SFKSSVTTRVIRYDQPHFTLHGRIRELQHEWSVGLQKQRKDGNTIVKING EDGNKISDLAHLLPMQIITPEGLTLLNGGPSYRRAFLDWGLFHHQPNFHS AWSALHRLLKQRNAALNQTYDYNMLKPWDMELAKLAHQVSQWRADYAEAL SPEIEQTCRLFLPELDIHVSFHQGWEKDTDYAQLLTENFERDKAIGYTVS GPQKADFRFKSNGLPVEDVLSRGQLKLLMCALRLAQGEHLMAQKNRHCIF LIDDFASELDETKRALLAQRLQNSNSQVFVTAISPEQLKQMQPEKHRTFQ VVNGQIEQLL >MS1735 recG, RecG protein MTTQLLDAIPLTSLSGVGAAVSAKLSKIGINNLQDLLFHLPIRYEDRTRI TPISDLRPEQYATIEGIVQTCEIQFGRRPILTVSLSDGTSKIMLRFFNFN AGMRNGFQPGARVKAFGEVKRGRFMAEIHHPEYQIIRDKQPLQLEENLTP IYSATEGLKQNSLRKLTDQALELLDKIQIAEILPDQFNPYPFSLKEAIRF LHRPPPDVSVESLEKGTHPAQVRLIFEELLAHNLAMQKVRLGTQQFQALP LHFQTDLKQRFLATLPFEPTNAQVRVTQDIERDLAKDYPMMRLVQGDVGS GKTLVAALAALTAIDNGKQVALMAPTEILAEQHAENFRRWFEPFGIEVGW LAGKVKGKARQSELERIKNAEVQMVVGTHALFQEEVAFSDLALVIIDEQH RFGVHQRLLLREKGEKAGNYPHQLIMTATPIPRTLAMTVYADLDTSIIDE LPPGRTPIKTIVVSEERRAEIVARVHNACTNENRQVYWVCTLIDESEVLE AQAAEATAEDLHRALPHLRIGLVHGRMKPAEKQAIMASFKAAELDLLVAT TVIEVGVDVPNASLMIIENAERLGLSQLHQLRGRVGRGSTASFCVLMYKP PLGKISQKRLQVLRESQDGFVISEKDLEIRGPGEVLGTKQTGIAEFKVAN LMRDRKMIPTVQHYARRLIVEYPDVADTLIKRWLNNREIYSNA >MS1539 recJ, RecJ protein MIYSGLIKCISVILDCIYPVNKLIQRRTIPHGSAVCADPLLDRLYRSRHI KNSQQLDRTLHSMLAPNQLQGIDQAVQLLITAREKQQKVIIVGDFDADGA TSTALTVSALRQLGFTDVDYLVPNRFEQGYGLSVAVAEMALAKGVELLIT VDNGVSSLDGVAFLKGRGVRVLITDHHLPPEILPAADAIVNPNLADCHFP SKALAGVGVAFYLMLALRAKLRESGEFNEKTQPNFTELLDLVALGTIADV VPLDQNNRILAHQGLARIRAERCRYGIRALIEVANKDISQLSASDLGYSI APRLNAAGRLDNMSVGVELLLADSMEQARALALELDGLNQTRKEIEQGMK AEALEICRNLTALKTELPTGIALYQADWHQGVLGILASRIKDQFHRPVVA FAQDQNGLLKGSARSIEGLHMRDALERINTLYPDMIVKFGGHAMAAGLTI KEELFADFQRSFNQVVTDWLDKDMLQGIVWTDGDLPQTMMNMNTAELLKQ AGPWGQAFPEPIFDGEFRILQQRLVGEKHLKMLVEPVNGGPLFDAIAFNI DTRYYPDLSIRTAVLAYKLEINEFRGNRDVQLLVDYIQPRS >MS0741 recN, RecN protein MLTQLTINNFAIVRHLDIELSEGMSVITGETGAGKSIAIDALGLCLGQRT EAAMLREGQERAEVCATFQLKADSPAARWLTDHELQDQDNPEECILRRLV NQDGRSKAFINNTPVSASQLKEFGQYLVHINGQHASQLLLKNDFQLQALD NFCAHNHLLEQMKTDYLNWKELQSQVKTFNQKCVENEAKKQLLQYQVNEL NEFNLRPNEYQELEEEQRRLSNSEQLTQLSQSVLQILTENETVNVDSLLY RTTQHLEDLAELDTRYVDAQALLQEALIQVQEAASEIQHLSANIEEDPQV LREIEQRMNQAVQLARKHNVKPEELTQLHKQLKLELNQLVDFSESENELL AQEQQAYEKMSASATKLHQSRRQGAEKLAKQVTKSVKQLAMENAEFFINL TADYSKISVNGADNVIFNLQSNLGQSPQPLAKIASGGELSRIALAIQVLT SDKTAIPTLIFDEIDVGISGATASVVGKLLRKLGHSCQVLCVTHLPQVAC NGHHHFMVEKSTVEGKTETKMTALSSQQRIKALAKLLGGQHITDSVLANA QEMLALVS >MS0239 recO, RecO protein MLHRKPYSETSLLVDLFTEESGRLTVLAKGARAKRSALKSVLQPFTPLLL RWTGKSSLKILTKAEPAAIALPLQQTALFSGFYVNELITRVIEPETPNPQ LFQDYLHCLTSLAVSQNFVEPALREFEFKLLNILGYGVDFLHCAGSGEPV DENMTYRYREEKGFIASLIKDNLTFFGRELIAFERQDFSEKSVLQAAKRF TRVALKPYLGNKPLKSRELFTQTILHLK >MS2081 recQ, RecQ protein MTAELSNRSEAIKPELIKSAVENPEISTALDVLHSVFGYQTFRKGQQEVI QAALSGRDSLVVMATGNGKSLCYQIPALCFAGLTLVISPLISLMKDQVDQ LLANGIAADFLNSTQSLEQQQQVQNKAISGELKLLYLSPEKVMTNSFFQF ISLCNVSFIAIDEAHCISQWGHDFRPEYTQLGGLKGCFPHAPIMALTATA DSTTRQDILQNLSLNEPHLYVGSFDRPNIRYTLVEKFKPMEQLCNFVAAQ KGKSGIVYCNSRSKVERIAEALKKRGISAAAYHAGMESSQREAVQQAFQR DNIQVVVATIAFGMGINKSNVRFVAHFDLSRSIEAYYQETGRAGRDDLPA EAVLFYEPADYAWLHKILLEEPESPQRDIKRHKLEAIGEFAESQTCRRLV LLNYFGENRQTPCNNCDICLDPPKKYDGLLDAQKILSTIYRTGQRFGTQY VIGVMRGLQNQKIKENQHDELKVYGIGKDKSKEYWQSVIRQLIHLGFVQQ IISDFGMGTRLQLTESTRPVLRGEVSLELATPRLSSITMVQAPQRNAVTN YDKDLFARLRFLRKQIADKENIPPYIVFSDATLQEMSLYQPTSKVEMLQI NGVGAIKWQRFGQPFMAIIKEHQALRKAGKNPLELQS >MS1506 recR, RecR protein MQTSPLLENLIESLRCLPGVGPKSAQRMAYHLLQRDRSGGMNLARALTEA MSKIGHCEHCRTFTEEDICSICDNPRRQNSRLLCVVEMPADIQAIEQTGQ FSGRYFVLMGHLSPLDGIGPREIGLDLLQRRLQQEQFNEVILATNPTVEG DATANYIAELCNQQNIKVSRIAHGIPVGGELETVDGTTLTHSFLGRRTIG >MS1262 rfaE, RfaE protein MAQYSAQFNQAKVLVLGDVMLDRYWFGATNRISPEAPVPVVRVQENEERA GGAANVAMNIASLNVPVQLLGLTGLDETGKALTTLLQTQKIDCDFVRLAT HPTITKLRILSRHQQLLRLDFEEDFKNVQSTELLTKLENAVKNFGALILS DYGKGTLNDVQQMIQIARKAKVPVLIDPKGTDFERYRGATLLTPNMSEFE AVVGKCDTEEDIIEKGLKLIADIELSALLVTRSEKGMTLLRPGQEAYHLP TEAKEVFDVTGAGDTVISVLATALADGRSFEEACFLSNVAAGIVVGKLGT STVSTIELENAIHGRSSTGFGIMNEDELKVAVQLAKARGEKIVMTNGCFD ILHPGHVSYLENARKLGDRLIVAVNTDDSVKRLKGETRPINDLPSRMAVL AGLSSVDWLVAFDEDTPQRLIGEVLPDLLVKGGDYKPEDIAGSKEVWANG GDVKVLNFENGCSTSNVIQKIRELKD >MS0445 rfaF, RfaF protein MNLKNILRNLRLSLGKLLIDKKIQGDVNVFPPQRILFLRQDGKIGDYIVS SFVFRELKKANPNIHIGVICTQKDAYLFEQNPHIDQLYYVKKRDILDYIT CGLRLAKLQYDVVIDPTISLKNRDLLLLRLINARNYIGYKKSNYQLFNIN LEGEFHFSELYRLALEKIGIQVQDMSYDVPYNSQSAIEISQFLEINQLKN YIAVNFFGGYRHKVMNKQNIEKYISYLTSQSDKPLVLLSYPEVIPMLKDA AKSYTNIFIHDTTTIFHTIELIRHCALLISTDTSTVHIASGFNKPMIAIY KEDPIAFIHWKPISQAKTHILYYKDNINELSPEAIKPEWLL >MS2260 rfaF, RfaF protein MNILVIGPSWVGDMMMSHSLYQTLKKQYPDCAIDVLAPNWCKPLLARMPE VRRALTMPIGHGAFNLTERFRIGKSLRNQYDMAIVLPNSLKSAFIPLFAK IAVRRGWKGESRYFFLNDLRSNKNDYPMMVQRYAALGYEKSAVPDAQNLP IPTPYLTVEKSAVEKTKEKFSAQLALAENRPAIGFCPGAEFGPAKRYPHY HYAKLAEMLIEKGYSVRLFGSAKDNEVCQQIRGGLPESLQAYCVDLSGQT ELNQAVDLISDCVAMVTNDSGLMHIAAALKRPLVALYGPTSPQYTPPLSD KAVIIRLIEGGLIKIRKGADSAEGYHQSLIDIQPEMVMEKLAQLLN >MS2259 rfaF, RfaF protein MVKMKVLIVKTSSMGDVLHTLPALTDAQKALPDVRFDWVVEENFAEIPHW HCAVDKVIPVAIRRWRKQPFSAETKNQWKNYRTFLQTEQYDAVIDAQGLL KSALLVTRPARGEKYGYSFSCAREGLAAFFYDHKFNIPYQQHAVERIRRL FAQSLSYALPDEKGDYGIASHFRHAAKAIDNPYIMFIHATTRADKFWINA EWTKLARLLAAKGYHIHLPWGNEREYEQAQQIAQNIPQVKILPKLSLSAL AEEIAQSSAVVSVDTGLAHLTAALDIPNITLYGATDPKLIGTYGKNQHHL TKETMRAISAEEVFERFFGVV >MS1962 rfaF, RfaF protein MALFNQAPKSLCVLRLSAIGDVCHALAAVQQIQKYWPETEISWIVGKTEA QLLAGIPNAELIVYDKKSGWKGVLALWRQLKHRRFDALLNMQTAFRASVL SFGIKARYKIGFGKQRAREGQWLFTNRKVRDPQNPHVLDGFMAFVEYLGV PVEAPHWQLAVSEQDKEAVKPYIDPARKNLIISPCSSKAEKDWLIERYAQ VANIAHQHNVNVILCGSSAKREVEILQKITALCDFQPVNLSGKTNLKQLV ALISMADLVISPDSGPAHMATTQGTPVIGLYAYHNPLRTGPYNNLANVVS VYEKNVRKEYGKPSDQLPWATKLTGKNLMSQIQVEDVVEQMKKLAVI >MS0439 rfaF, RfaF protein MQKLLVIRNDKLGDFMLIFPALALLKKSYPQLKISALVPAYTAPIAEICP YIDEVIIDAKNKKNPAEFDRTLQLIRAQKFDAVISFFSTWYNAKLVWKAG IKYRLAPATKLFQFLYNHRLTQRRSRSEKAEYEYNMDLARQFLRDHNIPI IEPSAPYLQFAKSAVENQKILLSEQLNIKQDKKWLFVHSGSGGSANNLSL QQYADLIQGILREFDCYIILTAGPNEEEKARQLANLVSRPNVVVYAKNNG LVDFARSLACADLYISGSTGPLHLSGALNIPTIGFYPSRLSAIPRRWRPI NAPDYHIAFCPPFKKEVEKNLTVISIAECLKDIIPFIRTHWH >MS1494 rfaG, RfaG protein MQDKKHILIVAPYVTFPDEMGMNRFIYLAKLLSAEFDVTLLTSKYCHFLK EHREQTPVLDNVNVVLLDEPGYRKNVSIQRLISHHRFCRNFEDFLKNYRR KIDLVYSAYPLIKTNYILGKYKQSKNFKLIIDVQDVWPEAISGPIAFFST SIGKMLMKPITRYANKTYGYADALVAVSDTYLNRADVNHLPDELKSAVYI GGDFLFTKSTDKKVTDKLTATYLGTMAGSYDLETVVRSAPLCSENVEIRF IGTGPHEASLQALNHQLGGHVKFLGVHPYSEAMRMLADSDVALNAIKASS EGSITNKLSDYICCALPIVSCQKHPEVEKLLAKGGGIQYTAGDYRELAET LNKLAEDRTVLDKMSQVNLSLAKEKFLRERSYKEIEKLIKNII >MS1497 rfaG, RfaG protein MSQAKLRVLLVSDMGHIGGTEIATLIAATELNPLTESVTVFGKTGPLFDR INKLGIRQINADCHTKNPLKLLNYVCQLVKTVNDNQIDVIHAQMARPLLF IWLAKKFFKNKQVKIFWTSRGLDHETYQKVVPFFAKMGVRGLGNCKLEQQ KLIRYGYPETQTSYIYNAYRLTPTVKPMKSLDKRQFVIGTLSALREGRSV ELFLDLAKYCLTQYSERQFQFVIGGDGPHRATLEKISADLGIDQQVKFVG NVSDVSTFMDGVDVFVSPLVVDGDSGAGLSNSIVEAMIMKVPVCAFRAAG IEEIVINGSTGHLIEPRNIPAMAEAVCWTVDNKATTESYVNNAYNLIIRE CDPKKYAQKLLKLYKEL >MS1496 rfaG, RfaG protein MHVLILPSWYPLHKDDLNGSFFREQAYALARSGIKVGVIAPQFRSLRLGK NAVLGRYDQEFWQDGDINTYFQHGVFWFPKVPYLDLKRWVKAGLALFESY VKEQGMPDILHVHSLLHAGPLALEIHRKYRIPYCVTEHSSVFGRGLVKDW EWDHLKRSEASASKLLAVSKSLADLLKQKLNGKEWTVFPNLLDDLFVESP VDSSVRKYQLCAVAFLYAKKGFDVLIKAFAKVVEEYPQLKLMIGGDGPER AKLEALIKSLKLENNVSLLGALSRQEVCQLMKESLCFVLSSYIETFGVVV IEALSQGTPVVSTLCGGPESILTEGDGLFVKTGDEKELAKGILEFLANQE KFDNQQIRRRCIDTYSEKPFVNRLTAIYQDILDKT >MS0447 rfaJ, RfaJ protein MNIIFNCDENYAPYLSVVIKSILDNTTLSTQFYILDFNISEESKSCIKNL IQNINKKNSFQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYLNE LNKAIYLDIDIIVISDLSRLWHIDLADNLVGACLDPYIEYENQDYKRKIG LQDSQPYINAGVLLLNLKALREFNLYQKAIDWNKDYPNIQFQDQDILNGV LKGKVLFLDSRYNFTVNHRNRIKLAHKGKLLLSSLEKATKPICILHYVGS HKPWLPTTTMVKSCLFDQIYNSIRNKPPHWNKKYQSVPLKFQLKRILREI EDKLVYKII >MS0243 rfaL, RfaL protein MLKLNKGFLINGVVGLFFIICLSVRSGYSISPILLALVGLGYLIYDLIKK RKWQISPDEKWLIYSYGLYFSLFVLSLLIHQGRLKELDSPVKIVFLLTLL LLFSRFPIKFSTLIYAIPMGSMIAGITALIDRFYLHSQMAYAPRIMHIQG GDIAMSLGMFSLVCCIYFFIKQQKKWMLFCLLATLSGMLGSILSTARGGW IGVPFVLCFIFWAYRKYLSRTFFVSVISILVVAVGVAVSIPNTKIMKRIN AAQHDITSYVDNKKGSDSTSVGARFEMWKSAFLMIKEKPVFGWGIEGVNQ MRKEHQKQGIISKYASQFTHAHNQYLDDFSKRGVLGFIALLAVFLVPMRF FRKNLSDRLEVKVVAIFGMVHVISVMFYCFSQGFFSHNSGNIFYFFPVIL FYAVILNLTAKKSAESTKS >MS1593 rfbB, RfbB protein MKILVTGGAGFIGSALIRYIINQRQDEVINLDKLTYAANLDSLETVSLNP RYSFERADICDRAALDRIFADHQPDAVMHLAAESHVDRSIDGAGIFIQTN IVGTYTLLEAARHYWNRLDTERKKTFRFHHISTDEVYGDLADKNALFTEE TPYSPSSPYSASKASADHLVRAWHRTYGLPTIVSNCSNNYGPFQFPEKLI PLMILNALEGKPLPVYGNGLQIRDWLFVEDHVRALYKILTEGRVGETYNI GGNNEKSNIEVVKTLCTLLEELVPNKPAGVMKYEDLICYVTDRPGHDLRY AIDSSKINRELDWRAEESFESGMRKTLQWYLTNKSWWRRILNGSYHLERL GLNN >MS0657 rfbX, RfbX protein MVKSTKRVFNFIMNKINTEHKKRLFSNFFSLTVLQIVNYALPLLTLPYLV RVLDVETYGLVMFAQSFILFFNILVDFGFNLSATKEVSIHRDDKNKLIEI YSSVMVIKFLLILSSFIILSIIIFSFERFSLNKGVYFLSFLWVIGQALFP VWYFQGIEKMKYITIVNIIAKFLFTGCIFLFVKENADYLLIPLFNGLGIL IAALVALWIVHVSLKQKVTWQPLSKLWIYFKESSTFFLSRASLTMYTSAN AFVLGIFSNNTIVGYYSIADQLYKALQAFYTPLSQVLYPYIAKERNIVLF KKIFNMAVFLNCMGIAILYFITVDVFALLFTQKIGIESINVFNIFLIASL IVVPSILLGYPFLGALGFAKEANLSVIYASIIHILGLVILILFNKISLYS VAYMVLVTELFVFMYRISKIRGRRLWRKQL >MS1495 rfbX, RfbX protein MVRLISTVFVRQILVGILQVITLIVIARGLGTGQMGQYTLAILLPTLFSQ IITFGLQSINIYAIGRKMINENQALYANLIFLSGLSVLTSLILGVVVYYF GQYFFNEVPVNLLYLALASLLPQTFFTVLPSLIQAVQNFKWFNIVCVAQP LVIFVVSMVAILLSDNVSSILTAYVLSHWISFFILLGIILKLIKVETCSL KRFFSDFIGYGLKSHLSNIITLLNYRSSLLILGYFTTPVIVGIYSVGMQL AEKLWLPSQAVSTVLLPRLSNKLGEGGDEKEVAKLTLDSARLTFIVTLII GIAFACLSSIVVRILFGVEYDKAVYVILLLLPGILAWTPSRILANDLAAR GFAELNLKNSYWVFGINTALSLCLVPLWGLIGASVATSIAYSMDLVLRLI AFNQVTQSRAFLHIIPRISDFGTVINFIKGLRNAR >MS1670 rfe, Rfe protein MLVWFAKFLEQYYSGFNVVSYLTFRSVLALLTALLLSLWIGPKMIRRLQI FKFGQEVRNDGPESHFQKRGTPTMGGLMILATITVSTLLWGDLSNPYIWF SLFVLLGYGAIGFVDDYRKIKYKNTDGLIARWKYFWLSLVSLIAIFGMYA LGKDTDATRLVVPFFKEIMPQLGLFYVVLAYFVIVGTSNAVNLTDGLDGL AIMPTVFVAGAFAIIAWATGNVEISKYLYIPYIKYTSELVIFCTAIVGAG LGFLWFNTYPAQVFMGDVGSLALGGALGTIAVLVRQEFLLVIMGGVFVME TVSVILQVGSYKLRKKRIFRMAPIHHHYELKGWPEPRVIIRFWIISLMLV LFGLVTLKLR >MS0911 rfe, Rfe protein MLWLSLILTFIVSFLTILVIKPVALKVGLVDKPNYRKRHQGAIPLLGGIS LFMGNLCFYLLQWDSTRLPYLYLFCVLVLLVIGLLDDRFDISAALRAGIQ AVLAVLMIYVGHLSLEHLGQIIGPFQVTLGPIGFVITVFATIAVINAFNM IDGIDGLLGGLSSVSFAAIGILMLMDNQFDLAVWCFALIIALIPYVMFNL TIFGKERKIFMGDSGSTLIGFTMIWILLLSTQGKGHPMNPVTALWVIAIP IIDMIAVIYRRLRKGKSPFKPDRLHVHHLMVRAGLTSRQALLVITFFAAL CAGFGILGEVYYINEWIMFILFIVLFFLYLYSITKAWKVTRLIRRMKRRA QRRKHS >MS0535 rhaT, RhaT protein MLQKYRGEIILFIVSLIAASGWFFSKFSMAEFPALGFIGLRFFLAAIFFF PLAYPQLKRLDKPQLIKSALVGLCYAVYIMLWMLGLINSAHFGEGAFLVS LSMLIAPLLSWLIFGHLPYKSFWLALPAAFTGLYLLSSGKGGLHFSFGSL IFLISSLVAALYFVLNNQYARDIPVLSFTTIQLFIVGTCCGTLSILFEQW PTSISMTAWGWFLCSLVIATNLRMLLQTYGQKYCHVATAAIIMILEPVWT LFFSILILGERLTLHKAFGCLSILAAIMIYRLPAILRNQASANKE >MS1825 rhaT, RhaT protein MLMPHFTQSKGYGYFCLILATFFWGGNYMFGRILSHVIPPIILNYLRWLP AAIILLLLFAKYLPQQRHIIRKNWQILTALALLGVLIFPVFLYQGLQTTT ALNASIYLAVVPIVVMFLNRICFKDTIRFPVFIGALISFIGVLWLLSHGE LSRLLTFNVNRGDLWAIGSAVSWSVYCSIIRLRPKEIGNSVMLTAQVGIA MIIFTPVFLSQLNTENLQIISELTYGQWMIILYLIIGPSILSYGFWNYGM TIVGGTKGAAFTNATPLFAAALGILVLGEQLHGYHLISSLLIVIGLTLCN KK >MS1753 rhaT, RhaT protein MLFQIIATLIWASAFIAAKYTYEMMDPVLMVQCRFFIASIIMLPGFFAAY KRVPKERLKIMWLLALINFPLMFLLQFIGLYFTSAASAVTMLGMIPLLTV LIGFLFFKRRINKIDLLLSLVALAGIILTVVGGGEDNLINPWGCLLVLGS AVSFCFCLYLSKDVMQEMAPKDYTNVLVILGSILCLPFTCVLVRDWSIVP SVKGMISLFYLGIGCTWLAVVLWFKGVQKTPTYISSILTTLEPIFGVILA ILILDERLSTVSAMGILLTLGAAAVSVLIPVLMKKSP >MS1754 rhaT, RhaT protein MFYLIAAVLIWASAFIAAKFSYTMFDPALTVMLRLILSALLVLPTFFRSY RKIPKQYRLQLWGLGLLNFPVVLLLQFTGVHYTSVASAVTMLGTEPLVVT LLGHIFFHKPARLLDWLLGIVALTGIVFVVYGSESGGEVTLLGCTLVLLG SIAFSFSIHLAQSVMKAVEAKAYTDVIIMTGAISCVPFSLLLVQDWQIHL NIEGISAILYLSVGCTWLAYRLWSKGLRVSSANTASILTTLEPVFGVLLA ILLLGEHLTLTTLFGICLVISAAGISVLSSMLINYIKNKVTIL >MS1595 rhaT, RhaT protein MNQQPVLGFIFALITAMAWGSLPIALQHVLTVMGAESIVWYRFFVASLAL FLLLAWKKKLPALSQFTSRYWKLSLIGVLGLAGNFFLFNSSLNYIEPAIT QIFIHLSSFMMLICGVFVFKEKLGAHQKAGLLILILGLGLFFNDKFDMLF GLNMYSTGILLSVSAAVVWVAYGMAQKLMSRQFTAQQILLIMYTGCVIVL CPFAQFSQIQGLSGFALGCFIYCCLNTLIGYGAYAEALNRWDVSKVSVVV TLVPLFTILFSRILHGLDPAHFAMPHLNTVSYIGAFVVVLGAIISAVGYK LFKYKR >MS1597 rhaT, RhaT protein MKQQPLLGFLFGLIAACMWSSLPLFVQQVVKVMDIQTSVWYRFVLSAVGV LLLLCFSGKFFTFKRISPKNTLLLLLAIAGLSVNFYLYNLALKYIPPTTS QVLSPLSSFMMLFAGVLIFKEKMARHQKIGLAVLSLGLILFFNERLDDFL QLNTYFKGVVMVIASSFVWVIYAIVQKVLLSHLSSQQILLMIYIGCTLVF FPNADIKQIYQLDGFQLVCLVFSGVNTIIAYGCYAEALDRWEVSKVSAIL TQIPIFTLLFFHLAVMIAPNYFVAVELNWISYLGAFCVVSGAMLSALGHK LKMLKERD >MS0885 rhaT, RhaT protein MLRPSCREKIIMVNNYNLALIKVHFTAVLFGLTGVLGVIISADSDVIVLG RVIIAFLALSVYFLIKREKLTALSTKDVANQSLSGALLTAHWVTFYVAVK VGGVAVATLGFAGFPGFVALFERLFFQEKLKRRELILLIAVTIGLILVTP QFEFGNQSTQGLLWGIFSGAIYGILAILNRKNINKLSGTQASWWQYLIGS ILLFPFAAHKLPAVSVTDWFWIACLGLLCTSLAYTLFVSSLNIINARTAA MIISLEPVYAILIAWIWLGEQPGLRMIIGGLIILLSVGVVNFRR >MS2326 rhaT, RhaT protein MGGSILLGIFWHFVGATSAACFYAPMKKVTNWSWETMWAIAGIFSWILLP WGISYWLLPDFGSYYSFFGSDILLPVFLFGAMWGIGNIGYGLTMRYLGMS MGIGIAIGITLIVGTLMTPIIQGRFGELLASTGGQMTLIGVVIAVVGVAV VSYAGLLKEKAIGVTAEEFNLKKGLALAVMCGIFSAGMSFAMSAATPMHE EAARLGVDPLYVALPSYVVIMGGGAIINLGFCIIRLITRPELSFKADMSV VKGLLISNILFSALGGIMWYFQFFFYAWGHANIPANYGFMSWMLHMSFYV LCGGIVGLLLHEWKDTGKKPTRVLCIGCLIIVLAANIVGLGMAN >MS1837 rho, Rho protein MVTLAHSKLLPTIKQTSQNFIKSNQKDSQQIIMHLTELKNTPVSELVALG EGQMGLENLARLRKQDIVFAILKQHAKSGEDIFGGGILEILPDGFGFLRS ADSSYLAGPDDIYVSPSQIRRFNLQTGDKIEGKIRPPKEGERYFALLKVD QVNDDKPEVSRSKILFENLTPLHANSRLRMERGNGSTEDLTARILDLASP IGKGQRGLIVAPPKAGKTMLLQNIAQSITHNYPECELIVLLIDERPEEVT EMQRSVKGEVIASTFDEPASRHVQVAEMVIEKAKRSVEHKKDVVILLDSI TRLARAYNTVTPASGKILSGGVDANALHRPKRFFGAARNVEEGGSLTIIA TALVDTGSKMDEVIFEEFKGTGNMELHLSRKIAEKRVFPAIDFNRSGTRK EDLLTTPDELQKMWILRKILNPMGEVEAMEFLIDKLMVAKTNEEFFEIMK RS >MS1681 rhtB, RhtB protein MEFWHGFLIITGIHILAAMSPGPDFIYVSQQTLSRGRAAGIICALGVAFG LGVHILYSVLGLAVVIASAAWILTTIKIIGGIYLIYLGYKGLKARAKNQV QIIEKVEVQQENRLKTLWKGFLCNVLNPKAPVYFVSVFTVVLSPNMPVWQ LAIYGVWMMFLQFVWFASVAFLLSIPKVNKQFQKAGHWIDRILGFVMVGL GIKVISS >MS0972 rhtB, RhtB protein MLNLIIVHFFGLVTPGPDFFYVSRMAASNSRRNVICGIIGITLGVAFWAA SAMLGLAILFTTMPVLHGVIMLLGGGYLAYLGLLMVRSRTNATFAPLSAE ELNKTTTVKKEIMKGLFVNLSNAKAIIYFASVMSLILVHITQVWQMLLAF AIILVETFIYFYLISVLFSRPFAKKFYSRYSRYIDNVAGIIFLIFGMILA YTGVMEMMG >MS1533 ribA, RibA protein MSKIQRVAEANLPTEFGLFRIVGFEFPDTKKEHVALVMGDISNSEENPVL ARIHSECLTGDALHSLKCDCGFQLSTALRQISEAGRGVLIYHREEGRGIG LINKIRAYSLQDNGMDTIEANLALGFAADERNFKVCADIFELLGICKVRL LTNNPAKIDTMKKAGINVVERIPLNVGENRYNTGYLDTKAKKMGHYIVHD NEKHYLDCPYCQSEIPNKK >MS0172 ribB, RibB protein MNQSLLASFGSSEERVIAALDTFKQGNGVLVLDDENRENEGDLIFPAETI TTEQMAKLIRYGSGIVCLCITDELCQKLELPPMVAANTSVNKTAFTVTIE AAEGVSTGVSAADRVTTVKVAVADNAKPSDLHHPGHVFPLRAAENGVLAR PGHTEAAVDLARLCGYKPAGVICEITNDDGTMARTPELVAFAQKFGYAVV TIEDLIAYRTKYNK >MS1313 ribC, RibC protein MEKIMFTGIVQGTAQIQSIISKRDFRTHIIKMPQKLLADLEIGASVAHNG VCLTVTDIKGDLVSFDLMTETLRITNLGDVKEGDFVNIERAMKMGSEIGG HLLSGHVYCTAEVVEIIPSENNLQLWFKLPTNEVMKYILTKGFIAVDGIS LTIGEVKDNEFCVNLIPETIHRTLIGQRKIGSKVNIEIDPQTQAIVDTVE RYLASKV >MS1374 ribD, RibD protein MSEFTEQDRAFMQLAIELAEKGQFTTTPNPSVGCVLVKQGEIVGKGFHFK AGEPHAEVMAMREAGKNAKGATAYVTLEPCSHFGRTPPCAKGLIEAGVVK VIAAMEDPNPSVAGQGLKMLQQAGIETAVGLLQEQAERLNRGFLKRMRTG LPYVQLKMAMTIDGRTATSTGESQWITGESARIDVQQERAKASAILSASG TVLADNPSLNVRWEQFPEQLKADYKKETVRQPVRVIIDSKQQISSQLNLF KIDSPVWLAGIQPRDLTDFPANCEIICLMPEKEQSLLQALMIELGKRQIN SVWVEAGAKLAGALIEQNLVDELVLYIAPKLLGDEAKGLCHLPHLTKLAD APLWRLQSMEKVGDDIKMIYMRK >MS1752 ribF, RibF protein MQLIRGLHNLRRDFAGCALTIGNFDGVHLGHQAILQHLREKANQLKLPMV VMLFEPQPREYFVSADAKQQAPARLMRLRDKLHYLQQQGVDYVICVKFDR TFAKQDPNLFIETYLVNRLHVKFLSIGDDFRFGANRRGDFSLLESAGKKY GFSVEDNRTFSLDKLRISSTAIRHALAHDDLKKAEELLGRAYSIFGKVVH GQKLGRTIGFPTANIRLQRQVNPLQGVYAVRIQCPCGRAFQGVANIGQRP TVNGVEQRLEVHLFDFDENLYGQNIAVTFCHKIRNEMKFPSLNALKQQIA RDVLVAQQYFRQNS >MS0976 ribH, RibH protein MKVLEGGLAAPNAKVAVVVARFNSFINDSLVEGAVDALKRIGQVKDENIT LVRVPGAYELPLAVRRLADSKKYDAIVALGTVIRGGTAHFEYVAGGASNG IGHVSLESNVPVAFGVLTTENIEQAIDRAGTKSGNKGAEAALVALEMVNL LAQINA >MS0300 rimI, RimI protein MQFKIKPMLPEHYQQVYRLWTSIEGMDMSDADDNFEAISAFLAFNPDLNY IAEINGKVVGVIMCGFDGRRATLYHAAVDPDYQKQGIGFALAEHLESALK TKGISKGRLLAFKSNESATLFWQKAGWTLQQKLNYFSKKFI >MS1590 rimI, RimI protein MTEISPIQAEDFDRLFEIEQAAHLVPWSMGTLQNNQGERYLNLKSSVQNH IAGFAICQTVLDEATLFNIAIDPVCQGQGIGKALLSELIKRLREKNVATL WLEVRESNQTAKRLYDRLGFNEVDIRKNYYPTPDGGRENAIVMALYL >MS0757 rimK, RimK protein MKLLMLCREPNLYSCRRLKMTAENAGHKMDILDPNRFLLKIQENRPHFAL YYQPNQGTPYLLPDYDAVIPRFGTQSTKMGCSVLTHLAAKNIPCLNNPAS FALARDKWLSLQALAAANIAVPVTVFAGQDFQAGSAVEKVSSPTILKTLN GSQGIGVILADRSQSAVSIMETLTLSHIPVLLQDFIGEAGASDIRCFVIG DKVVAAMQRSGQKGEFRANCHRGGITQQITLSDDEKLIAVRAAQALGLDV AGVDLIQSKKGLLVLEVNASPGLEMIEKTSGIDVATQMIAYLEKKIAGL >MS0441 rimM, RimM protein MNKMDRQRIETVGKLGSTYGIRGWLRIYSSTENAESIFDYQPWFLKIKDQ WQAIELETWKHHNHELIVKLKNINDRETAQTLANVEIGVDLSVFPALEEG DFYWHDLIGCQVVNLQGYAMGTVSEMMETGSNDVLVVRANAKDAFGKQER LIPFLYEQVVKRVDLTTKTIEVDWDAGF >MS1830 rlpA, RlpA protein MKLKFLLTVLLAVMTTACSASSNTAQVNNTKKHYGIAGPKLEHKGAAKSS NTYVVNGRKYTTQTSRNAKNYSKEGKASYYHNKFHGRRTASGEKYSNQQY TAAHKTLPLGSYALVTNLRNNKKVIVRINDRGPFSKTRIMDLSHAAANEL GLIRAGVGNVRVEALHVDRSGQISGAGASTLVKNARTDEARDRIK >MS0337 rlpB, RlpB protein MLKQLKKFTFVTAIAALTACGFHFQNGQLIPQELQTLTLESSDQYSDMAM AMRKQLQLNNINLVEASPDVPVLRLNKTSTDDEVVSVFKQGREAEKMLML EVSASVKMPNRAAYPISAKVNRTFFDNSRAALAKSSEKEIIWNDMREQAA RQLISKMVALQHQIKNDKQ >MS0231 rluA, RluA protein MALIEYNPPTEPWLDIVYHDNHILVVNKPSGLLSVPGNQPRYYDSAMSRV KDKYGFCEPAHRLDMATSGILLFAMSKAADSELKRQFRERETKKYYEALV WGHLEQDSGEVNLPLVCDWENRPRQTICFERGKSAVTKYEVLQRLPNNTT RVKLTPITGRSHQLRLHMLALGHPILGDKFYAHPQAKTMAPRLCLHAESL TIKHPISGEEMTFFRLADF >MS1821 rluA, RluA protein MIFMAQITMSAEVQPHQMGQRLDQTLAELFPDYSRSRLKTWIEDELVLLN GKVANIPREKVYGGEQVEITVEIEDENRFEPQNIPLNIVYEDDDILVINK PKDFVVHPGAGNPNGTVLNALLYHYPQITEVPRAGIVHRLDKDTTGLMVV AKTIPAQTQLVRALQKRKITREYEAIAFGIMTKGGTVDEPMSRHPTKRTL MAVHPMGKPAVTHYRVMEHFRNYTRLRLRLETGRTHQIRVHMAHIAHPLL GDQTYGGRPRPPKNASEEFMSVLRNFQRQALHAIMLRLEHPITGELMEWH APLPEDFVELVNALKADYQLHKDELDY >MS1072 rluA, RluA protein MLEILYQDEHIVAVNKPAGMLVHRSWLDRHETQFVMQTLRDQIGRLVYPI HRLDRPTSGVLLFALSSETANLLCQQFETKQVEKSYLAVVRGYLTGSERI DYPLKIQLDKIADKFAQEDKAPQPAVTDYEGLKTVEKPYATPRYATSRYA LVRLVPHTGRKHQLRRHMKHIFHPILGDTQYGDLHQNRTLTEHTEVSRLM LHAEKLSFIHPIKRQRTEITAGLDEQWKKLMALFEW >MS1170 rluA, RluA protein MQEFKIIHKHRDFIIIDKPNGVSVHRDEAEIGLTGLLAKQLSVAQVWLVH RLDKVTSGLLILALNERSAAEFSLLFARHEISKTYLALSKQKPKKKQGLI IGDMEKARRGAWKLCQTKQNPAVTRFESVSCEPNLRLFILKPQTGKTHQL RVAMKSLGSPILGDELYGGNSEKNDRTYLHAFRLEFTYQGEPFRVQSLPQ TGEYFLRESVREKIDSV >MS1624 rluA, RluA protein MTTNPLNEKIINATVKMLQISEDESGQRIDNYLLAKLKGVPKSLIYRIVR KGEVRVNKGRIKPEYKLQTGDTVRIPPVRVAEKEQAPISNKLNKVAALEK QIIFEDDCLLVLNKPSGIAVHGGSGLSFGVIEALRSLRPEARFLELVHRI DRDTSGILLVAKKRSALRNLHEQLRIKTVKKDYLALVRGQWQSHVKVIRA PLLKNELSGGERIVRVNEQGKPSETRFAIEERYPTATLVRASPVTGRTHQ IRVHTQYAGHPIALDDKYGDKEFDQQMQKLGLNRLFLHAYSIKFEHPKSG EELRLTAPLDENMKGILKKLRENKA >MS0368 rnc, Rnc protein MNHLDRLQRQISYEFKDITLLKQALTHRSAATKHNERLEFLGDAILNYTI ADALYHQFPKCNEGELSRMRATLVREPTLAILARQFKLGEYMALGHGELK SGGFRRESILADCVEAIIGAISLDSSLVSATQITLHWYEKLLREIKPGEN QKDPKTRLQEYLQGHRLALPTYDVKDIKGEAHCQTFTIECHVPNLDRTFI GVGSSRRKAEQAAAEQILTALEIK >MS1357 rnd, Rnd protein MTYQIKEIQNPPHFMLITDDEALADVCARASTKSAIALDTEFVRIRSYYP KLGLIQLYDGEQVSLIDPQEIQDFSPFKQLLADPKILKVLHACHEDLEVF QHYYQQLPAPMLDTQIMANFLGFQNSMGLASLIKHYFNLEIDKGASRTDW LARPLSNRQLAYAAADVWYLLPLYCKMQNALEQTRWQSAVEFDCNLLLEK HRIVKNIDKAYLSISGAWKLNSEELMRLKLLASWRQEEAVKRDLALNFVV RGENLWLLAQNNPKHTSEMLKLGLNPQEVRIHGKKMLQILERAERIDAEY YPPEISRLADDMRYKQGLKNLQQKLKTIAPPDLNAEVIAGKRSLESLMKW VWLKHKDPNKLPDLMRDWRAEFGSELAKLL >MS1571 rnhA, RnhA protein MYQIMRKQIEIFTDGSCLGNPGAGGIGVVLRYKQHEKTLSQGYFKTTNNR MELRAVIEALNLLKEPCAVTLHSDSQYMKNGITQWIFNWKKKNWKASNGK PVKNQDLWMALDNAVQAHTIDWRWVKGHSGHRENELCDQLAKQGAENPTL EDIGYQPD >MS0423 rnhB, RnhB protein MAEFEYPQGFELIAGVDEVGRGPLVGAVVTAAVILDPNNPIDGLTDSKKL SEKKREKLAEEIKQKALAWALGRAEPEEIDALNILQATMLAMQRAIKNLK IQPHFVLIDGNRIPQLAIPAQAVVKGDSLVAEISAASIIAKVSRDHEMEV LDKQYPQYEFAKHKGYPTKVHLAKLAEFGVLPQHRRSFSPVRKLLENE >MS0483 rnpA, RnpA protein MIKLNFSRELRLLTPAQFKYVFEQPLRASTPEITILARRNDLQYPRLGLT VAKKHLKRAHERNRVKRLCRESFRLLQHELPNYDFVIVAKHGIGKLDNPT FTAILSKLWQRHIRLAKKSLSN >MS2330 rpe, Rpe protein MKSYLIAPSILSADLARLGEDVENVLKAGADVIHFDVMDNHYVPNLTFGP AVCKALRDYGITAPIDVHLMVKPVDRLIPDFAKAGADYITFHPEASEHID RSLQLIRNSGCKAGLVFNPATSLSYLDYVMDKIDVILLMSVNPGFGGQSF IPATMQKLQEARRRIDESGFDIRLEVDGGVKINNIAEIAAAGADMFVAGS AIFDQPDYRKVINEMRQELAKVQK >MS0252 rph, Rph protein MRPNNRAVNEPRPIKITRHYTKHAEGSVLVEFGDTKVICTATVEDSVPRF LKGQGQGWVTAEYGMLPRSTHSRMLREAAKGKQGGRTMEIQRLIARSLRA MVDLTALGERSITLDCDVIQADGGTRTASITGACVALTDAINALVENGTL KTSPLKGLVAAVSVGIVNGEAVCDLEYVEDSAAETDMNVVMMEDGRMIEV QGTAEGEPFSHEELLTLLNLAKQGCNMIFDAQRRALAADC >MS1744 rpiA, RpiA protein MNQLEMKKVAAKAALQFVKPDMIVGVGSGSTVNCFIEELGAFRDQIKGAV AASKASEELLRKQGIEVFSANDVSSLDIYVDGADEINPQKMMIKGGGAAL TREKIVSSLAKNFICIVDSSKQVDILGSTFPLPVEVIPMARSQVARKLVA LGGSPEWREGVITDNGNVILDVHNFIIMNPIEMEKELNNVAGVVTNGIFA LNAAHTVIVGTPDGAKIIE >MS0383 rpiB, RpiB protein MKIAIGCDEAAYRLKVEIMKHLDELGIEYDDFGAGEGDVVLYPDVAEAVA VAVAEGKYQRAILTCGTGIGMCITANKVPGIRAAVCYDVFSTERSRKSND AQIMCLGERVIGVELAKSLIDVWFKCEFAGGGSAPKVKRINEIDAKYNKR >MS0564 rpiB, RpiB protein MVSTLIELKNLLFNKEINMKIALMMENSQAAKNAVVLKELKGVVDPKGYS VFNVGMSDENDHHLTYIHLGIMASILLNSKAVDFVVTGCGTGQGAMMSLN LHPGVVCGYCLDPADAFLFCQINNGNALALAFAKGFGWGAELNVRYMFEK AFTGVRGEGYPIERKEPQVRNAGILNEVKKAVAKDNYLDTLRAIDPELVK TAVSGERFQQCFFENCQDKEIEAFVREVLAK >MS0198 rpiR, RpiR protein MAQIDPKSIGAHIRTRKQQLTPLERKVLDCILAKSDFDEKTSLKEIATEN QVSEAIVVKIAKKLDFSGYREFRSGLAYYKQLEVANLHNDISADDTATQV IKKVFETSIQALQETMSILDISEFERCVKILVEADHIDLFGIGGSAQIAK DMAHKFLRIGIKASVYDDSHMMLMAGAVSHPGNVVLAISHSGTTIDVIEP LQLARQNGAKTIAITNYAISPIAECADVVLTSTSQGSLLLGENAAARIAQ LNILDALYVAVAKQNLDISEDNLRKTRYAVKHKRTK >MS0208 rplA, RplA protein MAKLTKRMKAIKAGVDSTKAYEINEAIAVLKQFATAKFDESVDVAVNLGI DPRKSDQNVRGATVLPNGTGRSVRVAVFTQGANADAAKEAGADLVGMEDL AEQIKKGEMNFDVVIASPDAMRVVGQLGQVLGPRGLMPNPKVGTVTPNVA DAVKNAKSGQVRYRNDKNGIIHTTIGKASFSAEALTQNLQALLAALVKAK PTTAKGIFIKKVSISTTMGAGVAVDQNSL >MS2045 rplB, RplB protein MAIVKCKPTSAGRRHVVKIVNPELHKGKPYAPLLDTKSKTGGRNNLGRIT TRHIGGGHKQHYRLIDFKRNKLDIPAVVERLEYDPNRSANIALVLYKDGE RRYILAPKGLSAGDQIQAGVNAPIKVGNALPMRNIPVGSTVHNVELKPGK GGQIARSAGSYVQIIAREGNYVTLRLRSGEMRKVLAECTATIGEVGNSEH MLRVLGKAGANRWRGVRPTVRGTAMNPVDHPHGGGEGRNFGKHPVSPWGV QTKGKKTRHNKRTDKYIVRRRGK >MS2048 rplC, RplC protein MIGLVGRKVGMTRIFTEEGVSIPVTVIEIEANRVTQVKTLENDGYSAVQV TTGSKKASRVTKPEAGHFVKAGVEAGRGLWEFRTEGEEFTLGQEINVDIF ADVKKVDVTGTSKGKGFQGGVKRWNFRTQDATHGNSLSHRVLGSIGQNQT PGRVFKGKKMAGHLGAERVTVQSLEVVRVDAERKLLLVKGAVPGATNSDV IVKPAVKA >MS2047 rplD, RplD protein MELQVVGANALAVSETTFGREFNEALIHQVVVAYAAGARQGSRAQKTRAE VSGSGKKPWRQKGTGRARSGDIKSPIWRSGGITFAAKPQDHSQKVNKKMY RGAIKSILSELVRQDRLVVVDKFEIDAPKTKVLVQKLKDLALEDVLIITA SLDENLFLAARNLYKVDVRDVQGIDPVSLIAFNKVVVTVDAVKQIEEMLA >MS2036 rplE, RplE protein MAKLHDYYRDQVVNELKTKFNYASVMQVPRIEKITLNMGVGEALTDKKLL DNAVADLTAISGQKPLITKARKSVAGFKIRQGYPIGCKVTLRGERMWEFL ERLITIAVPRIRDFRGLSAKSFDGRGNYSMGVREQIIFPEIDYDKVDRVR GLDITITTTAKSDEEGQALLAAFNFPFRK >MS2033 rplF, RplF protein MSRVAKAPVSVPAGVEVKLDGQLLTVKGKNGELTRTIHNFVEVKQDNNEL TFSPRNDGAEANAQAGTTRALVNAMVIGVTEGFTKKLQLVGVGYRAQVKG NVVNLSLGFSHPVEHTLPAGITAECPSQTEIVLKGADKQLIGQVAADIRA YRSPEPYKGKGVRYSDEVVRTKEAKKK >MS0472 rplI, RplI protein MQVILLDKIVHLGNVGDQVNVKSGFARNFLIPQGKAVMATKANIEHFEAR RAEIEAKVAAELAAAQARAAQIAALEAVTISSKAGDEGRLFGSITTREIA EAVTAAGVEVAKSEVRLSTGPIRTLGDHEVKFQLHGEVFTALNVIVVAE >MS0209 rplJ, RplJ protein MALNLQDKQAIVAEVNEAAKGALSAVIADSRGVTVDKMTELRKTAREAGV SMRVVRNTLLRRAVEGTEFECLTDTFTGPTLIAFSNEHPGAAARLFKEFA KANDKFEIKGAAFEGKIQDVDFLATLPTYDEAIARLMGTIKEAAAGKLVR TFAALRDKLQEAA >MS0207 rplK, RplK protein MAKKVQAYVKLQVAAGMANPSPPVGPALGQQGVNIMEFCKAFNARTESLE KGLPIPVVITVYADRSFTFVTKTPPAAVLLKKAAGVKSGSGKPNKEKVGK VTLDQVRQIAETKAADMTGATIETKMKSIAGTARSMGLVVEE >MS0210 rplL, RplL protein MIVMSLTNEQIIEAIASKSVSEIVELISAMEEKFGVSAAAVAAAAPAAGA AAAEEKTEFDVVLKAAGANKVAVIKAVRGATGLGLKEAKDLVESAPANLK EGVSKEEAESLKKELEAAGAEVEIK >MS1283 rplM, RplM protein MVIKLMKTFVAKPETVKRDWYVVDATGKTLGRLATELASRLRGKHKAEYT PHVDTGDYIIVINADKVAVTGRKETDKVYYWHTGYVGGIKQATFKEMIAR RPEAVIEIAVKGMLPKGPLGRAMFRKLKVYAGSQHEHAAQQPQVLDI >MS2029 rplO, RplO protein MYLNTLAPAEGAKHSAKRLGRGIGSGLGKTGGRGHKGQKSRTGGGVRRGF EGGQMPLYRRLPKFGFTSMKAAVTAEIRLNDLTKVENNVVTLESLKAANI ITKDIQFAKVVLAGEVKGAVTVRGLRVTKGAKAAIEAAGGSVEE >MS2041 rplP, RplP protein MLQPKRTKFRKVHKGRNRGIAAGTDVSFGTYGLKAIGRGRLTARQIEAAR RAMTRAVKRQGKIWIRVFPDKPITEKPLEVRMGKGKGNVEYWVALIQPGK VLYEMDGVSEEIAREAFALAAAKLPIKTTFVTKTVM >MS2022 rplQ, RplQ protein MRHRKSGRQLNRNSSHRQAMFRNMASSLVSHEIIKTTLPKAKELRRVVEP LITLAKEDSVANRRLAFARTRNIETVAKLFNELGPRFAQRAGGYTRILKC GFRAGDNAPMAYIELVDRPETAAAVEE >MS2032 rplR, RplR protein MDKKSARIRRAARARHMMRENGVTRLVIHRTPRHIYAQVIAPNGSEVLAA ASTVEKAISEQVKYTGNKDAAAVVGKLVAERALAKGIKDVAFDRSGFKYH GRVQSLADAAREAGLQF >MS0443 rplS, RplS protein MSNIIKQLEQEQLKQNIPSFRPGDTLEVKVWVVEGAKKRLQAFEGVVIAI RNRGLHSAFTLRKVSNGTGVERVFQTHSPAIDSISVKRKGAVRKAKLYYL RERSGKSARIKERLGA >MS1056 rplT, RplT protein MARVKRGVIARARHKKVLKAAKGYYGARSRVYRVAFQAVIKAGQYAYRDR RQRKRQFRQLWIARINAAARQNGLSYSKFINGLKKASVEIDRKILADIAV FDKVAFAALVEKAKSAL >MS1599 rplU, RplU protein MRVKCRDIISARYSGVFMYAVFQSGGKQHRVSEGQVVRLEKLEKATGETV EFDSVLMVVNGEDVKIGAPVVTGAKVVAEVVAQGRGDKIKIVKFRRRKHS RKQQGHRQWFTEVKITGIQA >MS2043 rplV, RplV protein METIAKHRYARTSAQKARLVADLIRGKKVAAALEILTYTNKKAAALVKKV LESAIANAEHNDGADIDDLKVTKIFVDEGPSMKRVMPRAKGRADRILKRT SHITVVVSDR >MS2046 rplW, RplW protein MSQQERLLKVLKAPHISEKATNNAEKSNTIVFKVALDANKVEIANAVAQL FEVKVDSVRTVVVKGKTKRHGAKTGRRSDWKKAYVTLAEGQELDFVEGAA E >MS2037 rplX, RplX protein MAAKIRQNDEVIVLTGKDKGKRGKVTQVLPNGKVIVEGVKIITKHEKPVP ALGKEGGLVKKEAAIDVSNVAIFNPKTNKADRVGFRFEDGKKVRFFKSNN EII >MS1116 rplY, RplY protein MAFKFNAEVRSAQGKGASRRLRHNGQIPAIVYGGNEDAVSIVLNHDELNN AQAHDSFYSEVITLVINGKEVAVKVQAMQRHPFKPKLVHIDFKRA >MS1598 rpmA, RpmA protein MATKKAGGSTRNGRDSEAKRLGVKRFGGESVLAGSIIVRQRGTKFHAGSN VGMGRDHTLFATADGKVKFEVKGEKSRKYVSIVTE >MS1942 rpmB, RpmB protein MEIIMSRVCQVTGKRPAVGNNRSHALNATRRRFLPNLHTHRFWVESENRF VTLRLTAKGMRIIDKKGIDAVLADIRARGEKI >MS2040 rpmC, RpmC protein MKAQELRAKTVEELNAELANLAGEQFKLRMQAATGQLQQTHQLKQVRRNI AQVKTVLTEKAGE >MS2030 rpmD, RpmD protein MTMAKTIKVTQVRSSIARLPKHKATLRGLGLRHMHHTVELIDTPAVRGMI NQVSYMVKVEE >MS0448 rpmE, RpmE protein MRFSMKQGIHPEYKEITVTCSCGNVIKTRSTAGHDINLDVCGNCHPFYTG KQRVVDTGGRVERFNKRFSIPGSKK >MS1869 rpmF, RpmF protein MAVQQNKKSRSRRDMRRSHDALTTAAVSVDKASGETHLRHHVTADGYYRG RKVINK >MS1943 rpmG, RpmG protein MAAKGAREKIRLVSSAETGHFYTTDKNKRNMPEKMEIKKFDPVVRKHVIY KEAKIK >MS0484 rpmH, RpmH protein MKRTFQPSVLKRSRTHGFRARMATKNGRQVLARRRAKGRKSLSA >MS1055 rpmI, RpmI protein MYQTKQCGVFLTMPKIKTVRGAAKRFKKTASGGFKRKQSHLRHILTKKTT KRKRHLRHKSMVAKADQVLVVACLPYV >MS2027 rpmJ, RpmJ protein MKVRASVKKICRNCKIVKREGVVRVLCSDPKHKQRQG >MS2023 rpoA, RpoA protein MQGSVTEFLKPHLVDIEQVSPTHAKVILEPLERGFGHTLGNALRRILLSS MPGCAVTEVEIDGVLHEYSSKEGVQEDILEVLLNLKGLAVKVQNKDDVFL TLNKSGIGPVVAADITHDGDVEIVNPEHVICHLTDENASINMRIRVQRGR GYVPASARVHAQDEERPIGRLLVDACYSPVDRIAYNVEAARVEQRTDLDK LVIELETNGAIDPEEAIRRAATILAEQLDAFVDLRDVRQPEVKEEKPEFD PILLRPVDDLELTVRSANCLKAETIHYIGDLVQRTEVELLKTPNLGKKSL TEIKDVLASRGLSLGMRLENWPPASIAED >MS0212 rpoB, RpoB protein MGYSYTEKKRIRKDFGKRPQVLNVPYLLTIQLDSFEKFIQRDPEGQQGLE AAFRSVFPIVSNNGSTELQYVSYKLGEPVFDVRECQIRGTTFAAPLRVNL RLVSYDRDAAPGTIKDIKEQDVYMGEIPLMTDNGTFVINGTERVIVSQLH RSPGVFFDSDKGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFARIDR RRKLPATIILRALGYSTEEILDLFFEKIQFEIQDNKLLMALVPERLRGET ASFDIEANGKVYVERGRRITARHIRTLEKDNVTKIDVPTEYIVGKVSAKD YIDLESGELVCPANMEISLDILAKLAQAGYKSIETLFTNDLDFGPYISET LRVDPSSDRLSALVEIYRMMRPGEPPTKEAAEALFDNLFFSAERYDLSAV GRMKFNRSLGLAEGVGNGVLSKEDIVGVMKKLIDIRNGRGEVDDIDHLGN RRIRSVGEMAENQFRIGLVRVERAVKERLSLGDLDAVTPQDLINAKPVSA AVKEFFGSSQLSQFMDQNNPLSEVTHKRRISALGPGGLTRERAGFEVRDV HPTHYGRVCPIETPEGPNIGLINSLSVYARTNNYGFLETPYRKVVDGQVT EEIEYLSAIEEGNYVIAQANASLDEDFRFTDAFVTCRGEHGESGLYRPEE IQYMDVSPQQVVSVAAALIPFLEHDDANRALMGANMQRQAVPTLRADKPL VGTGMEKPIALDSGVAVVAKRGGIIQYVDASRIVVKVNEDETIPGEAGID IYNLIKYTRSNQNTCINQIPCVNLGEPIGRGEVLADGPSTDLGELALGQN IRVAFMPWNGYNFEDSMLVSERVVQQDRFTTIHIQELSCVARDTKLGAEE ITADIPNVGETALSKLDESGIVYVGAEVKGGDILVGKVTPKGETQLTPEE KLLRAIFGEKASDVKDSSLRVPNSVSGTVIDVQVFTRDGVEKDKRALEIE EMQLKEAKKDIAEELEILEAGLFSRVRNLLIDGGVDAKELDRLDRTKWLE QTLNDEAKQNQLEQLAEQYEELRKDFEHKLEVKRGKIIQGDDLAPGVLKV VKVYLAVKRRIQPGDKMAGRHGNKGVISKINPVEDMPYDENGQPVEIVLN PLGVPSRMNIGQILETHLGLAAKGIGEQINRMLKEKQEIEKLRGYIQKAY DLGGGSQKVDLNTFTDEEVMRLAQNLRKGMPLATPVFDGAEEKEIKDLLE LGGLPTSGQITLYDGRTGEKFERPVTVGYMYMLKLNHLVDDKMHARSTGS YSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVNG RTKMYKNIVSGTHQMDPGTPESFNVIMKEIRSLGINIDLDEE >MS0213 rpoC, RpoC protein MKNFHRTLNKFNSDRSKSVKDLVKFLKAQSKTSEDFDVIKIGLASPDMIR SWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKHR GVICEKCGVEVTQTKVRRERMGHIELASPVAHIWFLKSLPSRIGLLLDMP LRDIERVLYFESYIVIEPGMTDLEKGQLLTEEQFMDAEDRWADEFDAKMG AEAIQALLRDMDLEHECETLREELQETNSETKRKKITKRLKLLEAFMQSG NKPEWMVMTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKR LLDLVAPDIIVRNEKRMLQESVDALLDNGRRGRAITGSNKRPLKSLADMI KGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFRPFI YAKLESRGFASTIKAAKKMVEREDAIVWDILAEVIREHPILLNRAPTLHR LGIQAFEPILIEGKAMQLHPLVCAAFNADFDGDQMAVHVPLTLEAQLEAR ALMMSTNNVLSPANGDPIIVPSQDVVLGIYYMTREKVNAKGEGMLLQDPR EAEKAYRTGRAELHSRVKIRITEYVKNAEGEFEPQTTLTDTTIGRAILWM IAPKGMPYSLFNQTLGKKAISKLINECYRRLGVKASVMFADQIMYTGFAY AARSGSSVGIDDMVIPEKKYEIISAAEAEVAEIQEQFQSGLVTAGERYNK VIDIWATANERVAKAMMENLSTEEVVNREGNLEKQSSFNSIFMMADSGAR GSAAQIRQLAGMRGLMARPDGSIIETPITANFREGLNVLQYFISTHGARK GLADTALKTANSGYLTRRLVDVAQDLVIVEDDCGTHEGIVMTPLIEGGDE KVSLRELVLGRVAAEDILKPGTEEVLFPRNTLLDEKVCDILDENSVDSVK VRSVVTCDTDFGVCAKCYGRDLARGHLINQGEAVGVIAAQSIGEPGTQLT MRTFHIGGAASAAAKESSVQVKNSGSIRLTNVKSVTNNEGKLVVTSRNTE LTIIDAFGRTKEHYKVPYGAVLNKGDGEAVTAGETVANWDPHTMPVVSEV AGFVKFVDIVDGLTVTRQTDELTGLSSIVVQDVGERATAGKDLRPAIKVV DAQGNDIFIPGVDVLAQYFLPGKAIVTLDDGAEVQVGEPLARIPQESVGT KDITGGLPRVADLFEARKPKEPAILAEITGIVSFGKETKGKRRLVITPVE GEAYEEMIPKWRQLNVFEGEMVERGDVISDGAETPHDILRLRGVHAVTEY IVNEVQEVYRLQGVKINDKHIEVIVRQMLRKGIITKAYDSEFLEGEQVEV ARVKIVNRKREAEGKPPVEFERELLGITKASLATESFISAASFQETTRVL TEAAVAGKRDELRGLKENVIVGRLIPAGTGFAYHQNRIKNRGQANVVEEQ EVKFSAADEAEIEAEFNMIAEDPAASLAEMLNMADDAE >MS1760 rpoD, RpoD protein MNQRRTSYMDHNPQSQLKLLIAQGKEQGYLTYAEVNDHLPEELVDTDQIE DIIQMINDMGIQVLESAPDADDLMLSETIADEDAVEEATQVLSSVEAELG RTTDPVRMYMREMGSVELLTREGEIDIAKRIEDGINEVQSAVAEYPEALD YLLKQYEQVEEGSVRLADLITGFVDLNAEEASEEISDLEEVLDDEDGDIP ADALNDEEEDEESDEGDTSTDDSDNSIDPEVAREKFSALKDQCVKTLEFI EKYGRTDNKVKEQIQVLSDIFTQFRLVPRQFDTLVLSMRSMMKQVRAEER QIQRLAVDYAKVPKDDFQKAFIGNETSEQWLESLLQSKKTYVEKLQQRAP EISKSIVRLQQVETDTKLTVQQIRDIGERIAQGELKARRAKKEMVEANLR LVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWI RQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREASPEELAE RMGMPEDKIRKVLKIAKEPISMETPIGDDDDSHLGDFIEDSTLELPLDSA TAQSLKVATHEVLEGLTPREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRE RIRQIEAKALRKLRHPSRSETLRSFLDE >MS0025 rpoD, RpoD protein MTKETQTMMLVPQGSIEAYIRAANEYPMLSAEEEKELAERLYYQEDLEAA KKLILSHLRFVIHVARGYSGYGLPQADLIQEGNIGLMKAVKRFNPEVGVR LVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFS DNELDLVANELGVTKEDVIEMESRMTGADVGFDLPTDDSEEETFAPSMYL EDKSSNFAAELESENFETQAIDQLSNAMENLDERSKDIIQARWLDDTKAT LHELAAKYNISAERVRQLETNALKKLKSAVSF >MS2228 rpoE, RpoE protein MLLTRGYMAEQLTDQALVERVQQGDKKAFNLLVSRYQNKVAGLLTRYVSR NDIPDVVQESFIKAYRSIESFRGESAFYTWLYRIAVNTAKNYLTAQGRRP PNEDILAEEAETYDVGGNLRDVDTPEHEMLSAELKKVIFDTIDGLQEELK TAITLREMEGLSYEEIADIMDCPVGTVRSRIFRAREIIESKIRPLIQR >MS1737 rpoZ, RpoZ protein MARVTVQDAVEKIGNRFDLILTAARRARQLQLHVREPLVPEDNDKPTVIA LREIEKGLIDNNIMNAQERQEALEQEKVELNAVSLLSE >MS1476 rpsA, RpsA protein MSESFAQLFEESLKELETRQGSIVSGTVVAIQKGFVLVDAGLKSESAIPV EEFQNAQGELEVQVGDVVNVALDAVEDGFGETKLSREKAVRHESWIELEK AYEEQATVTGLINGKVKGGFTVELNGVRAFLPGSLVDTRPVRDTLHLEGK ELEFKVIKLDQKRNNVVVSRRAVIESENSQDREEILANLAEGAEVKGTVK NLTDYGAFVDLGGVDGLLHITDMAWKRVKHPSEIVNVGDEITVKVLKFDK DRTRVSLGLKQLGQDPWAAIAQNHPVGSKLTGKVTNLTDYGCFVEILDGV EGLVHVSEMDWTNKNIHPSKVVSLGDTVEVMVLEIDEERRRISLGLKQCK ANPWLQFAETHNKGDKVEGKIKSITDFGIFIGLEGGIDGLVHLSDISWNV AGEEAVRNYKKGDEVAAVVLQVDSAKERISLGIKQLEEDPFNNFVAVNKK GAVVSATVVEADSKGAKVELNGGVEGYIRAADLTDEVNAGDVVEAKYTGV DRKARIVHLSVRAKDQAEEAAAVASVNNKQEDVAIPNAMAEAFKAAKGE >MS1933 rpsB, RpsB protein MICGGKTPIKKENIMAQVSMRDMLQAGVHFGHQTRYWNPKMKPFIYGPRN GVHIINLEKTVPMFNGALAELTRIASNNGKILFVGTKRAATEAVQAAALD CQQYYVNHRWLGGMLTNWKTVRQSIKRLKDLETQSQDGTFDKLTKKEALV RTREMEKLELSLGGIKDMAGLPDAIFVIGADYEHIAIKEANNLGIPVFAV VDTNSNPDGIDFVIPGNDDATRAIQLYVTAAAAAVKEGRSQQTATEEKFA EEVAAE >MS2042 rpsC, RpsC protein MCQIVNRRGIAMGQKVNPHGIRLGIVKPWSSTWFANTQDFASNLDGDFKV RKFLNKELANASVSRITIERPAKSIRVTIHTARPGIVIGKKGEDVEKLRN AVAKIAGVPAQINIAEVKKPELDAKLVADSIASQLERRVMFRRAMKRAVQ NAMRLGAKGIKVEVSGRLGGAEIARSEWYREGRVPLHTLRADIDYNTSEA HTTYGVIGVKVWIFKGEILGGMAAIAQQPEQQPAAPKKAPRGKGRK >MS2024 rpsD, RpsD protein MARYLGPKLKLSRREGTDLFLKSGVRAIDSKCKIDTAPGQHGARKPRLSD YGSQLREKQKVRRIYGILERQFRNYYKEANRLKGNTGENLLVLLEGRLDN VVYRMGFAATRAEARQLVSHKAIVVNGRVVNIPSFQVSVDDVVAVREKSK KQARIKASLELAEQREKPTWLEVDAAKMEGVFKRVPERSDLSADINEHLI VELYSK >MS2031 rpsE, RpsE protein MANIEKQAGELQEKLIAVNRVSKTVKGGRIMSFTALTVVGDGNGRVGFGY GKAREVPAAIQKAMEKARRNMINVALHEGTLQHPVKGIHTGSRVFMQPAS EGTGIIAGGAMRAVLEVAGVRNVLSKAYGSTNPINVVRATIDALANMKSP EMVAAKRGKTVDEILG >MS0469 rpsF, RpsF protein MRHYEIVFMVHPDQSEQVPGMIERYTGSVKEAGGQIHRLEDWGRRQLAYP INKLHKAHYVLMNVEAPQEVIDELETTFRYNDAVLRNVIIRTKHAVTEAS PMVKAKDERRASAEVENNDFEDAEE >MS0163 rpsG, RpsG protein MGILKIKNGEIAMPRRRSIEPRKILPDPKFGSELLAKFINVLMVDGKKSI AESIVYNALDTLAQRTNKDALVAFEEALENVRPTVEVKSRRVGGSTYQVP VEVRPARRNALGMRWIVEAARKRGDKSMALRLANELSDASENKGSAVKKR EDVHRMAEANKAFAHYRW >MS2034 rpsH, RpsH protein MSMQDPIADMLTRIRNGQAASKVAISMPSSKLKVAIANVLAAEGYIESVK VLEGAKPELEITLKYFQGKPVVESIQRVSRPGLRIYKRKDELPKVMGGLG VAVVSTSKGVMTDRAARQAGLGGEIICYVA >MS1282 rpsI, RpsI protein MTAANQNYGTGRRKSSSARVFIKPGSGKITINQRELDVYFGRETSRMIVR QPLELVEMTEKLDLYITVKGGGISGQAGAIRHGITRALMEYDESLRPVLR AAGFVTRDARRVERKKVGLRKARRRPQFSKR >MS2049 rpsJ, RpsJ protein MKGDGVSFLLAKNYWSSGLMQNQRIRIRLKAFDHRLIDQSTAEIVETAKR TGAQVRGPIPLPTRKERFTVLISPHVNKDARDQYEIRTHKRLVDIVEPTE KTVDALMRLDLAAGVDVQISLG >MS2025 rpsK, RpsK protein MAKTPVRARKRVKKQVVDGVAHIHASFNNTIVTITDRQGNALAWATAGGS GFRGSRKSTPFAAQVAAERCAEAVKEFGLKNLEVMVKGPGPGRESTIRAL NAAGFRITNITDVTPIPHNGCRPPKKRRV >MS0162 rpsL, RpsL protein MATINQLVRKPRVKKVVKSNVPALQACPQKRGVCTRVYTTTPKKPNSALR KVCRIRLTNGFEVTSYIGGEGHNLQEHSVVLIRGGRVKDLPGVRYHTVRG ALDCAGVKDRKQGRSKYGVKRPKA >MS2026 rpsM, RpsM protein MARIAGINIPDHKHTVIALTAIYGIGKTRSQAICAAAGIAENVKISELSE EQIDKLRDEVGKFTVEGDLRREVTLNIKRLLDLGCYRGLRHRRGLPVRGQ RTKTNARTRKGPRKPIKK >MS2035 rpsN, RpsN protein MAKQSMKARDVKRVKLAEKFYAQRVELKRIISDVNSSDEERWDAVLKLQT LPRDSSPSRQRNRCSQTGRPHGVLRKFGLSRIKVREAAMRGEIPGLKKAS W >MS0699 rpsO, RpsO protein MSLSVEKKAAIVAEFGRDAKDTGSSEVQIALLTAQINHLQAHFAEHKKDH HGRRGLLRMVSRRRKLLDYLKRTDLAKYSETIARLGLRR >MS0440 rpsP, RpsP protein MVTIRLSRGGAKKRPFYQIVVADSRCPRDGRFIERVGFFNPLAAGNAERL RIQLDRVNAWLEKGASLSDRVAALVKEAQKAA >MS2039 rpsQ, RpsQ protein MTDKIRTVQGRVISDKMDKSFTIAIERKVKHPLLGKFIRRTTKLHVHDEN NEARIGDTVEIKECRPVSKTKSWTLVRVVEKAVEA >MS0471 rpsR, RpsR protein MARYFRRRKFCRFTAENVVEIDYKDIATLKNYITESGKIVPSRITGTRAK YQRQLARAIKRARYLALLPYTDNHQ >MS2044 rpsS, RpsS protein MPRSLKKGPFLDLHLLKKVEKAVESGDKKPIKTWSRRSMIIPSMIGLTIA VHNGRQHVPVYVSDEMIGHKLGEFAPTRTYRGHAADKKAKK >MS1756 rpsT, RpsT protein MTLANIKSAKKRAVQSEKSRQHNASQRSMMRTYIKKVYAAVAAGEKAAAQ AAFVEMQKVVDRMASKGLIHANKAANHKSKLVAQIKKLA >MS1762 rpsU, RpsU protein MPVIKVRENESFDVALRRFKRSCEKAGILAEVRSREFYEKPTTIRKRENA TRAKRHAKRVARENARNTRLY >MS2229 rseA, RseA protein MEFTMQKELLSAYIDGEQVGNDITLELCNDAELQQSWSNYHVIRSVMRDE SEVFLGADFTAKMATLIDQEDAITLSQPTPDEVENLPFMQKLKALFAPMV QVGVAAGVCLVAVLGVQSFNANNNAQTTADTPVLQTLPFSNNVQEVSYNA PTKDAVTQEQLEQKNKRIGAMLQSYELQRRVYADSVQNQQH >MS2230 rseB, RseB protein MVNLFPVDEGTSKMFKKLTALFFILPFSFSVFGQDNLSPKQLLTEMLAAQ NKLNYEISYVQIAGAEIDTYRYRHVYNEGKSYAQLATLEGGKQEIIQRDN LISYFHSNYSPFSIRGSQIIDNLPNIVNADFSRIEKHYDFINMGRNRIAD RLVQTVRILPKDNFRYQYVVFIEEKTHLLLGSDMLDQDGNLLERFRVVNF YIDDQMTQPLTDALSKLPTPPVLDKATPPKNKLSWQAGWLPQGFAVLNNY LTKTDEDTIESRLYSDGLFSFTIYVSNNILPENQENVWKQGSFTIYSESM KDKEVTIIGQIPLTTAKRIVQEIKSN >MS2231 rseC, RseC protein MLTESAVVIDYRDGIAKVKCQSKTACGSCAAKNACGSAALSELTGEPGEH ILIISTITPLKIGQQVEIGLQEQSLLFSAFLAYVIPLMTLLIGTFVATAI FSNELISAAFIFISTALSFLAVRFYAKKLNKKSAFEPILLRVLN >MS1588 rsmC, RsmC protein MISLESQVLERHLPLFADKSVLLAGGVNDDFPQKIQSQCRSVKIWSWYFD YVNQIQGKSAVDFSVIFTGRADLIVYYWTKNKQEVQFQLMQLLANAPVGQ EVLIIGENRSGVRSAEKMLAHFGDIGKIDSARRCGLYHFTLQKQPNFELE NFWKTYRSPQLGELIVYSLPGVFSANELDVGTQLLLSTVKDNIRGDVLDL GCGAGVIGSMIKLKNPPAKVTMTDIHAMALASAERTLLENKLSGQVLASD VFSHVEGKFDLIISNPPFHDGIDTAYRAVRELISNAKWHLVPGGELRIVA NAFLPYPDLLDEYFGGHKVLAQTNKFKVYSVIG >MS0145 rssA, RssA protein MKVGLVLEGGAMRGMFTAGVLDIFLDENIHIDGAVTVSAGALFGINLPSK QRGRVLRYNKKYLNDKRYMGLHSLLTTGNIVNRDFAFYELPYTLDPFDQQ TFAQSDMDFWVTLTNVETGEAEYFKIQDAFEQMEVLRATSAMPFVSKMVE INGKKYLDGGIADSIPLQKCFDLGYDKVIVVLTRPLEYRKTPSSKTLFKL FYPNYPQLAARWAQRYADYNQTVERIIKLNDEQKIFVIRPSESLNISRLE KDPEMIQRMYELGLKDGKAAIAGLREYLAK >MS0391 rsuA, RsuA protein MRKIAALAAFYYFRIFMRLDKFLAENTGLTRSQAAKILRQGNVQVNGQVV KSGSLKITPQDEVLFEGESLEWLEDGVYIMLNKPQGYVCSHDDGEYPTVY QFFDYPLAGKLHTAGRLDADTSGLVLLTDDGKWSHRVTSPKYHCQKTYLV TLADPVESNYRQACEQGILLRGEKEPTKPAILEIIDDYNVNLTISEGRYH QVKRMFAALGNKVVGLHRWKIGNIELDEDLPEGEFRVLTAEEIQYF >MS2342 rsuA, RsuA protein MNLNNPIKSGFKSDLKKSTGKSAFYNKKRKSAVSFHQNSVTKRQKSKQLA FSETQVILFNKPFDVLTQFTDENGRATLKDFIHIPDVYAAGRLDRDSEGL LILTNNGEIQHRLADPKFKMEKTYFVQVEGEPTETDLAKLRQGVELKDGM TRPAKVRLIAQPDFIWQRTPPIRERKSIPTSWLEIKISEGKNRQVRRMTA NIGFPTLRLIRYAVSSFTLDNLANGSYRLLTDNELERLYKTLKLTKE >MS1038 rsuA, RsuA protein MKATRKLMKFQTEKLQNRNFRQKPSENRPQFERGGRPERKERGAFESRFR ADDSRQASSFKNDRRDERRNARGDERRSSFSGDESSNRRNERKPAERKPL PMRKPKPAHPVEKKATVEGEKLQKVLARAGQGSRREIESIIEQGRVSVDG KIATLGDRVTVHDGLKIRIDGHLVNLTAAQREVCRVLMYYKPEGELCTRH DPEGRATVFDRLPRLTGSRWIAVGRLDINTSGLLLFTTDGELANRLMHPS QEVEREYSVRVFGQVDDAMIHRLRKGVQLEDGPANFKAIKAVGGTGLNQW FDVTLMEGRNREVRRLWESQGIQVSRLIRIRYGNIQLMKTLPRGGWEEMD LAKVNYLRELVGLPPETETKLDVTNLRRRAKTGQIRKAVKRYSEMNKRYK KS >MS0712 ruvA, RuvA protein MANFIVNVKVCMIGRLQGILLEKQPPEILLDVHGIGYELLLPMTSFYNLP EIGQETVLFTHLVVREDAHLLFGFSAKTDRTLFRELIKTNGVGPKLALAI LSAMSVNEFAYAIEHEELSKLVKIPGVGKKTAERLLVELKGKFKGIKQPD FFVESSHVGAVDPVTTSPEVPAEEAVAALMALGYKASDAEKMVKRIAKPH LTSEQLIREALKAAL >MS0713 ruvB, RuvB protein MIEADRIISSNAQLGDEYIDRAIRPKLLTDYVGQPQVREQMGIFIQAAKL RQDALDHLLIFGPPGLGKTTLANIVANEMGVNIRTTSGPVLEKAGDLAAM LTNLEPHDVLFIDEIHRLSPAIEEVLYPAMEDYQLDIMIGEGPAARSIKL DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYSVEDLTSIVARSAGCL NLEMSDGASHEIARRSRGTPRIANRLLRRVRDFADVKNAGIISEDIAKSA LSMLDIDQAGFDYLDRKLLSAVIERFDGGPVGLDNLAAAIGEERDTIEDV LEPYLIQQGFLQRTPRGRIATSRTYRHFGLDKLTE >MS0711 ruvC, RuvC protein MFALFIWSFMAIILGIDPGSRVTGYGIIRQTGRTLEYLGSGAIRTQVEDL PTRLKRIYAGVTEIITQFRPDMFAIEEVFLAKNPNSALKLGQARGTAIVA AVNQNLPVFEYAARLVKQTVTGSGSADKVQVQDMVTRILRLSDKPQADAA DALAIAITHAHTIQHSLQVATSAKSTENHEKTTALLRTRYSRGRFRLKI >MS1964 sPS1, SPS1 protein MRKDMLQVQHENHFFLFNFDENRPNQEHFFESYFWQKQNRIIGSAKGRGT TWFIQSQDLFGVNTALRHYYRGGLWGKINKDRYAFSSLEETRSFAEFNLL NRLYQAGLPVPKPIGAHVEKLAFNHYRADLLSERIENTQDLTALLPNTEL TAEQWQQIGKLIRRLHDLQICHTDLNAHNILIRQQNNDTKFWLIDFDKCG EKPGNLWKQENLQRLHRSFLKEVKRMRIQFSEKNWADLLNGYQN >MS0132 sUA5, SUA5 protein MELAQIVERLKKNEVVAYPTEAVFGLGCNPNSKSAVEKLLILKQRPVEKG LILVAHKLDLLLPFIDESRLKQSHWQLLTQQYDCPTTWVVPAKLSVPKFI TGQFDSVAVRLCTHPAVAQLCEQTGFALTSTSANLSGLPPCKTAQQVRSQ FGEFFPVLDMAVGNAVNPSEIRDIFSRQIFRRG >MS1037 sUA5, SUA5 protein MSQFFYIHPENPQVRLINQAVDILRNGGVIVYPTDSGYALGCMIGDKRAM DRIVQIRHLPEGHNFTLVCSDLSELSTYSLVTNTAYRLIKNNTPGRYTFI LTATKELPRRLMTSKRKTIGIRVPDNQIALDLLRTLGEPILSCSLMLPNE EHITQSDPEEIRDRLEHQVDLIIHGGYLGQEPTTVVDLTEETPVILREGS GPLDPFI >MS1471 sUI1, SUI1 protein MNDIVYSTETGRIKPEKTKQERPKGDGIVRIHRQTSGRKGAGISLIVGLD LPDDELKKLAAELKKRCGCGGSIKDGNIEIQGEKRDLLKQLLEQKGFKVK LAGG >MS0497 sUL1, SUL1 protein MQVRLFFYRCLSFCSIYIRISMLKKWFITKNVFLAVRPFSALKDSFREGY TTQKLVKDIIAGLTVGVIAIPLSMALAIASGVPPQHGLYTAIVAGIIIAL AGGSRFNISGPTAAFVVILYPVTQQFGLSGLLMATLLSGIILVIMALFRL GRLIEYIPLPVTLGFTCGIGITIGTLQIKDFFGLTIDKMPEHYIGKVQAI ITALPTINWADAMVGIVTLLVLINWHKLRLPVPGHLPAVIIGTLLSLVLI HFGYHVASIGSAFEYTLPDGSTGHGIPSVLPQFALPWNIPNAQGEVIEWN FAIIQNILPAAFSMAVLGAIESLLCAVVLDNMTDTKHHSNNELLAQGLGN IAAPFLGGITATAAIARSAANVKAGGQSPIASIVHALLVLFALLFFAGAL SYLPLSSMAALLLMVAWNMANVPQIIHLARRSGRNEIAVLTTCLVLTVIF DMVIAISVGVLLASLLFIRTIAEMTKSFEIAHPEDLDDVLVYRISGPLFF AAADNLFADLHEKTVHTDHEIRHIVLQCSAVTVLDAGGIHALTRFVQHML PHQELYMCNMQFQPLRMIVKSNMLTEIQKINFSTDLKETYNKIRLVEAED KKSEE >MS0909 sacC, SacC protein MRSFLPHFSLFYFHQGIMMIIFNNGKYKSILAAEQGELERIKSEVEKDRD FRPYYHLAPSTGLLNDPNGLVFDGEKFHLFYQWFPFDAIHGMKHWKHFTT EDFHIYTEADPLIPCELFESHGCYSGGALPVGDKIAAFYTGNTRRAADNQ RVPFQNLAIFDRTGKLLSKRPLIENAPKGYTEHVRDPKPYFTKEGKIRFI CGAQREDLTGTAIIFEMDNLDDEPRLLGELSLPAFDNQKVFMWECPDLLK VGDNDIFIWSPQGKRREARRFQNNFHAVYAVGKLDDRTFNAAHIAELDQG FDFYAPQTFAGLENQKHAVMFGWCGMPDLTYPTDKYKWHSMLTLPREITL QGNRLVQRPIKEIYQNLTALSQISLQQQAEIQDLDRAYIKFDAENTAFNI RFFANEQGQTLSLSYDGELVCLDRSQTEETEWMKKFASQRYCEIKNLRQV EIFFDRSIIEIFLNDGEKALTSRFFIANRQNSVKTDRTLRLNVGYPKEIE YK >MS0923 sanA, SanA protein MRCKMQGCKKIQSKIVSFLAKSSLKRLVQTCAILAGIAIVSLATLDQMIG YSVRNDIYTDITKVPHRPYGVLLGTAKYFARNTPNLFYVNRLNAAEALFK SAKIDYLLLSGDNRTLQYNEPRTMFKDLRKKGIGEEFLYMDFAGFRTLDS IIRAKEIFNATPMVIITQRFHCERALFIAKFHHIDAICFAAEYPKDYPFV RFREVFARLLMLWELFIEKEPHFLGTPEPLPPALPNY >MS0003 sapB, SapB protein MQSAQFMILLKISIAMCLGAFIGLERELKHKPVGVKTCVIISITTCILTI VSIQSAEYYAEVSNNIRTDPMRLAAQVISGIGFLGAGVILRKNNDAISGL TTAAIIWAAAGIGIATGAGFFFDAIIATLMILVAIRLSPYVMKLAHHRRQ EKDIEVSFTFHLASIQAIGNITELFMSHQCKIEDIAIKDLYNGEVNLTFQ CDIEDHNMLRDVYLHSKVLPDVLAVHLETA >MS1081 sbcB, SbcB protein MTDFSFFIYDFESFGVNPADDRPAQFAGIRTDKDFNIISDPVMFYCKQTN DYLPAPEAVMVTGITPQECNEKGISEPEFAARILAEFSQPNTCIMGFNNI RYDDEMTRYTFYRNFIDPYEYSWKNGNSRWDLLDLVRACYALRPEGINWP LDEEGMPSFRLEKLTKANGIEHENAHDAMADVYATIAMAKLIKEKQPKLF QFFFENRGKKEIEKWIDTAEMTPLVHVSGMLGNYRGNCTWIAPLAWHPIN QNAVIACDLAQNIDDLLNKSAVELRENLYTQKTELENDGVLPVPLKLVHI NKCPIIAPAKTLLPENAQRLGIDRQFCLDNLKKLQKSLDIRDKVIEVFNE ERKFDDSDNVETELYSGFFSKADKNNMTILRTLEPEKLADSGLQFEDKRI PDLLFHYRARHFYKTLNRGEQIKWQKYRRQKLEKSAVQFMESLQHLGEEN SNHPDKLKLLQQIYDYGIKLLA >MS0290 sbmA, SbmA protein MLKNIRLNFNLSIGSGMNYSQELLTSLLWIFKAIGITAVLFSLTVYVLVK TTRWGRQFWMLAAGYISPKRSKKPIGYFVIIVFFNLLSVRLDILFSEWYK AMYNALQESHEKMFWIQMVVFSVLATIHIANVLLTYYLTQRFTIQWRTWL NNEMVNRWTENQAYYKAQYVYNKLDNPDQRIQQDVLSFVSNSIEFATGVI SSVVSIVAFTVILWGLAGPMTVVGITIPHAMVYLVFIYVLITSIFAFRIG RPLINLNFTNERLNANYRYSLIRLKEYAESIAFFRGEKMEKNVLFKQFNQ VIGNVWKMVHMTLKLSGFNLAVSQVSVIFPFIIQASRYFSKQIQLGDLIQ TAQSFGRVQTALSFFRNSYDSFTGYRAVLDRLTGFYSAVNQANSASHISI EDSESAVVFDKLTVKKPTGEALIKDLSLNLPQGASLLIKGPSGAGKTTLL RTIAGLWSYSEGIVRCPQHHALFLSQKPYLPQGRLIDALFYPELAPENLD LAQAAEIMRKVQLGHLTDRLEQENDWTRVLSLGEQQRLSFARVLICRPLV AFLDEATASMDEGLEESMYRLLKTELPDTTIISVGHRSTLQIHHTQHLVI NPQDQSWALS >MS1316 sbmA, SbmA protein MNWQTELNNSFSWLITTLIWVSLAFTFFALLLRKTDFGEKFWLVTKPCIE QSNKFKTIGLILFLFLLILLEVRISVLNSFFYNGLYSALQDKKADAFWFF ATINAMLVGFKIIHSIINYLIRQIFEIRWLEKFNDDMLSRWLDHKNYYRL KYEKDLPDNIDQRIEQDAREFITGTVDLVDGILGAIVSIIEFTIILWGLS GLLVLFDISIPKGVVFFIYTFIIIATALSVWIGYPLIKLNFNKEKLNGDY RYSLIRIRDNAESIAFYDGEQKERQYLNERFKAIIKNRWAIVRQMLGLDG FNTGVTQIAMILPLMLQAPRFFAGQATLGDMHQTVQAFNRLMRALSFFRL FYEQFTLYQARLNRLYGFIGKLNELDTHLIPNPIECSQLVALENFGLKDA KGNVLFEGINLELSAGDALLIQGASGTGKTTLLKAIAGIYPFETVGRSKR PCNGKILFLPQRPYMPQGSLREAICYPNIDPHHPELESYMLKCHLDKYIF ALDQENDWQAILSPGELQRVAFIRIFLTKPDVVFLDETTSALDEPTEHSL YSKIRQALPGMIILSVGHRCTLQQFHTKHLVIGLDKSSRTI >MS1255 sbp, Sbp protein MLNVAYDVIRDFYKEYNLEFRQAYKAQQGQDLMISQSHGGASKQTLSVAS GLPADVVTLTQSSDVDILVKKGLVDSHWQQALPNHSVPFGSVMVFLVKKG NPKNIHDWHDLIRDDVSVIFANPKTSANGRFAYLSAYAYAKAQGDEQQAQ AFMKKILARVPVLESGARGASISFTQRNLGDVLIAPENEAALAAKALGEN SFSVIYPSYTAYTPVYVAEVNANTKINGTHEQAQAYLRNLWSEAAQELAV KHHFRPTNEKILQKSTALFPPVNSFDVNQVFGDWAIINQTHFADNALFDQ LYIAAQHKDK >MS1311 sdaA, SdaA protein MISVFDMFKVGIGPSSSHTVGPMKAGKQFIDDLITQGNIGKITRIHADVY GSLSMTGLGHNTDITIIMGLAGYLPHNVDIDSIADFISRVKQTALLPVAG GSYTVDFDFKQDMQFHDSFLSLHENGMTLTAFMNDEIAYRQTYYSIGGGF IVGEAHFNQAQNEEVPVPYPYNNAADILRHCHDTGLPISTVVFRNEVALH GKESVEHHLSLIWQTMQDCIKHGLKTEGLLPGPLKVSRRAPALHRLLQAN SNLNNDPMQVIDWINMFALAVNEENAAGGRVVTAPTNGACGIVPAVLSYY EKFISPLNAETVERYLLVCSVIGSLYKMNASISGAEVGCQGEVGVACSMA AAGLTEILGGNPEQVCIAAEIAMEHNLGLTCDPVGGQVQVPCIERNAIAS VKAINAARMALRRSTNPRVTLDKVIETMYETGKDMNAKYRETSKGGLAIK VVCS >MS0977 sdaC, SdaC protein MLHIILTLNRKFKMKNKTFGSALLVAGTTIGAGMLAMPLTSAEMGFTYTM ALLFLLWILLSYSALLFVEVYQTVQRKDAGIATLAEQYFGMVGRVLATLS LVIFMYAILSAYVTGGGSLLAGVLPFLGEHAAPISIIAFTVILGIFIVIS TGAVDGLTRLLFMIKLVAFVLVLTMMLPLVQGENLMAMPLKEFLIISASP VFFTSFGFHVIIPSINNYLDGNIKRLRAAIIGGTALPLVAYILWQMATHG VFPQAKFVEIINNDPTLNGLVDATYHVTGSNLISGSVRLFSTLALVTSFL GVSLSLFDCLDDLLKRINIKAGRLALGVLTFLPPLAFALFYPEGFIAALG YAGQMFTFYGLVLPVGLAWRARKLHPNLPYRVIGGNLTLLIALLLGLLIM NVPFLIEGGYLPKVIG >MS1895 sdaC, SdaC protein MYYYNSSKSYLTWKTFMEKSMKNKKQPSLLGGAMIIAGGTIGAGMLANPI STAGVWFLGSLLILIYTWFCMMSSGLMLLEANLHYPTGSSFDTIVKDLLG KGWNILNGLSLAFVLYILTYAYITSGGGITEGFLNQLLSSEQSAVEIGRS SGSLIFTFVLAVFVWFSTKAVDRFSTILIGGMVISFFLSVSGLISSANAD VLLNSATSQDTQYLPYALVALPVCLVSFGFHQNVPSLVKYYNRDAGKVSK SVFVGTFIALIIYILWQLAIQGNLPRAEFVPVIEKGGDIAALLAALSKYI QTDYIALALNFFAYMAIASSFLGVTLGLFDYIADLCGFDDSKAGRTKTAL ATFLPPLLLSLQFPYGFVIAIGYAGLAATIWAAIVPALLAKASRKKFNKP SYSCFGGNLMVYFIIIFGVLNILSQLAMQFGWLPEFKG >MS1652 sdhA, SdhA protein MQTVNVDIAIVGAGGGGLRAAIAAAEANPNLKIALVSKVYPMRSHTVAAE GGAAAVIKEEDSYDKHFQDTVAGGDWLCEQDVVEYFVQHSPVEMTQMERW GCPWSRKQDGDVNVRRFGGMKIERTWFAADKTGFHLLHTLFQTSIQFPQI QRFDEHFVLDVLVDDGHARGVVAMDMMEGKLVQINANAVVIATGGGCRSF KFNTNGGIVTGDGLSMAYRHGVPLRDMEFVQYHPTGLPNTGILMTEGCRG EGGILVNKDGYRYLQDYGLGPETPVGKPENKYMELGPRDKVSQAFWQEWK KGRTLKTAKGVDVVHLDLRHLGEKYLHERLPFICELSQAYEGVNPVNEPI PVRPVVHYTMGGIEVDFNSETRIKGLFAVGECASSGLHGANRLGSNSLAE LLVLGRVAGEYAAQRAVEATAANQTAVDAQAQDVVRRLEDLFNQEGTENW ADIREEMGTAMEEGCGFYRDQASMQTAVDKIAELKERCKRIRIQDRSSVF NTNVLYTVELGYILDVAQAIANSALERKESRGAHQRLDYVERDDTNYLKH TLAFYNENGAPRIDYSPVKITKSQPAKRVYGAEADAAEAAAKAKENANG >MS0327 secA, SecA protein MLKTIATKIFGSRNDRVLRKLNKVVKKINGLEPAFSALTDDELKAKTAEF RARLEKGESLESLMPEAFATVREASRRVLGMRHFDVQLIGGMVLTNRNIA EMRTGEGKTLTATLPCYLNALTGKGVHVVTVNDYLANRDAETNRPLFEFL GMTVGVNIPGLPPEVKRAAYQADITYATNSELGFDYLRDNLAHSKEERFQ RQLHYALVDEVDSILIDEARTPLIISGPAEDSSELYIAIDKLIPLLVKQD KEDTEEYQGDGDFTLDLKTKQAHLTERGQEKCENWLIENGFMTENESLYS PAKIGLVHHIYAALRAHTLFERDVDYIVKDGEIVIVDEHTGRTMAGRRWS DGLHQAIEAKEHVKIQGENQTVASITYQNYFRLYEKLAGMTGTADTEAFE FQQIYGLETIVIPTNRPMIRDDRTDVMFESEAYKFQAIIEDIKECVARSQ PVLVGTASIEKSELLSNELDKAGIPHNVLNAKFHAQEAEIIANAGYPGAV TIATNMAGRGTDIVLGGNWRAEAAKLENPTEEQLEALKAAWQERHDVVMK AGGLHIIGTERHESRRIDNQLRGRSGRQGDPGSSRFYLSLDDSLMRIYLN EGKLNMMRKAFSTPGEAMESKLLAKVIASAQAKVEAHNFDGRKNLLQFDD VANDQRHAIYAQRNDLLDHEDISETIKAIREDVYNEVIDQYIPPQSLEEQ WNIAELEKRLKQDFALDLPIQQWLEEDNQLHEDNLRERIIASAVEEYQHK EEIVGAETMRNFEKGVMLQTLDELWKEHLAAMDQLRKGIHLRGYAQKDPK QEYKKESFQMFTEMLDALKLTVIRTLSRVQVRTQEEAQAEAAQQAAAESK DYADDSASGERSVAQTTQRIGRNDPCPCGSGKKYKHCHGNRAAHEA >MS2214 secB, SecB protein MAEENQTPATATEEQQAVLQIQRIYVKDISFEAPNLPHVFQQEWKPKLNF DLSTEAKQLGEDLYEVVLNISVETTLEDSGDLAFLCEVKQAGVFTISGLE DMQMAHCLTSQCPNMLFPYARELVSNLVNRGTFPALNLSPVNFDALFMEF LQRQEQESQNAESSTEVQH >MS1563 secD, SecD protein MLNRYPLWKNLMVILVIAIGVLYALPNIYGEDPAVQISGTRGQTATETTL TDVQTLLTSNQLEPKSIKLEEGSILARFHNTDDQLLAKDKITEKLGQSYS VALNLAPSTPAWLSSIGGNPMKWGLDLRGGVRFLMEVDMNTALSKRQEQL QDSLRTELRKEKIQYSAIKNTDNFGTGVTLVKPEQLSDAGRFLRKQHPNL NITESADNTLNLALSEQALTEARENAVEQNLGILRKRVEELGVSEAVIQR QGAERIVVELPGIQDTARAKEILGATATLEFRLVNQNVSPEAMVRNIVPT DTEVKFMRDGQPVALFKRAVLGGEHITNASSGLDQQTSRPQVSVTLDSEG GEIMADTTRLNIKKPMATLYVEYKDSGKKDENGKVILEKHEEVINVATIQ GRFGSQFQITGIDSPAEAQNLSVLLRSGALTAPIQIVEERTVGPSLGAQN VAQGLNAGLWGLAIVIVFCLVFYKVFGLVASLALCANMVLVVGLMSLLPG ATLTMPGIAGIILSVGMSVDANVLIFERIKEELRNGRPIQQAINEGYNGA WTSIFDANLTTILTSIVLYAVGTGPVKGFAVTLALGVAISMFTAITGTRM LINWIYGGKRVEKLSI >MS0204 secE, SecE protein MALAIDKKKKNAPEEVEQKSKGLNTFLWVLVAVVIAVAAFGNVYYAEQFS TAVRVVAVVVLLAVALGIAAVTNQGKVALAFFGESRTELRRIVWPTRPEA MQTTLIVIGVTVLTSLILWGFDSIIVSIINFLTDLRF >MS1564 secF, SecF protein MTVTTNTKQKHEYKGIGLPFSLVHFMKYRKFGYLFSIIVIALSLFSIFTK GFNWGLDFTGGVIIDTHFSQPADLEQVRSTLKTGGIESALVQTTGSANDV AIRLPASASDANIGNNIKNMMTSLDKDIQIRSVEFVGPNVGEELTQGAIY ATLATLILLLAYVGMRFEWRLGLGGILGLAHDVIVTIGLFSFLQIEIDLT FVAAILTVVGYSLNDSIVVFDRVRENFRKIRRLSSEEVINISLTQTLSRT LMTSVTTLFVVFSLLFFGGPSIYSFSLALLIGIGFGTYSSIFVAIALAFD FGLDREHMVVKVVEKEDFQEGL >MS0729 secG, SecG protein MYQVLLIVYLLVSIALIGFILVQQGKGADAGASFGGGASGTVFGSAGSAN FLSRTTAILATIFFVISLVIGNINSHKNNVQQGTFDDLSQAAEQIQQKTV PAPVENKNADIPQ >MS2028 secY, SecY protein MAKQPGYQSRSTQSGSSELKSRLLFVLGALIVFRVGSFIPVPGIDAAVLA QLIEQQKGTIIDMFNMFSGGALSRASIFALGIMPYISASIIIQLLATVYP ALNELRKEGESGRRKISKYTRYATLGLATLQAIGISTGLPNMLPGLVPNL GFGFYFTAVISLVTGTMFLMWLGEQITERGIGNGISLIIFAGIVAGLPSA IGQTIEQARQGQMHLLVLLLIAVIVFAVTYFVVFFERGQRRIKVEYAKRQ QGRQILGGHSTHLPLKVNMAGVIPAIFASSIILFPATLTQWFGEGSSLEW LTDLSMLLHPGQPLYLIVYAIAIIFFSFFYTAMQYNPRDTADNLKKSGAF IPGIRPGEQTSRYIDKIMTRLTLIGGLYITFVCLVPYIMMSAWNVQFYFG GTSLLIVVVVIMDFIVQIQSHLMSTKYESALKKANLRGFGQ >MS2338 selA, SelA protein MTALFQQLPSVDKILKTPQGEQLVTEFGHSAVVNCCRHLLAQAREKIKIE KKLPHFFTDFNHTIAEVNRYLANQQQVKIKSVHNLTGTVLHTNLGRALWA QSAQQAALTAMRQNVALEYDLEAGKRSHRDNYVSELLHELTGAQAACVVN NNAAAVLLMLATFAQGKEVIISRGELIEIGGAFRIPDIMAQAGCKLVEVG TTNRTHLNDYRRAINENTALLMKVHSSNYQICGFTCEVSEQELVELGKEF NIPVVTDLGSGALTDLSRYDLPKEPTVQEKLVQGADLISFSGDKLLGGPQ AGIIVGKKELIQQLQSHPLKRVLRCDKVILAAMEATLRLYLQPEKLTEKL TSLRLLTQPLEQLRQQAEQLKAKLENLLKDDFLLQIESSLAQIGSGSQPM AKIPSIAVTIAEKNSEKLTALLARFKKLSTPIIARVENDKIRLDLRSVTA IETLLITLEELNQDQ >MS2339 selB, SelB protein MIIVTSGHVDHGKTALLQALTGMNTAHLPEEKKRGMTIDLGYAYLPVGDK ILGFIDVPGHEKFLANMLAGLGGIHYAMLIVAADEGIQAQTKEHLAILRL LQIEKIMVVISKADRASSAKIDELKTKILTDYPFLAESPFFVTSAVNGRG IAELREFLTALPNPADKDKPFRYAIDRIFTVKGAGTVVTGTAFSGKVKID DELYLSNGGKVRVKNIHAQNRQNTEGLAGQRLALNINADLDRTQIERGDW FFSQAPFEPTERFTIQLTAETSLTENQPVHVYHAASRTTGKLALLIEKAI YPGQQTFAELILDNPLFLAYGDRIILRSGDAKQLVGGGKVLEINSPKRHK RSEQRLQWLVQLQQAHSADERIALYLQDKAVEARAITWIEQLTELQLNEI INKNNDIRFQHWCFNQNYQHRQNHKILTALSFYHDQHEDQLGLGKARLYR IAALNQPEKLIYHFIDELLEQGKLQQTRGWLHLAEHKIQFSTEERGLWQL VLDEFEKQKGQPLWVRDLAQNLGFDETLMRNFLYKAGKLGFLTAVVKDRF FLTEHIYGYARLIKQMIEQNGAVSVNQLRDELQIGRKLAVQLMEYFDRSG YLRRKGNIHILRDTEAFDL >MS1241 selD, SelD protein MLGTILHSQLEQFVDPHLLVGNDTNDDAAVYDIGNGTCIISTTDFFMPIV DDPFDFGRIAATNAISDIFAMGGKPIMAIAILGFPINVLPAEVAQKIVDG GRFACREAGIALAGGHSIDAPEPIFGLAVTGIVPTEKVKRNASAEAGSKL YLTKPLGIGILTTAEKRGKLKPEHKGLATEVMCQMNLIGSQFSQLESVTA MTDVTGFGLLGHLAEICEGSNLVADVHFNKIKMLDGVPYYIEQGCLAGGV TRNYESYGIKIGAITEFQKAVLCDPQTSGGLLVAVKPEGETQLLELAAQA GIELIEVGELRRRVDNSDPVIIRILD >MS0863 seqA, SeqA protein MLREDRMKIIEVDEELYQYIASQTKSIGESASDILRRLLNLPVSGVNLTA VDLTQSTMNSTNEEKGTQLPAEKNVVAETPKPSSEQEIRTPARKQSTQSI QHIVTKVKNLLQSEAFQEESKMVVRFLNILSVLYRTNPESFAQATEQETS QGRTRTYYARDEATLLAAGNHTKPRQIPDTPYWVITNTNSGRKMLMLERT MQFMELPEELIDEVRPYFAVV >MS1743 serA, SerA protein MTNKVSLDKSKIKFLLLEGVHQNALDVLHAAGYTNIEYHKKALEPDELKE AIKEAHFIGLRSRTNLTADILEHANKLIAIGCFCIGTNQVALEAAEEKGI PVFNAPFSNTRSVAELVLGEILLLMRNIPAANAQVHRGEWNKSAAGSHEV RGKKLGIVGYGHIGSQLSIIAESLGMNVFFYDVETKLPLGNAQQVSTLEE LLSSCDIISLHVPELPSTKNLMSAERIAQLKPGSILINAARGTVVDIDAL AEALEQGKIHGAAIDVFPKEPASAAEAFESPLRKFDNVILTPHIGGSTAE AQENIGTEVASKFVKYSDNGSTLSAVNFPEVSLPEHRTAKRILHIHHNRP GILNKINQVFVDENINIAAQYLQTDAKIGYVVIDVETDDSTDLLQKLKSI EGTIRARVLF >MS0068 serA, SerA protein MIMKVVISHRLHDNGMKVLEDANAQVAITNDGNPKIMLPELLDAEGLIIR IGSIDRETMLQAKNLKVIGRPGVGVDDVDVKTATELGIPVVIAPGSNTRS VAEHAFALMFACAKDIVRSDNEMRKGNFAIRSSYKAYELNHKTLALIGYG RIGSILAQMSKAIGMNVKVYDPFVKQGTIEQEGYIYCTELDDVIRDSHVI SIHVPLTNETRNLIGEHEFSLMNEHTILINCARGEVIDEPVLTKVLQEGK IHSAGLDVFACEPVDINSPLFQLDNVIVSPHMAGQTKEAASGVATMAAEG VVAVINGEKWPYVCNPEAYNHPRWNK >MS1758 serB, SerB protein MQTSEFINLTLKDIKQHYSPFPNKLINNQPQTEGRDYFILFGTNLEPAKL QAFQQKCGENFQIFDCWNNLHNIVVLLKGHWQKSYETHAHDLTLDAAKID FNANLAEQGLLVMDMDSTAIQIECIDEIAKLAGTGEEVSAITAAAMRGEL DFEQSLRRRVSTLKDAPETILQEVRLQLPLMPGLKETVRILQQHNWRVAI ASGGFTYFADYLKELLNLDAAVSNQFDIENGKLTGRVKGDIVHAQYKADT LKRLAREFNIPLENTVAIGDGANDLLMLKQANLGAAFHAKPKVQQQAQVV VNFADLTALLCLLSAGEKIKHLS >MS1573 serC, SerC protein MSNVFNFSAGPAMMPPAVLKKAQEELLNWQGQGTSVMEVSHRGKYFMELI TQADKDFRELYNIPENYKILFLQGGARGQFAAIPMNLANNKGKALYLNTG HWSATAAKEARNFTEVDELNITEQIDGLTRVNRLDFSDIAEQYDYVHYCP NETITGVEINEIPNVGNAVLVADMSSNIMARKLDISKFGIIYAGAQKNLG PAGIVIVIVREDLIGHARKATPSIWNYEVQANADSMINTPPTFAWYLCSL VFKDLLANGGIDTVEKRNAQKAALLYDYLDQTVFYHNTIAKENRSVMNVT FTTGDDQLNAKFVAQATEAGLQALKGHKVFGGMRASIYNAMPVEGVEALI AFMKKFEAENA >MS1450 serS, SerS protein MIDPNLLRNNLAEVAATLKLKRNFILDTKELAELEEQRKALQVETETLQA KRNARSKAVGAAKARGENIAPLLAEMDDMGHELATVKAELDEILAELNTI ALTIPNLPADEVPLGKDDSENKEISRWGTPRQFDFEIKDHVTLGENLAGG IDFAAGAKLSGARFAVMKGQVAKMHRALAQFMLDLHTEQHGYTETYVPYL VNHTTLYGTGQLPKFGEDLFHTTPLEGEVPYALIPTAEVPVTNLVRDEIL NTEDLPIRMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVDPDKS MEALEELTAHAEKVLQLLGLPYRKMLLCTGDMGFGSCKTYDLEVWVPAQD TYREISSCSNMWDFQARRMQARCRSKTDKKTRLVHTLNGSGLAVGRTLVA VLENYQNEDGSVTVPEVLRPYMGGLEVIGK >MS0390 sfcA, SfcA protein MDEQLRQAALDFHEFPVPGKIEVTPTKSLATQRDLALAYSPGVAMPCLEI QEDPAKAYNYTAKGNLVAVISNGTAVLGLGNIGALAGKPVMEGKGVLFKK FAGVDVFDIEINEKDPEKLVEIIAALEPTFGGINLEDIKAPECFYIEQKL RERMNIPVFHDDQHGTAIISSAAVLNGLRIINKKIEDVRLVASGAGAASI ACLNLLVSLGMKRENITVCDSKGVIYKGRDENMDATKKLYAIDDNGTRSL ADAIPNADIFLGCSAAGALTQEMVKTMGPNPLILALANPNPEITPPEAKA VRPDAIVCTGRSDFPNQVNNVLCFPFIFRGALDVGATTINEEMKMAAVRA IADLALAEQSDVVSSAYTDESEVTFGPEYVIPKPFDPRLIIRIAPAVAKA AMDSGVATRPIQNFDAYIEKLTQFVYKTNLFMKPVFNQAKADKKRVLLTD GEETRILHAVQEISTLGIAYPVLVGRLDVIEAQIKRLGLKIQAGVDFEVL NTDNEEIYQQCWSLYHNKLKRHGVTEAMAKRRMLTNSTAIGSALLELGYA DAMLCGLVGTYSSSLSLLKEVIGIKENVDIPATVNGLVLPSGNLFIADTF VNLAPTAEELAEITLMAAEEVRRFGIEPQVALISHSNFGTSEDQSAVKMR EVLQLVKTQAPDLIIDGEMHANVALNENLRREVMPDSPLKGAANLLIMPD MESARISLNLLQGTATPITIGPILMGMKKPAHILTSVSSVRRIINMVAIA AVKAQQN >MS1695 sfp, Sfp protein MTTFIAFGNIQQHYPLRQIPPEFLTEELGRKPTENIRVKRRHRSRWIAHF LLWELCKKAQIPTALLADIQRSVSGRPYFTPPHIDFNISHSGDWVAVILS VNTPQSIVGIDIEHPKKMRNYTALLAHFASQREQNWFAGEADADSAFYRC WCLREAILKSQGVGIVQLSEVFHDPQNLRLQSAYCPSGRLIFTDELPFYF ACFAANSQLEQAQYFCWENNGFSPVSLNNAIKYSVNKT >MS1230 sfsA, SfsA protein MRLPPLQAAKFIRRYKRFMADVELANGNILTIHCANTGAMTGCAEKGDTV WYSDSKSTTRKYPCSWELTELSNGNLVCINTHRSNQLVQEALQNKVIKEL AGYSEIYPEVKYGEENSRIDFLLKGEGLPDCYVEVKSITLVKNNIGMFPD AVTTRGQKHVRELLAMKKQGYRAVVLFAGLHNGFDCFKTAEYIDPDYDKL LRQAMKEGVEVYAYAGKFDKIQEIPTALSLAEVVPLCFN >MS0150 sgaB, SgaB protein MKIMAVCGHGLGSSFMMEMNIKKALKTLGKEAEVSHQDLASVTANEADLF VMGADIANSSGLPADKVVVVKNIVSVKEFEEKLAEYFNQ >MS0022 sgaT, SgaT protein MKNKEVYMETLYNLFLGFNAQVLSKAPFLLGIVACFGYILLKKDTTTIIK GTIKTIVGFMMVQVGSGVLTTSFKPIIEKLSEFHHLAGAVIDPYTSMQST IETMGENYGWVGYAVLLALALNILLVVCRRITGIRTIMLTGHIMFQQAGL VAVFYMIIGASMWETVIYTAVLMALYWGISSNIMYKPTQAVTGGAGFSIG HQQQIASWVATKLAPKLGDRNDSVDNMKLPKWLHIFHDSISATALVMTVF FGIILLSFGLDNLQTMAGKTHWFMYILETGLKFAVAIQVIVTGVRMFVAE LSEAFKGISERVIPNSVLAIDCAAIYAFSPNAMVFGFMWGAIGQFFAVGV LLMVGAPVLIIPGFIPMFFSNATIGVFSNQFGGWKAVMKICFVMGIIEVL GSAWVIQLLASQGTTFNGWMGMADWALFFPPVLQGIVSIPGFFFIILALA MVYMYFASKKLRADEAAAAAAGKTLEQMDGYGLDDIDEPEAETVEQSDNE TVTQSAVKPVRILAVCGSGQGSSMMMKMKIKGYLDKRGIPNIMDSCAVTD HKGKLDSTDIIVCSKHLADEISANDKISVLGVQNMLNPNSFGDELLALIK KYQN >MS0151 sgaT, SgaT protein MDSILFFILDILKVPSVLVGLIALVGLVAQKKAFPDIIKGTVKTILGFLV LGGGATVLLSSLTPLGSMFEHAFNVQGIIPNNEAIVSMALEKYGTATALI MAFGMVANIIVARFTRLKFIFLTGHHTFYMACMIGVILTVAGFEGVQLVF VGALTLGLIMAFFPAIAHFYMKKITGSNDVGFGHFGTIGYVLSGAIGQAV GKGSPSTEEMDLPKNLSFLRDSSISISLTMMIIYFVLAIASGNEYVSTHF SNGQHYLVYATIQAITFAAGVYVILQGVRLILAEIVPAFTGFSEKLVPDA KPALDCPIVFPYAPNAVLVGFLSSFMGGIIGLVLLGQLNWVLILPGVVPH FFCGATAGVFGNATGGRRGAILGAFAHGLLITFLPVFLLPVLGSLGFANT TFSDTDFGGVGIVLGNMAQFMSKDMIMIVIVAIFLLLVGYNYLAKKPVKT EE >MS0047 sgaU, SgaU protein MRKHKLGIYEKALPKGISWQDRLSIAKACGFDFVEISIDETDERLARLDW TPEQRIELVSAIIKTGVTIPSMCLSGHRRFPFGSHDEATRQKAYEIMEKA IKLAVDLGIRTIQLAGYDVYYEEQDEGTLQRFREGMEWATELAASNEVTL AMEIMDTKFMSSISRWKKWDEIIKSPWFTVYPDVGNLSAWNDNVEEELTL GMDKISKIHLKDTYKVTESCKGQFRDVPFGEGCVDFVNVFRILDKLNYRG AFLIEMWTEKSDEPIAEIINARRWIEQKMKEGGFQC >MS0056 sgbH, SgbH protein MSKPLLQIALDSTTLEKAVADAKQAESSVDIIECGTILACAEGMKAVSVL RALHPQHILVCDLKTTDAGTVLAKMAFEAGADWLTVSAAAHPATKAGCKK VADDFNAANPDLNVKKEIQIELYGNWTLEDAESWLESGIKQVIYHRSRDA ELAGKGWTEEDLELMKRLSALGMEISITGGIVPEDIHLFKEIKNAKAFIA GRALVGEKGRQTAAAIRDKINEYWN >MS0020 sgbH, SgbH protein MSKPLLQIALDSLSLEKAVADAKKAENSVDIIEIGTILACAEGMKAVSTL RALHPNHILVCDLKTTDGGAILAKMAFEAGADWLTVSAAAHPATKAACKK VADEFNAAHPELKVKKEIQIEIYGNWTLEDAKQWVELGVTQAIYHRSRDA ELAGKGWMPEDIEKMKQLESLGLELSITGGIVPEEIHLFKDIKKAKVFIA GRALVGEKGQQTAAAIRSEIDKYWV >MS0173 sirA, SirA protein MKYNTMNEIISNHTLDALGLRCPEPVMMVRKQIRHMQDGEVLLIIADDPA TTRDIPSFCQFMDHTLLNSETESLPFKYWVKKGL >MS1561 sirA, SirA protein MQYRLDLTGYICPLPLLMARQVLDKLEKGAILTLFLNHTSAVTDFVSLCE QQGYQLISTENSADKFILTIKK >MS1198 sixA, SixA protein MYLLGVRMKIFIMRHGEAEMLAKSDKARHLTENGKNQALQQGLWLKSNNI NLDLVIVSPYARAIETLDQINQAYDNNLTDKTEIWDGLTPYGDAEMISDY LATIAEEQPEMSVLLVSHLPLVGEIVAELCGKNPISYHAATIAQIEWDTE KGVIEQIKYYGR >MS1359 slp, Slp protein MIYQIISEDKMKKWLIFPLIVLLSACVPAPEGLERDEFTIQSLRQIEDSD YVCQCRKVRLGGKIISAEALKNQTKLEILSLPITTYSAKPVIESATDGRF IAYLDGFADPASLKDQYITVAGILKEQWRGKIDEADYLYPIIKVTAYKQW RLAKEYYYEYDDWHDYRFRRFGHFRHWGWDPFWRPELKLRYRLY >MS2199 slpA, SlpA protein MKVAKNIVVGIAYQVRTEDGVLVDEAPTNQPLEYLQGHNNLVIGLENALE GKAVGDKFEVRVKPEEGYGEYNENMVQRVPKDVFVGVDELAVGMRFIADT DMGPLPVVITEVSENDVVVDGNHMLAGQELLFTVEVVSAREATPEEIAHG HIHSGDHDHGHGGCGCGGHGHEHDHDHSHGGCGCGGHGHHHDHGHDHHHG EGCCGGHGKKEGHGHGNCGCGGHGH >MS1326 slyB, SlyB protein MKKMTLALAVLVSLGLTGCANTDVYSGDVYTGAQSKEARSISYGTIVSAR PVKIQANNQGVIGTVGGGVLGGITGSTIGGGSGRAVASAVGAIAGAVAGS KIEEKVSQVDALELVIKKDDGKEIVVVQKADASLKAGARVRIVGGSTLNV SAI >MS0156 slyX, SlyX protein MAIVQHLFGITMQNSANLEQRIAELEMKITFQEGIIEELNQALIEQQFVI DKMQLQMRHVANKLKDLQPANIATQAEETPPPHY >MS0041 smf, Smf protein MAQYSAEQLSEFDAAEWRKIGWNDQQIQTWLNPNMRYLEPALRWNEQPEQ HILHYRQENYPELLKQIHSAPPLLFIKGNPELLTQPQIAIVGSRNCSDYG EYWAKHFASELSATGFVITSGLALGIDGFCHQATVEQQGQTIAVLGSGLQ HIYPARHKKLARRIIETNGALVSEFFPTHPPIAENFPRRNRIISGLSLAT LIVEATERSGSLITARYALEQNREVFAIPGNIQNQYSQGCHTLIKQGAML VERISDILENLPHFSINYRPPAKVRSQVQTAQLAAPEVQVSYPELYKHIS SLPISIDDLINATGLNVNELLVQLLELELQNLICQQNGLYQRN >MS1912 smpA, SmpA protein MQIKPIITALLLALSVTSCSTVSKVVYRVDVPQGNYLESAAVSQLQVGMT REQVQYILGTPVLNDPFSTNTWYYVYLQQRSYETPEQHTLTVNFNQQGTV ESFDLDKPLPDQEKQVVNNANITMPESQSTSWWQFWK >MS1145 smpA, SmpA protein MKLTHLLLSAVAATSLIACGNLSDVTDEGTSENLVWPKIDESRFNHDGSQ FGSWPNWDSVRMIERGMNKDQIRNLIGSPHFSEGLYGIREFDYAFNYREN GVHKICQYKILFDKNMNAQSFFWHPNGCNANSSFSLSADFLFDFDKDSLT ERGKKVVDSVAEQLKASKAKTVKVAGFTDRLGSEAYNLELSQRRANQVKE WLIARDVKADIDAIGYGSAQQIKPCTGLKAGKALRDCLRPNRRVEISSSG TVLKKFEENSKKSVGPAVFYQK >MS1505 smpB, SmpB protein MIYMTKKKVKPGSNTIALNKRARHDYFIEEELEAGLSLQGWEVKSMRAGK ANISDSYIIFRDGEAYLFGATIQPLTVASTHVVCDPTRTRKLLLNQRELA SLFGKANRDGYTIVALSLYWKNAWAKIKIGLAKGKQQHDKRNDIKDREWK MQKERIMKNANRG >MS2306 sms, Sms protein MAKAPKTAYVCNDCGAEYSRWQGQCLACKAWNTISEVRLVSAKQPANRND RFSGYAGETQAKIQTLAEINLQETPRFSSGFKELDRVLGGGIVPGSAILI GGHPGAGKSTLLLQVMCGLARNMTALYVTGEESLQQVAMRANRLGLPTDR LQMLSETSVEQICSLADQLKPQILVIDSIQVMHLADIQSSPGSVAQVREC ASFLTRYAKTRQVAIIMVGHVTKDGTLAGPKVLEHAIDCSLLLEGESDSR FRTLRSHKNRFGAVNELGVFGMTEQGLREVKNPSAIFLSRGEEQTPGSSV MVLWEGTRPLLVEIQALVDHSMLANPRRVAVGLEQNRLALLLAVLHRHGG LQMADQDVFVNVVGGVKVTETSADLALLLALISSFRNRALPQDLVVFGEV GLAGEIRPVPSGQERISEAAKHGFKRAIVPYGNKPKSAVENMQVFTVKKL ADALDILDSLDY >MS0776 smtA, SmtA protein MIDFRPFYQQIAVSELSSWLETLPSQLARWQKQTHGEYAKWAKIVDFLPH LKTARIDLKTAVKSEPVSPLSQGEQQRIIYHLKQLMPWRKGPYHLHGIHV DCEWRSDFKWDRVLPHLAPLQDRLILDVGCGSGYHMWRMVGEGAKMVVGI DPTELFLCQFEAVRKLLNNDRRANLIPLGIEEMQPLGVFDTVFSMGVLYH RKSPLDHLSQLKNQLRKGGELVLETLVTDGDEHHVLVPAERYAKMKNVYF IPSVPCLINWLEKSGFSNVRCVDVEVTSLEEQRKTEWLENESLIDFLDPN DHSKTIEGYPAPKRAVILANK >MS1338 smtA, SmtA protein MKSELICYKKMPVWNKNSLPKMFQEKHNTKAGTWGKLTVLQGKLKFYTLN EDGSIVNEHIFSANTDTPFVEPQQWHKVEALSDDLECYLEFYCTKEDYFG KKYNMTATHSDVLKTAKIITPCKVLDLGCGHGRNSLYLALKGYDVTSWDH NAASIAFLADSAAKENLQIQTAVYDINNANIQENYDLILSTVVFMFLDRE AVPAIIDNMQKHTNAGGYNLIVAAMSTEDMPCPIPFAFTFGENELKNYYQ GWEFVEYNENIGELHKTDKNGNRYKMKFVTMLAKKVK >MS0203 smtA, SmtA protein MTTFMNNKTKSAGFTFKQFHVSHDKCAMKVGTDGILLGAWASLQGNRYLD LGTGSGLIALMLAQRTQTDCHITGVEIDPSAYRQATENVRQSPWADKIQL EQQNIVDFTRTCTKKFDTVLSNPPYFEQGVDCRDKQRDTARYTQTLSHSD WLNLAADCLTNTGRIHLILPYAAGKNLQKQTALFCARCCEVITKSGKIPQ RLLLTFSKQPCTTEQSRLVVYNEQNQYTEQFIALTRDFYLNF >MS0467 smtA, SmtA protein MSLNLNQVSLLQNVTRYWNNRAEGYSRHNQQELQSIKRLKWQQLLLAHAP KKQNLKVLDIGTGPGFFAIIMAQAGAQVTAIDATSNMLEQAKYNAAQAMV DIRFVRGDVHHLPFADESFDLIISRNVTWNLSEPEQAYKEWHRVLKCGGN LLNFDANWYLFLYDEQRRRAFEQDRASTIRLNIPDHYADTDTSAMEAIAR KLPLSRQLRPHWDMNALLNIGFSQLMADTRIGEFLWDDEEKVNYRSTPMF MIVAQK >MS1894 smtA, SmtA protein MKESVYDSEGFFELYQKLRANPGSLNEIVEKPTMLSLLPDITGKTLLDMG CGTGGHLQMYLRLGAKRVVGIDLSASMLKQAEIDLGKLCENRLQFSSGSF SLHHLPMEQLDQLPEAQFDVITSSFAFHYVENFPALLTKIANKLTARGSL VFSQEHPVVTAYQGGERWEKDENKQQIAYRLNFYRDEGKRERSWFKQPFL TYHRTISTIVNNLIQVGFTIEKMAEPMLADQAEWQTEFKDLQHRPVLLFI RAKKS >MS2368 smtA, SmtA protein MNIQLICETENSQNFTALCKEKGLTHDPASVLALVQTETDGEVRLELRKL DEPKLGAVYVDFVAGTMAHRRKFGGGRGEAIAKAVGVKGNELPSVIDATA GLGRDAFVLASIGCRVRLVERHPVVYLLLQDGLRRAYADPEIGEMMQKNM QLLPVHHITELNPFEDFADVVYLDPMYPHKQKSALVKKEMRVFQYLVGAD SDSNLLLEPALKLAKKRVVVKRPDYAEFLAEKAPQFSRETKNHRFDIYSV NV >MS0706 smtA, SmtA protein MSKDTIFSTPIEKLGDFTFDENVAEVFPDMIQRSVPGYSNIITAIGMLAE RFVTADSNVYDLGCSRGAATLSARRNIKQANVKIIGVDNSQPMAERARQH IHAYHSEIPVEILCDDIRNIAIENASMVILNFTLQFLPPEDRRALLEKIY RGLNQGGLLVLSEKFRFEDETINNLLIDLHHTFKRANGYSELEVSQKRAA LENVMRIDSINTHKVRLKNVGFSHVELWFQCFNFGSMIAIK >MS0945 smtA, SmtA protein MWHAKHATELKLPTSWQQIPNGTLYCNALNRYFSHWLSNILGDQILKLGG LSAEIGLDLPMRHQLVISPEIPQNLTALCLHPCTSVVRSKVTELPLIEES IDACLLANNLNFCADPHRLLREITRVTTESGLLFISLFNPLSILAFKRQF HQTPYEKFPFRQYPTWLIIDWLELLNFDILQCENLALQHRQHFSLFSPLT VIIAQKRTCSLSSQAQKIQFHQEDVFSPEAAFKRINE >MS0389 sodA, SodA protein MAYTLPELGYAYDALEPHFDALTMEIHHSKHHQTYVNNANAAVEAAVKNV PALAEYLDACPGKILKNLDKVAAENRTAVRNNVGGHANHSLFWKALKTGT TLQGALKDAIIRDFGSVEAFQAEFEKAAATRFGSGWAWLVVQEGGKLAVV STANQDSPIMGKEIAGCEGFPLFCLDVWEHAYYLKFQNRRPDYIKEFWNV VNWDFAAERFEKKLAECGCAK >MS1704 sodC, SodC protein MFIFHTMIVSLWHLPTFYLFTGAYMKKTVILLGLFTLSGAAIAEEAKNVQ TEIKSKVIEVSLLDPVKGDKAIGQVVVTESPYGLVFTPELNGLTAGLHGF HLHQNPSCAAGEKDGKKVAGLGAGGHWDPKEAKRHGFPWEDNAHLGDLPA LAVNADGTASNPVLAPRLKSLDEIADKSIMIHVGGDNHSDHPAALGGGGA RMACGVIK >MS0886 soxR, SoxR protein MNINEIVKKTNLTAKSIRFYEEKGLITAPQRALNGYRQYNQKHVEELNLL HQARLVGFSLPECKELLELYKDPHRRSADVKAKTLARIAEIDNQIGKLQQ MRQQLQTLANQCPGDGSEHCPIIEGLSKPNCCDHHAEKK >MS0468 soxR, SoxR protein MRIGQLAKAVGCTIETIRYYENQGLLAKPQRSANNFRYYTNDHLQQLSFI CYCRSLDMSLHEIKMLLNLDRSSGQRAEEINLLLDKHIRDVAKRLHELAH LRMELIKLKQKCSEMTGENLMQNIFSGGNIRFRKIK >MS1385 soxR, SoxR protein MNSQKKFYTISQLAEKLAITTHTLRFYEKEGLLPSVQRDQNGNRLFIQAD VEWLELLICLKNTGMPLKEIKRFVEWLNYGDSTIEQRLQLFQAQVTKVEQ QIAELQRHLEILKYKRQFYQCAKELGSVQAVLDTQLQQQFAEQNILLPVS PLSMAENE >MS1433 soxR, SoxR protein MMMKINELSKKSGINLETIRYYEKTGLLPEPKRAANGYRVYDQQSLSQLN FIKSCRWLGFSIDEIKQLNELKNTPKHHCVADEMILSHLKQVEEKIARLL EIQTFLQNLVNHEEHSVEECRAISGLSQER >MS0179 soxR, SoxR protein MEQTLKQGIFMHIKEFSTKIGLSIDTLRYYEKEGLLNPARNKSGYRNYGK QDLEWIAFILKLKAMGVPLTQIKEYARLRYLGDTTIPERYAILQAHNQKL VEQEKEIKKYQQFLAHKLSIYEKVMKKQN >MS0241 spoT, SpoT protein MVAVRVSHLLNPKDFIIEDWCAGLGLTPDVEKNIVRAWYYAQEKAQQLFQ NSHWYLRDGVEMVEILHGLNMDADSLLTAMLFPIVNAKIVNQEQIKEDFG PHIWKLLKGVIEMNNIRQLNTTDSNAQVDNIRRMLLAMVDDFRCVIIKLA ERITYLRDAEKRYSKQDKVAAAKECSNIYAPLANRLGIGQLKWELEDYCF RNLQPEQYRIIAIKLNERRLDREQYIADFVQRVSQYLDESVTGAEIYGRP KHIYSIWRKMQKKHLDFSQLYDIRAVRIIVPALQDCYTALGIVHTHFKHL PDQFDDYIANPKPNGYQSIHTVVLGEGDKPIEVQIRTKKMHDDAELGVAA HWKYKEGNTGSLSAYEEKIIWLRKLLAWQHDISNSGEVVPELRTQVFDDR VYVFTPKGEVVDLPAGSTPLDFAYAIHSDVGHRCIGAKVGGRIVPFTYQL QMGDQIDIITQKNPNPSRDWLNPSLGFTHTAKARSKIQAWFKKLDREKNI PIGKEQLENELNRLAITLKQVEPIALPRYNLKSIDDLYSGIGSGDIRLNH LINFLQAKLIKPTAQEADEEVLRQVTKTANSAANQQKNEKNKGYVIVEGV GNLMHHIARCCQPIPGDDIEGYITLGRGISIHRTDCEQLAELKAAHPERV VESIWGENYNSASGFNLSIRVIANDRNGLLRDITTVLANDKISVANVTTR LDSKRQLATMDLEIQLKNVQILGKVITRLTKLDDVIEVKRL >MS1736 spoT, SpoT protein MYLFEPLNKIIQGYLPSEHIDLIKRAFVIARDAHEGQFRSSGEPYITHPV AVASIIAEMRLDHEAIMAALLHDVIEDTPYTEEQLTTEFGKSVAEIVEGV SKLDKLKFRTRQEAQAESFRKMILAMTKDIRVVLIKLADRTHNMRTLGSL RSDKRRRIAKETLEIYSPLAHRLGIEKVKNELEDLCFQAMHPQRYAVLNK VIQVARNTRQELVHPILVTIQQRLEEVGINAQVFSEEKPLFYIYQNMRLR NQQFRSIMDISNFRIIVDSIDNCYRVLGQMHQLFKPRPGQIKDYIAVPKA NGYQALHTSTIGPHGVAVEIQIRTEEMNLIAELGVTAHWVYKPGGKNDTT TAQIKAQHWLQSIIELQQSAGNSFEFIESVKSDLFSDEIYVFTPKGRIIE LPAGATPIDFAYAVHTSIGSTCVGAKVDRETYPLSQALRSGQTVEVITSP NATPNANWLNFVVTGRARAKIRQTLKTLRLEEAINLGRYQLLHALAGKHL EDLDPAIVHHVLTELNLDTMDDLLAEVGLGNQLSTVIARRLQGESLAIYT DIEEVNNQERLPIKGMDGLLVNFAKCCHPIPGDSIVAYANPGKGLVVHHE NCRNLKKRTTQSVPFIKVEWEQCDHSAEFEAELHINMVAQQGALANLTAA ISAAQSNIHSIWTEESEGRICHVTLTLSAKDTKHLANIMRKIKSLSGVQS VERNINE >MS0474 spoU, SpoU protein MSENIYGIHAVNSFLATAPERLIEVYVLKGREDKRLQPLLKELHQLGISV QFLNRQTLDNKANGEVHQGIIARVQPAKELNENDLERILSNNKDPLLLVL DGVTDPHNLGACLRTADAAGVCAVIVPKDKSAQLTSIARKVACGAAEVVP LIRVTNLARTLRDLQQSHNIWVVGTAGEATETLYQTKLIGPLALVMGAEG DGMRRLTREHCDQLISIPMAGSVSSLNVSVATGVCLFEIVRQRLS >MS0248 spoU, SpoU protein MNDKSNKAAFQPAKFQRPSNQKRFHERTVGERQEKFGQSRAQFSRSQDND RFAPQDRRSNKDFDRKERQNPAKNDRTFERRNERPIPQETKITETKLGNV KVVMKRSGVSEQPRVKKTGSLSPRAPEKIKKNRAEEMKVYGESACLALFA ERPESIVRVWATVEMAHKIGDMFSYLAANKKVYHVVERAELELVSGTEHH GGICMLVKKARPFTLTGYLDIPRQQDALIILDNVRNPQNIGGIIRTCAFY GVKGVIVDNAELLNSAAAMRVAEGGMEYIHQLQTESPDDALAKLRKAGYQ VVHTTTNKQAKGVHKLQLAKKVVFVLTESENPALVQSGDEVINLSFANPL KTGLNVAVAAGVLLAKLDK >MS1107 sppA, SppA protein MNIVFLFFVLLLAAIVSLTTMVKEKPNLTGDQGALLVNLNGYLADEREDG LNWRNALKKLNDEQVASQYSTFDVVYAIENAANDERIKGLVLDLNYLDGG DLPALDYVGKAIRDFQKSGKKVIAYADNYSQSQYFLASYADEIYLNPIGE VGIEGLSAQNLYFKSMLEKLEITPHVFRVGTYKSAVEPLLRDDMSPEAKA NTEQWLGTMWSNYQERIAENRNIAKNSVLPEAGVYVDELKALNGDITAYA KKHKFVTQVASRLKLSQNLTALFGENEQNEPKTVDFDTYLAALPDRLKGD SSDFVQAKNKIAVINIEGTIVDGETNEQGVGGDSIAQLLRKAYKDKNVKA VVLRVNSPGGSAFASEVIRQEAENLQTAGKPVVVSMGAMAASGGYWISST ADYIVADKNTLTGSIGIFAVLPTLENTIKKAGISADGVTTSALVSPSGFS PLTAELKDSLQLQIEHGYERFLSVVSKGRSLTKQQVDNVAQGRVWLGEDA YKMKLVDELGDFDTAVRKAQELANGKLAESEKTDTFSVEWITDENTGLLG GLMKNITQSSQNVIQNAVLKTMGLPKEVKQLQKQLGILTQFNDPKGQYLY CLNCSEVK >MS1146 sppA, SppA protein MWSEILVGYGIFILEILTILLVIAGIVAAIMTLKQQKNPQTGELKLTDLS EQYQDNVKKLKDFRLTDEELKQAEKARKKADKQKAKENKAKNKKGEKTEE SLKPCVYVMDFKGDIRASETAALREEISAILNVANPATDEVLLRLESPGG VVHGYGLAASQLARLKQKGIKLTVAVDKVAASGGYMMACVADKIVAAPFA VIGSIGVVAQVPNIHRLLKKHDVDVDVMTAGEYKRTVTFVGENTEKGKQK FQQELEETHDLFKQFVTANRPLVDIDKIATGEHWFGQQALALNLVDEIAT SDDLILDAMQDKSVIGVKYAVKKSLIQKLGKQAEESSDKLLLKWLKQGNK TLM >MS1436 sppA, SppA protein MSDSTSNFTIDDYDHYFYVGPINMVGYYRLCKEISKHKNKDKVLLCLVTY GGDPDAGFRIGRALQHHYNGEVTIYIPNVCKSAGTLTTIAAKHIIMDNKG ELGPLDVQLRKTDELGASNSGLDIFKTLDTLEDRANTAFNKYLRAVRFGQ GLSTKMSVEIATRLVDTIIKPIAEQIDPMKIGEHQRATDIAIEYGNRLNQ TSKCLKDDMQSLDKLIRGYPSHGFVIDRKEARTLFSCVTAANEDIIHRYQ TIHNATESNPNIIGSDLYVEYLEDEQNETSNANNESSSNSTPNDKSGKST RSSTKS >MS1088 spr, Spr protein MRTKIKTFCLTISVAILSACSSNISSITYKGRIDDPIMAIVLLSEQQREW AGAPYVLGGVSRSGVDCSGFVQTTFMDRFNIALPRTTAAQSGYGQKISLS DIQTGDLVFFKTGRGPNGYHVGIYVKNDKFLHASTKGGVIYSSMNSPYWK NAYWQTRRI >MS1333 spr, Spr protein MMIKKFLLVTAALVMTACSNSSRLDAAVYPSADETNDTQLTELIGSLKTN KPQYDVRSNSIHSTKNAQINNKKLMQVYSAWAGTRYRLGGTTTRGIDCSA FMQEAFSTAFGIDLPRSTSEQRSVGKKIQKSELKQGDLVFFRGNRHVGVY LGGNRFMHSSTKEGVTISSLDDGYWSRTYTQSRRVL >MS1698 sprT, SprT protein MFIYKKSAEFNRTFKSDFKYRKIATQLEGLFIFYKLDLNRLQKINSSCIS GKNTPRDHAESGYGGERKVDESGIGIGK >MS1699 sprT, SprT protein MENISELTGFRHLKMQVQRRLTNCLTLAETHFHRSFPMPTVTYQVRGMKA GVAYLQQNEIRLNRTLLLENSAEFIGQVVPHELAHLLVYQVFGRVKPHGV EWQTVMNNVFDLPANVYHRFDVKSVQGETFTYQCQCRTHQLSVRRHTRIQ RDHAVYFCRKCRSCLSFVSG >MS1950 srmB, SrmB protein MRYNFPQFYNLSHLRIFMPQPQFEDFDLSPELLKALAQKGYARPTAIQSE AIPAAMDERDVLGSAPTGTGKTAAFLLPAIQHLLDYPRRKPGAPRVLVLT PTRELAMQVAQQAEELAQFTKLSIATITGGVAYQNHGEIFNKNQDIVVAT PGRLLQYIKEENFDCRAVEILIFDEADRMLQMGFGQDAEKISAETRWRKQ TFLFSATLEGELLVDFAERILTDPVKIDAEPSRRERKKINQWYYHADSYE HKVKLLARFIADEQVSKGIVFVRRREDVRELSEILRKRGIRSTYLEGEMA QTQRNNAIDKLKNGIVTLLVATDVAARGIDIEDISHVMNFDLPYNADTYL HRIGRTARAGKKGTAVSFVEGHDYKYLGKIKRYTEELLKPRIIEGLEPRT KAPKDGEIKTVSKKQKAYIRQKREEKRKTTQKKAKLRRQDTKNIGKRRTP KAVSEAQAKEIR >MS1836 srmB, SrmB protein MSLDHLSQQRFADLPLNAKVLEALESNGFEYCTPIQALSLPISLAGKDVA GQAQTGTGKTMAFLTATFHHLLEHPVKTNHPRALIMAPTRELAVQIAHDA ERMVKTTGLKTALAYGGDGYDKQLKAIEAGADIIIGTTGRIIDYVKQNII ALSHIQVVVLDEADRMFDLGFIKDIRYLMRKCPSPKQRLTLLFSATLSYK VRELAFEDMNDPEYVEVEPLQKTGHRIKEELFYPSNEDKMPLLITLLEEE WPERCIIFANTKHQCEKIWGYLAADGHRVGLLTGDVAQKKRLSLLKQFTD GALDILVATDVAARGLHIPDVTHVFNYDLPDDREDYVHRIGRTGRAGESG VSISFACEEYAMNLPAIEEYIGHHIAVSQYDSDSLIRDLAKPYRLKPSLP ASNRHNRNGAKPFKKRF >MS0495 srmB, SrmB protein MTETKITFGDLGLPEFILSAVSDMGFETPSPIQQACIPHLLNGRDVLGMA QTGSGKTAAFSLPLLAQIDIEEKHPQMLVMAPTRELAIQVAEACELFTKN AKGVHIATLYGGQRYDIQLRALRQGAQVVVGTPGRILDHIRRGTLNLSEL KFIVLDEADEMLRMGFIDDVETVMAELPAQHQTALFSATMPEPIRRITKR FMTDPQEVKIQSTQRTNPDIAQSCWYVRGYRKNEALLRFLEVEDFDGAII FTRTKTGTLDVTELLEKHGFRAAALNGDMTQQLREQTLDRLRNGSLDILV ATDVAARGLDVERISLVVNYDIPLDAESYVHRIGRTGRAGRSGSAILFVE PRERRLLSNIERLMKKPIEEVDVPNHEALQARRREKFKAKITKQLEHHDL EQYRLLLEGLFTPDQDQEDIAAAMLMLLQGKQKLILPPEPPMEKRGRRER DDRRGERGDRRERRPEERRGYGNPQPMDLYRIEVGRADGVDVRHIVGAIA NEGDINSRNIGHIKLYDEYSTVELPQGMPKELLQVFGKARVLNKQMRMTF VSEAGETVGRERHEGRRNDRRDNGFRREERRFNDRGNRSFNERAPRREFR ERNDRRDRRDRRS >MS0694 srmR, SrmR protein MSTYLLDAKLAQKIVQRTMDIIDCNINIMDAKGKIIASGDVNRIGEIHDG ALLVLSQGRVVDINEAVIHSLHGVRPGINLPLRVDGEIVGVIGLTGEPTT LKEFGKLVCMTAEMMLEQARLFNILAQDTRLKEELVLNLINTDKITPSIV EWANRLGVDLSIPRVACIIEVDSGQLGIENARSELQNLQTLLKIPERDNL VAVLSLTELVVLKPALNSFGRWEVDDHLERINQLLSRMNEKAKLNVRISL GNYFTTEDSISLSYHTAKTTLTIGKARYPKQRIYNYQDLILPVLLDQLRD GWQKEELERPIKKLKLMDNNGVLLKTLLAWFENNMQTIATAKALYVHRNT LEYRLNKIADLTGLDLNSTDNRFLLYMALHVAV >MS0585 ssb, Ssb protein MAGINKVIIVGHLGNDPEIRTMPNGEAVANISVATSESWTDKNTGERREV TEWHRIVFYRRQAEVAGEYLRKGSQVYVEGRLRTRKWQDQNGQDRYTTEI QGDVLQMLGGRGQTADAGFAAPQPNQSFSRPQASAARQQPATRPAPAAEP AMDNFDDDIPF >MS1141 sseA, SseA protein MAGMVGDLDDYDRLLANISLSETDTMPHLRIFMKYTALFAIFSLFLTACN DNKVQPIDTAELLQNLNNPQYVIIDSRNDSLYNGFKDKHASRGGHIKGSI QFTCSWFDSIEAGKFDSFAESKGITKNKTLVIYDSNPDNLACISAEFAAK GYKVRTFSDFISYVNAGYPLESLLNFQYSVSPEWVYSVLQGEKPESYTND DFMLFEVSWGALENAKAYTQHIVGAYHFDTDWVEGEAPVHNLLEPATIER NLLKNGITKDKTIILYSDNPLAAYRIFWALKWAGVEDVRVLNGNLSTWMD SGFPTETKVNIPQPVNNFGGHIPTNPQLSIAQPQQAYARQQQGLKLISSR AWEEYIGEVSGDDAIQATGEPQGAIWGFSGSAPSNVADFYDPDDTLRNPK EIEALWQELGIVQGDQLAFYCGTGWRASVPWFMTQLLGWRNTAVYDGGWN AWQMTELPVQKGAPTGLLKPDAKNDSGRMLKKTNSCRG >MS1140 sseA, SseA protein MNIKLTLIATAVALTLSACDDKSVKEIKTDELLKNLDNPEFVIIDGRSDS LYNGFKDGDAKRGGHIKGAVQFSCNWLAHIADDKFEKFAKDKGLTKDKTL VFYDSNSEQLNCLSDKFAEKGYKVRVFKDYLSYANSDNPLEAFPNFEYSV SPEWVNAVIKGEKPESYQNDDFIVFHVGWGPVEKSEEYKQHIPGAFHFNT DWVENDPVWNLSDPKIIEQNLLNAGINKDKTIILYSDNQLAAYRIFWALK WAGVKDVRVLNGNLTTWTKAGFATETAVNTPTPVSQFGAEIPLNPQINIS MPQEAIARQKQGLKLISNRAWDEYTGKISGYSYIPGKGEPQGAIWGFAGT DASNMADYYDVDGTLRNPKEIFALWKEQNINQGDPIAFYCGTGWRAGVSW FMTQLAGWDNAYVYDGGWNAWQMDSVFPVQKGAPNNMAKPDSKNDFGQK >MS1908 ssnA, SsnA protein MKNHVRSFKTYIRDEIIKKGGWVNAHAHADRAFTMTPEKIHIYHNSNLQQ KWDLVDEVKRTSSVEYYYARFCQSIELMISQGVTAFGTFVDIDPICEDRA IIAAHKARDVYKNDIILKFANQTLKGVIEPTARKWFDIGSEMVDMIGGLP YRDELDYGRGLEAMDILLDKAKSLGIMCHVHVDQFNTPKEKETEQLCDKT IEHGMQGRVVAIHGISIGAHSREYRYELYKKMREAQMMIIACPMAWIDSN RKEELMPFHNALTPADEMIPEGITVALGTDNICDYMVPLCEGDMWQELSL LAAGCRFPNLDEMVNIASINGRKVLGLDR >MS1280 sspB, SspB protein MKNKMEYKSSPKRPYLLRAYYDWLVDNEFTPYLVVDATYYGVDVPQEYVR DGQIVLNLSSGAVANLQLTNDAVMFNARFQGVPREIYIPLGAALAIYARE NGDRSDVRT >MS0832 sstT, SstT protein MNISRLFSFLFHGNLVKRISIGLLLGIIFALVSPSLESALGFHLAEKMGL LGQIFVRSLRSVAPILVFVLVIAAIANKKVGSKSNMKDIIYLYLIGTFLS ALTAVFASFMFPTTIALATNEAELSPPGKITEVLTALIFNVVDNPITALF NANFIGILAWAIGLGITLRYASETTKNVMNDFAEAVSKIVHFIISFAPIG VFGLVASTLADKGLSALLDYVQLLAVLVGSMLFVAFVINPIIVFWKIRRN PYPLVWECIRVSGVTAFFTRSSAANIPVNMELAKRLNLDEETYSVSIPLG ATINMGGAAITITVLTLAAVFTLGIEVSIPTAILLSLVASICACGASGVA GGSLLLIPLACSLFGISNDIAAQVIGVGFIIGVLQDSTETALNSSTDVLF TAAACMSEERKNS >MS1355 sucA, SucA protein MQKNSPLEEWLASTALGGANQSYIEDLYEDYLRDPAGIEESWRKTFDSLP KSTAVEQPHSQIRSYFQQLARDSNSQSGASVIDPNVSKRLVKVLQWVNAH RNRGHLHANLDPLNLWQRLDAPTLDYKFFGFTDNDLDEVFDIGNYVYNRD KITLRDLAYALKNTYCSTIGLEFMHVNDLEARTWLQRKVENLLNKPLFSK DEQVKFLEELTAADGLERYLGAKFPGAKRFSLEGSDAFIPLMKEIIRHGA RNGVKEIVMGMAHRGRLNMLVNVLGKKPAELFDEFAGKHQGNGTGDVKYH QGYSSDFMTDAGLVHLALAFNPSHLEIVSPVVSGSVRARQKRIGDDHFTK VLPITVHGDSAVIGQGVVQETLNMSSTRGYTVGGTIRIVINNQIGFTTSN TRDTRSTEYCTDIAKMIEAPVIHVNGDDPEAVAYAARMAVEYRTLFKRDI FIDLVSYRRHGHNEADEPLATQPVMYKLIKQHPTPRKVYADRLVAEGVIT ESKATELMNNYRDGLDRGDCVVPEWRPLDTQKMDWTSFLTQEWTPYNGKF DPQRFKDLARKVCEYPENHPIHPRVQKIYADRLLMANGEKLFDWGMAETM AYATLLDDGHHIRISGEDSGRGTFFHRLAVLHNMNERKAYIPLMNLHQGQ GHFEVWDSVLSEEAVLAFEYGYATAAPKTLTIWEAQFGDFANGAQVVIDQ FISSGEQKWGRMCGLVMLLPHGYEGKGPEHSSARLERYLQLCAQQNMQVC VPSTPAQIYHLLRRQMLRTVRRPLVVISPKSLLRHPLAVSTTEELVNGTF QNVIPEIDDLDPKQVKRVVFCAGKVYYDLLEQRRKNNQTDIAIIRIEQLY PYPHEEMRDILTAYSHVKDYVWCQEEPLNQGAWYCSQHNFVANLPADGKL RYVGRPASASPAVGYLALHNEQQKALVAEALAI >MS1352 sucC, SucC protein MNLHEYQAKQIFAQYGLPVSEGCACQSLEEAIQAVKKLGGGQWVAKCQVH AGGRGKAGGVKLVKSEEEVRSFFEKFLGQRLVTFQTDAKGQPVNAIYMEA CANVKKELYLGAVLDRSSQRIVFMVSTEGGVNIEEVAEKTPHLLHKMPID PLVGAMPYQGRELAFKLGLQGKQIQQFAQIFCQLGKMFVEKDLSLLEINP LVILDNDQLHCLDAKIVVDGNALYRQPELNAMRDPSQEDAREAAAEQWHL NYVALEGNIGCMVNGAGLAMGTMDIVKLHGGQPANFLDVGGGTTKERVAE AFKIILSDQSVKAILVNIFGGIVRCDLIAEGIVAAVNEVGVSVPVVVRLE GNNAPLGREILAQSGLNIIAATSLTDAAVQVVNAAEGK >MS1351 sucD, SucD protein MAILINKETKVICQGFTGGQGTFHSEQALAYGTKLVGGVSPGKGGSIHLG LPVFNTVKEAVEQTGATATVIYVPAPFCKDAILEAIAAGLKLIVCITEGI PTLDMLQVKQKLDESGVVMIGPNCPGIIVPDECKIGIMPGYIHKKGRVGI VSRSGTLTYEAVKQTTDEGFGQSACVGIGGDPIPGSNFIDILKLYQADPK TEAIVMIGEIGGSAEEEAAEFIKAHVTKPVVSYIAGITAPKGKRMGHAGA IISGGKGTADDKIAALQSAGVICVKSLADIGAALKTVLK >MS0511 sufI, SufI protein MENYSRRRLFKKTLIATALVATPAPLLAASRQPLVIPPLLESRRGRPVIL STESSQTALVDGKLVEVWGFNGRYLGPTVRVKQGDFVKLNYRNNLTQLVA LNIQGLQTSGELLGSIGHSLKPGEGWAPIVPITQSAGTCYYHSCTLASSA YQNYRGLVGMWIIDDDESRKANLPNKYGVNDIPLILQDQRINSAGTQLFQ QNEPHFYGERLFVNGQEAPYLNIPRGWVRLRILNASLSRGYDLRMDDERD VLIIAQDQGFLPQSKTVKQFFVGPGERVEILVDLNEGENVSLIVGAKRGL LDKAKLLFNSNGELADNTVLELRPEGLLSVFNGKPSFQFSEAAVLPTQIK QERSFHLDATNGMINQKRFDPRRIDVNAKQGSVERWTISSSIPTGFRIQG ARFVIESIDDKATDAAELVWKDTVWINGKVRILVKFDNLSSNTQPFIFGS SDLMQADKGAIGLIVVQ >MS0876 sufI, SufI protein MPKPVNRTTPANVVVKLEAADRMMEIMPGVKFKYWTFNGSTPAPFIRVRE GDTVEVHLSNPINSGLPHSLDFHASAAPDGTAMVSSTKPGRTTVYRFKTL SSGLYVYHCASIPGAGTHIGKGMFGLMLVEPKEGFPPADKEFYIMQNEFY TNGSFGEQGLQVFSTEKAAYELPDYVVFNGHYGSMQGEKALKAKVGEKIR FYVGNAGPNKASSFHLIGKTFDTVYVEGGTLQNHNVQTTLIPSGGAMISE VTIPVPGQYSFIDHSIFRADKGARGTLMIEGDENPEIFSGKLRDEPYDKR NPDSDIDTGFKH >MS1729 suhB, SuhB protein MEIIMNPMLNIAIRAARKAGNVIAKSYERRDDIQTTLKSANDYVTNVDKA AEQAIIDVIRTSYPDHTIITEESGALEGKDSDIQWVIDPLDGTTNFVKGL PHFSVSIAIRVKGRTEVGVVYDPIRNELFTAVRGEGAKLNDLRLRVDAKR ELEGAILATGFPFKQARHMPLHFAVMNSLIESCADFRRTGSAALDLCYVA ANRVDGFFEIGLKPWDCAAGDLIVREAGGLVTDYNGGHSYLTSGHIVAAS ARVVKEILNRLQPLLGDEFKK >MS2203 sun, Sun protein MKNSAKTTALLSPKKAFKRPQTPAHKKIQSVRALSARIILQVLDQGKSLS ALIPELQSQVKAQDLPLLQEICFGVCRVLPRLEQIIKKLVDKPLKGKTRI VHCLLLVGLYQILYTRIPAHAAVDEVVNATSALKSENFRGLVNGVLRRFL REQQEILAVVDKNWQTLHPEWFVNKLKKAYPNWREIIEANNQKPPMWIRV NQQLCTTENYRTLLLTEQELDSFKDENPNALRLAQPTAVQNLPHFTEGWV TVQDVHAQWSALLLEAKNGDLILDACAAPGGKTTHILELAPQAKVIALDV EQSRLNRVAENLARLNQQAVLICGDATKPDDWLPNAAERLIGKKTDLQFD RILLDAPCSATGVIRRHPDIKWLRQEQDIGQLAALQQQILTALWKKLKPN GILLYATCSLLPQENSEQIRTFLANTPDAKLEPLPFATQENEIGKQFIPS EFSGDGFYYAKLRKTDKKGG >MS1843 surA, SurA protein MVMEKMHNASNSIFSKIIFALISVAFVVSGMAGYMVATADTSAVKINGEE ISQQAFQQQYNDEYQRLSQQLGAQFSAVADTPEFSEGLRKSVLNRLIDQE LLRQYVTDLKLVASDASVKQEIVTTPAFQADGKFDNNAYQQTLRANNMTA DMYAEYVREALRLDQLQSGLAGTVLMLPAQQEEFAKLFFQKRTFRLAKLP LTAEMAKQTVTDQEVADYYNANKSAFMVPELVKVQYLDITRAAAEKAVKV TDVEIQQYYQDNKAQFVSKAQDRLAHIQFAKETDALDAYQALQNGADFAA LAKEKSLDKPSAVNGGELGWLNAGDLPKAFEEAAAALQIGQYSQPVKVDN QYHIIKLEDRKEPKAQSLEEVKDLIASQIRQDLLNNQFYSLEKQVAEKAF EDQSSLEAAAKVAGVEIKETDYFSRKDIPAALNFPSVASAIFDGDISQGN QNSEPMNVADQHSIVVRVVDHKAEGTKSLEEAKAEITAYLKRQKAETVML EQANKTVEELNLGKQPALNFAAAETWVYAENKDPALNNAIFSMAKPEQDK TTYAAAKADNGDVVIVALTAVENGEVSAEQGAQFAAQVMQAEQTDLQANL LKSLRNKAKIEVNEEFMKQSQD >MS0629 surA, SurA protein MTVATKWRLFQNKLLSLNFLLKSVSNERNLLNFNNGIMMKPGNLKSLLLV MIGMFAVSVNVQAVEQVVATVDGTPILESQLKRALGKQANNATNRAKALD KIIDDMLVQKAVKEANVHISEGQLDKIVENIAAQNNMTYGQLLDALDYQG IGITKFRNNIRNQLMMAEVRNRSIGKNIDVTREQVETLSKQMLEQAKTQG KKAQVTGTEYQVRHILLKLNPLLNDAQAKAQLNQICSDIQSGKTTFAAAA KDYSKDYLSGANGGDLGYAFPEIYDPAFGQVIKATKKGVISAPFKTQFGW HILEVTDTRQGDMTEAAYRQKAYETLVNQQLQDDAKDWVKALRKGAEIKY LVK >MS2272 surE, SurE protein MLFFMGFMQNIPHKILIIIKEKNRKIMNILLSNDDGYHAEGIQILARELR KFADVTIVAPDRNRSAASGSLTLVEPLRPRHLDDGDYCVNGTPADCVHLA LNGFLSGRMDLVVSGINAGVNLGDDVIYSGTVAAALEGRHLGLPSIAVSL DGRRYYETAARVVCDLIPKLHTRLLNPREIININVPDIPYDQIKGIKVCR LGHRAASAEVIKQQDPRGESIYWIGPAALPEDDEEGTDFHAVNNGYVAIT PIQVDMTSYNSMSALQDWLESE >MS2356 tag, Tag protein MKKRCPWAEGSQLYRDYHDNEWGKAEFDSRKLFEKICLEGQQAGLSWITV LKKRENYRRAFHQFCPEKIVRMTDQDIDKLMLDKGLIRHRAKLMAIVKNA KAYLLMEKCGENFSNFVWSFVNNQPQINDCPDLTAVPAKTECSKALSKAL KKRGFVFVGETTCYAFMQSMGLVDDHINDCFCKHK >MS0659 tagB, TagB protein MTRFVKLSIAILTAMVGWILYFISGFVTRDTKKIVFGTHTGTFSGNVKAL YLDESYKKDAIKIFIYRNENIKASLEALDGKPLYFSYLSFKGIYHTLTAG TFVYSSYASDINYWLSKNAKLFNVWHGTPLKKIERDVTTGFYSIRNKYEF LFKYIYPHLFVRPNQLLVCSEYEKACFKTAFDVSNEAFVEAFPPRLNTLK EDYINEINKNFIIYAPTWRDDSSFQFYKNCDLNALNEVMEQKGLTFLIKP HPSDKMKHLDKKYSHIKLAPLSEDFYVLVKKAKLAVTDYSSVMFDCMYCN IPVVLFCPDLKSYMLNSRDFYCDIKELPFPLMESGNDFIRILNNNFTACN SDKFLPYKNTLT >MS0658 tagD, TagD protein MAKTIITYGTFDLFHIGHLRLLQRLKKLGDKLIVAVSTDEFNEGKGKKTV IPYEQRAEIVANIKCVDLVIPETAWEQKITDVQKYDVDVFAIGNDWEGKF DFLKEYCDVVYLERTKDISSTQLKQTLKSFSISKDEILKAFDILEQLKRD LE >MS2093 tas, Tas protein MKMRKLGSQGLLVSEMGLGCMGMDHGYGKPVDRQAMITLIHKAIELGCNL FDTAPIYGFDNEELLGNALKDHRENVVIATKFGVLDMELVDGQPVPVLDS SPASIREQLDGSLQRLQTDYIDLFYQHRVDPKVEPEVVAQIMKALIAEGK IKYWGISNAPADYIRRAHAVCPVSAIEDQYSMMWRKPEQDLFPMCDELGI GFMAYSPLGNGFLSGKVAQNTEYQEGDFRGQMGRFKPEVMAQNQALLDLI AEIAERKNATSAQVVLAWELAQKPYIVPIPGTTKLHRLEENFKGAEIELS AEELADLNTALSKLDINETFF >MS1418 tas, Tas protein MKKHKESTMKNLNLPKIMLGTWSWGAGMYGGDQVFGNSIEAKDVKEVFDL AVKNGLNAFDTATAYGLGASEEILGELMSAYQREELIISTKFTPQLAEMY DNSVQKMFDASAKRFNTDYIDIYWIHNPTDVERWTSGLIPLLKAGKVKAV GISNHNLAQIKRVNEILGAEGYKLDAVQNHFSLLYRASEEAGILDYCKQN GITFFAYMTLEQGALSGKYSPENPMPAGSGRGETYNKVLPQLVKLTDKMR EIGEKQGASVAQIAVAYAIAKGTLPILGATKPHHITDAAKAMTIALSADE VTELEELAKATGVDTKGAWEEPMA >MS1422 tas, Tas protein MKYTKLGNSDLNVSRICLGCMGFGDAATGQHSWTIGEPDTREIIQYALEN GINFFDTAIAYQLGSSERFVGKALRDMTKREDVVVATKFLPRTQQQLADG VSGEQAILSSLDQSLQNLGMDYIDLYIYHIWDYNTPIEEVLQTLHKAKQS GKVREIGIANVYAWQLAKANALAEREGWSKFISVQNHYNLIMREDERELF GLCAEDNIALTPYSALASGRLSRLPNETSKRAVEDTYAKGKYDATAEQDS VIINRVAELAEKYGVSMTEISLAWLLTKVDAPIAGATKKSHIDGAVGAVN LTLSAEDLVYLEACYQPHNLVGIMAQNSYKTKDVKQVWSR >MS0500 tatA, TatA protein MGGISIWQLLIIVAIIVLLFGTKKLRTLGTDLGESVKGFKKAMNEDEPKD AEFKSLNKDESATAGSEKVKDKEQA >MS0501 tatA, TatA protein MFDIGFSELLLIFIVGLVVLGPKRLPVAIRTVMGWVRTIRGLAANVQNEL AQELKLQELQESIKKAENLNLKNLSPDLAKTVEELKASAEKMKADLDKAA AETNTTIDEQIQILREENAQTQSNDVATSDTVEKSIADEFSIKNDENPTA LSSVVSSVDSIQNGQSDLELDAQAEVDRQLAAMMDKYAPPDDVAENPIST EKTS >MS0502 tatC, TatC protein MSSVEESQPLITHLVELRNRLLRSIMFVLIVFCGLVYFSNDIYHLIATPL LEQMPQGSTMIATNVAAPFFTPIKLTGIAAVFLSVPYILYQVWAFVAPAL YQHEKRLIYPLLFSSTVLFYTGVAFAYYVVFPIVFGFLTKTAPDGVAIAT DISSYLDFVLTLFLAFGICFEVPVAIILLCWTGVTTPEDLKEKRPYIIVA AFIIGMLLTPPDIFSQTLLAVPMCLLFEIGLLFARFYRPKEDETDNGSSE LTKHKE >MS0571 tatD, TatD protein MFIVDSHCHLDSLDYEKLHSNVDEVIEKAKARGVKHLLSIGVALNRFQAM KTLLAHRDEVSFSCGVHPLDLAGETFDRQRLERYAKDEKVIAIGEIGLDY YYDQDRKNEQLDAFSQQIEVANQLNKPVIVHTRDAREDTIRLLRENHAEK CGGVIHCFTENLEFAKQALDLGFYISCSGIVTFKNAEEIRDVVRYVPADR LLVETDSPYLAPVPYRGKQNQPAYTREVCEYVAALKGVSAEEFALITTQN FERLFKINVL >MS0625 tatD, TatD protein MAFFDTHTHLDYLQRTTNTPLSALMENALNADVQKILIAAVMARDFENIL NMTELFPRHLYCGLGLHPLFIKNHQKSHLDELETYLQKNPQNLTALSEIG LERSVSELISDELWRRQCDFLEAQLYLAKQYKLPVNLHSRKSHDQLFTFL KRIRLPKCGVLHGFSGSYQQAKNFVDLGYKIGVGGVISYLRANKTRQAIA KLPLDSLLLETDTPDMPVFGFQGEANRPERLVQTFRYLCELRSEPPAQIQ QQIWRNSCEMFAVK >MS1370 tbpA, TbpA protein MQNTTCEKVAQAFSKKYNVKTQFVRNSTGTVLGKIKTEKDNPQGDVWYGG TLEPHLQAADLGLLEKYRSPNQKDILPIFKDLTEKRGEYTSVIYLMELSM GINSKKLASLNIEPPKCFADLLDPRFKNQIQYADPRVSGTGYSFLIALVS LWGEEKAFDFLAKLNKNIAQYTKSGLATSNLASGEVAVDISFFHTYVREK EKGAPVEGVYPCEGTAYTLGATSIIKGARNLDNAKLFTDWALTPEAQEVH WREADSYQLPANIHAQYYPGMHVPANPKIIDIDFIRFGSNEQSKRLIERW VNGILSNQPQ >MS1526 tbpA, TbpA protein MIKGFFMSKLKTFLFLTALFISDYGLSAVQPLHIYAEEYFAADWGPAPEV KAQFERAYPQCQVTIQSFDSRTTMLNRLRLEGKNTKADIVLGMDNHQLEA AEKTGLFAPAKVDFSRLSLPVEWKNTTFVPYEFSKYAFIYDKSKLTNPPE SLKELVEREDLKVIYQDPRTSSIGRGLLVWMNKIYPPEQVEKAWQQLQKH TLTVGKGWSETYGTFLKGEGDLVLSNNTSPIYHLLTDQKENYAATEFSEG ETLQIDFAGKIAGKHNICADEFLAFLLKPEIQNIIITKGVMLPVVEGDLE PHYAALKNAVMQGKTIDTLSVSAEQIKHWIEVWQRALSK >MS1583 tbpA, TbpA protein MKIKKISLAISTALLGAGLMFSAQANAKGRLVVYCSATNEMCEAVTKSFE KQYDVKTAFIRNGSGSTFAKIEAEKNNPQADVWYGGTFDPQAQAAEMGLL TPYRSKNIDDVMPRFQDPAKIKGNYASAIYMGILGFGVNTERLKKLGINE VPKCWKDLADPRLKGEIQIADPQSSGTAYTAIATFVQLWGEDQTFEFFKK LHPNISQYTKSGITPSNSTARGEATVGIGFLHDYAVQKAAGAPIEMIVPC EGTGYELGGLSIIKGARNMDNAKLFVDYVLSKEGQEVAWRKGNSHQTLTN IKAEQSPTAFDPTKLNLINYDFEKYGASDERKRLIEKWVQEVKLAK >MS1585 tbpA, TbpA protein MKISKIALSLSTVLLGSLMFSQNVAADTGRLVVYCSAQNTMCEQETLAFE KKTGIKTSFIRGGTGSILAKIDAEKANPQGDVWYGGTLDPHSQAGEMGLL VPYKSPNLQYIPDELKDPAKVKGNYSSAIYLGVLGFGVNTERLAKLKIPV PKCWKDLTDPRLKNEIQAADPQSSGTAYTALATFIQLWGEEQAYVYLKEL HKNVSQYTKSGNTATRNTARGEASIGIGFLHEHSLEKEKGAPVELIVPCE GTGYEIGGVSIIKGARNLENAKKFVDWALSKEAQELSWQKGETHQILTNS QAKQSPYALDFKSINLINYDFDKYGSSDLRKRLITKWVDDVKLAK >MS2189 tdcF, TdcF protein MILKETIMATTIHTENAPAAIGPYVQAVDLGNLVLTSGQIPVNPATGEVP ADISAQARQSLENVKAIIEQAGLTVADIVKTTVFVKDLNDFATVNAEYER FFKENDHPNFPARSCVEVARLPKDVGLEIEAIAVRK >MS0525 tdh, Tdh protein MMRSLVCKEPFHLILEERAKPQPKDEEVQLKVAAIGICGTDIHAYAGNQP FFEYPRVLGHEASGVITELGKNVDKFKVGQRVALIPYVSCGKCGACLSGK TNCCENISVIGVHQDGAFSEYLTAPAKNILPIADSVDFTTAALIEPFAIS AHAVRRAQITKGDDVLIVGAGPIGLGAAAIAHADGANVVIADTSEERRKH IQANIPVPTVNPINEKVEDYFNGRLPQIVIDATGNQKAMNNAVNLIRHGG RIVFVGLHKGTIEFSDPDFHKKETTLMGSRNATLEDFEKVQHLMSERKIS ANMMLTHTFKYDELAEIYEEKITKNQSLIKSVVLY >MS0956 tdh, Tdh protein MMEIKTLSCVVRGPKDVGVMEQSINYDESSKEQTLVKITRGGICGSDLHY YQYGKVGNYEIKHPMILGHEVIGTVVKTNAPDLYVGQKVAINPSKPCLTC KYCLSGDTNQCETMRFFGSAMYNPHVDGGFTQYKVVDNSQCIDYPQDVSD DIMAFAEPLAVTIHAAKQAGDLAGKRVFVSGVGPIGCLAVAAIKASGAKE IVVSDLSRRCLDLALEMGATKALNAKDDFSEYMAHKGEFDVSFEASGHPS SIERCLAVTKARGTIIQIGMGGAIPEFPIMTLIAKEICLKGSFRFIEEFN TSVEWLSSGKVNPLPLLSATFPYTELEKALIIAGDKDNISKVQLSFE >MS1764 tdk, Tdk protein MAKLYFYYSTMNAGKSTTLLQSAYNYNERNMNTLVYTAAIDDRFGAGRVT SRIGISQQAKLFNRESNLFEEIKRHLASEKLHCILIDEAQFLTKQQVYQL SDVVDLLNIPVLCYGLRTDFQAELFEGSQYLLAWADQLEELKTICHCGRK ANFVLRLNERGDVVKDGEQIQIGGNERYLSVCRFHFKQKTGKLHN >MS0026 tehA, TehA protein MSDTKPFPIPVNYFSMVLGLAGLGLAWRYAAIILPLPAIIGESVLTVATA IWLALIVIYIYKWKAYPEQAKAELYHPILGCFVSLIPITTILVAMGALPY SRETAVVLTASGIIGQLGFAMFRSAGLWRSGHPQEAATPVLYLPTVATNF VSANALGSLGYSEIGALFLGGGMLAWFFLEPAVQQRLRNLTALPENIRPI IGIQLAPAFVCCSAYLAINDGEIDLLAKGLVGYGLLQLFFLLRLMPWIAT KGFTMPFWAFSFGLASMAGVGLHTAHSSSSPYLELLGLAMFGFASCCITL LTLGTLSLIRKEKFLIKN >MS0634 terC, TerC protein MFEWIADPEAWIALLTLTALEIVLGIDNIIFISILVGRLPESQRQMGRIL GLGLAMCTRILLLLSLAWVMRLTTPLFTLFEQQISGRDLILLFGGLFLVA KSTHEIHATMHPDEGEDETKGKKISFLGTLMMIAILDIIFSLDSVITAVG MANNVEVMIIAIIIAVGVMMLAAKPIGDFVENNPTLKVLALSFLILIGVT LIAESLDFHIPKGYIYFAMAFSVTVEMINLRTRKHLTIKE >MS0948 tesB, TesB protein MSEVLDNLIHLLKLEQLDDALFRGGCQDLGFRQVFGGQVVAQALSAAMQV APKDRLLHSCHAYFLAPGDSQRPIIYDVETLREGRNFTALRVKAIQYGHP ICHVTASFQVEESGFEHQVKMPEIGSPEEFMSESDALKKAAQYIPESVRD KFTAERPFDIRAKYINNPFLGTELPPEQYIWVRANGKSPLDQNIQKCLLA YFSDFHCILTALHPHAKGFLQKGMKVATIDHSIWFHRSFDLNDWLLYAIE SNNAFAARGLARGQIFDKAGRLIATTQQEGLIRYVE >MS2301 tfoX, TfoX protein MNRTNKDTQWIRTILNSFLENEVTAKHLFVGYGLFYRKVMFGIVIDDNFF LKAENQLVEYVEKLGAVSWDIFNKNTNLAISSYYRLPRALVDNEEEFKTL VILSIKQQQRKILDLNIAKKERIKELPNLSIKHERLLAKIGINNVKEFKS AGISNCFVKLKVHGFSVNVELFWLFQAALKNKHVSLLTKSEKKSALLVLN RKLVEAGFREIKHECLI >MS1559 tgt, Tgt protein MKFKLKTTSGAARRGELTFNRPQGEFSVQTPAFMPVGTYGTVKGMTPEEV RATGAEILLGNTFHLWLRPGQEVMRKHGDLHDFMQWHRPILTDSGGFQVF SLGKLRKITEEGVKFQNPINGERIFLSPEKSMEIQYDLGSDIVMIFDECT PYPATFDYAKKSMEMSLRWAKRSRDRFDELGNKNALFGIVQGGTFEELRK VSAENLINIGFDGYAVGGLAVGEPKEEMHRILEFTTPLLPADKPRYLMGV GKPEDLVEGVRRGIDMFDCVMPTRNARNGHLFVTDGIVKIRNAKYKDDTS PLDPHCDCYTCQHYTKSYLYHLDKCGEILGARLNTIHNLRYYQRLMEQIR TAIEQDRFDDFVQEFYARMDKPVPPLQKA >MS0480 thdF, ThdF protein MMTKETIVAQATPIGRGGVGILRVSGPLATEVAKAVVDKELKPRMANYLP FKDEDGTILDQGIALYFKSPNSFTGEDVVEFQGHGGQVVLDLLLKRILQV KGVRLARPGEFSEQAFLNDKLDLAQAEAIADLINASSEQAARSALKSLQG EFSKKINQLVDSVIYLRTYVEAAIDFPDEEIDFLADGKIEGHLNDLIGQL DKVRSEAKQGSILREGMKVVIAGRPNAGKSSLLNALAGREAAIVTDIAGT TRDVLREHIHIDGMPLHIIDTAGLRDATDEVERIGITRAWNEIEQADRVI LMLDSTDPDSKDLDQAKAEFLSKLPGNIPVTIVRNKSDLSGEKESIEEQE GFTVIRLSAQTQQGVSLLREHLKQSMGYQTGTEGGFLARRRHLEALEHAA EHLQIGRVQLTQFHAGELLAEELRIVQDYLGEITGKFTSDDLLGNIFSSF CIGK >MS0677 thiD, ThiD protein MVIPQVLTIAGSDSGGGAGIQADLKTFQMRGVFGTSVITAVTAQNTLGVF DIHPIPLASIQAQLRAVAKDFSISAVKIGMLGNTEIIQCVADCLEQYQFS HIVLDPVMIAKGGATLLEQSAVAALKNLILPKACLITPNIPEAERITGTQ IKNEADIFNAAQIFHELGANTVVIKGGHHNNSQSKLCKDWVFTQKGYFTL EAPRFATPHTHGTGCTFSACLTAELAKGKPVEQAVRTAKNYITAAIGHPL NIGHGHGPTNHWAYQNEQD >MS0676 thiE, ThiE protein MNKIKSMLSVYFIAGSQDCRHLPGEPTENLLTILQRALEAGITCFQFREK GEQSLACDLQLKRRLALKCLQLCRQFQVPFIVNDDVELALSIQADGIHVG QKDTAVETILRNTRNKPIIGLSINTLAQALANKDRQDIDYFGVGPIFPTN SKADHSPLVGMNFIRQIRQLGIDKPCVAIGGIKEESAAILRRLGADGVAV ISAISHSVNIANTVKTLAQK >MS1004 thiF, ThiF protein MSELTHAEELRYNRQIVLKSVDFDGQETLKDSKMLIVGLGGLGCAASQYL AAAGIGHLTLLDFDTVSLSNLQRQVLHNDERLDMPKVESAKIALQAINPH IEINTINGLLSEEKLAEIIPHFDLVLDCTDNVAARNQLDLCCRQAKVPLI SGAAIRMEGQVSVFTYEPGTPTYGHLSRLFGENALSCVEAGVLSPIVGIV GSIQALEAIKVRLKIGKNLCGRLLMIDGLNMSIREIKIPSI >MS1062 thiI, ThiI protein MKFVIKLFPEIMIKSESVRKRFVKILTGNIRNVLNKYDDGVAVVKHWDYI EVRSKNEENRAILIDVLGRIPGIHHFLEVDEKPFTDMHDIFEQTLADVGA SLENKTFCVRVKRKGKHEFSSLDVERYVGGGLNQAIETAKVKLSNPDVTV RIDIENDHMMLIKARHEGIGGYPIGTQEDVLSLISGGFDSGVSSYMFIRR GSRVHYCFFNLGGAAHEIGVKQMAYHIWNRYSGSHKVRFVAINFENVVGE ILEKIDNGQMGVVLKRMMVRAASKVAERFGIQAIVTGEALGQVSSQTLTN LRLIDEAASALVLRPLITHDKEAIIAMAKQIGTDDIAKSMPEFCGVISKN PTVKAIKEKIEKEELNFNFDVLESAVQNAQYLDIRQIAEQTEKDVVKVDT VSVLSANDVILDIRSPEEHDERPFELAGHEVKHLPFYKLSSQFGDLDQSK NYVLYCERGVMSKLQALYLKENGFANVRVFAHGNIN >MS1423 thiJ, ThiJ protein MTTSIKPVLCVVTSAPIKGKSGIPTGFYLAELTHALDEIEKAGLKTVIAS VRGGQPPIDGFDLTDPVNAKYWNEGDLYERLANTPALSELNGADYSAVFF AGGHGTMWDFAQSAEVHRIVSEVYTSGGVVSAVCHGPAALVGAKLPNGEF VVNGKNIAAFTNAEEVEVEGDKLVPYMLQTELEKQGAIHHAAPNWAENVI VDGQLVTGQNPASAKGVGAALAKVLLEK >MS0974 thiL, ThiL protein MAEGEFDLINRYFALSKQIPRQDVILSIGDDCAITQLKADQRLAVTTDTM VENSHFYPTIPPRALGYKAVATNLSDLAAMGATPTWVSLALTLPKTDSHW LNEFSKGMFEILNRYDVTLIGGDTTKGEIPTVTITAQGLLGDYALCRHQA KIGDWIYVSGTLGDALAGFYLNRDIYEGKKSAVGFDEDFFIQRNLYPIPR IEFGKTLAKYRLANAALDISDGFMGDLMHILERSQVSAVVNLEDLPLSPQ LSAYYGREKAELMALQGGEDYELCFTVSDENKTKLDQYLAPLNVPYTCVG QICAAEDNPVIRLQRYKQEVNLSIHSFDHFK >MS1586 thiP, ThiP protein MRPIITLKIARGCMNSAKIPFIQTANFWILLSALAFLILPSQALDYGLLE STADEFYDAMGWSSLNLTMLWFLPLLGFLLTPRLGLSATAQAKAELGLVA FTTLFAFVSATVYKVSMGYSVIILLASLTALATFALAKLKIMQGDKFIIA SLLAIILLIFFFIVYPTVAILISMFYDGDTFTPQQVLRILSQSYIIRVIT NSLALSGFVGVVSTIFGLAFALYTTRIAKRTAFIGKIFSILPIVTPPFVV GLGVTLMLGRSGYVTEFLSDNFGFTNQNWLYGFNGIAIAQILAFAPISFM ILDGALKSIHPSIEEASYTLRANRYQTFYNIIFPLLRPALANSFLIVFVQ SLADFSNPLVLGGSFDVIATQIYFYIAGSQLDYASASTLGSMLLIFSLLI FIVQYMWIGNRSYVTVSGKSYRGDVQDLPGGLKWTIIGMLAFWVIFNLAL YGSIFYGSFTVNWGVDYTLTTRNYQLMFGQGFSDGAWPSLLNTMLYAGIA APLTALFGLLIAYIVVRKDFQGKKTLEFLTMLCFAVPGTVAGVSYILAFN NAPMYITGTGVIIIISMVMRDLPIGMRAAIAGLGQLDKSLDEASLSLKGS SFKTIWYIVFPLLKPALLSALVTSFVRAMTTVSAIIFLVTADTRVATAYI LNRVEDGEYGVAIAYGSVLIIVMMAIILFFDWIVGDTRISRSKAKKMN >MS1525 thiP, ThiP protein MNKWFSRRHIMRPSQYAAGLSVLLLLVCVYGGALQAVFNTGEPYPWRNLW QDDYLHRVLLFSFGQASLSAFLSVFIGGIFARAFFYQSFKGKDFLLKIFS LTFVLPSLVAIFGLLGIYGSSGWAVKLLQAVHIQWRPDIYGLSGILIAHL FFNIPLAVRMFLQGLHNIPNQQRQLAAQLNLRGWQFIRLIELPYLRQQLL PVFMLIFTLCFTSFAIVLTLGGGPKYTTLEVAIYQAIIFDFDLAKAALFA LLQFVFCFTLFGLSTLFSAAPETNLSYKELWIAKQSSAVKIWQILVLILV GLFILLPLVNIVAAAFSAEEFISAWQDPQLWRAMGFSFTIAPLSACLSLL MSMGLLLLSRRLVWLHLTKIANVIMNLGMLILAVPGLILAVGLFLLLQKM EFGTAHLFVVMVMCNAFSAMPFVIRILAPAMNNNMQYYEKLCQSLGIRGW QRFRLIEQHKLAQPIKYAFALACTLSLGDFTAIALFGSQQFSSLPYLLYQ QLGSYRGDQAAVTALVLLLMCLLIFILVEGAKNSDKNENANDKT >MS1702 thrB, ThrB protein MLRIYAPASSANISVGFDTLGAAISPIDGSLLGDVVQIEDIPAGFELESA GYFVRKLPKEPQKNIVYQAYVLFSERLKLRNGHVKPLRLTLEKNMPIGSG LGSSACSIVAALVALNMFHNEPFSKMELLEMMGELEGRISGSIHYDNVAP CYLGGVQLMVQSLGNICQQLPFFDSWYWVLAYPGIEVSTAEARAILPKSY TRQDVIAHGRHLGSFVHACHTQQDVLAALMMKDVIAEPYRESLLPNFAEV KQASRDLGALATGISGSGPTIFSIAPDLAVATKLANYLENHYLQNNEGFV HICKVDNQGTRALG >MS1701 thrC, ThrC protein MNLYNIKHPEEQVNFAQAVRQGLGKDQGLFFPEVIPALDNIDELLALPLV ERSQKILGALIGEEIPAEKLNTMVKNAFTFPAPVAKVEEGVYALELFHGP TLAFKDFGGRFMAQALATVRGDGKITILTATSGDTGAAVAHAFYGLENID VVILYPQGKISPLQEKLFCTLGGNIRTVAINADFDACQALVKQAFDDEEL RRAIGLNSANSINISRLLAQVCYYFEAAAQLSPSERSNLVVSVPSGNFGN LTAGLIAKTLGLPIKRFIASTNANDTVPRYLAKGKWEPNATVATLSNAMD VSRPNNWPRVEELFKRNGWALSELHSGAVSDAQTEETLRDMNAKGYLCEP HGAIAYRVLKQDLQAGETGLFLCTAHPAKFKESVERILNTQLPLPQALAK HAELPLLSDVMENDFAALRAYLLKK >MS1053 thrS, ThrS protein MPIITLPDGSQRQFDNPVSVLEVAQSIGAGLAKATIAGRVNGERRDACDM ITEDSTLEIITAKDEDGLEIIRHSCAHLLGHAIKQLFPDVKMAIGPTIDN GFYYDVDLEHSLSQEDLDALEKRMLELAKTNYDVVKRRVSWQEARDTFEK RGEPYKMAILDENIAKDDHPALYHHEEYIDMCRGPHVPNMRFCHHFKLQK VAGAYWRGDSKNKMLQRIYGTAWADKKQLNEYLTRLEEAAKRDHRKIGKA LDLYHMQEEAPGMVFWHNDGWTIFRELETFVRTKLKEYDYQEVKGPFMMD RVLWEKTGHWQNYGDLMFTTQSENREYAIKPMNCPGHVQIFNQGLKSYRD LPIRMAEFGSCHRNEPSGSLHGLMRVRGFTQDDAHIFCTEDQIESEVTSC IRMVYDIYSTFGFTNIAVKLSTRPENRIGDDAMWDRAEQGLANALAHNGL QYEIQEGEGAFYGPKIEFALRDCLDREWQCGTVQLDFALPGRLNASYVAE DNDRRTPVMIHRAILGSIERFIGIITEEYAGFFPSWLAPVQAVVMNITDS QAEYVQKVTKTLSDAGLRVKSDLRNEKVGFKVREHTLRRVPYMLVCGDKE ISEGKVSVRTRRGADLGTYSVEEFVEILKNQVRSRELKLLGEE >MS0405 thyA, ThyA protein MRQYLDLCQRIVNEGCWIENKRTGKRCLTVINADLTYDVANNRFPIITTR KSYWKAAIAEFLGYIRGYDNAADFRKLGAKTWDANANENQVWLNNPHRKG TDDMGRVYGVQGRAWRKPNGETVDQLRKIVNNLSRGIDDRGEILTFLNPG EFDLGCLRPCMYNHTFSLLGDTLYLTSYQRSCDVPLGLNFNQIQVFTFLA LMAQITGKKAGQAYHKIVNAHIYEDQLELMRDVQLKREPFPSPKLEINPD IKTLEDLETWVTMDDFNVVGYQCHEPIKYPFSV >MS1849 tig, Tig protein MFGVKPSPSAHSIRENSIRGKQRMTTIETTQGLERRVSITVPAETVTTAV RDELKRVAKNARVDGFRKGKVPAQIIEKRFGASVRQDVLNDLLPRHFFDL AFKEKVNLAGRPTFAVENYEEGKDLQFTATFEVYPEIQLQGLENIKVEKP VVEITDADVDNMVEVLRKQQATWAETDNAATKDDRVTIDFVGSIDGEEFQ GGKANDFVLAMGQGRMIPGFEDGILGHKAGEQFDIEVTFPEDYHVENLKA KPAKFAITVKKVEVMVLPELTADFIAKFGPNTKTVDDLRAEIRKNMQREL KNALTARVKNQVIDGLIEQNQIDVPFAAVDQEIEVLRNQAAQRFGGNGEQ AAQLPRELFEEQAKRRVQVGLLLAEVISSNELKADEEKAKAMIEDIASAY EQPAEVVEYYSKNNELMNNIRNVVLEEQAIDAVLAKAQVTEKASSFDEVM NPQA >MS0057 tktA, TktA protein MTDHKKLANAIRFLSMDAVQKAKSGHPGAPMGMADIAEVLWRGFMKHNPT NPKWADRDRFVLSNGHGSMLIYSLLHLTGYDLSIEDLKNFRQLHSKTPGH PEYGYAPGVETTTGPLGQGITNAVGMAIAEKTLAAQFNREGHDIVDHHTY VFLGDGCLMEGISHEACSLAGTLGLGKLIAFYDDNNISIDGHVDGWFSDN TKGRFEAYGWQVIDNIDGHNPEQIAAAVKLARAETEKPSIIICKTIIGYG SPNKSASHDCHGAPLGDEEIALTRKALNWEYAPFEIPADIYAAWDAKSAG QSAEAVWNEKFAAYEKAYPELAKEFKRRVNGELPANWAAESQAFIEKLQA NPASIASRKASQNAIEAYAHILPEFLGGSADLASSNLTLWSGSKPIRAKQ NVDGNYINYGVREFGMSAIMNGIALHGGFIPYGATFLMFYEYAHNAVRMA ALMKQRSLFVYTHDSIGLGEDGPTHQPVEQTASLRLIPNLETWRPADQVE SAIAWKAAVERKDGPSALIFTRQNLAQQDRTSEQLANVARGGYILKDCAG TPELILIATGSEVELAVKAAEALTAEGKAVRVVSMPSTNVFDKQDEAYRE SVLPSSVTKRVAIEAQIADFWYKYVGLEGRIVGMNRFGESAPADQLFKLF GFTVENVVAKAKEIL >MS0682 tldD, TldD protein MEISQNQTALLKQQEQALRDAVSYAVEIAQKAGASAEVAVTKVNGLSVST RLKEVENVEFNNDGALGISVYLGQQKGNASTSDLSKDAIKNAVEAALAIA KYTSPDECAGLADKELMAFEAPSLALYNPAEVDVDQAIELALQAETAALN YDKRIVNSNGASFNSHNGVRVYGNSYGMLQSYLSSRYSISCSVLSGIDDE LENDYEYTVSRDLNALESPVWVGENAAKKAVARLQPRKITTQEAPVIFLN DVATGLIGSLAGAISGGSLYRKASFLLDHLGRQILPDWFHISERPHLTGR LASTPFDSEGVKTQSREIVEQGILRTYLLTSYSGRKLGMQSTGHAGGIHN WLVRPNANGDLDSLLRQMGRGLLVTDLMGQGVNMVTGDYSRGAAGFWVEN GEIQYPVAEITIAGRLKDMLRDIVAVGDDIEQRSNIQTGSILLESMKISG N >MS0780 tldD, TldD protein MLNKVVESLLTPSNLSVKDLPNIFDQLAHRHLDYSDLYFQLSQDESWVLE DGIIKEGGFHIDRGVGVRAISGEKTGFAYSDQINLTSLQQCANAVKGIAP AEQGRIITPTGFNRVNPILRYAAVNPLDTLTKEQKIELLYLVDKTARGMS PYVSRVSASLSSIYEEVLVAATDGTLAADIRPLVRLSVSVLVEKEGKRER GSAGAGGRFGLNWFLESFEGEVRAVSFTKEAVRQALVNLEAIPAPAGLMP VVLGAGWPGVLLHEAVGHGLEGDFNRKESSLFSGKIGELVTSPLCTIVDD GTLENRRGSLTIDDEGTPSQRNVLIENGILKGYMQDKMNARLMGVAPTGN GRRESYANLPMPRMTNTYMLSGDSKFEDLIGSIDRGIFASHFGGGQVDIT SGKFTFSTTEAYLIEKGKITRPVKGATLIGSGIEVMQQVSMVADNMEIDH GIGVCGKEGQSVPVGVGQPALKIERITVGGTN >MS0569 tmk, Tmk protein MNKNMNGKFIVLEGLEGAGKTTARQAVVEQLNALGITDLLFTREPGGTPL AEKLRNLIKYETEEPVTDKAELLMLYAARIQLVENVIKPALAQGKWVIGD RHDLSSQAYQGGGRQIDSHLLETLKKTVLGDFEPDFTLYLDLSPAIGLAR ARGRGELDRIEQQNLAFFDRTRTRYLELVKDNPKAVIINAEQSIERVTAD IKTAVKNWVNSISL >MS0722 tolA, TolA protein MKNNRQNKERSAFLTSLVLHILLFGLLILSSFYHTVEVMGGGEGDGEVIG AVMVDTGAAAKEWGRIQQDKKGQTDKQKQKKVPDQVDGEQHSPEPEQKIE EQVEKQKQQELEKQKQLEQQKQAEQQRQQELKAQKEAAEKAKAEAEAKAK REAEAAKLKADAEAKRLAAAAKQAEEDAKAKAKAEAEAKAKAEAKEKAAE EAKLKAQAEAKAKAEAEEKAKAEAKAKADAEAKAKAEAKAKADAEAKAKA DAKAKSDAKAKQAALDDFLNGGDVGGGSAMKGGNANKAGSQGSGAGLGAG DGGKVGDQYGAVIKREIKRRFLKDPSFAGKVCSIRIQLARDGTITGYQKV SGPDDICTAALSAVARTKKVPAAPSDDVYNKYKNPIIDFDLK >MS0723 tolB, TolB protein MMKFLTRMLSAFAVLFFAISTAQADDEVRIVIDEGVDGARPIAVVPFKTN GSVPADIAEIVTADLRNSGKFNPIPVSQMPQQPASASEVTPDAWAALGVD AIVVGQVTATGNGYNIAYQLVDTVGASGGAGAVLAQNSVTVGAKWIRYGA HTVSDEVFEKLTAIKGAFRTRIAYVVQKNGGSKPYEIRVADYDGFNQFIV NRSSQPIMSPAWSPDGKRLAYVSFENRKSQLVVQDLGSGARKVVASFQGH NGAPAFSPDGSRLAFASNKEGQLNIYVMGANGGQPTQLTSGSGNNTEPSW SPDGSSILFTSDRGGSPQVYRMSSSGGAASPVGGRGSAQISSDGKTLVMI NGNNNVVKQDVTSGASEVLSTSFLGESPSLSPNGIMIIYSSTQGLGKVLQ LVSADGRFKARLPGTDGQVKFPAWSPYLDKN >MS1131 tolC, TolC protein MLKINKLALAVVLSTALAGCANLDDSYQAAQDDFKQYEEVTKQFNVKDNW WSLYNDAQLNRVVEQALVNNKDLAKAAISVNSALYQANLLGADLVPSFNG STSSSASRPIDRHDNSTISHGGSLNVSYTLDLWRRLADAASAGEWSYQAT QQDMEATRLSLINSVVVTYYQIAYLNDAINATNDTINYYSQINGIMQNRL AQGVEDRASTDQAQQAVLTARNSLISYQTAKKTAEQTLRNLLNLKPSEPL NINFPNILNVQTAGVNLNVPVSTIGNRPDVRGYLYRLNSAFKDAKATQKS WFPEITLGAGLSSSGTRVNNAFNNPVAAGTIGISLPFLDWNHVKWNVKIS EAAYDTARTNYEQSITTALNEIDTNYFAYTQAQQNFTNLKQKYDYDKRIA QYYKNRYDAGVSDLKDWLSAINTERASQVSILNAKYSVIQNENAIYSSMA GYYSPKF >MS1304 tolC, TolC protein MKSMHKLSLIAVLVTLAACSSTNVDLNSQIEMPARFEQTAQATGTSEIAQ WWRNWNYPQLTALIEQGLQQNLEVAMARSRLAEAQANAAYTDADLGPSVS ASGSASGSRARVDNPLTGGSSTSSGSYQYAAVTASWELDFFGKKRSDRDA AEAQALSAQDQVYAAQMLVAGQIAESYFNIAALQQRQAVLQQYADVLGKL KTYVQGRFNAGQANANDVLQTESRLSSIQANLATFDSQIDSNRRAIAILT GKPAQGFRLSPAVKNPLINLPAAPAGVLPGEVLARRPDLHSYRNQVQAAA AKLASAKADLYPRFDIQFMGGTGRIDVNSDISELKGWAGLVSGGISLPIF TNGRIQANIDVADARLKTALLQYDKALIQALADVDNSYQAQFALNRQIRL LQTAAAQTQKSAVNAEKLFQYGEKTLDNTLSERINALNAQEQLIQARLTH AKNLVSLYKALGGGWVK >MS0515 tolQ, TolQ protein MTTALFDFLKNYSDYIILGLLGLMSFIMIWFVIERFIFLSRVKVHAYSNI HSLDIDLNRHLTIISTIGANAPYIGLLGTVVGILLTFYDLGNSGGDIDAS AIMLHLSLALKATAVGILVAIPSMVFYSALGRKVEVNRLKWKALNSQKNI GA >MS0720 tolQ, TolQ protein MSGDLNFFELFIKASIVVQIVILILIAFSVISWTIIIQRSRVLTTALKDS SAFEDRFWSGEDLTKLYEGLSNRRDGLTGSEQIFYVGFKEFSRLKQANPE APESIIEGSSRAMNLTMNREIEELEGRIPFLATVASISPYIGLFGTVWGI MHAFMGLSGAKQATLQMVAPGIAEALIATAIGLFAAIPAVMAYNRLSLRL NKLEQNYGNFIDEFTTILHRQVFGSKTH >MS0513 tonB, TonB protein MRRPSIAGFLGSLLFHGGIAATLFFSFKENDNANGMAAQIIDTNISMEMM MATMVEETQPTAEPEPQTKEEIVQKEAVEDPTLKKEKPKEKPKEKPKEKP REKPKEKAKKVPPPVQQGIKADKIVLSNANANSKATGLSNTINSDNPNLA GQGTGSSEVDAYKIALRREIERHKRYSQRAKMMRKQGTVIVAFNIGNDGS LSNARVVKSSGTEDLDNSALEAVKNAKSIGQKPAGMANAISVPIAFTIR >MS0131 topA, TopA protein MSEPLFQHTKTEECCPQCGSPLQIKQGKKGKFLGCSAYPACDYLKPLSNQ SESRIIKQLDECCPQCGHPLLIRQGNFGMFIGCGNYPQCHFIVHEDEQPP AEESVACPECGKGELISRRGRQGKYFYACNRYPHCKFTLPGKPYLQDCPQ CGGHICLLKKENETYRTFLCVNKSCRHQFDRKKEKT >MS1096 topA, TopA protein MSKSLVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTAGASTGEKAKP VSTKGLTAEEKQALKTEKEKNALVKRMGIDPYHGWKANYQILPGKEKVVA DLKSLAKKADHIYLATDLDREGEAIAWHLREVIGGDDNRFSRVVFNEITK NAIKQAFEKPEHLNLDRVNAQQTRRFLDRVVGFMVSPLLWKKVARGLSAG RVQSVAVKLVVEREREIKAFQPQEYWEVAVVTKTADNQKITLDVAEYKGK RFDPKNETEAQSAVDFLAKSDYIVSALETKPTTSRPRAPFITSTLQQTAS TRLNFSVKKTMMLAQRLYEAGYITYMRTDSTNLSRDALNMARSYIERNFG EKYLPEKPNFYSSKENAQEAHEAIRPSDVNISMNDLQGMEKDAVRLYDLI WRQFVACQMPAAQYDSTTLTVKAGDYELKAKGRILRFDGWTKVLPQLGKS AEDQELPALNVHNKLALDEIQPSQHFTKPPARFTEAALVKELEKRGIGRP STYAAIISTIQERGYVRTENRRFYAEKMGEIVTDRLNQSFAHLMSYDFTA SMEDMLDQIATGKKDWKTELNQFFKDFSGQLTTAELDELEGGMKPNSLVL TDIQCPTCGRPMAIRTASTGVFLGCSGYALAPKDRCKTTINLIPEAELLN VLDDASETKALMERKRCPKCDTAMDSYIIDPHRKIHICGNNPNCEGYLIE QGTFKIKGYDGPIVECDKCGSDMHLKLGRFGKYMACTACDNTRRILANGE VAPPKEEPIAFPELKCEKADAYFVLRNSAVGVFMSAHNFPRVRESRPAKV AELAQYRERLPEKLQYLADAPQQDPEGNPAIISFSRKEKHQYVTSEKNGK KTKWIVDYIDGNWIERKK >MS0730 topA, TopA protein MRLFVAEKPSLARAIADVLPKPHQRGDGFIKCGKNDCVTWCVGHLLEQAE PDAYNPMFKQWRLEHLPIVPKKWRLIPRKEVAKQLKTVENLIHQADQLVN AGDPDREGQLLVDEVFNYANLSTDKRNAIQRCLVSDLNPAAVEKAVKKLQ PNTNFIPLATSALARARADWLYGINMTRAYTIRGRQAGYNGVLSVGRVQT PVLGLIVRRDLEIENFQPKDFFEVLAHIQTEDETPQKFTALWQPSKACED YQDDDGRVLSLGLAENVVKRITGQPAEVTEYTDKREKETAPLPYSLSALQ IDAAKRFAMSAQDVLDTCQRLYETHKLITYPRSDCRYLPNEHFAERMPVL NAISTHCKEYQPLPEVLNTEQKNRCWNDKKVEAHHAIIPTAKNRPVNLNS QELNIYTLIARQYLMQFCPDAEYRKSKISLKIAGGNFVAQARNLQIAGWK ELLGKEDENEQLEPSLPIVKKGQQLFCEKGEVISKKTQPPKPFTDVTLLS AMTGIARFVQDKELKKILRETDGLGTEATRAGIIELLFKRGFLYKKGRNI HSSEAGRILIQALPDMATQPDMTAQWEAQLDGISRKQASYQQFMATLTEL LPELVQFVNFSALRKLSAVANNPKPKNFKKKAKIAQSTETKKEV >MS0587 torC, TorC protein MSKRKKISMWAAAVLLVIGALLLLGSQYVMKATSSTEFCVSCHSMEYPAE EWKASGHFSNTKGIRAECADCHIPHDGIDYVKAKVIALKDVWFTLTNKIP DRATFEEQRGELAQRVWDEMKANDSATCRSCHNEDAMIVSEQSDSAQKMH KLAKETNQTCIDCHKGLVHFMPETHAVASVQENVPPQAVQIVDNQPLYAS NVSTATLIDGGEARLLPYAELANWKEEDNNFIGTIEGWQQTGAESLIYKE LGKRINVAVLNEEAKTHVNVVNTVHDEVTDSDWKKVNINVSVPKSAVTSN LESLNQYGHNLNQTHCSGCHAAIGADHYTANQWIGVVNSMKDRTSMTANE VRALTIYLQRHAKDMH >MS2277 torC, TorC protein MIKKFWNWFRSPSKIAIGAVVLLSALGGILAWGGFNAGLEYTNTEEFCSS CHMNDVVPEYRQTIHYSNRSGVKAICADCHLPHEFIPKWTRKIQASREVF AHFTGKVDTKEKFEAHRLEMAEREWARMKANNSQECRNCHNFEDMDFTQQ KTVAQEMHAAAEQQGKTCIDCHKGIAHNLPHMEKVQKTFIPEDMIKPQEP AVQ >MS0837 torD, TorD protein MSETIINNFSLISRLFGNLFYRSPTDSILDGVFGWLQQKGLEQVWPLDTD EDVRQALDSVQMTIAKEVLAQEYERLFAGEQPKIDSRISAYGLNVDEFIN FRQTRRMPEVESADNFSLLLLTASWIEDNLDSISAQQELFESFLLPCASK FLTHVETYALLPFYRSLALLTREILAAMADELEENE >MS2335 torD, TorD protein MVKNTALLSLKQQKSAMNFKEILMDNALLQWISTGGRLLGAVFYYEPKDK RVQPVLDFFRQPDWTKDWATLANPALINALIEKSAQQDLSQAYQYLFIGP NELPAPPWGSVYLDKESVIFGDSLLALRDFLTVHQIEFIQTQNEPEDHLG LMLMLAAYLAENKPELLEEFLTKHLFSWVYRCLDLIFAQTDYPFYQAMAL LARQTLKGWQQQLDLQVDQPQLYR >MS0324 tpiA, TpiA protein MARRPLVMGNWKLNGSKAFTKELIEGLKAELAGVEGCDVAIAPPVMYLAE AEAALAGSKIVLGSQNVDVNVKGAFTGDISTEMLKDFGAKYIIIGHSERR TYHKESDEFVAQKFGALKEAGLTPVLCIGESEAENEAGKTEEVCARQIDA VINALGVEAFNGAVIAYEPIWAIGTGKSATPAQAQAVHAFIRGHIAAKSQ AVADQVIIQYGGSVNDANAAELFTQPDIDGALVGGASLKAPAFAVIVKAA AKAKA >MS0379 tpiA, TpiA protein MKKYYFGTNLKMYKGIADTTRFLAQLSELTHDIRANNDIELFVIPSYTAI QSAIQTTLATSHDNPIIIGAQNMNPNDNGQYTGDISPLMLKEIGTQLVMI GHSERRHKFGETDRQENEKVLSALKHDLTTLLCVGETLEQKNYNISDEVL RTQLKIGLSGINTNQLAKLRIAYEPVWAIGESGIPATADYANEKHAVIKQ CLIELFGDAGKDIPVFYGGSVNAENSNELFGQQYIDGLFIGRSAWDAENF FKIIERIVNK >MS0978 tra5, Tra5 protein MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRGSAGKRTIQSHLKEKG IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI LAKTTYRRLLN >MS1602 tra5, Tra5 protein MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI LAKTTYRRLLN >MS1577 tra5, Tra5 protein MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI LAKTTYRRLLN >MS2299 tra5, Tra5 protein MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI LAKTTYRRLLN >MS1804 tra5, Tra5 protein MSESAYYAHLRTAKKPAKHTALAVEIKAIFDASRSSAGKRTIQSHLKEKG IFVGLYLIRKLMNKQGLFSKQPQKWRNPSKGNSQVFENILSREFTPDSQT TVLCGDTTYIKINGIWCYLAVVINLLNRQVVGWKLSRYHDSELVKDALNH AMLNIERTERMLFHSDQGSIYGSEIFTDSVKKHGLTQSMSRRGNCWDNAP MERWFRSFKYEWMLKGGYSDFESAVNDVREYVMYYNHIRPHSYNQGLSPI LAKTTYRRLLN >MS2195 trkA, TrkA protein MKIIILGAGQVGTTLAENLVSEDNDITLVDNELLRLEDLQDKHDLRVVAG SASSPRVLREAGAPDADLLVAVTSSDEVNMVACQMAYTLFHTPTKIARIR NSEYLREKDKLFHNDMIPIDHIISPENLVTEEIIRLIDYPGALQVAHFAD RRISLVVLKAYYGGPLVGYAISMLKEHLPYIDYRIVSILRHDKLIRPQGS TIIEAGDEITFISATVHIKAIMAEIQRLDKPYKRIMIVGGGNIGAGVAKQ LEAGCSVKLIERNAEKAKSLAEKLSNTLVFHGDASDQSLLFEEHIENIDV FISLTSDDEANIMSALLAKRLGAKKAMVLIQRMAYINLIQGGTIDIAVSP QQATISALLTHVRKGDVKNVVSLRHGLAEALEVVVHGDAATSNVVGRKVS ELKLPQGVILGAVLRNEEVIIAKKQVVIEENDHVVIYLSDKKNISEIEKL FQPSAFFI >MS0175 trkG, TrkG protein MHILSIVRIVGILVMCFSVAMLAPAFVALIYGDGGGKAFMQSFVISLIVG TTLWWSCHSHKQELRSREGFIIVVAFWVVLGSLASIPFMLFEYPDLTVAS SFFEAFSGLTTTGATTIVGLDDLPKAILFYRQLLQWMGGMGIIVLAVAII PLLGIGGMSLYRAEMSGPMKEQKMRPRIAETAKILWFIYASLTILCALAY YLAGMSPFDAISHSFSTVSIGGFSTHDASIGYFNDSWINLITVVFLWISA CNFALHFRAFSEINKGGFFKIYRNDPEFRFFVSIQVILILICSAVMLSHS YFETTWENIEQVIFQSVSISTTTGYTTSDFSAWPSFVPMLLIIASFIGGC AGSVGGGVRVARILVLYLQGKRELKLFVHPNLVYPIKWGKRILDERVIGS IWAFFSAYLLVFIICLLGVIACGVDVFNAFNAVLACINNLGPAMGIVNSN MVEIPDSAKCILTIAMVCGRLEIFTLLALFSPTFWKA >MS0395 trmA, TrmA protein MTTKCPHFQTQQCQSCQWINRPYDEQLNEKQIHLKQQIAPLDQSQLRWSA PFQSRQSGFRNKAKMVVSGAVERPVLGILKDQNAPQSAVDLTDCLLYSAG FKPIFPVLKDFIGRAGLVPYNVAKQKGELKYILLTESGYQGDIMLRFVLR SENKIPLIRRELAKLREKLPQLKVISANIQPQHAAILEGEKEIFFTERQV LEERFNRIPLFIRPQGFFQTNPQVAEGLYGTAQQWVKDLPVNKLWDLFCG VGGFGLHCAKALQEKNPDIELTGIEIAPSAIYCAGLSAQKCGLKKVNFQS LDAANFALNQDENKPDLVIVNPPRRGIGKPLAQFLNQMQPQFILYSSCNA ISMTKDLLELTHYQLQKIQLFDMFPHTSHYEVLTLLIKR >MS0240 trmA, TrmA protein MVLLYTPPQKINKLQREMEVEILDLDYQGLGVAKIQGKTWFVENALPGEK VRIKIKEEKRQFGLATTKKILEASAQRQTPKCQYASRCGGCQNQHIPVEM QREAKQKALFRRLLKLQPEGIEFMPMIVGEAFGYRRRVRLSMLFDGKLKR LEIGFRQKNSAQIVHIEQCEVIEPALNKILSKLTALLSRFSQPKNLGHIE LVAADNGVAMLLRYSGKLTENDRTLLLDFAVREELMLFLQDDEKTEQIYG QPPFYQLADNLQLQFDIRDFIQVNSLLNQRMITAALDWLDVQKQDHVLDL FCGMGNFTLPLSRRVKSAVGIEGISAMVEKAKANAERNQCQNVQFYRADL DQNFADEVWATEPFNKILLDPPRTGAAFALNALCRLKAEKILYVSCNPAT LVRDAEILLNSDYRVKKVAMIDMFPHTGHLESITLFEKQS >MS2367 trmA, TrmA protein MQQLPIEKYSELLTKKQQKLTALLAPFNAPELSVFVSPVQNYRMRAEFRV WHDKGDLYHIMFNQQTKQRYRVDCFPIASLLINRMMENLIPLLKEQEILT KKLFQIDYLSTLSNKIIVSLLYHKTLTEEWQAAAQALKVRLEKLDFDVQI VGRATKQKICLERDYADEVLPVNGRNYVYRQIENSFTQPNAAVNCKMLEW AIGCTKNSSGDLLELYCGNGNFSIALAQNFRKVLATEIAKPSVAAAQFNI AENGIDNLQIIRMSAEEFTQAMNGVREFNRLKGIDLKSYECNTIFVDPPR AGLDPDTVKLVQNYDRILYISCNPNTLCNNLTELTKTHRIEKAALFDQFP YTDHMESGVWLIRK >MS0442 trmD, TrmD protein MWIGIISLFPEMFKAITEFGVTGRAVKQNLLQVSCWNPRDFTHDKHKTVD DRPYGGGPGMLMMVQPLRDAIHAAKAEAGDGVKVIYLSPQGRKLDQTGVT ELAANEKLILVCGRYEGIDERLIQTEIDEEWSIGDYVLTGGELPAMTLID AVARFVPGVLGKQASAEEDSFAEGLLDCPHYTRPEVLDGYVVPPVLMSGN HEEIRKWRLKQSLERTWLRRPELLEKLALTDEQKKLLKDIIEAYHIRQSK TSDNG >MS0301 trmU, TrmU protein MLRLNRSYGRTMQNLTNLSSRTYDQHFPKLSAEQLAENAKKKVIVGMSGG VDSSVSAFILQQQGYQVEGLFMKNWEEDDDTDYCTAAADLADAQAVADKL GMKLHKINFAAEYWDNVFEHFLAEYKAGRTPNPDILCNKEIKFKAFLEYA AEDLGADYIATGHYVRRRGDDENARLLRGLDSNKDQSYFLYTLSHKQVGQ SLFPVGDIEKPIVRAIAEDLGLITAKKKDSTGICFIGERKFKDFLARFLP AQPGEIRTVDGKVIGRHDGLMYYTLGQRKGLGIGGIKGMDENPFYVAEKD LVNNVLIVAQGHDNSALLSSGLIARQLHWVDRQPIRENLRCTVKTRYRQT DIPCEIQPIDDETIRVIFDEPQIAVTPGQSAVFYQGEVCLGGGVIETQIK >MS1154 trpA, TrpA protein MARFETLFAQLNAKKQGGFVPFVTLCDPDLERSFDIICTLVDNGADALEL GFPFSDPLLDGPVIQAANNRALNAGCSTAESFKLLEKVRSKYPEIPIGLL LCANLIYAQTLDGFYRRCAEIGIDAVLVADIPLLAAEPYIQAAKKHGIQP VFICPPNADENTVKGVAEHSEGYTYLVSRAGVTSAENQSHAANLDSLVEQ LKAHNAPPILQGFGIAKPQQVKEALNMGVAGAISGSATVKIIEANLDNHE KCLADLAEFVKNMKAATL >MS1153 trpB, TrpB protein MTDTILDPYFGEFGGMYVPEILIPVLKQLEKAFVEAQQDPAFQTEFLDLL KNYAGRPTALTLCRNLTKGTKTKLYLKREDLLHGGAHKTNQVLGQILLAK RMGKTRIIAETGAGQHGVATALACAMLGMPCRIYMGAKDVERQSPNVFRM RLMGAEVFPVTKGSSTLKDACCEAMRDWAANYENTHYLIGTAAGPHPFPT IVREFQKMIGEETKAQILQREGRLPDAVIACVGGGSNAIGMFTDFINETS VRLIGVEPAGKGIETGEHGAPLGHGKPGIYFGMKSPIMQTEDGQIEESYS ISAGLDFPSVGPQHAYLNSIGRAEYPSITDDEALEAFKELAQHEGIIPAL ESSHALAYALKMARQNPMREQLLVVNLSGRGDKDIFTVDKIFSERGML >MS1152 trpC, TrpC protein MNLNDKPTILQKIVADKIQWIKAKEQVFPLASFKEKITKSDRSFYQSLGK GTHQNPVFILECKKASPSKGLIRNEFNPADIAQVYKNYASAVSVLTDEKY FQGDFSYIKQVRDIVTCPVLCKDFMISEYQVYLARYYQADAILLMLSVLD DETYKKLAALAHELGMGVLTETSNQQELERGIALGAKVMGINNRNLHDLT VDLARTPPLAQQIPADRIIVSESGIYSHQQVQQLKPYVNAFLIGSSLMGS DDLNNAVRSVIFGENKVCGLTRPQDVQEVYRQGALYGGLIFAENSKRCVS LRQAQELVTVAPLRFVGVFQNQQIDFIVKIATQLNLYAVQLHGAENEEFI AALRIQLPHQIQIWQAVSIDVAQQSAVKIDRISAVDRYVLDSKTANRQGG TGVAFDWSKIPAEIKNKSLLAGGITPENIELALAQHCLGIDLNSGVESAA GIKNPEKLTAVFNKIHRF >MS1151 trpD, TrpD protein MRIKTRNFIMQTQQILTQLFDNQPLSQEQAAFIFGNIVKGELSNEQLAGA LIALKIRGETIDEITGAVTALLAAAEPFPAPDYPFADIVGTGGDNADTIN ISTASAIVAASMGLKIAKHGNRSVSSKTGASDVLTALGVNIRMSTEQARK ALDEIGIAFIFAQQYHLGFKYAGPVRQALKTRTIFNILGPLINPANPKRQ LLGVYSPELLKPYAETNLRLNHEHSIIVHGCGLDEVAIHGLTQVAELRDG KIEYYNLSPKDFGFEPQPLESLRGGAPEENAKILTALLQGKGSEQQAQAV AMNTALLMKLFGHEDIKQNAQQVLEQLTTGKAFETLTKLTTY >MS1149 trpE, TrpE protein MPNAYIQTLSNPVQYQQDLTAVFATVGKTNSLLLESAEISSKNSLQSLLI INAALKVSCLGQIVTFTALTANGSHVLPLIKEKLQGKTKSLSVQQNKLIA EFFPIDQNLDEDSKLQSLTVFDGLRVINQLYQHSKQPVFLGGLFAYDLVA NFIPMNNITLQDDGLSCPDYVFYLAEQLLRLDHPSQQATLQTFCFNDSEL QNLQQSAVEIDKDLRNLKPLSAIQQGSTDISTNHEDEKFKQIITALKHHI YIGDVFQIVPSRRFILQCPNTLATYRQLKENNPSPYMFFMQDEEFTLFGA SPESALKYSADNRQLEIYPIAGSRPRGFDAKGKIDPELDARLELEMRLDH KEQAEHLMLVDLARNDVARVCESGTRHVKELMQVDRYSHIMHLVSRVVGK LRPELDALHAYQACMNMGTLTGAPKIKAMQLIYQFEKQKRHSYGGAVGYL SSDGNLDTCIVIRSAFVQNGIAYVQAGCGEVLDSDPQMEADETRHKAQAV IKAILQTNAQAN >MS2193 trpE, TrpE protein MNFASFIRQANRLGRQKTAFFFLIDFERQKPLISPLESAVENGIIFSVEG NTNFYRPVELPRQKIRFSSEPVSFERYAAGFALVQQELQKGNSYLLNLTY PSKINTNYNLAQIFQATKAPYKLLLQDQFVCFSPESFIRIRQNQIFTYPM KGTIDAALPQAEQQLMQSEKEGREHYTIVDLMRNDLAMVAENIRVRRFRY IDKISTNRGEILQTSSEITGNLTADWQNRIGSILAALLPAGSISGAPKEK TVSIIRQAEGGKRGYYSGIFGIFNGEELNSAVAIRYIEQKDGQLYFRSGG GITSQSRLQEEYEEYCQKVYLPIHCVE >MS1566 trpR, TrpR protein MYISRNMEQWTKFIETLRIAFNDGKEQDLLTLLLTPDERDAIGLRLQIVA QLLDKKIPQREIQQNLNTSAATITRGSNMLKLMSPDFMEWVKKHTNETEN T >MS2332 trpS, TrpS protein MTKPVVLSGVQPSGELTIGNYLGALRQWVKMQDDYECLFCIVDLHAITVR QDPEQLRKATLDVLALYLACGIDPEKSTIFIQSHVPEHTQLAWVLNCYTY FGEMNRMTQFKDKSARYAENINVGLFTYPVLMAADILLYQAAQVPVGEDQ RQHLEITRDIAQRFNAIYGENQFTVPQAFIPKAGAKVMALQEPTKKMSKS DDNRNNVITLLEDPKSVAKKIKRAMTDGDEPPLVKYDVQNKAGVSNLLEI LSVITDKPIPQLEKEFEGKMYGHLKTTVADEVVAMLTQLQDRFAHYRNNE ELLNKIAAEGAKKARARAKETLEKVYNAIGFVAAK >MS1175 truA, TruA protein MRTDYKIMKIALGIEYNGKNYFGWQRQEKVHSVQAELEKALSFVANEKIE VFCAGRTDSGVHGTGQVVHFETNAIRPEKAWAFGTNANLPDDIAVRWAKE VPDDFHARFSATARRYRYVLYCNKLRSAILPYGITHTHLDLDEHKMQEAG RFLLGENDFSSFRAAQCQSNTPWRNIHHLNVIRRGNFVIVDIKANAFVHH MVRNIVGSLMEVGCGNQPPEWIEWLLAQKNRKLAAPTAKAEGLYLVQVTY PEHFELPQMPLGPLFLADEL >MS1441 truB, TruB protein MSRPRKRGRDIHGVFLLDKPQGMSSNDILQKVKRIYQANKAGHTGALDPL ATGMLPICLGEATKFSQFLLDADKRYQVIAKLGERTDTSDAEGQVVETRS VNVTEQKILDSLPHFRGDIMQVPTMFSALKHKGKPLYEYARAGIVVEREA RPISIFELNFISYEAPYLTLEVHCSKGTYIRTLVDDLGEYLGCGAHVSML RRTAVSDYPADKMLTWEQLQQFAQDEDLAALDARLLPVDSAVSKLPVLSL SEEQTKAVGFGQRVKFDNLQQLQGQVRLFSPQNVFLGVAEIGKDNVIRPS RMVNL >MS1813 trxA, TrxA protein MKRKVFLFLPLIVLLVICIFLIMGLKQDPKKIASALIGKPVPEFFQADLL DNNRIISNKHLPKQPYLLNVWGSWCYYCQQEHPLLMELAEQRIPIVGLNY RDKKQGALEMLTKKGNPFALVIDDSRGELAMKLGVDGAPETYVIDENGVI RYRYSGAVDKTILQKEILPEFNKLRN >MS1626 trxA, TrxA protein MSEVLHATDASFEADVLRSDVPVLVDLWAPWCGPCRMVAPILDDLAAELA GKVKIVKINIDENQGTPAQFGVRSIPTLLMFKDGQLVGTQVGALPKNQLA AFVEKNL >MS1299 trxA, TrxA protein MRNFLLFLILSLTTFVAHSGLFTGKPQFLKPHEAFILSANKQDAQINLHW KIADNYYLYKKELRITGENSKIGEIIYPQADKHQDEFFGETEIFRHELFL AVPVNEQNAASRLEVTYQGCTKGFCYPPETTVLELASLPVGQESQTLTAQ DSLSQNLLKSKYAVFGFFLLGIGLAFTPCVLPMLPLLSAIVIGQGKRAST GRSLLLSFVYVQGMALTYTLLGLIVAAIGLPFQVALQSPYVLVTLSAVFV LLALSMFGLFNLQLPSSLQTKLALFSQKQQSGALGGVFIMGMIAGLIASP CTSAPLSGALLYVAQTGDLFFGAITLYLLALGMGMPLVLITVFGNRILPK SGAWMEKVKTAFGFVLLALPVFLLARVLPGIWENLLWSLLTVSFFAWLSF SMPKGKLGRSLRILFLILAMIAVRPLQNFIWGDYSAPAGNPQSAVEKSEI SSSTRFKQINNYAQLKQALTSNPKAIAMLDLYADWCVACKEFEKYTFSHP DVKHKFEQVLLLQVDMTKNSPENAELMEKLSVLGLPTIIFFDRQGNEIVN SRITGFLNAKQFLSLIEKYL >MS0607 trxA, TrxA protein MNKKLYLPLLIFLILVGAFFIQLRQNASGGDPKLLESALVGKPVPEKVMQ GLFDNKDYTSEVFKQGKPILLNVWATWCPTCYAEHQYLNELAKQGITIIG VDYKDDSAKAVKWLKDLDNPYQLVLKDEKGSLALDLGVYGAPETFIIDGK GVIHYRLAGDVNEKVWKNTLLPIYNQLFEDGK >MS2059 trxA, TrxA protein MALSNNFRFILRIYRMKKLFLACFAALATVTFQVQAADLTEGKQYEVLAL EHSAQPEVVEFFSFYCPHCYSFEMQYKIPEKIKQAIPANASFKQYHVNFL GSQGENLTRAWALAMAIGAEDKIRAPLFKAAQANSLRSMDDIRQIFIDNG VTAEQFDGSINSFAVTALVNKQTNLAEQFKVRGVPDFYVNNKFHINMEGL SHDNFVQDYVDTVNELLSK >MS1538 trxA, TrxA protein MFSMKIKFCCYLFLVCLAPAVFAQKNTAGTHAAQAIASPGNRAAKNEFQD GQDYFSYSTPIHTENRRDGKILIQSFFDYDCRVCVNTLDILELYSKINPN KVVVEEYPIATKETTFSAQVYYSLKRMNHEDIAELLLFETTDIERYRELT KFENLLAYLKQQNVDEKLFTDIYQSAEIRRQVSEAIYRTEKYGVFTYPFV VIGGKYVLTNSTLYNDDYTFAVLDFLVHELSAVSTATHSK >MS0951 trxB, TrxB protein MSDIKHSKLLILGSGPAGYTAAIYAARANLNPVLVTGLEQGGQLTTTTEI ENWPGDFAETTGPELMQRMLQHAEKFDTEIVFDHINRVDFSSRPFKLYGD VQTFSCDALIIATGASARYLGLPSETEYKGRGVSACATCDGFFYRNKPVA VIGGGNTAVEEALYLANIASEVHLVHRRDAFRAEKILIDRLYKKVEEGKI ILHTNRNLDEVLGDNMGVTGVRLKDTQSENTEEIKIDGLFVAIGHAPNTA IFADQLELNNGYIVVKSGLNGNATATSVEGIFAAGDVMDHNYRQAITSAG TGCMAALDAERYLDALEA >MS1932 tsf, Tsf protein MAEITASLVKELRERTGAGMMECKKALVEANGDIELAIDNMRKSGQAKAA KKAGRVAAEGVILARIAEGHGVLVEMNCETDFVAKDAGFLSLANAVADYA VANKGVTIEALQAQFEEQRAALVAKIGENMTIRRVAEIEGKVIAQYLHGA KIGVLVAGEGSADELKKVAMHVAASKPEFVNPEDVSADVVEHERQIQIDI AINSGKPKEIAEKMVEGRMKKFTGEVSLTGQAFVMDPSQTVGSYLKSVNT SVANFIRLEVGEGIEKVEADFAAEVAAMQKV >MS0165 tufB, TufB protein MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLSKHYGGAARAFDQI DNAPEEKARGITINTSHVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMD GAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELV EMEVRELLSQYDFPGDDTPIIRGSALKALEGEAQWEEKILELANALDTYI PEPERAIDQPFLLPIEDVFSISGRGTVVTGRVERGIIRTGDEVEIVGIKE TAKTTVTGVEMFRKLLDEGRAGENIGALLRGTKREEIERGQVLAKPGSIT PHTDFESEVYVLSKEEGGRHTPFFKGYRPQFYFRTTDVTGTIELPEGVEM VMPGDNIKMTVSLIHPIAMDQGLRFAIREGGRTVGAGVVAKIIK >MS2187 tufB, TufB protein MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLSKHYGGAARAFDQI DNAPEEKARGITINTSHVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMD GAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELV EMEVRELLSQYDFPGDDTPIIRGSALKALEGEAQWEEKILELANALDTYI PEPERAIDQPFLLPIEDVFSISGRGTVVTGRVERGIIRTGDEVEIVGIKE TAKTTVTGVEMFRKLLDEGRAGENIGALLRGTKREEIERGQVLAKPGSIT PHTDFESEVYVLSKEEGGRHTPFFKGYRPQFYFRTTDVTGTIELPEGVEM VMPGDNIKMTVSLIHPIAMDQGLRFAIREGGRTVGAGVVAKIIK >MS0263 typA, TypA protein MCGVPKIFFLSLKITDIANLFETSASPILFVKQLRTFQMAELDIHKLRNI AIIAHVDHGKTTLVDKLLQQSGTLETARNGDSDERVMDSNDLEKERGITI LAKNTAINWNDYRINIVDTPGHADFGGEVERVLSMVDSVLLVVDAFDGPM PQTRFVTQKAFAHGLKPIVVINKVDRPGARPDWVVDQVFDLFVNLGATDE QLDFPIIYASALNGVAGLEHEELAEDMTPLFEAIVQHVEPPQVELNAPFQ MQISQLDYNNYVGVIGIGRIKRGTVKPNQSVTIIDSFGKTRNGKIGQVLG HLGLQRYEEDLAQAGDIVAITGLGELNISDTVCDINAVEALPALSVDEPT VTMFFCVNTSPFCGQEGKFVTSRQILERLNKELVHNVALRVEETPNPDEF RVSGRGELHLSVLIENMRREGYELAVSRPKVIYKEENGHKQEPFEQVTID IEEQHQGAVMEALGIRKGEVKDMSPDGKGRTRLEYVIPSRGLIGFRNEFM TMTSGTGLLYSSFSHYDDVKPGEIGQRKNGVLISNATGKALAYALWGLQE RGKLMAEHGQEVYEGQIIGIHSRTNDLTVNCLQGKKLTNMRASGKDDAIQ LTTPIKLTLEQAIEFIDDDELVEVTPQSIRIRKKLLTEMDRKRANRTTTS TSTH >MS1102 tyrA, TyrA protein MEALKEIRAEIDQLDRELLEVFAKRLALVKKVGEIKHQQGLPIYVPEREA DMLAARRSEAEKMGIPADLIEDVLRRVMRESYANEHEHGFKTVNPAIKKI VIVGGKGKLGGLFGRFLTASGYFVEALGSKDWDNAKAILAGANAVIVCVP IVKTLETIERLKPYLTEDMLLTDLTSVKRRPLEKMLEIHQGAVVGLHPMF GPDIASMAKQVVVRCDGRYPERYQWLLEQIQMWGARIYQADAAEHDHSMT YIQALRHFATFANGLHLSRQPVKLANLLALSSPIYRLELAMIGRLFAQDG SLYADIIMDKPENLEVIESLKQSYEDSLKFFENGDREGFIKTFNKVREWF GDYSEQFMKESRQLLQQANDYRHNSL >MS1031 tyrB, TyrB protein MQITILVSIKEKLISKHNISKESPMFKNITPAPADPILGLGEAFKAETRE NKINLGIGVYKDADGVTPIMTAVKKAEGQLFENEKDKNYLPIEGVAEYNA YAKELLFGKDSEIIASNRACTVQTLGGTGALRIAAEFVRRQTKAQNVWIS KPTWPNHNAIFNAVGVTIREYRWYNPETKALDWDNLLADLNNANPGDVVL LHGCCHNPTGIDPTPEQWKALAEMSAKNGWLPLFDFAYQGLANGLEEDAV GLRTFAETHRELLVASSFSKNFGLYSERVGAFTLVADNADVAAVALTQIK SIIRTLYSNPSAHGARTVATVLANPELRKEWEDELTSMRDRIKQMRKQLV ELLKEFGAQEDFSYIIDQKGMFSFSGLTAEQVDRLKEEFAIYAVRSGRIN VAGITEANIRYLAESIVKVL >MS0762 tyrR, TyrR protein MFTVKGYDEGNYFIRSIVGKTMSKNTAKRSAHFTVNQYENFTDVVALSPK MAALVEKAKKFALLDAPLLIQGETGTGKDLIAKACHNLSARKDQKFIAVN CAGLPDTDAESEMFGRADGDKTSTGFFEYANGGTVLLDGVAELSLNLQAK LLRFLNDGTFRRVGEEQEHYANVRVICTSQISLQHYVDEGKVRSDLFHRL NVLSLQIPPLRERKEDLAVLTENFVRQISRRLGVRTPEFDGQFLQYLKDY QWPGNVRELYNALYRACSLAEHNKLTIDGLNLSENETVPLTLEQFGNESL EEIMNNFEASVLRKFYEQYPSTRKLASRLGVSHTAIANKLKQYGIGK >MS1232 tyrS, TyrS protein MSDINVVLAELKRGVDEVLSEADLIEKLKENRPLKIKLGADPTAPDIHLG HTVVLNKLRQFQNFGHEVIFLIGDFTGMVGDPSGKNKTRPPLSREDVLRN AETYKQQIYKILDPQKTRIVFNSDWLGKLGTEGMIRLASNYTVARMLERD DFKKRFTEKQPIAIHEFIYPLLQGHDSVALEADVELGGTDQKFNLLVGRE LQKSAGQKPQVAMTLPLLVGLDGEKKMSKSLGNYIGVTDAPNDMFGKIMS ISDDLMWDWYDLLSFRPLTEIAQFKEEVKNGRNPRDVKILLAKEIIARFH SEADADTAEQEFINRFQKGAMPDEMPEFTFEGEIGLANLLKEAGLVASTS EANRMVQQDGVKIDGEKVEDAKTTISASTHVYQVGKRKFARVTVR >MS1582 udk, Udk protein MSDSANSSCIIIAIAGASASGKSLIASTVHRELRDQVGSDDISIISEDCY YKDQSHLDFATRTQTNYDHPNSMDRDLLLEHLRALKAGKSVDIPQYSYVE HTRMKEVTHFTPKKVIILEGILLLTDERVRNELSLSLFVDAPLDICFIRR LKRDMEERGRSLESVIEQYRKTVRPMFLQFIEPSKQYADIIIPRGGKNRI AINMLKAQILHLLGRK >MS2318 udp, Udp protein MSEVFHLGLTKAMLKGAKVAIVPGDPARSERIAKEMENAEYLNSTREFTS WLGYMDGEPIVVCSTGIGGPSVSICVEELAQLGVRTFLRIGTTGAIQPHI NVGDILITTGAVRLDGASQHFAPLEYPAVADFACTNALYNAALSQGIRPY VGITASSDTFYPGQERYDTFSGKVYPKFQGTLKQWQDLNVMNYEMESATL FTMCAALGLKAGMVAGVIVNRTQQEIPNEATIKSTEQKAVAVAVEAAGKM AKA >MS1992 ugpQ, UgpQ protein MKHKLKTLAIGLAFATLAACSSQTAQQTTPMNNQEKLVIAHRGASGYLPE HTLESKALAFAQQADYLEQDLAMTKDNHLIVIHDHFLDGLTDVAKKFPNR HRKDGRYYVADFTLKEIKSLEMTENFKTENGKQVQVYPNRFPMWKSHFTI HTFEEELEFIQGLEKSTGKKVGIYPEIKAPWLHHQEGKDIAVATLKVLQK YGYTKKTDPVYLQTFDFNELKRIKTDLLPKMGMDVKLVQLVAYTDWHETE EKNAQGKWVNYDYDWMFKDGAMAEVAKYADGVGPGWYMLIDDKNSKAGDI KYTPMVADIAKTKMELHPYTVRKDALPAFFTDVNQMYDALYNHAGATGLF TDFPDLAVKFLGKDKNKQ >MS1991 uhpC, UhpC protein MENFMFGPFKPAPAIAELPADKIDSTYRRLRWQVFMGIFFGYAAFYFVRA NFDLAQKGLIEAGMYTKTELGIIGTGAGLAYGLSKFVMAGMSDRSNPKVF LPFGLLLSGLCMTLMGLIPWATSGILIMFVLIFLNGWFQGMGWPPCGRTM VHWWSKSERGTIVSIWNCAHNVGGMVPGMMVLLASAVYFSNTGVQATAKD VWQQALYYPGIAAMIAAIPVYFVMKDTPQSCGLPPIEKWRNDYPDDYNEK TYEHDLTTKEIFVNYVLKNKLLWYIAIANVFVYLIRYGVLKWSPVYLGEV KHFNIKGTAWAYTIYELAAIPGTLLCGWVSDKIFKGKRGLTGFIFMILTT IAVFALWKNPATPEAELAQYAGLPFYKNPYQLMDFILMTTVGFLIYGPVM LIGLHALELAPKKAAGTSAGFTGLFGYLGGTVSASAVVGWAADKFGWDGG FYVMITGGILAVILMFIVMVAEGKHKAKLSDHYGK >MS2284 uhpC, UhpC protein MLSFLNEVRKPTLDLPVEERRKMWFKPFMQSYLVVFFGYMAMYLVRKNFN IAQNDMIETYGLTKTQLGMIGLGFSITYGLGKTIVSYYADGKNTKQFVPF MLILSALCMLGFSASMGGSSIALFLMVAFYALSGFFQSTGGSSSYSTITK WTPRKKRGTFLGFWNLSHNVGGAAAAGVALFGAHVFFNGHVIGMFIFPSI IALIIGFIGLRYGSDSPEAYGLGKAEELFGEEISEEDKDAEQNQLTKKQI FVQYVLKNKVIWLLCFANIFLYIVRIGIDQWSPVYAYQELGFSKDAAISG FALFEVGALVGTFLWGYLSDLANGRRGLLACVALVLIVFTLEFYQFANNE TMYLVALFALGFLVFGPQLLIGVAAVGFVPKKAIAVADGVKGTFAYLIGD SFAKLGLGMIADGTPIFGLTGWDGTFAALNSSALICIGLLAFVAIAEEKK IRRLKKAEA >MS2287 uhpC, UhpC protein MNTTFSPEINRTYRYWRIHLMIAMYIGYAGFYLTRKSFNFAVPEMINNLG IDKNDIGMMATLFYITYGVSKFFSGIFSDKSNPRHFMAVGLIMTGVANIF FGLSSSVLIFTAVWIINAWFQGWGWPACSKLLTTWYSRNERGRWWSIWNT AHNAGGALIPLLIGYVTIHYSWRYGFAIAGIVAISIGLFLFWRLRNTPES LGLPSIGHWRNDELELAQEAEAPNLSWRETLNRYVFLNKYIWLLALSYTL VYIVRTAINDWGNIYLTEKYHYDLVSANSALAVFEIGGFFGSLVAGWGSD RLFSSNRGPMALIFAIGIFFSITALWLLPTENYILQTALFFVVGFFVFGP QMLIGMAAAECSHKKTPGSATGFVGLFAYLGAAIAGYPLALIMQHFHWTG FFVFIACSASGIALLLLPFLKAQN >MS0373 ung, Ung protein MQTWKDVIGTEKTQPYFQHILQQVHAARDAGKTIYPPQHDVFNAFKLTEF DQVKVVILGQDPYHGPNQAHGLAFSVLPGIVPPPSLLNIYKELENDIAGF QIPRHGYLVKWAEQGVLLLNTVLTVERGLAHSHANFGWETFTDRVIAALN RHRENLVFLLWGSHAQKKGQFIDRDRHCVLTAPHPSPLSAHRGFLGCHHF SKANNYLQEHKITEIDWQLDTQLS >MS1880 upp, Upp protein MKLVEVKHPLVKHKLGLMRAADVSTKHFRELATEVGSLLTYEATADLETE IVTIEGWCGPVEVQRIKGKKVTVVPILRAGLGMMDGVLEHIPSARISVVG MYRDEETLEPVPYFQKLASDIEERLAIVVDPMLATGGSMIATIDLLKQKG CKHIKVLVLVAAPEGIKALESAHPDIELYTASIDDHLNQDGYIIPGLGDA GDKIFGTK >MS1927 uppS, UppS protein MKELDLNNIPKHIAIIMDGNGRWAKQQGKMRIFGHKSGVRAVRRSVSYAC QIGVQALTLYAFSSENWNRPEQEVNALMTLFMQALDLEVKKLHKNNIKLK VLGDISGFSPKLQEKIARAETLTANNDSLTLNIAANYGGCWDIVQATRQI AQQVKDGSLTISEITEELFQRNLVTKEQPPVDLLIRTSGEQRISNFLLWQ IAYAELYFSHVLWPDFNEQEFNRAIYVYQQRERRFGTS >MS0575 uraA, UraA protein MNNNLLYSVEDKPPFGLSLLLAAQHLLAALGGIIAVPLVIGNVLKLPTED TITLVNAALLISGVVTIIQCRGIGPIGIRLPSVMGTSFTFVAAALAIGFS EYGVAGIMGASLVGSLVMIIGSFFMPYIRKLFPPVVTGTVVMMIGLSLIP VAVDWFAGGQVGDENYATPENLLMATFVLVIVVTLVQWGKGIFSAAAIVI GMMTGYVVALCLGWVSFDGVNNAQTFAVPQPLHFGLAFPISGIIGMSIAY LVTIVESSGNFLALGNATQTEITGKHLRGGVLCDGLGSALAAIMSTTPFS SFSQNIGVISLTGVASRHVVALTGVLLALAGLFPVFGALIVSIPLPVLGG AGLMMFAMIIAAGIQMLDNIPRSKRNGLIIAISIGCGLAVTTRPELLDKL PHFFKEVLGSGITVGSLLALILNLILPEDKIPENH >MS1879 uraA, UraA protein MTNQTNAPIEVQSKAKQAFVGLQMLFVAFGALVLVPLITGLNANTALLTA GIGTLLFQLCTGKQVPIFLASSFAFIAPMQYGIQTWGIAVTMGGLAFAGL VYVALSALVKMRGAGALQRIFPPVVVGPVIIIIGMGLAPTAVDMALGKNS AYSYNDAVLVSMVTLLTTLCVAVFSKGMMKLIPIMFGIAVGYILCLFLGL IDFQPVLNAPWFSLPEITTPEFKLEAILYLLPIAIAPAVEHVGGIMAISS VTGKDFIRKPGLHRTLLGDGVATTAASLLGGPPNTTYAEVTGAVMLTRNF NPNIMTWAAVWAIGISFCGKVGAFLSTIPTIVMGGIMMLVFGSIAVVGMS TLIRDKVDVTEARNLCIISVVMTFGIGGMFVNVGELSLKGISLCAVVAIV LNLLLPKAKNQME >MS1658 ushA, UshA protein MNRPFRFLKLTAALSLFSAAAMSYQADKTYQFTLLHFNDLHGHYWHDKNG QYGLAAQKTAVDRIRNEVEAKGGSVITLFAGDLNTGVPESDLQNAHPDID GLNAIGYDAMVLGNHEFDNPLQLLDMQEKWAKFPFLAANIYHKNTDKTLV KPYTMLKRSGLNIAIVGLTTEDTAKLGNPEYMKDLRFDNPISTAKKVVAE IDKQENPDVKIALTHMGYYYDGNYGSNAPGDVTMARRLEKGTFDVIVGGH SHDTVCVDAKGVFIRDYQPTQACKPDYQNGTWIMSAGEWGKFLGRADFEF KNGEVKLVRYELIPINLKKKVETAAGKTEYQLYGEQIPQDEKLLATLKTY QDKGDQLLSVKIGDVAGKLVGDRNIVRFHQTNLGRLVAEAQRRAAGADVG IMNSGGIRDSIQSGVITYRDILKVQPFGNIVSYFELSGAELIDYLNIVAL KEVDSGAYPQFSGISMIIDRTAKQVKEVKIQGEPINLSKTYRISLPNYNA LGGDGYPVMDKNPTYVNTYKVDAEVLKAFIAENSPIDANKFEPKGEITYR >MS0064 ushA, UshA protein MERRRFIQLGASAMLVLGTSRYVWALGDNKAQLRIIATTDVHSFLTDFDY YKDAPTEKFGFTRAASLIEQARKEVSNSILVDNGDLIQGNPIADYQAAVG AKQGKPHPAIQVYNAMKYDMGTLGNHEFNYGLDYLNEVIKQADYPIINAN VVKIGTNEPMFRPYVIQEKDILDQAGNKQKIKIAYIGFTPPQVTVWDKAN LAGKAESRDIIKTAQKYIPMLKGKGADIVIALAHTGPSDEPYHEGMENAA FHLADVKGIDAVIFGHSHRLFPNKEFEKSANTDIAKGTVKNVPESMAGYW ANNISVIDLALVEKNGKWMVVDGSAALRPIYDVTAKKATVENHEKITALL QPVHEATRKFVAQPIGQANDNMYSYLALVQDDPTIQIVNQAQKAYTENVV KNLPELAGLPVLSAGAPFKAGGRKNDPTGFTEVDKGRLTFRNASDLYLYP NTLVVLKVNGAELKEWLECSAGMFKQIDINSDKPQFLLDWEGFRTYNYDV IDGVSYQFDITQPARYDGECKLINKNANRVVNLTFNGKPVDPKAEFLIAT NNYRAYGNKFPGTGDAHIVFASPDENRQILANYISAESKSKGEVTPTADK NWRIAPIHSKVKLDIRFETSPTEKAAAFIKQNAQYPMQLVGKDEIGFAVY QIDLSK >MS0349 uspA, UspA protein MYKHILVAVDLSDESSVILKKAADIAKRHEAKLSIIHVDVNFSDLYTGLI DVNMSSMQDRISTETQQALLELSEQAGYPITEKLSGSGDLGQVLSDAIDQ YDVDLLVTGHHQDFWSKLMSSTRQLMNNIKIDMLVVPLRDED >MS0329 uspA, UspA protein MYKNILIAIDLSNLDSAKYVVDTCLKLTEDNPQAIFRVVTIIEPMDDSFI SAFLPKNFDKSVLEEANKALHEFTEKAFPKGAKVQHIVSYGTIYEEINHL ADEKNVDLIVMLASSQPNAKGLSANTVKVARNTDKPVLILR >MS1076 uspA, UspA protein MEKMEVVMKFNNILVILNPENDKQYALARAIRLVKEQKSDKPVKVTLFLP VYDLSYEMSALLSSEEREEMHKGVIEQRYQQDVLPYIEKYQDATMIEFSS KVVWNSNEAEALVAELDENTYDLVVKYTKEEESLTSILFTPIDWQLLRKC PAPILMVRDGDWKHQRRILVAVNVSGDADYHEAFNQQLVELSMDLADNLE RGNVHLVGAYPPTPINMAIDLPEFHTSEYTSGVRGQHLINMKALRQRFGI DEDHTHVLEGFPEEVIPEVADKIGAELVVLGTVGRTGLSAALLGNTAEHV ISKLKCNLLAIKPNKIED >MS1240 uup, Uup protein MGFYMSSQFVFTMHRVGKVVPPKRHILKDISLSFFPGAKIGVLGLNGAGK STLLRIMAGVDKEFEGEARPQPGIKIGYLPQEPKLDPQQTVREAIEEAVS EVKSALTRLDEVYALYADPDADFDKLAAEQAKLEAVIQAHDGHNLDNQLE RAADALRLPEWEAKIENLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDA ESVAWLERFLHDYEGTVVAITHDRYFLDNVAGWILELDRGEGIPWEGNYS SWLEQKEKRLAQEQAQESARQKSIEKELEWVRQNPKGRQAKSKARMARFE ELNSGEYQKRNETNELFIPPGPRLGDKVLEVEHLTKSYGERTLIDDLSFS IPKGAIVGIIGPNGAGKSTLFRMLSGKEQPDSGSITLGETVVLASVDQFR DAMDDKKTVWEEVSNGQDILTIGNFEIPSRAYVGRFNFKGVDQQKRVGEL SGGERGRLHLAKLLQRGGNVLLLDEPTNDLDVETLRALENAILEFPGCAM VISHDRWFLDRIATHILDYGDEGKVTFYEGNFSDYEEWKKKTFGAESTQP HRMKYKRIAK >MS0840 uup, Uup protein MALISLTNGYLSFSDAPLLDHADLHIEPRERVCLVGRNGAGKSTLLKIIA GDVVMDDGKIQYERDLIVSRLEQDPPSHAQGNVFDYVAEGIGHLADLLKE YHHISTLLESDYNDNLLSKLAQVQSRLEHENGWQFENKINEVLGKLELNP NTLLSELSGGWLRKAALARALVCNPDVLLLDEPTNHLDVDAIEWLETFLL DFAGSIVFISHDRSFIRKMATRIVDLDRGKLVSYPGDYDLYLTTKEENLR VEALQNELFDKRLAQEEVWIRQGIKARRTRNEGRVRALKMLREERRQRRE VLGSAKLQLDTSSRSGKIVFEVEDASYAIAGKQLLSHFSTTILRGDKIAL VGPNGCGKTTFIKLLLGELQPTSGHIRCGTKLDIAYFDQYRADLDPEKTV MDNVADGKQDIEVNGVKRHVLGYLQDFLFPPKRAMTPVKALSGGERNRLL LAKLLLKPNNLLILDEPTNDLDIETLELLEDILADYQGTLLIVSHDRQFI DNVATECYMFEGNGQLSKYVGGFFDAKQQQENALTSKMASEQAKPKKMQP ESAVEKSEISTANNNQKTIKLSYKEQRELERLPQLLEELEKMIENLQNEV GNPDFFQQSHEYTSAKLQELADKEAELENAFIRWEELEEKKKGNLS >MS0137 uup, Uup protein MIFFSNLTLKRGLNLLLEEANATINPKQKVGLVGKNGCGKSSLFSLLKKE NQPEGGEINYPADWAVSWVNQETPALNISALDYVIEGDRTYCRLQKELKL ANEHNDGNAIARIHGQLDIIDAWTVQSRASALLHGLGFSQEELGRPVKSF SGGWRMRLNLAQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLV LISHDRDFLDPIVNKIIHIEDKKLNEYTGDYSSFELQRAEKLAQQNALFR QQQDKIAHLQKYIDRFKAKATKAKQAQSRMKALERMERIAPAHVDNPFTF EFREPLSLPNPLVMIDKASAGYGEGESAVEILQKIKLNLVPGSRIGLLGK NGAGKSTLIKLLAGELTARSGVLQLAKGVQLGYFAQHQLDTLRADESALW HLQKLAPQQTEQELRNYLGGFAFHGDKVKDPVKQFSGGEKARLVLALIVW QRPNLLLLDEPTNHLDLDMRQALTEALVDYQGSLVVVSHDRHLLRNTVEE FYLVHDKQVEEFNGDLEDYAKWLNDLNVQEKSAVKNTEVSKESNNENSGQ NRKEQKRREAELRQQTAPIRKQIAKFETEMDKLTAQLTEIEVRLADSGLY QTENKEKLTALLTQQVQTRKALEEAEAHWLTAQEELETLLAE >MS0586 uvrA, UvrA protein MDVIDIRGARTHNLKNINLIIPRDKLIVITGLSGSGKSSLAFDTLYAEGQ RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT ITEIHDYLRLLFARVGEPRCPTHNLALTAQTISQMVDKVLTLPEGRKMML LAPVVKARKGEHVKILEHIAAQGYIRARIDGEICDLSDPPKLELQKKHTI EVVVDRFKVRADLATRLAESFETALELSGGTAVVADMEDAKAEELVFSAN FACPHCGYSVPELEPRLFSFNNPAGACPTCDGLGVQQYFDEKRVVQNPAV SLAGGAVKGWDRRNFYYYQMLTSLAEHYHFDIEAPYEELQKNIQQVIMNG SGKEEIEFKYMNDRGDVVVRRHPFEGILNNMARRYKETESMSVREELAKN ISNRPCSDCGGSRLRPEARHVYIGQTNLPDISEMSIGEAYSFFEKLALAG QKAQIAEKILKEIKERLSFLVNVGLNYLSLSRSAETLSGGEAQRIRLASQ IGAGLVGVMYVLDEPSIGLHQRDNERLLNTLIHLRNLGNTVIVVEHDEDA IRLADHIIDIGPGAGVHGGNVIAEGTAEQIMQNPNSITGKFLSGEEEIEI PQKRTAVDKKKFLHLNGAAGNNLKNVNLALPVGLFTCITGVSGSGKSTLI NDTLFPIAQNVLNRADNIEYAPYKSIEGLEFFDKVINIDQSPIGRTPRSN PATYTGLFTPIRELFAGVPESRARGYNPGRFSFNVRGGRCEACQGDGVLK VEMHFLPDVYVPCDQCKGKRYNRETLEIRYKGKTIHQVLDMTVEEAREFF DVVPMIARKLQTLIDVGLSYIRLGQSSTTLSGGEAQRVKLATELSKRDTG KTLYILDEPTTGLHFADIKQLLEVLHRLRNQGNTIVVIEHNLDVIKTADW IVDLGPEGGSGGGEIIATGTPEEVAQNPLSHTGRFLKPILAKK >MS1371 uvrB, UvrB protein MSHKINSKPFILHSEFKPSGDQPQAIEILAENLNDGLAHQTLLGVTGSGK TFTIANVIAKLNRPAMLLAPNKTLAAQLYAEMKAFFPENAVEYFVSYYDY YQPEAYVPSSDTFIEKDASINDQIEQMRLSATKSFLERRDTIVVASVSAI YGLGDPDSYLKMMLHLQTGAIIDQRQILVRLAELQYTRNDQAFQRGTFRV RGEIIDIFPAESDDRAVRIELFDDEIERLSLFDPLTGTGFGAVPRFTVYP KTHYVTPREQILDAIEKIKSELADRREYFIKENKLLEEQRITQRTQFDIE MMNELGYCSGIENYSRYLSGRNEGEPPPTLFDYMPSDALLVIDESHVTVP QIGGMYRGDRSRKETLVEYGFRLPSALDNRPLRFEEFERLAPQTIYVSAT PGPYELEKSGTEIIDQVVRPTGLLDPEIEIRPVSIQVDDLLSEARQRADR NERVLVTTLTKRMAEDLTDYLDEHGIRVRYLHSDIDTVERVEIIRDLRLG EFDVLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARN LKGKAILYADRITNSMEKAITETNRRREKQMKYNEEHGITPQGLNKKVGE LLDIGQGGSNKSRNKPRSQKAAEPATTYAIPMTAKEYQQQIKKLEQQMYK FAQDLEFEKAAAIRDQLHKLREQFVENG >MS0937 uvrC, UvrC protein MFDSKKFLANVTHDPGVYRMFDDKDTVIYVGKAKDLKKRLSSYFRANLSS KKTEALVASICRIETTITTSETEALLLEHNYIKTFQPRYNVLLRDDKSYP YILLTKERHPRITSHRGSKKVTGEYFGPYPHAGAVRETLSLLQKLFPIRQ CENSVYANRSRPCLQYQIGRCLAPCVSGYVSDEEYNQQVGYARLFLQGKD QQVLDHLIGKMERASRALNFEEAARYRDQIQAVRSVIEKQFVSNERLDDM DIIAIAYKLGIACVHVLFIRQGKILGNRSYFPKVPENTSLSELTETFVGQ FYLQAHQGRTIPNSIIVDRKLEEKAELESLLTDQAGRKVSIQDNIKGNKS KYLHLAQMNAQAALALQLKQSSLIHERYKELQQLLGIEKIHRMECFDISH TMGQQTIASCVVFNEEGPLKSDYRRFNIEGITGGDDYAAMEQALKKRYDK DLELEKIPDIIFIDGGKGQLNRALKVFHELQVKWDKNRPHLIGVAKGVDR KVGLETLIISKQEREINLPADSLALHLIQHIRDESHNHAISGHRKKRQKA FTQSGLETIEGVGAKRRQALLKYLGGMQGVKNATQDEIASVPGISVALAE KIFEALHH >MS0413 uvrD, UvrD protein MKLNPQQQQAVEYTSGPCLVLAGAGSGKTRVIINKIAYLIEKCGYLPKQI AAVTFTNKAAREMKERVAHSIGKELSKGLIVSTFHTLGFDIIKREYKHLG FKANMTLFDEHDQMALLKELTEDYLQQDKDLLRELISVISNWKNDLIMPA QAAKIARDEKQQTFAKCYERYANQIRAYNALDFDDLIMLPTLLFKTNEQV RSKWQEKIRYLLVDEYQDTNTSQYELIKLLVGSRAKFTVVGDDDQSIYSW RGARPQNMVRLRDDFPNLQVIKLEQNYRSTQRILHCANILIDNNQHVFDK KLFSTIGEGEKLQIIEAKNEEHEAERVVGELIGHRFTNKTKYKDYAILYR GNHQSRLLEKVLMQNRIPYKISGGTSFFSRLEIKDMMAYLRLLVNQDDDA AFLRIVNTPKREIGAVTLEKLGSLANEKHISLFEAIFDFELIQRVTPKAY NALQTFGRWIVELSDELVRSEPERAVRSMLAQIHYEEYLYEQAVSPKAAE MQSKNVATLFDWVNDMLGGDEFNEPMTLNQVVTRLTLRDMLERGEEDDES DQVQLMTLHASKGLEFPHVFLIGMEEGILPHQTSIDEDNVEEERRLAYVG ITRAQRTLRFTLCKERRQFGELLKPEPSRFLLELPQDDLQWERDKPPMTE EQKQEKAVANIANLRAMLKRN >MS1368 uvrD, UvrD protein MMDISELLDGLNDKQREAVAAPLGNYLVLAGAGSGKTRVLTHRIAWLIAV EGISEGSIMAVTFTNKAAAEMRQRIESTLSQHSSRRLFGMWVGTFHSIAH RLLRAHYLDANLPQDFQILDSEDQLRLLKRLLKLHNYDEKMFPAKQACWY INNKKDDGLRPHQIDDNNDKQEREWINIYRIYQDTCDRAGLVDFAEILLR AYELFLKKPVILQRYRQRFQQILVDEFQDTNKIQYAWIRLLAGETGNVMI VGDDDQSIYGWRGAQVENIQRFLDDFHKAKTIRLEQNYRSTGNILQSANQ LISNNSNRLGKDLWTEGDKGEPVGIYAAFNELDEALFVSSQIKIWWEDGG ELNDCAVLYRSNSQSRVIEEALIRAQIPYRIYGGMRFFERQEIKDALAYL RLIANRQDDAAFERVINTPTRGIGDRTLDVLRNLTREREITLWQATQLAI GENKLAGRSATALLRFCELINSLAQETEEMPLFAQTDFVIKHSGLYEMYK QEKGEKGEVRIENLEELVSATREFIKPDDAEDMSDLSAFLTHASLEAGEE QASPHQSCVQMMTLHSAKGLEFPRVFMVGVEEGLFPSFMSLEEPGRLEEE RRLAYVGITRAKQKLTICYAESRRLYGKEERHIPSRFINELPQECIQAVR LRGTVTRAYNQSAVGSVKISPLNDSGWKTGQKVKHGKFGTGTVINVEGSD NNTRLQIAFQGQGIKWLIAHLANLEKL >MS0530 uxaA, UxaA protein MKQFIKIHRQDNVAVALQDLASCTLLDVDGQQIELKENIGRGHKFALTAV AKNQDVVKYGYPIGHALRHIEPGEHIHTHNVKTNLKDINDYKYLPESTAL SEQMPDREVQIYRRKNGDIGIRNELWIVPTVGCVVGIANLIKKRFMQLND LNDIDGVFTFNHSYGCSQLGDDHENTKTMLQNMVKHPNAGAVLVIGLGCE NNQVGAFKESLGEYDENRVKFMISQHYDDEVEQGVELLQQLYREMRQDRR ETGKLSEVKFGLECGGSDGFSGITANPMLGHFSDYLISHGGTTVLTEVPE MFGAERILMSHCKDEETFQKTVDMVNDFKKYFIEHNQPIYENPSPGNKAG GITTLEDKSLGCTQKAGHSQVVDVLKYGERLTIQGLNLLSAPGNDAVATS ALAGAGCHMVLFSTGRGTPYGGFVPTMKIATNSELALKKKHWIDFDAGRL VYDMSMQELLKDFINLVVEIVNGKPTKNEINEFRELAIFKSGVTL >MS0695 uxaA, UxaA protein MGENYMNTQQKALYIKVNPTDNVAIVVNSNGLPAGSQFEDGVTLIEHIPQ GHKVALVDIPKDSEIIRYGEIIGYAVKDIKQGSWIDESLVTLPKAPPLET LPLATRKAPKLEPLEGYTFEGYRNKDGSVGTKNMLGITTSVHCVAGVVDY VVNIIEKELLPQYPNVDGVVGLNHLYGCGVAINAPAAIVPIRTIHNIALN PNFGGEIMVIGLGCEKLQPQRLLEGTVDTYPIELKDATIMSLQDERHVGF EAMIKEILETAKQHLEKLNRRKRETCPVSDLVVGGQCGGSDAFSGVTANP AVGFAADLLVRAGATFMFSEVTEVRDAIHLLTPRAETVEVGKRLLEEMKW YDDYLDMGQTDRSANPSPGNKKGGLANVVEKALGSIAKSGSSNIVEVLSP GQRPTKKGLIYAATPASDFVCGTQQLASGITVQLFTTGRGTPYGLKAVPV IKLATRTDLANRWFDLIDIDTGTIATGKETIEQVGWRIFHEILDVASGRK QTWSDKWGLYNQLSVFNPAPVT >MS0544 uxaC, UxaC protein MKNTPALLMVSYCRLTAVFPLTAAFKEDLPMKQFMDEDFLLSNDVARTLY YDYAKDQPIFDYHCHLPPKEIAENRQFKDLTEIWLAGDHYKWRAMRSAGV DENLITGNASNYEKYQAWAKTVPLCIGNPIYHWTHLELRRPFGITNTLFN PQSADKIWQECNELLQQPEFSARGIMRQMNVKFSGTTDDPIDSLEYHKAI AEDRDFDIEVAPSWRPDKAVKIELPQFNDYIKQLEQVSDTEINGFDSLKK ALSKRLDHFDKRGCKSADQGMEIVRFAPVPDEKELDRILQLRRNEQPLTE LQISQFSTALLVWLGAEYCKRNWVMQMHIGALRNNNTRMFKLLGADSGFD SIADRTFAEQLSRLLDAMDQNNQLPKTILYCLNPRDNEMIATMIGNFQTG GIAGKIQFGSGWWFNDQKDGMERQLQQLSQLGLLSQFVGMLTDSRSFLSY TRHEYFRRILCEMIGRWVVNGEAPNDMNLLGNMVKNICFDNAKAYFK >MS0537 uxuA, UxuA protein MEQTWRWYGPNDPVSLADIRQAGATGIVNALHHIPNGQVWSVEEIEKRKA IIEAAGLTWSVVESVPVHEEIKTQTGNYKTWIENYKQTLRNLAQCGIDTV CYNFMPVLDWTRTDLAYELPDGSKALRFDQIAFAAFELHILKRPGAEQTY TAEEQKQAKAYFDKMSDADIKQLTSNIIAGLPGAEEGYTLEEFQGQLDRY KDISPEKFRTHLAYFLNEIIPVAQEVGIKMAVHPDDPPRPILGLPRIVST IEDMQWYVDTCDLPANGFTMCTGSYGVRADNDLVKMTEKFGDRIYFAHLR STCREDNPLTFHEAAHLQGDVDMFNVVKALLTEEYRRKANGETRLIPMRP DHGHQMLDDLKKKTNPGYSAIGRLKGLAEFRGLEMALKKVFFEK >MS1468 vacB, VacB protein MFQNNPLLSQLKQQLHDSKPHVEGVVKGTDKAYGFLETEKETFFIAPPAM KKVMHGDKIKAAIETIGDKKQAEPEELIEPMLTRFIAKVRFNKDKKLQVL VDHPNINQPIGAAQAKTVKQELKEGDWVVATLKTHPLRDDRFFYAQIAEF ICSAEDEFAPWWVTLARHEQSRYPVQGQEVYSMLDTETRRDLTALHFVTI DSENTQDMDDALYIEPVTAPNDEQTGWKLAVAIADPTAYIALDSQIEKDA RKRCFTNYLPGFNIPMLPRELSDELCSLMENETRAALVCRLETDMQGEIV GEPEFILAQVQSKAKLAYNNVSDYLEQVENAWQPENESTQQQINWLHQFA LVRINWRKKHGLLFKEKPDYSFVLADNGHVREIKAEYRRIANQIVEESMI IANICCAHYLAKNAQTGIFNTHVGFDKKFLPNAHNFLMANLSNEENQQEL AERYSVENLATLAGYCRMRHDIEPIEGDYLEFRLRRFLTFAEFKSELAPH FGLGLTGYATWTSPIRKYSDMVNHRLIKACLANRECVKPSDETLARLQEA RKQNRMVERDIADWLYCRYLADKVESNPEFRAEVQDCMRGGLRVQLLENG ASVFVPASSIHPNKDEIQVNTDELALYINGERRYKIGDIVNIRLTEVKEE TRSLIGNLV >MS0473 vacB, VacB protein MARKTTKKTTALLDPNYQQELEKYGNPVPSRDFILQVIREHNTPMSREEI LKVFAIQDDERVEGVRRRLRAMENDGQLVFTKRNCYVLPEKLDLLRGTVI GHRDGYGFLQVEGVKEDLFIPNTQMKRVMHGDYVLAQREGLDRKGRREVR IVRVLEGRKKQIVGRFFLEEGIGYVVPDDSRINRDILIPNENRLGARMGQ VVVVELKPRTASFSQPVGIITEILGDNMAKGMEVEIALRNHDIPHTFPPE VEKQIKKFTEEVPEEAKSGRVDLRSLPLVTIDGEDARDFDDAVHCRREQD GWHLWVAIADVSYYVRLRSALDTEARNRGNSVYFPNRVVPMLPEILSNGL CSLNPQVDRLCMVCEIKLSDKGVMKDYQFYEAVMNSHARLTYTKVARILE GDEELIERYQELVPHLQELHNMYNKLLEARHQRGAIDFETIESKFIFNEM GRIESIEQVVRNDAHKIIEECMIMANIAAANFMERHQEPALYRIHAGPSE EKLISFRSFLAECGLSLEGGMKPSTKDYAKLLEQVKERPDAELIQTMLLR SLSQAVYNADNIGHFGLALEEYAHFTSPIRRYPDLTLHRGIKYLLAKAQG VKRKTTDTGGYHYSLDEMDVLGDHCSMTERRADDATRDVADWLKCEYMQD HVGDEFEGIISSVTGFGFFVRLKDLFIDGLVHISTLDNDYYRFDAAGQRL IGENSGAVYRIGDIVKVRVEAVSLEQRQIDFALVSSERKPRREGKTAKDN AKKTMRYAESFAKQRKKAAATSKGKKKSAVKKSKNSVNKKANKKRTY >MS2239 vacJ, VacJ protein MKKITLIATALFAGSILTGCATIDPATGERQDPLEGFNRTMWSFNYDVLD PYVLKPAAKGWQALPSPLTTGLSNVAKNLEEPVSFVNRLLEGEVKKAFVH FDRFFINSTFGLGGLIDWASYSDPLKIENDRTFGDTLGSYGVEPGAYVML PAYGASSPRELTGTAVDTAYTYPFWHWVGGAWSLVPTVVKAVDKRAKAMD KEELLNQAQDPYITFREAYYQNLEYRATDGNVKAKDSGLSQDDLNSID >MS1556 valS, ValS protein MTQKLQMADRFDASAVEQALYNHWEQKGYFKPSYDAGRPSYSIAIPPPNV TGSLHMGHAFQQTLMDTLIRYHRMQGDNTLWQAGTDHAGIATQMVVERKI AAEENKTRHDYGREAFIEKIWDWKAYSGGTISQQMRRLGNSIDWERERFT MDEGLSEAVKEVFVRLHEEGLIYRGKRLVNWDPKLHTAISDLEVENKESK GSLWHFRYPLAKGAKTAEGLDYLVVATTRPETVLGDTAVAVHPEDERYQS LIGKTVVLPLANREIPIVADEYVDREFGTGVVKITPAHDFNDYEVGKRHN LPMVNVMTFNADIREEAEIIGTDGQPLTTYEAEIPQDYRGLERFAARKKV VADFDSLGLLEKIQPHDLKVPYGDRGGVPIEPMLTDQWYVSVKPLAETAI KAVEEGEIQFVPKQYENLYYSWMRDIQDWCISRQLWWGHRIPAWYDEQGN VYVGRSEEEVRSKNGLNSSVALRQDEDVLDTWFSSALWTFSTLGWPQQTK ELAMFHPTNVLITGFDIIFFWVARMIMMTMHFIKDENGKPQVPFKTVYVT GLIRDEQGQKMSKSKGNVIDPLDMIDGIDLESLLAKRTGNMMQPQLAEKI AKATKKEFPEGIQPHGTDALRFTLSALASTGRDINWDMKRLEGYRNFCNK LWNASRFVLTNDKLDLSTGERELSLADKWIQAEFNKTVQNFRNALDQYRF DLAATELYEFTWNQFCDWYLELTKPVFANGTDAQIRAASFTLVNVLEKLL RLAHPLIPFITEEIWQKVKDFAGVEGETIMTQPFPAFDEALVNDEAVAQI SWIKEVITAVRNIRAESNIAPSKGLDLLLRNLPDTEQKTLENNRTLMQIM AKLDSVKVLAQDEEAPLSVAKLVGSAELLVPMAGFINKDTELARLNKEIE KLIGEVKRIEGKLGNEAFVAKAPEAVIAKEREKMQDYQEGLEKLRAQYLS IENL >MS0675 vanY, VanY protein MGFGAENLTGKSRSHLLNLPCPLSNNHFLQPQALKAFQALQKSAVKNGFN LQPASTFRDFARQQLIWNGKFNGERKVHDDQGNPLDLTALSCWEKAQAIL RWSALPGASRHHWGTEIDFFDPDLLPQHQQLQLEPWEYEQDGYFFELSRF LQQNLPQFDFVLPFMQTPKGKEIGREPWHISYLPLAEKLEKQFTPEILLN AWENEDIAGRQTLIAHLPEIFERFIY >MS1094 vapI, VapI protein MLSPRKISEIATGKRPITADVAVRLALFFGTDAESWLNLQSHYDIKKSEE EIKTDIESILDSSIDGYLNI >MS0417 wbbJ, WbbJ protein MATEKEKMLAGLAHLPMEEHLSALRLQTKELLFDFNMLRPSNKLEKTHLL RKILGKAGKNIHVNSPFHCDYGCNIEVGDNFFANYHCVILDNGGVKIGND VMFAPNVSLYTVGHPLDAELRNQGWEQAKPIIIGNNVWIGGNVVILPGVV IGDNVVIGAGSVVTKDIPANSLALGNPCKVLRQITAADREYYQQTFMQNN >MS1499 wbbJ, WbbJ protein MTYYQHPSAIIDEGAEIGEGSRVWHFAHICGGAKIGKGVSLGQNVFVGNK VRIGDHCKVQNNVSVYDNVYLEEGVFCGPSMVFTNVYNPRSLIERKSEYK DTLVKKGATLGANSTIVCGVTVGAYAFVGAGAVINRDVPDYALMVGVPAK QIGWMSEYGEQLELPLSGQAETKCPHTGAIYRLEGHELKKL >MS2128 wbbJ, WbbJ protein MVGRNAHPTSKGNMMDYTLNLPLNQLIAQNSELFSKIHQVVDKNAPLVAE LNSGFRTQNEIRAILNEMTGTEIDASFHVNLPLYTDFSAHIRIGKRVFIN TAVMLTDLGGITLEDDVLIGPRVNIITVDHPIDPAQRRGVIVKPVVIKKN AWIGAGATILAGVTVGENAIVAAGAVVNKDVPANTIVGGIPAKLIKEI >MS0664 wcaA, WcaA protein MKVSLAVPVFNEEDTIPLFYQKVRNYEELQAYDVEIVFINDGSSDKTEEI ITALSAQDPLVQAVQFSRNFGKEPALFAGLEYSTGDVVIPIDVDLQDPIE VIPELIKEHQKGFDVVLAKRVDRQTDSWFKRKTALWFYKLHNQISKPKIE ENVGDFRLMSRRVVEAIKQLPERQLFMKGILSWVGFDTAVVEYNRAERVA GTTKFNGWKLWNFALEGITSFSTFPLRLWTYIGLFISACSFLYGSILILG KLIWGNTVPGYPSLMVAILFLGGVQLIGVGVLGEYIGRIYSESKQRPRYI VKTQKGNNNE >MS0902 wcaA, WcaA protein MKFSVLMSLYIKEKPEFLRASLQSLAEQTLPADEVVLVLDGEITPELEKV LDEFKEKLPFTFVPLVQNMGLGKALNEGIKVARNEWLFRMDTDDICYPER FAKQAEYIERHPDVVLFSTQIAEFDNDPAQIISVRRVPVGYEEIVRFNKM RSPFNHMTVAYKKSVLQEVGGYQHHLFLEDYNLWNRIIATGYQVGNLPDI LLYARTNGDAMIGRRRGLTYAKSEWKLYKLKRQLHIQGAVSGFLTFLMRT LPRLMPVSLLKNLYKLMRK >MS0903 wcaA, WcaA protein MFSIIVPSYNRNTEVNALLASLENQTVKNFEVIVVDDCSQNFIKIDRTFS FPVTLIRNETNSGAAQSRNVGANTAKNDWLLFLDDDDRFADNKCEVLAKT IVENPQANFVYHPAKCNMVNEGFSYVTSPLPPAQLTLDNMLLANKIGGMP MLGIKKDFFFELGGLSTELKSLEDYDFVLKLVSNPNLKAILVDQPLSICS FHTKRASVSTNTANTEKAIEIIRANYVKTVRQTHNFSLNALYMLAYPNAM NLSRKAATYYFEMFKKSHSLKHLIIAVVTFISPSLAINLKRFV >MS0438 wcaA, WcaA protein MPTISVAMIVKNEAQDLAKCLDTVKDWVDEIVILDSGSTDETREIALSYG AKFYTNTDWQGFGKQRQLAQQYVTCDYVLWLDADERVTPELRHSIQSAVE KNEDSTLYQIPRLSEVFGRKIRHSGWYPDYVLRLYKTHVAQYGDELVHEK VHYPVNVKVEKLTGDLEHYTYKDVYHYLIKSAGYGKAWAEQKAAAGKSTS LFNAVTHALGCFVKMYILRAGFLDGKQGLLLAILSANSTFNKYADLWVRT KTK >MS1480 wcaG, WcaG protein MRSVSIVGLGWLGLSLARHLKNLGWDVKGSKRTHEGVEQMRLMRFETYFL ELTPEINADPDDLTNLLSVDTLIINIPPSEYFFDPKLYVEGIENLVNEAL LCNISHIVFISSTSVFPNVSANFDEESVPQPDSEIGRALLEVEQRLFELK DIDVDIIRFAGLVGYDRHPVYSLVRKESAISGGNTPINLVHFDDCARAIQ LLLEMPGYQRLYHLAAPKHPSKVEYYTKMATKLGLNPPQFLCDEKDPQRI IKADKICRELDFVYQYPDPDEFI >MS0146 wcaG, WcaG protein MIIVTGGAGMIGANIVKALNDMGRKDILVVDNLKDGTKFINLVDLDIADY CDKEDFISSVIAGDDLGDIDAVFHEGACSATTEWDGKYLMHNNYEYSKEL LHYCLDREIPFFYASSAATYGDKTDFIEEREFEGPLNAYGYSKFLFDQYV RAILPEANSPVCGFKYFNVYGPREQHKGSMASVAFHLNNQILKGENPKLF AGSEHFLRDFVYVGDVAEVNLWAWENGVSGIFNLGTGNAESFKAVAEAVV KFHGKGEIETIPFPDHLKSRYQEYTQANLTKLRAAGCDFKFKNVAEGVAE YMAWLNRK >MS1492 wcaJ, WcaJ protein MIKRLFDIVVALIALILFSPLYLFVAYKVKQNLGSPVLFKQTRPGLHGKP FEMIKFRTMKDGVDENGNILPDAERLTPFGKMLRATSLDELPELWNVLKG DMSLVGPRPLLMEYLPLYNERQAKRHEVKPGITGYAQVNGRNAISWEQKF ELDAWYVEHQSLWLDLKIIAKTIQKVIAKDDINAADDATMPKFEGNKKS >MS0662 wcaJ, WcaJ protein MCECTLQMICKGTISMKKILFCKFTLALTDFLSLSLSIVLAFYSLELWTG ELSRYVPTEQIEERFYIHIILSFIGMGWYWIRLRHYTYRKPFWFELKEVL RTLFILGIIELAIVAFSKLYFSRYLWVLTWLIALIIVPTCRVVMKKILIK SGWYLRDAIIIGSGQNAVDAYNALMSESYLGFKIKYFITSHENKAIETLD VPSINENAQELWKAATNKSDQFIIALEEDEADSRDAWLRYFSSHKYRSVS VIPTLRGLPLYSTDMSFIFSHEVMLLRVHNNLAKRSSRFLKRTMDILGSL TIIILLSPILLYLYFSVKKDGGNAIYGHPRIGRNGKTFKCLKFRSMVVNS KEVLEELLERDPEARAEWEKDFKLKNDPRITKIGAFIRKTSLDELPQLFN VLKGEMSLVGPRPIVKEELERYQDDVDYYLMAKPGMTGLWQVSGRNDVDY DTRVYFDAWYVKNWSLWNDIAILFKTVNVVLNRDGAY >MS1501 wecC, WecC protein MGYIFMNLNSYKVGVIGLGYVGLPLAVEFGKHRFTVGFDISPNRVKELSE GKDRTLEVSSEALKSVTHLSFTSDLEQLKQCNFFIVTVPTPIDDVNRPDL TPLQKASESIGKVLKQGDIVVYESTVYPGATEEVCIPVLEKVSGLKFNQD FFAGYSPERINPGDKVNTLTKIKKITSGSTPEVADIVDQMYASIIEAGTH KASSIKVAEAAKVIENTQRDLNIALVNELSIIFERVGIDTLDVLEAAGSK WNFLPFRPGLVGGHCIGVDPYYLTHKAEEVGYNPQVILAGRRINDNMSQY VAQETIKLMLNNNIDVAHAKVGILGVTFKENCPDIRNSKVVDVVTELKNW GVEVVVADPWADAQEVKHEYGLDLSSVDENNPVDALIVAVGHKEFRDLSA ETLRSYLRTSKPVLADVKSLFDRDALAQQGITVFRL >MS0323 wecD, WecD protein MKIFKAEQWNLEVLLPLFEEYRLSHGMVENPERTFTFLNNRIRFSESIIF IATNERQQAIGFIQLYPRLSSLQLQRYWQLTDIFVQDVANQNEIYAGLIE KAKEFVCFTHSTRLVVEQDQQHQGIWEKEGFKLNTKKALFELKL >MS2102 wecD, WecD protein MMQTIDQFIAQYIPAAYALNLRVVESSPQRVVIKAPFECNSNHHHTIFGG SQALLATLSAWSLVYLNFPEANGNIVIRSSQIRYLKPAPSDIIAVSICPD SLAMNLAKQMLTQKGKAKITIQCQLYCDDIIVSEWTGEFVLSHTPF >MS1489 wecE, WecE protein MVGITDVFRFLTLYIKRFIMLNTAFEPWPSFTQEEADAVSRVILSNRVNY WTGTECREFEKEFAAYVGTKYAVSLTNGTVALDLALKALNIGAGDDVIVT SRTFLASASSIVTAGANPVFADVELDSQVISRRTIEAVLTPNTKAIICVH LAGWMCDMDPIMDLAKEKGIYVIEDCAQAHGAMYKGKSAGSIGHIAAWSF CQDKIMTTGGEGGMVTTNDKGLWNKMWSYKDHGKDFDTVYNKQHPPGFRW LHNSFGTNWRMMEVQAVIGRIQLTRMADWTAKRINNMERILNAFDGSPYF SVYRPNQDYVHAAYKCYVQVNPQALPAGWSRDRIMQVINEQGVPCYSGSC SEVYLEKAFDNTPWRPAKPLQNAKSLGETSLMFLVHPTLSEDSLVKTCAA ITQVIHALSENK >MS1498 wecE, WecE protein MEFIDLKAQQQRIKAQIDAGIQKVLAHGKYILGPEVAELEEKLAAYVGAK YCITCANGTDALQIAQMALGIGAGDEVITPGFTYIATAETVALLGAKPVY VDVDPKTYNIDAEKLEAAITPRTKAIIPVSLYGQCADFDAVNAVAKKYNL PVIEDAAQSFGASYKNRKSCNLTTISCTSFFPSKPLGCYGDGGAIFTNDD ALANVIRQVARHGQDRRYHHIRVGVNSRLDTLQAAILLPKLAILDDEIAA RQRVAENYTRLFNQAGVNTTPFIEAHNQSAWAQYTIQVDNRADVQEKLKT LGIPTAVHYPIPLNKQPAVADSRIHLPVGDLIAERVMSLPMHPYLTPEEQ QKIVQSLV >MS1487 wza, Wza protein MHRQVITLSVSLRMLFGPNSIRMINFYYRLNMNKLIKSLLLTGLGLSLSS CSVFLPDSHKSPISQRPAQVVGKNVDLAKAIDAYLITPQLLKTLTPVNAS AQSNLSLDNELKNYQYRVGVGDVLNVTVWDHPELTTPAGSYRSSAEAGNQ VHANGTMFYPYAGNIKVAGLTVGQIRSRLTKALSNYIAEPQVEVNVASFQ SQKAYVTGEVKSPGQQFITNVPLTLLDAINKAGGLADNANWHNVTLTRNG RDEVISVEALIQRGDLTQNRLLKSGDIVHIPRNDTMKIFVVGEVVQSQLL TIGRNGMTLTEALSASGGIDKLSSDATGIFVIRGQRGKQEFVQDNNGEKI EKVANIYQLDVTNPTAYILGTEFYLQPHDVVYVTTAPVSRWNRVISQVVP TISGFNDLTEGVLRIRTWP >MS0746 xerC, XerC protein MMKDSALIELFLNELWLGKGLSDNTVQSYRLDLTALSQWLQGQGKSLETL DSSDLQAFLGERVDQGYKATSTARMLSAMRKLFQYLYQESYRTDDPSAIL SSPKLPGRLPKYLTEQQVGDLLNAPSTDIPLELRDKAMLELLYATGLRVT ELVTLSTDNINLEQGVVRVIGKGNKERIVPMGEEASYWVGQFILYGRPML LNGQSSDVIFPSKRALQMTRQTFWHRIKHYAILADIDTDSLSPHVLRHAF ATHLVNHGADLRVVQMLLGHSDLSTTQIYTHVAKERLKRLHEKYHPRG >MS0523 xerC, XerC protein MQTYLQKYWNYLRNERQVSSYTLTNYQRQMDAVMKILQENDIQNWRQVSP SVVRFILAQSKKSGLHEKSLALRLSALRQFLAFLVLQGELKVNPAIGISA PKQGKHLPKNINAEQLNKLLDNNSKEPIDLRDKAMLELMYSSGLRLSELQ GLNLTSLNFRSREIRVLGKGNKERILPFGRHASHSVQEWLKVRLLFNPKD DALFVSSLGNRMSNRSIQKRMEIWGVRQGLNSHLNPHKLRHSFATQMLEA SSDLRAVQELLGHSNLSTTQIYTHLNFQHLAEVYDQAHPRAKRRK >MS1850 xkdP, XkdP protein MLKKEGKMGLFDFVGNIGKKIFNREDEASKAVTEHIAEDNPGVENVNVTV ENGVAKLEGSAKSASALEKAILMAGNIAGITSVKADGVNILNGEVLAGDD EFYVIQKGDTLWAIAEKHYGNGIKYKAIVEANKEVIKDENKIFPGQKIRL PKSL >MS0560 xseA, XseA protein MMNENIYSVSQLNYSVRQLLEGQLGLVWLTGEISNFSQPVSGHWYLTLKD ENAQVRCAMFRMKNMRVAFRPQNGMQVLVRANVSLYEPRGDYQLIIESMH PAGEGFLQQQFEALKIKLAAEGLFAQNLKKNLPHFAKTVGIVTSPTGAAL QDILNILQRRDPSLKIIIYPTAVQGKDAANEIVQMIELANLRNEADVLIV GRGGGSLEDLWCFNEETVARAIFRSSIPVISAVGHETDVTIADFVADVRA PTPSAAAELVSRNQQELFQQLQYKRQRLEMALDRLFNEKQQHLQRFLLRL QNRHPSARLLAQRQQTGQLEHRLNSAIRRLLDKNHYKLTALCERLEKNPL PYLVRQQNYHIVQLATNLDFALKRLIVSKQTSLSALCGKLDGLSPLKVLA RGYSIAETEQGETISSVNQVETGDKIKTRLRDGVIVSKVI >MS1061 xseB, XseB protein MARKPKESSTVDFETTLNQLETIVTRLEAGDLPLEEALKEFENGIKLAKL GQERLQQAEQRIQILLQKSDTAELTDYQPTDE >MS1048 xthA, XthA protein MYFINRNNMKIISFNINGLRARPHQLDKIVEQYQPDIIGLQEIKVADEMF PHELVDHLGYHVYHHGQKGHYGVALLCKQAPKAVHKGFSTDTEDAQKRLI MADFETAFGALTVVNGYFPQGESRDHETKFPAKEKFYADLLNYVKNEHNP ESNIIIMGDMNISPTDLDIGIGEDSRKRWLRTGKCSFLPEEREWYQRLYE CGLEDTFRKLNPWTNDKFSWFDYRSKGFAENRGLRIDHILANSKLAERCV DTGIALDIRAMEKPSDHAPIWATFK >MS2373 xylA, XylA protein MTNYFDKIEKVKYEGADSTNPFAYKHYNANEVILGKTMAEHLRLAVCYWH TFCWNGNDMFGVGSLDRSWQKMSDPLAAAKQKADIAFEFLTKLGVPYYCF HDVDIAPEGNSYQEYVRNFNTIVDILEQKQAESGVKLLWGTANCFSNPRY MSGAATNPNPEIFTRAAAQVFNAMNATKRLGGENYVLWGGREGYETLLNT DLRREREQIGRFMQMVVEHKHKIGFSGTLLIEPKPQEPTKHQYDYDVATV YGFLKQFGLEKEIKVNIEANHATLAGHTFQHEIATAAALDILGSIDANRG DPQLGWDTDQFPNSVEENTLAIYEILKAGGLTTGGFNFDAKIRRQSINPY DLFHGHIGAIDVLALSLKRAAKMVEDHTLQNIVDQRYAGWNGELGQQILA GKSSLEALAQAAQNLDPNPVSGQQEYIENLVNGYIYR >MS2329 xylB, XylB protein MTILNIAAVDLGASSGRVMLASYSTENHKISLEEIHRFKNQFVSQNGHEC WDLAYLENEIVNGLRKISNSGRTLHSIGIDTWGVDYVLLDQNGEVVGPTY AYRDHRTDGVMQKVQAELGKEVIYRKTGIQFLTFNTLYQLKAMTDENPAW LSQVKDFVMIPDYLNYRLTGVINREYTNATTTQLVNVNIDSWDTALLDYL GLPASWFGRIRHPGHQVGLWENRVPVMSVASHDTASAVISAPLSDENAAY LCSGTWSLMGLDTTTPCTDECAMNANITNEGGIDGHYRVLKNIMGLWLFN RLCTERDVTDIPALVKQAEAELPFQSLINPNAECFLNPSSMVEAIQQYCR EHNQVIPKTTAQLARCIFDSLAMLYRKVALELAGLQGKPISALHIVGGGS QNAFLNQLCADLCGIDVFAGPVEASVLGNVGCQLMALDQIHNAAEFRQLV VKNFPLKQFKKRPHFMPASDFEEKWCEFCALN >MS1609 xylB, XylB protein MNYYLGIDCGGTFIKAALFDENGNIRACERENVAVISEQSGYAERDMPEL WQACAEVIRRTVKSSEIPPHLIKSVGISAQGKGAFLLDKDKRPLGRAILS SDQRSLPIVKRWQAEGLEQQIYPISRQSLWTGHPVSILRWLKENDVPRYD QIRHLLMSHDYLRFCLTGELYCEESNISESNLYNMATGQYEPQLAHLLGI EEIMDKLPPIVKANQIAGFVTEQAAQACGLTAGTPVVGGVFDVTAMTLCL ADNQPHKLNVVLGTWSIVTGISNEIDDKQALPFAHGRYVEAGKFLIQEAS PTSAGNLEWFVKQWKLDYQQINQMVAALPPAQSAVIFLPFLYGTNAKLGM TAGFYGMQAHHSQAHLLQAVYEGVLFSLMIHLERMCQRFPQVQTLRVTGG PAKSAVWLQMLADLTGKTLEIPEVEEIGCLGAALMAAEGVNGDSLALKQH QALRVIQPNPANFDAYQHKYRQYRKLTKLLEQMA >MS0048 xylB, XylB protein MNYYLGIDCGGTFVKAALFDETGNLQGIARENVPVISDKAGYAERDMPQL WQVCAEVVRKTIAESKISPDLIKGVGISAQGKGAFLLDQNQQPLGRAILS SDQRSLDIVKQWQKEGIPEKLYPLTRQTLWTGHPVSILRWVKEHEPERYA RIGSVLMSHDYLRFCLTGELHCEETNISESNLYNMEKGEYDPVLADLLGL KGIIEKLPPVIQSNRIAGYVTEQAAKVSGLAVGTPVVGGLFDVVSTALCA GLEDETKLNTVFGTWCVVSGITDHIDPNQSLPFVYGRYAEENKFIVHEAS PTSAGNLEWFVKQWNLDYQRINEEIASLPPAASSVLFVPFLYGSNAGLGM QAGFYGIQSHHCQAHLLQAIYEGVLFSLMYHLERMLKRFPATKVLRVTGG SAKSEIWMQMLADFTGMTLEIPEIEETGCLGAAVIAMQALNNNLTVTEIL NKGIKVVRPNPDNFDLYQKKYQRYVTLTAALKAML >MS2372 xylB, XylB protein MYIGIDLGTSGVKAVLLDESQKIIATTQHPLPISRPHPLWSEQNPKDWWY ATNLAMLALAQQQNLSAVKAIGLTGQMHGATLLDKQNNVLRPAILWNDGR SSAECEELEKLVPRSRKITGNLMMPGFTAPKLKWVDKHESAIAEKISKVL LPKDYLRLMMTGEYASDMSDASGTMWLDVAKRDWDKSLLNACGLDENNMP KLFEGNQITGYLHADLAKNWKMNAVPVVAGGGDNAAGAIGIGLYQTGQAM LSLGTSGVYFVVTDKFTANPQKAVHSFCHALPDRWHLMSVILSAASAVDW VKKATGIADIQTLFQKAEKSAVNSEAIFLPYLSGERTPHNDAYAKGVFWG LSHNDDQTTMAKAVIEGVSFALADGIDVLHETGVTADNIALIGGGAKSAY WRQLLADISGRTMEYRTGGDVGPALGAAKLAQIALNPHEDIADFCQPLPL EAVYHPNAERTAYYAEKRAKYAELYQRVKGL >MS1411 xylB, XylB protein MHTNVKKLIESGQACLGIELGSTRIKSVLIDTSGNILAKGGFEWENHFVD GIWTYPLDEVWAGIAQSYRDLCQDVQAKYAVKLNRLAAMGVSAMMHGYLP FDAQDNQLAEFRTWRNNTTATAADQLTELLQYNIPQRWSVAHLYQAVLNQ EPHTKEVAYITTLAGYVHWQLTGQKVLGIGDASGMFPIDSTTKDYDAAMV ETFDRLIADKNLNWKLSGLLPKVLVAGENAGVLTEKGAKLLDPSGNLQAG CVLCPPEGDAGTGMMATNSIKVKTGNVSAGTSAFAMVVLEKPLSKVYRDL DMVTTPAGDPVAMAHSQNCSSDLNAWFQLFGEVLRSFDVSFTQDQLYGKL FDKALDAAPDAGGLLSYCFYSGEHGVDLTTGCPLFLHPANAEFSLANFVL VQLYTSFGAMKLGMDTLTQKEHVKIEKIFAHGGLFKSKAVAQHVLAAALN VPVAVLETASEGGAWGIALLAAYTRQAPLSLTEYLDLVVFADSADNVVEP EPARVKGYASFIRRYQDGLSIEREAADFANKK >MS0059 xylB, XylB protein MNMQDAKNLIASGGASVGIEFGSTRIKAVLISTDGTILASGGFTWENHFI DGIWTYPQSEIWQGLQQAYRDLANQVQEQYGITLTRAKAIGISGMMHGYI PFDKQGNQLVAFRTWRNNITAKSSQKLTALFNYNIPQRWSISHLYQAILN QEEHVGEIDYLTTLAGYVHWQLTGEKVLGVGEASGMFPIDPQTGSYYQIM LNQFDGLIAAQSYPWKIANILPQVLTAGQPAGHLTEQGAKLLDPTGRLQA GIPFCPPEGDAGTGMVATNSIKTNTGNISAGTSAFAMIVLEKELSKVYEQ LDMVTTPTGKLVAMAHANNCSSDINAWIRLFGETLKAFGAEVETDKLYET LFRKALEGDADCGGLLAYGFYSGEHSVGLAEGCPTFMHPANSRFTLANFI RTHLYSAFGAMKLGVDILIQQEKVNIAQILGHGGIFKTPNVASKILASAI NVPIAVMKTANEGGAWGIALLANYLDAHKNGQSLDDYLEQCIFSQAEVSV SYPDSETSKGYEEFIEYYKRGISVVQAAVNTFNE >MS1562 yajC, YajC protein MDAQQGSPMSMLIIFAIFGLIFYFMIYRPQAKRNKEHRQLMSQLAKGTEV LTSGGLVGKITKITADSDMVVIALNETNEVTIKRDFIVAVLPKGSIKSL >MS0481 yidC, YidC protein MDSRRSLLVLALLFISFLVYEQWQMDYNTPKPVATEQAQAVSSNAEMPAS TSSTEGTVDNVAQGKIISIQNDVFTLKVDTLGGDVVESSLTNYAAELNSD ARFILLQNKPNEVYVAQSGLIGKNGIDTKAGRAAYQVEAEQFTLADGQNE LRVPLTLEKDGVIYRKVFVIKAGSYDIEVNYEIQNQTNEAIEVQPYGQLK HTLVQSSGSMAMPTYTGGAYSSADTNYKKYSFDEMKDKNLSIDTKAGWVA VLQHYFVSAWIPNQDADNQLYTSTANGLGFIGYRGPVVNVPAGGSETIKS ALWTGPKLQNEMGAVANHLDLTVDYGWAWFIAKPLFWLLNVIQSIVSNWG LAIIGVTIVVKGILYPLTKAQYTSMAKMRMLQPKLQEMRERFGDDRQRMS QEMMKLYKEEKVNPLGGCLPLLIQMPIFIALYWTFMEAVELRHAPFFGWI QDLSAQDPYYILPILMGASMFLLQKMSPTPVADPMQQKIMNFMPLIFMVF FLWFPAGLVLYWLVSNIITIVQQQLIYRGLEKKGLHSRKK >MS1768 zipA, ZipA protein MDLNTILIILGILALVALVAHGLWSNRREKSQYFENANTFGRANRQDEPI STPESYKQARNVAPAFTQPKQAVIHQEPPVQQPLNTEPEPITQETPVRAE PQSVDQIKITLPNVEPAPAESAPIYEMRPSRRNTEPQYYQQPSEPYYQQP VQQNLARQTIADIEATVDPNEGVNSSSEYLRTQLQEASQEGNQIFTQSPL SRAPLQQPIEFDQPAQQEKESDNNEDEDVSFVMLYVAAAENRQFQGTVLV QALEDLGFSLGEDNLYHRHLDLTVASPVLFSAANITQPGTFNPYTLHEFF TDGVAIFMRLPSPGNDRTNLKIMIRSAKTLAQQLGGFVLTEQQELFTDAA EEEYLAKIK >MS0888 zntA, ZntA protein MQQTVQIDDRAEQTTLMLEGMSCAACVRKVEKALLAVPEVAAAQVNLAEN TALVYGNGDVELMLQAIENAGYHAEVVEDENSRREKQNVQAEHEINQRKW QSIVALIVGFGLFFWGIFGGTTVATAENHWNWVMVGLIVLVTMLCTGWHF FERAWKNLLKGNATMDTLVALGTGVAWLFSMFISLTPDFFPDGSRHLYFE SGVMIIGLINVGKMLEAKAKQRSSKALERLLDLTPKTTRIIDELGEREIP LKDVKTGMRIRLQTGDRVSVDGIVLQGSGWIDESMLTGEPIPVQKQEGDK ISAGTLVTDGALQFRAEQVGNKTMLANIIRLVRQAQSSKPEIGQLADKIA GIFVPVVICIAVFAALIWYLVGPEPQISYALVVLTTVLIIACPCALGLAT PMSIIGGVGRAAELGILVRNADALQKASNVDTIVFDKTGTLTKGEPKVTA LLTFNHFDESVALEYAASIEQGANHPIAKAVLSLAQEYNLNLEHEPEDFR TLKGLGVSAKVDQQNILLGNYTLLQQHNIDASAANRFFQTESEKGATVIF LAVNNVLAGVFAIRDPLREDSVEAIQRLHKQGYHLIMLTGDQEKTAQAIA KEAGIDQVIAGILPEGKANVIRQLQEQGGKVVMVGDGINDAPALAQADVS IAIGSGSDIAIETSELTLMRHSIHAVADALALAKGTLHNMKQNLFFAFVY NVICIPVAAGVFYPLFGFLLNPMFGGAAMACSSITVATNANRLLKFQPKD >MS0887 zntA, ZntA protein MAIFLALSELSCGHCIKSVTKAIEAISGENSAEVTLNYAKINSDKDPQLF IDAITAADFKAQLATPSFELELDGLNCGHCIKSVSKALSAVENMEVFDVD LKKARIYGNAKPEDVVKAIVDAGFNARLARLV >MS1457 znuA, ZnuA protein MRFLELSTSYLLNNNFIFIKRGIFMANRLILKQVLLAAILSSAACAVNAK VVASIKPLGFIASSIADGVTETDIVVPTGASPHDYSLKPTDVKKLKSADL LLWIGEDVDSFLAKSIGALDNRKVITIAKLDSVAPLLGKATHHHKEHDHE HHHADHEDEHQHDHDEHDKTGLSTNWHIWYSPEISKMVAAELAEKLTERF PAQKELIAENLLDFNRTLNERSDKIKLQLAAVKDKGFYVFHDAYGYFNDA YGLKQTGYFTINPLVSPGAKTLATIKEEIAEHKVSCLFAEPQFTPKVIES LSKGTGVHVGRLDPMGDKIQLGKRSYADFLQFTADSYGECLAK >MS0789 znuB, ZnuB protein MFEIIFPAWLTGILLSFITAPLGAFVVWRKMAYFGDTLSHSALLGVALGI CLDINPYLSILILTLILAVAMVWLESNTQFSVDTLLGIIAHSCLSLGVVT VGLLQNVRVDLMGYLFGDLLAVSYEDLIYIGIGVLIVLISLIYFWKPLIS TTVSPELAQVEGINIRRMRFILMILTALTIALSMKFVGALIITSLLIIPA ATARRFARTPESMAIIAVGLSIVAVSLGLTLSAFYNTAAGPSVVICSSFI FLISLLKKEKI >MS0790 znuC, ZnuC protein MFSTFFKVRDISSMQINSIKIPLIELQNIKVVFGAKTALQNINLSIYPNT VITIVGPNGGGKSTLLKVLLKLLSPTDGKVIHHRDLRIGYVPQKIHLEQS LPITVEKFLSLKKGISKAEIQDAIELLSIKHLIHSSMQKLSGGEMQRVLL ARALLNKPNLLVLDEPMQGVDLSGQIELYQLIHQTREKLNCAILMVSHDL HIVMADTNEVVCINRHICCAGSPEKVSNDPTFIHLFGDQFSQNVAFYTHH HNHQHNMHGDVCCIGNKHSVQCINNGR >MS0016 zwf, Zwf protein MKAENNCIVIFGASGDLTHRKLIPALYNLYKIGRLSENFSVLGVARTELS NEKFREKMRSALIEHEKADGDELNNFCSHLYYQAVNTSDAADYVKLIPRL DELHDKYKTCGNTLYYLSTPPSLYGVIPECLAAHGLNTEEFGWKRIIVEK PFGYDMKTAKELDVQIHRFFDEHQIYRIDHYLGKETVQNLLVLRFSNGLF EPLWNRNYIDYIEITGAEELGVEQRGGYYDGSGAMRDMFQNHLLQVLAMV AMEPPAIINADSMRDEVAKVLYCLHPLEPNDLQHNLVLGQYAATQLNGER VKGYLEEKGVPPDSNTETYMALRCKIDNWRWAGVPFYVRTGKRLAARVTE VVIHFKKTPHPVFSQEAPENKLILRIQPDEGISMRFGLKKPGAGFEAKEV SMDFRYSDLNGASLLTAYERLLLDAMKGDATLFARTDAVHACWKFVQPIL DYKANGGRVYEYEAGSWGPVEADKLIAQHGRVWRKPSGQMKKKV