Gene list
Applied filters:
COG category: Amino acid transport and metabolism
Organism: Mannheimia succiniciproducens MBEL55E, MBEL55E
Gene type: CDS
Number of genes found: 219
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mannheimia succiniciproducens MBEL55E, MBEL55E >MS2300 unknown MVLMDSLSKKVVYHKIVNAERVIYYRKAINELREKDYKIQSITCDGRRGL LKDILNTPIQMCQFHQVAIVIRRITRKPKSEAGKELKILIKTLKTSSKNK FYINLHHWYLKHKNFLNERSSIPDKAGKYPFKHRNLRSAYSSLKRHEEFL FTFEKYPELKIEKTTNRLEGLFSELKRKLALHNGLSKKNKIMFIKDFLNE KS >MS0296 unknown MSHLIVKEQKTIIRNAFFTFLYFTLAIGLAIGILYTDIFYLQNMIEEESL VEYTQSLSLTILTLMFSRHAYRSPQWRGGFVLITGFFLCMLIRESDALFD NLIRHGSWAYFAIITALVCIIYAFTHRQSTIDGLAQFAKQKEFHSFIIGL LTVLLASRLIGYGGLWRFILYNDYPHIVKNIIEETTELFGYLIMLFSCLS LTRHFK >MS1082 unknown MIAFGYITALSLSYFLLAPDFKGLSFTEYFIQSEAKPIFLTLGLLLPIGF IVMSKAVEYGGIVRTDAAQRLALFLQIIAAVILFGETLNNMRVGGVIVAF FALFCLLTKPTKSIENALKAVFALAAVWLIWGVTGILFKKIALMGGAFPT TLFVTFSIAAVLMFTYLLIKRTFWNASSLVGGIILGCLNFGNILFYIYAH QYFKENPTIVFATMDIGVICLGMIVGALVFKEKISKINMLGIVLGITAIL LLRV >MS0071 unknown MFRVVGEINMKKYISDKNFLAGFIFFFVSAFYLISAFQIETKNLVSVEAD FMPIIYGSLLLTTSIVLMITSFFKIRNTVVNKENKETDWKRIFSVIGLVF VYVLLMQYIGFIVTSIPFLFCLSVLLTPLYIKKNYIVYSIFSIVLPILAY FLFSYYLNLTMPSGFLF >MS1348 hypothetical protein MERHPMVERVDYPGLASSKDYELKQKYTPNGLCGVLSFELKGDKQTAMKW LDSLQIISREVHVADIRSCALHPATSTHRQLSDEEMRAANITPGFIRLSI GIENPEDLLADLQNAFDQIK >MS0343 unknown MKKLILATALSSVAAFTQAQIVPNANSATHTYEFTQSYDLQVPKGSSGET KLWVPLPFSNDYQDVKSVEFDGNYQQAYITENNQYGAKTLFALWDKDAQK RDLKVKLVVTTKDREPMKQGLLENYQAPENIEYSVDVQQYLKPTQHIKTD GIVKQFADKIVGKESNPLKKAEMIHQWIVNNMERDNSVLGCGDGDVEKIL TTGVLKGKCTDINSVFVALARASGIPAREIFGIRLGQAVKMGEYSKGAFG SAKDKVANENGGQHCRAEFYLAGFGWVPVDSADVAKYRLTENKSVEDKDT QAVSQYLFGNWEANWMGFNHARDFNLYPMPELAPLNNFGYPYAEVGGDPL NSYDAKKFGYEFTSKEL >MS1642 unknown MEPISLPEYAGSTLRGAFGRALRKIACMTKQADCKGCPLYRSCPYTNIFE TPAPTSHELQKFSQVPNGYIIEPPEWGEKIYLTGTELRFNLALFGRLIEQ LPLIAFAFKRAFEYNVGRGKAHLVDIAKFSQNMTACQSILKEGNIIEHEK QIILPESLPNYLTIQIETPLRIQENGKPLRENQINADRFFIGLAKRISLL SEFHHQPLNLDFELIKNDLQAVKYEKNLTWLDWTRYSSRQDQKMKLGGVV GSWQFENLSPELIQLLYFGQWLHCGKNATFGLGKYRITNL >MS2122 unknown MSRALNFVMISPHFPTNFETFAVRMREKGINTLGIADTPYEQLSETLRNN LTEYYRVDNMEDYEQVYRAVGYFAHKYGRIDRVESHNEYWLELDAKLRTD FNVFGYKNDDMLAIKTKAQMKEVFRKSGLKVAKGRVFKDDEDARKLAKQL KFPVIVKPNSGVGASDTYKIKSAVELEDFFGYKNPNVEYIMEEFIDGDIV TFDGLTDHDGKIVFYSSLEYSEAVLDTVEKDGDMFYYVPREISPKLVKLG EQCVEAFNVRERFFHFEFFRVKKSGELLPLEINCRPPGGLTIDMWNYAND FDVFREYANVVTENKFYSDITHPWNVVYISRKANQNYVNSIDDVCQKFGD NIISVQTVPGVFAKVMGEHGILVRTKTIEQMREIVQFAQAKQ >MS2266 unknown MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS >MS0665 unknown MNKYLKSDFIFSLFLSIAIMFICLYFEKSFFFVDDAQNEFLPFTRQIGNV WLNGEIPFILKNTFIGSNTMIDIHRAIFLPQNIFLSILSVKITSLKIISI IAAFINLFVMSFSALKLSEAFSLTKAAGIVLAFLFCINPIFLYFYLESWW NAAAGQAWFVASLASVAWLMRAFSIKRLLLNVITVLSIFASGWPHSVLVY GFLALIFSIFLYLNKRHNDLILFVLISFSIILIAIPLYSEYVISGDLINR QSFKFNNVGNFLSTTLNQLLLTFNVTYYHFMHRYGGYSITHIPMGYSSIY ILLLICFGSLKNIARNPNSLFLLVLCTVFFILTQTPTEIGPFRYPFRFTP YFSEVLTMLSIFSLEKLGIVKTRARVFLVVLLLSISLLLSIFSLEENFGK YAILQFLFFAVTTWYVVRYNSISLKSGLPYTAFIFLLMLLAKDSVIGYLS FPDLKNSINMENNYSQGGYILSLTNGKRPKNNLEDLNSTHFMLYGLKSIN GASPVGNKYISKTISTRSSQAFFNAKETILGLSKTYKDKCYFDLFGIDTV ILNKKDNSSLISQKLSDCGFSERKVKSHDVIYFLRNDFNAKGSVSYHSDT LSINQQISLKNNSEFYQLSGLKGDELIFNRVYWYGYRAYINDKEIPLLNY DGLLRIILDHDYQNGVLRLEYFPKSWKYALLIALSGFLLLLFSVGYMQRM RKWVSLN >MS1235 unknown MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS >MS1347 hypothetical protein MTAPPASWGSEILPIPIALTDLFHIQQRKITMKFETQCLHAGYSPKNGEP RVQPIVQSTTYTYDSAESIGKLFDLQEAGFFYTRLANPTTNAAEEKLAAL EGGVAALCTASGQAATFYALMNLVESGDHFISTTNIYGGTYNLFAHTFRK MGVEVTFVNQDDNLDELRKAIRPNTKAVFGETISNPTLRVLDIEKFAALA QAANAPLIIDNTFATPYFCRPFKYGANIVVHSTSKYLDGHAVALGGAIID GGNFNWEQEKFRQFSQPDITYHGLVYTRTFGKAAYAVKARVQLMRDLGAT PAPQNSFLLNLGMETLPLRMKQHYANAQAVAE >MS1493 unknown MNINVISIFKLLLLFALGLVILSPALSTQIGVPRLDSALCFLFFFLAVIT PFLRDMETDFFKLQFPVYVLFFFGFLSVLNAFSTEKLVDLFFFGIVMFLF HYSFLTFNRGDGEAGIRHLLLGISLIVLAGFFIEALLGFQLVSGNEELTV TDKAFKGFFFNTNDQSVIMISLAVAVGFFYIIRENNWKIKLIGYALIFIM GLAIVISASRSVLLSYLIMLMLILFLNASAYFKAVYLFFACVIALFIFNL SWLQEVFILLAKIDWLERPIERFSLVIFSMGDDKSVGYRTEIYTTFLDNF KILWLGYGPRDYIQYFDQIKLSFPLGYTNPHSFFIELYLAFGIFAFLAFI YFLLNSIIYVMNTRLLAWKERIFILFVFINFCWIVWVPSSILRLPLVWYP LFLVLVYTVLVKNGTFVSPKLVGRRRSS >MS0001 unknown MKGGDMSAIKDRLKDIDCALSDLERERKEILLDAGAPEIIGLKDDINALT VSLEYIDDEILPLLQQLSIDPDAYKYLSEDIKLSLLRDLPESVSAIKAII NKLTPVKHCIESFNHQNDIGF >MS0353 alsT, AlsT protein MSLETILSSIDSFIWGPPLLILLSGTGLYLTLRLGFLQIRHLPRAFAYMF KKEEGNHQRGDVSAFQALCTALSATIGTGNIVGVATAIQAGGPGAMFWMW LVALLGMSTKYAECLLAVKYRVRDKNGFMAGGPMYYIERGLGIKWLAKLF AVFGVLVAFFGIGTFPQINAITHAMNDTFSVPVTISAAIITILVAAIILG GVKRIAAVSSYIVPFMAVLYVTTSLIILLINADKVPSALALIIESAFNPE AALGGALGFTVMKAIQSGVARGIFSNESGLGSAPIAAAAAHTKEPVRQGL ISMTGTFLDTIIVCSMTGLVLVITGAWQSSDMAGAAVTNYAFSQGLGTNI GATIVTVGLLFFAFTTILGWCYYGERCFVYLVGIKGIKLYRTAFIILVAC GAFIKLDLIWILADIVNGLMAFPNLIALIGLRKVIVSETKDYFMRLKTNN YSLDDNEEQIVNS >MS0767 alsT, AlsT protein MNAKRYFGVLNDFVIMVEQGIHWLVDNVEGPLWDATIVILLGVGLFFTIT TGFVQIRLFPHSLREMWFGREVQGDSLTPFQAFATGLASRVGVGNISGVA TAIALGGPGAVFWMWLTALIGMSSAFAESSLAQLFKIKEADGSFRGGPAY YITQGIGSRWLAAAFAIALIFTFGFAFNAVQSNSIVEATRNAWLWDEHYV GMGLVLLTALIIFGGIKRIGKFSARIVPVMALVYLLIAVSILLIHYDRIP SVISLIIRSAFDFSAMAGGVFGAMLSKAMLLGIKRGLFSNEAGMGSAPNV AATADVKHPASQGLIQMLGVFVDTMVVCTCTAIIILLSDNYGGEQLQSIS LTQNALKYHMGEFGLHFLAFILLLFAFSSIIGNYAYAESNIRFIKNNPVV VNLFRAMVLFFVYFGAVNSGGIVWAFADTVMAVMAMINLVSLIILSPIVW LLLKDYHRQAKQGIVPVLDIMLHPRLLKLRLDQRLWNRR >MS2050 ansB, AnsB protein MKLTKLALTMSLGLGVSFANAAELPNITILATGGTIAGSGATSVSSSYKA GQLTVQTLIEAVPEMKDLANITGEQVVNIGSQDMSDEVWLKLAKTINAKC NETDGFVITHGTDTMEETAYFLDMTVKCEKPVVLVGAMRPATEKSADGPL NLYNAVVVATDKKSAGRGVLVAMNDKVLGARDVTKTSTTAVETFNSPNFG SLGYIHNSKVDYERSPESKHTTATPFNVDNLTALPKVGIVYAYSNMPTEP LKALLDAGYEGIVTAGVGNGNVNQANSAILEKAAKDGVAVVRSSRVPTGY TTRNGEVDDNALGFAASGTLNPQKARVLLQLALTQTKDINKSNNILMISK SGRST >MS0754 argB, ArgB protein MRSTELVQWFRQSTPYVNMHRGKTFVIMLDGNTIASSNFINIINDISLLH SLGIKLIIVYGARVQINSLLAQNNVTSVYHKNIRVTDPRTLELVKQAVGQ LSYDITARLSVRLPHSPVLNVVSSNFILAQPIGVDDGVDYMLSGKIRRIE IDNIKHHLDNNAIVLLGPIAPSVTGETFNLPFEEIATQVAIKLKAEKLIG FSSTQGILDPQGISIPDLLPQDAAKYLNQYIQQGEYHCSQARFLQAAIEV CKAGVKRSHLLSYEEDGSLLQELFTRDGVGTQLSVDNSEDIRIATVQDIP GLIELIHPLEQQGILVKRSREQLEMDIANYTIIDRDGVIIACAALNQYPE ENMAEMACVAVHPDYRSSSRGDILLEAIQKRARQLGIEKLFVLTTRTVHW FQERGFRLANVEDLPKEKRDHYNYQRRSKILIQPLNEEE >MS0236 argB, ArgB protein MKPLVIKLGGVLLDTPAAMENLFTALADYQQNFARPLLIVHGGGCLVDDL MKRLNLPVQKKNGLRVTPADQIDIIVGALAGIANKTLVAQAAKFKLNPVG LCLADGNLTQATQFDPELGHVAMVVAKNPALLNNLLGDAFLPIISSIAVD DNGLLMNVNADQAATAIAALINADLVMLSDVDGVLDANKQRLTELNSAQI EQLIEDKVITDGMIVKVNAALDAAKILNCGVDIANWKYPEKLTALFAGEI IGTRINP >MS0235 argC, ArgC protein MAQKAIVIGASGYTGAELARILTHHPEFELAGLYVSTNSADANKSISTLY PQLKTICDLPLQPLPEDLTEIAQNADLAFFGTAHEVSANLAPVFLQNNCK VFDLSGAYRVNSESFYQEFYGFEHKHPELLKQAVYGLAEWNADKIKTTDL VAVAGCYPTVSQLSLKPLIEEGLLDVNQLPVINAVSGVSGAGRKASLTSS FCEVSLNAYGVFNHRHQPEIATHLGTDVIFTPHLGNFKRGILATITAKLK AGVSDEQIKRAYAKYYANKPLVRVYEQGLPSIKAVEFSPYCDIGFATKNN HIIIVGAEDNLLKGAAAQAVQCANIRYGYNEVLGLI >MS0829 argD, ArgD protein MTITTPVKAVLASNQYFLDRQNAMESNVRSYPRKLPFAYAKAQGCWVTDV EGNEYLDFLAGAGTLALGHNHPVLIQSIKDVLDSGLPLHTLDLTTPLKDA FTEELLSFFPKDQYILQFTGPSGADANEAAIKLAKTYTGRGNVIAFSGGF HGMTHGALSLTGNLGAKNAVQNLMPGVQFMPYPHEYRCPFGIGGEAGAKA VERYFENFIEDVESGVVKPAAVILEAIQGEGGVVPAPVSFLQKVREVTQK HGILMIVDEVQAGFCRSGKMFAFEHAGIEPDIVVMSKAVGGSLPLAVLAI KKEFDAWQPAGHTGTFRGNQLAMATGYASLKIMREENLAQNAQQRGEYLT QALRELSKEFPCIGNVRGRGLMMGIDIVDERKPQDAAGAYPQDGELAATI QKFCFKNKLLLERGGRNGNVVRVLCAININQAECEEFIKRFKQSVTDAIK AVRG >MS0782 argD, ArgD protein MSQYTRKTFDEVMIQNYVPADFIPVKGKGCKVWDQQGRDYIDFTSGIAVN ALGHCPDEIVDVLKKQGETLWHSSNWFTSEPTLELASKLVEHTFAERVMF ANSGGEANEAALKLARRYAVDNYGYQKDTIISFKKSFHGRTLFTVSVGGQ AKYSDGFGPKPAGIVHLPFNDLDAVKAMIDDHTCAVIVEPIQGESGIIPA TKEFLQGLRRLCDENNALLIFDEVQTGVGRTGYLYAYESYDVVPDILTSS KALANGFPISAMLTTTKIAASFKPGVHGTTFGGNPLACAVGAKVIETIAN PAFLENVQKTSALFISELNKLNEKYHLFNEVRGQGLLIGGGIN >MS0783 argD, ArgD protein MILVAGPNVLRFAPALNISQQEVAEGFKRLDQALQKFA >MS0233 argE, ArgE protein MKRLPKFLDMYSQLIALPTISALEPEFDQSNKALIELLADWLATLGFKTE IIPVENSRAKYNLLATYGEGEGGLLLAGHTDTVPCNEELWTTNPFKLTER DGKFFGLGTADMKGFFAFVIDAVRQIDLTKLTKPLRILATADEETTMLGT RTFIRHTHIRPDCALIGEPTSLRAVRAHKGHVGKAVRIIGKSGHSSDPAK GINAIELMHEATGYLMQMRNELRDKYHHDAFEIPYPTMNFGAIHGGDAVN RICGCCELHFDIRPLPKMRLEDLDEMLQQKLAPMFEKWGDRISIEALHEP TPGYECEHSAQVVQVVEKLLGEKCEVVNYCTEAPFIQELCPTLVLGPGSI EQAHQPDEFLSAEFIEPTRDLLTKMIMHFC >MS0674 argE, ArgE protein MKNTIINLAQDLIRRPSISPDDQGCQQVIAERLTKLGFNIEWMSFNDTIN LWAKHGTTSPVVAFAGHTDVVPTGDENQWNYPPFSAQIVDDMLYGRGAAD MKGSLAAMIVAAEEYVKANPNHAGTIALLITSDEEAAAKDGTVKVVESLM ARGENIDYCLVGEPSSAKQLGDVVKNGRRGSITGDLYIQGIQGHVAYPHL AENPVHKATKFLTELTTYEWDNGNEFFPPTSLQIANIHAGTGSNNVIPGE LYVQFNLRYCTEVTDEFIKNKVAEMLQKHDLTYRIDWNLSGKPFLTKPGK LLNAVVESLESVAGIKPKLDTGGGTSDGRFIALMGAEVVELGPLNATIHK VNECVSCRDLATLGEVYRQMLVNLLGK >MS1555 argE, ArgE protein MSVNMKRIQTIIEKLASISSVPGELTRLAFSAEDEAAHNYLIELCKPYDL SIRRDQVGNLFIRKSGIEDHLPAVTFGSHIDTVVNAGKFDGPLGSVGGLE ILFQLCEQGVQTRYPLELIIFTCEESSRFNYATLGSKLMCGIANRESLSR LRDKQGNSLEEAMATIGLDFTEVDQVKRNAEEFKCFFELHIEQGPRLANE RKTIGVVTGIAAPIRCIVKIQGQADHSGATAMHYRRDALLGGAELALAIE RAAIDAGHSTVATVGNLNAKPGVMNVVPGYCELLVDIRGIHSEARESVFT VLQQQIEQVTAKRGLSIELQLISKDQPILLPDQMVQQISRAAQDLGYAYE IMPSGAGHDAMHMATFCPTGMIFVPSKNGISHNPLEFTSWEEIEAGIKVL QLVVLEQAEKV >MS1073 argF, ArgF protein MPFNLKNRHLLSLVNHSPREIKYLLDLARDLKRAKYAGTEQPRLKGKNIA LIFEKTSTRTRCSFEIAAYDQGANVTYIDPTSSQIGHKESMKDTARVLGR LYDAIEYRGYKQETVEELAKFSGVPVFNGLTDEFHPTQMLADVLTMIEHS TKPLNEIKYVYIGDARNNMGNSLLLIGAKLGMDVRICGPKSLLPEENFVS ICEEISKETGARLTVTDDIDLAVKDADFVHTDVWVSMGEPIEAWGERINL LMPYQVNTDLMKRTGNPNVKFMHCLPAFHNCETKVGREIAAAYPNLANGI EVTEDVFESPMNIAFEQAENRMHTIKAVMVASLA >MS1479 argG, ArgG protein MSNTILQNLPLGQKVGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPD EDDYNAIPKKAMAYGAENARLIDCRKQLAQEGIAAIQCGAFHISTGGVTY FNTTPLGRAVTGTMLVAAMKEDDVNIWGDGSTFKGNDIERFYRYGLLTNP NLKIYKPWLDDQFIDELGGRFEMSQFLIANGFDYKMSVEKAYSTDSNMLG ATHEAKDLEDLSTGIKIVKPIMGVAFWDESVEIKPEVVTVRFEEGVPVEL NGKRFDDVVELFMEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMAL FHIAYERLVTGIHNEDTIEQYRINGLRLGRLLYQGRWFDPQALMLRESSQ RWVAKAITGEVKLELRRGNDYSILDTVSPNLTYEAERLSMEKVEDAPFDP IDRIGQLTMRNLDVTDTRNKLGIYSEAGLLTAGKDAVVPQLGSK >MS0237 argH, ArgH protein MALWGGRFTQAADQRFKDFNDSLRFDYRLAEQDIEGSVGWSKALVSVGVL TTDEQQQLERALNELLIEVRSNPQAILQDDAEDIHSWVESKLIDKVGNLG KKLHTGRSRNDQVALDIKMWCKAQVTELQYAVRDLQAKLVETAENNQHAV MPGYTHLQRAQPISFAHWCMAYVEMLERDYSRLADAYNRMDSCPLGSGAL AGTAYPVDREQLAKDLGFAFATRNSLDSVSDRDHIIELLSTASLSMVHLS RFAEDMIIFNSGEADFVELSDRVTSGSSLMPQKKNPDACELIRGKAGRVI GSLTGMMVTVKGLPLAYNKDMQEDKEGIFDALDTWHDCLTMAAFVLEDIR VNVERTREAALKGYSNATELADYLVAKGVPFRDSHHIVGETVVYAIKVHK GLEDLSIEEFRQFSDVVGEDVYPILSLQSCLDKRSAKGGVSPLRVAEAIA DAKARIAAKK >MS1575 aroA, AroA protein MEKLTLTPISHVEGTVNLPGSKSLSNRALLLAALAKGTTRVTNLLDSDDV RHMLNALKQLGVNYSLSEDKSVCEVQGLGKAFAWQNGLALFLGNAGTAMR PLTAALCLANADSVPAEIILTGEPRMKERPIKHLVDALLQAGADVQYLEQ EGYPPLAIRNTGLKGGKVKIDGSVSSQFLTALLMAAPMAERDTEIEIIGE LVSKPYIDITLNMMKIFAVDVDNQNYQRFVVKGNQQYQSPNIFLVEGDAS SASYFLAAGAIKGKVRVTGVGKNSIQGDRLFAEVLEKMGAKITWGEDYIE AERGELNGIDMDMNHIPDAAMTIATTALFAQGETVIRNIYNWRVKETDRL SAMATELRKVGAEVEEGEDFIRIQPPASDQFKHAEIETYNDHRMAMCFAL VALSNTAVTICDPKCTAKTFPTFFDEFSAIATV >MS1968 aroB, AroB protein MVCVNVELKERRYPIYIGENLLTDTGVYPVKMGDKVMIVSNPTVAQYYLT PVTETLEKLGCQVSHVLLPDGEKYKTLDSLNMIFTALLKENHGRDTTLIA LGGGVIGDVTGYAAASYQRGIRFIQIPTTLLAQVDSSVGGKTAVNHELGK NMIGAFYQPCTVIIDTRTLVTLPKREVNAGLAEVIKYGAILDLPFFEWLE AHIDNLVALNQQDLQYCIARCCQIKADVVARDETEKGDRALLNLGHTFGH AIETHLGYGNWLHGEAVAAGSMMAAVLSEKLGDLSYSEVARLEKLLARAN LPTVSPDTMQAEDYLPHMMRDKKVLAGKLRLVLLKTLGQAYVASDTDKSL VLDAIRVCSQNN >MS0866 aroC, AroC protein MAGNSIGQLFRVTTFGESHGIALGCIVDGVPPNMALSEADIQPDLDRRKP GTSRYTTPRREDDEVQILSGVFEGKTTGTSIGMIIKNGDQRSKDYGDIMD KFRPGHADYTYQQKYGIRDYRGGGRSSARETAMRVAAGAIAKKYLREQFG VEVRGFLSQIGDVKIAPQNISEIDWAQVNDNPFFCPDQSAVEKFDELIRQ LKKDGDSIGAKLTVVAENVPVGLGEPVFDRLDADLAHALMGINAVKAVEI GDGFAVVEQRGTQHRDEMTPQGFLSNHAGGILGGISTGQPIIATIALKPT SSITVPGRTVNLNNEPVELITKGRHDPCVGIRAVPIAEAMTAIVLLDHLL RHRAQCGLK >MS0133 aroE, AroE protein MQLKSSLIVNQHLRLEQITEDDAEPVFRLICRQRDYLSRWLPGVGLTSNV SSTLKFIRSLKPLEQVFTIRRDDEIIGLVSFNKADYSNLKLEIGYWLSQS EQKQGIMTQCVQTMIDYAFNQLYFNRIQIKCAIGNTASKGIPQRLGFQLE GIERQGLLLLSGEFADFEIYSMLAQDWKNKQDKQIMDTYAVWGNPIAQSK SPAIHKIFAEQTGQNMKYIAMLGDEQHFERQLQEFFAQGAKGCNITAPFK ERAYRLADEYSERALTAGACNTLKKLENGKLYADNTDGAGLVSDLQRLGW LKPNQQILILGAGGATKGVLLPLLQAQQKILIANRTLAKAEELAEKFSPY GEIRAVELKTIPPYRYDVVINATSLGLTGKTADIQPEILQQAGAVYDMQY AKETDTPFIALAKSLGVNNVSDGFGMLVGQAAHSFRLWRGIMPDIEVLLN RGI >MS2315 aroE, AroE protein MINKDTQLCISLSGRPSNFGTRFHNYMYEKLGLNFVYKAFTTNDIEHAVK GVRALGIRGCAVSMPFKESCMPFLDEISPSAKAIESVNTIVNTDGYLKAY NTDYIAISKLIAKYQLKPTACVIIQGSGGMAKAVAAAFKNAGFDNLKIYA RNATTGGYLAKLYGYQYIDSLYGQNADILVNATPIGMKGGGKEESIISFP EAMIDQASVAFDVVAMPAETPLIKYARQQGKTVISGAEVAVLQAVEQFEL YTGQRPGDELIAEAASFARANS >MS1104 aroG, AroG protein MKDSIHNVHIIDEKVLITPAELKQKLPLPIALRTQIETHRREIADIVHKK DDRLLVVIGPCSVHDTKAAIDYAKRLKALSDELKDQLYIVMRVYFEKPRT TVGWKGLINDPRIDGTFNVEEGLHIGRKLLLDLAEMGLPLATEALDPMTP QYLADLFSWSAIGARTTESQTHRELASGLSMAVGFKNGTDGSLATAINAM KAASMGHSFIGINQQGQVNLLHTEGNPDGHVILRGGKKPNYQQEFVNQCE EELAKAGLETAIMIDCSHGNSNKDYKRQPSVAKDAVNQIVAGNKSIIGLM IESNINAGNQSSEQKVSEMKYGVSITDACIDWETTDNLLRKIAAALKNRA E >MS1184 aroG, AroG protein MISFKVRLNFSIFRMIYRELIMPTKNKNNIRVANDDTRIANIEQLLPPVA LLEKYPASNVAVKTVRNARNKAHQIIHGEDDRLLVIIGPCSIHDPKAALE YANRMAKMREKYKDTLEIIMRVYFEKPRTTVGWKGLINDPYLNDTYALND GLRIARKLLSDINDLGLPTAGEFLDMITPQYVADFMSWGAIGARTTESQV HRELASGLSCAVGFKNGTNGGVKIALDAIGAAEASHHFLSVTKFGHSAIV STKGNLDCHIILRGGDKGTNYDAENIAKVCANIEKSGRIGHVMIDFSHAN SSKQFKKQVEVCHDVAKQIAQGSNQIFGVMVESHLVEGRQDLVNGKAETY GQSITDACIGWDDTEIVLQELSDAVAARRKVNGK >MS1969 aroK, AroK protein MRLRILLLFIENFKKNNTMAEKRNIFLVGPMGAGKSTIGRQLAQLLNMEF IDSDNEIEQRAGADISWIFDIEGEDGFRKREERIINELTQKQGIVLSTGG GAILSKETRNHLSARGIVIYLQTTVDKQFERTQRDKKRPLLQGVEDVRKV LEDLAQVRNPLYEEVADITLPTDEQSAKLMASHIVELIDNFNS >MS1790 aroQ, AroQ protein MSQLSRILLLNGPNLNMLGAREPKHYGTLSLAAIEANVQALAAKNNIELE CFQANSEEKLIDKIHQSFKKVDFILINPAAFTHTSVALRDALLAVAIPFV EIHLSNIHKREPFRHHSYFSDVAEGVICGLGAKGYECAFEFAVEFLAKKA >MS1277 artI, ArtI protein MFKKLVLLATGMFAVATTTQAVAADSLLDRINNKGTITVGTEGTYAPFTY HDASGKLTGYDVEVTRAVADKLGVKVEFKETAWDSMMAGLKAGRFDIVAN QVALTTPERQATFDKSEPYSWSGAMMAVRADDDSIKTLDDIKDRKAAQSL TSNYGELAREKQAKIVPVDGLAQSLLVVQQKRADFTLNDSLAILDYLKKN PNSGLKSAWEAPAEEKLGSGLIVNKGNDEALAKISAAVIELQKDGTLKKL GEQFFGKDISVK >MS0704 artI, ArtI protein MKKLLLSTLLITTAFAVSAKDISFAMEPTYPPFEFTNEKGEIIGFDVDIA NALCKEMQANCTFKSQAFDALIQGLKQKRFDASISGMGITEARKKQVLFT EPYFSSSAAFIAKKGTDFTKVKTIGVQNGTTYQNYIIKEKPEYEVKAYAS FQDALLDIQNGRIDAIFGDIPVLVDMIKKTPELAFAGEKIDNKTYFGNGL GIAANKANQELIDEFNQALIKIRQNGEYQKIYDKWMTAK >MS1684 artI, ArtI protein MKIFKKTTALLAAALLATGLTACDNKDSGAASADNNAVSAIERIKKADKV RIGVFSDKPPFGYVDKDGKVQGFDVEIAKAVTKDLLGDENKAEFVLVEAA NRAEYLLSNKVDITMANFTVTPERKEVVNFAKPYMKVALGVVSKQDAPIT DVAQLADKTLLLNKGTTADAYFTKNFPKNKSLKFEQNTETFQALLDGRGD ALSHDNTLLFAWAKENPGYVVAIKNLGDLDYIAPAVKKEDTDLLQWLDGE IEKLAKDGTLNKAYQKTLQPIYGDEIKEADVLVEYQ >MS0900 artI, ArtI protein MKKATLATLIAAMFVTATAQAQTSPDTLTKVLETKELVVCSPGDYKPFSF DNNGKFEGVDNDLMDKLAQSMGAKVTIVKTTWKTLMDDFTANKCDIAVGG ISITLERQQKALFTEPYFINGKTPIVRCENVDKYQTVEQINRPEVRIIAN PGGSNEKYARNELSNANLTMNAENLTIFQQVIDKKVDVFVSEAAEAIVKA HEHKGVLCAVNPDKPLKPAQNGWLIHNGDYRFKSYVDQFLHLEKMSGNLD KTINKWLPRD >MS0220 artI, ArtI protein MQVVLKRRKQNNSDNIYLTINQGSYMKKLLLAAALAGTTFAAQARDITFA MEPSYPPFELTNAQGEIIGFDVDVAKAICKEIEANCNFKSQSFDALIPSL KAKRFDAAISAIDITETRAKQVLFSDAYYDSSASFIAVKGKADLNSAKNI GVQNGTTFQQYTVAEAKQYSPKAYTSLQDAILDLKNGRIDIIFGDTAVLA DMLAKEPELTFVGDKVTNKKYFGNGLGIAVNKSDKALVENLNKGLAAIKA NGEYQKIYDKWMTAK >MS1687 artM, ArtM protein MSIIMNWQYIWNALPRFVDATILTLELSFWAILFSVIIGVICAVVMSYRV RGLQTIVKAYIELSRNTPLLIQIFFLYFGLSKIGVKLEGFTCAVIGLAFL GGSYMAEAVRAGIESVSKGQVESALSIGLTPMQTFRYVVFPQAFAVATPA IGANCLFLMKETSVVSAIAIAELMFMAKEIIGMDYKTNEALFLLVVFYLI ILLPVSVFIGYLERRLRRAKYGA >MS0221 artM, ArtM protein MFFEYLPLMSTATLMTLGLAVCSLIAGLVLAIFFVVLETNKFVCVRKPTA IFVTLLRGLPEILVVLLIYFGSTELVEKLTGEYIEFSPFLCGVIALAIIF AAYASQTLRGAIQAIPLGQWESGAALGLSRGYTFVNIILPQVWRHALPGL SNQWLVLLKDTALVSLIGVDDLMRQASLVNTNTHQPFTWYSFAALLYLII TLVSQFFMRKLEMRFTRFERGVK >MS1686 artM, ArtM protein MGLTLLFEGNNLQRLLAGLGITAEIAFVSVFFACILGIVMGVVMTSRNIF VRGFCRLYLEIVRIIPLLAILFIVYFGVAKWFNVHLSGVTVCILVFIFWG TAEMGDLVRGALTSIEKHQTEAAYALGLSKIQTFIYILLPQSLKRVTPGA INLFTRMIKTSSLAMLIGVLEVIKVGQQIIETSLFRDPTSALWIYGVIFA LYFAICYPLSLFSKYLEKRWEN >MS1276 artM, ArtM protein MLNNLLLSIPFMTESRVDLVISAFWPMVEAAVLVSIPLAVSSFIIGMIIA VAVALVRVTPVNGVIHRLFLVIVKVYISIIRGTPMLVQISVVFYGLPALG IFIDPIPAAIIGFSLNIGAYASETVRAAISSVPKGQWEAGYTIGMSYMQT FRRIIAPQAFRVAVPPLSNTFIGLFKDTSLASVVTVTEMFRVAQQMANMS YDFLPIYIEAGLIYWCFCWVLFVIQAKVEKRMERYVAR >MS0222 artM, ArtM protein MFREYFMEIARGIPTSLLLTAVALAVAFVLALFLTFLLSMENKPVKRVIN IFLTLFTGTPLLVQFFLIYSGPGQFQWIVNSALWPLLSNAWFCAMFALAL NSAAYSTQLFHGAVKAIPKGQWESCAALGLSRLQTLKILIPYALKRALPS YSNEIILVFKGTSLASTITIMDIMGYARQLYGTEYDAITIYGIAGVIYLV ITGLMTLLLRKLEHKVLAFERLEVEKA >MS1101 asd, Asd protein MAILFLLFHDFLPPYFLLQLKICRIERLFIAERIMSTSLNIAIAANFDLC EKIASYLEESLLEVEKLSIVEIYPFSEEQGIRFNGKAVAQLPVDEVEWSD FNYLFFAGDLAHIPLLAKASEAGCLTIEMNGVCSALADVPVVIPGVNEEQ LRDLRQRNIVSLPDAQVTQFALSVRSLLNNASNAQIVVSSLLPASYYDAD GVHKLVGQTAKLLNGIPPDEEEMRFAFDVFPAKSLNLNAQLQRVFPQLEN VVFHQIHVPVFYGLAQMVTVKAEFEPEQDSILAEWSTNDLIRYHQDKVMT PVLNGEAENNEDEVHLQISALESVEGGIQYWLVADNQRFSQAFLAVKLLE SIYRQGY >MS0006 asd, Asd protein MKNVGFIGWRGMVGSVLMDRMQQEQDFANLNPVFFTTSQAGQKAPVFGGK EAGNLKDAFDIEELKKLDIIVTCQGGDYTNEVYPKLKATGWDGYWVDAAS ALRMEKDAIIVLDPVNQHVIADGLKNGIKTFVGGNCTVSLMLMALGGLFE RDLVEWISVATYQAASGAGAKNMRELVSQMGLLEKSVSEELANPASSILD IERKVTAEMRADSFPTDNFGAALAGSLIPWIDKLLPSGQTKEEWKGYAET NKILGLSDNPIPVDGLCVRIGALRCHSQAFTIKLKKDVPLEEIEQILASH NEWVKVIPNDKETTLRELTPAKVTGTLSVPVGRLRKLAMGPEYLAAFTVG DQLLWGAAEPVRRILKQLVA >MS0036 asnA, AsnA protein MKKSFILQQQEISFTKNTFTEKLAEHLGLVEVQGPILSQVGNGIQDNLSG TEKAVQVNVKMITDAAFEVVHSLAKWKRHTLARFGFAEGEGLFVHMKALR PDEDSLDQTHSVYVDQWDWEKVIPEGRRNLDYLKETVREIYAAILETEAA VDKKYGLKSFLPKEITFIHSEDLVKDYPGMTDKERENELCKKYGAVFLIG IGGVLPDGKPHDGRAPDYDDWTTTSEGEYKGLNGDILVWNPILNRAFEVS SMGIRVDETALRKQLSITGDEDRLKFDWHQDLINGRMPLSIGGGIGQSRL AMLLLQKRHIGEVQSSVWPKAVMEQYENIL >MS1984 aspA, AspA protein MAATRKEVDLLGEREVPADAYWGIHTLRAVENFNISKVTISDVPEFVKGM VMVKKATALANGELGAIPADIAKAIVAACDEILTTGKCLDQFPSDVYQGG AGTSVNMNTNEVVANLALEKIGHQKGEYNVINPMDHVNASQSTNDAYPTG FRIAVYNSILKLMDKIQYLHDGFDNKAKEFANILKMGRTQLQDAVPMTVG QEFKAFAVLLEEEVRNLKHAADLLLEVNLGATAIGTGLNTPAGYSELAVK RLAEVTGLPCVKASNLIEATSDCGSYVMVHGALKRTAVKLSKICNDLRLL SSGPRAGLNEINLPELQAGSSIMPAKVNPVVPEVVNQVCFKVMGNDTTVT FAAEAGQLQLNVMEPVIGQAMFESIDILANACVNLRDKCIDGITVNKEIC ENYVLNSIGIVTYLNPFIGHHNGDIVGKICAQTGRSVRDVVLEKGLLTEA ELDDILSVENLMNPTYKAKLSK >MS1248 avtA, AvtA protein MRYDKMSPFIVMDIVREAAKYPNAIHFEIGQPDLAPSEKVKKALQSAVEN NKFSYTESLGLLALREKICQYYDRTYHVKITPNRVLLTPGTSGAFLIAYA LTLAQDDKLGLTDPSYPCYKNFAYMMDIQPEFMPVDKHNCYQLEVGQLKG RNIKALQISSPANPTGNIYTAESLKSLNDYCMENHIDFISDELYHGLVYD QNAATALQFNPRAYVINGFSKYYCMPGMRLGWIIVPEDKVREAEIIAQNI FISAPTLSQYAALEAFEEEFLTATKQVFQQRRDFLYDALKDLFTIEFKPQ GAFYLWADVSKYTDDSYQFAKKMLHEIQVAATPGIDFGENGTKHYLRFAY TRDIEHLREGVERMKQWLKNK >MS1797 avtA, AvtA protein MELFPKSNKLEHVCYDIRGPVHKAALRLEEEGHKILKLNIGNPAPFGFEA PDEILIDVIRNLPTAQGYCDSKGLYSARKAIVQYYQSKGIHGATVNDVYI GNGASELITMAMQALLNDGDEVLVPMPDYPLWTAAVTLAGGKAVHYLCDE EQDWFPAIDDIKSKITSRTKAIVIINPNNPTGAVYSKELLLEIAEIARQN GLLIFSDEIYDKILYDGAVHHHIAGLAPDLLTITMNGLSKAYRICGFRQG WMILNGPKDKARGYIEGLDMIASMRLCANVPMQHAIQTALGGYQSINELI VPGGRLYEQRNRAYELLNQIPGVSCVKPMGALYMFPKIDIKKFNIYDDEK LVLDLLAQEKVLLVHGRGFNWHAPDHFRIVTLPYVHQIEEALNKFARFME NYHQ >MS0764 azlC, AzlC protein MSEIVSKTPVRDAAKAAFPYSAPMIAGFIFLGIAYGLYMKQLGFGVLFPV FMALLIYAGSVEFIVAAALVAPFSPLNVFLICLMVSGRQIFYGISMLEKY GGHLGKKRWYLITSLVDEAFSLNYMAKIPSYIDKGWYMFFVSLYLQIYWV MGAGIGNLFGAMLPFDLKGIEFAMTALFIIIFAENWLKEKSHESSLLGLG ITLTSLIIVGKEQFLIPSLLGIWIMLTLSRPKLSSKLKRIE >MS0765 azlD, AzlD protein MTLTEQIITVGMGILGVHICRVLPFLIFPPNRPIPEYIRYLGKVLPAAMF GMLVIYCYKNVDIFSGFHGFPEFLAGLITLALHLWKKNMFLSMAVGTGLY MFLVQAVFVN >MS0488 brnQ, BrnQ protein MNKNTFIVGFTLFAIFFGAGNLIFPPKLGLESGSEFWSAITGFILSGVGL PLLGIIVSAFYEGGYKTATTKISPWFSVIFLMAVYLSIGPFFAIPRTAAT SYEMAILPFIGKSSSLSMLIFTLFYFAISLWFALNPSKTVSRIGAILTPI LLFAILALVVKAFFILIDNDPSEVIFTLRESNNSFLFTGIIDGYLTMDTL ASIAYSVIVIAAIQSKGIKHGKELTKQTLLAGIVAAIALAAIYLAIGWIG NRVHISAETISLLQERNQDIGTYILNKITAQAFGNFGRSLLGVIVSLACL TTAIGLIVSVSEYFNEIYHKISYKTYVIIFTLIGFIIANQGLSAVISKSV PILLVLYPISMTIILLLSVNIFVKVPLVAQRLSIALTTLVSIGSVAGLEQ ANNLPLKDYSMEWIPFAVTGALLGCLIHVFYKSES >MS2237 carA, CarA protein MSEPAILVLADGSIFRGTSIGAAGHTIGEVVFNTSMTGYQEILTDPSYFK QIVTLTYPHIGNTGTNSEDLESNGVYAAGLIIRDLPMIHSNFRANQSLSD YLKDNNVVAIADIDTRRLTRLLRDKGAMAGCIMSGEVDEQKALELALSFG SMAGKDLAQEVTAQQSYRWTQGEWVLGKGYAEQQNASFNVVAYDFGVKHN ILRMLAERGCKLTVVPAKTSAEEVLALNPDGIFLSNGPGDPEPCDYAISA IQTLLATKKPIFGICLGHQLLGLASGGKTKKMAFGHHGANHPVQDLDTQK VMITSQNHGFEVDEHSLPANVRVTHRSLFDNSVQGIELTDQPAFSFQGHP EASPGPHDVAYLFDKFIDAMKQAKA >MS1491 carB, CarB protein MNILVTSAGQRVSLVQAFKKELSQLVSDGKVLTVDLNPELAPACYVADGH FQVPRVTDAGYIPTLLKICEENNVKLIIPTIDTELLILSEHLQRFKEKGI FISVSDTEFVRKCRDKRLTNQLFIEHNIAVPKQFEKGQFEYPVFVKPYNG SLSKGIFVAEKPEDISPEQLENPELMFMQYISPAEYDEYTVDCYFDKNSE LKSAVPRKRIFVRAGEINKGVTRKNAIVTQLSEKLSRLPGARGCLTIQVF YKESTAEILGIEINPRFGGGYPLSYLAGANYPRWLIQEYLFNQPIPAFDD WEADLLMLRYDAEVLAHHYEK >MS2236 carB, CarB protein MAMSTKPSGASKFVYKTANNFLKVLSRENNMPKRNDINTILIIGAGPIVI GQACEFDYSGAQACKALREEGYKVVLVNSNPATIMTDPNMADVTYIEPIH WQTVEKIIEKERPDAILPTMGGQTALNCALDLSKNGVLKKYGVELIGATE DAIDKAEDRGRFKEAMAKIGLNTPKSFVCHSFDEAWKAQEEVGFPTLIRP SFTMGGSGGGIAYNRDEFQAICERGFEASPTHELLIEQSVLGWKEYEMEV VRDKADNCIIVCSIENFDPMGVHTGDSITVAPAQTLTDKEYQIMRNASLA VLREIGVDTGGSNVQFAINPENGEMIVIEMNPRVSRSSALASKATGFPIA KVAAKLAVGYTLNELRNDITGGLIPASFEPSIDYVVTKVPRFAFEKFPKA DDRLTTQMKSVGEVMAMGRTFQESIQKALRGLETGICGFNLKTEDMEKLR HEISNPGPERLLYVADAFGIGWSIEDVHHYSKIDPWFLIQIQDLVLEELA LEKKTLADLNKDEIYRLKRKGFSDKRIAQLVKSDETSVRSLRNAFNIHPV YKRVDTCAGEFKSDTAYLYSTYEEECEAAPSDRKKVMILGGGPNRIGQGI EFDYCCVHAALALRESGFETIMVNCNPETVSTDFDTSDRLYFEPLTLEDV LEIIHVEKPWGVIVHYGGQTPLKLANALHANGVNIIGTSADSIDAAEDRE RFQKILHDLNLKQPANRTARNTQEAVGLANEVGYPLVVRPSYVLGGRAMQ IVYNDEELNRYMREAVSVSNDSPILLDHFLNNAIEVDVDCICDGEQVIIG GIMQHIEQAGIHSGDSACSLPPYSLSMEIQDEIRRQTAAMARALNVVGLM NVQFAVQNDVIYVLEVNPRASRTVPFVSKATGQPLAKIAARVMAGISLKE QGIQGEVVPQDFYAVKEAVFPFIKFPGVDTILGPEMRSTGEVMGVGATFA EAFLKAQIGAGERIPRTGKVFVSVDNNDKPRLLPIVKRLQEQGYGLCATF GTAKFLRENGIAVQTVNKVREGRPHIVDAIKNDEIALIINTAGGMAESVA DSASIRASALKQRVPLYTTIAGADAISLSVANLDIHDVYSVQGLHAGLTK >MS1273 csdB, CsdB protein MFDTTGFRSHFPYFQHPDRVIYLDNAATTLKPQSLIDATVKFYQSAGSVH RSQYDEEQTALYEQARSQVRQLINAESDKAIIWTSGTTQAINTVANGLIP YIQSDDEIIISEADHHANFVTWSMIAQKCGAKLRILPIQDNWLIDENALL EALNKRTKVVVLNFVSNVTGTEQPVEHLIRLIRKHSSALVSVDAAQAISH VKIDLRKLDADFLSFSAHKIYGPNGLGVLSGKLTALELLQPLIYGGKMVD RVSKQQISFAELPYRLEAGTPNIAGVIGFNAVLSWLNQWDFEQAEHHAVQ LAEQTKVRLKNYEFCQLFNSPKPSSVISFVFKNIAGSDLATLLAEQNIAL RTGVHCAQPYLSRLGQHSTLRLSFAPYNTQQEVDAFFTALDKSLALLEE >MS2212 cysE, CysE protein MLREVWNNIRNEAKELVEHEPVLASFFHSTILKHKNLGGALSYILANKLA TSTMPAITLREIIEETYQDDPRIIDSAACDIHAVRQRDPAVGLWATPLLY LKGFHAIQSYRITHHLWQQNRKSLAIYLQNQISVAFDVDIHPAARVGCGI MFDHATGIVVGETAVIENDVSILQGVTLGGTGKESGDRHPKIREGVMIGA GAKILGNIEVGKYAKIGANSVVLQPVPEYATAAGVPAKIISKDRSAKPAF DMNQYFIDDAEALNI >MS1252 cysH, CysH protein MTTQNQIENGHLDWLEAESIYIIREVVAECSHPALLFSGGKDSVVLLALA RKAFQLEGRDLVLPFPLVHIDTGHNYPEVIQFRDEQVKKLNARLVVGHVE DSIAKGTVVLRKETDSRNAAQAVTLLETIEANGFDALMGGARRDEEKARA KERIFSFRDEFGQWDPKAQRPELWSLYNGKLHKGENMRVFPISNWTELDI WQYIEREKLELPPIYYAHQREVVERNGLLVPVTPITPKQPGDESKVVSVR FRTVGDISCTCPVASTAATPAEIIKETAVTEISERSATRMDDRTSEAAME QRKKQGYF >MS1253 cysH, CysH protein MIIKPNFWQIPQPTATDFAALAEKEQLLAQRIHEIANRHQHAKFASSLAV EDMVITDVIAKSKAKITVFTLETGRLNPETLALADTVKKTYPDLDFRLFR PNPIAAEKYDREKGRFAFYESVELRRECCFIRKIEPLNRALADADAWLTG QRREQSVTRTELEFHEWDQSRGIDKYNPIFDWHEMDVWAYILKYDIPYNE LYKQGYPSIGCEPCTKQVKAGEDIRAGRWWWENKDSKECGLHK >MS1770 cysK, CysK protein MTIFADNSYSIGNTPLVRLHNFGHNGNLVVKIESRNPSFSVKCRIGANMV WQAEKDGVLTKDKEIVDATSGNTGIALAYVAAARGYKITLTMPETMSLER KRLLRGLGVNLVLTEGAKGMKGAIAKAEEIVASDPNRYIMLKQFENPANP AIHQQTTGVEIWQATEGKVDVVVAGVGTGGTITGISRAIKLDQGKQITSV AVEPAESPVITQILAGEEIKPGPHKIQGIGAGFIPKNLDLSLIDRVETVD SDTAIKTARRLMAEEGILAGISSGAAVAAADRLAKLPEFQDKLIVAILPS ASERYLSTALFEGIEG >MS1769 cysZ, CysZ protein MLFPTALCMLALIRFLIYIFLGSFMKKEKEIKSGFHYFVMGWHLIGQQGL RRFVVMPVLLNIILLSGLFWLFVSKISDMIEGVISFIPDWLSWLSGILLA LSILMILLVFYFIFNTLSGFIAAPFNGLLAEKAEAMLTGESGENMTTMEF IKDTPRMLAREWQKLLYSLPKYIGLFLLSFIPLIGQSLIPVLTFLFTAWM MAIQYCDYPFDNHKISFPTMKFKLNENRIQNVTFGTFVTLCTFVPFINFV IIPVAVCGATAMWVDTYRKQLYLDKNLQKSTAVSTASTEKPGSDIARHSN NIRNR >MS2290 dadA, DadA protein MLKFSYQEHIKTYYYDTRNQDFTQPTLTGGQSADVCVVGAGFGGLSAALE LAERGKSVIVLEGARIGFGASGRNGGQAINGFEDGMDAYIDDMGLEKARK LWEMSLEAIDIIEQRIAKYNIQCDWRKGYATLALNHRRMDDLVTIEQTSR EIFAYDYMQLWNKAELKQYLGSDIYVGGLYDGNSGHLHPLNYCLGLAKAC LDLGVRIFEQSPVIDLDVGKSKVIAETAEGSVTAENVVLATNAYVTSLPK RIQRGTARKILPIDSFIIATEPLDQETANAVINNGMSVCDNNLLLDYYRL SADNRLLFGSDSSSNKDMVQVMRNNMLHVFPQLENVKIDYGWAGPIDMTI NAKPCLGRIASNIFYAHGYSGHGVALTGLAGRLIAEAIEGDDERFAIFES LKSPSVYGGRIVKNLATKIGVKYYKWLDKYR >MS1592 dadA, DadA protein MLKVTTAHIHFNRDNTPVSEQFDDIYFSTADGLEESRYVFQEGNNLWRRW LQFGENHFVIAETGFGTGLNFLAVTALFREFRTQYPDSPLKRLFFISFEK YPMSCADLRSAHQAYPQFNSLAEQLRQNWLQPIVGCYRFHFEETVLDLWF GDIADNLPQLGDYMVNKIDAWFLDGFAPSKNPEMWNENLYKQMFRYTKPA GTFATFTAASAVKKGLESAGFSLQKRKGFGKKRECLQGFKPLNAEQNPAV HTPWLLSRSATLSENTDIAIIGGGISSLFSAISLLQRGANVTLYCEDEQP ALNASGNKQGAFYPQLSDDDIHNIRFYIHAFAYGQQQLRWAIQQGIEFEH EFCGVALCAYDEKSAVKLAKISDYDWDTSLYQPLNQQELSEKAGLPLPCG GGFIPQGAWLAPRQFVQNGFAFAQKCGLKLKTFEKITALSQSEKGWILHN DKNEQFHHETVIIANGHKLKQFTQTARIPVYSVRGQVSQIPTSSQLLKLK SVLCYDGYLTPADQAKQFHCIGASHVRDCEDRDFSLQEQQENQAKIQLNI AEDWTKEVNTADNLARTGIRCAVRDRIPLVGNVPDFERQADEYRNIFNLR RRKQFIPQAAVFENLYLVGALGSRGLTSAPLLGEILASMIYGEPIPLSED ILHCLNPNRSWMRKLLKGTPVK >MS0282 dapA, DapA protein MKKINLEKTMSIQGIIPVMLTPFMENNEIDYDGLRKLTDWYIDNGSDALF AACQSSEILFLSLEERVKITKTVMDQVQGRIPVVASGHISDSFEQQVEEL TAIYNTGVDAVILITNRLDPNNEGTTVLKSNFEKLLAALPKDIVLGLYEC PVPYRRLLTDGEISYFAGFENMVVLKDVSCNLETVKRRIQLTKNSNLKIV NANAAIAFEAMKAGSEGFSGVFNNIHPDLYAYLYKNKNSSDPMVQELANF LAICGAAESFGYPNFAKLMHTKIGTFKHYNSRVIKDDIKVKYWAVEELLD HIMQGSERYRNKLNLR >MS0067 dapA, DapA protein MFKPQGIIAPVLTALDDNEKFNPEVYKNYINYLIKAGIHGIFPLGTNGEF YGFNEAEKLEIIKTAIEAADGCVPVYAGTGCVTTKETVEFSKKVVDLGVD VLSIVSPYYIAVTQDDLYRHYATIAENVTAPILMYNIPARTGNNIDYKTI KKLAQYENIIGVKDSSGNFDNTLKYIENTDSRLSIMAGSDSLILWTLLAG GTGAISGCSNVFPELMVSIYEYWKQGDFEKANEAQKKIRDFRNVMQMGNP NSVVKRAAQLRGLGTGPAKEPSNCANNPVIDKALQDVFKLYD >MS0265 dapA, DapA protein MSSTRPLFYGSIVALITPMDGHGEVNYDELKKLVEYHIASGTHAIVSVGT TGESATLSIDENVKTIQKTVEFAAGRIPVIAGTGANATSEAITMTKLLNN SGVAGCLSVVPYYNKPTQEGMYQHFKAIAECTDLPQILYNVPGRTGSDMK PETVGRLSKIENIVAIKEATGDVSRVKQIKELAGEDFIFLSGDDATGLES IKLGGQGVISVTNNLAAADMAKMCELALAGNFDEAEAINQRLMGLHHDLF IEGNPIPVKWAAYKLGLIKEPVLRLPLTTLSEAAQPKVLEALKQAGLI >MS0971 dapB, DapB protein MTLKLAIVGAGGRMGRQLIQAVQAAEGVELGAAFERKGSSLIGADAGELA GLGELGIKVAEDLAAEKDKFDIIIDFTRPEGSLEHIKFCVANNKKLILGT TGFDDAGKQAIGKAAEKTAIVFASNYSVGVNLVFKLLEKAAKVMGDYSDI EIIEAHHRHKVDAPSGTALSMGEHIAKTLGRDLKVNGVFSREGITGERKR TDIGFSTIRAADVVGEHTVWFADIGERVEISHKASSRMTFANGAVRAAKW LANKQIGLFDMTDVLDLNNL >MS1177 dapD, DapD protein MSNLQSIIEAAFERRAEITPKTVDAQTKAAIEEVIAGLDCGKYRVAEKID GDWVTHQWLKKAVLLSFRINDNQLIDGAETKYYDKVALKFADYTEERFQQ EGFRVVPSATVRKGAYIAKNTVLMPSYVNIGAFVDEGTMVDTWVTVGSCA QIGKNVHLSGGVGIGGVLEPLQANPTIIGDNCFIGARSEIVEGVIVEDGC VISMGVFIGQSTKIYDRETGEVHYGRVPAGSVVVSGSLPSKDGSHSLYCA VIVKKVDAKTLGKVGLNELLRTIEE >MS1784 dapF, DapF protein MQFSKMHGLGNDFVVVDAVTQNVYFPEEVIKKLADRHRGIGFDQMLIVEP PYDPELDFHYRIFNADGSEVAQCGNGARCFARFVTLKGLTDKKDIAVSTT NGKMILTVQDDGMIRVNMGEPVWEPAKIPFIANKFEKNYILRTDIQTVLC GAVSMGNPHCTLVVDDVETANVTELGPLLENHERFPERVNVGFMQVINPN HIKLRVYERGAGETQACGSGACAAAAIGIMQGLLENKVQVDLPGGSLWIE WQGEGHPLYMTGDATHVYDGVIKL >MS1199 dcp, Dcp protein MSNPLLENTPLPQFSKIKPEHIQPAIEQLIQDCRITTENLLKQPQLSWDN FCQPLSEVNDRLSKAWSPVSHLNSVKNSNELRDAYQACLPMLSEYGTWVG QHQGLYNAYVQLKNSPEFAGYSPAQKKAVENSLRDFKLSGISLAPEQQKR YGEIVSRLSELSSQFSNNVLDATMGWDKVITDEEQLKGLPESALQAAKQS AQNKGVEGYRFTLEFPSYIPVMTYCENRELREEMYRAFVTRASDQGPNAG KWDNSAIMEEILTLRVELAKLLGFNSYTELSLATKMAETPAQVLSFLDDL AMRSKPQGEKELADLYAFCEKEFAITELEPWDISYYSEKEKQALYAINDE ELRPYFPEQRVISGLFELIKRIFNIRAVERQGVDCWHKDVRFFDLIDETD EVRGSFYLDLYAREHKRGGAWMDDCIGRKIKADGALQKPVAYLTCNFNAP VGDKPALFTHDEVTTLFHEFGHGIHHMLTKVDIGDVSGINGVPWDAVELP SQFMENWCWEEEALAFISGHYQTGEPLPKEKLTQLLKAKNFHAAMFVLRQ LEFGIFDFRLHDNYKPGKANQILDTLNAVKDQVSVVKAVDWARTPHSFGH IFSGGYAAGYYSYLWAEVLSADAFSRFEEEGIFNAVTGKSYLDEILTKGG SEEPMVLFERFRGRKPTLDALLRHKGIAN >MS0465 dppB, DppB protein MQHYFIRRLIMMIPLMLLISFVAFSLMNLVPSDPAETMLRINNITVTDEA VKEARQALGLDKPFLLRYALWLYALLQGDLGKSFLSNQNVWDEITQAFPA TFYLAVTAFAVIFLLSLTLSLLCMLMLNSLWDKIIRGILFFFTALPNYWL ALLFIWLFSVRLNWLPSNGLEQKSGIILPALTLSLGYIGVYVRLLRGAML NQLQQPYVFYARTRGLSEKQILFKHILQNSLHTSYIAMGMSIPKLLAGSV IIENIFALPGLGRLCIQAIFGRDYPVIQAYILLMAMLFLVGNFVIDWLQH RRDPRIKRGY >MS1367 dppB, DppB protein MFKFILKRILMVIPTFLAITLVTFALVHFIPGDPVEIRMGERGVDPIVHA QMMEQMGLNDPLPEQYLNYIKGVVQGDFGRSFRNNEPVLKEFFTLFPATV ELAFFALLWSLIAGIFLGVIAAVKKDSWISHTVTALSLTGYSMPIFWWGL ILILYVSNFLGLPAGGRLPDEYWIDFDTGFMLIDTWNSGEPGAFVAAIKS LILPAVVLGTIPLAVVTRMTRSSMLEVLGEDYIRTAKAKGLSTTRIVIVH ALRNALIPVITVVGLIVGQLLSGAVLTENIFSWPGIGKWIIDAINARDYP VLQGSVLIISTIIIVVNLLVDVIYGVVNPRIRHN >MS0464 dppC, DppC protein MSGFIKQLRSDIFAQCCLFILTMIGLAGIFAPWICTFDPATIDMQAKLLP VSAQHWLGTDHLGRDIFSRLIWGVRSTVFYGLFAMLLTMMLGILIGMTAA IGGKKTDEFIMRLCDVLLSFPGEIMILALVGMLGPGIEHILVAVILVKWA WYARMIRGTVMQYTHKNYVHYSQAIGVSPWRIIRRHLLPVATAELIILAS ADMGAVILLISGLSFLGLGVQPPTPEWGAMLSDAKNIMLLYPQQMLPAGL AITLTVTAFNGFGDFLRDVLDPDNPLKGTNNE >MS1366 dppC, DppC protein MTTEITSSTPQTPLQEFWYYFRQNKGAVIGLTFIAAVFFICICAPFVSPY DPIVQHRDALLLPPAWMENGSLSYFLGTDDIGRDILSRIIYGARLSVFIG LLIVILSCIFGVILGLLAGYYGGLLDVIVMRLMDIMMAIPSLLLTIALVT ILGPSLFNAAIAIAIVSVPSYVRLTRASVLNEKNRDYVVASRVAGAGVLR LMFIVILPNCLAPLIVQMTMGISNAILELAALGFLGIGAQPPTPELGTML AEARSFMQAASWLVTIPGVAILLLVLAFNLMGDGLRDALDPKLKQ >MS1365 dppD, DppD protein MSLLNVNQLSVHFGDGKAPFKAVDRISYSVNKGEVLGIVGESGSGKSVSS LAIMGLIDYPGRVSAEALSFDGVDLLSLNEKQKRKIVGADVSMIFQDPMT SLNPCYTVGYQIMEALKAHQGGSKKERRERTVELLKLVGIPAPESRLDVY PHQLSGGMSQRVMIAMAIACKPRLLIADEPTTALDVTIQAQIVDLLLTLQ KQENMALILITHDLALVAEAAHRIIVMYAGQVVEEGRAEEIFKRPKHPYT QALLRSLPEFAEGKSRLQSLQGVVPGKYDRPQGCLLNPRCPYATEHCRRV EPDLIQLGEGKVKCHTPLNAQGEPSNV >MS0463 dppD, DppD protein MNKPIIRFDNFSIENPDSDRPLIAPLNLTLPPYRTLALVGESGSGKTLLG RSILGLLPEQLNTTGNIYFQDKKIISVTGTPTVDDKQKTNEIATLEIRGK AVSFIMQNAINAFDPLFSLQDQFCETLQKHTALSYRQALIKAQQSVSKVK LSSALLKRLPSQLSGGQLQRMMLALTFALEPELVIADEPTSALDSLTQFE LLPLFKQMAKERSMIFITHDLALVQELADDIAVLKRGEIVEFRAKSILFS HPQHPYTQYLLAMRAKLNQPFARLVRKKQ >MS0827 gadB, GadB protein MGRRPPYGTNMADISKHRQSLFCSDPQSIADYETAMSNAVKAVSNWLKNE KMYTGGSIRELRKTIGSFNPSKQGVGVNQSLDHLVDIFLNPSLKVHHPHS LAHLHCPTMVASQIAEVLINATNQSMDSWDQSPAGSIMEEQLIDWLRQKA GYGQGTSGVFTSGGTQSNLMGILLARDWAVANHWKNEDGSEWSVQENGLP AEALKKLKVVCSENAHFSVQKNMAMMGMGFQSVVTVPTNANAQMDVAELE KTLATLKAEGKIVACIVATAGTTDAGAIDDLKAIRKLADAYQAWLHVDAA WGGALLLSKDFRHLLDGIELTDSITLDFHKHFFQSISCGAFLLRDERNYR FIDYKADYLNSEYDEEHGVPNLVSKSLQTTRRFDALKLWFTLEALGEDLY ASMIDHGVKLTKQVEEYIRTTEGLEMLVPTQFAAVLFRVAPEGYPAEFID ALNQNVADELFARGEANIGVTKVGNKQSLKMTTLSPIATLENVKALLALV LAEAERIKDAIANGTYVPPID >MS0196 gdhA, GdhA protein MQTLTILLIRGKLMSSTVSSLEDFLSLVAQRDGNQPEFLQAVREVFTSIW PFLEANPQYRSQALLERLVEPERAFQFRVAWTDDKGQVQVNRAFRVQFSS AIGPYKGGMRFHPSVNLSILKFLGFEQIFKNALTTLPMGGGKGGSDFDPK GKSDAEVMRFCQALVAELYRHIGPDTDVPAGDIGVGGREVGYLAGYMKKL SNQAACVFTGRGLSFGGSLIRPEATGYGLVYFAQAMLAEKGDSFQGKTVS VSGSGNVAQYAIEKALQLGAKVVTCSDSAGYVYDEAGFTTEKLAALLDIK NVKRGRVKDYAEQFGLQYFPGERPWGVKVDIALPCATQNELELTDAQKLI ANGVQLVAEGANMPTTIEATEALQAAGVLFAPGKAANAGGVATSGLEMAQ SSQRLFWSAEEVDQKLHNIMLDIHANCKKYGTDANGNINYVAGANIAGFV KVADAMLAQGVY >MS0262 glnA, GlnA protein MANPNAIQRVAKLIEDNDVKFVLLRFTDIKGKEHGVSLPVNLVADELEDF FEEGKMFDGSSVEGWKAINKADMLLMPMPETAVIDPFAQITTLSIRCSVY EPNTMQSYDRDPRSIATRAENYLKSTGIADQALFGPEPEFFLFDDVRFST EMNNVSYKIDDIEAAWNTNRKFEDGNNAYRPLKKGGYCAVAPIDNAHDIR SEMCLILEEMGLVIEAHHHEVATAGQNEIASKFNTLTLKADETQIYKYVV QNVALEYGKTACFMAKPFAGDNGSGMHCNMSLSKDGKNVFQGDKYAGLSE TALYYIGGIIKHAKALNAFTNPTTNSYKRLVPGFEAPVLLAYSASNRSAS IRIPAVTSPKAIRVEARFPDPLANPYLAFAALLMAGIDGIINKIHPGDAM DKNLYDLPPEELKEIPAVCSSLEEALDSLQADHEFLIQGGVFSKEFIDAF VAIKRKEVERVNMTPHPVEFEMYYA >MS0426 glnK, GlnK protein MKKIEAIIKPFKLDDVRESLSDIGITGMTVTEVRGFGRQKGHTELYRGAE YMVDFLPKVKLEIIIPDELLDQCIEAIMETAQTGKIGDGKIFVYNVERVI RIRTGEENEDAL >MS0219 glnQ, GlnQ protein MTISVKNLNFFYGSSQALFDINLTAEDGDTVVLLGPSGAGKSTLIRTFNL LEVPKSGDLTVADNHFDLSQNTDAKKMRQLRQDVGMVFQQYNLWPHFTVM ENLIEAPMKILGLTESEAQKEAMELLTRLRLEEHAHRFPLQLSGGQQQRV AIARALMMKPKVLLFDEPTAALDPEITAQIVSIIQELQETGITQVIVTHE VGVARKVATKVVYMEKGRIVETGDASCFEAPQTEQFRQYLSHD >MS1685 glnQ, GlnQ protein MALLEIKELVKNYGEVTALNGVNLSVEKGEVVVILGPSGCGKSTFLRCIN GLEEIKSGSLKLADVGELGKDISWVKARQHIGMVFQSYELFAHMTVIDNI LLGPLKVQKRARAEVEKQADALLKRVGLYERKNAYPRELSGGQKQRIAIV RSLCMNPDIMLFDEVTAALDPEMVREVLDVVLGLAKDGMTMIIVTHEMQF ARQVADRIVFMDNGNIIEESEPEQFFTSPKTERAKTFLNILDYYI >MS1275 glnQ, GlnQ protein MIKVKNIHKAFGENVILRGIDLDITKGEVVVILGPSGSGKTTFLRCLNAL EMPEQGTIEFDNAAPLKIDFAAKPSKKDILALRRKAGMVFQNYNLFPHKT ALENVMEGPVRVQSKKVAQAREEALALLTKVGLADKADLYPFQLSGGQQQ RVGIARALALQPELMLFDEPTSALDPELVQDVLDTMKSLAKEGWTMVVVT HEIKFALDVADLVIVMDDGVIVEQGSPKQLFDNPQHERTKAFLQRLRSH >MS0611 gloA, GloA protein MISLFTGFHHIAIIVSDYEKSKYFYTQILGAEVIEETYRASRHSYKLDLK FADGSQIELFSFPSSPSRLTMPEACGLRHLAFKVKDIEEAVQYLKTQQIE CEDIRIDELTGKKFTFFKDPDNLPLELYEFNSFKGG >MS0703 gloA, GloA protein MMRILHTMLRVGDLDRSVKFYQDVLGMRLLRTSENPEYKYSLAFLGYDDE DKTAVIELTYNWGVTEYELGSAFGHIAIGVDDIHATCEAVKAHGGKVTRE PGPVKGGSTVIAFVEDPDGYKIEFIENKNAKAALGN >MS0597 gloA, GloA protein MKLEHVAIYVQDLEKAKAFFMKYFNAQPNEKYHNPRTNLMTYFLTFSGGA RLEIMTRPEIIELDKNIFRTGLIHLSMQVGGEEKVRELTERLRTDGYQVI SEPRKTGDGYYESCVLDGEGNQIEIVA >MS1994 glpB, GlpB protein MNFDVVIIGAGIAGLTCGLTLQEKGVRCAIINNGQAALDFSSGSMDLLSR LPNGSTVDSFAQSYAALAQQSPNHPYVILGKDVVLDKIQQFETLAKSLNL SLVGSSDKNHKRVTALGGLRGTWLSPNSVPTVSLEGKFPHDNIVLLGIEG YHDFQPQLLADNLKQNPQFAHCEITTNFLHIPELDHLRQNSREFRSVNIA QVLEYKLSFNNLVDEIKQAVGNAKAAFLPACFGLDDQSFFESLKQATGIE LYELPTLPPSLLGIRQHRQLRHRFEKLGGVMFNGDRALRSEFEGNKVARI FTQLHLENAVTAKYFVLASGGFFSNGLVSEFEEIYEPLFRSDIVKTERFN ATDRFSWISKRFADPQPYQSAGVVINAECQVQKDGNNVENLFAIGAVIGG YNGIELGCGSGVAVTTALKVADNIIAKESSN >MS0731 gltD, GltD protein MAKFFLAPADNYDVKIGELVDKFVNKVRSFPPGTCPLVVQYASLRSSMSQ TCGKCVPCRDGIPHLSFLLRDILAGEGDDSTMRQIRELAEMIRDGSDCAI GYQPAIEILDSIEEFKEEYESHIHNKSCQKVIGQRIPCINMCPAHVDIPG YIAHIGDGNYAEAINLIRKDNPLPTACGLVCEHPCEERCRRRLIDDAINI RGLKKYAVDQVAADVVKVPQALPDTGKKVAVIGGGPAGLTCAYFLAQMGH RVTIYERQKALGGMLRYGIPNYRFPKDRLDQDLNAILSAGRIEVKYGVMV GDDIAIEDIYNSHDAMFVGIGAQKGKTLRIKGSEANNVFSAVEMLDDIGN GKIPDYTDKVVVVIGGGNVAMDAARSAVRCKAKDVRIVYRRRQDDMTALH AEIEAAIMEGIELITLAAPVAIEKDEQGNCTGLTVQPQMTGPYDHGGRPS PVAVKKPPFTIGCDVILIAVGQDIISLPFEEFGMPANRGIFQADLTTAVP DMDGVFVGGDCATGPATAIKAIAAGKVAAHNIDEYLGYHHEFPCETKAPP PKENVRIQVGRANTTERPAYIRKCDFEHVENPYTYEEAMQEAERCLRCDH FGCGVLQGGRDL >MS0030 gltS, GltS protein MTFDTYETLALACLVLLLGYFLVKRVKLLSNFNIPEPVVGGFIVAIVLTV VHEIWGLSFSFDSNLQRTMMLVFFSSIGLSANFARLIKGGKPLVMFLVVA AMLIAIQDTVGIFGSMALGLDPAYGLIAGSVTLTGGHGTGAAWAETLTND FGISGAMELAMACATFGLVFGGIIGGPVARFLLTRLHKEEVPEDENVDDV QEVFEKPVYRRKVNSRAIIETISMMAVCLLVGQFLDELAKGTAFQLPTFV WCLFTGVILRNTLTLVFKFTAPDQTIDVLGTVGLSIFLAIALMSLKLWEL AGLALPVFVILTLQVVVMATFAILVTYRVMGSDYDAVVLSAGHCGFGLGA TPTAVANMQAVTAHFGHSHKAFLIVPMVGAFFIDLLNASLLKFFVEVAAY FH >MS1295 glyA, GlyA protein MLQNHSIAEFDPVLWDAIQNENRRQEEHIELIASENYVTKAVMEAQGSQL TNKYAEGYPGKRYYGGCEYVDIVEQLAIDRAKELFGADYANVQPHSGSQA NAAVYGALLNAGDTILGMDLAHGGHLTHGAKVSFSGKIYNSVLYGITAEG LIDYEDVRVKALESKPKMIVAGFSAYSQVVDWAKMREIADEVGAYLFVDM AHVAGLIAAGLYPNPLPHAHVVTTTTHKTLAGPRGGLILSACGDEEIYKK LNSSVFPANQGGPLMHVIAAKAVCFKEALQPEFKAYQAQVLKNAKAMVEV FKQRGFEVVSKGTENHLFLVSFVKQGLTGKAADAALGEANITVNKNSVPN DPQKPFITSGIRVGSPSITRRGFNEADASTLAGWMCDVLESIGKDNYDQV IAETRAKVLEICKRLPVYGD >MS0953 gntT, GntT protein MLDTVLNTLAVAKVIDGSQLWVETLRLLGKTPIALLITLIVSIVLLKNQR SYEQIEKICDSSLGPICAIVLVTGAGGMFGGVLRASGIGEVLASTLGHTG MPVIVAAFIISSALRVAQGSATVALTTTAALISPMVAADPSLSQMDLCFI VISIASGATVLAHVNDSGFWLVSRFLEIDTKTMLKTWTVQETLIGIVGFI IAYVGSIIF >MS0686 gntT, GntT protein MSGISLIISFIIAIIIMIWMISKLKVHPFLSLMTISLALALVAGIELNKI PGMIGDGFSSTFKSIGIVIIFGAIIGTILEKTGAALKLADMVVKLVGQKH PELAMLIMGAIVGIPVFCDSGFVVLNPIREALYKKIAANPVATAVALSGG LYASHVFIPPTPGPIAAAGALGLESNLLLVIIMGVVVSIPVLTAVYFFAG YIGKRVTLDEEAQADAAIVKNYEQLLKQYGILPGKFLSLAPILMPIVFMA LGSIAKIAEIGGNTGIIIQFLGTPIIALAIGVIFSVFLLLQTKKITEFND LTNETLKIVGPILFITAAGGVLGKVITEAGFVDYIKQNAHIISTTGIFFP FIISAVLKTAQGSSTVAIITTASIMGMYSAGDSLMSVLGLTSEIAAALCV MAIAAGAMCVSHANDSYFWVVTNFGKMTAQQGYKTQTLMTFIMGIVGIIT VYILSLLLL >MS1977 gntT, GntT protein MSLKIAAILLALLYQEYCMSNEMLILIGIVSVIALLLIMIKGKVHPFVAL SLVSIAVALSSGIPMGKVVPTLISGMGGTLGSVALIVGLGAMLGKIIEKS NGADVLASWLLDKFGEKRAPFALAMTGFIFGIPVFVDVGFIVLIPIIFSV ARRIGGNMLVYALPIGLSMLTVHVLMPPHPGVVAGAQVLNADIGLVLGLG FIAALPAVLIGQTFIPLFTKNNFVAIPASSDLLEYQKQVSKNVDGLPKFA TVLAMIVFPLLLIMSGTVSATVLPKESIVREFFSMVGASPFALLLAVCVS SYILGIRRGWRKEQLEEILNSALAPIAGIILITGAGGMFGKVLNESGVGN ALADVLSSTGLPILALSFILAAMLRAAQGSATVAVITTATILAPAVTSAG YSDIQTALVTAAIGAGSMTLSHVNDSLFWVWTKFFGITITQGLRTWSILS TIYGSLAFLIVTLMWMFA >MS0954 gntT, GntT protein MLIFIMIASVALLLLLIMKFKVHAFVALTIVSLLTALATGIPINKILPTL LNGFGNTLASVALLVGLGAMIGRLLEITGGAKVLADTLINKFGEQKAPLA LGIASLLFGFPIFFDAGLVVMLPIIFSVAKQFGGSLIRYAFPAAGAFAVM HAFSVPHPGPVAAGDLLGANIGLLTIIGLICAIPTWYIATYLFGLHLGKK YHLDLPKAFLNAMPINETAVLTPPSFKKVILILLLPLGINYAGYGVKYFS RCKSN >MS0335 gntT, GntT protein MIMSITVAFIIGVAVLLFLALKLKVSAFLSLLATALTIGILSGMGTTEII KDIVAGFSKSVGSIGLVIIFGTMLGNYLEQSRAAHKMALDAVRLVGTKNS SIAMSISGYLISIPVFSDVGFLILSPLIKAISKKSKIPLAALAVALSAGL LATHVYVPPTPGPLAAAGLLGIDIGRAIIWGAFAAVVMTLFGWMYAHFYL MKKSPDYYTFVETVVEEKEVDETNLPGSLASLMPLLLPIVLILLNTTCAA IFPKDSPVLSVTKFIGDSNIALVIGALTAIALLGKRIGKEKVLKIMDSSL KDAGSIIFITAAGGALGQILKTSGAGDSLAQAVVSSGLPFILIPFVISAI LKIVQGSGVVAVITSATLAAPIATQLGIDPILIFLASGAGARAYCHVNDS YFWVYTNCCGFDMKTGLKTLSNASIFMSLGGLLATFIASLII >MS0688 gntT, GntT protein METAASMSQMLIGLAIGIALLLILAMKTRIHVFVALILASLTTGLIGGLP FAEVISSVTKGFGSTLGSTGIIIGLGVMMGAILEKSGAAEQMAFSIIKLI GKAKEEWALALTGYVVAIPVFADSGLIILTPLARSLSRMTGKSVIGLGLA MATGLQLAHVFIPPTPGPLAVAGILDIDMGMMIIWGMILTVPTLVMSTLY AKWLGKKIYQIPNEDGTDFERKEFKEEYIKSIENVEQIYKDKNLPGAGLS FSPIVIPLILILGNTTVNFLKIENGFADLLKIVGHPIIALIIGLLIALYG LGRRLSKAETNKAIEDGVKSTGMILFITGAGGALGYVVRDAGIGNALGEA VLTVGIPGILIPFVIAALMRIALGSATVALITAATLAAPLVPQLGLNPTL VAMSTCAGAVSFSYFNDSGFWVFNGLYGLKEVKDQFMAKTMVSFIGAFSC LALVLIFNIFM >MS0671 gsp, Gsp protein MSEISPNIPTHDAFGSLLGYAPGGIAIYSSDYETADKNEYPDDAAFRSYL GREYMGYKWQCVEFARRYLYLNHGMVFTDVGMAYEIFSLRFLRQVVNDAL VPLQAYANGSKKSPEPGALLIWQEGGEFQETGHVAIITEVFNDKIRIAEQ NVIHYRLPSGQQWTRELPMSVTEQGYILHDTFDDTEILGWMIQTDDSTYS LPQPTAAPESLEIHAEHIENKGQFDGKWLNESDPFEKLYVTAMNGHQVSR TDQYRYFTISETAKHELIRATNELHLMYLHATNKVLNDDNLLKYFNIPKL LWPRLRLSWENRRYQTVSGRLDFCLDERGLKVYEYNADSASCHAEAGAIL GRWAKVAGLDNGEDPGAHLRNALADCWKHRDNTPLVHIMQDNDSEEDYHS MFMQSALLQAGCRTKIIHGTEGLHWDKRGRLLDDEDNQILSVWKTWAWET MLEQLREDATGREVAPPIRTGYPEDKVRLIDVLLRPEVLVYEPLWTAIPS NKAILPVLWSLFPNHRYLLESGFELTQNLIKNGYAKKPIAGRRGDNVTLF ADQHSRLDVTHGRFGKQEHIYQQLWCLPKVEEQYVQICTFTVGGHYGGSC LRSDPSRIIVGDSDMQPLRVLNDKDFLAK >MS1883 hisA, HisA protein MKKSIIIPALDLIDGNVVRLHQGDYAKQTTYSDNPIEQFASYLAQGAEQL HLVDLTGAKDPAKRQTALIGKIIAATHCKIQVGGGIRTEKDVADLLAVGA NRVVIGSTAVKERAMVKEWFNKYGAEKFVLALDVNIDASGQKIIAISGWQ EASGVSLEELIEDFQSVGLQHVLCTDISRDGTLAGSNVDLYKEICAKYPA VNFQSSGGIGSLEDIKALKGTGVAGVIVGRALLEGKFNVAEAIECWQNG >MS0435 hisB, HisB protein MNKERMVKKAIFLDRDGTINIDHGYVHKIDDFHFIEGSIEALEELKNMGY LLVLVTNQSGIARGYFSEDEFLQLTEWMDWSLADRNVDLDGIYYCPHHPE GLGEYRQDCDCRKPKPGMLLQAIEELNIDPAQSFMVGDKVEDLKAAVSAN VKYKVLVKTGKTVTQAGEQLADYVLDSIADLPRIIKRLKK >MS1890 hisB, HisB protein MTQQPTLFIDRDGTLIDEPKTDFQIDSLEKLKFERNVIPALLKLKNRYRF VMVSNQDGLGTDSFPQEDFDKPHNAMLAVFRSQGIEFDDILICPHKPEDN CDCRKPKIKLLKKYIDKKLFDPADSFVIGDRPTDVQLAENLGIRALQYHP ENLDWDMIAEKLLREPVADPKGLGQPRHAVVARKTKETDIKVEVWLDEAG VNQINTGIGFFDHMLDQIATHGGFRMNVSCKGDLHIDDHHTIEDVALALG AALKEAIGNKRGIQRFGFVLPMDECKAECALDLSGRPYFKFKAKFNRDKV GDFSTEMTEHFFQSIAYTLLATLHLSVKGDNAHHQIEALFKAFGRTLRQA IKIEGNEMPSSKGVL >MS1574 hisC, HisC protein MVVRPNFKTTIPNNHKSAVENMTFLQQANTGVQALSPYQAGKPIEELERE LGISNIIKLASNENPFGFPESAKKAIQNQLDNLTRYPDSNGFSLKAAIAE KFNLQPEQITLGNGSNDLIELIAHTFATEGDEIIFSQYAFIVYPLITKAI NAKAREIPAKNWGHDLEAFLAAINEKTKLIFIANPNNPTGNFLTEAEIDS FLAKVPPHIVVALDEAYTEFTAKEERVNSLALLKKYPNLVVSRSLSKAYG LAGLRIGFAVSNPEIAGLFNRVRQPFNVNSLALAAAEAVLNDDDFVEKAA ENNRRELKRYEEFCQKYGLQYIPSKGNFITIDFQQPAAPVYDALLHEGVI VRPIAGYGMPNHLRISIGLPEENQRLFDALIKILNLK >MS1891 hisC, HisC protein MSISQLSRKNVQALTPYQSARRLGGNGDVWLNANEYPTSPDFNLSERIFN RYPEPQPEAVIKGYAAYADVKPENVIVTRGGDESIELLIKGFCEPEDKVL YCPPTYGMYAVSAETLGIATKTVPLTEDFQLDLPEIEKNLAGVKVIFVCS PNNPTGNVLNQADLIRLLDITAGSAIVVVDEAYIEFSPETSMIKQLGNYP HLAIIRTLSKAFALAGLRCGFTLANPELIGVLQKVIAPYPLPVPVSDIAA QALQPQGVAQMKMRVADVLANRAWLIGELKQIPSVVKIFATEANYVLVKF QDGEKVFNALWEKGIILRDQHKAFGLKNCIRISIGTRAELEKTVVALKLA >MS1892 hisD, HisD protein MQTLIWKDLTEQEKKQALTRPAISAAGNIKDAVDAIRENVVANGDKALFE LSEKFDRVKLNSLEVSEQQIEEAAQRLPEELKQAIQNAKKNIEAFHLAQV PVEADVETQSGVRCQVLTRPINRVGLYIPGGSAPLFSTVLMLAIPAKIAG CKKIVLCSPPPIADAILYAANLCGVETIYQVGGAQAVVAMAFGTETVAKV DKIFGPGNAFVTEAKRQVSQAVNGAAIDMQAGPSEVLVLADENADPDFVA SDLLSQAEHGADSQVILVTPSERLALETELAVERQLTTLPRSEIAQKALA HSRIFIAENLQQCVEISNEYAPEHLVVQVQNARDLLSNIDNAGSIFLGAY SPESMGDYASGTNHVLPTYGYTRTSSSLGLADFSKRMTVQELSPQGFKDL AKTVEVMAAAERLDAHKQAVSIRLAKIK >MS1882 hisF, HisF protein MLAKRIIPCLDVRNGQVVKGVQFRNHEIIGDIVPLAARYAEEGADELVFY DITASSDGRTVDKSWVERVAEVIDIPFCVAGGIKTIADAEQIFTFGADKI SINSPALADPDLISRLADRFGVQAIVVGIDSWFEQETGKYWVNQYTGDES RTRQTNWQLLDWVKEVQKRGAGEIVLNMMNQDGVRNGYDLTQLKLVRDVC KVPLIASGGAGEMVHFRDAFIEANVDGALAASVFHKQIINIGELKEYLAR EGVEVRR >MS1893 hisG, HisG protein MSTNKRLRIAMQKKGRLSDESQELLKQCGVKINLQGQKLIAYAENLPIDI LRVRDDDIPGLVFDGVVDLGIIGENVLEEEELTRTAAGDKVEYKMLRRLE FGGCRLSLAVDSDVEFDGPESLSDCRIATSYPQLLKRYMAEQGVPFKSIL LNGSVEVAPRAGLADAICDLVSSGATLEANGLKEVEVIYRSKACLIQRKE PLSEEKQALVDKILTRIQGVQQADESKYIMLHAPKDKLEEITALLPGVEN PTILPLAHDDTKVAVHVVSQENLFWETMEQLKEKGASSVLVLPIEKMLA >MS1885 hisH, HisH protein MIIIDTGCANLSSVKFAFDRLNIKAEISRDIATIKSADKLLLPGVGTAMA AMKILQDRNLIETIQNATQPMLGICLGMQLMTEYSSEGNVPTLSLMSGHT DLIPNTGLPLPHMGWNKVRYEQDHPLFAGIEQDSHFYFVHSYAVLPNEHT IATSDYGVPFSAALGCKNFYGVQFHPERSGKNGAQLLKNFVENL >MS1881 hisI, HisI protein MQNKINWQKVDNLLPVIIQHFQTCEVLMLGYMNQEALAKTCDEKVVTFFS RTKQRLWTKGETSGNFLNAVDMSLDCDNDTLLILADPIGPTCHTGEESCF HQFATQSEGDWTWFAKLERVLAERKFADPESSYTATLYAKGTKKIAQKVG EEGVETALAALSKDKGEIVSETADLIYHLTVMLHEQNLEWGDVIDKLKER HQGIGLHPEGSNK >MS2218 ilvA, IlvA protein MVNNLSNAPTGAEYLRAILISKVYEAAKVTPLQLMPKLSERLGNRIYVKR EDHQPVHSFKLRGAYAMISGLTQAQKEAGVITASAGNHAQGVALSAKNAG IRALIVMPQNTPSIKVDAVRGHGGEVLLHGANFDEAKAKAIELSQTEQMT FIPPFDHPAVIAGQGSIGMELLQQNGHINRIFVPVGGGGLLAGVAVLIKQ LMPEIKVIGVEAKDSACLYYALKAGRPVDLERVGLFADGVAVKRIGDETF RICQQYVDDVILVDGDEICAAMKDMFENVRAVPEPSGALSLAGLKKYAKQ HNLQGETLVNLLSGANLNFHTLRYVSERCEIGEKHEALFAVTIPEQRGSF LKFCQILGQNAVTEFNYRYADEKQACIFVGVRITGEQEKQVIIQQLKQGG YDVQDLSDDDIAKTHIRYMVGGRSSSDLNERLYSFEFPEQKGALLKFLET LGTTDANISLFHYRGHGADYGDVLAGFQINDADLPAFKQHLEKLGYAYQD VTDSPSYRYFLG >MS1319 ilvB, IlvB protein MKMKKLSGAEMVVQSLRDQGVKYLFGYPGGSVLDIYDAIHTLGGIEHVLV RHEQAAVHMADGYARSTGEVGCVLVTSGPGSTNAVTGILTAYTDSVPLVI ITGQVRSNLIGTDAFQECDTIGLTRPVVKHSFMVKHAEDIPETIKKAFYI ASSGRPGPVVIDIPKDVVNPANKYTYEYPKEVSLRSYNPNVQGHKGQIKK ALKALLVAKKPVLFIGGGVIIGNSSEKLTQFAQLLNLPVTSSLMGLGGYP GTDKQFLGMLGMHGTYQANMAMHNADLILGIGVRFDDRTTNNVEKYCPHA KVIHVDIDPTSISKNIAADIPIVGSVDNVLTEFLSLLEDDNLSKSQSDLT EWWKQIDEWKAKKCLEFDRTSQAIKPQAVVEAIYRLTKGEAYIASDVGQH QMFAALHYPFDKPRHWINSGGAGTMGFGLPAAIGTKFAHPDSRVVCITGD GSIQMNIQELSTAKQYGTPIVIVSLNNRFLGMVKQWQDLIYSGRHSQVYM NSLPDFAKLAEAYGHVGIQINTADELEEKLTQAFAVKDKLVFVDVLVDAT ENVYPMQITGGGMNEMLLGKPAEK >MS2223 ilvB, IlvB protein MNGANLVTECLKAHNVDTVFGYPGGAIMPVYDALYDCGINHLLCRNEQGA AMAAIGYARSTGKTGVCIATSGPGATNLITGLGDALMDSIPLVAITGQVA APLIGTDAFQEADVLGLSLACTKHSFIVQNIEELPEIFAKAFKIAQSGRP GPVLIDIPKDVQFAETLLQPIVYSVEKPTALSAKSLEKAVELLKNAKRPV AYIGGGVGMAKAVPALHEFLTATRIPTICTLKGLGAVPADNPYYMGMIGM HGTKAANYATQEADLLLVLGARFDDRVTGKLSSFATEAKVIHADIDVAEI NKLRRADVALCGDLEQALKALSFALDIEPWRADVQRLKRDFDWDYGENEG EGDINPLFLLNRVSRLKAENAIVVTDVGQHQMWAAQHMSFGKPENFITSA GFGTMGFGLPVAIGAQKARPRDQVILVTGDGSIMMNIQELGSIKRAKTPI KILLLDNQRLGMVRQWQSLFFHGRHSSTILDDNPDFVTLASAFGIRGERI EKAGEVNEALDRFFASQEAYLLHVCVHEDENVWPLVPPGACNVEMIEEMS >MS0045 ilvC, IlvC protein MSNYFNTLNLRQKLDQLGRCRFMERSEFADGCNFLKGKKIVIVGCGAQGL NQGLNMRDSGLDISYALRPEAITEKRASFQRATENGFKVGTYQELIPTAD LVVNLTPDKQHSKVVADVMPLMKQGASFGYSHGFNIVEVGEQIREDITVV MVAPKCPGTEVREEYKRGFGVPTLIAVHPANDPKGEGMAIAKAWASATGG DRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSIVCYDKLVADGKDPAYA GKLIQYGWETITEALKQGGITLMMDRLSNSAKIRAFELAEEIKEHLNFLY LKHMDDIISGEFSATMMADWANGDKDLFAWREATGKTAFENAPKADGIKI SEQEYFDNGVVMVAMVKAGVEMAFDAMVASGIYEESAYYESLHELPLIAN TIARKRLYEMNVVISDTAEYGNYLFSNVATPILAKEIVSQLKRGDLGEPT PAAEIDNVYLRDINDTIRNHPVELIGQELRGYMTDMKRISSQG >MS2219 ilvD, IlvD protein MEIFMPKLRSATSTQGRNMAGARSLWRATGMKEGDFGKPIIAVVNSFTQF VPGHVHLHDIGQMVVKQIEAAGGVAKEFNTIAVDDGIAMGHGGMLYSLPS RDLIADSVEYMVNAHCADAMVCISNCDKITPGMLMAAMRLNIPTIFVSGG PMEAGKTKLSDQLIKLDLIDAMIQSADKNVSDSDVDAIERSACPTCGSCS GMFTANSMNCLTEALGLSLPGNGSCLATHADRKQLFLDAATQIVELCKRH YEQDDYSVLPRSIATKAAFENAMSLDIAMGGSTNTVLHLLAVAQEAEVDF TMADIDRLSRIVPCLSKVAPNTNKYHMEDVHRAGGVMAILGELDRANLLH HDTKTVLGLTFAEQLAKYDIKLTRDEAVKTFYRSGPAGIRTTEAFSQDCR WETLDDDRENGCIRDKAHAYSQDGGLAMLSGNIALDGCIVKTAGVDESIL KFTGEAIVFESQEDAVDGILGGKVKAGHVVVIRYEGPKGGPGMQEMLYPT SYLKSMGLGKACALLTDGRFSGGTSGLSIGHCSPEAASGGTIGLVRNGDI IAIDIPNRSIQLQVSDEELATRRAEQDVKGWKPANRAREVSFALKVFGHF ATSADKGAVRDKTKL >MS0896 ilvE, IlvE protein MKDLDWKNLGFGYTKTDYRYIAYWKNGEWQKGELTKDNTLHISEGSPALH YGQQCFEGLKAYRTKDGSIQLFRPDQNALRMQQSADRLLMPRVPVDMFID ACKQVVKANEEWVGPYGSGATLYLRPFLIGVGDNVGVHPAKEYIFSIFVC PVGAYFKGGLAPSKFLISTHFDRAAPHGTGAAKVGGNYAASLYPGKYAKE HGFADCIYLDPATHTKIEEVGSANFFGITKDNKFITPISPSILPSITKYS LLYLAKERLGLEVEEGDVYVKDLDQFAEAGACGTAAVITPISGVQIDDKY HVFYSETEIGPITQKLYDELTGIQFGDKPAPEGWIVKVE >MS2192 ilvE, IlvE protein MCRIGIFMDYPLFETVAVERGEILNLDYHQTRYEQALHQYYGRKVLPFNL QEILQKSTALLTLKRSEPLIRCRIDYNDQDYRLQCFAYQRKVFRSFQPVI CDHIDYGLKFSDRRIFAELLRQKGKHDEIIIIKQGLVTDCTIGNLLFRKN QQWFTPEAPLLNGTQRAKLLAEKRIQTLNIKRQDIAQFDEIRLINAMNPF SESL >MS1318 ilvH, IlvH protein MRRILSVLLENESGALSRVVALFSQRAFNIESLTVAPTDDPTLSRMTIEA SGDEAILEQIEKQLHKLVDVFKVINLSDCEHVEREVMLLKLRATGSTRDE IKRLTDIFRGQIVDVTTKSYTIQLAGTKDKLNAFVSAVKEETTIIEIVRS GLISLSRGEKNCL >MS0507 kamA, KamA protein MRILTQNNPVREENWLEILANSISDPEVLLKTLSLPIDKFEKDIHARKLF AMRVPLPFVRKMELGNAQDPLFLQAMSSADEFLTADGFSKDPLEEQQVVA PNILHKYKNRLLLMVKGGCAINCRYCFRRHFPYADNQGNKANWQKALDYI SANPQIEEVIFSGGDPLMAKDHELDWLIKKLEKIPHLQRLRIHTRLPVVI PQRITGAFCKILTESRLNTVLVTHINHGNEIDEQLTRALNKLKNAGVVLL NQSVLLKNINDNAQTLKNLSDKLFRAGILPYYLHLLDKVEGASHFYVPDQ RAVEIYRELQSLTSGYLVPKLAREIAHEPNKTLYGG >MS0599 leuA, LeuA protein MNVHNKRIRTMANNRVIIFDTTLRDGEQALKASLTVKEKLQIALALERLG VDVMEVGFPVSSAGDFESVQTIAVHVKNSVVCGLSRAVNKDIDAAAEALK VAERFRIHTFIATSALHVEAKLKRSFDDVVEMAVAAVKRARRYTDDVEFS CEDAGRTGIDNICRVVEAAINAGATTVNIPDTVGFCLPTEYGNIIHQVMN RVPNIDKAVVSVHCHNDLGMATANSLTAVLNGARQIECTINGIGERAGNT ALEEVVMSIKTRQDLFGVDTRINTQEIHRVSQMVSQICNMPIQPNKAIVG ENAFSHSSGIHQDGMLKNKNTYEIMSPETIGLKKEKLNLTARSGRAAVKG HMADMGYTEQDYDLDKLYEAFLKLADKKGQVFDYDLEALAFIDMQQGDED RLKLDVITSQTISTLPASAFVQVELDGKRINKTSNGGNGPVDAVYNAIMQ IVGMDLKMSHYNLTAKGEGAEALGQVDIVVEYQGRKFHGVGLATDIVESS ALALVHAINAIYRSQKVADLKKDLKHIHTV >MS1105 leuB, LeuB protein MVIMTHKIAVIPGDGIGIEVINEGVKVLNCVSQLDPKIQFEFTHFPWGCE FYSKTGRMMDDDGIERLSKFDGIFLGAVGYPGVPDHISLWGLLLRIRKSF DQYVNVRPVKLLKGAPCPLKEKSPKDINMIFIRENSEGEYAGSGSWLYRD KPNEVVIQDGVFSRVGCERIIRYAFELARTEKKSLTSISKGNALNYSMVF WDQIFQQLSQEYPDVETHSYLVDAAAMLMITKPERFEIVVTSNLFGDILT DLGAAIAGGMGLAAGANLNPEGNFPSMFEPIHGSAPDIAGKQLANPLATV WSASQLLEFFGYKEWAARLIDAIEYLLVEQKTLTPDLGGTAKTADVGDAV VAYLQKHFA >MS0598 leuB, LeuB protein MSTYNVAVLPGDGIGPEVMAEAIKVLDKVQAKFGFKLNFTQYLVGGAAID AKGEPLPAETLQGCDNADAILFGSVGGPKWTHLPPDQQPERGALLPLRKH FKLFCNLRPATLYKGLEKFCPLRADIAAKGFDMVVVRELTGGIYFGQPKG RDGEGSDTRAFDTEVYYKYEIERIARAAFDAAMKRRKQVTSVDKANVLQS SILWRETVAEIAKEYPEVQVENMYIDNATMQLIKAPESFDVLLCSNIFGD IISDEAAMITGSMGMLPSASLNEEGFGLYEPAGGSAPDIAGKGIANPIAQ ILSAAMMLRYSFNLNEAATAIENAVQKVLADGHRTGDLADNSTPVSTAEM GTLIANAI >MS0333 leuC, LeuC protein MENAMSKTLYDKHIDSHTIKELDNEGNVLLYIDRTILNEYTSPQAFSGLR EENRDVWNKKSILLNVDHVNPTRPVRDANMTDPGGTLQVNYFRENSKLFD IELFDVTDPRQGIEHVVAHEQGLALPGMVIAAGDSHTTTYGAFGAFGFGI GTSEIEHLLATQTLVYKKLKNMRVTLTGKLPFGTTAKDVIMALVAKIGAD GATNYAIEFCGEVIDELSVEGRMTICNMAVECGARGAFMAPDEKVYEYIK GTPRAPKGEMWDLAIAEWRKLKSDNDAVFDKEIHMDCSDLEPFVTWGISP DQADVISGEVPDPNLLPEGQKRKDYQAALEYMGLEPGMKFEEIKISHAFI GSCTNGRIEDLREVAKVLKGRKIAQGVRGMIIPGSTQVRARAEAEGLAKI FIDAGFEWRQSGCSMCLAMNEDVLSPGDRCASGTNRNFAGRQGAGSRTHL MSPAMVAAAAVAGHLVDVRKFVEGD >MS0596 leuC, LeuC protein MAKTLYQKLFDAHVVYEAEGETPILYINRHLIHEVTSPQAFDGLRVAGRQ VRQVSKTFGTMDHSISTQVRDVNKLEGQAKIQVLELDKNCKATGISLFDM NTKEQGIVHVMGPEQGLTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVE HVLATQTLKQARAKSMKVEVRGKVNPGITAKDIVLAIIGKTTMAGGTGHV VEFCGEAIRDLSMEGRMTVCNMAIEFGAKAGLVAPDETTFEYLKGRPHAP KGKDWDDAVAYWKTLKSDEDAQFDTVVVLEAKDIAPQVTWGTNPGQVIGI DQLVPNPAEMTDPVTKASAEKALAYIGLEPNTDLKNVPVDQVFIGSCTNS RIEDLRAAAAVMKGRKKADNVKRVLVVPGSGLVKEQAEKEGLDKIFLAAG AEWRNPGCSMCLGMNDDRLGEWERCASTSNRNFEGRQGRNGRTHLVSPAM AAAAAVFGKFVDIRNVSLN >MS0334 leuD, LeuD protein MDKFTLITAKAAPMMAANTDTDVIMPKQFLKGIDRKGLDRGVFFDLRFNL DGTPNEKFILNQADWQGSQFLVVGPNFGCGSSREHAVWGLKQLGIRALIG TSFAGIFNDNCLRNGVLTICVSDQEIEQIATTVSNPATNTISVDLEGQKV LTENGEIAFDVDPLKKEMLIKGLDAVGFTLSMKDDILAFEQSYFKANPWL KL >MS0595 leuD, LeuD protein MTRIDKMAGLKQHSGLVVPLDAANVDTDAIIPKQFLQAITRVGFGKHLFH EWRYLDAEETQPNPEFVLNFPQYQGASILLARKNLGCGSSREHAPWALAD YGFKVMIAPSFADIFYNNSLNNHMLPIKLSEQEVEEIFQWVWANPGKKID VDLEAKTVTVGEKVYHFDLDEFRRHCLLEGLDNIGLTLQHEDAIAAYESK IPAFLR >MS2084 lysA, LysA protein MDFFQYKNNKLYAEDLLVSELAEQFGTPLYIYSRATLERHWKAFDSALGD HPHLVCFAVKSNPNIAILQVMAKLGAGFDIVSQGELERVIAAGGDPHKVV FSGVAKNEKEIARALELDIRCFNVESLAELQRINEVAGKSGKIAPISLRV NPDVDAHTHPYISTGLKENKFGVSVDEAREVYRLASRLPNIKVTGMDCHI GSQLTEIQPFLDATDRLILLLEQLREDGIELEHLDLGGGLGVTYSDETPP HPSEYATALLNKLKQYTNLEIIMEPGRAISANSGILVTKVEYLKSNETHN FAIVDAGMNDMIRPALYQAYMNIIEADRTLNRESKIYDVVGPICETSDFL GKQRRLAIAPGDYLVQRSAGAYGASMSSTYNSRPLTAEVMVDGSQAHLIR RRAELTELWALESLLP >MS1613 lysC, LysC protein MANLSVAKFGGTSVANYAAMTACAKIVIADPNTRVVVLSASAGVTNLLVA LANGCEATQRAKLLAEVRQIQENILNELKDAGTVRLEIEELLTNIEYLAE AASLATSSALTDELISHGEMMSTKIFVQVLRELNAQATWVDVRTVVATNS NFGKAAPDDEQTQKNSDNVLKPLIDRGELVITQGFIGRDPNGKTTTLGRG GSDYSAALIAEVLNAKDVLIWTDVAGIYSTDPRIVPNAQRIDTMSFAEAA EMATFGAKVLHPATLLPAVRSNIPVYVGSSKAPEQGGTWVTRDPQPRPTF RAIALRRDQTLLTLSSLNMLHAQGFLANVFNILAKHKISVDTITTSEVSV ALTLDKTGSASSGAELLSSDLLNELSEVCTVKVDTGLALVALIGNDLHLS AGIAKRIFGTIEEYNIRMISYGASTNNICTLVHSAHADDVVRALHKELFE >MS1703 lysC, LysC protein MRVLKFGGTSLANPERFLQAARLIEKAHLEEQAAAVLSAPAKITNHLVAL SEKASLNQPTETNFNEALDIFYNIINGLHEKNNNFDLKGTSQLIESEFNQ LAELLEQIRQAGKVEDAVKATIDCRGEKLSIAMMKAWFEACGYEVTVINP VEKLLAYGNYLESSVDIEESAKRVDVASIPKNNVVLMAGFTAGNEKGELV LLGRNGSDYSAACLAACLNASACEIWTDVDGVFTCDPRLVPDARLLPSLS YREAMELSYFGAKVIHPRTIGPLVRSNIPCLIKNTGNPTAPGSIIDGNEP QSGELQVKGITNLDNVAMFNVSGPGMQGMVGMAARVFSTMSKAGVSVILI TQSSSEYSISFCVPSKLAAKAKDALNTEFAKELLDKDLEPVEVIEDLSII SVVGDGMKQAKGIAARFFSALSQANISIVAIAQGSSERSISAVVAQNKAI EAVKSTHQALFNNKKSVDMFLVGVGGVGGELIEQIKQQKEYLAKKDIEIR VCALANSNKMLLNENGLSLDNWKEDLSNATQPSDFDVLLSFIKLHHVVNP VFVDCTTAESVSGLYARALSEGFHVVTPNKKANTREMAYYNLVRENARKN QRKFLYDTNVGAGLPVIENLQNLLAAGDEVERFNGILSGSLSFIFGKLEE GLTLSQATALAREKGFTEPDPRDDLSGQDVARKLLILARESGLELELSDV EVESVLPKGFSEGKSAVEFMEILPQLDAEFAARVEKAGAQNKVLRYVGQI NDGKCKVSIVEVDADDPLYKVKNGENALAFYTRYYQPIPLLLRGYGAGNA VTAAGIFADILRTLRN >MS0924 mET2, MET2 protein MDSSMSAQQVTLFTEQPLDLIFGGRLGQIDVAYQTYGTLNEDKSNAVLIC HALTGDAEPYLSPVENQAGGWWQSFMGEGLALDTSRYFFICSNVLGGCKG TTGPASINPKTNKPYGSQFPKVTVQDIVRLQKALISHLNIPHLHAVIGGS FGGMQATQWAIYYPDFVDKVVNLCSSLTFSAEAIGFNHVMRQAIINDPNF NNGDYYEGEPPENGLSIARMLGMLTYRTDLQLAKAFGRATKNEGHYWGDY FQVESYLSYQGQKFLGRFDANSYLHLLRALDIYDPSIGFDNIKEALSRIK AHYTLVAVTNDQLFKLTDLHKSKTLLEQAGVPLDYYEFPSDYGHDAFLVD YDTFEPKIRSGLE >MS0553 mHT1, MHT1 protein MPITILDGGMSRELMRRNAPFRQPEWSACALYEEPSAVQAVHEDFIAHGA EVITTDSYAVVPFHIGEQRFHTDGKTLADLAGRLAKSAVKNSGVLTTKIA GSLPPMFGSYRADLIQPERFAEIAQPLIDGLSPYVDIWLCETQSAIIEPV SIKALLPKDDRPFWVSFTLTDDELTCEPQLRSGETVKSAVEKMVDLGVDA ILFNCCQPEVIGEALAVTTATLTALNATHIQTGAYANAFAPQPKDATAND GLDEVRKDLDPPAYLAWAKKWTAQGASIIGGCCGIGVEYIETLAKNLK >MS0812 malK, MalK protein MENIVQSKPIIELRSLKKSYNENTIIDNFNLTINNGEFLTILGPSGCGKT TVLRLIAGFEEANGGQIILDGEDVTDLPAEHRPVNTVFQSYALFPHMTIF ENVAFGLRMQKVPNEEIKPRVLEALRMVQLEEMADRKPTQLSGGQQQRIA IARAVVNKPKVLLLDESLSALDYKLRKQMQNELKALQRKLGITFIFVTHD QEEALTMSDRIIVLRKGNIEQDGSPREIYEEPSNLFVAKFIGEINIFDAQ VLNRVDEKRVRANVEGRVCDIYTDLAVKEGQKLKVLLRPEDVQLEELDEN EQSSAIIGHIRERNYKGMTLESTVELEHNNKLVLVSEFFNEDDPNIDHSL DQRVGVTWIEKWEVVLNDENDNA >MS0584 malK, MalK protein MEQTDMAKLEIKNITKKFGDFYAANNISFTAEEGEFVTLLGPSGCGKTSL LKLIAGFHIADEGEILIGGKNVNEIPPEKRNTAMCFQSYALFPHLNVSHN ICYGLKQRKIDINEQKQRLDLAIKQMDLEIHRLKLPNELSGGQQQRVALA RAMVTRPDVILFDEPLSNLDAKLRESVRFEIKQLSKQYNLTSIYVTHDQA EALSMSDKIIVLNKGKIEQIGSPQEIYHHPINRFVADFIGIANITEAHVK EMENNLYEVNSIYGNFTVYSEIKPQSDHIYICFRPEDIEIVPASENKENM LTVDVTHTAFMGNITEIQALIRKDDKEQKLRLQLTKFPQLTENYQLSFCV PRDAIKFLESVK >MS1587 malK, MalK protein MLSHHKNKNGGAYPTLYRQYNIMTNQNDNFLVLKNINKTFGKSVVIDDLD LVIKRGTMVTLLGPSGCGKTTILRLVAGLENPTSGQIFIDGEDVTKSSIQ NRDICIVFQSYALFPHMSIGDNVGYGLRMQNIAKEERKQRIREALELVDL AGFEDRFVDQISGGQQQRVALARALVLKPKVLLFDEPLSNLDANLRRSMR EKIRELQQSLSITSLYVTHDQTEAFAVSDEVIVMNKGKIVQKAPAKELYQ QPNSLFLANFMGESSIFNGQLQGNQVTLNGYQFTLPNAQQFNLPNGDCLV GIRPEAVTLKETGEPSQQCSIKTAVYMGNHWEIVADWAGQDLLINANPEV FNPEQKQAYVHLSSHGVFLLKKE >MS1520 metC, MetC protein MSNKYSLATTLVHAGRSKRVSQGSVNPVVQRASSLVFDSIADKRQATVNR AKQALFYGRRGTLTHFALQDLMCEMEGGAGCYLYPCGAAAVTNAILAFVQ SGDNILMTGAAYEPTQDFCNKILSKMNVSTTYYDPMDGEKIAELVQPNTK VLFLESPSSLTFEVPDVPNIVKAVRKINPEIVIMIDNTWAGGILFKALEH DIDISIQAGTKYLVGHSDVMIGTAVSNARCWDQLRENSYLMGQMVDADTA YTTARGIRSLAVRFKQHTESSIKVAQWLAEQPEVKAVFHPALPSCPGHEF FKRDFTGSAGLFSFELKEQLSREKLERFMDNFKLFSMAYSWGGFESLILY NQPADIAAIRPNIKRKLTGTLIRIHIGFEDVNELIEDLKAGFERLK >MS1627 metC, MetC protein MTQNYSIETILAQAGNKSDARTGAVSTPIFLSTAYGHRGIGESTGFDYTR TKNPTRLVLEETIAKLENGDQGFAFSSGMAAIQVLMTLFTAPDEWIVSSD VYGGTYRLLDFAYKNNNSVKPVYVNTASVEAIETAITPNTKAIFVETPSN PLMEECNVTEIAKIAKKYNLLLIVDNTFLTPVFSRPLDLGADIVIHSATK YLAGHNDTLAGLVVAKGQALCERIFYIQNGAGAVLSPFDSWLTIRGLKTL ALRMERHQANAAAIAEFLKAQPQVKDVLYPNKGGMLSFRLQDENWVNPFL KAINLITFAESLGGTESFITYPTTQTHMDIPAEERIARGVTNDLLRFSVG LENVEDIKADLAQAFAQFK >MS0941 metE, MetE protein MTKLFPNATVRTSAPYRFDIVGSFLRSDAIKSARAACACGDISCADLTRA EDAEIAKLVERQKSVGLHAVTDGEFRRTFWHLDFLAGLDGVEEVDAEKFS VQFKHHNVRPKTLKIVAKVDFSENHPFVEHFRSVNELAKGTEVKFTIPSP SMLHLITNVRATNYQPIPRYENNNQQLLDDIADAYIKAMNIFYKLGCRNL QLDDTSWGEFCAEDKRAAYQERGFDLDQIAKDYVYMLNKIVDAKPAQDIA ITMHICRGNFRSTWFSAGGYEPVAEILFGSCRVDGFFLEYDSDRAGDFKP LRFIKNQQVVLGLVTSKDGTLENREDIINRIKEAAQYVDINQLCLSPQCG FASTEEGNILTEEQQWAKLNFIREIAEEVWGK >MS0787 metF, MetF protein MSYAKDIDTLNQHVADLNGQINVSFEFFPPKNEKMEETLWSSIHRLKTLN PKFVSVTYGANSGERERTHSVVKNIKQKTGLEAAPHLTGIDATPEQLKEI AQDYWNNGIRRIVALRGDIPAGYTKTPFYASDLVALLRSVADFDISVAAY PEVHPEAKSAQADLINLKRKIDAGANHVITQFFFDIDNYLRFRDRCASIG IDAEIVPGILPVTNFKQLQRMAALTNVKIPNWLAVNYEGLDEDQTTRNLV AASVALDMVRVLSREGVKDFHFYTLNRSELTYAICHILGVRPK >MS1009 metH, MetH protein MHNKIDILKASLAQRILILDGAMGTMIQQYKLSEQQFRGERFKQSSVDLR GNNDLLSLTQPLLIQAIHEKYLQAGADIIETNTFSSTSIAQADYDLQAIA YELNFAGAKLARIAADKYSSADKPRFVAGVLGPTNRTASISPDVNDPGFR NITFMQLAEAYGEATRGLIAGGADIIMLETIFDTLNAKAAVFAIEQVFEE LGVRLPVMISGTITDASGRTLSGQTTEAFYNSLRHAKPLSFGLNCALGPK ELRQYVEQLSKISECYVSAHPNAGLPNAFGGYDLGAEEMAAQLKEWAESG FLNIVGGCCGTTPEHIKAFAEAMQGVKPRPLPQIKTAMRLSGLEPLSIDD DSLFVNVGERNNVTGSAKFKRLIKEEKFGEAIEIAIDQVENGAQVIDVNM DEALLDSQKCMTRFLNIMATEPDAAKVPVMIDSSKWEVIEAGLQSIQGKG IVNSISLKEGEEKFIRQAKLIRRYGAAAVVMAFDEKGQADTEARKVEICT RAYDILVNQAGFPPEDIIFDPNIFAIGTGIEEHNNYGVDFINATGRIKQT LPYAKVSGGVSNVSFSFRGNNPMREAIHAVFLYHAIKQGMDMGIVNAGQL AIYDDLDPELREVVEDAVLNRRPDATDRLLEIAEKYRNQDSTGEDNGVAE WRSWSVEERLKHALVKGITHFIIEDTEEARQKFSLPLEVIEGPLMAGMDV VGDLFGDGKMFLPQVVKSARVMKQSVAYLEPFINATKQKGSSNGKVVIAT VKGDVHDIGKNIVSVVLQCNNFEVIDLGVMVPADKIIETAIAEKADIIGL SGLITPSLDEMEYFLGEMNRLNLNIPVLIGGATTSKEHTAIKLYPKYKYE VIYTTNASRAVTVCAALMNPESKAELWARTRKEYEKIQQSFAERKPLRSS LSLEQARANGFNPFAGEWANYQVPQPKQPGISEFKDVPIAMLRKFIDWSP FFRVWGLMGGYPDAFDYPEGGEEARKVWHDAQIMLDEFENNGKLTPSGVL GIFPAERAGDDIKIYQNSDRTLLAGVARHLRQQSERGKNSKIPYNLCLSD FIAEGSNGQQDWLGMFAVCAGTQEHALVDSFKAKGDDYNAILLQAVGDRL AEAMAEYLHFELRTRLWGYSDETFDNQALIDEKYIGIRPAPGYPSCPEHT EKQLIWDLLEVEQRIGMKLTESYAMWPAASVCGWYFSHPASSYFTLGRID EDQAADYAKRKGWDEREMRKWLGVSMK >MS1966 metJ, MetJ protein MGFFSLKYRQILRLLIGNFMADWDGKYISPYAEHGKKSEQVKKITVSIPI KVLEILTNERTRRQLKNLRHATNSELLCEAFLHAFTGQPLPTDEDLLKER HDEIPEQAKLIMRELGINPDEWEY >MS1726 nifS, NifS protein MKFPIYLDYAATCPADDRVAEKMMQYLTRDGIFGNPASRSHKFGWQAEEA VDIARNHIADLIGADSREIVFTSGATESDNLAIKGAAHFYQTKGKHIITC KTEHKAVLDTCRQLEREGFEVTYLAPKSDGLVDLDEFRAAIRPDTILASI MHVNNEIGVIQDIEAIGKICREHKVIFHVDATQSVGKLPINLAELPVDLM SMSGHKLYGPKGIGALYVRRKPRVRLEAIIHGGGHERGMRSGTLAVHQIV GMGEAYRICKEEMAEEMAHVTKLRDRLYNGLKDIEETYVNGSMEHRVGSN LNISFNFVEGESLMMALRDIAVSSGSACTSASLEPSYVLRALGLNDELAH SSIRFSLGRYTTEEEIDYTIDLVKSAVKKLRDLSPLWDMFKEGIDMSKIE WSAH >MS0856 oppA, OppA protein MFIRKVTFIGFLLFSAMLPFFSWAAPRVPEILTQNGLIYCTHSSGFSFNP QTADAGTSMNVITEQIYNKLFEIKNNSSRLEPSLAQSYKISEDGKTITVY LRKGVEFHHTPWFTPSRNFNADDVVYSLNRVLGHNTSLPEFNASEQQKGM KRQYNIFHELAKKTRFPYFDSIKLNQKIESVTALDPYTVQINLFAPDASI LSHLASQYAIIFSHEYALQLNADDNLAQLDLLPVGTGPYQVKNYFRNQYV RLIRHENYWKKEAEIKNIIIDLSPDRTGRLAKFFNNECQIAAFPDVSQLG LLQENGERFQTTLSDGMNLAFLAFNFKRPLMQDAEIRRGIAQAINRHRII KDIYYNTASVANKIIPSVSWAGSDSNNHSFAYDYDPAQAKKVLQDRQLSL DMWVLKEEQLYNPSPIKMAELIKHDLTKAGIEVKVRLISRNFLMEQLRNN SENYDLILGGWLAVSLDPDSFMRPILSCGTTSEITNLSNWCSQSFEEILD RALISNSTNERAVNYHLAEQEVLSELPILPIASVKRILISNSNVQGVEMS PFGSISFEKLSFKKGEK >MS1325 oppA, OppA protein MSNKMRSSLFSGKFSLVAKSAVIFCCFLSSVGCDRIKNLFSDTKQSVSEQ PAESMTSTKQIQTETVPEQHILSRGVYSDLVLNIRDVKSSEQADFMRDLF EGLVIFDIHGNIQPAVAESWETKDNKTWIFTLRQDAKWSNGEAVTAEDFA QAWKLLALSSSPLRQYLAFIHIDQAQEILEGKSDISQLGIKAQDEYHLQI SLDKPISYLPEMLAHIALLPAYSGGNSNKGELISNGAYKLAGQKADTISL VKNEFYWNAEKVSFPQVHYQKLADNTDVKKVDLVTDFRQIKMENVVNFPK LCTYFYEFNLKDQNLAKTAVRNALNSMISSHNIVRDSGLSGFAVSYFVPR NMEFESDESWQATVVEQILQQADFSEKNPLQFKLTYEQEGIHPNIANRLV RSWSQSDLISVKMEPVNWSQLQEKRAKGDFQIIRSGWCADYNDPSAFLNL LYSKNPDNKTGFSQERVDKLLEKAQQTISEPERNELYRQVLLISRQEHLF LPIFQYAKAVYLNPTLQGFDIHNPTEVIYSKDLSRKPMRQKN >MS2053 oppA, OppA protein MKLTTKFTLAALVLSAIGFVQAAETTFINCTSRAPTGFSPALVMDGISYN ASSQQVYNRLVEFKRGSVDIEPGLAESWDISDDGLTYTFHLRKGVKFHAN KEFTPTREFNADDVIFSFQRQLDSNHPYHKVSNGTYPYFNSMKFPSLLKS VEKLNDHQVRITLTRKDATFLASLGMDFLSIYSAEYADKMMRAGKPETID NQPIGTGPFVFAGYQVDKAVRFVANKDYWKGKAAIDRLVFSITPDAGTRY AKLQQGACDLAEFPNTADIERMKADKRIQMPSQESLNVAYIAFNTEKAPF DNVKVRQALNYAVDKNTILNAVYQGAGIAAKNPLPPTIWGYNDQVQPYEY NPEKAKQLLAEAGFPNGFETELWVQPVVRNSNPNPRRMSELVQSDWEKVG VKAKLVSYEWGDYIKRAKAGELTAGTFGWSGDNGDPDNFLSPLLAGVNAG NSNYARWKNAEFDALLDKAIGLTDKAQRAALYKQAQVIAHDQAPWIPMAH AVTYAPLSARVRDFKQSPFGYTSFYGVRVEDKK >MS0466 oppA, OppA protein MKKICTILTALFTATCVYADSTNNRLDYASTKDIRDINPHLYAGEMAAQN MVFEPLVINTNQGIRPFLAKSWRISEDGKSYLFHLRKDVKFTDGEPFNAF VAKMNIEAVLANFNRHAWLELVRQIDSVRAPDEFTLELTLKNPYYPTLTE LALTRPFRFLSPKCFNQGKTSQGVMCYAGTGPWILKKHKKNALADFSRNE NYWGELPKLNGVTWHVIPERQTMLLALLKGDIQLIFGADGDMLDMDSFKQ ISESGQFISAMSEANASRAIVLNSARTITSDQKVRQALQYAVDKAAIAKG VFNDTESIAETLMAKNVPYADVDVQTYPFNLLKAAQLLEEAGWNLSVGKN IREKAGKPLSLLLSYNINNAAEKEIAQLLQADFRKIGVDLQILGEEKQAY LDRQKNGDFDLQYSLSWGSPYDPASFVSSFRIPAHADYQGQKGLPNKTEI DEMIGELLITPNEQTRIKLYQKLFKTLAEQAVYVPLTYSKTKAIYSAQLE GVGFNPSQYEIPFEKMSFKK >MS1364 oppF, OppF protein MSESIKQATPLLEAVNLKKYYPVKKGLFAKPQLVKALDGVSFCLEKGQTL AVVGESGCGKSTLGRLLTMIETPTDGELYYNGQNFLENDKTTQKLRRQKI QIVFQNPYGSLNPRKKIGSILEEPLVINTDLTAAQRKARVLEIMAKVGLR AEFYHRYPHMFSGGQRQRIAIARGLMLQPDIVVADEPVSALDVSVRAQVL NLMMDLQKEMGLSYVFISHDLSVVEHIADQVMVMYLGRCVEQGRVEAIFK NPRHPYTQALLSATPRLSPKLSSERIKLEGELPSPLNPPKGCAFHTRCRL ATERCKQEQPLLKDYSDGTRIACFMVE >MS0462 oppF, OppF protein MSLLKVENLTKSYRTFNSLFSHLSHPALQNVSFQLEKGESVGLIGENGSG KSTLARIISGIEKADSGHVWLNGTDIYQRKNRRQQISVVFQDYFSSVNPT MTVLQAICEPLLEQKQAAAKSLEPLVVQFLKKVNLSTDCLHKYIYQLSGG QAQRVCLCRALINNPSLIILDEALSSLDIVTQVQLLELLIELKNEFQLSY FFISHNIQMICYLCERVLFFKQGQIITQSDIENLAEIKSDYAQKLIRSVI >MS1150 pabA, PabA protein MATILFLDNFDSFTYNLVDQFRGLGHQVKIYRNDCDLALLESIALQPDTI LALSPGPGTPAEAGNMLALIQRVKSAVPIIGICLGHQALIEAFGGKVVHA GEVLHGKVSKINHDEQAMFLNLQNPMPVARYHSLKGSNLPEELVVNATYN DIIMAFRHKNLPICGFQFHPESILTVQGAKLLENSVNWLLNK >MS2194 pabA, PabA protein MSKRLLIVNNHDSFTYNLVDLIRRLSVPMRVIEVEKLDLDEVEQFSHILL SPGPDVPEAYPEMFALLTRYYRHKAILGVCLGHQTLCRFFGGRLYNLRQV RHGVCGRLKVRSKSAIFSGLPEEFDIGLYHSWAVDSQNFPAELTITAECH EEVVMAFEHKTLPIYGVQFHPESYISEYGEQMLINWLNS >MS1550 pepB, PepB protein MKYSVKQTALEQENKSLFIAIFENQELSPAALKLDLKLKGEITEAVKNGE VSGKIGRILVLRHGAQRIILVGCGKQNEVTERQYKQIIQKAVKTAKETIA TTIINALTEVKIKDRDLYWNVRFAVETIEEDNYIFEQFKSKKSENNSKLA EIIFYTEENHEQAELAIRHATAISSGVKAAKDIANCPPNICNPAYLAEQA NQLAGRSSLIETTVIGEKEMRKLGMNAYLAVSCGSKNEAKLSVMEYRNHE NPNAKPIVLAGKGLTFDAGGISLKPAADMDEMKYDMCGAASVYGVMNAIA ELQLPLNVIGVMAGCENLPDGNAYRPGDILTTMSGLTVEVLNTDAEGRLV LCDTLTYVERFEPELVIDVATLTGACVVALGQHNSGLVSTDDNLAQDLER AAKLANDKAWRLPLSEEYQEQLKSKFADLANLGGRWGGAITAGAFLSNFT KNYPWAHLDIAGTAWLQGQNKGATGRPVSLLVQFLLNQVK >MS0667 pepB, PepB protein MQIQLSNLPAPKSWGKNPLLSFSDNQATIHLENSEKSDRTLIQKAARKLR GQGIDDVELVGNDWSLENCWAFYQGFYTAKQDWAVEFPELGDDHEELLAR IQCGDFVREIINLPSSVITPLELAQRSARFIAGLAEEYAGKSAVDFHIIS GEELKAQNYLGIWNVGKGAENPPAMLQLDFNPTGNPESPVLACLVGKGIT FDSGGYSIKPSNFMDSMRTDMGGAALVTGALGLAIARGLNRRVKLFLCCA ENLVSGNAFKLGDIITYRNGVKAEILNTDAEGRLVLADGLIDASSENAQF ILDAATLTGAAKVALGNDYHCVLSMDEELTTDLFNAAKEEQEPFWRLPFE ELHRSQISSSFADISNTSSAALAAGASTATAFLSHFVKDYQQNWLHLDCS ATYRKTPSDLWATGATGLGVQTIANLLLTKATQL >MS0815 pepD, PepD protein MQYNEQLLERFFNYVSLDTQSKPGAKTSPSTQGQLKLAKILEQELYSLGL DEIEVSKHGIVTALLPGNIENSPTIGLIAHLDTSPQCSGKNVKPEVIENY RGGDIALGLGDEFISPVTFTFLHKLVGKTLIVTDGTTLLGADNKAGIAEI MTALSQLKESSVPRCHIRVAFTPDEEIGLGMKFFPIEKFSCDWAYTIDGG AVGELEYENFNAAGATVTIFGRAIHPGSAKDKMVNALTLACEFQQGFPTD EVPEKTEEKQGFFHLNSFHGDIEKVELHYLIRDFDKQAFTQRKAFLEKWV DEFNCRKQLKEPVKVTITDNYYNMYDTVSKVPQSIELADSAMKACGIVPI HQPIRGGTDGAWLAEKGLACPNIFTGGYNFHSKHELITLEGMCSAVDVIM KIAQLAVK >MS2118 pepD, PepD protein MSEIQSLQPQLLWKWFDQICSIPHPSHHEEQLAEFIVNWAKGKGFYAERD EAGNLLIRKPATKGMEHCQSVALQAHLDMVPQANEETDHDFTSDPIQPYI DGEWVKAKGTTLGADNGIGMASALAVLDSENLAHPALEVLLTMTEEVGMD GALGLRKNWLQSEIMINTDTEDNGEIYIGCAGGENADLTVPVQWQENNYE HCYQISLKGLRGGHSGCDIHTGRASAIKTLARFLANLQQNQPHFEFSLSE IRGGSVRNAIPREAFATLCFNGEPANFTQGVKSFESLLKTELAIAEPDLQ LTAQPAEKATKVFAPNTKNNVVNLLNALPNGVIRNSDVVENVVESSLSIG VLKTTEDAVKGTILVRSLIESGTNYINGLLISLTELCGASVQFSGRYPGW EPHAETPILTLTKEIYGELLGYEPAIKVIHAGLECGLLKKIYPALDVVSI GPTIVNAHSPDEKVHIPAVRTYWELLTKVLAGIPAKK >MS1554 pepE, PepE protein MQAVLSPLNMEIISGKMLRHNGESREEHLAEFLIVNPTALVYAHPESTAL HIEGRQATILE >MS1034 pepN, PepN protein MHAKAKYRKDYKKPDFTVTDIHLDFQLDPQKTVVTAHSQYQRLNPAATVL RLDGHSFQFASIKVNGKDFATYQQDGESLTLDLSDIDAERFELEVITRLV PAKNTSLQGLYQSGEGICTQCEAEGFRQITYMLDRPDVLARYTTKITADK SKYPYLLSNGNRIAGGDLEDGRHWVEWNDPFPKPSYLFALVAGDFDLLED SFTTKSGREVKLELFVDRGNLNRASWAMESLKKAMKWDEERFDLEYDLDI YMIVAVDFFNMGAMENKGLNVFNSKYVLANPETATDEDYLNIESVIGHEY FHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDVGSRAVNRIKNVKFLR TAQFAEDASPMSHPIRPEKVLEMNNFYTMTVYEKGAEVIRMMHTLLGEKK FQQGMKLYIAENDGKAATCEDFVAAMEQASGVDLTQFRRWYSQSGTPELT VTDSYDEKKRSYKLYVSQMTAPTADQMDKVNLHIPLKIALYDMNGMPFSL IKDDEAVNDVLDILLEDQVFEFHNITSKPVPALLCDFSAPVKLDYDYSTA QLIALLKFAHNEFVRWDAMQMLFAQELRRNLSAYQQGEQLTFSAEILSAL QQVLENYQSNVELTTLILTLPKETEFAELFKTIDPEGIAVVCDFMQHAIA EGLQDLWLKTYHQINLEEYCIDMRDIALRGLRNLCLQYLAFTDYGNALVN KHYLYADNMTDKLAALAAATKAQLTCRDKVMKDFEEKWQHDGLVMDKWFN LQATRPDGNVLTLVKQLMDHPSFNFNNPNRLRALVGSFESQNLRAFHAVD GSGYRFLTDVLLRLNESNPQVAARLVEPLIRFSRYDSQRQTLMKRALERL REVENLSNDLFEKIEKALQ >MS0479 pepP, PepP protein MDLAYMAELPADEFVLRRQKLAAQLTDNSVFIVFSEVEKRRNNDCTYPFR QDSYFWYLTGFNEPNSALVIQKKGKLVETTIFVRPSNPLMEIWNGRRLGV ERAAEKLHLDQAFSIDDFARIFGKICQNSTALYHYQGLQPWADQLLAETF ISPPDYINWAPMLDEMRLFKSANEVRLMQQAGQITALGHMKAMRQTRPNR FEYEIESEILHEFNRFGARYPAYTTIVAGGENACILHYTENDQPLKDGDL VLIDAGCEFAMYAGDITRTFPVNGKFTQAQREIYQIVLNAQKRAIELLVA GNSIQRANDEVVRIKVKGLLDLGIMRGDIDELIANNAHREFYMHGLGHWL GLDVHDVGSYSKEGQNGDRNSKVRDRPLEIGMVLTVEPGLYISPKSDVPE QYKGIGVRIEDNILITEYGNKVLTAAAPKEIGDIEALMATER >MS1657 pheA, PheA protein MALDLSEIRQQITQIDRSLLKLLSERHRLAFDVVRSKEITQKPLRDEKRE QQLLQELINFSENENYQLEPQYITQIFQKIIEDSVLTQQVYLQKKLNEQR EQSIHIAFLGKRGSYSHLAARSYATRYQEQLIELSCSSFEQIFEKVSSGE ADYGVLPLENTTSGSINEVYDLLQHTDLSLVGELTYPIKHCVLVNGQDDL SKIDTLYSHPQVIQQCSQFIRSLNKVHIEFCESSSHAMQLVSSLNKPNIA ALGNEDGGHLYGLTVLRSNIANQENNITRFIVIARKAITVSPQIHTKTLL LMTTGQEAGSLVDALTVFKKYQIKMTKLESRPIYGKPWEEMFYLEIEANT NHPDTQAALEELRQYSTYLKVLGCYPSEIVKPVDVR >MS0583 potB, PotB protein MKGIKKDLKAWLLLCSGLGTILFLMGSTFYIVVTQSLGLYNISGEDSRFT LQYWHDVLTNSVFQSSYIYSVKVSLLGAILSIIVSYPIAMWLRNELPAKV TIITILRAPMLVPGLVAAFLFVNMISYHGILNETMVFLGIWHEPKTLQND EFGWGVVILQMWKNIPFALILIGGAVNSLKTDLLDAAANLGSTSWQRFRY VIFPLTLTAVQVSFILIFIGALGDFAFYSIAGPRSTYSLARLMQMSAYEF EEWNQSAVMAMMIMLTSAFFTILVSIIIKPLAVKRGDIK >MS0811 potB, PotB protein MKMTTRKFQNSTVAVIFAWLIFFMFVPNFLVLIVSFLSKDSSNFYALPFT FENYARLFEPLYGTVVWNSLYMSGIATVICLLIGYPFAFFMAKLNPKYRP ILLFLLVLPFWTNSLIRIYGMKVFLGVKGILNEFLLFTGIIDEPIRILNT EVAVIIGLVYLLLPFMILPLYSAIEKLDLRLLEAAKDLGANGIQRFIKII IPLTMPGIVSGCLLVLLPAMGMFYVADLLGGAKVLLVGNVIKSEFLISRN WPFGSAISIGLTILMALLIFVYYKANKLLNKKVELE >MS0581 potC, PotC protein MSSAKITTKNSKIIARISLTFFVLVNFIWLVLPFLMAGLWSLVDPKQPWS YPDILPPSLSLERWQMVWENTSLPEAMFNSYTIAPTVSLITISLSIPTAY AFGRMEFRGKKIAELLTLIPLVIPGMIIALFFSRMLLDLNISNPFVGIVI GHVVLTLPYAIRILSAGFSSVPQDLIEASRDLGASKFTVFKDVYMPMLKP SFLASIIFCLVKSIEEFAISFVIGSPDFITVPTILYSFLGYSFIRPNAAV VSIILLVPNIILMMIIEKLLKGNYLSQSTGKA >MS0810 potC, PotC protein MSRVLRNIFMLVVYAYLYIPIIILVGNSFNADRYGLSWKGFSFAWYERLA NNDTLIQAAVHSVTIAFFAATFATIIGSMTAIALYRYRFRGKQAVSGMLF VTMMSPDIVMAVSLLALFMIIGISLGFWSLLLAHITFCLPYVVVSVFSRL KGFDLRMLEAARDLGASEVTILRKIIFPLALPAIISGWLLSFTISMDDVV VSSFVSGVSYEILPLKIFSLVKTGVTPEVNALATIMIVLSLLLVLLGQII GKKDKS >MS2292 potD, PotD protein MLVTATAFFSTASFAAPKQLYIYNWTDYIPSDLISKFTRETGIKVNYSTF ESNEEMFSKLKLTINKPGYDLVFPSSYYISKMVKENMLTPINHSKLTNLK QIPSNLLNKDFDPANKFSLPYVYGLTGIGINTSFVNPDEVTGWGDLWKEK FKGKVLLTADSREVFHIALLLDGKSPNTQNEEEIRNAYQRLTKILPNVAA FNSDTPELPYIQGEVELGMIWNGSAYMAEKENPAIKFIYPKEGAIFWMDN YAIPKNARNIEGAHKFIDFMLRPEHAKIIIERMGFSMPNEGVKVLLKPED RVNPLLFPPEDEVKKGVFQADVGDATDIYEKYWNKLKTN >MS0809 potD, PotD protein MSWNYQGQIFYSLSTGANSMKKLAGLFAAGLIAVAVTGCNDKESKSADAN APETAKDNGTVYLYTWSEYVPDGLLDDFTKETGIKVIVSSLESNETLYAK LKTQGADGGYDIIAPSNYFVSKMAREGMLKELDHSKLPVIQELDPDWLNK SYDPNNKYSLPQLLGAPGIAYNTQTYKGSDFTSWGDLWKPEFAGKVQLLD DAREVFNIALLKLGKNPNTQDPAEIKAAYEELLKLRPNVLSFNSDNPANA FISGEVEVGQLWNGSVRIAKKEQPGSIDMIFPKEGPVLWVDNLAIPATSK NPDGAHKLINYLLGAKAAEKLTLAIGYPTANVEAKKVLPKEITEDPAIYP TAELLRTANWQEDVGEAVELYEKYYQELKAAK >MS0552 potE, PotE protein MSNKKIGLLSLTALVLSSMIGSGIFSLPQNMAAVAGAEAISIGWLITGIG IIFLGLSFFFISRLRPELDGGIYTYAREGFGDLMGFMSAWGYWLCATIGI VGYLTVAFEGLGVFTDSENTVIFGQGNTVASFIGSSIIVWLVHALIAGGI KEAASVNLVATFVKVAPLVLFILLGFWFFDTDIFNSDVKASALNNNIGDQ VKDTMLITLWVFTGVEGASVLSAHAKKRTDVGLATVLGILIALALYVAIT ILALGILPRETIAEMPNPSMGPLLDAMMGPTGKVIITACLIVSVLASYIS WTMYSAEIPYRGAQKGAFPKILDKLNENSTPINSLWFTGFIVQFCLILVF VFEQSYNTLLLISTSMILIPYFLIGAYLFKLAIQTNSAWYIKLTGFMASI YGLWIVYAAGLQYLLLSVVLYVPGILLFLYSHRKFHGKFKLKGFEQTILA MIFILFCYAVYRLPELLAA >MS1604 proA, ProA protein MTDLIQMGKQAKQAAFALSQLSQQEKNHALALIAERLEAQQERILAENAK DIQAARENGLSESIIDRLLLTKERLTGIADDVRHVISLADPVGKIIDGGV LDSGLKLERIRTPLGVIGTIYEARPNVTIDVASLCLKTGNAVILRGGKET QHSNKILVEVIQNALQQAGLPEMAVQAITDPDRALVMELLHLDKYVDMII PRGGAGLQALCRDNSSIPVIVGGIGVCHIFVEQSADQDRSLAVIENAKTQ RPSTCNTVETLLVQESIAEEFLPKLARRLKTKEVKFHADSTALSILQGVS ADVKPVTEQQLRNEWLTYDLNVVIVKGIEEAVEHIREYGSEHSESILTES QKLANQFVAQVDAAAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMG LEALTTYKWVCVGDYTSRA >MS1862 proB, ProB protein MKFGTSTLTQGTPKLNRAHMIEIVRQLAQLHQEGYRLVIVTSGAMAAGRH YLNHPKLPPTIASKQLLAAVGQSQLIQTWEQLFAIYDIHIGQLLLTRADI EDRERFLNARDTLHALLDNRIIPVVNENDAVATAEIKVGDNDNLSALVAI LVQAEQLYLLTDQQGLFDSDPRKNPQAKLIPVVNEITDHIRSIAGGSGTT LGTGGMSTKITAADIATRSGIETIIAPGNRENVIADLAHGEAIGTKFTVQ TDKLESRKQWLFAAPSAGILTIDQGAENAILEQHKSLLPAGIVNIEGRFS RGEVVKIRTQQGKDIALGMPRYNSDALYLIQGKKSQNIEKILGYEYGSVA IHRDDMIVLNK >MS1799 proC, ProC protein MKNKLLTFIGGGNMAQAIVFGLLNKGYSAAKLIVCDRNEAKRNLFAQKGV EVNLTNVEAAEKAEVVVLAVKPQAMAETCGPLSAVDFSGKLVISIAAAVS VSRLSALLPTAKNIVRVMPNTPALVSEGMAGLFASAGLNGEYQDFAEDLL NAVGKTCWLQKEEDMHAVTAGSGSSPAYFFLFMEAMEKTLSSMGISPENA RTLVQQSALGAAKMVENNPQLPLSTLRENVTSKGGTTAAALAVFNQYQLD KIVQQAMEACVARSQEMEKLF >MS1178 proP, ProP protein MPNKAETSPAKLRLKAFLKRIKIMNTTENSKQKPVNVVAFAFLLTAFLTG IASSFQTPTLSLFLAQEIQVSPFMVGMFYTSNAVLGIVLSQILAKYSDSQ DDRRKIIIFCSLLAIGGCITFAYNRNYYVLMFFATFLLSLGSSANPQAFA LAREYADYTKREAIMFTTIMRTQISLAWIVGPPLSFSIALGWGFEYMYMV AASAFLLCAIIAKALLPYVPRKAVVPLTKPDEVAGLPAKNKKQSDKQSIR LLFITCFLMWSCNGMYLISMPLHVINELHLSERLAGILMGTAAGLEIPVM LIAGYLTKYLTKKSLILTALFMGLFFYIGMLFAEQTWQLVALQAFNAIFI GIIATLGMVYFQDLMPGKMGSATTLFSNAAKSSWIVAGPFVGIIAQIWNY SSVFYISIVLVAVSLFSMSKVKSV >MS2054 proP, ProP protein MASGEANYRSLAWIAASALFMQSLDATILNTALPTIAADLHHSPLEMQLA VISYALTVALFIPISGWVADKYGTLRVFRFAVGMFALGSLACAMSSSLIM LIFSRVLQGFGGALMMPVARLSIIRSVPKQELLPVWNLMATAGLTGPILG PILGGWIVTYTSWHWIFLINIPMSLLGIWLANRYMPNVTGSLQKLDWAGF FFLGGGLVGVTLGFDLISEEFIAKWQATVIVILGVILIITYCFHAQKRER LALLPLSLFKIRTFRVGIMANMLIRLCASGIPFLLPLMYQVVFHYSADKA GMLIAPIALSSMLVKPLCGRILTKLGYRTALISASIVLTLSIAVMSFLHI DSPVWILIVNVALYGGCISIVFTAVNTLTISELSDQDASAGSTFLSVVQQ VGIGLGIAVSALILSLYRYFIGESAVQLQQAFGYTYLTSASFGVLLVLVL SGLKKEDGAHLHK >MS2374 proP, ProP protein MSGEKTSRYVLGVTLVATLGGLLFGYDTAVISGTVSSLDTVFIQPKGLPE ISANSLLGFCVASALIGCIIGGACGGYLSSKYGRKKALLIAALLFLISAF GSAYPEFGLKTINETNNIPYYLSNFLIQFVIYRIIGGIGVGIASMVSPMY IAEITPARIRGKMVSFNQFAIIAGQLIVYFVNYFIALNGDNTWLNMLGWR YMFLSEMVPAALFLILLFFVPESPRWLVLQNKFSQAEITLLKLLGERSGK TELQNIVSSLEHRVVKGAPLFSFGLGVIVIGIALSVFQQFVGINVALYYA PEIFKSLGASTNNALLQTIIMGTINLSCTTIAIFTVDKYGRKPLQIIGAL GMAMGMFVLGMAFYANLSGTIALTGMLFYVAAFAISWGPVCWVLLAEIFP NAIRSQALAIAVAAQWIANYIVSWTFPMMDKSSYLVERFNHGFAYWVYGL MAILAALFMWKFVPETKGKTLEELELLWNKK >MS0392 proP, ProP protein MSTAKKRNFIFIATLGILSMLPPLGVDMYLPSFLNIARDLQVDPERVQYT LTFFTFGMAAGQLFWGPVGDSYGRKPIILLGVIIGAVAAFFLTGVNSIEN FTALRFIQGFFGSAPVVLVGALLRDLFDKNELSKTMSMITLVFMIAPLVA PIIGGYLVLFFHWHSIFYVICAMGILSAILVFFIIPETHHQDNRIPLRLN VVVRNFVTLWRRKEVLGYMFSSGLGFGGLFAFLTAGSIVYIGLYGVPVDQ FGYFFMLNIGVMTLGSVINGRVVHRVGAERMLQIGLTVQLIAGIWLLIVA CFDLGFWPMALGIAVFVGQNSLISSNAMASILEKFPTMAGTANSVAGSVR FGLGATVGSLVALMKMDSAAPMLFTMGICVIVAVCCYYFLTYRSL >MS0191 proP, ProP protein MSNKVNSYGWKALMGSAVGYAMDGFDLLILGFMLSAISADLSLSPTQAGS LVTWTLIGAVAGGIIFGALSDKYGRVRVLTWTIVLFAVFTGLCAFAQGYW DLLIYRTIAGIGLGGEFGIGMALAAEAWPARHRAKASSYVALGWQVGVLA AALLTPLLLPIIGWRGMFLVGIFPAFVAWYLRAKLHEPEVFVQKQAEVAT GKRQSPFKLLIKDVATAKVSLGVVVLTSVQNFGYYGIMIWLPNFLSKQLG FSLTKSGVWTAVTVCGMMAGIWIFGRLADRIGRKPSFLLFQIGAVISIIA YSQLTDPAIMLFAGAALGMFVNGMMGGYGALMSEAYPTEARATAQNVLFN LGRAVGGFGPVIVGAVVSAYSFKIAIALLAVIYVIDMIATVFLIPELKGK ALK >MS1798 proP, ProP protein MNVRPFTWLALSYFGYYCAYGVLVPFLPVWLKSQNYGTELIGAVIASSYL FRFLGGIFFPSRVKRANQILPALRLLAWANVFVITAMAFVSESFWLIFIA IAVFSMVNAAGMPLTDSMATTWQRQIRLDYGKARLIGSAAFVVGVTVFGS LIGAIGEQYIISILIGLFGLYAVLQMVPPQPKPADEDKNSAKSAVGFGEL LKNPTHLRLIIAAMLIQGSHAGYYVYSVIYWTNRGIAVETTSLLWGLGVI AEILLFFFSGRLFRNWSVNAIFYLSAAAAALRWGAFSYTDALWQIALLQC LHSLTFAALHYAMVRYIGMQPQNAMVRLQSLYSGLASCASVALLTALAGI IYPISSHWVFLVMMICALIALFVIPRKPTNA >MS0785 proP, ProP protein MQNKFAVYLAAIGHLVTDMAQGALPALLPLFIKNYGLTYQEAGGLIFANT VLASIAQPFFGYLADKRSMPWLIPLGMMLSGCCIAAMGFVHSYPGLFFFA MIAGIGSALFHPEGARLVNRMSGGEKGKAMGIFAVGGNAGFAIGPMFAGL AYLFGAQTLSIFALINTIIALIIFLQLPKLTVENVVNKAKNTASTTLQND WRSFAKLSVIIFVRATNFTVLNAFIPIYWIHILHQQETDANFALTIFLSM GVAITFIGGLLSDRLGYVRIIRYAYLIFLPTILIFTQSENLWLSFILLIP LGLGVFTQYSPIVVLGQTYLAKSVGFAAGITLGLGITMGGIFSPIVGWIA DHYGLQIALQTLSVLSLLGLIFSYRLKITDTEKPEKK >MS1407 proP, ProP protein MMTSSRPNLTLLLILGALMACTSLSTDIYLPAMPTMAKELQGNTELTITG FLIGFAIAQLIWGPISDRIGRKIPLFIGMALFAVGSVGCALSQSMAEIVF WRVFQAVGACVGPMLSRAMIRDLYDRSQAAQMLSTLTIIMAAAPIIGPLL GGLLLKISSWQAIFWLLVVIGILLFLSIIKLPETLPPAKRAAGSFWSAFG NYRILLKNRAFMRYTLCVTFFYVAAYAFITGSPFVYIDYFKVDPQYYGFL FGVNIVGVALLSAVNRRLVRHYPLESLLRVSTMIALCAVLILVVLVFMDL DGIAGILSVAVPIFIMFSMNGIIAACTNAMALDSVQPEIAGSAAALLGSL QYGSGILSSLLLAYFSDGTPHTMAWIIALFVGLCAVIGWGQRPRSA >MS0499 proP, ProP protein MNLREHIDNNPMSAYQWTVVIIAAIMNLLDGFDVLALAFTATAIRGDLGL SGAELGYLFSAGLLGMAAGSLFLAPLADKIGRRPLLLISVTLSALGMLGS AYSASYGALGFWRLITGLGVGGILVGTNVLTSEYSSRKWRSLAISIYASG FGIGAVLGGMFAVVLQEEYSWHAVFLAGFILTAVCLIVLLIWLPESIDFL MTQQPRNAQIRLNKITKKMGLKGQWTLPEKVLASASKLPLTQLFNKNYRK STALIWIAFFAIMFCYYFVSSWTPALLKEAGMTTEQSVSVGMMVSLGGTC GSLLYGLLASRWKAKQMLVQFTVLSAFSVIIFILSSSILWLAMLFGILVG GFMNGCISGLYTLNPSIYAANIRSTGVGWSIGVGRIGAILAPLAAGVLLD YGWDKQSLYIGVGFVLLIAAIALSLLRIKTTLVKC >MS1530 proP, ProP protein MNTETKQPALIVPRLSLMMFMEFFIWGSWSVTLGIVMTKYDLSTLIGDAF SMGPIASIISPFILGMLVDRFFPSEKVLAVLHLIGAAILWFIPEFITGQQ GGTLVFALLAYMLCYMPTVALTNNIAFHSLADSEKSFPVIRVFGTIGWIV AGLFIGQADLSASPAIFQVAAICSLILGLYSFTLPNTPPPAKGKPFSMRD LMCADAIALFKIPHFLVFAICATLISIPLGTYYAYAAPFLDAVGFEKIGS LMSMGQMSEIVFMLLIPFFFKRLGVKYMLLAGMLAWFLRYAFFALGVSEE IRWAVYLGILLHGICYDFFFVVGFMYTDKVADEKIRGQAQSLVVLFTYGL GMLLGSQISGGLYNNMFADNTDVSTWSTFWWIPAISAVVISVIFFIFFNY KEDKREA >MS0807 proP, ProP protein MLMTSQNKINAVPSNQNFYLNNRNYWIFSGYFFVYFFIMATCYPFLGIWL GDINGLSGEDRGTVFAMMSFFALCFQPVFGYVSDKLGLKKHLLWVLGISL LIYAPFFIYIFAPLLKVNVWLGSLVGGAYIGFVFQAGAPASEAYIERVSR RSKFEYGRVRMFGMFGWAICASIAGVLYATNPNLVFWLGSIASLILLLLI ALAKPEQTSTVQIAEKLGANKNPVNLRQAFALLKLPKFWALLAYVMGIAC VYDIFDQQFGNFFNTFFESHEQGIKMFGYVTTAGELLNALIMFFVPLIIN RIGAKNALLIAGTIMSVRIIGSSYAIEAWHVVVLKTLHMFEVPFYLVGLF KYIANVFEVHFSATIYLVACHFAKQIGNMLVSPLVGAWYDTYGFQDTYLI LGCIAAGFTLLSVFTLTGKSLSSQS >MS0797 proP, ProP protein MSQNHFFSHIFNRNMLICIFTGFSSGLPLYILTSLIPTWLRSTEIDLKTI GFFTLTSLPFIWKFLWSPFLDRFVPPFLGRRRGWMLIFQLLLLISLGLFG FIDPHTNQGLSLLIGLATMVSFFSASQDIVLDAYRREILSDQELGMGNSI HVSAYRIAGLVPGSLSLILSDHFSWQAVFIITALFMLPGLLMTLFISHEP QIELKSNRTLAENIVEPFKEFFQRKGLWGAIGILTFIFLYKFGDSMATAL ISAFYLDMGFTKTQIGLVVKNASLWPMIIAGIIGGMITLKIGINKALWLF GLVQIVTILGFAWLAQLGPFEKVDSFAIFALTVVVMAEYVGIGLGTSAFV AFMARATNPVYTATQLALFTSLSALPRAVFNSFSGVLIENMGYYHYFWLC FFLAIPGMLCLIWVAPWKEK >MS0549 proV, ProV protein MTTSVKISVKNLTKIFGSHPKSAFKLLQNGKTKEQIFAETGSTVAVNNVS LDIMAGEIFVIMGLSGSGKSTLIRLLNRLIEPSAGHVFIGDDDIAEMSEK ALRAVRRKRISMVFQSFALMPHMTILENVAFGLELSGVNSKNRRRMALET LARVGLEAYADVYPGELSGGMQQRVGLARALANDPEILLMDEAFSALDPL IRTEMQDELLRLQENSERTIVFISHDLDEAMRIGNRIAIMQDGQVIQVGR PDEILQNPANDYIRSFIQGVNVSNVLSAKDIASKRHLLNIVQKSEDETPH VAFKLLEQHERDFAVVLDRYGYYKGMVSVDSLQQARSNRQSLSQSFIEIT PLSPEQSISDIINDVATTREPLPVVDDKGHYYGVVTKVKVLQTLDRGTEA >MS0550 proW, ProW protein MTTENIRTADPWEATLQAAQQDNAYAWLQGSEQSQDFNWMYPFDHTLVPF GDWVESLINWLVTHLRSFFQFISAPIDYILSLFQTSLNVLPPTVMIILFT LLVWQFTHFRLALATLLSITLIGAVGAWNEMMITLALVLTSVSFCLLIGL PLGIWMARSTRASAIVKPVLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVT IIFALPPIVRLTILGIQQVPEALIEAAQAFGASKKQLLYKVQLPLAMPSI MAGVNQTLMLSLSMVVIASMIAVGGLGQMVLRGIGRLDMGEAATGGLGIV LMAIVLDRLTQKIAENMHSQHKVRWYERGITGLFIRKK >MS0551 proX, ProX protein MAYPMKLTILFSLALFASNAVRADDKAIQPLQSPLAEETFQTLIVVKALE ELGYRVNPPKEVDYNVAFTSIANGDATFMAVHWLPLQADKYANAGGDRKL YRQGTFVEGAVQGYMIDKKTADTYNITNLAQLKDPKLAKLFDTNGNGKAD LIGCSPGWSCEYTVSQHIDGYGLSRTVEVTQGNYSALIANTIAQYQNGKS ILYYTWTPYWVSGVLVPGKDVVWLQVPNRPDPGKTVADTNLANGKNYGFT VSSMHIVANKTFTDAHPDAARLFAVMRLPAGDISAQNMAMRNGQNSSQDI ERHAEAWIKFHRVQFDEWIKQAKSAKN >MS1536 prsA, PrsA protein MPDIKLFAGNATPELAKRISERLYISLGDATVGRFSDGEIQVQINENVRG SDVFIIQSTCAPTNDNLMELIVMVDALRRASAGRITAVIPYFGYARQDRR VRSARVPITAKVVADFLSSVGVDRVLTCDLHAEQIQGFFDVPVDNVFGSP VLINDILKKTDLENPIVVSPDIGGVVRARAVAKLLNDTDMAIIDKRRPKA NVSQVMHIIGDVSDRDCILVDDMIDTGGTLVKAAEALKERGAKRVFAYAT HAVFSGSAAQNIANPALDEVVVTDTIPLSAEIKALGKVRSLTLSAMLAEA IRRISNEESISAMFDA >MS0777 putP, PutP protein MFGLDPTLITFTIYILGMLAIGVLAYYYTNNISDYILGGRRLGSFVTAMS AGASDMSGWLLMGLPGAVYVSGLIEGWIAIGLTIGAYLNWLFVAGRLRVH TEFNNNALTLPEYFHSRFGTSHNLLKIISASIILVFFTIYCSSGVVAGAK LFQNLFGIPYATALWYGALATIAYTFIGGFLAVSWTDTIQATLMLFALIL TPVVIVVSLGGIDGFSASMQSAEIDMQKDFTDLFTGTSTLGLFSLAAWGL GYFGQPHILARFMAAYSAKSLHKARRISITWMIICLIGAISIGFFGIAFF HANPQIAEVVTKEPEQVFIELAKLLFNPWVAGILLSAILAAVMSTLSCQL LLASSAITEDFYKGFIRPKAGEKELVWLGRIMVLIIAALAIWIAQDENNK VLKLVEFAWAGFGSSFGPVVLLSLFWKRMTSSGAIAGMLTGAIVVFSWKS VIPATSEWSGVYEMIPAFSLASLMIILVSLLSPAPNKEIVETFEKANLAY KNAE >MS1741 putP, PutP protein MNVDYLVMAGYFALIIAISLLFKKMASNSTSDYFRGGGKMLWWMVGGTAF MTQFSAWTFTGAAGKAFNDGLSVIAVFVGNMVAYACAYWYFARRFRQMRV DTPTEAIGRRFGTSNEQFFTWVIIPLSVINAGVWLNGLSVFASAVFDADI TMTIYVTGISVLIISLLSGAWGVVASDFVQMLVVAVISVACAVVGLVVIG GPGEIIDRFPGGFVSGPDMNYPLILICTFLFFIVKQLQSINNMQDSYRFL NAKDSKNASKAAIFALLLMLVGTIIWFIPPWVTAIIYPEAASLYPQLGKK ASDAVYLVFAKNVMPAGTIGLLMAGLFAATMSSMDSALNRNSGVFVRSFY APIIRKGKADDKELLRAGQIVCVINGILVILMAQFFNSLKHLSLFDLMMQ VATLLQSPILVPLFLAIIIRKTPKWAPWATVLFGMFVSWSVVKVFTPEYV ASWFGVEDLTKREISELKVIITIAAHLIFTAGFFCLTTLFYNEAKDTNNE RRIAFFKDVDTECVAEEGQDEIDRLQRKKLSTLVMLMAAGLLLMILIPNP LWGRALFACCSLAIFAVGYGLKRSAEV >MS0535 rhaT, RhaT protein MLQKYRGEIILFIVSLIAASGWFFSKFSMAEFPALGFIGLRFFLAAIFFF PLAYPQLKRLDKPQLIKSALVGLCYAVYIMLWMLGLINSAHFGEGAFLVS LSMLIAPLLSWLIFGHLPYKSFWLALPAAFTGLYLLSSGKGGLHFSFGSL IFLISSLVAALYFVLNNQYARDIPVLSFTTIQLFIVGTCCGTLSILFEQW PTSISMTAWGWFLCSLVIATNLRMLLQTYGQKYCHVATAAIIMILEPVWT LFFSILILGERLTLHKAFGCLSILAAIMIYRLPAILRNQASANKE >MS0885 rhaT, RhaT protein MLRPSCREKIIMVNNYNLALIKVHFTAVLFGLTGVLGVIISADSDVIVLG RVIIAFLALSVYFLIKREKLTALSTKDVANQSLSGALLTAHWVTFYVAVK VGGVAVATLGFAGFPGFVALFERLFFQEKLKRRELILLIAVTIGLILVTP QFEFGNQSTQGLLWGIFSGAIYGILAILNRKNINKLSGTQASWWQYLIGS ILLFPFAAHKLPAVSVTDWFWIACLGLLCTSLAYTLFVSSLNIINARTAA MIISLEPVYAILIAWIWLGEQPGLRMIIGGLIILLSVGVVNFRR >MS1754 rhaT, RhaT protein MFYLIAAVLIWASAFIAAKFSYTMFDPALTVMLRLILSALLVLPTFFRSY RKIPKQYRLQLWGLGLLNFPVVLLLQFTGVHYTSVASAVTMLGTEPLVVT LLGHIFFHKPARLLDWLLGIVALTGIVFVVYGSESGGEVTLLGCTLVLLG SIAFSFSIHLAQSVMKAVEAKAYTDVIIMTGAISCVPFSLLLVQDWQIHL NIEGISAILYLSVGCTWLAYRLWSKGLRVSSANTASILTTLEPVFGVLLA ILLLGEHLTLTTLFGICLVISAAGISVLSSMLINYIKNKVTIL >MS1595 rhaT, RhaT protein MNQQPVLGFIFALITAMAWGSLPIALQHVLTVMGAESIVWYRFFVASLAL FLLLAWKKKLPALSQFTSRYWKLSLIGVLGLAGNFFLFNSSLNYIEPAIT QIFIHLSSFMMLICGVFVFKEKLGAHQKAGLLILILGLGLFFNDKFDMLF GLNMYSTGILLSVSAAVVWVAYGMAQKLMSRQFTAQQILLIMYTGCVIVL CPFAQFSQIQGLSGFALGCFIYCCLNTLIGYGAYAEALNRWDVSKVSVVV TLVPLFTILFSRILHGLDPAHFAMPHLNTVSYIGAFVVVLGAIISAVGYK LFKYKR >MS1753 rhaT, RhaT protein MLFQIIATLIWASAFIAAKYTYEMMDPVLMVQCRFFIASIIMLPGFFAAY KRVPKERLKIMWLLALINFPLMFLLQFIGLYFTSAASAVTMLGMIPLLTV LIGFLFFKRRINKIDLLLSLVALAGIILTVVGGGEDNLINPWGCLLVLGS AVSFCFCLYLSKDVMQEMAPKDYTNVLVILGSILCLPFTCVLVRDWSIVP SVKGMISLFYLGIGCTWLAVVLWFKGVQKTPTYISSILTTLEPIFGVILA ILILDERLSTVSAMGILLTLGAAAVSVLIPVLMKKSP >MS1825 rhaT, RhaT protein MLMPHFTQSKGYGYFCLILATFFWGGNYMFGRILSHVIPPIILNYLRWLP AAIILLLLFAKYLPQQRHIIRKNWQILTALALLGVLIFPVFLYQGLQTTT ALNASIYLAVVPIVVMFLNRICFKDTIRFPVFIGALISFIGVLWLLSHGE LSRLLTFNVNRGDLWAIGSAVSWSVYCSIIRLRPKEIGNSVMLTAQVGIA MIIFTPVFLSQLNTENLQIISELTYGQWMIILYLIIGPSILSYGFWNYGM TIVGGTKGAAFTNATPLFAAALGILVLGEQLHGYHLISSLLIVIGLTLCN KK >MS1597 rhaT, RhaT protein MKQQPLLGFLFGLIAACMWSSLPLFVQQVVKVMDIQTSVWYRFVLSAVGV LLLLCFSGKFFTFKRISPKNTLLLLLAIAGLSVNFYLYNLALKYIPPTTS QVLSPLSSFMMLFAGVLIFKEKMARHQKIGLAVLSLGLILFFNERLDDFL QLNTYFKGVVMVIASSFVWVIYAIVQKVLLSHLSSQQILLMIYIGCTLVF FPNADIKQIYQLDGFQLVCLVFSGVNTIIAYGCYAEALDRWEVSKVSAIL TQIPIFTLLFFHLAVMIAPNYFVAVELNWISYLGAFCVVSGAMLSALGHK LKMLKERD >MS0972 rhtB, RhtB protein MLNLIIVHFFGLVTPGPDFFYVSRMAASNSRRNVICGIIGITLGVAFWAA SAMLGLAILFTTMPVLHGVIMLLGGGYLAYLGLLMVRSRTNATFAPLSAE ELNKTTTVKKEIMKGLFVNLSNAKAIIYFASVMSLILVHITQVWQMLLAF AIILVETFIYFYLISVLFSRPFAKKFYSRYSRYIDNVAGIIFLIFGMILA YTGVMEMMG >MS1681 rhtB, RhtB protein MEFWHGFLIITGIHILAAMSPGPDFIYVSQQTLSRGRAAGIICALGVAFG LGVHILYSVLGLAVVIASAAWILTTIKIIGGIYLIYLGYKGLKARAKNQV QIIEKVEVQQENRLKTLWKGFLCNVLNPKAPVYFVSVFTVVLSPNMPVWQ LAIYGVWMMFLQFVWFASVAFLLSIPKVNKQFQKAGHWIDRILGFVMVGL GIKVISS >MS1311 sdaA, SdaA protein MISVFDMFKVGIGPSSSHTVGPMKAGKQFIDDLITQGNIGKITRIHADVY GSLSMTGLGHNTDITIIMGLAGYLPHNVDIDSIADFISRVKQTALLPVAG GSYTVDFDFKQDMQFHDSFLSLHENGMTLTAFMNDEIAYRQTYYSIGGGF IVGEAHFNQAQNEEVPVPYPYNNAADILRHCHDTGLPISTVVFRNEVALH GKESVEHHLSLIWQTMQDCIKHGLKTEGLLPGPLKVSRRAPALHRLLQAN SNLNNDPMQVIDWINMFALAVNEENAAGGRVVTAPTNGACGIVPAVLSYY EKFISPLNAETVERYLLVCSVIGSLYKMNASISGAEVGCQGEVGVACSMA AAGLTEILGGNPEQVCIAAEIAMEHNLGLTCDPVGGQVQVPCIERNAIAS VKAINAARMALRRSTNPRVTLDKVIETMYETGKDMNAKYRETSKGGLAIK VVCS >MS0977 sdaC, SdaC protein MLHIILTLNRKFKMKNKTFGSALLVAGTTIGAGMLAMPLTSAEMGFTYTM ALLFLLWILLSYSALLFVEVYQTVQRKDAGIATLAEQYFGMVGRVLATLS LVIFMYAILSAYVTGGGSLLAGVLPFLGEHAAPISIIAFTVILGIFIVIS TGAVDGLTRLLFMIKLVAFVLVLTMMLPLVQGENLMAMPLKEFLIISASP VFFTSFGFHVIIPSINNYLDGNIKRLRAAIIGGTALPLVAYILWQMATHG VFPQAKFVEIINNDPTLNGLVDATYHVTGSNLISGSVRLFSTLALVTSFL GVSLSLFDCLDDLLKRINIKAGRLALGVLTFLPPLAFALFYPEGFIAALG YAGQMFTFYGLVLPVGLAWRARKLHPNLPYRVIGGNLTLLIALLLGLLIM NVPFLIEGGYLPKVIG >MS1895 sdaC, SdaC protein MYYYNSSKSYLTWKTFMEKSMKNKKQPSLLGGAMIIAGGTIGAGMLANPI STAGVWFLGSLLILIYTWFCMMSSGLMLLEANLHYPTGSSFDTIVKDLLG KGWNILNGLSLAFVLYILTYAYITSGGGITEGFLNQLLSSEQSAVEIGRS SGSLIFTFVLAVFVWFSTKAVDRFSTILIGGMVISFFLSVSGLISSANAD VLLNSATSQDTQYLPYALVALPVCLVSFGFHQNVPSLVKYYNRDAGKVSK SVFVGTFIALIIYILWQLAIQGNLPRAEFVPVIEKGGDIAALLAALSKYI QTDYIALALNFFAYMAIASSFLGVTLGLFDYIADLCGFDDSKAGRTKTAL ATFLPPLLLSLQFPYGFVIAIGYAGLAATIWAAIVPALLAKASRKKFNKP SYSCFGGNLMVYFIIIFGVLNILSQLAMQFGWLPEFKG >MS2338 selA, SelA protein MTALFQQLPSVDKILKTPQGEQLVTEFGHSAVVNCCRHLLAQAREKIKIE KKLPHFFTDFNHTIAEVNRYLANQQQVKIKSVHNLTGTVLHTNLGRALWA QSAQQAALTAMRQNVALEYDLEAGKRSHRDNYVSELLHELTGAQAACVVN NNAAAVLLMLATFAQGKEVIISRGELIEIGGAFRIPDIMAQAGCKLVEVG TTNRTHLNDYRRAINENTALLMKVHSSNYQICGFTCEVSEQELVELGKEF NIPVVTDLGSGALTDLSRYDLPKEPTVQEKLVQGADLISFSGDKLLGGPQ AGIIVGKKELIQQLQSHPLKRVLRCDKVILAAMEATLRLYLQPEKLTEKL TSLRLLTQPLEQLRQQAEQLKAKLENLLKDDFLLQIESSLAQIGSGSQPM AKIPSIAVTIAEKNSEKLTALLARFKKLSTPIIARVENDKIRLDLRSVTA IETLLITLEELNQDQ >MS1241 selD, SelD protein MLGTILHSQLEQFVDPHLLVGNDTNDDAAVYDIGNGTCIISTTDFFMPIV DDPFDFGRIAATNAISDIFAMGGKPIMAIAILGFPINVLPAEVAQKIVDG GRFACREAGIALAGGHSIDAPEPIFGLAVTGIVPTEKVKRNASAEAGSKL YLTKPLGIGILTTAEKRGKLKPEHKGLATEVMCQMNLIGSQFSQLESVTA MTDVTGFGLLGHLAEICEGSNLVADVHFNKIKMLDGVPYYIEQGCLAGGV TRNYESYGIKIGAITEFQKAVLCDPQTSGGLLVAVKPEGETQLLELAAQA GIELIEVGELRRRVDNSDPVIIRILD >MS1743 serA, SerA protein MTNKVSLDKSKIKFLLLEGVHQNALDVLHAAGYTNIEYHKKALEPDELKE AIKEAHFIGLRSRTNLTADILEHANKLIAIGCFCIGTNQVALEAAEEKGI PVFNAPFSNTRSVAELVLGEILLLMRNIPAANAQVHRGEWNKSAAGSHEV RGKKLGIVGYGHIGSQLSIIAESLGMNVFFYDVETKLPLGNAQQVSTLEE LLSSCDIISLHVPELPSTKNLMSAERIAQLKPGSILINAARGTVVDIDAL AEALEQGKIHGAAIDVFPKEPASAAEAFESPLRKFDNVILTPHIGGSTAE AQENIGTEVASKFVKYSDNGSTLSAVNFPEVSLPEHRTAKRILHIHHNRP GILNKINQVFVDENINIAAQYLQTDAKIGYVVIDVETDDSTDLLQKLKSI EGTIRARVLF >MS0068 serA, SerA protein MIMKVVISHRLHDNGMKVLEDANAQVAITNDGNPKIMLPELLDAEGLIIR IGSIDRETMLQAKNLKVIGRPGVGVDDVDVKTATELGIPVVIAPGSNTRS VAEHAFALMFACAKDIVRSDNEMRKGNFAIRSSYKAYELNHKTLALIGYG RIGSILAQMSKAIGMNVKVYDPFVKQGTIEQEGYIYCTELDDVIRDSHVI SIHVPLTNETRNLIGEHEFSLMNEHTILINCARGEVIDEPVLTKVLQEGK IHSAGLDVFACEPVDINSPLFQLDNVIVSPHMAGQTKEAASGVATMAAEG VVAVINGEKWPYVCNPEAYNHPRWNK >MS1758 serB, SerB protein MQTSEFINLTLKDIKQHYSPFPNKLINNQPQTEGRDYFILFGTNLEPAKL QAFQQKCGENFQIFDCWNNLHNIVVLLKGHWQKSYETHAHDLTLDAAKID FNANLAEQGLLVMDMDSTAIQIECIDEIAKLAGTGEEVSAITAAAMRGEL DFEQSLRRRVSTLKDAPETILQEVRLQLPLMPGLKETVRILQQHNWRVAI ASGGFTYFADYLKELLNLDAAVSNQFDIENGKLTGRVKGDIVHAQYKADT LKRLAREFNIPLENTVAIGDGANDLLMLKQANLGAAFHAKPKVQQQAQVV VNFADLTALLCLLSAGEKIKHLS >MS1573 serC, SerC protein MSNVFNFSAGPAMMPPAVLKKAQEELLNWQGQGTSVMEVSHRGKYFMELI TQADKDFRELYNIPENYKILFLQGGARGQFAAIPMNLANNKGKALYLNTG HWSATAAKEARNFTEVDELNITEQIDGLTRVNRLDFSDIAEQYDYVHYCP NETITGVEINEIPNVGNAVLVADMSSNIMARKLDISKFGIIYAGAQKNLG PAGIVIVIVREDLIGHARKATPSIWNYEVQANADSMINTPPTFAWYLCSL VFKDLLANGGIDTVEKRNAQKAALLYDYLDQTVFYHNTIAKENRSVMNVT FTTGDDQLNAKFVAQATEAGLQALKGHKVFGGMRASIYNAMPVEGVEALI AFMKKFEAENA >MS0832 sstT, SstT protein MNISRLFSFLFHGNLVKRISIGLLLGIIFALVSPSLESALGFHLAEKMGL LGQIFVRSLRSVAPILVFVLVIAAIANKKVGSKSNMKDIIYLYLIGTFLS ALTAVFASFMFPTTIALATNEAELSPPGKITEVLTALIFNVVDNPITALF NANFIGILAWAIGLGITLRYASETTKNVMNDFAEAVSKIVHFIISFAPIG VFGLVASTLADKGLSALLDYVQLLAVLVGSMLFVAFVINPIIVFWKIRRN PYPLVWECIRVSGVTAFFTRSSAANIPVNMELAKRLNLDEETYSVSIPLG ATINMGGAAITITVLTLAAVFTLGIEVSIPTAILLSLVASICACGASGVA GGSLLLIPLACSLFGISNDIAAQVIGVGFIIGVLQDSTETALNSSTDVLF TAAACMSEERKNS >MS0956 tdh, Tdh protein MMEIKTLSCVVRGPKDVGVMEQSINYDESSKEQTLVKITRGGICGSDLHY YQYGKVGNYEIKHPMILGHEVIGTVVKTNAPDLYVGQKVAINPSKPCLTC KYCLSGDTNQCETMRFFGSAMYNPHVDGGFTQYKVVDNSQCIDYPQDVSD DIMAFAEPLAVTIHAAKQAGDLAGKRVFVSGVGPIGCLAVAAIKASGAKE IVVSDLSRRCLDLALEMGATKALNAKDDFSEYMAHKGEFDVSFEASGHPS SIERCLAVTKARGTIIQIGMGGAIPEFPIMTLIAKEICLKGSFRFIEEFN TSVEWLSSGKVNPLPLLSATFPYTELEKALIIAGDKDNISKVQLSFE >MS0525 tdh, Tdh protein MMRSLVCKEPFHLILEERAKPQPKDEEVQLKVAAIGICGTDIHAYAGNQP FFEYPRVLGHEASGVITELGKNVDKFKVGQRVALIPYVSCGKCGACLSGK TNCCENISVIGVHQDGAFSEYLTAPAKNILPIADSVDFTTAALIEPFAIS AHAVRRAQITKGDDVLIVGAGPIGLGAAAIAHADGANVVIADTSEERRKH IQANIPVPTVNPINEKVEDYFNGRLPQIVIDATGNQKAMNNAVNLIRHGG RIVFVGLHKGTIEFSDPDFHKKETTLMGSRNATLEDFEKVQHLMSERKIS ANMMLTHTFKYDELAEIYEEKITKNQSLIKSVVLY >MS1702 thrB, ThrB protein MLRIYAPASSANISVGFDTLGAAISPIDGSLLGDVVQIEDIPAGFELESA GYFVRKLPKEPQKNIVYQAYVLFSERLKLRNGHVKPLRLTLEKNMPIGSG LGSSACSIVAALVALNMFHNEPFSKMELLEMMGELEGRISGSIHYDNVAP CYLGGVQLMVQSLGNICQQLPFFDSWYWVLAYPGIEVSTAEARAILPKSY TRQDVIAHGRHLGSFVHACHTQQDVLAALMMKDVIAEPYRESLLPNFAEV KQASRDLGALATGISGSGPTIFSIAPDLAVATKLANYLENHYLQNNEGFV HICKVDNQGTRALG >MS1701 thrC, ThrC protein MNLYNIKHPEEQVNFAQAVRQGLGKDQGLFFPEVIPALDNIDELLALPLV ERSQKILGALIGEEIPAEKLNTMVKNAFTFPAPVAKVEEGVYALELFHGP TLAFKDFGGRFMAQALATVRGDGKITILTATSGDTGAAVAHAFYGLENID VVILYPQGKISPLQEKLFCTLGGNIRTVAINADFDACQALVKQAFDDEEL RRAIGLNSANSINISRLLAQVCYYFEAAAQLSPSERSNLVVSVPSGNFGN LTAGLIAKTLGLPIKRFIASTNANDTVPRYLAKGKWEPNATVATLSNAMD VSRPNNWPRVEELFKRNGWALSELHSGAVSDAQTEETLRDMNAKGYLCEP HGAIAYRVLKQDLQAGETGLFLCTAHPAKFKESVERILNTQLPLPQALAK HAELPLLSDVMENDFAALRAYLLKK >MS1154 trpA, TrpA protein MARFETLFAQLNAKKQGGFVPFVTLCDPDLERSFDIICTLVDNGADALEL GFPFSDPLLDGPVIQAANNRALNAGCSTAESFKLLEKVRSKYPEIPIGLL LCANLIYAQTLDGFYRRCAEIGIDAVLVADIPLLAAEPYIQAAKKHGIQP VFICPPNADENTVKGVAEHSEGYTYLVSRAGVTSAENQSHAANLDSLVEQ LKAHNAPPILQGFGIAKPQQVKEALNMGVAGAISGSATVKIIEANLDNHE KCLADLAEFVKNMKAATL >MS1153 trpB, TrpB protein MTDTILDPYFGEFGGMYVPEILIPVLKQLEKAFVEAQQDPAFQTEFLDLL KNYAGRPTALTLCRNLTKGTKTKLYLKREDLLHGGAHKTNQVLGQILLAK RMGKTRIIAETGAGQHGVATALACAMLGMPCRIYMGAKDVERQSPNVFRM RLMGAEVFPVTKGSSTLKDACCEAMRDWAANYENTHYLIGTAAGPHPFPT IVREFQKMIGEETKAQILQREGRLPDAVIACVGGGSNAIGMFTDFINETS VRLIGVEPAGKGIETGEHGAPLGHGKPGIYFGMKSPIMQTEDGQIEESYS ISAGLDFPSVGPQHAYLNSIGRAEYPSITDDEALEAFKELAQHEGIIPAL ESSHALAYALKMARQNPMREQLLVVNLSGRGDKDIFTVDKIFSERGML >MS1152 trpC, TrpC protein MNLNDKPTILQKIVADKIQWIKAKEQVFPLASFKEKITKSDRSFYQSLGK GTHQNPVFILECKKASPSKGLIRNEFNPADIAQVYKNYASAVSVLTDEKY FQGDFSYIKQVRDIVTCPVLCKDFMISEYQVYLARYYQADAILLMLSVLD DETYKKLAALAHELGMGVLTETSNQQELERGIALGAKVMGINNRNLHDLT VDLARTPPLAQQIPADRIIVSESGIYSHQQVQQLKPYVNAFLIGSSLMGS DDLNNAVRSVIFGENKVCGLTRPQDVQEVYRQGALYGGLIFAENSKRCVS LRQAQELVTVAPLRFVGVFQNQQIDFIVKIATQLNLYAVQLHGAENEEFI AALRIQLPHQIQIWQAVSIDVAQQSAVKIDRISAVDRYVLDSKTANRQGG TGVAFDWSKIPAEIKNKSLLAGGITPENIELALAQHCLGIDLNSGVESAA GIKNPEKLTAVFNKIHRF >MS1151 trpD, TrpD protein MRIKTRNFIMQTQQILTQLFDNQPLSQEQAAFIFGNIVKGELSNEQLAGA LIALKIRGETIDEITGAVTALLAAAEPFPAPDYPFADIVGTGGDNADTIN ISTASAIVAASMGLKIAKHGNRSVSSKTGASDVLTALGVNIRMSTEQARK ALDEIGIAFIFAQQYHLGFKYAGPVRQALKTRTIFNILGPLINPANPKRQ LLGVYSPELLKPYAETNLRLNHEHSIIVHGCGLDEVAIHGLTQVAELRDG KIEYYNLSPKDFGFEPQPLESLRGGAPEENAKILTALLQGKGSEQQAQAV AMNTALLMKLFGHEDIKQNAQQVLEQLTTGKAFETLTKLTTY >MS2193 trpE, TrpE protein MNFASFIRQANRLGRQKTAFFFLIDFERQKPLISPLESAVENGIIFSVEG NTNFYRPVELPRQKIRFSSEPVSFERYAAGFALVQQELQKGNSYLLNLTY PSKINTNYNLAQIFQATKAPYKLLLQDQFVCFSPESFIRIRQNQIFTYPM KGTIDAALPQAEQQLMQSEKEGREHYTIVDLMRNDLAMVAENIRVRRFRY IDKISTNRGEILQTSSEITGNLTADWQNRIGSILAALLPAGSISGAPKEK TVSIIRQAEGGKRGYYSGIFGIFNGEELNSAVAIRYIEQKDGQLYFRSGG GITSQSRLQEEYEEYCQKVYLPIHCVE >MS1149 trpE, TrpE protein MPNAYIQTLSNPVQYQQDLTAVFATVGKTNSLLLESAEISSKNSLQSLLI INAALKVSCLGQIVTFTALTANGSHVLPLIKEKLQGKTKSLSVQQNKLIA EFFPIDQNLDEDSKLQSLTVFDGLRVINQLYQHSKQPVFLGGLFAYDLVA NFIPMNNITLQDDGLSCPDYVFYLAEQLLRLDHPSQQATLQTFCFNDSEL QNLQQSAVEIDKDLRNLKPLSAIQQGSTDISTNHEDEKFKQIITALKHHI YIGDVFQIVPSRRFILQCPNTLATYRQLKENNPSPYMFFMQDEEFTLFGA SPESALKYSADNRQLEIYPIAGSRPRGFDAKGKIDPELDARLELEMRLDH KEQAEHLMLVDLARNDVARVCESGTRHVKELMQVDRYSHIMHLVSRVVGK LRPELDALHAYQACMNMGTLTGAPKIKAMQLIYQFEKQKRHSYGGAVGYL SSDGNLDTCIVIRSAFVQNGIAYVQAGCGEVLDSDPQMEADETRHKAQAV IKAILQTNAQAN >MS1102 tyrA, TyrA protein MEALKEIRAEIDQLDRELLEVFAKRLALVKKVGEIKHQQGLPIYVPEREA DMLAARRSEAEKMGIPADLIEDVLRRVMRESYANEHEHGFKTVNPAIKKI VIVGGKGKLGGLFGRFLTASGYFVEALGSKDWDNAKAILAGANAVIVCVP IVKTLETIERLKPYLTEDMLLTDLTSVKRRPLEKMLEIHQGAVVGLHPMF GPDIASMAKQVVVRCDGRYPERYQWLLEQIQMWGARIYQADAAEHDHSMT YIQALRHFATFANGLHLSRQPVKLANLLALSSPIYRLELAMIGRLFAQDG SLYADIIMDKPENLEVIESLKQSYEDSLKFFENGDREGFIKTFNKVREWF GDYSEQFMKESRQLLQQANDYRHNSL >MS1031 tyrB, TyrB protein MQITILVSIKEKLISKHNISKESPMFKNITPAPADPILGLGEAFKAETRE NKINLGIGVYKDADGVTPIMTAVKKAEGQLFENEKDKNYLPIEGVAEYNA YAKELLFGKDSEIIASNRACTVQTLGGTGALRIAAEFVRRQTKAQNVWIS KPTWPNHNAIFNAVGVTIREYRWYNPETKALDWDNLLADLNNANPGDVVL LHGCCHNPTGIDPTPEQWKALAEMSAKNGWLPLFDFAYQGLANGLEEDAV GLRTFAETHRELLVASSFSKNFGLYSERVGAFTLVADNADVAAVALTQIK SIIRTLYSNPSAHGARTVATVLANPELRKEWEDELTSMRDRIKQMRKQLV ELLKEFGAQEDFSYIIDQKGMFSFSGLTAEQVDRLKEEFAIYAVRSGRIN VAGITEANIRYLAESIVKVL >MS0762 tyrR, TyrR protein MFTVKGYDEGNYFIRSIVGKTMSKNTAKRSAHFTVNQYENFTDVVALSPK MAALVEKAKKFALLDAPLLIQGETGTGKDLIAKACHNLSARKDQKFIAVN CAGLPDTDAESEMFGRADGDKTSTGFFEYANGGTVLLDGVAELSLNLQAK LLRFLNDGTFRRVGEEQEHYANVRVICTSQISLQHYVDEGKVRSDLFHRL NVLSLQIPPLRERKEDLAVLTENFVRQISRRLGVRTPEFDGQFLQYLKDY QWPGNVRELYNALYRACSLAEHNKLTIDGLNLSENETVPLTLEQFGNESL EEIMNNFEASVLRKFYEQYPSTRKLASRLGVSHTAIANKLKQYGIGK