TitleGenColors Logo

Gene list

Applied filters:

COG category: Amino acid transport and metabolism
Organism: Mannheimia succiniciproducens MBEL55E, MBEL55E
Gene type: CDS

Number of genes found: 219

Free access
Sort by:

 



# Mannheimia succiniciproducens MBEL55E, MBEL55E

>MS2300 unknown
MVLMDSLSKKVVYHKIVNAERVIYYRKAINELREKDYKIQSITCDGRRGL
LKDILNTPIQMCQFHQVAIVIRRITRKPKSEAGKELKILIKTLKTSSKNK
FYINLHHWYLKHKNFLNERSSIPDKAGKYPFKHRNLRSAYSSLKRHEEFL
FTFEKYPELKIEKTTNRLEGLFSELKRKLALHNGLSKKNKIMFIKDFLNE
KS
>MS0296 unknown
MSHLIVKEQKTIIRNAFFTFLYFTLAIGLAIGILYTDIFYLQNMIEEESL
VEYTQSLSLTILTLMFSRHAYRSPQWRGGFVLITGFFLCMLIRESDALFD
NLIRHGSWAYFAIITALVCIIYAFTHRQSTIDGLAQFAKQKEFHSFIIGL
LTVLLASRLIGYGGLWRFILYNDYPHIVKNIIEETTELFGYLIMLFSCLS
LTRHFK
>MS1082 unknown
MIAFGYITALSLSYFLLAPDFKGLSFTEYFIQSEAKPIFLTLGLLLPIGF
IVMSKAVEYGGIVRTDAAQRLALFLQIIAAVILFGETLNNMRVGGVIVAF
FALFCLLTKPTKSIENALKAVFALAAVWLIWGVTGILFKKIALMGGAFPT
TLFVTFSIAAVLMFTYLLIKRTFWNASSLVGGIILGCLNFGNILFYIYAH
QYFKENPTIVFATMDIGVICLGMIVGALVFKEKISKINMLGIVLGITAIL
LLRV
>MS0071 unknown
MFRVVGEINMKKYISDKNFLAGFIFFFVSAFYLISAFQIETKNLVSVEAD
FMPIIYGSLLLTTSIVLMITSFFKIRNTVVNKENKETDWKRIFSVIGLVF
VYVLLMQYIGFIVTSIPFLFCLSVLLTPLYIKKNYIVYSIFSIVLPILAY
FLFSYYLNLTMPSGFLF
>MS1348 hypothetical protein
MERHPMVERVDYPGLASSKDYELKQKYTPNGLCGVLSFELKGDKQTAMKW
LDSLQIISREVHVADIRSCALHPATSTHRQLSDEEMRAANITPGFIRLSI
GIENPEDLLADLQNAFDQIK
>MS0343 unknown
MKKLILATALSSVAAFTQAQIVPNANSATHTYEFTQSYDLQVPKGSSGET
KLWVPLPFSNDYQDVKSVEFDGNYQQAYITENNQYGAKTLFALWDKDAQK
RDLKVKLVVTTKDREPMKQGLLENYQAPENIEYSVDVQQYLKPTQHIKTD
GIVKQFADKIVGKESNPLKKAEMIHQWIVNNMERDNSVLGCGDGDVEKIL
TTGVLKGKCTDINSVFVALARASGIPAREIFGIRLGQAVKMGEYSKGAFG
SAKDKVANENGGQHCRAEFYLAGFGWVPVDSADVAKYRLTENKSVEDKDT
QAVSQYLFGNWEANWMGFNHARDFNLYPMPELAPLNNFGYPYAEVGGDPL
NSYDAKKFGYEFTSKEL
>MS1642 unknown
MEPISLPEYAGSTLRGAFGRALRKIACMTKQADCKGCPLYRSCPYTNIFE
TPAPTSHELQKFSQVPNGYIIEPPEWGEKIYLTGTELRFNLALFGRLIEQ
LPLIAFAFKRAFEYNVGRGKAHLVDIAKFSQNMTACQSILKEGNIIEHEK
QIILPESLPNYLTIQIETPLRIQENGKPLRENQINADRFFIGLAKRISLL
SEFHHQPLNLDFELIKNDLQAVKYEKNLTWLDWTRYSSRQDQKMKLGGVV
GSWQFENLSPELIQLLYFGQWLHCGKNATFGLGKYRITNL
>MS2122 unknown
MSRALNFVMISPHFPTNFETFAVRMREKGINTLGIADTPYEQLSETLRNN
LTEYYRVDNMEDYEQVYRAVGYFAHKYGRIDRVESHNEYWLELDAKLRTD
FNVFGYKNDDMLAIKTKAQMKEVFRKSGLKVAKGRVFKDDEDARKLAKQL
KFPVIVKPNSGVGASDTYKIKSAVELEDFFGYKNPNVEYIMEEFIDGDIV
TFDGLTDHDGKIVFYSSLEYSEAVLDTVEKDGDMFYYVPREISPKLVKLG
EQCVEAFNVRERFFHFEFFRVKKSGELLPLEINCRPPGGLTIDMWNYAND
FDVFREYANVVTENKFYSDITHPWNVVYISRKANQNYVNSIDDVCQKFGD
NIISVQTVPGVFAKVMGEHGILVRTKTIEQMREIVQFAQAKQ
>MS2266 unknown
MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA
ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI
VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER
SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE
GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS
>MS0665 unknown
MNKYLKSDFIFSLFLSIAIMFICLYFEKSFFFVDDAQNEFLPFTRQIGNV
WLNGEIPFILKNTFIGSNTMIDIHRAIFLPQNIFLSILSVKITSLKIISI
IAAFINLFVMSFSALKLSEAFSLTKAAGIVLAFLFCINPIFLYFYLESWW
NAAAGQAWFVASLASVAWLMRAFSIKRLLLNVITVLSIFASGWPHSVLVY
GFLALIFSIFLYLNKRHNDLILFVLISFSIILIAIPLYSEYVISGDLINR
QSFKFNNVGNFLSTTLNQLLLTFNVTYYHFMHRYGGYSITHIPMGYSSIY
ILLLICFGSLKNIARNPNSLFLLVLCTVFFILTQTPTEIGPFRYPFRFTP
YFSEVLTMLSIFSLEKLGIVKTRARVFLVVLLLSISLLLSIFSLEENFGK
YAILQFLFFAVTTWYVVRYNSISLKSGLPYTAFIFLLMLLAKDSVIGYLS
FPDLKNSINMENNYSQGGYILSLTNGKRPKNNLEDLNSTHFMLYGLKSIN
GASPVGNKYISKTISTRSSQAFFNAKETILGLSKTYKDKCYFDLFGIDTV
ILNKKDNSSLISQKLSDCGFSERKVKSHDVIYFLRNDFNAKGSVSYHSDT
LSINQQISLKNNSEFYQLSGLKGDELIFNRVYWYGYRAYINDKEIPLLNY
DGLLRIILDHDYQNGVLRLEYFPKSWKYALLIALSGFLLLLFSVGYMQRM
RKWVSLN
>MS1235 unknown
MKAPKTPLNLPQNEILNIVMDTTFFGNEFGVLVLMDSLSKKVVYHKIVNA
ERVIYYRKAINELREKDYKIQSITCDGRRGLLKDILNTPIQMCQFHQVAI
VIRRITRKPKSEAGKELKILIKTLKTSSKNKFYINLHHWYLKHKNFLNER
SSIPDKAGKYPFKHRNLRSAYSSLKRHEEFLFTFEKYPELKIEKTTNRLE
GLFSELKRKLALHNGLSKKNKIMFIKDFLNEKS
>MS1347 hypothetical protein
MTAPPASWGSEILPIPIALTDLFHIQQRKITMKFETQCLHAGYSPKNGEP
RVQPIVQSTTYTYDSAESIGKLFDLQEAGFFYTRLANPTTNAAEEKLAAL
EGGVAALCTASGQAATFYALMNLVESGDHFISTTNIYGGTYNLFAHTFRK
MGVEVTFVNQDDNLDELRKAIRPNTKAVFGETISNPTLRVLDIEKFAALA
QAANAPLIIDNTFATPYFCRPFKYGANIVVHSTSKYLDGHAVALGGAIID
GGNFNWEQEKFRQFSQPDITYHGLVYTRTFGKAAYAVKARVQLMRDLGAT
PAPQNSFLLNLGMETLPLRMKQHYANAQAVAE
>MS1493 unknown
MNINVISIFKLLLLFALGLVILSPALSTQIGVPRLDSALCFLFFFLAVIT
PFLRDMETDFFKLQFPVYVLFFFGFLSVLNAFSTEKLVDLFFFGIVMFLF
HYSFLTFNRGDGEAGIRHLLLGISLIVLAGFFIEALLGFQLVSGNEELTV
TDKAFKGFFFNTNDQSVIMISLAVAVGFFYIIRENNWKIKLIGYALIFIM
GLAIVISASRSVLLSYLIMLMLILFLNASAYFKAVYLFFACVIALFIFNL
SWLQEVFILLAKIDWLERPIERFSLVIFSMGDDKSVGYRTEIYTTFLDNF
KILWLGYGPRDYIQYFDQIKLSFPLGYTNPHSFFIELYLAFGIFAFLAFI
YFLLNSIIYVMNTRLLAWKERIFILFVFINFCWIVWVPSSILRLPLVWYP
LFLVLVYTVLVKNGTFVSPKLVGRRRSS
>MS0001 unknown
MKGGDMSAIKDRLKDIDCALSDLERERKEILLDAGAPEIIGLKDDINALT
VSLEYIDDEILPLLQQLSIDPDAYKYLSEDIKLSLLRDLPESVSAIKAII
NKLTPVKHCIESFNHQNDIGF
>MS0353 alsT, AlsT protein
MSLETILSSIDSFIWGPPLLILLSGTGLYLTLRLGFLQIRHLPRAFAYMF
KKEEGNHQRGDVSAFQALCTALSATIGTGNIVGVATAIQAGGPGAMFWMW
LVALLGMSTKYAECLLAVKYRVRDKNGFMAGGPMYYIERGLGIKWLAKLF
AVFGVLVAFFGIGTFPQINAITHAMNDTFSVPVTISAAIITILVAAIILG
GVKRIAAVSSYIVPFMAVLYVTTSLIILLINADKVPSALALIIESAFNPE
AALGGALGFTVMKAIQSGVARGIFSNESGLGSAPIAAAAAHTKEPVRQGL
ISMTGTFLDTIIVCSMTGLVLVITGAWQSSDMAGAAVTNYAFSQGLGTNI
GATIVTVGLLFFAFTTILGWCYYGERCFVYLVGIKGIKLYRTAFIILVAC
GAFIKLDLIWILADIVNGLMAFPNLIALIGLRKVIVSETKDYFMRLKTNN
YSLDDNEEQIVNS
>MS0767 alsT, AlsT protein
MNAKRYFGVLNDFVIMVEQGIHWLVDNVEGPLWDATIVILLGVGLFFTIT
TGFVQIRLFPHSLREMWFGREVQGDSLTPFQAFATGLASRVGVGNISGVA
TAIALGGPGAVFWMWLTALIGMSSAFAESSLAQLFKIKEADGSFRGGPAY
YITQGIGSRWLAAAFAIALIFTFGFAFNAVQSNSIVEATRNAWLWDEHYV
GMGLVLLTALIIFGGIKRIGKFSARIVPVMALVYLLIAVSILLIHYDRIP
SVISLIIRSAFDFSAMAGGVFGAMLSKAMLLGIKRGLFSNEAGMGSAPNV
AATADVKHPASQGLIQMLGVFVDTMVVCTCTAIIILLSDNYGGEQLQSIS
LTQNALKYHMGEFGLHFLAFILLLFAFSSIIGNYAYAESNIRFIKNNPVV
VNLFRAMVLFFVYFGAVNSGGIVWAFADTVMAVMAMINLVSLIILSPIVW
LLLKDYHRQAKQGIVPVLDIMLHPRLLKLRLDQRLWNRR
>MS2050 ansB, AnsB protein
MKLTKLALTMSLGLGVSFANAAELPNITILATGGTIAGSGATSVSSSYKA
GQLTVQTLIEAVPEMKDLANITGEQVVNIGSQDMSDEVWLKLAKTINAKC
NETDGFVITHGTDTMEETAYFLDMTVKCEKPVVLVGAMRPATEKSADGPL
NLYNAVVVATDKKSAGRGVLVAMNDKVLGARDVTKTSTTAVETFNSPNFG
SLGYIHNSKVDYERSPESKHTTATPFNVDNLTALPKVGIVYAYSNMPTEP
LKALLDAGYEGIVTAGVGNGNVNQANSAILEKAAKDGVAVVRSSRVPTGY
TTRNGEVDDNALGFAASGTLNPQKARVLLQLALTQTKDINKSNNILMISK
SGRST
>MS0754 argB, ArgB protein
MRSTELVQWFRQSTPYVNMHRGKTFVIMLDGNTIASSNFINIINDISLLH
SLGIKLIIVYGARVQINSLLAQNNVTSVYHKNIRVTDPRTLELVKQAVGQ
LSYDITARLSVRLPHSPVLNVVSSNFILAQPIGVDDGVDYMLSGKIRRIE
IDNIKHHLDNNAIVLLGPIAPSVTGETFNLPFEEIATQVAIKLKAEKLIG
FSSTQGILDPQGISIPDLLPQDAAKYLNQYIQQGEYHCSQARFLQAAIEV
CKAGVKRSHLLSYEEDGSLLQELFTRDGVGTQLSVDNSEDIRIATVQDIP
GLIELIHPLEQQGILVKRSREQLEMDIANYTIIDRDGVIIACAALNQYPE
ENMAEMACVAVHPDYRSSSRGDILLEAIQKRARQLGIEKLFVLTTRTVHW
FQERGFRLANVEDLPKEKRDHYNYQRRSKILIQPLNEEE
>MS0236 argB, ArgB protein
MKPLVIKLGGVLLDTPAAMENLFTALADYQQNFARPLLIVHGGGCLVDDL
MKRLNLPVQKKNGLRVTPADQIDIIVGALAGIANKTLVAQAAKFKLNPVG
LCLADGNLTQATQFDPELGHVAMVVAKNPALLNNLLGDAFLPIISSIAVD
DNGLLMNVNADQAATAIAALINADLVMLSDVDGVLDANKQRLTELNSAQI
EQLIEDKVITDGMIVKVNAALDAAKILNCGVDIANWKYPEKLTALFAGEI
IGTRINP
>MS0235 argC, ArgC protein
MAQKAIVIGASGYTGAELARILTHHPEFELAGLYVSTNSADANKSISTLY
PQLKTICDLPLQPLPEDLTEIAQNADLAFFGTAHEVSANLAPVFLQNNCK
VFDLSGAYRVNSESFYQEFYGFEHKHPELLKQAVYGLAEWNADKIKTTDL
VAVAGCYPTVSQLSLKPLIEEGLLDVNQLPVINAVSGVSGAGRKASLTSS
FCEVSLNAYGVFNHRHQPEIATHLGTDVIFTPHLGNFKRGILATITAKLK
AGVSDEQIKRAYAKYYANKPLVRVYEQGLPSIKAVEFSPYCDIGFATKNN
HIIIVGAEDNLLKGAAAQAVQCANIRYGYNEVLGLI
>MS0829 argD, ArgD protein
MTITTPVKAVLASNQYFLDRQNAMESNVRSYPRKLPFAYAKAQGCWVTDV
EGNEYLDFLAGAGTLALGHNHPVLIQSIKDVLDSGLPLHTLDLTTPLKDA
FTEELLSFFPKDQYILQFTGPSGADANEAAIKLAKTYTGRGNVIAFSGGF
HGMTHGALSLTGNLGAKNAVQNLMPGVQFMPYPHEYRCPFGIGGEAGAKA
VERYFENFIEDVESGVVKPAAVILEAIQGEGGVVPAPVSFLQKVREVTQK
HGILMIVDEVQAGFCRSGKMFAFEHAGIEPDIVVMSKAVGGSLPLAVLAI
KKEFDAWQPAGHTGTFRGNQLAMATGYASLKIMREENLAQNAQQRGEYLT
QALRELSKEFPCIGNVRGRGLMMGIDIVDERKPQDAAGAYPQDGELAATI
QKFCFKNKLLLERGGRNGNVVRVLCAININQAECEEFIKRFKQSVTDAIK
AVRG
>MS0782 argD, ArgD protein
MSQYTRKTFDEVMIQNYVPADFIPVKGKGCKVWDQQGRDYIDFTSGIAVN
ALGHCPDEIVDVLKKQGETLWHSSNWFTSEPTLELASKLVEHTFAERVMF
ANSGGEANEAALKLARRYAVDNYGYQKDTIISFKKSFHGRTLFTVSVGGQ
AKYSDGFGPKPAGIVHLPFNDLDAVKAMIDDHTCAVIVEPIQGESGIIPA
TKEFLQGLRRLCDENNALLIFDEVQTGVGRTGYLYAYESYDVVPDILTSS
KALANGFPISAMLTTTKIAASFKPGVHGTTFGGNPLACAVGAKVIETIAN
PAFLENVQKTSALFISELNKLNEKYHLFNEVRGQGLLIGGGIN
>MS0783 argD, ArgD protein
MILVAGPNVLRFAPALNISQQEVAEGFKRLDQALQKFA
>MS0233 argE, ArgE protein
MKRLPKFLDMYSQLIALPTISALEPEFDQSNKALIELLADWLATLGFKTE
IIPVENSRAKYNLLATYGEGEGGLLLAGHTDTVPCNEELWTTNPFKLTER
DGKFFGLGTADMKGFFAFVIDAVRQIDLTKLTKPLRILATADEETTMLGT
RTFIRHTHIRPDCALIGEPTSLRAVRAHKGHVGKAVRIIGKSGHSSDPAK
GINAIELMHEATGYLMQMRNELRDKYHHDAFEIPYPTMNFGAIHGGDAVN
RICGCCELHFDIRPLPKMRLEDLDEMLQQKLAPMFEKWGDRISIEALHEP
TPGYECEHSAQVVQVVEKLLGEKCEVVNYCTEAPFIQELCPTLVLGPGSI
EQAHQPDEFLSAEFIEPTRDLLTKMIMHFC
>MS0674 argE, ArgE protein
MKNTIINLAQDLIRRPSISPDDQGCQQVIAERLTKLGFNIEWMSFNDTIN
LWAKHGTTSPVVAFAGHTDVVPTGDENQWNYPPFSAQIVDDMLYGRGAAD
MKGSLAAMIVAAEEYVKANPNHAGTIALLITSDEEAAAKDGTVKVVESLM
ARGENIDYCLVGEPSSAKQLGDVVKNGRRGSITGDLYIQGIQGHVAYPHL
AENPVHKATKFLTELTTYEWDNGNEFFPPTSLQIANIHAGTGSNNVIPGE
LYVQFNLRYCTEVTDEFIKNKVAEMLQKHDLTYRIDWNLSGKPFLTKPGK
LLNAVVESLESVAGIKPKLDTGGGTSDGRFIALMGAEVVELGPLNATIHK
VNECVSCRDLATLGEVYRQMLVNLLGK
>MS1555 argE, ArgE protein
MSVNMKRIQTIIEKLASISSVPGELTRLAFSAEDEAAHNYLIELCKPYDL
SIRRDQVGNLFIRKSGIEDHLPAVTFGSHIDTVVNAGKFDGPLGSVGGLE
ILFQLCEQGVQTRYPLELIIFTCEESSRFNYATLGSKLMCGIANRESLSR
LRDKQGNSLEEAMATIGLDFTEVDQVKRNAEEFKCFFELHIEQGPRLANE
RKTIGVVTGIAAPIRCIVKIQGQADHSGATAMHYRRDALLGGAELALAIE
RAAIDAGHSTVATVGNLNAKPGVMNVVPGYCELLVDIRGIHSEARESVFT
VLQQQIEQVTAKRGLSIELQLISKDQPILLPDQMVQQISRAAQDLGYAYE
IMPSGAGHDAMHMATFCPTGMIFVPSKNGISHNPLEFTSWEEIEAGIKVL
QLVVLEQAEKV
>MS1073 argF, ArgF protein
MPFNLKNRHLLSLVNHSPREIKYLLDLARDLKRAKYAGTEQPRLKGKNIA
LIFEKTSTRTRCSFEIAAYDQGANVTYIDPTSSQIGHKESMKDTARVLGR
LYDAIEYRGYKQETVEELAKFSGVPVFNGLTDEFHPTQMLADVLTMIEHS
TKPLNEIKYVYIGDARNNMGNSLLLIGAKLGMDVRICGPKSLLPEENFVS
ICEEISKETGARLTVTDDIDLAVKDADFVHTDVWVSMGEPIEAWGERINL
LMPYQVNTDLMKRTGNPNVKFMHCLPAFHNCETKVGREIAAAYPNLANGI
EVTEDVFESPMNIAFEQAENRMHTIKAVMVASLA
>MS1479 argG, ArgG protein
MSNTILQNLPLGQKVGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPD
EDDYNAIPKKAMAYGAENARLIDCRKQLAQEGIAAIQCGAFHISTGGVTY
FNTTPLGRAVTGTMLVAAMKEDDVNIWGDGSTFKGNDIERFYRYGLLTNP
NLKIYKPWLDDQFIDELGGRFEMSQFLIANGFDYKMSVEKAYSTDSNMLG
ATHEAKDLEDLSTGIKIVKPIMGVAFWDESVEIKPEVVTVRFEEGVPVEL
NGKRFDDVVELFMEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMAL
FHIAYERLVTGIHNEDTIEQYRINGLRLGRLLYQGRWFDPQALMLRESSQ
RWVAKAITGEVKLELRRGNDYSILDTVSPNLTYEAERLSMEKVEDAPFDP
IDRIGQLTMRNLDVTDTRNKLGIYSEAGLLTAGKDAVVPQLGSK
>MS0237 argH, ArgH protein
MALWGGRFTQAADQRFKDFNDSLRFDYRLAEQDIEGSVGWSKALVSVGVL
TTDEQQQLERALNELLIEVRSNPQAILQDDAEDIHSWVESKLIDKVGNLG
KKLHTGRSRNDQVALDIKMWCKAQVTELQYAVRDLQAKLVETAENNQHAV
MPGYTHLQRAQPISFAHWCMAYVEMLERDYSRLADAYNRMDSCPLGSGAL
AGTAYPVDREQLAKDLGFAFATRNSLDSVSDRDHIIELLSTASLSMVHLS
RFAEDMIIFNSGEADFVELSDRVTSGSSLMPQKKNPDACELIRGKAGRVI
GSLTGMMVTVKGLPLAYNKDMQEDKEGIFDALDTWHDCLTMAAFVLEDIR
VNVERTREAALKGYSNATELADYLVAKGVPFRDSHHIVGETVVYAIKVHK
GLEDLSIEEFRQFSDVVGEDVYPILSLQSCLDKRSAKGGVSPLRVAEAIA
DAKARIAAKK
>MS1575 aroA, AroA protein
MEKLTLTPISHVEGTVNLPGSKSLSNRALLLAALAKGTTRVTNLLDSDDV
RHMLNALKQLGVNYSLSEDKSVCEVQGLGKAFAWQNGLALFLGNAGTAMR
PLTAALCLANADSVPAEIILTGEPRMKERPIKHLVDALLQAGADVQYLEQ
EGYPPLAIRNTGLKGGKVKIDGSVSSQFLTALLMAAPMAERDTEIEIIGE
LVSKPYIDITLNMMKIFAVDVDNQNYQRFVVKGNQQYQSPNIFLVEGDAS
SASYFLAAGAIKGKVRVTGVGKNSIQGDRLFAEVLEKMGAKITWGEDYIE
AERGELNGIDMDMNHIPDAAMTIATTALFAQGETVIRNIYNWRVKETDRL
SAMATELRKVGAEVEEGEDFIRIQPPASDQFKHAEIETYNDHRMAMCFAL
VALSNTAVTICDPKCTAKTFPTFFDEFSAIATV
>MS1968 aroB, AroB protein
MVCVNVELKERRYPIYIGENLLTDTGVYPVKMGDKVMIVSNPTVAQYYLT
PVTETLEKLGCQVSHVLLPDGEKYKTLDSLNMIFTALLKENHGRDTTLIA
LGGGVIGDVTGYAAASYQRGIRFIQIPTTLLAQVDSSVGGKTAVNHELGK
NMIGAFYQPCTVIIDTRTLVTLPKREVNAGLAEVIKYGAILDLPFFEWLE
AHIDNLVALNQQDLQYCIARCCQIKADVVARDETEKGDRALLNLGHTFGH
AIETHLGYGNWLHGEAVAAGSMMAAVLSEKLGDLSYSEVARLEKLLARAN
LPTVSPDTMQAEDYLPHMMRDKKVLAGKLRLVLLKTLGQAYVASDTDKSL
VLDAIRVCSQNN
>MS0866 aroC, AroC protein
MAGNSIGQLFRVTTFGESHGIALGCIVDGVPPNMALSEADIQPDLDRRKP
GTSRYTTPRREDDEVQILSGVFEGKTTGTSIGMIIKNGDQRSKDYGDIMD
KFRPGHADYTYQQKYGIRDYRGGGRSSARETAMRVAAGAIAKKYLREQFG
VEVRGFLSQIGDVKIAPQNISEIDWAQVNDNPFFCPDQSAVEKFDELIRQ
LKKDGDSIGAKLTVVAENVPVGLGEPVFDRLDADLAHALMGINAVKAVEI
GDGFAVVEQRGTQHRDEMTPQGFLSNHAGGILGGISTGQPIIATIALKPT
SSITVPGRTVNLNNEPVELITKGRHDPCVGIRAVPIAEAMTAIVLLDHLL
RHRAQCGLK
>MS0133 aroE, AroE protein
MQLKSSLIVNQHLRLEQITEDDAEPVFRLICRQRDYLSRWLPGVGLTSNV
SSTLKFIRSLKPLEQVFTIRRDDEIIGLVSFNKADYSNLKLEIGYWLSQS
EQKQGIMTQCVQTMIDYAFNQLYFNRIQIKCAIGNTASKGIPQRLGFQLE
GIERQGLLLLSGEFADFEIYSMLAQDWKNKQDKQIMDTYAVWGNPIAQSK
SPAIHKIFAEQTGQNMKYIAMLGDEQHFERQLQEFFAQGAKGCNITAPFK
ERAYRLADEYSERALTAGACNTLKKLENGKLYADNTDGAGLVSDLQRLGW
LKPNQQILILGAGGATKGVLLPLLQAQQKILIANRTLAKAEELAEKFSPY
GEIRAVELKTIPPYRYDVVINATSLGLTGKTADIQPEILQQAGAVYDMQY
AKETDTPFIALAKSLGVNNVSDGFGMLVGQAAHSFRLWRGIMPDIEVLLN
RGI
>MS2315 aroE, AroE protein
MINKDTQLCISLSGRPSNFGTRFHNYMYEKLGLNFVYKAFTTNDIEHAVK
GVRALGIRGCAVSMPFKESCMPFLDEISPSAKAIESVNTIVNTDGYLKAY
NTDYIAISKLIAKYQLKPTACVIIQGSGGMAKAVAAAFKNAGFDNLKIYA
RNATTGGYLAKLYGYQYIDSLYGQNADILVNATPIGMKGGGKEESIISFP
EAMIDQASVAFDVVAMPAETPLIKYARQQGKTVISGAEVAVLQAVEQFEL
YTGQRPGDELIAEAASFARANS
>MS1104 aroG, AroG protein
MKDSIHNVHIIDEKVLITPAELKQKLPLPIALRTQIETHRREIADIVHKK
DDRLLVVIGPCSVHDTKAAIDYAKRLKALSDELKDQLYIVMRVYFEKPRT
TVGWKGLINDPRIDGTFNVEEGLHIGRKLLLDLAEMGLPLATEALDPMTP
QYLADLFSWSAIGARTTESQTHRELASGLSMAVGFKNGTDGSLATAINAM
KAASMGHSFIGINQQGQVNLLHTEGNPDGHVILRGGKKPNYQQEFVNQCE
EELAKAGLETAIMIDCSHGNSNKDYKRQPSVAKDAVNQIVAGNKSIIGLM
IESNINAGNQSSEQKVSEMKYGVSITDACIDWETTDNLLRKIAAALKNRA
E
>MS1184 aroG, AroG protein
MISFKVRLNFSIFRMIYRELIMPTKNKNNIRVANDDTRIANIEQLLPPVA
LLEKYPASNVAVKTVRNARNKAHQIIHGEDDRLLVIIGPCSIHDPKAALE
YANRMAKMREKYKDTLEIIMRVYFEKPRTTVGWKGLINDPYLNDTYALND
GLRIARKLLSDINDLGLPTAGEFLDMITPQYVADFMSWGAIGARTTESQV
HRELASGLSCAVGFKNGTNGGVKIALDAIGAAEASHHFLSVTKFGHSAIV
STKGNLDCHIILRGGDKGTNYDAENIAKVCANIEKSGRIGHVMIDFSHAN
SSKQFKKQVEVCHDVAKQIAQGSNQIFGVMVESHLVEGRQDLVNGKAETY
GQSITDACIGWDDTEIVLQELSDAVAARRKVNGK
>MS1969 aroK, AroK protein
MRLRILLLFIENFKKNNTMAEKRNIFLVGPMGAGKSTIGRQLAQLLNMEF
IDSDNEIEQRAGADISWIFDIEGEDGFRKREERIINELTQKQGIVLSTGG
GAILSKETRNHLSARGIVIYLQTTVDKQFERTQRDKKRPLLQGVEDVRKV
LEDLAQVRNPLYEEVADITLPTDEQSAKLMASHIVELIDNFNS
>MS1790 aroQ, AroQ protein
MSQLSRILLLNGPNLNMLGAREPKHYGTLSLAAIEANVQALAAKNNIELE
CFQANSEEKLIDKIHQSFKKVDFILINPAAFTHTSVALRDALLAVAIPFV
EIHLSNIHKREPFRHHSYFSDVAEGVICGLGAKGYECAFEFAVEFLAKKA
>MS1277 artI, ArtI protein
MFKKLVLLATGMFAVATTTQAVAADSLLDRINNKGTITVGTEGTYAPFTY
HDASGKLTGYDVEVTRAVADKLGVKVEFKETAWDSMMAGLKAGRFDIVAN
QVALTTPERQATFDKSEPYSWSGAMMAVRADDDSIKTLDDIKDRKAAQSL
TSNYGELAREKQAKIVPVDGLAQSLLVVQQKRADFTLNDSLAILDYLKKN
PNSGLKSAWEAPAEEKLGSGLIVNKGNDEALAKISAAVIELQKDGTLKKL
GEQFFGKDISVK
>MS0704 artI, ArtI protein
MKKLLLSTLLITTAFAVSAKDISFAMEPTYPPFEFTNEKGEIIGFDVDIA
NALCKEMQANCTFKSQAFDALIQGLKQKRFDASISGMGITEARKKQVLFT
EPYFSSSAAFIAKKGTDFTKVKTIGVQNGTTYQNYIIKEKPEYEVKAYAS
FQDALLDIQNGRIDAIFGDIPVLVDMIKKTPELAFAGEKIDNKTYFGNGL
GIAANKANQELIDEFNQALIKIRQNGEYQKIYDKWMTAK
>MS1684 artI, ArtI protein
MKIFKKTTALLAAALLATGLTACDNKDSGAASADNNAVSAIERIKKADKV
RIGVFSDKPPFGYVDKDGKVQGFDVEIAKAVTKDLLGDENKAEFVLVEAA
NRAEYLLSNKVDITMANFTVTPERKEVVNFAKPYMKVALGVVSKQDAPIT
DVAQLADKTLLLNKGTTADAYFTKNFPKNKSLKFEQNTETFQALLDGRGD
ALSHDNTLLFAWAKENPGYVVAIKNLGDLDYIAPAVKKEDTDLLQWLDGE
IEKLAKDGTLNKAYQKTLQPIYGDEIKEADVLVEYQ
>MS0900 artI, ArtI protein
MKKATLATLIAAMFVTATAQAQTSPDTLTKVLETKELVVCSPGDYKPFSF
DNNGKFEGVDNDLMDKLAQSMGAKVTIVKTTWKTLMDDFTANKCDIAVGG
ISITLERQQKALFTEPYFINGKTPIVRCENVDKYQTVEQINRPEVRIIAN
PGGSNEKYARNELSNANLTMNAENLTIFQQVIDKKVDVFVSEAAEAIVKA
HEHKGVLCAVNPDKPLKPAQNGWLIHNGDYRFKSYVDQFLHLEKMSGNLD
KTINKWLPRD
>MS0220 artI, ArtI protein
MQVVLKRRKQNNSDNIYLTINQGSYMKKLLLAAALAGTTFAAQARDITFA
MEPSYPPFELTNAQGEIIGFDVDVAKAICKEIEANCNFKSQSFDALIPSL
KAKRFDAAISAIDITETRAKQVLFSDAYYDSSASFIAVKGKADLNSAKNI
GVQNGTTFQQYTVAEAKQYSPKAYTSLQDAILDLKNGRIDIIFGDTAVLA
DMLAKEPELTFVGDKVTNKKYFGNGLGIAVNKSDKALVENLNKGLAAIKA
NGEYQKIYDKWMTAK
>MS1687 artM, ArtM protein
MSIIMNWQYIWNALPRFVDATILTLELSFWAILFSVIIGVICAVVMSYRV
RGLQTIVKAYIELSRNTPLLIQIFFLYFGLSKIGVKLEGFTCAVIGLAFL
GGSYMAEAVRAGIESVSKGQVESALSIGLTPMQTFRYVVFPQAFAVATPA
IGANCLFLMKETSVVSAIAIAELMFMAKEIIGMDYKTNEALFLLVVFYLI
ILLPVSVFIGYLERRLRRAKYGA
>MS0221 artM, ArtM protein
MFFEYLPLMSTATLMTLGLAVCSLIAGLVLAIFFVVLETNKFVCVRKPTA
IFVTLLRGLPEILVVLLIYFGSTELVEKLTGEYIEFSPFLCGVIALAIIF
AAYASQTLRGAIQAIPLGQWESGAALGLSRGYTFVNIILPQVWRHALPGL
SNQWLVLLKDTALVSLIGVDDLMRQASLVNTNTHQPFTWYSFAALLYLII
TLVSQFFMRKLEMRFTRFERGVK
>MS1686 artM, ArtM protein
MGLTLLFEGNNLQRLLAGLGITAEIAFVSVFFACILGIVMGVVMTSRNIF
VRGFCRLYLEIVRIIPLLAILFIVYFGVAKWFNVHLSGVTVCILVFIFWG
TAEMGDLVRGALTSIEKHQTEAAYALGLSKIQTFIYILLPQSLKRVTPGA
INLFTRMIKTSSLAMLIGVLEVIKVGQQIIETSLFRDPTSALWIYGVIFA
LYFAICYPLSLFSKYLEKRWEN
>MS1276 artM, ArtM protein
MLNNLLLSIPFMTESRVDLVISAFWPMVEAAVLVSIPLAVSSFIIGMIIA
VAVALVRVTPVNGVIHRLFLVIVKVYISIIRGTPMLVQISVVFYGLPALG
IFIDPIPAAIIGFSLNIGAYASETVRAAISSVPKGQWEAGYTIGMSYMQT
FRRIIAPQAFRVAVPPLSNTFIGLFKDTSLASVVTVTEMFRVAQQMANMS
YDFLPIYIEAGLIYWCFCWVLFVIQAKVEKRMERYVAR
>MS0222 artM, ArtM protein
MFREYFMEIARGIPTSLLLTAVALAVAFVLALFLTFLLSMENKPVKRVIN
IFLTLFTGTPLLVQFFLIYSGPGQFQWIVNSALWPLLSNAWFCAMFALAL
NSAAYSTQLFHGAVKAIPKGQWESCAALGLSRLQTLKILIPYALKRALPS
YSNEIILVFKGTSLASTITIMDIMGYARQLYGTEYDAITIYGIAGVIYLV
ITGLMTLLLRKLEHKVLAFERLEVEKA
>MS1101 asd, Asd protein
MAILFLLFHDFLPPYFLLQLKICRIERLFIAERIMSTSLNIAIAANFDLC
EKIASYLEESLLEVEKLSIVEIYPFSEEQGIRFNGKAVAQLPVDEVEWSD
FNYLFFAGDLAHIPLLAKASEAGCLTIEMNGVCSALADVPVVIPGVNEEQ
LRDLRQRNIVSLPDAQVTQFALSVRSLLNNASNAQIVVSSLLPASYYDAD
GVHKLVGQTAKLLNGIPPDEEEMRFAFDVFPAKSLNLNAQLQRVFPQLEN
VVFHQIHVPVFYGLAQMVTVKAEFEPEQDSILAEWSTNDLIRYHQDKVMT
PVLNGEAENNEDEVHLQISALESVEGGIQYWLVADNQRFSQAFLAVKLLE
SIYRQGY
>MS0006 asd, Asd protein
MKNVGFIGWRGMVGSVLMDRMQQEQDFANLNPVFFTTSQAGQKAPVFGGK
EAGNLKDAFDIEELKKLDIIVTCQGGDYTNEVYPKLKATGWDGYWVDAAS
ALRMEKDAIIVLDPVNQHVIADGLKNGIKTFVGGNCTVSLMLMALGGLFE
RDLVEWISVATYQAASGAGAKNMRELVSQMGLLEKSVSEELANPASSILD
IERKVTAEMRADSFPTDNFGAALAGSLIPWIDKLLPSGQTKEEWKGYAET
NKILGLSDNPIPVDGLCVRIGALRCHSQAFTIKLKKDVPLEEIEQILASH
NEWVKVIPNDKETTLRELTPAKVTGTLSVPVGRLRKLAMGPEYLAAFTVG
DQLLWGAAEPVRRILKQLVA
>MS0036 asnA, AsnA protein
MKKSFILQQQEISFTKNTFTEKLAEHLGLVEVQGPILSQVGNGIQDNLSG
TEKAVQVNVKMITDAAFEVVHSLAKWKRHTLARFGFAEGEGLFVHMKALR
PDEDSLDQTHSVYVDQWDWEKVIPEGRRNLDYLKETVREIYAAILETEAA
VDKKYGLKSFLPKEITFIHSEDLVKDYPGMTDKERENELCKKYGAVFLIG
IGGVLPDGKPHDGRAPDYDDWTTTSEGEYKGLNGDILVWNPILNRAFEVS
SMGIRVDETALRKQLSITGDEDRLKFDWHQDLINGRMPLSIGGGIGQSRL
AMLLLQKRHIGEVQSSVWPKAVMEQYENIL
>MS1984 aspA, AspA protein
MAATRKEVDLLGEREVPADAYWGIHTLRAVENFNISKVTISDVPEFVKGM
VMVKKATALANGELGAIPADIAKAIVAACDEILTTGKCLDQFPSDVYQGG
AGTSVNMNTNEVVANLALEKIGHQKGEYNVINPMDHVNASQSTNDAYPTG
FRIAVYNSILKLMDKIQYLHDGFDNKAKEFANILKMGRTQLQDAVPMTVG
QEFKAFAVLLEEEVRNLKHAADLLLEVNLGATAIGTGLNTPAGYSELAVK
RLAEVTGLPCVKASNLIEATSDCGSYVMVHGALKRTAVKLSKICNDLRLL
SSGPRAGLNEINLPELQAGSSIMPAKVNPVVPEVVNQVCFKVMGNDTTVT
FAAEAGQLQLNVMEPVIGQAMFESIDILANACVNLRDKCIDGITVNKEIC
ENYVLNSIGIVTYLNPFIGHHNGDIVGKICAQTGRSVRDVVLEKGLLTEA
ELDDILSVENLMNPTYKAKLSK
>MS1248 avtA, AvtA protein
MRYDKMSPFIVMDIVREAAKYPNAIHFEIGQPDLAPSEKVKKALQSAVEN
NKFSYTESLGLLALREKICQYYDRTYHVKITPNRVLLTPGTSGAFLIAYA
LTLAQDDKLGLTDPSYPCYKNFAYMMDIQPEFMPVDKHNCYQLEVGQLKG
RNIKALQISSPANPTGNIYTAESLKSLNDYCMENHIDFISDELYHGLVYD
QNAATALQFNPRAYVINGFSKYYCMPGMRLGWIIVPEDKVREAEIIAQNI
FISAPTLSQYAALEAFEEEFLTATKQVFQQRRDFLYDALKDLFTIEFKPQ
GAFYLWADVSKYTDDSYQFAKKMLHEIQVAATPGIDFGENGTKHYLRFAY
TRDIEHLREGVERMKQWLKNK
>MS1797 avtA, AvtA protein
MELFPKSNKLEHVCYDIRGPVHKAALRLEEEGHKILKLNIGNPAPFGFEA
PDEILIDVIRNLPTAQGYCDSKGLYSARKAIVQYYQSKGIHGATVNDVYI
GNGASELITMAMQALLNDGDEVLVPMPDYPLWTAAVTLAGGKAVHYLCDE
EQDWFPAIDDIKSKITSRTKAIVIINPNNPTGAVYSKELLLEIAEIARQN
GLLIFSDEIYDKILYDGAVHHHIAGLAPDLLTITMNGLSKAYRICGFRQG
WMILNGPKDKARGYIEGLDMIASMRLCANVPMQHAIQTALGGYQSINELI
VPGGRLYEQRNRAYELLNQIPGVSCVKPMGALYMFPKIDIKKFNIYDDEK
LVLDLLAQEKVLLVHGRGFNWHAPDHFRIVTLPYVHQIEEALNKFARFME
NYHQ
>MS0764 azlC, AzlC protein
MSEIVSKTPVRDAAKAAFPYSAPMIAGFIFLGIAYGLYMKQLGFGVLFPV
FMALLIYAGSVEFIVAAALVAPFSPLNVFLICLMVSGRQIFYGISMLEKY
GGHLGKKRWYLITSLVDEAFSLNYMAKIPSYIDKGWYMFFVSLYLQIYWV
MGAGIGNLFGAMLPFDLKGIEFAMTALFIIIFAENWLKEKSHESSLLGLG
ITLTSLIIVGKEQFLIPSLLGIWIMLTLSRPKLSSKLKRIE
>MS0765 azlD, AzlD protein
MTLTEQIITVGMGILGVHICRVLPFLIFPPNRPIPEYIRYLGKVLPAAMF
GMLVIYCYKNVDIFSGFHGFPEFLAGLITLALHLWKKNMFLSMAVGTGLY
MFLVQAVFVN
>MS0488 brnQ, BrnQ protein
MNKNTFIVGFTLFAIFFGAGNLIFPPKLGLESGSEFWSAITGFILSGVGL
PLLGIIVSAFYEGGYKTATTKISPWFSVIFLMAVYLSIGPFFAIPRTAAT
SYEMAILPFIGKSSSLSMLIFTLFYFAISLWFALNPSKTVSRIGAILTPI
LLFAILALVVKAFFILIDNDPSEVIFTLRESNNSFLFTGIIDGYLTMDTL
ASIAYSVIVIAAIQSKGIKHGKELTKQTLLAGIVAAIALAAIYLAIGWIG
NRVHISAETISLLQERNQDIGTYILNKITAQAFGNFGRSLLGVIVSLACL
TTAIGLIVSVSEYFNEIYHKISYKTYVIIFTLIGFIIANQGLSAVISKSV
PILLVLYPISMTIILLLSVNIFVKVPLVAQRLSIALTTLVSIGSVAGLEQ
ANNLPLKDYSMEWIPFAVTGALLGCLIHVFYKSES
>MS2237 carA, CarA protein
MSEPAILVLADGSIFRGTSIGAAGHTIGEVVFNTSMTGYQEILTDPSYFK
QIVTLTYPHIGNTGTNSEDLESNGVYAAGLIIRDLPMIHSNFRANQSLSD
YLKDNNVVAIADIDTRRLTRLLRDKGAMAGCIMSGEVDEQKALELALSFG
SMAGKDLAQEVTAQQSYRWTQGEWVLGKGYAEQQNASFNVVAYDFGVKHN
ILRMLAERGCKLTVVPAKTSAEEVLALNPDGIFLSNGPGDPEPCDYAISA
IQTLLATKKPIFGICLGHQLLGLASGGKTKKMAFGHHGANHPVQDLDTQK
VMITSQNHGFEVDEHSLPANVRVTHRSLFDNSVQGIELTDQPAFSFQGHP
EASPGPHDVAYLFDKFIDAMKQAKA
>MS1491 carB, CarB protein
MNILVTSAGQRVSLVQAFKKELSQLVSDGKVLTVDLNPELAPACYVADGH
FQVPRVTDAGYIPTLLKICEENNVKLIIPTIDTELLILSEHLQRFKEKGI
FISVSDTEFVRKCRDKRLTNQLFIEHNIAVPKQFEKGQFEYPVFVKPYNG
SLSKGIFVAEKPEDISPEQLENPELMFMQYISPAEYDEYTVDCYFDKNSE
LKSAVPRKRIFVRAGEINKGVTRKNAIVTQLSEKLSRLPGARGCLTIQVF
YKESTAEILGIEINPRFGGGYPLSYLAGANYPRWLIQEYLFNQPIPAFDD
WEADLLMLRYDAEVLAHHYEK
>MS2236 carB, CarB protein
MAMSTKPSGASKFVYKTANNFLKVLSRENNMPKRNDINTILIIGAGPIVI
GQACEFDYSGAQACKALREEGYKVVLVNSNPATIMTDPNMADVTYIEPIH
WQTVEKIIEKERPDAILPTMGGQTALNCALDLSKNGVLKKYGVELIGATE
DAIDKAEDRGRFKEAMAKIGLNTPKSFVCHSFDEAWKAQEEVGFPTLIRP
SFTMGGSGGGIAYNRDEFQAICERGFEASPTHELLIEQSVLGWKEYEMEV
VRDKADNCIIVCSIENFDPMGVHTGDSITVAPAQTLTDKEYQIMRNASLA
VLREIGVDTGGSNVQFAINPENGEMIVIEMNPRVSRSSALASKATGFPIA
KVAAKLAVGYTLNELRNDITGGLIPASFEPSIDYVVTKVPRFAFEKFPKA
DDRLTTQMKSVGEVMAMGRTFQESIQKALRGLETGICGFNLKTEDMEKLR
HEISNPGPERLLYVADAFGIGWSIEDVHHYSKIDPWFLIQIQDLVLEELA
LEKKTLADLNKDEIYRLKRKGFSDKRIAQLVKSDETSVRSLRNAFNIHPV
YKRVDTCAGEFKSDTAYLYSTYEEECEAAPSDRKKVMILGGGPNRIGQGI
EFDYCCVHAALALRESGFETIMVNCNPETVSTDFDTSDRLYFEPLTLEDV
LEIIHVEKPWGVIVHYGGQTPLKLANALHANGVNIIGTSADSIDAAEDRE
RFQKILHDLNLKQPANRTARNTQEAVGLANEVGYPLVVRPSYVLGGRAMQ
IVYNDEELNRYMREAVSVSNDSPILLDHFLNNAIEVDVDCICDGEQVIIG
GIMQHIEQAGIHSGDSACSLPPYSLSMEIQDEIRRQTAAMARALNVVGLM
NVQFAVQNDVIYVLEVNPRASRTVPFVSKATGQPLAKIAARVMAGISLKE
QGIQGEVVPQDFYAVKEAVFPFIKFPGVDTILGPEMRSTGEVMGVGATFA
EAFLKAQIGAGERIPRTGKVFVSVDNNDKPRLLPIVKRLQEQGYGLCATF
GTAKFLRENGIAVQTVNKVREGRPHIVDAIKNDEIALIINTAGGMAESVA
DSASIRASALKQRVPLYTTIAGADAISLSVANLDIHDVYSVQGLHAGLTK
>MS1273 csdB, CsdB protein
MFDTTGFRSHFPYFQHPDRVIYLDNAATTLKPQSLIDATVKFYQSAGSVH
RSQYDEEQTALYEQARSQVRQLINAESDKAIIWTSGTTQAINTVANGLIP
YIQSDDEIIISEADHHANFVTWSMIAQKCGAKLRILPIQDNWLIDENALL
EALNKRTKVVVLNFVSNVTGTEQPVEHLIRLIRKHSSALVSVDAAQAISH
VKIDLRKLDADFLSFSAHKIYGPNGLGVLSGKLTALELLQPLIYGGKMVD
RVSKQQISFAELPYRLEAGTPNIAGVIGFNAVLSWLNQWDFEQAEHHAVQ
LAEQTKVRLKNYEFCQLFNSPKPSSVISFVFKNIAGSDLATLLAEQNIAL
RTGVHCAQPYLSRLGQHSTLRLSFAPYNTQQEVDAFFTALDKSLALLEE
>MS2212 cysE, CysE protein
MLREVWNNIRNEAKELVEHEPVLASFFHSTILKHKNLGGALSYILANKLA
TSTMPAITLREIIEETYQDDPRIIDSAACDIHAVRQRDPAVGLWATPLLY
LKGFHAIQSYRITHHLWQQNRKSLAIYLQNQISVAFDVDIHPAARVGCGI
MFDHATGIVVGETAVIENDVSILQGVTLGGTGKESGDRHPKIREGVMIGA
GAKILGNIEVGKYAKIGANSVVLQPVPEYATAAGVPAKIISKDRSAKPAF
DMNQYFIDDAEALNI
>MS1252 cysH, CysH protein
MTTQNQIENGHLDWLEAESIYIIREVVAECSHPALLFSGGKDSVVLLALA
RKAFQLEGRDLVLPFPLVHIDTGHNYPEVIQFRDEQVKKLNARLVVGHVE
DSIAKGTVVLRKETDSRNAAQAVTLLETIEANGFDALMGGARRDEEKARA
KERIFSFRDEFGQWDPKAQRPELWSLYNGKLHKGENMRVFPISNWTELDI
WQYIEREKLELPPIYYAHQREVVERNGLLVPVTPITPKQPGDESKVVSVR
FRTVGDISCTCPVASTAATPAEIIKETAVTEISERSATRMDDRTSEAAME
QRKKQGYF
>MS1253 cysH, CysH protein
MIIKPNFWQIPQPTATDFAALAEKEQLLAQRIHEIANRHQHAKFASSLAV
EDMVITDVIAKSKAKITVFTLETGRLNPETLALADTVKKTYPDLDFRLFR
PNPIAAEKYDREKGRFAFYESVELRRECCFIRKIEPLNRALADADAWLTG
QRREQSVTRTELEFHEWDQSRGIDKYNPIFDWHEMDVWAYILKYDIPYNE
LYKQGYPSIGCEPCTKQVKAGEDIRAGRWWWENKDSKECGLHK
>MS1770 cysK, CysK protein
MTIFADNSYSIGNTPLVRLHNFGHNGNLVVKIESRNPSFSVKCRIGANMV
WQAEKDGVLTKDKEIVDATSGNTGIALAYVAAARGYKITLTMPETMSLER
KRLLRGLGVNLVLTEGAKGMKGAIAKAEEIVASDPNRYIMLKQFENPANP
AIHQQTTGVEIWQATEGKVDVVVAGVGTGGTITGISRAIKLDQGKQITSV
AVEPAESPVITQILAGEEIKPGPHKIQGIGAGFIPKNLDLSLIDRVETVD
SDTAIKTARRLMAEEGILAGISSGAAVAAADRLAKLPEFQDKLIVAILPS
ASERYLSTALFEGIEG
>MS1769 cysZ, CysZ protein
MLFPTALCMLALIRFLIYIFLGSFMKKEKEIKSGFHYFVMGWHLIGQQGL
RRFVVMPVLLNIILLSGLFWLFVSKISDMIEGVISFIPDWLSWLSGILLA
LSILMILLVFYFIFNTLSGFIAAPFNGLLAEKAEAMLTGESGENMTTMEF
IKDTPRMLAREWQKLLYSLPKYIGLFLLSFIPLIGQSLIPVLTFLFTAWM
MAIQYCDYPFDNHKISFPTMKFKLNENRIQNVTFGTFVTLCTFVPFINFV
IIPVAVCGATAMWVDTYRKQLYLDKNLQKSTAVSTASTEKPGSDIARHSN
NIRNR
>MS2290 dadA, DadA protein
MLKFSYQEHIKTYYYDTRNQDFTQPTLTGGQSADVCVVGAGFGGLSAALE
LAERGKSVIVLEGARIGFGASGRNGGQAINGFEDGMDAYIDDMGLEKARK
LWEMSLEAIDIIEQRIAKYNIQCDWRKGYATLALNHRRMDDLVTIEQTSR
EIFAYDYMQLWNKAELKQYLGSDIYVGGLYDGNSGHLHPLNYCLGLAKAC
LDLGVRIFEQSPVIDLDVGKSKVIAETAEGSVTAENVVLATNAYVTSLPK
RIQRGTARKILPIDSFIIATEPLDQETANAVINNGMSVCDNNLLLDYYRL
SADNRLLFGSDSSSNKDMVQVMRNNMLHVFPQLENVKIDYGWAGPIDMTI
NAKPCLGRIASNIFYAHGYSGHGVALTGLAGRLIAEAIEGDDERFAIFES
LKSPSVYGGRIVKNLATKIGVKYYKWLDKYR
>MS1592 dadA, DadA protein
MLKVTTAHIHFNRDNTPVSEQFDDIYFSTADGLEESRYVFQEGNNLWRRW
LQFGENHFVIAETGFGTGLNFLAVTALFREFRTQYPDSPLKRLFFISFEK
YPMSCADLRSAHQAYPQFNSLAEQLRQNWLQPIVGCYRFHFEETVLDLWF
GDIADNLPQLGDYMVNKIDAWFLDGFAPSKNPEMWNENLYKQMFRYTKPA
GTFATFTAASAVKKGLESAGFSLQKRKGFGKKRECLQGFKPLNAEQNPAV
HTPWLLSRSATLSENTDIAIIGGGISSLFSAISLLQRGANVTLYCEDEQP
ALNASGNKQGAFYPQLSDDDIHNIRFYIHAFAYGQQQLRWAIQQGIEFEH
EFCGVALCAYDEKSAVKLAKISDYDWDTSLYQPLNQQELSEKAGLPLPCG
GGFIPQGAWLAPRQFVQNGFAFAQKCGLKLKTFEKITALSQSEKGWILHN
DKNEQFHHETVIIANGHKLKQFTQTARIPVYSVRGQVSQIPTSSQLLKLK
SVLCYDGYLTPADQAKQFHCIGASHVRDCEDRDFSLQEQQENQAKIQLNI
AEDWTKEVNTADNLARTGIRCAVRDRIPLVGNVPDFERQADEYRNIFNLR
RRKQFIPQAAVFENLYLVGALGSRGLTSAPLLGEILASMIYGEPIPLSED
ILHCLNPNRSWMRKLLKGTPVK
>MS0282 dapA, DapA protein
MKKINLEKTMSIQGIIPVMLTPFMENNEIDYDGLRKLTDWYIDNGSDALF
AACQSSEILFLSLEERVKITKTVMDQVQGRIPVVASGHISDSFEQQVEEL
TAIYNTGVDAVILITNRLDPNNEGTTVLKSNFEKLLAALPKDIVLGLYEC
PVPYRRLLTDGEISYFAGFENMVVLKDVSCNLETVKRRIQLTKNSNLKIV
NANAAIAFEAMKAGSEGFSGVFNNIHPDLYAYLYKNKNSSDPMVQELANF
LAICGAAESFGYPNFAKLMHTKIGTFKHYNSRVIKDDIKVKYWAVEELLD
HIMQGSERYRNKLNLR
>MS0067 dapA, DapA protein
MFKPQGIIAPVLTALDDNEKFNPEVYKNYINYLIKAGIHGIFPLGTNGEF
YGFNEAEKLEIIKTAIEAADGCVPVYAGTGCVTTKETVEFSKKVVDLGVD
VLSIVSPYYIAVTQDDLYRHYATIAENVTAPILMYNIPARTGNNIDYKTI
KKLAQYENIIGVKDSSGNFDNTLKYIENTDSRLSIMAGSDSLILWTLLAG
GTGAISGCSNVFPELMVSIYEYWKQGDFEKANEAQKKIRDFRNVMQMGNP
NSVVKRAAQLRGLGTGPAKEPSNCANNPVIDKALQDVFKLYD
>MS0265 dapA, DapA protein
MSSTRPLFYGSIVALITPMDGHGEVNYDELKKLVEYHIASGTHAIVSVGT
TGESATLSIDENVKTIQKTVEFAAGRIPVIAGTGANATSEAITMTKLLNN
SGVAGCLSVVPYYNKPTQEGMYQHFKAIAECTDLPQILYNVPGRTGSDMK
PETVGRLSKIENIVAIKEATGDVSRVKQIKELAGEDFIFLSGDDATGLES
IKLGGQGVISVTNNLAAADMAKMCELALAGNFDEAEAINQRLMGLHHDLF
IEGNPIPVKWAAYKLGLIKEPVLRLPLTTLSEAAQPKVLEALKQAGLI
>MS0971 dapB, DapB protein
MTLKLAIVGAGGRMGRQLIQAVQAAEGVELGAAFERKGSSLIGADAGELA
GLGELGIKVAEDLAAEKDKFDIIIDFTRPEGSLEHIKFCVANNKKLILGT
TGFDDAGKQAIGKAAEKTAIVFASNYSVGVNLVFKLLEKAAKVMGDYSDI
EIIEAHHRHKVDAPSGTALSMGEHIAKTLGRDLKVNGVFSREGITGERKR
TDIGFSTIRAADVVGEHTVWFADIGERVEISHKASSRMTFANGAVRAAKW
LANKQIGLFDMTDVLDLNNL
>MS1177 dapD, DapD protein
MSNLQSIIEAAFERRAEITPKTVDAQTKAAIEEVIAGLDCGKYRVAEKID
GDWVTHQWLKKAVLLSFRINDNQLIDGAETKYYDKVALKFADYTEERFQQ
EGFRVVPSATVRKGAYIAKNTVLMPSYVNIGAFVDEGTMVDTWVTVGSCA
QIGKNVHLSGGVGIGGVLEPLQANPTIIGDNCFIGARSEIVEGVIVEDGC
VISMGVFIGQSTKIYDRETGEVHYGRVPAGSVVVSGSLPSKDGSHSLYCA
VIVKKVDAKTLGKVGLNELLRTIEE
>MS1784 dapF, DapF protein
MQFSKMHGLGNDFVVVDAVTQNVYFPEEVIKKLADRHRGIGFDQMLIVEP
PYDPELDFHYRIFNADGSEVAQCGNGARCFARFVTLKGLTDKKDIAVSTT
NGKMILTVQDDGMIRVNMGEPVWEPAKIPFIANKFEKNYILRTDIQTVLC
GAVSMGNPHCTLVVDDVETANVTELGPLLENHERFPERVNVGFMQVINPN
HIKLRVYERGAGETQACGSGACAAAAIGIMQGLLENKVQVDLPGGSLWIE
WQGEGHPLYMTGDATHVYDGVIKL
>MS1199 dcp, Dcp protein
MSNPLLENTPLPQFSKIKPEHIQPAIEQLIQDCRITTENLLKQPQLSWDN
FCQPLSEVNDRLSKAWSPVSHLNSVKNSNELRDAYQACLPMLSEYGTWVG
QHQGLYNAYVQLKNSPEFAGYSPAQKKAVENSLRDFKLSGISLAPEQQKR
YGEIVSRLSELSSQFSNNVLDATMGWDKVITDEEQLKGLPESALQAAKQS
AQNKGVEGYRFTLEFPSYIPVMTYCENRELREEMYRAFVTRASDQGPNAG
KWDNSAIMEEILTLRVELAKLLGFNSYTELSLATKMAETPAQVLSFLDDL
AMRSKPQGEKELADLYAFCEKEFAITELEPWDISYYSEKEKQALYAINDE
ELRPYFPEQRVISGLFELIKRIFNIRAVERQGVDCWHKDVRFFDLIDETD
EVRGSFYLDLYAREHKRGGAWMDDCIGRKIKADGALQKPVAYLTCNFNAP
VGDKPALFTHDEVTTLFHEFGHGIHHMLTKVDIGDVSGINGVPWDAVELP
SQFMENWCWEEEALAFISGHYQTGEPLPKEKLTQLLKAKNFHAAMFVLRQ
LEFGIFDFRLHDNYKPGKANQILDTLNAVKDQVSVVKAVDWARTPHSFGH
IFSGGYAAGYYSYLWAEVLSADAFSRFEEEGIFNAVTGKSYLDEILTKGG
SEEPMVLFERFRGRKPTLDALLRHKGIAN
>MS0465 dppB, DppB protein
MQHYFIRRLIMMIPLMLLISFVAFSLMNLVPSDPAETMLRINNITVTDEA
VKEARQALGLDKPFLLRYALWLYALLQGDLGKSFLSNQNVWDEITQAFPA
TFYLAVTAFAVIFLLSLTLSLLCMLMLNSLWDKIIRGILFFFTALPNYWL
ALLFIWLFSVRLNWLPSNGLEQKSGIILPALTLSLGYIGVYVRLLRGAML
NQLQQPYVFYARTRGLSEKQILFKHILQNSLHTSYIAMGMSIPKLLAGSV
IIENIFALPGLGRLCIQAIFGRDYPVIQAYILLMAMLFLVGNFVIDWLQH
RRDPRIKRGY
>MS1367 dppB, DppB protein
MFKFILKRILMVIPTFLAITLVTFALVHFIPGDPVEIRMGERGVDPIVHA
QMMEQMGLNDPLPEQYLNYIKGVVQGDFGRSFRNNEPVLKEFFTLFPATV
ELAFFALLWSLIAGIFLGVIAAVKKDSWISHTVTALSLTGYSMPIFWWGL
ILILYVSNFLGLPAGGRLPDEYWIDFDTGFMLIDTWNSGEPGAFVAAIKS
LILPAVVLGTIPLAVVTRMTRSSMLEVLGEDYIRTAKAKGLSTTRIVIVH
ALRNALIPVITVVGLIVGQLLSGAVLTENIFSWPGIGKWIIDAINARDYP
VLQGSVLIISTIIIVVNLLVDVIYGVVNPRIRHN
>MS0464 dppC, DppC protein
MSGFIKQLRSDIFAQCCLFILTMIGLAGIFAPWICTFDPATIDMQAKLLP
VSAQHWLGTDHLGRDIFSRLIWGVRSTVFYGLFAMLLTMMLGILIGMTAA
IGGKKTDEFIMRLCDVLLSFPGEIMILALVGMLGPGIEHILVAVILVKWA
WYARMIRGTVMQYTHKNYVHYSQAIGVSPWRIIRRHLLPVATAELIILAS
ADMGAVILLISGLSFLGLGVQPPTPEWGAMLSDAKNIMLLYPQQMLPAGL
AITLTVTAFNGFGDFLRDVLDPDNPLKGTNNE
>MS1366 dppC, DppC protein
MTTEITSSTPQTPLQEFWYYFRQNKGAVIGLTFIAAVFFICICAPFVSPY
DPIVQHRDALLLPPAWMENGSLSYFLGTDDIGRDILSRIIYGARLSVFIG
LLIVILSCIFGVILGLLAGYYGGLLDVIVMRLMDIMMAIPSLLLTIALVT
ILGPSLFNAAIAIAIVSVPSYVRLTRASVLNEKNRDYVVASRVAGAGVLR
LMFIVILPNCLAPLIVQMTMGISNAILELAALGFLGIGAQPPTPELGTML
AEARSFMQAASWLVTIPGVAILLLVLAFNLMGDGLRDALDPKLKQ
>MS1365 dppD, DppD protein
MSLLNVNQLSVHFGDGKAPFKAVDRISYSVNKGEVLGIVGESGSGKSVSS
LAIMGLIDYPGRVSAEALSFDGVDLLSLNEKQKRKIVGADVSMIFQDPMT
SLNPCYTVGYQIMEALKAHQGGSKKERRERTVELLKLVGIPAPESRLDVY
PHQLSGGMSQRVMIAMAIACKPRLLIADEPTTALDVTIQAQIVDLLLTLQ
KQENMALILITHDLALVAEAAHRIIVMYAGQVVEEGRAEEIFKRPKHPYT
QALLRSLPEFAEGKSRLQSLQGVVPGKYDRPQGCLLNPRCPYATEHCRRV
EPDLIQLGEGKVKCHTPLNAQGEPSNV
>MS0463 dppD, DppD protein
MNKPIIRFDNFSIENPDSDRPLIAPLNLTLPPYRTLALVGESGSGKTLLG
RSILGLLPEQLNTTGNIYFQDKKIISVTGTPTVDDKQKTNEIATLEIRGK
AVSFIMQNAINAFDPLFSLQDQFCETLQKHTALSYRQALIKAQQSVSKVK
LSSALLKRLPSQLSGGQLQRMMLALTFALEPELVIADEPTSALDSLTQFE
LLPLFKQMAKERSMIFITHDLALVQELADDIAVLKRGEIVEFRAKSILFS
HPQHPYTQYLLAMRAKLNQPFARLVRKKQ
>MS0827 gadB, GadB protein
MGRRPPYGTNMADISKHRQSLFCSDPQSIADYETAMSNAVKAVSNWLKNE
KMYTGGSIRELRKTIGSFNPSKQGVGVNQSLDHLVDIFLNPSLKVHHPHS
LAHLHCPTMVASQIAEVLINATNQSMDSWDQSPAGSIMEEQLIDWLRQKA
GYGQGTSGVFTSGGTQSNLMGILLARDWAVANHWKNEDGSEWSVQENGLP
AEALKKLKVVCSENAHFSVQKNMAMMGMGFQSVVTVPTNANAQMDVAELE
KTLATLKAEGKIVACIVATAGTTDAGAIDDLKAIRKLADAYQAWLHVDAA
WGGALLLSKDFRHLLDGIELTDSITLDFHKHFFQSISCGAFLLRDERNYR
FIDYKADYLNSEYDEEHGVPNLVSKSLQTTRRFDALKLWFTLEALGEDLY
ASMIDHGVKLTKQVEEYIRTTEGLEMLVPTQFAAVLFRVAPEGYPAEFID
ALNQNVADELFARGEANIGVTKVGNKQSLKMTTLSPIATLENVKALLALV
LAEAERIKDAIANGTYVPPID
>MS0196 gdhA, GdhA protein
MQTLTILLIRGKLMSSTVSSLEDFLSLVAQRDGNQPEFLQAVREVFTSIW
PFLEANPQYRSQALLERLVEPERAFQFRVAWTDDKGQVQVNRAFRVQFSS
AIGPYKGGMRFHPSVNLSILKFLGFEQIFKNALTTLPMGGGKGGSDFDPK
GKSDAEVMRFCQALVAELYRHIGPDTDVPAGDIGVGGREVGYLAGYMKKL
SNQAACVFTGRGLSFGGSLIRPEATGYGLVYFAQAMLAEKGDSFQGKTVS
VSGSGNVAQYAIEKALQLGAKVVTCSDSAGYVYDEAGFTTEKLAALLDIK
NVKRGRVKDYAEQFGLQYFPGERPWGVKVDIALPCATQNELELTDAQKLI
ANGVQLVAEGANMPTTIEATEALQAAGVLFAPGKAANAGGVATSGLEMAQ
SSQRLFWSAEEVDQKLHNIMLDIHANCKKYGTDANGNINYVAGANIAGFV
KVADAMLAQGVY
>MS0262 glnA, GlnA protein
MANPNAIQRVAKLIEDNDVKFVLLRFTDIKGKEHGVSLPVNLVADELEDF
FEEGKMFDGSSVEGWKAINKADMLLMPMPETAVIDPFAQITTLSIRCSVY
EPNTMQSYDRDPRSIATRAENYLKSTGIADQALFGPEPEFFLFDDVRFST
EMNNVSYKIDDIEAAWNTNRKFEDGNNAYRPLKKGGYCAVAPIDNAHDIR
SEMCLILEEMGLVIEAHHHEVATAGQNEIASKFNTLTLKADETQIYKYVV
QNVALEYGKTACFMAKPFAGDNGSGMHCNMSLSKDGKNVFQGDKYAGLSE
TALYYIGGIIKHAKALNAFTNPTTNSYKRLVPGFEAPVLLAYSASNRSAS
IRIPAVTSPKAIRVEARFPDPLANPYLAFAALLMAGIDGIINKIHPGDAM
DKNLYDLPPEELKEIPAVCSSLEEALDSLQADHEFLIQGGVFSKEFIDAF
VAIKRKEVERVNMTPHPVEFEMYYA
>MS0426 glnK, GlnK protein
MKKIEAIIKPFKLDDVRESLSDIGITGMTVTEVRGFGRQKGHTELYRGAE
YMVDFLPKVKLEIIIPDELLDQCIEAIMETAQTGKIGDGKIFVYNVERVI
RIRTGEENEDAL
>MS0219 glnQ, GlnQ protein
MTISVKNLNFFYGSSQALFDINLTAEDGDTVVLLGPSGAGKSTLIRTFNL
LEVPKSGDLTVADNHFDLSQNTDAKKMRQLRQDVGMVFQQYNLWPHFTVM
ENLIEAPMKILGLTESEAQKEAMELLTRLRLEEHAHRFPLQLSGGQQQRV
AIARALMMKPKVLLFDEPTAALDPEITAQIVSIIQELQETGITQVIVTHE
VGVARKVATKVVYMEKGRIVETGDASCFEAPQTEQFRQYLSHD
>MS1685 glnQ, GlnQ protein
MALLEIKELVKNYGEVTALNGVNLSVEKGEVVVILGPSGCGKSTFLRCIN
GLEEIKSGSLKLADVGELGKDISWVKARQHIGMVFQSYELFAHMTVIDNI
LLGPLKVQKRARAEVEKQADALLKRVGLYERKNAYPRELSGGQKQRIAIV
RSLCMNPDIMLFDEVTAALDPEMVREVLDVVLGLAKDGMTMIIVTHEMQF
ARQVADRIVFMDNGNIIEESEPEQFFTSPKTERAKTFLNILDYYI
>MS1275 glnQ, GlnQ protein
MIKVKNIHKAFGENVILRGIDLDITKGEVVVILGPSGSGKTTFLRCLNAL
EMPEQGTIEFDNAAPLKIDFAAKPSKKDILALRRKAGMVFQNYNLFPHKT
ALENVMEGPVRVQSKKVAQAREEALALLTKVGLADKADLYPFQLSGGQQQ
RVGIARALALQPELMLFDEPTSALDPELVQDVLDTMKSLAKEGWTMVVVT
HEIKFALDVADLVIVMDDGVIVEQGSPKQLFDNPQHERTKAFLQRLRSH
>MS0611 gloA, GloA protein
MISLFTGFHHIAIIVSDYEKSKYFYTQILGAEVIEETYRASRHSYKLDLK
FADGSQIELFSFPSSPSRLTMPEACGLRHLAFKVKDIEEAVQYLKTQQIE
CEDIRIDELTGKKFTFFKDPDNLPLELYEFNSFKGG
>MS0703 gloA, GloA protein
MMRILHTMLRVGDLDRSVKFYQDVLGMRLLRTSENPEYKYSLAFLGYDDE
DKTAVIELTYNWGVTEYELGSAFGHIAIGVDDIHATCEAVKAHGGKVTRE
PGPVKGGSTVIAFVEDPDGYKIEFIENKNAKAALGN
>MS0597 gloA, GloA protein
MKLEHVAIYVQDLEKAKAFFMKYFNAQPNEKYHNPRTNLMTYFLTFSGGA
RLEIMTRPEIIELDKNIFRTGLIHLSMQVGGEEKVRELTERLRTDGYQVI
SEPRKTGDGYYESCVLDGEGNQIEIVA
>MS1994 glpB, GlpB protein
MNFDVVIIGAGIAGLTCGLTLQEKGVRCAIINNGQAALDFSSGSMDLLSR
LPNGSTVDSFAQSYAALAQQSPNHPYVILGKDVVLDKIQQFETLAKSLNL
SLVGSSDKNHKRVTALGGLRGTWLSPNSVPTVSLEGKFPHDNIVLLGIEG
YHDFQPQLLADNLKQNPQFAHCEITTNFLHIPELDHLRQNSREFRSVNIA
QVLEYKLSFNNLVDEIKQAVGNAKAAFLPACFGLDDQSFFESLKQATGIE
LYELPTLPPSLLGIRQHRQLRHRFEKLGGVMFNGDRALRSEFEGNKVARI
FTQLHLENAVTAKYFVLASGGFFSNGLVSEFEEIYEPLFRSDIVKTERFN
ATDRFSWISKRFADPQPYQSAGVVINAECQVQKDGNNVENLFAIGAVIGG
YNGIELGCGSGVAVTTALKVADNIIAKESSN
>MS0731 gltD, GltD protein
MAKFFLAPADNYDVKIGELVDKFVNKVRSFPPGTCPLVVQYASLRSSMSQ
TCGKCVPCRDGIPHLSFLLRDILAGEGDDSTMRQIRELAEMIRDGSDCAI
GYQPAIEILDSIEEFKEEYESHIHNKSCQKVIGQRIPCINMCPAHVDIPG
YIAHIGDGNYAEAINLIRKDNPLPTACGLVCEHPCEERCRRRLIDDAINI
RGLKKYAVDQVAADVVKVPQALPDTGKKVAVIGGGPAGLTCAYFLAQMGH
RVTIYERQKALGGMLRYGIPNYRFPKDRLDQDLNAILSAGRIEVKYGVMV
GDDIAIEDIYNSHDAMFVGIGAQKGKTLRIKGSEANNVFSAVEMLDDIGN
GKIPDYTDKVVVVIGGGNVAMDAARSAVRCKAKDVRIVYRRRQDDMTALH
AEIEAAIMEGIELITLAAPVAIEKDEQGNCTGLTVQPQMTGPYDHGGRPS
PVAVKKPPFTIGCDVILIAVGQDIISLPFEEFGMPANRGIFQADLTTAVP
DMDGVFVGGDCATGPATAIKAIAAGKVAAHNIDEYLGYHHEFPCETKAPP
PKENVRIQVGRANTTERPAYIRKCDFEHVENPYTYEEAMQEAERCLRCDH
FGCGVLQGGRDL
>MS0030 gltS, GltS protein
MTFDTYETLALACLVLLLGYFLVKRVKLLSNFNIPEPVVGGFIVAIVLTV
VHEIWGLSFSFDSNLQRTMMLVFFSSIGLSANFARLIKGGKPLVMFLVVA
AMLIAIQDTVGIFGSMALGLDPAYGLIAGSVTLTGGHGTGAAWAETLTND
FGISGAMELAMACATFGLVFGGIIGGPVARFLLTRLHKEEVPEDENVDDV
QEVFEKPVYRRKVNSRAIIETISMMAVCLLVGQFLDELAKGTAFQLPTFV
WCLFTGVILRNTLTLVFKFTAPDQTIDVLGTVGLSIFLAIALMSLKLWEL
AGLALPVFVILTLQVVVMATFAILVTYRVMGSDYDAVVLSAGHCGFGLGA
TPTAVANMQAVTAHFGHSHKAFLIVPMVGAFFIDLLNASLLKFFVEVAAY
FH
>MS1295 glyA, GlyA protein
MLQNHSIAEFDPVLWDAIQNENRRQEEHIELIASENYVTKAVMEAQGSQL
TNKYAEGYPGKRYYGGCEYVDIVEQLAIDRAKELFGADYANVQPHSGSQA
NAAVYGALLNAGDTILGMDLAHGGHLTHGAKVSFSGKIYNSVLYGITAEG
LIDYEDVRVKALESKPKMIVAGFSAYSQVVDWAKMREIADEVGAYLFVDM
AHVAGLIAAGLYPNPLPHAHVVTTTTHKTLAGPRGGLILSACGDEEIYKK
LNSSVFPANQGGPLMHVIAAKAVCFKEALQPEFKAYQAQVLKNAKAMVEV
FKQRGFEVVSKGTENHLFLVSFVKQGLTGKAADAALGEANITVNKNSVPN
DPQKPFITSGIRVGSPSITRRGFNEADASTLAGWMCDVLESIGKDNYDQV
IAETRAKVLEICKRLPVYGD
>MS0953 gntT, GntT protein
MLDTVLNTLAVAKVIDGSQLWVETLRLLGKTPIALLITLIVSIVLLKNQR
SYEQIEKICDSSLGPICAIVLVTGAGGMFGGVLRASGIGEVLASTLGHTG
MPVIVAAFIISSALRVAQGSATVALTTTAALISPMVAADPSLSQMDLCFI
VISIASGATVLAHVNDSGFWLVSRFLEIDTKTMLKTWTVQETLIGIVGFI
IAYVGSIIF
>MS0686 gntT, GntT protein
MSGISLIISFIIAIIIMIWMISKLKVHPFLSLMTISLALALVAGIELNKI
PGMIGDGFSSTFKSIGIVIIFGAIIGTILEKTGAALKLADMVVKLVGQKH
PELAMLIMGAIVGIPVFCDSGFVVLNPIREALYKKIAANPVATAVALSGG
LYASHVFIPPTPGPIAAAGALGLESNLLLVIIMGVVVSIPVLTAVYFFAG
YIGKRVTLDEEAQADAAIVKNYEQLLKQYGILPGKFLSLAPILMPIVFMA
LGSIAKIAEIGGNTGIIIQFLGTPIIALAIGVIFSVFLLLQTKKITEFND
LTNETLKIVGPILFITAAGGVLGKVITEAGFVDYIKQNAHIISTTGIFFP
FIISAVLKTAQGSSTVAIITTASIMGMYSAGDSLMSVLGLTSEIAAALCV
MAIAAGAMCVSHANDSYFWVVTNFGKMTAQQGYKTQTLMTFIMGIVGIIT
VYILSLLLL
>MS1977 gntT, GntT protein
MSLKIAAILLALLYQEYCMSNEMLILIGIVSVIALLLIMIKGKVHPFVAL
SLVSIAVALSSGIPMGKVVPTLISGMGGTLGSVALIVGLGAMLGKIIEKS
NGADVLASWLLDKFGEKRAPFALAMTGFIFGIPVFVDVGFIVLIPIIFSV
ARRIGGNMLVYALPIGLSMLTVHVLMPPHPGVVAGAQVLNADIGLVLGLG
FIAALPAVLIGQTFIPLFTKNNFVAIPASSDLLEYQKQVSKNVDGLPKFA
TVLAMIVFPLLLIMSGTVSATVLPKESIVREFFSMVGASPFALLLAVCVS
SYILGIRRGWRKEQLEEILNSALAPIAGIILITGAGGMFGKVLNESGVGN
ALADVLSSTGLPILALSFILAAMLRAAQGSATVAVITTATILAPAVTSAG
YSDIQTALVTAAIGAGSMTLSHVNDSLFWVWTKFFGITITQGLRTWSILS
TIYGSLAFLIVTLMWMFA
>MS0954 gntT, GntT protein
MLIFIMIASVALLLLLIMKFKVHAFVALTIVSLLTALATGIPINKILPTL
LNGFGNTLASVALLVGLGAMIGRLLEITGGAKVLADTLINKFGEQKAPLA
LGIASLLFGFPIFFDAGLVVMLPIIFSVAKQFGGSLIRYAFPAAGAFAVM
HAFSVPHPGPVAAGDLLGANIGLLTIIGLICAIPTWYIATYLFGLHLGKK
YHLDLPKAFLNAMPINETAVLTPPSFKKVILILLLPLGINYAGYGVKYFS
RCKSN
>MS0335 gntT, GntT protein
MIMSITVAFIIGVAVLLFLALKLKVSAFLSLLATALTIGILSGMGTTEII
KDIVAGFSKSVGSIGLVIIFGTMLGNYLEQSRAAHKMALDAVRLVGTKNS
SIAMSISGYLISIPVFSDVGFLILSPLIKAISKKSKIPLAALAVALSAGL
LATHVYVPPTPGPLAAAGLLGIDIGRAIIWGAFAAVVMTLFGWMYAHFYL
MKKSPDYYTFVETVVEEKEVDETNLPGSLASLMPLLLPIVLILLNTTCAA
IFPKDSPVLSVTKFIGDSNIALVIGALTAIALLGKRIGKEKVLKIMDSSL
KDAGSIIFITAAGGALGQILKTSGAGDSLAQAVVSSGLPFILIPFVISAI
LKIVQGSGVVAVITSATLAAPIATQLGIDPILIFLASGAGARAYCHVNDS
YFWVYTNCCGFDMKTGLKTLSNASIFMSLGGLLATFIASLII
>MS0688 gntT, GntT protein
METAASMSQMLIGLAIGIALLLILAMKTRIHVFVALILASLTTGLIGGLP
FAEVISSVTKGFGSTLGSTGIIIGLGVMMGAILEKSGAAEQMAFSIIKLI
GKAKEEWALALTGYVVAIPVFADSGLIILTPLARSLSRMTGKSVIGLGLA
MATGLQLAHVFIPPTPGPLAVAGILDIDMGMMIIWGMILTVPTLVMSTLY
AKWLGKKIYQIPNEDGTDFERKEFKEEYIKSIENVEQIYKDKNLPGAGLS
FSPIVIPLILILGNTTVNFLKIENGFADLLKIVGHPIIALIIGLLIALYG
LGRRLSKAETNKAIEDGVKSTGMILFITGAGGALGYVVRDAGIGNALGEA
VLTVGIPGILIPFVIAALMRIALGSATVALITAATLAAPLVPQLGLNPTL
VAMSTCAGAVSFSYFNDSGFWVFNGLYGLKEVKDQFMAKTMVSFIGAFSC
LALVLIFNIFM
>MS0671 gsp, Gsp protein
MSEISPNIPTHDAFGSLLGYAPGGIAIYSSDYETADKNEYPDDAAFRSYL
GREYMGYKWQCVEFARRYLYLNHGMVFTDVGMAYEIFSLRFLRQVVNDAL
VPLQAYANGSKKSPEPGALLIWQEGGEFQETGHVAIITEVFNDKIRIAEQ
NVIHYRLPSGQQWTRELPMSVTEQGYILHDTFDDTEILGWMIQTDDSTYS
LPQPTAAPESLEIHAEHIENKGQFDGKWLNESDPFEKLYVTAMNGHQVSR
TDQYRYFTISETAKHELIRATNELHLMYLHATNKVLNDDNLLKYFNIPKL
LWPRLRLSWENRRYQTVSGRLDFCLDERGLKVYEYNADSASCHAEAGAIL
GRWAKVAGLDNGEDPGAHLRNALADCWKHRDNTPLVHIMQDNDSEEDYHS
MFMQSALLQAGCRTKIIHGTEGLHWDKRGRLLDDEDNQILSVWKTWAWET
MLEQLREDATGREVAPPIRTGYPEDKVRLIDVLLRPEVLVYEPLWTAIPS
NKAILPVLWSLFPNHRYLLESGFELTQNLIKNGYAKKPIAGRRGDNVTLF
ADQHSRLDVTHGRFGKQEHIYQQLWCLPKVEEQYVQICTFTVGGHYGGSC
LRSDPSRIIVGDSDMQPLRVLNDKDFLAK
>MS1883 hisA, HisA protein
MKKSIIIPALDLIDGNVVRLHQGDYAKQTTYSDNPIEQFASYLAQGAEQL
HLVDLTGAKDPAKRQTALIGKIIAATHCKIQVGGGIRTEKDVADLLAVGA
NRVVIGSTAVKERAMVKEWFNKYGAEKFVLALDVNIDASGQKIIAISGWQ
EASGVSLEELIEDFQSVGLQHVLCTDISRDGTLAGSNVDLYKEICAKYPA
VNFQSSGGIGSLEDIKALKGTGVAGVIVGRALLEGKFNVAEAIECWQNG
>MS0435 hisB, HisB protein
MNKERMVKKAIFLDRDGTINIDHGYVHKIDDFHFIEGSIEALEELKNMGY
LLVLVTNQSGIARGYFSEDEFLQLTEWMDWSLADRNVDLDGIYYCPHHPE
GLGEYRQDCDCRKPKPGMLLQAIEELNIDPAQSFMVGDKVEDLKAAVSAN
VKYKVLVKTGKTVTQAGEQLADYVLDSIADLPRIIKRLKK
>MS1890 hisB, HisB protein
MTQQPTLFIDRDGTLIDEPKTDFQIDSLEKLKFERNVIPALLKLKNRYRF
VMVSNQDGLGTDSFPQEDFDKPHNAMLAVFRSQGIEFDDILICPHKPEDN
CDCRKPKIKLLKKYIDKKLFDPADSFVIGDRPTDVQLAENLGIRALQYHP
ENLDWDMIAEKLLREPVADPKGLGQPRHAVVARKTKETDIKVEVWLDEAG
VNQINTGIGFFDHMLDQIATHGGFRMNVSCKGDLHIDDHHTIEDVALALG
AALKEAIGNKRGIQRFGFVLPMDECKAECALDLSGRPYFKFKAKFNRDKV
GDFSTEMTEHFFQSIAYTLLATLHLSVKGDNAHHQIEALFKAFGRTLRQA
IKIEGNEMPSSKGVL
>MS1574 hisC, HisC protein
MVVRPNFKTTIPNNHKSAVENMTFLQQANTGVQALSPYQAGKPIEELERE
LGISNIIKLASNENPFGFPESAKKAIQNQLDNLTRYPDSNGFSLKAAIAE
KFNLQPEQITLGNGSNDLIELIAHTFATEGDEIIFSQYAFIVYPLITKAI
NAKAREIPAKNWGHDLEAFLAAINEKTKLIFIANPNNPTGNFLTEAEIDS
FLAKVPPHIVVALDEAYTEFTAKEERVNSLALLKKYPNLVVSRSLSKAYG
LAGLRIGFAVSNPEIAGLFNRVRQPFNVNSLALAAAEAVLNDDDFVEKAA
ENNRRELKRYEEFCQKYGLQYIPSKGNFITIDFQQPAAPVYDALLHEGVI
VRPIAGYGMPNHLRISIGLPEENQRLFDALIKILNLK
>MS1891 hisC, HisC protein
MSISQLSRKNVQALTPYQSARRLGGNGDVWLNANEYPTSPDFNLSERIFN
RYPEPQPEAVIKGYAAYADVKPENVIVTRGGDESIELLIKGFCEPEDKVL
YCPPTYGMYAVSAETLGIATKTVPLTEDFQLDLPEIEKNLAGVKVIFVCS
PNNPTGNVLNQADLIRLLDITAGSAIVVVDEAYIEFSPETSMIKQLGNYP
HLAIIRTLSKAFALAGLRCGFTLANPELIGVLQKVIAPYPLPVPVSDIAA
QALQPQGVAQMKMRVADVLANRAWLIGELKQIPSVVKIFATEANYVLVKF
QDGEKVFNALWEKGIILRDQHKAFGLKNCIRISIGTRAELEKTVVALKLA
>MS1892 hisD, HisD protein
MQTLIWKDLTEQEKKQALTRPAISAAGNIKDAVDAIRENVVANGDKALFE
LSEKFDRVKLNSLEVSEQQIEEAAQRLPEELKQAIQNAKKNIEAFHLAQV
PVEADVETQSGVRCQVLTRPINRVGLYIPGGSAPLFSTVLMLAIPAKIAG
CKKIVLCSPPPIADAILYAANLCGVETIYQVGGAQAVVAMAFGTETVAKV
DKIFGPGNAFVTEAKRQVSQAVNGAAIDMQAGPSEVLVLADENADPDFVA
SDLLSQAEHGADSQVILVTPSERLALETELAVERQLTTLPRSEIAQKALA
HSRIFIAENLQQCVEISNEYAPEHLVVQVQNARDLLSNIDNAGSIFLGAY
SPESMGDYASGTNHVLPTYGYTRTSSSLGLADFSKRMTVQELSPQGFKDL
AKTVEVMAAAERLDAHKQAVSIRLAKIK
>MS1882 hisF, HisF protein
MLAKRIIPCLDVRNGQVVKGVQFRNHEIIGDIVPLAARYAEEGADELVFY
DITASSDGRTVDKSWVERVAEVIDIPFCVAGGIKTIADAEQIFTFGADKI
SINSPALADPDLISRLADRFGVQAIVVGIDSWFEQETGKYWVNQYTGDES
RTRQTNWQLLDWVKEVQKRGAGEIVLNMMNQDGVRNGYDLTQLKLVRDVC
KVPLIASGGAGEMVHFRDAFIEANVDGALAASVFHKQIINIGELKEYLAR
EGVEVRR
>MS1893 hisG, HisG protein
MSTNKRLRIAMQKKGRLSDESQELLKQCGVKINLQGQKLIAYAENLPIDI
LRVRDDDIPGLVFDGVVDLGIIGENVLEEEELTRTAAGDKVEYKMLRRLE
FGGCRLSLAVDSDVEFDGPESLSDCRIATSYPQLLKRYMAEQGVPFKSIL
LNGSVEVAPRAGLADAICDLVSSGATLEANGLKEVEVIYRSKACLIQRKE
PLSEEKQALVDKILTRIQGVQQADESKYIMLHAPKDKLEEITALLPGVEN
PTILPLAHDDTKVAVHVVSQENLFWETMEQLKEKGASSVLVLPIEKMLA
>MS1885 hisH, HisH protein
MIIIDTGCANLSSVKFAFDRLNIKAEISRDIATIKSADKLLLPGVGTAMA
AMKILQDRNLIETIQNATQPMLGICLGMQLMTEYSSEGNVPTLSLMSGHT
DLIPNTGLPLPHMGWNKVRYEQDHPLFAGIEQDSHFYFVHSYAVLPNEHT
IATSDYGVPFSAALGCKNFYGVQFHPERSGKNGAQLLKNFVENL
>MS1881 hisI, HisI protein
MQNKINWQKVDNLLPVIIQHFQTCEVLMLGYMNQEALAKTCDEKVVTFFS
RTKQRLWTKGETSGNFLNAVDMSLDCDNDTLLILADPIGPTCHTGEESCF
HQFATQSEGDWTWFAKLERVLAERKFADPESSYTATLYAKGTKKIAQKVG
EEGVETALAALSKDKGEIVSETADLIYHLTVMLHEQNLEWGDVIDKLKER
HQGIGLHPEGSNK
>MS2218 ilvA, IlvA protein
MVNNLSNAPTGAEYLRAILISKVYEAAKVTPLQLMPKLSERLGNRIYVKR
EDHQPVHSFKLRGAYAMISGLTQAQKEAGVITASAGNHAQGVALSAKNAG
IRALIVMPQNTPSIKVDAVRGHGGEVLLHGANFDEAKAKAIELSQTEQMT
FIPPFDHPAVIAGQGSIGMELLQQNGHINRIFVPVGGGGLLAGVAVLIKQ
LMPEIKVIGVEAKDSACLYYALKAGRPVDLERVGLFADGVAVKRIGDETF
RICQQYVDDVILVDGDEICAAMKDMFENVRAVPEPSGALSLAGLKKYAKQ
HNLQGETLVNLLSGANLNFHTLRYVSERCEIGEKHEALFAVTIPEQRGSF
LKFCQILGQNAVTEFNYRYADEKQACIFVGVRITGEQEKQVIIQQLKQGG
YDVQDLSDDDIAKTHIRYMVGGRSSSDLNERLYSFEFPEQKGALLKFLET
LGTTDANISLFHYRGHGADYGDVLAGFQINDADLPAFKQHLEKLGYAYQD
VTDSPSYRYFLG
>MS1319 ilvB, IlvB protein
MKMKKLSGAEMVVQSLRDQGVKYLFGYPGGSVLDIYDAIHTLGGIEHVLV
RHEQAAVHMADGYARSTGEVGCVLVTSGPGSTNAVTGILTAYTDSVPLVI
ITGQVRSNLIGTDAFQECDTIGLTRPVVKHSFMVKHAEDIPETIKKAFYI
ASSGRPGPVVIDIPKDVVNPANKYTYEYPKEVSLRSYNPNVQGHKGQIKK
ALKALLVAKKPVLFIGGGVIIGNSSEKLTQFAQLLNLPVTSSLMGLGGYP
GTDKQFLGMLGMHGTYQANMAMHNADLILGIGVRFDDRTTNNVEKYCPHA
KVIHVDIDPTSISKNIAADIPIVGSVDNVLTEFLSLLEDDNLSKSQSDLT
EWWKQIDEWKAKKCLEFDRTSQAIKPQAVVEAIYRLTKGEAYIASDVGQH
QMFAALHYPFDKPRHWINSGGAGTMGFGLPAAIGTKFAHPDSRVVCITGD
GSIQMNIQELSTAKQYGTPIVIVSLNNRFLGMVKQWQDLIYSGRHSQVYM
NSLPDFAKLAEAYGHVGIQINTADELEEKLTQAFAVKDKLVFVDVLVDAT
ENVYPMQITGGGMNEMLLGKPAEK
>MS2223 ilvB, IlvB protein
MNGANLVTECLKAHNVDTVFGYPGGAIMPVYDALYDCGINHLLCRNEQGA
AMAAIGYARSTGKTGVCIATSGPGATNLITGLGDALMDSIPLVAITGQVA
APLIGTDAFQEADVLGLSLACTKHSFIVQNIEELPEIFAKAFKIAQSGRP
GPVLIDIPKDVQFAETLLQPIVYSVEKPTALSAKSLEKAVELLKNAKRPV
AYIGGGVGMAKAVPALHEFLTATRIPTICTLKGLGAVPADNPYYMGMIGM
HGTKAANYATQEADLLLVLGARFDDRVTGKLSSFATEAKVIHADIDVAEI
NKLRRADVALCGDLEQALKALSFALDIEPWRADVQRLKRDFDWDYGENEG
EGDINPLFLLNRVSRLKAENAIVVTDVGQHQMWAAQHMSFGKPENFITSA
GFGTMGFGLPVAIGAQKARPRDQVILVTGDGSIMMNIQELGSIKRAKTPI
KILLLDNQRLGMVRQWQSLFFHGRHSSTILDDNPDFVTLASAFGIRGERI
EKAGEVNEALDRFFASQEAYLLHVCVHEDENVWPLVPPGACNVEMIEEMS
>MS0045 ilvC, IlvC protein
MSNYFNTLNLRQKLDQLGRCRFMERSEFADGCNFLKGKKIVIVGCGAQGL
NQGLNMRDSGLDISYALRPEAITEKRASFQRATENGFKVGTYQELIPTAD
LVVNLTPDKQHSKVVADVMPLMKQGASFGYSHGFNIVEVGEQIREDITVV
MVAPKCPGTEVREEYKRGFGVPTLIAVHPANDPKGEGMAIAKAWASATGG
DRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSIVCYDKLVADGKDPAYA
GKLIQYGWETITEALKQGGITLMMDRLSNSAKIRAFELAEEIKEHLNFLY
LKHMDDIISGEFSATMMADWANGDKDLFAWREATGKTAFENAPKADGIKI
SEQEYFDNGVVMVAMVKAGVEMAFDAMVASGIYEESAYYESLHELPLIAN
TIARKRLYEMNVVISDTAEYGNYLFSNVATPILAKEIVSQLKRGDLGEPT
PAAEIDNVYLRDINDTIRNHPVELIGQELRGYMTDMKRISSQG
>MS2219 ilvD, IlvD protein
MEIFMPKLRSATSTQGRNMAGARSLWRATGMKEGDFGKPIIAVVNSFTQF
VPGHVHLHDIGQMVVKQIEAAGGVAKEFNTIAVDDGIAMGHGGMLYSLPS
RDLIADSVEYMVNAHCADAMVCISNCDKITPGMLMAAMRLNIPTIFVSGG
PMEAGKTKLSDQLIKLDLIDAMIQSADKNVSDSDVDAIERSACPTCGSCS
GMFTANSMNCLTEALGLSLPGNGSCLATHADRKQLFLDAATQIVELCKRH
YEQDDYSVLPRSIATKAAFENAMSLDIAMGGSTNTVLHLLAVAQEAEVDF
TMADIDRLSRIVPCLSKVAPNTNKYHMEDVHRAGGVMAILGELDRANLLH
HDTKTVLGLTFAEQLAKYDIKLTRDEAVKTFYRSGPAGIRTTEAFSQDCR
WETLDDDRENGCIRDKAHAYSQDGGLAMLSGNIALDGCIVKTAGVDESIL
KFTGEAIVFESQEDAVDGILGGKVKAGHVVVIRYEGPKGGPGMQEMLYPT
SYLKSMGLGKACALLTDGRFSGGTSGLSIGHCSPEAASGGTIGLVRNGDI
IAIDIPNRSIQLQVSDEELATRRAEQDVKGWKPANRAREVSFALKVFGHF
ATSADKGAVRDKTKL
>MS0896 ilvE, IlvE protein
MKDLDWKNLGFGYTKTDYRYIAYWKNGEWQKGELTKDNTLHISEGSPALH
YGQQCFEGLKAYRTKDGSIQLFRPDQNALRMQQSADRLLMPRVPVDMFID
ACKQVVKANEEWVGPYGSGATLYLRPFLIGVGDNVGVHPAKEYIFSIFVC
PVGAYFKGGLAPSKFLISTHFDRAAPHGTGAAKVGGNYAASLYPGKYAKE
HGFADCIYLDPATHTKIEEVGSANFFGITKDNKFITPISPSILPSITKYS
LLYLAKERLGLEVEEGDVYVKDLDQFAEAGACGTAAVITPISGVQIDDKY
HVFYSETEIGPITQKLYDELTGIQFGDKPAPEGWIVKVE
>MS2192 ilvE, IlvE protein
MCRIGIFMDYPLFETVAVERGEILNLDYHQTRYEQALHQYYGRKVLPFNL
QEILQKSTALLTLKRSEPLIRCRIDYNDQDYRLQCFAYQRKVFRSFQPVI
CDHIDYGLKFSDRRIFAELLRQKGKHDEIIIIKQGLVTDCTIGNLLFRKN
QQWFTPEAPLLNGTQRAKLLAEKRIQTLNIKRQDIAQFDEIRLINAMNPF
SESL
>MS1318 ilvH, IlvH protein
MRRILSVLLENESGALSRVVALFSQRAFNIESLTVAPTDDPTLSRMTIEA
SGDEAILEQIEKQLHKLVDVFKVINLSDCEHVEREVMLLKLRATGSTRDE
IKRLTDIFRGQIVDVTTKSYTIQLAGTKDKLNAFVSAVKEETTIIEIVRS
GLISLSRGEKNCL
>MS0507 kamA, KamA protein
MRILTQNNPVREENWLEILANSISDPEVLLKTLSLPIDKFEKDIHARKLF
AMRVPLPFVRKMELGNAQDPLFLQAMSSADEFLTADGFSKDPLEEQQVVA
PNILHKYKNRLLLMVKGGCAINCRYCFRRHFPYADNQGNKANWQKALDYI
SANPQIEEVIFSGGDPLMAKDHELDWLIKKLEKIPHLQRLRIHTRLPVVI
PQRITGAFCKILTESRLNTVLVTHINHGNEIDEQLTRALNKLKNAGVVLL
NQSVLLKNINDNAQTLKNLSDKLFRAGILPYYLHLLDKVEGASHFYVPDQ
RAVEIYRELQSLTSGYLVPKLAREIAHEPNKTLYGG
>MS0599 leuA, LeuA protein
MNVHNKRIRTMANNRVIIFDTTLRDGEQALKASLTVKEKLQIALALERLG
VDVMEVGFPVSSAGDFESVQTIAVHVKNSVVCGLSRAVNKDIDAAAEALK
VAERFRIHTFIATSALHVEAKLKRSFDDVVEMAVAAVKRARRYTDDVEFS
CEDAGRTGIDNICRVVEAAINAGATTVNIPDTVGFCLPTEYGNIIHQVMN
RVPNIDKAVVSVHCHNDLGMATANSLTAVLNGARQIECTINGIGERAGNT
ALEEVVMSIKTRQDLFGVDTRINTQEIHRVSQMVSQICNMPIQPNKAIVG
ENAFSHSSGIHQDGMLKNKNTYEIMSPETIGLKKEKLNLTARSGRAAVKG
HMADMGYTEQDYDLDKLYEAFLKLADKKGQVFDYDLEALAFIDMQQGDED
RLKLDVITSQTISTLPASAFVQVELDGKRINKTSNGGNGPVDAVYNAIMQ
IVGMDLKMSHYNLTAKGEGAEALGQVDIVVEYQGRKFHGVGLATDIVESS
ALALVHAINAIYRSQKVADLKKDLKHIHTV
>MS1105 leuB, LeuB protein
MVIMTHKIAVIPGDGIGIEVINEGVKVLNCVSQLDPKIQFEFTHFPWGCE
FYSKTGRMMDDDGIERLSKFDGIFLGAVGYPGVPDHISLWGLLLRIRKSF
DQYVNVRPVKLLKGAPCPLKEKSPKDINMIFIRENSEGEYAGSGSWLYRD
KPNEVVIQDGVFSRVGCERIIRYAFELARTEKKSLTSISKGNALNYSMVF
WDQIFQQLSQEYPDVETHSYLVDAAAMLMITKPERFEIVVTSNLFGDILT
DLGAAIAGGMGLAAGANLNPEGNFPSMFEPIHGSAPDIAGKQLANPLATV
WSASQLLEFFGYKEWAARLIDAIEYLLVEQKTLTPDLGGTAKTADVGDAV
VAYLQKHFA
>MS0598 leuB, LeuB protein
MSTYNVAVLPGDGIGPEVMAEAIKVLDKVQAKFGFKLNFTQYLVGGAAID
AKGEPLPAETLQGCDNADAILFGSVGGPKWTHLPPDQQPERGALLPLRKH
FKLFCNLRPATLYKGLEKFCPLRADIAAKGFDMVVVRELTGGIYFGQPKG
RDGEGSDTRAFDTEVYYKYEIERIARAAFDAAMKRRKQVTSVDKANVLQS
SILWRETVAEIAKEYPEVQVENMYIDNATMQLIKAPESFDVLLCSNIFGD
IISDEAAMITGSMGMLPSASLNEEGFGLYEPAGGSAPDIAGKGIANPIAQ
ILSAAMMLRYSFNLNEAATAIENAVQKVLADGHRTGDLADNSTPVSTAEM
GTLIANAI
>MS0333 leuC, LeuC protein
MENAMSKTLYDKHIDSHTIKELDNEGNVLLYIDRTILNEYTSPQAFSGLR
EENRDVWNKKSILLNVDHVNPTRPVRDANMTDPGGTLQVNYFRENSKLFD
IELFDVTDPRQGIEHVVAHEQGLALPGMVIAAGDSHTTTYGAFGAFGFGI
GTSEIEHLLATQTLVYKKLKNMRVTLTGKLPFGTTAKDVIMALVAKIGAD
GATNYAIEFCGEVIDELSVEGRMTICNMAVECGARGAFMAPDEKVYEYIK
GTPRAPKGEMWDLAIAEWRKLKSDNDAVFDKEIHMDCSDLEPFVTWGISP
DQADVISGEVPDPNLLPEGQKRKDYQAALEYMGLEPGMKFEEIKISHAFI
GSCTNGRIEDLREVAKVLKGRKIAQGVRGMIIPGSTQVRARAEAEGLAKI
FIDAGFEWRQSGCSMCLAMNEDVLSPGDRCASGTNRNFAGRQGAGSRTHL
MSPAMVAAAAVAGHLVDVRKFVEGD
>MS0596 leuC, LeuC protein
MAKTLYQKLFDAHVVYEAEGETPILYINRHLIHEVTSPQAFDGLRVAGRQ
VRQVSKTFGTMDHSISTQVRDVNKLEGQAKIQVLELDKNCKATGISLFDM
NTKEQGIVHVMGPEQGLTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVE
HVLATQTLKQARAKSMKVEVRGKVNPGITAKDIVLAIIGKTTMAGGTGHV
VEFCGEAIRDLSMEGRMTVCNMAIEFGAKAGLVAPDETTFEYLKGRPHAP
KGKDWDDAVAYWKTLKSDEDAQFDTVVVLEAKDIAPQVTWGTNPGQVIGI
DQLVPNPAEMTDPVTKASAEKALAYIGLEPNTDLKNVPVDQVFIGSCTNS
RIEDLRAAAAVMKGRKKADNVKRVLVVPGSGLVKEQAEKEGLDKIFLAAG
AEWRNPGCSMCLGMNDDRLGEWERCASTSNRNFEGRQGRNGRTHLVSPAM
AAAAAVFGKFVDIRNVSLN
>MS0334 leuD, LeuD protein
MDKFTLITAKAAPMMAANTDTDVIMPKQFLKGIDRKGLDRGVFFDLRFNL
DGTPNEKFILNQADWQGSQFLVVGPNFGCGSSREHAVWGLKQLGIRALIG
TSFAGIFNDNCLRNGVLTICVSDQEIEQIATTVSNPATNTISVDLEGQKV
LTENGEIAFDVDPLKKEMLIKGLDAVGFTLSMKDDILAFEQSYFKANPWL
KL
>MS0595 leuD, LeuD protein
MTRIDKMAGLKQHSGLVVPLDAANVDTDAIIPKQFLQAITRVGFGKHLFH
EWRYLDAEETQPNPEFVLNFPQYQGASILLARKNLGCGSSREHAPWALAD
YGFKVMIAPSFADIFYNNSLNNHMLPIKLSEQEVEEIFQWVWANPGKKID
VDLEAKTVTVGEKVYHFDLDEFRRHCLLEGLDNIGLTLQHEDAIAAYESK
IPAFLR
>MS2084 lysA, LysA protein
MDFFQYKNNKLYAEDLLVSELAEQFGTPLYIYSRATLERHWKAFDSALGD
HPHLVCFAVKSNPNIAILQVMAKLGAGFDIVSQGELERVIAAGGDPHKVV
FSGVAKNEKEIARALELDIRCFNVESLAELQRINEVAGKSGKIAPISLRV
NPDVDAHTHPYISTGLKENKFGVSVDEAREVYRLASRLPNIKVTGMDCHI
GSQLTEIQPFLDATDRLILLLEQLREDGIELEHLDLGGGLGVTYSDETPP
HPSEYATALLNKLKQYTNLEIIMEPGRAISANSGILVTKVEYLKSNETHN
FAIVDAGMNDMIRPALYQAYMNIIEADRTLNRESKIYDVVGPICETSDFL
GKQRRLAIAPGDYLVQRSAGAYGASMSSTYNSRPLTAEVMVDGSQAHLIR
RRAELTELWALESLLP
>MS1613 lysC, LysC protein
MANLSVAKFGGTSVANYAAMTACAKIVIADPNTRVVVLSASAGVTNLLVA
LANGCEATQRAKLLAEVRQIQENILNELKDAGTVRLEIEELLTNIEYLAE
AASLATSSALTDELISHGEMMSTKIFVQVLRELNAQATWVDVRTVVATNS
NFGKAAPDDEQTQKNSDNVLKPLIDRGELVITQGFIGRDPNGKTTTLGRG
GSDYSAALIAEVLNAKDVLIWTDVAGIYSTDPRIVPNAQRIDTMSFAEAA
EMATFGAKVLHPATLLPAVRSNIPVYVGSSKAPEQGGTWVTRDPQPRPTF
RAIALRRDQTLLTLSSLNMLHAQGFLANVFNILAKHKISVDTITTSEVSV
ALTLDKTGSASSGAELLSSDLLNELSEVCTVKVDTGLALVALIGNDLHLS
AGIAKRIFGTIEEYNIRMISYGASTNNICTLVHSAHADDVVRALHKELFE
>MS1703 lysC, LysC protein
MRVLKFGGTSLANPERFLQAARLIEKAHLEEQAAAVLSAPAKITNHLVAL
SEKASLNQPTETNFNEALDIFYNIINGLHEKNNNFDLKGTSQLIESEFNQ
LAELLEQIRQAGKVEDAVKATIDCRGEKLSIAMMKAWFEACGYEVTVINP
VEKLLAYGNYLESSVDIEESAKRVDVASIPKNNVVLMAGFTAGNEKGELV
LLGRNGSDYSAACLAACLNASACEIWTDVDGVFTCDPRLVPDARLLPSLS
YREAMELSYFGAKVIHPRTIGPLVRSNIPCLIKNTGNPTAPGSIIDGNEP
QSGELQVKGITNLDNVAMFNVSGPGMQGMVGMAARVFSTMSKAGVSVILI
TQSSSEYSISFCVPSKLAAKAKDALNTEFAKELLDKDLEPVEVIEDLSII
SVVGDGMKQAKGIAARFFSALSQANISIVAIAQGSSERSISAVVAQNKAI
EAVKSTHQALFNNKKSVDMFLVGVGGVGGELIEQIKQQKEYLAKKDIEIR
VCALANSNKMLLNENGLSLDNWKEDLSNATQPSDFDVLLSFIKLHHVVNP
VFVDCTTAESVSGLYARALSEGFHVVTPNKKANTREMAYYNLVRENARKN
QRKFLYDTNVGAGLPVIENLQNLLAAGDEVERFNGILSGSLSFIFGKLEE
GLTLSQATALAREKGFTEPDPRDDLSGQDVARKLLILARESGLELELSDV
EVESVLPKGFSEGKSAVEFMEILPQLDAEFAARVEKAGAQNKVLRYVGQI
NDGKCKVSIVEVDADDPLYKVKNGENALAFYTRYYQPIPLLLRGYGAGNA
VTAAGIFADILRTLRN
>MS0924 mET2, MET2 protein
MDSSMSAQQVTLFTEQPLDLIFGGRLGQIDVAYQTYGTLNEDKSNAVLIC
HALTGDAEPYLSPVENQAGGWWQSFMGEGLALDTSRYFFICSNVLGGCKG
TTGPASINPKTNKPYGSQFPKVTVQDIVRLQKALISHLNIPHLHAVIGGS
FGGMQATQWAIYYPDFVDKVVNLCSSLTFSAEAIGFNHVMRQAIINDPNF
NNGDYYEGEPPENGLSIARMLGMLTYRTDLQLAKAFGRATKNEGHYWGDY
FQVESYLSYQGQKFLGRFDANSYLHLLRALDIYDPSIGFDNIKEALSRIK
AHYTLVAVTNDQLFKLTDLHKSKTLLEQAGVPLDYYEFPSDYGHDAFLVD
YDTFEPKIRSGLE
>MS0553 mHT1, MHT1 protein
MPITILDGGMSRELMRRNAPFRQPEWSACALYEEPSAVQAVHEDFIAHGA
EVITTDSYAVVPFHIGEQRFHTDGKTLADLAGRLAKSAVKNSGVLTTKIA
GSLPPMFGSYRADLIQPERFAEIAQPLIDGLSPYVDIWLCETQSAIIEPV
SIKALLPKDDRPFWVSFTLTDDELTCEPQLRSGETVKSAVEKMVDLGVDA
ILFNCCQPEVIGEALAVTTATLTALNATHIQTGAYANAFAPQPKDATAND
GLDEVRKDLDPPAYLAWAKKWTAQGASIIGGCCGIGVEYIETLAKNLK
>MS0812 malK, MalK protein
MENIVQSKPIIELRSLKKSYNENTIIDNFNLTINNGEFLTILGPSGCGKT
TVLRLIAGFEEANGGQIILDGEDVTDLPAEHRPVNTVFQSYALFPHMTIF
ENVAFGLRMQKVPNEEIKPRVLEALRMVQLEEMADRKPTQLSGGQQQRIA
IARAVVNKPKVLLLDESLSALDYKLRKQMQNELKALQRKLGITFIFVTHD
QEEALTMSDRIIVLRKGNIEQDGSPREIYEEPSNLFVAKFIGEINIFDAQ
VLNRVDEKRVRANVEGRVCDIYTDLAVKEGQKLKVLLRPEDVQLEELDEN
EQSSAIIGHIRERNYKGMTLESTVELEHNNKLVLVSEFFNEDDPNIDHSL
DQRVGVTWIEKWEVVLNDENDNA
>MS0584 malK, MalK protein
MEQTDMAKLEIKNITKKFGDFYAANNISFTAEEGEFVTLLGPSGCGKTSL
LKLIAGFHIADEGEILIGGKNVNEIPPEKRNTAMCFQSYALFPHLNVSHN
ICYGLKQRKIDINEQKQRLDLAIKQMDLEIHRLKLPNELSGGQQQRVALA
RAMVTRPDVILFDEPLSNLDAKLRESVRFEIKQLSKQYNLTSIYVTHDQA
EALSMSDKIIVLNKGKIEQIGSPQEIYHHPINRFVADFIGIANITEAHVK
EMENNLYEVNSIYGNFTVYSEIKPQSDHIYICFRPEDIEIVPASENKENM
LTVDVTHTAFMGNITEIQALIRKDDKEQKLRLQLTKFPQLTENYQLSFCV
PRDAIKFLESVK
>MS1587 malK, MalK protein
MLSHHKNKNGGAYPTLYRQYNIMTNQNDNFLVLKNINKTFGKSVVIDDLD
LVIKRGTMVTLLGPSGCGKTTILRLVAGLENPTSGQIFIDGEDVTKSSIQ
NRDICIVFQSYALFPHMSIGDNVGYGLRMQNIAKEERKQRIREALELVDL
AGFEDRFVDQISGGQQQRVALARALVLKPKVLLFDEPLSNLDANLRRSMR
EKIRELQQSLSITSLYVTHDQTEAFAVSDEVIVMNKGKIVQKAPAKELYQ
QPNSLFLANFMGESSIFNGQLQGNQVTLNGYQFTLPNAQQFNLPNGDCLV
GIRPEAVTLKETGEPSQQCSIKTAVYMGNHWEIVADWAGQDLLINANPEV
FNPEQKQAYVHLSSHGVFLLKKE
>MS1520 metC, MetC protein
MSNKYSLATTLVHAGRSKRVSQGSVNPVVQRASSLVFDSIADKRQATVNR
AKQALFYGRRGTLTHFALQDLMCEMEGGAGCYLYPCGAAAVTNAILAFVQ
SGDNILMTGAAYEPTQDFCNKILSKMNVSTTYYDPMDGEKIAELVQPNTK
VLFLESPSSLTFEVPDVPNIVKAVRKINPEIVIMIDNTWAGGILFKALEH
DIDISIQAGTKYLVGHSDVMIGTAVSNARCWDQLRENSYLMGQMVDADTA
YTTARGIRSLAVRFKQHTESSIKVAQWLAEQPEVKAVFHPALPSCPGHEF
FKRDFTGSAGLFSFELKEQLSREKLERFMDNFKLFSMAYSWGGFESLILY
NQPADIAAIRPNIKRKLTGTLIRIHIGFEDVNELIEDLKAGFERLK
>MS1627 metC, MetC protein
MTQNYSIETILAQAGNKSDARTGAVSTPIFLSTAYGHRGIGESTGFDYTR
TKNPTRLVLEETIAKLENGDQGFAFSSGMAAIQVLMTLFTAPDEWIVSSD
VYGGTYRLLDFAYKNNNSVKPVYVNTASVEAIETAITPNTKAIFVETPSN
PLMEECNVTEIAKIAKKYNLLLIVDNTFLTPVFSRPLDLGADIVIHSATK
YLAGHNDTLAGLVVAKGQALCERIFYIQNGAGAVLSPFDSWLTIRGLKTL
ALRMERHQANAAAIAEFLKAQPQVKDVLYPNKGGMLSFRLQDENWVNPFL
KAINLITFAESLGGTESFITYPTTQTHMDIPAEERIARGVTNDLLRFSVG
LENVEDIKADLAQAFAQFK
>MS0941 metE, MetE protein
MTKLFPNATVRTSAPYRFDIVGSFLRSDAIKSARAACACGDISCADLTRA
EDAEIAKLVERQKSVGLHAVTDGEFRRTFWHLDFLAGLDGVEEVDAEKFS
VQFKHHNVRPKTLKIVAKVDFSENHPFVEHFRSVNELAKGTEVKFTIPSP
SMLHLITNVRATNYQPIPRYENNNQQLLDDIADAYIKAMNIFYKLGCRNL
QLDDTSWGEFCAEDKRAAYQERGFDLDQIAKDYVYMLNKIVDAKPAQDIA
ITMHICRGNFRSTWFSAGGYEPVAEILFGSCRVDGFFLEYDSDRAGDFKP
LRFIKNQQVVLGLVTSKDGTLENREDIINRIKEAAQYVDINQLCLSPQCG
FASTEEGNILTEEQQWAKLNFIREIAEEVWGK
>MS0787 metF, MetF protein
MSYAKDIDTLNQHVADLNGQINVSFEFFPPKNEKMEETLWSSIHRLKTLN
PKFVSVTYGANSGERERTHSVVKNIKQKTGLEAAPHLTGIDATPEQLKEI
AQDYWNNGIRRIVALRGDIPAGYTKTPFYASDLVALLRSVADFDISVAAY
PEVHPEAKSAQADLINLKRKIDAGANHVITQFFFDIDNYLRFRDRCASIG
IDAEIVPGILPVTNFKQLQRMAALTNVKIPNWLAVNYEGLDEDQTTRNLV
AASVALDMVRVLSREGVKDFHFYTLNRSELTYAICHILGVRPK
>MS1009 metH, MetH protein
MHNKIDILKASLAQRILILDGAMGTMIQQYKLSEQQFRGERFKQSSVDLR
GNNDLLSLTQPLLIQAIHEKYLQAGADIIETNTFSSTSIAQADYDLQAIA
YELNFAGAKLARIAADKYSSADKPRFVAGVLGPTNRTASISPDVNDPGFR
NITFMQLAEAYGEATRGLIAGGADIIMLETIFDTLNAKAAVFAIEQVFEE
LGVRLPVMISGTITDASGRTLSGQTTEAFYNSLRHAKPLSFGLNCALGPK
ELRQYVEQLSKISECYVSAHPNAGLPNAFGGYDLGAEEMAAQLKEWAESG
FLNIVGGCCGTTPEHIKAFAEAMQGVKPRPLPQIKTAMRLSGLEPLSIDD
DSLFVNVGERNNVTGSAKFKRLIKEEKFGEAIEIAIDQVENGAQVIDVNM
DEALLDSQKCMTRFLNIMATEPDAAKVPVMIDSSKWEVIEAGLQSIQGKG
IVNSISLKEGEEKFIRQAKLIRRYGAAAVVMAFDEKGQADTEARKVEICT
RAYDILVNQAGFPPEDIIFDPNIFAIGTGIEEHNNYGVDFINATGRIKQT
LPYAKVSGGVSNVSFSFRGNNPMREAIHAVFLYHAIKQGMDMGIVNAGQL
AIYDDLDPELREVVEDAVLNRRPDATDRLLEIAEKYRNQDSTGEDNGVAE
WRSWSVEERLKHALVKGITHFIIEDTEEARQKFSLPLEVIEGPLMAGMDV
VGDLFGDGKMFLPQVVKSARVMKQSVAYLEPFINATKQKGSSNGKVVIAT
VKGDVHDIGKNIVSVVLQCNNFEVIDLGVMVPADKIIETAIAEKADIIGL
SGLITPSLDEMEYFLGEMNRLNLNIPVLIGGATTSKEHTAIKLYPKYKYE
VIYTTNASRAVTVCAALMNPESKAELWARTRKEYEKIQQSFAERKPLRSS
LSLEQARANGFNPFAGEWANYQVPQPKQPGISEFKDVPIAMLRKFIDWSP
FFRVWGLMGGYPDAFDYPEGGEEARKVWHDAQIMLDEFENNGKLTPSGVL
GIFPAERAGDDIKIYQNSDRTLLAGVARHLRQQSERGKNSKIPYNLCLSD
FIAEGSNGQQDWLGMFAVCAGTQEHALVDSFKAKGDDYNAILLQAVGDRL
AEAMAEYLHFELRTRLWGYSDETFDNQALIDEKYIGIRPAPGYPSCPEHT
EKQLIWDLLEVEQRIGMKLTESYAMWPAASVCGWYFSHPASSYFTLGRID
EDQAADYAKRKGWDEREMRKWLGVSMK
>MS1966 metJ, MetJ protein
MGFFSLKYRQILRLLIGNFMADWDGKYISPYAEHGKKSEQVKKITVSIPI
KVLEILTNERTRRQLKNLRHATNSELLCEAFLHAFTGQPLPTDEDLLKER
HDEIPEQAKLIMRELGINPDEWEY
>MS1726 nifS, NifS protein
MKFPIYLDYAATCPADDRVAEKMMQYLTRDGIFGNPASRSHKFGWQAEEA
VDIARNHIADLIGADSREIVFTSGATESDNLAIKGAAHFYQTKGKHIITC
KTEHKAVLDTCRQLEREGFEVTYLAPKSDGLVDLDEFRAAIRPDTILASI
MHVNNEIGVIQDIEAIGKICREHKVIFHVDATQSVGKLPINLAELPVDLM
SMSGHKLYGPKGIGALYVRRKPRVRLEAIIHGGGHERGMRSGTLAVHQIV
GMGEAYRICKEEMAEEMAHVTKLRDRLYNGLKDIEETYVNGSMEHRVGSN
LNISFNFVEGESLMMALRDIAVSSGSACTSASLEPSYVLRALGLNDELAH
SSIRFSLGRYTTEEEIDYTIDLVKSAVKKLRDLSPLWDMFKEGIDMSKIE
WSAH
>MS0856 oppA, OppA protein
MFIRKVTFIGFLLFSAMLPFFSWAAPRVPEILTQNGLIYCTHSSGFSFNP
QTADAGTSMNVITEQIYNKLFEIKNNSSRLEPSLAQSYKISEDGKTITVY
LRKGVEFHHTPWFTPSRNFNADDVVYSLNRVLGHNTSLPEFNASEQQKGM
KRQYNIFHELAKKTRFPYFDSIKLNQKIESVTALDPYTVQINLFAPDASI
LSHLASQYAIIFSHEYALQLNADDNLAQLDLLPVGTGPYQVKNYFRNQYV
RLIRHENYWKKEAEIKNIIIDLSPDRTGRLAKFFNNECQIAAFPDVSQLG
LLQENGERFQTTLSDGMNLAFLAFNFKRPLMQDAEIRRGIAQAINRHRII
KDIYYNTASVANKIIPSVSWAGSDSNNHSFAYDYDPAQAKKVLQDRQLSL
DMWVLKEEQLYNPSPIKMAELIKHDLTKAGIEVKVRLISRNFLMEQLRNN
SENYDLILGGWLAVSLDPDSFMRPILSCGTTSEITNLSNWCSQSFEEILD
RALISNSTNERAVNYHLAEQEVLSELPILPIASVKRILISNSNVQGVEMS
PFGSISFEKLSFKKGEK
>MS1325 oppA, OppA protein
MSNKMRSSLFSGKFSLVAKSAVIFCCFLSSVGCDRIKNLFSDTKQSVSEQ
PAESMTSTKQIQTETVPEQHILSRGVYSDLVLNIRDVKSSEQADFMRDLF
EGLVIFDIHGNIQPAVAESWETKDNKTWIFTLRQDAKWSNGEAVTAEDFA
QAWKLLALSSSPLRQYLAFIHIDQAQEILEGKSDISQLGIKAQDEYHLQI
SLDKPISYLPEMLAHIALLPAYSGGNSNKGELISNGAYKLAGQKADTISL
VKNEFYWNAEKVSFPQVHYQKLADNTDVKKVDLVTDFRQIKMENVVNFPK
LCTYFYEFNLKDQNLAKTAVRNALNSMISSHNIVRDSGLSGFAVSYFVPR
NMEFESDESWQATVVEQILQQADFSEKNPLQFKLTYEQEGIHPNIANRLV
RSWSQSDLISVKMEPVNWSQLQEKRAKGDFQIIRSGWCADYNDPSAFLNL
LYSKNPDNKTGFSQERVDKLLEKAQQTISEPERNELYRQVLLISRQEHLF
LPIFQYAKAVYLNPTLQGFDIHNPTEVIYSKDLSRKPMRQKN
>MS2053 oppA, OppA protein
MKLTTKFTLAALVLSAIGFVQAAETTFINCTSRAPTGFSPALVMDGISYN
ASSQQVYNRLVEFKRGSVDIEPGLAESWDISDDGLTYTFHLRKGVKFHAN
KEFTPTREFNADDVIFSFQRQLDSNHPYHKVSNGTYPYFNSMKFPSLLKS
VEKLNDHQVRITLTRKDATFLASLGMDFLSIYSAEYADKMMRAGKPETID
NQPIGTGPFVFAGYQVDKAVRFVANKDYWKGKAAIDRLVFSITPDAGTRY
AKLQQGACDLAEFPNTADIERMKADKRIQMPSQESLNVAYIAFNTEKAPF
DNVKVRQALNYAVDKNTILNAVYQGAGIAAKNPLPPTIWGYNDQVQPYEY
NPEKAKQLLAEAGFPNGFETELWVQPVVRNSNPNPRRMSELVQSDWEKVG
VKAKLVSYEWGDYIKRAKAGELTAGTFGWSGDNGDPDNFLSPLLAGVNAG
NSNYARWKNAEFDALLDKAIGLTDKAQRAALYKQAQVIAHDQAPWIPMAH
AVTYAPLSARVRDFKQSPFGYTSFYGVRVEDKK
>MS0466 oppA, OppA protein
MKKICTILTALFTATCVYADSTNNRLDYASTKDIRDINPHLYAGEMAAQN
MVFEPLVINTNQGIRPFLAKSWRISEDGKSYLFHLRKDVKFTDGEPFNAF
VAKMNIEAVLANFNRHAWLELVRQIDSVRAPDEFTLELTLKNPYYPTLTE
LALTRPFRFLSPKCFNQGKTSQGVMCYAGTGPWILKKHKKNALADFSRNE
NYWGELPKLNGVTWHVIPERQTMLLALLKGDIQLIFGADGDMLDMDSFKQ
ISESGQFISAMSEANASRAIVLNSARTITSDQKVRQALQYAVDKAAIAKG
VFNDTESIAETLMAKNVPYADVDVQTYPFNLLKAAQLLEEAGWNLSVGKN
IREKAGKPLSLLLSYNINNAAEKEIAQLLQADFRKIGVDLQILGEEKQAY
LDRQKNGDFDLQYSLSWGSPYDPASFVSSFRIPAHADYQGQKGLPNKTEI
DEMIGELLITPNEQTRIKLYQKLFKTLAEQAVYVPLTYSKTKAIYSAQLE
GVGFNPSQYEIPFEKMSFKK
>MS1364 oppF, OppF protein
MSESIKQATPLLEAVNLKKYYPVKKGLFAKPQLVKALDGVSFCLEKGQTL
AVVGESGCGKSTLGRLLTMIETPTDGELYYNGQNFLENDKTTQKLRRQKI
QIVFQNPYGSLNPRKKIGSILEEPLVINTDLTAAQRKARVLEIMAKVGLR
AEFYHRYPHMFSGGQRQRIAIARGLMLQPDIVVADEPVSALDVSVRAQVL
NLMMDLQKEMGLSYVFISHDLSVVEHIADQVMVMYLGRCVEQGRVEAIFK
NPRHPYTQALLSATPRLSPKLSSERIKLEGELPSPLNPPKGCAFHTRCRL
ATERCKQEQPLLKDYSDGTRIACFMVE
>MS0462 oppF, OppF protein
MSLLKVENLTKSYRTFNSLFSHLSHPALQNVSFQLEKGESVGLIGENGSG
KSTLARIISGIEKADSGHVWLNGTDIYQRKNRRQQISVVFQDYFSSVNPT
MTVLQAICEPLLEQKQAAAKSLEPLVVQFLKKVNLSTDCLHKYIYQLSGG
QAQRVCLCRALINNPSLIILDEALSSLDIVTQVQLLELLIELKNEFQLSY
FFISHNIQMICYLCERVLFFKQGQIITQSDIENLAEIKSDYAQKLIRSVI
>MS1150 pabA, PabA protein
MATILFLDNFDSFTYNLVDQFRGLGHQVKIYRNDCDLALLESIALQPDTI
LALSPGPGTPAEAGNMLALIQRVKSAVPIIGICLGHQALIEAFGGKVVHA
GEVLHGKVSKINHDEQAMFLNLQNPMPVARYHSLKGSNLPEELVVNATYN
DIIMAFRHKNLPICGFQFHPESILTVQGAKLLENSVNWLLNK
>MS2194 pabA, PabA protein
MSKRLLIVNNHDSFTYNLVDLIRRLSVPMRVIEVEKLDLDEVEQFSHILL
SPGPDVPEAYPEMFALLTRYYRHKAILGVCLGHQTLCRFFGGRLYNLRQV
RHGVCGRLKVRSKSAIFSGLPEEFDIGLYHSWAVDSQNFPAELTITAECH
EEVVMAFEHKTLPIYGVQFHPESYISEYGEQMLINWLNS
>MS1550 pepB, PepB protein
MKYSVKQTALEQENKSLFIAIFENQELSPAALKLDLKLKGEITEAVKNGE
VSGKIGRILVLRHGAQRIILVGCGKQNEVTERQYKQIIQKAVKTAKETIA
TTIINALTEVKIKDRDLYWNVRFAVETIEEDNYIFEQFKSKKSENNSKLA
EIIFYTEENHEQAELAIRHATAISSGVKAAKDIANCPPNICNPAYLAEQA
NQLAGRSSLIETTVIGEKEMRKLGMNAYLAVSCGSKNEAKLSVMEYRNHE
NPNAKPIVLAGKGLTFDAGGISLKPAADMDEMKYDMCGAASVYGVMNAIA
ELQLPLNVIGVMAGCENLPDGNAYRPGDILTTMSGLTVEVLNTDAEGRLV
LCDTLTYVERFEPELVIDVATLTGACVVALGQHNSGLVSTDDNLAQDLER
AAKLANDKAWRLPLSEEYQEQLKSKFADLANLGGRWGGAITAGAFLSNFT
KNYPWAHLDIAGTAWLQGQNKGATGRPVSLLVQFLLNQVK
>MS0667 pepB, PepB protein
MQIQLSNLPAPKSWGKNPLLSFSDNQATIHLENSEKSDRTLIQKAARKLR
GQGIDDVELVGNDWSLENCWAFYQGFYTAKQDWAVEFPELGDDHEELLAR
IQCGDFVREIINLPSSVITPLELAQRSARFIAGLAEEYAGKSAVDFHIIS
GEELKAQNYLGIWNVGKGAENPPAMLQLDFNPTGNPESPVLACLVGKGIT
FDSGGYSIKPSNFMDSMRTDMGGAALVTGALGLAIARGLNRRVKLFLCCA
ENLVSGNAFKLGDIITYRNGVKAEILNTDAEGRLVLADGLIDASSENAQF
ILDAATLTGAAKVALGNDYHCVLSMDEELTTDLFNAAKEEQEPFWRLPFE
ELHRSQISSSFADISNTSSAALAAGASTATAFLSHFVKDYQQNWLHLDCS
ATYRKTPSDLWATGATGLGVQTIANLLLTKATQL
>MS0815 pepD, PepD protein
MQYNEQLLERFFNYVSLDTQSKPGAKTSPSTQGQLKLAKILEQELYSLGL
DEIEVSKHGIVTALLPGNIENSPTIGLIAHLDTSPQCSGKNVKPEVIENY
RGGDIALGLGDEFISPVTFTFLHKLVGKTLIVTDGTTLLGADNKAGIAEI
MTALSQLKESSVPRCHIRVAFTPDEEIGLGMKFFPIEKFSCDWAYTIDGG
AVGELEYENFNAAGATVTIFGRAIHPGSAKDKMVNALTLACEFQQGFPTD
EVPEKTEEKQGFFHLNSFHGDIEKVELHYLIRDFDKQAFTQRKAFLEKWV
DEFNCRKQLKEPVKVTITDNYYNMYDTVSKVPQSIELADSAMKACGIVPI
HQPIRGGTDGAWLAEKGLACPNIFTGGYNFHSKHELITLEGMCSAVDVIM
KIAQLAVK
>MS2118 pepD, PepD protein
MSEIQSLQPQLLWKWFDQICSIPHPSHHEEQLAEFIVNWAKGKGFYAERD
EAGNLLIRKPATKGMEHCQSVALQAHLDMVPQANEETDHDFTSDPIQPYI
DGEWVKAKGTTLGADNGIGMASALAVLDSENLAHPALEVLLTMTEEVGMD
GALGLRKNWLQSEIMINTDTEDNGEIYIGCAGGENADLTVPVQWQENNYE
HCYQISLKGLRGGHSGCDIHTGRASAIKTLARFLANLQQNQPHFEFSLSE
IRGGSVRNAIPREAFATLCFNGEPANFTQGVKSFESLLKTELAIAEPDLQ
LTAQPAEKATKVFAPNTKNNVVNLLNALPNGVIRNSDVVENVVESSLSIG
VLKTTEDAVKGTILVRSLIESGTNYINGLLISLTELCGASVQFSGRYPGW
EPHAETPILTLTKEIYGELLGYEPAIKVIHAGLECGLLKKIYPALDVVSI
GPTIVNAHSPDEKVHIPAVRTYWELLTKVLAGIPAKK
>MS1554 pepE, PepE protein
MQAVLSPLNMEIISGKMLRHNGESREEHLAEFLIVNPTALVYAHPESTAL
HIEGRQATILE
>MS1034 pepN, PepN protein
MHAKAKYRKDYKKPDFTVTDIHLDFQLDPQKTVVTAHSQYQRLNPAATVL
RLDGHSFQFASIKVNGKDFATYQQDGESLTLDLSDIDAERFELEVITRLV
PAKNTSLQGLYQSGEGICTQCEAEGFRQITYMLDRPDVLARYTTKITADK
SKYPYLLSNGNRIAGGDLEDGRHWVEWNDPFPKPSYLFALVAGDFDLLED
SFTTKSGREVKLELFVDRGNLNRASWAMESLKKAMKWDEERFDLEYDLDI
YMIVAVDFFNMGAMENKGLNVFNSKYVLANPETATDEDYLNIESVIGHEY
FHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDVGSRAVNRIKNVKFLR
TAQFAEDASPMSHPIRPEKVLEMNNFYTMTVYEKGAEVIRMMHTLLGEKK
FQQGMKLYIAENDGKAATCEDFVAAMEQASGVDLTQFRRWYSQSGTPELT
VTDSYDEKKRSYKLYVSQMTAPTADQMDKVNLHIPLKIALYDMNGMPFSL
IKDDEAVNDVLDILLEDQVFEFHNITSKPVPALLCDFSAPVKLDYDYSTA
QLIALLKFAHNEFVRWDAMQMLFAQELRRNLSAYQQGEQLTFSAEILSAL
QQVLENYQSNVELTTLILTLPKETEFAELFKTIDPEGIAVVCDFMQHAIA
EGLQDLWLKTYHQINLEEYCIDMRDIALRGLRNLCLQYLAFTDYGNALVN
KHYLYADNMTDKLAALAAATKAQLTCRDKVMKDFEEKWQHDGLVMDKWFN
LQATRPDGNVLTLVKQLMDHPSFNFNNPNRLRALVGSFESQNLRAFHAVD
GSGYRFLTDVLLRLNESNPQVAARLVEPLIRFSRYDSQRQTLMKRALERL
REVENLSNDLFEKIEKALQ
>MS0479 pepP, PepP protein
MDLAYMAELPADEFVLRRQKLAAQLTDNSVFIVFSEVEKRRNNDCTYPFR
QDSYFWYLTGFNEPNSALVIQKKGKLVETTIFVRPSNPLMEIWNGRRLGV
ERAAEKLHLDQAFSIDDFARIFGKICQNSTALYHYQGLQPWADQLLAETF
ISPPDYINWAPMLDEMRLFKSANEVRLMQQAGQITALGHMKAMRQTRPNR
FEYEIESEILHEFNRFGARYPAYTTIVAGGENACILHYTENDQPLKDGDL
VLIDAGCEFAMYAGDITRTFPVNGKFTQAQREIYQIVLNAQKRAIELLVA
GNSIQRANDEVVRIKVKGLLDLGIMRGDIDELIANNAHREFYMHGLGHWL
GLDVHDVGSYSKEGQNGDRNSKVRDRPLEIGMVLTVEPGLYISPKSDVPE
QYKGIGVRIEDNILITEYGNKVLTAAAPKEIGDIEALMATER
>MS1657 pheA, PheA protein
MALDLSEIRQQITQIDRSLLKLLSERHRLAFDVVRSKEITQKPLRDEKRE
QQLLQELINFSENENYQLEPQYITQIFQKIIEDSVLTQQVYLQKKLNEQR
EQSIHIAFLGKRGSYSHLAARSYATRYQEQLIELSCSSFEQIFEKVSSGE
ADYGVLPLENTTSGSINEVYDLLQHTDLSLVGELTYPIKHCVLVNGQDDL
SKIDTLYSHPQVIQQCSQFIRSLNKVHIEFCESSSHAMQLVSSLNKPNIA
ALGNEDGGHLYGLTVLRSNIANQENNITRFIVIARKAITVSPQIHTKTLL
LMTTGQEAGSLVDALTVFKKYQIKMTKLESRPIYGKPWEEMFYLEIEANT
NHPDTQAALEELRQYSTYLKVLGCYPSEIVKPVDVR
>MS0583 potB, PotB protein
MKGIKKDLKAWLLLCSGLGTILFLMGSTFYIVVTQSLGLYNISGEDSRFT
LQYWHDVLTNSVFQSSYIYSVKVSLLGAILSIIVSYPIAMWLRNELPAKV
TIITILRAPMLVPGLVAAFLFVNMISYHGILNETMVFLGIWHEPKTLQND
EFGWGVVILQMWKNIPFALILIGGAVNSLKTDLLDAAANLGSTSWQRFRY
VIFPLTLTAVQVSFILIFIGALGDFAFYSIAGPRSTYSLARLMQMSAYEF
EEWNQSAVMAMMIMLTSAFFTILVSIIIKPLAVKRGDIK
>MS0811 potB, PotB protein
MKMTTRKFQNSTVAVIFAWLIFFMFVPNFLVLIVSFLSKDSSNFYALPFT
FENYARLFEPLYGTVVWNSLYMSGIATVICLLIGYPFAFFMAKLNPKYRP
ILLFLLVLPFWTNSLIRIYGMKVFLGVKGILNEFLLFTGIIDEPIRILNT
EVAVIIGLVYLLLPFMILPLYSAIEKLDLRLLEAAKDLGANGIQRFIKII
IPLTMPGIVSGCLLVLLPAMGMFYVADLLGGAKVLLVGNVIKSEFLISRN
WPFGSAISIGLTILMALLIFVYYKANKLLNKKVELE
>MS0581 potC, PotC protein
MSSAKITTKNSKIIARISLTFFVLVNFIWLVLPFLMAGLWSLVDPKQPWS
YPDILPPSLSLERWQMVWENTSLPEAMFNSYTIAPTVSLITISLSIPTAY
AFGRMEFRGKKIAELLTLIPLVIPGMIIALFFSRMLLDLNISNPFVGIVI
GHVVLTLPYAIRILSAGFSSVPQDLIEASRDLGASKFTVFKDVYMPMLKP
SFLASIIFCLVKSIEEFAISFVIGSPDFITVPTILYSFLGYSFIRPNAAV
VSIILLVPNIILMMIIEKLLKGNYLSQSTGKA
>MS0810 potC, PotC protein
MSRVLRNIFMLVVYAYLYIPIIILVGNSFNADRYGLSWKGFSFAWYERLA
NNDTLIQAAVHSVTIAFFAATFATIIGSMTAIALYRYRFRGKQAVSGMLF
VTMMSPDIVMAVSLLALFMIIGISLGFWSLLLAHITFCLPYVVVSVFSRL
KGFDLRMLEAARDLGASEVTILRKIIFPLALPAIISGWLLSFTISMDDVV
VSSFVSGVSYEILPLKIFSLVKTGVTPEVNALATIMIVLSLLLVLLGQII
GKKDKS
>MS2292 potD, PotD protein
MLVTATAFFSTASFAAPKQLYIYNWTDYIPSDLISKFTRETGIKVNYSTF
ESNEEMFSKLKLTINKPGYDLVFPSSYYISKMVKENMLTPINHSKLTNLK
QIPSNLLNKDFDPANKFSLPYVYGLTGIGINTSFVNPDEVTGWGDLWKEK
FKGKVLLTADSREVFHIALLLDGKSPNTQNEEEIRNAYQRLTKILPNVAA
FNSDTPELPYIQGEVELGMIWNGSAYMAEKENPAIKFIYPKEGAIFWMDN
YAIPKNARNIEGAHKFIDFMLRPEHAKIIIERMGFSMPNEGVKVLLKPED
RVNPLLFPPEDEVKKGVFQADVGDATDIYEKYWNKLKTN
>MS0809 potD, PotD protein
MSWNYQGQIFYSLSTGANSMKKLAGLFAAGLIAVAVTGCNDKESKSADAN
APETAKDNGTVYLYTWSEYVPDGLLDDFTKETGIKVIVSSLESNETLYAK
LKTQGADGGYDIIAPSNYFVSKMAREGMLKELDHSKLPVIQELDPDWLNK
SYDPNNKYSLPQLLGAPGIAYNTQTYKGSDFTSWGDLWKPEFAGKVQLLD
DAREVFNIALLKLGKNPNTQDPAEIKAAYEELLKLRPNVLSFNSDNPANA
FISGEVEVGQLWNGSVRIAKKEQPGSIDMIFPKEGPVLWVDNLAIPATSK
NPDGAHKLINYLLGAKAAEKLTLAIGYPTANVEAKKVLPKEITEDPAIYP
TAELLRTANWQEDVGEAVELYEKYYQELKAAK
>MS0552 potE, PotE protein
MSNKKIGLLSLTALVLSSMIGSGIFSLPQNMAAVAGAEAISIGWLITGIG
IIFLGLSFFFISRLRPELDGGIYTYAREGFGDLMGFMSAWGYWLCATIGI
VGYLTVAFEGLGVFTDSENTVIFGQGNTVASFIGSSIIVWLVHALIAGGI
KEAASVNLVATFVKVAPLVLFILLGFWFFDTDIFNSDVKASALNNNIGDQ
VKDTMLITLWVFTGVEGASVLSAHAKKRTDVGLATVLGILIALALYVAIT
ILALGILPRETIAEMPNPSMGPLLDAMMGPTGKVIITACLIVSVLASYIS
WTMYSAEIPYRGAQKGAFPKILDKLNENSTPINSLWFTGFIVQFCLILVF
VFEQSYNTLLLISTSMILIPYFLIGAYLFKLAIQTNSAWYIKLTGFMASI
YGLWIVYAAGLQYLLLSVVLYVPGILLFLYSHRKFHGKFKLKGFEQTILA
MIFILFCYAVYRLPELLAA
>MS1604 proA, ProA protein
MTDLIQMGKQAKQAAFALSQLSQQEKNHALALIAERLEAQQERILAENAK
DIQAARENGLSESIIDRLLLTKERLTGIADDVRHVISLADPVGKIIDGGV
LDSGLKLERIRTPLGVIGTIYEARPNVTIDVASLCLKTGNAVILRGGKET
QHSNKILVEVIQNALQQAGLPEMAVQAITDPDRALVMELLHLDKYVDMII
PRGGAGLQALCRDNSSIPVIVGGIGVCHIFVEQSADQDRSLAVIENAKTQ
RPSTCNTVETLLVQESIAEEFLPKLARRLKTKEVKFHADSTALSILQGVS
ADVKPVTEQQLRNEWLTYDLNVVIVKGIEEAVEHIREYGSEHSESILTES
QKLANQFVAQVDAAAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMG
LEALTTYKWVCVGDYTSRA
>MS1862 proB, ProB protein
MKFGTSTLTQGTPKLNRAHMIEIVRQLAQLHQEGYRLVIVTSGAMAAGRH
YLNHPKLPPTIASKQLLAAVGQSQLIQTWEQLFAIYDIHIGQLLLTRADI
EDRERFLNARDTLHALLDNRIIPVVNENDAVATAEIKVGDNDNLSALVAI
LVQAEQLYLLTDQQGLFDSDPRKNPQAKLIPVVNEITDHIRSIAGGSGTT
LGTGGMSTKITAADIATRSGIETIIAPGNRENVIADLAHGEAIGTKFTVQ
TDKLESRKQWLFAAPSAGILTIDQGAENAILEQHKSLLPAGIVNIEGRFS
RGEVVKIRTQQGKDIALGMPRYNSDALYLIQGKKSQNIEKILGYEYGSVA
IHRDDMIVLNK
>MS1799 proC, ProC protein
MKNKLLTFIGGGNMAQAIVFGLLNKGYSAAKLIVCDRNEAKRNLFAQKGV
EVNLTNVEAAEKAEVVVLAVKPQAMAETCGPLSAVDFSGKLVISIAAAVS
VSRLSALLPTAKNIVRVMPNTPALVSEGMAGLFASAGLNGEYQDFAEDLL
NAVGKTCWLQKEEDMHAVTAGSGSSPAYFFLFMEAMEKTLSSMGISPENA
RTLVQQSALGAAKMVENNPQLPLSTLRENVTSKGGTTAAALAVFNQYQLD
KIVQQAMEACVARSQEMEKLF
>MS1178 proP, ProP protein
MPNKAETSPAKLRLKAFLKRIKIMNTTENSKQKPVNVVAFAFLLTAFLTG
IASSFQTPTLSLFLAQEIQVSPFMVGMFYTSNAVLGIVLSQILAKYSDSQ
DDRRKIIIFCSLLAIGGCITFAYNRNYYVLMFFATFLLSLGSSANPQAFA
LAREYADYTKREAIMFTTIMRTQISLAWIVGPPLSFSIALGWGFEYMYMV
AASAFLLCAIIAKALLPYVPRKAVVPLTKPDEVAGLPAKNKKQSDKQSIR
LLFITCFLMWSCNGMYLISMPLHVINELHLSERLAGILMGTAAGLEIPVM
LIAGYLTKYLTKKSLILTALFMGLFFYIGMLFAEQTWQLVALQAFNAIFI
GIIATLGMVYFQDLMPGKMGSATTLFSNAAKSSWIVAGPFVGIIAQIWNY
SSVFYISIVLVAVSLFSMSKVKSV
>MS2054 proP, ProP protein
MASGEANYRSLAWIAASALFMQSLDATILNTALPTIAADLHHSPLEMQLA
VISYALTVALFIPISGWVADKYGTLRVFRFAVGMFALGSLACAMSSSLIM
LIFSRVLQGFGGALMMPVARLSIIRSVPKQELLPVWNLMATAGLTGPILG
PILGGWIVTYTSWHWIFLINIPMSLLGIWLANRYMPNVTGSLQKLDWAGF
FFLGGGLVGVTLGFDLISEEFIAKWQATVIVILGVILIITYCFHAQKRER
LALLPLSLFKIRTFRVGIMANMLIRLCASGIPFLLPLMYQVVFHYSADKA
GMLIAPIALSSMLVKPLCGRILTKLGYRTALISASIVLTLSIAVMSFLHI
DSPVWILIVNVALYGGCISIVFTAVNTLTISELSDQDASAGSTFLSVVQQ
VGIGLGIAVSALILSLYRYFIGESAVQLQQAFGYTYLTSASFGVLLVLVL
SGLKKEDGAHLHK
>MS2374 proP, ProP protein
MSGEKTSRYVLGVTLVATLGGLLFGYDTAVISGTVSSLDTVFIQPKGLPE
ISANSLLGFCVASALIGCIIGGACGGYLSSKYGRKKALLIAALLFLISAF
GSAYPEFGLKTINETNNIPYYLSNFLIQFVIYRIIGGIGVGIASMVSPMY
IAEITPARIRGKMVSFNQFAIIAGQLIVYFVNYFIALNGDNTWLNMLGWR
YMFLSEMVPAALFLILLFFVPESPRWLVLQNKFSQAEITLLKLLGERSGK
TELQNIVSSLEHRVVKGAPLFSFGLGVIVIGIALSVFQQFVGINVALYYA
PEIFKSLGASTNNALLQTIIMGTINLSCTTIAIFTVDKYGRKPLQIIGAL
GMAMGMFVLGMAFYANLSGTIALTGMLFYVAAFAISWGPVCWVLLAEIFP
NAIRSQALAIAVAAQWIANYIVSWTFPMMDKSSYLVERFNHGFAYWVYGL
MAILAALFMWKFVPETKGKTLEELELLWNKK
>MS0392 proP, ProP protein
MSTAKKRNFIFIATLGILSMLPPLGVDMYLPSFLNIARDLQVDPERVQYT
LTFFTFGMAAGQLFWGPVGDSYGRKPIILLGVIIGAVAAFFLTGVNSIEN
FTALRFIQGFFGSAPVVLVGALLRDLFDKNELSKTMSMITLVFMIAPLVA
PIIGGYLVLFFHWHSIFYVICAMGILSAILVFFIIPETHHQDNRIPLRLN
VVVRNFVTLWRRKEVLGYMFSSGLGFGGLFAFLTAGSIVYIGLYGVPVDQ
FGYFFMLNIGVMTLGSVINGRVVHRVGAERMLQIGLTVQLIAGIWLLIVA
CFDLGFWPMALGIAVFVGQNSLISSNAMASILEKFPTMAGTANSVAGSVR
FGLGATVGSLVALMKMDSAAPMLFTMGICVIVAVCCYYFLTYRSL
>MS0191 proP, ProP protein
MSNKVNSYGWKALMGSAVGYAMDGFDLLILGFMLSAISADLSLSPTQAGS
LVTWTLIGAVAGGIIFGALSDKYGRVRVLTWTIVLFAVFTGLCAFAQGYW
DLLIYRTIAGIGLGGEFGIGMALAAEAWPARHRAKASSYVALGWQVGVLA
AALLTPLLLPIIGWRGMFLVGIFPAFVAWYLRAKLHEPEVFVQKQAEVAT
GKRQSPFKLLIKDVATAKVSLGVVVLTSVQNFGYYGIMIWLPNFLSKQLG
FSLTKSGVWTAVTVCGMMAGIWIFGRLADRIGRKPSFLLFQIGAVISIIA
YSQLTDPAIMLFAGAALGMFVNGMMGGYGALMSEAYPTEARATAQNVLFN
LGRAVGGFGPVIVGAVVSAYSFKIAIALLAVIYVIDMIATVFLIPELKGK
ALK
>MS1798 proP, ProP protein
MNVRPFTWLALSYFGYYCAYGVLVPFLPVWLKSQNYGTELIGAVIASSYL
FRFLGGIFFPSRVKRANQILPALRLLAWANVFVITAMAFVSESFWLIFIA
IAVFSMVNAAGMPLTDSMATTWQRQIRLDYGKARLIGSAAFVVGVTVFGS
LIGAIGEQYIISILIGLFGLYAVLQMVPPQPKPADEDKNSAKSAVGFGEL
LKNPTHLRLIIAAMLIQGSHAGYYVYSVIYWTNRGIAVETTSLLWGLGVI
AEILLFFFSGRLFRNWSVNAIFYLSAAAAALRWGAFSYTDALWQIALLQC
LHSLTFAALHYAMVRYIGMQPQNAMVRLQSLYSGLASCASVALLTALAGI
IYPISSHWVFLVMMICALIALFVIPRKPTNA
>MS0785 proP, ProP protein
MQNKFAVYLAAIGHLVTDMAQGALPALLPLFIKNYGLTYQEAGGLIFANT
VLASIAQPFFGYLADKRSMPWLIPLGMMLSGCCIAAMGFVHSYPGLFFFA
MIAGIGSALFHPEGARLVNRMSGGEKGKAMGIFAVGGNAGFAIGPMFAGL
AYLFGAQTLSIFALINTIIALIIFLQLPKLTVENVVNKAKNTASTTLQND
WRSFAKLSVIIFVRATNFTVLNAFIPIYWIHILHQQETDANFALTIFLSM
GVAITFIGGLLSDRLGYVRIIRYAYLIFLPTILIFTQSENLWLSFILLIP
LGLGVFTQYSPIVVLGQTYLAKSVGFAAGITLGLGITMGGIFSPIVGWIA
DHYGLQIALQTLSVLSLLGLIFSYRLKITDTEKPEKK
>MS1407 proP, ProP protein
MMTSSRPNLTLLLILGALMACTSLSTDIYLPAMPTMAKELQGNTELTITG
FLIGFAIAQLIWGPISDRIGRKIPLFIGMALFAVGSVGCALSQSMAEIVF
WRVFQAVGACVGPMLSRAMIRDLYDRSQAAQMLSTLTIIMAAAPIIGPLL
GGLLLKISSWQAIFWLLVVIGILLFLSIIKLPETLPPAKRAAGSFWSAFG
NYRILLKNRAFMRYTLCVTFFYVAAYAFITGSPFVYIDYFKVDPQYYGFL
FGVNIVGVALLSAVNRRLVRHYPLESLLRVSTMIALCAVLILVVLVFMDL
DGIAGILSVAVPIFIMFSMNGIIAACTNAMALDSVQPEIAGSAAALLGSL
QYGSGILSSLLLAYFSDGTPHTMAWIIALFVGLCAVIGWGQRPRSA
>MS0499 proP, ProP protein
MNLREHIDNNPMSAYQWTVVIIAAIMNLLDGFDVLALAFTATAIRGDLGL
SGAELGYLFSAGLLGMAAGSLFLAPLADKIGRRPLLLISVTLSALGMLGS
AYSASYGALGFWRLITGLGVGGILVGTNVLTSEYSSRKWRSLAISIYASG
FGIGAVLGGMFAVVLQEEYSWHAVFLAGFILTAVCLIVLLIWLPESIDFL
MTQQPRNAQIRLNKITKKMGLKGQWTLPEKVLASASKLPLTQLFNKNYRK
STALIWIAFFAIMFCYYFVSSWTPALLKEAGMTTEQSVSVGMMVSLGGTC
GSLLYGLLASRWKAKQMLVQFTVLSAFSVIIFILSSSILWLAMLFGILVG
GFMNGCISGLYTLNPSIYAANIRSTGVGWSIGVGRIGAILAPLAAGVLLD
YGWDKQSLYIGVGFVLLIAAIALSLLRIKTTLVKC
>MS1530 proP, ProP protein
MNTETKQPALIVPRLSLMMFMEFFIWGSWSVTLGIVMTKYDLSTLIGDAF
SMGPIASIISPFILGMLVDRFFPSEKVLAVLHLIGAAILWFIPEFITGQQ
GGTLVFALLAYMLCYMPTVALTNNIAFHSLADSEKSFPVIRVFGTIGWIV
AGLFIGQADLSASPAIFQVAAICSLILGLYSFTLPNTPPPAKGKPFSMRD
LMCADAIALFKIPHFLVFAICATLISIPLGTYYAYAAPFLDAVGFEKIGS
LMSMGQMSEIVFMLLIPFFFKRLGVKYMLLAGMLAWFLRYAFFALGVSEE
IRWAVYLGILLHGICYDFFFVVGFMYTDKVADEKIRGQAQSLVVLFTYGL
GMLLGSQISGGLYNNMFADNTDVSTWSTFWWIPAISAVVISVIFFIFFNY
KEDKREA
>MS0807 proP, ProP protein
MLMTSQNKINAVPSNQNFYLNNRNYWIFSGYFFVYFFIMATCYPFLGIWL
GDINGLSGEDRGTVFAMMSFFALCFQPVFGYVSDKLGLKKHLLWVLGISL
LIYAPFFIYIFAPLLKVNVWLGSLVGGAYIGFVFQAGAPASEAYIERVSR
RSKFEYGRVRMFGMFGWAICASIAGVLYATNPNLVFWLGSIASLILLLLI
ALAKPEQTSTVQIAEKLGANKNPVNLRQAFALLKLPKFWALLAYVMGIAC
VYDIFDQQFGNFFNTFFESHEQGIKMFGYVTTAGELLNALIMFFVPLIIN
RIGAKNALLIAGTIMSVRIIGSSYAIEAWHVVVLKTLHMFEVPFYLVGLF
KYIANVFEVHFSATIYLVACHFAKQIGNMLVSPLVGAWYDTYGFQDTYLI
LGCIAAGFTLLSVFTLTGKSLSSQS
>MS0797 proP, ProP protein
MSQNHFFSHIFNRNMLICIFTGFSSGLPLYILTSLIPTWLRSTEIDLKTI
GFFTLTSLPFIWKFLWSPFLDRFVPPFLGRRRGWMLIFQLLLLISLGLFG
FIDPHTNQGLSLLIGLATMVSFFSASQDIVLDAYRREILSDQELGMGNSI
HVSAYRIAGLVPGSLSLILSDHFSWQAVFIITALFMLPGLLMTLFISHEP
QIELKSNRTLAENIVEPFKEFFQRKGLWGAIGILTFIFLYKFGDSMATAL
ISAFYLDMGFTKTQIGLVVKNASLWPMIIAGIIGGMITLKIGINKALWLF
GLVQIVTILGFAWLAQLGPFEKVDSFAIFALTVVVMAEYVGIGLGTSAFV
AFMARATNPVYTATQLALFTSLSALPRAVFNSFSGVLIENMGYYHYFWLC
FFLAIPGMLCLIWVAPWKEK
>MS0549 proV, ProV protein
MTTSVKISVKNLTKIFGSHPKSAFKLLQNGKTKEQIFAETGSTVAVNNVS
LDIMAGEIFVIMGLSGSGKSTLIRLLNRLIEPSAGHVFIGDDDIAEMSEK
ALRAVRRKRISMVFQSFALMPHMTILENVAFGLELSGVNSKNRRRMALET
LARVGLEAYADVYPGELSGGMQQRVGLARALANDPEILLMDEAFSALDPL
IRTEMQDELLRLQENSERTIVFISHDLDEAMRIGNRIAIMQDGQVIQVGR
PDEILQNPANDYIRSFIQGVNVSNVLSAKDIASKRHLLNIVQKSEDETPH
VAFKLLEQHERDFAVVLDRYGYYKGMVSVDSLQQARSNRQSLSQSFIEIT
PLSPEQSISDIINDVATTREPLPVVDDKGHYYGVVTKVKVLQTLDRGTEA
>MS0550 proW, ProW protein
MTTENIRTADPWEATLQAAQQDNAYAWLQGSEQSQDFNWMYPFDHTLVPF
GDWVESLINWLVTHLRSFFQFISAPIDYILSLFQTSLNVLPPTVMIILFT
LLVWQFTHFRLALATLLSITLIGAVGAWNEMMITLALVLTSVSFCLLIGL
PLGIWMARSTRASAIVKPVLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVT
IIFALPPIVRLTILGIQQVPEALIEAAQAFGASKKQLLYKVQLPLAMPSI
MAGVNQTLMLSLSMVVIASMIAVGGLGQMVLRGIGRLDMGEAATGGLGIV
LMAIVLDRLTQKIAENMHSQHKVRWYERGITGLFIRKK
>MS0551 proX, ProX protein
MAYPMKLTILFSLALFASNAVRADDKAIQPLQSPLAEETFQTLIVVKALE
ELGYRVNPPKEVDYNVAFTSIANGDATFMAVHWLPLQADKYANAGGDRKL
YRQGTFVEGAVQGYMIDKKTADTYNITNLAQLKDPKLAKLFDTNGNGKAD
LIGCSPGWSCEYTVSQHIDGYGLSRTVEVTQGNYSALIANTIAQYQNGKS
ILYYTWTPYWVSGVLVPGKDVVWLQVPNRPDPGKTVADTNLANGKNYGFT
VSSMHIVANKTFTDAHPDAARLFAVMRLPAGDISAQNMAMRNGQNSSQDI
ERHAEAWIKFHRVQFDEWIKQAKSAKN
>MS1536 prsA, PrsA protein
MPDIKLFAGNATPELAKRISERLYISLGDATVGRFSDGEIQVQINENVRG
SDVFIIQSTCAPTNDNLMELIVMVDALRRASAGRITAVIPYFGYARQDRR
VRSARVPITAKVVADFLSSVGVDRVLTCDLHAEQIQGFFDVPVDNVFGSP
VLINDILKKTDLENPIVVSPDIGGVVRARAVAKLLNDTDMAIIDKRRPKA
NVSQVMHIIGDVSDRDCILVDDMIDTGGTLVKAAEALKERGAKRVFAYAT
HAVFSGSAAQNIANPALDEVVVTDTIPLSAEIKALGKVRSLTLSAMLAEA
IRRISNEESISAMFDA
>MS0777 putP, PutP protein
MFGLDPTLITFTIYILGMLAIGVLAYYYTNNISDYILGGRRLGSFVTAMS
AGASDMSGWLLMGLPGAVYVSGLIEGWIAIGLTIGAYLNWLFVAGRLRVH
TEFNNNALTLPEYFHSRFGTSHNLLKIISASIILVFFTIYCSSGVVAGAK
LFQNLFGIPYATALWYGALATIAYTFIGGFLAVSWTDTIQATLMLFALIL
TPVVIVVSLGGIDGFSASMQSAEIDMQKDFTDLFTGTSTLGLFSLAAWGL
GYFGQPHILARFMAAYSAKSLHKARRISITWMIICLIGAISIGFFGIAFF
HANPQIAEVVTKEPEQVFIELAKLLFNPWVAGILLSAILAAVMSTLSCQL
LLASSAITEDFYKGFIRPKAGEKELVWLGRIMVLIIAALAIWIAQDENNK
VLKLVEFAWAGFGSSFGPVVLLSLFWKRMTSSGAIAGMLTGAIVVFSWKS
VIPATSEWSGVYEMIPAFSLASLMIILVSLLSPAPNKEIVETFEKANLAY
KNAE
>MS1741 putP, PutP protein
MNVDYLVMAGYFALIIAISLLFKKMASNSTSDYFRGGGKMLWWMVGGTAF
MTQFSAWTFTGAAGKAFNDGLSVIAVFVGNMVAYACAYWYFARRFRQMRV
DTPTEAIGRRFGTSNEQFFTWVIIPLSVINAGVWLNGLSVFASAVFDADI
TMTIYVTGISVLIISLLSGAWGVVASDFVQMLVVAVISVACAVVGLVVIG
GPGEIIDRFPGGFVSGPDMNYPLILICTFLFFIVKQLQSINNMQDSYRFL
NAKDSKNASKAAIFALLLMLVGTIIWFIPPWVTAIIYPEAASLYPQLGKK
ASDAVYLVFAKNVMPAGTIGLLMAGLFAATMSSMDSALNRNSGVFVRSFY
APIIRKGKADDKELLRAGQIVCVINGILVILMAQFFNSLKHLSLFDLMMQ
VATLLQSPILVPLFLAIIIRKTPKWAPWATVLFGMFVSWSVVKVFTPEYV
ASWFGVEDLTKREISELKVIITIAAHLIFTAGFFCLTTLFYNEAKDTNNE
RRIAFFKDVDTECVAEEGQDEIDRLQRKKLSTLVMLMAAGLLLMILIPNP
LWGRALFACCSLAIFAVGYGLKRSAEV
>MS0535 rhaT, RhaT protein
MLQKYRGEIILFIVSLIAASGWFFSKFSMAEFPALGFIGLRFFLAAIFFF
PLAYPQLKRLDKPQLIKSALVGLCYAVYIMLWMLGLINSAHFGEGAFLVS
LSMLIAPLLSWLIFGHLPYKSFWLALPAAFTGLYLLSSGKGGLHFSFGSL
IFLISSLVAALYFVLNNQYARDIPVLSFTTIQLFIVGTCCGTLSILFEQW
PTSISMTAWGWFLCSLVIATNLRMLLQTYGQKYCHVATAAIIMILEPVWT
LFFSILILGERLTLHKAFGCLSILAAIMIYRLPAILRNQASANKE
>MS0885 rhaT, RhaT protein
MLRPSCREKIIMVNNYNLALIKVHFTAVLFGLTGVLGVIISADSDVIVLG
RVIIAFLALSVYFLIKREKLTALSTKDVANQSLSGALLTAHWVTFYVAVK
VGGVAVATLGFAGFPGFVALFERLFFQEKLKRRELILLIAVTIGLILVTP
QFEFGNQSTQGLLWGIFSGAIYGILAILNRKNINKLSGTQASWWQYLIGS
ILLFPFAAHKLPAVSVTDWFWIACLGLLCTSLAYTLFVSSLNIINARTAA
MIISLEPVYAILIAWIWLGEQPGLRMIIGGLIILLSVGVVNFRR
>MS1754 rhaT, RhaT protein
MFYLIAAVLIWASAFIAAKFSYTMFDPALTVMLRLILSALLVLPTFFRSY
RKIPKQYRLQLWGLGLLNFPVVLLLQFTGVHYTSVASAVTMLGTEPLVVT
LLGHIFFHKPARLLDWLLGIVALTGIVFVVYGSESGGEVTLLGCTLVLLG
SIAFSFSIHLAQSVMKAVEAKAYTDVIIMTGAISCVPFSLLLVQDWQIHL
NIEGISAILYLSVGCTWLAYRLWSKGLRVSSANTASILTTLEPVFGVLLA
ILLLGEHLTLTTLFGICLVISAAGISVLSSMLINYIKNKVTIL
>MS1595 rhaT, RhaT protein
MNQQPVLGFIFALITAMAWGSLPIALQHVLTVMGAESIVWYRFFVASLAL
FLLLAWKKKLPALSQFTSRYWKLSLIGVLGLAGNFFLFNSSLNYIEPAIT
QIFIHLSSFMMLICGVFVFKEKLGAHQKAGLLILILGLGLFFNDKFDMLF
GLNMYSTGILLSVSAAVVWVAYGMAQKLMSRQFTAQQILLIMYTGCVIVL
CPFAQFSQIQGLSGFALGCFIYCCLNTLIGYGAYAEALNRWDVSKVSVVV
TLVPLFTILFSRILHGLDPAHFAMPHLNTVSYIGAFVVVLGAIISAVGYK
LFKYKR
>MS1753 rhaT, RhaT protein
MLFQIIATLIWASAFIAAKYTYEMMDPVLMVQCRFFIASIIMLPGFFAAY
KRVPKERLKIMWLLALINFPLMFLLQFIGLYFTSAASAVTMLGMIPLLTV
LIGFLFFKRRINKIDLLLSLVALAGIILTVVGGGEDNLINPWGCLLVLGS
AVSFCFCLYLSKDVMQEMAPKDYTNVLVILGSILCLPFTCVLVRDWSIVP
SVKGMISLFYLGIGCTWLAVVLWFKGVQKTPTYISSILTTLEPIFGVILA
ILILDERLSTVSAMGILLTLGAAAVSVLIPVLMKKSP
>MS1825 rhaT, RhaT protein
MLMPHFTQSKGYGYFCLILATFFWGGNYMFGRILSHVIPPIILNYLRWLP
AAIILLLLFAKYLPQQRHIIRKNWQILTALALLGVLIFPVFLYQGLQTTT
ALNASIYLAVVPIVVMFLNRICFKDTIRFPVFIGALISFIGVLWLLSHGE
LSRLLTFNVNRGDLWAIGSAVSWSVYCSIIRLRPKEIGNSVMLTAQVGIA
MIIFTPVFLSQLNTENLQIISELTYGQWMIILYLIIGPSILSYGFWNYGM
TIVGGTKGAAFTNATPLFAAALGILVLGEQLHGYHLISSLLIVIGLTLCN
KK
>MS1597 rhaT, RhaT protein
MKQQPLLGFLFGLIAACMWSSLPLFVQQVVKVMDIQTSVWYRFVLSAVGV
LLLLCFSGKFFTFKRISPKNTLLLLLAIAGLSVNFYLYNLALKYIPPTTS
QVLSPLSSFMMLFAGVLIFKEKMARHQKIGLAVLSLGLILFFNERLDDFL
QLNTYFKGVVMVIASSFVWVIYAIVQKVLLSHLSSQQILLMIYIGCTLVF
FPNADIKQIYQLDGFQLVCLVFSGVNTIIAYGCYAEALDRWEVSKVSAIL
TQIPIFTLLFFHLAVMIAPNYFVAVELNWISYLGAFCVVSGAMLSALGHK
LKMLKERD
>MS0972 rhtB, RhtB protein
MLNLIIVHFFGLVTPGPDFFYVSRMAASNSRRNVICGIIGITLGVAFWAA
SAMLGLAILFTTMPVLHGVIMLLGGGYLAYLGLLMVRSRTNATFAPLSAE
ELNKTTTVKKEIMKGLFVNLSNAKAIIYFASVMSLILVHITQVWQMLLAF
AIILVETFIYFYLISVLFSRPFAKKFYSRYSRYIDNVAGIIFLIFGMILA
YTGVMEMMG
>MS1681 rhtB, RhtB protein
MEFWHGFLIITGIHILAAMSPGPDFIYVSQQTLSRGRAAGIICALGVAFG
LGVHILYSVLGLAVVIASAAWILTTIKIIGGIYLIYLGYKGLKARAKNQV
QIIEKVEVQQENRLKTLWKGFLCNVLNPKAPVYFVSVFTVVLSPNMPVWQ
LAIYGVWMMFLQFVWFASVAFLLSIPKVNKQFQKAGHWIDRILGFVMVGL
GIKVISS
>MS1311 sdaA, SdaA protein
MISVFDMFKVGIGPSSSHTVGPMKAGKQFIDDLITQGNIGKITRIHADVY
GSLSMTGLGHNTDITIIMGLAGYLPHNVDIDSIADFISRVKQTALLPVAG
GSYTVDFDFKQDMQFHDSFLSLHENGMTLTAFMNDEIAYRQTYYSIGGGF
IVGEAHFNQAQNEEVPVPYPYNNAADILRHCHDTGLPISTVVFRNEVALH
GKESVEHHLSLIWQTMQDCIKHGLKTEGLLPGPLKVSRRAPALHRLLQAN
SNLNNDPMQVIDWINMFALAVNEENAAGGRVVTAPTNGACGIVPAVLSYY
EKFISPLNAETVERYLLVCSVIGSLYKMNASISGAEVGCQGEVGVACSMA
AAGLTEILGGNPEQVCIAAEIAMEHNLGLTCDPVGGQVQVPCIERNAIAS
VKAINAARMALRRSTNPRVTLDKVIETMYETGKDMNAKYRETSKGGLAIK
VVCS
>MS0977 sdaC, SdaC protein
MLHIILTLNRKFKMKNKTFGSALLVAGTTIGAGMLAMPLTSAEMGFTYTM
ALLFLLWILLSYSALLFVEVYQTVQRKDAGIATLAEQYFGMVGRVLATLS
LVIFMYAILSAYVTGGGSLLAGVLPFLGEHAAPISIIAFTVILGIFIVIS
TGAVDGLTRLLFMIKLVAFVLVLTMMLPLVQGENLMAMPLKEFLIISASP
VFFTSFGFHVIIPSINNYLDGNIKRLRAAIIGGTALPLVAYILWQMATHG
VFPQAKFVEIINNDPTLNGLVDATYHVTGSNLISGSVRLFSTLALVTSFL
GVSLSLFDCLDDLLKRINIKAGRLALGVLTFLPPLAFALFYPEGFIAALG
YAGQMFTFYGLVLPVGLAWRARKLHPNLPYRVIGGNLTLLIALLLGLLIM
NVPFLIEGGYLPKVIG
>MS1895 sdaC, SdaC protein
MYYYNSSKSYLTWKTFMEKSMKNKKQPSLLGGAMIIAGGTIGAGMLANPI
STAGVWFLGSLLILIYTWFCMMSSGLMLLEANLHYPTGSSFDTIVKDLLG
KGWNILNGLSLAFVLYILTYAYITSGGGITEGFLNQLLSSEQSAVEIGRS
SGSLIFTFVLAVFVWFSTKAVDRFSTILIGGMVISFFLSVSGLISSANAD
VLLNSATSQDTQYLPYALVALPVCLVSFGFHQNVPSLVKYYNRDAGKVSK
SVFVGTFIALIIYILWQLAIQGNLPRAEFVPVIEKGGDIAALLAALSKYI
QTDYIALALNFFAYMAIASSFLGVTLGLFDYIADLCGFDDSKAGRTKTAL
ATFLPPLLLSLQFPYGFVIAIGYAGLAATIWAAIVPALLAKASRKKFNKP
SYSCFGGNLMVYFIIIFGVLNILSQLAMQFGWLPEFKG
>MS2338 selA, SelA protein
MTALFQQLPSVDKILKTPQGEQLVTEFGHSAVVNCCRHLLAQAREKIKIE
KKLPHFFTDFNHTIAEVNRYLANQQQVKIKSVHNLTGTVLHTNLGRALWA
QSAQQAALTAMRQNVALEYDLEAGKRSHRDNYVSELLHELTGAQAACVVN
NNAAAVLLMLATFAQGKEVIISRGELIEIGGAFRIPDIMAQAGCKLVEVG
TTNRTHLNDYRRAINENTALLMKVHSSNYQICGFTCEVSEQELVELGKEF
NIPVVTDLGSGALTDLSRYDLPKEPTVQEKLVQGADLISFSGDKLLGGPQ
AGIIVGKKELIQQLQSHPLKRVLRCDKVILAAMEATLRLYLQPEKLTEKL
TSLRLLTQPLEQLRQQAEQLKAKLENLLKDDFLLQIESSLAQIGSGSQPM
AKIPSIAVTIAEKNSEKLTALLARFKKLSTPIIARVENDKIRLDLRSVTA
IETLLITLEELNQDQ
>MS1241 selD, SelD protein
MLGTILHSQLEQFVDPHLLVGNDTNDDAAVYDIGNGTCIISTTDFFMPIV
DDPFDFGRIAATNAISDIFAMGGKPIMAIAILGFPINVLPAEVAQKIVDG
GRFACREAGIALAGGHSIDAPEPIFGLAVTGIVPTEKVKRNASAEAGSKL
YLTKPLGIGILTTAEKRGKLKPEHKGLATEVMCQMNLIGSQFSQLESVTA
MTDVTGFGLLGHLAEICEGSNLVADVHFNKIKMLDGVPYYIEQGCLAGGV
TRNYESYGIKIGAITEFQKAVLCDPQTSGGLLVAVKPEGETQLLELAAQA
GIELIEVGELRRRVDNSDPVIIRILD
>MS1743 serA, SerA protein
MTNKVSLDKSKIKFLLLEGVHQNALDVLHAAGYTNIEYHKKALEPDELKE
AIKEAHFIGLRSRTNLTADILEHANKLIAIGCFCIGTNQVALEAAEEKGI
PVFNAPFSNTRSVAELVLGEILLLMRNIPAANAQVHRGEWNKSAAGSHEV
RGKKLGIVGYGHIGSQLSIIAESLGMNVFFYDVETKLPLGNAQQVSTLEE
LLSSCDIISLHVPELPSTKNLMSAERIAQLKPGSILINAARGTVVDIDAL
AEALEQGKIHGAAIDVFPKEPASAAEAFESPLRKFDNVILTPHIGGSTAE
AQENIGTEVASKFVKYSDNGSTLSAVNFPEVSLPEHRTAKRILHIHHNRP
GILNKINQVFVDENINIAAQYLQTDAKIGYVVIDVETDDSTDLLQKLKSI
EGTIRARVLF
>MS0068 serA, SerA protein
MIMKVVISHRLHDNGMKVLEDANAQVAITNDGNPKIMLPELLDAEGLIIR
IGSIDRETMLQAKNLKVIGRPGVGVDDVDVKTATELGIPVVIAPGSNTRS
VAEHAFALMFACAKDIVRSDNEMRKGNFAIRSSYKAYELNHKTLALIGYG
RIGSILAQMSKAIGMNVKVYDPFVKQGTIEQEGYIYCTELDDVIRDSHVI
SIHVPLTNETRNLIGEHEFSLMNEHTILINCARGEVIDEPVLTKVLQEGK
IHSAGLDVFACEPVDINSPLFQLDNVIVSPHMAGQTKEAASGVATMAAEG
VVAVINGEKWPYVCNPEAYNHPRWNK
>MS1758 serB, SerB protein
MQTSEFINLTLKDIKQHYSPFPNKLINNQPQTEGRDYFILFGTNLEPAKL
QAFQQKCGENFQIFDCWNNLHNIVVLLKGHWQKSYETHAHDLTLDAAKID
FNANLAEQGLLVMDMDSTAIQIECIDEIAKLAGTGEEVSAITAAAMRGEL
DFEQSLRRRVSTLKDAPETILQEVRLQLPLMPGLKETVRILQQHNWRVAI
ASGGFTYFADYLKELLNLDAAVSNQFDIENGKLTGRVKGDIVHAQYKADT
LKRLAREFNIPLENTVAIGDGANDLLMLKQANLGAAFHAKPKVQQQAQVV
VNFADLTALLCLLSAGEKIKHLS
>MS1573 serC, SerC protein
MSNVFNFSAGPAMMPPAVLKKAQEELLNWQGQGTSVMEVSHRGKYFMELI
TQADKDFRELYNIPENYKILFLQGGARGQFAAIPMNLANNKGKALYLNTG
HWSATAAKEARNFTEVDELNITEQIDGLTRVNRLDFSDIAEQYDYVHYCP
NETITGVEINEIPNVGNAVLVADMSSNIMARKLDISKFGIIYAGAQKNLG
PAGIVIVIVREDLIGHARKATPSIWNYEVQANADSMINTPPTFAWYLCSL
VFKDLLANGGIDTVEKRNAQKAALLYDYLDQTVFYHNTIAKENRSVMNVT
FTTGDDQLNAKFVAQATEAGLQALKGHKVFGGMRASIYNAMPVEGVEALI
AFMKKFEAENA
>MS0832 sstT, SstT protein
MNISRLFSFLFHGNLVKRISIGLLLGIIFALVSPSLESALGFHLAEKMGL
LGQIFVRSLRSVAPILVFVLVIAAIANKKVGSKSNMKDIIYLYLIGTFLS
ALTAVFASFMFPTTIALATNEAELSPPGKITEVLTALIFNVVDNPITALF
NANFIGILAWAIGLGITLRYASETTKNVMNDFAEAVSKIVHFIISFAPIG
VFGLVASTLADKGLSALLDYVQLLAVLVGSMLFVAFVINPIIVFWKIRRN
PYPLVWECIRVSGVTAFFTRSSAANIPVNMELAKRLNLDEETYSVSIPLG
ATINMGGAAITITVLTLAAVFTLGIEVSIPTAILLSLVASICACGASGVA
GGSLLLIPLACSLFGISNDIAAQVIGVGFIIGVLQDSTETALNSSTDVLF
TAAACMSEERKNS
>MS0956 tdh, Tdh protein
MMEIKTLSCVVRGPKDVGVMEQSINYDESSKEQTLVKITRGGICGSDLHY
YQYGKVGNYEIKHPMILGHEVIGTVVKTNAPDLYVGQKVAINPSKPCLTC
KYCLSGDTNQCETMRFFGSAMYNPHVDGGFTQYKVVDNSQCIDYPQDVSD
DIMAFAEPLAVTIHAAKQAGDLAGKRVFVSGVGPIGCLAVAAIKASGAKE
IVVSDLSRRCLDLALEMGATKALNAKDDFSEYMAHKGEFDVSFEASGHPS
SIERCLAVTKARGTIIQIGMGGAIPEFPIMTLIAKEICLKGSFRFIEEFN
TSVEWLSSGKVNPLPLLSATFPYTELEKALIIAGDKDNISKVQLSFE
>MS0525 tdh, Tdh protein
MMRSLVCKEPFHLILEERAKPQPKDEEVQLKVAAIGICGTDIHAYAGNQP
FFEYPRVLGHEASGVITELGKNVDKFKVGQRVALIPYVSCGKCGACLSGK
TNCCENISVIGVHQDGAFSEYLTAPAKNILPIADSVDFTTAALIEPFAIS
AHAVRRAQITKGDDVLIVGAGPIGLGAAAIAHADGANVVIADTSEERRKH
IQANIPVPTVNPINEKVEDYFNGRLPQIVIDATGNQKAMNNAVNLIRHGG
RIVFVGLHKGTIEFSDPDFHKKETTLMGSRNATLEDFEKVQHLMSERKIS
ANMMLTHTFKYDELAEIYEEKITKNQSLIKSVVLY
>MS1702 thrB, ThrB protein
MLRIYAPASSANISVGFDTLGAAISPIDGSLLGDVVQIEDIPAGFELESA
GYFVRKLPKEPQKNIVYQAYVLFSERLKLRNGHVKPLRLTLEKNMPIGSG
LGSSACSIVAALVALNMFHNEPFSKMELLEMMGELEGRISGSIHYDNVAP
CYLGGVQLMVQSLGNICQQLPFFDSWYWVLAYPGIEVSTAEARAILPKSY
TRQDVIAHGRHLGSFVHACHTQQDVLAALMMKDVIAEPYRESLLPNFAEV
KQASRDLGALATGISGSGPTIFSIAPDLAVATKLANYLENHYLQNNEGFV
HICKVDNQGTRALG
>MS1701 thrC, ThrC protein
MNLYNIKHPEEQVNFAQAVRQGLGKDQGLFFPEVIPALDNIDELLALPLV
ERSQKILGALIGEEIPAEKLNTMVKNAFTFPAPVAKVEEGVYALELFHGP
TLAFKDFGGRFMAQALATVRGDGKITILTATSGDTGAAVAHAFYGLENID
VVILYPQGKISPLQEKLFCTLGGNIRTVAINADFDACQALVKQAFDDEEL
RRAIGLNSANSINISRLLAQVCYYFEAAAQLSPSERSNLVVSVPSGNFGN
LTAGLIAKTLGLPIKRFIASTNANDTVPRYLAKGKWEPNATVATLSNAMD
VSRPNNWPRVEELFKRNGWALSELHSGAVSDAQTEETLRDMNAKGYLCEP
HGAIAYRVLKQDLQAGETGLFLCTAHPAKFKESVERILNTQLPLPQALAK
HAELPLLSDVMENDFAALRAYLLKK
>MS1154 trpA, TrpA protein
MARFETLFAQLNAKKQGGFVPFVTLCDPDLERSFDIICTLVDNGADALEL
GFPFSDPLLDGPVIQAANNRALNAGCSTAESFKLLEKVRSKYPEIPIGLL
LCANLIYAQTLDGFYRRCAEIGIDAVLVADIPLLAAEPYIQAAKKHGIQP
VFICPPNADENTVKGVAEHSEGYTYLVSRAGVTSAENQSHAANLDSLVEQ
LKAHNAPPILQGFGIAKPQQVKEALNMGVAGAISGSATVKIIEANLDNHE
KCLADLAEFVKNMKAATL
>MS1153 trpB, TrpB protein
MTDTILDPYFGEFGGMYVPEILIPVLKQLEKAFVEAQQDPAFQTEFLDLL
KNYAGRPTALTLCRNLTKGTKTKLYLKREDLLHGGAHKTNQVLGQILLAK
RMGKTRIIAETGAGQHGVATALACAMLGMPCRIYMGAKDVERQSPNVFRM
RLMGAEVFPVTKGSSTLKDACCEAMRDWAANYENTHYLIGTAAGPHPFPT
IVREFQKMIGEETKAQILQREGRLPDAVIACVGGGSNAIGMFTDFINETS
VRLIGVEPAGKGIETGEHGAPLGHGKPGIYFGMKSPIMQTEDGQIEESYS
ISAGLDFPSVGPQHAYLNSIGRAEYPSITDDEALEAFKELAQHEGIIPAL
ESSHALAYALKMARQNPMREQLLVVNLSGRGDKDIFTVDKIFSERGML
>MS1152 trpC, TrpC protein
MNLNDKPTILQKIVADKIQWIKAKEQVFPLASFKEKITKSDRSFYQSLGK
GTHQNPVFILECKKASPSKGLIRNEFNPADIAQVYKNYASAVSVLTDEKY
FQGDFSYIKQVRDIVTCPVLCKDFMISEYQVYLARYYQADAILLMLSVLD
DETYKKLAALAHELGMGVLTETSNQQELERGIALGAKVMGINNRNLHDLT
VDLARTPPLAQQIPADRIIVSESGIYSHQQVQQLKPYVNAFLIGSSLMGS
DDLNNAVRSVIFGENKVCGLTRPQDVQEVYRQGALYGGLIFAENSKRCVS
LRQAQELVTVAPLRFVGVFQNQQIDFIVKIATQLNLYAVQLHGAENEEFI
AALRIQLPHQIQIWQAVSIDVAQQSAVKIDRISAVDRYVLDSKTANRQGG
TGVAFDWSKIPAEIKNKSLLAGGITPENIELALAQHCLGIDLNSGVESAA
GIKNPEKLTAVFNKIHRF
>MS1151 trpD, TrpD protein
MRIKTRNFIMQTQQILTQLFDNQPLSQEQAAFIFGNIVKGELSNEQLAGA
LIALKIRGETIDEITGAVTALLAAAEPFPAPDYPFADIVGTGGDNADTIN
ISTASAIVAASMGLKIAKHGNRSVSSKTGASDVLTALGVNIRMSTEQARK
ALDEIGIAFIFAQQYHLGFKYAGPVRQALKTRTIFNILGPLINPANPKRQ
LLGVYSPELLKPYAETNLRLNHEHSIIVHGCGLDEVAIHGLTQVAELRDG
KIEYYNLSPKDFGFEPQPLESLRGGAPEENAKILTALLQGKGSEQQAQAV
AMNTALLMKLFGHEDIKQNAQQVLEQLTTGKAFETLTKLTTY
>MS2193 trpE, TrpE protein
MNFASFIRQANRLGRQKTAFFFLIDFERQKPLISPLESAVENGIIFSVEG
NTNFYRPVELPRQKIRFSSEPVSFERYAAGFALVQQELQKGNSYLLNLTY
PSKINTNYNLAQIFQATKAPYKLLLQDQFVCFSPESFIRIRQNQIFTYPM
KGTIDAALPQAEQQLMQSEKEGREHYTIVDLMRNDLAMVAENIRVRRFRY
IDKISTNRGEILQTSSEITGNLTADWQNRIGSILAALLPAGSISGAPKEK
TVSIIRQAEGGKRGYYSGIFGIFNGEELNSAVAIRYIEQKDGQLYFRSGG
GITSQSRLQEEYEEYCQKVYLPIHCVE
>MS1149 trpE, TrpE protein
MPNAYIQTLSNPVQYQQDLTAVFATVGKTNSLLLESAEISSKNSLQSLLI
INAALKVSCLGQIVTFTALTANGSHVLPLIKEKLQGKTKSLSVQQNKLIA
EFFPIDQNLDEDSKLQSLTVFDGLRVINQLYQHSKQPVFLGGLFAYDLVA
NFIPMNNITLQDDGLSCPDYVFYLAEQLLRLDHPSQQATLQTFCFNDSEL
QNLQQSAVEIDKDLRNLKPLSAIQQGSTDISTNHEDEKFKQIITALKHHI
YIGDVFQIVPSRRFILQCPNTLATYRQLKENNPSPYMFFMQDEEFTLFGA
SPESALKYSADNRQLEIYPIAGSRPRGFDAKGKIDPELDARLELEMRLDH
KEQAEHLMLVDLARNDVARVCESGTRHVKELMQVDRYSHIMHLVSRVVGK
LRPELDALHAYQACMNMGTLTGAPKIKAMQLIYQFEKQKRHSYGGAVGYL
SSDGNLDTCIVIRSAFVQNGIAYVQAGCGEVLDSDPQMEADETRHKAQAV
IKAILQTNAQAN
>MS1102 tyrA, TyrA protein
MEALKEIRAEIDQLDRELLEVFAKRLALVKKVGEIKHQQGLPIYVPEREA
DMLAARRSEAEKMGIPADLIEDVLRRVMRESYANEHEHGFKTVNPAIKKI
VIVGGKGKLGGLFGRFLTASGYFVEALGSKDWDNAKAILAGANAVIVCVP
IVKTLETIERLKPYLTEDMLLTDLTSVKRRPLEKMLEIHQGAVVGLHPMF
GPDIASMAKQVVVRCDGRYPERYQWLLEQIQMWGARIYQADAAEHDHSMT
YIQALRHFATFANGLHLSRQPVKLANLLALSSPIYRLELAMIGRLFAQDG
SLYADIIMDKPENLEVIESLKQSYEDSLKFFENGDREGFIKTFNKVREWF
GDYSEQFMKESRQLLQQANDYRHNSL
>MS1031 tyrB, TyrB protein
MQITILVSIKEKLISKHNISKESPMFKNITPAPADPILGLGEAFKAETRE
NKINLGIGVYKDADGVTPIMTAVKKAEGQLFENEKDKNYLPIEGVAEYNA
YAKELLFGKDSEIIASNRACTVQTLGGTGALRIAAEFVRRQTKAQNVWIS
KPTWPNHNAIFNAVGVTIREYRWYNPETKALDWDNLLADLNNANPGDVVL
LHGCCHNPTGIDPTPEQWKALAEMSAKNGWLPLFDFAYQGLANGLEEDAV
GLRTFAETHRELLVASSFSKNFGLYSERVGAFTLVADNADVAAVALTQIK
SIIRTLYSNPSAHGARTVATVLANPELRKEWEDELTSMRDRIKQMRKQLV
ELLKEFGAQEDFSYIIDQKGMFSFSGLTAEQVDRLKEEFAIYAVRSGRIN
VAGITEANIRYLAESIVKVL
>MS0762 tyrR, TyrR protein
MFTVKGYDEGNYFIRSIVGKTMSKNTAKRSAHFTVNQYENFTDVVALSPK
MAALVEKAKKFALLDAPLLIQGETGTGKDLIAKACHNLSARKDQKFIAVN
CAGLPDTDAESEMFGRADGDKTSTGFFEYANGGTVLLDGVAELSLNLQAK
LLRFLNDGTFRRVGEEQEHYANVRVICTSQISLQHYVDEGKVRSDLFHRL
NVLSLQIPPLRERKEDLAVLTENFVRQISRRLGVRTPEFDGQFLQYLKDY
QWPGNVRELYNALYRACSLAEHNKLTIDGLNLSENETVPLTLEQFGNESL
EEIMNNFEASVLRKFYEQYPSTRKLASRLGVSHTAIANKLKQYGIGK