Gene list
Applied filters:
COG category: General function prediction only
Gene type: CDS
Genomic element: chromosome
Number of genes found: 272
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mannheimia succiniciproducens MBEL55E, MBEL55E >MS1428 unknown MNMNKKIVMILKILLAVIVLLTGAVWAFMTYHPVWGGTPDEGSMARIRAS KAYNATLGKFENQEPTQLLTTDEKPSITTWITRLMAADEGKNPSEPLPSA AFDKNVLKDGEMVWFGHSTVLFKLGGLNVITDPVFHNASPIPYIGISPFK TEHSYSVESLPELDIVLLSHDHYDHLDYRAIQELDSKTKHFIVPLGVKAH LQRWGVADDKITEMDWDEQTKIGTLAITLVPARHFSGRTLNIKDPTLWGG YIIQSPELKYYYSADSGYGKHYRETIAKHAPFDFVMIENGAYDKKWALIH ETPEEALQALKDIGATKVLPIHWGKFDMANHVWTDPINRLMKDVASQPEI SVATPKIGQIFHTQGDLPAEQWWQGVR >MS0521 unknown MRALKKISQLLAKNTALVIILTALFTFIVPEAFTWVKGDAQVLVLGIIML SMGMTLGAKDYQILAKRPLDILIGTVAQYTIMPFVAISIAQAFNLSPGLT LGLVLVGTCPGGVASNIMSFLCKGDVAFSVGMTTVSTIIAPVMTPLLLNY LVGETIDMDGWGMFKFMLLVTILPVGLGSLFNMGCHKQKWFNDVRSVMPG VAVIAFACIVGGVVAFQGERFLESGLIMLMAIGCHNITGYILGFAAGRVF GMNTAKKRTLSIEVGVQNAGLATGLSAKFFPTNAESVVACAVACVWHSVS GSVLANIYQWWDKKHGEPVTEIHEIKKPVTESV >MS0214 unknown MLKINYERRLVMKEVSSKSLNDDELALLDNLLLEYANEESDEGIFTLSEL DGYLTAIISSPMLIQPSTWIPAIWDNDLPEWENEQEMAMFFDLLFRHYNS IIMMLQTGLEYYSPCFEYSNFTDGDYPIVDDWCFGYMRGVKLADWQNLPT KLQPYLKLIEDQTHLHSSLDDYVSPSLQEQNELADRLIEAAVKIYRYFR >MS1705 unknown MEIANNLKQIHKNIVSICQNAGLPSNSVKLLAVSKTKPVEDLEQAYQAGQ RAFGENYVQEGVEKIEFFQAKHPDMEWHFIGPLQSNKTRLVAEYFDWMQT VDREKIAIRLNEQRPANKSPLNVLIQINISDEESKSGIKPADMMALAEII ENLPHLRLRGLMAIPAATHDVAIQAQSFSAMHKLFVELQQSLPNQRIDTL SMGMTDDMTAAIKCGSTMVRIGTAIFGSRN >MS0034 unknown MLNYKPLTEKSGFFCILRTLMTQYIIAQTNKGVQLGITAKMANRHGLIAG ATGTGKTVTLRKLAEAFSDDGVPVFLVDVKGDLSGLTVKGTLQGKIAERV EQFNLGGENYLSGYPVSFWDVFGETGIPLRTTISEMGPMLLSRLLNLNAT QEGLLNLVFRVADDKGLLLIDLKDLRAMLKFVAENAKEFQVEYGNVSAAS VGAVQRALLTLENEGATNLFGEPALNLEDWLQTRDGRGVINILNSEKLIN SPRMYSAFLLWLMAELFERLPEVGDPEKPKFVMFFDEAHLLFDGVPSALV DKVEQVVRLIRSKGVGIYFVTQNPLDLPDTVLGQLGNRVQHALRAFTPRD QKAVKSAAETFRANPQVDVVETISTLGVGQALVSFLDEKGMPTPVEIVGI FPPKSQLTPLTNEQRTDWVKDDELYPHYRDLVDNESAYEILNDQSVQAQV QQQVQDEENSDFFSGMISSIFGTKKKSRQTVAEQMVSSVAHQVGRNLRNQ VTKQILRGILGAITKK >MS0102 unknown MSKQQTVELDLNPIMQALSRTPMVLLGYQKRWCEDTNPVKVVEKSRRIGL TWGEAADCALLAASNSGMDVWYVGYNKDMALEFIRDCANWAKFYGLAAGE IEETEEVFKEGDEKESILAFTIRFASGWRITALSSSPSNLRGKQGLVIID EAAFHPCLSELLKAAMALLMWGGRVHIISTHDGVDNPFNELIQEIREGKK PYSLHTITFEDAMKDGLYERICLRTNRAYSKEGEQQWEAEIRASYGEDAA EELDCIPKNSGGKWLSRALIESQMHSHTPLVRKEMARDFELIDEPVRAKE IAQWLQEEIQPLLDDLDKNRPHFLGEDFARKGDLTSLVIAAQQPNLTNEI QFIVELGNMPYAQQEQIVLYILKALPLFSGAAFDGGGNGGSLAEKARDAF GESLIHIIQLSEKWYKENTAPFKAALEDGTLTKLPKNADVLADLRAFEIV RGVPRIPDKRAKSVDGGKNKRHGDTAIALLLLHFATRQDVRLPVVAVTRR ARRSQTISEGY >MS1050 unknown MTIKAIIFDMDGVLIDSEPVWKQAGIDIFNAEGIPVTYDDMLALTGIPSL GIVKAVYEKYQRSPVPVAEMAQRLNDHAISLILAQKPLIDGVQETLQKLT ALGYKLAVASASPRILLEEITQSCGIDQYFSYLSSATELSHNKPHPAVWL HAAEMLGVEATECIGIEDSVVGMVSVKAASMKCIVVPGVLGSDDPRWALA DIKLATLREIDETVIGKLDSI >MS1709 unknown MQAEKHLKWAAEQNDDRIAFRLDGELSRDTLLPLWNEFQKREQRSSFLSE RQIADKNISWDLSQVSRIDSAGFALLCDLLHYCQAKKNADKTLLLENVPP QLLTLADLVGLADWIKPYLK >MS1493 unknown MNINVISIFKLLLLFALGLVILSPALSTQIGVPRLDSALCFLFFFLAVIT PFLRDMETDFFKLQFPVYVLFFFGFLSVLNAFSTEKLVDLFFFGIVMFLF HYSFLTFNRGDGEAGIRHLLLGISLIVLAGFFIEALLGFQLVSGNEELTV TDKAFKGFFFNTNDQSVIMISLAVAVGFFYIIRENNWKIKLIGYALIFIM GLAIVISASRSVLLSYLIMLMLILFLNASAYFKAVYLFFACVIALFIFNL SWLQEVFILLAKIDWLERPIERFSLVIFSMGDDKSVGYRTEIYTTFLDNF KILWLGYGPRDYIQYFDQIKLSFPLGYTNPHSFFIELYLAFGIFAFLAFI YFLLNSIIYVMNTRLLAWKERIFILFVFINFCWIVWVPSSILRLPLVWYP LFLVLVYTVLVKNGTFVSPKLVGRRRSS >MS2129 unknown MKFKLKALTATLFLGSSLLGANAMAQLPQNATAIEVPAQSIQLTQEWDKI FPKSDKVEHRKVTFKNRYGITLVGDLYVPKGATGKLPAIAVSGPFGAVKE QSSGLYAQHLAERGFVTVAFDGSFTGESSGLPRNTASPEINTDDFVSAVD FLGSLDNVDREKIGVLGICGWGGFALNSAISDPRIKAVATSTMYDMTQVM ADGYEIKMEPNPKVPYERTSPMTTEARYKMKQDLANARWEAAANGYSLNG KAEDHLTPQDKITAETPRFVREYSNFYKTKRGFHPRSVNSTTGWNTAMTP SFINMPILQRAGELKAPALVVHGEFAHSRYFGEDAYKALGSKNKELYIVP GANHTDLYDDVNGKIPYDKFEQFFKANLK >MS2095 unknown MKDLTAREFGYGHPTPLFMIGTYDEDGRVNFMNSHWGALNHGGYINLNIN TNKKTHLNIEKMKAFTVTLATEKLMPYADFFGTYSGFQYPDKFEKSGLTA HKAKYVNAPIIDGSTLVIECELVEILYQEHIHTIIGRVKNVSVDESVLDA QGKVDASKLGMIFFDSFSRGYFTLGERVGDAWSIGQSILNS >MS0023 unknown MIVLDFLLCRFLKDKEQNMSKVSEITRESWILSTFPEWGTWLNEEIELEQ VPANNFAMWWLGCVGLWVKTPQSANICIDLWCGRGKATKQVKDMVRGHQM ANMAGVRKLQPNLRNSVGVLDPFAINEVDAIVATHYHNDHIDVNVAAAVV NNPKLDHVKFIGPQYCVDMWTKWGVPAERCVVVKPGDTVKIKDLELVALD SFDRTCLVTLPARGAEDNGGELNGICPSDEEMGLKAVNYLIKTPGGNIYH SGDSHYSIYYAKHGKDYDIDVALGSYGENPLGIQDKMTSIDILRMAECLR AKVVIPVHHDIWTNFMASTNEILELYRMRKDRLQYQFHPFIWEVGGKYVY PRDKDLIEYHHPRGFDDCFEQEPNVPFKSIL >MS0991 unknown MTNTKKVYYAHSEKDLPHEQWQTFSSHAENVAKLAAQFAEIFDAYQLAYN TGLLHDLGKYTPAFDKRLHGGPSVDHATAGAKIAIERWGFPLGKILAFCI ASHHTGLVNGDGEGDNRSTLKQRLSVPFGKGNLPELDPIWQSELPLPEKL TFPALKPDPYYQPFALAFFIRMLYSCLVDADFLDTEAFYANLKQQDIDRG NAPSLDQLHQQFNRFISDFRERKKALQPQTEEEQRNAKLNRLRSQILDHA IAQAQQEPGLFSLTVPTGGGKTFTSMAFALEHAKKYGMLRIIYVIPFTSI IEQNAQEFRKAFGEFGEAAVLEHHSTFDDEKLLDKDTKDKLKLASENWDM PIVVTTAVQFFESLFADKSSRCRKLHNIANSVIILDEAQMLPLNLLLPIM QSIKELARNYHSSIVMCTATQPAIQTQHGFYRGFENVREIAPNPTALFAD LRRTSVQHIGMQSDKDLIDKLTENQQILIIVNNRRHARSLYEQAKQLDGT FHLTTLMCAKHRSQMLEQIRQHLQAGRSCRVIATSLIEAGVDVDFPLVMR AEAGLDSVAQAAGRCNREGKKLAEQSFVWVFQPEQQWKAPTELGLLSAAM RSTVRCYGDNLLSVEAISHYFSAVYEQKGKDLDNKQILAKCHAAGKTLDF PFQTIAKEFCMIESHMLPLIIPFDKEAEKRIEELRHAEKVGGLLRKLQPY TVQIPQKSLEALFKAGRIEAINEQQFGNQFYSLIGLDLYDEVAGLDWGDL GFITIENSVF >MS0665 unknown MNKYLKSDFIFSLFLSIAIMFICLYFEKSFFFVDDAQNEFLPFTRQIGNV WLNGEIPFILKNTFIGSNTMIDIHRAIFLPQNIFLSILSVKITSLKIISI IAAFINLFVMSFSALKLSEAFSLTKAAGIVLAFLFCINPIFLYFYLESWW NAAAGQAWFVASLASVAWLMRAFSIKRLLLNVITVLSIFASGWPHSVLVY GFLALIFSIFLYLNKRHNDLILFVLISFSIILIAIPLYSEYVISGDLINR QSFKFNNVGNFLSTTLNQLLLTFNVTYYHFMHRYGGYSITHIPMGYSSIY ILLLICFGSLKNIARNPNSLFLLVLCTVFFILTQTPTEIGPFRYPFRFTP YFSEVLTMLSIFSLEKLGIVKTRARVFLVVLLLSISLLLSIFSLEENFGK YAILQFLFFAVTTWYVVRYNSISLKSGLPYTAFIFLLMLLAKDSVIGYLS FPDLKNSINMENNYSQGGYILSLTNGKRPKNNLEDLNSTHFMLYGLKSIN GASPVGNKYISKTISTRSSQAFFNAKETILGLSKTYKDKCYFDLFGIDTV ILNKKDNSSLISQKLSDCGFSERKVKSHDVIYFLRNDFNAKGSVSYHSDT LSINQQISLKNNSEFYQLSGLKGDELIFNRVYWYGYRAYINDKEIPLLNY DGLLRIILDHDYQNGVLRLEYFPKSWKYALLIALSGFLLLLFSVGYMQRM RKWVSLN >MS1549 unknown MILTRYLTKEVFKSQVAILFILLLIFFSQQLVRVLGSAANGNVPADLVLS LLGLGMPAMAQLMLPLCLFIAILLTFGRLYAESEISVMRACGVGQRILVK VALGLSVLTAALAAYNVLWVSPWAIQKQGQIVEDARANPNMSALSAGQFM TSNDSDFVLFIDNIKDNKISNIYLFQTKEKGNSKPSVIVAENGELQSLPN GDQILSLQNSQRVEGSAALPDFRITNFTEYQAYLGHRNVDSDENETTELP LAELLALKTPAAKAELNWRISLILAVPLMALLAVPLSKVNPRQGRFAKIL PALLLYLIYFLLQSSLKSAGGAGKLDAGLLMPLVNLFFLLLGIMLNSWNS AFMYKIRHLFSKKSAI >MS1082 unknown MIAFGYITALSLSYFLLAPDFKGLSFTEYFIQSEAKPIFLTLGLLLPIGF IVMSKAVEYGGIVRTDAAQRLALFLQIIAAVILFGETLNNMRVGGVIVAF FALFCLLTKPTKSIENALKAVFALAAVWLIWGVTGILFKKIALMGGAFPT TLFVTFSIAAVLMFTYLLIKRTFWNASSLVGGIILGCLNFGNILFYIYAH QYFKENPTIVFATMDIGVICLGMIVGALVFKEKISKINMLGIVLGITAIL LLRV >MS2255 unknown MEKLFDINEQGLSVRCKLFYEKDVHSIENIVLILHGFGSSKEVKSNAKFG ERLITKYKNYGAIAFDLPCHGADARKKLSVAECLTYIQLVVNYAKEKLNA QNLYAYATSFGGYLTLKYIAERENPFRKIALRAPAIQMFHTLTANMTDDE RHKVAKGKEIMLGFERKMKIGKEFLDELEQGDIQQYDYLDYADDMLILHG TADEIVDIATSQTFAENNVIELIAVEGADHPFSNPQLMDLAIGRIVEFFH >MS0879 unknown MCYFKCLFKIKELFNMQTFLKFTNFMSKTFALWVLVFAFLAFQFPAQFAI FAPYIPYLLGLVMFGMGITLTFNDFGEVFKHPKSVFIGVAGQFVIMPAIA FCLAKIFNLPADLAVGVILVGSCPGGTSSNVMTYLSRGNTALSVACTTIS TLLAPFLTPAIFYILASQWLDINAGAMFMSVLKMVLFPIFLGLIVRAIFK KSISEISRTMPLVSVISIVLILSAVVAVSKDKIVESGLLIFGVVVLHNCL GYLVGFFGARLFKLNIADSKAVSIEVGMQNSGLGAALAAAHFNPIAAVPS AVFSFWHNVSGPILANIFANIKNDDKK >MS2191 unknown MMLKNIFAGLTVLLLSACTLVTYQPVDTISHVNAKQGYRMRNAIQQPDGN LIILMFSGGGSRAASLGYGVLEEFKNAAVRPTAKGTTLIDNVDLVYGVSG GSVLATYYSLYGRDAVPKFEENFLKKNFQREIISQVFSLSNLPRITSPQF GRGDLLQEQLDQTLYKGATFGDLERKRKGPFVVVSATDMNLGQKITFTQE FFDGLCIDLSKMEISRAVAASSSVPLLFSPLTLNNNGGNCHFDIPELIQI SQNISNDAQKSKNLEELKNTLSLYQNSKERPFIHLVDGGLTDNLGLSGLI DIYDVAGQEGMYREAVKNQLKNIIVINVNAQNEVSSEIDKTANVPGTRDV INTIINVPIDRNSQVSLRRFREFTDEWNKSMANKPPKQRINMHFVNLSLK DLPESQLKKEVLNISTSFYLLHSDVNKLKRSAKILLQQSKEYQDVLRALQ >MS0331 unknown MYMFVLFGLNSEHSQHINYFLYTGELSMKKLLKLSLVAGLAMTALAVQAE ERFITIGTGGQTGVYYVVGQSICQLVNRDTAKTQIKCNAPSTGASIANLN AIADKQMDMGIAQSDWQYHAYNGTSAFEGKKNEKLRAVFSLHAEPFTLMA RDDSGIKTFDDLKGKRVNVGDPGSGTRATINVIMAEKGWTDKNFKVAAEL KPAEMASAMCDNNLDAITYNVGHPNGALKEAAASCDSHLVPVTGPEIDKL VSEHSYYAKAVIPGGLYKGTDNPVETFGSYATLVSSTDVDADKVYAVVKA VFDNFDRFKRLHPAFANLKEEDMIKNALSAPLHEGAERYYKERGWLK >MS1036 unknown MTDKIYDLHCHSTASDGILSPSEIVQRAHEQGVQSLALTDHDTISGLTEA RRQAELLGVEFINGVEISTSWENKVIHIVGLNFDENSPEMTALLAKQAQL RLNRALTIGEKLAKAGVANAFEGASALAKGEVTRAHYARYLVQIGKVANE NQAFKRYLSQGKSCYVKAEWCDIPAAISIIKQAGGIPIIAHPLRYTMTAR WIKRLIADFKNWGGEGIEVSGCGQTADQRQLIARWANEFELLASVGSDFH FPCGWVELGKSLWLPENVTPVWSQFGDKPKYLQNTCKS >MS0758 unknown MIIYLHGFSSSRPDDYENVMQLKMIDPDVRVISYSTVHPRHDMTYILNET HKLVSETQDDKPMICGVGLGGYWAERVGFLCGVKQIILNPNLFPEENMEG KIDRPEEYLDIKTKCIEDFREKNQSRCLVFLSKNDKVVDPKRSEALLSHY YEVIWDDTDAHQFKHIAPYIQRLKEFKAA >MS0522 unknown MKFYRTLEDFKVISFDLDDTLYDNSQVILDAERHSVDFLREISQIPQLDG GYWRYWKNKTALDFPLLAEDVTQWRIKTIVELLRAHQKSAVEIERISHAA MEDFFEWRHKMQVPQQSFEVLNKLKRQYKLAALTNGNVTPSRAGFDQFEL VLTGGVQGRAKPHQDLFRQTAGYFNVRPHEILHVGDNLVTDVQGAIQAGC QAVWINLSDKKIQHFSEATLVPTFEITDLNELLFFRNL >MS1651 unknown MSMQIKSIQLAFSVLYHYFDECVKKALSNLPIQRVDIPDSLLAQAESLVL GKLPSEKKKDLPLTSIFEGISQQNNSQQYLYDFKPLSPDSIFPDLQRNEG HQPFELWQHLAKAVEEIPTSHRENINLWLDHFDTALQCYTSQITCPYDQS ISFYDFTKAVAAFVVASMDKSADKNRPFLLIQGDFFGVQDFIFSGGSQSN KQAAKLLRGRSFQVSLFTELAALKVLNACDLPATSQMMNAAGKFLIIAPN TPEIHKKLDDVQKELNEWCIKNTYGLIGLGIAKMSAGKVDFEQKNYEKLI KLLFENLETQKLKRLDLTDTTQSVQEESYPNGVCEMNSFFPALPNSNRSI MTEDQVKIGELLAKKQRIIVCDVGTEINNSYRTQTLKLDMFGYNVIFTDS RKDTKDFGHPVKLYQIHRFWDFSLAKNTKDELWNGYARRYINAYVPFDEQ EQIKTFDEIAQADEGINALMTLKGDVDNLGTIFQKGIQPANIAKMAALSR QMNQFFSLWLPAYCAEYSPNMYTVFAGGDDFFLIGPWHSTQKVAFEMQQA FKRYVAENPEIHFSVGMVMSKVGLPVPRLGDLAEMALEKAKSIDSGKNAV TIFNRTVKWTDWQQLCDLEDEIHRLAKDYNISTSYLYSLIRLCEQANDKN NIESTMWRSHFYYRTARYVIDKLNQEKRDKALNEITISLGENGISQYKIN FAIPLTNYFYQKR >MS1440 unknown MEIAFLLAGKIIELTIIVLLGYALVKSKLLKSQDSYPLSIIGLYLISPSV MINAFQIDYSPQILNGLLLSLTMAVFLHIILIITGVILKRLLNLDPIEHA ASIYSNSGNLIIPLVVSMFGQQWVIYATCFIVVQTFLFWTHCRSIICGKG SISILKMFKNINILSIFLGVFLFAFQIKLPPLISGTLSSLGQFIGPNAML IAGMLIASIPLRNIITSKRIYLVTFLRLILIPIFLLIIIKLCGFDNWVEN GETIAMISFLATMSPAAATVTQMALIYGKNANKASAIYGVTTMLCVFSMP LIIALYQLI >MS1680 unknown MMKTVTIDLQIASEDQSNLPTLEQFTLWATNAVRAEHFEPEITIRIVDEA ESHELNFTYRGKDRPTNVLSFPFECPEEVELPLLGDLVICRQVVEREAQE QGKPLTAHWAHMVVHGSLHLLGYDHIEDDEAVEMESLETEIMTGLGFEDP YSYDEE >MS1976 unknown MLSYRHSFHAGNHADVVKHIVEMLIIENLTQKEKGFYYLDTHSGVGRYRL FSQESEKTAEFEEGIARLWQRDDLPEEVQRYVDLIKKLNYGGKELRYYAG SPLIAAQMLRPQDRGLLVELHPTDFPLLRNNFKEFKNISVKRDDGFQQVK ATLPPKERRGLVLMDPPYEMKEDYDLVVNTIVEGYKRFATGVYAVWYPVV LRQQSKRIVKGLEASGIRKILQIELAVRPDSDQRGMTASGMIVINPPWQL EAQMKKILPYLTNVLVPEGTGSWSVNWIAPE >MS1279 unknown MVTGVMFEPEPIYDELDKKPAELTHDQPLGFTDVVDNAKKEAKTTKTANK KTKDKKSASHLRIVK >MS1854 unknown MRWQGRRESTNVEDRRSERSGISMGGKKTGVLGFIILLVGAYYGVDLSGL VGTSSNIGEVGSSLSQNEEETLEKLSRVVLADTESTWQDYFARSGQKYSA PTMVLYNGATPSACGTGQSAMGPFYCPNDHKVYLDLSFYNDMKNQLGAGG EAAFAYVIAHEVGHHVQNLTGILPRISRLQQSNPAQANQLSVNLELQADC FAGVFGYQAVKNNMFEASDLEVAFAAAEAVGDDRLQKRSQGYAVPDSFTH GTSQQRLTWFRKGLQTGDPTQCNTFTN >MS2381 unknown MRKQLYITHGYTANSQSHWFQWLKNQLIPHQIHTNIFDMPDSSKPNPQIW LAHHQTYINQCDENTVFIGHSLGCIATLRYLQRQKKKIKGLILVAGFDEP LDNLPELTSFTLQRIYYPELIANIPQRIVIGSSNDEVVAPKYTQKLAANL QASYLTVENAGHFLARQGFTEFPLLLKECLNIFNG >MS0857 unknown MFTSIQREVNQFINRGLDRTLRIAVTGLSQSGKTAFITSLINQLINIDNV TNGHLPLFEAARQQRIVGVKRIPQINLNIPRFDYEANLNSLMASPPQWPQ STRGVSETRLAVRYHNSGLFSHIKEKSTLYLDIFDYPGEWLLDLPLLNLN YQQWSLEQQNLRQGLRAELAQTWLEKTKKLDLTAMADEDILAQIAKDYTA YLQACKEQGLHFIQPGRFVLPAELEGAPVLQFFPLLHLAEKDWKKLKEEA KPNSYFAILNQRYDYYKNKVVKGFYENYFVHFDRQLILADCLTPINHSRQ AFQDMQEGLQQLFKNFHYGKRRLINRLFYPRIDKLMFIATKADHITSDQI PNLVSLMRQLVQDGGRHVAFEGIETGFTAIAAIRATKQVLVEQEGKTFKA LQGIRSKDKRQVTVYPGSVPSRLPSIDFWQQQKFDFDQFEPQPLESGEII PHLRMDSVLQFLLGDKLA >MS0254 unknown MTKLIHLTQYKLIELTGVDSEKFLQGQLTCDVTKLKTGDSTLTAHCDPKG KVSSVFRLIRVAQEQFYLLFRTDLLPAGLDQLKKYAVFSKVAFAEPEVQL AGVIGENCGQFSASFVVNSGNAAILINPAERLEFNASAEAWDCVEIQRGY PILSAKTQNEFIPQALNLQCIEQAVSFQKGCYIGQETVARAKYRGTNKRA MFIFKARSQIIPEIGGEIEMRLENGWRKTGVILSAVNFGEVLWLQVVLNN RLEDGQQFRLPADETALELYPLPYELV >MS1093 unknown MIMHFACPDTKRFFNGERFVRFISCERLAIRKLQQLNAATSLEFLTKLPN NKLETTLYNHVSYYNLKINEQWSLLFLWDHNSPTDVKLVDMKEV >MS2124 unknown MGFNKYYLKMEEHFLYVPYYNHQRRIRVLLPKDYYKEDWQSYPVLYMHDG QNIFYSKESYSGYSWKIIPTIKYHKEFPKIIIVGIDNATVDRLDEYAPWR TDVGNTAEARNTGGKGAEYGQWVVETVKPFIDGHYRTKPQRENTLLAGSS MGAIITAYMGAAYPHIFGHLGVFSLASWFSENEFLRFMHEHPIDRASRVF IQVGTKEGDDADAQYISNMNQAYIDSTLYYYQALIRTGHPLDNIRLKIMA NEIHHEKYWASHFVDFLRFSLMGK >MS1018 unknown MEKIMENQSFLQNFFKLNQHKTSTKTEIIAGITTFFTMVYIVFVNPSVLG DAGMDKQVVFVTTCLIAGFGTMAMGLFSNLPIALAPAMGLNAFFAYVVVG KLGYSWEVGMGAIFWGSVGLLILTLLQVRYWLMASIPLALRVGIGAGIGF FIALIGFKNMGLVVANPATLVALGELHDPKVLMGILGFFIIVVLAARNIF SGVLVSIVVVTALALQFDENVIYRGLVSMPPSLDAVVGKVDIAGALDIAL LGIIFSFLLVNLFDSSGTLLGVTDKAGICDERGRFPKMRQALYVDSVSAV VGSSIGTSAISTYVESGAGVSVGGRTGLTAVVVGVLFLLTIFFSPLAGLV PAYATAGALVYVGILMASSLIKVQWEDLTEAAPAFITAAMMPFTYSITEG IAFGFISYCVMKVGTGRWKEVNAPVWVVSVLFLIKFIWIG >MS0714 unknown MISKIKVSLVALCAGLFFVSVNTSAAETQTQVPQQCQKLFSATERLIEEA EKQPGTHTQVSKIKNKLNQSKKQILEMELATQIKSCDHGLARLNRLNQQD QITN >MS0847 unknown MADSRIVLDAREQSTSLLSTHKVLRNTYFLLGMTLAFSAFVAYISISLGL PHPGIIVTLVGFYGLLFLTNSLANSGWGILSAFAFTGFLGYTLGPILNVY IGAGLSETVVLALSGTAAVFFACSAYVLTTRKDMSFLSGMIFSLFIVLLL GMVASIFFQTPALHLAISGLFVIFSSAAILFETSNIIHGGETNYIRATVS LFVSIYNLFLSLLQLLGIFGGDD >MS0295 unknown MAIVSVPVEKSYRLLNIGATTLVSAKAEDIENVMSVAWSCALDYGPLSKV TTVLDKQAFTRGLVEKSGLFAIQIPVANQAELVVKLGTTSRHNNPHKIDD VEIFYPDGFDVPLVKGCAGWIICQLIRDENNQQNHDLFIGKVLAAYADDR VFKDAHWIFEQAPNELRTLHYVAGGQFYLIGESLEVK >MS1916 unknown MTEKINLMNLTRQQMREFFKELGEKPFRADQLVKWIYHFGEDNFDNMTNI NKKLRDKLKAVAEIKAPEIAVEQRSADGTIKWAMQVGDQQIETVYIPEAD RATLCVSSQVGCALACTFCSTAQQGFNRNLTVSEIIGQVWRASKVIGEFG VTGIRPITNVVMMGMGEPLLNVANVVPAMELMLDDFAYGLSKRRVTLSTS GVVPALDNLSGMIDVALAISLHAPNDELRDEIVPINKKYNIKMLIDSVNR YLSVSNANHGKVTIEYVMLDHVNDSIEHAHQLAEVLKNTPCKINLIPWNP FPQAPYGKSSNTRVDKFQKTLMEYGFTVIVRKTRGDDIDAACGQLAGDVI DRTKRTAAKRQFGQNIDVQLQ >MS1514 unknown MTELTHYNQYIADENAMIAFGQQLIQAINKLDNNKPVVIYLNGDLGAGKT TLSRGMIQGLGHQGNVKSPTYTLVEEYHLQNKHIYHFDLYRLSDPEELEF MGIRDYFGTDTICLIEWAEKGIGLLAEPDLIVNIRYADNARDIDLIAQNA QGEQIITLLAAK >MS1718 unknown MRQFMELIIISGRSGAGKSVALRALEDMGYYCVDNLPINLLPELADILST SQQSAAVSLDIRNLPHSPETLDTLLQQLADAQHQVRIIFLEADRSTLIRR YSDSRRLHPLSMQDLSLEAAIEAEAGYLEPLLQNAELVINTSEISTHELA QRLREFLKGKPDKELKIVVESFGFKYGLPLDADYVFDVRFLPNPHWNPDL RPMTGLDQPVIDFLGKYSEVNNFIYSTRNYLETWLPMLEQNNRSYLTIAI GCTGGKHRSVYIAQQLGEYFQAKGKKVKIQHKSLEKHHKKNSA >MS0914 unknown MNNNGNNMTTKTERQTWSSKITYIMTVAGATVGFGATWRFPYLVGENGGG AYVLLFCLAMILIGIPMILVENVIGRRLRVNSIDAFGDKLQDENISGGWK IIGYMGLLGAFGIMAYYMVLGGWVMNYIISLISGILDISTPITKETAKEF YDFSIGNSPLHIALYTFIFVIINYIILAKGIIGGIERAVKFLMPLLFVFL IGMVIRNVTLPGAMDGIIYYLKPDFSKITPKLFIMVLGQVFFALSLGFGV LITLSSYLSKEENLIQTAVITGFTNTIIAILAGFMIFPSLFSFGIEPNAG PTLVFQSLPIVFSHLWSGTFFAIVFFSLLLIAALTTSITIYEVIITALQE KLKMRRSKAILLTLGGIFLLGNIPSILGDNLWKDFRPFDKSIFDAFDFIS GNILFLLTALGCAVFVGFVLKDKAKAELSPTPDSLFTTVWFNYVKFVVPL IIIVIFVSNII >MS2167 unknown MNVPILYFDVVHSIQIHDWIIEKSGGLAGLYPDGTGKLESVLEHIQNDLY YPNFEDKLVHLIYSINKLHAFLDGNKRSSIVLGSYFLELNGYDYCVKEFT IKMENIVVWLAESKISKELLLKLVCSILNNEEQYSDELKYELICATSDDF GN >MS2014 unknown MGTTNAGLPKNYRIKLGVIKTPLYSNESISSWLIRAALDCGTEPITFTGF YWNKWRLWTYDLDRGFEPIAQHIYADITELSLNQQVNLVNHSLYSVLRPI NGKNTLIKGQAKWVLSRGSRNRSFRVGQSYCPCCLEETPYLRNEWRFAWH FGCLKHKVLFGSKCSCCGGLYQPHLLSAEKRQLNYCHQCGEKLQVITTPL NEVEIATMETLDKVFTTNSGECFGKRVDAQVYFAVLRYFINLVRRTAVAK STHAFARFVEECGISQAEICQTRTALAFEQLPVEERKNLLVNAIKILNLS SKDFIQATQQSAITQKAFAFENYPMELDTLFKYASKGKTVSRKTVTNKPK TDSVLSMNRQWERLKRQLKIAA >MS1274 unknown MNLQEQLKNAKNWEERYRLIIQAGKNITKPTEQELAEMQPLSGCEAQVWF KISQNSDRTLHFQAYSDARIINGLLWILSLAVNGKPTEQCRRFDLTSYYA ELGIAQRLTSTRLNGLKQIEGCIHQAGN >MS2174 unknown MIMFKKLLIATALCASFSAMADDSFTLKVKGVENGKFQNKHLLSAEYGFG CAGENISPEIEWKNAPKGTKSFVLTVYDKDAPTGLGWVHWEVVNIPANVS KLPAGIDAKDNNLPKGALQTRTDFGVPGYGGACPPENEKHRYEFTLTALK VEQLPNVTADSTPALVGFFTNANAIAKAQVTVETAR >MS0732 unknown MINLKIDGFDVRVDEGTTILEAAKSVGINIPTLCYLKDVSDIGSCRVCVV EVEGFEKLPTSCNTLAQEGMVIRTQTDKVVKSRRMALDLILSHHNLICFS CPSNGACELQNVAHQCGISESSFPNFRLPGIEVPHVEDNPFLGYRPDLCI HCQRCINTCANVSGCSSIKLASRGIFRAIETPFGKDWKETTCESCGNCAE ACPTGAIYKKEAKSYRSWEIQRVRTTCPHCAVGCQYDLLVKDNKLVGAEG VDGPSNGGRLCVKGRFGSYKFVMSGDRLTDPLIKDRATGKFRKASWDEAL DLVASKFMTLKRQYGGDSLAGFACSRSPNEDIYMVQKMVRTCFGTNNTDN CARVCHSASVEGLARTLGSGAMTNPIYDITHDVDAILLVGSNPEEAHPVI GMQIREAVRNGTKLIVVDPRDIGLTKQADIHLKLRPGTNIAFANGMCHIF IKEGLIDEKFIAEHTEGFKELKKIVKDYTPEYVAEICGIDADDLRAAARI YATAKKAPIIYCLGVTEHSTGTEGVMSMSNMAMMVGKIGREGCGVNPLRG QNNVQGACDMGASPNQYPGYQSVKDPEIRAKFEKAWGVKLPAHIGLHATD VFPAAIKGKIKGLYICGEDPVVTDPDTNHVINALKSLDFLVVQELFMTET ALLADVVLPGRSYAEKDGTFSNTERRVQRVRKAITLPGNSRLDTDIICEL MRRMGYNQPNLTASEIFDEMASVTPSFRGISYERLEKEPTQSLQWPCTDQ YHPGTPIMHVGKFARGLGLFYPTVYTPAKELPDAQYPMMLTTGRILYHYN TRAMTGRTEGLMEIAGHSFIEINSADAKRLNIENGERVRVTSRRGTITTE ARVSDKTNEGETWMPFHFADGNCNWLTNAALDQFARIPEYKVCACRIEKL PEDEAFNMKGKYITQKMVAAQWRKKMDKSIAKLVR >MS0084 unknown MSNVPKPDFSLCYEKTNITADIEPRLVQFTYTDHLEGQSDELTVEFEDIS GKWVRQWFPTQGDKLRAAIGYKDSLLVDIGEFEIDEVEYRYKPSTINLKA LSTGISKANRTLKPKAYENTTLAQVVAKVADSLKLKLVGKIKAIPIKRIT QYQERDVEFLARLAREYHHSFKIVGSQLIFTDKTELGKSEPVLILEERDT ISLSLRDRIKDTAKAVDISGFDASGKKVVKKRKKATALRPNLKQVKASSE DTLKVVTRGETQEQIDARGEAALAEQNDNQTAGNITLIGNPELVAGATIL LKNLGVFSGKYLIKSSRHSFGRNSGYTTEIEVRMLEFIADDLITLGMEKT NANA >MS0747 unknown MKELFATTARGFEELLKLELSSLGATECQVAQGGVHFMADDETQYRALLW SRLSSRILLPIVKTKIYSDLDLYSAVVRQNWLAYFDERVRFLVDFNGTNR EIRHTQFGAMRVKDGIVDYFERNGKARPNVDKDYPDIRIHAYLNRDDLVL SLDLSGEALHLRGYREDSGAAPLRETLAAAIVLRSGWKQGTPLVDPMCGS GTLLIEAAQMEAKIAPQLHRMHWGFDFWRGHNQAAWEKVKREAVAMAEAE FNKNPNPHFYGFDLDHRVLQKAQRNAQNAGVAHLIKWKQGDVAALKNPTP EDKGTVICNPPYGERLGTTPALIALYSVFGQRLKEQFPGWNASIFSSEQG LLDCLRMRSHRQFKAKNGPLDCIQKNYQISDRTLSPENKSAVENAGEFKP NANVATDFANRLQKNIKKIEKWAKQEGIEAYRLYDADLPDYNLAVDHYGD HIVVQEYAAPKNIDENKARQRLLDAVTATLAVTGVETNKLILKVRQKQKG ANQYEKLANKGEYFYVNEYGAKLWVNLTDYLDTGLFLDHRLTRRMVGQMA KGKDFLNLFAYTGSATVHAALGGAKSTTTVDMSNTYLNWAEQNLILNEAD GKQHKLIQADCLQWLANCAQQFDLIFVDPPTFSNSKRMEDSWDVQRDHIK LMGNLKRILRPNGTIVFSNNKRGFKMDFEGLTRLGLKAEEISAKTLPLDF ERNKQIHNCWIVEFV >MS1987 unknown MTEQNLLSSLAHMISEQRNPNSMNLDSLSPLELVTLINNEDKQVPLAIEK VLPQIAQAVEKIVRTFQQGGRLVYIGAGTSGRLGVLDASECPPTYGVKPE MVVGLIAGGERALRHPIEGAEDNAEQGKADLQQINFSKKDILVGIAASGR TPYVIGALNYAKSLGAITISIASNPDSAMASIADIAIDTLVGAEVLTGSS RMKSGTAQKLVLNMLTTASMVLMGKCYQNLMVDVQASNEKLRARAIRIVM QATDCEKEVAERFLKAADNNAKLAIMMVLTNLDKQQASVLLQRHQGKLSR ALSQ >MS0350 unknown MIIILSLVMQKKFALDHVLQHFLWVGELYGLCYDRQKLMKKLLNKKKDDM SRNIQELKNIVAKLRDPDGGCPLGSETIL >MS1068 unknown MADWILILSVLILRRFRKIYAWQDNKNIGDIMSTSHYVSPKGSMDQLSHM EIDLLTKRAQSDLYKLFRNSSLAVLNSGAINDDSRALLNKYPNFEISIIC KERGVTLKLDNSPESAFVDDKIIRNIQYNLFAVLRDILFVNALMQRFGLD AERGNSFITNQVFSILRNAKALSLNEDPNLVVCWGGHSINQTEYAYCRAV GLELGLRELNIVTGCGPGVMEAPMKGAAIGHANQRYKQSRFIGITEPSII ASEPPNPIVNELIIMPDIEKRLEAFVRMGHGIVIFPGGPGTFEEFMFILG IKLNPENRAQKLPLILTGPKESADYFATIDRFVLDTLGEEAQSLYTIIID DAVAVARHMKAEMVEIRDFRCKISDSFSFNWSLKIEHQFQQPFLPTHENM ANLNLHLNQSTVDLAANLRCVFSGIVAGNIKPATQDQIAEKGKFQLYGEP RLMEKVDNLLQDFIVQHRMKLPTDEAYEPCYEICK >MS1363 unknown MKTDFLSSLIFSVGVTLPTILLLILGMLIRKKKMIDDRFCEQSTKVVFNI TLPVLLFFSVYGKHVDYISQMAVLSVGIIGTISLFLLAELFAARFIAEKR ERGTFVQAIYRGNSGILGLAFCISAFGDSAAVPASIYSAAVIFLYNILAV ITLTRSLSTGSVSVVSIMKGVIKNPLIIAILFALIANSISLQLPAPLLST GNYLANMTLPLALICTGATIDLSVFSNKTSNVVLMGSLGRLVVTPVFMIL IGKVFGLDGMLLGVVALMNTTPVASAAYAMVRAMGGNSVTVANIIGITTV GSMITSSLMLLILSQAGWI >MS0298 unknown MATNYYDITLALAGVCQSAKLVQQFALEGKADEEAFNTSLYTLLQTTPKD ILSVYGGHERNLKLGLETLLEQLNGSTEDITRYWLSLLALSGKLEKNAQA KSELARRIQYLPTQLEHYDLLDEQMLANLASMYVDIISPLGNKIQVKGSI EVLQQTSMHHRIRACLLAGIRSALLWRQVGGSKWQLLFSRRKIFNMAKQI YSSL >MS1427 unknown MQVISGHWGEMLPFYLQRLDDSIPQAATGLKRSITQTFKEQVFVTPSGML TLPHFNFIYELVGADRILYSIDYPYQTLDGARAFIENLPISQAEKELIAY KNAEKLFGLG >MS0894 unknown MNKLALYCRIGFEKETAAEITEKAAEKGVFGFARVNNDSGYVIFECYQEG EADRLAREIPFNQLIFARQMIVISDLLENLPPTDRITPIIEEYNRIGSLV NLHRTTELFVETADTNEAKELSVFCRKFTVPLRQALKKQGYLAFKEVKKS GLTLHIFFVKPNCCYVGYSYNNNHSPNFMGILRLKFPPQAPSRSTLKLHE AILTFLSPEEERKCMNESMYGVDLGACPGGWTYQLVKRGLFVYAVDHGKM AASLHDTGRIDHCPEDGFKFQPPKRSKIDWLVCDMVEQPIRIAALIAKWL VNEWCRESIFNLKLPMKKRYAEVQNCLQLITNELDKAGFKYHIQAKHLYH DREEITVHISVKK >MS2252 unknown METFIHGFLVCGGLIIAIGAQNAFVLKQGLLKNHILAVILTCFICDIVLI SLGVLGLGSLISESREATVALGIVGALFLTVYGARAFRSAYLGNSSLEIQ SQRQDNTSSAWKAVLATLAITLLNPHVYLDCFAIIGGIAGTLTPDQKILF LCGALCTSFLWFFSLGYGARLLIPLFKRPITWRILDFVIGSVMWLIAFGL AKYAYQLA >MS1997 unknown MTEFKLNYHKTHFMTSAANIHQLPKDEGMEIAFAGRSNAGKSTALNALTN QKNLARTSKTPGRTQLINLFEVEPQYKLVDLPGYGYAAVPEQMKLQWQKS LGEYLQHRECLKGVVILMDIRHPLKDLDQQMIEWAVSSDLPVLLLLTKAD KLSQSARSKQVKTVREAILPFQGDVQVEAFSAQNKIGIDKLAAKLDSWFS SLLTE >MS0416 unknown MLDVGLISYFSLVDLSIAFQHSKRNKIKKCIDHKFIGKYKKIVRGRKMYD LITMNQYDALIFDMDGTIIDTMPSHAKAWEKVGEVLGYPIKGDVMYEFGG ATTKIIAQETMRRYGVPAELLEQVVTMKRQFGQEMVLQNATLLPTMQVLE HFLGKKPMALGTGSHKAMVDMLLQRFDLNDYFSAVVMAEDVQKHKPDPET FLRCAELMKVDPVRCLVFEDADFGVTAAHAGGMDVFDVRINQIMKVS >MS1448 unknown MTTKKQAVFSRLVNELVQKNQGKRIFSFDFENQTYWVKQPEKLTGVWKIL KPHPKQSFREELHILKNLYERGAPVPQVILSGEDFFVLKDVGPTLNHWIE NAGLNLTPAEKNQILVDAIKALTSLHKKGVTHGRPAIRDIAWRQGKVTFM DFESHSRSLNLQWHKIRDVLVFIHSLCRSKHLSGEQIQYLINKYEEYCES DLWQDVLNLVAKFRFLYYILLVFKPVARMDLIAIYRLFQYLLPLTEENK >MS0108 unknown MSAPLTFQQVFDRVVGHEGGYVNDPHDPGGETNWGITKYTARENGYTGSM KAMTREQAYKIYEKAFWQRYHCEKLPEAVAFQFFDAAVNHGVGNASRMLQ RAVNVADDGIIGKVTLSAVEKMPISDLLLRFNAERIRFYTKLKNFPRYGK GWMNRIAGNLAYAAIDNEV >MS0407 unknown MLTFFIITLIVGSIVGFLAGIFGIGGGLVIVPTLLYLLPMVGVPDEKLMA TALGTSFATIIITSLASAYRHNKLGNVVWEAVKYLAPTLVIATFISGLFI GKLPKDISSKLFACLVVYLAAKMVLSIRNKKSKTPAKPLTPQSTILGGIL IGIASSAAGIGGGSFIVPFLNSRGIEMRKSVGSSSFCGAFLGLAGMLSFM IGGWSVEGMPDWSLGYIYLPAVLGITLTSFFTSKFGAEMANKLPVASLKR YFAIFLILMAIKMLIG >MS1070 unknown MYLVEVFFKNINEDNLPQQIPLINQLIDQWRYNGQIIGREIPVFVANQEN ERGLATRVICPEQQSLLPEYNNAEVNRCLANIENCGLILHSFQIVGEDLN SDITYEDKKPDWQILYTTYLQVCSPLHSGDRLAPIPLYKQLKDVPHLSMD VIKWQENWQACDQLQMNAVALESQALREISDINSRIFKHGYSLTKEIEEH TGVPTYYYLYRVGGKNLASESARHCPICHGDWKLAQPLFDQFHFKCDHCR LVSNISWNFL >MS0318 unknown MRNNMSEQKITFADQKRKTVETAEFTEDGRYKRKVRSFVLRTGRLSEFQR NMMNDNWADFGLEHQNNYFDFAEIYGNTNPVILEIGFGMGKSLVEMAEQN PERNYLGIEVHTPGVGACIAYAVEKQVKNLRVICHDATEILQDCIADDSL GGLQLFFPDPWHKSKHHKRRIVQPNFVDNVMQKLQQSGFIHMATDWENYA EQMLDVLSQSKALTNTSKTNDFIPRPDFRPLTKFEQRGHRLGHGVWDLYF VKN >MS0091 unknown MSIAINQIVNANVYIDGNSQIGKAQQIKIPDIEFEMVDHKGLGLFGTIKL PSGAKAIEGGVNWDSYYPEVRAKLYNPFKNFQLQCRSNLQVFNAQGLAAE EPMVTIMNVSSVKIGGTDVESKENAKFDDTFAVHSIKQTVAGKEILFIDV FANIFRVNGEDVLSKYRTNVGQ >MS0906 unknown MNSNLIRLIFITLLSLGLTLISSFVLARLLSVQDRGLHQLFITAVSYVVT FATGGSGFALALSMRKKQYAGWQNYFIAFLALSVLAAIIAIYCFDFTAFH VLFVLNVVLTAILTMTLEKSKIDANLRVYRQLTLQQPVLLVAVYGICYLL LGEQPLEIAIELFTLFSAMQALACLYYLKKINADFKRKNEIQPIQKRFFL KTWFKQNLLQIFGATTASLDKFLIVYFLGNYTLGLYTVCIAFDSLITKFI NMLADYFYSGLLNNINRIKSVLILILLMAVGAVILVPLLAEPIIIFFFSA KYAEVAPVLILFIINAIIGGLSWVLSQNMLLLGKQVLLFTRQIIAIAVFV LLFYLFKDYQLYGVAYAFIGASLTRLIISVIYYLKYPITDVKPEKSAV >MS1909 unknown MQFIKNGRQYREATSQKISWGHWFALFNIIWAILFGSRYAFIIDWPSTLW GKLYFFISILGHFSFVVFAGYLLIIFPLSFIIKNERTFRGLSVIVTTICL TLLLIDTEVFSRFNLHLSSVVWNLLVNPEDGELSRDWQIFFAPMPLILLV QMLYSRWSWNKLRSLERQKWMRKVGIFFVTMFVATHLIYAWADAYIYRPI TMQKSNFPLSYPMTARTFLEKNGLLDKTEYAQTLEQEGRPEAFNIDYPKH KLAYMPIERKPNILLINISGMRYDSVIESKMPNLTEFAKQSAQFMNHYST GNNSNLGLTGLFYGLNASYTDSILHNKTESELFKKLQAEHYQMGLFSANN FKDSLFRQALFQKVNLPRIKAGNQSAVKNWLIWLNKAHLDQAWFSYLDLD VLTAVQNADPKSKEEETEIYDNQLGNVDVQLQIVFEQLQERGLLDKTIVI ITADHGHAFQLSDKEHIDYFGLDEIQVPMIIRWNALLNEQQSKLTSHVDL VPTLMQNVFKVENPITDYAQGESLINISRKADWILVGNYRWNVIISPNGN QYHIDRKGQYQKYNVDYEKESSLRPPLGLFLEVFTQSRSFMAK >MS2096 unknown MTIKSVEISKAYRLVQLGSTTMLSAKHDGDADVMAAAWVGLGGPNKIIAY IGTQAYTRKLVEQNGYFVVHIPTVQQMETVLYVGEHSKHTMPNKLDNLPL FYQEGVDIPMVEGSAGYLLCQVIPNPQQEQNYDSFMGEIVAAWADDRVFD GRHWTFDTAPDELRTVHYVAGGQFYAMGKGTKFDHGPGQD >MS1868 unknown MQKVKLPLTVDPVKDAQRRLDYVGYYAADQLVRLNESVVKVLSDAQVTLS FFIDPQKLVVMKGQAQVEVELECQRCGQTFNQTLECTFCYSPVANLSKID ELPEIYEPIEFNEFGEIDLIGTIEDEFILNLPIVPMHSSEHCEVSAQEQV FGELPEELAKKPNPFAVLANLKQK >MS0568 unknown MNLRVFLLMMKKCIRFIFLLLLMFAAAGFWGYNYIQKLVNEPVNIKAEQL LTLERGTTGKKLFALLEKENIIADNILFPLLLKLQPQFNNVKAGTYSLEG VKTLGDLLTLLNSGKEAQFALRFTDGETWKQVKKSLENAPHLKHELKDKT DVEVFHQFKEMLPEFEVQNAYKTLDGWIYPDTYNYTPNSTDVALVKRSVE RMVKTLEKAWAERDEDLPLNNPYEMLILASIVEKESGISAERGKIASVFV NRLKAKMKLQTDPTVIYGMGESYQGNIRKKDLESPTPYNTYVIDGLPPTP IANPSEDALNAVAHPERTDFLYFVADGSGGHKFSRSLIEHNKAVQEYLLW LRRNKNK >MS2009 unknown MPVFNAHVAQGKLTKEQKQGLADAFVLAIHDALNAPMEDQFVIINEHPQD NIFIHPTFPNMQRTDKRMVVTVDVSTTRTLEEKRKLTELVTKYAVEKAGI GQDDISLLIYALPLENMSFGRGILMPDDAEAMVKRTRS >MS1931 unknown MMLSPSEILKKTTALFAATICLYFACKLILMGTGFYPQPKLTDILLFAIL IVIFNSSKNLFYFLLLPFIIAHALYAPVGITFGAPSYQYIASVFATDLME SREFLSQLSIKNYLMPVGIIGLTLAFRWITQKYDLKLHKNKMFLASITAF MLLANSPFKFIDEISTSGTQVISELQRLNNMTIESEWGDSQLINSNYDDY VLIVGESARKDYHHAYGYPVKNTPFMSKANGVLIDGMTAGGTNTIASLKL MFTQPNTQTKEGNYSLNFVDLIKSAGIKTYWISNQGYLGEFDTPISAIAN KSDEKIFLKSGDSLNSNTSDFELLPKFTQVLERPSTGKRFIVVHLYGSHP ITCDRLNDYPKLFDDDKIAKKYFNVNCYISSIKKTDEVIKRIYDALAENK AKTDRTFSMIYFSDHGLAHQITEDNIVIHNSSGKSKRHYDIPLFKISSDD TKRHEYRVFKSGLNFTAGLAYWVGISNAKLAVREDLFSNEPDKDDYGLKA EIDKIDVPEDKAVVIPGTH >MS2154 unknown MQKLKIETQSGTLLDGVLFSQTPSKTVIIAITGIHGNFYSNPFYYNIGHT LSQSGIDFIYAQTRNAFGKTDFVNPKTGQPESIGSWNEDFAKTIEDLTAY VDFAEQKGYQHIVLAGHSLGANKVIHYLAETQDKRVAKFILLSPANVTHL TNAISEQQRAYIRHQVEKGNSQRLLPFELFGWLPCIADTAFQWLYSPLLN NVHVEPNSDFSQVAKIQHTGALLIGTLDRFTYGDPPGFLRNINNHFQSAD KNTLIFIENTGHTYQQKEQEVADKLLDLVKDWGY >MS1757 unknown MHIIKEKLAKSLMFVVIIALCITVMSIILFGINQFKIGSQLASVNQVSNL SHLLVRQQANLFSMLLVNNAGNEQLTDNLENLTKDKFVLDASIYGKNGEL LAQTRNTLDLREQLGLNEESSKHHVVNRQQIVEPIYSPNGIEGFLRVTFD SKYGQTTQNKINQIFHRLYGELIIVFLAGVILASSVHYFLSHYRRARRSQ ITEQINTVKEIKNSSALVFHRRRRRYR >MS1511 unknown MTKRKLTQNQKRRIHSNNVKALDRHHRRAKKEIDWQEEMLGDTQDGVVVT RYSMHADVENSQGEIFRCNLRRTLANVVVGDHVVWRRGHEKLQGISGVIE AIKPRENEIARPDYYDGLKVMASNIDRIIIVSSVLPALSLNIIDRYLVIC ENANIPAVILLNKVDLLTDEQWREAEEQLEIYRKIGYETLMISAISGKNM EKLTALLADGTSIFVGQSGVGKSSLINYILPEVNAQTGEISETSGLGQHT TTSSRLYHLPQGGNLIDSPGIREFGLWHLEPAQITNGYREFQYFLGTCKF RDCKHIDDPGCALREAVELGKIHPVRFDNYHRLISSREENKSQRHFMEQD IR >MS1906 unknown MRSKLIKIICLRSRICVSETIHTLEKQAKLAIITNGFTALQHLRLQRTGL AQYFQFITISQELGIAKPDARIFEHSLQQADIEDKSQVLMVGDNLHSDIL GGKNAGLDTCWLSYDKANDSDIAPTYSIKKFNELLDVVAA >MS1387 unknown MKQLENVRIFGGEQQVWQHQSATLNCTMNFAIFLPKQAKTEKLPVLYWLS GLTCTEQNFITKAGAQRYAAQHKVIIVAPDTSPRGDDVADNESYDLGKGA GFYLNATQQPWAKHYQMYDYIVNELPALIAEHFPVNGKQAISGHSMGGHG ALTIALKNPQRYSSVSAFAPIVAPTQVPWGQKAFQHYLGDNQTQWTQYDA TALVNAETRLPIRIDQGDKDSFLTEQLRPELFLDACRAHHVACEYYLRQG YDHSYYFIATFIGEHIAFHAKALYQDSEALPL >MS1268 unknown MKILITGATGLVGKALTRQLLKQSHQITALTRAVNTAQKLFPEVDWVSSL STYKNLDQFDAVVNLAGEPIFDKKWTDEQKLRLKNSRILLTQQLTQLINR GKRPPVFISGSASGFYGNAGSQLLTESALPATSFTAELCQAWEAAAQQAD TRVCVIRTGMVMSPRGGALARMLPLYRFGLAGKLGSGQQFMPWIALKDMV RGIIFLINNPNAVGAFNFSSPNPVTNKEFNRLLGSRLKRPHFFSVPACIL RLFLGERACLLLDSQNVYPKKLLDLGYTFQFEHLETYFSKTLKQKRKK >MS0512 unknown MIIGPFINAGAIVFGGLIGAALGGRVPERLRTNLTMLFGLCSMCMGIVMI AKVAQMPAMILALLLGTILGELILLEQGINKLASKTKTIVEKILPNNQKK GVSHEEFLQKFVGIVILFSFSGTGIFGSMNEGLTGDSSILIVKAFLDFFT AIIFGTTLGSTIATAAIPQTVLQIALAYSAVLIIPLITPEMRADFAAAGG MLMVATGFRICGILHFQVANMLPALFIIMPISAIWLQMMG >MS0081 unknown MTEAIKIINDDVKIVLAETIADYEKRTGKTLRPAHIERSIIQSYAYREQL VRQGINHAFLQTFPQFATGLALDLCGEPMGCYRLSDLPAEVTLRFSVEGD HDAVVIPEGTLVAATDNVVFATDTEVRISSTESYVDVVGICQITGAVGNG WQLGQVKTLKSTLDAKVTVSNIDVSDNGIDTESDDDYRKRILLAPEAFTT CGSVAAYEYHTRSVSQYIADVDIATPVGGTVQVTILTKQGLPSSILLNKV KDHISGEKLRPLCDTVVVSSPERVAYSVVANLDLLETVAESDVKVQAEAA LRAFISSRTQLLGADIVPLDIQAALKVAGVYNVTLASPTLTKLTKQQWAE CESITININGERQDG >MS0011 unknown MTKQIAVLIGSGSTTSFSKLVVSHLQKMAPASIQLNIVEIADLPLYDRDL DENSPAQYTRVREQIANADGVILVSPEHNGAISAMLKNAIDVVSRPMGQS KWFGKPAGIVTVAAGMAGGVRVADQLRTIASGSFIGMPVYQQNACVGGLF NGVFDQNGEITIDAVKQMLQQFIDGYAEFVAKF >MS1905 unknown MKYQWIFFDADETLFSFNAFAGLQKLFADNGLKFNEQDFTQYEKVNKPLW VKYQNAEISAEQIQTIRFEPWEQKLGKSAVEINQDYMLALADLCKRNHSH PGKTGKTGNYYKRLYRLATSSSAKNRFSTIFPVYYYFARTRHSQTGRPNL RA >MS1438 unknown MKNLKLSIATIAVASLLSACTSQYATEKHEQLKLQNQAALGIVWMQQSGE YQALAHQAFNTAKTAFDQAKKTKGKKKAVVVDLDETMMDNSAYAGWQVKN GEDFTQETWTKWVNARQTAAIPGAVEFANYVNNHGGTMFYVSNRLENGER QGTIDDMARLGFPGVSEKTLILKDGKSAKSARYKTITDQGYDIVVYVGDN LNDFGDATYRKPNAERRDFVAQNAKQFGTKYIVLPNPNYGDWEGGLDSNY YKGDVKNKVDIRLNSIKAWDGK >MS0296 unknown MSHLIVKEQKTIIRNAFFTFLYFTLAIGLAIGILYTDIFYLQNMIEEESL VEYTQSLSLTILTLMFSRHAYRSPQWRGGFVLITGFFLCMLIRESDALFD NLIRHGSWAYFAIITALVCIIYAFTHRQSTIDGLAQFAKQKEFHSFIIGL LTVLLASRLIGYGGLWRFILYNDYPHIVKNIIEETTELFGYLIMLFSCLS LTRHFK >MS0289 unknown MSDIAITISILSLAAVLGLWIGQWKIKGVGLGIGGVLFGGIIVSHFSEQN GLQLDAHTLHFVQEFGLILFVYTIGIQVGPGFFASLRKSGLRLNALATLI VALGSLIVVIINKAFDVPLDIILGIYSGGVTNTPSLGAGQQILTELGMQN ITQSMGMAYAVAYPFGICGILASMWLVRLIFRVKVDDEAKKFTQESGQQT ESLQKINIRVANPNLDGLCLRDIPGFDERGVVCTRLKREENISVPKADTT IFLNDVLHLVGDSHSLQRMCLIVGEKIELEPSKLVGNIPFRTGCGYQ >MS0092 unknown MAFHHGSETKRVNGGSVAVSTVDGAIIGIVGTAPMGAVNELTVCLTKKDF SQFGTILDQGFTLPDAFDILARYASGQVYVVNVLDPAKHRTTVTDEVLTQ DSDTLVATTAKKGLISVTNVKLGGSLLTEGETYSVNLESGEITLTVAAGE QDLTASYVYADPEKVTEDDIKGGVDSLTGKRQGFELLRDGFNLYGADAKI LICPEYDKTASCAAALATLADQMHAKAYVQLPKGTSLSKAIQGRGSLGTI NASASNENVRHFFPYALGSSNNLESLATHAAGLRMKVDVDEGYWFSTSNH ELSGVIGMEIPLTARVDDIQSETNRLNAVGITTIFNSFGTGFRLWGNRSS NYPTETHISCFEVASRTGDIIDESIRQAELQFIDKPIDDALIDSFIETID TFLRSQKSLVGYSVGLDYDYDLVDAFSQGQIPLIYDYTPKIPGERISNKS VMTRTYLANLVSQR >MS0397 unknown MIMKKFFLFATALLLASCSAQKPNLVSTQKPILNIAANLAQSIEANAGAH SAWVKNKSQQPIAFNYNLYWYDENGITQLFSTQQEKYQGALLLQPQQKAE INLTKPTAESVNYRLYLFSGNN >MS0357 unknown MKIQHTEDQQQGEFFILSETGEKVAKLTYFYQSPRVINANHTYVSDSLRG QGIADKLYQALIQLIKEKRLELIPSCSYIAKKWRRDHQKS >MS2113 unknown MKKDLIYRKRYLERVRPFIGKSLIKVFTGQRRVGKSYLLFQIMQEVQASD SQAHIIYINKEDLAFSHIKTAQDLAEFVLIEKKSGKKNYVFIDEIQEISE FETALRSLLLDDELDLYCTGSNAHLLSRDIAGSLSGRAIEINVHSLSYFE FLEFMRLEDSDKTMSQFLKYGGLPYLKDLPLQDNIVFEYLRNIYSTIAVR DIINRYALRNVQFLEQLTQFFASNIGNLFSAKKISDFLKSQRISANTVQV QNYAEYLANAFLIHKVPRYDIEGKRIFEIGEKYYFEDLGLRNALIGYRVQ DRGKLLENTIFNHLQIAGYDVKIGGLGTQEIDFVAEKDGERIYVQATLTI NEEKTLEREFGNLLKIQDNYPKYVVTMDEFDGNTFEGVECLSLREFLMLL MDSND >MS1426 unknown MMTRRTFLTASGLMASGLFLPKICKSETLLQRRRPMKIIAVEEHVLDADL GKASMPAALAQAPYLPDWGKTVQDGYNLDRSRPQIEQNALINPKGFDMGE GRLKEMDLAGIDMQVLSYGGFPQFALKEQSAALNRAANDRLAEAVAKHPD RFAGFATLPWGQPQEAVKELKRAVNELGLKGALLNGRPSEHFIDHSDYEP LLAAFHELNVPLYLHPGVPVQAVQQAYYGGFSPEI >MS1855 unknown MQQHKQIGRICTLLIRGSKTSHAKKLRKTMNITNPNGDRKAVVIFSGGQD STTCLLKAIADYGVENVEAVTFQYGQRHAIELEKAKWIAQDLGIKQTLID TSVIKTITANAMMDNIKITKDEAGMPNTFVDGRNALFLLYTAIYAKGQGI RDIITGVCETDFSGYPDCRDVFIKSMNVTLNLAMDYQFNIHTPLMYLTKA QTWQLADELGALNYVREHTHTCYLGVEGGCGSCPSCILRENGLQQYLASK Q >MS2172 unknown MKLKALTSALILATTLSGGIAMAKTQSATVAEMPAQTIQLTQEWDKVFPK SDKVEHRKVTFKNRYGITLVGDLYLPKNAQGKLQAIAVSGPFGAVKEQVS GLYAQTLAERGFVTIAFDGSYTGESAGLPRDLASPEINTEDFSAAADFLG SLENVDREKIGVLGVCGWGGFALNAAVGDPRIKVVATSTMYDMTRVMANG YNDSVDNDARYQMKQDLNNARWEAMSHDYANTGAPVLPSEKELNADTPKF VADYVNFYKTKRGFHPRSVGSNGSWTTTTPIAFINMPILQRAGELRAPAL IVHGENAHSRYFSEDAFKTLGSKDKELHIVKGASHTDLYDNQANKIPYDK FEQFFKANLK >MS0874 unknown MKLKYKLCIALFAWVSAFHVAAAPQTHAEVSNVTTELNDIQIRLKAQQSA DKGDWKTVYTLLLPLAQRGDSQAQVNLGILFSSGRGVEKNLEKAYWWFNE SAEQGNAKAVTYIGLMYLEGVGVKQDTKHAIRILEKAGRVDYPRAMLALG NAYYMEKNLQKSFLWFERAAMKGVSEAQFKLGMMYEKGEGTHKDEEQAVY WYQTSLKANDDIAEFAKERLSALGRLR >MS0573 unknown MGLFEAIFILFLLIVISAIISSSEISLAGARKIKLQSLANEGDTRAEKVL KLQEHPGRFITVVQIGLNMVAIFGGMIGESALRPYIQQTIHQYTNAPWVD GAASCASFVVVTAAFILLADLMPKRIAITYPEQVALRTVGVMSFCIVIFK PLVLLFDSVANGLFRLLKISTVRHDSMTSEDIVAVVDAGAEAGVLKAQEH YLIENIFDMQERTVTSTMTTRENIVFLNRTFDRQKVMETLTKDSHSKVLI CDNGLDRILGYVESHTLLTLYLREEQVSLTDQRILRKPLFIPDTLSLYEV LELFKSSGEDFAVIVNEYALVVGICTLNDVMSIVMGELVSSEEEQIVRRD EDSWLIDGATPLEDVMRALNIESFPDWENYETISGFMMYMLRKIPKKTDF VLYDKYKFEIIDTENFKIDQLMVSIRKDLNEQN >MS1716 unknown MSILYAENLAKSYKGRQVVSDVSFTVKSNEIVGLLGPNGAGKTTSFYMVV GLVRHDQGKIRIDDEDISLLPMHNRAQKGVGYLPQEASIFRRLSVYDNLM AVLEIRKDLTKEQRHARAEELIDEFNIGHIRDNLGQSLSGGERRRVEIAR ALAANPKFILLDEPFAGVDPISVIDIKKIIKDLRDRGLGVLITDHNVRET LDVCERAYIVSAGKMIATGTPTDILNDEHVKRVYLGEEFKL >MS0098 unknown MIKITLDDTQAVKKLQSVAAQLKAPRRLYALLGEELKKIHDDRFKTEKDP NGKPWTPLAAKTLARKRKRGKSLKILRQDGNLANKTAYNILDDGVEFGSP EVYAALHQFGGKAGKGRQVTIPARPWLGVNKENEYYLLKKAVSHLQKSLG KIK >MS1271 unknown MRQKIFLFVRSLIILYLILFIGEGIAKLIPIGIPGSIFGLLILFIGLTTQ IIKVDWVFFGASLLIRYMAVLFVPVSVGVMKYSDLLVSHASSLLIPNIVS TCVTLLVIGFLGDYLFSLNSFTRLRKKAIKKRDINNVNNKGEAS >MS1143 unknown MLIGLFIGLLFGFFLQRGQFCFVSGFRIIYTQRNFRFLTALLIAVSIQSI GFFSLSGLDLITIPNTPMPLLATLIGGLLFGIGMVLANCCASGGWFRTRE GAVGSWIALICFALTMAATQTGALKQWINPLLLETTTLDNIYNTFNLSPW ILVTVLVLITVVMIVYHIKHPRYQFPQEPTTALIPHRIFTKHWHPFTAAV WIGLLGVLAWLVSEQYGRSYGYGVAVPTANVVQYIVIGQQRYLNWGSYFV LGILLGSFIAAKLSGEFEIRLPEPKAILQRMLGGVIMGIGASLAGGCTIT NALVSTAYFSWQGWLATLMIMIGCWLTSVLVKPTQCRI >MS0372 unknown MIKGIQITKAANDNLLNSFWLLDSDKGEARCLAAKAEFAEDQIVAINELG QIEYRELAVDVAPTIKVEGGQHLNVNVLRRETLEDAVNNPDKYPQLTIRV SGYAVRFNSLTPEQQRDVITRTFTESL >MS1287 unknown MNNSYGTLYIVATPIGNLQDITQRALDIFTQVDLIAAEDTRHSGLLLSHY GIKKPFFALHDHNEQQKADALVEKLRQGTNIALISDAGTPLISDPGFHLV RKCRQTGLKVVPLPGACAAITALCASGIASDRFCFEGFLPAKSKARKDKL QNIAEEDRTLIFYESTHRILDTLEDIEAILGAERYIVLAREITKTWETIT GDTVANLRKWLAEDPNRTKGEMVLVIEGKAKSDDAEEISPQAIKALALLA KELPLKKAAAIVAELYGYKKNALYQYGLEYLD >MS2159 unknown MKKILVLTGSPHPNGASSRLADEFVKGAKEAGNDVFRFDAGLQPLGELHF LQLDASERTIADNDIVSREVLPKLIEADVVVFVSSLYYFGMNAQLKAVID RFYSINHELKDDKQSAVIMAGYGEGDDLKPMKDHFNIIQKYMRWQNIGTI VAEDSWNAAKLAKHLQEAYALGKSISA >MS0868 unknown MAGLTDKGTFMEVTIEITVILFTVAVIAGFIDSIAGGGGLITIPALLMTG MPPALALGTNKLQACGGSFSASWYFIRRRAVDLSAVWLILLMTFIGAVIG TILIQLVDASLIKKVIPFLVLAIGLYFLFTPKLGEQDARQRLSYGVYAFT AGVSIGFYDGFFGPGTGSILSLACVTLLGFNLAKATAHAKVFNFTSNFAS LIFFLIGGHILWSVGLVMLVGQFIGAHFGAKMVLSGGKKIIRPMVVIMSF IMTVKMAYDQGWFS >MS0387 unknown MIRFPRFNLRSSTLIAIVALYFTLVLNFAFYGKVLTQHPFTGKPEDYFLL TVPFFVFFTLNAVFQILAVPLLHKIIMPLLLIISAAIAYSQVFLDVYFTT DMLENVLQTTSAESTRMITWQYVLWIIGFGIIPAFLYLSVKINYHTWFKE LGIRLGAILVSAVVIFSISKFFYQDYAAFVRNNKPTVNLILPSNFITAGV NEIKRIHDANRPYEKIGLDAQQEKPDPYRHFTVIVVGETTRAQNWGLNGY QRQTTPKLAARGDDVINFNHVTSCGTATAVSVPCMFSYLTKDQYNGSKAE KMDNLLDVLQRAGVNIFWLDNNSDCKGVCLRVPNETVNMTLKDYCTEGEC LDEVLLRDFDKILNETTKDTVLILHTIGNHGPTYYERYTPEYKKFVPTCD TNQIQTCSNEQLVNTYDNSILYIDNFIDSVISKLENRDDLESAVYYVSDH GESLGENGMYLHGAPYAIAPEQQTRVPMVFWFSKTWKKNEGVDLNCVREK AKTREFSHDNLFSTVIGMMDMNLKTSVYQPEFDILASCKRH >MS1397 unknown MMKKSLFLTALSLAILTGCQNVGSQALQIEKQGSFTVGGSYVTHKGTFKQ ENFIAPEGQRAYGDFAYVKYQTPTNAKKYPLVFQHGGAQSSRTWESTVDG REGFDTLFLRKGYSTYLVDQPRSGKSNLSTKAITPDTPWASNPMYADKTF WILSRMGHYDSHNQPVANAQFPAGEAAYQAFQQAWTIGSGPLDNDLNADV LTQLVDQTKGAILVTHSMGGTIGWRTALRTDNVKAIVAWEPGGTPFIFPE NEMPKITKARFEALSGAAMGVPMNEFLKLTKIPIVLYYGDYIQVGSDNVG EDKWGTELAMAKQFVATINKHGGDATLVHLPEIGIKGNSHFLMGEKNNQQ LADLMADWLKQKELDK >MS0621 unknown MCQMLAMNCNTPTDIVFSFEGFRRRAGMTDSHSDGFGIAFFEGKGVRVFR DDQPGAVSPIADCVKQYHIKSLNVIAHIRKATQGVVNIENTHPFIREIWG ENWVFAHNGNLNALPDLSSCYCTPIGDTDSEAAFCYIAAKLKERFCRKPT ENEIFDTIKELAAELAQHGTFNFILSNGQWMIAHCSTNLHYLTRQAPFGV AQRIDDDGIIDFSNYAKDTDKVTIITTFPLTKDEIWAKMEHGGMVMFKDG VKIREAIGTPKEAVDDGTLGCTKIAA >MS1679 unknown MLAERRLELYFAENPPHFFDEMAKSAVILSPENFHNAKNLLGREFDQILF DGRTSLNLDALAIAAGTLRAGGRLLLWLDKNPHVDPDSLRWSGAEQAVET PNFYAHFNRLLQVYGCDNGIQAQNNQSVSTQKTNIASTATAEQQQIIRQI LQADSDIFILTAKRGRGKSALAGLLAKELRNSAQYHKKPFNVYLTAPNKS AVETLQLFAGEKITFIAPDELCRRIGQNARQFSQDWLLIDEAAMIPLELL FQLTSTFKHILCCTTIHSYEGTGRGFLLKFLPNLHRSFQQFELIRPLRWA ENDKLEKFIEELLMLEAEDRLIQPPYSIKSAVKIRQISQNELVEHITDFY GLLTLAHYRTSPLDLRRLFDAVKQHFLIAEWECYLLAGVWALEEGGFSDK ALIRAICRGERRPKGNLVAQSLAFNCNLPEACALKSLRISRIAVQPDWQG RGLGLQLVEKLAQTAQADFLSVSFGYNEELAHFWQKCGFILVNIGEYKEA TSGCYSAIALRPLTAAGEDLVKRAQQYFRRNLAFTFHPLHDKLSVEKSSA EKITQLNGQDFGILENFADYHRTFYSSQGAIYRLFIRLGADTSPHVG >MS1810 unknown MKNYSETIIIGAGAAGLFCAGQIGKAGKSVTVFDNGKKAGRKILMSGGGF CNFTNLEVLPSHYLSHNPHFVKSALARFTQWDFIAMVAAQGIAYHEKESG QLFCDNGAEDIVKMLEARCTENRVSIQLRQRIDLVEAVHNDENARFKIQS GGQTWYCKNLVIATGGLSMPALGASPFGYQIAEQFGLNVLSPRASLVPFT YRENDKFLTALSGISLPVRVTAQNGKSFSNNLLFTHRGVSGPAILQISNY WQPNESVEIDLLPTDSIEEYLSQLKASSPKLQLKTALSRILPKKLVELWF ERQLLQDETLANLSKVRLKNLENLIHHWQFQPNGTEGYRTAEVTMGGIDT KEISSKTMESQKVKGLYFIGEVLDVTGWLGGYNFQWAWSSAYACAVGITQ TE >MS2108 unknown MGYRVNSVLGTKFRIWATARLKDYLTKGYAINQQHLSQNAHELEQALALI QKTAKSSGLTLESVWWTLSAVIRKHFYCLQAAEKR >MS0083 unknown MQTHNFGATYQEGIVTEVDAAKHKVRCKIPALEDLETAWLPFLTPNAGGN QFYCLPDKDELVALLLDARGEGGCVLGAIYNDQDPTPVANAEIWCHKFKN GTEISHNRKTGDVVVNTKGHVTVTAGAGATINADTVVNGKLHATGKITSG EEVSAPKVKQGTVELGTHTHGSSPQPNK >MS1490 unknown MTRKYWLIIMKNKALVLDLDDTLYAEIDFLYSAYKHIASRLAPERSETLF NRLVELYHRGENAFQYLVEQYDVDLSTLLDWYRFHVPQIRLFPHVADQLN RLKEDFRFALITDGRSVTQRNKVKALGIEPLLDFIVISEEVGSEKPSLNN YRLVQDALHCRDYIYIGDNPKKDFVTPNKLGWKTICLKDRGTNIHRQDFE ILEEFRPHFYMSDWSELPTFLDF >MS1630 unknown MRYFIGSFTFLTANGIIFLTIFRIQPLFRLSALILILLLFSAFISVGLAL TYKLLKSFINSSILNRTLRAVYPIGMLILVGLSIYNAYTPKVIHYQIELD KPLKAMRIAVASDFHLGKLFGSEQIDKLARIIEREKADLVLLPGDIMDDN LNAYLAEQMSSHLAKLKAPLGVYATLGNHDFFGQQQAIADEINKTGIKVL WDEAVTINNEFVIVGRNDDLNKARPTTKRLLQNVDTNLPVFLMDHRPTEV TEHSALPIDVQVSGHTHNGQIFPANLIIKAMYRLGYGYEKIADGHFFVTS GYGFWGIPMRLGSQSEIFIIDVKGKN >MS0080 unknown MGNGKMAKLQYPAIIETDKKFTALADLGKRLNSLDKSQIMTSFTYLVPTA FLELLAEKWSVTGYDGWLLAESEDAKRKLIKRAVELHRYKGTPWAIREII RQLGFGEVEFLEGLFDKRRDGSFVRDGAYFHGDRSKWAHYRVILKTAITN EQAALLRKTLRVFAPARCVLASLDYRTVALQHNGKATRNGQYNRGTA >MS0995 unknown MNSQVKNMNRKLENIKFVITDVDGVLTDGQLHYDANGEAIKSFHVRDGLG VKMLMESGIPVAVLSGRDSAILRKRIADLGIKLAFLGKLEKESACYELMK EVGVTPEETAYIGDDSVDLPAFNVCGVAFAVADAPDYVKDCADYVLDLRG GKGAFREMSDMILKAQGKTDVYSSAKGFLKIVTNMAQ >MS0734 unknown MTTPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGHANISG YDFIVIDTGGIDGTEEGVEEKMAEQSLLAIEEADVVLFLVDARAGLTPAD IGIAQYLRQRQNKITVVVANKTDGIDADSHCAEFYQLGLGEIAQIAASQG RGVTQLMEDVLAPLAEKMKTDESAVENDENSEQEKDEWEHEFDFNSEEDA ELLDEALAEENEEPENKNIKIAIVGRPNVGKSTLTNRILGEDRVVVYDLP GTTRDSVYIPMERDGQQYTIIDTAGVRKRGKVHLSVEKFSVIKTLQAIQD ANVVLLTVDAREGISDQDLSLLGFILNAGRSLVIVVNKWDGLSQYTKDQV KSELDRRLDFIDFARVHFISALHGSGVGNLFDSVQEAYACATKKMTTSML TRLLQMATDEHQPPMINGRRIKLKFAHPGGYNPPIIVIHGNKIDKLPDSY KRYLSNYYRRSLKIVGSPIRLQFQEGSNPFAGKRNKLTPNQLRKRKRLMK FIKKSKR >MS0802 unknown MSLEKRFELIERGSTVRQEIIAGLTTFLAMVYSIIVVPGMLSKAGFPAES VFIATCLVSGLGSILIGFWANAPMAIGCAISLTAFTAFSLVLGQQVSIPV ALGAVFLMGAVFTLISATGIRAWILRNLPASIAQGAGIGIGLFLLLIAAN GVGAVVSNQAGLPVKFGEFTSFPVMMSLIGLAFIIGLEKLQIKGAILWVI IAITIVGLIFDPNVTFGGEVFKMPSFGEQSLFAALDIQGALQPAILPVVF ALVMTAVFDATGTIRAVAGQANLLDKDGQIINGGKALTADSVSSLFSGLF GTAPAAVYIESAAGTAAGGKTGITAIVVGVLFLLMLFFQPLATLVPGYAT APALMYVGLLMLSNVSKLDFDDFVGAMSGLICAVFIVLTANIVTGIMLGF AALVIGRIVSGEMKKLNVGTVLIALALVAFYAFGWAI >MS0122 unknown MAKKAVRIKAETHEINLQTQDDVALAIKEIGDLEREQVRLSTLQADEKAA IDEKYTAELTALKDKVKPLQKAVQAYCESRRNELTNGGKQKTAYFTTGEV QWRAKPPAVIARGIDVILESLRNSGLFRFIRTKEELNKEAMLAEPDIARS IDGVTIREGVEEFVIKPNDEEVRT >MS0330 unknown MSEKIQSVDYDDLRDLVASNDEGGRNPAGFPKKLIVGTAILWSVFQLYYT SPFPFWLQEVLTQNNIDLNVVVDDTKARSVHLAFALFLAYLSFPALATSP KHRIPIIDWICATAGAFLGAYYLFFYQSLVTRFGAPNLQDIIAGCIGIVL LLEATRRSLGLPLAVIAVIFLLYNFFGQYLPTSWIISHRSGSLSQIINQQ WITTEGVFGVALGVSTKYVFLFVLFGALLDKAGAGNYFIKTAFAYLGHLS GGPAKAAVVSSALTGLVSGSSIANVVTTGTFTIPMMKRVGFTQEKAGAVE VASSVNGQLMPPVMGAAAFLMIEYINMPYNELILHAFLPALISYIALVYI VHLEACKMGLKGLPRTDPAKPFLVTLIRAIGTFLTLCIIYFVLELTLGWL KTAVPNEAFLIVCLLLLIVYILLIRRVASFPDLEPDDPNAKIVVLPATKP TVNAGLHYLLPVVVLMWCLMIERMSPGLSAFWGILALSAIIITQRPLLSL FRKENTDKFIQLKEGVQELIKGLETGARNMIGIGIATATAGIIVGVVSLT GFGVQLSGIIEILSMGNVLLMLILVAIFSLILGMGLPTTANYIVVSSLMA LVIVEVGKQNGLIVPMIAVHLFVFYFGIMADVTPPVGLASFAAAAISGGS PIKTGATAFYYSLRTAILPFLFIFNTDLLLLDVGWAKGILVFITATIGVM AFTAATMGYFFTKNKKWEGFALILAAFMLFRPGFFMEYVSPTERHIEPAQ LVQEIENAAAGQNLTIKVAGLNPYGKEIEFYSKLSIPAGENGEEKLKAMG LTLLNTGEKIQINGNETDKILIDNVEIDSPAAKAGLNWDQTIIDVEVPKN SLPKELMFIPALLLVSALAWNQRRRRNS >MS0701 unknown MLTNEVVISILVLLILSLLRINVVIALVISALTAGLVGGLGITKTIETFT GGLGGGAEVAMNYAILGAFAVAISKSGITDLLAYKVIKRLGNRPTGSSIA GFKYFILAVLVAFSISSQNLLPVHIAFIPIVVPPLLSIFNKLKLDRRAVA CVLTFGLTATYMLLPVGFGKIFIESILVKNINEVGAALGLQTSVAQVSMA MSIPVLGMILGLCTAIFISYRKPREYIVKIAEPTTAEIEQHIANIKPFHV MASIVAVLVTFGLQLFTSSTIIGGLAGLIIFAVCGIFKLKESNDIFQQGL RLMAMIGFVMIAASGFANVINSTGGVTELVNSFSQSVGADNKGIAAFLML VIGLFITMGIGSSFSTVPIITSIYVPLCLTLGFSPLATVAIVGVAAALGD AGSPASDSTLGPTSGLNMDGRHDHIWDSVVPTFLHFNIPLLVFGWFAAMT L >MS2120 unknown MQTKPFGKHPEGQRLARIEQSVHYKAGKFVNHLPTEVQTSDKPLWKIWYD FLFQQIDHLTPNRPLPVVKTDLQQLSREKNFIVWFGHSSYLIQLDGKRFL VDPVLVSGSPLSFANKMFQGTNLYQPQDMPDFDYLVITHDHWDHLDYEAV IQLKNKMKEKVITSLGVGAHLEYWGYPAERIIEMDWNEKTELENHFKITA LPARHFSGRGVVRNKTLWSSFMLEVPGETIYLGGDSGYDPIYQEIGQRFN ISLALMENGQYNKDWANIHIQPEQLTLAVKALRPKRLMTVHNAKFALARH DWRAPLEQIYRNAQKENFNLFTPKIGDVFYFSEQGEADSPNFREPWWQSV E >MS1548 unknown MMNTLDRYIGKSILGAIFATLLTLVGLSGIIKFVEQFRSVGKGSYDSMQA FLYTVLTMPKDIETFFPMAALLGALIALGNLASRSELVVMQSAGFSRMKI GFAVMKTALPLVLLTMVIGEWGIPQTEQFARDMRSKAISGGSMLSVKNGI WAKDGNDFIYIKRATEDANLNNIYIYSFNDNRQLQRVSHANKASYENGSW VLKQVNESQISADEIKTKNYLNRPWKTSLTPDKLGIFTVKPTSLSISGLS SYISFLKETGQDSKKFELTYWRKLFQPISVGVMMMLALSFIFGPLRSVTA GARIVTGICFGFVFYVINEIFGPLSLVYNVAPIIGALMPSLLFLVITWWL LSRKRD >MS1376 unknown MKVTSSAIKNGAFEDKYGKRGSQFTPNGMPSYSIPFEITGAPEGTKSFAV VLEDKDAVTASGFVWIHWLIANLERTSVLENESQTATDFIQGANSWSSVL AKLDITEASAYGGMAPPNCLHRYELFVYALDTKLDLQPGFKFNELHFAMQ GHILAKAEIMGTYDV >MS2382 unknown MRMTKSNSTRETFSGRRAFIFAAIGSAVGLGNIWRFPYTTYENGGGAFII PYLIALLTAGIPLLFLDYAIGHRHRGGAPLSYRRFSKHFEAFGWWQVMVN VIIGLYYAVVLGWAATYTYFSFTMAWGDKPIDFFIGEFLKMGDITQGVSL EFVGMVVGPLIAVWLVALGVLALGVQKGIARTSSILMPVLVIMFLILVIS SLFLPGAAKGLDALFTPDWSKLSNPSVWIAAYGQIFFSLSICFGIMITYA SYLKKEFDLTGSGLVVGFANSSFELLAGIGVFAALGFMAAASGHEVSEVA KGGIGLAFFAFPTIINEAPFGQILGVLFFGSLTFAALTSFISVIEVIISA VQDKLRIRRAKVTFIVGVPMMIVSTLLFGTTTGLPVLDVMDKFVNYFGIV AVAFVSLIAIVANEKLGLLGDHLNETSSFKVGFIWRLCIVITTGILAFML FSEGAKVFAEGYEGYPSWFVNSFGWGMAVMLVIVAVLLSRLKWKNEVQVS GE >MS1215 unknown MDRFSRMWRKLQKSAVCFLLIFVTVNVWAADFPGSPNPFRYVNDYTNTLS ENDKNYLENKLINFSRETSSQIAVVMVKTTGEYAISDYAFTLGDNWGIGR KQLNNGVLLLVAKEDRKVFIATGQGLEGALPDAFLSQIIRRVILPNFRQE QYASGINGALDYIIAASKGEYDAAAEQNDEGFEQYIPFLMVLVFVLFVLF GELNGRRKPYISPTTNHQLEQVILQSARRRRGNSGGFGSGGFGGFGGGGS SGGGFGGGGFGGGGAGGSW >MS0288 unknown MSVKKSNLSRPNWLAISRSERVVGTNEKVLGKRIRTLGIHQRYGIMISRL NRAGVELVPTADSILQFGDVLHMVGNVETMDAAISIIGNAKQKLQQVQML PVFIGICLGVLLGSLPIHIPGFPVALKLGLAGGPLVVALILARIGSIGKL YWFMPPSANLALREIGIVLFLTVVGLKSGGNFVNTLTQGDGVTWMGYGVL ITFVPLMAVGIIARIYAKMNYLSICGLLAGSMTDPPALAFANAIKEENGA AALSYATVYPLVMFLRIISPQLLAILLWVA >MS0082 unknown MGAMNTQIQSTHWQLAPETDGVSVVSGVDDIHLCIANILSTQKGTDILRP EFGSDHFKFIDYPEDVAVPNFVREITQALQKWENRIVIDEVLVDGEAPHF TFTVSWSLTDDVYREIYRTQVQQ >MS1911 unknown MSITVNQIVLHQIIKPASANIPANNNNETENGETATQNTQLETVLRQELL PITAEAEQFMLELHQAYQNKTKGYGVFQEQSRFAQSLNRLLERETDFLPF SYEAAKLLSSELAKYAFAESGTFVLCRYNFLATDYLFIALLDSKASVLVD EKLEIHRTQYLNINQFDIAARINLTDLRVNANSNRYLTFIKGRVGRKIGD FFMDFLGADEGLNPQVQNQCLLQAVSDYCQKGELSAEQSQAVKKQVFDYC KGQINAGDEIELTELSETIPTLNQQPFADFAAEQDYGLENNIPPVRSALK SLTKFSGSGKGVTISFDAELLDKRIYWDDMQDTLTIHGLPANLKDQLQRL LKNHN >MS1402 unknown MRYNSSFQITLSRQMNKPNITIQPIQASHYADYVALIGKQLGEGYFKQAD FEALANNPQAICFEAVDEQNQVVGVITSVTLDRESALALLKIQAQNTPDY VLQSDRIGIFKTIAIDENRKGCGIGSALVRKLLESFKQAGLNSIACVAWQ YGETENIRGIMQAFDFTCYEKIANYWLDDPEPFICPACGEPPCRCQANIY FRQI >MS2135 unknown MLQLRKSNERGHANHGWLDSYHTFSFADYFDRNHMHFSDLRVINEDFIQP TMGFGTHPHKDMEILTYVLQGAIAHKDSMGNVKTFTAGEFQIMSAGTGIY HSEFNPSESELLHLLQIWIMPNELGVSPRYDQKQFADKEGATLILSPDAE GESFKVYQDMKLWRHQYKAHQKVELGLNSRRNYWLQVVKGNLTVNDIALA TSDALGISAEELATIETSDEVEFLLFDLR >MS1114 unknown MNKVSLLTLLIGGALAVQYANGSPIDERRENIIKYSRLGDGQLVEGTKQL IDLYNKTKDKKVRDDLITLLVRQNRDAEALSISETYKLTDFSSNELEYLA RAARNERQFSKSLAFYNQLNNLDTKNPNGLLGLALVSTDMAKFEQSKLYL SRYKHRFGTDEQYNQANAYFLDSSEPLITRFHRWNSELDTNPNDIELVKK LYRLAAQLNISPVQEQLIAKYPEVFTDNDKSWLLHDQAVRISKNSPNKQQ LNTAYSMLDKVYIKVPEDNSLKQQSLQDMVVVGSKLKNDDSNRAKNSYEL LTESNQPIPNYVKEAYADYLVASGSPFAALSLYKEVEQSHLAEGGEVPFT LGIKIVQALNDAAKYPEARDYLENNIGEPSLMVLDFTRSRKIENPDYGNY FSTKVSSLVAQGDLSSAMQLIDERLSVTPGDGWIMLTKAELEAARARTDD AADWVHKAQAFLPEDTAWAEVAQANLALSVNDWRTASRLVNTWTTEEKDN ANWFMEQYDQAKSARLVASGGISHRTSPAGENESNQEYYLYSPKTDDGHD VYIHYLTTKSPDDGLPFEQQRVGAGVEANFYPFMVNAEAGKGIKLNDKAY FAATIQYQLNQHWQFSLNGGLNSANTPIKAIYQDTYAKDLGFSVNYKYSD RFEAGAGITAMKFDDENLRKNLSFWSNFNLFKHNRWNLNGSLYGSYERNK AIPGAYYYNPLKSRSLEDNFDLSYYQPFDHSITLTHHFKAGGGYYWQDSF ASSKTWSVAYGQEWRLGKKLNISYDVGRKRSIYDGSPEFNNFINLTLSVS F >MS1552 abgB, AbgB protein MELTQQQLVQWRREFHRFPETGWAEFWTTSRIADYLEQMGFEILLGNQII NRDFVRGRQQAVVEKGLANAVAYGAKQKWLEKMDGYTGCVAVLDSGKPGK TLALRFDIDCVNVMETKAPEHIPNKEDFASLNDGFMHACGHDGHITIGLG TALWLSQNKDKLSGKVKIVFQPAEEGVRGAAAIAASGVIDDADYFSASHI GFCADSGTVISNPKNFLSTTKIDIRYQGKPAHAGAAPHLGRNALLAAAHA VTQLHGISRHGEGMTRINVGVLKAGEGRNVIPSKAEIQLEVRGENKAVNQ YMVDQVMRIANGIAVSFDVEYETEIMGEAVDMINDTELVGLVEEIVLAHP KVHSANANYAFNASEDATVLGRRVQEQGGKAIYFVLGADRTAGHHEAEFD FDEDQLMNGVNIYTALVQRLLG >MS2075 ara1, ARA1 protein MLTFVKQGLELGVDTLDHAACYGAFTSEAEFGRALALDKSLRAQLTLVTK CGILYPNEELPDIKSHHYDNSYRHIMWSAQRSIEKLQCDYLDVLLIHRLS PCADPEQIARAFDELYQTGKVRYFGVSNYTPAKFAMLQSYVNQPLITNQI EISPLHRQAFDDGTLDFLLEKRIQPMAWSPLAGGRLFNQDENSRAVQKTL LEIGETKGETRLDTLAYAWLLAHPAKIMPVMGSGKIERVKSAADALRISF TEEEWIKVYVAAQGRDIP >MS1209 ara1, ARA1 protein MQWLKYKHRCKYSLGGNKMQTFKLNNGVEIPVLGFGVFQIPPEETEQAVI SAIHAGYRHIDTAQAYMNETETGAGIRNSGVVREEIFVTSKVWIENYGYE AAKASLDRTLARLDIGYIDLMLLHQPFNDVYGAWRALEEYLAAGKIRAIG LSNFTADRVLDVGLYNKVMPAVNQIEINPFHQQQAQVEGLLSEGIVPEAW GPFAEGKFGIFENPVLAKIGQKYGKSIAQVVTRWLVQRGVVVLAKSTRPE RMAENLNVFDFELDADDFAQIAALDVGKSQIISHTDLAMVRQFKEWVFNV >MS0687 ara1, ARA1 protein MKKITLKNGDKLTLLGMGTWFIGDNAHYRQEEIAALRYGIEHGINLIDTA EMYGNGRAERLIGEAIAPYDRNSLYLISKVLPNNANKRKMEQACNNSLKA LNTDYLDMYLYHWRGTTPLAETVECLEALKNKGKIKAWGVSNFDLEDMQE LLALPNGNQCQLNEVLFHLGSRGIEYALKPYQDKLAIPTVAYCPLAQAGS LQRNLLRHPEVTTIAEELNCTPYQLLLLFVLAQPNMIAIPKAGQVRHMKE NIACLDMQLTQQQLARLNNAFPSPTHRIHLDIV >MS0518 ccmC, CcmC protein MVTGLRIMSFALFSALFYIISILFIAPMLAKAQSGEQIQRPNKNWFILTA LFAVICHFISLFPFFSNLFSGENFTLMEIGSLISVLIAILATVAIALKIK TFWFLLPIIYCFATINVTLAAFAPSHVIQNLAQDLGLLLHILLAMFAYAV CFIAMLQSIQLAWLDRKLKTKQMVISPLLPPLMMVERHFFRVMLSGEILL TLTLLTGAVYLADFFGNENIQKAIFSFLAWIVYAVLLIGHWKYRWRGKKM IIYTISGMILLTIAYFGSRAMLGMN >MS2235 cof, Cof protein MTIPNLRDKIKIVFFDIDETLIMKFEDILPDSVLPVIRKLKQNGIIPAIA TGRSRCSLPTKIKALIAEEPIELFVTMNGQFSVFQNKVIEKHPIPTEKVQ HLVDFFDAQQIDYAFVSDNNVAVSKITAKQKSALDPILTDYIVDKDYFKH NEVFQLLPFYDQSQDELVKNANILDGLRVVRWDKDSVDLFDAEGSKARGI ASAIKRLGFEMENVMAFGDGLNDLEMLSTVGVGVAMGNARDELKKVADFV TDRIEDHGIYNFLVKAGLIED >MS2344 cof, Cof protein MQYKAIFSDIDGTLLNSRHQISSKTESVIKLAVSKGIPFIPVSARPPYAI TPYTEQLQTNQGIICYSGALILDKNLRELYSVQIDQADLAALNQILADYP YLSINHYAALDWFSNDLDNYWTKQEADITGLFPKQTPSNLTKVHKILVMG EADKIKPLEQKLKQKLPHLSIHLSKPEYIEIMNKAATKAKAIGFMERHLH VSADEVIAFGDNFNDLDMLEYAGLSVAMGNAPDEIKQVAKKVTASNDEDG IALVLNEIFNL >MS2225 cof, Cof protein MKQLPFRAIVSDMDGTLLNANHVVGDFTINTLEKLAQKGVDIVMATGRGY TDVASTLSKMKIKNAAMITSNGAQIHDLQGNRLYSNYLPEDVAFEVMQLP FDADRVCMNTYQNNDWFINIDLPQLRKYHQTSGFMYEVVDFKKHHGRDTE KVFFIGKKPADLMEIEQELTTRFGNYATITYSTPVCLEVMNKNVSKATAL AHLIEQREYSLSDCIAFGDGMNDIEMLTEVGKGCIMQNADPRLLQLLPDN ERIGLNKDESVASYVRAVFGIY >MS0842 cof, Cof protein MAYQVLAFDLDGTLLNSQGIILPSSKKAIEAARAKGMQVILVTGRHHTAV KPYYYELNLETPIVCCNGTYLYQPQTDEVLRSNPFSKTQALQLIDIAERQ KIHILMYSRNAMNYMELNPHMEKFQKWVQSCPQNVRPDVRQVSSFRDIVN NEDIIWKFVMSAPNRELMQQTVNMLPQDQFSCEWSWIDRVDISNKGNTKG SRLLEYLRSVNMNPEQVVAFGDNQNDLSMLTSVGLGVAMGNADEIVKQQA KCIIGTNNENSIADFIEGLK >MS0931 comEC, ComEC protein MMKLDLFLFCFIVNTLCLLVLPESFLLDFPLFLHFLFPLVIAAFIYWFKY RRLWRGFYYLFCGLIAVFYIHFQALSLFRAADGVKYLPAKVQTDFVIDEI LYQRDYRNIIVKAQLAPEFKPQRIYVNWQADQAVKTGEKWRGELHLRAVS SRLNYGGFDKQKWYYAQGITAWAKVKSAVKISEDLSLRQQLFNHYLAQTE RLRQQGLLMALAFGERAWLQEDVWQIYRKTNTAHLIAISGLHIGLAMLLG MGVARLIQFCLPTRYISPYFPMLSGLVFAAVYAGLAGFAIPTLRALIALV IVSLLKLLRGYCNVWQLFLRVIGVLFIFDPLMVLSNSFWLSVCAVFSLIL WYQIFPLNLLEWKGKSVTDGKFAWLFGLIHLQLGLFCLFSPMQLMTFQGI SLAGFWANLIIVPLFSFLLVPVILFALFSNGAWESWRIADWLAQWFTHLL SYFQDYWIGVSNQTSWLICCLLCLLLLTVVHFIYPLKKQIPEKNELLTQF KTKKISLKSDRTLSPVLRKYLVSVATLFLASGAMLWLYQQWRQPDWRFET LDVGQGLANLIVKDGRAVLYDTGAGWKNGSMAQSEIIPYLQRQGLILEKV ILSHDDNDHSGGIADILQAYPSINILQPSMVNYEKTEQNSFNFDRTFCKQ GLNWQWHGLNFQVLAPAKIAERANNTDSCVLLIDDGQYKLLLTGDADLAA EQQFVAHLGKVNVLQVGHHGSRTSTGEALIKQIKPDFALISAGRWNQWGF PHPVVTQRLKRHKSAVYNTAFSGQISFEFYPNKIEVKTARSNYQPWFRQI VGGERD >MS2234 comFC, ComFC protein MNWFAFRCIYCQRKLAIGSHGLCCSCNKQIRRFNYCGVCGSELAENTLGC GNCLQNRPAWHRMVIIGAYKMPLSSLIHRFKFQNSFYFDRTLARLLYLAI RDARRTHGLMLPEVIIPVPLHHFRHWRRGYNQADLLAGQLAKWLNIPCNN RLIKRVKHTRTQRGLSAAARRVNLQKAFRFADKKQACPYKSVALVDDVIT TGSTLNALAGLFVQQGVEQIQVWGLARA >MS1002 cvpA, CvpA protein MIDYIIIGIIVFSIVVSLLRGFVREVMSLASWVVAFVIASQFYPYLANFL TQIESEYLRNGTAIGILFILTLIVGAIVNYVIGQLVDKTGLSGTDRVLGA CFGFLRGVLIVSALLFFVDTFTNFDQNDMWKESKLIPHFGFVVEWFFEQL QANSSFLNSTLNK >MS2216 dcuB, DcuB protein MSAMFLIQFAIVLLCILMGARAGGIGLGVFGGLGLAILSFGFGLKPAGLP IDVMFMIMAVVSAAAAMQAAGGLDYMIKIATNILRRNPKYITFMAPAVTW LFTFLAGTGHVAYSVLPVIAEVARHNGVRPERPLSMAVIASQFAIVASPI AAAVVAVVAYLEPQGITLANVLSVTIPATLLGIFLACVFVNKIGVELKDD PEYQRRLQDPEYVKANHADVNMDEIQLKPTAKLSVGLFLLGALLVVVMGA LPELRPSFDGKPMGMAHTIEIVMLTIGALIIFTCKPDGTEITRGSVFHAG MRAVIAIFGIAWLGDTLMQAHMDEVKGMVSGLVETAPWAFALALFILSIL VNSQGATVATLFPLGIALGIPAPILIGVFVAVNGYFFIPNYGPIIASIDF DTTGTTRIGKFIFNHSFMLPGLLSMAFSLGFGLLFANMFL >MS1877 dcuB, DcuB protein MLYLEFLFLLLMLYTGSRFGGIGLGVISGIGLVIEVFILRMPLGKAPIDV MLVILAVVTCASILEAAGGLKYMLQIAERVLRSNPKRVTILAPMVTYVMT FMLGTGHSVYSVMPIIGDIALKNKIRPERPMAVSSVASQLAITSSPLSAA IAYYLTQITKMPGYEHITLLNIISVTVPATFVGTMAMALYSLRRGKELED DPEYQRRLKDPTWRDRILNTTATSLDAELPRSAKMAVWLFVLSLVTVVVI AMLPEIRTVGVPVDGKPVKAISMSFIIQMMMLCFGGIILIATKTNPQSVP NGVVFKSGMVACIAIYGIAWMSDTYFSYAMPEFKAAVTTMVESYPWTFAF ALFAVSVVINSQAATAVMMLPVGISLGLPAPVLVGLIPATYAYFFIPNYP SDIATVNFDVTGTTKIGKYYFNHSFMIPGLIGVTTACLVGYALAHMIIV >MS0689 dgoA, DgoA protein MSTPVITEMQVIPVAGHDSMLLNLSGAHSPYFTRNIVILKDNSGNTGIGE VPGGEKIRQTLEDAKPLVIGKTLGEYKNVMNTVRQTFNDRDAGGRGLQTF DLRTTIHVVTAVEAAMLDLLGQHLGVTVASLLGDGQQRDAVEMLGYLFFV GDRKKTNLAYQSQENDLCDWYRVRHEEAMTPESVVRLAEAAYEKYGFNDF KLKGGVLDGFEEAEAVTALAKRFPQARITLDPNGAWSLDEAIKIGKQLKG VLAYAEDPCGAEQGYSGREIMAEFRRATGLPTATNMIATDWRQMGHTISL QSVDIPLADPHFWTMQGSVRVAQMCNEWGLTWGSHSNNHFDISLAMFTHV AAAAPGDITAIDTHWIWQEGNQRLTKEPLQIKGGLVEVPKKPGLGIEIDM DQVMKANELYKSMGLGARDDAMAMQFLIPGWTFDNKRPCLVR >MS1568 dltE, DltE protein MAILITGASAGFGKAACITLVKAGYKVIGAARRLEKLTELKQQLGENFYP LQMDVSQTAEIDSALASLPADWAEIELLVNNAGLALGLEPAYKVNFDDWL TMINTNIIGLTYLTRQILPQMVERNKGHIINLGSIAGTYPYPGGNVYGAT KAFVKQFSLNLRADLAGTAVRVSNIEPGLCGGTEFSNVRFKGDDEKAANV YKNTLSIQPEDIANTILWIYQQPAHVNINRIEIMPISQSSGALNVVRE >MS2336 dmsC, DmsC protein MATIIVIYQGFGLSQIHSSAQQAVALVPDFAVNQVIRLCLLAAAGMVLLK SKQPLLLSIAVILALFAEMIGRELFYSLHMTVGMA >MS0266 elaA, ElaA protein MLFMRIHPMNWQCKTFNQLSNIELYQILQLRSDVFVIEQQCIYRDMDNKD LLASHLFLSKDNQIVAYCRLLPKGVSVADAAIGRVIIHEKYRGRHLAHKM MGKAIDIIIHEWHENKIYVQAQEYLQGFYQSLGFKATSDVYLEDEIPHLD MYWES >MS0367 era, Era protein MTETKPENIVQHNETTAAEQETYCGFVAIVGRPNVGKSTLLNKILGQKIS ITSRKAQTTRHRIVGIHTEGPYQAIYVDTPGLHIEEKRAINRLMNRAASS AISDVDLIIFVVDGIHWNADDEMVLNKLRASKAPVVLAINKIDNIKNKDE LLPFITELSGKFNFKEIIPISAQRGNNVHNLQKVVRQSLRKGVHHFPEDY VTDRSQRFMASEIIREKLMRFMGEELPYSVTVEIEQFKMNERGTYEINGL ILVEREGQKKMVIGQGGQKIKTVGIEARADMERLFDNKVHLELWVKVKSG WADDERALRSLGYMEEY >MS1874 fabG, FabG protein MQGKIALVTGATRGIGRAIAEELATKGAFVIGTATLEKGAESISAYLGEK GKGFVLNVADQESIESVLEQIKKEFGDIDILVNNAGITRDNLLMRMKDDE WFDIIQTNLTSVYRLSKAMLRTMMKKRFGRIITIGSVVGSSGNPGQSNYC AAKAGLIGFSKGLAKEVASRGITVNVVAPGFIATDMTEVLTEEQKAGILA NVPAGHLGEPKDIAKAVAFLASEDAGYITGTTLHVNGGLYMA >MS0543 fabG, FabG protein MITMPFNFWYMEKDKMTLAKKHNFKDKVVVITGAGGVLCAYFAKEIAKTG AKVALLDINLESAQKFADEINAQGYIAKAYKTNVLELDSIKQTRDAIAAD FGTCDILINGAGGNNPKATTDNEFHELDLPPTTKSFFDLDKSGIEFVFNL NYLGTLLPTQVFAKDMVGKKGANIINISSMNAYTPLTKIPAYSGAKAAIS NFTQWLAVHFSHVGIRCNAIAPGFLVSNQNRALLFDEQDNPTARAHKILT NTPMGRFGEAKELMGGILFLMDEEYASFINGVVLPIDGGFSAYSGV >MS2145 fabG, FabG protein MQRFEQKTALVTGAGTGIGQAIAVRLAQEGAKVLVVGRTEKTLQETTALH PNIAYAVADIEKDDDVQKIVQQLNQKYGGLDILINNAGWAPVTPISQVKI EEYDKVFGINVRALVNLTLQCLPMLKARKGNIINMSSAICRNHLPNMSMY AGTKAAVEIFTKIWAKELGADGVRVNSISVGPIETPIYDKTDLSNDGIQD HIDRIRKTIPLGAFGKSEDVANVTAFLASDEARFITGSDYSVDGGFGA >MS2144 fabG, FabG protein MNNMKKLLILVGAGKGLGNAIAKEFASHDFRVALIARNAENLTAYRQEFQ ALGYEVMTQVADALYPETLTKAINAIQAEWGTCDALVYNVGITELDNDRP ITNELLMQRYQIDAASAYHCAMLVATPEFAAKQGAIIFTGGGFAKTFQPI LALKPLCIDKAALNAMNIVLHHLLAPQGIFVGSVLVSNVIQPNDPKYAPD VIAKAYWKMYCERDEFELLY >MS0563 fabG, FabG protein MVNTLLIHFLHRRIYMNLFDLTGKVALVTGCNTGLGQGMALGLAQAGCDI VGVNLVEPLDTKEKIEALGRKFVNIEANLMKQEGLTDVVEKAVSVFGKID ILVNNAGIIRREDAIDFSEQNWDDVININLKTVFFLSQLVAKQFIAQGHG GKIINVASMLSFQGGIRVPSYTASKSAIMGITRAMANEWAKYNINVNAVA PGYMATDNTAALRADEARSKEILDRIPAGRWGTPNDLVGPCVFLASAAGD YVNGYTVAVDGGWLAR >MS0955 fabG, FabG protein MIKIIFLKCNFHLNEEQKMSELFSLKNKRILITGSTRGIGNLLANGLAEH GAEIIIHGTRLETAEKIAADFNTKGFKAYAVAFDVTDSKAAQDTIDYIEK EIGPIDVLINNAGIQRRYPFCEFPEKDYDDVISVNQKAVFIISQAVARYM VKRQRGKIINIGSMQSELGRDTITPYAASKGAVKMLTRGMCVELARYNIQ VNGIAPGYFATELTKPLVENQEFTSWLCKRTPAGRWGDPKELIGAAVFLS SKASDFVNGHLLFVDGGMLAAV >MS1412 fabG, FabG protein MSILEKMKLTGKTAFVTGGARGIGKSVAIAFAQAGANVVIADFDIAEAEK TAAEIAKEEGVKSIAVQTDVTDQASVNHLMDVIKQQFGKLDIAFCNAGIC INVPAEEMSYEQWLKVINVNLNGVFLTAQAAGKLMIEQGTGGSIINTASM SAHIVNVPQPQCAYNASKAGVIQLTKSLAIEWAKHNIRVNSLSPGYIGTE LTLNSKDLQPLIKEWNAMAPLHRLGKPEELQSICVYLAGDTSSFTTGADF IVDGAFTCF >MS2175 fabG, FabG protein MFKKIILTLFSGLIFTEVTMAQTKYGVGSYNTEEVAAEMEYIEKHIRPLN PKPTKRIFITGSSAGIGELTAKMLLAKGYEVVAHARDAKRAADVKRDLPE IKHVVIGDLAKPDEVDKIADQVNALGRFDVIIHNAGVYRGENIFQINLLA PYVLTAKITQPQTLIYVSSNMHNGGELRLDAFNAGNVGYSDSKLQLLTLA KSLAVRWSKVRVNAMHPGWVGTKMSGGSAPDPLRQAYETLVWLAEGTDPA AQTSGGYFFNKQPDSHYRRDSEDSAQQAVLWQALEKITGVKLPE >MS1421 fabG, FabG protein MKLQNKVALVTGGGTGIGRAIAKQMAEAGATVIIIGRREAQLQESARQHA NIHYIVADVLNSDDITRTLNEIQQRFGKLDVVVNNAGIAPVTPIENVNLA DFDRTFALNVRAVIDVTSQAIPYLKSTQGNIINITSGLVNNPMPMNSIYT ASKAAVLSMTRTWAKELAPYGIRVNSVAAGATKTPLYDGLGLSETEAKDY EATVEHIVPLGRFAEPDEIAPAVVFLASDDARYATGAHYGVDGGFGI >MS2163 fabG, FabG protein MNNIQGKVVIITGASSGIGEATAYKLAEQGAKIVLAARREAQLKAIADNI KAKGGEAVYRVTDVVKPEDNQALVELAKSAFGKVDAIFLNAGLMPSAPLS ALETDNWNRMIDVNIKGVLNGIAAVLPTFEAQKSGHVLATSSVAGLKVYP GGTVYCGTKWAVKAIMEGLRMESAQAGTNIRTATIYPAAVQSELVAGITD ETTSQGYRQLYDTYEIPAERVANVVAFALSQPDDTNVSEFTIGPTTQPW >MS1406 fabG, FabG protein MWELQRSKKMKQKEVIVAIGSGSIAQAIARRVSIGKQVLLADIKLENAEA AAKTLREAGFEVSTTVVDVSSRASVQALVQTAVDLGAVKGVIHTAGLSPS QASPEAILKVDLYGTAVVFEEFGKVIAAGGSAVVIGSQSSHRLAIDEISQ AQADELATLEPEKLLELPLVQEINDSLRAYQISKRGNALRVQAEAVKWGK RGARINCISAGIIYTPLAYDELTSSERGEFYRNMLAKSPAGRGGTPDEIG ALAEFLFNSSYISGSDILIDGGVTASYKYGELKPA >MS0719 fcbC, FcbC protein MNNTFQFPVRVYYEDTDAGGVVYHARYLHFFERARTEFLRTLNFSQNQLL HEQNIAFVVKSMTIDYRFPACLDDALIVESEVVEVKGATILFSQILKRDE LVLTTATVKVACVDLGKMKPAALPAEVKAAISK >MS0457 fxsA, FxsA protein MPIIFIITLIAFLFIYGELSLLIAIGSAIGAFGVIMLLLLSVFIGGVILK SKGLFGLNFRRQIAQGEIPADSVVKSLLWMIAGILFIIPGFITDLLACLL LLLPSGLFEKWISQKFTVINSGFTAQGFGRHSHRYRYYKDQNTEVFEAEY EKEVDEKKRIK >MS0946 gloB, GloB protein MLVPIPALNDNYIWLYGRENLPVIAIDVAECKNLSAYLTQHHLQLEAVLL THYHDDHTGGVEELKRYYPDIPVYGPAETADKGATHIVNEGNIQTAHYRI EVVPSGGHTANHVSYLIDNHLFCGDTLFSAGCGRVFTGDYGQMFESITRL KQLPDKTVICPAHEYTLSNLVFAEAFAPNEKVKSAVKNQRISVESLRAQN KPSLPTTLALEKNINPFLQAENLADFIYLRKAKDNF >MS0824 gloB, GloB protein MNIDIIPVTSFQQNCSLIWDDRKNAAIIDPGGEPKKLIEKIEENGLDLKM ILLTHGHLDHIGAAPALKAHFGVDIIGPHEDDVFWFENLPQQSAQFGLFE ANAFLPDMWLNRENEVLEVGSLKLEVLHLPGHTPGHVGFFEHQNIVAFTG DVLFRNSIGRTDFPGGSYDDLISSIKEKLFPLGDDWIIIPGHGPYTTIGA EKKTNPYLK >MS2011 gloB, GloB protein MKKLVLTTLISATLGLSAIAAHAHPTYAPAKNAVKMQKTQVPGYFRQMVG DYEVTALYDGVGNLDMSLMAPFTQFSKAELDAMLDDEFAQRSELGGLEGT IIGFLVNTGDNLILIDAGKGEAEAPIFLDKQGRLIDSLKAAGYQPEQVDI ILPTHMHADHINGITEKGKRVFKNATVYLPLQEKAFWLDTPMDKLPSEIH PFIEAARYAVAPYLKADKVKFYNAGDEVFAGVKTVPLFGHTPGHSGFEFT SKGEKILFWGDVMHNGAVQMAHPEVAIEFDADAEAARTNRQTILTKIAAD KTLIAAAHLPFPGLGHIKTEKDGKGYRWYPVQYRPFDKH >MS2185 glpG, GlpG protein MQLLFRSEIPSFAWQFRDYIRKKYQIELILQQEKTDMRQNVIAVYLSGNS EQTAAILQDLAEFHRNPFDERYERASWETGDVSSGSHSLKELAENSSQGI KQQLLKTGPVTLLITLICIIVYGFEISGMAEQIMQFAHFPYEFGENQQIW RYFTHSLVHLSSMHITFNLVWWWIFGGAIERYFGSTKLIIIYVLAAFATG VTQNFASGPHFFGLSGVVYAVLGYVFVADKFSPNNRFNLPSGFFNVLIIG IALGFVTPLIGIKMGNTAHITGLLVGLILAFLQEKIGKKSK >MS0731 gltD, GltD protein MAKFFLAPADNYDVKIGELVDKFVNKVRSFPPGTCPLVVQYASLRSSMSQ TCGKCVPCRDGIPHLSFLLRDILAGEGDDSTMRQIRELAEMIRDGSDCAI GYQPAIEILDSIEEFKEEYESHIHNKSCQKVIGQRIPCINMCPAHVDIPG YIAHIGDGNYAEAINLIRKDNPLPTACGLVCEHPCEERCRRRLIDDAINI RGLKKYAVDQVAADVVKVPQALPDTGKKVAVIGGGPAGLTCAYFLAQMGH RVTIYERQKALGGMLRYGIPNYRFPKDRLDQDLNAILSAGRIEVKYGVMV GDDIAIEDIYNSHDAMFVGIGAQKGKTLRIKGSEANNVFSAVEMLDDIGN GKIPDYTDKVVVVIGGGNVAMDAARSAVRCKAKDVRIVYRRRQDDMTALH AEIEAAIMEGIELITLAAPVAIEKDEQGNCTGLTVQPQMTGPYDHGGRPS PVAVKKPPFTIGCDVILIAVGQDIISLPFEEFGMPANRGIFQADLTTAVP DMDGVFVGGDCATGPATAIKAIAAGKVAAHNIDEYLGYHHEFPCETKAPP PKENVRIQVGRANTTERPAYIRKCDFEHVENPYTYEEAMQEAERCLRCDH FGCGVLQGGRDL >MS2331 gph, Gph protein MNSQFKLIGFDLDGTLVNSLPDLALSVNSALAEFELPQAPEELVLTWIGN GADILIGRALDWAKEQSGKSLTDEQTAQLKERFSFYYAENLCNVSRLYPN VKETLETLKEQGFILAVVTNKPTRHVQPVLKAFAIDHLFSETLGGQSLPA IKPHPAPLYYLCGKFGLYPHQILFVGDSRNDILAAHSAGCTAVGLTYGYN YNMPIADSHPDWIFEDFADLLKIV >MS0774 guaB, GuaB protein MLRIKQEALTFDDVLLVPAHSTVLPNTANLSTQLTKEIRLNIPMLSAAMD TVTETKLAISLAQEGGIGFIHKNMSIERQADRVRKVKKFESGVVSEPVTV FPELSLGELAQLVKKNGFAGYPVIDQNDNLVGIITARDTRFVKDLNKTVA EVMTPKEKLVTVKEGAKREDIIALMHSHRVEKVLVVDDNFKLKGMITVKD FQKAEQKPNACKDELGRLRVGAAVGAGPGNEERIDALVKAGVDVLLIDSS HGHSEGVLQRVRETRAKYPNLPIVAGNIATAEGAIALADAGASAVKVGIG PGSICTTRIVTGVGVPQITAISDAAAALEGRGIPVIADGGIRFSGDIAKA IAAGASCVMVGSMFAGTEEAPGEIELYQGRSYKSYRGMGSLSAMSQGSSD RYFQSDNAADKLVPEGIEGRIAYKGLLKDIIHQQMGGLRSCMGLTGSATI EDLRTKSQFVRISGAGIKESHVHDVTITKEAPNYRLG >MS0996 gutQ, GutQ protein MDYLQNARETLATEKDALTLLSRNLDQSFNNVIDLILNCGGRLVIGGIGK SGLIGRKMVATFASTGTPSFFLHPTEAFHGDLGMLKPIDIVMLISYSGES DDVNKLIPSLKNFGNTIIALTGNKHSTLAKHADYVLDISVEREACPNNLA PTTSALVTLALGDALAVALINARHFQPMDFAKFHPGGSLGRRLLCRVKDQ MQTNLPVTALNTSFTDCLTIMNEGRMGVALVMENDDLKGIITDGDIRRAL AANGADTLNKVARELMTSNPKVINQDTYIGQAEDYMKEHRIHSLIVVDND NKVVGLVEFSS >MS1519 hflX, HflX protein MNNDVNISKSAVNFTALSSISAPRSDQSDNAIVVHVFFSQDKNPEDLDEF QQLAQSANVNILQVITAARSTPQAKYFVGQGKAEEIAQAVETHNADVVLV NHSLTPAQARNLESLCQCRVVDRTGLILDIFAQRARSHEGKLQVELAQLK HLATRLVRRKTGLDQQKGAVGLRGPGETQLETDRRLIKVRIAQLQNRLAK VEKQRNQNRQTRQKADIPTISLVGYTNAGKSTLFNRITQANVYAADQLFA TLDPTLRRLQIQDVGTTILADTVGFIRDLPHDLVSAFKSTLQETTEAGLL LHIIDAADPRKLENIEAVNAVLEEIKAADLPTLLVYNKIDTLENLEPHIE YDDQHIPVAVYLSAISAEGIDLLFAAIREKLKNEILHLQLNLSPNEGKIR HQLYLLDCIRREEISDQGEFLLEIQIDKIQWLKLAKKFPQLEKCGKNL >MS1518 hfq, Hfq protein MAKGQSLQDPYLNALRRERIPVSIYLVNGIKLQGQIESFDQFVILLKNTV NQMVYKHAISTVVPARSVSHHNNPQQQQQHSQQTESAAPAAEPQAE >MS0399 hit, Hit protein MWIYSFGLRDKLLFKLISDKKCGRFFPKIRKHKMAEETIFSKIIRKEIPA DIIYQDDLVTAFRDIAPQAKTHILIIPNKLIPTVNDVTAEDEAVLGRLFI TAAKIAKLEGIAEDGYRLIVNCNKHGGQEVFHIHMHLLGGEKLGPLNAK >MS1322 hns, Hns protein MNEVIKTLNNLRRLRSMAKELSIEQLENIIEKFQLVIEEKKAEELEIKRL EEERKNRLEKYRELLKEDGITADELAQILAGKNNTAKAKRAPLSAKYKYI NENGEQKTWTGQGRMPKAIQLQLNAGKSLSDFAI >MS1545 hybF, HybF protein MEIVEEQCHRNNVNKVTDIWLEIGPLSCVEPDAIEFCFEVCRKNTVMENC KLHFVPVPALAYCWHCEKTVEIKSHHDACPQCGGIHLQKQGGDDLRIKEI AVE >MS1693 icc, Icc protein MISNTYIYEADSDVIRFVQITDPHLFKDEQGELLGVNTQQSLTQVLTELK ENQFNYDFVLATGDIVQDSSEEAYLRFCKSVQQLDKMVFWIPGNHDFQPK MFDILVQEHGNLSPKKHLLLGDKWQILMLDSQVFGVPHGQLGQYQLEWLD SKLKDNPDRYSLVVLHHHILPTHSSWLDQHNLRNAHELAQVLAQYDNVRG ILYGHIHQAMDGTWKDYQIMATPSTCIQFKPDSNVFALDTLQPGWREVEL HSDGSIITRVNRIQKASFLPNMQEDGY >MS2079 ldhA, LdhA protein MTKSVCLNKELTMKVAVYSTKNYDRKHLDLANKKFNFELHFFDFLLDEQT AKMAEGADAVCIFVNDDASRPVLTKLAQIGVKIIALRCAGFNNVDLEAAK ELGLKVVRVPAYSPEAVAEHAIGLMLTLNRRIHKAYQRTRDANFSLEGLV GFNMFGKTAGVIGTGKIGLAAIRILKGFGMDVLAFDPFKNPTAEALGAKY VGLDELYAKSHVITLHCPATADNYHLLNEAAFNKMRDGVMIINTSRGVLI DSRAAIEALKRQKIGALGMDVYENERDLFFEDKSNDVITDDVFRRLSSCH NVLFTGHQAFLTEEALNNIADVTLSNIQAVSKNATCENSVEG >MS1188 ldhA, LdhA protein MMKSAVVFTALFLYAISHIKDELYLPKQGAFMKIVFLDSTALPPHLPIPR PDFDHEWIDYPYTGAEQTVERAKDADIVVTSKVIFSREVMEQLPKLKLIA LTATGTNNIDLIAAKELGIRVKNVAGYSSVTVPEHVLGLIFSLKHSLAGW YRDQLEGKWGESKQFCYFDYPITDIRGSVLGVVGKGCLGTEVGRLATALG MKVLYAEHRDAQSCREGYTPFDEVLKQADIVTLHCPLTEHTTNLINKETL SLFKKGAFLINTGRGPLVDEQALLDALKSGHLAGAAIDVMIKEPPEKDNP LIVAAKTMPNLLITPHIAWASDSAVTTLVNKVRDNIEEFVATGK >MS1288 lppC, LppC protein MTILLQRAKFKKRLMPILFPLMLAGCTNLFGSNFQDVLRNDANASSEFYM NKIEQTREVEDQQTYKLLAARVLVTENKTAQAEALLAELTKLTPEQQLDK SILDALIAAVKRDNDSASALLKTIPLAQLSQSQTSRYYEVQARIAENKTD IIEAVKARIQMDMALTDVQRKQDNIDKIWALLRSGNKTLINTTQPEGNVA LAGWLDLTKAYNDNLSQPSQLAQALQNWKTTYPNHSAAYLFPTELKSLSN FTQTQVNKIALLLPLSGNASILGSTIKSGFDDSRGADKSVQVDVIDTMAM PVTDAIALAKQNGDGMIVGPLLKDNVDVILSNPTAVQGMNVLALNSTPNA RAIDKMCYYGLAPEDEAEAAANRMWNDGVRQPIVAVPQSDLGQRTASAFN VRWQQLAASDADVRYYNQPDDAAYNLTADPAQNQAIYIVVTDSEQLMSIK GALDNSGVKAKIYTNSRNNSSNNAVEYRLAMEGVTFSDIPFFKDLDGEQY KKIEAATGGDYSLMRLYAMGADSWLLAHSFNELRQVPGFSLSGLTGKLTA GPNCNVERDLTWYSYQGGNIVPLN >MS0351 mazG, MazG protein MIPCLIEESYEVVEAIQQKNTADLREELGDLLMQVVFLSQLAAEENKFTF DDVVNDIAEKLIYRHPHVFGDKEAADEHAALRNWNEMKAREAKNQAHTSI LDNVPFSFPALLRAEKLQKKCAKAGFDWQQVAPVIAKVEEELEEVTQEIN CPAPQQAKLEEEIGDLLFAVVNLSRHLKCQAEESLRKANHKFERRFRAVE DKLRQQNKTATESSLMEMDMLWDEVKHEEKVSSD >MS1417 mdaB, MdaB protein MKNVLIVSGHPNLKTSIANQVILDETAKALPNAEIRKLDELFHNGTFDIA AEQAAVLKADVLVFQFPFSWFSLPGVMKIWLDEVFEHGFAHGSTAQLAGK KIIFSTTTGAPAEVYQKDGFFKYTMEEFAAQFEIMAQLCNLDYQGLIYTN GIGYTSRENEEKINAQKAEAKKHAQRLVALIEKA >MS2162 mdaB, MdaB protein MVIFLTVCYHTSGRFLSIFCKCRYSTSGEEYMNRRNLLKAGVALAAVAAM PFGRAQAKTPSKKTLVIVSHPYPESSTFIKGLQQAAETVEGVTVRNLETI YGFDTRAVKGDEERRIMRAHDRVVFIFPTHWFNITPMMKAYLNETWGSVG PGLWQGKEMLVVSTAAGGSETYGKNGRVGVELADVFLPMKASALHCGMTY LPPLVFQGVRSSELANYQQQLIERLMQ >MS0836 mdaB, MdaB protein MKHLVIFAHPNTKNSFNKAILERVLQASQKMNVDTTVRDLYGMNFNPVVS WEELTGSFKEIIPAAIRHEQQLISEADLITLIYPLWWMGFPAILKGYFDR VFTHGFAYKTDETGTVGLIQGKKMQQFITMGNNEERYQQMGFARSLNDTL VNGLFNYVGIIDIDHRLLGDIHIISSEERQALLNEVEQKTKENLTALLEG KA >MS2139 mdaB, MdaB protein MSNILIISGHPNLANSVVNTIILDEFAKTLPQAEIRKLDQLHTNYEFDVA AEQAAIEKADVILWQFPFYWYAMPALMKKWLDDVFVHGFAHGSTAKIAGK KLLISLTTGAPLEAYQREGFFKHKMDDFFAAFETTAILCGLDFQGVQFLN GVSYVGRNEEKIAQQQAEAKVYAQTVIEKVKRL >MS2094 mdaB, MdaB protein MKTTVLVVHPNIKQSRVNAALAKGAADVAGVKVRYLYDLYPDGKIDATAE QAVLEKADRIVLQFPMYWYSSPALLKQWLDDVLAYGWAYGDKQALKGKEL MLAVTTGGGEEFYQKDGLAGHTVAEFLVAYETIASYLGMNYGKMFVTGNC LNISDDEIAAQVPRYQAVLSA >MS1631 metG, MetG protein MSNQHRQILVTCALPYANGPIHLGHMLEHIQADIWVRFQRMRGNEIHFVC ADDAHGTPIMLKADQMGITPEQLIADVKEKHYADFCGFNISFDNYHSTHS EENRELSELIYSRLKENGFIKSRTISQLFDPEKSMFLPDRFVKGTCPKCK AEDQYGDNCEVCSATYSPTELINPRSAVSGATPVIKESEHFFFDLPSFES MLKEWNRSGALQSEVANKMQEWFDAGLQQWDISRDAPYFGFKIPGTENKY FYVWLDAPIGYMASFKNLCKRENLDFDRFWNKDSNTELYHFIGKDIMYFH SLFWPAMLDGANYRKPTNIFVHGYVTVNGEKMSKSRGTFIQAATYLKHLD PECLRYYYAAKLSNRIDDLDLNLDDFVQRVNTDLVNKLVNLASRNAGFIQ KRFDGKLADKLEDESLFAEFIAQSEQIAAYYENREFGKAIREIMALTDKA NKYVDDKAPWVIAKEEGREAELQAVCSMGIQLFRVLMGYLKPVLPKLAER SEAFLQAELTWDNLAQPLLNHGIAPFKALFSRLDVKQIDAMIEASKAENA AVNATVKKEEKNSKKSTALLTDFEPIEPEISIDDFAKIDLRVAKVIKCEE VPESKKLLKFQLDLGFEQRQVLSGIKGAYNNPEELEGRFVIVVANLAPRK MKFGVSEGMILSAGTGGEDLYLLDVDAGVKAGSRVM >MS1793 mhpC, MhpC protein MNTMTLVFLHGLLGTKSDWRKIIENLPHFRCVSLDLPFHGEHKFTEANNF EQCADFISHQIKSAVGNQPYFLVGYSLGGRIALYYALQSQCEKGNLQGLI LEGANLGLTCDEARKVRWKNDEFWAQRFITESAESVLNDWYQQPVFAHLN AQQRADLIEKRVTNCGKNIGKMLEATSLAKQPYLGDKVRESTLPVYYLAG EKDQKFRQMAVQEKLNLQLIANAGHNAHLENPVEFSQKLTALLRNHKIKK TDNL >MS0862 mhpC, MhpC protein MKLLNYQFHQLKQPSNQATMVFIHGLFGDMNNLGIIARAFSDAYNILRLD LRNHGQSFHADEMNYSLMAQDIIHLLETLQLTKVILIGHSMGGKAAMKTA ALRPDLVEKLICIDIGPIAYAHRWHDDVFAGLFAVKNAQASSRQEAKPIL ASYIKDEGVIQFMLKSFDGNAAEKFRFNLSALFNNYGQIMGWEEVFFDKP TLFIKGGNSDYLQSGYGTRILAQFPQASSFTINGSGHWVHAEKPEFVVRA IQRFLESN >MS0882 mhpC, MhpC protein MMLYETKGNGEPIIFLPGLFAGGWIWNSVVRNIQDKGFKTFTFTDPIPVA FEGSQQKALTELDTITENCSTPVYLVGNSLGALIALHYAFQRKDRVKGVI MSGAPGQLEMEAGVSLDELKTGKDKYTTLLGSRIFYDQSKIPPHGIEEVK YLFGTEKIFRNIVRWLYFSRKYDVPDVLQKISIPIDFIWGQYDLITPIEP WIDIAKNFPQTSMTIIKDSGHSPMVEQPELFTEALLRKISSGRTHIK >MS2156 mhpC, MhpC protein MIMTISALDFFKRDVTLPNQLDGLPHKLSDVTGLQIGSFKTNDGVSLNYW KAGSGEPLVFVPGWSSNGAEYINLIHLLKDKFTVYVLDQRNHGLSDKVKF GNRISRFAMDLHEFFNAENIEKAHLCGWSMGCSVIWGYVDLLGTSRVEKF VFIDEAPSIYCHSNWTEEERINAGAFTTSAEMMIDMYYGRGTCNMLQVNT DLFNFYNTIDALAFENSMALCDQVCPHDKDALEQVLFDHILNDWRDVLIN KIDKPTLVVSGEHSNWVESQRWIAQTVPNSEDLIYGKHEHGDHFLHLKMP QKFAGELTEFLNRMS >MS1344 modE, ModE protein MDNTEILLTIKLHQRLFVDPKRIRLLKEIAHCGSINQAAKNAKVSYKSAW DHLEAMNAISPKPLLERNIGGKNGGGTQLTNYARRLLQLYDLLEKTQEKA FQILQDESIPLNNPLSATARFSLQSSARNQFFGKVTKLELKNGHCMVSIQ IEGLNRPLVASITEKSAVRLGLVPGKEVMLMIKAPWIKTQLEEPVDKENQ FLAEVRSVSDKGGEKEIILSIGENPEFCATIEKTVDVAVNQKRWLYIDPE QIVLASL >MS0019 mutT, MutT protein MNLLQKPEILGISVAAKSRIFEIQAVELKFSNGELRTYERFKPSSRCAVM VLPIDGEDLLMVREYAVGTERYELGFTKGLMEAGETPEQSANREMQEEIG LGAKQFMLLRTVNSSPSFMNNPMHILIAQDFYPSKLPGDEPEPLQLVRVP LANINELIEDPGFSEARNLVALYTLRDYLRKLK >MS0709 mutT, MutT protein MNYKNPNSVLVVIYAKNSGRVLMLQRQDDPEFWQSVTGSLAEKEMPFLTA LREVKEETGIDIKRENLTLVDCHQSVEFEIFPHFRYKYAPNVTHCKEHWF LLELPDERVPVLTEHLAYQWLEPAKAAELTKSPNNAQVIRKYLINKSA >MS0328 mutT, MutT protein MDKKTVQVAAGIIRNEFGQIYLTQRLEGQDFAQSLEFPGGKVDVNETPEQ ALKRELEEEVGIVALNPVMFEQFVFEYPNKIIHFYFYLISEWIGEPFGRE GQEGFWIEQLDLDESQFPPANSKLIQRLLAEMNC >MS0408 mutT, MutT protein MIDFDGYRPNVGIVICNRKGQVLWAKRYGQNSWQYPQGGINDGETPEQAM YRELYEEVGLTRRDVRIVYASKQWLRYKLPKRLLRYDSKPMCIGQKQRWF LVQLMSDEKNINMNCSKSPEFDGWRWVSFWYPVRQVVSFKRDVYRKAMKE FACFLFDANKTVNPLSTNNNDEKKANYSAKKPYSPYRNQDKKRKTRV >MS2341 mutT, MutT protein MLKPHVTMACIVHCKGKFLFVEEIEYGKRTLNQPAGHLEENETILEGASR ELYEETGIRAKMQHLVKIYQWHAPRSQKDYLRFVFALELDDWAEITPHDS DITQGFWLTLEEFNYYIRQENQCARNPLVTEALEDYLAGSRYPLDILTLF NN >MS1694 mutT, MutT protein MLIFCEQVQKNYKKNLKIFNFELSLPIVFAGGSVMSELQQFSQQDIEVLN EETLYSGFFKMKKVRFRHKLFAGGMSEVVTRELLYKGAASVVIAYDPVRD EVVLVEQVRIGAYDPNLSSSPWLMELIAGMIEEGESPEEVAMRESEEEAG VTIDNLEYALSVWDSPGGTVERLYLFAGRVDSSKAKGLHGLACEHEDIKV HVVSRETAYQWVNQGKIDNSSAVIGIQWLQLNYRRLQKNWC >MS1528 mviM, MviM protein MKKINVGIIGTGFIGAAHIEAIRRLGFVDVIALAENNQQLAEQKAKELNI PLAYDCVDKLLANPDIQVVHNCTPNHLHFAINKKVILAGKHVFSEKPLCL TSQEADELTSLAEQQGVTTAVGFVYRNFAMVQQAADMVRDQQIGRVFAVN GHYLQDWMLLETDYNWRVDPKVGGKSRTVADIGSHWCDTVQFVTGKKIKE VFADMSIVYSTRKASKQVESFVTVNADSSYELKPVETEDYASVLVRFEDG SKGSFTVSQVSAGHKNDLTFDISGSEKSLHWEQETPQYLKIGYRQQANQI LCDDPSLVNPAVRAYNHFPGGHIEGWPDAFKNMMLAFYAFIAEGKDPQQD TAKFAMFKDGAQIVHIVDTIIESAQQGKWISVK >MS1500 mviM, MviM protein MKKFALIGAGGYIAPRHLRAIKDTGNTLVVAMDVNDSVGIMDSHFPDAEF FTEFEQFEAFVEDQKLKGEKLDYVAICSPNYLHAPHMKFALKNGINVICE KPLVLNSTDLNMLSEYEQKYGAKVNSILQLRLHPSIIALRDKVEAAPADK VFDVDLTYLTSRGKWYLKSWKGVDQKSGGVATNIGVHFYDMLHFIFGDVV KNEVHYRDEKTVSGYLEYKRARVRWFLSIDANNLPENAVQGEKLTYRSIT IENEELEFSGGFTDLHTQSYQRILEGKGYGLEENRTAIETVEVIRHAPII ENPANPHPFLAKVLNK >MS1414 mviM, MviM protein MDMKLGIVGTGMIVADLMQTLHKVTLEKLAIWGRDQVKTTQFASENGISQ VFADYEAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTG EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ YSSRYDRFKAGETLPAFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCAYA ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP ANAMNRFTYTPNQGEAQHFEFGDAHRMLPEFERFVEIIDRKDFAQAEKML DISAAVSEVLEQARKGAGIKFAGE >MS1388 mviM, MviM protein MGMKLGIVGTGMIVRDLMQTLHKVRLEQLAIWGRDQAKTAQFAAEQGILQ VFSDYAAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTD EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ YSSRYDRFKAGETLPVFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCTYA ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP ANAMNRFTYTPNQGEAQHFEFGDVHRMLPEFERFVDIVDRKDFAQAEKML DISAAVSEVIEQARKGAGIRFAGE >MS1755 mviN, MviN protein MSKRLLKSGIIVSTMTLLSRVLGLVRDVVIANIIGAGATADVFLFANRIP NFLRRLFAEGAFSQAFVPVLAEYQRSGELSKTQEFIGKVSGTLGGLVSIV TLLAMVGSPVVAAIFGTGWFIDWINDGPNAEKFTSASLLLKITFPYLWFI TFVALSGAILNSLGKFGVMSFSPVLLNIAMITTALLLAPQMESPDVALAI GIFIGGLLQFLFQLPFLKKAGLLVRPRWAWNDEGVKKIRTLMIPALFGVS VSQINLLLDTFIASFLMTGSISWLYYSDRLLEFPLGLFGIAISTVILPTL SRQHVNRADDVQKSAADFRATMDWGVRMILLLGVPATIGIAVLAQPMLLV LFMRGQFSLTDVQATSYALWSINVGLLSFMLIKILANGYYARQDTKTPVK IGIIAMISNMVFNLLAIPFSYVGLAMASAMSATLNAYLLYRGLAKADVYC FTKQSAVFFLKVLAAALVMGTVVWYFSPQLVIWNEMAFLTKVIRLAELIL IAASSYLLMLVILGIRKRHLLAR >MS0494 nrfG, NrfG protein MIQLKKLFNFVVFLPGLFFAFALSGCVNGADDVFVSKNKIILGEQYPNVH FDQEVMIVRISQMLIIGQLSKNERADLYFERGVLYDSLGLWGLARYDFTQ ALALQPRSPAIYNYLGLYLLLDEDYDSALEAFNAVLELDPNYDYTYLNRG LDFYYMERYNLAQQDLLKFYEAKKDDPYRALWLYINELKFKPNEATQNLA RRAKDLSTEYWGTYIVQYYLNEISVKDLLDKAKVFVDPQSSQYAEILTET YFYLAKQKLNAGHAEEAETLFKLAMANQVYNFVEYRFALFELAKLKTNSE QTEQAVVQRVKTTQAPNSKELDAE >MS1820 nrfG, NrfG protein MRKFKSLTLIALSVLVIASCSSSEKPVEQASEQELFSTGANYLQEGNYTQ ATRYLEAVDSRFPGSSYSEQAELNLIFSTYKSQDYTKTLTTADRFLQQFP QSQHLDYVLYMAALTNSALGDNLFQDFFGVDRSTRETTSMKTAFNNFQTL VQNFPNSPYTPDALARMAYIKDRLARHELEIAKFYAKRSAWVATSNRITG MLRSYPDTQATLEALPLLQESYEKMGLTQLASQAATLVKANEGRVIKEAE KPKEPFLSLPSWLSFGSSDSSDKEKVATKSDDSFFSWPSWLSFGSKD >MS1594 obg, Obg protein MKFIDEALIRIEAGDGGNGCVSFRREKYIPKGGPDGGDGGDGGDVYLIAD ENLNTLIDYRFEKRFAAERGENGRSSNCTGHRGKDITLRVPVGTRAIDND TKEIIGDLTKNGAKLLVAKGGYHGLGNTRFKSSVNRAPRQKTMGTPGEKR DLQLELMLLADVGMLGLPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGV ARVDANRSFVVADIPGLIEGASEGAGLGIRFLKHLERCRVLIHLVDIAPI DESDPADNIGIIESELFQYSEKLADKPRWLVFNKIDTISDEEAAKRAKDI TERLGWEEDYYLISAATGKNIPQLIRDIMDFIEANPREVEEEEKAAEEVK FKWDDYHNEQLSERGFDDEEDWDDDWSEEDDEGVEFIYKP >MS2242 oraA, OraA protein MTTLAFSYAVNLLSRREYSEFEIRCKMQEKAFSEQEIEDTLAQLQQKNWQ SDKRFTENYLRARAQRGYGVNRIKQELRQLKGILPETVDEALMECDIDWS EIALNVLAKKFPDYRARQDAKNKQKIWRYMLSHGFFAEDFADFIGNGTED EFY >MS1291 osmY, OsmY protein MNMHKLKKLTFIIGSALLLQGCVAALVGGGAVATKVGTDPRTTGTQLDDE TLKFQVYNAVNKDEQIKQEGRIVVSSYSGRVLLLGQVPTESLKSVATSLA KGVDGVGDVYNEIRVGSPITVTQKTKDSWITSKIKSDMLLNSSVKTTDIK VITENGEVFLMGNVTQEQANAAAEVARNIAVLKKS >MS1958 perM, PerM protein MIEMLKNWYLRRFSDPQAMGLAAILFFGFVAIYFFSDLIAPLLIALVLAY LLEMPISFLSDKLKLPRFLSILLILGGFIAVTILMIFGLIPTLINQTVNL FSDLPNMLNLSHQWVMSLPESYPELVDYQMIDSLFITIREKTLAFGESAV KFSLSSLMNLVTIGIYAFLVPLMVFFMVKDQDELIAGFSRFLPKNRTLAS KVWQEMQLQIANYIRGKLFEILIVAVVSYIIFLFFGLRYPLLLAVAVGLS VLIPYIGAVLVTIPVALVAIFQFGATPTFGYLMTAYIVSQLLDGNLLVPY LFSEAVNLHPLTIIIAVLIFGGLWGFWGVFFAIPLATLVKAVVNAWPSNE DEAIS >MS0428 perM, PerM protein MNKSVSVNQFLIGFAALVIILAGIKMAGEIVVPFLMSLFIAIICSPIIKF MTNRKIPHWLAISILFLFIVLVFFFLLGLVNSSIREFSQSIPQYRVLMSE RLNEITALIQKWNLPLNLEKETILEHFDPSSIMNFVSRLLLSFSNVLSNA FVLILVVIFMLLEAPTAKRKVALALSGNEKDASKEEKHLERILQGVISYL GVKTAVSLLTGLCAWVLLETCGVQYAVLWATLTFLFNYIPNIGSIIAAIP IVLQALLLNGFSTGFAVMTGIIAINMLIGNFLEPKLMGRTLGLSTLVVFL SLLFWGWLLGTVGMLLSVPLTMALKIMLEASPNTTKYAALLGDVEESN >MS1478 pfoR, PfoR protein MKNRLKNFLIRQNIKFSLRRYAIDAMNFMALGLFGSLIIGLILKNTGDWL DILWLNELGALAQSSMGAAIGVGVAYALKAPPLVLLSSTTTGIAGATLGG PIGCFIAAAIGAEFGKLVNKTTPIDILITPAVTLLSGIATAQFMGPFLAS LMRETGAMIMWAVELHPIPMSILVSVLMGMILTLPISSAAIAVTLSLSGL AAGAATIGCCAQMIGFAVIGFKENRWGGLLSLGLGTSMLQIPNIVKNPKI WVPPTLSGAIIAPFATVIFQMQNIPSGAGMGTSGLVGQIGTINAMGNSPY IWLVILVLHFILPAILSLLITYLMRRKGWIKPGDLKLAV >MS1090 pheT, PheT protein MKFSEQWVREWVNPAVNTEQLCDQITMLGLEVDGVEAVAGEFNGVVVGEV VECAQHPDADKLRVTKVNVGGERLLDIVCGAPNCRQGLKVACAIEGAVLP GDFKIKKTKLRGQPSEGMLCSYRELGMSEDHSGIIELPADAPVGKDFREY LILDDKEIEISLTPNRADCLSIAGVAREIGVVNQLAVTEPAINPVPVTSD EKVAINVLAPEACPRYLLRSVKNVNVNAETPVWMKEKLRRCGIRSIDPIV DITNFVLLELGQPMHAFDAAKLAQPVQVRFAADGEELVLLDGTTAKLQSN TLVIADQTGPLAMAGIFGGQASGVNAQTKDVILEAAFFAPLAITGRARQY GLHTDSSHRFERGVDFELQHKAMERATSLLVEICGGEVGEICEVVSETHL PKLNKVQLRRSKLDALLGHHIETETVTEIFHRLGLPVSYENEVWTVTSAS WRFDIEIEEDLIEEIARIYGYNSIPNNAPLAHLSMREHHESDLELSRIKL ALVGNDFHEAITYSFVDPKLQSILHPEQAVWILPNPISSEMSAMRVSLLT GLLGAVVYNQNRQQNRVRLFETGLRFIPDESAEFGIRQELVFAAVMTGSR LSEHWASKAEPADFFDLKGYIENLLSLTKAGPYIKFVAKEFPAFHPGQSA AIVLDGEEIGYIGQLHPMAAQKLGINGKAFACELIVDKVAERNVANAKEI SKFPANKRDLALVVAENIAASDILDACREVAGSKLTQVNLFDVYQGQGVP EGHKSLAISLTIQDTEKTLEEDDINAVISVVLSELKDRFNAYLRD >MS0580 potD, PotD protein MRNILRKALSLTITALAVANFAQAENLTDKSWPDIEAQAKKEGKLTVSVW YLQPQFRVFVKEFEKQYGIQVKVPEGTLDGNINKLIAEKNLEKGKMDVVV LSADRVSNVTNNGVLANIKQLPNFGKLNHFLQGVDLGETAVGYWGNQTGF AYDPLRITEDQLPQSWQDVENYIQQNPKKFGYSDPNGGSSGNAFIQRALV YVNGEYDYMTPTVDAAQVANWKKTWEWFNARKNVMIRTASNADSLTRLND GELVLVSAWQDHLFSLQKQGAITTRLKFYVPQFGMPGGGNVATIAKNAPN PAASLVFIHWLTSPEVQQKLSQEFGVRPLDSESGKRDTLFFSTPWRKAEM EAFTKEVVSR >MS0817 pqiB, PqiB protein MASPFSLRCFQPHSLIPVYFGIFMTNNQANNRVKINAENNVQAAKIKQDK RISPFWLLPIIALCIGALLFFQIIKEQGETIRITFTTGDGLVANKTQVRY QGLQIGIVKKVNFTDDLKKVEVQASIYPEAKNVLRENTKFWLVQPSASLA GISGLDTLISGNYISLQPGDGNYKDDFIAEETGPIAQVSDGDLLIHLLAD DLGSISEGASVYYKKMPVGKIYDYRFTPDQKKVEIQVVIDKAYANLIKQD TRFWNISGINANVGPSGITVNMDSLNAIVQGAITFDSPDNSPKAKQDQQF TLYPTLQAAQRGIEVKITLQNQAGLKAGKTEVFYNNLQVGTLAKLDNEDI THAKISGTLLLDPNISNELRTNTNIILRTPKMNLATLEKLPDMLRGQFFE IIPGSGEPQREFQVYKESDLLLKQADTLVFTLTAPETYGIAEGQQIFYNN LPIGEIVKQTLNEQGVEYQAAIAGKYRHLIYGDSQFVAASNLDISLGIDG LRVEAASPDKWLQGGIRLIANKNKGSALSSYPIYKDLSSAEAGITSSTLT PTITLNAQNLPNIGKGSLVLYRQYEVGKVLDIRPLKNSFDVDVAIYPKYR HLLTKNSLFWVESASQVDITARGISIQTSPLGRVLKGAISFDNSGGNNNK TLYANELRAKSAGQVITLTADNATNLTKGMALRYMGLEVGQLESINLDQN KNQVVVKALMNPNYMNLVAKEGSEFRIISPQISAGGIENLDSLLQPYIDI DAGKGKYKTTFAIKNNNNTDNKYNNGFPIILEASDALNITTGSPIYYRGV EVGKINRMELNELGDRVLIHLLIANKYRHLVRKNSEFWISSGYSAGVGWS GIEVNTGTVQQLLKGGISFSTPSGTVIQPQAAANQRFLLQIKKPVEAKTW NSAVLPEQN >MS0807 proP, ProP protein MLMTSQNKINAVPSNQNFYLNNRNYWIFSGYFFVYFFIMATCYPFLGIWL GDINGLSGEDRGTVFAMMSFFALCFQPVFGYVSDKLGLKKHLLWVLGISL LIYAPFFIYIFAPLLKVNVWLGSLVGGAYIGFVFQAGAPASEAYIERVSR RSKFEYGRVRMFGMFGWAICASIAGVLYATNPNLVFWLGSIASLILLLLI ALAKPEQTSTVQIAEKLGANKNPVNLRQAFALLKLPKFWALLAYVMGIAC VYDIFDQQFGNFFNTFFESHEQGIKMFGYVTTAGELLNALIMFFVPLIIN RIGAKNALLIAGTIMSVRIIGSSYAIEAWHVVVLKTLHMFEVPFYLVGLF KYIANVFEVHFSATIYLVACHFAKQIGNMLVSPLVGAWYDTYGFQDTYLI LGCIAAGFTLLSVFTLTGKSLSSQS >MS0191 proP, ProP protein MSNKVNSYGWKALMGSAVGYAMDGFDLLILGFMLSAISADLSLSPTQAGS LVTWTLIGAVAGGIIFGALSDKYGRVRVLTWTIVLFAVFTGLCAFAQGYW DLLIYRTIAGIGLGGEFGIGMALAAEAWPARHRAKASSYVALGWQVGVLA AALLTPLLLPIIGWRGMFLVGIFPAFVAWYLRAKLHEPEVFVQKQAEVAT GKRQSPFKLLIKDVATAKVSLGVVVLTSVQNFGYYGIMIWLPNFLSKQLG FSLTKSGVWTAVTVCGMMAGIWIFGRLADRIGRKPSFLLFQIGAVISIIA YSQLTDPAIMLFAGAALGMFVNGMMGGYGALMSEAYPTEARATAQNVLFN LGRAVGGFGPVIVGAVVSAYSFKIAIALLAVIYVIDMIATVFLIPELKGK ALK >MS2054 proP, ProP protein MASGEANYRSLAWIAASALFMQSLDATILNTALPTIAADLHHSPLEMQLA VISYALTVALFIPISGWVADKYGTLRVFRFAVGMFALGSLACAMSSSLIM LIFSRVLQGFGGALMMPVARLSIIRSVPKQELLPVWNLMATAGLTGPILG PILGGWIVTYTSWHWIFLINIPMSLLGIWLANRYMPNVTGSLQKLDWAGF FFLGGGLVGVTLGFDLISEEFIAKWQATVIVILGVILIITYCFHAQKRER LALLPLSLFKIRTFRVGIMANMLIRLCASGIPFLLPLMYQVVFHYSADKA GMLIAPIALSSMLVKPLCGRILTKLGYRTALISASIVLTLSIAVMSFLHI DSPVWILIVNVALYGGCISIVFTAVNTLTISELSDQDASAGSTFLSVVQQ VGIGLGIAVSALILSLYRYFIGESAVQLQQAFGYTYLTSASFGVLLVLVL SGLKKEDGAHLHK >MS1530 proP, ProP protein MNTETKQPALIVPRLSLMMFMEFFIWGSWSVTLGIVMTKYDLSTLIGDAF SMGPIASIISPFILGMLVDRFFPSEKVLAVLHLIGAAILWFIPEFITGQQ GGTLVFALLAYMLCYMPTVALTNNIAFHSLADSEKSFPVIRVFGTIGWIV AGLFIGQADLSASPAIFQVAAICSLILGLYSFTLPNTPPPAKGKPFSMRD LMCADAIALFKIPHFLVFAICATLISIPLGTYYAYAAPFLDAVGFEKIGS LMSMGQMSEIVFMLLIPFFFKRLGVKYMLLAGMLAWFLRYAFFALGVSEE IRWAVYLGILLHGICYDFFFVVGFMYTDKVADEKIRGQAQSLVVLFTYGL GMLLGSQISGGLYNNMFADNTDVSTWSTFWWIPAISAVVISVIFFIFFNY KEDKREA >MS0785 proP, ProP protein MQNKFAVYLAAIGHLVTDMAQGALPALLPLFIKNYGLTYQEAGGLIFANT VLASIAQPFFGYLADKRSMPWLIPLGMMLSGCCIAAMGFVHSYPGLFFFA MIAGIGSALFHPEGARLVNRMSGGEKGKAMGIFAVGGNAGFAIGPMFAGL AYLFGAQTLSIFALINTIIALIIFLQLPKLTVENVVNKAKNTASTTLQND WRSFAKLSVIIFVRATNFTVLNAFIPIYWIHILHQQETDANFALTIFLSM GVAITFIGGLLSDRLGYVRIIRYAYLIFLPTILIFTQSENLWLSFILLIP LGLGVFTQYSPIVVLGQTYLAKSVGFAAGITLGLGITMGGIFSPIVGWIA DHYGLQIALQTLSVLSLLGLIFSYRLKITDTEKPEKK >MS0499 proP, ProP protein MNLREHIDNNPMSAYQWTVVIIAAIMNLLDGFDVLALAFTATAIRGDLGL SGAELGYLFSAGLLGMAAGSLFLAPLADKIGRRPLLLISVTLSALGMLGS AYSASYGALGFWRLITGLGVGGILVGTNVLTSEYSSRKWRSLAISIYASG FGIGAVLGGMFAVVLQEEYSWHAVFLAGFILTAVCLIVLLIWLPESIDFL MTQQPRNAQIRLNKITKKMGLKGQWTLPEKVLASASKLPLTQLFNKNYRK STALIWIAFFAIMFCYYFVSSWTPALLKEAGMTTEQSVSVGMMVSLGGTC GSLLYGLLASRWKAKQMLVQFTVLSAFSVIIFILSSSILWLAMLFGILVG GFMNGCISGLYTLNPSIYAANIRSTGVGWSIGVGRIGAILAPLAAGVLLD YGWDKQSLYIGVGFVLLIAAIALSLLRIKTTLVKC >MS0797 proP, ProP protein MSQNHFFSHIFNRNMLICIFTGFSSGLPLYILTSLIPTWLRSTEIDLKTI GFFTLTSLPFIWKFLWSPFLDRFVPPFLGRRRGWMLIFQLLLLISLGLFG FIDPHTNQGLSLLIGLATMVSFFSASQDIVLDAYRREILSDQELGMGNSI HVSAYRIAGLVPGSLSLILSDHFSWQAVFIITALFMLPGLLMTLFISHEP QIELKSNRTLAENIVEPFKEFFQRKGLWGAIGILTFIFLYKFGDSMATAL ISAFYLDMGFTKTQIGLVVKNASLWPMIIAGIIGGMITLKIGINKALWLF GLVQIVTILGFAWLAQLGPFEKVDSFAIFALTVVVMAEYVGIGLGTSAFV AFMARATNPVYTATQLALFTSLSALPRAVFNSFSGVLIENMGYYHYFWLC FFLAIPGMLCLIWVAPWKEK >MS2374 proP, ProP protein MSGEKTSRYVLGVTLVATLGGLLFGYDTAVISGTVSSLDTVFIQPKGLPE ISANSLLGFCVASALIGCIIGGACGGYLSSKYGRKKALLIAALLFLISAF GSAYPEFGLKTINETNNIPYYLSNFLIQFVIYRIIGGIGVGIASMVSPMY IAEITPARIRGKMVSFNQFAIIAGQLIVYFVNYFIALNGDNTWLNMLGWR YMFLSEMVPAALFLILLFFVPESPRWLVLQNKFSQAEITLLKLLGERSGK TELQNIVSSLEHRVVKGAPLFSFGLGVIVIGIALSVFQQFVGINVALYYA PEIFKSLGASTNNALLQTIIMGTINLSCTTIAIFTVDKYGRKPLQIIGAL GMAMGMFVLGMAFYANLSGTIALTGMLFYVAAFAISWGPVCWVLLAEIFP NAIRSQALAIAVAAQWIANYIVSWTFPMMDKSSYLVERFNHGFAYWVYGL MAILAALFMWKFVPETKGKTLEELELLWNKK >MS0392 proP, ProP protein MSTAKKRNFIFIATLGILSMLPPLGVDMYLPSFLNIARDLQVDPERVQYT LTFFTFGMAAGQLFWGPVGDSYGRKPIILLGVIIGAVAAFFLTGVNSIEN FTALRFIQGFFGSAPVVLVGALLRDLFDKNELSKTMSMITLVFMIAPLVA PIIGGYLVLFFHWHSIFYVICAMGILSAILVFFIIPETHHQDNRIPLRLN VVVRNFVTLWRRKEVLGYMFSSGLGFGGLFAFLTAGSIVYIGLYGVPVDQ FGYFFMLNIGVMTLGSVINGRVVHRVGAERMLQIGLTVQLIAGIWLLIVA CFDLGFWPMALGIAVFVGQNSLISSNAMASILEKFPTMAGTANSVAGSVR FGLGATVGSLVALMKMDSAAPMLFTMGICVIVAVCCYYFLTYRSL >MS1798 proP, ProP protein MNVRPFTWLALSYFGYYCAYGVLVPFLPVWLKSQNYGTELIGAVIASSYL FRFLGGIFFPSRVKRANQILPALRLLAWANVFVITAMAFVSESFWLIFIA IAVFSMVNAAGMPLTDSMATTWQRQIRLDYGKARLIGSAAFVVGVTVFGS LIGAIGEQYIISILIGLFGLYAVLQMVPPQPKPADEDKNSAKSAVGFGEL LKNPTHLRLIIAAMLIQGSHAGYYVYSVIYWTNRGIAVETTSLLWGLGVI AEILLFFFSGRLFRNWSVNAIFYLSAAAAALRWGAFSYTDALWQIALLQC LHSLTFAALHYAMVRYIGMQPQNAMVRLQSLYSGLASCASVALLTALAGI IYPISSHWVFLVMMICALIALFVIPRKPTNA >MS1178 proP, ProP protein MPNKAETSPAKLRLKAFLKRIKIMNTTENSKQKPVNVVAFAFLLTAFLTG IASSFQTPTLSLFLAQEIQVSPFMVGMFYTSNAVLGIVLSQILAKYSDSQ DDRRKIIIFCSLLAIGGCITFAYNRNYYVLMFFATFLLSLGSSANPQAFA LAREYADYTKREAIMFTTIMRTQISLAWIVGPPLSFSIALGWGFEYMYMV AASAFLLCAIIAKALLPYVPRKAVVPLTKPDEVAGLPAKNKKQSDKQSIR LLFITCFLMWSCNGMYLISMPLHVINELHLSERLAGILMGTAAGLEIPVM LIAGYLTKYLTKKSLILTALFMGLFFYIGMLFAEQTWQLVALQAFNAIFI GIIATLGMVYFQDLMPGKMGSATTLFSNAAKSSWIVAGPFVGIIAQIWNY SSVFYISIVLVAVSLFSMSKVKSV >MS1407 proP, ProP protein MMTSSRPNLTLLLILGALMACTSLSTDIYLPAMPTMAKELQGNTELTITG FLIGFAIAQLIWGPISDRIGRKIPLFIGMALFAVGSVGCALSQSMAEIVF WRVFQAVGACVGPMLSRAMIRDLYDRSQAAQMLSTLTIIMAAAPIIGPLL GGLLLKISSWQAIFWLLVVIGILLFLSIIKLPETLPPAKRAAGSFWSAFG NYRILLKNRAFMRYTLCVTFFYVAAYAFITGSPFVYIDYFKVDPQYYGFL FGVNIVGVALLSAVNRRLVRHYPLESLLRVSTMIALCAVLILVVLVFMDL DGIAGILSVAVPIFIMFSMNGIIAACTNAMALDSVQPEIAGSAAALLGSL QYGSGILSSLLLAYFSDGTPHTMAWIIALFVGLCAVIGWGQRPRSA >MS0998 pta, Pta protein MSRTIILIPISAGVGLTSVSLGLIRALEQKGTKIGFMKPISQPRSGEDML DRTTSIVRTSTTIETTEPVMLSEAENLIGQNQTDVLLEKIVAQHQQISKD NDIVIVEGLIPSRKNSYANSVNYDIAQALDAEIILVSAPATETPAQLKER VEAAAASFGGKSNPNLLGVVINKFNAPVDESGRTRPDLTEIFDSFQHSHN NIKEIYKLFENSPIKVLACIPWSADLIATRAIDLVKHLGASILNEGDMNR RIRSITFCARTLPNMIEHFKAGSLLVVSADRPEILTAAALAATTGIELGG ILLTGGYKIDCEIKKLCNPTFENTKLPVFRIEGNTWQTALSLQSFNLEVP VDDKERIENIKQYTSGQFDADFIHSLASASVRARRLSPPAFRYQLTELAR AAKKRIVLPEGDEPRTIKAAVLCAERGIAECVLLAKPEDVKRVADSQGVK LGNGITVIDPASVRENYVARLVELRKAKGMTEMAAREQLEDTVVLGTMML EAGEVDGLVSGAVHTTANTIRPPMQIIKTAPGSSIISSIFFMLLPDQVLV YGDCAVNPDPTAEQLAEIAIQSAESAKSFGIDPRVAMISYSTGTSGSGAD VEKVKEATRIAQEKRPDLLIDGPLQYDAAVMEDVARSKAPNSKVAGKATV FVFPDLNTGNTTYKAVQRSADLVSIGPMLQGMRKPVNDLSRGALVDDIVY TIALTAIQATQC >MS0777 putP, PutP protein MFGLDPTLITFTIYILGMLAIGVLAYYYTNNISDYILGGRRLGSFVTAMS AGASDMSGWLLMGLPGAVYVSGLIEGWIAIGLTIGAYLNWLFVAGRLRVH TEFNNNALTLPEYFHSRFGTSHNLLKIISASIILVFFTIYCSSGVVAGAK LFQNLFGIPYATALWYGALATIAYTFIGGFLAVSWTDTIQATLMLFALIL TPVVIVVSLGGIDGFSASMQSAEIDMQKDFTDLFTGTSTLGLFSLAAWGL GYFGQPHILARFMAAYSAKSLHKARRISITWMIICLIGAISIGFFGIAFF HANPQIAEVVTKEPEQVFIELAKLLFNPWVAGILLSAILAAVMSTLSCQL LLASSAITEDFYKGFIRPKAGEKELVWLGRIMVLIIAALAIWIAQDENNK VLKLVEFAWAGFGSSFGPVVLLSLFWKRMTSSGAIAGMLTGAIVVFSWKS VIPATSEWSGVYEMIPAFSLASLMIILVSLLSPAPNKEIVETFEKANLAY KNAE >MS1741 putP, PutP protein MNVDYLVMAGYFALIIAISLLFKKMASNSTSDYFRGGGKMLWWMVGGTAF MTQFSAWTFTGAAGKAFNDGLSVIAVFVGNMVAYACAYWYFARRFRQMRV DTPTEAIGRRFGTSNEQFFTWVIIPLSVINAGVWLNGLSVFASAVFDADI TMTIYVTGISVLIISLLSGAWGVVASDFVQMLVVAVISVACAVVGLVVIG GPGEIIDRFPGGFVSGPDMNYPLILICTFLFFIVKQLQSINNMQDSYRFL NAKDSKNASKAAIFALLLMLVGTIIWFIPPWVTAIIYPEAASLYPQLGKK ASDAVYLVFAKNVMPAGTIGLLMAGLFAATMSSMDSALNRNSGVFVRSFY APIIRKGKADDKELLRAGQIVCVINGILVILMAQFFNSLKHLSLFDLMMQ VATLLQSPILVPLFLAIIIRKTPKWAPWATVLFGMFVSWSVVKVFTPEYV ASWFGVEDLTKREISELKVIITIAAHLIFTAGFFCLTTLFYNEAKDTNNE RRIAFFKDVDTECVAEEGQDEIDRLQRKKLSTLVMLMAAGLLLMILIPNP LWGRALFACCSLAIFAVGYGLKRSAEV >MS0578 rarD, RarD protein MFMSISAKLKGWHYAFACYAIWGTFPIYWYPLNSSAMPADQILAQRIVWS VVFAVFLLIIFKQSRAVLRAFTKPKILAIFFLSSFLIALNWLVYLWAITN HHVLDASLGYFINPLFNVFLGRLVFKERLNKPQLLALCFATAGILWLAIP AGQIPWVALLLAGSFGFYALIRKLAPMEALAGLALETLLLSPFALAYLFF CYTQNTLVFSELNSLQLGVLLGSGAATTIPLLWFAMGARQISMSLLGMLQ YISPTLQFLCGSLLFGEALSITRLIGYSLVWIGVAIFLLAMRKKMQNK >MS1495 rfbX, RfbX protein MVRLISTVFVRQILVGILQVITLIVIARGLGTGQMGQYTLAILLPTLFSQ IITFGLQSINIYAIGRKMINENQALYANLIFLSGLSVLTSLILGVVVYYF GQYFFNEVPVNLLYLALASLLPQTFFTVLPSLIQAVQNFKWFNIVCVAQP LVIFVVSMVAILLSDNVSSILTAYVLSHWISFFILLGIILKLIKVETCSL KRFFSDFIGYGLKSHLSNIITLLNYRSSLLILGYFTTPVIVGIYSVGMQL AEKLWLPSQAVSTVLLPRLSNKLGEGGDEKEVAKLTLDSARLTFIVTLII GIAFACLSSIVVRILFGVEYDKAVYVILLLLPGILAWTPSRILANDLAAR GFAELNLKNSYWVFGINTALSLCLVPLWGLIGASVATSIAYSMDLVLRLI AFNQVTQSRAFLHIIPRISDFGTVINFIKGLRNAR >MS0657 rfbX, RfbX protein MVKSTKRVFNFIMNKINTEHKKRLFSNFFSLTVLQIVNYALPLLTLPYLV RVLDVETYGLVMFAQSFILFFNILVDFGFNLSATKEVSIHRDDKNKLIEI YSSVMVIKFLLILSSFIILSIIIFSFERFSLNKGVYFLSFLWVIGQALFP VWYFQGIEKMKYITIVNIIAKFLFTGCIFLFVKENADYLLIPLFNGLGIL IAALVALWIVHVSLKQKVTWQPLSKLWIYFKESSTFFLSRASLTMYTSAN AFVLGIFSNNTIVGYYSIADQLYKALQAFYTPLSQVLYPYIAKERNIVLF KKIFNMAVFLNCMGIAILYFITVDVFALLFTQKIGIESINVFNIFLIASL IVVPSILLGYPFLGALGFAKEANLSVIYASIIHILGLVILILFNKISLYS VAYMVLVTELFVFMYRISKIRGRRLWRKQL >MS1825 rhaT, RhaT protein MLMPHFTQSKGYGYFCLILATFFWGGNYMFGRILSHVIPPIILNYLRWLP AAIILLLLFAKYLPQQRHIIRKNWQILTALALLGVLIFPVFLYQGLQTTT ALNASIYLAVVPIVVMFLNRICFKDTIRFPVFIGALISFIGVLWLLSHGE LSRLLTFNVNRGDLWAIGSAVSWSVYCSIIRLRPKEIGNSVMLTAQVGIA MIIFTPVFLSQLNTENLQIISELTYGQWMIILYLIIGPSILSYGFWNYGM TIVGGTKGAAFTNATPLFAAALGILVLGEQLHGYHLISSLLIVIGLTLCN KK >MS1754 rhaT, RhaT protein MFYLIAAVLIWASAFIAAKFSYTMFDPALTVMLRLILSALLVLPTFFRSY RKIPKQYRLQLWGLGLLNFPVVLLLQFTGVHYTSVASAVTMLGTEPLVVT LLGHIFFHKPARLLDWLLGIVALTGIVFVVYGSESGGEVTLLGCTLVLLG SIAFSFSIHLAQSVMKAVEAKAYTDVIIMTGAISCVPFSLLLVQDWQIHL NIEGISAILYLSVGCTWLAYRLWSKGLRVSSANTASILTTLEPVFGVLLA ILLLGEHLTLTTLFGICLVISAAGISVLSSMLINYIKNKVTIL >MS1595 rhaT, RhaT protein MNQQPVLGFIFALITAMAWGSLPIALQHVLTVMGAESIVWYRFFVASLAL FLLLAWKKKLPALSQFTSRYWKLSLIGVLGLAGNFFLFNSSLNYIEPAIT QIFIHLSSFMMLICGVFVFKEKLGAHQKAGLLILILGLGLFFNDKFDMLF GLNMYSTGILLSVSAAVVWVAYGMAQKLMSRQFTAQQILLIMYTGCVIVL CPFAQFSQIQGLSGFALGCFIYCCLNTLIGYGAYAEALNRWDVSKVSVVV TLVPLFTILFSRILHGLDPAHFAMPHLNTVSYIGAFVVVLGAIISAVGYK LFKYKR >MS1753 rhaT, RhaT protein MLFQIIATLIWASAFIAAKYTYEMMDPVLMVQCRFFIASIIMLPGFFAAY KRVPKERLKIMWLLALINFPLMFLLQFIGLYFTSAASAVTMLGMIPLLTV LIGFLFFKRRINKIDLLLSLVALAGIILTVVGGGEDNLINPWGCLLVLGS AVSFCFCLYLSKDVMQEMAPKDYTNVLVILGSILCLPFTCVLVRDWSIVP SVKGMISLFYLGIGCTWLAVVLWFKGVQKTPTYISSILTTLEPIFGVILA ILILDERLSTVSAMGILLTLGAAAVSVLIPVLMKKSP >MS1597 rhaT, RhaT protein MKQQPLLGFLFGLIAACMWSSLPLFVQQVVKVMDIQTSVWYRFVLSAVGV LLLLCFSGKFFTFKRISPKNTLLLLLAIAGLSVNFYLYNLALKYIPPTTS QVLSPLSSFMMLFAGVLIFKEKMARHQKIGLAVLSLGLILFFNERLDDFL QLNTYFKGVVMVIASSFVWVIYAIVQKVLLSHLSSQQILLMIYIGCTLVF FPNADIKQIYQLDGFQLVCLVFSGVNTIIAYGCYAEALDRWEVSKVSAIL TQIPIFTLLFFHLAVMIAPNYFVAVELNWISYLGAFCVVSGAMLSALGHK LKMLKERD >MS0885 rhaT, RhaT protein MLRPSCREKIIMVNNYNLALIKVHFTAVLFGLTGVLGVIISADSDVIVLG RVIIAFLALSVYFLIKREKLTALSTKDVANQSLSGALLTAHWVTFYVAVK VGGVAVATLGFAGFPGFVALFERLFFQEKLKRRELILLIAVTIGLILVTP QFEFGNQSTQGLLWGIFSGAIYGILAILNRKNINKLSGTQASWWQYLIGS ILLFPFAAHKLPAVSVTDWFWIACLGLLCTSLAYTLFVSSLNIINARTAA MIISLEPVYAILIAWIWLGEQPGLRMIIGGLIILLSVGVVNFRR >MS0535 rhaT, RhaT protein MLQKYRGEIILFIVSLIAASGWFFSKFSMAEFPALGFIGLRFFLAAIFFF PLAYPQLKRLDKPQLIKSALVGLCYAVYIMLWMLGLINSAHFGEGAFLVS LSMLIAPLLSWLIFGHLPYKSFWLALPAAFTGLYLLSSGKGGLHFSFGSL IFLISSLVAALYFVLNNQYARDIPVLSFTTIQLFIVGTCCGTLSILFEQW PTSISMTAWGWFLCSLVIATNLRMLLQTYGQKYCHVATAAIIMILEPVWT LFFSILILGERLTLHKAFGCLSILAAIMIYRLPAILRNQASANKE >MS0300 rimI, RimI protein MQFKIKPMLPEHYQQVYRLWTSIEGMDMSDADDNFEAISAFLAFNPDLNY IAEINGKVVGVIMCGFDGRRATLYHAAVDPDYQKQGIGFALAEHLESALK TKGISKGRLLAFKSNESATLFWQKAGWTLQQKLNYFSKKFI >MS1590 rimI, RimI protein MTEISPIQAEDFDRLFEIEQAAHLVPWSMGTLQNNQGERYLNLKSSVQNH IAGFAICQTVLDEATLFNIAIDPVCQGQGIGKALLSELIKRLREKNVATL WLEVRESNQTAKRLYDRLGFNEVDIRKNYYPTPDGGRENAIVMALYL >MS0145 rssA, RssA protein MKVGLVLEGGAMRGMFTAGVLDIFLDENIHIDGAVTVSAGALFGINLPSK QRGRVLRYNKKYLNDKRYMGLHSLLTTGNIVNRDFAFYELPYTLDPFDQQ TFAQSDMDFWVTLTNVETGEAEYFKIQDAFEQMEVLRATSAMPFVSKMVE INGKKYLDGGIADSIPLQKCFDLGYDKVIVVLTRPLEYRKTPSSKTLFKL FYPNYPQLAARWAQRYADYNQTVERIIKLNDEQKIFVIRPSESLNISRLE KDPEMIQRMYELGLKDGKAAIAGLREYLAK >MS1964 sPS1, SPS1 protein MRKDMLQVQHENHFFLFNFDENRPNQEHFFESYFWQKQNRIIGSAKGRGT TWFIQSQDLFGVNTALRHYYRGGLWGKINKDRYAFSSLEETRSFAEFNLL NRLYQAGLPVPKPIGAHVEKLAFNHYRADLLSERIENTQDLTALLPNTEL TAEQWQQIGKLIRRLHDLQICHTDLNAHNILIRQQNNDTKFWLIDFDKCG EKPGNLWKQENLQRLHRSFLKEVKRMRIQFSEKNWADLLNGYQN >MS0290 sbmA, SbmA protein MLKNIRLNFNLSIGSGMNYSQELLTSLLWIFKAIGITAVLFSLTVYVLVK TTRWGRQFWMLAAGYISPKRSKKPIGYFVIIVFFNLLSVRLDILFSEWYK AMYNALQESHEKMFWIQMVVFSVLATIHIANVLLTYYLTQRFTIQWRTWL NNEMVNRWTENQAYYKAQYVYNKLDNPDQRIQQDVLSFVSNSIEFATGVI SSVVSIVAFTVILWGLAGPMTVVGITIPHAMVYLVFIYVLITSIFAFRIG RPLINLNFTNERLNANYRYSLIRLKEYAESIAFFRGEKMEKNVLFKQFNQ VIGNVWKMVHMTLKLSGFNLAVSQVSVIFPFIIQASRYFSKQIQLGDLIQ TAQSFGRVQTALSFFRNSYDSFTGYRAVLDRLTGFYSAVNQANSASHISI EDSESAVVFDKLTVKKPTGEALIKDLSLNLPQGASLLIKGPSGAGKTTLL RTIAGLWSYSEGIVRCPQHHALFLSQKPYLPQGRLIDALFYPELAPENLD LAQAAEIMRKVQLGHLTDRLEQENDWTRVLSLGEQQRLSFARVLICRPLV AFLDEATASMDEGLEESMYRLLKTELPDTTIISVGHRSTLQIHHTQHLVI NPQDQSWALS >MS1316 sbmA, SbmA protein MNWQTELNNSFSWLITTLIWVSLAFTFFALLLRKTDFGEKFWLVTKPCIE QSNKFKTIGLILFLFLLILLEVRISVLNSFFYNGLYSALQDKKADAFWFF ATINAMLVGFKIIHSIINYLIRQIFEIRWLEKFNDDMLSRWLDHKNYYRL KYEKDLPDNIDQRIEQDAREFITGTVDLVDGILGAIVSIIEFTIILWGLS GLLVLFDISIPKGVVFFIYTFIIIATALSVWIGYPLIKLNFNKEKLNGDY RYSLIRIRDNAESIAFYDGEQKERQYLNERFKAIIKNRWAIVRQMLGLDG FNTGVTQIAMILPLMLQAPRFFAGQATLGDMHQTVQAFNRLMRALSFFRL FYEQFTLYQARLNRLYGFIGKLNELDTHLIPNPIECSQLVALENFGLKDA KGNVLFEGINLELSAGDALLIQGASGTGKTTLLKAIAGIYPFETVGRSKR PCNGKILFLPQRPYMPQGSLREAICYPNIDPHHPELESYMLKCHLDKYIF ALDQENDWQAILSPGELQRVAFIRIFLTKPDVVFLDETTSALDEPTEHSL YSKIRQALPGMIILSVGHRCTLQQFHTKHLVIGLDKSSRTI >MS1230 sfsA, SfsA protein MRLPPLQAAKFIRRYKRFMADVELANGNILTIHCANTGAMTGCAEKGDTV WYSDSKSTTRKYPCSWELTELSNGNLVCINTHRSNQLVQEALQNKVIKEL AGYSEIYPEVKYGEENSRIDFLLKGEGLPDCYVEVKSITLVKNNIGMFPD AVTTRGQKHVRELLAMKKQGYRAVVLFAGLHNGFDCFKTAEYIDPDYDKL LRQAMKEGVEVYAYAGKFDKIQEIPTALSLAEVVPLCFN >MS0467 smtA, SmtA protein MSLNLNQVSLLQNVTRYWNNRAEGYSRHNQQELQSIKRLKWQQLLLAHAP KKQNLKVLDIGTGPGFFAIIMAQAGAQVTAIDATSNMLEQAKYNAAQAMV DIRFVRGDVHHLPFADESFDLIISRNVTWNLSEPEQAYKEWHRVLKCGGN LLNFDANWYLFLYDEQRRRAFEQDRASTIRLNIPDHYADTDTSAMEAIAR KLPLSRQLRPHWDMNALLNIGFSQLMADTRIGEFLWDDEEKVNYRSTPMF MIVAQK >MS0945 smtA, SmtA protein MWHAKHATELKLPTSWQQIPNGTLYCNALNRYFSHWLSNILGDQILKLGG LSAEIGLDLPMRHQLVISPEIPQNLTALCLHPCTSVVRSKVTELPLIEES IDACLLANNLNFCADPHRLLREITRVTTESGLLFISLFNPLSILAFKRQF HQTPYEKFPFRQYPTWLIIDWLELLNFDILQCENLALQHRQHFSLFSPLT VIIAQKRTCSLSSQAQKIQFHQEDVFSPEAAFKRINE >MS0706 smtA, SmtA protein MSKDTIFSTPIEKLGDFTFDENVAEVFPDMIQRSVPGYSNIITAIGMLAE RFVTADSNVYDLGCSRGAATLSARRNIKQANVKIIGVDNSQPMAERARQH IHAYHSEIPVEILCDDIRNIAIENASMVILNFTLQFLPPEDRRALLEKIY RGLNQGGLLVLSEKFRFEDETINNLLIDLHHTFKRANGYSELEVSQKRAA LENVMRIDSINTHKVRLKNVGFSHVELWFQCFNFGSMIAIK >MS0203 smtA, SmtA protein MTTFMNNKTKSAGFTFKQFHVSHDKCAMKVGTDGILLGAWASLQGNRYLD LGTGSGLIALMLAQRTQTDCHITGVEIDPSAYRQATENVRQSPWADKIQL EQQNIVDFTRTCTKKFDTVLSNPPYFEQGVDCRDKQRDTARYTQTLSHSD WLNLAADCLTNTGRIHLILPYAAGKNLQKQTALFCARCCEVITKSGKIPQ RLLLTFSKQPCTTEQSRLVVYNEQNQYTEQFIALTRDFYLNF >MS1894 smtA, SmtA protein MKESVYDSEGFFELYQKLRANPGSLNEIVEKPTMLSLLPDITGKTLLDMG CGTGGHLQMYLRLGAKRVVGIDLSASMLKQAEIDLGKLCENRLQFSSGSF SLHHLPMEQLDQLPEAQFDVITSSFAFHYVENFPALLTKIANKLTARGSL VFSQEHPVVTAYQGGERWEKDENKQQIAYRLNFYRDEGKRERSWFKQPFL TYHRTISTIVNNLIQVGFTIEKMAEPMLADQAEWQTEFKDLQHRPVLLFI RAKKS >MS2368 smtA, SmtA protein MNIQLICETENSQNFTALCKEKGLTHDPASVLALVQTETDGEVRLELRKL DEPKLGAVYVDFVAGTMAHRRKFGGGRGEAIAKAVGVKGNELPSVIDATA GLGRDAFVLASIGCRVRLVERHPVVYLLLQDGLRRAYADPEIGEMMQKNM QLLPVHHITELNPFEDFADVVYLDPMYPHKQKSALVKKEMRVFQYLVGAD SDSNLLLEPALKLAKKRVVVKRPDYAEFLAEKAPQFSRETKNHRFDIYSV NV >MS1338 smtA, SmtA protein MKSELICYKKMPVWNKNSLPKMFQEKHNTKAGTWGKLTVLQGKLKFYTLN EDGSIVNEHIFSANTDTPFVEPQQWHKVEALSDDLECYLEFYCTKEDYFG KKYNMTATHSDVLKTAKIITPCKVLDLGCGHGRNSLYLALKGYDVTSWDH NAASIAFLADSAAKENLQIQTAVYDINNANIQENYDLILSTVVFMFLDRE AVPAIIDNMQKHTNAGGYNLIVAAMSTEDMPCPIPFAFTFGENELKNYYQ GWEFVEYNENIGELHKTDKNGNRYKMKFVTMLAKKVK >MS0776 smtA, SmtA protein MIDFRPFYQQIAVSELSSWLETLPSQLARWQKQTHGEYAKWAKIVDFLPH LKTARIDLKTAVKSEPVSPLSQGEQQRIIYHLKQLMPWRKGPYHLHGIHV DCEWRSDFKWDRVLPHLAPLQDRLILDVGCGSGYHMWRMVGEGAKMVVGI DPTELFLCQFEAVRKLLNNDRRANLIPLGIEEMQPLGVFDTVFSMGVLYH RKSPLDHLSQLKNQLRKGGELVLETLVTDGDEHHVLVPAERYAKMKNVYF IPSVPCLINWLEKSGFSNVRCVDVEVTSLEEQRKTEWLENESLIDFLDPN DHSKTIEGYPAPKRAVILANK >MS1908 ssnA, SsnA protein MKNHVRSFKTYIRDEIIKKGGWVNAHAHADRAFTMTPEKIHIYHNSNLQQ KWDLVDEVKRTSSVEYYYARFCQSIELMISQGVTAFGTFVDIDPICEDRA IIAAHKARDVYKNDIILKFANQTLKGVIEPTARKWFDIGSEMVDMIGGLP YRDELDYGRGLEAMDILLDKAKSLGIMCHVHVDQFNTPKEKETEQLCDKT IEHGMQGRVVAIHGISIGAHSREYRYELYKKMREAQMMIIACPMAWIDSN RKEELMPFHNALTPADEMIPEGITVALGTDNICDYMVPLCEGDMWQELSL LAAGCRFPNLDEMVNIASINGRKVLGLDR >MS1280 sspB, SspB protein MKNKMEYKSSPKRPYLLRAYYDWLVDNEFTPYLVVDATYYGVDVPQEYVR DGQIVLNLSSGAVANLQLTNDAVMFNARFQGVPREIYIPLGAALAIYARE NGDRSDVRT >MS2272 surE, SurE protein MLFFMGFMQNIPHKILIIIKEKNRKIMNILLSNDDGYHAEGIQILARELR KFADVTIVAPDRNRSAASGSLTLVEPLRPRHLDDGDYCVNGTPADCVHLA LNGFLSGRMDLVVSGINAGVNLGDDVIYSGTVAAALEGRHLGLPSIAVSL DGRRYYETAARVVCDLIPKLHTRLLNPREIININVPDIPYDQIKGIKVCR LGHRAASAEVIKQQDPRGESIYWIGPAALPEDDEEGTDFHAVNNGYVAIT PIQVDMTSYNSMSALQDWLESE >MS0956 tdh, Tdh protein MMEIKTLSCVVRGPKDVGVMEQSINYDESSKEQTLVKITRGGICGSDLHY YQYGKVGNYEIKHPMILGHEVIGTVVKTNAPDLYVGQKVAINPSKPCLTC KYCLSGDTNQCETMRFFGSAMYNPHVDGGFTQYKVVDNSQCIDYPQDVSD DIMAFAEPLAVTIHAAKQAGDLAGKRVFVSGVGPIGCLAVAAIKASGAKE IVVSDLSRRCLDLALEMGATKALNAKDDFSEYMAHKGEFDVSFEASGHPS SIERCLAVTKARGTIIQIGMGGAIPEFPIMTLIAKEICLKGSFRFIEEFN TSVEWLSSGKVNPLPLLSATFPYTELEKALIIAGDKDNISKVQLSFE >MS0525 tdh, Tdh protein MMRSLVCKEPFHLILEERAKPQPKDEEVQLKVAAIGICGTDIHAYAGNQP FFEYPRVLGHEASGVITELGKNVDKFKVGQRVALIPYVSCGKCGACLSGK TNCCENISVIGVHQDGAFSEYLTAPAKNILPIADSVDFTTAALIEPFAIS AHAVRRAQITKGDDVLIVGAGPIGLGAAAIAHADGANVVIADTSEERRKH IQANIPVPTVNPINEKVEDYFNGRLPQIVIDATGNQKAMNNAVNLIRHGG RIVFVGLHKGTIEFSDPDFHKKETTLMGSRNATLEDFEKVQHLMSERKIS ANMMLTHTFKYDELAEIYEEKITKNQSLIKSVVLY >MS0480 thdF, ThdF protein MMTKETIVAQATPIGRGGVGILRVSGPLATEVAKAVVDKELKPRMANYLP FKDEDGTILDQGIALYFKSPNSFTGEDVVEFQGHGGQVVLDLLLKRILQV KGVRLARPGEFSEQAFLNDKLDLAQAEAIADLINASSEQAARSALKSLQG EFSKKINQLVDSVIYLRTYVEAAIDFPDEEIDFLADGKIEGHLNDLIGQL DKVRSEAKQGSILREGMKVVIAGRPNAGKSSLLNALAGREAAIVTDIAGT TRDVLREHIHIDGMPLHIIDTAGLRDATDEVERIGITRAWNEIEQADRVI LMLDSTDPDSKDLDQAKAEFLSKLPGNIPVTIVRNKSDLSGEKESIEEQE GFTVIRLSAQTQQGVSLLREHLKQSMGYQTGTEGGFLARRRHLEALEHAA EHLQIGRVQLTQFHAGELLAEELRIVQDYLGEITGKFTSDDLLGNIFSSF CIGK >MS1423 thiJ, ThiJ protein MTTSIKPVLCVVTSAPIKGKSGIPTGFYLAELTHALDEIEKAGLKTVIAS VRGGQPPIDGFDLTDPVNAKYWNEGDLYERLANTPALSELNGADYSAVFF AGGHGTMWDFAQSAEVHRIVSEVYTSGGVVSAVCHGPAALVGAKLPNGEF VVNGKNIAAFTNAEEVEVEGDKLVPYMLQTELEKQGAIHHAAPNWAENVI VDGQLVTGQNPASAKGVGAALAKVLLEK >MS0780 tldD, TldD protein MLNKVVESLLTPSNLSVKDLPNIFDQLAHRHLDYSDLYFQLSQDESWVLE DGIIKEGGFHIDRGVGVRAISGEKTGFAYSDQINLTSLQQCANAVKGIAP AEQGRIITPTGFNRVNPILRYAAVNPLDTLTKEQKIELLYLVDKTARGMS PYVSRVSASLSSIYEEVLVAATDGTLAADIRPLVRLSVSVLVEKEGKRER GSAGAGGRFGLNWFLESFEGEVRAVSFTKEAVRQALVNLEAIPAPAGLMP VVLGAGWPGVLLHEAVGHGLEGDFNRKESSLFSGKIGELVTSPLCTIVDD GTLENRRGSLTIDDEGTPSQRNVLIENGILKGYMQDKMNARLMGVAPTGN GRRESYANLPMPRMTNTYMLSGDSKFEDLIGSIDRGIFASHFGGGQVDIT SGKFTFSTTEAYLIEKGKITRPVKGATLIGSGIEVMQQVSMVADNMEIDH GIGVCGKEGQSVPVGVGQPALKIERITVGGTN >MS0682 tldD, TldD protein MEISQNQTALLKQQEQALRDAVSYAVEIAQKAGASAEVAVTKVNGLSVST RLKEVENVEFNNDGALGISVYLGQQKGNASTSDLSKDAIKNAVEAALAIA KYTSPDECAGLADKELMAFEAPSLALYNPAEVDVDQAIELALQAETAALN YDKRIVNSNGASFNSHNGVRVYGNSYGMLQSYLSSRYSISCSVLSGIDDE LENDYEYTVSRDLNALESPVWVGENAAKKAVARLQPRKITTQEAPVIFLN DVATGLIGSLAGAISGGSLYRKASFLLDHLGRQILPDWFHISERPHLTGR LASTPFDSEGVKTQSREIVEQGILRTYLLTSYSGRKLGMQSTGHAGGIHN WLVRPNANGDLDSLLRQMGRGLLVTDLMGQGVNMVTGDYSRGAAGFWVEN GEIQYPVAEITIAGRLKDMLRDIVAVGDDIEQRSNIQTGSILLESMKISG N >MS2335 torD, TorD protein MVKNTALLSLKQQKSAMNFKEILMDNALLQWISTGGRLLGAVFYYEPKDK RVQPVLDFFRQPDWTKDWATLANPALINALIEKSAQQDLSQAYQYLFIGP NELPAPPWGSVYLDKESVIFGDSLLALRDFLTVHQIEFIQTQNEPEDHLG LMLMLAAYLAENKPELLEEFLTKHLFSWVYRCLDLIFAQTDYPFYQAMAL LARQTLKGWQQQLDLQVDQPQLYR >MS0837 torD, TorD protein MSETIINNFSLISRLFGNLFYRSPTDSILDGVFGWLQQKGLEQVWPLDTD EDVRQALDSVQMTIAKEVLAQEYERLFAGEQPKIDSRISAYGLNVDEFIN FRQTRRMPEVESADNFSLLLLTASWIEDNLDSISAQQELFESFLLPCASK FLTHVETYALLPFYRSLALLTREILAAMADELEENE >MS0137 uup, Uup protein MIFFSNLTLKRGLNLLLEEANATINPKQKVGLVGKNGCGKSSLFSLLKKE NQPEGGEINYPADWAVSWVNQETPALNISALDYVIEGDRTYCRLQKELKL ANEHNDGNAIARIHGQLDIIDAWTVQSRASALLHGLGFSQEELGRPVKSF SGGWRMRLNLAQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLV LISHDRDFLDPIVNKIIHIEDKKLNEYTGDYSSFELQRAEKLAQQNALFR QQQDKIAHLQKYIDRFKAKATKAKQAQSRMKALERMERIAPAHVDNPFTF EFREPLSLPNPLVMIDKASAGYGEGESAVEILQKIKLNLVPGSRIGLLGK NGAGKSTLIKLLAGELTARSGVLQLAKGVQLGYFAQHQLDTLRADESALW HLQKLAPQQTEQELRNYLGGFAFHGDKVKDPVKQFSGGEKARLVLALIVW QRPNLLLLDEPTNHLDLDMRQALTEALVDYQGSLVVVSHDRHLLRNTVEE FYLVHDKQVEEFNGDLEDYAKWLNDLNVQEKSAVKNTEVSKESNNENSGQ NRKEQKRREAELRQQTAPIRKQIAKFETEMDKLTAQLTEIEVRLADSGLY QTENKEKLTALLTQQVQTRKALEEAEAHWLTAQEELETLLAE >MS1240 uup, Uup protein MGFYMSSQFVFTMHRVGKVVPPKRHILKDISLSFFPGAKIGVLGLNGAGK STLLRIMAGVDKEFEGEARPQPGIKIGYLPQEPKLDPQQTVREAIEEAVS EVKSALTRLDEVYALYADPDADFDKLAAEQAKLEAVIQAHDGHNLDNQLE RAADALRLPEWEAKIENLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDA ESVAWLERFLHDYEGTVVAITHDRYFLDNVAGWILELDRGEGIPWEGNYS SWLEQKEKRLAQEQAQESARQKSIEKELEWVRQNPKGRQAKSKARMARFE ELNSGEYQKRNETNELFIPPGPRLGDKVLEVEHLTKSYGERTLIDDLSFS IPKGAIVGIIGPNGAGKSTLFRMLSGKEQPDSGSITLGETVVLASVDQFR DAMDDKKTVWEEVSNGQDILTIGNFEIPSRAYVGRFNFKGVDQQKRVGEL SGGERGRLHLAKLLQRGGNVLLLDEPTNDLDVETLRALENAILEFPGCAM VISHDRWFLDRIATHILDYGDEGKVTFYEGNFSDYEEWKKKTFGAESTQP HRMKYKRIAK >MS0840 uup, Uup protein MALISLTNGYLSFSDAPLLDHADLHIEPRERVCLVGRNGAGKSTLLKIIA GDVVMDDGKIQYERDLIVSRLEQDPPSHAQGNVFDYVAEGIGHLADLLKE YHHISTLLESDYNDNLLSKLAQVQSRLEHENGWQFENKINEVLGKLELNP NTLLSELSGGWLRKAALARALVCNPDVLLLDEPTNHLDVDAIEWLETFLL DFAGSIVFISHDRSFIRKMATRIVDLDRGKLVSYPGDYDLYLTTKEENLR VEALQNELFDKRLAQEEVWIRQGIKARRTRNEGRVRALKMLREERRQRRE VLGSAKLQLDTSSRSGKIVFEVEDASYAIAGKQLLSHFSTTILRGDKIAL VGPNGCGKTTFIKLLLGELQPTSGHIRCGTKLDIAYFDQYRADLDPEKTV MDNVADGKQDIEVNGVKRHVLGYLQDFLFPPKRAMTPVKALSGGERNRLL LAKLLLKPNNLLILDEPTNDLDIETLELLEDILADYQGTLLIVSHDRQFI DNVATECYMFEGNGQLSKYVGGFFDAKQQQENALTSKMASEQAKPKKMQP ESAVEKSEISTANNNQKTIKLSYKEQRELERLPQLLEELEKMIENLQNEV GNPDFFQQSHEYTSAKLQELADKEAELENAFIRWEELEEKKKGNLS >MS1094 vapI, VapI protein MLSPRKISEIATGKRPITADVAVRLALFFGTDAESWLNLQSHYDIKKSEE EIKTDIESILDSSIDGYLNI >MS1499 wbbJ, WbbJ protein MTYYQHPSAIIDEGAEIGEGSRVWHFAHICGGAKIGKGVSLGQNVFVGNK VRIGDHCKVQNNVSVYDNVYLEEGVFCGPSMVFTNVYNPRSLIERKSEYK DTLVKKGATLGANSTIVCGVTVGAYAFVGAGAVINRDVPDYALMVGVPAK QIGWMSEYGEQLELPLSGQAETKCPHTGAIYRLEGHELKKL >MS0417 wbbJ, WbbJ protein MATEKEKMLAGLAHLPMEEHLSALRLQTKELLFDFNMLRPSNKLEKTHLL RKILGKAGKNIHVNSPFHCDYGCNIEVGDNFFANYHCVILDNGGVKIGND VMFAPNVSLYTVGHPLDAELRNQGWEQAKPIIIGNNVWIGGNVVILPGVV IGDNVVIGAGSVVTKDIPANSLALGNPCKVLRQITAADREYYQQTFMQNN >MS2128 wbbJ, WbbJ protein MVGRNAHPTSKGNMMDYTLNLPLNQLIAQNSELFSKIHQVVDKNAPLVAE LNSGFRTQNEIRAILNEMTGTEIDASFHVNLPLYTDFSAHIRIGKRVFIN TAVMLTDLGGITLEDDVLIGPRVNIITVDHPIDPAQRRGVIVKPVVIKKN AWIGAGATILAGVTVGENAIVAAGAVVNKDVPANTIVGGIPAKLIKEI >MS0323 wecD, WecD protein MKIFKAEQWNLEVLLPLFEEYRLSHGMVENPERTFTFLNNRIRFSESIIF IATNERQQAIGFIQLYPRLSSLQLQRYWQLTDIFVQDVANQNEIYAGLIE KAKEFVCFTHSTRLVVEQDQQHQGIWEKEGFKLNTKKALFELKL >MS2102 wecD, WecD protein MMQTIDQFIAQYIPAAYALNLRVVESSPQRVVIKAPFECNSNHHHTIFGG SQALLATLSAWSLVYLNFPEANGNIVIRSSQIRYLKPAPSDIIAVSICPD SLAMNLAKQMLTQKGKAKITIQCQLYCDDIIVSEWTGEFVLSHTPF