TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Gene type: CDS
Genomic element: chromosome

Number of genes found: 272

Free access
Sort by:

 



# Mannheimia succiniciproducens MBEL55E, MBEL55E

>MS1428 unknown
MNMNKKIVMILKILLAVIVLLTGAVWAFMTYHPVWGGTPDEGSMARIRAS
KAYNATLGKFENQEPTQLLTTDEKPSITTWITRLMAADEGKNPSEPLPSA
AFDKNVLKDGEMVWFGHSTVLFKLGGLNVITDPVFHNASPIPYIGISPFK
TEHSYSVESLPELDIVLLSHDHYDHLDYRAIQELDSKTKHFIVPLGVKAH
LQRWGVADDKITEMDWDEQTKIGTLAITLVPARHFSGRTLNIKDPTLWGG
YIIQSPELKYYYSADSGYGKHYRETIAKHAPFDFVMIENGAYDKKWALIH
ETPEEALQALKDIGATKVLPIHWGKFDMANHVWTDPINRLMKDVASQPEI
SVATPKIGQIFHTQGDLPAEQWWQGVR
>MS0521 unknown
MRALKKISQLLAKNTALVIILTALFTFIVPEAFTWVKGDAQVLVLGIIML
SMGMTLGAKDYQILAKRPLDILIGTVAQYTIMPFVAISIAQAFNLSPGLT
LGLVLVGTCPGGVASNIMSFLCKGDVAFSVGMTTVSTIIAPVMTPLLLNY
LVGETIDMDGWGMFKFMLLVTILPVGLGSLFNMGCHKQKWFNDVRSVMPG
VAVIAFACIVGGVVAFQGERFLESGLIMLMAIGCHNITGYILGFAAGRVF
GMNTAKKRTLSIEVGVQNAGLATGLSAKFFPTNAESVVACAVACVWHSVS
GSVLANIYQWWDKKHGEPVTEIHEIKKPVTESV
>MS0214 unknown
MLKINYERRLVMKEVSSKSLNDDELALLDNLLLEYANEESDEGIFTLSEL
DGYLTAIISSPMLIQPSTWIPAIWDNDLPEWENEQEMAMFFDLLFRHYNS
IIMMLQTGLEYYSPCFEYSNFTDGDYPIVDDWCFGYMRGVKLADWQNLPT
KLQPYLKLIEDQTHLHSSLDDYVSPSLQEQNELADRLIEAAVKIYRYFR
>MS1705 unknown
MEIANNLKQIHKNIVSICQNAGLPSNSVKLLAVSKTKPVEDLEQAYQAGQ
RAFGENYVQEGVEKIEFFQAKHPDMEWHFIGPLQSNKTRLVAEYFDWMQT
VDREKIAIRLNEQRPANKSPLNVLIQINISDEESKSGIKPADMMALAEII
ENLPHLRLRGLMAIPAATHDVAIQAQSFSAMHKLFVELQQSLPNQRIDTL
SMGMTDDMTAAIKCGSTMVRIGTAIFGSRN
>MS0034 unknown
MLNYKPLTEKSGFFCILRTLMTQYIIAQTNKGVQLGITAKMANRHGLIAG
ATGTGKTVTLRKLAEAFSDDGVPVFLVDVKGDLSGLTVKGTLQGKIAERV
EQFNLGGENYLSGYPVSFWDVFGETGIPLRTTISEMGPMLLSRLLNLNAT
QEGLLNLVFRVADDKGLLLIDLKDLRAMLKFVAENAKEFQVEYGNVSAAS
VGAVQRALLTLENEGATNLFGEPALNLEDWLQTRDGRGVINILNSEKLIN
SPRMYSAFLLWLMAELFERLPEVGDPEKPKFVMFFDEAHLLFDGVPSALV
DKVEQVVRLIRSKGVGIYFVTQNPLDLPDTVLGQLGNRVQHALRAFTPRD
QKAVKSAAETFRANPQVDVVETISTLGVGQALVSFLDEKGMPTPVEIVGI
FPPKSQLTPLTNEQRTDWVKDDELYPHYRDLVDNESAYEILNDQSVQAQV
QQQVQDEENSDFFSGMISSIFGTKKKSRQTVAEQMVSSVAHQVGRNLRNQ
VTKQILRGILGAITKK
>MS0102 unknown
MSKQQTVELDLNPIMQALSRTPMVLLGYQKRWCEDTNPVKVVEKSRRIGL
TWGEAADCALLAASNSGMDVWYVGYNKDMALEFIRDCANWAKFYGLAAGE
IEETEEVFKEGDEKESILAFTIRFASGWRITALSSSPSNLRGKQGLVIID
EAAFHPCLSELLKAAMALLMWGGRVHIISTHDGVDNPFNELIQEIREGKK
PYSLHTITFEDAMKDGLYERICLRTNRAYSKEGEQQWEAEIRASYGEDAA
EELDCIPKNSGGKWLSRALIESQMHSHTPLVRKEMARDFELIDEPVRAKE
IAQWLQEEIQPLLDDLDKNRPHFLGEDFARKGDLTSLVIAAQQPNLTNEI
QFIVELGNMPYAQQEQIVLYILKALPLFSGAAFDGGGNGGSLAEKARDAF
GESLIHIIQLSEKWYKENTAPFKAALEDGTLTKLPKNADVLADLRAFEIV
RGVPRIPDKRAKSVDGGKNKRHGDTAIALLLLHFATRQDVRLPVVAVTRR
ARRSQTISEGY
>MS1050 unknown
MTIKAIIFDMDGVLIDSEPVWKQAGIDIFNAEGIPVTYDDMLALTGIPSL
GIVKAVYEKYQRSPVPVAEMAQRLNDHAISLILAQKPLIDGVQETLQKLT
ALGYKLAVASASPRILLEEITQSCGIDQYFSYLSSATELSHNKPHPAVWL
HAAEMLGVEATECIGIEDSVVGMVSVKAASMKCIVVPGVLGSDDPRWALA
DIKLATLREIDETVIGKLDSI
>MS1709 unknown
MQAEKHLKWAAEQNDDRIAFRLDGELSRDTLLPLWNEFQKREQRSSFLSE
RQIADKNISWDLSQVSRIDSAGFALLCDLLHYCQAKKNADKTLLLENVPP
QLLTLADLVGLADWIKPYLK
>MS1493 unknown
MNINVISIFKLLLLFALGLVILSPALSTQIGVPRLDSALCFLFFFLAVIT
PFLRDMETDFFKLQFPVYVLFFFGFLSVLNAFSTEKLVDLFFFGIVMFLF
HYSFLTFNRGDGEAGIRHLLLGISLIVLAGFFIEALLGFQLVSGNEELTV
TDKAFKGFFFNTNDQSVIMISLAVAVGFFYIIRENNWKIKLIGYALIFIM
GLAIVISASRSVLLSYLIMLMLILFLNASAYFKAVYLFFACVIALFIFNL
SWLQEVFILLAKIDWLERPIERFSLVIFSMGDDKSVGYRTEIYTTFLDNF
KILWLGYGPRDYIQYFDQIKLSFPLGYTNPHSFFIELYLAFGIFAFLAFI
YFLLNSIIYVMNTRLLAWKERIFILFVFINFCWIVWVPSSILRLPLVWYP
LFLVLVYTVLVKNGTFVSPKLVGRRRSS
>MS2129 unknown
MKFKLKALTATLFLGSSLLGANAMAQLPQNATAIEVPAQSIQLTQEWDKI
FPKSDKVEHRKVTFKNRYGITLVGDLYVPKGATGKLPAIAVSGPFGAVKE
QSSGLYAQHLAERGFVTVAFDGSFTGESSGLPRNTASPEINTDDFVSAVD
FLGSLDNVDREKIGVLGICGWGGFALNSAISDPRIKAVATSTMYDMTQVM
ADGYEIKMEPNPKVPYERTSPMTTEARYKMKQDLANARWEAAANGYSLNG
KAEDHLTPQDKITAETPRFVREYSNFYKTKRGFHPRSVNSTTGWNTAMTP
SFINMPILQRAGELKAPALVVHGEFAHSRYFGEDAYKALGSKNKELYIVP
GANHTDLYDDVNGKIPYDKFEQFFKANLK
>MS2095 unknown
MKDLTAREFGYGHPTPLFMIGTYDEDGRVNFMNSHWGALNHGGYINLNIN
TNKKTHLNIEKMKAFTVTLATEKLMPYADFFGTYSGFQYPDKFEKSGLTA
HKAKYVNAPIIDGSTLVIECELVEILYQEHIHTIIGRVKNVSVDESVLDA
QGKVDASKLGMIFFDSFSRGYFTLGERVGDAWSIGQSILNS
>MS0023 unknown
MIVLDFLLCRFLKDKEQNMSKVSEITRESWILSTFPEWGTWLNEEIELEQ
VPANNFAMWWLGCVGLWVKTPQSANICIDLWCGRGKATKQVKDMVRGHQM
ANMAGVRKLQPNLRNSVGVLDPFAINEVDAIVATHYHNDHIDVNVAAAVV
NNPKLDHVKFIGPQYCVDMWTKWGVPAERCVVVKPGDTVKIKDLELVALD
SFDRTCLVTLPARGAEDNGGELNGICPSDEEMGLKAVNYLIKTPGGNIYH
SGDSHYSIYYAKHGKDYDIDVALGSYGENPLGIQDKMTSIDILRMAECLR
AKVVIPVHHDIWTNFMASTNEILELYRMRKDRLQYQFHPFIWEVGGKYVY
PRDKDLIEYHHPRGFDDCFEQEPNVPFKSIL
>MS0991 unknown
MTNTKKVYYAHSEKDLPHEQWQTFSSHAENVAKLAAQFAEIFDAYQLAYN
TGLLHDLGKYTPAFDKRLHGGPSVDHATAGAKIAIERWGFPLGKILAFCI
ASHHTGLVNGDGEGDNRSTLKQRLSVPFGKGNLPELDPIWQSELPLPEKL
TFPALKPDPYYQPFALAFFIRMLYSCLVDADFLDTEAFYANLKQQDIDRG
NAPSLDQLHQQFNRFISDFRERKKALQPQTEEEQRNAKLNRLRSQILDHA
IAQAQQEPGLFSLTVPTGGGKTFTSMAFALEHAKKYGMLRIIYVIPFTSI
IEQNAQEFRKAFGEFGEAAVLEHHSTFDDEKLLDKDTKDKLKLASENWDM
PIVVTTAVQFFESLFADKSSRCRKLHNIANSVIILDEAQMLPLNLLLPIM
QSIKELARNYHSSIVMCTATQPAIQTQHGFYRGFENVREIAPNPTALFAD
LRRTSVQHIGMQSDKDLIDKLTENQQILIIVNNRRHARSLYEQAKQLDGT
FHLTTLMCAKHRSQMLEQIRQHLQAGRSCRVIATSLIEAGVDVDFPLVMR
AEAGLDSVAQAAGRCNREGKKLAEQSFVWVFQPEQQWKAPTELGLLSAAM
RSTVRCYGDNLLSVEAISHYFSAVYEQKGKDLDNKQILAKCHAAGKTLDF
PFQTIAKEFCMIESHMLPLIIPFDKEAEKRIEELRHAEKVGGLLRKLQPY
TVQIPQKSLEALFKAGRIEAINEQQFGNQFYSLIGLDLYDEVAGLDWGDL
GFITIENSVF
>MS0665 unknown
MNKYLKSDFIFSLFLSIAIMFICLYFEKSFFFVDDAQNEFLPFTRQIGNV
WLNGEIPFILKNTFIGSNTMIDIHRAIFLPQNIFLSILSVKITSLKIISI
IAAFINLFVMSFSALKLSEAFSLTKAAGIVLAFLFCINPIFLYFYLESWW
NAAAGQAWFVASLASVAWLMRAFSIKRLLLNVITVLSIFASGWPHSVLVY
GFLALIFSIFLYLNKRHNDLILFVLISFSIILIAIPLYSEYVISGDLINR
QSFKFNNVGNFLSTTLNQLLLTFNVTYYHFMHRYGGYSITHIPMGYSSIY
ILLLICFGSLKNIARNPNSLFLLVLCTVFFILTQTPTEIGPFRYPFRFTP
YFSEVLTMLSIFSLEKLGIVKTRARVFLVVLLLSISLLLSIFSLEENFGK
YAILQFLFFAVTTWYVVRYNSISLKSGLPYTAFIFLLMLLAKDSVIGYLS
FPDLKNSINMENNYSQGGYILSLTNGKRPKNNLEDLNSTHFMLYGLKSIN
GASPVGNKYISKTISTRSSQAFFNAKETILGLSKTYKDKCYFDLFGIDTV
ILNKKDNSSLISQKLSDCGFSERKVKSHDVIYFLRNDFNAKGSVSYHSDT
LSINQQISLKNNSEFYQLSGLKGDELIFNRVYWYGYRAYINDKEIPLLNY
DGLLRIILDHDYQNGVLRLEYFPKSWKYALLIALSGFLLLLFSVGYMQRM
RKWVSLN
>MS1549 unknown
MILTRYLTKEVFKSQVAILFILLLIFFSQQLVRVLGSAANGNVPADLVLS
LLGLGMPAMAQLMLPLCLFIAILLTFGRLYAESEISVMRACGVGQRILVK
VALGLSVLTAALAAYNVLWVSPWAIQKQGQIVEDARANPNMSALSAGQFM
TSNDSDFVLFIDNIKDNKISNIYLFQTKEKGNSKPSVIVAENGELQSLPN
GDQILSLQNSQRVEGSAALPDFRITNFTEYQAYLGHRNVDSDENETTELP
LAELLALKTPAAKAELNWRISLILAVPLMALLAVPLSKVNPRQGRFAKIL
PALLLYLIYFLLQSSLKSAGGAGKLDAGLLMPLVNLFFLLLGIMLNSWNS
AFMYKIRHLFSKKSAI
>MS1082 unknown
MIAFGYITALSLSYFLLAPDFKGLSFTEYFIQSEAKPIFLTLGLLLPIGF
IVMSKAVEYGGIVRTDAAQRLALFLQIIAAVILFGETLNNMRVGGVIVAF
FALFCLLTKPTKSIENALKAVFALAAVWLIWGVTGILFKKIALMGGAFPT
TLFVTFSIAAVLMFTYLLIKRTFWNASSLVGGIILGCLNFGNILFYIYAH
QYFKENPTIVFATMDIGVICLGMIVGALVFKEKISKINMLGIVLGITAIL
LLRV
>MS2255 unknown
MEKLFDINEQGLSVRCKLFYEKDVHSIENIVLILHGFGSSKEVKSNAKFG
ERLITKYKNYGAIAFDLPCHGADARKKLSVAECLTYIQLVVNYAKEKLNA
QNLYAYATSFGGYLTLKYIAERENPFRKIALRAPAIQMFHTLTANMTDDE
RHKVAKGKEIMLGFERKMKIGKEFLDELEQGDIQQYDYLDYADDMLILHG
TADEIVDIATSQTFAENNVIELIAVEGADHPFSNPQLMDLAIGRIVEFFH
>MS0879 unknown
MCYFKCLFKIKELFNMQTFLKFTNFMSKTFALWVLVFAFLAFQFPAQFAI
FAPYIPYLLGLVMFGMGITLTFNDFGEVFKHPKSVFIGVAGQFVIMPAIA
FCLAKIFNLPADLAVGVILVGSCPGGTSSNVMTYLSRGNTALSVACTTIS
TLLAPFLTPAIFYILASQWLDINAGAMFMSVLKMVLFPIFLGLIVRAIFK
KSISEISRTMPLVSVISIVLILSAVVAVSKDKIVESGLLIFGVVVLHNCL
GYLVGFFGARLFKLNIADSKAVSIEVGMQNSGLGAALAAAHFNPIAAVPS
AVFSFWHNVSGPILANIFANIKNDDKK
>MS2191 unknown
MMLKNIFAGLTVLLLSACTLVTYQPVDTISHVNAKQGYRMRNAIQQPDGN
LIILMFSGGGSRAASLGYGVLEEFKNAAVRPTAKGTTLIDNVDLVYGVSG
GSVLATYYSLYGRDAVPKFEENFLKKNFQREIISQVFSLSNLPRITSPQF
GRGDLLQEQLDQTLYKGATFGDLERKRKGPFVVVSATDMNLGQKITFTQE
FFDGLCIDLSKMEISRAVAASSSVPLLFSPLTLNNNGGNCHFDIPELIQI
SQNISNDAQKSKNLEELKNTLSLYQNSKERPFIHLVDGGLTDNLGLSGLI
DIYDVAGQEGMYREAVKNQLKNIIVINVNAQNEVSSEIDKTANVPGTRDV
INTIINVPIDRNSQVSLRRFREFTDEWNKSMANKPPKQRINMHFVNLSLK
DLPESQLKKEVLNISTSFYLLHSDVNKLKRSAKILLQQSKEYQDVLRALQ
>MS0331 unknown
MYMFVLFGLNSEHSQHINYFLYTGELSMKKLLKLSLVAGLAMTALAVQAE
ERFITIGTGGQTGVYYVVGQSICQLVNRDTAKTQIKCNAPSTGASIANLN
AIADKQMDMGIAQSDWQYHAYNGTSAFEGKKNEKLRAVFSLHAEPFTLMA
RDDSGIKTFDDLKGKRVNVGDPGSGTRATINVIMAEKGWTDKNFKVAAEL
KPAEMASAMCDNNLDAITYNVGHPNGALKEAAASCDSHLVPVTGPEIDKL
VSEHSYYAKAVIPGGLYKGTDNPVETFGSYATLVSSTDVDADKVYAVVKA
VFDNFDRFKRLHPAFANLKEEDMIKNALSAPLHEGAERYYKERGWLK
>MS1036 unknown
MTDKIYDLHCHSTASDGILSPSEIVQRAHEQGVQSLALTDHDTISGLTEA
RRQAELLGVEFINGVEISTSWENKVIHIVGLNFDENSPEMTALLAKQAQL
RLNRALTIGEKLAKAGVANAFEGASALAKGEVTRAHYARYLVQIGKVANE
NQAFKRYLSQGKSCYVKAEWCDIPAAISIIKQAGGIPIIAHPLRYTMTAR
WIKRLIADFKNWGGEGIEVSGCGQTADQRQLIARWANEFELLASVGSDFH
FPCGWVELGKSLWLPENVTPVWSQFGDKPKYLQNTCKS
>MS0758 unknown
MIIYLHGFSSSRPDDYENVMQLKMIDPDVRVISYSTVHPRHDMTYILNET
HKLVSETQDDKPMICGVGLGGYWAERVGFLCGVKQIILNPNLFPEENMEG
KIDRPEEYLDIKTKCIEDFREKNQSRCLVFLSKNDKVVDPKRSEALLSHY
YEVIWDDTDAHQFKHIAPYIQRLKEFKAA
>MS0522 unknown
MKFYRTLEDFKVISFDLDDTLYDNSQVILDAERHSVDFLREISQIPQLDG
GYWRYWKNKTALDFPLLAEDVTQWRIKTIVELLRAHQKSAVEIERISHAA
MEDFFEWRHKMQVPQQSFEVLNKLKRQYKLAALTNGNVTPSRAGFDQFEL
VLTGGVQGRAKPHQDLFRQTAGYFNVRPHEILHVGDNLVTDVQGAIQAGC
QAVWINLSDKKIQHFSEATLVPTFEITDLNELLFFRNL
>MS1651 unknown
MSMQIKSIQLAFSVLYHYFDECVKKALSNLPIQRVDIPDSLLAQAESLVL
GKLPSEKKKDLPLTSIFEGISQQNNSQQYLYDFKPLSPDSIFPDLQRNEG
HQPFELWQHLAKAVEEIPTSHRENINLWLDHFDTALQCYTSQITCPYDQS
ISFYDFTKAVAAFVVASMDKSADKNRPFLLIQGDFFGVQDFIFSGGSQSN
KQAAKLLRGRSFQVSLFTELAALKVLNACDLPATSQMMNAAGKFLIIAPN
TPEIHKKLDDVQKELNEWCIKNTYGLIGLGIAKMSAGKVDFEQKNYEKLI
KLLFENLETQKLKRLDLTDTTQSVQEESYPNGVCEMNSFFPALPNSNRSI
MTEDQVKIGELLAKKQRIIVCDVGTEINNSYRTQTLKLDMFGYNVIFTDS
RKDTKDFGHPVKLYQIHRFWDFSLAKNTKDELWNGYARRYINAYVPFDEQ
EQIKTFDEIAQADEGINALMTLKGDVDNLGTIFQKGIQPANIAKMAALSR
QMNQFFSLWLPAYCAEYSPNMYTVFAGGDDFFLIGPWHSTQKVAFEMQQA
FKRYVAENPEIHFSVGMVMSKVGLPVPRLGDLAEMALEKAKSIDSGKNAV
TIFNRTVKWTDWQQLCDLEDEIHRLAKDYNISTSYLYSLIRLCEQANDKN
NIESTMWRSHFYYRTARYVIDKLNQEKRDKALNEITISLGENGISQYKIN
FAIPLTNYFYQKR
>MS1440 unknown
MEIAFLLAGKIIELTIIVLLGYALVKSKLLKSQDSYPLSIIGLYLISPSV
MINAFQIDYSPQILNGLLLSLTMAVFLHIILIITGVILKRLLNLDPIEHA
ASIYSNSGNLIIPLVVSMFGQQWVIYATCFIVVQTFLFWTHCRSIICGKG
SISILKMFKNINILSIFLGVFLFAFQIKLPPLISGTLSSLGQFIGPNAML
IAGMLIASIPLRNIITSKRIYLVTFLRLILIPIFLLIIIKLCGFDNWVEN
GETIAMISFLATMSPAAATVTQMALIYGKNANKASAIYGVTTMLCVFSMP
LIIALYQLI
>MS1680 unknown
MMKTVTIDLQIASEDQSNLPTLEQFTLWATNAVRAEHFEPEITIRIVDEA
ESHELNFTYRGKDRPTNVLSFPFECPEEVELPLLGDLVICRQVVEREAQE
QGKPLTAHWAHMVVHGSLHLLGYDHIEDDEAVEMESLETEIMTGLGFEDP
YSYDEE
>MS1976 unknown
MLSYRHSFHAGNHADVVKHIVEMLIIENLTQKEKGFYYLDTHSGVGRYRL
FSQESEKTAEFEEGIARLWQRDDLPEEVQRYVDLIKKLNYGGKELRYYAG
SPLIAAQMLRPQDRGLLVELHPTDFPLLRNNFKEFKNISVKRDDGFQQVK
ATLPPKERRGLVLMDPPYEMKEDYDLVVNTIVEGYKRFATGVYAVWYPVV
LRQQSKRIVKGLEASGIRKILQIELAVRPDSDQRGMTASGMIVINPPWQL
EAQMKKILPYLTNVLVPEGTGSWSVNWIAPE
>MS1279 unknown
MVTGVMFEPEPIYDELDKKPAELTHDQPLGFTDVVDNAKKEAKTTKTANK
KTKDKKSASHLRIVK
>MS1854 unknown
MRWQGRRESTNVEDRRSERSGISMGGKKTGVLGFIILLVGAYYGVDLSGL
VGTSSNIGEVGSSLSQNEEETLEKLSRVVLADTESTWQDYFARSGQKYSA
PTMVLYNGATPSACGTGQSAMGPFYCPNDHKVYLDLSFYNDMKNQLGAGG
EAAFAYVIAHEVGHHVQNLTGILPRISRLQQSNPAQANQLSVNLELQADC
FAGVFGYQAVKNNMFEASDLEVAFAAAEAVGDDRLQKRSQGYAVPDSFTH
GTSQQRLTWFRKGLQTGDPTQCNTFTN
>MS2381 unknown
MRKQLYITHGYTANSQSHWFQWLKNQLIPHQIHTNIFDMPDSSKPNPQIW
LAHHQTYINQCDENTVFIGHSLGCIATLRYLQRQKKKIKGLILVAGFDEP
LDNLPELTSFTLQRIYYPELIANIPQRIVIGSSNDEVVAPKYTQKLAANL
QASYLTVENAGHFLARQGFTEFPLLLKECLNIFNG
>MS0857 unknown
MFTSIQREVNQFINRGLDRTLRIAVTGLSQSGKTAFITSLINQLINIDNV
TNGHLPLFEAARQQRIVGVKRIPQINLNIPRFDYEANLNSLMASPPQWPQ
STRGVSETRLAVRYHNSGLFSHIKEKSTLYLDIFDYPGEWLLDLPLLNLN
YQQWSLEQQNLRQGLRAELAQTWLEKTKKLDLTAMADEDILAQIAKDYTA
YLQACKEQGLHFIQPGRFVLPAELEGAPVLQFFPLLHLAEKDWKKLKEEA
KPNSYFAILNQRYDYYKNKVVKGFYENYFVHFDRQLILADCLTPINHSRQ
AFQDMQEGLQQLFKNFHYGKRRLINRLFYPRIDKLMFIATKADHITSDQI
PNLVSLMRQLVQDGGRHVAFEGIETGFTAIAAIRATKQVLVEQEGKTFKA
LQGIRSKDKRQVTVYPGSVPSRLPSIDFWQQQKFDFDQFEPQPLESGEII
PHLRMDSVLQFLLGDKLA
>MS0254 unknown
MTKLIHLTQYKLIELTGVDSEKFLQGQLTCDVTKLKTGDSTLTAHCDPKG
KVSSVFRLIRVAQEQFYLLFRTDLLPAGLDQLKKYAVFSKVAFAEPEVQL
AGVIGENCGQFSASFVVNSGNAAILINPAERLEFNASAEAWDCVEIQRGY
PILSAKTQNEFIPQALNLQCIEQAVSFQKGCYIGQETVARAKYRGTNKRA
MFIFKARSQIIPEIGGEIEMRLENGWRKTGVILSAVNFGEVLWLQVVLNN
RLEDGQQFRLPADETALELYPLPYELV
>MS1093 unknown
MIMHFACPDTKRFFNGERFVRFISCERLAIRKLQQLNAATSLEFLTKLPN
NKLETTLYNHVSYYNLKINEQWSLLFLWDHNSPTDVKLVDMKEV
>MS2124 unknown
MGFNKYYLKMEEHFLYVPYYNHQRRIRVLLPKDYYKEDWQSYPVLYMHDG
QNIFYSKESYSGYSWKIIPTIKYHKEFPKIIIVGIDNATVDRLDEYAPWR
TDVGNTAEARNTGGKGAEYGQWVVETVKPFIDGHYRTKPQRENTLLAGSS
MGAIITAYMGAAYPHIFGHLGVFSLASWFSENEFLRFMHEHPIDRASRVF
IQVGTKEGDDADAQYISNMNQAYIDSTLYYYQALIRTGHPLDNIRLKIMA
NEIHHEKYWASHFVDFLRFSLMGK
>MS1018 unknown
MEKIMENQSFLQNFFKLNQHKTSTKTEIIAGITTFFTMVYIVFVNPSVLG
DAGMDKQVVFVTTCLIAGFGTMAMGLFSNLPIALAPAMGLNAFFAYVVVG
KLGYSWEVGMGAIFWGSVGLLILTLLQVRYWLMASIPLALRVGIGAGIGF
FIALIGFKNMGLVVANPATLVALGELHDPKVLMGILGFFIIVVLAARNIF
SGVLVSIVVVTALALQFDENVIYRGLVSMPPSLDAVVGKVDIAGALDIAL
LGIIFSFLLVNLFDSSGTLLGVTDKAGICDERGRFPKMRQALYVDSVSAV
VGSSIGTSAISTYVESGAGVSVGGRTGLTAVVVGVLFLLTIFFSPLAGLV
PAYATAGALVYVGILMASSLIKVQWEDLTEAAPAFITAAMMPFTYSITEG
IAFGFISYCVMKVGTGRWKEVNAPVWVVSVLFLIKFIWIG
>MS0714 unknown
MISKIKVSLVALCAGLFFVSVNTSAAETQTQVPQQCQKLFSATERLIEEA
EKQPGTHTQVSKIKNKLNQSKKQILEMELATQIKSCDHGLARLNRLNQQD
QITN
>MS0847 unknown
MADSRIVLDAREQSTSLLSTHKVLRNTYFLLGMTLAFSAFVAYISISLGL
PHPGIIVTLVGFYGLLFLTNSLANSGWGILSAFAFTGFLGYTLGPILNVY
IGAGLSETVVLALSGTAAVFFACSAYVLTTRKDMSFLSGMIFSLFIVLLL
GMVASIFFQTPALHLAISGLFVIFSSAAILFETSNIIHGGETNYIRATVS
LFVSIYNLFLSLLQLLGIFGGDD
>MS0295 unknown
MAIVSVPVEKSYRLLNIGATTLVSAKAEDIENVMSVAWSCALDYGPLSKV
TTVLDKQAFTRGLVEKSGLFAIQIPVANQAELVVKLGTTSRHNNPHKIDD
VEIFYPDGFDVPLVKGCAGWIICQLIRDENNQQNHDLFIGKVLAAYADDR
VFKDAHWIFEQAPNELRTLHYVAGGQFYLIGESLEVK
>MS1916 unknown
MTEKINLMNLTRQQMREFFKELGEKPFRADQLVKWIYHFGEDNFDNMTNI
NKKLRDKLKAVAEIKAPEIAVEQRSADGTIKWAMQVGDQQIETVYIPEAD
RATLCVSSQVGCALACTFCSTAQQGFNRNLTVSEIIGQVWRASKVIGEFG
VTGIRPITNVVMMGMGEPLLNVANVVPAMELMLDDFAYGLSKRRVTLSTS
GVVPALDNLSGMIDVALAISLHAPNDELRDEIVPINKKYNIKMLIDSVNR
YLSVSNANHGKVTIEYVMLDHVNDSIEHAHQLAEVLKNTPCKINLIPWNP
FPQAPYGKSSNTRVDKFQKTLMEYGFTVIVRKTRGDDIDAACGQLAGDVI
DRTKRTAAKRQFGQNIDVQLQ
>MS1514 unknown
MTELTHYNQYIADENAMIAFGQQLIQAINKLDNNKPVVIYLNGDLGAGKT
TLSRGMIQGLGHQGNVKSPTYTLVEEYHLQNKHIYHFDLYRLSDPEELEF
MGIRDYFGTDTICLIEWAEKGIGLLAEPDLIVNIRYADNARDIDLIAQNA
QGEQIITLLAAK
>MS1718 unknown
MRQFMELIIISGRSGAGKSVALRALEDMGYYCVDNLPINLLPELADILST
SQQSAAVSLDIRNLPHSPETLDTLLQQLADAQHQVRIIFLEADRSTLIRR
YSDSRRLHPLSMQDLSLEAAIEAEAGYLEPLLQNAELVINTSEISTHELA
QRLREFLKGKPDKELKIVVESFGFKYGLPLDADYVFDVRFLPNPHWNPDL
RPMTGLDQPVIDFLGKYSEVNNFIYSTRNYLETWLPMLEQNNRSYLTIAI
GCTGGKHRSVYIAQQLGEYFQAKGKKVKIQHKSLEKHHKKNSA
>MS0914 unknown
MNNNGNNMTTKTERQTWSSKITYIMTVAGATVGFGATWRFPYLVGENGGG
AYVLLFCLAMILIGIPMILVENVIGRRLRVNSIDAFGDKLQDENISGGWK
IIGYMGLLGAFGIMAYYMVLGGWVMNYIISLISGILDISTPITKETAKEF
YDFSIGNSPLHIALYTFIFVIINYIILAKGIIGGIERAVKFLMPLLFVFL
IGMVIRNVTLPGAMDGIIYYLKPDFSKITPKLFIMVLGQVFFALSLGFGV
LITLSSYLSKEENLIQTAVITGFTNTIIAILAGFMIFPSLFSFGIEPNAG
PTLVFQSLPIVFSHLWSGTFFAIVFFSLLLIAALTTSITIYEVIITALQE
KLKMRRSKAILLTLGGIFLLGNIPSILGDNLWKDFRPFDKSIFDAFDFIS
GNILFLLTALGCAVFVGFVLKDKAKAELSPTPDSLFTTVWFNYVKFVVPL
IIIVIFVSNII
>MS2167 unknown
MNVPILYFDVVHSIQIHDWIIEKSGGLAGLYPDGTGKLESVLEHIQNDLY
YPNFEDKLVHLIYSINKLHAFLDGNKRSSIVLGSYFLELNGYDYCVKEFT
IKMENIVVWLAESKISKELLLKLVCSILNNEEQYSDELKYELICATSDDF
GN
>MS2014 unknown
MGTTNAGLPKNYRIKLGVIKTPLYSNESISSWLIRAALDCGTEPITFTGF
YWNKWRLWTYDLDRGFEPIAQHIYADITELSLNQQVNLVNHSLYSVLRPI
NGKNTLIKGQAKWVLSRGSRNRSFRVGQSYCPCCLEETPYLRNEWRFAWH
FGCLKHKVLFGSKCSCCGGLYQPHLLSAEKRQLNYCHQCGEKLQVITTPL
NEVEIATMETLDKVFTTNSGECFGKRVDAQVYFAVLRYFINLVRRTAVAK
STHAFARFVEECGISQAEICQTRTALAFEQLPVEERKNLLVNAIKILNLS
SKDFIQATQQSAITQKAFAFENYPMELDTLFKYASKGKTVSRKTVTNKPK
TDSVLSMNRQWERLKRQLKIAA
>MS1274 unknown
MNLQEQLKNAKNWEERYRLIIQAGKNITKPTEQELAEMQPLSGCEAQVWF
KISQNSDRTLHFQAYSDARIINGLLWILSLAVNGKPTEQCRRFDLTSYYA
ELGIAQRLTSTRLNGLKQIEGCIHQAGN
>MS2174 unknown
MIMFKKLLIATALCASFSAMADDSFTLKVKGVENGKFQNKHLLSAEYGFG
CAGENISPEIEWKNAPKGTKSFVLTVYDKDAPTGLGWVHWEVVNIPANVS
KLPAGIDAKDNNLPKGALQTRTDFGVPGYGGACPPENEKHRYEFTLTALK
VEQLPNVTADSTPALVGFFTNANAIAKAQVTVETAR
>MS0732 unknown
MINLKIDGFDVRVDEGTTILEAAKSVGINIPTLCYLKDVSDIGSCRVCVV
EVEGFEKLPTSCNTLAQEGMVIRTQTDKVVKSRRMALDLILSHHNLICFS
CPSNGACELQNVAHQCGISESSFPNFRLPGIEVPHVEDNPFLGYRPDLCI
HCQRCINTCANVSGCSSIKLASRGIFRAIETPFGKDWKETTCESCGNCAE
ACPTGAIYKKEAKSYRSWEIQRVRTTCPHCAVGCQYDLLVKDNKLVGAEG
VDGPSNGGRLCVKGRFGSYKFVMSGDRLTDPLIKDRATGKFRKASWDEAL
DLVASKFMTLKRQYGGDSLAGFACSRSPNEDIYMVQKMVRTCFGTNNTDN
CARVCHSASVEGLARTLGSGAMTNPIYDITHDVDAILLVGSNPEEAHPVI
GMQIREAVRNGTKLIVVDPRDIGLTKQADIHLKLRPGTNIAFANGMCHIF
IKEGLIDEKFIAEHTEGFKELKKIVKDYTPEYVAEICGIDADDLRAAARI
YATAKKAPIIYCLGVTEHSTGTEGVMSMSNMAMMVGKIGREGCGVNPLRG
QNNVQGACDMGASPNQYPGYQSVKDPEIRAKFEKAWGVKLPAHIGLHATD
VFPAAIKGKIKGLYICGEDPVVTDPDTNHVINALKSLDFLVVQELFMTET
ALLADVVLPGRSYAEKDGTFSNTERRVQRVRKAITLPGNSRLDTDIICEL
MRRMGYNQPNLTASEIFDEMASVTPSFRGISYERLEKEPTQSLQWPCTDQ
YHPGTPIMHVGKFARGLGLFYPTVYTPAKELPDAQYPMMLTTGRILYHYN
TRAMTGRTEGLMEIAGHSFIEINSADAKRLNIENGERVRVTSRRGTITTE
ARVSDKTNEGETWMPFHFADGNCNWLTNAALDQFARIPEYKVCACRIEKL
PEDEAFNMKGKYITQKMVAAQWRKKMDKSIAKLVR
>MS0084 unknown
MSNVPKPDFSLCYEKTNITADIEPRLVQFTYTDHLEGQSDELTVEFEDIS
GKWVRQWFPTQGDKLRAAIGYKDSLLVDIGEFEIDEVEYRYKPSTINLKA
LSTGISKANRTLKPKAYENTTLAQVVAKVADSLKLKLVGKIKAIPIKRIT
QYQERDVEFLARLAREYHHSFKIVGSQLIFTDKTELGKSEPVLILEERDT
ISLSLRDRIKDTAKAVDISGFDASGKKVVKKRKKATALRPNLKQVKASSE
DTLKVVTRGETQEQIDARGEAALAEQNDNQTAGNITLIGNPELVAGATIL
LKNLGVFSGKYLIKSSRHSFGRNSGYTTEIEVRMLEFIADDLITLGMEKT
NANA
>MS0747 unknown
MKELFATTARGFEELLKLELSSLGATECQVAQGGVHFMADDETQYRALLW
SRLSSRILLPIVKTKIYSDLDLYSAVVRQNWLAYFDERVRFLVDFNGTNR
EIRHTQFGAMRVKDGIVDYFERNGKARPNVDKDYPDIRIHAYLNRDDLVL
SLDLSGEALHLRGYREDSGAAPLRETLAAAIVLRSGWKQGTPLVDPMCGS
GTLLIEAAQMEAKIAPQLHRMHWGFDFWRGHNQAAWEKVKREAVAMAEAE
FNKNPNPHFYGFDLDHRVLQKAQRNAQNAGVAHLIKWKQGDVAALKNPTP
EDKGTVICNPPYGERLGTTPALIALYSVFGQRLKEQFPGWNASIFSSEQG
LLDCLRMRSHRQFKAKNGPLDCIQKNYQISDRTLSPENKSAVENAGEFKP
NANVATDFANRLQKNIKKIEKWAKQEGIEAYRLYDADLPDYNLAVDHYGD
HIVVQEYAAPKNIDENKARQRLLDAVTATLAVTGVETNKLILKVRQKQKG
ANQYEKLANKGEYFYVNEYGAKLWVNLTDYLDTGLFLDHRLTRRMVGQMA
KGKDFLNLFAYTGSATVHAALGGAKSTTTVDMSNTYLNWAEQNLILNEAD
GKQHKLIQADCLQWLANCAQQFDLIFVDPPTFSNSKRMEDSWDVQRDHIK
LMGNLKRILRPNGTIVFSNNKRGFKMDFEGLTRLGLKAEEISAKTLPLDF
ERNKQIHNCWIVEFV
>MS1987 unknown
MTEQNLLSSLAHMISEQRNPNSMNLDSLSPLELVTLINNEDKQVPLAIEK
VLPQIAQAVEKIVRTFQQGGRLVYIGAGTSGRLGVLDASECPPTYGVKPE
MVVGLIAGGERALRHPIEGAEDNAEQGKADLQQINFSKKDILVGIAASGR
TPYVIGALNYAKSLGAITISIASNPDSAMASIADIAIDTLVGAEVLTGSS
RMKSGTAQKLVLNMLTTASMVLMGKCYQNLMVDVQASNEKLRARAIRIVM
QATDCEKEVAERFLKAADNNAKLAIMMVLTNLDKQQASVLLQRHQGKLSR
ALSQ
>MS0350 unknown
MIIILSLVMQKKFALDHVLQHFLWVGELYGLCYDRQKLMKKLLNKKKDDM
SRNIQELKNIVAKLRDPDGGCPLGSETIL
>MS1068 unknown
MADWILILSVLILRRFRKIYAWQDNKNIGDIMSTSHYVSPKGSMDQLSHM
EIDLLTKRAQSDLYKLFRNSSLAVLNSGAINDDSRALLNKYPNFEISIIC
KERGVTLKLDNSPESAFVDDKIIRNIQYNLFAVLRDILFVNALMQRFGLD
AERGNSFITNQVFSILRNAKALSLNEDPNLVVCWGGHSINQTEYAYCRAV
GLELGLRELNIVTGCGPGVMEAPMKGAAIGHANQRYKQSRFIGITEPSII
ASEPPNPIVNELIIMPDIEKRLEAFVRMGHGIVIFPGGPGTFEEFMFILG
IKLNPENRAQKLPLILTGPKESADYFATIDRFVLDTLGEEAQSLYTIIID
DAVAVARHMKAEMVEIRDFRCKISDSFSFNWSLKIEHQFQQPFLPTHENM
ANLNLHLNQSTVDLAANLRCVFSGIVAGNIKPATQDQIAEKGKFQLYGEP
RLMEKVDNLLQDFIVQHRMKLPTDEAYEPCYEICK
>MS1363 unknown
MKTDFLSSLIFSVGVTLPTILLLILGMLIRKKKMIDDRFCEQSTKVVFNI
TLPVLLFFSVYGKHVDYISQMAVLSVGIIGTISLFLLAELFAARFIAEKR
ERGTFVQAIYRGNSGILGLAFCISAFGDSAAVPASIYSAAVIFLYNILAV
ITLTRSLSTGSVSVVSIMKGVIKNPLIIAILFALIANSISLQLPAPLLST
GNYLANMTLPLALICTGATIDLSVFSNKTSNVVLMGSLGRLVVTPVFMIL
IGKVFGLDGMLLGVVALMNTTPVASAAYAMVRAMGGNSVTVANIIGITTV
GSMITSSLMLLILSQAGWI
>MS0298 unknown
MATNYYDITLALAGVCQSAKLVQQFALEGKADEEAFNTSLYTLLQTTPKD
ILSVYGGHERNLKLGLETLLEQLNGSTEDITRYWLSLLALSGKLEKNAQA
KSELARRIQYLPTQLEHYDLLDEQMLANLASMYVDIISPLGNKIQVKGSI
EVLQQTSMHHRIRACLLAGIRSALLWRQVGGSKWQLLFSRRKIFNMAKQI
YSSL
>MS1427 unknown
MQVISGHWGEMLPFYLQRLDDSIPQAATGLKRSITQTFKEQVFVTPSGML
TLPHFNFIYELVGADRILYSIDYPYQTLDGARAFIENLPISQAEKELIAY
KNAEKLFGLG
>MS0894 unknown
MNKLALYCRIGFEKETAAEITEKAAEKGVFGFARVNNDSGYVIFECYQEG
EADRLAREIPFNQLIFARQMIVISDLLENLPPTDRITPIIEEYNRIGSLV
NLHRTTELFVETADTNEAKELSVFCRKFTVPLRQALKKQGYLAFKEVKKS
GLTLHIFFVKPNCCYVGYSYNNNHSPNFMGILRLKFPPQAPSRSTLKLHE
AILTFLSPEEERKCMNESMYGVDLGACPGGWTYQLVKRGLFVYAVDHGKM
AASLHDTGRIDHCPEDGFKFQPPKRSKIDWLVCDMVEQPIRIAALIAKWL
VNEWCRESIFNLKLPMKKRYAEVQNCLQLITNELDKAGFKYHIQAKHLYH
DREEITVHISVKK
>MS2252 unknown
METFIHGFLVCGGLIIAIGAQNAFVLKQGLLKNHILAVILTCFICDIVLI
SLGVLGLGSLISESREATVALGIVGALFLTVYGARAFRSAYLGNSSLEIQ
SQRQDNTSSAWKAVLATLAITLLNPHVYLDCFAIIGGIAGTLTPDQKILF
LCGALCTSFLWFFSLGYGARLLIPLFKRPITWRILDFVIGSVMWLIAFGL
AKYAYQLA
>MS1997 unknown
MTEFKLNYHKTHFMTSAANIHQLPKDEGMEIAFAGRSNAGKSTALNALTN
QKNLARTSKTPGRTQLINLFEVEPQYKLVDLPGYGYAAVPEQMKLQWQKS
LGEYLQHRECLKGVVILMDIRHPLKDLDQQMIEWAVSSDLPVLLLLTKAD
KLSQSARSKQVKTVREAILPFQGDVQVEAFSAQNKIGIDKLAAKLDSWFS
SLLTE
>MS0416 unknown
MLDVGLISYFSLVDLSIAFQHSKRNKIKKCIDHKFIGKYKKIVRGRKMYD
LITMNQYDALIFDMDGTIIDTMPSHAKAWEKVGEVLGYPIKGDVMYEFGG
ATTKIIAQETMRRYGVPAELLEQVVTMKRQFGQEMVLQNATLLPTMQVLE
HFLGKKPMALGTGSHKAMVDMLLQRFDLNDYFSAVVMAEDVQKHKPDPET
FLRCAELMKVDPVRCLVFEDADFGVTAAHAGGMDVFDVRINQIMKVS
>MS1448 unknown
MTTKKQAVFSRLVNELVQKNQGKRIFSFDFENQTYWVKQPEKLTGVWKIL
KPHPKQSFREELHILKNLYERGAPVPQVILSGEDFFVLKDVGPTLNHWIE
NAGLNLTPAEKNQILVDAIKALTSLHKKGVTHGRPAIRDIAWRQGKVTFM
DFESHSRSLNLQWHKIRDVLVFIHSLCRSKHLSGEQIQYLINKYEEYCES
DLWQDVLNLVAKFRFLYYILLVFKPVARMDLIAIYRLFQYLLPLTEENK
>MS0108 unknown
MSAPLTFQQVFDRVVGHEGGYVNDPHDPGGETNWGITKYTARENGYTGSM
KAMTREQAYKIYEKAFWQRYHCEKLPEAVAFQFFDAAVNHGVGNASRMLQ
RAVNVADDGIIGKVTLSAVEKMPISDLLLRFNAERIRFYTKLKNFPRYGK
GWMNRIAGNLAYAAIDNEV
>MS0407 unknown
MLTFFIITLIVGSIVGFLAGIFGIGGGLVIVPTLLYLLPMVGVPDEKLMA
TALGTSFATIIITSLASAYRHNKLGNVVWEAVKYLAPTLVIATFISGLFI
GKLPKDISSKLFACLVVYLAAKMVLSIRNKKSKTPAKPLTPQSTILGGIL
IGIASSAAGIGGGSFIVPFLNSRGIEMRKSVGSSSFCGAFLGLAGMLSFM
IGGWSVEGMPDWSLGYIYLPAVLGITLTSFFTSKFGAEMANKLPVASLKR
YFAIFLILMAIKMLIG
>MS1070 unknown
MYLVEVFFKNINEDNLPQQIPLINQLIDQWRYNGQIIGREIPVFVANQEN
ERGLATRVICPEQQSLLPEYNNAEVNRCLANIENCGLILHSFQIVGEDLN
SDITYEDKKPDWQILYTTYLQVCSPLHSGDRLAPIPLYKQLKDVPHLSMD
VIKWQENWQACDQLQMNAVALESQALREISDINSRIFKHGYSLTKEIEEH
TGVPTYYYLYRVGGKNLASESARHCPICHGDWKLAQPLFDQFHFKCDHCR
LVSNISWNFL
>MS0318 unknown
MRNNMSEQKITFADQKRKTVETAEFTEDGRYKRKVRSFVLRTGRLSEFQR
NMMNDNWADFGLEHQNNYFDFAEIYGNTNPVILEIGFGMGKSLVEMAEQN
PERNYLGIEVHTPGVGACIAYAVEKQVKNLRVICHDATEILQDCIADDSL
GGLQLFFPDPWHKSKHHKRRIVQPNFVDNVMQKLQQSGFIHMATDWENYA
EQMLDVLSQSKALTNTSKTNDFIPRPDFRPLTKFEQRGHRLGHGVWDLYF
VKN
>MS0091 unknown
MSIAINQIVNANVYIDGNSQIGKAQQIKIPDIEFEMVDHKGLGLFGTIKL
PSGAKAIEGGVNWDSYYPEVRAKLYNPFKNFQLQCRSNLQVFNAQGLAAE
EPMVTIMNVSSVKIGGTDVESKENAKFDDTFAVHSIKQTVAGKEILFIDV
FANIFRVNGEDVLSKYRTNVGQ
>MS0906 unknown
MNSNLIRLIFITLLSLGLTLISSFVLARLLSVQDRGLHQLFITAVSYVVT
FATGGSGFALALSMRKKQYAGWQNYFIAFLALSVLAAIIAIYCFDFTAFH
VLFVLNVVLTAILTMTLEKSKIDANLRVYRQLTLQQPVLLVAVYGICYLL
LGEQPLEIAIELFTLFSAMQALACLYYLKKINADFKRKNEIQPIQKRFFL
KTWFKQNLLQIFGATTASLDKFLIVYFLGNYTLGLYTVCIAFDSLITKFI
NMLADYFYSGLLNNINRIKSVLILILLMAVGAVILVPLLAEPIIIFFFSA
KYAEVAPVLILFIINAIIGGLSWVLSQNMLLLGKQVLLFTRQIIAIAVFV
LLFYLFKDYQLYGVAYAFIGASLTRLIISVIYYLKYPITDVKPEKSAV
>MS1909 unknown
MQFIKNGRQYREATSQKISWGHWFALFNIIWAILFGSRYAFIIDWPSTLW
GKLYFFISILGHFSFVVFAGYLLIIFPLSFIIKNERTFRGLSVIVTTICL
TLLLIDTEVFSRFNLHLSSVVWNLLVNPEDGELSRDWQIFFAPMPLILLV
QMLYSRWSWNKLRSLERQKWMRKVGIFFVTMFVATHLIYAWADAYIYRPI
TMQKSNFPLSYPMTARTFLEKNGLLDKTEYAQTLEQEGRPEAFNIDYPKH
KLAYMPIERKPNILLINISGMRYDSVIESKMPNLTEFAKQSAQFMNHYST
GNNSNLGLTGLFYGLNASYTDSILHNKTESELFKKLQAEHYQMGLFSANN
FKDSLFRQALFQKVNLPRIKAGNQSAVKNWLIWLNKAHLDQAWFSYLDLD
VLTAVQNADPKSKEEETEIYDNQLGNVDVQLQIVFEQLQERGLLDKTIVI
ITADHGHAFQLSDKEHIDYFGLDEIQVPMIIRWNALLNEQQSKLTSHVDL
VPTLMQNVFKVENPITDYAQGESLINISRKADWILVGNYRWNVIISPNGN
QYHIDRKGQYQKYNVDYEKESSLRPPLGLFLEVFTQSRSFMAK
>MS2096 unknown
MTIKSVEISKAYRLVQLGSTTMLSAKHDGDADVMAAAWVGLGGPNKIIAY
IGTQAYTRKLVEQNGYFVVHIPTVQQMETVLYVGEHSKHTMPNKLDNLPL
FYQEGVDIPMVEGSAGYLLCQVIPNPQQEQNYDSFMGEIVAAWADDRVFD
GRHWTFDTAPDELRTVHYVAGGQFYAMGKGTKFDHGPGQD
>MS1868 unknown
MQKVKLPLTVDPVKDAQRRLDYVGYYAADQLVRLNESVVKVLSDAQVTLS
FFIDPQKLVVMKGQAQVEVELECQRCGQTFNQTLECTFCYSPVANLSKID
ELPEIYEPIEFNEFGEIDLIGTIEDEFILNLPIVPMHSSEHCEVSAQEQV
FGELPEELAKKPNPFAVLANLKQK
>MS0568 unknown
MNLRVFLLMMKKCIRFIFLLLLMFAAAGFWGYNYIQKLVNEPVNIKAEQL
LTLERGTTGKKLFALLEKENIIADNILFPLLLKLQPQFNNVKAGTYSLEG
VKTLGDLLTLLNSGKEAQFALRFTDGETWKQVKKSLENAPHLKHELKDKT
DVEVFHQFKEMLPEFEVQNAYKTLDGWIYPDTYNYTPNSTDVALVKRSVE
RMVKTLEKAWAERDEDLPLNNPYEMLILASIVEKESGISAERGKIASVFV
NRLKAKMKLQTDPTVIYGMGESYQGNIRKKDLESPTPYNTYVIDGLPPTP
IANPSEDALNAVAHPERTDFLYFVADGSGGHKFSRSLIEHNKAVQEYLLW
LRRNKNK
>MS2009 unknown
MPVFNAHVAQGKLTKEQKQGLADAFVLAIHDALNAPMEDQFVIINEHPQD
NIFIHPTFPNMQRTDKRMVVTVDVSTTRTLEEKRKLTELVTKYAVEKAGI
GQDDISLLIYALPLENMSFGRGILMPDDAEAMVKRTRS
>MS1931 unknown
MMLSPSEILKKTTALFAATICLYFACKLILMGTGFYPQPKLTDILLFAIL
IVIFNSSKNLFYFLLLPFIIAHALYAPVGITFGAPSYQYIASVFATDLME
SREFLSQLSIKNYLMPVGIIGLTLAFRWITQKYDLKLHKNKMFLASITAF
MLLANSPFKFIDEISTSGTQVISELQRLNNMTIESEWGDSQLINSNYDDY
VLIVGESARKDYHHAYGYPVKNTPFMSKANGVLIDGMTAGGTNTIASLKL
MFTQPNTQTKEGNYSLNFVDLIKSAGIKTYWISNQGYLGEFDTPISAIAN
KSDEKIFLKSGDSLNSNTSDFELLPKFTQVLERPSTGKRFIVVHLYGSHP
ITCDRLNDYPKLFDDDKIAKKYFNVNCYISSIKKTDEVIKRIYDALAENK
AKTDRTFSMIYFSDHGLAHQITEDNIVIHNSSGKSKRHYDIPLFKISSDD
TKRHEYRVFKSGLNFTAGLAYWVGISNAKLAVREDLFSNEPDKDDYGLKA
EIDKIDVPEDKAVVIPGTH
>MS2154 unknown
MQKLKIETQSGTLLDGVLFSQTPSKTVIIAITGIHGNFYSNPFYYNIGHT
LSQSGIDFIYAQTRNAFGKTDFVNPKTGQPESIGSWNEDFAKTIEDLTAY
VDFAEQKGYQHIVLAGHSLGANKVIHYLAETQDKRVAKFILLSPANVTHL
TNAISEQQRAYIRHQVEKGNSQRLLPFELFGWLPCIADTAFQWLYSPLLN
NVHVEPNSDFSQVAKIQHTGALLIGTLDRFTYGDPPGFLRNINNHFQSAD
KNTLIFIENTGHTYQQKEQEVADKLLDLVKDWGY
>MS1757 unknown
MHIIKEKLAKSLMFVVIIALCITVMSIILFGINQFKIGSQLASVNQVSNL
SHLLVRQQANLFSMLLVNNAGNEQLTDNLENLTKDKFVLDASIYGKNGEL
LAQTRNTLDLREQLGLNEESSKHHVVNRQQIVEPIYSPNGIEGFLRVTFD
SKYGQTTQNKINQIFHRLYGELIIVFLAGVILASSVHYFLSHYRRARRSQ
ITEQINTVKEIKNSSALVFHRRRRRYR
>MS1511 unknown
MTKRKLTQNQKRRIHSNNVKALDRHHRRAKKEIDWQEEMLGDTQDGVVVT
RYSMHADVENSQGEIFRCNLRRTLANVVVGDHVVWRRGHEKLQGISGVIE
AIKPRENEIARPDYYDGLKVMASNIDRIIIVSSVLPALSLNIIDRYLVIC
ENANIPAVILLNKVDLLTDEQWREAEEQLEIYRKIGYETLMISAISGKNM
EKLTALLADGTSIFVGQSGVGKSSLINYILPEVNAQTGEISETSGLGQHT
TTSSRLYHLPQGGNLIDSPGIREFGLWHLEPAQITNGYREFQYFLGTCKF
RDCKHIDDPGCALREAVELGKIHPVRFDNYHRLISSREENKSQRHFMEQD
IR
>MS1906 unknown
MRSKLIKIICLRSRICVSETIHTLEKQAKLAIITNGFTALQHLRLQRTGL
AQYFQFITISQELGIAKPDARIFEHSLQQADIEDKSQVLMVGDNLHSDIL
GGKNAGLDTCWLSYDKANDSDIAPTYSIKKFNELLDVVAA
>MS1387 unknown
MKQLENVRIFGGEQQVWQHQSATLNCTMNFAIFLPKQAKTEKLPVLYWLS
GLTCTEQNFITKAGAQRYAAQHKVIIVAPDTSPRGDDVADNESYDLGKGA
GFYLNATQQPWAKHYQMYDYIVNELPALIAEHFPVNGKQAISGHSMGGHG
ALTIALKNPQRYSSVSAFAPIVAPTQVPWGQKAFQHYLGDNQTQWTQYDA
TALVNAETRLPIRIDQGDKDSFLTEQLRPELFLDACRAHHVACEYYLRQG
YDHSYYFIATFIGEHIAFHAKALYQDSEALPL
>MS1268 unknown
MKILITGATGLVGKALTRQLLKQSHQITALTRAVNTAQKLFPEVDWVSSL
STYKNLDQFDAVVNLAGEPIFDKKWTDEQKLRLKNSRILLTQQLTQLINR
GKRPPVFISGSASGFYGNAGSQLLTESALPATSFTAELCQAWEAAAQQAD
TRVCVIRTGMVMSPRGGALARMLPLYRFGLAGKLGSGQQFMPWIALKDMV
RGIIFLINNPNAVGAFNFSSPNPVTNKEFNRLLGSRLKRPHFFSVPACIL
RLFLGERACLLLDSQNVYPKKLLDLGYTFQFEHLETYFSKTLKQKRKK
>MS0512 unknown
MIIGPFINAGAIVFGGLIGAALGGRVPERLRTNLTMLFGLCSMCMGIVMI
AKVAQMPAMILALLLGTILGELILLEQGINKLASKTKTIVEKILPNNQKK
GVSHEEFLQKFVGIVILFSFSGTGIFGSMNEGLTGDSSILIVKAFLDFFT
AIIFGTTLGSTIATAAIPQTVLQIALAYSAVLIIPLITPEMRADFAAAGG
MLMVATGFRICGILHFQVANMLPALFIIMPISAIWLQMMG
>MS0081 unknown
MTEAIKIINDDVKIVLAETIADYEKRTGKTLRPAHIERSIIQSYAYREQL
VRQGINHAFLQTFPQFATGLALDLCGEPMGCYRLSDLPAEVTLRFSVEGD
HDAVVIPEGTLVAATDNVVFATDTEVRISSTESYVDVVGICQITGAVGNG
WQLGQVKTLKSTLDAKVTVSNIDVSDNGIDTESDDDYRKRILLAPEAFTT
CGSVAAYEYHTRSVSQYIADVDIATPVGGTVQVTILTKQGLPSSILLNKV
KDHISGEKLRPLCDTVVVSSPERVAYSVVANLDLLETVAESDVKVQAEAA
LRAFISSRTQLLGADIVPLDIQAALKVAGVYNVTLASPTLTKLTKQQWAE
CESITININGERQDG
>MS0011 unknown
MTKQIAVLIGSGSTTSFSKLVVSHLQKMAPASIQLNIVEIADLPLYDRDL
DENSPAQYTRVREQIANADGVILVSPEHNGAISAMLKNAIDVVSRPMGQS
KWFGKPAGIVTVAAGMAGGVRVADQLRTIASGSFIGMPVYQQNACVGGLF
NGVFDQNGEITIDAVKQMLQQFIDGYAEFVAKF
>MS1905 unknown
MKYQWIFFDADETLFSFNAFAGLQKLFADNGLKFNEQDFTQYEKVNKPLW
VKYQNAEISAEQIQTIRFEPWEQKLGKSAVEINQDYMLALADLCKRNHSH
PGKTGKTGNYYKRLYRLATSSSAKNRFSTIFPVYYYFARTRHSQTGRPNL
RA
>MS1438 unknown
MKNLKLSIATIAVASLLSACTSQYATEKHEQLKLQNQAALGIVWMQQSGE
YQALAHQAFNTAKTAFDQAKKTKGKKKAVVVDLDETMMDNSAYAGWQVKN
GEDFTQETWTKWVNARQTAAIPGAVEFANYVNNHGGTMFYVSNRLENGER
QGTIDDMARLGFPGVSEKTLILKDGKSAKSARYKTITDQGYDIVVYVGDN
LNDFGDATYRKPNAERRDFVAQNAKQFGTKYIVLPNPNYGDWEGGLDSNY
YKGDVKNKVDIRLNSIKAWDGK
>MS0296 unknown
MSHLIVKEQKTIIRNAFFTFLYFTLAIGLAIGILYTDIFYLQNMIEEESL
VEYTQSLSLTILTLMFSRHAYRSPQWRGGFVLITGFFLCMLIRESDALFD
NLIRHGSWAYFAIITALVCIIYAFTHRQSTIDGLAQFAKQKEFHSFIIGL
LTVLLASRLIGYGGLWRFILYNDYPHIVKNIIEETTELFGYLIMLFSCLS
LTRHFK
>MS0289 unknown
MSDIAITISILSLAAVLGLWIGQWKIKGVGLGIGGVLFGGIIVSHFSEQN
GLQLDAHTLHFVQEFGLILFVYTIGIQVGPGFFASLRKSGLRLNALATLI
VALGSLIVVIINKAFDVPLDIILGIYSGGVTNTPSLGAGQQILTELGMQN
ITQSMGMAYAVAYPFGICGILASMWLVRLIFRVKVDDEAKKFTQESGQQT
ESLQKINIRVANPNLDGLCLRDIPGFDERGVVCTRLKREENISVPKADTT
IFLNDVLHLVGDSHSLQRMCLIVGEKIELEPSKLVGNIPFRTGCGYQ
>MS0092 unknown
MAFHHGSETKRVNGGSVAVSTVDGAIIGIVGTAPMGAVNELTVCLTKKDF
SQFGTILDQGFTLPDAFDILARYASGQVYVVNVLDPAKHRTTVTDEVLTQ
DSDTLVATTAKKGLISVTNVKLGGSLLTEGETYSVNLESGEITLTVAAGE
QDLTASYVYADPEKVTEDDIKGGVDSLTGKRQGFELLRDGFNLYGADAKI
LICPEYDKTASCAAALATLADQMHAKAYVQLPKGTSLSKAIQGRGSLGTI
NASASNENVRHFFPYALGSSNNLESLATHAAGLRMKVDVDEGYWFSTSNH
ELSGVIGMEIPLTARVDDIQSETNRLNAVGITTIFNSFGTGFRLWGNRSS
NYPTETHISCFEVASRTGDIIDESIRQAELQFIDKPIDDALIDSFIETID
TFLRSQKSLVGYSVGLDYDYDLVDAFSQGQIPLIYDYTPKIPGERISNKS
VMTRTYLANLVSQR
>MS0397 unknown
MIMKKFFLFATALLLASCSAQKPNLVSTQKPILNIAANLAQSIEANAGAH
SAWVKNKSQQPIAFNYNLYWYDENGITQLFSTQQEKYQGALLLQPQQKAE
INLTKPTAESVNYRLYLFSGNN
>MS0357 unknown
MKIQHTEDQQQGEFFILSETGEKVAKLTYFYQSPRVINANHTYVSDSLRG
QGIADKLYQALIQLIKEKRLELIPSCSYIAKKWRRDHQKS
>MS2113 unknown
MKKDLIYRKRYLERVRPFIGKSLIKVFTGQRRVGKSYLLFQIMQEVQASD
SQAHIIYINKEDLAFSHIKTAQDLAEFVLIEKKSGKKNYVFIDEIQEISE
FETALRSLLLDDELDLYCTGSNAHLLSRDIAGSLSGRAIEINVHSLSYFE
FLEFMRLEDSDKTMSQFLKYGGLPYLKDLPLQDNIVFEYLRNIYSTIAVR
DIINRYALRNVQFLEQLTQFFASNIGNLFSAKKISDFLKSQRISANTVQV
QNYAEYLANAFLIHKVPRYDIEGKRIFEIGEKYYFEDLGLRNALIGYRVQ
DRGKLLENTIFNHLQIAGYDVKIGGLGTQEIDFVAEKDGERIYVQATLTI
NEEKTLEREFGNLLKIQDNYPKYVVTMDEFDGNTFEGVECLSLREFLMLL
MDSND
>MS1426 unknown
MMTRRTFLTASGLMASGLFLPKICKSETLLQRRRPMKIIAVEEHVLDADL
GKASMPAALAQAPYLPDWGKTVQDGYNLDRSRPQIEQNALINPKGFDMGE
GRLKEMDLAGIDMQVLSYGGFPQFALKEQSAALNRAANDRLAEAVAKHPD
RFAGFATLPWGQPQEAVKELKRAVNELGLKGALLNGRPSEHFIDHSDYEP
LLAAFHELNVPLYLHPGVPVQAVQQAYYGGFSPEI
>MS1855 unknown
MQQHKQIGRICTLLIRGSKTSHAKKLRKTMNITNPNGDRKAVVIFSGGQD
STTCLLKAIADYGVENVEAVTFQYGQRHAIELEKAKWIAQDLGIKQTLID
TSVIKTITANAMMDNIKITKDEAGMPNTFVDGRNALFLLYTAIYAKGQGI
RDIITGVCETDFSGYPDCRDVFIKSMNVTLNLAMDYQFNIHTPLMYLTKA
QTWQLADELGALNYVREHTHTCYLGVEGGCGSCPSCILRENGLQQYLASK
Q
>MS2172 unknown
MKLKALTSALILATTLSGGIAMAKTQSATVAEMPAQTIQLTQEWDKVFPK
SDKVEHRKVTFKNRYGITLVGDLYLPKNAQGKLQAIAVSGPFGAVKEQVS
GLYAQTLAERGFVTIAFDGSYTGESAGLPRDLASPEINTEDFSAAADFLG
SLENVDREKIGVLGVCGWGGFALNAAVGDPRIKVVATSTMYDMTRVMANG
YNDSVDNDARYQMKQDLNNARWEAMSHDYANTGAPVLPSEKELNADTPKF
VADYVNFYKTKRGFHPRSVGSNGSWTTTTPIAFINMPILQRAGELRAPAL
IVHGENAHSRYFSEDAFKTLGSKDKELHIVKGASHTDLYDNQANKIPYDK
FEQFFKANLK
>MS0874 unknown
MKLKYKLCIALFAWVSAFHVAAAPQTHAEVSNVTTELNDIQIRLKAQQSA
DKGDWKTVYTLLLPLAQRGDSQAQVNLGILFSSGRGVEKNLEKAYWWFNE
SAEQGNAKAVTYIGLMYLEGVGVKQDTKHAIRILEKAGRVDYPRAMLALG
NAYYMEKNLQKSFLWFERAAMKGVSEAQFKLGMMYEKGEGTHKDEEQAVY
WYQTSLKANDDIAEFAKERLSALGRLR
>MS0573 unknown
MGLFEAIFILFLLIVISAIISSSEISLAGARKIKLQSLANEGDTRAEKVL
KLQEHPGRFITVVQIGLNMVAIFGGMIGESALRPYIQQTIHQYTNAPWVD
GAASCASFVVVTAAFILLADLMPKRIAITYPEQVALRTVGVMSFCIVIFK
PLVLLFDSVANGLFRLLKISTVRHDSMTSEDIVAVVDAGAEAGVLKAQEH
YLIENIFDMQERTVTSTMTTRENIVFLNRTFDRQKVMETLTKDSHSKVLI
CDNGLDRILGYVESHTLLTLYLREEQVSLTDQRILRKPLFIPDTLSLYEV
LELFKSSGEDFAVIVNEYALVVGICTLNDVMSIVMGELVSSEEEQIVRRD
EDSWLIDGATPLEDVMRALNIESFPDWENYETISGFMMYMLRKIPKKTDF
VLYDKYKFEIIDTENFKIDQLMVSIRKDLNEQN
>MS1716 unknown
MSILYAENLAKSYKGRQVVSDVSFTVKSNEIVGLLGPNGAGKTTSFYMVV
GLVRHDQGKIRIDDEDISLLPMHNRAQKGVGYLPQEASIFRRLSVYDNLM
AVLEIRKDLTKEQRHARAEELIDEFNIGHIRDNLGQSLSGGERRRVEIAR
ALAANPKFILLDEPFAGVDPISVIDIKKIIKDLRDRGLGVLITDHNVRET
LDVCERAYIVSAGKMIATGTPTDILNDEHVKRVYLGEEFKL
>MS0098 unknown
MIKITLDDTQAVKKLQSVAAQLKAPRRLYALLGEELKKIHDDRFKTEKDP
NGKPWTPLAAKTLARKRKRGKSLKILRQDGNLANKTAYNILDDGVEFGSP
EVYAALHQFGGKAGKGRQVTIPARPWLGVNKENEYYLLKKAVSHLQKSLG
KIK
>MS1271 unknown
MRQKIFLFVRSLIILYLILFIGEGIAKLIPIGIPGSIFGLLILFIGLTTQ
IIKVDWVFFGASLLIRYMAVLFVPVSVGVMKYSDLLVSHASSLLIPNIVS
TCVTLLVIGFLGDYLFSLNSFTRLRKKAIKKRDINNVNNKGEAS
>MS1143 unknown
MLIGLFIGLLFGFFLQRGQFCFVSGFRIIYTQRNFRFLTALLIAVSIQSI
GFFSLSGLDLITIPNTPMPLLATLIGGLLFGIGMVLANCCASGGWFRTRE
GAVGSWIALICFALTMAATQTGALKQWINPLLLETTTLDNIYNTFNLSPW
ILVTVLVLITVVMIVYHIKHPRYQFPQEPTTALIPHRIFTKHWHPFTAAV
WIGLLGVLAWLVSEQYGRSYGYGVAVPTANVVQYIVIGQQRYLNWGSYFV
LGILLGSFIAAKLSGEFEIRLPEPKAILQRMLGGVIMGIGASLAGGCTIT
NALVSTAYFSWQGWLATLMIMIGCWLTSVLVKPTQCRI
>MS0372 unknown
MIKGIQITKAANDNLLNSFWLLDSDKGEARCLAAKAEFAEDQIVAINELG
QIEYRELAVDVAPTIKVEGGQHLNVNVLRRETLEDAVNNPDKYPQLTIRV
SGYAVRFNSLTPEQQRDVITRTFTESL
>MS1287 unknown
MNNSYGTLYIVATPIGNLQDITQRALDIFTQVDLIAAEDTRHSGLLLSHY
GIKKPFFALHDHNEQQKADALVEKLRQGTNIALISDAGTPLISDPGFHLV
RKCRQTGLKVVPLPGACAAITALCASGIASDRFCFEGFLPAKSKARKDKL
QNIAEEDRTLIFYESTHRILDTLEDIEAILGAERYIVLAREITKTWETIT
GDTVANLRKWLAEDPNRTKGEMVLVIEGKAKSDDAEEISPQAIKALALLA
KELPLKKAAAIVAELYGYKKNALYQYGLEYLD
>MS2159 unknown
MKKILVLTGSPHPNGASSRLADEFVKGAKEAGNDVFRFDAGLQPLGELHF
LQLDASERTIADNDIVSREVLPKLIEADVVVFVSSLYYFGMNAQLKAVID
RFYSINHELKDDKQSAVIMAGYGEGDDLKPMKDHFNIIQKYMRWQNIGTI
VAEDSWNAAKLAKHLQEAYALGKSISA
>MS0868 unknown
MAGLTDKGTFMEVTIEITVILFTVAVIAGFIDSIAGGGGLITIPALLMTG
MPPALALGTNKLQACGGSFSASWYFIRRRAVDLSAVWLILLMTFIGAVIG
TILIQLVDASLIKKVIPFLVLAIGLYFLFTPKLGEQDARQRLSYGVYAFT
AGVSIGFYDGFFGPGTGSILSLACVTLLGFNLAKATAHAKVFNFTSNFAS
LIFFLIGGHILWSVGLVMLVGQFIGAHFGAKMVLSGGKKIIRPMVVIMSF
IMTVKMAYDQGWFS
>MS0387 unknown
MIRFPRFNLRSSTLIAIVALYFTLVLNFAFYGKVLTQHPFTGKPEDYFLL
TVPFFVFFTLNAVFQILAVPLLHKIIMPLLLIISAAIAYSQVFLDVYFTT
DMLENVLQTTSAESTRMITWQYVLWIIGFGIIPAFLYLSVKINYHTWFKE
LGIRLGAILVSAVVIFSISKFFYQDYAAFVRNNKPTVNLILPSNFITAGV
NEIKRIHDANRPYEKIGLDAQQEKPDPYRHFTVIVVGETTRAQNWGLNGY
QRQTTPKLAARGDDVINFNHVTSCGTATAVSVPCMFSYLTKDQYNGSKAE
KMDNLLDVLQRAGVNIFWLDNNSDCKGVCLRVPNETVNMTLKDYCTEGEC
LDEVLLRDFDKILNETTKDTVLILHTIGNHGPTYYERYTPEYKKFVPTCD
TNQIQTCSNEQLVNTYDNSILYIDNFIDSVISKLENRDDLESAVYYVSDH
GESLGENGMYLHGAPYAIAPEQQTRVPMVFWFSKTWKKNEGVDLNCVREK
AKTREFSHDNLFSTVIGMMDMNLKTSVYQPEFDILASCKRH
>MS1397 unknown
MMKKSLFLTALSLAILTGCQNVGSQALQIEKQGSFTVGGSYVTHKGTFKQ
ENFIAPEGQRAYGDFAYVKYQTPTNAKKYPLVFQHGGAQSSRTWESTVDG
REGFDTLFLRKGYSTYLVDQPRSGKSNLSTKAITPDTPWASNPMYADKTF
WILSRMGHYDSHNQPVANAQFPAGEAAYQAFQQAWTIGSGPLDNDLNADV
LTQLVDQTKGAILVTHSMGGTIGWRTALRTDNVKAIVAWEPGGTPFIFPE
NEMPKITKARFEALSGAAMGVPMNEFLKLTKIPIVLYYGDYIQVGSDNVG
EDKWGTELAMAKQFVATINKHGGDATLVHLPEIGIKGNSHFLMGEKNNQQ
LADLMADWLKQKELDK
>MS0621 unknown
MCQMLAMNCNTPTDIVFSFEGFRRRAGMTDSHSDGFGIAFFEGKGVRVFR
DDQPGAVSPIADCVKQYHIKSLNVIAHIRKATQGVVNIENTHPFIREIWG
ENWVFAHNGNLNALPDLSSCYCTPIGDTDSEAAFCYIAAKLKERFCRKPT
ENEIFDTIKELAAELAQHGTFNFILSNGQWMIAHCSTNLHYLTRQAPFGV
AQRIDDDGIIDFSNYAKDTDKVTIITTFPLTKDEIWAKMEHGGMVMFKDG
VKIREAIGTPKEAVDDGTLGCTKIAA
>MS1679 unknown
MLAERRLELYFAENPPHFFDEMAKSAVILSPENFHNAKNLLGREFDQILF
DGRTSLNLDALAIAAGTLRAGGRLLLWLDKNPHVDPDSLRWSGAEQAVET
PNFYAHFNRLLQVYGCDNGIQAQNNQSVSTQKTNIASTATAEQQQIIRQI
LQADSDIFILTAKRGRGKSALAGLLAKELRNSAQYHKKPFNVYLTAPNKS
AVETLQLFAGEKITFIAPDELCRRIGQNARQFSQDWLLIDEAAMIPLELL
FQLTSTFKHILCCTTIHSYEGTGRGFLLKFLPNLHRSFQQFELIRPLRWA
ENDKLEKFIEELLMLEAEDRLIQPPYSIKSAVKIRQISQNELVEHITDFY
GLLTLAHYRTSPLDLRRLFDAVKQHFLIAEWECYLLAGVWALEEGGFSDK
ALIRAICRGERRPKGNLVAQSLAFNCNLPEACALKSLRISRIAVQPDWQG
RGLGLQLVEKLAQTAQADFLSVSFGYNEELAHFWQKCGFILVNIGEYKEA
TSGCYSAIALRPLTAAGEDLVKRAQQYFRRNLAFTFHPLHDKLSVEKSSA
EKITQLNGQDFGILENFADYHRTFYSSQGAIYRLFIRLGADTSPHVG
>MS1810 unknown
MKNYSETIIIGAGAAGLFCAGQIGKAGKSVTVFDNGKKAGRKILMSGGGF
CNFTNLEVLPSHYLSHNPHFVKSALARFTQWDFIAMVAAQGIAYHEKESG
QLFCDNGAEDIVKMLEARCTENRVSIQLRQRIDLVEAVHNDENARFKIQS
GGQTWYCKNLVIATGGLSMPALGASPFGYQIAEQFGLNVLSPRASLVPFT
YRENDKFLTALSGISLPVRVTAQNGKSFSNNLLFTHRGVSGPAILQISNY
WQPNESVEIDLLPTDSIEEYLSQLKASSPKLQLKTALSRILPKKLVELWF
ERQLLQDETLANLSKVRLKNLENLIHHWQFQPNGTEGYRTAEVTMGGIDT
KEISSKTMESQKVKGLYFIGEVLDVTGWLGGYNFQWAWSSAYACAVGITQ
TE
>MS2108 unknown
MGYRVNSVLGTKFRIWATARLKDYLTKGYAINQQHLSQNAHELEQALALI
QKTAKSSGLTLESVWWTLSAVIRKHFYCLQAAEKR
>MS0083 unknown
MQTHNFGATYQEGIVTEVDAAKHKVRCKIPALEDLETAWLPFLTPNAGGN
QFYCLPDKDELVALLLDARGEGGCVLGAIYNDQDPTPVANAEIWCHKFKN
GTEISHNRKTGDVVVNTKGHVTVTAGAGATINADTVVNGKLHATGKITSG
EEVSAPKVKQGTVELGTHTHGSSPQPNK
>MS1490 unknown
MTRKYWLIIMKNKALVLDLDDTLYAEIDFLYSAYKHIASRLAPERSETLF
NRLVELYHRGENAFQYLVEQYDVDLSTLLDWYRFHVPQIRLFPHVADQLN
RLKEDFRFALITDGRSVTQRNKVKALGIEPLLDFIVISEEVGSEKPSLNN
YRLVQDALHCRDYIYIGDNPKKDFVTPNKLGWKTICLKDRGTNIHRQDFE
ILEEFRPHFYMSDWSELPTFLDF
>MS1630 unknown
MRYFIGSFTFLTANGIIFLTIFRIQPLFRLSALILILLLFSAFISVGLAL
TYKLLKSFINSSILNRTLRAVYPIGMLILVGLSIYNAYTPKVIHYQIELD
KPLKAMRIAVASDFHLGKLFGSEQIDKLARIIEREKADLVLLPGDIMDDN
LNAYLAEQMSSHLAKLKAPLGVYATLGNHDFFGQQQAIADEINKTGIKVL
WDEAVTINNEFVIVGRNDDLNKARPTTKRLLQNVDTNLPVFLMDHRPTEV
TEHSALPIDVQVSGHTHNGQIFPANLIIKAMYRLGYGYEKIADGHFFVTS
GYGFWGIPMRLGSQSEIFIIDVKGKN
>MS0080 unknown
MGNGKMAKLQYPAIIETDKKFTALADLGKRLNSLDKSQIMTSFTYLVPTA
FLELLAEKWSVTGYDGWLLAESEDAKRKLIKRAVELHRYKGTPWAIREII
RQLGFGEVEFLEGLFDKRRDGSFVRDGAYFHGDRSKWAHYRVILKTAITN
EQAALLRKTLRVFAPARCVLASLDYRTVALQHNGKATRNGQYNRGTA
>MS0995 unknown
MNSQVKNMNRKLENIKFVITDVDGVLTDGQLHYDANGEAIKSFHVRDGLG
VKMLMESGIPVAVLSGRDSAILRKRIADLGIKLAFLGKLEKESACYELMK
EVGVTPEETAYIGDDSVDLPAFNVCGVAFAVADAPDYVKDCADYVLDLRG
GKGAFREMSDMILKAQGKTDVYSSAKGFLKIVTNMAQ
>MS0734 unknown
MTTPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGHANISG
YDFIVIDTGGIDGTEEGVEEKMAEQSLLAIEEADVVLFLVDARAGLTPAD
IGIAQYLRQRQNKITVVVANKTDGIDADSHCAEFYQLGLGEIAQIAASQG
RGVTQLMEDVLAPLAEKMKTDESAVENDENSEQEKDEWEHEFDFNSEEDA
ELLDEALAEENEEPENKNIKIAIVGRPNVGKSTLTNRILGEDRVVVYDLP
GTTRDSVYIPMERDGQQYTIIDTAGVRKRGKVHLSVEKFSVIKTLQAIQD
ANVVLLTVDAREGISDQDLSLLGFILNAGRSLVIVVNKWDGLSQYTKDQV
KSELDRRLDFIDFARVHFISALHGSGVGNLFDSVQEAYACATKKMTTSML
TRLLQMATDEHQPPMINGRRIKLKFAHPGGYNPPIIVIHGNKIDKLPDSY
KRYLSNYYRRSLKIVGSPIRLQFQEGSNPFAGKRNKLTPNQLRKRKRLMK
FIKKSKR
>MS0802 unknown
MSLEKRFELIERGSTVRQEIIAGLTTFLAMVYSIIVVPGMLSKAGFPAES
VFIATCLVSGLGSILIGFWANAPMAIGCAISLTAFTAFSLVLGQQVSIPV
ALGAVFLMGAVFTLISATGIRAWILRNLPASIAQGAGIGIGLFLLLIAAN
GVGAVVSNQAGLPVKFGEFTSFPVMMSLIGLAFIIGLEKLQIKGAILWVI
IAITIVGLIFDPNVTFGGEVFKMPSFGEQSLFAALDIQGALQPAILPVVF
ALVMTAVFDATGTIRAVAGQANLLDKDGQIINGGKALTADSVSSLFSGLF
GTAPAAVYIESAAGTAAGGKTGITAIVVGVLFLLMLFFQPLATLVPGYAT
APALMYVGLLMLSNVSKLDFDDFVGAMSGLICAVFIVLTANIVTGIMLGF
AALVIGRIVSGEMKKLNVGTVLIALALVAFYAFGWAI
>MS0122 unknown
MAKKAVRIKAETHEINLQTQDDVALAIKEIGDLEREQVRLSTLQADEKAA
IDEKYTAELTALKDKVKPLQKAVQAYCESRRNELTNGGKQKTAYFTTGEV
QWRAKPPAVIARGIDVILESLRNSGLFRFIRTKEELNKEAMLAEPDIARS
IDGVTIREGVEEFVIKPNDEEVRT
>MS0330 unknown
MSEKIQSVDYDDLRDLVASNDEGGRNPAGFPKKLIVGTAILWSVFQLYYT
SPFPFWLQEVLTQNNIDLNVVVDDTKARSVHLAFALFLAYLSFPALATSP
KHRIPIIDWICATAGAFLGAYYLFFYQSLVTRFGAPNLQDIIAGCIGIVL
LLEATRRSLGLPLAVIAVIFLLYNFFGQYLPTSWIISHRSGSLSQIINQQ
WITTEGVFGVALGVSTKYVFLFVLFGALLDKAGAGNYFIKTAFAYLGHLS
GGPAKAAVVSSALTGLVSGSSIANVVTTGTFTIPMMKRVGFTQEKAGAVE
VASSVNGQLMPPVMGAAAFLMIEYINMPYNELILHAFLPALISYIALVYI
VHLEACKMGLKGLPRTDPAKPFLVTLIRAIGTFLTLCIIYFVLELTLGWL
KTAVPNEAFLIVCLLLLIVYILLIRRVASFPDLEPDDPNAKIVVLPATKP
TVNAGLHYLLPVVVLMWCLMIERMSPGLSAFWGILALSAIIITQRPLLSL
FRKENTDKFIQLKEGVQELIKGLETGARNMIGIGIATATAGIIVGVVSLT
GFGVQLSGIIEILSMGNVLLMLILVAIFSLILGMGLPTTANYIVVSSLMA
LVIVEVGKQNGLIVPMIAVHLFVFYFGIMADVTPPVGLASFAAAAISGGS
PIKTGATAFYYSLRTAILPFLFIFNTDLLLLDVGWAKGILVFITATIGVM
AFTAATMGYFFTKNKKWEGFALILAAFMLFRPGFFMEYVSPTERHIEPAQ
LVQEIENAAAGQNLTIKVAGLNPYGKEIEFYSKLSIPAGENGEEKLKAMG
LTLLNTGEKIQINGNETDKILIDNVEIDSPAAKAGLNWDQTIIDVEVPKN
SLPKELMFIPALLLVSALAWNQRRRRNS
>MS0701 unknown
MLTNEVVISILVLLILSLLRINVVIALVISALTAGLVGGLGITKTIETFT
GGLGGGAEVAMNYAILGAFAVAISKSGITDLLAYKVIKRLGNRPTGSSIA
GFKYFILAVLVAFSISSQNLLPVHIAFIPIVVPPLLSIFNKLKLDRRAVA
CVLTFGLTATYMLLPVGFGKIFIESILVKNINEVGAALGLQTSVAQVSMA
MSIPVLGMILGLCTAIFISYRKPREYIVKIAEPTTAEIEQHIANIKPFHV
MASIVAVLVTFGLQLFTSSTIIGGLAGLIIFAVCGIFKLKESNDIFQQGL
RLMAMIGFVMIAASGFANVINSTGGVTELVNSFSQSVGADNKGIAAFLML
VIGLFITMGIGSSFSTVPIITSIYVPLCLTLGFSPLATVAIVGVAAALGD
AGSPASDSTLGPTSGLNMDGRHDHIWDSVVPTFLHFNIPLLVFGWFAAMT
L
>MS2120 unknown
MQTKPFGKHPEGQRLARIEQSVHYKAGKFVNHLPTEVQTSDKPLWKIWYD
FLFQQIDHLTPNRPLPVVKTDLQQLSREKNFIVWFGHSSYLIQLDGKRFL
VDPVLVSGSPLSFANKMFQGTNLYQPQDMPDFDYLVITHDHWDHLDYEAV
IQLKNKMKEKVITSLGVGAHLEYWGYPAERIIEMDWNEKTELENHFKITA
LPARHFSGRGVVRNKTLWSSFMLEVPGETIYLGGDSGYDPIYQEIGQRFN
ISLALMENGQYNKDWANIHIQPEQLTLAVKALRPKRLMTVHNAKFALARH
DWRAPLEQIYRNAQKENFNLFTPKIGDVFYFSEQGEADSPNFREPWWQSV
E
>MS1548 unknown
MMNTLDRYIGKSILGAIFATLLTLVGLSGIIKFVEQFRSVGKGSYDSMQA
FLYTVLTMPKDIETFFPMAALLGALIALGNLASRSELVVMQSAGFSRMKI
GFAVMKTALPLVLLTMVIGEWGIPQTEQFARDMRSKAISGGSMLSVKNGI
WAKDGNDFIYIKRATEDANLNNIYIYSFNDNRQLQRVSHANKASYENGSW
VLKQVNESQISADEIKTKNYLNRPWKTSLTPDKLGIFTVKPTSLSISGLS
SYISFLKETGQDSKKFELTYWRKLFQPISVGVMMMLALSFIFGPLRSVTA
GARIVTGICFGFVFYVINEIFGPLSLVYNVAPIIGALMPSLLFLVITWWL
LSRKRD
>MS1376 unknown
MKVTSSAIKNGAFEDKYGKRGSQFTPNGMPSYSIPFEITGAPEGTKSFAV
VLEDKDAVTASGFVWIHWLIANLERTSVLENESQTATDFIQGANSWSSVL
AKLDITEASAYGGMAPPNCLHRYELFVYALDTKLDLQPGFKFNELHFAMQ
GHILAKAEIMGTYDV
>MS2382 unknown
MRMTKSNSTRETFSGRRAFIFAAIGSAVGLGNIWRFPYTTYENGGGAFII
PYLIALLTAGIPLLFLDYAIGHRHRGGAPLSYRRFSKHFEAFGWWQVMVN
VIIGLYYAVVLGWAATYTYFSFTMAWGDKPIDFFIGEFLKMGDITQGVSL
EFVGMVVGPLIAVWLVALGVLALGVQKGIARTSSILMPVLVIMFLILVIS
SLFLPGAAKGLDALFTPDWSKLSNPSVWIAAYGQIFFSLSICFGIMITYA
SYLKKEFDLTGSGLVVGFANSSFELLAGIGVFAALGFMAAASGHEVSEVA
KGGIGLAFFAFPTIINEAPFGQILGVLFFGSLTFAALTSFISVIEVIISA
VQDKLRIRRAKVTFIVGVPMMIVSTLLFGTTTGLPVLDVMDKFVNYFGIV
AVAFVSLIAIVANEKLGLLGDHLNETSSFKVGFIWRLCIVITTGILAFML
FSEGAKVFAEGYEGYPSWFVNSFGWGMAVMLVIVAVLLSRLKWKNEVQVS
GE
>MS1215 unknown
MDRFSRMWRKLQKSAVCFLLIFVTVNVWAADFPGSPNPFRYVNDYTNTLS
ENDKNYLENKLINFSRETSSQIAVVMVKTTGEYAISDYAFTLGDNWGIGR
KQLNNGVLLLVAKEDRKVFIATGQGLEGALPDAFLSQIIRRVILPNFRQE
QYASGINGALDYIIAASKGEYDAAAEQNDEGFEQYIPFLMVLVFVLFVLF
GELNGRRKPYISPTTNHQLEQVILQSARRRRGNSGGFGSGGFGGFGGGGS
SGGGFGGGGFGGGGAGGSW
>MS0288 unknown
MSVKKSNLSRPNWLAISRSERVVGTNEKVLGKRIRTLGIHQRYGIMISRL
NRAGVELVPTADSILQFGDVLHMVGNVETMDAAISIIGNAKQKLQQVQML
PVFIGICLGVLLGSLPIHIPGFPVALKLGLAGGPLVVALILARIGSIGKL
YWFMPPSANLALREIGIVLFLTVVGLKSGGNFVNTLTQGDGVTWMGYGVL
ITFVPLMAVGIIARIYAKMNYLSICGLLAGSMTDPPALAFANAIKEENGA
AALSYATVYPLVMFLRIISPQLLAILLWVA
>MS0082 unknown
MGAMNTQIQSTHWQLAPETDGVSVVSGVDDIHLCIANILSTQKGTDILRP
EFGSDHFKFIDYPEDVAVPNFVREITQALQKWENRIVIDEVLVDGEAPHF
TFTVSWSLTDDVYREIYRTQVQQ
>MS1911 unknown
MSITVNQIVLHQIIKPASANIPANNNNETENGETATQNTQLETVLRQELL
PITAEAEQFMLELHQAYQNKTKGYGVFQEQSRFAQSLNRLLERETDFLPF
SYEAAKLLSSELAKYAFAESGTFVLCRYNFLATDYLFIALLDSKASVLVD
EKLEIHRTQYLNINQFDIAARINLTDLRVNANSNRYLTFIKGRVGRKIGD
FFMDFLGADEGLNPQVQNQCLLQAVSDYCQKGELSAEQSQAVKKQVFDYC
KGQINAGDEIELTELSETIPTLNQQPFADFAAEQDYGLENNIPPVRSALK
SLTKFSGSGKGVTISFDAELLDKRIYWDDMQDTLTIHGLPANLKDQLQRL
LKNHN
>MS1402 unknown
MRYNSSFQITLSRQMNKPNITIQPIQASHYADYVALIGKQLGEGYFKQAD
FEALANNPQAICFEAVDEQNQVVGVITSVTLDRESALALLKIQAQNTPDY
VLQSDRIGIFKTIAIDENRKGCGIGSALVRKLLESFKQAGLNSIACVAWQ
YGETENIRGIMQAFDFTCYEKIANYWLDDPEPFICPACGEPPCRCQANIY
FRQI
>MS2135 unknown
MLQLRKSNERGHANHGWLDSYHTFSFADYFDRNHMHFSDLRVINEDFIQP
TMGFGTHPHKDMEILTYVLQGAIAHKDSMGNVKTFTAGEFQIMSAGTGIY
HSEFNPSESELLHLLQIWIMPNELGVSPRYDQKQFADKEGATLILSPDAE
GESFKVYQDMKLWRHQYKAHQKVELGLNSRRNYWLQVVKGNLTVNDIALA
TSDALGISAEELATIETSDEVEFLLFDLR
>MS1114 unknown
MNKVSLLTLLIGGALAVQYANGSPIDERRENIIKYSRLGDGQLVEGTKQL
IDLYNKTKDKKVRDDLITLLVRQNRDAEALSISETYKLTDFSSNELEYLA
RAARNERQFSKSLAFYNQLNNLDTKNPNGLLGLALVSTDMAKFEQSKLYL
SRYKHRFGTDEQYNQANAYFLDSSEPLITRFHRWNSELDTNPNDIELVKK
LYRLAAQLNISPVQEQLIAKYPEVFTDNDKSWLLHDQAVRISKNSPNKQQ
LNTAYSMLDKVYIKVPEDNSLKQQSLQDMVVVGSKLKNDDSNRAKNSYEL
LTESNQPIPNYVKEAYADYLVASGSPFAALSLYKEVEQSHLAEGGEVPFT
LGIKIVQALNDAAKYPEARDYLENNIGEPSLMVLDFTRSRKIENPDYGNY
FSTKVSSLVAQGDLSSAMQLIDERLSVTPGDGWIMLTKAELEAARARTDD
AADWVHKAQAFLPEDTAWAEVAQANLALSVNDWRTASRLVNTWTTEEKDN
ANWFMEQYDQAKSARLVASGGISHRTSPAGENESNQEYYLYSPKTDDGHD
VYIHYLTTKSPDDGLPFEQQRVGAGVEANFYPFMVNAEAGKGIKLNDKAY
FAATIQYQLNQHWQFSLNGGLNSANTPIKAIYQDTYAKDLGFSVNYKYSD
RFEAGAGITAMKFDDENLRKNLSFWSNFNLFKHNRWNLNGSLYGSYERNK
AIPGAYYYNPLKSRSLEDNFDLSYYQPFDHSITLTHHFKAGGGYYWQDSF
ASSKTWSVAYGQEWRLGKKLNISYDVGRKRSIYDGSPEFNNFINLTLSVS
F
>MS1552 abgB, AbgB protein
MELTQQQLVQWRREFHRFPETGWAEFWTTSRIADYLEQMGFEILLGNQII
NRDFVRGRQQAVVEKGLANAVAYGAKQKWLEKMDGYTGCVAVLDSGKPGK
TLALRFDIDCVNVMETKAPEHIPNKEDFASLNDGFMHACGHDGHITIGLG
TALWLSQNKDKLSGKVKIVFQPAEEGVRGAAAIAASGVIDDADYFSASHI
GFCADSGTVISNPKNFLSTTKIDIRYQGKPAHAGAAPHLGRNALLAAAHA
VTQLHGISRHGEGMTRINVGVLKAGEGRNVIPSKAEIQLEVRGENKAVNQ
YMVDQVMRIANGIAVSFDVEYETEIMGEAVDMINDTELVGLVEEIVLAHP
KVHSANANYAFNASEDATVLGRRVQEQGGKAIYFVLGADRTAGHHEAEFD
FDEDQLMNGVNIYTALVQRLLG
>MS2075 ara1, ARA1 protein
MLTFVKQGLELGVDTLDHAACYGAFTSEAEFGRALALDKSLRAQLTLVTK
CGILYPNEELPDIKSHHYDNSYRHIMWSAQRSIEKLQCDYLDVLLIHRLS
PCADPEQIARAFDELYQTGKVRYFGVSNYTPAKFAMLQSYVNQPLITNQI
EISPLHRQAFDDGTLDFLLEKRIQPMAWSPLAGGRLFNQDENSRAVQKTL
LEIGETKGETRLDTLAYAWLLAHPAKIMPVMGSGKIERVKSAADALRISF
TEEEWIKVYVAAQGRDIP
>MS1209 ara1, ARA1 protein
MQWLKYKHRCKYSLGGNKMQTFKLNNGVEIPVLGFGVFQIPPEETEQAVI
SAIHAGYRHIDTAQAYMNETETGAGIRNSGVVREEIFVTSKVWIENYGYE
AAKASLDRTLARLDIGYIDLMLLHQPFNDVYGAWRALEEYLAAGKIRAIG
LSNFTADRVLDVGLYNKVMPAVNQIEINPFHQQQAQVEGLLSEGIVPEAW
GPFAEGKFGIFENPVLAKIGQKYGKSIAQVVTRWLVQRGVVVLAKSTRPE
RMAENLNVFDFELDADDFAQIAALDVGKSQIISHTDLAMVRQFKEWVFNV
>MS0687 ara1, ARA1 protein
MKKITLKNGDKLTLLGMGTWFIGDNAHYRQEEIAALRYGIEHGINLIDTA
EMYGNGRAERLIGEAIAPYDRNSLYLISKVLPNNANKRKMEQACNNSLKA
LNTDYLDMYLYHWRGTTPLAETVECLEALKNKGKIKAWGVSNFDLEDMQE
LLALPNGNQCQLNEVLFHLGSRGIEYALKPYQDKLAIPTVAYCPLAQAGS
LQRNLLRHPEVTTIAEELNCTPYQLLLLFVLAQPNMIAIPKAGQVRHMKE
NIACLDMQLTQQQLARLNNAFPSPTHRIHLDIV
>MS0518 ccmC, CcmC protein
MVTGLRIMSFALFSALFYIISILFIAPMLAKAQSGEQIQRPNKNWFILTA
LFAVICHFISLFPFFSNLFSGENFTLMEIGSLISVLIAILATVAIALKIK
TFWFLLPIIYCFATINVTLAAFAPSHVIQNLAQDLGLLLHILLAMFAYAV
CFIAMLQSIQLAWLDRKLKTKQMVISPLLPPLMMVERHFFRVMLSGEILL
TLTLLTGAVYLADFFGNENIQKAIFSFLAWIVYAVLLIGHWKYRWRGKKM
IIYTISGMILLTIAYFGSRAMLGMN
>MS2235 cof, Cof protein
MTIPNLRDKIKIVFFDIDETLIMKFEDILPDSVLPVIRKLKQNGIIPAIA
TGRSRCSLPTKIKALIAEEPIELFVTMNGQFSVFQNKVIEKHPIPTEKVQ
HLVDFFDAQQIDYAFVSDNNVAVSKITAKQKSALDPILTDYIVDKDYFKH
NEVFQLLPFYDQSQDELVKNANILDGLRVVRWDKDSVDLFDAEGSKARGI
ASAIKRLGFEMENVMAFGDGLNDLEMLSTVGVGVAMGNARDELKKVADFV
TDRIEDHGIYNFLVKAGLIED
>MS2344 cof, Cof protein
MQYKAIFSDIDGTLLNSRHQISSKTESVIKLAVSKGIPFIPVSARPPYAI
TPYTEQLQTNQGIICYSGALILDKNLRELYSVQIDQADLAALNQILADYP
YLSINHYAALDWFSNDLDNYWTKQEADITGLFPKQTPSNLTKVHKILVMG
EADKIKPLEQKLKQKLPHLSIHLSKPEYIEIMNKAATKAKAIGFMERHLH
VSADEVIAFGDNFNDLDMLEYAGLSVAMGNAPDEIKQVAKKVTASNDEDG
IALVLNEIFNL
>MS2225 cof, Cof protein
MKQLPFRAIVSDMDGTLLNANHVVGDFTINTLEKLAQKGVDIVMATGRGY
TDVASTLSKMKIKNAAMITSNGAQIHDLQGNRLYSNYLPEDVAFEVMQLP
FDADRVCMNTYQNNDWFINIDLPQLRKYHQTSGFMYEVVDFKKHHGRDTE
KVFFIGKKPADLMEIEQELTTRFGNYATITYSTPVCLEVMNKNVSKATAL
AHLIEQREYSLSDCIAFGDGMNDIEMLTEVGKGCIMQNADPRLLQLLPDN
ERIGLNKDESVASYVRAVFGIY
>MS0842 cof, Cof protein
MAYQVLAFDLDGTLLNSQGIILPSSKKAIEAARAKGMQVILVTGRHHTAV
KPYYYELNLETPIVCCNGTYLYQPQTDEVLRSNPFSKTQALQLIDIAERQ
KIHILMYSRNAMNYMELNPHMEKFQKWVQSCPQNVRPDVRQVSSFRDIVN
NEDIIWKFVMSAPNRELMQQTVNMLPQDQFSCEWSWIDRVDISNKGNTKG
SRLLEYLRSVNMNPEQVVAFGDNQNDLSMLTSVGLGVAMGNADEIVKQQA
KCIIGTNNENSIADFIEGLK
>MS0931 comEC, ComEC protein
MMKLDLFLFCFIVNTLCLLVLPESFLLDFPLFLHFLFPLVIAAFIYWFKY
RRLWRGFYYLFCGLIAVFYIHFQALSLFRAADGVKYLPAKVQTDFVIDEI
LYQRDYRNIIVKAQLAPEFKPQRIYVNWQADQAVKTGEKWRGELHLRAVS
SRLNYGGFDKQKWYYAQGITAWAKVKSAVKISEDLSLRQQLFNHYLAQTE
RLRQQGLLMALAFGERAWLQEDVWQIYRKTNTAHLIAISGLHIGLAMLLG
MGVARLIQFCLPTRYISPYFPMLSGLVFAAVYAGLAGFAIPTLRALIALV
IVSLLKLLRGYCNVWQLFLRVIGVLFIFDPLMVLSNSFWLSVCAVFSLIL
WYQIFPLNLLEWKGKSVTDGKFAWLFGLIHLQLGLFCLFSPMQLMTFQGI
SLAGFWANLIIVPLFSFLLVPVILFALFSNGAWESWRIADWLAQWFTHLL
SYFQDYWIGVSNQTSWLICCLLCLLLLTVVHFIYPLKKQIPEKNELLTQF
KTKKISLKSDRTLSPVLRKYLVSVATLFLASGAMLWLYQQWRQPDWRFET
LDVGQGLANLIVKDGRAVLYDTGAGWKNGSMAQSEIIPYLQRQGLILEKV
ILSHDDNDHSGGIADILQAYPSINILQPSMVNYEKTEQNSFNFDRTFCKQ
GLNWQWHGLNFQVLAPAKIAERANNTDSCVLLIDDGQYKLLLTGDADLAA
EQQFVAHLGKVNVLQVGHHGSRTSTGEALIKQIKPDFALISAGRWNQWGF
PHPVVTQRLKRHKSAVYNTAFSGQISFEFYPNKIEVKTARSNYQPWFRQI
VGGERD
>MS2234 comFC, ComFC protein
MNWFAFRCIYCQRKLAIGSHGLCCSCNKQIRRFNYCGVCGSELAENTLGC
GNCLQNRPAWHRMVIIGAYKMPLSSLIHRFKFQNSFYFDRTLARLLYLAI
RDARRTHGLMLPEVIIPVPLHHFRHWRRGYNQADLLAGQLAKWLNIPCNN
RLIKRVKHTRTQRGLSAAARRVNLQKAFRFADKKQACPYKSVALVDDVIT
TGSTLNALAGLFVQQGVEQIQVWGLARA
>MS1002 cvpA, CvpA protein
MIDYIIIGIIVFSIVVSLLRGFVREVMSLASWVVAFVIASQFYPYLANFL
TQIESEYLRNGTAIGILFILTLIVGAIVNYVIGQLVDKTGLSGTDRVLGA
CFGFLRGVLIVSALLFFVDTFTNFDQNDMWKESKLIPHFGFVVEWFFEQL
QANSSFLNSTLNK
>MS2216 dcuB, DcuB protein
MSAMFLIQFAIVLLCILMGARAGGIGLGVFGGLGLAILSFGFGLKPAGLP
IDVMFMIMAVVSAAAAMQAAGGLDYMIKIATNILRRNPKYITFMAPAVTW
LFTFLAGTGHVAYSVLPVIAEVARHNGVRPERPLSMAVIASQFAIVASPI
AAAVVAVVAYLEPQGITLANVLSVTIPATLLGIFLACVFVNKIGVELKDD
PEYQRRLQDPEYVKANHADVNMDEIQLKPTAKLSVGLFLLGALLVVVMGA
LPELRPSFDGKPMGMAHTIEIVMLTIGALIIFTCKPDGTEITRGSVFHAG
MRAVIAIFGIAWLGDTLMQAHMDEVKGMVSGLVETAPWAFALALFILSIL
VNSQGATVATLFPLGIALGIPAPILIGVFVAVNGYFFIPNYGPIIASIDF
DTTGTTRIGKFIFNHSFMLPGLLSMAFSLGFGLLFANMFL
>MS1877 dcuB, DcuB protein
MLYLEFLFLLLMLYTGSRFGGIGLGVISGIGLVIEVFILRMPLGKAPIDV
MLVILAVVTCASILEAAGGLKYMLQIAERVLRSNPKRVTILAPMVTYVMT
FMLGTGHSVYSVMPIIGDIALKNKIRPERPMAVSSVASQLAITSSPLSAA
IAYYLTQITKMPGYEHITLLNIISVTVPATFVGTMAMALYSLRRGKELED
DPEYQRRLKDPTWRDRILNTTATSLDAELPRSAKMAVWLFVLSLVTVVVI
AMLPEIRTVGVPVDGKPVKAISMSFIIQMMMLCFGGIILIATKTNPQSVP
NGVVFKSGMVACIAIYGIAWMSDTYFSYAMPEFKAAVTTMVESYPWTFAF
ALFAVSVVINSQAATAVMMLPVGISLGLPAPVLVGLIPATYAYFFIPNYP
SDIATVNFDVTGTTKIGKYYFNHSFMIPGLIGVTTACLVGYALAHMIIV
>MS0689 dgoA, DgoA protein
MSTPVITEMQVIPVAGHDSMLLNLSGAHSPYFTRNIVILKDNSGNTGIGE
VPGGEKIRQTLEDAKPLVIGKTLGEYKNVMNTVRQTFNDRDAGGRGLQTF
DLRTTIHVVTAVEAAMLDLLGQHLGVTVASLLGDGQQRDAVEMLGYLFFV
GDRKKTNLAYQSQENDLCDWYRVRHEEAMTPESVVRLAEAAYEKYGFNDF
KLKGGVLDGFEEAEAVTALAKRFPQARITLDPNGAWSLDEAIKIGKQLKG
VLAYAEDPCGAEQGYSGREIMAEFRRATGLPTATNMIATDWRQMGHTISL
QSVDIPLADPHFWTMQGSVRVAQMCNEWGLTWGSHSNNHFDISLAMFTHV
AAAAPGDITAIDTHWIWQEGNQRLTKEPLQIKGGLVEVPKKPGLGIEIDM
DQVMKANELYKSMGLGARDDAMAMQFLIPGWTFDNKRPCLVR
>MS1568 dltE, DltE protein
MAILITGASAGFGKAACITLVKAGYKVIGAARRLEKLTELKQQLGENFYP
LQMDVSQTAEIDSALASLPADWAEIELLVNNAGLALGLEPAYKVNFDDWL
TMINTNIIGLTYLTRQILPQMVERNKGHIINLGSIAGTYPYPGGNVYGAT
KAFVKQFSLNLRADLAGTAVRVSNIEPGLCGGTEFSNVRFKGDDEKAANV
YKNTLSIQPEDIANTILWIYQQPAHVNINRIEIMPISQSSGALNVVRE
>MS2336 dmsC, DmsC protein
MATIIVIYQGFGLSQIHSSAQQAVALVPDFAVNQVIRLCLLAAAGMVLLK
SKQPLLLSIAVILALFAEMIGRELFYSLHMTVGMA
>MS0266 elaA, ElaA protein
MLFMRIHPMNWQCKTFNQLSNIELYQILQLRSDVFVIEQQCIYRDMDNKD
LLASHLFLSKDNQIVAYCRLLPKGVSVADAAIGRVIIHEKYRGRHLAHKM
MGKAIDIIIHEWHENKIYVQAQEYLQGFYQSLGFKATSDVYLEDEIPHLD
MYWES
>MS0367 era, Era protein
MTETKPENIVQHNETTAAEQETYCGFVAIVGRPNVGKSTLLNKILGQKIS
ITSRKAQTTRHRIVGIHTEGPYQAIYVDTPGLHIEEKRAINRLMNRAASS
AISDVDLIIFVVDGIHWNADDEMVLNKLRASKAPVVLAINKIDNIKNKDE
LLPFITELSGKFNFKEIIPISAQRGNNVHNLQKVVRQSLRKGVHHFPEDY
VTDRSQRFMASEIIREKLMRFMGEELPYSVTVEIEQFKMNERGTYEINGL
ILVEREGQKKMVIGQGGQKIKTVGIEARADMERLFDNKVHLELWVKVKSG
WADDERALRSLGYMEEY
>MS1874 fabG, FabG protein
MQGKIALVTGATRGIGRAIAEELATKGAFVIGTATLEKGAESISAYLGEK
GKGFVLNVADQESIESVLEQIKKEFGDIDILVNNAGITRDNLLMRMKDDE
WFDIIQTNLTSVYRLSKAMLRTMMKKRFGRIITIGSVVGSSGNPGQSNYC
AAKAGLIGFSKGLAKEVASRGITVNVVAPGFIATDMTEVLTEEQKAGILA
NVPAGHLGEPKDIAKAVAFLASEDAGYITGTTLHVNGGLYMA
>MS0543 fabG, FabG protein
MITMPFNFWYMEKDKMTLAKKHNFKDKVVVITGAGGVLCAYFAKEIAKTG
AKVALLDINLESAQKFADEINAQGYIAKAYKTNVLELDSIKQTRDAIAAD
FGTCDILINGAGGNNPKATTDNEFHELDLPPTTKSFFDLDKSGIEFVFNL
NYLGTLLPTQVFAKDMVGKKGANIINISSMNAYTPLTKIPAYSGAKAAIS
NFTQWLAVHFSHVGIRCNAIAPGFLVSNQNRALLFDEQDNPTARAHKILT
NTPMGRFGEAKELMGGILFLMDEEYASFINGVVLPIDGGFSAYSGV
>MS2145 fabG, FabG protein
MQRFEQKTALVTGAGTGIGQAIAVRLAQEGAKVLVVGRTEKTLQETTALH
PNIAYAVADIEKDDDVQKIVQQLNQKYGGLDILINNAGWAPVTPISQVKI
EEYDKVFGINVRALVNLTLQCLPMLKARKGNIINMSSAICRNHLPNMSMY
AGTKAAVEIFTKIWAKELGADGVRVNSISVGPIETPIYDKTDLSNDGIQD
HIDRIRKTIPLGAFGKSEDVANVTAFLASDEARFITGSDYSVDGGFGA
>MS2144 fabG, FabG protein
MNNMKKLLILVGAGKGLGNAIAKEFASHDFRVALIARNAENLTAYRQEFQ
ALGYEVMTQVADALYPETLTKAINAIQAEWGTCDALVYNVGITELDNDRP
ITNELLMQRYQIDAASAYHCAMLVATPEFAAKQGAIIFTGGGFAKTFQPI
LALKPLCIDKAALNAMNIVLHHLLAPQGIFVGSVLVSNVIQPNDPKYAPD
VIAKAYWKMYCERDEFELLY
>MS0563 fabG, FabG protein
MVNTLLIHFLHRRIYMNLFDLTGKVALVTGCNTGLGQGMALGLAQAGCDI
VGVNLVEPLDTKEKIEALGRKFVNIEANLMKQEGLTDVVEKAVSVFGKID
ILVNNAGIIRREDAIDFSEQNWDDVININLKTVFFLSQLVAKQFIAQGHG
GKIINVASMLSFQGGIRVPSYTASKSAIMGITRAMANEWAKYNINVNAVA
PGYMATDNTAALRADEARSKEILDRIPAGRWGTPNDLVGPCVFLASAAGD
YVNGYTVAVDGGWLAR
>MS0955 fabG, FabG protein
MIKIIFLKCNFHLNEEQKMSELFSLKNKRILITGSTRGIGNLLANGLAEH
GAEIIIHGTRLETAEKIAADFNTKGFKAYAVAFDVTDSKAAQDTIDYIEK
EIGPIDVLINNAGIQRRYPFCEFPEKDYDDVISVNQKAVFIISQAVARYM
VKRQRGKIINIGSMQSELGRDTITPYAASKGAVKMLTRGMCVELARYNIQ
VNGIAPGYFATELTKPLVENQEFTSWLCKRTPAGRWGDPKELIGAAVFLS
SKASDFVNGHLLFVDGGMLAAV
>MS1412 fabG, FabG protein
MSILEKMKLTGKTAFVTGGARGIGKSVAIAFAQAGANVVIADFDIAEAEK
TAAEIAKEEGVKSIAVQTDVTDQASVNHLMDVIKQQFGKLDIAFCNAGIC
INVPAEEMSYEQWLKVINVNLNGVFLTAQAAGKLMIEQGTGGSIINTASM
SAHIVNVPQPQCAYNASKAGVIQLTKSLAIEWAKHNIRVNSLSPGYIGTE
LTLNSKDLQPLIKEWNAMAPLHRLGKPEELQSICVYLAGDTSSFTTGADF
IVDGAFTCF
>MS2175 fabG, FabG protein
MFKKIILTLFSGLIFTEVTMAQTKYGVGSYNTEEVAAEMEYIEKHIRPLN
PKPTKRIFITGSSAGIGELTAKMLLAKGYEVVAHARDAKRAADVKRDLPE
IKHVVIGDLAKPDEVDKIADQVNALGRFDVIIHNAGVYRGENIFQINLLA
PYVLTAKITQPQTLIYVSSNMHNGGELRLDAFNAGNVGYSDSKLQLLTLA
KSLAVRWSKVRVNAMHPGWVGTKMSGGSAPDPLRQAYETLVWLAEGTDPA
AQTSGGYFFNKQPDSHYRRDSEDSAQQAVLWQALEKITGVKLPE
>MS1421 fabG, FabG protein
MKLQNKVALVTGGGTGIGRAIAKQMAEAGATVIIIGRREAQLQESARQHA
NIHYIVADVLNSDDITRTLNEIQQRFGKLDVVVNNAGIAPVTPIENVNLA
DFDRTFALNVRAVIDVTSQAIPYLKSTQGNIINITSGLVNNPMPMNSIYT
ASKAAVLSMTRTWAKELAPYGIRVNSVAAGATKTPLYDGLGLSETEAKDY
EATVEHIVPLGRFAEPDEIAPAVVFLASDDARYATGAHYGVDGGFGI
>MS2163 fabG, FabG protein
MNNIQGKVVIITGASSGIGEATAYKLAEQGAKIVLAARREAQLKAIADNI
KAKGGEAVYRVTDVVKPEDNQALVELAKSAFGKVDAIFLNAGLMPSAPLS
ALETDNWNRMIDVNIKGVLNGIAAVLPTFEAQKSGHVLATSSVAGLKVYP
GGTVYCGTKWAVKAIMEGLRMESAQAGTNIRTATIYPAAVQSELVAGITD
ETTSQGYRQLYDTYEIPAERVANVVAFALSQPDDTNVSEFTIGPTTQPW
>MS1406 fabG, FabG protein
MWELQRSKKMKQKEVIVAIGSGSIAQAIARRVSIGKQVLLADIKLENAEA
AAKTLREAGFEVSTTVVDVSSRASVQALVQTAVDLGAVKGVIHTAGLSPS
QASPEAILKVDLYGTAVVFEEFGKVIAAGGSAVVIGSQSSHRLAIDEISQ
AQADELATLEPEKLLELPLVQEINDSLRAYQISKRGNALRVQAEAVKWGK
RGARINCISAGIIYTPLAYDELTSSERGEFYRNMLAKSPAGRGGTPDEIG
ALAEFLFNSSYISGSDILIDGGVTASYKYGELKPA
>MS0719 fcbC, FcbC protein
MNNTFQFPVRVYYEDTDAGGVVYHARYLHFFERARTEFLRTLNFSQNQLL
HEQNIAFVVKSMTIDYRFPACLDDALIVESEVVEVKGATILFSQILKRDE
LVLTTATVKVACVDLGKMKPAALPAEVKAAISK
>MS0457 fxsA, FxsA protein
MPIIFIITLIAFLFIYGELSLLIAIGSAIGAFGVIMLLLLSVFIGGVILK
SKGLFGLNFRRQIAQGEIPADSVVKSLLWMIAGILFIIPGFITDLLACLL
LLLPSGLFEKWISQKFTVINSGFTAQGFGRHSHRYRYYKDQNTEVFEAEY
EKEVDEKKRIK
>MS0946 gloB, GloB protein
MLVPIPALNDNYIWLYGRENLPVIAIDVAECKNLSAYLTQHHLQLEAVLL
THYHDDHTGGVEELKRYYPDIPVYGPAETADKGATHIVNEGNIQTAHYRI
EVVPSGGHTANHVSYLIDNHLFCGDTLFSAGCGRVFTGDYGQMFESITRL
KQLPDKTVICPAHEYTLSNLVFAEAFAPNEKVKSAVKNQRISVESLRAQN
KPSLPTTLALEKNINPFLQAENLADFIYLRKAKDNF
>MS0824 gloB, GloB protein
MNIDIIPVTSFQQNCSLIWDDRKNAAIIDPGGEPKKLIEKIEENGLDLKM
ILLTHGHLDHIGAAPALKAHFGVDIIGPHEDDVFWFENLPQQSAQFGLFE
ANAFLPDMWLNRENEVLEVGSLKLEVLHLPGHTPGHVGFFEHQNIVAFTG
DVLFRNSIGRTDFPGGSYDDLISSIKEKLFPLGDDWIIIPGHGPYTTIGA
EKKTNPYLK
>MS2011 gloB, GloB protein
MKKLVLTTLISATLGLSAIAAHAHPTYAPAKNAVKMQKTQVPGYFRQMVG
DYEVTALYDGVGNLDMSLMAPFTQFSKAELDAMLDDEFAQRSELGGLEGT
IIGFLVNTGDNLILIDAGKGEAEAPIFLDKQGRLIDSLKAAGYQPEQVDI
ILPTHMHADHINGITEKGKRVFKNATVYLPLQEKAFWLDTPMDKLPSEIH
PFIEAARYAVAPYLKADKVKFYNAGDEVFAGVKTVPLFGHTPGHSGFEFT
SKGEKILFWGDVMHNGAVQMAHPEVAIEFDADAEAARTNRQTILTKIAAD
KTLIAAAHLPFPGLGHIKTEKDGKGYRWYPVQYRPFDKH
>MS2185 glpG, GlpG protein
MQLLFRSEIPSFAWQFRDYIRKKYQIELILQQEKTDMRQNVIAVYLSGNS
EQTAAILQDLAEFHRNPFDERYERASWETGDVSSGSHSLKELAENSSQGI
KQQLLKTGPVTLLITLICIIVYGFEISGMAEQIMQFAHFPYEFGENQQIW
RYFTHSLVHLSSMHITFNLVWWWIFGGAIERYFGSTKLIIIYVLAAFATG
VTQNFASGPHFFGLSGVVYAVLGYVFVADKFSPNNRFNLPSGFFNVLIIG
IALGFVTPLIGIKMGNTAHITGLLVGLILAFLQEKIGKKSK
>MS0731 gltD, GltD protein
MAKFFLAPADNYDVKIGELVDKFVNKVRSFPPGTCPLVVQYASLRSSMSQ
TCGKCVPCRDGIPHLSFLLRDILAGEGDDSTMRQIRELAEMIRDGSDCAI
GYQPAIEILDSIEEFKEEYESHIHNKSCQKVIGQRIPCINMCPAHVDIPG
YIAHIGDGNYAEAINLIRKDNPLPTACGLVCEHPCEERCRRRLIDDAINI
RGLKKYAVDQVAADVVKVPQALPDTGKKVAVIGGGPAGLTCAYFLAQMGH
RVTIYERQKALGGMLRYGIPNYRFPKDRLDQDLNAILSAGRIEVKYGVMV
GDDIAIEDIYNSHDAMFVGIGAQKGKTLRIKGSEANNVFSAVEMLDDIGN
GKIPDYTDKVVVVIGGGNVAMDAARSAVRCKAKDVRIVYRRRQDDMTALH
AEIEAAIMEGIELITLAAPVAIEKDEQGNCTGLTVQPQMTGPYDHGGRPS
PVAVKKPPFTIGCDVILIAVGQDIISLPFEEFGMPANRGIFQADLTTAVP
DMDGVFVGGDCATGPATAIKAIAAGKVAAHNIDEYLGYHHEFPCETKAPP
PKENVRIQVGRANTTERPAYIRKCDFEHVENPYTYEEAMQEAERCLRCDH
FGCGVLQGGRDL
>MS2331 gph, Gph protein
MNSQFKLIGFDLDGTLVNSLPDLALSVNSALAEFELPQAPEELVLTWIGN
GADILIGRALDWAKEQSGKSLTDEQTAQLKERFSFYYAENLCNVSRLYPN
VKETLETLKEQGFILAVVTNKPTRHVQPVLKAFAIDHLFSETLGGQSLPA
IKPHPAPLYYLCGKFGLYPHQILFVGDSRNDILAAHSAGCTAVGLTYGYN
YNMPIADSHPDWIFEDFADLLKIV
>MS0774 guaB, GuaB protein
MLRIKQEALTFDDVLLVPAHSTVLPNTANLSTQLTKEIRLNIPMLSAAMD
TVTETKLAISLAQEGGIGFIHKNMSIERQADRVRKVKKFESGVVSEPVTV
FPELSLGELAQLVKKNGFAGYPVIDQNDNLVGIITARDTRFVKDLNKTVA
EVMTPKEKLVTVKEGAKREDIIALMHSHRVEKVLVVDDNFKLKGMITVKD
FQKAEQKPNACKDELGRLRVGAAVGAGPGNEERIDALVKAGVDVLLIDSS
HGHSEGVLQRVRETRAKYPNLPIVAGNIATAEGAIALADAGASAVKVGIG
PGSICTTRIVTGVGVPQITAISDAAAALEGRGIPVIADGGIRFSGDIAKA
IAAGASCVMVGSMFAGTEEAPGEIELYQGRSYKSYRGMGSLSAMSQGSSD
RYFQSDNAADKLVPEGIEGRIAYKGLLKDIIHQQMGGLRSCMGLTGSATI
EDLRTKSQFVRISGAGIKESHVHDVTITKEAPNYRLG
>MS0996 gutQ, GutQ protein
MDYLQNARETLATEKDALTLLSRNLDQSFNNVIDLILNCGGRLVIGGIGK
SGLIGRKMVATFASTGTPSFFLHPTEAFHGDLGMLKPIDIVMLISYSGES
DDVNKLIPSLKNFGNTIIALTGNKHSTLAKHADYVLDISVEREACPNNLA
PTTSALVTLALGDALAVALINARHFQPMDFAKFHPGGSLGRRLLCRVKDQ
MQTNLPVTALNTSFTDCLTIMNEGRMGVALVMENDDLKGIITDGDIRRAL
AANGADTLNKVARELMTSNPKVINQDTYIGQAEDYMKEHRIHSLIVVDND
NKVVGLVEFSS
>MS1519 hflX, HflX protein
MNNDVNISKSAVNFTALSSISAPRSDQSDNAIVVHVFFSQDKNPEDLDEF
QQLAQSANVNILQVITAARSTPQAKYFVGQGKAEEIAQAVETHNADVVLV
NHSLTPAQARNLESLCQCRVVDRTGLILDIFAQRARSHEGKLQVELAQLK
HLATRLVRRKTGLDQQKGAVGLRGPGETQLETDRRLIKVRIAQLQNRLAK
VEKQRNQNRQTRQKADIPTISLVGYTNAGKSTLFNRITQANVYAADQLFA
TLDPTLRRLQIQDVGTTILADTVGFIRDLPHDLVSAFKSTLQETTEAGLL
LHIIDAADPRKLENIEAVNAVLEEIKAADLPTLLVYNKIDTLENLEPHIE
YDDQHIPVAVYLSAISAEGIDLLFAAIREKLKNEILHLQLNLSPNEGKIR
HQLYLLDCIRREEISDQGEFLLEIQIDKIQWLKLAKKFPQLEKCGKNL
>MS1518 hfq, Hfq protein
MAKGQSLQDPYLNALRRERIPVSIYLVNGIKLQGQIESFDQFVILLKNTV
NQMVYKHAISTVVPARSVSHHNNPQQQQQHSQQTESAAPAAEPQAE
>MS0399 hit, Hit protein
MWIYSFGLRDKLLFKLISDKKCGRFFPKIRKHKMAEETIFSKIIRKEIPA
DIIYQDDLVTAFRDIAPQAKTHILIIPNKLIPTVNDVTAEDEAVLGRLFI
TAAKIAKLEGIAEDGYRLIVNCNKHGGQEVFHIHMHLLGGEKLGPLNAK
>MS1322 hns, Hns protein
MNEVIKTLNNLRRLRSMAKELSIEQLENIIEKFQLVIEEKKAEELEIKRL
EEERKNRLEKYRELLKEDGITADELAQILAGKNNTAKAKRAPLSAKYKYI
NENGEQKTWTGQGRMPKAIQLQLNAGKSLSDFAI
>MS1545 hybF, HybF protein
MEIVEEQCHRNNVNKVTDIWLEIGPLSCVEPDAIEFCFEVCRKNTVMENC
KLHFVPVPALAYCWHCEKTVEIKSHHDACPQCGGIHLQKQGGDDLRIKEI
AVE
>MS1693 icc, Icc protein
MISNTYIYEADSDVIRFVQITDPHLFKDEQGELLGVNTQQSLTQVLTELK
ENQFNYDFVLATGDIVQDSSEEAYLRFCKSVQQLDKMVFWIPGNHDFQPK
MFDILVQEHGNLSPKKHLLLGDKWQILMLDSQVFGVPHGQLGQYQLEWLD
SKLKDNPDRYSLVVLHHHILPTHSSWLDQHNLRNAHELAQVLAQYDNVRG
ILYGHIHQAMDGTWKDYQIMATPSTCIQFKPDSNVFALDTLQPGWREVEL
HSDGSIITRVNRIQKASFLPNMQEDGY
>MS2079 ldhA, LdhA protein
MTKSVCLNKELTMKVAVYSTKNYDRKHLDLANKKFNFELHFFDFLLDEQT
AKMAEGADAVCIFVNDDASRPVLTKLAQIGVKIIALRCAGFNNVDLEAAK
ELGLKVVRVPAYSPEAVAEHAIGLMLTLNRRIHKAYQRTRDANFSLEGLV
GFNMFGKTAGVIGTGKIGLAAIRILKGFGMDVLAFDPFKNPTAEALGAKY
VGLDELYAKSHVITLHCPATADNYHLLNEAAFNKMRDGVMIINTSRGVLI
DSRAAIEALKRQKIGALGMDVYENERDLFFEDKSNDVITDDVFRRLSSCH
NVLFTGHQAFLTEEALNNIADVTLSNIQAVSKNATCENSVEG
>MS1188 ldhA, LdhA protein
MMKSAVVFTALFLYAISHIKDELYLPKQGAFMKIVFLDSTALPPHLPIPR
PDFDHEWIDYPYTGAEQTVERAKDADIVVTSKVIFSREVMEQLPKLKLIA
LTATGTNNIDLIAAKELGIRVKNVAGYSSVTVPEHVLGLIFSLKHSLAGW
YRDQLEGKWGESKQFCYFDYPITDIRGSVLGVVGKGCLGTEVGRLATALG
MKVLYAEHRDAQSCREGYTPFDEVLKQADIVTLHCPLTEHTTNLINKETL
SLFKKGAFLINTGRGPLVDEQALLDALKSGHLAGAAIDVMIKEPPEKDNP
LIVAAKTMPNLLITPHIAWASDSAVTTLVNKVRDNIEEFVATGK
>MS1288 lppC, LppC protein
MTILLQRAKFKKRLMPILFPLMLAGCTNLFGSNFQDVLRNDANASSEFYM
NKIEQTREVEDQQTYKLLAARVLVTENKTAQAEALLAELTKLTPEQQLDK
SILDALIAAVKRDNDSASALLKTIPLAQLSQSQTSRYYEVQARIAENKTD
IIEAVKARIQMDMALTDVQRKQDNIDKIWALLRSGNKTLINTTQPEGNVA
LAGWLDLTKAYNDNLSQPSQLAQALQNWKTTYPNHSAAYLFPTELKSLSN
FTQTQVNKIALLLPLSGNASILGSTIKSGFDDSRGADKSVQVDVIDTMAM
PVTDAIALAKQNGDGMIVGPLLKDNVDVILSNPTAVQGMNVLALNSTPNA
RAIDKMCYYGLAPEDEAEAAANRMWNDGVRQPIVAVPQSDLGQRTASAFN
VRWQQLAASDADVRYYNQPDDAAYNLTADPAQNQAIYIVVTDSEQLMSIK
GALDNSGVKAKIYTNSRNNSSNNAVEYRLAMEGVTFSDIPFFKDLDGEQY
KKIEAATGGDYSLMRLYAMGADSWLLAHSFNELRQVPGFSLSGLTGKLTA
GPNCNVERDLTWYSYQGGNIVPLN
>MS0351 mazG, MazG protein
MIPCLIEESYEVVEAIQQKNTADLREELGDLLMQVVFLSQLAAEENKFTF
DDVVNDIAEKLIYRHPHVFGDKEAADEHAALRNWNEMKAREAKNQAHTSI
LDNVPFSFPALLRAEKLQKKCAKAGFDWQQVAPVIAKVEEELEEVTQEIN
CPAPQQAKLEEEIGDLLFAVVNLSRHLKCQAEESLRKANHKFERRFRAVE
DKLRQQNKTATESSLMEMDMLWDEVKHEEKVSSD
>MS1417 mdaB, MdaB protein
MKNVLIVSGHPNLKTSIANQVILDETAKALPNAEIRKLDELFHNGTFDIA
AEQAAVLKADVLVFQFPFSWFSLPGVMKIWLDEVFEHGFAHGSTAQLAGK
KIIFSTTTGAPAEVYQKDGFFKYTMEEFAAQFEIMAQLCNLDYQGLIYTN
GIGYTSRENEEKINAQKAEAKKHAQRLVALIEKA
>MS2162 mdaB, MdaB protein
MVIFLTVCYHTSGRFLSIFCKCRYSTSGEEYMNRRNLLKAGVALAAVAAM
PFGRAQAKTPSKKTLVIVSHPYPESSTFIKGLQQAAETVEGVTVRNLETI
YGFDTRAVKGDEERRIMRAHDRVVFIFPTHWFNITPMMKAYLNETWGSVG
PGLWQGKEMLVVSTAAGGSETYGKNGRVGVELADVFLPMKASALHCGMTY
LPPLVFQGVRSSELANYQQQLIERLMQ
>MS0836 mdaB, MdaB protein
MKHLVIFAHPNTKNSFNKAILERVLQASQKMNVDTTVRDLYGMNFNPVVS
WEELTGSFKEIIPAAIRHEQQLISEADLITLIYPLWWMGFPAILKGYFDR
VFTHGFAYKTDETGTVGLIQGKKMQQFITMGNNEERYQQMGFARSLNDTL
VNGLFNYVGIIDIDHRLLGDIHIISSEERQALLNEVEQKTKENLTALLEG
KA
>MS2139 mdaB, MdaB protein
MSNILIISGHPNLANSVVNTIILDEFAKTLPQAEIRKLDQLHTNYEFDVA
AEQAAIEKADVILWQFPFYWYAMPALMKKWLDDVFVHGFAHGSTAKIAGK
KLLISLTTGAPLEAYQREGFFKHKMDDFFAAFETTAILCGLDFQGVQFLN
GVSYVGRNEEKIAQQQAEAKVYAQTVIEKVKRL
>MS2094 mdaB, MdaB protein
MKTTVLVVHPNIKQSRVNAALAKGAADVAGVKVRYLYDLYPDGKIDATAE
QAVLEKADRIVLQFPMYWYSSPALLKQWLDDVLAYGWAYGDKQALKGKEL
MLAVTTGGGEEFYQKDGLAGHTVAEFLVAYETIASYLGMNYGKMFVTGNC
LNISDDEIAAQVPRYQAVLSA
>MS1631 metG, MetG protein
MSNQHRQILVTCALPYANGPIHLGHMLEHIQADIWVRFQRMRGNEIHFVC
ADDAHGTPIMLKADQMGITPEQLIADVKEKHYADFCGFNISFDNYHSTHS
EENRELSELIYSRLKENGFIKSRTISQLFDPEKSMFLPDRFVKGTCPKCK
AEDQYGDNCEVCSATYSPTELINPRSAVSGATPVIKESEHFFFDLPSFES
MLKEWNRSGALQSEVANKMQEWFDAGLQQWDISRDAPYFGFKIPGTENKY
FYVWLDAPIGYMASFKNLCKRENLDFDRFWNKDSNTELYHFIGKDIMYFH
SLFWPAMLDGANYRKPTNIFVHGYVTVNGEKMSKSRGTFIQAATYLKHLD
PECLRYYYAAKLSNRIDDLDLNLDDFVQRVNTDLVNKLVNLASRNAGFIQ
KRFDGKLADKLEDESLFAEFIAQSEQIAAYYENREFGKAIREIMALTDKA
NKYVDDKAPWVIAKEEGREAELQAVCSMGIQLFRVLMGYLKPVLPKLAER
SEAFLQAELTWDNLAQPLLNHGIAPFKALFSRLDVKQIDAMIEASKAENA
AVNATVKKEEKNSKKSTALLTDFEPIEPEISIDDFAKIDLRVAKVIKCEE
VPESKKLLKFQLDLGFEQRQVLSGIKGAYNNPEELEGRFVIVVANLAPRK
MKFGVSEGMILSAGTGGEDLYLLDVDAGVKAGSRVM
>MS1793 mhpC, MhpC protein
MNTMTLVFLHGLLGTKSDWRKIIENLPHFRCVSLDLPFHGEHKFTEANNF
EQCADFISHQIKSAVGNQPYFLVGYSLGGRIALYYALQSQCEKGNLQGLI
LEGANLGLTCDEARKVRWKNDEFWAQRFITESAESVLNDWYQQPVFAHLN
AQQRADLIEKRVTNCGKNIGKMLEATSLAKQPYLGDKVRESTLPVYYLAG
EKDQKFRQMAVQEKLNLQLIANAGHNAHLENPVEFSQKLTALLRNHKIKK
TDNL
>MS0862 mhpC, MhpC protein
MKLLNYQFHQLKQPSNQATMVFIHGLFGDMNNLGIIARAFSDAYNILRLD
LRNHGQSFHADEMNYSLMAQDIIHLLETLQLTKVILIGHSMGGKAAMKTA
ALRPDLVEKLICIDIGPIAYAHRWHDDVFAGLFAVKNAQASSRQEAKPIL
ASYIKDEGVIQFMLKSFDGNAAEKFRFNLSALFNNYGQIMGWEEVFFDKP
TLFIKGGNSDYLQSGYGTRILAQFPQASSFTINGSGHWVHAEKPEFVVRA
IQRFLESN
>MS0882 mhpC, MhpC protein
MMLYETKGNGEPIIFLPGLFAGGWIWNSVVRNIQDKGFKTFTFTDPIPVA
FEGSQQKALTELDTITENCSTPVYLVGNSLGALIALHYAFQRKDRVKGVI
MSGAPGQLEMEAGVSLDELKTGKDKYTTLLGSRIFYDQSKIPPHGIEEVK
YLFGTEKIFRNIVRWLYFSRKYDVPDVLQKISIPIDFIWGQYDLITPIEP
WIDIAKNFPQTSMTIIKDSGHSPMVEQPELFTEALLRKISSGRTHIK
>MS2156 mhpC, MhpC protein
MIMTISALDFFKRDVTLPNQLDGLPHKLSDVTGLQIGSFKTNDGVSLNYW
KAGSGEPLVFVPGWSSNGAEYINLIHLLKDKFTVYVLDQRNHGLSDKVKF
GNRISRFAMDLHEFFNAENIEKAHLCGWSMGCSVIWGYVDLLGTSRVEKF
VFIDEAPSIYCHSNWTEEERINAGAFTTSAEMMIDMYYGRGTCNMLQVNT
DLFNFYNTIDALAFENSMALCDQVCPHDKDALEQVLFDHILNDWRDVLIN
KIDKPTLVVSGEHSNWVESQRWIAQTVPNSEDLIYGKHEHGDHFLHLKMP
QKFAGELTEFLNRMS
>MS1344 modE, ModE protein
MDNTEILLTIKLHQRLFVDPKRIRLLKEIAHCGSINQAAKNAKVSYKSAW
DHLEAMNAISPKPLLERNIGGKNGGGTQLTNYARRLLQLYDLLEKTQEKA
FQILQDESIPLNNPLSATARFSLQSSARNQFFGKVTKLELKNGHCMVSIQ
IEGLNRPLVASITEKSAVRLGLVPGKEVMLMIKAPWIKTQLEEPVDKENQ
FLAEVRSVSDKGGEKEIILSIGENPEFCATIEKTVDVAVNQKRWLYIDPE
QIVLASL
>MS0019 mutT, MutT protein
MNLLQKPEILGISVAAKSRIFEIQAVELKFSNGELRTYERFKPSSRCAVM
VLPIDGEDLLMVREYAVGTERYELGFTKGLMEAGETPEQSANREMQEEIG
LGAKQFMLLRTVNSSPSFMNNPMHILIAQDFYPSKLPGDEPEPLQLVRVP
LANINELIEDPGFSEARNLVALYTLRDYLRKLK
>MS0709 mutT, MutT protein
MNYKNPNSVLVVIYAKNSGRVLMLQRQDDPEFWQSVTGSLAEKEMPFLTA
LREVKEETGIDIKRENLTLVDCHQSVEFEIFPHFRYKYAPNVTHCKEHWF
LLELPDERVPVLTEHLAYQWLEPAKAAELTKSPNNAQVIRKYLINKSA
>MS0328 mutT, MutT protein
MDKKTVQVAAGIIRNEFGQIYLTQRLEGQDFAQSLEFPGGKVDVNETPEQ
ALKRELEEEVGIVALNPVMFEQFVFEYPNKIIHFYFYLISEWIGEPFGRE
GQEGFWIEQLDLDESQFPPANSKLIQRLLAEMNC
>MS0408 mutT, MutT protein
MIDFDGYRPNVGIVICNRKGQVLWAKRYGQNSWQYPQGGINDGETPEQAM
YRELYEEVGLTRRDVRIVYASKQWLRYKLPKRLLRYDSKPMCIGQKQRWF
LVQLMSDEKNINMNCSKSPEFDGWRWVSFWYPVRQVVSFKRDVYRKAMKE
FACFLFDANKTVNPLSTNNNDEKKANYSAKKPYSPYRNQDKKRKTRV
>MS2341 mutT, MutT protein
MLKPHVTMACIVHCKGKFLFVEEIEYGKRTLNQPAGHLEENETILEGASR
ELYEETGIRAKMQHLVKIYQWHAPRSQKDYLRFVFALELDDWAEITPHDS
DITQGFWLTLEEFNYYIRQENQCARNPLVTEALEDYLAGSRYPLDILTLF
NN
>MS1694 mutT, MutT protein
MLIFCEQVQKNYKKNLKIFNFELSLPIVFAGGSVMSELQQFSQQDIEVLN
EETLYSGFFKMKKVRFRHKLFAGGMSEVVTRELLYKGAASVVIAYDPVRD
EVVLVEQVRIGAYDPNLSSSPWLMELIAGMIEEGESPEEVAMRESEEEAG
VTIDNLEYALSVWDSPGGTVERLYLFAGRVDSSKAKGLHGLACEHEDIKV
HVVSRETAYQWVNQGKIDNSSAVIGIQWLQLNYRRLQKNWC
>MS1528 mviM, MviM protein
MKKINVGIIGTGFIGAAHIEAIRRLGFVDVIALAENNQQLAEQKAKELNI
PLAYDCVDKLLANPDIQVVHNCTPNHLHFAINKKVILAGKHVFSEKPLCL
TSQEADELTSLAEQQGVTTAVGFVYRNFAMVQQAADMVRDQQIGRVFAVN
GHYLQDWMLLETDYNWRVDPKVGGKSRTVADIGSHWCDTVQFVTGKKIKE
VFADMSIVYSTRKASKQVESFVTVNADSSYELKPVETEDYASVLVRFEDG
SKGSFTVSQVSAGHKNDLTFDISGSEKSLHWEQETPQYLKIGYRQQANQI
LCDDPSLVNPAVRAYNHFPGGHIEGWPDAFKNMMLAFYAFIAEGKDPQQD
TAKFAMFKDGAQIVHIVDTIIESAQQGKWISVK
>MS1500 mviM, MviM protein
MKKFALIGAGGYIAPRHLRAIKDTGNTLVVAMDVNDSVGIMDSHFPDAEF
FTEFEQFEAFVEDQKLKGEKLDYVAICSPNYLHAPHMKFALKNGINVICE
KPLVLNSTDLNMLSEYEQKYGAKVNSILQLRLHPSIIALRDKVEAAPADK
VFDVDLTYLTSRGKWYLKSWKGVDQKSGGVATNIGVHFYDMLHFIFGDVV
KNEVHYRDEKTVSGYLEYKRARVRWFLSIDANNLPENAVQGEKLTYRSIT
IENEELEFSGGFTDLHTQSYQRILEGKGYGLEENRTAIETVEVIRHAPII
ENPANPHPFLAKVLNK
>MS1414 mviM, MviM protein
MDMKLGIVGTGMIVADLMQTLHKVTLEKLAIWGRDQVKTTQFASENGISQ
VFADYEAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTG
EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ
YSSRYDRFKAGETLPAFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCAYA
ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP
ANAMNRFTYTPNQGEAQHFEFGDAHRMLPEFERFVEIIDRKDFAQAEKML
DISAAVSEVLEQARKGAGIKFAGE
>MS1388 mviM, MviM protein
MGMKLGIVGTGMIVRDLMQTLHKVRLEQLAIWGRDQAKTAQFAAEQGILQ
VFSDYAAMLNSDLDTIYIALPNHLHFSFAKQALEAGKNVIMEKPITSNTD
EFNQLRQLAQTQGVILIEAVTVHYLPAYLAIREKVAELGEIKIVSLNYSQ
YSSRYDRFKAGETLPVFDPQKSGGALMDLNVYNVHFAVGLFGKPQSCTYA
ANIQRGIDTSGILLLDYPQFKAVCIGAKDCAAPVMLSIQGDKGNITVPMP
ANAMNRFTYTPNQGEAQHFEFGDVHRMLPEFERFVDIVDRKDFAQAEKML
DISAAVSEVIEQARKGAGIRFAGE
>MS1755 mviN, MviN protein
MSKRLLKSGIIVSTMTLLSRVLGLVRDVVIANIIGAGATADVFLFANRIP
NFLRRLFAEGAFSQAFVPVLAEYQRSGELSKTQEFIGKVSGTLGGLVSIV
TLLAMVGSPVVAAIFGTGWFIDWINDGPNAEKFTSASLLLKITFPYLWFI
TFVALSGAILNSLGKFGVMSFSPVLLNIAMITTALLLAPQMESPDVALAI
GIFIGGLLQFLFQLPFLKKAGLLVRPRWAWNDEGVKKIRTLMIPALFGVS
VSQINLLLDTFIASFLMTGSISWLYYSDRLLEFPLGLFGIAISTVILPTL
SRQHVNRADDVQKSAADFRATMDWGVRMILLLGVPATIGIAVLAQPMLLV
LFMRGQFSLTDVQATSYALWSINVGLLSFMLIKILANGYYARQDTKTPVK
IGIIAMISNMVFNLLAIPFSYVGLAMASAMSATLNAYLLYRGLAKADVYC
FTKQSAVFFLKVLAAALVMGTVVWYFSPQLVIWNEMAFLTKVIRLAELIL
IAASSYLLMLVILGIRKRHLLAR
>MS0494 nrfG, NrfG protein
MIQLKKLFNFVVFLPGLFFAFALSGCVNGADDVFVSKNKIILGEQYPNVH
FDQEVMIVRISQMLIIGQLSKNERADLYFERGVLYDSLGLWGLARYDFTQ
ALALQPRSPAIYNYLGLYLLLDEDYDSALEAFNAVLELDPNYDYTYLNRG
LDFYYMERYNLAQQDLLKFYEAKKDDPYRALWLYINELKFKPNEATQNLA
RRAKDLSTEYWGTYIVQYYLNEISVKDLLDKAKVFVDPQSSQYAEILTET
YFYLAKQKLNAGHAEEAETLFKLAMANQVYNFVEYRFALFELAKLKTNSE
QTEQAVVQRVKTTQAPNSKELDAE
>MS1820 nrfG, NrfG protein
MRKFKSLTLIALSVLVIASCSSSEKPVEQASEQELFSTGANYLQEGNYTQ
ATRYLEAVDSRFPGSSYSEQAELNLIFSTYKSQDYTKTLTTADRFLQQFP
QSQHLDYVLYMAALTNSALGDNLFQDFFGVDRSTRETTSMKTAFNNFQTL
VQNFPNSPYTPDALARMAYIKDRLARHELEIAKFYAKRSAWVATSNRITG
MLRSYPDTQATLEALPLLQESYEKMGLTQLASQAATLVKANEGRVIKEAE
KPKEPFLSLPSWLSFGSSDSSDKEKVATKSDDSFFSWPSWLSFGSKD
>MS1594 obg, Obg protein
MKFIDEALIRIEAGDGGNGCVSFRREKYIPKGGPDGGDGGDGGDVYLIAD
ENLNTLIDYRFEKRFAAERGENGRSSNCTGHRGKDITLRVPVGTRAIDND
TKEIIGDLTKNGAKLLVAKGGYHGLGNTRFKSSVNRAPRQKTMGTPGEKR
DLQLELMLLADVGMLGLPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGV
ARVDANRSFVVADIPGLIEGASEGAGLGIRFLKHLERCRVLIHLVDIAPI
DESDPADNIGIIESELFQYSEKLADKPRWLVFNKIDTISDEEAAKRAKDI
TERLGWEEDYYLISAATGKNIPQLIRDIMDFIEANPREVEEEEKAAEEVK
FKWDDYHNEQLSERGFDDEEDWDDDWSEEDDEGVEFIYKP
>MS2242 oraA, OraA protein
MTTLAFSYAVNLLSRREYSEFEIRCKMQEKAFSEQEIEDTLAQLQQKNWQ
SDKRFTENYLRARAQRGYGVNRIKQELRQLKGILPETVDEALMECDIDWS
EIALNVLAKKFPDYRARQDAKNKQKIWRYMLSHGFFAEDFADFIGNGTED
EFY
>MS1291 osmY, OsmY protein
MNMHKLKKLTFIIGSALLLQGCVAALVGGGAVATKVGTDPRTTGTQLDDE
TLKFQVYNAVNKDEQIKQEGRIVVSSYSGRVLLLGQVPTESLKSVATSLA
KGVDGVGDVYNEIRVGSPITVTQKTKDSWITSKIKSDMLLNSSVKTTDIK
VITENGEVFLMGNVTQEQANAAAEVARNIAVLKKS
>MS1958 perM, PerM protein
MIEMLKNWYLRRFSDPQAMGLAAILFFGFVAIYFFSDLIAPLLIALVLAY
LLEMPISFLSDKLKLPRFLSILLILGGFIAVTILMIFGLIPTLINQTVNL
FSDLPNMLNLSHQWVMSLPESYPELVDYQMIDSLFITIREKTLAFGESAV
KFSLSSLMNLVTIGIYAFLVPLMVFFMVKDQDELIAGFSRFLPKNRTLAS
KVWQEMQLQIANYIRGKLFEILIVAVVSYIIFLFFGLRYPLLLAVAVGLS
VLIPYIGAVLVTIPVALVAIFQFGATPTFGYLMTAYIVSQLLDGNLLVPY
LFSEAVNLHPLTIIIAVLIFGGLWGFWGVFFAIPLATLVKAVVNAWPSNE
DEAIS
>MS0428 perM, PerM protein
MNKSVSVNQFLIGFAALVIILAGIKMAGEIVVPFLMSLFIAIICSPIIKF
MTNRKIPHWLAISILFLFIVLVFFFLLGLVNSSIREFSQSIPQYRVLMSE
RLNEITALIQKWNLPLNLEKETILEHFDPSSIMNFVSRLLLSFSNVLSNA
FVLILVVIFMLLEAPTAKRKVALALSGNEKDASKEEKHLERILQGVISYL
GVKTAVSLLTGLCAWVLLETCGVQYAVLWATLTFLFNYIPNIGSIIAAIP
IVLQALLLNGFSTGFAVMTGIIAINMLIGNFLEPKLMGRTLGLSTLVVFL
SLLFWGWLLGTVGMLLSVPLTMALKIMLEASPNTTKYAALLGDVEESN
>MS1478 pfoR, PfoR protein
MKNRLKNFLIRQNIKFSLRRYAIDAMNFMALGLFGSLIIGLILKNTGDWL
DILWLNELGALAQSSMGAAIGVGVAYALKAPPLVLLSSTTTGIAGATLGG
PIGCFIAAAIGAEFGKLVNKTTPIDILITPAVTLLSGIATAQFMGPFLAS
LMRETGAMIMWAVELHPIPMSILVSVLMGMILTLPISSAAIAVTLSLSGL
AAGAATIGCCAQMIGFAVIGFKENRWGGLLSLGLGTSMLQIPNIVKNPKI
WVPPTLSGAIIAPFATVIFQMQNIPSGAGMGTSGLVGQIGTINAMGNSPY
IWLVILVLHFILPAILSLLITYLMRRKGWIKPGDLKLAV
>MS1090 pheT, PheT protein
MKFSEQWVREWVNPAVNTEQLCDQITMLGLEVDGVEAVAGEFNGVVVGEV
VECAQHPDADKLRVTKVNVGGERLLDIVCGAPNCRQGLKVACAIEGAVLP
GDFKIKKTKLRGQPSEGMLCSYRELGMSEDHSGIIELPADAPVGKDFREY
LILDDKEIEISLTPNRADCLSIAGVAREIGVVNQLAVTEPAINPVPVTSD
EKVAINVLAPEACPRYLLRSVKNVNVNAETPVWMKEKLRRCGIRSIDPIV
DITNFVLLELGQPMHAFDAAKLAQPVQVRFAADGEELVLLDGTTAKLQSN
TLVIADQTGPLAMAGIFGGQASGVNAQTKDVILEAAFFAPLAITGRARQY
GLHTDSSHRFERGVDFELQHKAMERATSLLVEICGGEVGEICEVVSETHL
PKLNKVQLRRSKLDALLGHHIETETVTEIFHRLGLPVSYENEVWTVTSAS
WRFDIEIEEDLIEEIARIYGYNSIPNNAPLAHLSMREHHESDLELSRIKL
ALVGNDFHEAITYSFVDPKLQSILHPEQAVWILPNPISSEMSAMRVSLLT
GLLGAVVYNQNRQQNRVRLFETGLRFIPDESAEFGIRQELVFAAVMTGSR
LSEHWASKAEPADFFDLKGYIENLLSLTKAGPYIKFVAKEFPAFHPGQSA
AIVLDGEEIGYIGQLHPMAAQKLGINGKAFACELIVDKVAERNVANAKEI
SKFPANKRDLALVVAENIAASDILDACREVAGSKLTQVNLFDVYQGQGVP
EGHKSLAISLTIQDTEKTLEEDDINAVISVVLSELKDRFNAYLRD
>MS0580 potD, PotD protein
MRNILRKALSLTITALAVANFAQAENLTDKSWPDIEAQAKKEGKLTVSVW
YLQPQFRVFVKEFEKQYGIQVKVPEGTLDGNINKLIAEKNLEKGKMDVVV
LSADRVSNVTNNGVLANIKQLPNFGKLNHFLQGVDLGETAVGYWGNQTGF
AYDPLRITEDQLPQSWQDVENYIQQNPKKFGYSDPNGGSSGNAFIQRALV
YVNGEYDYMTPTVDAAQVANWKKTWEWFNARKNVMIRTASNADSLTRLND
GELVLVSAWQDHLFSLQKQGAITTRLKFYVPQFGMPGGGNVATIAKNAPN
PAASLVFIHWLTSPEVQQKLSQEFGVRPLDSESGKRDTLFFSTPWRKAEM
EAFTKEVVSR
>MS0817 pqiB, PqiB protein
MASPFSLRCFQPHSLIPVYFGIFMTNNQANNRVKINAENNVQAAKIKQDK
RISPFWLLPIIALCIGALLFFQIIKEQGETIRITFTTGDGLVANKTQVRY
QGLQIGIVKKVNFTDDLKKVEVQASIYPEAKNVLRENTKFWLVQPSASLA
GISGLDTLISGNYISLQPGDGNYKDDFIAEETGPIAQVSDGDLLIHLLAD
DLGSISEGASVYYKKMPVGKIYDYRFTPDQKKVEIQVVIDKAYANLIKQD
TRFWNISGINANVGPSGITVNMDSLNAIVQGAITFDSPDNSPKAKQDQQF
TLYPTLQAAQRGIEVKITLQNQAGLKAGKTEVFYNNLQVGTLAKLDNEDI
THAKISGTLLLDPNISNELRTNTNIILRTPKMNLATLEKLPDMLRGQFFE
IIPGSGEPQREFQVYKESDLLLKQADTLVFTLTAPETYGIAEGQQIFYNN
LPIGEIVKQTLNEQGVEYQAAIAGKYRHLIYGDSQFVAASNLDISLGIDG
LRVEAASPDKWLQGGIRLIANKNKGSALSSYPIYKDLSSAEAGITSSTLT
PTITLNAQNLPNIGKGSLVLYRQYEVGKVLDIRPLKNSFDVDVAIYPKYR
HLLTKNSLFWVESASQVDITARGISIQTSPLGRVLKGAISFDNSGGNNNK
TLYANELRAKSAGQVITLTADNATNLTKGMALRYMGLEVGQLESINLDQN
KNQVVVKALMNPNYMNLVAKEGSEFRIISPQISAGGIENLDSLLQPYIDI
DAGKGKYKTTFAIKNNNNTDNKYNNGFPIILEASDALNITTGSPIYYRGV
EVGKINRMELNELGDRVLIHLLIANKYRHLVRKNSEFWISSGYSAGVGWS
GIEVNTGTVQQLLKGGISFSTPSGTVIQPQAAANQRFLLQIKKPVEAKTW
NSAVLPEQN
>MS0807 proP, ProP protein
MLMTSQNKINAVPSNQNFYLNNRNYWIFSGYFFVYFFIMATCYPFLGIWL
GDINGLSGEDRGTVFAMMSFFALCFQPVFGYVSDKLGLKKHLLWVLGISL
LIYAPFFIYIFAPLLKVNVWLGSLVGGAYIGFVFQAGAPASEAYIERVSR
RSKFEYGRVRMFGMFGWAICASIAGVLYATNPNLVFWLGSIASLILLLLI
ALAKPEQTSTVQIAEKLGANKNPVNLRQAFALLKLPKFWALLAYVMGIAC
VYDIFDQQFGNFFNTFFESHEQGIKMFGYVTTAGELLNALIMFFVPLIIN
RIGAKNALLIAGTIMSVRIIGSSYAIEAWHVVVLKTLHMFEVPFYLVGLF
KYIANVFEVHFSATIYLVACHFAKQIGNMLVSPLVGAWYDTYGFQDTYLI
LGCIAAGFTLLSVFTLTGKSLSSQS
>MS0191 proP, ProP protein
MSNKVNSYGWKALMGSAVGYAMDGFDLLILGFMLSAISADLSLSPTQAGS
LVTWTLIGAVAGGIIFGALSDKYGRVRVLTWTIVLFAVFTGLCAFAQGYW
DLLIYRTIAGIGLGGEFGIGMALAAEAWPARHRAKASSYVALGWQVGVLA
AALLTPLLLPIIGWRGMFLVGIFPAFVAWYLRAKLHEPEVFVQKQAEVAT
GKRQSPFKLLIKDVATAKVSLGVVVLTSVQNFGYYGIMIWLPNFLSKQLG
FSLTKSGVWTAVTVCGMMAGIWIFGRLADRIGRKPSFLLFQIGAVISIIA
YSQLTDPAIMLFAGAALGMFVNGMMGGYGALMSEAYPTEARATAQNVLFN
LGRAVGGFGPVIVGAVVSAYSFKIAIALLAVIYVIDMIATVFLIPELKGK
ALK
>MS2054 proP, ProP protein
MASGEANYRSLAWIAASALFMQSLDATILNTALPTIAADLHHSPLEMQLA
VISYALTVALFIPISGWVADKYGTLRVFRFAVGMFALGSLACAMSSSLIM
LIFSRVLQGFGGALMMPVARLSIIRSVPKQELLPVWNLMATAGLTGPILG
PILGGWIVTYTSWHWIFLINIPMSLLGIWLANRYMPNVTGSLQKLDWAGF
FFLGGGLVGVTLGFDLISEEFIAKWQATVIVILGVILIITYCFHAQKRER
LALLPLSLFKIRTFRVGIMANMLIRLCASGIPFLLPLMYQVVFHYSADKA
GMLIAPIALSSMLVKPLCGRILTKLGYRTALISASIVLTLSIAVMSFLHI
DSPVWILIVNVALYGGCISIVFTAVNTLTISELSDQDASAGSTFLSVVQQ
VGIGLGIAVSALILSLYRYFIGESAVQLQQAFGYTYLTSASFGVLLVLVL
SGLKKEDGAHLHK
>MS1530 proP, ProP protein
MNTETKQPALIVPRLSLMMFMEFFIWGSWSVTLGIVMTKYDLSTLIGDAF
SMGPIASIISPFILGMLVDRFFPSEKVLAVLHLIGAAILWFIPEFITGQQ
GGTLVFALLAYMLCYMPTVALTNNIAFHSLADSEKSFPVIRVFGTIGWIV
AGLFIGQADLSASPAIFQVAAICSLILGLYSFTLPNTPPPAKGKPFSMRD
LMCADAIALFKIPHFLVFAICATLISIPLGTYYAYAAPFLDAVGFEKIGS
LMSMGQMSEIVFMLLIPFFFKRLGVKYMLLAGMLAWFLRYAFFALGVSEE
IRWAVYLGILLHGICYDFFFVVGFMYTDKVADEKIRGQAQSLVVLFTYGL
GMLLGSQISGGLYNNMFADNTDVSTWSTFWWIPAISAVVISVIFFIFFNY
KEDKREA
>MS0785 proP, ProP protein
MQNKFAVYLAAIGHLVTDMAQGALPALLPLFIKNYGLTYQEAGGLIFANT
VLASIAQPFFGYLADKRSMPWLIPLGMMLSGCCIAAMGFVHSYPGLFFFA
MIAGIGSALFHPEGARLVNRMSGGEKGKAMGIFAVGGNAGFAIGPMFAGL
AYLFGAQTLSIFALINTIIALIIFLQLPKLTVENVVNKAKNTASTTLQND
WRSFAKLSVIIFVRATNFTVLNAFIPIYWIHILHQQETDANFALTIFLSM
GVAITFIGGLLSDRLGYVRIIRYAYLIFLPTILIFTQSENLWLSFILLIP
LGLGVFTQYSPIVVLGQTYLAKSVGFAAGITLGLGITMGGIFSPIVGWIA
DHYGLQIALQTLSVLSLLGLIFSYRLKITDTEKPEKK
>MS0499 proP, ProP protein
MNLREHIDNNPMSAYQWTVVIIAAIMNLLDGFDVLALAFTATAIRGDLGL
SGAELGYLFSAGLLGMAAGSLFLAPLADKIGRRPLLLISVTLSALGMLGS
AYSASYGALGFWRLITGLGVGGILVGTNVLTSEYSSRKWRSLAISIYASG
FGIGAVLGGMFAVVLQEEYSWHAVFLAGFILTAVCLIVLLIWLPESIDFL
MTQQPRNAQIRLNKITKKMGLKGQWTLPEKVLASASKLPLTQLFNKNYRK
STALIWIAFFAIMFCYYFVSSWTPALLKEAGMTTEQSVSVGMMVSLGGTC
GSLLYGLLASRWKAKQMLVQFTVLSAFSVIIFILSSSILWLAMLFGILVG
GFMNGCISGLYTLNPSIYAANIRSTGVGWSIGVGRIGAILAPLAAGVLLD
YGWDKQSLYIGVGFVLLIAAIALSLLRIKTTLVKC
>MS0797 proP, ProP protein
MSQNHFFSHIFNRNMLICIFTGFSSGLPLYILTSLIPTWLRSTEIDLKTI
GFFTLTSLPFIWKFLWSPFLDRFVPPFLGRRRGWMLIFQLLLLISLGLFG
FIDPHTNQGLSLLIGLATMVSFFSASQDIVLDAYRREILSDQELGMGNSI
HVSAYRIAGLVPGSLSLILSDHFSWQAVFIITALFMLPGLLMTLFISHEP
QIELKSNRTLAENIVEPFKEFFQRKGLWGAIGILTFIFLYKFGDSMATAL
ISAFYLDMGFTKTQIGLVVKNASLWPMIIAGIIGGMITLKIGINKALWLF
GLVQIVTILGFAWLAQLGPFEKVDSFAIFALTVVVMAEYVGIGLGTSAFV
AFMARATNPVYTATQLALFTSLSALPRAVFNSFSGVLIENMGYYHYFWLC
FFLAIPGMLCLIWVAPWKEK
>MS2374 proP, ProP protein
MSGEKTSRYVLGVTLVATLGGLLFGYDTAVISGTVSSLDTVFIQPKGLPE
ISANSLLGFCVASALIGCIIGGACGGYLSSKYGRKKALLIAALLFLISAF
GSAYPEFGLKTINETNNIPYYLSNFLIQFVIYRIIGGIGVGIASMVSPMY
IAEITPARIRGKMVSFNQFAIIAGQLIVYFVNYFIALNGDNTWLNMLGWR
YMFLSEMVPAALFLILLFFVPESPRWLVLQNKFSQAEITLLKLLGERSGK
TELQNIVSSLEHRVVKGAPLFSFGLGVIVIGIALSVFQQFVGINVALYYA
PEIFKSLGASTNNALLQTIIMGTINLSCTTIAIFTVDKYGRKPLQIIGAL
GMAMGMFVLGMAFYANLSGTIALTGMLFYVAAFAISWGPVCWVLLAEIFP
NAIRSQALAIAVAAQWIANYIVSWTFPMMDKSSYLVERFNHGFAYWVYGL
MAILAALFMWKFVPETKGKTLEELELLWNKK
>MS0392 proP, ProP protein
MSTAKKRNFIFIATLGILSMLPPLGVDMYLPSFLNIARDLQVDPERVQYT
LTFFTFGMAAGQLFWGPVGDSYGRKPIILLGVIIGAVAAFFLTGVNSIEN
FTALRFIQGFFGSAPVVLVGALLRDLFDKNELSKTMSMITLVFMIAPLVA
PIIGGYLVLFFHWHSIFYVICAMGILSAILVFFIIPETHHQDNRIPLRLN
VVVRNFVTLWRRKEVLGYMFSSGLGFGGLFAFLTAGSIVYIGLYGVPVDQ
FGYFFMLNIGVMTLGSVINGRVVHRVGAERMLQIGLTVQLIAGIWLLIVA
CFDLGFWPMALGIAVFVGQNSLISSNAMASILEKFPTMAGTANSVAGSVR
FGLGATVGSLVALMKMDSAAPMLFTMGICVIVAVCCYYFLTYRSL
>MS1798 proP, ProP protein
MNVRPFTWLALSYFGYYCAYGVLVPFLPVWLKSQNYGTELIGAVIASSYL
FRFLGGIFFPSRVKRANQILPALRLLAWANVFVITAMAFVSESFWLIFIA
IAVFSMVNAAGMPLTDSMATTWQRQIRLDYGKARLIGSAAFVVGVTVFGS
LIGAIGEQYIISILIGLFGLYAVLQMVPPQPKPADEDKNSAKSAVGFGEL
LKNPTHLRLIIAAMLIQGSHAGYYVYSVIYWTNRGIAVETTSLLWGLGVI
AEILLFFFSGRLFRNWSVNAIFYLSAAAAALRWGAFSYTDALWQIALLQC
LHSLTFAALHYAMVRYIGMQPQNAMVRLQSLYSGLASCASVALLTALAGI
IYPISSHWVFLVMMICALIALFVIPRKPTNA
>MS1178 proP, ProP protein
MPNKAETSPAKLRLKAFLKRIKIMNTTENSKQKPVNVVAFAFLLTAFLTG
IASSFQTPTLSLFLAQEIQVSPFMVGMFYTSNAVLGIVLSQILAKYSDSQ
DDRRKIIIFCSLLAIGGCITFAYNRNYYVLMFFATFLLSLGSSANPQAFA
LAREYADYTKREAIMFTTIMRTQISLAWIVGPPLSFSIALGWGFEYMYMV
AASAFLLCAIIAKALLPYVPRKAVVPLTKPDEVAGLPAKNKKQSDKQSIR
LLFITCFLMWSCNGMYLISMPLHVINELHLSERLAGILMGTAAGLEIPVM
LIAGYLTKYLTKKSLILTALFMGLFFYIGMLFAEQTWQLVALQAFNAIFI
GIIATLGMVYFQDLMPGKMGSATTLFSNAAKSSWIVAGPFVGIIAQIWNY
SSVFYISIVLVAVSLFSMSKVKSV
>MS1407 proP, ProP protein
MMTSSRPNLTLLLILGALMACTSLSTDIYLPAMPTMAKELQGNTELTITG
FLIGFAIAQLIWGPISDRIGRKIPLFIGMALFAVGSVGCALSQSMAEIVF
WRVFQAVGACVGPMLSRAMIRDLYDRSQAAQMLSTLTIIMAAAPIIGPLL
GGLLLKISSWQAIFWLLVVIGILLFLSIIKLPETLPPAKRAAGSFWSAFG
NYRILLKNRAFMRYTLCVTFFYVAAYAFITGSPFVYIDYFKVDPQYYGFL
FGVNIVGVALLSAVNRRLVRHYPLESLLRVSTMIALCAVLILVVLVFMDL
DGIAGILSVAVPIFIMFSMNGIIAACTNAMALDSVQPEIAGSAAALLGSL
QYGSGILSSLLLAYFSDGTPHTMAWIIALFVGLCAVIGWGQRPRSA
>MS0998 pta, Pta protein
MSRTIILIPISAGVGLTSVSLGLIRALEQKGTKIGFMKPISQPRSGEDML
DRTTSIVRTSTTIETTEPVMLSEAENLIGQNQTDVLLEKIVAQHQQISKD
NDIVIVEGLIPSRKNSYANSVNYDIAQALDAEIILVSAPATETPAQLKER
VEAAAASFGGKSNPNLLGVVINKFNAPVDESGRTRPDLTEIFDSFQHSHN
NIKEIYKLFENSPIKVLACIPWSADLIATRAIDLVKHLGASILNEGDMNR
RIRSITFCARTLPNMIEHFKAGSLLVVSADRPEILTAAALAATTGIELGG
ILLTGGYKIDCEIKKLCNPTFENTKLPVFRIEGNTWQTALSLQSFNLEVP
VDDKERIENIKQYTSGQFDADFIHSLASASVRARRLSPPAFRYQLTELAR
AAKKRIVLPEGDEPRTIKAAVLCAERGIAECVLLAKPEDVKRVADSQGVK
LGNGITVIDPASVRENYVARLVELRKAKGMTEMAAREQLEDTVVLGTMML
EAGEVDGLVSGAVHTTANTIRPPMQIIKTAPGSSIISSIFFMLLPDQVLV
YGDCAVNPDPTAEQLAEIAIQSAESAKSFGIDPRVAMISYSTGTSGSGAD
VEKVKEATRIAQEKRPDLLIDGPLQYDAAVMEDVARSKAPNSKVAGKATV
FVFPDLNTGNTTYKAVQRSADLVSIGPMLQGMRKPVNDLSRGALVDDIVY
TIALTAIQATQC
>MS0777 putP, PutP protein
MFGLDPTLITFTIYILGMLAIGVLAYYYTNNISDYILGGRRLGSFVTAMS
AGASDMSGWLLMGLPGAVYVSGLIEGWIAIGLTIGAYLNWLFVAGRLRVH
TEFNNNALTLPEYFHSRFGTSHNLLKIISASIILVFFTIYCSSGVVAGAK
LFQNLFGIPYATALWYGALATIAYTFIGGFLAVSWTDTIQATLMLFALIL
TPVVIVVSLGGIDGFSASMQSAEIDMQKDFTDLFTGTSTLGLFSLAAWGL
GYFGQPHILARFMAAYSAKSLHKARRISITWMIICLIGAISIGFFGIAFF
HANPQIAEVVTKEPEQVFIELAKLLFNPWVAGILLSAILAAVMSTLSCQL
LLASSAITEDFYKGFIRPKAGEKELVWLGRIMVLIIAALAIWIAQDENNK
VLKLVEFAWAGFGSSFGPVVLLSLFWKRMTSSGAIAGMLTGAIVVFSWKS
VIPATSEWSGVYEMIPAFSLASLMIILVSLLSPAPNKEIVETFEKANLAY
KNAE
>MS1741 putP, PutP protein
MNVDYLVMAGYFALIIAISLLFKKMASNSTSDYFRGGGKMLWWMVGGTAF
MTQFSAWTFTGAAGKAFNDGLSVIAVFVGNMVAYACAYWYFARRFRQMRV
DTPTEAIGRRFGTSNEQFFTWVIIPLSVINAGVWLNGLSVFASAVFDADI
TMTIYVTGISVLIISLLSGAWGVVASDFVQMLVVAVISVACAVVGLVVIG
GPGEIIDRFPGGFVSGPDMNYPLILICTFLFFIVKQLQSINNMQDSYRFL
NAKDSKNASKAAIFALLLMLVGTIIWFIPPWVTAIIYPEAASLYPQLGKK
ASDAVYLVFAKNVMPAGTIGLLMAGLFAATMSSMDSALNRNSGVFVRSFY
APIIRKGKADDKELLRAGQIVCVINGILVILMAQFFNSLKHLSLFDLMMQ
VATLLQSPILVPLFLAIIIRKTPKWAPWATVLFGMFVSWSVVKVFTPEYV
ASWFGVEDLTKREISELKVIITIAAHLIFTAGFFCLTTLFYNEAKDTNNE
RRIAFFKDVDTECVAEEGQDEIDRLQRKKLSTLVMLMAAGLLLMILIPNP
LWGRALFACCSLAIFAVGYGLKRSAEV
>MS0578 rarD, RarD protein
MFMSISAKLKGWHYAFACYAIWGTFPIYWYPLNSSAMPADQILAQRIVWS
VVFAVFLLIIFKQSRAVLRAFTKPKILAIFFLSSFLIALNWLVYLWAITN
HHVLDASLGYFINPLFNVFLGRLVFKERLNKPQLLALCFATAGILWLAIP
AGQIPWVALLLAGSFGFYALIRKLAPMEALAGLALETLLLSPFALAYLFF
CYTQNTLVFSELNSLQLGVLLGSGAATTIPLLWFAMGARQISMSLLGMLQ
YISPTLQFLCGSLLFGEALSITRLIGYSLVWIGVAIFLLAMRKKMQNK
>MS1495 rfbX, RfbX protein
MVRLISTVFVRQILVGILQVITLIVIARGLGTGQMGQYTLAILLPTLFSQ
IITFGLQSINIYAIGRKMINENQALYANLIFLSGLSVLTSLILGVVVYYF
GQYFFNEVPVNLLYLALASLLPQTFFTVLPSLIQAVQNFKWFNIVCVAQP
LVIFVVSMVAILLSDNVSSILTAYVLSHWISFFILLGIILKLIKVETCSL
KRFFSDFIGYGLKSHLSNIITLLNYRSSLLILGYFTTPVIVGIYSVGMQL
AEKLWLPSQAVSTVLLPRLSNKLGEGGDEKEVAKLTLDSARLTFIVTLII
GIAFACLSSIVVRILFGVEYDKAVYVILLLLPGILAWTPSRILANDLAAR
GFAELNLKNSYWVFGINTALSLCLVPLWGLIGASVATSIAYSMDLVLRLI
AFNQVTQSRAFLHIIPRISDFGTVINFIKGLRNAR
>MS0657 rfbX, RfbX protein
MVKSTKRVFNFIMNKINTEHKKRLFSNFFSLTVLQIVNYALPLLTLPYLV
RVLDVETYGLVMFAQSFILFFNILVDFGFNLSATKEVSIHRDDKNKLIEI
YSSVMVIKFLLILSSFIILSIIIFSFERFSLNKGVYFLSFLWVIGQALFP
VWYFQGIEKMKYITIVNIIAKFLFTGCIFLFVKENADYLLIPLFNGLGIL
IAALVALWIVHVSLKQKVTWQPLSKLWIYFKESSTFFLSRASLTMYTSAN
AFVLGIFSNNTIVGYYSIADQLYKALQAFYTPLSQVLYPYIAKERNIVLF
KKIFNMAVFLNCMGIAILYFITVDVFALLFTQKIGIESINVFNIFLIASL
IVVPSILLGYPFLGALGFAKEANLSVIYASIIHILGLVILILFNKISLYS
VAYMVLVTELFVFMYRISKIRGRRLWRKQL
>MS1825 rhaT, RhaT protein
MLMPHFTQSKGYGYFCLILATFFWGGNYMFGRILSHVIPPIILNYLRWLP
AAIILLLLFAKYLPQQRHIIRKNWQILTALALLGVLIFPVFLYQGLQTTT
ALNASIYLAVVPIVVMFLNRICFKDTIRFPVFIGALISFIGVLWLLSHGE
LSRLLTFNVNRGDLWAIGSAVSWSVYCSIIRLRPKEIGNSVMLTAQVGIA
MIIFTPVFLSQLNTENLQIISELTYGQWMIILYLIIGPSILSYGFWNYGM
TIVGGTKGAAFTNATPLFAAALGILVLGEQLHGYHLISSLLIVIGLTLCN
KK
>MS1754 rhaT, RhaT protein
MFYLIAAVLIWASAFIAAKFSYTMFDPALTVMLRLILSALLVLPTFFRSY
RKIPKQYRLQLWGLGLLNFPVVLLLQFTGVHYTSVASAVTMLGTEPLVVT
LLGHIFFHKPARLLDWLLGIVALTGIVFVVYGSESGGEVTLLGCTLVLLG
SIAFSFSIHLAQSVMKAVEAKAYTDVIIMTGAISCVPFSLLLVQDWQIHL
NIEGISAILYLSVGCTWLAYRLWSKGLRVSSANTASILTTLEPVFGVLLA
ILLLGEHLTLTTLFGICLVISAAGISVLSSMLINYIKNKVTIL
>MS1595 rhaT, RhaT protein
MNQQPVLGFIFALITAMAWGSLPIALQHVLTVMGAESIVWYRFFVASLAL
FLLLAWKKKLPALSQFTSRYWKLSLIGVLGLAGNFFLFNSSLNYIEPAIT
QIFIHLSSFMMLICGVFVFKEKLGAHQKAGLLILILGLGLFFNDKFDMLF
GLNMYSTGILLSVSAAVVWVAYGMAQKLMSRQFTAQQILLIMYTGCVIVL
CPFAQFSQIQGLSGFALGCFIYCCLNTLIGYGAYAEALNRWDVSKVSVVV
TLVPLFTILFSRILHGLDPAHFAMPHLNTVSYIGAFVVVLGAIISAVGYK
LFKYKR
>MS1753 rhaT, RhaT protein
MLFQIIATLIWASAFIAAKYTYEMMDPVLMVQCRFFIASIIMLPGFFAAY
KRVPKERLKIMWLLALINFPLMFLLQFIGLYFTSAASAVTMLGMIPLLTV
LIGFLFFKRRINKIDLLLSLVALAGIILTVVGGGEDNLINPWGCLLVLGS
AVSFCFCLYLSKDVMQEMAPKDYTNVLVILGSILCLPFTCVLVRDWSIVP
SVKGMISLFYLGIGCTWLAVVLWFKGVQKTPTYISSILTTLEPIFGVILA
ILILDERLSTVSAMGILLTLGAAAVSVLIPVLMKKSP
>MS1597 rhaT, RhaT protein
MKQQPLLGFLFGLIAACMWSSLPLFVQQVVKVMDIQTSVWYRFVLSAVGV
LLLLCFSGKFFTFKRISPKNTLLLLLAIAGLSVNFYLYNLALKYIPPTTS
QVLSPLSSFMMLFAGVLIFKEKMARHQKIGLAVLSLGLILFFNERLDDFL
QLNTYFKGVVMVIASSFVWVIYAIVQKVLLSHLSSQQILLMIYIGCTLVF
FPNADIKQIYQLDGFQLVCLVFSGVNTIIAYGCYAEALDRWEVSKVSAIL
TQIPIFTLLFFHLAVMIAPNYFVAVELNWISYLGAFCVVSGAMLSALGHK
LKMLKERD
>MS0885 rhaT, RhaT protein
MLRPSCREKIIMVNNYNLALIKVHFTAVLFGLTGVLGVIISADSDVIVLG
RVIIAFLALSVYFLIKREKLTALSTKDVANQSLSGALLTAHWVTFYVAVK
VGGVAVATLGFAGFPGFVALFERLFFQEKLKRRELILLIAVTIGLILVTP
QFEFGNQSTQGLLWGIFSGAIYGILAILNRKNINKLSGTQASWWQYLIGS
ILLFPFAAHKLPAVSVTDWFWIACLGLLCTSLAYTLFVSSLNIINARTAA
MIISLEPVYAILIAWIWLGEQPGLRMIIGGLIILLSVGVVNFRR
>MS0535 rhaT, RhaT protein
MLQKYRGEIILFIVSLIAASGWFFSKFSMAEFPALGFIGLRFFLAAIFFF
PLAYPQLKRLDKPQLIKSALVGLCYAVYIMLWMLGLINSAHFGEGAFLVS
LSMLIAPLLSWLIFGHLPYKSFWLALPAAFTGLYLLSSGKGGLHFSFGSL
IFLISSLVAALYFVLNNQYARDIPVLSFTTIQLFIVGTCCGTLSILFEQW
PTSISMTAWGWFLCSLVIATNLRMLLQTYGQKYCHVATAAIIMILEPVWT
LFFSILILGERLTLHKAFGCLSILAAIMIYRLPAILRNQASANKE
>MS0300 rimI, RimI protein
MQFKIKPMLPEHYQQVYRLWTSIEGMDMSDADDNFEAISAFLAFNPDLNY
IAEINGKVVGVIMCGFDGRRATLYHAAVDPDYQKQGIGFALAEHLESALK
TKGISKGRLLAFKSNESATLFWQKAGWTLQQKLNYFSKKFI
>MS1590 rimI, RimI protein
MTEISPIQAEDFDRLFEIEQAAHLVPWSMGTLQNNQGERYLNLKSSVQNH
IAGFAICQTVLDEATLFNIAIDPVCQGQGIGKALLSELIKRLREKNVATL
WLEVRESNQTAKRLYDRLGFNEVDIRKNYYPTPDGGRENAIVMALYL
>MS0145 rssA, RssA protein
MKVGLVLEGGAMRGMFTAGVLDIFLDENIHIDGAVTVSAGALFGINLPSK
QRGRVLRYNKKYLNDKRYMGLHSLLTTGNIVNRDFAFYELPYTLDPFDQQ
TFAQSDMDFWVTLTNVETGEAEYFKIQDAFEQMEVLRATSAMPFVSKMVE
INGKKYLDGGIADSIPLQKCFDLGYDKVIVVLTRPLEYRKTPSSKTLFKL
FYPNYPQLAARWAQRYADYNQTVERIIKLNDEQKIFVIRPSESLNISRLE
KDPEMIQRMYELGLKDGKAAIAGLREYLAK
>MS1964 sPS1, SPS1 protein
MRKDMLQVQHENHFFLFNFDENRPNQEHFFESYFWQKQNRIIGSAKGRGT
TWFIQSQDLFGVNTALRHYYRGGLWGKINKDRYAFSSLEETRSFAEFNLL
NRLYQAGLPVPKPIGAHVEKLAFNHYRADLLSERIENTQDLTALLPNTEL
TAEQWQQIGKLIRRLHDLQICHTDLNAHNILIRQQNNDTKFWLIDFDKCG
EKPGNLWKQENLQRLHRSFLKEVKRMRIQFSEKNWADLLNGYQN
>MS0290 sbmA, SbmA protein
MLKNIRLNFNLSIGSGMNYSQELLTSLLWIFKAIGITAVLFSLTVYVLVK
TTRWGRQFWMLAAGYISPKRSKKPIGYFVIIVFFNLLSVRLDILFSEWYK
AMYNALQESHEKMFWIQMVVFSVLATIHIANVLLTYYLTQRFTIQWRTWL
NNEMVNRWTENQAYYKAQYVYNKLDNPDQRIQQDVLSFVSNSIEFATGVI
SSVVSIVAFTVILWGLAGPMTVVGITIPHAMVYLVFIYVLITSIFAFRIG
RPLINLNFTNERLNANYRYSLIRLKEYAESIAFFRGEKMEKNVLFKQFNQ
VIGNVWKMVHMTLKLSGFNLAVSQVSVIFPFIIQASRYFSKQIQLGDLIQ
TAQSFGRVQTALSFFRNSYDSFTGYRAVLDRLTGFYSAVNQANSASHISI
EDSESAVVFDKLTVKKPTGEALIKDLSLNLPQGASLLIKGPSGAGKTTLL
RTIAGLWSYSEGIVRCPQHHALFLSQKPYLPQGRLIDALFYPELAPENLD
LAQAAEIMRKVQLGHLTDRLEQENDWTRVLSLGEQQRLSFARVLICRPLV
AFLDEATASMDEGLEESMYRLLKTELPDTTIISVGHRSTLQIHHTQHLVI
NPQDQSWALS
>MS1316 sbmA, SbmA protein
MNWQTELNNSFSWLITTLIWVSLAFTFFALLLRKTDFGEKFWLVTKPCIE
QSNKFKTIGLILFLFLLILLEVRISVLNSFFYNGLYSALQDKKADAFWFF
ATINAMLVGFKIIHSIINYLIRQIFEIRWLEKFNDDMLSRWLDHKNYYRL
KYEKDLPDNIDQRIEQDAREFITGTVDLVDGILGAIVSIIEFTIILWGLS
GLLVLFDISIPKGVVFFIYTFIIIATALSVWIGYPLIKLNFNKEKLNGDY
RYSLIRIRDNAESIAFYDGEQKERQYLNERFKAIIKNRWAIVRQMLGLDG
FNTGVTQIAMILPLMLQAPRFFAGQATLGDMHQTVQAFNRLMRALSFFRL
FYEQFTLYQARLNRLYGFIGKLNELDTHLIPNPIECSQLVALENFGLKDA
KGNVLFEGINLELSAGDALLIQGASGTGKTTLLKAIAGIYPFETVGRSKR
PCNGKILFLPQRPYMPQGSLREAICYPNIDPHHPELESYMLKCHLDKYIF
ALDQENDWQAILSPGELQRVAFIRIFLTKPDVVFLDETTSALDEPTEHSL
YSKIRQALPGMIILSVGHRCTLQQFHTKHLVIGLDKSSRTI
>MS1230 sfsA, SfsA protein
MRLPPLQAAKFIRRYKRFMADVELANGNILTIHCANTGAMTGCAEKGDTV
WYSDSKSTTRKYPCSWELTELSNGNLVCINTHRSNQLVQEALQNKVIKEL
AGYSEIYPEVKYGEENSRIDFLLKGEGLPDCYVEVKSITLVKNNIGMFPD
AVTTRGQKHVRELLAMKKQGYRAVVLFAGLHNGFDCFKTAEYIDPDYDKL
LRQAMKEGVEVYAYAGKFDKIQEIPTALSLAEVVPLCFN
>MS0467 smtA, SmtA protein
MSLNLNQVSLLQNVTRYWNNRAEGYSRHNQQELQSIKRLKWQQLLLAHAP
KKQNLKVLDIGTGPGFFAIIMAQAGAQVTAIDATSNMLEQAKYNAAQAMV
DIRFVRGDVHHLPFADESFDLIISRNVTWNLSEPEQAYKEWHRVLKCGGN
LLNFDANWYLFLYDEQRRRAFEQDRASTIRLNIPDHYADTDTSAMEAIAR
KLPLSRQLRPHWDMNALLNIGFSQLMADTRIGEFLWDDEEKVNYRSTPMF
MIVAQK
>MS0945 smtA, SmtA protein
MWHAKHATELKLPTSWQQIPNGTLYCNALNRYFSHWLSNILGDQILKLGG
LSAEIGLDLPMRHQLVISPEIPQNLTALCLHPCTSVVRSKVTELPLIEES
IDACLLANNLNFCADPHRLLREITRVTTESGLLFISLFNPLSILAFKRQF
HQTPYEKFPFRQYPTWLIIDWLELLNFDILQCENLALQHRQHFSLFSPLT
VIIAQKRTCSLSSQAQKIQFHQEDVFSPEAAFKRINE
>MS0706 smtA, SmtA protein
MSKDTIFSTPIEKLGDFTFDENVAEVFPDMIQRSVPGYSNIITAIGMLAE
RFVTADSNVYDLGCSRGAATLSARRNIKQANVKIIGVDNSQPMAERARQH
IHAYHSEIPVEILCDDIRNIAIENASMVILNFTLQFLPPEDRRALLEKIY
RGLNQGGLLVLSEKFRFEDETINNLLIDLHHTFKRANGYSELEVSQKRAA
LENVMRIDSINTHKVRLKNVGFSHVELWFQCFNFGSMIAIK
>MS0203 smtA, SmtA protein
MTTFMNNKTKSAGFTFKQFHVSHDKCAMKVGTDGILLGAWASLQGNRYLD
LGTGSGLIALMLAQRTQTDCHITGVEIDPSAYRQATENVRQSPWADKIQL
EQQNIVDFTRTCTKKFDTVLSNPPYFEQGVDCRDKQRDTARYTQTLSHSD
WLNLAADCLTNTGRIHLILPYAAGKNLQKQTALFCARCCEVITKSGKIPQ
RLLLTFSKQPCTTEQSRLVVYNEQNQYTEQFIALTRDFYLNF
>MS1894 smtA, SmtA protein
MKESVYDSEGFFELYQKLRANPGSLNEIVEKPTMLSLLPDITGKTLLDMG
CGTGGHLQMYLRLGAKRVVGIDLSASMLKQAEIDLGKLCENRLQFSSGSF
SLHHLPMEQLDQLPEAQFDVITSSFAFHYVENFPALLTKIANKLTARGSL
VFSQEHPVVTAYQGGERWEKDENKQQIAYRLNFYRDEGKRERSWFKQPFL
TYHRTISTIVNNLIQVGFTIEKMAEPMLADQAEWQTEFKDLQHRPVLLFI
RAKKS
>MS2368 smtA, SmtA protein
MNIQLICETENSQNFTALCKEKGLTHDPASVLALVQTETDGEVRLELRKL
DEPKLGAVYVDFVAGTMAHRRKFGGGRGEAIAKAVGVKGNELPSVIDATA
GLGRDAFVLASIGCRVRLVERHPVVYLLLQDGLRRAYADPEIGEMMQKNM
QLLPVHHITELNPFEDFADVVYLDPMYPHKQKSALVKKEMRVFQYLVGAD
SDSNLLLEPALKLAKKRVVVKRPDYAEFLAEKAPQFSRETKNHRFDIYSV
NV
>MS1338 smtA, SmtA protein
MKSELICYKKMPVWNKNSLPKMFQEKHNTKAGTWGKLTVLQGKLKFYTLN
EDGSIVNEHIFSANTDTPFVEPQQWHKVEALSDDLECYLEFYCTKEDYFG
KKYNMTATHSDVLKTAKIITPCKVLDLGCGHGRNSLYLALKGYDVTSWDH
NAASIAFLADSAAKENLQIQTAVYDINNANIQENYDLILSTVVFMFLDRE
AVPAIIDNMQKHTNAGGYNLIVAAMSTEDMPCPIPFAFTFGENELKNYYQ
GWEFVEYNENIGELHKTDKNGNRYKMKFVTMLAKKVK
>MS0776 smtA, SmtA protein
MIDFRPFYQQIAVSELSSWLETLPSQLARWQKQTHGEYAKWAKIVDFLPH
LKTARIDLKTAVKSEPVSPLSQGEQQRIIYHLKQLMPWRKGPYHLHGIHV
DCEWRSDFKWDRVLPHLAPLQDRLILDVGCGSGYHMWRMVGEGAKMVVGI
DPTELFLCQFEAVRKLLNNDRRANLIPLGIEEMQPLGVFDTVFSMGVLYH
RKSPLDHLSQLKNQLRKGGELVLETLVTDGDEHHVLVPAERYAKMKNVYF
IPSVPCLINWLEKSGFSNVRCVDVEVTSLEEQRKTEWLENESLIDFLDPN
DHSKTIEGYPAPKRAVILANK
>MS1908 ssnA, SsnA protein
MKNHVRSFKTYIRDEIIKKGGWVNAHAHADRAFTMTPEKIHIYHNSNLQQ
KWDLVDEVKRTSSVEYYYARFCQSIELMISQGVTAFGTFVDIDPICEDRA
IIAAHKARDVYKNDIILKFANQTLKGVIEPTARKWFDIGSEMVDMIGGLP
YRDELDYGRGLEAMDILLDKAKSLGIMCHVHVDQFNTPKEKETEQLCDKT
IEHGMQGRVVAIHGISIGAHSREYRYELYKKMREAQMMIIACPMAWIDSN
RKEELMPFHNALTPADEMIPEGITVALGTDNICDYMVPLCEGDMWQELSL
LAAGCRFPNLDEMVNIASINGRKVLGLDR
>MS1280 sspB, SspB protein
MKNKMEYKSSPKRPYLLRAYYDWLVDNEFTPYLVVDATYYGVDVPQEYVR
DGQIVLNLSSGAVANLQLTNDAVMFNARFQGVPREIYIPLGAALAIYARE
NGDRSDVRT
>MS2272 surE, SurE protein
MLFFMGFMQNIPHKILIIIKEKNRKIMNILLSNDDGYHAEGIQILARELR
KFADVTIVAPDRNRSAASGSLTLVEPLRPRHLDDGDYCVNGTPADCVHLA
LNGFLSGRMDLVVSGINAGVNLGDDVIYSGTVAAALEGRHLGLPSIAVSL
DGRRYYETAARVVCDLIPKLHTRLLNPREIININVPDIPYDQIKGIKVCR
LGHRAASAEVIKQQDPRGESIYWIGPAALPEDDEEGTDFHAVNNGYVAIT
PIQVDMTSYNSMSALQDWLESE
>MS0956 tdh, Tdh protein
MMEIKTLSCVVRGPKDVGVMEQSINYDESSKEQTLVKITRGGICGSDLHY
YQYGKVGNYEIKHPMILGHEVIGTVVKTNAPDLYVGQKVAINPSKPCLTC
KYCLSGDTNQCETMRFFGSAMYNPHVDGGFTQYKVVDNSQCIDYPQDVSD
DIMAFAEPLAVTIHAAKQAGDLAGKRVFVSGVGPIGCLAVAAIKASGAKE
IVVSDLSRRCLDLALEMGATKALNAKDDFSEYMAHKGEFDVSFEASGHPS
SIERCLAVTKARGTIIQIGMGGAIPEFPIMTLIAKEICLKGSFRFIEEFN
TSVEWLSSGKVNPLPLLSATFPYTELEKALIIAGDKDNISKVQLSFE
>MS0525 tdh, Tdh protein
MMRSLVCKEPFHLILEERAKPQPKDEEVQLKVAAIGICGTDIHAYAGNQP
FFEYPRVLGHEASGVITELGKNVDKFKVGQRVALIPYVSCGKCGACLSGK
TNCCENISVIGVHQDGAFSEYLTAPAKNILPIADSVDFTTAALIEPFAIS
AHAVRRAQITKGDDVLIVGAGPIGLGAAAIAHADGANVVIADTSEERRKH
IQANIPVPTVNPINEKVEDYFNGRLPQIVIDATGNQKAMNNAVNLIRHGG
RIVFVGLHKGTIEFSDPDFHKKETTLMGSRNATLEDFEKVQHLMSERKIS
ANMMLTHTFKYDELAEIYEEKITKNQSLIKSVVLY
>MS0480 thdF, ThdF protein
MMTKETIVAQATPIGRGGVGILRVSGPLATEVAKAVVDKELKPRMANYLP
FKDEDGTILDQGIALYFKSPNSFTGEDVVEFQGHGGQVVLDLLLKRILQV
KGVRLARPGEFSEQAFLNDKLDLAQAEAIADLINASSEQAARSALKSLQG
EFSKKINQLVDSVIYLRTYVEAAIDFPDEEIDFLADGKIEGHLNDLIGQL
DKVRSEAKQGSILREGMKVVIAGRPNAGKSSLLNALAGREAAIVTDIAGT
TRDVLREHIHIDGMPLHIIDTAGLRDATDEVERIGITRAWNEIEQADRVI
LMLDSTDPDSKDLDQAKAEFLSKLPGNIPVTIVRNKSDLSGEKESIEEQE
GFTVIRLSAQTQQGVSLLREHLKQSMGYQTGTEGGFLARRRHLEALEHAA
EHLQIGRVQLTQFHAGELLAEELRIVQDYLGEITGKFTSDDLLGNIFSSF
CIGK
>MS1423 thiJ, ThiJ protein
MTTSIKPVLCVVTSAPIKGKSGIPTGFYLAELTHALDEIEKAGLKTVIAS
VRGGQPPIDGFDLTDPVNAKYWNEGDLYERLANTPALSELNGADYSAVFF
AGGHGTMWDFAQSAEVHRIVSEVYTSGGVVSAVCHGPAALVGAKLPNGEF
VVNGKNIAAFTNAEEVEVEGDKLVPYMLQTELEKQGAIHHAAPNWAENVI
VDGQLVTGQNPASAKGVGAALAKVLLEK
>MS0780 tldD, TldD protein
MLNKVVESLLTPSNLSVKDLPNIFDQLAHRHLDYSDLYFQLSQDESWVLE
DGIIKEGGFHIDRGVGVRAISGEKTGFAYSDQINLTSLQQCANAVKGIAP
AEQGRIITPTGFNRVNPILRYAAVNPLDTLTKEQKIELLYLVDKTARGMS
PYVSRVSASLSSIYEEVLVAATDGTLAADIRPLVRLSVSVLVEKEGKRER
GSAGAGGRFGLNWFLESFEGEVRAVSFTKEAVRQALVNLEAIPAPAGLMP
VVLGAGWPGVLLHEAVGHGLEGDFNRKESSLFSGKIGELVTSPLCTIVDD
GTLENRRGSLTIDDEGTPSQRNVLIENGILKGYMQDKMNARLMGVAPTGN
GRRESYANLPMPRMTNTYMLSGDSKFEDLIGSIDRGIFASHFGGGQVDIT
SGKFTFSTTEAYLIEKGKITRPVKGATLIGSGIEVMQQVSMVADNMEIDH
GIGVCGKEGQSVPVGVGQPALKIERITVGGTN
>MS0682 tldD, TldD protein
MEISQNQTALLKQQEQALRDAVSYAVEIAQKAGASAEVAVTKVNGLSVST
RLKEVENVEFNNDGALGISVYLGQQKGNASTSDLSKDAIKNAVEAALAIA
KYTSPDECAGLADKELMAFEAPSLALYNPAEVDVDQAIELALQAETAALN
YDKRIVNSNGASFNSHNGVRVYGNSYGMLQSYLSSRYSISCSVLSGIDDE
LENDYEYTVSRDLNALESPVWVGENAAKKAVARLQPRKITTQEAPVIFLN
DVATGLIGSLAGAISGGSLYRKASFLLDHLGRQILPDWFHISERPHLTGR
LASTPFDSEGVKTQSREIVEQGILRTYLLTSYSGRKLGMQSTGHAGGIHN
WLVRPNANGDLDSLLRQMGRGLLVTDLMGQGVNMVTGDYSRGAAGFWVEN
GEIQYPVAEITIAGRLKDMLRDIVAVGDDIEQRSNIQTGSILLESMKISG
N
>MS2335 torD, TorD protein
MVKNTALLSLKQQKSAMNFKEILMDNALLQWISTGGRLLGAVFYYEPKDK
RVQPVLDFFRQPDWTKDWATLANPALINALIEKSAQQDLSQAYQYLFIGP
NELPAPPWGSVYLDKESVIFGDSLLALRDFLTVHQIEFIQTQNEPEDHLG
LMLMLAAYLAENKPELLEEFLTKHLFSWVYRCLDLIFAQTDYPFYQAMAL
LARQTLKGWQQQLDLQVDQPQLYR
>MS0837 torD, TorD protein
MSETIINNFSLISRLFGNLFYRSPTDSILDGVFGWLQQKGLEQVWPLDTD
EDVRQALDSVQMTIAKEVLAQEYERLFAGEQPKIDSRISAYGLNVDEFIN
FRQTRRMPEVESADNFSLLLLTASWIEDNLDSISAQQELFESFLLPCASK
FLTHVETYALLPFYRSLALLTREILAAMADELEENE
>MS0137 uup, Uup protein
MIFFSNLTLKRGLNLLLEEANATINPKQKVGLVGKNGCGKSSLFSLLKKE
NQPEGGEINYPADWAVSWVNQETPALNISALDYVIEGDRTYCRLQKELKL
ANEHNDGNAIARIHGQLDIIDAWTVQSRASALLHGLGFSQEELGRPVKSF
SGGWRMRLNLAQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLV
LISHDRDFLDPIVNKIIHIEDKKLNEYTGDYSSFELQRAEKLAQQNALFR
QQQDKIAHLQKYIDRFKAKATKAKQAQSRMKALERMERIAPAHVDNPFTF
EFREPLSLPNPLVMIDKASAGYGEGESAVEILQKIKLNLVPGSRIGLLGK
NGAGKSTLIKLLAGELTARSGVLQLAKGVQLGYFAQHQLDTLRADESALW
HLQKLAPQQTEQELRNYLGGFAFHGDKVKDPVKQFSGGEKARLVLALIVW
QRPNLLLLDEPTNHLDLDMRQALTEALVDYQGSLVVVSHDRHLLRNTVEE
FYLVHDKQVEEFNGDLEDYAKWLNDLNVQEKSAVKNTEVSKESNNENSGQ
NRKEQKRREAELRQQTAPIRKQIAKFETEMDKLTAQLTEIEVRLADSGLY
QTENKEKLTALLTQQVQTRKALEEAEAHWLTAQEELETLLAE
>MS1240 uup, Uup protein
MGFYMSSQFVFTMHRVGKVVPPKRHILKDISLSFFPGAKIGVLGLNGAGK
STLLRIMAGVDKEFEGEARPQPGIKIGYLPQEPKLDPQQTVREAIEEAVS
EVKSALTRLDEVYALYADPDADFDKLAAEQAKLEAVIQAHDGHNLDNQLE
RAADALRLPEWEAKIENLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDA
ESVAWLERFLHDYEGTVVAITHDRYFLDNVAGWILELDRGEGIPWEGNYS
SWLEQKEKRLAQEQAQESARQKSIEKELEWVRQNPKGRQAKSKARMARFE
ELNSGEYQKRNETNELFIPPGPRLGDKVLEVEHLTKSYGERTLIDDLSFS
IPKGAIVGIIGPNGAGKSTLFRMLSGKEQPDSGSITLGETVVLASVDQFR
DAMDDKKTVWEEVSNGQDILTIGNFEIPSRAYVGRFNFKGVDQQKRVGEL
SGGERGRLHLAKLLQRGGNVLLLDEPTNDLDVETLRALENAILEFPGCAM
VISHDRWFLDRIATHILDYGDEGKVTFYEGNFSDYEEWKKKTFGAESTQP
HRMKYKRIAK
>MS0840 uup, Uup protein
MALISLTNGYLSFSDAPLLDHADLHIEPRERVCLVGRNGAGKSTLLKIIA
GDVVMDDGKIQYERDLIVSRLEQDPPSHAQGNVFDYVAEGIGHLADLLKE
YHHISTLLESDYNDNLLSKLAQVQSRLEHENGWQFENKINEVLGKLELNP
NTLLSELSGGWLRKAALARALVCNPDVLLLDEPTNHLDVDAIEWLETFLL
DFAGSIVFISHDRSFIRKMATRIVDLDRGKLVSYPGDYDLYLTTKEENLR
VEALQNELFDKRLAQEEVWIRQGIKARRTRNEGRVRALKMLREERRQRRE
VLGSAKLQLDTSSRSGKIVFEVEDASYAIAGKQLLSHFSTTILRGDKIAL
VGPNGCGKTTFIKLLLGELQPTSGHIRCGTKLDIAYFDQYRADLDPEKTV
MDNVADGKQDIEVNGVKRHVLGYLQDFLFPPKRAMTPVKALSGGERNRLL
LAKLLLKPNNLLILDEPTNDLDIETLELLEDILADYQGTLLIVSHDRQFI
DNVATECYMFEGNGQLSKYVGGFFDAKQQQENALTSKMASEQAKPKKMQP
ESAVEKSEISTANNNQKTIKLSYKEQRELERLPQLLEELEKMIENLQNEV
GNPDFFQQSHEYTSAKLQELADKEAELENAFIRWEELEEKKKGNLS
>MS1094 vapI, VapI protein
MLSPRKISEIATGKRPITADVAVRLALFFGTDAESWLNLQSHYDIKKSEE
EIKTDIESILDSSIDGYLNI
>MS1499 wbbJ, WbbJ protein
MTYYQHPSAIIDEGAEIGEGSRVWHFAHICGGAKIGKGVSLGQNVFVGNK
VRIGDHCKVQNNVSVYDNVYLEEGVFCGPSMVFTNVYNPRSLIERKSEYK
DTLVKKGATLGANSTIVCGVTVGAYAFVGAGAVINRDVPDYALMVGVPAK
QIGWMSEYGEQLELPLSGQAETKCPHTGAIYRLEGHELKKL
>MS0417 wbbJ, WbbJ protein
MATEKEKMLAGLAHLPMEEHLSALRLQTKELLFDFNMLRPSNKLEKTHLL
RKILGKAGKNIHVNSPFHCDYGCNIEVGDNFFANYHCVILDNGGVKIGND
VMFAPNVSLYTVGHPLDAELRNQGWEQAKPIIIGNNVWIGGNVVILPGVV
IGDNVVIGAGSVVTKDIPANSLALGNPCKVLRQITAADREYYQQTFMQNN
>MS2128 wbbJ, WbbJ protein
MVGRNAHPTSKGNMMDYTLNLPLNQLIAQNSELFSKIHQVVDKNAPLVAE
LNSGFRTQNEIRAILNEMTGTEIDASFHVNLPLYTDFSAHIRIGKRVFIN
TAVMLTDLGGITLEDDVLIGPRVNIITVDHPIDPAQRRGVIVKPVVIKKN
AWIGAGATILAGVTVGENAIVAAGAVVNKDVPANTIVGGIPAKLIKEI
>MS0323 wecD, WecD protein
MKIFKAEQWNLEVLLPLFEEYRLSHGMVENPERTFTFLNNRIRFSESIIF
IATNERQQAIGFIQLYPRLSSLQLQRYWQLTDIFVQDVANQNEIYAGLIE
KAKEFVCFTHSTRLVVEQDQQHQGIWEKEGFKLNTKKALFELKL
>MS2102 wecD, WecD protein
MMQTIDQFIAQYIPAAYALNLRVVESSPQRVVIKAPFECNSNHHHTIFGG
SQALLATLSAWSLVYLNFPEANGNIVIRSSQIRYLKPAPSDIIAVSICPD
SLAMNLAKQMLTQKGKAKITIQCQLYCDDIIVSEWTGEFVLSHTPF