Gene list
Applied filters:
COG category: Transcription
Gene type: CDS
Genomic element: chromosome
Number of genes found: 133
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mannheimia succiniciproducens MBEL55E, MBEL55E >MS1402 unknown MRYNSSFQITLSRQMNKPNITIQPIQASHYADYVALIGKQLGEGYFKQAD FEALANNPQAICFEAVDEQNQVVGVITSVTLDRESALALLKIQAQNTPDY VLQSDRIGIFKTIAIDENRKGCGIGSALVRKLLESFKQAGLNSIACVAWQ YGETENIRGIMQAFDFTCYEKIANYWLDDPEPFICPACGEPPCRCQANIY FRQI >MS0260 unknown MLGVLNMTRQNIFIILAFSNEINKMELQDKLLIAMPNLQDSYFSQSVIYI CEHNEQGAMGLVLNQVTDLSIAELVAKLNFMMADGRHYPETYVFAGGPVS MDRGFILHTATERTFEHSYRVTDNLQLTTSEDVIETFGTPEAPEKYLVAL GCATWTSGQLEKEIADNDWLVVPANNHILFDVPWAECWTAAQQLLGFQPA NLVAEAGYC >MS0143 unknown MIFKEIKMNNFFERGNVLAAACPSRQILQHLTSRWGGLVLIALRSGTKRF SELRKTIDGVSERMLTQTLQQLEEDGMLVRKSYNTVPPHVDYTLTEFGAQ ASEKMFELVDWLESNLNDILTHKVSKQ >MS0313 unknown MKKISLFLTALLAASSALAANNQAAPQQENAKTEFMFGAKAANDPVGIWQ KDGRHFSKKDLSKQFCWTLTNFRSDSGNVNITITLTSPKNTNFNLGEHIS KNTTTHIFNFTYPITQTYYNCWAFEESDPEGKYTLTVKANNTTFPTQVFT LTK >MS1430 unknown MNPTVVVSNIKTLLLAVLSGRIFAPVMQHDCQPYLDSGELEIVFPNLESQ MWGIYLYRPYQTITPKRVLVVFEILERLLKMHSEQ >MS0118 unknown MGGRRRNKMSEKINSTQRALRILKALKGRTLTGLSNKELADRLNESPVNI TRSLQALIAEGLVVKLEETGRFALSIQMLQIAVTHQRDTEKMQARMAEMD QRVNAGAF >MS0409 unknown MTSEDPIYEKLNETTSIRGFITACVAIFDESVDQLINRVFRKTDFAVKSV VDSLFINSGPLFDLSIRLKVLLGLGIISHETFMDINAFIQLKEALNNDGK EYEFFDPIIISFIQGLNVRQDKSFLNLDTKIDGTKDSLLYQVKVLRREKL IRSYLILSVTDLYDQLQVESPL >MS1431 unknown MSEPIVATPALIKRMGKPKDVFDLAANFPVGYILNPQTGKVWDWMLGR >MS0113 unknown MYSTVKHIVPQGEKRNMTNTVKTANQIKLEFHQQGKTISSWAKENGYSRT DVSRVINGLAKGQRGKTLEIAVKLGMVIL >MS0831 unknown MMNYVAHAKDQALTAHHDLFSYHPMPFYEDTEQTRSRFHKKLDLNLYCIK RPQQTCFIRVQNPDLMAWGIEQGDMLVVEKNDSLSIGDLIVIEVNQKLEI FEFIAYDKNEFVFLSLSSKLNNIRTANWSTLPIIGTVTNTIHQMKPKNTI SFAA >MS2207 unknown MLNQQISQVIAAELSVQPKQILAAVQLLDEGNTIPFIARYRKEVTGGLDD TQLRHFETRLIYLRELEDRRQTILKSIDEQGKLSDELRAKINATLSKNEL EDLYLPYKPKRRTKGQIAIEAGLEPLADLLWSEPEHEPESAALAYVDANK GVPDTKAALDGARYILMERFAEDAQLLAKVRQYLQHNAVLVSKVIEGKET DGAKFQDYFDHQELLKNVPSHRALAMFRGRNEGFLQLSLNADPDAEEGSR SSYCEEIIREHLAVRLTGLPADKWRAQVIAWTWKIKVSLHLETDLMGSLR EKAEDEAIDVFARNLTALLMAAPAGAKTTIGLDPGLRTGVKVAVVDSTGK LLATDTIYPHTGRMNEAMVSLYQLGKKYHAELIAIGNGTASRETERFAKE VIKQSTDWSAQTVVVSEAGASVYSASEFAAAEFPELDVSLRGAVSIARRL QDPLAELVKIEPKAIGVGQYQHDVNQSQLARKLDAVVEDCVNAVGVDLNT ASAPLLTRVAGMTKVLAQNIVAYRDENGCFESRQQLLSVPRLGPKAFEQC AGFMRILNGKNPLDASGVHPETYAVVENILQVTEQSIRDLMGNSNALRRL DATQFTNEKFGLPTVQDIFKELEKPGRDPRGEFKTATFMEGVEEITDLKA GMILEGTITNVTNFGAFVDIGVHQDGLVHISSLSDKFVEDPHQVVKTGDV VKVKVLEVDVARRRIALTMRLDESAVKNSEKSDRTLSTKSGQDRNRRDNR QPQRNQFANNVFADALKGWKK >MS0116 unknown MPYFTTTQQGEEMNHTDFQPLPYPQTPESARAYFNLHGINRSEWARYFGI DQQAISDHLRGRLKGTWGKSHKVAVLLGLKPNPETKVTA >MS1375 unknown MHCPFCSTEETKVIDSRLVSDGYQVRRRRECTKCHERFTTFETAELVVPK IIKNNGMREPFNEDKLRRGIQHALEKRPVSADDVEKAISHITHQLRATGE REVPSKLVGSLVMEELKKLDKVAYIRFASVYLSFENINEFSNEIEKLKD >MS1448 unknown MTTKKQAVFSRLVNELVQKNQGKRIFSFDFENQTYWVKQPEKLTGVWKIL KPHPKQSFREELHILKNLYERGAPVPQVILSGEDFFVLKDVGPTLNHWIE NAGLNLTPAEKNQILVDAIKALTSLHKKGVTHGRPAIRDIAWRQGKVTFM DFESHSRSLNLQWHKIRDVLVFIHSLCRSKHLSGEQIQYLINKYEEYCES DLWQDVLNLVAKFRFLYYILLVFKPVARMDLIAIYRLFQYLLPLTEENK >MS1727 unknown MKLTSKGRYAVTAILDIAINAEDGPVTLSDISERQNISLSYLEQLFAKLR RHGLVKSVRGPGGGYQLGQPSGQISIGMIIAAVNENISVTKCLGQGNCQG GKVCLTHHLWAELSDRIENFLNEITLEELVSKQHSQKTHTDFDNLLVVDN >MS2211 acrR, AcrR protein MKKNLNFVVKESITEALLRLMAKKNFDEINITAITELAGVSRISFYRNFD SKEDVLIKYMYVRAKELYKPFESQDVSVRDKLIGMFKSIEGMEDIINLLY AQNLSHIFLQYFNFVRGAKPEQENLDAYQNSIVVGVCFGALDEWIKRGRQ ETPEQMVDLLQNVIWGFVKE >MS2295 acrR, AcrR protein MEQKLSPKQKGRPRTFDREKALESALFVFWNQGYTNTSIADLCNAININP PSLYAAFGNKSQFFIEILDYYRRVYWDVIYAKMDVEKDIHRAIHIFFRDS VNVVTVANTPGGCLSAVATLNLSAEETKIQQNMRQLKSDILKRFENRLKR AIVDKQLPSQTDIPALALALQTYLYGIAIQAQAGTSKDDLLKVASKAGLL LPKLI >MS1936 acrR, AcrR protein MAEQLTLDSIEPEPEKQSAKIEKRSIKERRQQVLTVLTHLLHSEKGMERM TTARLAKEVGVSEAALYRYFPSKTKMFEALIENIESSLFSRISYSIKMET NTLNRVHDILQMIFDFARKNPGLTRVLTGHALMFEEAKLQARVALFFDRL ELQFVNILQMRKLREGKTFPIDERTIATYLVTFCEGQFMRLVRTNFRHMP NQGFEQQWRFIEPLFE >MS1300 acrR, AcrR protein MKQDIRITKTLGLIRHVFLELLEEKGFEHIVVQDILDRAQINRSTFYKHF QNKHAVALMLVDEIKQLLTENFENRFSIPTTEFAQKMVPIFWQHRDLIHL IGKIENPRIHLYKDLALVIKEEYIKQAVREQPQSSEELDFQGYLFAIVSL GTIRYFVEKGELPDPSVIVGDIESVFNLLIIK >MS0453 acrR, AcrR protein MRQSETDMAEQIFAATERLMAKDGLHHLSMHKIAKEARISAGTIYIYFKS KEELLEQFAWRVFSLFQTALEKDYDETLSYFEQYKKMWLNVWYFLQDNPN IVMNMQQYQSLPGFFDICKEMDYNSRWATFCQKAQQAGAVCELSVSILFS LSMESAMNLAFKKLYINEFLADEELMTIIERTWRSIQK >MS0153 acrR, AcrR protein MINMAGVRAIQKEKTRRALIDAAFNQLNAEKSFSNLSLREVAREAGIAPT SFYRHFKDMDELGLTMVDEAGLTLRQLMRQARKRIEKGGSVIVISVETFF EFIAHSPNVFRLLLRESSGTSQAFRTAAAREIKHFVDELAEYLANKNNYS EYVAYVQSEGMVTIVFTAGANALDMNNKERELLKERLILQLRMLAKGAHH HMMERERHNTHLPATGKS >MS2131 araC, AraC protein MLNWLIRQTLKLKSGEKGTMGIETPVPELFVFHSETDLRDVSQLQESGIC LILQGRKDVRVGDQHYRYQAGEFVCYTVDLPIMTEYLTDDGGYLDLRLFF DLPLMREIIDELNRQNFSFAPASQQKIVSTASPELIRAFEMLICLTENSQ DLPIMLPLIKKAIYFYLLTGEQGGTLRQIALQNSNSQRIVETVGWLKEHY NESFDIEQLAAASSMSISGFYAQFRRLTGMSPLQYQKNLRLTKANALLKL GQKNISEIAFEIGYDSLPQFSREYKRYFGHSPRSDLSRAG >MS2322 araC, AraC protein MIQKLLARDFFNNKEQPIILEPRAPQEIFPEHTHDFDELVIVKHGSGRHI LNGYPHDLYPGVVLYIQAQDHHSYENLQDLCLTNILIQSNNNFKYLNNID ILLNGLKPENSSYQLINKKTAEYIDSLLEKINAIDESYNLQNECLFFQVL SSIQAHQFNDSGYGNTEEKGRQMIRWLENNFEKEIDWEELAEKFALPIRT LHRYIKSQTGHTPQNYVTKLRLAQAYYQLKYTEKNIINIAYDCGFNDSSY FSTCFKNEYSIAPRELRI >MS2323 araC, AraC protein MTDILQLSHHSYFISEESPITVERRHYQPPFPLHRHDFNEIVIISAGNGI HFWNDEIHPITTGNVLYIESGDKHKYGEVDKLKLDNILYRPEKLSLFPIM KDYIPHNNEKKSLRINQETLVQLQSLISQLEIESKKTNKSSMHLSEAIFL QILILICRTQQQENKAYSDISKLESLFSALNQSISQEFYLADFCRQHQLA VSSVRRIFKQQTNMTIAQYLQKLRLCRAATLLRNTSESVANIAIRCGYSD SNYFSSVFGKTFSCTPTEYRSRFIKK >MS2173 araC, AraC protein MPKPLILSRKNLANLGSVIQQRKLLYTRMAVDEPTLLYIQVGQKTLRWRG QELTIQAGEMVLLAAGQTFDVLNNPDAKLGFYQAGWIALEQRVVDEFADL FGVETYVQELAKIQPLAPLKAHFDVVRQALENDEAPELVLKLKLFELLAW LKAEHLSFVPHEKHNLLRQIRKMIASNTAFEWTAETIARQLHLSETSLRR ALQKSDTTFREVLTDVRMSRALTLLQITKWQVARIANEVGYDSPSRFTVR FKQRFGFLPSDIRENLSQPVQNEQQKLVRIGVKK >MS1229 araC, AraC protein MRFCWYTSDNNKSVVNQINMKTSHLAKQTSTELADKSGSEIISPLSLSLD ARPFNVEIQQPPGNMPAYHWHGHIEINIPFDDDVEYSFNEHSTLINAGHI SIFWASIPHRLTDKHNCRTMAVFNIPVYQFLSWQLSQNLINHITHGIIIQ SKNPRLVSLFEVQRWEQELKLEDPNRHKLVYDEIQLMIKRVSLDGWLLLL EPPKKNNHQLSGSKHAQNYVRTMLDYIANHYNAPLTVQSVANAVGLNTNY AMGLFQSAMQLTIKQYIIMMRINHAKALLSDTNRSVLDISLTTGFSSMSR FYDNFLKYTGVSPNKYRKQIRADDNWSAQGLIPTTQAIKGASTGEKLIMT GEHFNQSEEF >MS1400 araC, AraC protein MANIRQNQSISELHYQPHKHHPYGIELFTVASLRARSAEVVMEKNYLYQC DMIIVVTQGSGTLWQDFEPVACMQGSVLWIKQGQACSFGNDKHWDGWVLM IKNKPLLSEFDYQINTLWLSENELENVEQSLKQLKQDSEKPYSIVHKQLI HHQFYAFLWRLISLTPNQTILYSPRLRSRFDSFQSLLESYFHEWHHVHQY ATALACSEKTLSRACLEITQQPAKTVINNRLLLEAKRLLVQSNQSIASIS LQLGFNEATHFVKFFKREAGITPQKFRELG >MS2105 araC, AraC protein MSGILFLLITCLFIQIIMFSEQSFARLLDVIPHNQTYHSPIKGLIIHHSD HPFSYDNVIQEPSICIVIRGEREVQLGNQCYLFDNRHFMFCPVNVPMCGK VLQATAEEPFVVMSMKIDLQAVNKILLEQTALLAKNSENPTAFGQWHLDA ELENAFERLLLLHENTKDITFLAPLIQQEIYYRLLTGEQGDKLKQMVSFG SNTQKIAKATEYLKAHYIETITVESLAELCGMSLSGFHNHFKKHTTLSPL QYQKSLRLMEANRLISQENLPISTAAFQVGYESPSQFSREYKRYFGKAPS VR >MS0060 araC, AraC protein MKYQREVQQETNPLLPGYQFGSYLVAGCTPIEKGNEVDFAIRRPNGMKGY IINLTTKGEGTVFEGDRAFTCCKGDLLLFPPNAEHLYYRSQSSESWHHQW IYFRPRSFWANWLQWSHISDHVGRLTITDPTTYEEILALFKKIEREYNAK DIFSEAMSMCLLEQLLIKCIKLDPVNSQRMLDPRILETCHFISANLHINH KITEIAEHIHMSPSRLTHLFAQQTGSSIIKWREEQRMIKAQHLLHTSGAP IYAIARQLGYDDQLYFSRLFKRYSGLSPSDYRNSR >MS1267 argR, ArgR protein MAIEKTDNLLTVFKDLLSQERFGSQSEIVSALQDLGFSNINQSKVSRMLT KFGAIRTRNTRMEMVYCLPNELSVPNTSSPLKNLVLDIDHNDFLIVIKTS PGAAQLIARLLDSVGKTEGILGTIAGDDTIFITPTKGTGIKELINTIQQL FENSL >MS1708 bolA, BolA protein MEITMETQEIERILKQALNLDEVYVQGENAHYGVIVVSEEIAKLSRLKQQ QTIYAPLMDHFSSGEIHALTIKTFSPEKWKLERMLNVVN >MS2285 citB, CitB protein MRKFYRTFFQNATIRRSKAYPGEITMIKVALIDDHVIVRSGFAQLLSLEE DIEVVGEFGSAKETRQNLPRIKADVCIIDISMPDESGLDLLKSIPSGIHC IMLSVNDSEMIVKKALELGAKGYLSKRCSPEELVQAVRTVYTGGVYLMPE LTVKLVTNKNNNPIQQLTKRELEICELLIRGLGAKEIGEQLGLSFKTVHA HRANAMSKLDVKNNVELANLFHQYS >MS0278 citB, CitB protein MHGIAWGYCFIIVKVRNMSEKTKVIVIDDHPLMRRGIKQLIELEEQFEVV GDAGSGNEGVELAIKTSPDLIILDLNMKGLSGLDTLKVLRQEGVDARIVI LTVSDSKADIYALIDAGADGYLLKDTEPDTLLAQIKQIAQGEIILSDSIK NLLVERHPAHEPIHALTDREMDVLQLIATGLSNKQIAAQLFISEETVKVH IRNLLRKLNVHSRVAATVLYLEYKGS >MS1144 cspC, CspC protein MSKLNGLVKWFNSDKGFGFITPADGSKDLFVHFSSILGNNYRSLNEGDRV EYNVENTQRGPAAVEVAVIK >MS1095 cspC, CspC protein MEVGVVKWFNNAKGFGFINAEGSDADIFAHYSVIEMDGYRSLKAGQKVNF EVVHGEKGSHATKIIPILE >MS1361 dinG, DinG protein MANIDQIKAAFSERGQLSSNIKDFRPRSEQLEMAEAVGKAIENKGVLVVE AGTGTGKTFAYLTPALLSKKKTIVSTGSKNLQDQLFKRDLPTIQKALNYS GKIALLKGRANYLCLERLDQVIAQGVLGDKSVLVDLSKVRKWNNATKTGD LSECVELAEDSPILPQLTSTTESCLGSDCPNYGDCYVAAARKRALAADLV VVNHHLFCADMAVKENGFGELIPNAEVIIFDEAHQLPDIASQYFGQSITS RQLFDLCKDINIVYRTEIKDMPQLGVASDHLLKMVQDFRLLLGEGNNRGN WREWLVKPDVQKGFKVLQEKLDFIADVVKLALGRSQTLDSIFERISALKA QLVRLSDTSVTGYCYWFETFNRQFGLHITPLTVSDKFGEQMNNHESAWIF TSATLEVGGSFNHFRQRLGIRATDEKVLQSPFNYPEQALLCVPRYLPGSN QNHTMTKLAEMLLPVIEANKGRCFVLCTSYFMMKGFAEYFREHSGLSILL QGEISKTKLLEQFVSEEHSVLVATSSFWEGIDVRGDALSLVIIDKLPFTS PDEPLLKARVEDCQLQGGNPFNDIQIPEAVIALKQGGGRLIRDVTDSGAV IICDSRLVTRPYGETFLKSLPNAKRTRDLNKVVEFLKSIQQNRT >MS0431 fadR, FadR protein MFTQKSANSSPSVLKARSPAALAEEYIVKSIWSNFYPPGTDLPAERELAE KIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNDVWQTSGLNILDVLVRL DSTMSPTLIANMLSARTNIAIIYIPRAFKVSYEKALASFDGLENLPETAE SYTAFDYEILHKLAFISLNPIYGMVLNSLKGLYTRVGSYYFAIPEARALA KKFYIELRELGKAHRLDEIPSLFRQYGRESSLIFEAAQDGLAQYLIEN >MS0538 fadR, FadR protein MTDNAELRSYKKIGSILKQELIDGLYQIGERLPPERDLAEKMNVSRTVVR EAIIMLELENLVEVRKGSGVYVINMPLTSEENQDDTYEDVGPFELLQARQ LLESGIAEFAAIQATRSDILRLKEILNKERMTLAEDDKDYTADEEFHSAI AEITQNEILIKLQKELWKYRTKSSMWQGLHAHITDQEYRKSWLQDHQNIL NGIQRKNPALAKKAMWQHLENVKQKLFELSDIEDPDFDGFLFSVNPVVVG L >MS0531 fis, Fis protein MLEQQRSPSDALTVSVLNSQSQVTNKPLRDSVKQALRNYLSQLDGQDVND LYELVLAEVEHPMLDMIMQYTRGNQTRAATMLGINRGTLRKKLKKYGMG >MS0380 glpR, GlpR protein MKLNEKEQLIIDSLKRKDVITNIELSEILQCSTVTIRSLIRSLEKKGLII RTHGGAKLCNDYLDIHIPAGNIFKEREAKLRIAEKAYQYIAERDTIILDD SSNSYYLAQVIKKYSDKYLIIITNSLPVIAELSTCSAVEIISIGGVLRGN KNAFVGDFAIEMLKNFKATKAFIGVHGIDPEFGITSIGNEQMMIKKQIFK IAQYVYVLTCSEKFGTGYLLVSAPLSQVHKIITDKNIDKNILNVIKSSVD IDLV >MS2316 glpR, GlpR protein MREKKVKPRERQSAIVEFLQINGKTAVEQLAQIFKTTGTTIRKDLTALEA EKKVLRAYGSVVLVNKDEIDLPEANKTNTNLEVKRRIGQKATEFIGDGDS LLMDSGTTVLQMVPYLAKYRDLTIMTNSLHIMNALTGLERDYELLITGGT YRQKSASFHGILAESTVEKFTFDKLFIGTNSFDLDYGLTTFNEVHGVSKS MCKAAREIIVLADSSKFQRRSPNVVCPLEKINTIVTDKNLDPAIHQALIE KNINVILV >MS2186 glpR, GlpR protein MKQSIRHQKIVELVKLQGYISTDELVTLLNVSPQTIRRDLNELAENNLIR RHHGGAASPSSAENSDYSERKLFFSLEKNHIAQAVSRLIPNGSSLFIDIG TTSEAVANALLGHQNLRIVTNNLNAAHILMKNDTFKITVAGGSLRQDGGI IGEATVNFISQFRLDYGILGISSIDLDGSLLDYDYHEVQVKRAIMESSRE TVLVTDHSKFSRQAIVKLASVTDVDYLFTDQEPPKSIMELIHNSSVELRV CK >MS0024 glpR, GlpR protein MVRSNIMNEQIRHNKLLTLLGENGFLSVQEIMTALNISPATARRDITKLN EQGRLKKLRNGAEAVIQSTFQPQKKQNEIKNLDEKQRIAALAASLCQNDS SAILTCGSTMLLLGNALCNRNVQIITNYLPLANQLIENDHERVVIMGGQY NKSQAITLSLSEHNEAFAADIMFTSGKGLTAQGLYKTDMVIASSEQRLLK RAQKLIVLVDSSKLDKTVGMLFTELKNIDLIITGQEADPDFIRTLREKGV DVMLA >MS1983 glpR, GlpR protein MIPAERQKMLLNLISQQDIVSISQLVETLGVSHMTVRRDIQKLEEEGKVV SVSGGVKMLEHLSIEPTHNDKSLLSPSQKSQIGIKASEIIPEKTTIYLDA GTTTLEIAHHIVDREDLLVITNDFVIANFLMKAGKCELIHTGGSVNKSNY SSVGELAAQFLRQISIDIAFISTSSWNLKGLTTPDENKLPVKRAILQSSN KRILVSDSSKYGKVATFQICPLSEFDVIICDSDLLENAKDAINEMRIELL LV >MS0074 glpR, GlpR protein MSVDRQNAIKLFLRSHNMATVEQLVKITNSSPATIRRDLIKLDDAGIINR THGGVSLRDSFPYQPTTNEKQYQHVTEKENIADYVVSLISPGDSVLLDAG TTTLCIAKKLVNIPLRVITSDLHIALLLSEYKQIDIVMTGGAIDKSSQSC IGQHGLDLLQNINPDFAFVSCNSWSIERGITAPTEDKANLKKCLLQNSRR KVLVADSSKYGKCSLFKVIELNRLTDIITDHNLPQSAQKALNELDLSVAF A >MS0187 glpR, GlpR protein MKRNFQQRNTQQRRHGIMQLLQQKGEVSVEQLVQLFETSEVTIRKDLTAL ESNGFLLRRYGGAILMPQDLMDESQDENLSKQKLSIAKAAAERIRDHHRI IIDSGSTTAALIKQLNSKQGLVVMTNSLSVASELRSLENEPTLLMTGGTW DTRSESFQGKVAEQVLRSYDFDQLFIGADGIDLARGTTTFNELVELSRVM AEVSREVIVMVESQKIGRKMPNLELNWQQIDVLVTDDLLSEKDKAVIERH NIEVIIAK >MS0524 gntR, GntR protein MFFIKNKDNMSRDLNLRQDIINQMIDDISSDLLTSPLPSLSALATLYNVS RTTIRHAITYLTEQKIINRIDAQLIITKKPSADDKITYIKIKKPGNNQIK KLEKYFSSAVQQKIIKPGDDFTELELAKNANVDIFTVREYLIQFSRFNLI SHISAGKWRLTKLTQHYADKLFELREMLECHALNCFMNLPKNDIRWKQMK LLLQEHRILRNNIVEKYVDFSLLDQQLHSLILSAADNPFINDFINLISVI FHFHYQWDNSNLRTRNILAVEEHLAILVKIVSQDDLGAITELKRHLQTAK NGLMNSIRLMNN >MS2208 greA, GreA protein MAKSNYITRAGWNVLDQELKYLWKDERPKVTQAVSDAAAMGDRSENAEYI YGKRRLREIDRRVRFLSKRLEVLQIVDYNPKQEGKVFFGAWIELENESGE IKQYRIVGCDEFDPAKNWISIDSPVARALIGKQIDDEVRVETPAGKVLLY VNNIWYEK >MS0961 greA, GreA protein MKQIPMTVRGAELLKQELDFLKTTRRPEIIKAIAEAREHGDLKENAEYHA AREQQGFCEGRIQEIESKLSNCQIIDVTKLPNNGKVIFGATVVLVNTEND DEVTYQIVGDDEADIKSGLISVNSPIARGLIGKEVDETVSIVVPGGKVEF DIIEVNYI >MS0195 hepA, HepA protein MSFAVGQRWISESENDLGLGVIVGMDNRTVTILFPASDEQRVYALAAAPL TRVEFQKGDTVVHHEGWKAQIIDVTENNGVLIYLTIRLDTQEEAVLREMD LAHKISFSKPQERLFGAQIDRSDRFTLRYHALQQQQAQFQSPLRGLRGIR AGLIPHQLHIANEVGRRVNPRVLLADEVGLGKTIEAGMILQQQLFAGKVE RVLIIVPETLQHQWLVEMLRRFNLHFSLFDEERAADFAANEYDEERNPFE SENLIICSLDWIVAQPKRAQQILQAEFDMLIVDEAHHLVWSERQPSMAYQ VVEQLSRRIPAILLLTATPEQLGQESHFARLALLDPDRFYNYDAFVAEQK NYQPVAEAVQTLLNEKPLNTAEQNAIADLLEEQDVEPLFKVINSMAEESE RLQARQELIDNLVDRHGTSRILFRNTRQGVKGFPHRIYNQVTVEMPKQYV NAVKVMNLLGEEIGDGLFYPEQIFQKMNPEAKWWEFDPRLEWLITFLKNH REEKVLVICRHANTAIQLEQALREKEAIRSAVFHENMSIVERDRASAYFA LQEEGAQVLLSSSIGSEGRNFQFACHLVLFNLPDNPDLLEQCIGRLDRIG QTRDIRIHTPCFADTPQVVLARWYHEGLNAFEETCPMGMTIFTECGEKLK NFVKNPTQLDGFEEFVAQTRKRQQVLKQELENGRDRLLELNSNGGERAQK LAEHIADEDNSTALVNFVLNLFDVIGIEQEDLGEKSIAIIPASTMLVPDF PGLKEEGVTVTFDRRLSLAREELEFLTWDHPIVTNGIDLITSGDIGKTAV SLLINKSLPPGTLLLELIYVVESQSPKGLQLTRFLPPTPVRLLLDAKGNN LAAQVSFQALEKQLRPVKRNMANKMAKMIRPNIERLIAGGDKHIAEQARE IIQSAKQKADQTLSAELDRLNALKAVNKNIRQDEIDILAQIREQSLTQLD QANWRLDSLRVIVSNKE >MS0112 hipB, HipB protein MNLSSLFSVRLKNERNRLGLTQAEIAKKCGVSREMWGKYERGVALAGSEV LFSLAAIGVDMDYILLGTRKEVFEEITTEALKDMPKADFSDKTGLLVQLF MQCDDNGRAAILSVAQTMAGMANKTGHQNSDSTGGQSFAGDVHGGQFSTG TINNYGEKK >MS1463 hypB, HypB protein MCTTCGCGHPEQVRIGELQHTHSHSEHQSAVKMPDFSQSVFHSMKPSIHE HAGEQDNTQKRLLKIEQDVLGKNNRIADSNRNLFNYLNLTVFNLVSSPGS GKTSLLTATLNSLKNDRNCYVIEGDQQTENDADRIRATGVPAIQVNTGKG CHLDAQMISDAMMKLRPQENGLLFIENVGNLVCPSEFDLGEKAKVVILSV TEGEDKPLKYPHMFAASKLMILNKVDLLPYLKFDVEKCIENAKRVNPQIE VIQLSAATGEGLQDWLNWLQQ >MS0562 iclR, IclR protein MEKENQPEAVSSVLKVFGIIEALAEQKEIGITELAQRLMMSKSTTYRFLQ TMKTLGFVSQEGETEKYTLTLKLFEVGAKALEYADIIGLANHEMSYISRQ TNETLHLGTLDGTEIIYLHKIDSGYNLRMYSRIGRRNPIYSTAIGKVLLS GLTNKEIRELLADLTFVKHTSKTLENIDQLIEEIEKVRKQHYAEDNEEQE PGLRCVAAPIYNRFGRIIAGLSISIPTIRFEEEKLPQLVNLLQVAGKNIS EQIGYHDYPEILAP >MS0055 iclR, IclR protein MFFIVRRLKEMEKNSGNQSLIRGLRLIEILSRFPNGCPLVQLANISELNK STVHRLLQGLQQEGFVQPAITVGSYRLTSKCLSIGHKIFSSLNIINIISP HLENLNLDLGETINFSMRENDHAIMIYKLEPTTGMMRTRAYIGQHLQLYC SAMGKLYLAYDRPAYLKEYWQTNNDNIQTLTCNTITELPVMEKELDEIKK QGFAVDKEENEIGISCIACPIFNFQNKVEYAMSVSISTSKLNQYGIEHLL EKIKLTAEAISLELGWLPESVQN >MS0744 lexA, LexA protein MSAFCTKKQGIYMKPIKALTARQQEVFNFLKHHIETTGMPPTRAEISREL GFRSPNAAEEYLKALARKGVVEILSGTSRGIRLLVDTEESANDEDAGLPL IGRVAAGEPILAEQHIEGTYKVDADMFKPQADFLLKVYGQSMKDIGILDG DLLAVHSTKDVRNGQVIVARIEDEVTVKRLERKGDVVYLHAENEEFKPIV VNLKEQPNFEIEGIAVGIIRNNAWM >MS1455 lrp, Lrp protein MVNFMEKKLPKALDSIDIKILNELQRNGKISNIDLSKKVGLSPTPCLERV KRLEKQGVIMGYKALLNPELLNSPLLVIVEITLIRGKPDVFEEFNAAVQE LDEIQECHLVSGDFDYLLKTRVADMAAYRKLLGTTLLRLPGVNDTRTYVV MEEVKQTNFLQLK >MS0035 lrp, Lrp protein MYAIDSLDQQILRVLTKDARTPYAEMAKNFGVSPGTIHVRVEKMRQSGII EGTKVRIDERKLGYDVCCFIGIILKSAKDYDKVIKQLEGFDEVVEAYYTT GNYSIFIKVMTHTIAELHSVLATKIQLIEEIQSTETLISMQNPILRDIKP >MS1689 lysR, LysR protein MQSSIYGYLTYFHEIVIEGSIAGAARKLEVAPPAVSNALKLLERHLGLPL FTRTTRKMELTEAGQRLFESTKDMLRGLDSVMESVRDLTEKPSGLVRITT SIISYLLVIRPHFAEFCERYPDIRLEISVNDGIVDIVKEGFDVGMRFGDR LEQNVVAKKLLDPVRLGLYASESYLRKYGKPETLEDLSQHKLLGYRFVTA NRTYPLTFNQDGREISIDMPYSVLTNNLTVELDTVRQGVALGQLFEPVVN ALHDRKNFIPVLDAHWTQYPALYLFYMQHSQKAGKVRALIDFLEEKIKG >MS0763 lysR, LysR protein MKPIFLELRHLKTLLALKETGSVSLAAKRVYLTQSALSHQIKLLEEQYGL PLFERKSNPLRFTAAGDRLLQLANDILPKVVAAERDLSRVKQGEAGELRI AVECHTCFDWLMPAMDSFRQHWPLVELDIVSGFHTDTVGLLLTHRADWAV VSEVEETDGIVHKPLFSYEMVGLCAKDHPLAHKEIWEAEDFADQTWITYP VPDDMLDLLRQVLKPAGINPVRRTSELTIAIIQLVASKRGVAALPFWAAK PYLDRGYVVARKITQNGLYSNLYAAYREEDANSAYLEDFYETVKSQSFST LPGLSVLE >MS2006 lysR, LysR protein MNLDWNDLHYFVLLVEKETLTAAANALDVEHGTVSRRIERLEKQLGLHLF NRINKRYLLTDDGRDLYAEAKKLQLNIKQFAQTAQDKCQSMGEVTVSAPP FVANSLITPLLAHFYRRFRHIRLILNSDSGLSNLHRSQADIALRIAQPKQ DDLVAHRLMNVEYRWFAHRDYLACTPESERQFLSLNLTGTHQQWLQTQLT GKSVRFACNDFNIMKSAVLQQLGVGLLPVCYIDSPDLAAVKNMEYFRAPL YLVMHEDVRQSQKVRMAADFLIENLRD >MS1097 lysR, LysR protein MDKLNAISVFCRIIESQSFTQAAALENISVAMASKLVAQLEEHLKTRLLQ RTTRKIVPTEAGLVYYQRCQPILLELKEADSSISDLSTSLQGNLVVSVPM DFGLKFITPTLPAFISANPNLHVEMEFSDRRVDLMAEGYDLALRIGSLQD STLVAKKLATTSMHFAASAEYLRRYGTPRKPEDLQYHQCLLYKAIGNQIY WEFANKGKIQRVKMRSKMVCNNGLTLVQLAKADLGIINSPRFLVEEELAS GELIEVLPEFKQQLLDIHAVYPHRRHLAAKVKAFVEFLSGLNLGSET >MS2143 lysR, LysR protein MPEMKKTDRFNHLISFTHAARFGSFSAAAEALDLTPAAVSKNVALLEQAL NVRLFNRTTRSLSLTEEGQVFYAESKKALALLEEAVNQITLAESQEIAGN VRISMPNVVGRNLVFPLLKSFNEDYPKIHLELDFDNKAIDFVKAGFDFVL RVGESSEGSLVARHIGMIQTCLVASPAYLKSQGVPKNMADLPQHQLLMTR LPNGKLQPWTFNEQGDNVHFLHAQPHLVLTDAEMQTQAAVQGFGITQLPV YLALPYLQNGELVTILNDSYQPLKLSLNILFPHRTLLAQRVRTTMDYLLE QLKQHEGLRMTQEELKAFSFK >MS1403 lysR, LysR protein MRELRNLDLNLLKAFDVLMDEKSVSKAAQRLSVTQPAMSGILQRLRDSFN DPLFVRVQRGIVPTNRALELRQPIKQLLQSAEQLLQPKIFDPQTAELTLT IACTDYALRAVISPFLAVLKQRAPKIKVAILAINEQNLQSQLEQGVVDFG LVTPDFSAPDIHSKDLYQEQYVCALRKDHPVAQQGSISLEQFCRLEQALV SYQGGSFSGATDKALAKLGLTRNVTVSVQNFIVMPEFLANSDLLAVVPKR LVENLANIHYFEPPLQIDGFTKTLVWHERTHRDPAYRWLRELMAEVC >MS0884 lysR, LysR protein MMLDKVEAVRYFCIAAETLHFRETANRLAISPQVVTRMIAELERELGEPL FKRNTRNISLTDFGQAFLADAQQWLKATETLFQTDFKESMSGTVRITLPR LPNNDVILTELLTALSPYPDLHIDWRPDTALYNSITRQIDIGIRISLEME PHFIAKKITHIKERIVASTALLNRLGQPRDLDDLQNRFPLCAEINPQTGK AWHWFNTAEQSFVAKKPYFMSSESYSNLAVILKGLAIGVLPDYYYLPHVQ TGKLKILFPDLPIPEWKMFLYRPYQENTPLRVIHVFGLLEKILVKHYHTT G >MS2092 lysR, LysR protein MNKLDALKFFITAAETLNFREAAVKLAISPSVVTRTIAELENQLGEPLFK RSTRSITLTSFGELFLPKAKRLLEDSDTLFQTAKDDNEMKGVVRITLFRL PNHEQILFELLTALRPYPELFIDWRLDMMRLDTVEHRIDIGIRVGREPNP NFIIKPIAKVQHIFVAAPDLLERLGAPKDFEDLRQRYPFSGLINPETGKV WEFMLDGVNTFLPRHLEFFSTDPDTQIQAALAGRAVVQASDLACKEYLAN GRLVKVLPQIQQEKWQLYLYRPYQTITPKRVMKVFEVLEGVLRKYLG >MS2008 lysR, LysR protein MHITHCLRTLKALKLQNNVHLCTIKPKEQRRETMNLDWSDIHYFVLMVEK QTLKATAEALQVEHSTVSRRIERLEKQLNVHLFDRINKRYLLTADGQRLY TEAKKLQFNVRQFVQAAQDSLQEMTNVLVSMPPMIAHALVSPHLAAFQQR FPAIRLVLSSNTAISSLHQRQADIALRLVVPQQNDLVVRRLRDMQYGWFA HADYVKNTPESQWQYIDFGVTGPHTPWLNKQLADKSIGFVCNDFAVMQSA VMQKLGIGWLPFEYGNSSEFIQVHTSEIFIGQLHLVMHEDVRHAQKVRDV ADFLIEILRE >MS0336 lysR, LysR protein MLKDKKTWPLIEDLNVFLTIIRKNSFSGAAKELGQSNSYITKRINILEDH LHTSLFYRNTRNIKLTAAGEYVQNQAIAIIDKMDSLMTNIVEDKKSMFGH LHICSSFGFGRTHLAKPISLFAKQHPNLSLDLTLTDHKLDLIKENIDLEI AVGNDLNDRYFAKKLANNRRILCASPDYLQSYGLPKKVEQLSKHNCLFLK EKNSSFGVWKLFNGKILKSITVNGGLTTNNGEVILQWALEGHGIIYRSLW DAEKYLISGELVHILPEYYEDAPIWVVYPNKLSESLKTEIFVNFLTEYFA KKELTKSHDE >MS2151 lysR, LysR protein MNSTEYGQLLIFQAIAKEGSISACARALRISVPAVSKALRQLENRLGVPL FQRSTRKIQLTETGVQLLEQTVQAVDTLSQAFENAKTLAKTPTGTVRITV SQVAFSLILQPVYAEFRERYPHIVLDISINNATVNLIDEQFDLGIRFGNH LEEGIVARRLTGEIREGLFISPQYAQKFGTPKTLADLAHHQLIGYRFITA NRFHPLTLMENGQPHTIEMPMSLILNDSEMAIDAIRQGFGIGRIFEAQYE RLESKIDLLPVLKKHWQTLQPMYLYYQPKSQKVKRVQVLIEFLQEKMEVL GW >MS0044 lysR, LysR protein MRKKPMEFNELKLFLHLAESQNFSRSAAQNHMSTSTLSRQIQRMEDELGE PLFLRDNRRVQLTECGEKFKIFAQQSWNQWQHFKQQIHHNENELNGELKV FCSVTAAYSHLPQVLEKFRLRYPKVEIKLMTGDPALALHQVQSQQVDLSL SGRPLHLPNSIKFHYIDDISLSLIAPRIACPATQLLQHSPIDWQRIPFIL PVEGPARQRIDQWFRQQKIKHPKIYATVAGHEGIVSMVALGCGLALLPDV VIKNSPMNSQVSSLTLDIPVYPFELGVCVQKKSLELPLIKAFWDSLQTEN AG >MS1395 lysR, LysR protein MKENLNDLRAFLVVARTGSFTKAGAQMGVSQSALSHSIRGIEERLNIKLF HRTTRSISTTEAGEQLYQRLSPLFDDIDNELNELSEFRNAVTGTLRINGN EHAFYYALGDKFVRFSQKYPEVNLELVAENRFIDIVAERFDAGIRLGSDV AKDMIAVRLTDKLPMCCVASPEYLANYGTPKTPYDLTEHQCLLHRLSNGG VMNWEFIDPKSKGRILKVQPQGTISANGGRVLENYARSGLGILWCPLDMV EEDIRSGKLIRILQQWDMDYDGYHLYYPNRRQNSPLFKALVEELRLVK >MS2130 lysR, LysR protein MLNKFDALRYFCVAAETLNFRETANRLSVSPSVITRVVNELEAELGEQLF KRHTRSIKLTSFGEQFLLRAQHLLAESETLFKMGKNQADDLAGIVRITVP SWRNNDEIIRQLLITLESYPEIIIDWREDMGKLDMVEDRIDMGLRIGLEP DQDFVVRKITEIGDVLVASPALVKKLGQPTDLTDFERRYPMAIPINSNTG KPWTLFLNEDITLNPKNPAFYSVDNYSALQAVLLGKCAGLINDFMVKPYL EFGELIQLFPEIQIDKWQLFLYRPYQTVTPARVLKVFDLLTEILRKTYY >MS1039 lysR, LysR protein MKIQQLRYIVEIVNQNLNVTEAANALYTSQPGISKQVRLLEDELGLEIFE RNGKHIKTVTPAGKKIVAIARELLVKTQAIKAVANEFTQPNHGVLRIATS NTQARYMLPAVIERFSKQYPNVSLHVHQGSPNQLYDALLSSEVDLAITTE AQYLFDDVVLLPCYMWNRSIIVKADHPLAKLSHVTIEDLGKYPLITYTFG FTGVSDLDQAFNSAGILPNIVFTATDADVIKTYVRLGLGVGIIASMAHTD ADTDLIRIDASHLFKSSMTQIAFKHSTFLRNYMYDFINYFSPHLTRAKVE KAERARDNTAVQKLFEGIDLEVR >MS0154 lysR, LysR protein MNIRDLEYLAALAEYKHFRRAADACHVSQPTLSGQIRKLEDELGITLLER TSRKVLFTQSGLILVEQAKKVLREVKLLKEMASNQGKEMTGPLHLGVIPT VGPYLLPYIMPALKEAFPDLELYLYEAQTSHLLDQLESGRLDCAILATVP ETEPFIEVPIFNERMLLAVSEQHPWAKEKSIKMHALQGHEVLMLDDGHCL RDQALGYCFTAGARENSHFQATSLETLRNMIAANAGMTLMPELAMLNEGT RAGVKYIPCTDPEPKRTIALVYRPGSPLRSRYERVANAVGDAVKAILHTE GD >MS2152 lysR, LysR protein MNDKFSGIEEFLMTVEMGSFSAAAERLNLTGSAVGKSISRLEQRLNTQLF HRSTRKITLTREGEVWLASCRRMMEELEQAKLLLSSQSQQIIGEIRIDLP TTYGRSHILPKLLAIQADYPKLYLNISFQDRKVDMIAEHIDIAVRFGELA DLTDIIAKQIDCFQNQLCATPAFVSKWGKLNHPDDLTHFPCIVGNQISWR LMNEQGKSTGFPLNVQHQINDGDARLQAVLADCGIAFLPDWLIQPAVEAG KLVQLLPEFTPPPEPIYVLWQKKLHLQPKVKAIVNSLV >MS0895 lysR, LysR protein MERKMFKRLPPLNSLKAFESAARFLSFTKAADELCVTQAAVSHQIKLLED FLNIRLFIRKNRSLELTELGKNYFQEISPILQKLADVTEKLKSTDNPHLT ISVLQSFGINWLVPRLNRFNQLYPNIEVRIKSAEQDEGILGNDIDVAIYY GYGNWDNLKTEKLSEDNLLILASPKLLANNPVNSKDDLKHHTLIHVHTRD NWQNMATELGISDLNIHIGPLFSHTFMALQAAVHGQGIVLANSILAQQEI DNGNLQVVLPYELKDPKSFYVVSDTNRTNDQNISAFRQWIMQEMKYN >MS1210 lysR, LysR protein MMNYAAMLHNLPNLNELYFFVQIANAGSFTKAAERLGVTTSALSQNMRSL EKHLDVRLFNRTTRSISTTEAGEKLLAEIAPHFLAIADAVRHLDEIRDEP QGTIRINTSEIAANLIIYPKLQPFLLANPHIKVELVIDNRWVDIVAQGFD MGVRLGYAVFNDMIAVQISEPMKMVLVASPGYLKDKPLPKKINDLTNYHL IGSRFSSEHSQLEWEFMDKGQKVGFQPMPQFSINNDLRTQAALDGFGIAW LPEIRVHEELKNGNLVEILPQYAYTYDPFYIYYPNRKGNSKAFQMVVELL KFKK >MS2176 lysR, LysR protein MQFYSITVPKIAFSYLFHKMKTAKSFIPENEIMNRLDALKYFIVAAETLS FKSTASRFSVSPQVITRVISELEGELGEQLFKRNTRAIRITDFGSRFLAD AIAFLQQEERLFGGVKTAEESLSGLVRITLPPSDYADKILLRLLTALAPY PDIQIDWRTDFDTLKAVDDQIDIGIRISRTPEDHWVAKKITDLQEPIVAA PSLIAKTGLPKDVFDLAANFPVGYILNPKTGKVWDWMMGEQPIILTKPTV ITSDIKSLLPAVLSGRIFAPIMYHDCKSYLDSGELQVVFSNEETLIWGIY LYRPYQTITPKRVLLVFELLEKILEEGF >MS2134 lysR, LysR protein MQKWKDNMKEISLDDMRLFVSVVQSGSLSHAGELTGIPVSRLSRRLTQLE QALGTQLLNRGKKGVSLNELGERFFEHSQQMLQQAELAIESVQKSLENPS GLLRISVAADIFYLFIQPYLATYLNENPQVNLEINLSNQKINMIQDGVDL AIRTGVIDNENVVARLWKKMEFGVFASQAYLAKYSEPQSPNDLYQHHIIS QMYTLPWRFQQGNQEVAVFPHSRLTCNDFAIVEQQLKQHSGIGILPITKN HNRSDLIRILADWQLQSVPVSLIYYRNRGAIATVRSFVEFLQRLV >MS2116 lysR, LysR protein MNTKNTSVYALKLFLQVLELGSLSEVARRENLSASMLSRLIKQLEDDWGA ALFYRNTRAITPTETGLLLAEYARQIVSQFQAAEQAITAQTAEIAGTVRI NAPVFFGQLHIIPHLAELQARYPNLIVNLVQTDDYIDPFTDSTDIIFRLA PLNDSSLKVRILAQQHFCLAASPSYLQKYGTPKIPADLAKHHALLYKGKT GTLRWLLQEGENWQACSPKIALTSNNGNAIATACVQGMGIALLANWAASD LLKEGKVVRLLPEYNFSTQTVPVYVAMLYPQTAFISPSVRAVLDYFREIF QDKSW >MS1415 lysR, LysR protein MHSSIYGYLTVFHTIAAEGSIAGAARKLQMASPSISQSLKLLEQHIGLPL FNRTTRKMELTEAGHHLLASTQDAIAQLSVAVESVQDLSGVPKGVVRMTV PHVGYWLIIEPHLAEFCERYPDIQLEISINDGTVDILKEGFDLGIRFGDK VDEQMVAKKLTAPFRLGLYASSAYQQQFGLPKKIAELKNHRLVGFRFATS NRIFPLSLNDKGEEVSVEMPTPIVANSLIVAKDVIKSGIALGRFFEPLMS KQADRAAFIPVLEKHWKTFGALYLYYMQHSQKAGRVRAVIEFFTEKAQVE KK >MS2072 malT, MalT protein MLIPSKLVCSFRLQNSVPRTRLIQELDKSAFYPVVLINAPAGYGKTTLVS QWIEDKKNVGWYGLDEGDNNSDRFAVYFSAALHSAINEEVDVLLEENRKA NLLALFNQLLIKASGFPQHFYLVIDDYHLIENDEIHEALKYWIRHQPANM TLILISRSVPPLSVASLRVQEQLLEIDINQLMFDHQESVAFFQARLGSEL KQQDIIELCNEVEGWPTALQLISLFAKNKSQTLQVPLQDIAKRLAKSNNF HINEYLADEVLNKVDKSTRLFILRCSVLHSMNETLVEAVTGEPNSRKKLE SLEKQGLFLQQMANSKWQTVDDSWWKFHPLFASFLNFCCQHELYDELSQL HRRAAQAWLKLGYVTEALHHAMQLSDTCLLLEILDEHAWTVFHQGELQLL EESLNSLDYAHLTEHTNLVLLKAWLVQSQHRHVEVSGILAEFSRALNENK VELSKTAQAEFNVLRAQVAINSGDENTALQLASDALKDLSENAYYAHIVA TSIIGEAHHCHGNLAEALSMLQKAERMARQHHTYHNILWSLLQQSEILLA QGFSQAAYDMLDKASEFVKENHLQKVPMYEFLLRLKGKILWEWYNLDKAE SMAVAGMNALQKFEDKLQCLALLTKISLVRGNLDNTSRLLNEVEQLERSH AYHHDWTASADQVRMFYWQMTNDVAAARNWLIQNPAPISDKNHFTQIQWR NIARARILLGQYDKAQEILDNLIETAEKFSLTSDLNRALIVRNRLYFLQG AKELAQQDLIAALKLTRQTNFISAFVVEGDVMAQQIRNLLQLNVLDELVL HKAQFILRNINQFYRHKFAHFDETFVSQLLKNPKVPELLKISPLTQREWQ VLGLIYSGYSNEQISDELQVAATTIKTHIRNLYQKIGVTNRNEAISYTKE LLALMGYN >MS2146 marR, MarR protein MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE >MS2091 marR, MarR protein MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE >MS1966 metJ, MetJ protein MGFFSLKYRQILRLLIGNFMADWDGKYISPYAEHGKKSEQVKKITVSIPI KVLEILTNERTRRQLKNLRHATNSELLCEAFLHAFTGQPLPTDEDLLKER HDEIPEQAKLIMRELGINPDEWEY >MS0216 mfd, Mfd protein MTTHYFNLDIPTQAGDHKIVANVLTGSDGLAICEMAEQFQGLTVVVANDT KSAVRLEKILQESGKLEVRYFPDWETLPYDSFSPHQDIISSRLSALFYLQ NTRKGILILSVSTLMQRICPPQYLQHNVLLIKKGDRLVIEKLRLQLENAG YRAVEQVMEHGEFAVRGALLDLFPMGSPLPFRLDFFDDEIDSIRTFDADT QRTLEEIRQINLLPAHEFPTDDKSIEFFRAQFRETFGEIRRDPEHIYQQV SKGTLVSGIEYWQPLFFENMATLFDYLPANTLFVDMEQYQIQAERFYQDA VQRFESRKIDPMRPLLAPERLWLRIDEVNRALRNYPRISLKAEKVRTSVR QKNLPLKALPELQIQPQQKEPLQNLRHFIEKFKGHIVFSVETEGRRETLL DLLSPIKLRPKQVNSLFEAQSQTYSLQISSLDNGFIIEQENGEPIAIICE TELLGERVQQRGRDKRKSVNPDTLIRNLAELKIGQPVVHLDHGVGRYGGL VTLENAGIKAEYLLLTYANDAKLYVPVANLHLISRYVGGSEETAPLHKLG SDSWAKARRKAAEKIRDVAAELLDVYAQREAQKGFAFHYNREEFMQFSAT FPFEETHDQEAAINAVISDMCQPKAMDRLVCGDVGFGKTEVAMRAAFLAV MNHKQVAVLVPTTLLAQQHYENFRDRFANLPVNVEMVSRFRTAKEQKKIL EDLSAGKVDILIGTHKLIQSDVKFNDLGLLIIDEEHRFGVRQKEKIKQLR ANVDILTLTATPIPRTLNMAMNGIRDLSIISTPPARRLTIKTFVRQADDL LIREAILREILRGGQVYYLHNDVASIENCAEKLTALVPEARIIIGHGQMH ERELERVMTDFYHQRFNVLVCSTIIETGIDIPTANTIIIERADHFGLAQL HQLRGRVGRSHHQAYAYLLTPPPKLMTKDAVKRLEALESLDNLGAGFILA THDLEIRGAGELLGSEQSGQIESIGFSLYMELLEAAVQAMKQGREPSLDE LTQQQVEIDLRIPALLPEDYLGDVNMRLSFYKRIAGAENKPALDELKVEL IDRFGLLPEATKNLMQITELRLMAKQLDIIRIDGSQNGGFIEFSPTADID PMKFINLIKQQPAVFKFDGPTKFRFSCALEQAQKRLDFIFNLLQSLMD >MS1527 nagC, NagC protein MKNGITWKNSLFLRMIMLYGFDIGGTKIELAVFNDKLERQYTERVETPKD SYEQWLDVIVNLVEKADQKFACKGSVGLGLPGFVNHETGIAEITNIRVAD NKPIIKDLSERLGREVRAENDANCFALSEAWDEENQQYPFVLGLILGTGF GGGLIFNGKVHSGQIGMAGELGHLQLNYHALKLLGWDKAPIYDCGCGNRA CLDTYLSGRGFEMLYRDLKGEALSAKEIIERFYAADKTAVDFVGLFIELC AISLGNIITALDPHVIVLGGGLSNFDYLYEALPKALPKHLMRSAKVPVIK KAKYGDSGGVRGAAALFLTK >MS1413 nagC, NagC protein MTRNEEALDIKHTNYRNIYRLFFQYNGLSKPQIVKLLNLSLPTVSNNIGE LEAEGKIREGGFFQPQGGRPAIAYQLVENAFISIGVEIQKKNVRCLALNL QGNILAQKDTALYFENEPQYIESLCNIIHTFIRSLGCLYTQILGIGFSIQ GIVSKDGQSMLYSRVLPGEHFDVKELQPYFDVPVKLFHDVKCAALTELWF SEQIDNAVYISISEHLGGAIIINNQIDLGKKGYSGALEHLQIHSEGNLCY CGQRGCLETYCSLSALLSPNETIEAFFKALRNKDELVLMRWDAFLEHLAK GLNTVYLLLERDIILGGEIAFYLIPEDLKILQEKILKLSTFPLEGDFIRI ATQQKYTSAIGAALPFLIEYLP >MS1445 nusA, NusA protein MSKEILLAAEAVSNEKLLPREKIFEALESAIALSTKKKYEQEIDVRVAIN QKTGEFDTFRRWLVVDEVVNPTKEITLEAAQFEDPDIQLGDYVEDQIDSV AFDRITMQTARQVISTKIREAERNKVVEQFRSEEGKIVTGTVKKVTRDSI ILDLTGNKEDPAKAEAVITREDMLPRENFRPGDRVRGVLYKVNPESKGAQ LFVTRAKPVMLEELFRLEVPEIGEELIEIKGASRDAGLRAKIAVKSNDKR IDPVGACVGMRGSRVQAITNELGGERVDIVLWDDNPAQFVINAMAPADVN SIVVDEDNHSMDIAVEQENLAQAIGRNGQNVRLATQLTGWTLNVMTTEEL QQKHQAEDNKVLNLFMTSLELDEDFAQLLIDEGFSSLEELAYVPVSELTA IDGLEDEDLVEELQNRAKDALTAKAVAEEEALKQAEVEDRLLNLEGMERH IAFRLAEKNIKTLEELAEQGVDDLADIEELSAEKAADLIMAARNICWFGD E >MS0975 nusB, NusB protein MTEQVKKRPSPRRRARECAVQALYSFQISQNPVETVELSFVTDQDMKGVD MPYFRKLFRQTVENIPSVDSTMAPYLDRSANELDPIEKAILRLAVYELKY ELDVPYKVVINEAIEVAKTFGAEDSHKYINGVLDKIAPALARK >MS0205 nusG, NusG protein MTETAVKKRWYVLQAFSGFEGRVATTLREYIKLNHMEDQFGEVLVPTEEV VENVAGKRRKSERKFFPGYVLVEMEMNDDTWHLVRSVPRVMGFIGGTPDR PLPISKREADLILNRVEENADKPRPKNTFQPGEEVRVTEGPFADFNGTVE EVDYEKGRLKVSVSIFGRATPVELEFSQVEKANG >MS1246 ompR, OmpR protein MSSMMMRILLIEDDALIGNGIKVGLTKSGFSVDWFTDGKTGLQAIKSAPY DAVVLDLTLPGMDGMDILQQWRNEKIDTPVLILTARDTLNDRVTGLQRGA DDYLCKPFALAEVIARLQALIRRRYGQANPIVEHSLVKFDPNSRKVSLQG KDIPLTTREYNLLELFMMNKERVLSRSFIEEKLYNWDDEVSSNALEVHIH NLRQKLGKQFIRTVHGVGYALGKNEE >MS1913 ompR, OmpR protein MAKILLVDDDTELTELLSELLSLEGFEVQIACNGEEALAKIDESYDIVLL DIMMPVLNGIETLKRLRQNFTTPVLMLTARGDEIDRVLGLELGADDYLPK PFNDRELVARIKAILRRSVLNKSASSEEETPFEERKAIEFAGLTLYPGRQ QVMYQGQDLELTGTEFALLCVLIKHPGEVLSRELLSLEALGKNLTSFDRS IDMHMSNLRKKLPTRPDDFPWFKTLRGRGYILLTD >MS1504 ompR, OmpR protein MLSPQILIVEDETVTRNTLKSIFEAEGYEVFEATDGNQMHQIIETQEINL VVMDINLPGKNGLMLARELREKTNTALMFLTGRDNEVDKILGLEIGADDY ITKPFNPRELAIRARNLLHRTMAENEKNSNTHVDAYRFNGWTLDINKRAL IDPESVEYKLPRSEFRAMLHFCENPGKIQTREDLLKKMTGRELKPQDRTV DVTIRRIRKHFEDHPDTPEIIATIHGEGYRFCGEIE >MS1063 purR, PurR protein MATIKDVAKMAGVSTTTVSHVINKTRHVADETKQTVLDAIKALNYSPSAV ARSLKVNTTKSIGMVVTTSETPYFAEIIHAVEEQCYRQGYSLFLCNTQND PDKLKNHLEMLAKKRVDGVLVMCSEYKDDSRDLLKSFSYLPIVIMDWGPV NPDTDLILDNSFEGGYLAGKHLVDNGHKKIGYLSAELTKVTAKQRYQGFI KALSEANVEMKSEWLFEGSFEPEDGYECMNRLLALEDRPTAVFCCNDIMA LGAISAITEKGYRVPDDFSVIGYDNVHSSRFFAPPLTTIHQSKARLGERA LRLLFERIAHKDAKRETIEIHPELVIRKSVKKIA >MS0644 purR, PurR protein MITIRDVAKQAGVSVATVSRVLNNASSSEKARKAVQSAVEKLGYSPNANA QALALPTTDTIGVVVTDVTDAFFAILVKAVDQVASSYNKTILIGIGYHNA EKERNAIDTLLRKRCSCLVVHSKALSDEELANYLEQVPGMVIINRSIQGY EHRCVSLDNQRGTFLATETLIRLGHKRIGYIGSNHHINDEEERRQGYIQA LQHHRLPQIDDAIIQSSPDFEGGEEAMIKLLSYHSDLTAVVAYNDSMAAG ALSVLNENNINVPRQFSIIGFDDMPISRYLIPKLTTIRYPIDLMANYAAR LALSLVNEGIETPLHAQFNPTVVRRFSTENCNNP >MS0284 purR, PurR protein MATMKDIARLANVSTSTVSHVINNDRFVSEKIREKVMAVVKELNYQPSGL ARSFKTKETKTIGMLVTASDNPFFAEVVHAVERYCCQQNYNLILSNTEGS PQHLQHNLQMLINKQVDGLLLMCSETHTQDNMPINLPIPAVIMDWWPSEL TADKIFENSELGAYLATKHLIHHQHKRIAIVNGDLRKPIAQNRLIGYKKA LTEANLPIDETLIFEGKFDFQTGFDALERLLKTDCPPSAIFACCDAIALG IYQAAWRHNLIIPRHLSVIGYDDTILSQYIAPPLSTIHQPKTELGKLAVQ TLLERIKNPQKTYRTFVLDPVLVERESVATRKES >MS1238 purR, PurR protein MKYTINEIAKLCNVGKSTVSRVLNKDPKVRSETREKVQRVIDRLGFQPNR SARAMRAGQEPVVGVIVSKLDSGSESQTLRAILQALQAEHITPLIVESRF EAEQVRHHFQLFRERQVNAVILFGFFPLPLEIVREWQGSLVVIARTYPNI SSVYYDDEQAITRLMTELYRQGHRRIAYLGIQDSDETTGKLRTQSYLQFC RSHNIRPNSVSVELSAESAYLHCAELFTRPVDALVCATGRLALGAFKFSQ QSGRVFPIAYVGYNELLQYMMPNALSLDFGYCQAGLKAVELLMRQLRGKS STEHYLVSTHQP >MS1317 purR, PurR protein MKLEELAKLAGVSRTTASYVVNGKAKQYRVSDKTIEKVQALIKEYDFKPN AMAAGLRAGKSNTIGLIIPDFENLSYAKIANQLEKSCRENGYQLLITCSN DNVANELECAKHLFQRQVDALFVSTVLPADNHYYQQNNAIPIIGFDRHID SEGVDNVLTDDKHDAYELAVSLFDKADYQRILFLGALPELPMSKAREEGF KQALGKKQVQVDYLYASQFRKENAEQLVSEWIEKNGIVPDAIFSTSLTLL QGLLMSFIKRNEAFPKDLVIATFGWHEMLELLENKIVCSVQDHSKVVQAL LDLALHKMRIKKLKQPHPVIQRRLAYHNWQ >MS0808 purR, PurR protein MLMVSLKDVAKEAGVSLMTVSRALKSPDKLSPKTYKVVKEVIDRLGYVPN LAAQHIRGVAANTIGVLSLGTATTPFSVEILLGIEQTVRQHGWNSFVINT FENDSQAMEDAVEQMLSHRPSAIIIARNGLKNVSIPEKLRSFPLVLANCQ TQDMAVAAYIPDDYQGQRVVVDRIVAKGYQRPLFLHIPKNYIATAKRRQA FEDAWANHSGQKPVQFFMRRDGEDYFEGAQPLIDYLEKPDPLPFDVIICG NDRIALVAYQLLLAKGYRIPEDVAVCAYDNMVGIAQLFIPPLTTVELPHY QMGQEAALHLIEGRKDRDIHQLPCPLIEGESC >MS2375 purR, PurR protein MKSGLKHHRIALLFNANKVYDREVIEGVGQYIQASQCLWNIFIEDDFVYR KESLHNLDIDGIIADFDDPETVAMLEHTEIPVIAVGGSYQNPAFYPHYPY VATDNYALVETAFLHLKQKGINQFAFYGLPNETPKHWSEERKNAFMQLMA DYGHQTYIYLGEQAHSDNWLEVQSKLCDWISRLPPHTGIIAVTDARARHL LQACEYLNIAVPDELCIIGIDNEELIQYLSRVSLSSVVQGTNQIGYQAAK LLDQLLKGRPVSQTPILVPPLRVEQRRSTDYRSLHDPLVIQAMHYIRHYA TQGIKTEQVLDHLRISRSNLEQHFKAEMNKTIHQVIHEEKLDRAKNMLKF TDVPIQEISDICGYPSLQYFYAVFKKEYGQTPKEFRER >MS0148 purR, PurR protein MSLANNSNKNRRSTGKVTLADVAKEVGVGTMTVSRALRTPKMVSENLRQK IHEAVQKLGYVPNSAARELASVSSRNIVIVTSSLVSVENNLILNSLQKEL QPLDLQIIILVANKKGWLRELINNSPLAVILLNLQCPSTEAQWIRNSGLI CLEIGSKQANPLGINVCVDSKSAVQKVISFLVAKGYRDIGLLCAQQEQAI FQQYLACWHSALHANHLNSHQILHCSEPVSFSAGAKLFNEAISTWGCIDA FVFLSDELACGALFEAQRQHIGIPYDVAIIGLGDLEISQTTYPALTTLNI PYAKLGETAGKKLAELLQTEKDPQTECIQLISTLRERESG >MS1531 purR, PurR protein MSVQKIAKLAGVSVATVSRVLNDSPSVKAVNKEKVLAAIKALNYQPNLLA RQLRTSRTGMILAMVSNIANPFCAAVVKGIEREAEKNGYRILLCNTESDL ERSRSCLQLLSGKMVDGVITMDAISELPELQNIIGDAPWVQCAEYDPDSS VSSVSIDDISATEFVIDQLVKTGKKRIALINHDLSYQYAQHRELGYLDGL KRHGLAYCEIIYADELDYLSGKEAVLSLLKNAQRPDAILAISDVLAAGVI NGLNELNVAIPEDIAVVGFDGIDISQITTPSLSTIQQPCKEIGEMAFSLL LQQIDSTSSVKRVHHLLPWTFIKRQSS >MS1242 purR, PurR protein MTKHKRPTLQDIANHLGITKMTISRYLRNPASVAEETGKRIAKAIEEFGY IPNRAPDILSNAKSRAIGVLVPSLTNQVFADVIKGIEEITDEAGYQTMLA HYGYSEKKEEQRIESLLSYNVDGIILSENSHSERTKKMLQVANIPVIEIM DTSEIGIQQVIGFDNIAAAQAMVETMIKRGYKKIVYFSARLDKRTQLKMQ GYQQAMKKYQLSPRIIATKEHSSFTHGAELLHQALKQYPDIDGIFCTNDD LAIGALFECQRLGIKVPKQIAIAGFHGHDVGQSITPQLATVITPRLQIGR IAAQELLARLQNIPAQSSIINLGYQIHLGESI >MS1735 recG, RecG protein MTTQLLDAIPLTSLSGVGAAVSAKLSKIGINNLQDLLFHLPIRYEDRTRI TPISDLRPEQYATIEGIVQTCEIQFGRRPILTVSLSDGTSKIMLRFFNFN AGMRNGFQPGARVKAFGEVKRGRFMAEIHHPEYQIIRDKQPLQLEENLTP IYSATEGLKQNSLRKLTDQALELLDKIQIAEILPDQFNPYPFSLKEAIRF LHRPPPDVSVESLEKGTHPAQVRLIFEELLAHNLAMQKVRLGTQQFQALP LHFQTDLKQRFLATLPFEPTNAQVRVTQDIERDLAKDYPMMRLVQGDVGS GKTLVAALAALTAIDNGKQVALMAPTEILAEQHAENFRRWFEPFGIEVGW LAGKVKGKARQSELERIKNAEVQMVVGTHALFQEEVAFSDLALVIIDEQH RFGVHQRLLLREKGEKAGNYPHQLIMTATPIPRTLAMTVYADLDTSIIDE LPPGRTPIKTIVVSEERRAEIVARVHNACTNENRQVYWVCTLIDESEVLE AQAAEATAEDLHRALPHLRIGLVHGRMKPAEKQAIMASFKAAELDLLVAT TVIEVGVDVPNASLMIIENAERLGLSQLHQLRGRVGRGSTASFCVLMYKP PLGKISQKRLQVLRESQDGFVISEKDLEIRGPGEVLGTKQTGIAEFKVAN LMRDRKMIPTVQHYARRLIVEYPDVADTLIKRWLNNREIYSNA >MS1837 rho, Rho protein MVTLAHSKLLPTIKQTSQNFIKSNQKDSQQIIMHLTELKNTPVSELVALG EGQMGLENLARLRKQDIVFAILKQHAKSGEDIFGGGILEILPDGFGFLRS ADSSYLAGPDDIYVSPSQIRRFNLQTGDKIEGKIRPPKEGERYFALLKVD QVNDDKPEVSRSKILFENLTPLHANSRLRMERGNGSTEDLTARILDLASP IGKGQRGLIVAPPKAGKTMLLQNIAQSITHNYPECELIVLLIDERPEEVT EMQRSVKGEVIASTFDEPASRHVQVAEMVIEKAKRSVEHKKDVVILLDSI TRLARAYNTVTPASGKILSGGVDANALHRPKRFFGAARNVEEGGSLTIIA TALVDTGSKMDEVIFEEFKGTGNMELHLSRKIAEKRVFPAIDFNRSGTRK EDLLTTPDELQKMWILRKILNPMGEVEAMEFLIDKLMVAKTNEEFFEIMK RS >MS0368 rnc, Rnc protein MNHLDRLQRQISYEFKDITLLKQALTHRSAATKHNERLEFLGDAILNYTI ADALYHQFPKCNEGELSRMRATLVREPTLAILARQFKLGEYMALGHGELK SGGFRRESILADCVEAIIGAISLDSSLVSATQITLHWYEKLLREIKPGEN QKDPKTRLQEYLQGHRLALPTYDVKDIKGEAHCQTFTIECHVPNLDRTFI GVGSSRRKAEQAAAEQILTALEIK >MS0198 rpiR, RpiR protein MAQIDPKSIGAHIRTRKQQLTPLERKVLDCILAKSDFDEKTSLKEIATEN QVSEAIVVKIAKKLDFSGYREFRSGLAYYKQLEVANLHNDISADDTATQV IKKVFETSIQALQETMSILDISEFERCVKILVEADHIDLFGIGGSAQIAK DMAHKFLRIGIKASVYDDSHMMLMAGAVSHPGNVVLAISHSGTTIDVIEP LQLARQNGAKTIAITNYAISPIAECADVVLTSTSQGSLLLGENAAARIAQ LNILDALYVAVAKQNLDISEDNLRKTRYAVKHKRTK >MS2023 rpoA, RpoA protein MQGSVTEFLKPHLVDIEQVSPTHAKVILEPLERGFGHTLGNALRRILLSS MPGCAVTEVEIDGVLHEYSSKEGVQEDILEVLLNLKGLAVKVQNKDDVFL TLNKSGIGPVVAADITHDGDVEIVNPEHVICHLTDENASINMRIRVQRGR GYVPASARVHAQDEERPIGRLLVDACYSPVDRIAYNVEAARVEQRTDLDK LVIELETNGAIDPEEAIRRAATILAEQLDAFVDLRDVRQPEVKEEKPEFD PILLRPVDDLELTVRSANCLKAETIHYIGDLVQRTEVELLKTPNLGKKSL TEIKDVLASRGLSLGMRLENWPPASIAED >MS0212 rpoB, RpoB protein MGYSYTEKKRIRKDFGKRPQVLNVPYLLTIQLDSFEKFIQRDPEGQQGLE AAFRSVFPIVSNNGSTELQYVSYKLGEPVFDVRECQIRGTTFAAPLRVNL RLVSYDRDAAPGTIKDIKEQDVYMGEIPLMTDNGTFVINGTERVIVSQLH RSPGVFFDSDKGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFARIDR RRKLPATIILRALGYSTEEILDLFFEKIQFEIQDNKLLMALVPERLRGET ASFDIEANGKVYVERGRRITARHIRTLEKDNVTKIDVPTEYIVGKVSAKD YIDLESGELVCPANMEISLDILAKLAQAGYKSIETLFTNDLDFGPYISET LRVDPSSDRLSALVEIYRMMRPGEPPTKEAAEALFDNLFFSAERYDLSAV GRMKFNRSLGLAEGVGNGVLSKEDIVGVMKKLIDIRNGRGEVDDIDHLGN RRIRSVGEMAENQFRIGLVRVERAVKERLSLGDLDAVTPQDLINAKPVSA AVKEFFGSSQLSQFMDQNNPLSEVTHKRRISALGPGGLTRERAGFEVRDV HPTHYGRVCPIETPEGPNIGLINSLSVYARTNNYGFLETPYRKVVDGQVT EEIEYLSAIEEGNYVIAQANASLDEDFRFTDAFVTCRGEHGESGLYRPEE IQYMDVSPQQVVSVAAALIPFLEHDDANRALMGANMQRQAVPTLRADKPL VGTGMEKPIALDSGVAVVAKRGGIIQYVDASRIVVKVNEDETIPGEAGID IYNLIKYTRSNQNTCINQIPCVNLGEPIGRGEVLADGPSTDLGELALGQN IRVAFMPWNGYNFEDSMLVSERVVQQDRFTTIHIQELSCVARDTKLGAEE ITADIPNVGETALSKLDESGIVYVGAEVKGGDILVGKVTPKGETQLTPEE KLLRAIFGEKASDVKDSSLRVPNSVSGTVIDVQVFTRDGVEKDKRALEIE EMQLKEAKKDIAEELEILEAGLFSRVRNLLIDGGVDAKELDRLDRTKWLE QTLNDEAKQNQLEQLAEQYEELRKDFEHKLEVKRGKIIQGDDLAPGVLKV VKVYLAVKRRIQPGDKMAGRHGNKGVISKINPVEDMPYDENGQPVEIVLN PLGVPSRMNIGQILETHLGLAAKGIGEQINRMLKEKQEIEKLRGYIQKAY DLGGGSQKVDLNTFTDEEVMRLAQNLRKGMPLATPVFDGAEEKEIKDLLE LGGLPTSGQITLYDGRTGEKFERPVTVGYMYMLKLNHLVDDKMHARSTGS YSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVNG RTKMYKNIVSGTHQMDPGTPESFNVIMKEIRSLGINIDLDEE >MS0213 rpoC, RpoC protein MKNFHRTLNKFNSDRSKSVKDLVKFLKAQSKTSEDFDVIKIGLASPDMIR SWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKHR GVICEKCGVEVTQTKVRRERMGHIELASPVAHIWFLKSLPSRIGLLLDMP LRDIERVLYFESYIVIEPGMTDLEKGQLLTEEQFMDAEDRWADEFDAKMG AEAIQALLRDMDLEHECETLREELQETNSETKRKKITKRLKLLEAFMQSG NKPEWMVMTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKR LLDLVAPDIIVRNEKRMLQESVDALLDNGRRGRAITGSNKRPLKSLADMI KGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFRPFI YAKLESRGFASTIKAAKKMVEREDAIVWDILAEVIREHPILLNRAPTLHR LGIQAFEPILIEGKAMQLHPLVCAAFNADFDGDQMAVHVPLTLEAQLEAR ALMMSTNNVLSPANGDPIIVPSQDVVLGIYYMTREKVNAKGEGMLLQDPR EAEKAYRTGRAELHSRVKIRITEYVKNAEGEFEPQTTLTDTTIGRAILWM IAPKGMPYSLFNQTLGKKAISKLINECYRRLGVKASVMFADQIMYTGFAY AARSGSSVGIDDMVIPEKKYEIISAAEAEVAEIQEQFQSGLVTAGERYNK VIDIWATANERVAKAMMENLSTEEVVNREGNLEKQSSFNSIFMMADSGAR GSAAQIRQLAGMRGLMARPDGSIIETPITANFREGLNVLQYFISTHGARK GLADTALKTANSGYLTRRLVDVAQDLVIVEDDCGTHEGIVMTPLIEGGDE KVSLRELVLGRVAAEDILKPGTEEVLFPRNTLLDEKVCDILDENSVDSVK VRSVVTCDTDFGVCAKCYGRDLARGHLINQGEAVGVIAAQSIGEPGTQLT MRTFHIGGAASAAAKESSVQVKNSGSIRLTNVKSVTNNEGKLVVTSRNTE LTIIDAFGRTKEHYKVPYGAVLNKGDGEAVTAGETVANWDPHTMPVVSEV AGFVKFVDIVDGLTVTRQTDELTGLSSIVVQDVGERATAGKDLRPAIKVV DAQGNDIFIPGVDVLAQYFLPGKAIVTLDDGAEVQVGEPLARIPQESVGT KDITGGLPRVADLFEARKPKEPAILAEITGIVSFGKETKGKRRLVITPVE GEAYEEMIPKWRQLNVFEGEMVERGDVISDGAETPHDILRLRGVHAVTEY IVNEVQEVYRLQGVKINDKHIEVIVRQMLRKGIITKAYDSEFLEGEQVEV ARVKIVNRKREAEGKPPVEFERELLGITKASLATESFISAASFQETTRVL TEAAVAGKRDELRGLKENVIVGRLIPAGTGFAYHQNRIKNRGQANVVEEQ EVKFSAADEAEIEAEFNMIAEDPAASLAEMLNMADDAE >MS1760 rpoD, RpoD protein MNQRRTSYMDHNPQSQLKLLIAQGKEQGYLTYAEVNDHLPEELVDTDQIE DIIQMINDMGIQVLESAPDADDLMLSETIADEDAVEEATQVLSSVEAELG RTTDPVRMYMREMGSVELLTREGEIDIAKRIEDGINEVQSAVAEYPEALD YLLKQYEQVEEGSVRLADLITGFVDLNAEEASEEISDLEEVLDDEDGDIP ADALNDEEEDEESDEGDTSTDDSDNSIDPEVAREKFSALKDQCVKTLEFI EKYGRTDNKVKEQIQVLSDIFTQFRLVPRQFDTLVLSMRSMMKQVRAEER QIQRLAVDYAKVPKDDFQKAFIGNETSEQWLESLLQSKKTYVEKLQQRAP EISKSIVRLQQVETDTKLTVQQIRDIGERIAQGELKARRAKKEMVEANLR LVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWI RQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREASPEELAE RMGMPEDKIRKVLKIAKEPISMETPIGDDDDSHLGDFIEDSTLELPLDSA TAQSLKVATHEVLEGLTPREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRE RIRQIEAKALRKLRHPSRSETLRSFLDE >MS0025 rpoD, RpoD protein MTKETQTMMLVPQGSIEAYIRAANEYPMLSAEEEKELAERLYYQEDLEAA KKLILSHLRFVIHVARGYSGYGLPQADLIQEGNIGLMKAVKRFNPEVGVR LVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFS DNELDLVANELGVTKEDVIEMESRMTGADVGFDLPTDDSEEETFAPSMYL EDKSSNFAAELESENFETQAIDQLSNAMENLDERSKDIIQARWLDDTKAT LHELAAKYNISAERVRQLETNALKKLKSAVSF >MS2228 rpoE, RpoE protein MLLTRGYMAEQLTDQALVERVQQGDKKAFNLLVSRYQNKVAGLLTRYVSR NDIPDVVQESFIKAYRSIESFRGESAFYTWLYRIAVNTAKNYLTAQGRRP PNEDILAEEAETYDVGGNLRDVDTPEHEMLSAELKKVIFDTIDGLQEELK TAITLREMEGLSYEEIADIMDCPVGTVRSRIFRAREIIESKIRPLIQR >MS1737 rpoZ, RpoZ protein MARVTVQDAVEKIGNRFDLILTAARRARQLQLHVREPLVPEDNDKPTVIA LREIEKGLIDNNIMNAQERQEALEQEKVELNAVSLLSE >MS1964 sPS1, SPS1 protein MRKDMLQVQHENHFFLFNFDENRPNQEHFFESYFWQKQNRIIGSAKGRGT TWFIQSQDLFGVNTALRHYYRGGLWGKINKDRYAFSSLEETRSFAEFNLL NRLYQAGLPVPKPIGAHVEKLAFNHYRADLLSERIENTQDLTALLPNTEL TAEQWQQIGKLIRRLHDLQICHTDLNAHNILIRQQNNDTKFWLIDFDKCG EKPGNLWKQENLQRLHRSFLKEVKRMRIQFSEKNWADLLNGYQN >MS0179 soxR, SoxR protein MEQTLKQGIFMHIKEFSTKIGLSIDTLRYYEKEGLLNPARNKSGYRNYGK QDLEWIAFILKLKAMGVPLTQIKEYARLRYLGDTTIPERYAILQAHNQKL VEQEKEIKKYQQFLAHKLSIYEKVMKKQN >MS0886 soxR, SoxR protein MNINEIVKKTNLTAKSIRFYEEKGLITAPQRALNGYRQYNQKHVEELNLL HQARLVGFSLPECKELLELYKDPHRRSADVKAKTLARIAEIDNQIGKLQQ MRQQLQTLANQCPGDGSEHCPIIEGLSKPNCCDHHAEKK >MS1385 soxR, SoxR protein MNSQKKFYTISQLAEKLAITTHTLRFYEKEGLLPSVQRDQNGNRLFIQAD VEWLELLICLKNTGMPLKEIKRFVEWLNYGDSTIEQRLQLFQAQVTKVEQ QIAELQRHLEILKYKRQFYQCAKELGSVQAVLDTQLQQQFAEQNILLPVS PLSMAENE >MS1433 soxR, SoxR protein MMMKINELSKKSGINLETIRYYEKTGLLPEPKRAANGYRVYDQQSLSQLN FIKSCRWLGFSIDEIKQLNELKNTPKHHCVADEMILSHLKQVEEKIARLL EIQTFLQNLVNHEEHSVEECRAISGLSQER >MS0468 soxR, SoxR protein MRIGQLAKAVGCTIETIRYYENQGLLAKPQRSANNFRYYTNDHLQQLSFI CYCRSLDMSLHEIKMLLNLDRSSGQRAEEINLLLDKHIRDVAKRLHELAH LRMELIKLKQKCSEMTGENLMQNIFSGGNIRFRKIK >MS1736 spoT, SpoT protein MYLFEPLNKIIQGYLPSEHIDLIKRAFVIARDAHEGQFRSSGEPYITHPV AVASIIAEMRLDHEAIMAALLHDVIEDTPYTEEQLTTEFGKSVAEIVEGV SKLDKLKFRTRQEAQAESFRKMILAMTKDIRVVLIKLADRTHNMRTLGSL RSDKRRRIAKETLEIYSPLAHRLGIEKVKNELEDLCFQAMHPQRYAVLNK VIQVARNTRQELVHPILVTIQQRLEEVGINAQVFSEEKPLFYIYQNMRLR NQQFRSIMDISNFRIIVDSIDNCYRVLGQMHQLFKPRPGQIKDYIAVPKA NGYQALHTSTIGPHGVAVEIQIRTEEMNLIAELGVTAHWVYKPGGKNDTT TAQIKAQHWLQSIIELQQSAGNSFEFIESVKSDLFSDEIYVFTPKGRIIE LPAGATPIDFAYAVHTSIGSTCVGAKVDRETYPLSQALRSGQTVEVITSP NATPNANWLNFVVTGRARAKIRQTLKTLRLEEAINLGRYQLLHALAGKHL EDLDPAIVHHVLTELNLDTMDDLLAEVGLGNQLSTVIARRLQGESLAIYT DIEEVNNQERLPIKGMDGLLVNFAKCCHPIPGDSIVAYANPGKGLVVHHE NCRNLKKRTTQSVPFIKVEWEQCDHSAEFEAELHINMVAQQGALANLTAA ISAAQSNIHSIWTEESEGRICHVTLTLSAKDTKHLANIMRKIKSLSGVQS VERNINE >MS0241 spoT, SpoT protein MVAVRVSHLLNPKDFIIEDWCAGLGLTPDVEKNIVRAWYYAQEKAQQLFQ NSHWYLRDGVEMVEILHGLNMDADSLLTAMLFPIVNAKIVNQEQIKEDFG PHIWKLLKGVIEMNNIRQLNTTDSNAQVDNIRRMLLAMVDDFRCVIIKLA ERITYLRDAEKRYSKQDKVAAAKECSNIYAPLANRLGIGQLKWELEDYCF RNLQPEQYRIIAIKLNERRLDREQYIADFVQRVSQYLDESVTGAEIYGRP KHIYSIWRKMQKKHLDFSQLYDIRAVRIIVPALQDCYTALGIVHTHFKHL PDQFDDYIANPKPNGYQSIHTVVLGEGDKPIEVQIRTKKMHDDAELGVAA HWKYKEGNTGSLSAYEEKIIWLRKLLAWQHDISNSGEVVPELRTQVFDDR VYVFTPKGEVVDLPAGSTPLDFAYAIHSDVGHRCIGAKVGGRIVPFTYQL QMGDQIDIITQKNPNPSRDWLNPSLGFTHTAKARSKIQAWFKKLDREKNI PIGKEQLENELNRLAITLKQVEPIALPRYNLKSIDDLYSGIGSGDIRLNH LINFLQAKLIKPTAQEADEEVLRQVTKTANSAANQQKNEKNKGYVIVEGV GNLMHHIARCCQPIPGDDIEGYITLGRGISIHRTDCEQLAELKAAHPERV VESIWGENYNSASGFNLSIRVIANDRNGLLRDITTVLANDKISVANVTTR LDSKRQLATMDLEIQLKNVQILGKVITRLTKLDDVIEVKRL >MS1836 srmB, SrmB protein MSLDHLSQQRFADLPLNAKVLEALESNGFEYCTPIQALSLPISLAGKDVA GQAQTGTGKTMAFLTATFHHLLEHPVKTNHPRALIMAPTRELAVQIAHDA ERMVKTTGLKTALAYGGDGYDKQLKAIEAGADIIIGTTGRIIDYVKQNII ALSHIQVVVLDEADRMFDLGFIKDIRYLMRKCPSPKQRLTLLFSATLSYK VRELAFEDMNDPEYVEVEPLQKTGHRIKEELFYPSNEDKMPLLITLLEEE WPERCIIFANTKHQCEKIWGYLAADGHRVGLLTGDVAQKKRLSLLKQFTD GALDILVATDVAARGLHIPDVTHVFNYDLPDDREDYVHRIGRTGRAGESG VSISFACEEYAMNLPAIEEYIGHHIAVSQYDSDSLIRDLAKPYRLKPSLP ASNRHNRNGAKPFKKRF >MS0495 srmB, SrmB protein MTETKITFGDLGLPEFILSAVSDMGFETPSPIQQACIPHLLNGRDVLGMA QTGSGKTAAFSLPLLAQIDIEEKHPQMLVMAPTRELAIQVAEACELFTKN AKGVHIATLYGGQRYDIQLRALRQGAQVVVGTPGRILDHIRRGTLNLSEL KFIVLDEADEMLRMGFIDDVETVMAELPAQHQTALFSATMPEPIRRITKR FMTDPQEVKIQSTQRTNPDIAQSCWYVRGYRKNEALLRFLEVEDFDGAII FTRTKTGTLDVTELLEKHGFRAAALNGDMTQQLREQTLDRLRNGSLDILV ATDVAARGLDVERISLVVNYDIPLDAESYVHRIGRTGRAGRSGSAILFVE PRERRLLSNIERLMKKPIEEVDVPNHEALQARRREKFKAKITKQLEHHDL EQYRLLLEGLFTPDQDQEDIAAAMLMLLQGKQKLILPPEPPMEKRGRRER DDRRGERGDRRERRPEERRGYGNPQPMDLYRIEVGRADGVDVRHIVGAIA NEGDINSRNIGHIKLYDEYSTVELPQGMPKELLQVFGKARVLNKQMRMTF VSEAGETVGRERHEGRRNDRRDNGFRREERRFNDRGNRSFNERAPRREFR ERNDRRDRRDRRS >MS1950 srmB, SrmB protein MRYNFPQFYNLSHLRIFMPQPQFEDFDLSPELLKALAQKGYARPTAIQSE AIPAAMDERDVLGSAPTGTGKTAAFLLPAIQHLLDYPRRKPGAPRVLVLT PTRELAMQVAQQAEELAQFTKLSIATITGGVAYQNHGEIFNKNQDIVVAT PGRLLQYIKEENFDCRAVEILIFDEADRMLQMGFGQDAEKISAETRWRKQ TFLFSATLEGELLVDFAERILTDPVKIDAEPSRRERKKINQWYYHADSYE HKVKLLARFIADEQVSKGIVFVRRREDVRELSEILRKRGIRSTYLEGEMA QTQRNNAIDKLKNGIVTLLVATDVAARGIDIEDISHVMNFDLPYNADTYL HRIGRTARAGKKGTAVSFVEGHDYKYLGKIKRYTEELLKPRIIEGLEPRT KAPKDGEIKTVSKKQKAYIRQKREEKRKTTQKKAKLRRQDTKNIGKRRTP KAVSEAQAKEIR >MS0694 srmR, SrmR protein MSTYLLDAKLAQKIVQRTMDIIDCNINIMDAKGKIIASGDVNRIGEIHDG ALLVLSQGRVVDINEAVIHSLHGVRPGINLPLRVDGEIVGVIGLTGEPTT LKEFGKLVCMTAEMMLEQARLFNILAQDTRLKEELVLNLINTDKITPSIV EWANRLGVDLSIPRVACIIEVDSGQLGIENARSELQNLQTLLKIPERDNL VAVLSLTELVVLKPALNSFGRWEVDDHLERINQLLSRMNEKAKLNVRISL GNYFTTEDSISLSYHTAKTTLTIGKARYPKQRIYNYQDLILPVLLDQLRD GWQKEELERPIKKLKLMDNNGVLLKTLLAWFENNMQTIATAKALYVHRNT LEYRLNKIADLTGLDLNSTDNRFLLYMALHVAV >MS2301 tfoX, TfoX protein MNRTNKDTQWIRTILNSFLENEVTAKHLFVGYGLFYRKVMFGIVIDDNFF LKAENQLVEYVEKLGAVSWDIFNKNTNLAISSYYRLPRALVDNEEEFKTL VILSIKQQQRKILDLNIAKKERIKELPNLSIKHERLLAKIGINNVKEFKS AGISNCFVKLKVHGFSVNVELFWLFQAALKNKHVSLLTKSEKKSALLVLN RKLVEAGFREIKHECLI >MS1566 trpR, TrpR protein MYISRNMEQWTKFIETLRIAFNDGKEQDLLTLLLTPDERDAIGLRLQIVA QLLDKKIPQREIQQNLNTSAATITRGSNMLKLMSPDFMEWVKKHTNETEN T >MS0762 tyrR, TyrR protein MFTVKGYDEGNYFIRSIVGKTMSKNTAKRSAHFTVNQYENFTDVVALSPK MAALVEKAKKFALLDAPLLIQGETGTGKDLIAKACHNLSARKDQKFIAVN CAGLPDTDAESEMFGRADGDKTSTGFFEYANGGTVLLDGVAELSLNLQAK LLRFLNDGTFRRVGEEQEHYANVRVICTSQISLQHYVDEGKVRSDLFHRL NVLSLQIPPLRERKEDLAVLTENFVRQISRRLGVRTPEFDGQFLQYLKDY QWPGNVRELYNALYRACSLAEHNKLTIDGLNLSENETVPLTLEQFGNESL EEIMNNFEASVLRKFYEQYPSTRKLASRLGVSHTAIANKLKQYGIGK >MS1468 vacB, VacB protein MFQNNPLLSQLKQQLHDSKPHVEGVVKGTDKAYGFLETEKETFFIAPPAM KKVMHGDKIKAAIETIGDKKQAEPEELIEPMLTRFIAKVRFNKDKKLQVL VDHPNINQPIGAAQAKTVKQELKEGDWVVATLKTHPLRDDRFFYAQIAEF ICSAEDEFAPWWVTLARHEQSRYPVQGQEVYSMLDTETRRDLTALHFVTI DSENTQDMDDALYIEPVTAPNDEQTGWKLAVAIADPTAYIALDSQIEKDA RKRCFTNYLPGFNIPMLPRELSDELCSLMENETRAALVCRLETDMQGEIV GEPEFILAQVQSKAKLAYNNVSDYLEQVENAWQPENESTQQQINWLHQFA LVRINWRKKHGLLFKEKPDYSFVLADNGHVREIKAEYRRIANQIVEESMI IANICCAHYLAKNAQTGIFNTHVGFDKKFLPNAHNFLMANLSNEENQQEL AERYSVENLATLAGYCRMRHDIEPIEGDYLEFRLRRFLTFAEFKSELAPH FGLGLTGYATWTSPIRKYSDMVNHRLIKACLANRECVKPSDETLARLQEA RKQNRMVERDIADWLYCRYLADKVESNPEFRAEVQDCMRGGLRVQLLENG ASVFVPASSIHPNKDEIQVNTDELALYINGERRYKIGDIVNIRLTEVKEE TRSLIGNLV >MS0473 vacB, VacB protein MARKTTKKTTALLDPNYQQELEKYGNPVPSRDFILQVIREHNTPMSREEI LKVFAIQDDERVEGVRRRLRAMENDGQLVFTKRNCYVLPEKLDLLRGTVI GHRDGYGFLQVEGVKEDLFIPNTQMKRVMHGDYVLAQREGLDRKGRREVR IVRVLEGRKKQIVGRFFLEEGIGYVVPDDSRINRDILIPNENRLGARMGQ VVVVELKPRTASFSQPVGIITEILGDNMAKGMEVEIALRNHDIPHTFPPE VEKQIKKFTEEVPEEAKSGRVDLRSLPLVTIDGEDARDFDDAVHCRREQD GWHLWVAIADVSYYVRLRSALDTEARNRGNSVYFPNRVVPMLPEILSNGL CSLNPQVDRLCMVCEIKLSDKGVMKDYQFYEAVMNSHARLTYTKVARILE GDEELIERYQELVPHLQELHNMYNKLLEARHQRGAIDFETIESKFIFNEM GRIESIEQVVRNDAHKIIEECMIMANIAAANFMERHQEPALYRIHAGPSE EKLISFRSFLAECGLSLEGGMKPSTKDYAKLLEQVKERPDAELIQTMLLR SLSQAVYNADNIGHFGLALEEYAHFTSPIRRYPDLTLHRGIKYLLAKAQG VKRKTTDTGGYHYSLDEMDVLGDHCSMTERRADDATRDVADWLKCEYMQD HVGDEFEGIISSVTGFGFFVRLKDLFIDGLVHISTLDNDYYRFDAAGQRL IGENSGAVYRIGDIVKVRVEAVSLEQRQIDFALVSSERKPRREGKTAKDN AKKTMRYAESFAKQRKKAAATSKGKKKSAVKKSKNSVNKKANKKRTY >MS0323 wecD, WecD protein MKIFKAEQWNLEVLLPLFEEYRLSHGMVENPERTFTFLNNRIRFSESIIF IATNERQQAIGFIQLYPRLSSLQLQRYWQLTDIFVQDVANQNEIYAGLIE KAKEFVCFTHSTRLVVEQDQQHQGIWEKEGFKLNTKKALFELKL >MS2102 wecD, WecD protein MMQTIDQFIAQYIPAAYALNLRVVESSPQRVVIKAPFECNSNHHHTIFGG SQALLATLSAWSLVYLNFPEANGNIVIRSSQIRYLKPAPSDIIAVSICPD SLAMNLAKQMLTQKGKAKITIQCQLYCDDIIVSEWTGEFVLSHTPF