TitleGenColors Logo

Gene list

Applied filters:

COG category: Transcription
Gene type: CDS
Genomic element: chromosome

Number of genes found: 133

Free access
Sort by:

 



# Mannheimia succiniciproducens MBEL55E, MBEL55E

>MS1402 unknown
MRYNSSFQITLSRQMNKPNITIQPIQASHYADYVALIGKQLGEGYFKQAD
FEALANNPQAICFEAVDEQNQVVGVITSVTLDRESALALLKIQAQNTPDY
VLQSDRIGIFKTIAIDENRKGCGIGSALVRKLLESFKQAGLNSIACVAWQ
YGETENIRGIMQAFDFTCYEKIANYWLDDPEPFICPACGEPPCRCQANIY
FRQI
>MS0260 unknown
MLGVLNMTRQNIFIILAFSNEINKMELQDKLLIAMPNLQDSYFSQSVIYI
CEHNEQGAMGLVLNQVTDLSIAELVAKLNFMMADGRHYPETYVFAGGPVS
MDRGFILHTATERTFEHSYRVTDNLQLTTSEDVIETFGTPEAPEKYLVAL
GCATWTSGQLEKEIADNDWLVVPANNHILFDVPWAECWTAAQQLLGFQPA
NLVAEAGYC
>MS0143 unknown
MIFKEIKMNNFFERGNVLAAACPSRQILQHLTSRWGGLVLIALRSGTKRF
SELRKTIDGVSERMLTQTLQQLEEDGMLVRKSYNTVPPHVDYTLTEFGAQ
ASEKMFELVDWLESNLNDILTHKVSKQ
>MS0313 unknown
MKKISLFLTALLAASSALAANNQAAPQQENAKTEFMFGAKAANDPVGIWQ
KDGRHFSKKDLSKQFCWTLTNFRSDSGNVNITITLTSPKNTNFNLGEHIS
KNTTTHIFNFTYPITQTYYNCWAFEESDPEGKYTLTVKANNTTFPTQVFT
LTK
>MS1430 unknown
MNPTVVVSNIKTLLLAVLSGRIFAPVMQHDCQPYLDSGELEIVFPNLESQ
MWGIYLYRPYQTITPKRVLVVFEILERLLKMHSEQ
>MS0118 unknown
MGGRRRNKMSEKINSTQRALRILKALKGRTLTGLSNKELADRLNESPVNI
TRSLQALIAEGLVVKLEETGRFALSIQMLQIAVTHQRDTEKMQARMAEMD
QRVNAGAF
>MS0409 unknown
MTSEDPIYEKLNETTSIRGFITACVAIFDESVDQLINRVFRKTDFAVKSV
VDSLFINSGPLFDLSIRLKVLLGLGIISHETFMDINAFIQLKEALNNDGK
EYEFFDPIIISFIQGLNVRQDKSFLNLDTKIDGTKDSLLYQVKVLRREKL
IRSYLILSVTDLYDQLQVESPL
>MS1431 unknown
MSEPIVATPALIKRMGKPKDVFDLAANFPVGYILNPQTGKVWDWMLGR
>MS0113 unknown
MYSTVKHIVPQGEKRNMTNTVKTANQIKLEFHQQGKTISSWAKENGYSRT
DVSRVINGLAKGQRGKTLEIAVKLGMVIL
>MS0831 unknown
MMNYVAHAKDQALTAHHDLFSYHPMPFYEDTEQTRSRFHKKLDLNLYCIK
RPQQTCFIRVQNPDLMAWGIEQGDMLVVEKNDSLSIGDLIVIEVNQKLEI
FEFIAYDKNEFVFLSLSSKLNNIRTANWSTLPIIGTVTNTIHQMKPKNTI
SFAA
>MS2207 unknown
MLNQQISQVIAAELSVQPKQILAAVQLLDEGNTIPFIARYRKEVTGGLDD
TQLRHFETRLIYLRELEDRRQTILKSIDEQGKLSDELRAKINATLSKNEL
EDLYLPYKPKRRTKGQIAIEAGLEPLADLLWSEPEHEPESAALAYVDANK
GVPDTKAALDGARYILMERFAEDAQLLAKVRQYLQHNAVLVSKVIEGKET
DGAKFQDYFDHQELLKNVPSHRALAMFRGRNEGFLQLSLNADPDAEEGSR
SSYCEEIIREHLAVRLTGLPADKWRAQVIAWTWKIKVSLHLETDLMGSLR
EKAEDEAIDVFARNLTALLMAAPAGAKTTIGLDPGLRTGVKVAVVDSTGK
LLATDTIYPHTGRMNEAMVSLYQLGKKYHAELIAIGNGTASRETERFAKE
VIKQSTDWSAQTVVVSEAGASVYSASEFAAAEFPELDVSLRGAVSIARRL
QDPLAELVKIEPKAIGVGQYQHDVNQSQLARKLDAVVEDCVNAVGVDLNT
ASAPLLTRVAGMTKVLAQNIVAYRDENGCFESRQQLLSVPRLGPKAFEQC
AGFMRILNGKNPLDASGVHPETYAVVENILQVTEQSIRDLMGNSNALRRL
DATQFTNEKFGLPTVQDIFKELEKPGRDPRGEFKTATFMEGVEEITDLKA
GMILEGTITNVTNFGAFVDIGVHQDGLVHISSLSDKFVEDPHQVVKTGDV
VKVKVLEVDVARRRIALTMRLDESAVKNSEKSDRTLSTKSGQDRNRRDNR
QPQRNQFANNVFADALKGWKK
>MS0116 unknown
MPYFTTTQQGEEMNHTDFQPLPYPQTPESARAYFNLHGINRSEWARYFGI
DQQAISDHLRGRLKGTWGKSHKVAVLLGLKPNPETKVTA
>MS1375 unknown
MHCPFCSTEETKVIDSRLVSDGYQVRRRRECTKCHERFTTFETAELVVPK
IIKNNGMREPFNEDKLRRGIQHALEKRPVSADDVEKAISHITHQLRATGE
REVPSKLVGSLVMEELKKLDKVAYIRFASVYLSFENINEFSNEIEKLKD
>MS1448 unknown
MTTKKQAVFSRLVNELVQKNQGKRIFSFDFENQTYWVKQPEKLTGVWKIL
KPHPKQSFREELHILKNLYERGAPVPQVILSGEDFFVLKDVGPTLNHWIE
NAGLNLTPAEKNQILVDAIKALTSLHKKGVTHGRPAIRDIAWRQGKVTFM
DFESHSRSLNLQWHKIRDVLVFIHSLCRSKHLSGEQIQYLINKYEEYCES
DLWQDVLNLVAKFRFLYYILLVFKPVARMDLIAIYRLFQYLLPLTEENK
>MS1727 unknown
MKLTSKGRYAVTAILDIAINAEDGPVTLSDISERQNISLSYLEQLFAKLR
RHGLVKSVRGPGGGYQLGQPSGQISIGMIIAAVNENISVTKCLGQGNCQG
GKVCLTHHLWAELSDRIENFLNEITLEELVSKQHSQKTHTDFDNLLVVDN
>MS2211 acrR, AcrR protein
MKKNLNFVVKESITEALLRLMAKKNFDEINITAITELAGVSRISFYRNFD
SKEDVLIKYMYVRAKELYKPFESQDVSVRDKLIGMFKSIEGMEDIINLLY
AQNLSHIFLQYFNFVRGAKPEQENLDAYQNSIVVGVCFGALDEWIKRGRQ
ETPEQMVDLLQNVIWGFVKE
>MS2295 acrR, AcrR protein
MEQKLSPKQKGRPRTFDREKALESALFVFWNQGYTNTSIADLCNAININP
PSLYAAFGNKSQFFIEILDYYRRVYWDVIYAKMDVEKDIHRAIHIFFRDS
VNVVTVANTPGGCLSAVATLNLSAEETKIQQNMRQLKSDILKRFENRLKR
AIVDKQLPSQTDIPALALALQTYLYGIAIQAQAGTSKDDLLKVASKAGLL
LPKLI
>MS1936 acrR, AcrR protein
MAEQLTLDSIEPEPEKQSAKIEKRSIKERRQQVLTVLTHLLHSEKGMERM
TTARLAKEVGVSEAALYRYFPSKTKMFEALIENIESSLFSRISYSIKMET
NTLNRVHDILQMIFDFARKNPGLTRVLTGHALMFEEAKLQARVALFFDRL
ELQFVNILQMRKLREGKTFPIDERTIATYLVTFCEGQFMRLVRTNFRHMP
NQGFEQQWRFIEPLFE
>MS1300 acrR, AcrR protein
MKQDIRITKTLGLIRHVFLELLEEKGFEHIVVQDILDRAQINRSTFYKHF
QNKHAVALMLVDEIKQLLTENFENRFSIPTTEFAQKMVPIFWQHRDLIHL
IGKIENPRIHLYKDLALVIKEEYIKQAVREQPQSSEELDFQGYLFAIVSL
GTIRYFVEKGELPDPSVIVGDIESVFNLLIIK
>MS0453 acrR, AcrR protein
MRQSETDMAEQIFAATERLMAKDGLHHLSMHKIAKEARISAGTIYIYFKS
KEELLEQFAWRVFSLFQTALEKDYDETLSYFEQYKKMWLNVWYFLQDNPN
IVMNMQQYQSLPGFFDICKEMDYNSRWATFCQKAQQAGAVCELSVSILFS
LSMESAMNLAFKKLYINEFLADEELMTIIERTWRSIQK
>MS0153 acrR, AcrR protein
MINMAGVRAIQKEKTRRALIDAAFNQLNAEKSFSNLSLREVAREAGIAPT
SFYRHFKDMDELGLTMVDEAGLTLRQLMRQARKRIEKGGSVIVISVETFF
EFIAHSPNVFRLLLRESSGTSQAFRTAAAREIKHFVDELAEYLANKNNYS
EYVAYVQSEGMVTIVFTAGANALDMNNKERELLKERLILQLRMLAKGAHH
HMMERERHNTHLPATGKS
>MS2131 araC, AraC protein
MLNWLIRQTLKLKSGEKGTMGIETPVPELFVFHSETDLRDVSQLQESGIC
LILQGRKDVRVGDQHYRYQAGEFVCYTVDLPIMTEYLTDDGGYLDLRLFF
DLPLMREIIDELNRQNFSFAPASQQKIVSTASPELIRAFEMLICLTENSQ
DLPIMLPLIKKAIYFYLLTGEQGGTLRQIALQNSNSQRIVETVGWLKEHY
NESFDIEQLAAASSMSISGFYAQFRRLTGMSPLQYQKNLRLTKANALLKL
GQKNISEIAFEIGYDSLPQFSREYKRYFGHSPRSDLSRAG
>MS2322 araC, AraC protein
MIQKLLARDFFNNKEQPIILEPRAPQEIFPEHTHDFDELVIVKHGSGRHI
LNGYPHDLYPGVVLYIQAQDHHSYENLQDLCLTNILIQSNNNFKYLNNID
ILLNGLKPENSSYQLINKKTAEYIDSLLEKINAIDESYNLQNECLFFQVL
SSIQAHQFNDSGYGNTEEKGRQMIRWLENNFEKEIDWEELAEKFALPIRT
LHRYIKSQTGHTPQNYVTKLRLAQAYYQLKYTEKNIINIAYDCGFNDSSY
FSTCFKNEYSIAPRELRI
>MS2323 araC, AraC protein
MTDILQLSHHSYFISEESPITVERRHYQPPFPLHRHDFNEIVIISAGNGI
HFWNDEIHPITTGNVLYIESGDKHKYGEVDKLKLDNILYRPEKLSLFPIM
KDYIPHNNEKKSLRINQETLVQLQSLISQLEIESKKTNKSSMHLSEAIFL
QILILICRTQQQENKAYSDISKLESLFSALNQSISQEFYLADFCRQHQLA
VSSVRRIFKQQTNMTIAQYLQKLRLCRAATLLRNTSESVANIAIRCGYSD
SNYFSSVFGKTFSCTPTEYRSRFIKK
>MS2173 araC, AraC protein
MPKPLILSRKNLANLGSVIQQRKLLYTRMAVDEPTLLYIQVGQKTLRWRG
QELTIQAGEMVLLAAGQTFDVLNNPDAKLGFYQAGWIALEQRVVDEFADL
FGVETYVQELAKIQPLAPLKAHFDVVRQALENDEAPELVLKLKLFELLAW
LKAEHLSFVPHEKHNLLRQIRKMIASNTAFEWTAETIARQLHLSETSLRR
ALQKSDTTFREVLTDVRMSRALTLLQITKWQVARIANEVGYDSPSRFTVR
FKQRFGFLPSDIRENLSQPVQNEQQKLVRIGVKK
>MS1229 araC, AraC protein
MRFCWYTSDNNKSVVNQINMKTSHLAKQTSTELADKSGSEIISPLSLSLD
ARPFNVEIQQPPGNMPAYHWHGHIEINIPFDDDVEYSFNEHSTLINAGHI
SIFWASIPHRLTDKHNCRTMAVFNIPVYQFLSWQLSQNLINHITHGIIIQ
SKNPRLVSLFEVQRWEQELKLEDPNRHKLVYDEIQLMIKRVSLDGWLLLL
EPPKKNNHQLSGSKHAQNYVRTMLDYIANHYNAPLTVQSVANAVGLNTNY
AMGLFQSAMQLTIKQYIIMMRINHAKALLSDTNRSVLDISLTTGFSSMSR
FYDNFLKYTGVSPNKYRKQIRADDNWSAQGLIPTTQAIKGASTGEKLIMT
GEHFNQSEEF
>MS1400 araC, AraC protein
MANIRQNQSISELHYQPHKHHPYGIELFTVASLRARSAEVVMEKNYLYQC
DMIIVVTQGSGTLWQDFEPVACMQGSVLWIKQGQACSFGNDKHWDGWVLM
IKNKPLLSEFDYQINTLWLSENELENVEQSLKQLKQDSEKPYSIVHKQLI
HHQFYAFLWRLISLTPNQTILYSPRLRSRFDSFQSLLESYFHEWHHVHQY
ATALACSEKTLSRACLEITQQPAKTVINNRLLLEAKRLLVQSNQSIASIS
LQLGFNEATHFVKFFKREAGITPQKFRELG
>MS2105 araC, AraC protein
MSGILFLLITCLFIQIIMFSEQSFARLLDVIPHNQTYHSPIKGLIIHHSD
HPFSYDNVIQEPSICIVIRGEREVQLGNQCYLFDNRHFMFCPVNVPMCGK
VLQATAEEPFVVMSMKIDLQAVNKILLEQTALLAKNSENPTAFGQWHLDA
ELENAFERLLLLHENTKDITFLAPLIQQEIYYRLLTGEQGDKLKQMVSFG
SNTQKIAKATEYLKAHYIETITVESLAELCGMSLSGFHNHFKKHTTLSPL
QYQKSLRLMEANRLISQENLPISTAAFQVGYESPSQFSREYKRYFGKAPS
VR
>MS0060 araC, AraC protein
MKYQREVQQETNPLLPGYQFGSYLVAGCTPIEKGNEVDFAIRRPNGMKGY
IINLTTKGEGTVFEGDRAFTCCKGDLLLFPPNAEHLYYRSQSSESWHHQW
IYFRPRSFWANWLQWSHISDHVGRLTITDPTTYEEILALFKKIEREYNAK
DIFSEAMSMCLLEQLLIKCIKLDPVNSQRMLDPRILETCHFISANLHINH
KITEIAEHIHMSPSRLTHLFAQQTGSSIIKWREEQRMIKAQHLLHTSGAP
IYAIARQLGYDDQLYFSRLFKRYSGLSPSDYRNSR
>MS1267 argR, ArgR protein
MAIEKTDNLLTVFKDLLSQERFGSQSEIVSALQDLGFSNINQSKVSRMLT
KFGAIRTRNTRMEMVYCLPNELSVPNTSSPLKNLVLDIDHNDFLIVIKTS
PGAAQLIARLLDSVGKTEGILGTIAGDDTIFITPTKGTGIKELINTIQQL
FENSL
>MS1708 bolA, BolA protein
MEITMETQEIERILKQALNLDEVYVQGENAHYGVIVVSEEIAKLSRLKQQ
QTIYAPLMDHFSSGEIHALTIKTFSPEKWKLERMLNVVN
>MS2285 citB, CitB protein
MRKFYRTFFQNATIRRSKAYPGEITMIKVALIDDHVIVRSGFAQLLSLEE
DIEVVGEFGSAKETRQNLPRIKADVCIIDISMPDESGLDLLKSIPSGIHC
IMLSVNDSEMIVKKALELGAKGYLSKRCSPEELVQAVRTVYTGGVYLMPE
LTVKLVTNKNNNPIQQLTKRELEICELLIRGLGAKEIGEQLGLSFKTVHA
HRANAMSKLDVKNNVELANLFHQYS
>MS0278 citB, CitB protein
MHGIAWGYCFIIVKVRNMSEKTKVIVIDDHPLMRRGIKQLIELEEQFEVV
GDAGSGNEGVELAIKTSPDLIILDLNMKGLSGLDTLKVLRQEGVDARIVI
LTVSDSKADIYALIDAGADGYLLKDTEPDTLLAQIKQIAQGEIILSDSIK
NLLVERHPAHEPIHALTDREMDVLQLIATGLSNKQIAAQLFISEETVKVH
IRNLLRKLNVHSRVAATVLYLEYKGS
>MS1144 cspC, CspC protein
MSKLNGLVKWFNSDKGFGFITPADGSKDLFVHFSSILGNNYRSLNEGDRV
EYNVENTQRGPAAVEVAVIK
>MS1095 cspC, CspC protein
MEVGVVKWFNNAKGFGFINAEGSDADIFAHYSVIEMDGYRSLKAGQKVNF
EVVHGEKGSHATKIIPILE
>MS1361 dinG, DinG protein
MANIDQIKAAFSERGQLSSNIKDFRPRSEQLEMAEAVGKAIENKGVLVVE
AGTGTGKTFAYLTPALLSKKKTIVSTGSKNLQDQLFKRDLPTIQKALNYS
GKIALLKGRANYLCLERLDQVIAQGVLGDKSVLVDLSKVRKWNNATKTGD
LSECVELAEDSPILPQLTSTTESCLGSDCPNYGDCYVAAARKRALAADLV
VVNHHLFCADMAVKENGFGELIPNAEVIIFDEAHQLPDIASQYFGQSITS
RQLFDLCKDINIVYRTEIKDMPQLGVASDHLLKMVQDFRLLLGEGNNRGN
WREWLVKPDVQKGFKVLQEKLDFIADVVKLALGRSQTLDSIFERISALKA
QLVRLSDTSVTGYCYWFETFNRQFGLHITPLTVSDKFGEQMNNHESAWIF
TSATLEVGGSFNHFRQRLGIRATDEKVLQSPFNYPEQALLCVPRYLPGSN
QNHTMTKLAEMLLPVIEANKGRCFVLCTSYFMMKGFAEYFREHSGLSILL
QGEISKTKLLEQFVSEEHSVLVATSSFWEGIDVRGDALSLVIIDKLPFTS
PDEPLLKARVEDCQLQGGNPFNDIQIPEAVIALKQGGGRLIRDVTDSGAV
IICDSRLVTRPYGETFLKSLPNAKRTRDLNKVVEFLKSIQQNRT
>MS0431 fadR, FadR protein
MFTQKSANSSPSVLKARSPAALAEEYIVKSIWSNFYPPGTDLPAERELAE
KIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNDVWQTSGLNILDVLVRL
DSTMSPTLIANMLSARTNIAIIYIPRAFKVSYEKALASFDGLENLPETAE
SYTAFDYEILHKLAFISLNPIYGMVLNSLKGLYTRVGSYYFAIPEARALA
KKFYIELRELGKAHRLDEIPSLFRQYGRESSLIFEAAQDGLAQYLIEN
>MS0538 fadR, FadR protein
MTDNAELRSYKKIGSILKQELIDGLYQIGERLPPERDLAEKMNVSRTVVR
EAIIMLELENLVEVRKGSGVYVINMPLTSEENQDDTYEDVGPFELLQARQ
LLESGIAEFAAIQATRSDILRLKEILNKERMTLAEDDKDYTADEEFHSAI
AEITQNEILIKLQKELWKYRTKSSMWQGLHAHITDQEYRKSWLQDHQNIL
NGIQRKNPALAKKAMWQHLENVKQKLFELSDIEDPDFDGFLFSVNPVVVG
L
>MS0531 fis, Fis protein
MLEQQRSPSDALTVSVLNSQSQVTNKPLRDSVKQALRNYLSQLDGQDVND
LYELVLAEVEHPMLDMIMQYTRGNQTRAATMLGINRGTLRKKLKKYGMG
>MS0380 glpR, GlpR protein
MKLNEKEQLIIDSLKRKDVITNIELSEILQCSTVTIRSLIRSLEKKGLII
RTHGGAKLCNDYLDIHIPAGNIFKEREAKLRIAEKAYQYIAERDTIILDD
SSNSYYLAQVIKKYSDKYLIIITNSLPVIAELSTCSAVEIISIGGVLRGN
KNAFVGDFAIEMLKNFKATKAFIGVHGIDPEFGITSIGNEQMMIKKQIFK
IAQYVYVLTCSEKFGTGYLLVSAPLSQVHKIITDKNIDKNILNVIKSSVD
IDLV
>MS2316 glpR, GlpR protein
MREKKVKPRERQSAIVEFLQINGKTAVEQLAQIFKTTGTTIRKDLTALEA
EKKVLRAYGSVVLVNKDEIDLPEANKTNTNLEVKRRIGQKATEFIGDGDS
LLMDSGTTVLQMVPYLAKYRDLTIMTNSLHIMNALTGLERDYELLITGGT
YRQKSASFHGILAESTVEKFTFDKLFIGTNSFDLDYGLTTFNEVHGVSKS
MCKAAREIIVLADSSKFQRRSPNVVCPLEKINTIVTDKNLDPAIHQALIE
KNINVILV
>MS2186 glpR, GlpR protein
MKQSIRHQKIVELVKLQGYISTDELVTLLNVSPQTIRRDLNELAENNLIR
RHHGGAASPSSAENSDYSERKLFFSLEKNHIAQAVSRLIPNGSSLFIDIG
TTSEAVANALLGHQNLRIVTNNLNAAHILMKNDTFKITVAGGSLRQDGGI
IGEATVNFISQFRLDYGILGISSIDLDGSLLDYDYHEVQVKRAIMESSRE
TVLVTDHSKFSRQAIVKLASVTDVDYLFTDQEPPKSIMELIHNSSVELRV
CK
>MS0024 glpR, GlpR protein
MVRSNIMNEQIRHNKLLTLLGENGFLSVQEIMTALNISPATARRDITKLN
EQGRLKKLRNGAEAVIQSTFQPQKKQNEIKNLDEKQRIAALAASLCQNDS
SAILTCGSTMLLLGNALCNRNVQIITNYLPLANQLIENDHERVVIMGGQY
NKSQAITLSLSEHNEAFAADIMFTSGKGLTAQGLYKTDMVIASSEQRLLK
RAQKLIVLVDSSKLDKTVGMLFTELKNIDLIITGQEADPDFIRTLREKGV
DVMLA
>MS1983 glpR, GlpR protein
MIPAERQKMLLNLISQQDIVSISQLVETLGVSHMTVRRDIQKLEEEGKVV
SVSGGVKMLEHLSIEPTHNDKSLLSPSQKSQIGIKASEIIPEKTTIYLDA
GTTTLEIAHHIVDREDLLVITNDFVIANFLMKAGKCELIHTGGSVNKSNY
SSVGELAAQFLRQISIDIAFISTSSWNLKGLTTPDENKLPVKRAILQSSN
KRILVSDSSKYGKVATFQICPLSEFDVIICDSDLLENAKDAINEMRIELL
LV
>MS0074 glpR, GlpR protein
MSVDRQNAIKLFLRSHNMATVEQLVKITNSSPATIRRDLIKLDDAGIINR
THGGVSLRDSFPYQPTTNEKQYQHVTEKENIADYVVSLISPGDSVLLDAG
TTTLCIAKKLVNIPLRVITSDLHIALLLSEYKQIDIVMTGGAIDKSSQSC
IGQHGLDLLQNINPDFAFVSCNSWSIERGITAPTEDKANLKKCLLQNSRR
KVLVADSSKYGKCSLFKVIELNRLTDIITDHNLPQSAQKALNELDLSVAF
A
>MS0187 glpR, GlpR protein
MKRNFQQRNTQQRRHGIMQLLQQKGEVSVEQLVQLFETSEVTIRKDLTAL
ESNGFLLRRYGGAILMPQDLMDESQDENLSKQKLSIAKAAAERIRDHHRI
IIDSGSTTAALIKQLNSKQGLVVMTNSLSVASELRSLENEPTLLMTGGTW
DTRSESFQGKVAEQVLRSYDFDQLFIGADGIDLARGTTTFNELVELSRVM
AEVSREVIVMVESQKIGRKMPNLELNWQQIDVLVTDDLLSEKDKAVIERH
NIEVIIAK
>MS0524 gntR, GntR protein
MFFIKNKDNMSRDLNLRQDIINQMIDDISSDLLTSPLPSLSALATLYNVS
RTTIRHAITYLTEQKIINRIDAQLIITKKPSADDKITYIKIKKPGNNQIK
KLEKYFSSAVQQKIIKPGDDFTELELAKNANVDIFTVREYLIQFSRFNLI
SHISAGKWRLTKLTQHYADKLFELREMLECHALNCFMNLPKNDIRWKQMK
LLLQEHRILRNNIVEKYVDFSLLDQQLHSLILSAADNPFINDFINLISVI
FHFHYQWDNSNLRTRNILAVEEHLAILVKIVSQDDLGAITELKRHLQTAK
NGLMNSIRLMNN
>MS2208 greA, GreA protein
MAKSNYITRAGWNVLDQELKYLWKDERPKVTQAVSDAAAMGDRSENAEYI
YGKRRLREIDRRVRFLSKRLEVLQIVDYNPKQEGKVFFGAWIELENESGE
IKQYRIVGCDEFDPAKNWISIDSPVARALIGKQIDDEVRVETPAGKVLLY
VNNIWYEK
>MS0961 greA, GreA protein
MKQIPMTVRGAELLKQELDFLKTTRRPEIIKAIAEAREHGDLKENAEYHA
AREQQGFCEGRIQEIESKLSNCQIIDVTKLPNNGKVIFGATVVLVNTEND
DEVTYQIVGDDEADIKSGLISVNSPIARGLIGKEVDETVSIVVPGGKVEF
DIIEVNYI
>MS0195 hepA, HepA protein
MSFAVGQRWISESENDLGLGVIVGMDNRTVTILFPASDEQRVYALAAAPL
TRVEFQKGDTVVHHEGWKAQIIDVTENNGVLIYLTIRLDTQEEAVLREMD
LAHKISFSKPQERLFGAQIDRSDRFTLRYHALQQQQAQFQSPLRGLRGIR
AGLIPHQLHIANEVGRRVNPRVLLADEVGLGKTIEAGMILQQQLFAGKVE
RVLIIVPETLQHQWLVEMLRRFNLHFSLFDEERAADFAANEYDEERNPFE
SENLIICSLDWIVAQPKRAQQILQAEFDMLIVDEAHHLVWSERQPSMAYQ
VVEQLSRRIPAILLLTATPEQLGQESHFARLALLDPDRFYNYDAFVAEQK
NYQPVAEAVQTLLNEKPLNTAEQNAIADLLEEQDVEPLFKVINSMAEESE
RLQARQELIDNLVDRHGTSRILFRNTRQGVKGFPHRIYNQVTVEMPKQYV
NAVKVMNLLGEEIGDGLFYPEQIFQKMNPEAKWWEFDPRLEWLITFLKNH
REEKVLVICRHANTAIQLEQALREKEAIRSAVFHENMSIVERDRASAYFA
LQEEGAQVLLSSSIGSEGRNFQFACHLVLFNLPDNPDLLEQCIGRLDRIG
QTRDIRIHTPCFADTPQVVLARWYHEGLNAFEETCPMGMTIFTECGEKLK
NFVKNPTQLDGFEEFVAQTRKRQQVLKQELENGRDRLLELNSNGGERAQK
LAEHIADEDNSTALVNFVLNLFDVIGIEQEDLGEKSIAIIPASTMLVPDF
PGLKEEGVTVTFDRRLSLAREELEFLTWDHPIVTNGIDLITSGDIGKTAV
SLLINKSLPPGTLLLELIYVVESQSPKGLQLTRFLPPTPVRLLLDAKGNN
LAAQVSFQALEKQLRPVKRNMANKMAKMIRPNIERLIAGGDKHIAEQARE
IIQSAKQKADQTLSAELDRLNALKAVNKNIRQDEIDILAQIREQSLTQLD
QANWRLDSLRVIVSNKE
>MS0112 hipB, HipB protein
MNLSSLFSVRLKNERNRLGLTQAEIAKKCGVSREMWGKYERGVALAGSEV
LFSLAAIGVDMDYILLGTRKEVFEEITTEALKDMPKADFSDKTGLLVQLF
MQCDDNGRAAILSVAQTMAGMANKTGHQNSDSTGGQSFAGDVHGGQFSTG
TINNYGEKK
>MS1463 hypB, HypB protein
MCTTCGCGHPEQVRIGELQHTHSHSEHQSAVKMPDFSQSVFHSMKPSIHE
HAGEQDNTQKRLLKIEQDVLGKNNRIADSNRNLFNYLNLTVFNLVSSPGS
GKTSLLTATLNSLKNDRNCYVIEGDQQTENDADRIRATGVPAIQVNTGKG
CHLDAQMISDAMMKLRPQENGLLFIENVGNLVCPSEFDLGEKAKVVILSV
TEGEDKPLKYPHMFAASKLMILNKVDLLPYLKFDVEKCIENAKRVNPQIE
VIQLSAATGEGLQDWLNWLQQ
>MS0562 iclR, IclR protein
MEKENQPEAVSSVLKVFGIIEALAEQKEIGITELAQRLMMSKSTTYRFLQ
TMKTLGFVSQEGETEKYTLTLKLFEVGAKALEYADIIGLANHEMSYISRQ
TNETLHLGTLDGTEIIYLHKIDSGYNLRMYSRIGRRNPIYSTAIGKVLLS
GLTNKEIRELLADLTFVKHTSKTLENIDQLIEEIEKVRKQHYAEDNEEQE
PGLRCVAAPIYNRFGRIIAGLSISIPTIRFEEEKLPQLVNLLQVAGKNIS
EQIGYHDYPEILAP
>MS0055 iclR, IclR protein
MFFIVRRLKEMEKNSGNQSLIRGLRLIEILSRFPNGCPLVQLANISELNK
STVHRLLQGLQQEGFVQPAITVGSYRLTSKCLSIGHKIFSSLNIINIISP
HLENLNLDLGETINFSMRENDHAIMIYKLEPTTGMMRTRAYIGQHLQLYC
SAMGKLYLAYDRPAYLKEYWQTNNDNIQTLTCNTITELPVMEKELDEIKK
QGFAVDKEENEIGISCIACPIFNFQNKVEYAMSVSISTSKLNQYGIEHLL
EKIKLTAEAISLELGWLPESVQN
>MS0744 lexA, LexA protein
MSAFCTKKQGIYMKPIKALTARQQEVFNFLKHHIETTGMPPTRAEISREL
GFRSPNAAEEYLKALARKGVVEILSGTSRGIRLLVDTEESANDEDAGLPL
IGRVAAGEPILAEQHIEGTYKVDADMFKPQADFLLKVYGQSMKDIGILDG
DLLAVHSTKDVRNGQVIVARIEDEVTVKRLERKGDVVYLHAENEEFKPIV
VNLKEQPNFEIEGIAVGIIRNNAWM
>MS1455 lrp, Lrp protein
MVNFMEKKLPKALDSIDIKILNELQRNGKISNIDLSKKVGLSPTPCLERV
KRLEKQGVIMGYKALLNPELLNSPLLVIVEITLIRGKPDVFEEFNAAVQE
LDEIQECHLVSGDFDYLLKTRVADMAAYRKLLGTTLLRLPGVNDTRTYVV
MEEVKQTNFLQLK
>MS0035 lrp, Lrp protein
MYAIDSLDQQILRVLTKDARTPYAEMAKNFGVSPGTIHVRVEKMRQSGII
EGTKVRIDERKLGYDVCCFIGIILKSAKDYDKVIKQLEGFDEVVEAYYTT
GNYSIFIKVMTHTIAELHSVLATKIQLIEEIQSTETLISMQNPILRDIKP
>MS1689 lysR, LysR protein
MQSSIYGYLTYFHEIVIEGSIAGAARKLEVAPPAVSNALKLLERHLGLPL
FTRTTRKMELTEAGQRLFESTKDMLRGLDSVMESVRDLTEKPSGLVRITT
SIISYLLVIRPHFAEFCERYPDIRLEISVNDGIVDIVKEGFDVGMRFGDR
LEQNVVAKKLLDPVRLGLYASESYLRKYGKPETLEDLSQHKLLGYRFVTA
NRTYPLTFNQDGREISIDMPYSVLTNNLTVELDTVRQGVALGQLFEPVVN
ALHDRKNFIPVLDAHWTQYPALYLFYMQHSQKAGKVRALIDFLEEKIKG
>MS0763 lysR, LysR protein
MKPIFLELRHLKTLLALKETGSVSLAAKRVYLTQSALSHQIKLLEEQYGL
PLFERKSNPLRFTAAGDRLLQLANDILPKVVAAERDLSRVKQGEAGELRI
AVECHTCFDWLMPAMDSFRQHWPLVELDIVSGFHTDTVGLLLTHRADWAV
VSEVEETDGIVHKPLFSYEMVGLCAKDHPLAHKEIWEAEDFADQTWITYP
VPDDMLDLLRQVLKPAGINPVRRTSELTIAIIQLVASKRGVAALPFWAAK
PYLDRGYVVARKITQNGLYSNLYAAYREEDANSAYLEDFYETVKSQSFST
LPGLSVLE
>MS2006 lysR, LysR protein
MNLDWNDLHYFVLLVEKETLTAAANALDVEHGTVSRRIERLEKQLGLHLF
NRINKRYLLTDDGRDLYAEAKKLQLNIKQFAQTAQDKCQSMGEVTVSAPP
FVANSLITPLLAHFYRRFRHIRLILNSDSGLSNLHRSQADIALRIAQPKQ
DDLVAHRLMNVEYRWFAHRDYLACTPESERQFLSLNLTGTHQQWLQTQLT
GKSVRFACNDFNIMKSAVLQQLGVGLLPVCYIDSPDLAAVKNMEYFRAPL
YLVMHEDVRQSQKVRMAADFLIENLRD
>MS1097 lysR, LysR protein
MDKLNAISVFCRIIESQSFTQAAALENISVAMASKLVAQLEEHLKTRLLQ
RTTRKIVPTEAGLVYYQRCQPILLELKEADSSISDLSTSLQGNLVVSVPM
DFGLKFITPTLPAFISANPNLHVEMEFSDRRVDLMAEGYDLALRIGSLQD
STLVAKKLATTSMHFAASAEYLRRYGTPRKPEDLQYHQCLLYKAIGNQIY
WEFANKGKIQRVKMRSKMVCNNGLTLVQLAKADLGIINSPRFLVEEELAS
GELIEVLPEFKQQLLDIHAVYPHRRHLAAKVKAFVEFLSGLNLGSET
>MS2143 lysR, LysR protein
MPEMKKTDRFNHLISFTHAARFGSFSAAAEALDLTPAAVSKNVALLEQAL
NVRLFNRTTRSLSLTEEGQVFYAESKKALALLEEAVNQITLAESQEIAGN
VRISMPNVVGRNLVFPLLKSFNEDYPKIHLELDFDNKAIDFVKAGFDFVL
RVGESSEGSLVARHIGMIQTCLVASPAYLKSQGVPKNMADLPQHQLLMTR
LPNGKLQPWTFNEQGDNVHFLHAQPHLVLTDAEMQTQAAVQGFGITQLPV
YLALPYLQNGELVTILNDSYQPLKLSLNILFPHRTLLAQRVRTTMDYLLE
QLKQHEGLRMTQEELKAFSFK
>MS1403 lysR, LysR protein
MRELRNLDLNLLKAFDVLMDEKSVSKAAQRLSVTQPAMSGILQRLRDSFN
DPLFVRVQRGIVPTNRALELRQPIKQLLQSAEQLLQPKIFDPQTAELTLT
IACTDYALRAVISPFLAVLKQRAPKIKVAILAINEQNLQSQLEQGVVDFG
LVTPDFSAPDIHSKDLYQEQYVCALRKDHPVAQQGSISLEQFCRLEQALV
SYQGGSFSGATDKALAKLGLTRNVTVSVQNFIVMPEFLANSDLLAVVPKR
LVENLANIHYFEPPLQIDGFTKTLVWHERTHRDPAYRWLRELMAEVC
>MS0884 lysR, LysR protein
MMLDKVEAVRYFCIAAETLHFRETANRLAISPQVVTRMIAELERELGEPL
FKRNTRNISLTDFGQAFLADAQQWLKATETLFQTDFKESMSGTVRITLPR
LPNNDVILTELLTALSPYPDLHIDWRPDTALYNSITRQIDIGIRISLEME
PHFIAKKITHIKERIVASTALLNRLGQPRDLDDLQNRFPLCAEINPQTGK
AWHWFNTAEQSFVAKKPYFMSSESYSNLAVILKGLAIGVLPDYYYLPHVQ
TGKLKILFPDLPIPEWKMFLYRPYQENTPLRVIHVFGLLEKILVKHYHTT
G
>MS2092 lysR, LysR protein
MNKLDALKFFITAAETLNFREAAVKLAISPSVVTRTIAELENQLGEPLFK
RSTRSITLTSFGELFLPKAKRLLEDSDTLFQTAKDDNEMKGVVRITLFRL
PNHEQILFELLTALRPYPELFIDWRLDMMRLDTVEHRIDIGIRVGREPNP
NFIIKPIAKVQHIFVAAPDLLERLGAPKDFEDLRQRYPFSGLINPETGKV
WEFMLDGVNTFLPRHLEFFSTDPDTQIQAALAGRAVVQASDLACKEYLAN
GRLVKVLPQIQQEKWQLYLYRPYQTITPKRVMKVFEVLEGVLRKYLG
>MS2008 lysR, LysR protein
MHITHCLRTLKALKLQNNVHLCTIKPKEQRRETMNLDWSDIHYFVLMVEK
QTLKATAEALQVEHSTVSRRIERLEKQLNVHLFDRINKRYLLTADGQRLY
TEAKKLQFNVRQFVQAAQDSLQEMTNVLVSMPPMIAHALVSPHLAAFQQR
FPAIRLVLSSNTAISSLHQRQADIALRLVVPQQNDLVVRRLRDMQYGWFA
HADYVKNTPESQWQYIDFGVTGPHTPWLNKQLADKSIGFVCNDFAVMQSA
VMQKLGIGWLPFEYGNSSEFIQVHTSEIFIGQLHLVMHEDVRHAQKVRDV
ADFLIEILRE
>MS0336 lysR, LysR protein
MLKDKKTWPLIEDLNVFLTIIRKNSFSGAAKELGQSNSYITKRINILEDH
LHTSLFYRNTRNIKLTAAGEYVQNQAIAIIDKMDSLMTNIVEDKKSMFGH
LHICSSFGFGRTHLAKPISLFAKQHPNLSLDLTLTDHKLDLIKENIDLEI
AVGNDLNDRYFAKKLANNRRILCASPDYLQSYGLPKKVEQLSKHNCLFLK
EKNSSFGVWKLFNGKILKSITVNGGLTTNNGEVILQWALEGHGIIYRSLW
DAEKYLISGELVHILPEYYEDAPIWVVYPNKLSESLKTEIFVNFLTEYFA
KKELTKSHDE
>MS2151 lysR, LysR protein
MNSTEYGQLLIFQAIAKEGSISACARALRISVPAVSKALRQLENRLGVPL
FQRSTRKIQLTETGVQLLEQTVQAVDTLSQAFENAKTLAKTPTGTVRITV
SQVAFSLILQPVYAEFRERYPHIVLDISINNATVNLIDEQFDLGIRFGNH
LEEGIVARRLTGEIREGLFISPQYAQKFGTPKTLADLAHHQLIGYRFITA
NRFHPLTLMENGQPHTIEMPMSLILNDSEMAIDAIRQGFGIGRIFEAQYE
RLESKIDLLPVLKKHWQTLQPMYLYYQPKSQKVKRVQVLIEFLQEKMEVL
GW
>MS0044 lysR, LysR protein
MRKKPMEFNELKLFLHLAESQNFSRSAAQNHMSTSTLSRQIQRMEDELGE
PLFLRDNRRVQLTECGEKFKIFAQQSWNQWQHFKQQIHHNENELNGELKV
FCSVTAAYSHLPQVLEKFRLRYPKVEIKLMTGDPALALHQVQSQQVDLSL
SGRPLHLPNSIKFHYIDDISLSLIAPRIACPATQLLQHSPIDWQRIPFIL
PVEGPARQRIDQWFRQQKIKHPKIYATVAGHEGIVSMVALGCGLALLPDV
VIKNSPMNSQVSSLTLDIPVYPFELGVCVQKKSLELPLIKAFWDSLQTEN
AG
>MS1395 lysR, LysR protein
MKENLNDLRAFLVVARTGSFTKAGAQMGVSQSALSHSIRGIEERLNIKLF
HRTTRSISTTEAGEQLYQRLSPLFDDIDNELNELSEFRNAVTGTLRINGN
EHAFYYALGDKFVRFSQKYPEVNLELVAENRFIDIVAERFDAGIRLGSDV
AKDMIAVRLTDKLPMCCVASPEYLANYGTPKTPYDLTEHQCLLHRLSNGG
VMNWEFIDPKSKGRILKVQPQGTISANGGRVLENYARSGLGILWCPLDMV
EEDIRSGKLIRILQQWDMDYDGYHLYYPNRRQNSPLFKALVEELRLVK
>MS2130 lysR, LysR protein
MLNKFDALRYFCVAAETLNFRETANRLSVSPSVITRVVNELEAELGEQLF
KRHTRSIKLTSFGEQFLLRAQHLLAESETLFKMGKNQADDLAGIVRITVP
SWRNNDEIIRQLLITLESYPEIIIDWREDMGKLDMVEDRIDMGLRIGLEP
DQDFVVRKITEIGDVLVASPALVKKLGQPTDLTDFERRYPMAIPINSNTG
KPWTLFLNEDITLNPKNPAFYSVDNYSALQAVLLGKCAGLINDFMVKPYL
EFGELIQLFPEIQIDKWQLFLYRPYQTVTPARVLKVFDLLTEILRKTYY
>MS1039 lysR, LysR protein
MKIQQLRYIVEIVNQNLNVTEAANALYTSQPGISKQVRLLEDELGLEIFE
RNGKHIKTVTPAGKKIVAIARELLVKTQAIKAVANEFTQPNHGVLRIATS
NTQARYMLPAVIERFSKQYPNVSLHVHQGSPNQLYDALLSSEVDLAITTE
AQYLFDDVVLLPCYMWNRSIIVKADHPLAKLSHVTIEDLGKYPLITYTFG
FTGVSDLDQAFNSAGILPNIVFTATDADVIKTYVRLGLGVGIIASMAHTD
ADTDLIRIDASHLFKSSMTQIAFKHSTFLRNYMYDFINYFSPHLTRAKVE
KAERARDNTAVQKLFEGIDLEVR
>MS0154 lysR, LysR protein
MNIRDLEYLAALAEYKHFRRAADACHVSQPTLSGQIRKLEDELGITLLER
TSRKVLFTQSGLILVEQAKKVLREVKLLKEMASNQGKEMTGPLHLGVIPT
VGPYLLPYIMPALKEAFPDLELYLYEAQTSHLLDQLESGRLDCAILATVP
ETEPFIEVPIFNERMLLAVSEQHPWAKEKSIKMHALQGHEVLMLDDGHCL
RDQALGYCFTAGARENSHFQATSLETLRNMIAANAGMTLMPELAMLNEGT
RAGVKYIPCTDPEPKRTIALVYRPGSPLRSRYERVANAVGDAVKAILHTE
GD
>MS2152 lysR, LysR protein
MNDKFSGIEEFLMTVEMGSFSAAAERLNLTGSAVGKSISRLEQRLNTQLF
HRSTRKITLTREGEVWLASCRRMMEELEQAKLLLSSQSQQIIGEIRIDLP
TTYGRSHILPKLLAIQADYPKLYLNISFQDRKVDMIAEHIDIAVRFGELA
DLTDIIAKQIDCFQNQLCATPAFVSKWGKLNHPDDLTHFPCIVGNQISWR
LMNEQGKSTGFPLNVQHQINDGDARLQAVLADCGIAFLPDWLIQPAVEAG
KLVQLLPEFTPPPEPIYVLWQKKLHLQPKVKAIVNSLV
>MS0895 lysR, LysR protein
MERKMFKRLPPLNSLKAFESAARFLSFTKAADELCVTQAAVSHQIKLLED
FLNIRLFIRKNRSLELTELGKNYFQEISPILQKLADVTEKLKSTDNPHLT
ISVLQSFGINWLVPRLNRFNQLYPNIEVRIKSAEQDEGILGNDIDVAIYY
GYGNWDNLKTEKLSEDNLLILASPKLLANNPVNSKDDLKHHTLIHVHTRD
NWQNMATELGISDLNIHIGPLFSHTFMALQAAVHGQGIVLANSILAQQEI
DNGNLQVVLPYELKDPKSFYVVSDTNRTNDQNISAFRQWIMQEMKYN
>MS1210 lysR, LysR protein
MMNYAAMLHNLPNLNELYFFVQIANAGSFTKAAERLGVTTSALSQNMRSL
EKHLDVRLFNRTTRSISTTEAGEKLLAEIAPHFLAIADAVRHLDEIRDEP
QGTIRINTSEIAANLIIYPKLQPFLLANPHIKVELVIDNRWVDIVAQGFD
MGVRLGYAVFNDMIAVQISEPMKMVLVASPGYLKDKPLPKKINDLTNYHL
IGSRFSSEHSQLEWEFMDKGQKVGFQPMPQFSINNDLRTQAALDGFGIAW
LPEIRVHEELKNGNLVEILPQYAYTYDPFYIYYPNRKGNSKAFQMVVELL
KFKK
>MS2176 lysR, LysR protein
MQFYSITVPKIAFSYLFHKMKTAKSFIPENEIMNRLDALKYFIVAAETLS
FKSTASRFSVSPQVITRVISELEGELGEQLFKRNTRAIRITDFGSRFLAD
AIAFLQQEERLFGGVKTAEESLSGLVRITLPPSDYADKILLRLLTALAPY
PDIQIDWRTDFDTLKAVDDQIDIGIRISRTPEDHWVAKKITDLQEPIVAA
PSLIAKTGLPKDVFDLAANFPVGYILNPKTGKVWDWMMGEQPIILTKPTV
ITSDIKSLLPAVLSGRIFAPIMYHDCKSYLDSGELQVVFSNEETLIWGIY
LYRPYQTITPKRVLLVFELLEKILEEGF
>MS2134 lysR, LysR protein
MQKWKDNMKEISLDDMRLFVSVVQSGSLSHAGELTGIPVSRLSRRLTQLE
QALGTQLLNRGKKGVSLNELGERFFEHSQQMLQQAELAIESVQKSLENPS
GLLRISVAADIFYLFIQPYLATYLNENPQVNLEINLSNQKINMIQDGVDL
AIRTGVIDNENVVARLWKKMEFGVFASQAYLAKYSEPQSPNDLYQHHIIS
QMYTLPWRFQQGNQEVAVFPHSRLTCNDFAIVEQQLKQHSGIGILPITKN
HNRSDLIRILADWQLQSVPVSLIYYRNRGAIATVRSFVEFLQRLV
>MS2116 lysR, LysR protein
MNTKNTSVYALKLFLQVLELGSLSEVARRENLSASMLSRLIKQLEDDWGA
ALFYRNTRAITPTETGLLLAEYARQIVSQFQAAEQAITAQTAEIAGTVRI
NAPVFFGQLHIIPHLAELQARYPNLIVNLVQTDDYIDPFTDSTDIIFRLA
PLNDSSLKVRILAQQHFCLAASPSYLQKYGTPKIPADLAKHHALLYKGKT
GTLRWLLQEGENWQACSPKIALTSNNGNAIATACVQGMGIALLANWAASD
LLKEGKVVRLLPEYNFSTQTVPVYVAMLYPQTAFISPSVRAVLDYFREIF
QDKSW
>MS1415 lysR, LysR protein
MHSSIYGYLTVFHTIAAEGSIAGAARKLQMASPSISQSLKLLEQHIGLPL
FNRTTRKMELTEAGHHLLASTQDAIAQLSVAVESVQDLSGVPKGVVRMTV
PHVGYWLIIEPHLAEFCERYPDIQLEISINDGTVDILKEGFDLGIRFGDK
VDEQMVAKKLTAPFRLGLYASSAYQQQFGLPKKIAELKNHRLVGFRFATS
NRIFPLSLNDKGEEVSVEMPTPIVANSLIVAKDVIKSGIALGRFFEPLMS
KQADRAAFIPVLEKHWKTFGALYLYYMQHSQKAGRVRAVIEFFTEKAQVE
KK
>MS2072 malT, MalT protein
MLIPSKLVCSFRLQNSVPRTRLIQELDKSAFYPVVLINAPAGYGKTTLVS
QWIEDKKNVGWYGLDEGDNNSDRFAVYFSAALHSAINEEVDVLLEENRKA
NLLALFNQLLIKASGFPQHFYLVIDDYHLIENDEIHEALKYWIRHQPANM
TLILISRSVPPLSVASLRVQEQLLEIDINQLMFDHQESVAFFQARLGSEL
KQQDIIELCNEVEGWPTALQLISLFAKNKSQTLQVPLQDIAKRLAKSNNF
HINEYLADEVLNKVDKSTRLFILRCSVLHSMNETLVEAVTGEPNSRKKLE
SLEKQGLFLQQMANSKWQTVDDSWWKFHPLFASFLNFCCQHELYDELSQL
HRRAAQAWLKLGYVTEALHHAMQLSDTCLLLEILDEHAWTVFHQGELQLL
EESLNSLDYAHLTEHTNLVLLKAWLVQSQHRHVEVSGILAEFSRALNENK
VELSKTAQAEFNVLRAQVAINSGDENTALQLASDALKDLSENAYYAHIVA
TSIIGEAHHCHGNLAEALSMLQKAERMARQHHTYHNILWSLLQQSEILLA
QGFSQAAYDMLDKASEFVKENHLQKVPMYEFLLRLKGKILWEWYNLDKAE
SMAVAGMNALQKFEDKLQCLALLTKISLVRGNLDNTSRLLNEVEQLERSH
AYHHDWTASADQVRMFYWQMTNDVAAARNWLIQNPAPISDKNHFTQIQWR
NIARARILLGQYDKAQEILDNLIETAEKFSLTSDLNRALIVRNRLYFLQG
AKELAQQDLIAALKLTRQTNFISAFVVEGDVMAQQIRNLLQLNVLDELVL
HKAQFILRNINQFYRHKFAHFDETFVSQLLKNPKVPELLKISPLTQREWQ
VLGLIYSGYSNEQISDELQVAATTIKTHIRNLYQKIGVTNRNEAISYTKE
LLALMGYN
>MS2146 marR, MarR protein
MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ
KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV
IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE
>MS2091 marR, MarR protein
MQNHITSIDLLAETMMQSLQIYMKYARKMGLAENEYVVLYSVYHHQGCSQ
KDIVADWELPKQTVSFVCKQLVERGWLAFAPDPNDKRGKLMNLTADGLAV
IAPIIEAQTAGERQSAVDFGEEKLAALVQDLIRLNKVLSKNLGVE
>MS1966 metJ, MetJ protein
MGFFSLKYRQILRLLIGNFMADWDGKYISPYAEHGKKSEQVKKITVSIPI
KVLEILTNERTRRQLKNLRHATNSELLCEAFLHAFTGQPLPTDEDLLKER
HDEIPEQAKLIMRELGINPDEWEY
>MS0216 mfd, Mfd protein
MTTHYFNLDIPTQAGDHKIVANVLTGSDGLAICEMAEQFQGLTVVVANDT
KSAVRLEKILQESGKLEVRYFPDWETLPYDSFSPHQDIISSRLSALFYLQ
NTRKGILILSVSTLMQRICPPQYLQHNVLLIKKGDRLVIEKLRLQLENAG
YRAVEQVMEHGEFAVRGALLDLFPMGSPLPFRLDFFDDEIDSIRTFDADT
QRTLEEIRQINLLPAHEFPTDDKSIEFFRAQFRETFGEIRRDPEHIYQQV
SKGTLVSGIEYWQPLFFENMATLFDYLPANTLFVDMEQYQIQAERFYQDA
VQRFESRKIDPMRPLLAPERLWLRIDEVNRALRNYPRISLKAEKVRTSVR
QKNLPLKALPELQIQPQQKEPLQNLRHFIEKFKGHIVFSVETEGRRETLL
DLLSPIKLRPKQVNSLFEAQSQTYSLQISSLDNGFIIEQENGEPIAIICE
TELLGERVQQRGRDKRKSVNPDTLIRNLAELKIGQPVVHLDHGVGRYGGL
VTLENAGIKAEYLLLTYANDAKLYVPVANLHLISRYVGGSEETAPLHKLG
SDSWAKARRKAAEKIRDVAAELLDVYAQREAQKGFAFHYNREEFMQFSAT
FPFEETHDQEAAINAVISDMCQPKAMDRLVCGDVGFGKTEVAMRAAFLAV
MNHKQVAVLVPTTLLAQQHYENFRDRFANLPVNVEMVSRFRTAKEQKKIL
EDLSAGKVDILIGTHKLIQSDVKFNDLGLLIIDEEHRFGVRQKEKIKQLR
ANVDILTLTATPIPRTLNMAMNGIRDLSIISTPPARRLTIKTFVRQADDL
LIREAILREILRGGQVYYLHNDVASIENCAEKLTALVPEARIIIGHGQMH
ERELERVMTDFYHQRFNVLVCSTIIETGIDIPTANTIIIERADHFGLAQL
HQLRGRVGRSHHQAYAYLLTPPPKLMTKDAVKRLEALESLDNLGAGFILA
THDLEIRGAGELLGSEQSGQIESIGFSLYMELLEAAVQAMKQGREPSLDE
LTQQQVEIDLRIPALLPEDYLGDVNMRLSFYKRIAGAENKPALDELKVEL
IDRFGLLPEATKNLMQITELRLMAKQLDIIRIDGSQNGGFIEFSPTADID
PMKFINLIKQQPAVFKFDGPTKFRFSCALEQAQKRLDFIFNLLQSLMD
>MS1527 nagC, NagC protein
MKNGITWKNSLFLRMIMLYGFDIGGTKIELAVFNDKLERQYTERVETPKD
SYEQWLDVIVNLVEKADQKFACKGSVGLGLPGFVNHETGIAEITNIRVAD
NKPIIKDLSERLGREVRAENDANCFALSEAWDEENQQYPFVLGLILGTGF
GGGLIFNGKVHSGQIGMAGELGHLQLNYHALKLLGWDKAPIYDCGCGNRA
CLDTYLSGRGFEMLYRDLKGEALSAKEIIERFYAADKTAVDFVGLFIELC
AISLGNIITALDPHVIVLGGGLSNFDYLYEALPKALPKHLMRSAKVPVIK
KAKYGDSGGVRGAAALFLTK
>MS1413 nagC, NagC protein
MTRNEEALDIKHTNYRNIYRLFFQYNGLSKPQIVKLLNLSLPTVSNNIGE
LEAEGKIREGGFFQPQGGRPAIAYQLVENAFISIGVEIQKKNVRCLALNL
QGNILAQKDTALYFENEPQYIESLCNIIHTFIRSLGCLYTQILGIGFSIQ
GIVSKDGQSMLYSRVLPGEHFDVKELQPYFDVPVKLFHDVKCAALTELWF
SEQIDNAVYISISEHLGGAIIINNQIDLGKKGYSGALEHLQIHSEGNLCY
CGQRGCLETYCSLSALLSPNETIEAFFKALRNKDELVLMRWDAFLEHLAK
GLNTVYLLLERDIILGGEIAFYLIPEDLKILQEKILKLSTFPLEGDFIRI
ATQQKYTSAIGAALPFLIEYLP
>MS1445 nusA, NusA protein
MSKEILLAAEAVSNEKLLPREKIFEALESAIALSTKKKYEQEIDVRVAIN
QKTGEFDTFRRWLVVDEVVNPTKEITLEAAQFEDPDIQLGDYVEDQIDSV
AFDRITMQTARQVISTKIREAERNKVVEQFRSEEGKIVTGTVKKVTRDSI
ILDLTGNKEDPAKAEAVITREDMLPRENFRPGDRVRGVLYKVNPESKGAQ
LFVTRAKPVMLEELFRLEVPEIGEELIEIKGASRDAGLRAKIAVKSNDKR
IDPVGACVGMRGSRVQAITNELGGERVDIVLWDDNPAQFVINAMAPADVN
SIVVDEDNHSMDIAVEQENLAQAIGRNGQNVRLATQLTGWTLNVMTTEEL
QQKHQAEDNKVLNLFMTSLELDEDFAQLLIDEGFSSLEELAYVPVSELTA
IDGLEDEDLVEELQNRAKDALTAKAVAEEEALKQAEVEDRLLNLEGMERH
IAFRLAEKNIKTLEELAEQGVDDLADIEELSAEKAADLIMAARNICWFGD
E
>MS0975 nusB, NusB protein
MTEQVKKRPSPRRRARECAVQALYSFQISQNPVETVELSFVTDQDMKGVD
MPYFRKLFRQTVENIPSVDSTMAPYLDRSANELDPIEKAILRLAVYELKY
ELDVPYKVVINEAIEVAKTFGAEDSHKYINGVLDKIAPALARK
>MS0205 nusG, NusG protein
MTETAVKKRWYVLQAFSGFEGRVATTLREYIKLNHMEDQFGEVLVPTEEV
VENVAGKRRKSERKFFPGYVLVEMEMNDDTWHLVRSVPRVMGFIGGTPDR
PLPISKREADLILNRVEENADKPRPKNTFQPGEEVRVTEGPFADFNGTVE
EVDYEKGRLKVSVSIFGRATPVELEFSQVEKANG
>MS1246 ompR, OmpR protein
MSSMMMRILLIEDDALIGNGIKVGLTKSGFSVDWFTDGKTGLQAIKSAPY
DAVVLDLTLPGMDGMDILQQWRNEKIDTPVLILTARDTLNDRVTGLQRGA
DDYLCKPFALAEVIARLQALIRRRYGQANPIVEHSLVKFDPNSRKVSLQG
KDIPLTTREYNLLELFMMNKERVLSRSFIEEKLYNWDDEVSSNALEVHIH
NLRQKLGKQFIRTVHGVGYALGKNEE
>MS1913 ompR, OmpR protein
MAKILLVDDDTELTELLSELLSLEGFEVQIACNGEEALAKIDESYDIVLL
DIMMPVLNGIETLKRLRQNFTTPVLMLTARGDEIDRVLGLELGADDYLPK
PFNDRELVARIKAILRRSVLNKSASSEEETPFEERKAIEFAGLTLYPGRQ
QVMYQGQDLELTGTEFALLCVLIKHPGEVLSRELLSLEALGKNLTSFDRS
IDMHMSNLRKKLPTRPDDFPWFKTLRGRGYILLTD
>MS1504 ompR, OmpR protein
MLSPQILIVEDETVTRNTLKSIFEAEGYEVFEATDGNQMHQIIETQEINL
VVMDINLPGKNGLMLARELREKTNTALMFLTGRDNEVDKILGLEIGADDY
ITKPFNPRELAIRARNLLHRTMAENEKNSNTHVDAYRFNGWTLDINKRAL
IDPESVEYKLPRSEFRAMLHFCENPGKIQTREDLLKKMTGRELKPQDRTV
DVTIRRIRKHFEDHPDTPEIIATIHGEGYRFCGEIE
>MS1063 purR, PurR protein
MATIKDVAKMAGVSTTTVSHVINKTRHVADETKQTVLDAIKALNYSPSAV
ARSLKVNTTKSIGMVVTTSETPYFAEIIHAVEEQCYRQGYSLFLCNTQND
PDKLKNHLEMLAKKRVDGVLVMCSEYKDDSRDLLKSFSYLPIVIMDWGPV
NPDTDLILDNSFEGGYLAGKHLVDNGHKKIGYLSAELTKVTAKQRYQGFI
KALSEANVEMKSEWLFEGSFEPEDGYECMNRLLALEDRPTAVFCCNDIMA
LGAISAITEKGYRVPDDFSVIGYDNVHSSRFFAPPLTTIHQSKARLGERA
LRLLFERIAHKDAKRETIEIHPELVIRKSVKKIA
>MS0644 purR, PurR protein
MITIRDVAKQAGVSVATVSRVLNNASSSEKARKAVQSAVEKLGYSPNANA
QALALPTTDTIGVVVTDVTDAFFAILVKAVDQVASSYNKTILIGIGYHNA
EKERNAIDTLLRKRCSCLVVHSKALSDEELANYLEQVPGMVIINRSIQGY
EHRCVSLDNQRGTFLATETLIRLGHKRIGYIGSNHHINDEEERRQGYIQA
LQHHRLPQIDDAIIQSSPDFEGGEEAMIKLLSYHSDLTAVVAYNDSMAAG
ALSVLNENNINVPRQFSIIGFDDMPISRYLIPKLTTIRYPIDLMANYAAR
LALSLVNEGIETPLHAQFNPTVVRRFSTENCNNP
>MS0284 purR, PurR protein
MATMKDIARLANVSTSTVSHVINNDRFVSEKIREKVMAVVKELNYQPSGL
ARSFKTKETKTIGMLVTASDNPFFAEVVHAVERYCCQQNYNLILSNTEGS
PQHLQHNLQMLINKQVDGLLLMCSETHTQDNMPINLPIPAVIMDWWPSEL
TADKIFENSELGAYLATKHLIHHQHKRIAIVNGDLRKPIAQNRLIGYKKA
LTEANLPIDETLIFEGKFDFQTGFDALERLLKTDCPPSAIFACCDAIALG
IYQAAWRHNLIIPRHLSVIGYDDTILSQYIAPPLSTIHQPKTELGKLAVQ
TLLERIKNPQKTYRTFVLDPVLVERESVATRKES
>MS1238 purR, PurR protein
MKYTINEIAKLCNVGKSTVSRVLNKDPKVRSETREKVQRVIDRLGFQPNR
SARAMRAGQEPVVGVIVSKLDSGSESQTLRAILQALQAEHITPLIVESRF
EAEQVRHHFQLFRERQVNAVILFGFFPLPLEIVREWQGSLVVIARTYPNI
SSVYYDDEQAITRLMTELYRQGHRRIAYLGIQDSDETTGKLRTQSYLQFC
RSHNIRPNSVSVELSAESAYLHCAELFTRPVDALVCATGRLALGAFKFSQ
QSGRVFPIAYVGYNELLQYMMPNALSLDFGYCQAGLKAVELLMRQLRGKS
STEHYLVSTHQP
>MS1317 purR, PurR protein
MKLEELAKLAGVSRTTASYVVNGKAKQYRVSDKTIEKVQALIKEYDFKPN
AMAAGLRAGKSNTIGLIIPDFENLSYAKIANQLEKSCRENGYQLLITCSN
DNVANELECAKHLFQRQVDALFVSTVLPADNHYYQQNNAIPIIGFDRHID
SEGVDNVLTDDKHDAYELAVSLFDKADYQRILFLGALPELPMSKAREEGF
KQALGKKQVQVDYLYASQFRKENAEQLVSEWIEKNGIVPDAIFSTSLTLL
QGLLMSFIKRNEAFPKDLVIATFGWHEMLELLENKIVCSVQDHSKVVQAL
LDLALHKMRIKKLKQPHPVIQRRLAYHNWQ
>MS0808 purR, PurR protein
MLMVSLKDVAKEAGVSLMTVSRALKSPDKLSPKTYKVVKEVIDRLGYVPN
LAAQHIRGVAANTIGVLSLGTATTPFSVEILLGIEQTVRQHGWNSFVINT
FENDSQAMEDAVEQMLSHRPSAIIIARNGLKNVSIPEKLRSFPLVLANCQ
TQDMAVAAYIPDDYQGQRVVVDRIVAKGYQRPLFLHIPKNYIATAKRRQA
FEDAWANHSGQKPVQFFMRRDGEDYFEGAQPLIDYLEKPDPLPFDVIICG
NDRIALVAYQLLLAKGYRIPEDVAVCAYDNMVGIAQLFIPPLTTVELPHY
QMGQEAALHLIEGRKDRDIHQLPCPLIEGESC
>MS2375 purR, PurR protein
MKSGLKHHRIALLFNANKVYDREVIEGVGQYIQASQCLWNIFIEDDFVYR
KESLHNLDIDGIIADFDDPETVAMLEHTEIPVIAVGGSYQNPAFYPHYPY
VATDNYALVETAFLHLKQKGINQFAFYGLPNETPKHWSEERKNAFMQLMA
DYGHQTYIYLGEQAHSDNWLEVQSKLCDWISRLPPHTGIIAVTDARARHL
LQACEYLNIAVPDELCIIGIDNEELIQYLSRVSLSSVVQGTNQIGYQAAK
LLDQLLKGRPVSQTPILVPPLRVEQRRSTDYRSLHDPLVIQAMHYIRHYA
TQGIKTEQVLDHLRISRSNLEQHFKAEMNKTIHQVIHEEKLDRAKNMLKF
TDVPIQEISDICGYPSLQYFYAVFKKEYGQTPKEFRER
>MS0148 purR, PurR protein
MSLANNSNKNRRSTGKVTLADVAKEVGVGTMTVSRALRTPKMVSENLRQK
IHEAVQKLGYVPNSAARELASVSSRNIVIVTSSLVSVENNLILNSLQKEL
QPLDLQIIILVANKKGWLRELINNSPLAVILLNLQCPSTEAQWIRNSGLI
CLEIGSKQANPLGINVCVDSKSAVQKVISFLVAKGYRDIGLLCAQQEQAI
FQQYLACWHSALHANHLNSHQILHCSEPVSFSAGAKLFNEAISTWGCIDA
FVFLSDELACGALFEAQRQHIGIPYDVAIIGLGDLEISQTTYPALTTLNI
PYAKLGETAGKKLAELLQTEKDPQTECIQLISTLRERESG
>MS1531 purR, PurR protein
MSVQKIAKLAGVSVATVSRVLNDSPSVKAVNKEKVLAAIKALNYQPNLLA
RQLRTSRTGMILAMVSNIANPFCAAVVKGIEREAEKNGYRILLCNTESDL
ERSRSCLQLLSGKMVDGVITMDAISELPELQNIIGDAPWVQCAEYDPDSS
VSSVSIDDISATEFVIDQLVKTGKKRIALINHDLSYQYAQHRELGYLDGL
KRHGLAYCEIIYADELDYLSGKEAVLSLLKNAQRPDAILAISDVLAAGVI
NGLNELNVAIPEDIAVVGFDGIDISQITTPSLSTIQQPCKEIGEMAFSLL
LQQIDSTSSVKRVHHLLPWTFIKRQSS
>MS1242 purR, PurR protein
MTKHKRPTLQDIANHLGITKMTISRYLRNPASVAEETGKRIAKAIEEFGY
IPNRAPDILSNAKSRAIGVLVPSLTNQVFADVIKGIEEITDEAGYQTMLA
HYGYSEKKEEQRIESLLSYNVDGIILSENSHSERTKKMLQVANIPVIEIM
DTSEIGIQQVIGFDNIAAAQAMVETMIKRGYKKIVYFSARLDKRTQLKMQ
GYQQAMKKYQLSPRIIATKEHSSFTHGAELLHQALKQYPDIDGIFCTNDD
LAIGALFECQRLGIKVPKQIAIAGFHGHDVGQSITPQLATVITPRLQIGR
IAAQELLARLQNIPAQSSIINLGYQIHLGESI
>MS1735 recG, RecG protein
MTTQLLDAIPLTSLSGVGAAVSAKLSKIGINNLQDLLFHLPIRYEDRTRI
TPISDLRPEQYATIEGIVQTCEIQFGRRPILTVSLSDGTSKIMLRFFNFN
AGMRNGFQPGARVKAFGEVKRGRFMAEIHHPEYQIIRDKQPLQLEENLTP
IYSATEGLKQNSLRKLTDQALELLDKIQIAEILPDQFNPYPFSLKEAIRF
LHRPPPDVSVESLEKGTHPAQVRLIFEELLAHNLAMQKVRLGTQQFQALP
LHFQTDLKQRFLATLPFEPTNAQVRVTQDIERDLAKDYPMMRLVQGDVGS
GKTLVAALAALTAIDNGKQVALMAPTEILAEQHAENFRRWFEPFGIEVGW
LAGKVKGKARQSELERIKNAEVQMVVGTHALFQEEVAFSDLALVIIDEQH
RFGVHQRLLLREKGEKAGNYPHQLIMTATPIPRTLAMTVYADLDTSIIDE
LPPGRTPIKTIVVSEERRAEIVARVHNACTNENRQVYWVCTLIDESEVLE
AQAAEATAEDLHRALPHLRIGLVHGRMKPAEKQAIMASFKAAELDLLVAT
TVIEVGVDVPNASLMIIENAERLGLSQLHQLRGRVGRGSTASFCVLMYKP
PLGKISQKRLQVLRESQDGFVISEKDLEIRGPGEVLGTKQTGIAEFKVAN
LMRDRKMIPTVQHYARRLIVEYPDVADTLIKRWLNNREIYSNA
>MS1837 rho, Rho protein
MVTLAHSKLLPTIKQTSQNFIKSNQKDSQQIIMHLTELKNTPVSELVALG
EGQMGLENLARLRKQDIVFAILKQHAKSGEDIFGGGILEILPDGFGFLRS
ADSSYLAGPDDIYVSPSQIRRFNLQTGDKIEGKIRPPKEGERYFALLKVD
QVNDDKPEVSRSKILFENLTPLHANSRLRMERGNGSTEDLTARILDLASP
IGKGQRGLIVAPPKAGKTMLLQNIAQSITHNYPECELIVLLIDERPEEVT
EMQRSVKGEVIASTFDEPASRHVQVAEMVIEKAKRSVEHKKDVVILLDSI
TRLARAYNTVTPASGKILSGGVDANALHRPKRFFGAARNVEEGGSLTIIA
TALVDTGSKMDEVIFEEFKGTGNMELHLSRKIAEKRVFPAIDFNRSGTRK
EDLLTTPDELQKMWILRKILNPMGEVEAMEFLIDKLMVAKTNEEFFEIMK
RS
>MS0368 rnc, Rnc protein
MNHLDRLQRQISYEFKDITLLKQALTHRSAATKHNERLEFLGDAILNYTI
ADALYHQFPKCNEGELSRMRATLVREPTLAILARQFKLGEYMALGHGELK
SGGFRRESILADCVEAIIGAISLDSSLVSATQITLHWYEKLLREIKPGEN
QKDPKTRLQEYLQGHRLALPTYDVKDIKGEAHCQTFTIECHVPNLDRTFI
GVGSSRRKAEQAAAEQILTALEIK
>MS0198 rpiR, RpiR protein
MAQIDPKSIGAHIRTRKQQLTPLERKVLDCILAKSDFDEKTSLKEIATEN
QVSEAIVVKIAKKLDFSGYREFRSGLAYYKQLEVANLHNDISADDTATQV
IKKVFETSIQALQETMSILDISEFERCVKILVEADHIDLFGIGGSAQIAK
DMAHKFLRIGIKASVYDDSHMMLMAGAVSHPGNVVLAISHSGTTIDVIEP
LQLARQNGAKTIAITNYAISPIAECADVVLTSTSQGSLLLGENAAARIAQ
LNILDALYVAVAKQNLDISEDNLRKTRYAVKHKRTK
>MS2023 rpoA, RpoA protein
MQGSVTEFLKPHLVDIEQVSPTHAKVILEPLERGFGHTLGNALRRILLSS
MPGCAVTEVEIDGVLHEYSSKEGVQEDILEVLLNLKGLAVKVQNKDDVFL
TLNKSGIGPVVAADITHDGDVEIVNPEHVICHLTDENASINMRIRVQRGR
GYVPASARVHAQDEERPIGRLLVDACYSPVDRIAYNVEAARVEQRTDLDK
LVIELETNGAIDPEEAIRRAATILAEQLDAFVDLRDVRQPEVKEEKPEFD
PILLRPVDDLELTVRSANCLKAETIHYIGDLVQRTEVELLKTPNLGKKSL
TEIKDVLASRGLSLGMRLENWPPASIAED
>MS0212 rpoB, RpoB protein
MGYSYTEKKRIRKDFGKRPQVLNVPYLLTIQLDSFEKFIQRDPEGQQGLE
AAFRSVFPIVSNNGSTELQYVSYKLGEPVFDVRECQIRGTTFAAPLRVNL
RLVSYDRDAAPGTIKDIKEQDVYMGEIPLMTDNGTFVINGTERVIVSQLH
RSPGVFFDSDKGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFARIDR
RRKLPATIILRALGYSTEEILDLFFEKIQFEIQDNKLLMALVPERLRGET
ASFDIEANGKVYVERGRRITARHIRTLEKDNVTKIDVPTEYIVGKVSAKD
YIDLESGELVCPANMEISLDILAKLAQAGYKSIETLFTNDLDFGPYISET
LRVDPSSDRLSALVEIYRMMRPGEPPTKEAAEALFDNLFFSAERYDLSAV
GRMKFNRSLGLAEGVGNGVLSKEDIVGVMKKLIDIRNGRGEVDDIDHLGN
RRIRSVGEMAENQFRIGLVRVERAVKERLSLGDLDAVTPQDLINAKPVSA
AVKEFFGSSQLSQFMDQNNPLSEVTHKRRISALGPGGLTRERAGFEVRDV
HPTHYGRVCPIETPEGPNIGLINSLSVYARTNNYGFLETPYRKVVDGQVT
EEIEYLSAIEEGNYVIAQANASLDEDFRFTDAFVTCRGEHGESGLYRPEE
IQYMDVSPQQVVSVAAALIPFLEHDDANRALMGANMQRQAVPTLRADKPL
VGTGMEKPIALDSGVAVVAKRGGIIQYVDASRIVVKVNEDETIPGEAGID
IYNLIKYTRSNQNTCINQIPCVNLGEPIGRGEVLADGPSTDLGELALGQN
IRVAFMPWNGYNFEDSMLVSERVVQQDRFTTIHIQELSCVARDTKLGAEE
ITADIPNVGETALSKLDESGIVYVGAEVKGGDILVGKVTPKGETQLTPEE
KLLRAIFGEKASDVKDSSLRVPNSVSGTVIDVQVFTRDGVEKDKRALEIE
EMQLKEAKKDIAEELEILEAGLFSRVRNLLIDGGVDAKELDRLDRTKWLE
QTLNDEAKQNQLEQLAEQYEELRKDFEHKLEVKRGKIIQGDDLAPGVLKV
VKVYLAVKRRIQPGDKMAGRHGNKGVISKINPVEDMPYDENGQPVEIVLN
PLGVPSRMNIGQILETHLGLAAKGIGEQINRMLKEKQEIEKLRGYIQKAY
DLGGGSQKVDLNTFTDEEVMRLAQNLRKGMPLATPVFDGAEEKEIKDLLE
LGGLPTSGQITLYDGRTGEKFERPVTVGYMYMLKLNHLVDDKMHARSTGS
YSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVNG
RTKMYKNIVSGTHQMDPGTPESFNVIMKEIRSLGINIDLDEE
>MS0213 rpoC, RpoC protein
MKNFHRTLNKFNSDRSKSVKDLVKFLKAQSKTSEDFDVIKIGLASPDMIR
SWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKHR
GVICEKCGVEVTQTKVRRERMGHIELASPVAHIWFLKSLPSRIGLLLDMP
LRDIERVLYFESYIVIEPGMTDLEKGQLLTEEQFMDAEDRWADEFDAKMG
AEAIQALLRDMDLEHECETLREELQETNSETKRKKITKRLKLLEAFMQSG
NKPEWMVMTVLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKR
LLDLVAPDIIVRNEKRMLQESVDALLDNGRRGRAITGSNKRPLKSLADMI
KGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFRPFI
YAKLESRGFASTIKAAKKMVEREDAIVWDILAEVIREHPILLNRAPTLHR
LGIQAFEPILIEGKAMQLHPLVCAAFNADFDGDQMAVHVPLTLEAQLEAR
ALMMSTNNVLSPANGDPIIVPSQDVVLGIYYMTREKVNAKGEGMLLQDPR
EAEKAYRTGRAELHSRVKIRITEYVKNAEGEFEPQTTLTDTTIGRAILWM
IAPKGMPYSLFNQTLGKKAISKLINECYRRLGVKASVMFADQIMYTGFAY
AARSGSSVGIDDMVIPEKKYEIISAAEAEVAEIQEQFQSGLVTAGERYNK
VIDIWATANERVAKAMMENLSTEEVVNREGNLEKQSSFNSIFMMADSGAR
GSAAQIRQLAGMRGLMARPDGSIIETPITANFREGLNVLQYFISTHGARK
GLADTALKTANSGYLTRRLVDVAQDLVIVEDDCGTHEGIVMTPLIEGGDE
KVSLRELVLGRVAAEDILKPGTEEVLFPRNTLLDEKVCDILDENSVDSVK
VRSVVTCDTDFGVCAKCYGRDLARGHLINQGEAVGVIAAQSIGEPGTQLT
MRTFHIGGAASAAAKESSVQVKNSGSIRLTNVKSVTNNEGKLVVTSRNTE
LTIIDAFGRTKEHYKVPYGAVLNKGDGEAVTAGETVANWDPHTMPVVSEV
AGFVKFVDIVDGLTVTRQTDELTGLSSIVVQDVGERATAGKDLRPAIKVV
DAQGNDIFIPGVDVLAQYFLPGKAIVTLDDGAEVQVGEPLARIPQESVGT
KDITGGLPRVADLFEARKPKEPAILAEITGIVSFGKETKGKRRLVITPVE
GEAYEEMIPKWRQLNVFEGEMVERGDVISDGAETPHDILRLRGVHAVTEY
IVNEVQEVYRLQGVKINDKHIEVIVRQMLRKGIITKAYDSEFLEGEQVEV
ARVKIVNRKREAEGKPPVEFERELLGITKASLATESFISAASFQETTRVL
TEAAVAGKRDELRGLKENVIVGRLIPAGTGFAYHQNRIKNRGQANVVEEQ
EVKFSAADEAEIEAEFNMIAEDPAASLAEMLNMADDAE
>MS1760 rpoD, RpoD protein
MNQRRTSYMDHNPQSQLKLLIAQGKEQGYLTYAEVNDHLPEELVDTDQIE
DIIQMINDMGIQVLESAPDADDLMLSETIADEDAVEEATQVLSSVEAELG
RTTDPVRMYMREMGSVELLTREGEIDIAKRIEDGINEVQSAVAEYPEALD
YLLKQYEQVEEGSVRLADLITGFVDLNAEEASEEISDLEEVLDDEDGDIP
ADALNDEEEDEESDEGDTSTDDSDNSIDPEVAREKFSALKDQCVKTLEFI
EKYGRTDNKVKEQIQVLSDIFTQFRLVPRQFDTLVLSMRSMMKQVRAEER
QIQRLAVDYAKVPKDDFQKAFIGNETSEQWLESLLQSKKTYVEKLQQRAP
EISKSIVRLQQVETDTKLTVQQIRDIGERIAQGELKARRAKKEMVEANLR
LVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWI
RQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREASPEELAE
RMGMPEDKIRKVLKIAKEPISMETPIGDDDDSHLGDFIEDSTLELPLDSA
TAQSLKVATHEVLEGLTPREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRE
RIRQIEAKALRKLRHPSRSETLRSFLDE
>MS0025 rpoD, RpoD protein
MTKETQTMMLVPQGSIEAYIRAANEYPMLSAEEEKELAERLYYQEDLEAA
KKLILSHLRFVIHVARGYSGYGLPQADLIQEGNIGLMKAVKRFNPEVGVR
LVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFS
DNELDLVANELGVTKEDVIEMESRMTGADVGFDLPTDDSEEETFAPSMYL
EDKSSNFAAELESENFETQAIDQLSNAMENLDERSKDIIQARWLDDTKAT
LHELAAKYNISAERVRQLETNALKKLKSAVSF
>MS2228 rpoE, RpoE protein
MLLTRGYMAEQLTDQALVERVQQGDKKAFNLLVSRYQNKVAGLLTRYVSR
NDIPDVVQESFIKAYRSIESFRGESAFYTWLYRIAVNTAKNYLTAQGRRP
PNEDILAEEAETYDVGGNLRDVDTPEHEMLSAELKKVIFDTIDGLQEELK
TAITLREMEGLSYEEIADIMDCPVGTVRSRIFRAREIIESKIRPLIQR
>MS1737 rpoZ, RpoZ protein
MARVTVQDAVEKIGNRFDLILTAARRARQLQLHVREPLVPEDNDKPTVIA
LREIEKGLIDNNIMNAQERQEALEQEKVELNAVSLLSE
>MS1964 sPS1, SPS1 protein
MRKDMLQVQHENHFFLFNFDENRPNQEHFFESYFWQKQNRIIGSAKGRGT
TWFIQSQDLFGVNTALRHYYRGGLWGKINKDRYAFSSLEETRSFAEFNLL
NRLYQAGLPVPKPIGAHVEKLAFNHYRADLLSERIENTQDLTALLPNTEL
TAEQWQQIGKLIRRLHDLQICHTDLNAHNILIRQQNNDTKFWLIDFDKCG
EKPGNLWKQENLQRLHRSFLKEVKRMRIQFSEKNWADLLNGYQN
>MS0179 soxR, SoxR protein
MEQTLKQGIFMHIKEFSTKIGLSIDTLRYYEKEGLLNPARNKSGYRNYGK
QDLEWIAFILKLKAMGVPLTQIKEYARLRYLGDTTIPERYAILQAHNQKL
VEQEKEIKKYQQFLAHKLSIYEKVMKKQN
>MS0886 soxR, SoxR protein
MNINEIVKKTNLTAKSIRFYEEKGLITAPQRALNGYRQYNQKHVEELNLL
HQARLVGFSLPECKELLELYKDPHRRSADVKAKTLARIAEIDNQIGKLQQ
MRQQLQTLANQCPGDGSEHCPIIEGLSKPNCCDHHAEKK
>MS1385 soxR, SoxR protein
MNSQKKFYTISQLAEKLAITTHTLRFYEKEGLLPSVQRDQNGNRLFIQAD
VEWLELLICLKNTGMPLKEIKRFVEWLNYGDSTIEQRLQLFQAQVTKVEQ
QIAELQRHLEILKYKRQFYQCAKELGSVQAVLDTQLQQQFAEQNILLPVS
PLSMAENE
>MS1433 soxR, SoxR protein
MMMKINELSKKSGINLETIRYYEKTGLLPEPKRAANGYRVYDQQSLSQLN
FIKSCRWLGFSIDEIKQLNELKNTPKHHCVADEMILSHLKQVEEKIARLL
EIQTFLQNLVNHEEHSVEECRAISGLSQER
>MS0468 soxR, SoxR protein
MRIGQLAKAVGCTIETIRYYENQGLLAKPQRSANNFRYYTNDHLQQLSFI
CYCRSLDMSLHEIKMLLNLDRSSGQRAEEINLLLDKHIRDVAKRLHELAH
LRMELIKLKQKCSEMTGENLMQNIFSGGNIRFRKIK
>MS1736 spoT, SpoT protein
MYLFEPLNKIIQGYLPSEHIDLIKRAFVIARDAHEGQFRSSGEPYITHPV
AVASIIAEMRLDHEAIMAALLHDVIEDTPYTEEQLTTEFGKSVAEIVEGV
SKLDKLKFRTRQEAQAESFRKMILAMTKDIRVVLIKLADRTHNMRTLGSL
RSDKRRRIAKETLEIYSPLAHRLGIEKVKNELEDLCFQAMHPQRYAVLNK
VIQVARNTRQELVHPILVTIQQRLEEVGINAQVFSEEKPLFYIYQNMRLR
NQQFRSIMDISNFRIIVDSIDNCYRVLGQMHQLFKPRPGQIKDYIAVPKA
NGYQALHTSTIGPHGVAVEIQIRTEEMNLIAELGVTAHWVYKPGGKNDTT
TAQIKAQHWLQSIIELQQSAGNSFEFIESVKSDLFSDEIYVFTPKGRIIE
LPAGATPIDFAYAVHTSIGSTCVGAKVDRETYPLSQALRSGQTVEVITSP
NATPNANWLNFVVTGRARAKIRQTLKTLRLEEAINLGRYQLLHALAGKHL
EDLDPAIVHHVLTELNLDTMDDLLAEVGLGNQLSTVIARRLQGESLAIYT
DIEEVNNQERLPIKGMDGLLVNFAKCCHPIPGDSIVAYANPGKGLVVHHE
NCRNLKKRTTQSVPFIKVEWEQCDHSAEFEAELHINMVAQQGALANLTAA
ISAAQSNIHSIWTEESEGRICHVTLTLSAKDTKHLANIMRKIKSLSGVQS
VERNINE
>MS0241 spoT, SpoT protein
MVAVRVSHLLNPKDFIIEDWCAGLGLTPDVEKNIVRAWYYAQEKAQQLFQ
NSHWYLRDGVEMVEILHGLNMDADSLLTAMLFPIVNAKIVNQEQIKEDFG
PHIWKLLKGVIEMNNIRQLNTTDSNAQVDNIRRMLLAMVDDFRCVIIKLA
ERITYLRDAEKRYSKQDKVAAAKECSNIYAPLANRLGIGQLKWELEDYCF
RNLQPEQYRIIAIKLNERRLDREQYIADFVQRVSQYLDESVTGAEIYGRP
KHIYSIWRKMQKKHLDFSQLYDIRAVRIIVPALQDCYTALGIVHTHFKHL
PDQFDDYIANPKPNGYQSIHTVVLGEGDKPIEVQIRTKKMHDDAELGVAA
HWKYKEGNTGSLSAYEEKIIWLRKLLAWQHDISNSGEVVPELRTQVFDDR
VYVFTPKGEVVDLPAGSTPLDFAYAIHSDVGHRCIGAKVGGRIVPFTYQL
QMGDQIDIITQKNPNPSRDWLNPSLGFTHTAKARSKIQAWFKKLDREKNI
PIGKEQLENELNRLAITLKQVEPIALPRYNLKSIDDLYSGIGSGDIRLNH
LINFLQAKLIKPTAQEADEEVLRQVTKTANSAANQQKNEKNKGYVIVEGV
GNLMHHIARCCQPIPGDDIEGYITLGRGISIHRTDCEQLAELKAAHPERV
VESIWGENYNSASGFNLSIRVIANDRNGLLRDITTVLANDKISVANVTTR
LDSKRQLATMDLEIQLKNVQILGKVITRLTKLDDVIEVKRL
>MS1836 srmB, SrmB protein
MSLDHLSQQRFADLPLNAKVLEALESNGFEYCTPIQALSLPISLAGKDVA
GQAQTGTGKTMAFLTATFHHLLEHPVKTNHPRALIMAPTRELAVQIAHDA
ERMVKTTGLKTALAYGGDGYDKQLKAIEAGADIIIGTTGRIIDYVKQNII
ALSHIQVVVLDEADRMFDLGFIKDIRYLMRKCPSPKQRLTLLFSATLSYK
VRELAFEDMNDPEYVEVEPLQKTGHRIKEELFYPSNEDKMPLLITLLEEE
WPERCIIFANTKHQCEKIWGYLAADGHRVGLLTGDVAQKKRLSLLKQFTD
GALDILVATDVAARGLHIPDVTHVFNYDLPDDREDYVHRIGRTGRAGESG
VSISFACEEYAMNLPAIEEYIGHHIAVSQYDSDSLIRDLAKPYRLKPSLP
ASNRHNRNGAKPFKKRF
>MS0495 srmB, SrmB protein
MTETKITFGDLGLPEFILSAVSDMGFETPSPIQQACIPHLLNGRDVLGMA
QTGSGKTAAFSLPLLAQIDIEEKHPQMLVMAPTRELAIQVAEACELFTKN
AKGVHIATLYGGQRYDIQLRALRQGAQVVVGTPGRILDHIRRGTLNLSEL
KFIVLDEADEMLRMGFIDDVETVMAELPAQHQTALFSATMPEPIRRITKR
FMTDPQEVKIQSTQRTNPDIAQSCWYVRGYRKNEALLRFLEVEDFDGAII
FTRTKTGTLDVTELLEKHGFRAAALNGDMTQQLREQTLDRLRNGSLDILV
ATDVAARGLDVERISLVVNYDIPLDAESYVHRIGRTGRAGRSGSAILFVE
PRERRLLSNIERLMKKPIEEVDVPNHEALQARRREKFKAKITKQLEHHDL
EQYRLLLEGLFTPDQDQEDIAAAMLMLLQGKQKLILPPEPPMEKRGRRER
DDRRGERGDRRERRPEERRGYGNPQPMDLYRIEVGRADGVDVRHIVGAIA
NEGDINSRNIGHIKLYDEYSTVELPQGMPKELLQVFGKARVLNKQMRMTF
VSEAGETVGRERHEGRRNDRRDNGFRREERRFNDRGNRSFNERAPRREFR
ERNDRRDRRDRRS
>MS1950 srmB, SrmB protein
MRYNFPQFYNLSHLRIFMPQPQFEDFDLSPELLKALAQKGYARPTAIQSE
AIPAAMDERDVLGSAPTGTGKTAAFLLPAIQHLLDYPRRKPGAPRVLVLT
PTRELAMQVAQQAEELAQFTKLSIATITGGVAYQNHGEIFNKNQDIVVAT
PGRLLQYIKEENFDCRAVEILIFDEADRMLQMGFGQDAEKISAETRWRKQ
TFLFSATLEGELLVDFAERILTDPVKIDAEPSRRERKKINQWYYHADSYE
HKVKLLARFIADEQVSKGIVFVRRREDVRELSEILRKRGIRSTYLEGEMA
QTQRNNAIDKLKNGIVTLLVATDVAARGIDIEDISHVMNFDLPYNADTYL
HRIGRTARAGKKGTAVSFVEGHDYKYLGKIKRYTEELLKPRIIEGLEPRT
KAPKDGEIKTVSKKQKAYIRQKREEKRKTTQKKAKLRRQDTKNIGKRRTP
KAVSEAQAKEIR
>MS0694 srmR, SrmR protein
MSTYLLDAKLAQKIVQRTMDIIDCNINIMDAKGKIIASGDVNRIGEIHDG
ALLVLSQGRVVDINEAVIHSLHGVRPGINLPLRVDGEIVGVIGLTGEPTT
LKEFGKLVCMTAEMMLEQARLFNILAQDTRLKEELVLNLINTDKITPSIV
EWANRLGVDLSIPRVACIIEVDSGQLGIENARSELQNLQTLLKIPERDNL
VAVLSLTELVVLKPALNSFGRWEVDDHLERINQLLSRMNEKAKLNVRISL
GNYFTTEDSISLSYHTAKTTLTIGKARYPKQRIYNYQDLILPVLLDQLRD
GWQKEELERPIKKLKLMDNNGVLLKTLLAWFENNMQTIATAKALYVHRNT
LEYRLNKIADLTGLDLNSTDNRFLLYMALHVAV
>MS2301 tfoX, TfoX protein
MNRTNKDTQWIRTILNSFLENEVTAKHLFVGYGLFYRKVMFGIVIDDNFF
LKAENQLVEYVEKLGAVSWDIFNKNTNLAISSYYRLPRALVDNEEEFKTL
VILSIKQQQRKILDLNIAKKERIKELPNLSIKHERLLAKIGINNVKEFKS
AGISNCFVKLKVHGFSVNVELFWLFQAALKNKHVSLLTKSEKKSALLVLN
RKLVEAGFREIKHECLI
>MS1566 trpR, TrpR protein
MYISRNMEQWTKFIETLRIAFNDGKEQDLLTLLLTPDERDAIGLRLQIVA
QLLDKKIPQREIQQNLNTSAATITRGSNMLKLMSPDFMEWVKKHTNETEN
T
>MS0762 tyrR, TyrR protein
MFTVKGYDEGNYFIRSIVGKTMSKNTAKRSAHFTVNQYENFTDVVALSPK
MAALVEKAKKFALLDAPLLIQGETGTGKDLIAKACHNLSARKDQKFIAVN
CAGLPDTDAESEMFGRADGDKTSTGFFEYANGGTVLLDGVAELSLNLQAK
LLRFLNDGTFRRVGEEQEHYANVRVICTSQISLQHYVDEGKVRSDLFHRL
NVLSLQIPPLRERKEDLAVLTENFVRQISRRLGVRTPEFDGQFLQYLKDY
QWPGNVRELYNALYRACSLAEHNKLTIDGLNLSENETVPLTLEQFGNESL
EEIMNNFEASVLRKFYEQYPSTRKLASRLGVSHTAIANKLKQYGIGK
>MS1468 vacB, VacB protein
MFQNNPLLSQLKQQLHDSKPHVEGVVKGTDKAYGFLETEKETFFIAPPAM
KKVMHGDKIKAAIETIGDKKQAEPEELIEPMLTRFIAKVRFNKDKKLQVL
VDHPNINQPIGAAQAKTVKQELKEGDWVVATLKTHPLRDDRFFYAQIAEF
ICSAEDEFAPWWVTLARHEQSRYPVQGQEVYSMLDTETRRDLTALHFVTI
DSENTQDMDDALYIEPVTAPNDEQTGWKLAVAIADPTAYIALDSQIEKDA
RKRCFTNYLPGFNIPMLPRELSDELCSLMENETRAALVCRLETDMQGEIV
GEPEFILAQVQSKAKLAYNNVSDYLEQVENAWQPENESTQQQINWLHQFA
LVRINWRKKHGLLFKEKPDYSFVLADNGHVREIKAEYRRIANQIVEESMI
IANICCAHYLAKNAQTGIFNTHVGFDKKFLPNAHNFLMANLSNEENQQEL
AERYSVENLATLAGYCRMRHDIEPIEGDYLEFRLRRFLTFAEFKSELAPH
FGLGLTGYATWTSPIRKYSDMVNHRLIKACLANRECVKPSDETLARLQEA
RKQNRMVERDIADWLYCRYLADKVESNPEFRAEVQDCMRGGLRVQLLENG
ASVFVPASSIHPNKDEIQVNTDELALYINGERRYKIGDIVNIRLTEVKEE
TRSLIGNLV
>MS0473 vacB, VacB protein
MARKTTKKTTALLDPNYQQELEKYGNPVPSRDFILQVIREHNTPMSREEI
LKVFAIQDDERVEGVRRRLRAMENDGQLVFTKRNCYVLPEKLDLLRGTVI
GHRDGYGFLQVEGVKEDLFIPNTQMKRVMHGDYVLAQREGLDRKGRREVR
IVRVLEGRKKQIVGRFFLEEGIGYVVPDDSRINRDILIPNENRLGARMGQ
VVVVELKPRTASFSQPVGIITEILGDNMAKGMEVEIALRNHDIPHTFPPE
VEKQIKKFTEEVPEEAKSGRVDLRSLPLVTIDGEDARDFDDAVHCRREQD
GWHLWVAIADVSYYVRLRSALDTEARNRGNSVYFPNRVVPMLPEILSNGL
CSLNPQVDRLCMVCEIKLSDKGVMKDYQFYEAVMNSHARLTYTKVARILE
GDEELIERYQELVPHLQELHNMYNKLLEARHQRGAIDFETIESKFIFNEM
GRIESIEQVVRNDAHKIIEECMIMANIAAANFMERHQEPALYRIHAGPSE
EKLISFRSFLAECGLSLEGGMKPSTKDYAKLLEQVKERPDAELIQTMLLR
SLSQAVYNADNIGHFGLALEEYAHFTSPIRRYPDLTLHRGIKYLLAKAQG
VKRKTTDTGGYHYSLDEMDVLGDHCSMTERRADDATRDVADWLKCEYMQD
HVGDEFEGIISSVTGFGFFVRLKDLFIDGLVHISTLDNDYYRFDAAGQRL
IGENSGAVYRIGDIVKVRVEAVSLEQRQIDFALVSSERKPRREGKTAKDN
AKKTMRYAESFAKQRKKAAATSKGKKKSAVKKSKNSVNKKANKKRTY
>MS0323 wecD, WecD protein
MKIFKAEQWNLEVLLPLFEEYRLSHGMVENPERTFTFLNNRIRFSESIIF
IATNERQQAIGFIQLYPRLSSLQLQRYWQLTDIFVQDVANQNEIYAGLIE
KAKEFVCFTHSTRLVVEQDQQHQGIWEKEGFKLNTKKALFELKL
>MS2102 wecD, WecD protein
MMQTIDQFIAQYIPAAYALNLRVVESSPQRVVIKAPFECNSNHHHTIFGG
SQALLATLSAWSLVYLNFPEANGNIVIRSSQIRYLKPAPSDIIAVSICPD
SLAMNLAKQMLTQKGKAKITIQCQLYCDDIIVSEWTGEFVLSHTPF