TitleGenColors Logo

Gene list

Applied filters:

COG category: Cell motility
Gene type: CDS
Genomic element: chromosome

Number of genes found: 137

Free access
Sort by:

 



# Geobacter sulfurreducens PCA, PCA

>GSU1618 hypothetical protein
MAVKFFGQFLVEKEVVTREVLLQAIELQESVNLSFGATAMAMGLLTEADI
EKVHNAQRCEDLRFGDMAVKLELLTADQMQQVLTRQKNGHLYIGEALVKV
GGLSADDLPRYLDEFKADQAQYATDTVSIPAGLANPNIWEMMVDLSHKML
TRVALLTFRPEPCFMANRLPRKDVYAAMDFSGDVSGCYLMGVSTDAQARI
ARAILKEANVDEEPKEVLDDTVMEFINVVCGNIAAKAAQLGKSIEIAPPR
IVEASAGIVPPPDHLCLCFPACLAEGDHVELAVFIKE
>GSU0412 flagellar assembly protein fliH, putative
MSSSKASRIIKVDQSPNQAIRSYSFGFIAADAPQELPPEADGFVPFALGT
PVPLPGLQSAEEPDPDPVVPFNLEGKVVLAEDELQARVDEVFRNGMDEGR
RQAERGLANVFKSLRDGVAALTGLRSRVMKESEEDLLRLAVMIARKIVQR
EVAQDPQVLAAIVAAAVGGCTERDRVVVRLNPDDYTQVSANRQAFLAGLG
EESAITLAPDESIGPGGCLVETATGTVDARIEAQLDEIYRSLLEERSAPV
EPSASPDTDSRADLAFGGEETIAPFKGQGAWVKGSEEKPRDDV
>GSU0935 methyl-accepting chemotaxis protein, putative
MLGKKLAFKVLAILGICLSLGLFTLGSVAGWLQFRSSLDLQLKNARNLAG
LIIHDIDGYMMKGDSSEVDRFISAVKSKNFIMDLRVFDEQAKEVSPTPSQ
TPNAKIQQAIAAGRTLEFKETLDGKRTLSLVLPFPNEQRCQSCHDAGAAY
LGGLLVTTSIEEGYEGARHLLIMLAVVGTVFFLVLLACMYFFFSKTIIAP
IADVSRQVDELASGGGDLTRVLPVRTQDEIGNLAGGINRLTSTIQGIITR
IAQNAAQLASAASQLNVTSADMARSMEAVAGQATTVATASEQMASTSQEI
AGSCSIAADGAMQATETARDGAEVVERTIAVMASIADRVKDTARTVESLG
SRSDQIGEIIGTIEDIADQTNLLALNAAIEAARAGEQGRGFAVVADEVRA
LAERTTRATKEIGSMIKSIQQETRGAVTSMEEGVHEVTRGTDEASRSGES
LQAILQRVSDVTGQVNQIATAAEEQNATTGEITRNIQDITDTVQSTARGA
QDSAQAAGQLASLARELQELVGKFKIGA
>GSU2674 hypothetical protein
MRTLIRCVLFCLLLIPAAARADDQVAMVQGQIQMAQERLTAYQGELSSTL
SLLQKTPSEATFQKIRDLQDRIRRQGNVVDTLRMALQDARAAQSWSALVS
DMAKNSTILGLGADIGVSVYQSGKWIFGATDDEIYGENWQQVESYNAKLV
AEARAIRDESKAILGALDQVKAQLNAGGSEEQLRSLAETLSARLRELQTR
GEQVKKNIRLLIRIYRAEKENLPASTLFHGVFMGRYTEKMVEEFDASKLV
DLVIDVIKGDTNDALIKLIKPMVKTAVGDYLAKNAGGGHLTPEITDDLVF
KILFGGESGFDALIQDRLGDEAGGAAIKGLVKEVVAAEANHVILKSNQQF
YGAMKRTLSTMPPTADIVEHTLVVDTTDQLVKEHKSLVESGARAKAQRLA
QTGEAIKKVWDLVLKDAVQAYLQKDSFNNAVKAADEACETWRGEWRKVRD
DMILTEDEYVNHRWFKGGQNAAAPPKQVVGGTEDTGKTYAAFNPEEAGKL
LEARKRLVESGVGEGGSAGLGDFDANEDGLVVLACNGGIPLGRVGFVPLR
DVKKLTSAALFAPCDELSIRRSAEIEAGYKACEHLWNPQSVNEYFACKDA
MRVKNNAYAEEINRCIDTNVTQPHQAFAQRRLERHRRLNDCETKRYQETV
RRIQAEIAPAVSWAESVRGDVERLQEAINKLRRSGLMYPEAPPRLPADIG
AAVEEEEYDTASADDKIAMIRANLGITRGIDVAGLESALARADFVVDWAS
VDSGGTPYLCKEFVLNNQPGIEEVECTGGYYAQDEQLLQDMALIATINPS
TVQELVSHGRQLPEVANRPLLEAMTGFSPQESVNLYRQGMSQRNRARAAI
DGVNSRLLDVENVLSGSSVAQISLYYDYQFTPWKVRRIPTLLGQDLDALE
GHIREMRTRFNADRQVVIPDYDALRREAEAASGAVAQYREIYGRYAALLT
KHFPTAFVDEFSPPDLSGLVSAIDAQAFLAERYAEEFKRAATALDEMERE
VQALRRQEEQSLKGLEIYHQARAGAVKKAAGSCAALPADPEGCLKAVEES
KAQLLAITLQGDILTHFPSQKYRDRRDEILAGSRIDYDIYALEAKAKAAI
EQREQDETRKREDRETNAPLRIAEFYQQFRGAYEGRNDSQLISFLGDDWE
SGDGTTLADLQGYLRNSFSVFDEIRYTQSNLQIHPAAQGGYRVTYDLAII
GRIYADNLTHEEKSTVSEELAFDNGGKLRITRTTGGRFWSVQ
>GSU3195 chemotaxis protein methyltransferase CheR,putative
MLSVERRINHHMINRHMNEHRRVSIPPEEPAGGFSDTAFSRIRDILCTRR
NFDIASYKDKYIKRRISIRIRATRAATVDDYCDLLQRSEEELDLLVKGLT
IHVSQFFRNRSTFEKLGTEVFPALFARLRAEGREHLTLWSAGCAGGEEPF
TLALILAERFAKELEEFKVSIVATDIDDGILEAAQQGMYGHERLQEAPAH
VLDRHFCRNGDKYCLSPAIRGMVEFRRGDLFDTGNHVESDLILCRNVLIY
FERREQERILLGLSNALTQGGILILGKAETLVGELRRRFGTICPVERIYT
KNRFSVY
>GSU0750 methyl-accepting chemotaxis protein, putative
MSVSIGRRLTLNMVWGVFVVLVLVIGNWIGMGHLEQLQATSHEAMARSRS
AQETKVIGEKLYRFVLESVANPDMAGSSSKGWLNRKAEGMAKLKQLAEQT
GDDAALGALVASADKAFRGTVTLYETKLIPALERGANHEDIMDIDDEISM
ESDNLSISLLKVAETLEKRAIAASAQYDSFSAKLKTFSLALGGIGIVLLV
VFSSWLSRSIMRPLRQVIAMMEDVAEGEGDLTKRLEHRSNDELGKLCTEF
NSFVGKVHDTISRTSSVARDVTGSVAEISRTAERLAEGAEEVASQAVMAA
TASEEMAATSCEIAGNCQTAAQSSSRARETAARGFAMVENTIAVMNQIAR
RVRVSAESVQGLGARSDQIGEIVMTIQDIADQTNLLALNAAIEAARAGEQ
GRGFAVVADEVRALAERTSRATREIGEMIKGIQGETRTAVLTMEEGVKEV
EAGTREAAKSGEALNEIMQGIEQLNQQMGQIACAAEQQTSTTMEISGSIQ
RIKDVAQETAGGAHDSARTSTRLTDLSHDLDRLVSQFRV
>GSU2372 methyl-accepting chemotaxis protein, putative
MFKDMKLGYRLIGSFAIMAAIVAVTGFIGIRSIGMVGNRVSDLMQTRADQ
QKLALQLQAAERTSRVALLEAMMGHVDTRILAANVESYRKNRDIFRRYSN
ALLKGDPALGIRPDSVDTVMEEHAKALLDTWAEYEKVADRIIAYKSGVLS
GSVSPSVLVETRLISELSGASEFVARDIDDLIETVKGLMQVVGQETRQIR
ASVSITFVIVIIGAAVLAFVFGVVATRNIIRRVNMMVTALNKGAEGDLTV
RVTTDATDELSLLGRDFNIMLEMLGELVRKVNRSLVEVGQVSANIFEASR
RVMAAAEVQAEGVSLTSSAVAEINTSIKEVSRGVDGLSLSASETSSSILE
MAASIEEVAVNVDSLAQAVDEVSSSVMEMAASIKQIANSVVSLQDVTTTT
ASSVAEMDSSIRQVEKNAMETASISEGVRRDAEMGKVSVEATIAGINEIK
RSSRITSEVIETLSVRATDIGAILSVIDEVAEQTNLLALNAAIIAAQAGE
HGKGFAVVADEIKELAERTTSSTREIAQLIKGVQDETARAVEAIELAEKS
IADGEALSQKSGEALAKIVTGVQGATAQVESIARATMEQAKGSQMIRSAM
ERVSDMIAQVAGATREQGKGSDMIMAAAERMKGLTSQVRTSTREQSKVGA
FIARSTENITDMIQQIKRACDEQSRGSDQIIRAVEDIQESTSTNLGSARM
MDDAVSRLSRQLEALERGMSSFKVENR
>GSU0582 methyl-accepting chemotaxis protein
MFRTRLAFKVLAIIGITLFLGFAALGITSIWLEYNAIMDLQTRNTRGLST
LVVRDIGELMMAGDMAVIERYVADVRGKGAVLDLRIYDAAGRPAGKKQDA
PDGEVQAALTSGATAEKRHKVDGRHVLSFIVPLANEVRCQSCHEQGARFN
GAMLLTTSLEEGYAGARNLTLALALVGVCSFFLLLGVMYLFFNRVIIRNI
GEISRRVQEIAQGEGDLTASVPVRSSDELGVLAEGINLLVTKLREIISGL
YHQAGHIAISACRTIKETERLVASTHEQKDLSTSVAVASEEMAATLNDVA
VNTQRAAQLSLSVDRAAHEGMATVTETAESIDRIKDSVMATLDTMDKLQQ
SSGQIGEIVGIIGDIADQTNLLALNAAIEAARAGDSGKGFAVVANEVKVL
SDRTASSTREIGTIIRSIQAEIRAVVASIAEGKDKVEVGVERSTTARRQL
EDILRLAAESTDMINQIATATEEQSATTGEISEKISQVSGTAERVNGQME
QTAGIFRELSETAEQIYGTVGRFKVGTYHDTVKGLASEMRDRVVATLERA
ASDRRVTLDALFSSEYTPIPDTFPQKYRTPSDRLFDEIISPIQEEILGRD
SGMYYAICVDRRGYCPSHNLRYSRPLTGNREADKEHNRTKRIFEDRTGLR
CAGNTGSFLLQTYLRDTGEVMNDLSVPIVIGGRHWGAVRIGYRADD
>GSU1782 conserved domain protein
MLFAQNAIGMEVSQHGVRFVVLGGGKGAPRLVTHGGASFAPGIVRILHRE
PNVVDPKAFVGTVREEYCKLLVRTDLVSVTLPDAVGRVMIMDFDTRFKNR
EEGRDMIRWKLKKSLPFDAGDMHLDYQTLRERDNGSLSVLIAVVARQVVT
QYEDLLLEAGIQPNRIDFNTFNLYRVAAKRIPSEDTSLFVAFHGGVLSML
ALTDGLIDFYRVKEMGREVVDPNRIFMEINSSLLVYRDKNPGREVKSVFC
LAPPDGGESFAGIVAEASGIDPVMFNPQAVMGNNGTATDPALLHDLAAAI
GAATRNL
>GSU3015 flagellin FlaG, putative
MNIEATSGTGPSAVAVVPTIVAKSRQDEEEKRAARPVEQAREPEKESDKG
KSEEERVKEATERINEFIESVSRDLEFSVDKDTNRTVVKVLARESGEVIR
QIPAEEVLKIARMLDELKGLIIREKA
>GSU1041 methyl-accepting chemotaxis protein
MFMRNWKIGTKLATGFGGLLLLLVIFTTVTIISIRFVKTSTSQIRTESLP
YALLAEEMAFEVVQVQQFLTDVGATREPDAYAEADAAAANFRKSLKQFED
MYRRENDTAALQSVEKMEKDFESFYQLGRRMAEAYMAEGTEAGNRLMGDF
DKVSTVLAEDMRTFKEGQVREANHMTASVDETLGGLEKVIIALAAAGIIV
GLFASWFIGKAISAPLGKAVSAIDRIASGDLTIRIPVTGSDETGALAVSV
NRMADDMGTAMAALANASSHLASASVELAVQADQMAKGAEEVAAQTGTVA
AASEEMAATSHEIAMNCSHAAESSRRANDRASAGSDVIRRTVEGMHRIAE
KVQRSSESVAGLGARSDQIGQIVSVIEDIADQTNLLALNAAIEAARAGEQ
GRGFAVVADEVRALAERTGKATREIAQMIRSIQQETEGAVKAMEEGVAEV
SAGKEDAQQSAGALREIVEQIEAMTTQINQIAVASEQQNATTDQITMNLQ
QVSSVIEASSRGSEETANAAHTLSALSEELQSIVGRFRTAA
>GSU2579 methyl-accepting chemotaxis protein
MQWLRNLRVGIKLVAAFSVVAAIAAVVGFIGSVEIRRIQSDDRRLYEKIT
VPMHDLAEMSVAFQRVRINLRDAVEATDPAEQALYLDTIRKLREVITEHQ
DNFEKTILTDEGRTLFNEYKEARKVYGGYIDNIMQLNSAKKVTEAKALLH
GDAKKAALHYQELLSKLVDAKQAQAKLTAERNEHVASTSFTVMAVLSAVG
VILAIGLGLLISRMITTPLSRAVDVANRLADGDLTVVVAATSTDETGRLL
TAMQNMVRSLREMVTQTATISAGIASASSQLHATSEQIATGTEEVASQAG
TVATASEEMSATSQDIATNCHAAAGSAEQVAATTRQGFDVVRHTVDGIRD
RGEKTRQNAQIVASLGDRSEQIGDIVGTIEDIADQTNLLALNAAIEAARA
GEQGRGFAVVADEVRALAERTTRATKEIGEMIRAIQQETKTAIVSMEEGV
RGTERGAIEAAQLETALQQILNQVNEVSMQVGQIATAAEEQTATTGEVTS
NIQQITEVVHQTAQGAEETADAAAQLARQAQDLQALIGRFRLA
>GSU3055 flagellar biosynthetic protein FlhF, putative
MLVKTFEAVDMSEALRMVKAELGPDAMIISSKKERKKGFLGFFSKPVIKV
TAALEVKPRQPAPNPYREAQEQHLSAKEMLENSMLAPLARELKDLRLKVE
QMTRQEAEAKAKAAEAQSREESAAAAEAPAAREFNPRSIPKQDLEEMKKI
LFKTLAAKEAGAEQSAEKADADPAPAKAPQTETEKASGGIKAALRVVVTE
LHRKGLERGAIRAVMDHLKPEARKGGTVEAIRSFLPQAFKSAIKCSGPLT
VKKNGPRIIALVGPTGVGKTTTIAKLAAHYALREGHRAALITIDNFRVGA
VEQLKTYSRIMGVPVEVAATAAELEAAIELHSDKELILIDTAGRSHKDSE
KIEELKGFLESRFAIEIHLCLAATTRDREVLEIVERFGVLSVSRVIFTKL
DESESYGSIVNAHLRTKFPLSYFTTGQRVPEDLEIATPGRLAGLVLGESK
Q
>GSU1035 methyl-accepting chemotaxis protein
MRMPARLAALKLSHKLILALAVLNLFVIAAVAASSYQGQKTAVQHAVDEK
LLACAQGVRLLGDAFHDRLGQSADINQEEYVAMLDNLSAFAEGAGVKYVY
TVVVKDGKVVFTTSSHTREEKEKGDIAALYDPYDDASSALKDAIADGKPR
YDTYSDQWGTFRSLFLPVRSSGGATYVIGVDVSTADVNAVLRSSLITTVV
MGAVLFVAGTLLMLLVIRPVSAAVRMLAEKVNHVADGDLNVTVDYASGDE
LGMLAGDMNRMVEKLRDMVAGVAGAAAEVTTAARQLSSTSEEMAAGVQSA
AAEVVGVSTAGEEMAATSFEISFNCSTVAADARQATESATAGEEVVSATV
CIMANIAALVRDSARTVESLGARSDQIGELAGSIEDIADQTNLLALNAAI
EAARAGEQGRGFAVVADEVRALAERTARATREITAVIRSIQQETQGAVTA
MTAGVVEVERGTAEASRSGEALRGILERIHAVEEQVVQIAAAADQQTATT
TEISGNILRISDVVQSTTRGAQDSADAAAHLQGLAEELHAAVGRFRVAG
>GSU3050 flagella basal body P-ring formation protein flgA, putative
MNGKQLAKGWSVMRAAAMALIACMMCIGVAFATTGTQVVKEAGIRAVVAE
YVRERTAGLGVETEIRKMAPIGDLKLPAGTVSYEVQAPRSWEGWGRANLA
LIVRVDDRVQRNIPVTVDVEALADAVVAAHPLARGDIITSDDVTVQKRDI
SSVNGRVYRSIDEVVGKRVKNAVRANTPLSGASIEKVPLVKSGQLVTIIA
ESPALRLTATGKARGNGAEGDIIKVQNMGSLKEIPARVIDVGTVQVDF
>GSU3026 flagellar protein FlbD, putative
MIRLTRLDGDVFFVNPDLIEAIEETPDTHIVLSNGRRYLVIEKTDAIVAR
IISFKSSVMKRALGGHGRKYLRRKMEGSYLPRCPLKPGHNTD
>GSU3311 hypothetical protein
MLTMQEIKAHYRFTDEDAELLGSLFPLAETNKERLADQFYDYLLGIPETA
EFLKEDLVLQKLKQTHQDWFVSLFAGSYDNRYIHNLQKIGHAHVRVGLNA
HYVNVAMNVVRQFTLSIIQDNFPDPEERRQRREAVEKILDINLDIMSASY
REEEMRKFFVSHRLESKLITATERFTYGLNLILVLALAGVSVSVVGLFIW
DIGHIFQGSVEKGILSALGSLLILWMMIELIENEIKILKGGRFNILFFIG
VIIVALIREILISTLRHDPLTTQVFLAGTLLVLGVVYFLVSKSQQPYTPH
>GSU1781 hypothetical protein
MDIRINLATRYFYNTRKVNTAIAAVILGLLLLLAYNIASLVANVSTERAL
KKDMGILQARFDESAKGITEKQYRDLLKKIAEVNAVIGKKAFDWLLLLNR
LEEVVPEGVALGAIDPSLKDGTLKLSGAARSFGALRSLMENLESSTHFTD
VLLLNQGQLSVGEKQKGISFTVTCKVDFT
>GSU1300 methyl-accepting chemotaxis protein
MRIGDLKIGVRLGIGFGVLLVLLGVVTVVGINGMKEINSKLDRILKVNYG
KIKNANEVSMVVGDLMGKINEIMLKDVNERPAIKQEIEKLRSDYRSALER
LEQLEQTDQGKQLIAGAKSAIDNAKKANNDIIELSLAGKTDQALIVYNKE
GAPLSQKIVTAFKNIVKYQEERMELRYAEALKAYGTSRAVALGVAFCAAI
IGILISLFATRSITRPLDKAVSVSNELAEGNLTVTIEATSKDETGQLLAA
MHNMVEKLKGVVADVKSAADNVAAGSQELSSSSEEMSQGATEQAAAAEEA
SSSMEQMSSNIRQNADNATQTEKIALKSATDAREGGKAVAGTVSAMKEIA
SKISIIEEIARQTNLLALNAAIEAARAGEHGKGFAVVAAEVRKLAERSQK
AAGEISELSASSVQVAEEAGEMLTRIVPDIQRTAELVQEISAACKEQDTG
AEQINKAIQQLDQVIQQNASASEEMASTSEELASQAEQLQATISFFRTDD
RGASSRSAVHRPVAKKKAAIPHLGHGTSNGYHAEPATSRKVAVGGGVNLN
LDSDHLDDQFEKF
>GSU1303 methyl-accepting chemotaxis protein
MGLRNMTIRWKLIAMTAIIVVLTATVITAICLARFKADLMRVATVSQETR
LKTLWELLRQKGANARIDNGRLMVGEYVVNDNFEIPDKVKDLCGGTATIF
MGDLRVSTNVKKEDGSRAIGTKLQGPAYDTIFKQGKPYRGEANILGVPYF
TSYDPLRDASGAIIGVLYVGVKKSEFFESYDRLKLIIVGSAVGTVLLACL
VAGFVSGRLVRPLRDAVATVDRLATGDLSVAVEAKGTDEIGRLLAAMGNM
VSKLREVVTSVKSASDNVAAGARELSVSAEEMSEGATEQAAAAEQASGNM
EEMSGSIRHTADNAVQTEKIAGKSAADAREGGEAVAETVSAMKVIAGKIA
IIEEIARQTNLLALNAAIEAARAGEHGKGFAVVASEVRKLAERSQKAAGE
IGELSASSVRIAEKAGEMLARMIPDIQRTAELVQEISAACKEQDSGADQI
NRAIQQLDNVIQQNASTSEEMASTSEELASQAEQLQATIAFFNI
>GSU1030 methyl-accepting chemotaxis protein
MAIKSFKDWRILPKIIGAALLGVALLAAVVLFYFLPMVEKQEMESRKRAT
RQSVELAFGIVGAYEARIAAGELTVDEAKERAAADIKKLRYAKKEYFWIN
DSSARLVAHPLRPENEGKDMGDFKDADGKLIYREFAKAAGAENGELFVDY
RQIKPNEKTPLPKVSFVKYHKPWDWVIGTGIYVDDVKRDITMLRWKIIGA
IVVAGAVACLLVLFAGVRITRPLKVVVSSLEDIAQGEGDLTRRIDVVTRD
EVGDLGRAFNQFIEKLHNIISQVVQNSMQVASAAAQIHSTSEQTATGAEE
VAAQAGTVATAGEEMASTSSEIARNCMAAAENSRQANDTALKGSHVVKET
LTVMTRIADRVKESAHTVESLGSRSDQIGEIVGTIQDIADQTNLLALNAA
IEAARAGEQGRGFAVVADEVRALAERTTKATKEIGQMIRSIQQETKLAVS
SMEEGVKEVERGTSEAAKSGEALEEILHQIGEVTNQVNQIATAAEQQTAT
TSEISSNIHEITEVITQTTRGAQDSASATSDLARLAEELQRLVGQFRLS
>GSU1141 methyl-accepting chemotaxis protein
MPRVLSLWQRYLDLSVNAKLMLYVACFTVWLILVGAAGLWGMGVLSTGIN
RDGLALRRVLLVSGLKNDLLYLRHDLDRYFLENGNAASRAIHDAEQRLKT
IAGGIEALERHEPDAEQRRLLGVFAQEFALYREAVVRLIDLQKRALETGD
VTVRSAAASYAREDILPLFFGASDAVTDLVEFDRQQALQTVDANGLQYAR
LSRVLPALIVAAVTLGLFFGIVIARSITKPLARILAAVESLATGNLNVDA
PVGARDDLGRLAVGINAMVGRFRDVVTSICRDSEAVAGAASQLSGTACQL
SEAATEQAAAAEDASSSMEQISSAIRANVQNAQTTADVANRSSIDAAAGG
ETVTETVALMKEISRKIMVIEEIARQTNLLALNAAIEAARAGDHGKGFAV
VAGEVRKLAERSQSAAAEIGRLSVTSVEVAERAGTLFGAIIPDIRQTAEL
VQGISSACHEQETGVGQINRAIRQLDAVIQQNASASEQMASTAQELSSQA
DMLLDAVSFFRLGETNRYQESRSELSGIS
>GSU1029 methyl-accepting chemotaxis protein
MSAWRDLKVRTKIFVLVIAGCLGLVVLGSVALYNMRNLSGSVKEANIGME
HVAGLSGMKSDFLEMRLALVYMLALKDAEKIGGKEQDFLKAADRIKKTLD
DLGKQELTDTEKKSLVEFRGGFESYVEKGTRLAELIKDATAKGDEVGRAD
AMTFATQSVAPLYDTPAKIIASMVQENIGEAHKMYEQDMASYRASFIMMV
VIILGVIGVAAAAGLAIAGSISGPLNKVLDVLTRVAAGDLTARADVVSAD
EMGLLAREVNTTAAKINEIIGLVAHNASQVTAAATQLHATSTQMSTGAEE
VAQQAATVATASEEMAATSAEIAHNCSLAAESSRHANDRAENGSDVVQET
LTVMNRIAERVKDSARTVESLGERSDQIGEIIGTIQDIADQTNLLALNAA
IEAARAGEQGRGFAVVADEVRALAERTTKATKEISQMIKAIQGETKGAVT
SMEEGVKEVEKGTSDASKSGEALQAILEQIGGVTMQVSQIATAAEEQTAT
TGEINNNIQQITEVVQLTARGAEESAQAAEQLAKLAEELQDLVYKFKLA
>GSU1140 methyl-accepting chemotaxis protein
MFKNMKVGLRLGIGFGVVVTVFMIAILVTLVLLREVNQESRQVAEESIPF
LMSAYEMDVALAELTENLTDVAATHSPDGFKGAEEAAAIVKREITKFREM
FRKENDTVALKELDDVEAAFTAFHLSGVKMAKVYMEQGIGAGNPLMKEFD
NAHEVLIEKVEKLQKSQVDEALGNSRDNVAAVGKVTMVLIGFGIVAVLIG
VAVAFFITRSITLPLVRAMEASNRLAEGDLTIEIVADREDEAGQLLKSMK
NMIDSLRSLATTAERVAEGDLAVEVVVRSDRDVLARNLHGMLETLKGLRQ
ETDELIGAVRDGRLSVRGNARTFSGGWGELLTGINQLVDAFVQPIQVTAT
ALNRISRGDIPEKITAEYKGDFNEIKINLNSLIDAMNSITALAQELSAGN
LTVEVKERSERDELMKALASMVTKLRDVVADIMIAADNVTSGSQQLSSTS
EEMSQGATEQAASAEEASSSMEQMSSNIRQNADNAAQTERIAIKSAADAI
EGGKAVGNTVSAMKEIASKISIIEEIARQTNLLALNAAIEAARAGEHGKG
FAVVASEVRKLAERSQKAAGEISELSSSSVEVAVRAGELLATIVPDIQRT
SELVQEISAACREQDTGAEQINKAIQQLDQVIQQNASAAEEMSSTAEELS
SQAEQLQDTVAFFSIGGEMKRKIAPKPSRPNAKASIRLPAAPHGTANGYG
RTSASVTGGFALDMAGHDHLDNEFEKF
>GSU0683 methyl-accepting chemotaxis protein, putative
MKWFEELKVSSKLAVSFMVVIVLTTFLGIFSIFELSRVNETGTDMAENWI
PSLNAISAMQLDFASYRRLELQHILEVESAGQKTYEERMAGLVKSIAEHQ
KEYEPLLSTPEEKQMLQEFSTKWQEYLNEGKPVLELSRQNKAQEAAALLN
ANSRKLYNEAGALIDKLKTLNTQEAKDASARGDKLYSSARIWIIGSLIAC
IVLAVVMGLVITRVLLRQLGGEPTAIADIANKLADGDLRIAFDTTGKAET
GVYAAMHNMVEKLKGVVADVKSAADNVAAGSQELSSSSEEMSQGATEQAA
AAEEASSSMEQMSSNIRQNADNATQTEKIALKSASDAKQGGTAVAETVVA
MKEIASKISIIEEIARQTNLLALNAAIEAARAGEHGKGFAVVAAEVRKLA
ERSQKAAGEISELSASSVQVAEDAGEMLTRIVPDIQRTAELVQEISAACK
EQDTGAEQINKAIQQLDQVIQQNASASEEMASTSEELASQAEQLQATISF
FRTDDRGASSRSAARRPVAKKKAAISHLGHGMSNGYHTEPATSRKVAVGG
GVDLNLDTDHLDDQFEKF
>GSU1294 methyl-accepting chemotaxis protein
MLQNMRIGLRLGLGFGLVVVLMVILGGISINRMATLNTDLDMVVKDRWPK
AETTFGISSQINVVARALRNAILLDDPAEVQKEIARINEASVSVSKSMDE
LSKSITSEEGKAKLKAVEASRAAYREDLLKLVEYIRAGNKSAAQKMLFGS
YRERQRSYFDAVDGLTQYQAKLLAVSGKEAEQTFVSSRNIIVALLVLSAL
LACACAWLVTRSITRPIGACMAAAGRIASGDTDVTLDVTARDETGLLQVE
MQKMVEAIRALIADADMLSRAAVEGRLATRADAAKHQGDFRKIVSGVNDT
LDAVITPLNVAADYVDRISRGDIPPRITDTYNGDFNEVKNNLNRCIDALN
GLLSDMNEMSKMHDLGDIDVVMPADNYQGAYRIMAKGVNDMVNGHISVKK
KAMACVAEFGKGNFDAELEKFPGKKAFINDTIEAVRSNIKNFIADMGHMS
QQHDLGDIDVKMPEDQYQGAFQVMAKGVNNMVGGHISVKKKAMACVAEFG
KGNFNADLEKFPGKKAFINETIEGVRTNLKSFEEQLGILITAAADGQLDK
RANADLFVGGWNQLARGVNDTITNIVEPLMVTADYVDRISKGDMPPLITK
EYRGQYNIIKQNLNTLIDATNGIVAAAKEVAGGNLMVELRERSAKDELMQ
ALSAMVKKLSEVVAEVKSAANNVAAGSREMSSGSEQMSQGATEQAAAAEE
ASSSMEEMSSNIRQNADNASQTERIAIKSAQDARDGGKAVAETVTAMKDI
ASKISIIEEIARQTNLLALNAAIEAARAGEHGKGFAVVAAEVRKLAERSQ
KAAGEISDLSASSVEVAEKAGEMLGRIVPDIQKTAELVQEISAASKEQDT
GAEQINRAIQQLDQVIQQNASAAEEMASTAEELSAQSEQLQSIISFFRVD
SSAQSSSAIAAAKPAAKKPALAHAPANGYHKANQAPAKKVAHAGLNLNLE
GGDHLDSEFETF
>GSU1491 type IV pilus biogenesis protein PilB
MQASRLGELLVRNNIITKEQLAKALDEQRTSGGQQRLGSILVKNGLVTEP
DLTTFLSKQYGVPSINLSEFEADMAVVKIIPADVAQKYQIVPVNRAGSTL
IIAMADPSNIFAIDDIKFMTGYNVEVVVASESSIKTAIDKYYDQSASLAD
VMNDLEMDDLEVIGEDEDVDVSSLERATEDAPVVKLVNLILTDAIKKKAS
DIHIEPYERTFRVRYRIDGVLYEVMKPPLKLKNAITSRIKIMADLDIAER
RLPQDGRIKIKMGGGQDMDYRVSVLPTLFGEKVVLRLLDKSNLQLDMTKL
GYEPTALSYFKEAIHKPFGMVLVTGPTGSGKTVSLYSALSELNKTTENIS
TAEDPVEFNFAGINQVQMHEDIGLTFAAALRSFLRQDPDIIMIGEIRDFE
TAEIAIKAALTGHLVLSTLHTNDAPATINRLLNMGVEPFLVASAVNLITA
QRLARRVCSECKAVEEIPIQALIDAGVPPEEAPEYVCFRGTGCAKCNNTG
YKGRVGFYQVMPMLEEIRELILNGANTAEIKRESMRLGIKTMRQSGLTKL
KEGVTSFEEVLRVTVADD
>GSU0989 NHL repeat domain protein
MVHGLDRRLGLAGRIRPDALVADRCGTLIMLAGERFYRLDPMSGRLERIP
CLGGRGDRAGELSGPRAMALGSRNLYVADTDNNRVCVFATVNWQVRRFIG
AENPAGEPAAGTGPGEFDRPLDLAVDPCDNLYVLDAGNRRIQRFDYHGEP
VPHVPPFGADRLKQPVALALGPAPSPSGGGALVHCLDTGLTAIVTFDDQG
RFLGTVGLDDLGFEPAGLAVDADGKWYVSDRERFIYAIRSAGDWSPLEEY
EGKALRLFAGPGGEFYALEDGEVARLTRRRRYPPAGSWTGSGPVTGIYTS
RSFDTGDGRLFWHRVTLDATVPPKTQVRLSYFIYETGRDPELLPADGEWR
SFPSNPADALFERKEGRYLRVRLELISEDRHATPTVVSLRLQFPKQSYLR
YLPAVFQDDERGRDFLERFLSLFESVLYDLEREIFTTRRYADPCAVPAGF
LPWLASWLALPDADQWLEDGGARLRTLIARANELYRCRGTRGGLAELITL
YTGKEPWIVEAFQLDRIRGRSEWRETMRLFGEDPYHFTVLLPPGGAGRTE
TVKRIVARERPAHTCATVVALENLFRLGGHTYLEVNTNLNQPLFALETSS
SLARQTYLADGEKAGQAQVRARQGMDTLFE
>GSU2031 type IV pilus biogenesis protein PilN
MVRINLLPVRSSKKKETARQQMAILLVSVLVVLGIGVGLFGYAQAKIKAT
KNDISGAESELQRLKGKIGELENIKKLKDDVTKKLNVLTQLRKEKTGPVR
RLATLSDATPEKLWLTKYSENGPNVSIGGVAVDEDLIAAFMRNLQQTEDY
TNVELIVSEQTEIGGVKAKRFELTCVIKALKKEEPAPAKKK
>GSU3196 methyl-accepting chemotaxis protein
MRIPLGYKFILGFVVVVAAVAFVPTGIRLLGYAPEITHILTYVVAMTIGL
ILGWIFSRGVARNMGLLTDSAEAISRGDLTRDVALPPGRFPDETVDLGDS
INIMVGSLRELVGHIRTTAEKVAASARTLSDSTVEVNSSSEEVAQAVEQI
ARGAGTQAEMVERSSRIIHEMAISVELVSKRARESAKAAQETSRTARRGQ
KLANDSLERMTSFFGKVEESSAQFLSFNARLQQVGKIADFIAEIARQTNL
LALNASIEAARAGEYGKGFAVVADEVRKLADGTGKSAAHITELIAAVREE
SRRVQQLIEESSRDIGEGKRNVDITAGAFQDILSNALETERRAGSIADLS
HIQTDGAQKMVTAVDEIARVAEDNAAATEEVSAASEQQAIAMHEMTVAAR
DLADLAGVLMGVVERFILPRSEGGNKAP
>GSU1500 hypothetical protein
MVPFTDHMRGRPFVEKLGYLPASDALKVLVADQKQSVGASLIMKVMMYFG
GKVSGGNLKVTTDVDYKAMSRTVHAALKLDPYNMDGYYFAQAILVWDVKQ
YRLANDLLEYGMKYRTWDWYLPFFAGFNYAFFLKDYPNAARMYMRAGDLS
GEPLFKKLAGRYLQQSGQTEIAIAYLTTMEKGARDKAVKESFRIRLTAFR
RVLLIEKARDGFIAEHGRLPSSVEEMLAKGYLKSIPADPYGGKFYLEPTG
DVSTTSKFAFAGVQNN
>GSU3304 LamB porin family protein, putative
MKKKIFAVAAAGALTAATAVPALALENEFHGMFRVNGFISNFADGASGAT
KITAQDPKTRNFLEQRARLLYMAKANDDLKLITHFEIDSTWGKSSYVVGR
SNDGGALGADSVNIETKNVYLDFNIPSTPLNVKVGIQPISDSYKGVFINA
DAAAVLAAAKMGDATVALGFARLDDADTFTIGSAANRSATTPGKATRDLY
IVDGKYNISKDLKVGGSYYALLADQNDTTLNFALHTIGVNAEYKFDPVTI
DGFLLYQRGRTSGTAKVDFGGWAANATAKMKVGPGTLKTSFLYASGEQNA
NPDNSSYMGVTNETSGANGEHSFYESEMMIMFRNKYNIGDRALVYNVQNV
IGGFVGYNANITSKAFAIANVGFVAADKDNTTYGNARVTGEAGHKSKYLG
TELNAEVGYKVFDNLTASVQGAYVILGDYFKDTAGTAANPEDPRNPYLTR
VMLNYAF
>GSU1287 methyl-accepting chemotaxis protein, putative
MEGPERLDTVAFDLFRRFRQDGAEAGCPPGGAPSAVADDSLLRELQVWKS
CLADAGDRLQQVTGSTEEEFLAVGARLHEFYSRAGDIERMTRGVAENVLG
DEFGSDMKSLSAILDRIAAYLGQADSQTEQLTQTLRSVLELIDRVDTPLE
GFRKIIKNLHMLSTAVKIESARLGEGAAGFNTLAEDVERLSVSIKEKSTR
ILGEKDSLGRVITDTLESISRIEASQREDVRRIISETGENLGALSTLHAR
CSDVANNVATLSAEIADSIGEVVTSLQFHDITRQQIEHVKEALEDVVRHL
DNPGDNPQGDVAEAAEVCDLQLAQLLHSRDELVSAVERIILNLRDIVAKE
TRMSEETRGITGSADQTGHSFFARMEDEMASVSRVLTDNVRAKRDTASAM
TLVVSSVNDISAFVTDIEEIGTEIELIALNSQVKAANTGDGGAALGVLAE
SIQHLSVDARTRTGDVSVTLREVTEVTGRLVMEVDADVSSGTDEIERLLV
ELKRLLDSVESINSRLLGLLADMDSAVGSLSTDIEAATSGLSVHNTAGAL
LDEVATALEETLRDMRRAVPAATRRGNGGKLLDLAQRYTMHSERTVHHRV
VGGGTPAVATGATASPGDGLGDNVELF
>GSU3201 chemotaxis protein CheD, putative
MSRIVSVGISEFKIASAPTILMTYGLGSCVGIALHDPVALTGGLAHTLLP
APVRGMDSMVKSAKFTCWAVDLMVEELIKCGCVAERLVAKLAGGATMFEP
QHRTTHSGIGERNVTAAKEALERRGIPLVASDTGDDYGRSLEFNTVTGVI
TVRALQRPIKRM
>GSU0420 flagellar protein FliL
MAADEKAPVEGAPKDKKKLFIIIGAAAVVIIALAVVFLGGGKKEKGKEGE
AEAKVEQKAEGGHGGGKEGAAGAVATAFALEPFIVNIYDGQELRYLRVKV
EFETATPDAKAEIESRQAPLRDAILVLLTTKTLQDIQDLQGKNQLRDEIL
VAANKILPPGKVTKVYFTDFVVQ
>GSU3046 flagellar protein FlgJ-like protein
MRTSMPTETLMSDAETARVQQLASRAGKAEQERMAAKKVAREFEAVFIGM
MLKSMRDTVAKDDLTGGGRGEEIFRSMLDQEYATACAASGGLGLAPLIEQ
QLLPHETQPSVVPAAKPDADKKTGAR
>GSU1497 hypothetical protein
MKKIITIVAMLLAMQGIAIAAGKIPTTTMGGKDFTFKPSTNVSVSYFTTN
GATSTAGTVNTDYAVNTKNSSGNRVFTSTNNTSNIWYIENDAWKGKAVSD
SDVTALGTGDVGKSDFSGTEWKSQ
>GSU1013 chemotaxis MotB protein, putative
MLKRLVILAVLSLSSLMLSGCLVGEGKYLKKVEEADNLSKELTTLQEKYS
ALSSENEGLKGALAKLKDEAAGLAQDKEKLTADNRELQQVLQAKSDSLSQ
NIVELRQKVSQLEAENARLKADIASTQKAKEEQVREVSKTYEDLLDKMKG
EIAQGQVTISELKGKLTVNMVDAILFDSGKAEVKPAGMDLLQKVVEILKD
VKDKAIRIEGHTDNVQIVGNLAKRFPTNWELSSARAINVTRFLQSRGIDP
AVLSAVAYGEYHPVAPNDTDEGKAKNRRIEIILVPKDAP
>GSU0583 methyl-accepting chemotaxis protein
MALNYAPAAGYDRGHAHLLAGMLGCNALLSLFALWMLSARKLVVRVNNLA
SAMDRGAEGDLTVTVTDESSDELGQLTDNFNAMFGRLAGMVTRVKEAVEE
LRAISATVKDAVERGLDTAEVQTGAVQRTTDGIRAIDRSVNEVAQSVESL
SRTASENAAAIVQMSTSIEVVAEHMEGLAREVDEVSSSIIQMAAAEKEIG
RSVRVLMEDASRTASLVAEMDLSIRQVEKSALETAAISEEVLRDAELGRD
SVDRTISGISEIRRSSRSASDTITTLSHRVGDIGTIISVINEIAEQTKLL
ALNASIIAAQAGEHGKGFAVVANEIKELAKRTTSSTGEIAEIISGLREET
VRAVQAIKQAEDRIGEGETLSYRSGEALEKIVDGVKMAVDQVGEIARTTV
EQAQGSENMRRAMERVAEMVEQIMRATQEQAHGTELITEAADRMKSLTGR
VFSSTREQRDTSTHIVRSSEGVTHMISTIRQASQVQAENSQKIVEAVENM
ETTAVNGLDTTRLMEEAVSRLARQTEGLTEAMAGFKVR
>GSU2143 hypothetical protein
MNKGIIAVVAVVGALMASSAVYAGWGWGNGGWCMTGNSQNVSTQKMRSFQ
KESFKARESLMDKQLELQDEYSKDVPDGRKIAALRKEIASLQDQLQATGD
KYGVGNWGTGGGMNYRQSSGYGCGCGYCNW
>GSU0291 CheR methyltransferase, SAM binding domain protein, putative
MSPHHTPLHPCFDPAETSNELSRLLVSGPIVDADLDRRIARLRERFRIYC
GIYPFGMWDSGLVVTREMRALTDLLLPLAEILPAFSRLFRLTLRFPPLLE
ATPLTTACSWLDLLERLDVSAARGNPARLLEQLAGDGARRTSFVFSLFIP
RHYGGGFDRYPDQTAFLKRWIRQRGTPEGGIFSVLDAACGCGEGTYALAR
LLMGMGVAPDRFRVLGSSLEEIELFAARHAFFPHDQRRGEVLRSFAAPLL
AAGAATSIGFVREDIREATGGAWDVILCNGILGGPFIHDRQTLERTIARL
AARLAPGGLIVAADRFHEGWKRRVPAAMLEEMLRGAGLTVLTVGEGVAGV
RP
>GSU1032 methyl-accepting chemotaxis protein
MAGLTVLGLAAVTALSLYGVNRQVAETETIVQVDAAQVDVALKSQVSLAE
AVRAYKNYLVRKDDKHVTGFRESIGTFEKNIAEFEKLANTDGEKAAVGKA
KEELGKYRGCIDELVAARGASDDVAAIDRNLARGIDRPLEAAVKEMEKAA
RQSFGDSRAALAASSKRLLVGQAVFSVIVALLAAGFGFNTARRFVFRLGK
FSEMIARVADSDLTARVVIRADDELGDMGRTFNRMVENFEHMLTSIQNAV
LNLSESARTLSVTSEQIATGAEEMASQTGTVATASEEMAATSQEIAQNCS
TAADVARNASASARSGAAVVQQTIGAMERITERVRDTARTVEALGARSDQ
IGEIVGTIQDIADQTNLLALNAAIEAARAGEQGRGFAVVADEVRALAERT
TRATREIAEMIKSIQQETRGAVASMEEGVVEVTQGSADAARSGDALREIL
DQIEQVTGQVAQIATAAEQQTATTSEITMNIQQITEVVGHTAREAGESAD
AATGLATLADELQTEVRTFKTSGSELFILELAKKDHSGFVTTVEAVLVGR
RRMEAGELSTHHTCRFGKWYEGDGRQLCGHLASYKAIYAPHERIHSLARD
VVAAVNGGDRDRAARLFPELKELSREIITRLDDIRREFEAQRAAA
>GSU0435 MSHA biogenesis protein MshE, putative
MESIVKEGSLGSILFKCQIISEDDIRRALDEQERTGGRFGEALVSLGIVT
QEDIDWALSNQLNIPYVRLKPAMVDRDAVALVPAVMARQHNLIPLIRAGE
ELSIAIADPLNVAAVAAVEKETGCAVSVSVALIREIREMQERFYGPPDTE
ERLGFTSSAFPPQALAAMNHDLTGGKFIDYLLLFVAQQKLSSLSLHPLGD
RVSVIGRRGGTTREVGQLAPSRYPDVVMHVKKLAHIDGARFSARGGLSFA
LKGRSIPFQVATLRGEGGDHLTFRMTVAALFPTSLADLGLTDDQVRQFAD
LAAAGRGMVVTGARDREIRRRLTDLYLQEHEAEGKTVLVVGSGAGTGEQR
FPRIPVPSDADLSAVVSACLEHDPDILVLEDVTDGQAFAAACRATLRGKL
VVAGIGCGDAVGALDQLIAFRDMHVLVPAYLRGVITCTPIRPLCPACRRS
EPFPAAERAALGIGADVTSCWRSAGCESCDQTGHDGRRYLLDVLVLDHDL
RERFEAARNGAEVIEHLRGQGWRGITDERQTLLAEGTISLEEYASSLHG
>GSU2609 type IV pilus assembly protein, putative
MSQKKVGEILIEHRLISEDQLREALELQKVFPDQPVGQLLCKLGFLSESE
LSYILEQTGKRQKLGDILIRERLVDEERLNQARVAAKRDGSTLERALRKL
RLVEEEPLAKTIATQYDLSFVHINTLEIEPDLARCINPNYAQRQRIVPIS
RIGNTITLAMAYPIKLHELKELEQSIKSRIIPVIAMESEIIQAQQRLYKT
AASAAHALTLDEADLEIAPGSIVDILSSGAGEDEPDIDDEVRTITERDSV
IVKLVNKIIFDAHQNRASDIHIEPYPGKNDVIVRMRVDGSCKVYQRIPFK
YKYAIPSRLKIMAELDIAEKRKPQDGKINFKKFGPLDLELRIATMPTAGG
LEDVVIRLLNTGQAYSFDSLSLTDRNMRIFGESITKPYGLVLVVGPTGSG
KTTTLHAAIARINRPEVKIWTAEDPVEITQKGLRQVQVNQRIGLTFAAAL
RSFLRLDPDVIMVGEMRDEETASIAVEASLTGHLVLSTLHTNSAPETVTR
LLEMGLDPFSFSDSLLCVVAQRLARRLCEDCRELYRPDRKELSEIIEEYG
EEQFAATGLLGNEVVLARPVGCTTCNQSGYRGRLGIHEVLEGTDTMKSLV
KKKSDTEIIRRQAMADGMTTLRQDGILKVFQGLTDIHEVRKVCLK
>GSU1063 hypothetical protein
MSLVEMLIALLILVVGFLSVIMVLWMSINSGRFTRDMTMAASLGQDMLER
FTARSYGSLPATGGAFEPYTTANASAVGYVREVKVEDNVPDVGIKTVTVR
VRWNSNGHERSRTFTMLKRDY
>GSU2185 flgM family protein
MKIDTNPPVTTVNQVKGETSQAASGADARKTGAAGGQATDTVDLSRNAER
LVKANATLRTMPDVRVEKVEELKKQIAAGEYNVSARDVAEKMLISMRNGV
TA
>GSU2038 hypothetical protein
MRQSSVLGCARPAGLAALLTLLLAAGGDGAAAATMNDYCIQPPFVSQSVP
PLVMFEVGREHKLYYEAYNDANDLDDDGRLDTTYKHSIDYYGYFDPYKCY
THSGGSGSNDKYTPVSTTADKFCSSGQWSGNILNWLTMSRMDVLKKVLFG
GQRSADSNTATYLERVYVPQDAHSWGKEVTGRLCSNGTNYTDMCQFDSDC
DTGYTCVDKSVNLIGITASDTGTACSFTSSIKWDTTGKILVAKYTHSNFS
CGSDSTDLISSYEPANLVAGFPVYVATFGDAILNPAADHGDQFNYLALAE
FSVSKSDKGNWMFAIDGDDGVELEIINPAGDASTIVASRYGCNSACNCQT
NSGTINLNTTGYWRLIARHSEKSGQDGVKVWYKKPSKTQSSDPWVLFGSS
TLTLRAPTIPAGAECTLKDRSFIETGKPKVGTTPKQHLFCSTTLSDGGTP
ILRFLGNKENRIWEWVSKERPVCDSSLGAPTDYTVQVEVCKSVSPDNRPT
GKKDDLASGRETNCKDYAGTFKPVGLLQKFGEGEGAKVCSRTLAKSCTSD
SDCGAGEGLCIYKSPMYFGMFTDSYTKNLSGGVLRKNIGSILDETNANNG
IFQTSENVQGNIMITLDRLKTIGFRYTDQSYQDASGGSCGWITDRPLNEG
ECRMWGNPIAEMMYESLRYFAGKGAPTTEFTYTTPADSGLSLSKPSWGYS
KGSTTYQLYDIYPPCAKPFMLILSDINPSYDSDQIPGSSFKKTDGTYFSE
DAASPQLGLGVAGADGVSLLNKLADTIGTSEGIIGDSWFVGENGSTTDFV
CSSKSVTKLSLLRGMCPEEPTKRGSFYSAALAYYGLTLMKEKTGKPDVST
FVVALSSPVADLKIKAGNSHVSILPVGKSVSGSHGINASCAQKCTLTADE
DGLHISNCSSTAYCPSNQIVDFYIDSLKYDNDKNVIYAKYRINYEDVEQG
ADHDMDAIVTYEVCTQSAIDQGLGACSGSLGSNIQIKLNSDYAAGSIDQV
MGFVISGTTEDGVYLPVRDRDVSSADSDTPATVAGLPLNWSKTFTISGNP
TGTLKSPLWYAAKWGGFIDANNNKKPDLASEWDKDGDGEPDNYFLVVNPL
KLEQQLQKALTDILNRVSSGTAASILSNNDNNGATLLQAIFYPRKNFAET
ELAWTGELQAFWYYIDPFLNTNSIREDTDQDLRLKLKTDYVLDFRFDTND
NKTKIDRSLDVDGNGSGDSYVNTIEPEQVNALWKAGSLLWSRNLSTSPRT
IYTSYRDAASKDQLTVFTTAGKDLFKANLQAADATEEDKIINYIRGTEQS
GYRNRTVTIGGSTGVWRLGDIISSTPRLQSNARLNGYHLPPPVGYKDSSY
QRYLDSNEYKTRGMGYVGANDGMLHAFNLGVLKAGTTKDVTSFITGSDFG
KEMWTYIPRNALPYLKYLADPEYDHLYYVDASPSLNDVSIEVTEGTGCTD
AAYWLCTKQTVYQAGTDSTTKELDLDKTSWRTVLLGAMGLGGASRNTTDA
CSASTDCVKTPIANVGYSSYFALDVTTPTSPSLMWEFASADLGYSTVGPA
IVRIGGETNGRWFAVLASGPTGPINTQTHQFLGRSTQTLKLFILDLKTGA
LLRTIDTGIQNAFAGSLSGGTLDTDRSAGTTGKYNDDAVYLGYVRKDTTT
GTWTKGGVLRLFTKENIDPAQWWWATLVDDIGPVTSAVAQLQDTTHKNHW
LFFGSGRYYYKAGSDLDDAAGRRALYGIKDPCYDLNNKMKTTCNTPTVLA
TDLVNQTDSIQGMGTAPGWYVLLDEASGSAGAERVITDPVAAPNGAIFFT
SFKPAADVCKFGGDLALWGVNYSTGGYLAPSQLIGEAIIQSSTGSFEQID
LGSSFTQRLNRKTAERQGVPPRNKPTIVTNANIKPQKRIIHIREK
>GSU2059 conserved hypothetical protein
MSAITFEEIMFTIMSATQDTFKNYLSVDIFAGKVERKVDPVDSDVVGIVG
VAGDRVGYIILAAESTAAVTIAKELLMLDEPDEESIRDAIGELTNNIAGV
FKTKYHEQYGNVALGLPLIVSGMLRTVAEGTSQDQSGGSSSMNVQCKGVT
IPFMSLDGKFALRVMVYM
>GSU0314 general secretion protein E N-terminal domain protein
MPIRLGEMLIKAGMITHTQLDEALKGQVIFGGRLGTNLIEMGVIGEEELA
RVLSEKLRVPCVDPDELMAVPDHLLSLVPRDMVERYKIVPLGVDGRRLRL
VMADPSDLPAIDEIAFRTGFVIVPMVAPEIRLFMALEKYYGIRREVRALP
VSETLGGRRCTYGTKRPAEQLMRRDVVDFSTLPDNGKYLPWEGGDIDDNR
IEAAERYTIDALSRTLADCRERDAVADALVDYAGRLFGCAGLLLVMRDMA
AGWEAVAGHERLREFGQLRISLQEASHVRTVVEERSVYLGPPGNMPTDRR
LAEALGGAGAPGLMLVPMVMGRRVVTILCAAGEMVALGARLSEAQTIARK
GVLAFEILILRGKILMT
>GSU1298 methyl-accepting chemotaxis protein
MTWFHNLKIASKLIVGFALVTLIACVIGAIGITKIMQIEKADTEMYELNA
KPMGPILTTAVAFQRIRVNYREIALEQTNEGKVKFSNRIKELQKTIDDNL
PEIEKSLKSEETKKAYADIKAELAKFAPHLDKIVALAMDGKNDQAVAYMR
SDSVAGIARSVDDSIQKLADLKIDLAKKKSDANTAAAKAAVAMTGIILVL
GVALAVGLGIFLSRIISRPLRSAVDVSNRLSEGDLTVTIEATSKDETGQL
LAAMHNMVEKLKGVVADVKSAADNVAAGSQELSSSSEEMSQGATEQAAAA
EEASSSMEQMSSNIRQNADNATQTEKIALKSAADAKQGGTAVAETVVAMK
EIASKISIIEEIARQTNLLALNAAIEAARAGEHGKGFAVVAAEVRKLAER
SQKAAGEISELSASSVQVAEEAGEMLTRIVPDIQRTAELVQEISAACKEQ
DTGAEQINKAIQQLDQVIQQNASASEEMASTSEELASQAEQLQETIAFFK
TGEQVGLVRKAAAVRQFAAKKKAAIPHLGHGTSNGYHAEPATSRKVAVGG
GVDLNLDSDHLDDQFEKF
>GSU0726 chemotaxis protein CheD, putative
MRRHILQACGIRRFCYTVTMRHQRLEGRHIIRIAPGEYHVTTGGGVISTL
LGSCVAACLFDSESGVAGMNHFLLSNHRYSRTMPFCFTEAGRYGIHSMEL
LINSLMRHGARRENLRAKAFGGASILVNRAEVGNFSCVGSVNARFVREFL
ANEGIPLLSADLEGERGRVIYFDTTDFSVFVRKIRLQRSLVVAKRDRNVW
EHGVQAHDVEPDSVDLWLPG
>GSU2652 methyl-accepting chemotaxis protein
MSLLLNLYLHLKIRTRIILLCVCYSFCIIVAVIAGRLFSTSQAIVSTSVF
VILGILFSWLLFWTVNDALTRIMGYLAGITRGDLTETIAPKRNNEISDII
RSIGELQSTMREIISRISQTSQEVALASRQLQANADQIASGTENVASQAN
TVAVASEEMAATSSDIADNCLSAADNSTRASTTARSGSEVVRRTTDCMER
IADKVKGAARTVEGLGSRSDQIGQIIETIQDIADQTNLLALNAAIEAARA
GEQGRGFAVVADEVRALAERTTRATREISQMIKSVQTETKEAISAIDEGV
AEVEKGTEYSGESARSLDQILQQISDVTQQINQIATAAEQQTSTTAEISN
NIQQITAVVDQTAQGATETAGAAATLSRQSEELQRLVGQFKL
>GSU1876 hypothetical protein
MRTILAMAAATLCLTASICMAATGGKGTRQEAKALVERAAAYIKEHGKEK
AFVEFSNPKGKFVKNDLYIFAIGFNGVFLAHGSDPTLIGKNQLQSKDENV
RLVTIGLIDTARKGEGWYDYKWPHPKTKVMQKKSSYVKRVDDTVFIGCGV
YH
>GSU1783 type IV pilus biogenesis protein PilB, putative
MEKQSFKRKTIGQILVQQGSLNPDQIPYLVEKRNASTKRFGEVCVGDGLI
TEENLARALAEQFGLDYVDARGVRLNESLLATLPPDAIYRYQFVPLEEED
GTLVVLIADPTDVLKLDELELLLDRPFIVKLATETAIATILKKGEATSRV
LKEVSEDFMLQLVKENEKGEEILSMEKISADTSPIIKLVNSTVMDALTRR
ASDIHIETALEGVIIKYRIDGVLYRATEPLDIHFQAPIISRLKVMSELDI
SERRIPQDGRFKVRLNDKAIDFRVSIMPSAFGEDAVIRILDKESIASDLK
GLTLETLGMHPREMKRLRRKIREPYGMVLVTGPTGSGKTTTLYAALTEIH
TGEEKIITIEDPVEYVLRGIVQIPVNEKKGLTFARGLRSILRHDPDKIMV
GEIRDPETAQIAVQSALTGHLVFTTVHANNAFDVLGRFIHMGIDPYNFVS
CLNCVMAQRLVRKACPHCKYPVEHSDSVLIESGLDPEECRDVTFYESRGC
EECNGTGYRGRSAIIELLDLNDQMRELIMAKAPAAQLKAAARESGTVFLR
ESAIEKVFAGETTLREINRVTFVE
>GSU0329 general secretion pathway protein D, putative
MKKRFLNLLTAAVLACALLAPLPASAKGVVLNFNDVDIATMVKFISDLTG
KNFVLDERVKGKISIYSPSKLTPDEAFSLFTSVLELKGFTLVQAGKVYKV
VPTAAAKQSGMRLLSDKDRLPVSDAYVARIIPLERISAQEAVAFLQPIVS
KDGYIAAFGASNMLMVVDSALNIQKLTGILTLVDSPQKREGAEIIFLKNA
SADSVSGVIREWLGGKTSRPAGQAGQATATSSAGVLIVPDTRLNALVIFG
SDQDKDDIKKLIAMVDVVPPTTSSKINVYYLENADATDVAKVIDGLIKGT
PTTPGQPGVPAAAPVQSPFEGGKISVTPDKATNSLVIMASPVDYQNILQV
IQKLDKRRRQVFVQALIAEVSLDKLKDVGVQIGALAVGTQGDASGGAVLD
PFNFLSATSGPQFLLVKALEELGKNVSVSAQVKALVSDGAINVLSTPNIL
TSDNKEAEIFVGENVPFLSQTNLTTGGISQQSIERKDTGITLRITPQISE
GEYVKLDIYQEISAVKENKGQANDLVTTKRSAKTAVVVKDKDTVVIGGLI
QDRDTETINKIPLLGDIPLLGWLFKTKSTRREKTNLMIVLTPRIIRGAEE
MNDVSGQQRDKFGEALSLDAPFDLKRDLQLSK
>GSU0416 hypothetical protein
MEIMQLLQIAPQVVPTGGIPSEAPAQPAGDDSLFASLFAGLLMPPITTNA
ATQEVPDASSESLENAARNQDQENPPATDATLASLVPPIPLPLPAEAAPA
PESTASSSGTAAAAPVAPVSSPVTTTVAEPPASTMPHAMAQTAPTEAPVA
AASSVPAGQTAAVADAGAAEKPVEVLNVAVERTPVRQAGADILPEAEQSP
QSVGMTEEATAETRSPVRTMGPTAAYPDRIPVRHAEGIPAGKADTVNPVE
PRSAEVRADATTGSQVKALDVEVTVRDGSGEQHFAEGHEQAFTGKESAAM
KDVVSGHATHEAGKSTTAQPTETRSTSESSHSLRDSIMAQVRDAVTTREP
NGNGRISIRLNPVELGELTINVRVADQQVKVDVVAANGQVRDILLNNLDN
LKENFSRQNLTMTGFDVSTGTGQGFEHQLFREGGQAGQGGTHFSFSRGDD
LDGDIQAAVEDTRRYVTDRRENGLVDVRL
>GSU1784 type IV pilus biogenesis protein PilC, putative
MALYHCKLGSSEGRIITRELEAANPEMLRTSLEEQGFFVFEIKKKPLQFL
WDKGGGRRKVDNKALLTLNQELLVLIKAGLPIIQALDTVLERVERGTLFD
VLAVVREDVKGGMALSDALEKHTKVFPHLYVASVRAGERTGDLQLTIRRY
IAFLKRVEEVRKRFISALVYPAILVTVATLAITFLLVYVVPTFSQVYADA
GSQLPLPTRILIAFSTSLKQLFPLIIAAVIGAVFFFRRWAATESGRYRVD
DIKIRIPFIGDVFSKFAVSSFTRTLATVIGSGIPIVESLKMSVGTLNNRV
LERRMLEAVVKIEEGMSLSGAIESARIMPPLALRMLGVGESTGSLEEMLS
DIAEYFEGEIDARLHLLTTAIEPAIMIVMGLVVGVIIVTMYLPVFKIAGT
VG
>GSU2032 type IV pilus biogenesis protein PilM
MLFSKKKEIVGIDIGSSSVKLVQLKEQKGGWQLVNIGIQPLPPEAIVDNT
LMDSSSVIEAVKGLMKGLSVKVKDVACSISGNTVIIRKIKLPAMTPEELE
DQIQWEAEQYIPFDINDVNIDFQILEPDEDDPSRMNVLLVASKKEIINDY
VNVFAETGLKLVIVDVDSFAVQNAFELNYETDPEEVVALINVGASILNLN
IVRGGSSLFTRDVQVGGNLFTEEIQKQFALSSEEAEQVKITGEYPDKAKL
KDVIARVNETLAVEMRRSLDFYNTTAGEGRIARVYLSGGAAKTAMLAETV
QNKLGVPVEMLDPLTKITCSEKEFDPEYLREIGPLVTVAVGLATRRVGDK
W
>GSU3156 methyl-accepting chemotaxis protein, putative
MKKQLEEKDAELDVLKQMLENVKNIVMLCDATPENTIFYMNKAARELLAK
YRGDLNAGLRGADVAAAMDHSIHQFHKDPNRVRMILGKPGEMPHSAEIPI
GGITLRTTSFPIWDKKNPGRVKCYMACWDDITAEKEVVERNHQELQRKEY
LEERVAQIATAMEEMSMTVTEVARNTSNASDSAVQVAQNAHEGQEIVNRS
VQEMQKVAQIVRDSAAIVDSLGGKSEKIGEIINVINEIADQTNLLALNAA
IEAARAGEQGRGFAVVADEVRNLAVKTMNSTKQINAMVAEIQRETRQAVG
SIENAKQEAEVSESLSLQAESSLVTIVQAIEEIKNVITQIATASEEQAAT
ASVIAGNLEEISRNG
>GSU1704 methyl-accepting chemotaxis protein, putative
MESGAVFKNRSSLYGVLGCALGIGAPIAWTFIRLVFFSDPGQPLLGQVFS
DITRSTYQVALYTYMGMGTALVLAVVGHHIGKTSDELHKRAAELNLLHQE
VASQKEIFENRYRILDNNIKNFHQISSRIQKSIDVDEVLRLCAEGLHDVL
GYERVNILMADTARTSLSFVAAVGTADFNPAGVVLPLDQRGGVITKCFTD
RQVYMIDDVSAYPTDFRLQSPYDAIRALRSKSFVICPIVVKGEAIGVFAV
DNRSSRRSLNDTDVDTIKLFADQASSAIVRINLLKAIGTLTSELETTFAD
LLKNRDHYSRYVVNLKDAVNSVADGTAHIASASESVLSAVDETSSAVSNI
YVAIEQVTRNIDYLSESIDKSVSAMEELNSSIKNVEQSAAISHQVSSTVK
EKADSGRAVVDETIQALDEIQRSVDQSFEAMKRLSENSGKIESIVGVIND
ITKRTNLLALNASIIAAQAGEYGKSFGVVADEIRNLSLQTGQSTGEITGI
IEEIMRESRSAAQNITASKDLVQRGVELGGIMGQSLQVIHESSTRSMDMT
HEIKTATEEQARSVQLVTNSIENVSSMSTQIYKASKEQSDAAMSIVRSVD
TIKEMAQEMVRATVKQVEDGSEIKKSVEAVGEMVTRIFEDMEVRREESGE
VVKELELMKKIAS
>GSU1496 pilin domain protein
MANYPHTPTQAAKRRKETLMLQKLRNRKGFTLIELLIVVAIIGILAAIAI
PQFSAYRVKAYNSAASSDLRNLKTALESAFADDQTYPPES
>GSU0916 methyl-accepting chemotaxis protein
MVSLWNFYLNFSVKARLATLCVCYSACIAATALAAQADSALIKYGSVILF
IVLGGIFGWINIWSINRPIQRAIGYLQTMARGDLSQEITVFRKNEFSKML
LTMRELQGSMRDIISGIQTTAADLSAASDLLRTTSSQIAEGTDHASQESA
SITTAVDEMASVSLAISHNCQKMAEEASGTGHATESGTETISRMTTIMEA
VEQMVSGTMAAVNALGANSERIGDIITAIRDIADQTNLLALNAAIEAARA
GEQGRGFAVVADEVRNLAERTTSSTREIQSIIGALQGDVKNVMGLMEQSS
DSVRNGTRDMHLSRQAIGAIKEHIAPLIDHVSQVAIAAEEQSATTASITE
NIHRIALVIRDAAQGAQQTETAAADLAQSATELQQMVNRFKLSA
>GSU2942 methyl-accepting chemotaxis protein
MRVIRNLHLSTKLVAGFVLVAVIAGLVGLIGTLKIRVMETAGTEMYDLVT
EPLGTMGGVAIAFQKARVNIRGMILDDNPARAQANANSIAKFYKEIDEGL
ADFGKSILSKDIRQEFDALRNTIAEYAPVREEIVTATLDGDRETALALMR
SQGLAFEKKIDESIKKLFDMKIAGAKKRKDMNAAAAQSALTQMMLLALIG
MVVAVVLGLFVSRQITVPLRKVVDFAQAIAQGDLAHRLDMEQNDETGQLA
EAVNTMADRLNRLIAGVAENASQVAAAASQLTSNAEQMATGAEEVAAQTG
TVATASEEMASTSAEIAQNCTAAAEESRRASDTAVQGSEVIRHTVGEMER
IAERVRETARTVESLGARSDQIGEIIGTIEDIADQTNLLALNAAIEAARA
GEQGRGFAVVADEVRALAERTSRATREISTMIKAIQQETKGAVASMEQGV
REVERGTAEASQSGKALEEILEQVGCVTMQINQIATAAEQQTSTTSEISG
NIQQITDVVQQTARGAQETAAAARQLSQLSAELQHLIGQFHLAA
>GSU0766 methyl-accepting chemotaxis protein, putative
MSLSFRNARLRAKHILAFSTMALIVAVTGIFGIWRVDSVVTQVKTMLRGR
ALQEKAVLNMQLSQKSCRVNLVEAAMVRTDPDEFEEHVNNYRAKNELFKK
YSNALLNGDPALGIPPSPAGSPIATYAAAVLASWGEFEKAADQLIAHKQR
LLKGLRSGVVDQAAKDALADETLNRLALETIRDTSENAKLDIDDLADYLE
SQTYAGLKTTETIRQSTRYTFITVIIMAVAAAIGLGLLFTRMIVGRVKRV
AAALQSGARGDLTAAVPVDSRDELGTLGDDFNTMADRLSGMLTSVRKAVG
ELHALSDKIAVAAGKVTEGEEIQAKGVNAASSAISEINASIREVADGVAN
LSLSAADTSSSILEMTASVEEVAINADNLSRLVDEVSSSVIQMAASIKQI
DGSVQSLMEISTTTASSVAQMDTAIGQVEVNARETTALSQDVEREAERGK
RAVEDAIAGIVAIQRSSRITTEVIDVLSRKVEDIGGIISVIDEIAEQTNL
LSLNAAIIAAQAGEHGRGFAVVAGEIKDLSDRTRTSTREIAEVIMGVQSE
TRRAVEAISRADKSIADEERLSANADEALGKIVMRAREASSRVAEIARAT
VEQATGSKIIRDAINRVTDMTSQIASATSEQGAGGDLIMTAVERMRDATA
QVRNSTREQSATGNIIARSTENITDMIGQFRSASEEQFRGSEQIVRSMEE
IQQSATMSLEVSRVMEEAAITLSRQVKVLETEMEGFHIRGQSSSR
>GSU3197 purine-binding chemotaxis protein CheW, putative
MSDRMDRYVIFTVTDRKLALPLDKTAEVLEGAATHPVPAVPPLLGRALNV
HGRIVPALDLATFLHGGAIGRNHAFLALNHHVADLALLVEGPVAIVSVEE
SDRSPSEHPRFEGYLEIAGERVGILATGRLIEEVEEIL
>GSU0296 chemotaxis protein CheA, putative
MAIECEDQELLEGFLTETTELLEKLDDDLVALEKAPSDADLLNGIFRSIH
TVKGASSFLGFELLVKVTHKTEDVLNRLRRGELIVTPEIMDVILEAVDLV
KVLVADIKGGDIVDREIDGTISKLIPLLSENAKEATVLKAPAAQPETSAP
AEQSAEETASASEQSADASAPTETAPPPPPEKAASAPAVRPQNAPAPAKD
DKKADDLADNSTVRVDVKRLDDLMNQVGELVLERNRMMQLNTDFQGDSGD
SSFGEEFAKLSKRISFVTSELQMQVLKMRMIPVEKVFKKFPRIVRNLARD
LGKEVDLTVLGEETELDRSVVDEIGDPLIHLIRNAMDHGLETPDERVAAG
KPRKGTLILSAAHEGNQIVISIKDDGRGVDTDKVARKAKEKGLVTDEQLA
AMGQRELLDLIFLPGFSTKEKATDLSGRGVGMDVVRTNIKKLNGIIDIRS
ELGRGSEFILKLPLTLAIIQSLLVEVEDETYSLPLAAVLETLRVDEKEFH
TIGGQEVLKLRDSVLPLMRLQRIFNIAPGERNRSSCYVVVVGVAEKRVGL
VVSRLLGQQEVAIKSLGKYLANLPGIAGSTILGDGRVTLIIDPAGLIENS
DGSGGGRVAA
>GSU1304 methyl-accepting chemotaxis protein
MSWKNLKLSLKVLAGIGSVLVLLAVIGVWAVNGLSKVVRDGHEVSEGNKL
RAELLQREVDHLNWAKNVSTFLLDGKVRELTVQVDHTKCKFGEWYYGEGR
KQAEAMLPALKDELAAIEEPHRKLHESAGLIKKAYNKEQGEQGRKDAEII
FASQTQPNLQLVQKHLAGLNETSRKNILSDEQMIANANSTKTAVIALSIA
ALVIGVVLALLISRSISIPVLKGVEFALKIADGDLRSTLDIDRKDEVGQL
VAALNDMVAKLRDIVTDVKNSADNVAAGSQELSSSSEVMSQGATEQAAAA
EEASSSMEQMAANIRQNADNASQTEKIALKSATDAREGGKAVAGTVSAMK
EIASKISIIEEIARQTNLLALNAAIEAARAGEHGKGFAVVAAEVRKLAER
SQKAAGEISELSASSVQVAEEAGEMLARMVPDIQRTAELVQEISAACKEQ
DSGAEQINKAIQQLDQVIQQNASASEEMASTSEELAGQAEHLQSTITFFK
TDEQGRAAGRSPAVRPAAVAKKPAALRLGHGNERRTEPVAPRKAVAGKGV
DLKMDGDYLDDQFEKF
>GSU3200 chemotaxis protein, CheC family
MKFDALTEEHLDALKEVSNIGVAHAATALSQLIGKGITLQVPKVHLMKIT
EVPEAFGGAERIVVGIYLQMLGDARGNILIVLPRESALKLLSRLLPREKS
EGSLLTELEISALKEVGNILASAYLNALGALMRKTLIPSVPVLSFDMAGA
VIDYVLIELGEVGDLALMVETEFFGEEEKIGGQFFLLPDPESLRIILDAI
GVKL
>GSU1033 methyl-accepting chemotaxis protein
MKIRTKFVVVNLLIVCCALAAVAAACLVEFNRELRRQAVTSQEIRLKTFW
ELLRQKGDGFTVADGKLMAGSYVINDNYELPDKLKELTGGTATIFMGDTR
VSTNVLKPDGSRAVGTKLQGAAYDAVIKEGKPYRGEADILGVPYFTAYDP
IRDSRGEVIGVLYVGVKKGDFYASYESLKLTVVGIVLVIVLLAAVASKVI
IHRLFTPLNRMHDVLRDVAQGEGDLTQRLDYLAQDEVGDMSRSFNSFMDK
LHGIITHVARTVEQLASSASQVHGSAEQMAAGAGEVASQAGTVATAGEEM
AATSTEIAQNCAMAAEGARRASSTATAGAEVVGNTVTVMDRIAEKVKNSA
RTVERLGERSDQIGEIVGTIEDIADQTNLLALNAAIEAARAGEAGRGFAV
VADEVRALAERTTKATREISGMIRAIQAETLEAVSSMDEGVRDVETGTAE
AARSGEALREILDQITAVSMQVNQIAVAAEQQTSTTREISGNIQQITEVV
EGTAQGADESACAAGGLNRLAEDLQRMVGQFRL
>GSU0756 methyl-accepting chemotaxis protein
MAAWQSLKVKYKIFTLILVCCVGLIAVGLLGFGGMRSMGKSLGELNEEQK
SVATLSAMKNDFLEMRLAIVYMLALTDSAKLAEKERDFGAAAARIKERLA
SLEHHNFAAEEKKKITEFRDGYEAYLAEGTKSAAMAKAAAETGNAAGREE
AVRYAVTTAAPLYNKPAQALAELVEISIKEGGEVYDADMASYRRSVLVMG
VILMVVVAVSAVAGLAIASSISGPLNRILEVLQRVAAGDLTARAEIDSRD
EMGLLGRELNVTAEKIGKIIGQLAQAAGSVASASAQLHATAEQMATASEE
VAAQAETIATAGEEMAATSNDIAHNCVTAAEGSTQANDAAEGGAQVVQAA
IAAMDRIAERVHASAKTVEGLGVRSEEIGEIIGTIEDIADQTNLLALNAA
IEAARAGEQGRGFAVVADEVRALAERTSKATRQISEMIRAIQHDTQSAVH
SMEEGVSDVQAGTAEAARSGQALQMILAKIGDVTNQISQIATAAEEQTAT
TGEISNNMHQISQVVQDTARGAQDTVAAANSLSRLSEDMQGMVQQFRLA
>GSU0404 conserved hypothetical protein
MSLNSDVASSSKLSEEQLAGYIIGATKEVFGTMVMMEPQDQYPLREPVTT
FHCSVTGMVGLAGTYSGILSIHCPLELALKVTSNMLGMEVEEVGDDVNDA
LGEIANMLGGYVKMALSKGGTDLHLSVPTVISGEEYTLNAMADNDCVIIP
FIIDDTRFLVGLKLQKEA
>GSU0401 methyl-accepting chemotaxis protein, putative
MQLTIKQRMGMTVGVTLLGMVVIIVFMVVGFTKVHRQQELMDRLTLINNT
ALRGNIAMLKAREYEAEFFDRKQDKWVPRVKQAVDQVNKELDVILKNTDD
PKIKGWAESARKLATQYVQQFQELASVALGSNFQGAELAETREELRDILN
EFEPLLDNYIPKQVGVAYQAATEEMDRSIAAIRLQIFGAVLLVAVAMLVS
ISSTAIYLLRSLRLINDRLRDIADGDGDLTKRIELQSRDELGTLAVSFNN
FVGKLHDIIAQVSQGTLQVASASYELQANAEQMAHGAEAAATQVNTVASS
SEVLAASTFEISSNCGTVAESSRRANDSAQTGAVVVEKTVDIMARIAERV
KDSARTVESLGARGNQIGEIISTIEDIADQTNLLALNAAIEAARAGEQGR
GFAVVADEVRALAERTSRATREISQMIKGIQGETRGAVLAMEQGVKEVEL
GSEEAARSGEAIRTILEQFRTLDCQVGEISAAAEDQTRVTTEISTNVMQI
TEIIETTAKGAADSAEAAQGLAELSDQLKQIVGRFKLSV
>GSU1777 hypothetical protein
MKSSSVTALILRIDRRFRGDSRGLSLIELVFTVAILGILAMAVVPFTQMA
AKRSKEIELRRNLRVIRTAIDDYKKDYDKAIKDKKIMDVANRSGYPESFE
KLIEGEDFGGLYAYKKKYLRRIPVDPFHPPEVGEPPKWGMRSSVDDPESD
LWGGEDLYDVYSLSDGIAIDGSKYKDW
>GSU0327 general secretion pathway protein F
MPTFRYSAYTAGGRETSGTIEAESLKEAKLHLKRDGLYPRDIGPVSETAG
TVSRRFGGRNAGPAQVALMTRRLATLVGSQVPIYEAVTTLWEQEEPGEIK
KALGRIRERLAEGANLAKALSLEPRLFSESYVAMVAAGEASGALDAVLER
VALFLEEQRAIRSKITASLAYPTLMVLVGSAVMLFLLAFVIPKIVTIFED
NRAALPLITIALIKTSTFLRSFWWACIAAVAGVVLLYRRLMKDDAFRLRR
DRFLLRIPVVGSLLRQLILSRFAKVLGLLLSSGVPVMRALEITAQVVVNR
HYRAALTGVTAGLAEGGTLSGALRTTGLFPPLLVHMVAVGEKGGELEEML
GKAGSAFEREFESSVSGLMALLEPLLVLAMGLAVGLVVVAVLLPIFELNQ
LIR
>GSU1066 hypothetical protein
MSTDNTTLRKFALLAAAMAFAGIVASLALAAVSQIPLFLLNISQPNVMIL
LDNSGSMDIIMQHSAFDPTARYSGGFDNDRTYYQTTSNGYHYLSTGNDYI
RDDKKGNFTKNSVTIKLPLPYDDTRWDGNYLNWLFYHATSSQRSTVSTDA
TLQKTRIQTARGVISNLVKTVSGVRFGLAKLNVDGYDRFDRKQTDGGSIV
RNCGDLTSANVDTSVSGISAETWTPLGEALSEVWQYFKGGTSLYNTGVSY
TSPITSSCQKSFTIVVTDGEPTYDGCYRGDFSSYGCDNAADADSHLADVA
AHMNGSDATSAYGGTQSVTTYTIGMTIDSSLLRTTAENGGGSYYTTTSGM
DLATALQNAVNEILGRQSSASAVAVSTAYLTSNTTLYRARFDSTDWSGYL
EAYGINKANGAVTGYPNSPKWEAGALLNANSARTVYTAGVQSGVYRRVDF
TSTNAATLAPAGFMNFSSASTASMIGYVRGDVEPAGYRHRASKLGDMVQS
APVILGPPDGYYSDNNYATFKRNNATRQSLILAGANDGMLHAFNADTGAE
EWAFIPNILLPKLKLLRATPYTHTNYVNGAITVGDAFITAKGLDGKSETS
SSWRTIAVCGLREGGKGYFALDVTDAANPIPLWEITNTSPSETSGTVVGL
GYSFGTPLIVKLKDSSQSGGFRWVALLANGYEGTTSGRAATLIVADLATG
AVIREIVADASTFSGVSPNGLATPAAIDRDADGFVDYVYAGDLTGHLWKF
DLSSSNSNNWDVVWKRSGTPVALCRAKTAAGSVQPITTAPDVVLRGGYQI
VFFGTGKYYESTDISSTQPQTFYGAYDYNSTTTPTSAQATNGALLTRADL
TAQTVTRIDESGTSWRTSSNNPIGLTKGWYLDLPVAGERVITDPVARSRK
IIFTTFIPNTDACSFGGISWLMELNMDTGGEVVRPVFDVNLDGKVDYSDT
VLGDLKVKPTGTLLGDGLASTPAIVGAGDEHEYKYITKTTGEIIKLLEGG
GHSQIGLRSWRQLK
>GSU0400 methyl-accepting chemotaxis protein
MQIKRYRNWGILPKIMTISGITVVLIAALVLFVLLPLIGEKMMDGKKEKT
KSVVEVAYNLVADLGERAKRGEISEEEARKRAIDHVKQLRYQGKEYFWIN
DLTPRMIMHPIKPELDGTDLSENKDPRGTYLFREFANICREKGEGFVPYL
WPKPGASEPVEKISYVKLYEPWGWVIGSGIYVDDVRADMARLRWVVLGGT
ALFGLFALSLAFSVGLGVVRPLRHAVTSLQDIAEGEGDLTRRIAVEREDE
SGELALAFNRFVEKLQGIVGTVANNALQVAAAAGQVQEASRQMAEAAENV
AGQAATVATASEEMAATSMEIAGNCVSLADGARHASETAESGAAVVQETV
SVMGRIAERVKEAARTVDSLGSRSDQIGEIIGTIEDIADQTNLLALNAAI
EAARAGESGRGFAVVADEVRALAERTTRATREIALMIKAIQNETRGAVAS
MDEGVREVEKGTGEAARSGAALREILEQIGSVSLQISQIATAAEQQTSTT
TEISGSIQTITDTAHETARGAQESAGAAGQLADLAEQLQNVVMTFRLSA
>GSU1144 chemotaxis protein CheD, putative
MTARHPKLPHVYLKPGEFHFATKPTVVTTVLGSCVSVTMFDVFSRTAAIC
HALLPDGPRDDVFRYVDSSIIRMLEMFMSRGITPRQLQVKLFGGSDMLGA
TASRPGVGSRNVDIARQVLAAEGLEVAAADVGGTRGRKLFFYTHTGEVLL
KRLNRTEADS
>GSU1776 pilin domain protein
MLKRFRNRKGFTLIELMIVVTIIGILAAIAVPNYRWGLIRAREAVLQENL
YTMRSAIDQYYADQGKYPDTLDELAEKRYVKSIPNDPFTGKNDTWVTVKP
IEPTGEIHTTEEIKGNVADVHSGSDLVSSKGTPYKDW
>GSU2423 methyl-accepting chemotaxis protein, putative
MKKSGAKPAEMTLHREIQRLAEAMMNGRLDERGDPAQFTGDDAALVLMVN
RMLDTLVTPLRLAAGAIDEIAHGRIPPFVIDDYTGEYNNLKRNLNTLLAT
LYGLHSETQHLVGNIGEGRLQTRGNDWDFEGIWRDLIKGVNGTLDAVIDP
VNEAGTVLRHLAEYDLSARMRGKYHGEHAAIKKAMNSTAESLHSAVTQMT
ETVELVSAVGGQIADSSQLVTQGAREQEAQLTETSNVLDHIAATSQKSAR
STIDARQAAGGSAESIKTAKSVMDQMLQAMGQIRSAADNTVGIVQQIDSI
AKETDQLSSNATSKATLIRSSANGFSVVASEIRNLSKRCEDAVTRLHDFR
RRATLTPNGGSGDTDDALECEYLELIHELKSVASSSGLLGVNAAISAAHV
EGAGNDFQVLTEEIRQLAKRSTDAARQTDTLIKTSVEQARRGEDLSRKID
VHLTEAVTGATTICALTEDISQSSQEQASAIEQISRSVNHITTITRQNAD
SALKSSEVSHKLGQQMTKLTSMVSKFRLDNAAC
>GSU2030 type IV pilus biogenesis protein PilO
MDARIEKLLKLPNKQKLALLAAILVVEGAALYWGLYAPRQKELTALRGKL
EKLQTEVQEKTRIANNLPKLKKEYQQLQKDLENALTELPNQKEIPSLLTG
ITSVGKGAGLDFLLFRPKGEVPKDFYAEVPVDISVSGSFYGVANFFTAVG
NLPRIVNITNVSFTDIKPVGGKTTVKVNCLATTFRFIEKKETKDDKKK
>GSU0132 conserved hypothetical protein
MALSFTVKEATFKEADLVGHIVESVKKIFSTMIFIDDIVDEYPLAKPESH
FIASISGMVGLGGDFSGMVGIHIPIEFAKEATASMLGMELDELEGEDDIH
DAVGEITNMLAGEIKMLFSANNLTVGLSTPSIISGSDYTVEVVSSGAAVV
VPFNRNEHRFLATLQIEA
>GSU3120 conserved hypothetical protein
MKKQLCMIALVGATAFGTTGTSWSLDGPPPPEPPPMGKGQEHFLERMASV
LKLTDAQQAQIEALISTDAEQNAPLHRQLAENERALREATTAASLDEATV
RALAATKGNLMTEMIVSRAKLRNAINAILTAEQRELADRLDPLKYGPPRP
RPDRPGME
>GSU2029 lipoprotein, putative
MTRRNSIPILAVLVALTFVSAGCGKKEQAPSPPPPPQKASPAPKAQPPVQ
GRATSAAIAPVAGLSQYDFANRRDPFKPFLQAKAPEKTRAVRGSSAGLLP
IQSYNVEQFRISGIIVGLKESKALIVDPAGKGYVVKEGMSIGANNGVITK
IAPSYLEVNERYTDDFGKVRKRTVKLSLAKKQ
>GSU3027 chemotaxis MotA protein
MDIATIIGLVMGFGAVFGGALLEGLHLTALIQPTAAIIVLGGTFGAAFVS
FPMKTIIGAAKDIKKVLMPAQNDPEKVIKDLIGYAAKARRNGLISLEQEA
QNVKDPFTKKGISLVVDGIDPQKLRETLEIEVTYYEEHAKQSAEFFEAAG
GYAPTIGIIGAVLGLIHVMGNLSDSSKLGAGIAVAFVATIYGLMTANIIC
LPFGTKIKHGIKEELICKNMIIEGLIAIQNGENPHFIEQKLKAFLQHGAG
DKKG
>GSU1493 type IV pilus biogenesis protein PilC
MPKFNWEARSRTGSVQKGVMEAASAAAVEAQLKKYGFGSISIKEEGKGLS
MEIKLPGFAPKVETKDLVVFTRQFATMIDSGLPLVQCLDILSSQQENKTF
KDVLIRVKESVEGGSTFADALSKHPKVFDQLYVNLVAAGEVGGILDTILN
RLAAYIEKAMKLKKQVKGAMVYPTTIMAIAVIVVGVILIFVIPTFAKMFQ
EFGGELPGPTKFVINLSNFIVKYILLIIGLIFALIVGFKKYYATTGGRKK
IDAFALKAPIAGPIIKKVSVARFTRTLGTLISSGVPIMDGLEIVAKTAGN
KVVEEAVYKVRQAISEGKTMAEPLQECGVFPPMVVQMISVGEATGAMDAM
LSKIADFYDDEVDEAVSAMTALMEPMLMVFLGTTVGGLVIAMYLPIFKLA
GTVGG
>GSU1290 cheA-1, chemotaxis protein CheA
MDAHRQAYREEAYELLAELESSLLELEENPEDLDLIGRVFRAMHTIKGSG
AMFGFEDIATFTHEVETVFDKVRNGQMTVTRELVNLTLRARDLIKGMLDA
SEGGDPVEGREAEEVIAGLRALVPAPEVRDPLPMESVHAPLDTEGKGDAA
VTYRIRFIPTPEITVNGTNPLLLLAELRQLGACRVVAQMERVPTLEECNP
ELCYVYWDVILTSRRGVDAIRDVFIFIEDDCELKIDTIDDGGVLDTDSDY
KKLGIILAERGDLTRQDMEAILARQKRFGELLVEQGILQPEKVESALIEQ
QHVKEVRKERQAQESASSIRVPAEKLDILVNLVGELVTVQARLSQTAAGR
DDALLVTIAEEVERLTNELRDTALNIRMLPIGTTFSKFKRLVRDLSVELG
KDIELTTTGAETELDKTVIEKLNDPLVHLIRNSIDHGIEMPEAREAAGKP
RQGTVHLAAVHSGDSVLITITDDGAGLDKEAIRAKGVERGLITASAELTD
KEIYNLIFAPGFSTAKKVTSVSGRGVGMDVVKKAIDALRGTIDIASERGK
GSTITIKLPLTLAIIESLLVKIGTDCFVMPLSIVEECIELTREDVANAHG
RNLANVRDQIIPYVPLRERFRIQGEPPEIEQIVITSIQGSRIGFVVDDVI
GEHQTVIKSLGKMYKDVKGLSGATILGDGSVALILDVPHLVREVEREQVA
R
>GSU2222 cheA-2, chemotaxis protein CheA
MTNTHDASGRAVDDFLAEAEEIVEKLNTDLVTLSDCADSGECDPDLLNAI
FRGSHSLKGLAGMFGFTEIQSLSHNLENLLDSLRLGKIPLTPETMNVLFD
SMELLSGIIRSIGSGEDHSAAIEDAVSRLNACASAREAQEVSPLRELGMP
EKVLSSLTEYEEHRLLENVKKRRNIFSIHASFSLMSFDQDLGEISDTLKK
EGEVVSTLPSASTSPESFIDFDILFGTDLDEEGLTALLDRDGLTIIHYGS
AAQPEERSALPVAAPASPPPSSPLPSVPVLATPATMPAGIDDQLTAKSMS
RTVRVDIGKLDELMNIVGELVLSHSTIADITTRMRLAGFSSLAIELGKAA
KGLDRKLTELQKGVMEIRMIPVGQLFEKMSRIVRKVSREQGKKVDLKLFG
ADTELDKLIIEDISDPMVHIIRNAIDHGLETPEERIAAGKPERGTIRLSS
YQKGNHVVIEVADDGRGFNIEKVKQKALEKGLIKTLEGVSDRDALDFIFL
PGFSTADKVSELSGRGVGMDVVKSNIAAVSGMVDIESEFGKGSRIIITLP
ITLAIIKALLVLTADRTYAIPITSVLETIIVEEREILTVERKEVYQLRET
TLPLVRLERFFKVKRETPPPGSFYVVVVGVAEKRLGIIVDDLVGQQDIVI
KSLGDMFKGYKGISGAADLGDQRTILVMDVGGIIGEALRSGG
>GSU3199 cheA-3, chemotaxis protein CheA
MDMSQYRDLFVAEAREHLERLGEEVLALEKDPANGERLDSLFRTAHSIKG
MAGSMGYDGIADLSHRMEDLMDRVRKGRIPFGRDIADLLLACADQLGRMV
EDVTGGGNGSLDATDLCARLALVAGQEAAAPAAPADAETSPSPQPSDQPE
PARRDESDGARTVRIRSELLDRFVNITGELVTGKNRIMELAAGLESEPLR
DAAAELSKLVRDLQREVMSARMMPFGTICDRFPRMVRDLARRSGKEATLA
IDGKDQELDRGILEILPDPLLHALRNAVDHGIESPEERSAAGKGAGGRIV
LSVRREKDHLDVTVTDDGRGMDPAALVNAALAKGIITPEEAATLSRQEAL
MLVCRPGFSTARSVTEVSGRGVGMDAVQAAVSRAGGSLSIQSERGRGSRI
TLRLPLSVAIIQVLLVGCGPLTMAVPVNAVRRTVELDRRLQRIEDGRAVF
DLGGETLPLVDLGLLVGTGPTAGGDFSPVLTADVAGRTMGFAVDRFFGQA
EVFTKPLGTPLNRARGLAGGAILGDGRVIFILDLPNLVDGATSRRRVFMH
PDGAHKGGTTA
>GSU0293 cheB-1, protein-glutamate methylesterase
MLQNQTRKLRVLVVDDSSFMRMVIRSVLEKDPAIEVVGIAVDGMEGVEKA
LALKPDLITMDIEMPRLDGISALKQVMAKCPTRVLMVSTLTCEGAKATFD
ALDAGAIDYIPKNVTDSADAQRVFREELLRKVKGAASSIFGRPMTTSAPR
AIIAPPRQVQPAPRPSQALAGKFHYVGIGASTGGPVALQEVLGRIPGNYP
HGIVVAIHMPKAFTGPYAERLNSKCSLQIKEANDGDIIQPGVVLVAPGGR
HMALARQGNSIVVRTLSTAECPQYIYIPSVDHMMTTLADATNGSALGVIL
TGMGSDGFKGMKHLKSKGGITIVQDEATSTIYGMPRACIEGGVADTVLPL
TQIGSEIARLGG
>GSU1145 cheB-2, protein-glutamate methylesterase
MKKIKVLIVDDSAVVRQTMADILASDPQIEVMATAADPFIAAERMKSDVP
DVITLDVEMPRMDGITFLQKIMSQHPIPVVMCSTLTENGSETAMKALEYG
AVEIIQKPKLGTKQFLEESRVRICDAIKAASQARLRKIPARPHAVAPKLS
ADVILEKPASRAMIQTTERVVVVGASTGGTEALRVFLESFPADCPPIVIV
QHMPEGFTRAFAQRLDGICRISVKEAADNDSVIRGRALIAPGNRHMLLKR
SGARYYVEIKDGPLVSRHRPSVDVLFRSAARYAGKNAVGVIMTGMGDDGA
SGMREMKDAGAMTIAQDEASCVVFGMPNEAIKRGGTVKVLPLESIAADVI
RHCS
>GSU2214 cheB-3, protein-glutamate methylesterase
MRKIRVVVIDDSAYSRRAITKMLESMPEVEVIGYATDGEEGIRKIIDLKP
DLVTLDLEMPRMDGFTLLRIVMEYSPTAVIVISSRSEDEKVFRALELGAV
DFVAKPTKGVSEEILTIREDLHRKVRGVIHLNLAGIVRREREQERASVAA
GRRTSGSAPYAKAAVRTESTAPRPAGRLEVVAIGASTGGPPALQRILCAL
PGAFPQAVVVSQHMPAGFTRTFAERLNRLSPLEICEAADGDEVRAGRVLI
APGGHNMVFERQGSEVRARIVKPGTDDRYVPSVDAMLLSCAEVFGPRTLG
VVLTGMGNDGSKGVAAINRAGGQTLAEAEETAVVFGMPKEAIATGVVDKI
VSLDRMSREIIQRCGLLSDVD
>GSU0295 cheR-1, chemotaxis protein methyltransferase CheR
MKLSDKDFEILRDFIYNHCGMYFHASKKYFLESRIARRLEATKCSDIHGY
LGHLKGGVGKAEELTKLLNEITTNETCFFRNPPQLKALENVFLPEIVATK
GKIGFRKIRIWSAGSSSGEEAYTMAMMLLEKRSTILKDWIIEIVGTDISE
SVLAQAREGIYNSYSVRNTPDFYLKKYFREETGGRFLLSPDVKKLVSFSH
LNLYEDSKMVFMKSFDFIFCANVLIYFDLASKTKVVQHFYNNLQPYGYFF
VGQSESLHGVNDKFKTVHFPGGFAYKK
>GSU1143 cheR-2, chemotaxis protein methyltransferase CheR
MDGGGMFSAATDRITTAAMTDREFARFSEFIYDTCGIKMPPVKKTMLEAR
LQKRLRKLGISSFKDYSEYLFSRTGTETELVHLIDVVTTNKTDFFREPAH
FDYLVSQALPELMERTGAGLRKPLSIWSAGCSSGEEPYTLAMVLSEFSEQ
QNISFSILATDICTTVLDKARLAVYDEERIDPVPMSLRRKYLLRGKGEQK
GLVRVVPQLRHRITFRRLNFMDGDFGMREPMDIIFCRNVVIYFDKATQER
LLNKFYRQLIPGGYLFMGHSETLSGLDVPFVQMASTVYRKPL
>GSU2215 cheR-3, chemotaxis protein methyltransferase CheR
MFTFEPEIPMSDEEFRLIRDLIYSHCGLFFDSDSAYLLEKRLAKRVQLHQ
LSGFRDYYHFLRYNRKKDQELSDIMDVLTTNETYFFREAFQLKAFTQEII
PEIRDAKAKAGDRTLRIWSAGCSSGEEPYTIAMLLLEMGGFAGWHVEIIG
TDISQRVIQQARKGVYGKSSFRVTDEGYVRRYFTEQDGMFRVNDRVRELV
TISHLNLLDTNRIALLGRMDVIFCRNVIIYFDQAARKTVIDSFHRVLRDG
GYLLLGHSESLMNISTAFALKHLKNDMVYQKPLPPGGRP
>GSU0879 cheV, chemotaxis protein CheV
MAEPRILLESGTNELEIVEFMIDETGPNGETVHSYFGVNVAKVREIIRKP
QMWKVFNANSAVSGMMKLRDKVITVVNLATVLGKEYSALAPDRVVVLEFN
RMMVGVLVNGVSRIYRISWEQVEPPVRAIESAYVTGVVKMEDRIILILDF
EKIVGELCSEETLRALSEEQLLPGPVLDRSQRRILVADDSAFIRNSICSS
LRGAGYNVDEAENGEDAWNMIQDKLTRCRAAGVNLRSELDLLITDVEMPK
MDGLHLTTLVKKDDVLKDLPVLIFSSLASDDNKRKWKDLGALDIVTKPDL
PNLVKIADSVMH
>GSU1299 cheW, purine-binding chemotaxis protein CheW
MAHADTTETRQYLTFTLAGEVFGVDVAKVREILEWSSITKVPQTPEFMRG
VINLRGSVVPVIDLRQKFGMPETERSINTCIIVVEVETGAETLVLGMLAD
SVQEVFELEGVNIEPAPRIGTKLDTSFLKGMGKRGDAFLMILDIDRVFGG
DDLAGLAAAGERAA
>GSU0297 cheW-1, purine-binding chemotaxis protein CheW
MQNALQTRVDETRNELIQLVSFKLEEEEYGVNVLKVREIIRMPSITRVPN
TPHYVEGVINLRGKVIPIISMRKRFGLPEGENSSQTRIMVMDMEGELMGF
VVDAVSEVIRISESEIQPPPAVVNSAVEQECLSGVINQTERLLFFLDLEK
LITQDERRLFSGMI
>GSU0684 cheW-2, purine-binding chemotaxis protein CheW
MAHADVTETRQYLTFTLAGEVFAVDVAKVREILEWSSITKVPQTPEFMRG
VINLRGSVVPVIDLRQKFGMPETERSINTCIIVVEVETGAETLVLGMLAD
SVQEVFELESGNIEPAPRIGTKLDTSFLKGMGKHGDAFLIILDIDRVFGG
DDLAGLAAAGEAAA
>GSU1142 cheW-3, purine-binding chemotaxis protein CheW
MSVTTITETRQYLTFKLDDEVFAVDVAKVREILELTSITKVPQTPQFMRG
VINLRGSVVPVVDLRLKFGMSETAPTVDTCIIVVEVAHEHETLVLGALAD
SVQEVFEMEPGQVEPAPRIGTKLNTDFILGMGKHDGQFIMILDIDRTFTS
DELATAGSVSGEAA
>GSU1301 cheW-5, purine-binding chemotaxis protein CheW
MAHADATETRQYLTFTLAGEVFGVDVAKVREILEWSSITKVPQTPEFMRG
VINLRGSVVPVIDLRQKFGMPETERSINTCIIVVEVETGAETLVLGMLAD
SVQEVFELEGVNIEPAPRIGTKLDTSFLKGMGKRGDAFLMILDIDRVFGG
DDLAGLTTLEQTAA
>GSU2218 cheW-6, chemotaxis protein CheW
METDIQEIQLACFRLGDATFAADIMRIKEIIRPQKLTKLPKAPAFVEGVI
NLRGMVIPVIDLRKRFELPERVALEEARLLVVGVSRQLVGLVVDDVTEVV
TVQVGDIKPPPHSIDGVSAEYLIGVCLVRDTLVMLVNLDRILTSREASAI
AGLAGTGR
>GSU2220 cheW-7, chemotaxis protein CheW
MEDLVSDTEAEPARETETYLEVLCFRVADETYGIDIMELKEIIKPRETTE
VPHSPPFVAGVLSLRGIIIPVFILRERLGLADAVARGKERIVVVKHGEGL
CGLLVDEVTQVVKIPVATIEHPPAVLDGMDREFVNGIGRHDGGIIILLQL
EKVLDSALM
>GSU2416 cheW-8, chemotaxis protein CheW
MTTLPTLSTGSATASTPLQVVVFSVGSEEYCFEILKVREVIRTVPITAVP
SAPPHVEGIINLRGAVVPIIDFRKRFNITGECTVDESEKVVIVAAAGSTT
VGFTVDALSQVLKVPREAVSTPPTGVSDQGGEAITGVASMGDRLIIVLDI
ERLFSEDELTNLTGVA
>GSU2578 cheW-9, purine-binding chemotaxis protein CheW
MAHALVTNVSVGPSDELIQLVSFTVDHEEYGVDVLKVREIIRMSTITHVP
EAPPYVDGIINLRGRVIPIISMRDKFGLADVASDNRTRIVVMGVGDALLG
FRVDAVSEVIRIAGNRIQPSPALLSSGGEQEYIVGVVDQGEKLLVMLDPD
KMLTPRELGQLGDTSRLC
>GSU0407 flgB, flagellar basal-body rod protein FlgB
MPVDSMFGTTINVLAKAVDLRARNHTMISANLANAETPNYTPKALSFEKE
LGAALKGNKTGSPATTHPRHIPLKGQAASVQSVEGTVIETPAPTPGRDNN
GVELEAEMSRMAENQIMYNASIQILTKKFEGMKYAIKGQ
>GSU0408 flgC, flagellar basal-body rod protein FlgC
MDFFSAMQVSSSALSAERTRMNLISANLANANSTRTAEGGPYKRKDAVFA
ATQVGEGFRSALDRMRKNAPQGVQVTGIVEDPNPPRLQYDPSHPDADAKG
YVALPNVNVVEEMADMIAATRAYEANVTAMQAAKNMALKTLEIGSR
>GSU0417 flgD, flagellar hook assembly protein FlgD
MVYGVTNDTAAAAAAMKKSTGMNKDDFLKLFVTQLQNQDPLNPQDSTEFI
GQLAQLTQVEQAYNTNSNLSNLLNLVNGATSLSAVSFIGKEITASGDLIK
LTSGTQPTLGYRLPATAQKVTIKIMDDTDTVVRTLTLGGTQAGDGSITWD
GKDEKGNALPDGRYTFSVTGTNAEGKDFDGAPLLLGRAEGVMLEGEEPYI
TIGGINVPLGNILSVKGA
>GSU0419 flgE, flagellar hook protein FlgE
MSVTSALYTGISGLNANGEAMSVIGNNISNVNTIGFKQGRMLFSDVLSST
ISGGSQIGRGVQIQTVENQFTQGSFESTESGTDLAIQGDSFFVVQNTSGR
YYTRAGAFSFNKDKTLVNPEGYQVMGYGIIPSSGLSDGVLKPIDLTNFAT
TPPKQTSTVKFVVNLDSTQTTPTLAWDPANPVATSNYSTSLSVYDSQGNA
HTATVYFRKTADNAWDWHVILPDAAAGTPGSTTTPIDGTLTFDATGALTA
QTPLAGAAQNITFAGGVTAPQPIFFDLGVGATTQYASSSVVSSQTQDGYY
QGTLTKVTIDDKGYVNGVYSNGQLQKLYQVALAKFSSTAGLSKAGGTLFE
ETLESGQPLFSDASTPGVGKILANSLEQSNVDMAAQFVKMITTQRGYSAN
SKTITTADEMLQEVLSLKR
>GSU3051 flgG-1, flagellar basal-body rod protein FlgG
MIRALWTAASGMQAQQTNIDVVANNLANVNTAGFKKSRADFQDLMYQNLK
TSGAPSTSSTQVPSGIQIGLGAKLAAVTKLFSEGNINQTGNELDIAIEGD
GFFQIQMPDGTTTYSRAGSFKRDDQGRVVTSDGYPMLPELVVPSNATSIS
VGNDGTVSVTQAGQTSPTNIGNIQLATFSNPSGLTALGRNLFQESDSSGT
PTTGTPGQNGIGTLAQGFLEMSNVSVMEEMVNMIVGQRAYEVNSKAVQAA
DEMLQQANNLRR
>GSU3052 flgG-2, flagellar basal-body rod protein FlgG
MNSGIYSALSGNIAAMKRLDVLSNNLANVNTPGFKKDRMTFESLLQAAGK
VPQAGTTDAPVYSETAFFTDHSRGSVSQTGNTFDLAIDGDGFFVVNTPEG
KAYTRQGNFKLDATGKLVTADGYEVSGGGPIVINGSRVEINARGEILVDG
SPVGTLEVVDFPKPYALQKNGNALFVPTDPQAVPQPVQGEPVRQGYLELS
NVSAIQEMVQLIETQRFFEMCSKAVKAYDDMAGKAANEVGKI
>GSU3048 flgH, flagellar L-ring protein FlgH
MSIICLALAGCAVEKTEVRTPTFDEQLRPAPPSYANGSIWQASTTGLAVD
HKARSRGDIITVLIVEQASASKEATTDTERKAEVSASVPYLMGLEKSSTL
FSKLTNANPNNLLGASTNSKYEGSGATTRKENLLATMTAKITDVLPNGNF
LIEGRRNVKVNNEDQILVLQGTIRPRDVSPDNTISSTMIADARISYTGNG
VISDRQRPGWLMNILDYIWPF
>GSU3047 flgI, flagellar P-ring protein FlgI
MDKPMKRIFVVLVILLVLPQLALAIRIKDIASFDGVRDNQLIGYGLIVGL
NGTGDSDQTKFPVQSLANVLERMGITVNRDDIKVKNVAAVMVTAELPPFS
KQGTRVDVLVSSLGDAKSLAGGTLLMTPLKGADGQVYAVAQGGLLTNSFS
YGGQAATAQKNHPTAGRIPNGALVERELPNVLADRSQLRLNLHQPDFTTA
TRIARAVNEQFKAGVASCNDPGSVVISLPDAYQGRVVEFVADMERLEVRP
DNPAKVVLNERTGTIVIGENVRIDTVAVSHGNLTLLIKETPRVSQPQPLS
RTGETVVVPRTGIKVSEESGGLAVLREGASIGDVVRALNALGVTPRDLIG
ILQAIKAAGAMQAELSVI
>GSU3043 flgK, flagellar hook-associated protein FlgK
MSILSLLDIAKSGITAQRLALEVTSENITNVNTPGYSKQTTVFTTATVSQ
ERGFPLGNGVRVAEIQRAYDDFLQMQLKSENTTKGWSDTVLASMTRAEQL
FNEFTTDGLGKSLQDFFSAWQDLTANPQGQPERQAILARGQQLADQFKRV
NSYLNDIRTEANQSLEGVTADVNDKLRKIASLNEQIKQIEVQGARANELR
DQRDLAVRQLAEKVGITYMEQTDGTLNVSLSLGQPLVLGKDAAVLSLQPD
AANSGFYRIYSTAPGGKTAVDISSIVGGPGNGQGEMGGTLQVRDSLVNGF
LADLDELAYTLATEVNAVHSSGFGLTGSTGLDFFSTPATMAGYSGLGGIS
VAISNSNDIAAANADPSVGGTGNNTNAKDIASLFDKILPLSGGNMTLGGF
YNSLVGKVGVSVQNAERSATLSEGVLKQLDNLRESQSGVSLDEELANLIK
YQKAFEGAAKLINTGTEMMDVILGLVR
>GSU3042 flgL, flagellar hook-associated protein FlgL
MRVTQNTTANLVLNSLQTIRRRTEELEQQAATGVKINAPGDDPVTAQQIL
HMKSLMAAGDQYSRNISNGISWLSMTEAAMDEMGNVLTRAKELTVQMANA
TSDAKARESGMNEIVQLRNQLIQLGNTQLNGRYVFGGFKNDTPPFDSTGA
FNGTDDSITIEIDRGAFVPINYSGGELLRGGTPPGSTGTDIIGVFDNLIT
ALGANDQTAVQAELPNLEDALSQVLSTRTDLGARMNRLEGQKNVIEEMKF
SLTKVLSDKQDVDFMQVISDLTKQQTAFEAAIAASGKISQISLLDYLS
>GSU3056 flhA, flagellar biosynthetic protein FlhA
MANTAVDAVDLQPAKSNSDIYMAVALIGVLALMIIPLPAFLLDLFLAANI
TIALAILLVALYTQQPLDFSVFPSVLLVTTLFRLALNVAGTRLILLHGNE
GVDAAGHVIKAFGQFVVGGNYVVGAVIFLILVIINFVVITKGAGRVAEVA
ARFTLDAMPGKQMAIDADLSSGLINEKEARRRRSRVSREADFYGSMDGAS
KFVRGDAVAGILIMLVNIIGGFIIGVWQNGMPLEAALSNYTLLTIGEGLV
AQIPALIISTAAGIIVTRSADEKNFGHEISGQFLNYPKAFYVSSGVLFAF
GLIPGLPHVAFFLLSGAAYMAGRLAKERAQVVEDDLMTLPAPAETGESGD
QAGAIRPLDMLELEVGYGLVPMVDAAQEGELLERIRSIRRQYAQKMGFVV
PPVHIHDNLQLKPHEYNILIKGAKVGGGEMIGQYLAMDSGAVSMPVEGVR
TTEPVFGLPAIWIRPELKEQAQLAGYTVVDSTTIIATHISEIIRKHSHEM
VGRQELQQLLDNLSSSFPKVVEDLVPNLLNLGTVLRVVRNLLREGVSIRD
LRTVLETLADYGGLTKDPDTLTEFVRQGLGRSIVEQYKRDDDTLCLISLD
RRVEEVVAEAIQPSDQGSYLAIEPNTAQLILSGIRQEMEKFNQIGTQPVL
LASPSIRRHVKKLTERFVPNLVVLSHNEVPSGIKIQSLGVVTLNAG
>GSU0426 flhB, flagellar biosynthetic protein FlhB
MSDDKHSKTEKPTAKKLDEAKKKGVPHSRDLTSTVTLIAAMVALYTTGGF
MFTTLKRTSGELLGSMGTFHLTEASVEHLLIKLFLVFLSVVMPFMLVVVI
SGLATTMVQVGFSMNSERITFKLDKLNPVTNAQKLFNKDSLVEMLKAVLK
IVIVGYMSYKIMRDEMDGLLFLADTDLAGILEVFKHLAFKLVIHTCGVLL
ILGVLDLAFVKWRFIDNLKMTKQEVKDEHKESEGDPKVKGKIRQMQFQQA
QKRLRKVIPTADVVVTNPTHYAVALKYERETMAAPLVLAKGVDHMAQTIK
AIARENNVMLVENRFLARELYAQVKEGQPIPESLYTAVAEVLAYVYSLKG
KI
>GSU3038 fliC, flagellin FliC
MALTVNTNVASINAQRNLNVTQLALGKSLERLSSGYRINRAGDDAAGLAI
SENMRAQVRSMNQAVRNANDGVSLVQTAEGALNEVSNILVRMRELATQAA
TGTVSSEQRGYIDSEFQALKGEIDRIASSSEFNGAKLLDGTSATYSFQVG
SRNTGNDVISVSISAAGASSIGVGTASVTGNNGAAAKAALDSIDSAIANV
SSIRGTLGAVQNRLQSTINNLQVSVENLSAAESRIRDVDVASETAALTRA
QILTQAGTAILAQANQTPQSALSLLR
>GSU3037 fliD, flagellar hook-associated protein 2
MASVSFGGLATGLDSNTLISQLMYLERAPERILESKKSTISSQIDVYTQV
TNLLNSFKTLAAGMNTATGFMGKTTSVGDSTVATATSSSIASPGSFNLTV
NSLAKNERQVVDQGYASADALNFKTGTFTISGVATPITIAEGQNSLQGIA
SAINASGANVTASIINDGSANPYRLVITGKDTNNYTLNFSGLTGDPASGS
AYTTPTITKSGPTYQAGAAASFSVNGIAITKTSNIVTDVIPGVTLTLLKE
GGATTTVTVGNDTSGVTKKINDFVGAYNAAMSQINKQSEYNATTKKGGVL
SGDSTLRSVKTQLQNVLTTPVAGITGKYSTLADIGITTDRSNGTLTVDAT
KLADALGSNFNDVVELFTKNGGVSNLDTEKYGVAEQFRKVIDRFTHAYEG
PSSTANGIISSRVRGLNDTIKSIDDQIDAMEVRMERKEEALKKQFTAMET
LVSSLTTQGNSLISYLYGS
>GSU0409 fliE, flagellar hook-basal body complex protein FliE
MIDGIESGLGIAQAFPSVTGEAKPGNLAADGGKFFGELVSKVSELQAQSD
TAIKGLVSGESKGLHEVMIAMEKSSISFQFLSQVRNKAVEAYQEVMRMQV
>GSU0410 fliF, flagellar M-ring protein FliF
MPEALNKLIQPFMALPPAKRWVVGGVVGLSVIAFTILILVANRTDYRPLF
TNLTSEDAGEIVTKLKEQKVPYRIAADGKAILVPSDKVYDLRLSLASDGL
PQGGGVGFEIFDRKNFGMTDFVQKLNYQRALQGELSRTISQISGVEQARV
HLVIPEKSLFKEDEKPATASVVLKVKGQRQLRENDVQGIVHLVASAIEGM
NPEHVTVLDQKGKLLSKNTPGDAAGKMTASMQEVQRAYERSTEERLQSLL
DKAVGAGKSVARVSAVFDFRQVERYEEKYDPETVVRSEQRSEEKQDGSTV
TGGVPGVQTNLGRTAGQPAGTSGGGSKNDETLNYEVSRATARTIEPVGTL
SKVSVAILVDGKYDAAAAGKDGKEAKPKYTPRSPDELQKIDALVKSSVGF
NVERGDQVTVVNIPFQDTGDVGAGEADKWWNAPIFLSLLKNGLIGFGFLA
LLLFVVRPLLKTLKPEKSTSFEPIPSAEDALNQIAEIHRLQIGNQTVSQM
ELINKIKQEPYQAAQIIQNWLRDKGEE
>GSU0411 fliG, flagellar motor switch protein FliG
MTGTDKAAILLLYLGPEATSKVFEHLDDDEIKKISKSMATLGHVPRNVIQ
DVVTEYSSLTNPDTGIFSQGEEFVRKILEQTLGPQKAEILLKELQSSSFG
DMVDVLANLDAKSIANFLSQEHPQTIAVILAKLRAKQTSEIISMLPQGLQ
AEVVMRIADVDQVSPEILADIDEVIKRELTAMGGVQRYKVGGVEKVVDMF
NHLDRSKEKQILEKLDTLNPPLAEVIRKHLFTFEDIFKLDDRSIQAIMRE
VSNDTLTLAMKTAPDEIKDKIFRNISSRAAEMIKEDLEVMGPVRLSDVEK
GQSEIIKIVRRMEEEGKVVLAGRGGDDVLV
>GSU0413 fliI, flagellum-specific ATP synthase FliI
MSRIDLSRYLSAVDAMKPIRFHGKVTQVVGLVIEGFCPDAAVGTLCLVHP
NDGDPIPAEVVGFRDNKTLLMPLGELRGVGLGSLISVKRKKASLGVGPGL
LGRVIDGLGVPIDDKGPLAIREEYPIYANPVNPMKRRPIRQPLDLGIRAI
NALLTCGEGQRVGIMAGSGVGKSTLLGMIARYTEADVNVIALIGERGREL
REFIEKDLQEEGLKKSVVVVATSDQPPLVRMRGAYIATTIAEYFQAQGKK
VLLMMDSATRFAMAMREVGLAIGEPPTTKGYTPSVFAALPKLLERTGSFL
DGSITGLYTVLVEGDDFNEPISDAMRSILDGHIVLNRELAARAIYPPLDI
LASASRVMNDVTERSQQQFASRFKELLAAYRQAEDLINIGAYKPGSNPTI
DYAIAKMDGMINFIRQGIHDGVSMEQSIAELADIFDEGMAL
>GSU0421 fliM, flagellar motor switch protein FliM
MEKILTKQEIEALLAAVFEGKIEPDRELAKEEGTVHSYDLFNSEAHKGLV
PNLDIIYDGFIRYQRGTLSNRLGRIVEIKKLGAGSYKFDDFIQTLPSPVC
MAIYKADPLKGAALIAFDSTLVFTIVDCILGGTGMTSVQTSANRMFTSIE
LRLVQKIVQDMLVDLEKAWAPLYAAKMSLLRMEMNPRLVNIVPPEYQVVT
MEMQIQIDQLEGKMVFAVPYMTIEPIRDKLKSGAQFDLMAIDPQWSFRLS
KELLEAPLDVSVEVGGAVISLNDLLSLVPGDTIMLDTPCTSDLTVKVGGV
PKFTGMPGIRHGNKALQITNVVGKGEQR
>GSU0422 fliN, flagellar motor switch protein FliN
MSDFTKEETKDGELDRKNLEFILDIPLQLTVELGRTKILVKDVLQLNQGA
VVELTKLAGEPLDVFVNSKLVARGEAVVVNEKFGVRLVDIVSPNERVEKV
L
>GSU0423 fliP, flagellar biosynthetic protein FliP
MDGVPIFKRIPFIALCVILLTASLAAAAEPLALPSVSIGVGKATKPGDVS
VVLQIFFLMTVLSLAPGLLMMTTSFTRIAVVLSFLRHAIGTQQAPPNQII
IALSLFLTFFVMAPVWQQVNTQAIQPYRAAQITQDEALKRAVAPMRKFML
SQTREKDLALFLNLSKLPRPRTADDIPTLTLIPAFMISELRTAFQIGFLI
FIPFLVVDMVVASVLMSMGMMMLPPVMISLPFKILLFVLVDGWGLVIGSL
IKSFG
>GSU0424 fliQ, flagellar biosynthetic protein FliQ
MSPDLVVQLARRSFEVTLMLAAPLLISGLVVGLAVSIFQAVTSIQEATLA
FAPKIIAVMVALVIFFPWMMNYMSDFTREVYALIATMRR
>GSU0425 fliR, flagellar biosynthesis protein FliR
MFPLTTPFPTANDVAFFTLVMGRMAGIFAAIPIFGGRRVPTPIKALLVFA
MTMVCFPIIKEKMPQLPTDVLSLGFLMVQEVLVGVSLGLLSLIIFAAVEF
AGQIVSVQIGLTIVTEFDPSQGGQLSIMSIILEMLATLLFLSLGMHHIFI
GALVQSYDVLPLGAWHMSGALLQFIVTTIGEVFVLAVRLAAPVMVTLLAT
SVMLGIMARSFPQMNVFFVSMPLNIGIGFIVLGLSLPLFLHTVQGHFGLL
DEQLKTMMKLMGKG
>GSU3036 fliS, flagellar protein FliS
MLTPFNQYQNTQVGTASPEKILIMLYDGAINFSKIALERMEKKDLAGKGK
YISKAQAIVSELMNTLNHDVGGGIAQRLEQLYIYVIDEYINANINNSPRA
LENAIRILTVLRDSWVEAIDIWKRERDAVPPSVHQPGYVAGQAR
>GSU0328 gspE, general secretion pathway protein E
MEQIARRLGIPFLAEIGDNEADAALLARLPLAFARGRLVLPLRERDGRLL
VVSGNPADLSAIDEVRGVYGMEVELAAATPDTVLGAVNHLYARLGSSAQE
VVEELEGEDLSVIATELAEPKDLLDLTDEAPVIRLLNSILSEAVKERASD
IHIEPYERELEVRFRIDGILYRKLAPPKVVQEALVSRVKIMAGLNIAEKR
LPQDGRIRVIVAGRDVDIRVSIIPTFFGERVVLRLLDKQKGLISLENIGL
SEGGVRSMERLLGRTSGIILVTGPTGSGKSTTLYAALNRLNSPEKNIITI
EDPIEYQVKGIGQIQVNPKIELTFAQGLRAILRQDPDIVMVGEIRDAETA
EIAMQASLTGHLVLSTLHTNDSATAIARLVDMGIEPFMVASSLSAVLAQR
LVRRICPHCRESYTPERDYAGITLPSTLYRGRGCDACFGLGTLGRVGIYE
LLPVDGEICSMIIRREPAGAIKEYAVGKGMRTLRDDGLAKAAAGITTIEE
VLRVTQEEYADLPV
>GSU0326 gspG, general secretion pathway protein G
MHNTLRNRRGFTLIEIMVVIAILALLAALVGPRIIGRSDDAKVADAKVQI
KNLETALKLYKLDSGTYPSTEQGLMALVAAPTVGTIPKNYRSEGYLESKQ
VPKDPWGNDFVYLSPGEHGDYDLYSFGADGVKGGEGKNADIESWNLQ
>GSU1374 hylB, methyl-accepting chemotaxis protein
MMLQNLKIGTRLYGLIGFMSILLIVIGALGLNTARTANNGLDTVYRDRVL
PLKDLKIIADMYAVNIVDVSHKVRNGNITWTEGRKSVEEAKKTIAEKLQA
YLATNLAEEEKKHLEEAKPLIKVADATLERLASILSAEDAEALTAFTVSE
LYPAIDPVSAKFSSLVDDQLKIAKQEYDHSSGLYRASRTISLVAIIVGVL
IAGTAGLLITRSITGPLAEGVEVANRLAAGDLTVEVRAGGRDETGQLMAA
MGNMVTSLRHLIAEAISISHGIASASNQLHATSEQIATGSEEVASQVGAV
ATASEEMSSTSRDIAQNCTLAAESSRETSVTASNGSAVVQETNSGMVVIA
ERVKQTAGTVDALGRRSEQIGEIIGTIEDIADQTNLLALNAAIEAARAGE
QGRGFAVVADEVRALAERTTKATKEISGMIKAIQNETKAAVQAMEEGVGE
VEKGSVTSHKSGQALAEILDRINDVTMQINQIATAAEEQTATTGEITSNI
QQISDVVQQTARGAEEVSAAAAQLAQQAHQLQNVVGNFRIA
>GSU3028 motB, chemotaxis MotB protein
MAKPKKHEKEPNHERWLVSYADFITLLFAVFVTLYAMGQSDKEKIEQVMQ
SMRESFGFTATGSAPRPAVLDSSDMRIMPSIAPDLTNKGRGQGKNADSKG
RIRATEKDFQAIKSSIEAYLIKQGAQDKVNVGINRRGLVVSLKEAGFFDS
GSAIVKESAYPLLAKVAESLSAYSNPVRVEGHTDTMPISSVQFPSNWELS
TARATNIVHFLTRSYDFDPGAISAAGFGEYRPIADNNTAEGRSKNRRVDI
VLLSGEGERGEPERAQ
>GSU2043 pilD, type 4 prepilin-like proteins leader peptide processing enzyme
MTLPIVFYLFSFVLGAVVGSFLNVCIYRLPTGESVVFPPSRCTSCGTRIR
PWDNIPILSWLILRGACRACRAKISARYPLVELINGLLCLALFLKFGPTL
TFAALFVFCSALVAISFIDLDHQIIPDVISLPGIVLGFVLSFFLPWLGWL
NSLIGIAAGGGSLLLVAWLYERLTGKEGMGGGDIKLLAMMGAFLGWRAVP
FIIFASSLVGSVIGLTLMMLQKKDSKLAIPFGPFLALGALLYIFFGKAII
LWYLSIGAR
>GSU0146 pilT-1, twitching motility protein PilT
MELNDILTVAVRAKASDVHIKTGLPPVVRIDGRLRPIPNAPRLAPDQVRA
MALAIMNDRQKRLFEEHFECDTAYGVPGLGRFRVSVYSQRGTVAMVFRFI
PFGIPSMENLTLPPVIKKLAMEERGLILVTGTTGSGKSTTLAAMIDYINE
HRTCNIITVEDPVEFLHRDKKSILSQREVGFDTVSFATALKGALRQDPDV
ILVGEMRDLETIETAMHAAETGHLVMSTLHTLDATETINRIISVFPPYHQ
RQVRIQLAGVIKGVVSQRLVPRADGKGRVPAVEIMIGTARIKEYIDDKDK
TKLLPEAIAQGYTSYGMQTFDQSLMLLYTQKLITYEEALRQSSNPDDFAL
KVSGISSTSDSTWDDFVHDEAPPAEGEGSVEGIEKF
>GSU0230 pilT-2, twitching motility protein PilT
MDMNLLSQILGIAFEKRVSDLHFEVDNPPFFRAKGQLLRSKLPKLSPQDT
EFIARAVMEQNHRTLPDELRELDASYSLPNGGRFRVSIFRQRGSIGIVMR
VIPPHVGTFEELNLPPVLGEIAKAPNGLVLVTGPTGNGKSTTLASMIRHL
NETCTFNIITIEDPIEFLFTSDKSCIIQREVGIDTVDFSAALRSSLRMDP
DVIMVGEMRDLETIDACIKAAETGHLVFSTLHTQSAVSTINRLIGHFPPD
AQEVLRQRLADILVATVSLRLIKDKSGENILPVVEVMRATTTIQACIREG
RLDEIEKHIENGRSLYQMQTLDQHLLELCEKDVITFDQAKQITRSMDLER
KLAFTE
>GSU0436 pilT-3, twitching motility protein PilT
MARIDALFKLLKEQGASDLHLSSGAPPIFRLHGEMARQNFKVLSHEELTA
ILYEILTDKQKADFEERRDLDFAYAIPGLARFRGNYMMTHRGIAAVFRII
PSKILSADDLSLPDGVRRMTQFKKGLVLVTGPTGSGKSTTLAAMIDLINA
TRKEHILTLEDPLEFIHENKMSLLNQRQIGEHSLSFSAALRAALREDPDV
ILVGEMRDLETIGLAMSAAETGHLVFGTLHTNSAAKTIDRIIDVFPTDQQ
EQTRAMLSESLKGVVCQQLLKTADGKGRVAALEIMLGTPAIANLIREGKT
FQIPSIIQTAKRDGMQLMDQHLLDLFKTKRITAEEAYRCAQDKKQFEQYL
TEKPGQ
>GSU1492 pilT-4, twitching motility protein PilT
MANMHQLLTELVNRGGSDLHITTNSPPQIRVDGQLIPLEMPPLNAVDTKQ
LCYSILTEQQKHKFEEANELDLSFGIKGLSRFRGNVFIQRGAVAGVFRVI
PYKILTFEELGLPVVVKELAEKPRGLILVTGPTGSGKSTTLAAIIDKINT
ERHDHIVTIEDPIEYLHPHKSCVVNQREVGADTKSFKNALKYILRQDPDV
VLVGELRDLETIEAALTLAETGHLCLATLHTNSAVQTINRIVDVFPPYQQ
PQVRAQLSFVLEGVMSQTLLPNVSGKGRVLALEVMVPNPAIRNLIREDKI
HQIYSQMQVGQEKFGMQTMNQSLFSLLQKRRISLDVAMARSSDPDELKQM
LASAQRPPGQRPQMR