TitleGenColors Logo

Gene list

Applied filters:

COG category: Intracellular trafficking and secretion
Gene type: CDS
Genomic element: chromosome

Number of genes found: 100

Free access
Sort by:

 



# Geobacter sulfurreducens PCA, PCA

>GSU1496 pilin domain protein
MANYPHTPTQAAKRRKETLMLQKLRNRKGFTLIELLIVVAIIGILAAIAI
PQFSAYRVKAYNSAASSDLRNLKTALESAFADDQTYPPES
>GSU1500 hypothetical protein
MVPFTDHMRGRPFVEKLGYLPASDALKVLVADQKQSVGASLIMKVMMYFG
GKVSGGNLKVTTDVDYKAMSRTVHAALKLDPYNMDGYYFAQAILVWDVKQ
YRLANDLLEYGMKYRTWDWYLPFFAGFNYAFFLKDYPNAARMYMRAGDLS
GEPLFKKLAGRYLQQSGQTEIAIAYLTTMEKGARDKAVKESFRIRLTAFR
RVLLIEKARDGFIAEHGRLPSSVEEMLAKGYLKSIPADPYGGKFYLEPTG
DVSTTSKFAFAGVQNN
>GSU0814 outer membrane efflux protein, putative
MKALRRTALALAVTLLPLAAAWGADAGAGTKLTLDDCIKKALTAAPELGE
AQADIDATSAKLDEAKAHRYPQIEFLGLIGPVPQARGNQVYSPDGINDTE
RWTWFQRGDATLVQPLYTFGKISENMKAASHGIEVDRAKKDQKGTDVALQ
VKEYYYGILLARELRELVLETRDILDDAKTKARKLIDKGSPNADELDIYK
LDAFRGEVAKYLEEATKGEQLALAALKTRMGLSPAEAVDIDAERLQPAGA
TLGDLTAYVEEARTRRPEFRQLNEGIKAREALVEAAKANYWPDLFLGGYV
SAAYADKRTRIDNPFVPDDFNHVWAGIALGVKWKLDFGITGAKVAGERAQ
YNRLLSTKAYADEFIPLQIRKAWLEAREAEASATAMKEAYTNAKRWVVAS
VANYDFGVGPAQEIFDGLQNYARMRAAYFQAIYNQNMALARLDHAVGQAP
LN
>GSU2185 flgM family protein
MKIDTNPPVTTVNQVKGETSQAASGADARKTGAAGGQATDTVDLSRNAER
LVKANATLRTMPDVRVEKVEELKKQIAAGEYNVSARDVAEKMLISMRNGV
TA
>GSU0327 general secretion pathway protein F
MPTFRYSAYTAGGRETSGTIEAESLKEAKLHLKRDGLYPRDIGPVSETAG
TVSRRFGGRNAGPAQVALMTRRLATLVGSQVPIYEAVTTLWEQEEPGEIK
KALGRIRERLAEGANLAKALSLEPRLFSESYVAMVAAGEASGALDAVLER
VALFLEEQRAIRSKITASLAYPTLMVLVGSAVMLFLLAFVIPKIVTIFED
NRAALPLITIALIKTSTFLRSFWWACIAAVAGVVLLYRRLMKDDAFRLRR
DRFLLRIPVVGSLLRQLILSRFAKVLGLLLSSGVPVMRALEITAQVVVNR
HYRAALTGVTAGLAEGGTLSGALRTTGLFPPLLVHMVAVGEKGGELEEML
GKAGSAFEREFESSVSGLMALLEPLLVLAMGLAVGLVVVAVLLPIFELNQ
LIR
>GSU2695 outer membrane efflux protein
MSMFGITQSSRATEQQSDPAATVCRAASKRSQRVSGFAPKPFSVAQVLYV
SIMTLTLAGCTTMAPKYERPAAPVPVAWPEGPAYKQTASATEKPVADIPW
QEFFVDEKLHKLIALALENNRDLRAAALNIERSRAQYQIRRSDLFPKVDA
SAGATFTRQPEGLSATGRADTIDQYSVGLGVSSYELDLFGRVRSLKDQAL
EQYLATEQARRSVQISLVSEVAVNYLTLAADRERLKLAQETLANQQESYQ
LTKSRFDAGVSSALDLHQAQTSVDAARVDIARFTTLVAQDGNALSLVVGS
PVPAELLPSALSDTLTALKDVAPGLPSDVLLRRPDILQAENLLKGANANI
GAARANFFPRITLVSNVGFGSDDLAKLFSGDSFTWSFAPRITLPIFTAGA
NQATLEVAEADRNIAVAQYEKVIQTAFREVADALAQRGTIDDQLTAQQSL
SDATAESHRLSQARYEKGVDNYLQVLDSQRALYSAQQNLIGVRLVRLLNL
ATLYKVLGGGSE
>GSU1063 hypothetical protein
MSLVEMLIALLILVVGFLSVIMVLWMSINSGRFTRDMTMAASLGQDMLER
FTARSYGSLPATGGAFEPYTTANASAVGYVREVKVEDNVPDVGIKTVTVR
VRWNSNGHERSRTFTMLKRDY
>GSU1493 type IV pilus biogenesis protein PilC
MPKFNWEARSRTGSVQKGVMEAASAAAVEAQLKKYGFGSISIKEEGKGLS
MEIKLPGFAPKVETKDLVVFTRQFATMIDSGLPLVQCLDILSSQQENKTF
KDVLIRVKESVEGGSTFADALSKHPKVFDQLYVNLVAAGEVGGILDTILN
RLAAYIEKAMKLKKQVKGAMVYPTTIMAIAVIVVGVILIFVIPTFAKMFQ
EFGGELPGPTKFVINLSNFIVKYILLIIGLIFALIVGFKKYYATTGGRKK
IDAFALKAPIAGPIIKKVSVARFTRTLGTLISSGVPIMDGLEIVAKTAGN
KVVEEAVYKVRQAISEGKTMAEPLQECGVFPPMVVQMISVGEATGAMDAM
LSKIADFYDDEVDEAVSAMTALMEPMLMVFLGTTVGGLVIAMYLPIFKLA
GTVGG
>GSU2122 TraG family protein
MSLFKKRIINPAGGMTDIGNGVNLDRRTKIVPIAIPDADRKRHTFVFGTT
GVGKTRLCENLIEQDIRKGYSVVYFDPKGDQQIFTKIYEVARDSDRLEEL
MLVTPIFPEYSAVVDPMAFYFMPDELVGHIVSGIMGGREPFYRNIAKEIT
TAVISAYIIQSKRHGNLPILNIDEIRKRIRRESLDSTMKSLRSIGNAEAD
LTAGMIEDILKSPMEYYAKVSSTLRTALMELSSGNIGKIIGQADSNRFIK
NLESGKPVILVVHTGAMITREASATLGKVLLSMIQSFVGRVYLSNRQKVN
PPLSIFIDEAQSLLYQGVEELFAKAGSADVMVTAFAQSVNQVYAVIGEEF
GKSILDNTNTKIFMRCSDAETSDYVVKHFGVQNVLTGIFGSNQVTTREVE
QDILRVQDVLSLKPREFYMMTYSGRFKGTTNDAHEPKMKIIFPEAPAVIT
SKLSQSKPATPPSP
>GSU2882 cytochrome c family protein
MLAVGLADTAAAASTCSDCHGMPPIDAAYRNITTGGFKGSHQTHQPATAP
AGACAVCHTGSGSYAMDHMNGTIEMASNINASPLAATYGKGVFFNQTSNP
VMGTCSNVNCHFEKTTPVWSSGPLTVPAGCSICHGLPPADGSHPAATAGS
GRKHGDYYGTGTGSCVKCHPDHAAAAKPFAHASSAGNRGLILRFTAAPNT
AGTYSKTANLNYPNYLPSQTTAANRNGTCTAMYCHSDGNGGAARATATWG
GTLAADCTGCHGGNAASAAPIITGLHAQHVNAAAVLGTTIECARCHNGPV
SAGNDRAVTGVAAHVDGVKTVAFVGGGTWNATAKTCSATTCHSSGKATAP
QPPTPAWTGAAMGCNGCHGTLNPLGTPDYASGGSGTALANSHAKHVAAAA
DCALCHANTTTTGTAIKAGSTLHTNGAIDVNLDTTNARVGATATWTAGTK
TCATVYCHGATLTGGTTKSPVWGATLTGCGTCHGFPPATSVHTGKVATDC
TGCHPHVNASGTGFTDASKHINGVVEASGGHAVPYYTHKSPAPTTAQCSG
CHANASATAPYPAGTSPNYTAPDCRSCHVVKSPYGVADCSSCHGTAGATG
VNLGRPTGSTFPNIAGQHGKSDHRVACSTCHGTYGTGRTDGTHGPGNHTP
STVSTRATKVNVTLSTWNPTTRTCGTVCSYNHGSKTWY
>GSU2038 hypothetical protein
MRQSSVLGCARPAGLAALLTLLLAAGGDGAAAATMNDYCIQPPFVSQSVP
PLVMFEVGREHKLYYEAYNDANDLDDDGRLDTTYKHSIDYYGYFDPYKCY
THSGGSGSNDKYTPVSTTADKFCSSGQWSGNILNWLTMSRMDVLKKVLFG
GQRSADSNTATYLERVYVPQDAHSWGKEVTGRLCSNGTNYTDMCQFDSDC
DTGYTCVDKSVNLIGITASDTGTACSFTSSIKWDTTGKILVAKYTHSNFS
CGSDSTDLISSYEPANLVAGFPVYVATFGDAILNPAADHGDQFNYLALAE
FSVSKSDKGNWMFAIDGDDGVELEIINPAGDASTIVASRYGCNSACNCQT
NSGTINLNTTGYWRLIARHSEKSGQDGVKVWYKKPSKTQSSDPWVLFGSS
TLTLRAPTIPAGAECTLKDRSFIETGKPKVGTTPKQHLFCSTTLSDGGTP
ILRFLGNKENRIWEWVSKERPVCDSSLGAPTDYTVQVEVCKSVSPDNRPT
GKKDDLASGRETNCKDYAGTFKPVGLLQKFGEGEGAKVCSRTLAKSCTSD
SDCGAGEGLCIYKSPMYFGMFTDSYTKNLSGGVLRKNIGSILDETNANNG
IFQTSENVQGNIMITLDRLKTIGFRYTDQSYQDASGGSCGWITDRPLNEG
ECRMWGNPIAEMMYESLRYFAGKGAPTTEFTYTTPADSGLSLSKPSWGYS
KGSTTYQLYDIYPPCAKPFMLILSDINPSYDSDQIPGSSFKKTDGTYFSE
DAASPQLGLGVAGADGVSLLNKLADTIGTSEGIIGDSWFVGENGSTTDFV
CSSKSVTKLSLLRGMCPEEPTKRGSFYSAALAYYGLTLMKEKTGKPDVST
FVVALSSPVADLKIKAGNSHVSILPVGKSVSGSHGINASCAQKCTLTADE
DGLHISNCSSTAYCPSNQIVDFYIDSLKYDNDKNVIYAKYRINYEDVEQG
ADHDMDAIVTYEVCTQSAIDQGLGACSGSLGSNIQIKLNSDYAAGSIDQV
MGFVISGTTEDGVYLPVRDRDVSSADSDTPATVAGLPLNWSKTFTISGNP
TGTLKSPLWYAAKWGGFIDANNNKKPDLASEWDKDGDGEPDNYFLVVNPL
KLEQQLQKALTDILNRVSSGTAASILSNNDNNGATLLQAIFYPRKNFAET
ELAWTGELQAFWYYIDPFLNTNSIREDTDQDLRLKLKTDYVLDFRFDTND
NKTKIDRSLDVDGNGSGDSYVNTIEPEQVNALWKAGSLLWSRNLSTSPRT
IYTSYRDAASKDQLTVFTTAGKDLFKANLQAADATEEDKIINYIRGTEQS
GYRNRTVTIGGSTGVWRLGDIISSTPRLQSNARLNGYHLPPPVGYKDSSY
QRYLDSNEYKTRGMGYVGANDGMLHAFNLGVLKAGTTKDVTSFITGSDFG
KEMWTYIPRNALPYLKYLADPEYDHLYYVDASPSLNDVSIEVTEGTGCTD
AAYWLCTKQTVYQAGTDSTTKELDLDKTSWRTVLLGAMGLGGASRNTTDA
CSASTDCVKTPIANVGYSSYFALDVTTPTSPSLMWEFASADLGYSTVGPA
IVRIGGETNGRWFAVLASGPTGPINTQTHQFLGRSTQTLKLFILDLKTGA
LLRTIDTGIQNAFAGSLSGGTLDTDRSAGTTGKYNDDAVYLGYVRKDTTT
GTWTKGGVLRLFTKENIDPAQWWWATLVDDIGPVTSAVAQLQDTTHKNHW
LFFGSGRYYYKAGSDLDDAAGRRALYGIKDPCYDLNNKMKTTCNTPTVLA
TDLVNQTDSIQGMGTAPGWYVLLDEASGSAGAERVITDPVAAPNGAIFFT
SFKPAADVCKFGGDLALWGVNYSTGGYLAPSQLIGEAIIQSSTGSFEQID
LGSSFTQRLNRKTAERQGVPPRNKPTIVTNANIKPQKRIIHIREK
>GSU3120 conserved hypothetical protein
MKKQLCMIALVGATAFGTTGTSWSLDGPPPPEPPPMGKGQEHFLERMASV
LKLTDAQQAQIEALISTDAEQNAPLHRQLAENERALREATTAASLDEATV
RALAATKGNLMTEMIVSRAKLRNAINAILTAEQRELADRLDPLKYGPPRP
RPDRPGME
>GSU2221 general secretion pathway protein-related protein
MYREFYGLREKPFSKTPDPRYLFLSRGHREALARLQYAVEERELALLTGD
IGCGKTTISRALMDAMGEACRFCFIFNPRLSPLEFLRVIARSLGIDNPAG
AKDELLKQLTETLYLMHAEGRCPVIVVDEAQLIPDRDVFDEIRLLTNYQL
DDQNLMSVVLMGQPELRQILADPVHEPLRQRIALHYHLQPLSLDETLEYI
DFRLEVAGGTPGLFSPDAVQRIHELSGGVPRKINILATNALLVGYGRDAA
WIDASLVEELRDEANLY
>GSU2030 type IV pilus biogenesis protein PilO
MDARIEKLLKLPNKQKLALLAAILVVEGAALYWGLYAPRQKELTALRGKL
EKLQTEVQEKTRIANNLPKLKKEYQQLQKDLENALTELPNQKEIPSLLTG
ITSVGKGAGLDFLLFRPKGEVPKDFYAEVPVDISVSGSFYGVANFFTAVG
NLPRIVNITNVSFTDIKPVGGKTTVKVNCLATTFRFIEKKETKDDKKK
>GSU3398 metal ion efflux outer membrane protein family protein, putative
MNRAVSLPVAALLFLAVTGASAGNAGADTRIFTLTEAVEHALSNNGELKA
ARSGHEAAQAGTVRAGLYPNPVLELEGATGALTGSSNENSMELAISQEVL
LGGKRAKRLEAAERDVEAVRWWLADRERLTALEVKTAYADLLLAQQRVDL
AKQAAELGSRLLTLTQERFAAGDIPELEVNLARVEAARSDSERMAAEREI
VPVRSRLSTLMGLDPGTEAAVVAPPDEQPVTALVAGLATKAREHRPDLKA
LTAEQARGEVEVSLARSEGVPNLTVGLFVAHERSTDAIGTDEEKTRDTIV
GVRLSLPLPLFDRNQAGIREAGARRGGTEARLIAARTALEREVAAEYARM
AAAEEVLRLYAKDILPRLKENLALVQEAYGLGEIDILAVIEEQKKYVAVH
GSYLAAVHARQTARARLEAAVGAPLDELTTGGTQ
>GSU0330 general secretion pathway protein C, putative
MKKVYLLTALLIALNSLAAARIAAGLISYRLARTAPHTAMGATGAPAAAV
SDDILSFASILDQGLFGRATQGKLTPLSAPVQGAAGAQQPAPTHGDLTLL
GTARGSFRETFALIRRATPPEERVFRLGDTVFGIGPLVGVQKESVEILAN
GRKIRLTTPLAMGIEPTSAPPPPAVAPQTGAVQVGAGSYVIDQRALNAAL
ENLGQVMTDARLLPSVKEGKVEGFRISEVKPAGVFSMIGMRNGDILLRIN
DLPVDSPERAIQSLASLKGQNRIKLDLVRDGQPTTFSYDIR
>GSU0025 tolB protein
MKHVRIFATLLALLVISVTPAVIHSEDVYREVTASGAHNLTLAVDNPRNL
GGADDAALARDVAEVLRFDMTLAGPFSVMAPPAGTSPGGIRPGEFDLDSW
RNAGVDLLVKSGYTITGDSVTMEFRLYNATQGRELAAKRYTGKRSDLRRI
THTFSDDIMQTMTGERGPFTGKIAFVSTESGNKEIYLMDYDGHNVQRLTK
NRSINLNPDFSPNGRELAYTSYRRGNPDLFRREIFTGTEAVVSSHRGINV
TGTWSPDGKQLALAMSRDGNSEIYAISRDGRDPRRLTTHQAIDVSPAWSP
DGKRIAFVSDRLGKPQVFIMNADGSDVRRLTTSGAYNVSPRWSPKGDRLV
YCRQEGGFQIYSIATDGTGDTRLTSEGSNEHPRWSPDGRFLTFSSTRDGG
EAIYVMRSDGSGQTRVYRSKGKASHPTWSPRW
>GSU2899 high-molecular-weight cytochrome c
MINVYMKWHRGCALAGAAVLSALACAAALFALAGTAAAGTITDCSGCHGM
PPVDAPYRNISTGRFMGSHDTHAWLSAGTPNCAICHKMPATYNHRNNNVD
FVTTINNSRLTARYRNLTSFTRTASPSFGTCSNVNCHFEKTTPQNGATPL
YLDGGKTIQQKCATCHSAPPADARHAKHAQYYGDVTTACVKCHPDHAAKP
GKGAFSHATSAGRRGLVIQFTAFPNTGGSFSGDVSYPLYLYNPSRTGACT
NLYCHSPGTKAGSYDPPNQSPDWAGTLGTSCLGCHRGDADSGSPMQTGSH
GAHVGVNAAAQIGCVQCHGATVTNSRQIGDPTQHVNKKADISFEAAIGSA
GTYGGAAGHVAKDVGTAYGTCRNIYCHSDGATTTPPFNDYAVTWGAADFP
TGCTGCHGGQQGSGNVIASNNHRKHVDASYNGGLGTGIGCVECHAPTVSG
NLTIADKARHVNRFKDYSGQRAGTIAAGTCSNVYCHSSGQRSPAYRTMVP
WSDTATTYGCSSCHGASTAGPAGVFVSRFGEPNYNNYSSADRNWFNSHNP
KHVRSAGDCSTCHAGTTQNGVSLVPGTTLHANTQKNVTFNTAVAGGNSPS
YNDLSRRCTNVYCHSNGRQTGRAYATPRWGGSAQNCNACHPIAGLGGAHG
IHVGGMIPTFYAYTGNHPVGAAYRYGCANCHPMDPVHHIDGHIDLSLSKN
DVNAGGLKTKNGATNGSGLNSAGSGLTGTTGASVRCASVYCHSNGYAANL
VYAATPDWYGGSFAGDRCAACHGNSPNSTIAGSSSHYNNRFLGYTGVAGG
HQIGIHAMNIYSSPGKRATAGTTGSSSHGNAATATTISCNICHYETVTTA
RNDDNAVCKTCHYDGNAVGALSGNRAAIADRSKHVNGLVDVAFKPVAVIS
KAQMRPASFAIATYSSVWKRNGGYKVSGANDSAKQALDTATMWDGGTKTC
SNIACHNGQSVKWTDNNGITECVSCHSAM
>GSU0314 general secretion protein E N-terminal domain protein
MPIRLGEMLIKAGMITHTQLDEALKGQVIFGGRLGTNLIEMGVIGEEELA
RVLSEKLRVPCVDPDELMAVPDHLLSLVPRDMVERYKIVPLGVDGRRLRL
VMADPSDLPAIDEIAFRTGFVIVPMVAPEIRLFMALEKYYGIRREVRALP
VSETLGGRRCTYGTKRPAEQLMRRDVVDFSTLPDNGKYLPWEGGDIDDNR
IEAAERYTIDALSRTLADCRERDAVADALVDYAGRLFGCAGLLLVMRDMA
AGWEAVAGHERLREFGQLRISLQEASHVRTVVEERSVYLGPPGNMPTDRR
LAEALGGAGAPGLMLVPMVMGRRVVTILCAAGEMVALGARLSEAQTIARK
GVLAFEILILRGKILMT
>GSU2137 metal ion efflux outer membrane protein family protein, putative
MEVISLGTTVFARGALLLVPGIAVLLSSTLVRADEPSLSLPKVIEYSLQN
NGELKALREEKGVRDASKFRAGLLPNPTLDLEAGTGALTGSSDENNLTIG
VSQEFLLAGKRDKRLTVAERELEAYRWQLADRERTLREEVKAVFYDVMLA
EQRLKLTDRSIDLNRQLLEVTKDRLAAGDIPELEMNLAKVELTRSEGARI
EVERALLQTRSRLFAFMGLPAGEAPAIAGTLDNGFSLNKNLADLKQLALG
LRPDLKALEAEKGRGDADITLAEAEKVPNLTAGLFYTHDRRTDATGTGEE
KVRDNLLGIRLSMPIPVFDKNQAGLQEARAKRSSSESRLTAATRIVERDV
ETAYISALNAEKVLSLYKSNIIPQLEENLKLTQEAYRLGEVGILSVIQEQ
KKFFEVSDGYLTALHDRQLALVKLESAVATDINGGVQ
>GSU0702 cytochrome c family protein
MVVGSEVTVNTRSWKMLMMWCSVMFVGISPVLTGVDEVRAAIQCYQCHGT
AATSDYRPEDSAFRAISTGGFKGNHRTHMASSATANSCTICHGTSGYQAS
HRDGTIAIASNINASPAVGTYSRGTSFAQSASVVPGTCSNVNCHFETVTP
AWGSTPLASTADCGTCHGSAPATGSHPVGGSKHGAYYGTGTSSCGTCHPD
HGAKEGTARFAHATSVGRDLQVSFAAAPNSGSGSYSGPVNDYLPSQSNVF
GTCSGTYCHSPGTKSSAFDAPNQSAAWGGSLACNGCHKSDRSSGTAMTSG
SHRAHVDGFGLGYATVRCVRCHGATVTSAMAIGTYGNHVNKLVNVAFDST
TTAKNGTYGGVSSPMTKAPGSAYGSCTNVYCHSSGQGNSGSWPPTYTTPT
WGSAATGQCGTCHGINGDTHSGYGTPTIMTSGSHTRHLAYSFGITSDETR
CATCHATTLTGFTPTVCSSSVCHNQISQKHANYEVNVGFPDYYGATAAYG
GTPKPGDGYGSCTNIYCHSDGRDTLHYAQTPLTWGGASLGCTGCHGSATA
PQNGSGGTTLSGKHAVHVNNAAIFGTNNSLGCVQCHNRTVSDNSTLLGTT
GTQYHVNKTRDYYGTMAGSYNSGTKVCSNVYCHSSGQATPVYRTVAAWTS
ATGYGCNSCHGADSAFTAAAGEPNYANGGAGTATANSHQRHVASLGITAT
TGCAACHCRTVDATVTNKLRNYSTLHLNQARNVAMKAINGKSGTYDSNAK
TCSATYCHGTSPSPAWGGSTACNSCHSARATDAHWAANSAHKIHWEGSVL
PSSYAMTPGNGTGDAATYRFACSSCHSPSGGSAHAGGALAASQAAQVYFG
YSAAGVRGSYTGTGTAYSNDNGFNWTAGSSGCTATYCHSGGAGQAGRSAV
AWSTSAKTSYNCNLCHGNAAAPNNDWRRGAPLYASGSITYKGQAKGNAHG
AHISAQTSLSAVYMQCAHCHGATTATSTAISDKRKHLNKSYDVSAGGTFR
DGDNTANSTAVTIAYSYNVAGSSCSNVSCHPVGLDVTTTPSTPLTRSTST
AAWNSSYKCTDCHRIPMQDTDTYHHAMYDYGRTYPTTIPDGSATSGTNYT
SRTCTMCHVDHTIFSPDLNVNSAGRSYNLRTAIGSTPTTTANFTNSDYSA
SGGGICISCHSTERTKSTVRRKQETNATKTPPVTLANYSGSAHQYNVPAS
FFSDGSRFQGNCSKCHNALVNETSVFMSYTGSGDSFGNHNSGIRRLQGSL
GATGGEVAEEQICYRCHSLSSDADPGGGPPKAVANKDYYGVAPMSLASQD
IFAANRDFRAANPTYSLTNKLYFKPAAAETPAEAMPNQHNTGDSFSGGTW
VGRVMSPWETTVTYETKSQGTNLEGTSYWRMVTFTSPAVYSTTTVPAGNW
IINLYCRESSTAQNARVRYMVYKWNNPADTMGATIIAKGTYATELSTAGA
PGTVRQIPVSVGTLTLNAGEKIVVDLSLETTTTSTNGYTASFYFGSRAPS
ELTLPGSVAWTYADPGAPGYGHATQHYSGIHLPSRQNETLAYIAQNRHIE
CVDCHNPHATRNGLHGDYGIATGGSSTTLVNSNKNWVPNQWVGDYINIIS
GTGAGQTATISGNNATTVTVAGWTAPASGSVYRIIANNNAVSRSQRGVSG
ASVSYSGAWTNGTYTMVSEAAYEYQICFKCHAATNPTTNTLRYWNVTSPS
LGAARWTNIGLEFNPNNASYHPVIQPLPSTGNRRLQATALTGGWQPGEVM
TCSDCHGRDTSTSATAQGPHGSTVKWLLTGMYQNWPFTSAAANGTSSGGT
LLTGTGTATPPANNFCFNCHTWAAGGNGHVKSSGGHSRPCVNCHIRVPHG
GKVPRLLTGANAPSRYKPDGNGGAFSGSYLTSAILPASGYMGANNVSCSS
ICGARHTDNSIKAYSW
>GSU2031 type IV pilus biogenesis protein PilN
MVRINLLPVRSSKKKETARQQMAILLVSVLVVLGIGVGLFGYAQAKIKAT
KNDISGAESELQRLKGKIGELENIKKLKDDVTKKLNVLTQLRKEKTGPVR
RLATLSDATPEKLWLTKYSENGPNVSIGGVAVDEDLIAAFMRNLQQTEDY
TNVELIVSEQTEIGGVKAKRFELTCVIKALKKEEPAPAKKK
>GSU0797 membrane protein, putative
MSIYANFFVAIWLKFFFLLTPFFVMSVFLSMTQEIAPRDRRRLALKVTVA
VLVACFTLFFFGNHLFSLFGITIDSFRIGAGALLFLSAVQMVQGDAAVSP
SDRQGDISVVPLAIPVTVGPATTGALLVMGAELQGAWQTVIGCAALAAAV
LCIGALLGSAASIEHVIGKRGITILSKLTGLMLAALAAQIVFTGIKSFLA
IH
>GSU2710 hypothetical protein
MAASQEPTQESEPAWAPQQAQAPSFAPADSDAEEAADNLAVSSGIPSGPA
ATDEDFSFDFGGEPTAGSENQAGQSTASEPEVFDFGGFEEESVTEEQSRP
AEPEAASAVGGFGDGIAFGEVSPEAFAAATDAGGREEGSTGDFALEFEEG
VAKEDASPKIEAEEEAEPFDFGELDLGIDETAPEPEPRDRTQQATVAAVP
EQPVTEPAVTTAPLSPLDVPFGEEEAPPLVISSRRKGRSLLPLVAIVVSV
LVIVALAGFGFYFFKEGPAAFNKLGIGFVAEWLGFEAREEGGIGLDKVRG
GYLANAEAGEVFVIRGEAINNYRKPRASIQVKGALLGSGGQAVVQKIAYC
GNSLSDEQIASLPMAKIESAMNNQFGDSLSNLGVQPGKRIPFVIVLANIP
REATDFTVEVTGSTVATQ
>GSU0022 mttA/Hcf106 family protein
MFGIGMPELIVILVIALIVIGPQKLPDIARSLGKGLAEFKRASDDFQRNL
AEEVRTLDEKEKAEKGEAAAEPVKRDLAAEVKAYEDQAASGVHQGEPAPV
AGSPESEKKSA
>GSU1482 outer membrane efflux protein
MKPLPIRSIPFLILALCAATPAMASALSLKECLSLAAAGNYSLAVTASDR
LIAQEAITQARSGFLPRVDFQGGYTLQAKPQAVNFGERGSIETQDGTYGF
FNLSVTQTIYDFGRTSSRRQRASLLHEATAQQYAAAEKDVFLQVVEAYFG
ILEARRLLGTAEEEVAQRTDHLRIATNLFEQGVVTRNDLLQAQVRLAESS
QKRLVAANRLENRWMYLNHLTGRPLDQRDELEEGTESAMPDTAGAEERAF
ANRPELTALARSADAADAEVSEARSGYYPELYAKAGVDYVENSKVREQAI
YAATVGLKINLFEGFATESRHRQSVERLTRSRDALRLAREQVRLELATAL
NDARVAEQRIKSVETAIRQGEENLRINRDRYQAQVGTATDVLDAQTLLTQ
IKTDYHRAVFDFQVAKARVSKAMGEL
>GSU2032 type IV pilus biogenesis protein PilM
MLFSKKKEIVGIDIGSSSVKLVQLKEQKGGWQLVNIGIQPLPPEAIVDNT
LMDSSSVIEAVKGLMKGLSVKVKDVACSISGNTVIIRKIKLPAMTPEELE
DQIQWEAEQYIPFDINDVNIDFQILEPDEDDPSRMNVLLVASKKEIINDY
VNVFAETGLKLVIVDVDSFAVQNAFELNYETDPEEVVALINVGASILNLN
IVRGGSSLFTRDVQVGGNLFTEEIQKQFALSSEEAEQVKITGEYPDKAKL
KDVIARVNETLAVEMRRSLDFYNTTAGEGRIARVYLSGGAAKTAMLAETV
QNKLGVPVEMLDPLTKITCSEKEFDPEYLREIGPLVTVAVGLATRRVGDK
W
>GSU1777 hypothetical protein
MKSSSVTALILRIDRRFRGDSRGLSLIELVFTVAILGILAMAVVPFTQMA
AKRSKEIELRRNLRVIRTAIDDYKKDYDKAIKDKKIMDVANRSGYPESFE
KLIEGEDFGGLYAYKKKYLRRIPVDPFHPPEVGEPPKWGMRSSVDDPESD
LWGGEDLYDVYSLSDGIAIDGSKYKDW
>GSU2028 type IV pilus biogenesis protein PilQ
MRIQTTFSRKLVLMSLLAVFCGCADVHSSVKGDAMVEQAKGGVLREIKVA
ETGEGARVVLSADRPLAYTFYKTADPPKAVIDLARTEPGAIASPMDVNSD
TIRRIETVRYGEGATAMTRVEVYFTRDTEANATIDASDKGMLVLAVARPV
APAASVAVEAAPAAGQAGESSTGAPVAVAAAPAVAPAPVAVEPQAPIAPV
VAPQTGPPVIKAIEAQNGYLVIETSGEIKDFKFFRLAKPDRLVVDISDAR
LGINSKAIPLNALGVGMARIGAYPDKVRIVLDAAGGTLSPLNVTKGAAGL
IVAPADKVVDVPRAAAPPQPSASASQARQPEAVKSAEPAKGLTPAVDAIE
FKVLDGTSRITVAVTGACANDKPVKSADGFSVTLRNCILPKKLQRHLDTG
AFASVVQKVTPYQVKTKGRSDVKIQVQLRQPASYDVRRDGDLLQVSVRNP
EGFEPPVADVPASPTLQDGMDQAAVRQKEPSRETDPLAGVAQSGGTKKAY
TGRRVTLEFSDADIRKIFQLIAEVSNLNFLVGDDVTGTISLKLVNVPWDQ
ALDVILENKGLGMQRDGNIVQIRPKSKIQTLADEEQALKRAKERGMELKT
EVFDINFAAVGDIVSQFNAVKSERGTISQDARTNRVIVKDIEPALAEMRI
LLKNLDLPEKQVLIEARIVEATSTFTRDLGVQWGIHSNDSGADIIRSVDA
GFGGIVTPPPASGFPAATSSGGAVGISFAKMGSLQVDLRLSAAAVAGLVK
IVSTPKVVTLNNKAAKISQGQSIPYQTTSAEGTKTEFVEAALTLEVTPHI
TADGSVSMKIKASNNSAGTGSPPPINKKEATTELLVKNGETTVIGGIYVD
SDTDEDRGVPFLMDIPVLGWLFKSNTKNKTKTELLIFITPKIVS
>GSU1491 type IV pilus biogenesis protein PilB
MQASRLGELLVRNNIITKEQLAKALDEQRTSGGQQRLGSILVKNGLVTEP
DLTTFLSKQYGVPSINLSEFEADMAVVKIIPADVAQKYQIVPVNRAGSTL
IIAMADPSNIFAIDDIKFMTGYNVEVVVASESSIKTAIDKYYDQSASLAD
VMNDLEMDDLEVIGEDEDVDVSSLERATEDAPVVKLVNLILTDAIKKKAS
DIHIEPYERTFRVRYRIDGVLYEVMKPPLKLKNAITSRIKIMADLDIAER
RLPQDGRIKIKMGGGQDMDYRVSVLPTLFGEKVVLRLLDKSNLQLDMTKL
GYEPTALSYFKEAIHKPFGMVLVTGPTGSGKTVSLYSALSELNKTTENIS
TAEDPVEFNFAGINQVQMHEDIGLTFAAALRSFLRQDPDIIMIGEIRDFE
TAEIAIKAALTGHLVLSTLHTNDAPATINRLLNMGVEPFLVASAVNLITA
QRLARRVCSECKAVEEIPIQALIDAGVPPEEAPEYVCFRGTGCAKCNNTG
YKGRVGFYQVMPMLEEIRELILNGANTAEIKRESMRLGIKTMRQSGLTKL
KEGVTSFEEVLRVTVADD
>GSU3190 twin-arginine translocation protein, TatA/E family
MQYSLTTLWFFGGYCMFGFGMPELIVILIIVLVVFGAGRLPEIGGALGKS
IRNFKKASEGKEEIEIKPQKKDEPKKDA
>GSU2029 lipoprotein, putative
MTRRNSIPILAVLVALTFVSAGCGKKEQAPSPPPPPQKASPAPKAQPPVQ
GRATSAAIAPVAGLSQYDFANRRDPFKPFLQAKAPEKTRAVRGSSAGLLP
IQSYNVEQFRISGIIVGLKESKALIVDPAGKGYVVKEGMSIGANNGVITK
IAPSYLEVNERYTDDFGKVRKRTVKLSLAKKQ
>GSU0950 outer membrane efflux protein
MRVMIVTAALLLLSAQRLPATEPLTLREALVTAMENHPRTMAARESLRGA
DARAGQARSGYYPRLDLVADWNRGRSYLTALEGFKETETYTAGLTLRQTL
HDFGRTSGTVAAAEGERGAARESLAAVRQDVALRVREVYNVLLTAERRVE
ALRETVRAREEVLGQARAFYSEGVRPRLDVVRAEADLYAARTLLTAAENE
RDMARVDLAAAMGLDSLPDRPLAEPDGESPSLPDLAEAKRRATAGRVELK
RYDALHAAAQGAATAARGGHLPVIEATAGAGYADRSFPPEGNTWGVGVRL
TVPIFAGFATREKEREVAAALREVEARRRDQQLQVGREVEGAWLGMREAL
ARVESTGRELEAARESRTLAIERYREGVGTIIEVTDAQARELEAETANIR
ARGDTRIALARLDRAVGDDGGTEQ
>GSU2618 preprotein translocase, YajC subunit
MLGIAFAMAAPPGGAQAGGAMGAFQAILPLVFMFAIFYFLLIRPQQKKAK
EHRALLDSLKRGDQVVTAGGMHGKVSGIDGDIVNLEIAPGVVIKITKGYV
ASLKKD
>GSU0828 metal ion efflux outer membrane protein family protein, putative
MSGAVHLFLTAVILAPLGASGEVRTLTLPQAVEYALAHNGELKALRNEKD
VARAGLERAVLLPNPTLELSADSGVMTGSPDETALSIGISQEFLTGGKRA
KRRAVAEREAEAVHFQIADRERQLSLDVKSSFSELILAQKRRELAGRAVE
LNGKLLEITRERLAAGDIPELEVNLARVEVARSEGRKIEAEREFAPLLAR
LRTLLGVPAGEEIGFDGIPEQSPLSISLDDLIRLALENRPDLKAFQATSA
KGEAAVELAEAERIPNVTLGLGFTHERSTDATGSGDEKTRDNLLGVTLSI
PLPLFDRNQAGIREARAVRQGADNRLEFARSSVPREIEGDYARLAAAEKT
LRLYADGILPQLEDNLKLVQEAYRLGEVGILAVIEEQKKYIEVNDGYLVA
LAERQAALARLEASVGTDFQNRTNGGAQ
>GSU3221 cytochrome c family protein
MNIMKNRILTVVCAAALSAGALAGVALAKQTYTPGTGVSNSPHNINNVVT
NGDEYGRVCAYCHTPHHAIVSGAIAEYNPLWSHQVNEETYTPYASRTFDG
GSVDNMQSDPLVGPSRLCMSCHDGVIAVSQHYGTAPAAGNGSAAVGDNWN
EISVGDLAFGEGLTNDHPIGFDYDAVASTDKGNGTLGSGIKAANTQFSVT
LAGTGVNYAGTHDRKISDLMYNNGSKNIMTCASCHDVHNNENPEDYLLIN
KQAGSQICLTCHKK
>GSU1609 outer membrane efflux protein
MKRMTTALLLVMAPLTVHAADGGVRLSLKEAIQSAVQKNLDVRAELYNPA
MAEADIRKSLGIYDTQLTISTDFQYAVTEPVSSFLSGTNTSRTRTLTLNP
GVNQLTPLGGTVGLTFNNAYNYTNSTRSLSEYWNSDLTLSLSQPLLKNFG
KEPTELGIMVARTAKDGSLERFRTLLLDTVARVRTEYNRLYSLREDLEVK
KTSLELARKILTDTQARVKAGVLPAMEILNAEFGVASREKDLIDAEKAVR
DQNDVLHVLLQLPGKEEIIPVDIPTRDQYQAEEDALIRKALDLRPELREQ
KASLRTSELETRVARNRTLPDLNLTASAAVTGLDRHYNRNLEKVGSADYP
VWGVGLVFQYPLGNNAAENEYRRSKLKVEQGRTQIRSLEANVGNEVKTAI
RGVDSGYKQLDVTDRGRAFAEERLRSFIKRSEVGLATIKDVLEVESELAT
AKSNQIKALTGYGDAVTQLLRATGELLDREGITVTEKEGDSLYEQSARD
>GSU0787 twin-arginine translocation protein, TatA/E family
MFGFGMPEMIIILVIALVVVGPSKLPQLGQALGSSIKSFKKGMNEDEVKV
INKTNEA
>GSU1783 type IV pilus biogenesis protein PilB, putative
MEKQSFKRKTIGQILVQQGSLNPDQIPYLVEKRNASTKRFGEVCVGDGLI
TEENLARALAEQFGLDYVDARGVRLNESLLATLPPDAIYRYQFVPLEEED
GTLVVLIADPTDVLKLDELELLLDRPFIVKLATETAIATILKKGEATSRV
LKEVSEDFMLQLVKENEKGEEILSMEKISADTSPIIKLVNSTVMDALTRR
ASDIHIETALEGVIIKYRIDGVLYRATEPLDIHFQAPIISRLKVMSELDI
SERRIPQDGRFKVRLNDKAIDFRVSIMPSAFGEDAVIRILDKESIASDLK
GLTLETLGMHPREMKRLRRKIREPYGMVLVTGPTGSGKTTTLYAALTEIH
TGEEKIITIEDPVEYVLRGIVQIPVNEKKGLTFARGLRSILRHDPDKIMV
GEIRDPETAQIAVQSALTGHLVFTTVHANNAFDVLGRFIHMGIDPYNFVS
CLNCVMAQRLVRKACPHCKYPVEHSDSVLIESGLDPEECRDVTFYESRGC
EECNGTGYRGRSAIIELLDLNDQMRELIMAKAPAAQLKAAARESGTVFLR
ESAIEKVFAGETTLREINRVTFVE
>GSU1251 BNR repeat domain protein
MNVTTLWGGWFKALLAAAFILATLGAPAAGWSAVPHVAAGGNHVLTLRSD
GTLWAAGSNQFGQLGDGTGINRTSPVQVPGTWKTVAAGTTHSLGIKADGS
LWAWGSNLFGQLGQPLVNGQLATNKFSPIRIGTGNDWVAVSAGELTSFGL
KGDGTLWSWGNNLFGELGDGTTVSRPQPVQVVADPLSNGRYVAVAAGGEH
ALALQADGTLWAWGANLTGQLGTGGTGILPNPTPLKIGTDRDWTAISAGE
MHSVALKADGSLWSWGQNLFGELGNGVALPGANVTTPTRVGTGSDWVAIA
AGALHTVALHRNGTVSAWGNNAQGAVGDGTVANRSTPTLIVAPVTLVNNV
AIAAGTGVSFSVLANGTVFGWGGNGAGQLGNGTFGAVTAPTVLSAGASAW
LGVEPGGAFSLGLRSNGTLWAWGGNASGQLGDGTTTPSAIPVPVVGGAGN
WRTTATGTAHSVAVRADGTLWAWGDNSSGQLGIGSTVAASTPQQITVTNP
ASAGNDWTAVAGGGAHSLGLKADGTLWSWGDNTFGQLGDGTGTSNRTTPV
QIVTGNPGNFDRNWVAVAAGGVFSLALQADGTIWTWGDNSLLQLGTDPAM
LTPATQRNVPAQLAVASPPSTAFNSSWVAIAAGQDHGLSLQADGTLWAWG
ANSVGQLGNGVTTATFTPVQVINAEGVPYVSLAAGTSHSLAGRSDGTLWA
WGNNTSGQLGTGPHPGDADPLNPQPHTVPARILTSNPVSAADDWLVATAG
GSHSAALKSNGTLWTWGQNTSGQLGNGTTTEADIPTALLEPRISVSPASL
AFGPAPIGIPPSPTRTLTITNLGSAPLSAAIASNNAAFTVNPAACNNLPP
AGFCEITVTFVPALPVGVKSATLTITSNDPVTPLVSVGATGTAANPFFIT
ASVDPASPVGSGTISPAGVQAVAAGGGATYTITAASGYAIVDVIVNGVSR
GPIGSYTFVNVQQDQSIIALFAASVTISASSGPGGIISPTGTVSLPLGGS
QKFTITPNPGYAIAGVNVIEMVEQLDVNGNGTGIFNPVPRALGPVSSFTF
FSVRANGSSISATFVPRELHEWSWQNPKPQGQAISRIATDGNGTYVAVGE
FGTILTSTDSVNWTVRTFGSVNLNSVAYGGNHTFVAVGSYGRILLSTDDG
ATWTVQSSGVTASLRGVAHSGTVFAAVGVTENNPNFPYEPRTAILTSPDG
VTWTSRFLDQPFNGTLNLFDVAYGGGRFVAVGMEGHVVISEDNGGSWTAL
PSDPVTRPTNLNGVAFGAGTFVAAGDFGQIVTSTDGSNWVNRPLMSVAEL
KGVGFGTISDPLFAGNLLQVFTVVGTDGEILTSEDAGTTWTFRTSGLAAG
GAGPMLQAAMVGLNRGVTPFVPTVIVAGSEGNLLASSDTIDWTNLFTTIT
RYPLRSIAWGNGTFVAVGDGDPGTTLHPPISAPTVLTSTNTGLTWTRRYL
SPGHNIRGIAYGNGVFVAVGASGYDLDQNASRQAIILTSTNGITWTPRNS
GTFLNLNAVTFANGRFVAVSDYNATGDVADEGALILVSTNGITWNTIRKI
TSTAGNLRSVIAGTNGTTPALVAVGDFGTVLRSIDNGQNWTEVTSGTINF
AELNGIAFNNTSNTYAAIGITSEVFTSTDNGATWSMRSLPVDDTLLVRME
GITYAYGSFVAVGNDNHILTSPDGASWTINIGAEYINSSLYGVAAGNGSY
IAVGSDGTILQSNRLLDNPPIIGVDPTSLTFDITPEGTLSAGQVITISNL
GINDLTIGALGFDGSDAGEFTVSEDFCSQAVIPATGSCTVVVTFAPTGAG
VKSAELKIPSNDPDSNDIEVPVSGTAIPRYLLTVTNAGNGTGSIASSPAG
ISITNSAAGANSSALFVVDTAVTLTATPAVGSIFEGWSGAGAGACSGTTT
PCTLTMSQARNVTATFTRKFIITATAGANGSISPAGQVLVNPGANQAFTI
TPAPGYSVFGVLVNGLSVGAVTSHTFTAVNADQTISATFTGVPPVRLTLQ
SGNNVFSTIGQAVAATPVNTAATIDAQAVQTADNGLTLNRGITLTFRGGY
GAGFSGIVGMTTVTGPVTVTNGSLTVSDMVIK
>GSU1778 type II secretion system protein, putative
MRTIANMAIAAVVLTFLAGCTAGLTAFNKAEKLEEEGKLDEAVMKFAEAA
STNPQQTEYRMRLLKASEKAAFEHLKNGDTAYEQQLLDEALREYQSAVSL
NPALARAKQRSDELIRVRNSLTYYREGLEFEKSNKPREALQAYRKALDLN
PGNKEIKEALEKLLQTRRTKLEGFELNLKSTKPITLKFKEAKLKDVFYIL
SQLSGINFIFDEGVKEQNVTVFLENATFNQALDLLTSMNKLGRKVLNEST
ILIFPKTPEKAKQYEDLMVKTFYLNSLDAKKAVNLLRTMLQVKKIYVNEE
LNALVIRDNPELIDVAGKILEANDVPDAEVLLEVEVIQVSKNNSELFGLG
LSRYAVSLNARTPGSGGDFFSDTFDPKVTTTSNDTTTVTTTQVKNLLNLF
NWNGYQGFLTVPSATYNFSKQLSNAEVLSNPKIRVKNREKSKFTVGTRVP
ITTTSSTGSVGGVTVNVQYVDVGVKLNAEPVIQLNNEVTIKLGVEVSSII
GEKTVGSGDAISSVVTIGTRNVDTVLNLKDGETSIIGGLIEDTKSKSKQK
IWLLGDIPLIGSLLSSHNDRYDKSELVLAITPRIVRAVSVPEADVAAFWS
GKEDDPTAVKPFSSFELEPDFAAQPAGAPAKGAPAVPSAKPGSAAPAGQP
ATAADAPAAVVQPAATPAAASAAGVQASVAPVQPAMRGSLNIAAPAGVDL
GGQFKVEVKVTDVKGLAKAPFTLLYDPIFIEYVGAAEGNFLNRDGKPTIF
NALADKAAGRVVITMDRSAAGEGVDGSGTLLSATFKAKNKGPASLGLQNV
KFVDQANRPLDIIPYNTVVEVK
>GSU2911 hypothetical protein
MLEPMSLARIAALSLTLLSALLFGLPAGAIAGVITSDTVWQGNVTVTEDV
VVPEGVTLTVRAGTVVTVAAAESTKTDPEYLSPLTEITIRGRLAVEGTVT
APVRFSGQEKRAGSWAGILIDGGTAAIRECLVADADSGISVAEGTLHLSS
STLRENRYGLVAQGAASRITVEGARIIENDYGVLSLQGARVTAHATEVAR
NRKKDAHTAAVRPHTLPKGPPVNEEAPVSRRYQDMVLLGETLWQGRILVD
GTVRVPEGSRLVILPGTIVEFTRRDTNGDGIGENGIMIQGRLLAKGTPDR
PITFRSAEKDRRMGDWDAVNIMNSSAGQNLVEHCRIEHAYRGLHFHFSTV
AIHNTTLTNNYRGIQFQESVVALHGNTLCGNKSGVQGRDSEVELVDNLLC
GNQVGGNFFRTTLTVRGNRIVANGREGLRLREGAVTVRENLFDGNRFGLM
AADLYHGEVNRNSISGNAETGISLKNVDSVEFAGNAVTANGLSGFNIQDS
GSLITGNLIADNGERGMGILSFAGIISGNNFAGNGLYAIDLDGSGDVSAP
GNWWGGRDPALVINDQRDDPRLGRVDYGRPGAAPTPFVWPLATVAADTVW
RGVITVAVPTTVLPGAALTVAPGTTVQFAAGTGLEIKGKLLASGQRDGTI
RFTSVDRKGPSDWNEILLEYATGSVISNCIVEYATWGIHSHFTDLAVTDC
LIRHNYGGMRFRSGPVRIRRSIFRDNTIGIRSYIGNALIAENLITANETG
IFVREKGGGLTVTGNSLVGNSGYNMRIGDFNDQDVNARGNWWGEGDPGGT
IFDGRQEPGIGMVLYEPWLDKPGPVGPANGGTP
>GSU0412 flagellar assembly protein fliH, putative
MSSSKASRIIKVDQSPNQAIRSYSFGFIAADAPQELPPEADGFVPFALGT
PVPLPGLQSAEEPDPDPVVPFNLEGKVVLAEDELQARVDEVFRNGMDEGR
RQAERGLANVFKSLRDGVAALTGLRSRVMKESEEDLLRLAVMIARKIVQR
EVAQDPQVLAAIVAAAVGGCTERDRVVVRLNPDDYTQVSANRQAFLAGLG
EESAITLAPDESIGPGGCLVETATGTVDARIEAQLDEIYRSLLEERSAPV
EPSASPDTDSRADLAFGGEETIAPFKGQGAWVKGSEEKPRDDV
>GSU1153 outer membrane protein, OMP85 family
MKTTTIFTAALLACLSAPAHAQDLRGSEAGAVMKRESDVRDYYELQQRLK
ESRESRPEEAVTDRTEPAPQPPADGGQRVFISHIVTDPSEILTEDDLRQV
VAPLEGKEVSIRELLAMVDRINDLYRQKGYLTARAVLPPQKVERGTVRIR
LVEGRVGRISVGGNRHTRDWFVTSRLHLREGDLVRLDTLENDLFRFNAIN
DVKLRAEVKPGTATGTTDLILRTQEPDNYRVVAFADNGGGRYIGQERLGL
TLQDLSLLGIRDPLTVGGTVADGTLSAHASYSLPLTPVGTRLGVTYDYTS
IWITSGPFESLDVDGTSSDLGLTLSQPFALSPALSVTTFAGFNWKKSTTD
FGGDTIFENRTRTLTLGGDLLAIDGYGTWFTRHVLTQGFHDFGGDRSFFK
YNGDLVRTFILPDDFSALVRASGQVSGNHLLPSSEQFQLGGIATVRGFYE
GLLIGDDGYFVSAELTLPLFPADASVYDVRLSPLLRGAIFFDHGGAFPYK
GSGESIDHNDFLSSAGFGFILNLAKYLTGRIDFGFPVGERDPDPGTVRVH
FSLSSEIL
>GSU2143 hypothetical protein
MNKGIIAVVAVVGALMASSAVYAGWGWGNGGWCMTGNSQNVSTQKMRSFQ
KESFKARESLMDKQLELQDEYSKDVPDGRKIAALRKEIASLQDQLQATGD
KYGVGNWGTGGGMNYRQSSGYGCGCGYCNW
>GSU1781 hypothetical protein
MDIRINLATRYFYNTRKVNTAIAAVILGLLLLLAYNIASLVANVSTERAL
KKDMGILQARFDESAKGITEKQYRDLLKKIAEVNAVIGKKAFDWLLLLNR
LEEVVPEGVALGAIDPSLKDGTLKLSGAARSFGALRSLMENLESSTHFTD
VLLLNQGQLSVGEKQKGISFTVTCKVDFT
>GSU1537 general secretion pathway protein-related protein
MYRDHFGFTEQPFALTPNPDFLFLSTHHQEGFAHLLYGIDTHAGFIELTG
EVGTGKTTLIRTFLNQLDPATHRTALIFNPTLSSLGLLQGINREFGLPCA
SSERGELLEALNRFLLEESSAGRTVVLVIDEAQNLSAEVLEHIRLISNLE
TERDKLIQIVLVGQPELKRLLALEELRQLNQRITVRYHLEPMGCDDTREY
IRHRIRVAGGGREPVAFTLGAVKKIYRFSKGLPRLINAVCDRALLLAYTR
DSREITASMAAEAIIDVRQEEGRRFPIPRKTLQLLTLAAAISIGVAVFNR
ADKEPPPPAVAGEATPAPAPQPPPLTGDAVRKSLTATAAGDNLVTAANAL
LGAWQAPAIDRAGRSDDIRTLAARRGFTATEIKGSLDDILRFDAPVLLQV
ELPDGTSRFLTLTAANNGSFTVVPAVAGKDSLSRGEIEAFWEGRAWMFWK
NFHGIPLRTRAGSRGKGVKPLQELLKGAGFYDEKPTGDFDAATEEGVRRF
QQSEGLQPDGKAGEKTLALLYRRAGGFFPPGLTGAKGSTQ
>GSU1154 surface repeat protein, putative
MNGKRAKRTLGAFTGKRSTFRKLVSLSAALTLSLPPQAFPAQIVPDGRTG
TSLTIRDNVTDVTTSTVRGANAYNSFQTFDVYRGNVVNLHVPDSAVNLLN
LVHGQASTIDGILNAYKDGRIGGNVFFANPYGFLVGASGSVNVGALSVMT
PTTSFMESFFLAPGVPSESAAAMMLNGTVPINPDGLISIRGQINALGTIR
LSGGSVVNSGTITSTGTYGTAADTGSLFRAMVNADGLESGNNIVARNGGI
EIVAAQSVENYGSIYSKGQSLHLQAGTELIVADGETISTRKIEGDPSDAA
VHLLTDNSSGNSGNLTLEAPTITLGSGARLLTHASGDYLAGNIELLAGQN
ITLNNGARLLAGHASDPAKGGDVLLKVSAINAIGASRTADAGIRAVNSVI
RGRNVTLSSIADTSLIVHLLEQNPTLSLDEAQAYLNSELDDLVSDGPGGE
YLAVTTSATAKTELYGTTIEGTGAVTIEAKAGARAGFKKNAVAEVIIDDL
RDADQATVLAKSYIRGNKVSITSTADTSLTFNVLGSVLKLTDQSWLPDPV
TGELQLLNDQLFDFSEIPLVSLSTATAHTTVGGATFISAGDTLTISSEGI
SAAKPTFSSPLLFSAAWGESTVEAKTLVNGTTELSAANKATVKATTDVEI
NVTADVNSTNKPVDAVFVHAKNTAVTTSLVGNDTTTTAGAVEVNAAATAD
ISANALAKNAGGSGVGIAVAVNESTTTTTATLGGNVTADAGNVTVKATTD
ITKNNTGADAATLGNPNTISARITDFQAGIKRNVTKGIIDATGLLKPETS
ERITGFIFPGIKEGTFNLSGAVTYSKSVNTTTAAIAPDATVQAQGNIDVT
ARIDDRPNASVGSKATSTGTAIGGAAVIADFTNNASASIGTGASVDATGS
LLVDAQTVVPYPWQIDWDSPVTILNHLQDGILDLLLTSYAINSAGGKSGM
GLAAAVSVFNLENNANAWIDEGARINTVFDKDAMTLPNQIVTVHAKNDIS
TVNAVGLLSKKFLGTSGGKAAIGGSGNIIDIRENATATIRGDSVVKSEST
IDVKAENVNHLVTVTEAGGSSDQVGVEGAVSINTITGGAVAAIDDDADVD
AGGNISVEAKGTAKTISVAGGVVATKGQVGIGFAVSLNTIDTDASAYIGN
YDPLQQDDVPALGQVSTDGSLTVKATSSNEIGAYSVTGALATNSTAQTEV
PKDAQETKDGAGSVAGSSGSGKGTFGIAVSGDASVNDITSDTLAYISDGA
TVSQAGNATLSATNTLAVNALAGAVTISTQQEGNGLAGSYSQNTLGGTTA
AYLDDASLTISGDLDMDAQVNGEINTISASVQGTKGKVGVAGSVSVNEIT
NTTQTYLTGSTVRGVEAVDLTARDDSIIKSIAGAVSYGGKAGIGLSFAWN
SLDNLTQAYVDTSALTATGDITVSATTNNAIDTISAALGASTGDMAGEAA
VSVNTLSNETHAWISGQNNGSGVESVGSISLAADDQSRIFAIAGGLAATS
GKAAFGLSFAWSDVSNIVDAGIRTGADVESTSGNVEVAADSTTRVQAFAV
GGSFASKVGIGGSVSVAEGTNSVTATIDGTSGVTADGNVLVTASDDVDIF
SLAGNVAGAGSAAIAVANSTLVTHNLVEATLGAGATVSARGNGTAGRIYT
GDKDASGNRTKEDVTGLAVSAASFENLQTIAAGGAGGGKVGIAGSATVTV
LDEKTYATVGQGAHVNDADDGDAAQNVLIRASDRTGLLGVAGAVAFGGSG
GVGAGADVGVITKDTEASIASSAQTPTTVKAKGNIVVTADSSEDITSVAA
SLSAGGSAGIAGSASVYDLGLNTAATIGNSAVVRADGSVAVSAHDGTEMD
MIAGNGAFGGTAGVGASAAVQVITKTVIAAIGEQADVTGRGTGDGVVVAD
GGFAVSYGADSGDEGEIRAPTTNGSGSDSGALTGQRSAPPSTRTVNGVAV
TATNQDDIESISATGSIAGTAAITLAGNVNAITTTTSATIGTGATVNQDT
AAADAGQSVLVAAGNDYYHMGVAGSGSGAGAVGIGVGADVTVANLTTTAD
IGEGALVSAAKDVEVSALAGEEVLSISASLGVAGTVGVSGSVSVLSIDNT
TSAGTGTDSMVDAGGNVRIAARDDTETDMIAGTVAIGIGGAGVGGAVGVT
SVSKETTATVGSNATVNARGNDTGSMTAYTGDDSDTTGQIRGLSVEAASS
EDIFSVAAAGAGGFYAGVSGAVTVQTVDSATRASIGTNASVNEGVTDGHD
EQDVNVSARNSARTNVITGALGVGALGAAGGVDVGAIRNDTSASIADGAQ
IYANRDVEVNALAKTEIDSVVVSAAGGLGAIAGGVAVYSVGTGLEQEAKD
QLKSDGGDFADVNSYADDQASDNSIGTLLTGSGDSRIRSIAADAQAKRSD
VAVTDQLNNQTPRGTAAFIGGAAVEAGRHVDVSARRTVDADILAGAAAGG
ALGLGAGIGIVNVSGSTQAYILGSGRANAAGNILVSANTDATGTVDAYVG
TGGIVAVNAALAIYNDTASTSAYLGDGGVIDRADQVDINATGLHTVTAHT
FGVSAGAAAGGLSLAKARVGGTIDASVGEDTRIGQDSRNTGDTVGSLSVS
AQTITNGTARSEAAAGGILSGQGSIATADIGPTVTAEIGNRTAARVDTDV
AAMATATVTGTATANGASIGALGVGVSEATARTMPVVRATIGDETVITAG
QDVTVSGTATTTATTHATASAGALIGIAGTTSTATGLPQVNTAIGSGSTI
EADRAVSVTATTTNSASSDASGWIGGLAAFGSNTATAVTAVPLYDAQGRF
ISWFGSTTGALIGGNTAITTDSVNVAATSSNVALADSRAGAGGGIAGVTT
RAETIQVNTTTAAIADSSDDNARKIHATNSIAITADALTTLNAFADSSTA
GLVGVSGARTDNAATSVVEASVGTNNALEAGTDLAVLALNEIVKYSPQRP
ANLTSGAGGVFGGAAGQSSTSLSTFTTATLAGNTISGSDQTISAGGDLTV
AAENAVLATDMAQLSAGGLIAVADVRSAITSDNTATATIGANANVHAGND
LNVLAKTNANVQTTTNTSTWGFAAGGDGTALNTVVADNDVVVGTNAALSA
DNDIDLFAGQGLSDLQNSLISRADARSWVSGAIPVSDVTGWAYLYDFNDI
LIDTGSNLKAGRDINLGAFSGLATVEGYAKAKKKSYLLFGIPITIYSNGS
RRSWFFNHEGTDVSGPSVTVNGTLESGLNRHKTLVIGPDGTVVGGTLTSA
DYEQTTINLRDKVLDKKAKLDAKIAEIDPSGTYPNLPEGDKILYDALKTE
VQILEQKLAEWEGKSNAELTVPLIAVKDLMTGSGDINITTTTLKGTGTLK
VPGTDFMIRIDNNSLAHLELNDLEIPKSASGNVNLNGKAITSHSYGGETL
QIVAGQNLGRRIEIFNNAYLDDFPGALTPSDIVLKGDIINYGGRVSIRNN
SGSVAVGGNIIADDLDMVMQGGFVREWQPGLYQPGHLIAGNNIYISGEIL
DINDTIQSGIPYRYITIPEFDPETLGPDMVIPTIGDELSVAKWDPVNQRI
VIYRVDFGGGKVELFGNIVSTTGTGALKVMDGYGEIVIENLSSRDVVINT
LDVGPRVDGQIKIVDTGRKYTDNGIYVGDNNQLLTLITGNGDALNVSQGY
QLWDAATRKFIYTELDSHKADSRSSSYAPHAGARVATFSDWTITEKDVAA
AWDNWWTTRFTAGNFWLQFALMVDAGMKQAILDQFGKKADNPIGIEFLGN
LDEGRISIFNSGAAGPSDIYLNGSIRNTVGNVSIRNDRGGIYSLNDTYLV
TGRNIALTATEGSIGTLDQGIRTDTVGGSLRATAGGLINVEEVEGDLVID
TVTTTGDVRLVSAGSLRDGSGTSPSITGTNISLTATAGGIGTGDNALVVN
ADGILTAESLHSIYLTEKEGNAHINRIASREGDVVLTVDGGLEDYNFNEG
LDDDTKDKLLTTWDDLKLTDDTKVQQSIDQYKEQKKSQYQAAHRLSDNGT
PFDPSDDQYDATYDPSWQYTLTATEQSEFNEGVWTADELLNAKNLTTIPE
LGKTEVLIEEANVSGRNVTIVTGAGVGSVLADEVISADAISNGTVTPDQR
IMVARAEKDDITIDNGNLIVQLKNDVDVRASQSVTIQSRDHVYLGAETDV
NIDRVDAGNGDIRLKIVGGIINGRTDDEANLIGRDLILEASAGGVGSAAR
PLVTDLSLGGVLTARARDGIFIREAGGDISADSIISQNGAVELTVANGSA
AIGQISAPGHVLLEVSGNIVNGRDDNGVNIIGDDLIIESSAGGAGTSANA
LVTDLSGNGVLTAWVRDDLFLEERNGDLTIDTIASTNGAVELSVAAGSAI
VGGITAPRRIRMTASANIVNGRDDGRENLITDDLSLEAAGGSVGSAEKFI
VSRLRPAGILTGLSQDSFFLEEQGGGLTVDSVVSQTGSVHLTVPDGSVDA
DHISAPGTVSIRANGPLLTVHRVDPTVLDVRNTFSGGTIVVGQADVAESV
MARGDTVLLGEIHHTGSGTLHFDVDGGSKTMADMVRIGTDSNTAIDFDHL
SSDTAVITADVDNLSLFDTRIGNRGDFSNSLYHVIVGNRDKRVRPCHLQL
YATEPFSLTMTADKRFTTTAFAVNYDPHFVVNGFSTENSVVGTTEKMIWT
GKRQNRLYYDPMEPGSRPWQRHMAPSGHDAVDIQPGAVGIDASDSLLEAD
TVKVLTGNTGANR
>GSU2869 preprotein translocase, SecE subunit, putative
MWGPRVIAKTKEFLTEVKAELDKVTWPTRKETVSTTWVVVAIVLLISVYL
GVCDVVLAKLMRIILG
>GSU1330 metal ion efflux outer membrane protein family protein, putative
MPESIGPSLRKLVTGLYFSQNILRCCGIFFTARRSATMRRTDFQPVTAWH
GLCNADLRPYPEVPETEMITCILRTLLASAVCVLLLAGPARSGDAPAGED
LNSLVSRALAVNPELKASEARWEMFRNRVAQAGALADPMLMFKLQNFLLR
DPLDSRRDPMSQRVIGISQELPFWGKRALKTEIADREAEALRWQVEERKL
ELARMVKETWYQLYLVDRELDIVERNIRVMDDFVTLAETRYSVGQGAQQD
VFKGQVERSRMLDMQIALAQQRTSLQATLNTLLFRPAETPVGRVPDLEIR
PISLSAAELRALAEENRPQFRSVRAQLEKGAAGHRLAGLESFPDVTLSLE
YMQRDPSMDERGYDMYSVGLTFNLPVQRERRRAMARESVAETDMARAELN
TLNNAIALGIADSLARLERSEKLAQLYRTGIIPQAEQSLESATIGYRVGK
VDFLSLLDARVTVFNYERQYYEALAEHGMRRAQLEALVGRELE
>GSU1066 hypothetical protein
MSTDNTTLRKFALLAAAMAFAGIVASLALAAVSQIPLFLLNISQPNVMIL
LDNSGSMDIIMQHSAFDPTARYSGGFDNDRTYYQTTSNGYHYLSTGNDYI
RDDKKGNFTKNSVTIKLPLPYDDTRWDGNYLNWLFYHATSSQRSTVSTDA
TLQKTRIQTARGVISNLVKTVSGVRFGLAKLNVDGYDRFDRKQTDGGSIV
RNCGDLTSANVDTSVSGISAETWTPLGEALSEVWQYFKGGTSLYNTGVSY
TSPITSSCQKSFTIVVTDGEPTYDGCYRGDFSSYGCDNAADADSHLADVA
AHMNGSDATSAYGGTQSVTTYTIGMTIDSSLLRTTAENGGGSYYTTTSGM
DLATALQNAVNEILGRQSSASAVAVSTAYLTSNTTLYRARFDSTDWSGYL
EAYGINKANGAVTGYPNSPKWEAGALLNANSARTVYTAGVQSGVYRRVDF
TSTNAATLAPAGFMNFSSASTASMIGYVRGDVEPAGYRHRASKLGDMVQS
APVILGPPDGYYSDNNYATFKRNNATRQSLILAGANDGMLHAFNADTGAE
EWAFIPNILLPKLKLLRATPYTHTNYVNGAITVGDAFITAKGLDGKSETS
SSWRTIAVCGLREGGKGYFALDVTDAANPIPLWEITNTSPSETSGTVVGL
GYSFGTPLIVKLKDSSQSGGFRWVALLANGYEGTTSGRAATLIVADLATG
AVIREIVADASTFSGVSPNGLATPAAIDRDADGFVDYVYAGDLTGHLWKF
DLSSSNSNNWDVVWKRSGTPVALCRAKTAAGSVQPITTAPDVVLRGGYQI
VFFGTGKYYESTDISSTQPQTFYGAYDYNSTTTPTSAQATNGALLTRADL
TAQTVTRIDESGTSWRTSSNNPIGLTKGWYLDLPVAGERVITDPVARSRK
IIFTTFIPNTDACSFGGISWLMELNMDTGGEVVRPVFDVNLDGKVDYSDT
VLGDLKVKPTGTLLGDGLASTPAIVGAGDEHEYKYITKTTGEIIKLLEGG
GHSQIGLRSWRQLK
>GSU0279 cadherin domain/calx-beta domain protein
MKKTSDDQVRAAKGKSIPLSSSGKDFIEGSLDFRPVSPKKQTAVRKPGEE
QEEKAAVAAEQTESDHMTADEAADVSYDHVAAITGEQSFADTLSVADQTK
TGKEEKCGDNNDDDDCDDKGGWLWWAGGAVGVAGGSIGFAVAALNGDDDE
GTHVDTAAVAFAGRVTDGPVHGATIYNDINKNGVYDDGIDKAMTHNSEAI
TSDADGNFTITVGDLIENGITDINKLKLVAYGGIDTVTGEAVTVDFTAPE
GYRYLNPVTSLIAAYMEAYNEANPNAKITAAEAEEAVIQALGLPQIDYAT
TDLALPETAVEAQKVAAILAVAAMLIEESGTDSDGFAFLAAHLAPSETPL
PGTMTYLTDEVTTALQSANDTTAANQFSSTVVAVNDATSLDDINTALNNT
IFADLIVAGNVKVGQTLDGGLGMGSTTGLEVTYQWLESIDGTTWTPIAGE
TGSDYTIRPTDILHHLRLQATYIGTDGQPRTIFYDVGVVPDSSPVFASST
SGAVAENEAVGTVVYRAEATSDLENNPLSYSLGGTDADLFTIDVATGEVT
LKNPADYESKSSYSIDITATDTYGLTSTTSVTIGIDNLDEVAPSITSGPT
AATIAENSGPGQVVYTAAADDSADISGGVTFSLKADGDAALFTIDAATGE
VTLTGNPDYEAKPAYSFTVVATDAAGHSTEQTVTLAIDNLDEVAPSITSG
PTAATIAENSGPGQVVYTAAADDSADISGGVTFSLKADEDAALFTIDAAT
GKVTLTGNPDYEAKPAYSFTVVATDAAGHSTEQTVTLAIDNLDEVAPSIT
SGPTAAAIEENSGPGQVVYTAAADDSADISGGVTFSLKADGDAALFTIDA
ATGEVTLTGNPDYEAKPAYSFTVVATDAAGHSTEQTVTLAIDNLDEVAPS
ITSGPTADIAENTGAGQVIYTAVADDAADISGGVTWSLKAGSDAALTIDA
VTGAVTLADNPDHEAKSGYSFTVVATDAAGHSTEQAVELSVLDNNASAAI
AVDLTTIAEGSEGTSTILTYTVTRTSALNASSVDWAISGVDAADLAAGQA
AVGTVTFAIGETSKTFTVEVVGDRTIESNEDLVVTLGNPGNDIDLGTADS
SATTIADDDGEVSIAATAVSVPEGDTGDSRVVTFTVTRTNTLSASSVDWD
VAGGTVNAADFGGTLPSGTVTFAEGEATKTISITVTGDRIIEPDETLTVR
LSNPGLNLVLGVDEASSTIVNDDVGFSIFGDVMDVVEGGIGEQRAITFHV
VRSDSLTLPMTIDYRLIPRGSTVPDGFDFTGSPDSLGDNAGRPSGTISFG
PDETSKTVTIYVAGDAVPELNETFSIVLANAPPSTIIINGEIEGVIRSDE
TQYSIHAVTAATVEGNGTGGIQQFLITRTGDTSQPGSVGYTLSEYGENPT
EANDFAAGTPLTGTISFAAGETSKILSVNLEGDSVLEGYESFQVALHTLD
SNSIIGTNTAVASIIPDDAAINIAATDSIVKEGTGAVSRSHTFTLTRSSH
VDSEVTVDWHLAGTGANPVDAADFGGTLPSGSVTFAPGETVKTLTITPST
DAAYEPHESYEIVLSTSQLGVVLETDHASGMILNDDSGLTLVATNLDKAE
GNPGTPSQLTFTVQRTGDTTGESTVHWELVSADGSGVSAADFASGILPSG
DLTFSRGVTSRVVTIPLTTDNIIEPDKGFTIRLSSPSEGTELLVSEVGGY
IRNDDAAFTLESVSPVAEGHNGTTTVTFTVVRTGDISGADTVEYVVAPAD
GGAVVDGADFVGGQLPDGLITFNAGEASKTVTLAVAGDNALESDEAFTIT
LVNPGVGSTIASGSTDVVILNDDDALSIVATDADQAEAAAGGTRDFTFTV
NRTGFLDRATTVNWSVAGVGANQVDAADFGGALPSGTLEFAANESSKTIT
ITVNGDYFQEADEGFRVTLSSPSDGTTLTTASADGVIRNDDTGLAITATT
TTLAEGDSGTVTHVFTVTRTGVTTGTTTVDWALAGSGGHPVDAADFGGTL
PSGTLVFAPGETTKTIEVQASGDTDIEPGEGFTITLSGADGNADIMTASA
NGTVVADDISIAISAGTASVMEGATGSSRVLQFTVTRTGDLASPVSIDWS
ASGMDAADFANGTALSGTINFGAGETVKTINLTQIGDNVSESDETLTITL
SNPAGNPAHDRTYITSATATTDVVNDDASLTITADAASQNEHNTGDGEAT
SFTFTVTRTGDTSTETTIDWVLQLPGGAGSAAGNDFVAGQDLLGTNSGLP
SGTISFAADETSKTITVLVATDNQVEQDETFSIQLQGAGANTEVSGNSAS
AVISNDDTGFSIIALAADHTEANGGTVTYTFRVTRAGDISSAATVDWDVA
GSGASPANADDFGGSLPGGTLSFAENEASKEISFTVSGDTVVEQDEEFTV
TISNAQLTDATPQLIQDATVGGIIRNDDQSFSVSAANASVTEGSAGTTQI
AYTITRTGDLSDSVTIDYAVTGAGGAATSDVQGGVLPTGTLTFAAGETSK
SVTFDVIADTLAEGNETFTLTLTNPSAGIIGTASDSTVVVNDDTNFALSA
PAPFAEGESGSATATFTVTRSGDSTGAGSVQWSVAPATGLTTADFTGNQD
LLGTNSGLPSGTITFAAGETSKNITIQVAGDLTLENDETLRVILADPTGG
TIEGTDGDKSTTILTDDDSFSISTLTASRAEGNSDSTITYTVTRTGSLVG
ARDLTWTITGADGFATGNDLAGGQAATGTVSFADGQESATIVVNVKGDSA
VESDETMTVTLSGAPANSVIGTASASTVLTNDDASVSIVTLIADKNEGNV
TIVTPTGEVPGSTASTFTVSAAGTVSGTVESAGDRDWYKVNLVAGHQYQI
DLIGNGSYTAGDVFLSLRNSTGIQLASNDDFIGVNSRITYTAPSNGIYFI
DAGHLGSGTGTYGVTIADLTVPGTDMSAPAYGVGAQPYTFTITRTGDTTH
GSTVEWRVAQGVGVDAVDFGSVGSQDLLGDNSGLPSGTVTFAAGEISKTL
TVNIATDSAKETDEILRVVLSNPSAGTEVITASADGIVRNDDAELNITAG
TFNLLEGDGLHGTGKAMTYTVTRTGNINQTSTVDWSVVHGTTSSADFTNG
VGSNLTPSGTLTFASGVATQTIVVYVYGDTGVGSVEGDETFSIQLSNPNS
GSALGNITSYTSTILEDDTRLVLSAADYSQAEKTAGNNTTYTYNIAREGY
TGGTTNYSWAVGYTDPYTGNPAYMYDNTQSRYETVTANASDFTGSLSGSG
SFSAGQTNASFTVTVTGDDTPEDDEWFAVNLTASSGYDEVTVIYDDPTKG
TGTQLARTYYSPYRTYYDGQQVSSATNGVASNTNYLFSSIERDEAVYYLS
DREVASTSVQTLNPGDGLRTRVEGDTPADGGAGATTVTIEGVEYGYVEHI
FAVQRQVATAGTASVGWRIGTYYNAAVSADDFLTITRDGNGDITAITTAG
ALPSGTVTFADGQEWAYIKFYTKVDDIGEYDEYFSIFLENPSAGSSIYTY
DTVSYPQYNYGIITNDDTRFDASVNDVVEGGTLTYTVTRSGDSRGTDTVD
WSLALPGSEATNESNNSTGTWYKLDPSDIDSVTPSNGTATYNAGTLTWSG
TLTFEDGETTKTITVVTTDDSWTETWREELPIVLSNATNVNAGEGNHDQE
TASTGYTDTARVYDNESDPLIGVSVGSSTTWEGTGANDSATGNSVTFTIT
RTDQGGRDGSLNYPTTVAWRLDGSGINWGSANNSAEILTYGGDAASVNEY
TSNTTYGVVTFAAGETSKNVVVTFTGDRYVESDKTLTFTVLDPDDAEHGP
LYTDFYGPADINNAQASVTTTLKNDDIRLWVGGWDTYSGDANGYYTNVQT
SAYEGNPLTFAVNRYGRLDCDIVVNYTLINGTTTNGDFTTTSGSFTLAAQ
GSAYGEYTYSISLADLLTDDTTVEANETFTLRLSAPGDSAGSSVRFQSYY
ADYTSSYNSPATTLDVRGTVYDDDTTYTLTPASTSLVETDQGASQTFSFD
VTRGGTGYTGAAQLRWRVEAVGGTPADSADFTSTDLLGTNNGLPSGTVSF
ANGELTKTFSVLIRGDLVAENNETFRVVLYEDVLTSSSPTITNSQSVASS
TLTIVTDDTGISIADSTLTESDANQTMTFTITRSGDTSGTSSMNWTLYHG
TTTAGDFSGATTGTVSFAAGETSKTISVTVAGDATPEADETFTILLSNLV
GVDEAIDISATGTIKNDDSSFAIAGDAASSPESGSQTFTITRTNDTAQSQ
TITWSVSAGSAGAADFGGSLPSGSVTFAPGEMSKTITISPSSDATPETDE
SYTVSIALGAGTTGDTITQATATGTIENDDAAIYIAADQTNQQEGHSGTT
PFTFTVTRTGNTTGAASVDWALSSAGASAADFTTADGLGSNGGLPSGTIT
FADGESAKTITIEIVGDEVVEADESFTITLSNAAGGAIITGSAGSTIAND
DSTIAIAADSAVKNEGNSGTTAYTFTITRTGYLGEAETVEYSVAGSGAHP
ADGTDFNGTTGTLTLAAGEATTTLTINVSGDLSGEPDEDFTVTLSNPSSG
VTITTDTATGSILADDIVFDVAAPASQTEGNPGDTTYFDFVVTRSGNLSG
SQTLTWSVAGIGADGTSGSDFDSTTGTVTFDPGETSKTISVPVKGDYLGE
ADENFRLTLTGPDGVVFTHNSADATIIDDEASLRISATDAGRAEGANGVT
SYTFTVTRTGNTALEATVDWSLAAGATDPDDFAGGTLPSGSLSFAAGELS
KTITVDVAGDTAIEGDESFTVSLSNASTGADIVIGSATGTIVSDDVEWTV
SPLSVPAVEGDGASSYVFRVTRTGSLSATTLDWSTAGSGTNPADADDFLG
SFFPSGTLVFAQGQTSQDIVVQIAGDNLLEADKEFSVTLAAPVNGLTHSY
AEQTASATIVNDDDVISIAPLSADHAEGTDSSSPFTFTVTRTGSLTGTST
VGWRIVHGDTSADDFVATTGTVSFADGQDTATLTVLVSGDRNLEGDEGFS
VELYNPGAGSTVDDTATTASGIIHDDDVDLSLAAADANVAEGDSSTAGHA
TFTVTRSGDLSVETSVNWNVVAGTATAADFAGGELPGGTVVFGAGESSKT
ITIDLAGDGAWEGNETYTVQLSGASDHADIVANNVSGQIIDDDDTLTLSA
VSADHAEGNSGATIYTFRIDRAGTATGATSVEWIAAGSGAHPTDQDDLLA
TTGTVTFADGETSKTFTVEVAGDTTGEYDETFSVSLANPAYGSTTVGAPV
TATVRNDDAVLFVRADHVSVAEGADGVETTFTFTVTRSGDTSGAASALWE
VTGSGLRPANAADFGGIFPSGAVAFQPGESTQQISLTVLGDAVGEYDETF
SLVLSDPEGATILEGTAETIIANDDTGISITALDADKAEGNNGLTDFTFR
IERVGLANGAASVSWAVAGTGSYPAGADDFAGGILPSGTVYFADGESVKD
ITIQVAGDETYGQDQTFRVLLSNPAGANLINADATGVIRNDDSQVAITAL
DTAKLEGNAGTTTFSFQVTRTGALDTSATIDWDVIGSGGHQTVAGDFAGN
SFPGGALTFAVGESSKTITVEVAGDTLTEVDEEFSVRLRNPGSGVSIAPN
AGEASATILSDDDGVVLIGLDVDRHEGASGTQTVYTYQVLRSGNIDAPIT
LNYAVSGDVDSADFMSPLTGSFEMGAGENSRLLTLTVNGDDIVEPDEFFQ
VTLSGSGINIDSTPVTGAIRGDDVAGDGNDVIHAAATADTINSGAGDDVI
HLTMDNLLHLQVTDGAHVDGGLGFDTILFDAAGQEFDLVALVANDAMSGI
EKIDLGGEGNTLRLTTAELLHQDQNLFSILANGSEPFHQLMVDGDADDEV
IIADITNWSHGAADTYTDGSVTYDVYTNGTDHTQLLINQAITNVHGVAG
>GSU3466 membrane protein, putative
MEKRALIAVVLSILFFYGYTALFSPPPKETPKPVATATQSQPAQQVTAAP
VPVAVPAQPQPAVAARDVSVDTPAYSVTFSTQGGSIKRLDLKRYHETAGP
GGKNVTLVSEDNPSNYTIGLRAPGFGLDQNAVFVPSADALTVGPGEKKQL
SFTWVSPAGVTVTKTYNFSGDGYGLEIQYQVTNSGSARVSSPVQTVQTYP
LVPKVKESRFETFGPATFAQDKLFEDKVKDLESGAKTHAAPLWSGFADKY
FLSAVLAHEGSMAAATIRKTASGYLENTISSPELSLNPGEGRALTYRLFF
GPKDIDVLKAQGNSLERAINLGWFAMLAKPLLHSLKFFHNYTGNYGIAII
IITVIIKVIFYPLTHSSYKSMKEMQKLQPKMQQLREKYKNDREAMNRAMM
ELYQTHKVNPVGGCLPMLVQIPVFFALYKALMFSIELRHAPFMLWITDLA
AKDPYYVTPIIMGVTMVIQQKMTPSQMDPVQQKMMMALPVVFTFMFLNFP
SGLVLYWLVNNVLTIIQQYYINRSISTAEAK
>GSU1784 type IV pilus biogenesis protein PilC, putative
MALYHCKLGSSEGRIITRELEAANPEMLRTSLEEQGFFVFEIKKKPLQFL
WDKGGGRRKVDNKALLTLNQELLVLIKAGLPIIQALDTVLERVERGTLFD
VLAVVREDVKGGMALSDALEKHTKVFPHLYVASVRAGERTGDLQLTIRRY
IAFLKRVEEVRKRFISALVYPAILVTVATLAITFLLVYVVPTFSQVYADA
GSQLPLPTRILIAFSTSLKQLFPLIIAAVIGAVFFFRRWAATESGRYRVD
DIKIRIPFIGDVFSKFAVSSFTRTLATVIGSGIPIVESLKMSVGTLNNRV
LERRMLEAVVKIEEGMSLSGAIESARIMPPLALRMLGVGESTGSLEEMLS
DIAEYFEGEIDARLHLLTTAIEPAIMIVMGLVVGVIIVTMYLPVFKIAGT
VG
>GSU0278 outer membrane efflux protein
MGSVRTRVLGAVIGALCLAATAWGQEAPMTGDGLDVALNFAAIGHPSIKS
QVAELKALGSDLKSAEYKRYPALTIQAQTMTNNQNQVVAVVQQPLWVGGR
IDGGIEQADVGLRIGRAALLDVRRRVMEETSAAYATLRGALQRLKAAELN
VGEHEKLKGLVSRRVEGGVASNADILLAESRLSQAIAQRIQLKGAVSRAR
SDLLALTQQPVAGNEPVPAHLLELPEPERMAEMIVKTSATVEQRILKVED
ARITSRLAVASMMPALYAKVEQNVYVAERYGETPHETRFGVVLQGSVEGL
GLSGWKKVKASDSRIDAARKEVAAAENDVRRQAEALVTDILSLRQVVRSY
ELLVTSTEETLASFLRQYDAGRKSWVDVLNAQREFSEARISLEQARSSLE
EASLRLAARLGQLDSFTGGNE
>GSU0391 Outer membrane efflux family protein
MLVLSPPVLLAADRSVTLQEALQSALERNHLVNGARFEREAAERGAAASR
SRYFPHIFLEEGFAASDAPDRVFMMKLDQGRFTLDDFQLENLNTPSSYRD
FRTAVTLEQPLFDLGIGYGREMAEKEAERAGFSLAQRREDVGLAVYAAYL
DVQKGRVTLAATEKEVAEARESLRVAQARSREGTALRSDELRARTFLSES
EQRTITALNDLRLARMRLALATGEDAGGSLDIAEELISSPVRLSEDELVR
EALVNRSDLKGGEKDVERAEAAAGAARSAWFPTVYAGASYQMNNRDVPFG
RDNDAWMAGVNLRWELFDGLRRSHDQAKAGAVRQAALQYLEQQRKEVVLR
VREAALRREEAGKRLEVARHALLAADEGMRIVAKRYENGLATMVELLDAQ
SALNRSRTGLVERESEYLLATARVYHAAGLFLKETVK
>GSU2773 conserved domain protein
MAGKSLKKPTTIVTPGVKEDVSEKADEKTYDLRDPSQLVALFTENENEDF
IKISLSKYIESLIQSHNMASYNIVFLYDETRSISRYHSNQIYEAASSATD
KKDILLILHSGGGQIEPAYLISKTCKSLTKKKFVVGIPRRAKSAATLISL
GADEIHMGLMSELGPIDPQIGGYPALGLSNALNTLAGLACRFPDSADMFA
KYLTDNLNLKDLGYFERVSESASQYAERLLAGKKFPAPSTAQSLANHFVN
HYKDHGFVIDSDEATNLLGSNIIKQNTAEYLFANELYKFLDFASFLFSYF
KKKEFYYVGDIRSGLSTRDKRNT
>GSU0329 general secretion pathway protein D, putative
MKKRFLNLLTAAVLACALLAPLPASAKGVVLNFNDVDIATMVKFISDLTG
KNFVLDERVKGKISIYSPSKLTPDEAFSLFTSVLELKGFTLVQAGKVYKV
VPTAAAKQSGMRLLSDKDRLPVSDAYVARIIPLERISAQEAVAFLQPIVS
KDGYIAAFGASNMLMVVDSALNIQKLTGILTLVDSPQKREGAEIIFLKNA
SADSVSGVIREWLGGKTSRPAGQAGQATATSSAGVLIVPDTRLNALVIFG
SDQDKDDIKKLIAMVDVVPPTTSSKINVYYLENADATDVAKVIDGLIKGT
PTTPGQPGVPAAAPVQSPFEGGKISVTPDKATNSLVIMASPVDYQNILQV
IQKLDKRRRQVFVQALIAEVSLDKLKDVGVQIGALAVGTQGDASGGAVLD
PFNFLSATSGPQFLLVKALEELGKNVSVSAQVKALVSDGAINVLSTPNIL
TSDNKEAEIFVGENVPFLSQTNLTTGGISQQSIERKDTGITLRITPQISE
GEYVKLDIYQEISAVKENKGQANDLVTTKRSAKTAVVVKDKDTVVIGGLI
QDRDTETINKIPLLGDIPLLGWLFKTKSTRREKTNLMIVLTPRIIRGAEE
MNDVSGQQRDKFGEALSLDAPFDLKRDLQLSK
>GSU1782 conserved domain protein
MLFAQNAIGMEVSQHGVRFVVLGGGKGAPRLVTHGGASFAPGIVRILHRE
PNVVDPKAFVGTVREEYCKLLVRTDLVSVTLPDAVGRVMIMDFDTRFKNR
EEGRDMIRWKLKKSLPFDAGDMHLDYQTLRERDNGSLSVLIAVVARQVVT
QYEDLLLEAGIQPNRIDFNTFNLYRVAAKRIPSEDTSLFVAFHGGVLSML
ALTDGLIDFYRVKEMGREVVDPNRIFMEINSSLLVYRDKNPGREVKSVFC
LAPPDGGESFAGIVAEASGIDPVMFNPQAVMGNNGTATDPALLHDLAAAI
GAATRNL
>GSU0027 TolR protein
MEVGSRNSGDRGTMSQINVTPLVDVMLVLLIIFMVTAPMMQQGVQVNLPK
AETKAMTAQDEAVVVSIDRAGKVFVNSTEVASGDLTAKLTAMVANRAKKE
VFLKADRDVPYGQVVKTMAEIKGAGIERLGMVTEPAPGK
>GSU1486 MttB family protein
MSEDNQKVLPFLEHLVELRKRLIIIIVAVVVGMGFAWNLSNGLLHFVTKP
ITGETYLTDIKKQVYQEVGKRFPAAYKQFELEKAMNAAPKERKLNYSAPL
EPFFVQCKISVIAGFILALPVVFYQLWLFIAPGLTRKEKRMVLPFVTVST
VSFCVGALFFLVVIWPVIINFSLSYEAEGLNSLFNMSAYINFCLRLILMF
GLIFELPILMLLLSRFGIVTYQFLARNRRYALLASSIVAAFHADLITMFV
IMVPLYMMYEISVWLVLLFGKKKPVEAVAGEGPTGAAS
>GSU2609 type IV pilus assembly protein, putative
MSQKKVGEILIEHRLISEDQLREALELQKVFPDQPVGQLLCKLGFLSESE
LSYILEQTGKRQKLGDILIRERLVDEERLNQARVAAKRDGSTLERALRKL
RLVEEEPLAKTIATQYDLSFVHINTLEIEPDLARCINPNYAQRQRIVPIS
RIGNTITLAMAYPIKLHELKELEQSIKSRIIPVIAMESEIIQAQQRLYKT
AASAAHALTLDEADLEIAPGSIVDILSSGAGEDEPDIDDEVRTITERDSV
IVKLVNKIIFDAHQNRASDIHIEPYPGKNDVIVRMRVDGSCKVYQRIPFK
YKYAIPSRLKIMAELDIAEKRKPQDGKINFKKFGPLDLELRIATMPTAGG
LEDVVIRLLNTGQAYSFDSLSLTDRNMRIFGESITKPYGLVLVVGPTGSG
KTTTLHAAIARINRPEVKIWTAEDPVEITQKGLRQVQVNQRIGLTFAAAL
RSFLRLDPDVIMVGEMRDEETASIAVEASLTGHLVLSTLHTNSAPETVTR
LLEMGLDPFSFSDSLLCVVAQRLARRLCEDCRELYRPDRKELSEIIEEYG
EEQFAATGLLGNEVVLARPVGCTTCNQSGYRGRLGIHEVLEGTDTMKSLV
KKKSDTEIIRRQAMADGMTTLRQDGILKVFQGLTDIHEVRKVCLK
>GSU1618 hypothetical protein
MAVKFFGQFLVEKEVVTREVLLQAIELQESVNLSFGATAMAMGLLTEADI
EKVHNAQRCEDLRFGDMAVKLELLTADQMQQVLTRQKNGHLYIGEALVKV
GGLSADDLPRYLDEFKADQAQYATDTVSIPAGLANPNIWEMMVDLSHKML
TRVALLTFRPEPCFMANRLPRKDVYAAMDFSGDVSGCYLMGVSTDAQARI
ARAILKEANVDEEPKEVLDDTVMEFINVVCGNIAAKAAQLGKSIEIAPPR
IVEASAGIVPPPDHLCLCFPACLAEGDHVELAVFIKE
>GSU0781 twin-arginine translocation protein, TatA/E family
MFGFGMPEMIIILVIALVVFGPGKLPQLGQSLGASIRNFKKASLEEPEKI
TVKEKDEHA
>GSU1646 lipoprotein, putative
MTTTRSLPFLLIVLALAGCGDFEWFPDTETNQGSLGTVADAERNAPVVSK
AFTITGGSAAISIANGEYSIDGAPYTSASGNVQTGQSVTVRHITSNAYSS
SMTTVLTIANESIPFTSVTMSKPPVFVNSSTATDFTFAPKLDVEPNTVIV
SDSRTITGNTSPAPISITDGEYSTNGTDFTSSAGTINAGQSLHVRHTSPT
TYQTIKTTRLTVGGVRTNFSSMTKAAPFVNYSTVSPVNADPTNVISVAQP
LAATKDFTTSTHVSFSIRYFLNNSSNEQKNIGLTIAGADAQNRRIYYGTI
DAAVPANANPYTSTHSFGAALTIAQYNSITQWLVTKIIIYQ
>GSU3317 hypothetical protein
MSRPLTRTVVALALLAASSAAHADGGPAAGGLAEGSVTYSAIAPNSITNS
KIVDGAVTDAKLGFGAVTTPKIADGAVTDGKIRGPIAGWKIGPHGHDATD
IVQGTIDAARLPVGTGPGTVAAGDHTHEFLPKKPATLIVVAPTGGDYVSP
IDALNAITDASAEKPYLVKIMPGVYDLGIATMVMKEYVDVEGSGELATRL
RGSAADAGVVACASRAELRGLSIEAAGAEGNIVGIFNGSSAPRIRNVSVT
VQGGKGTFGIYNLMAEPLLDSVTVTAHGGDAGFGIFNIHSSPVIRNATIS
AGNGVYTTSSGSATVEGSIITATLFSIFNDTATTTRVANTRLAGGRIVNS
GIMKCAGVYDAEFDPVQCR
>GSU0435 MSHA biogenesis protein MshE, putative
MESIVKEGSLGSILFKCQIISEDDIRRALDEQERTGGRFGEALVSLGIVT
QEDIDWALSNQLNIPYVRLKPAMVDRDAVALVPAVMARQHNLIPLIRAGE
ELSIAIADPLNVAAVAAVEKETGCAVSVSVALIREIREMQERFYGPPDTE
ERLGFTSSAFPPQALAAMNHDLTGGKFIDYLLLFVAQQKLSSLSLHPLGD
RVSVIGRRGGTTREVGQLAPSRYPDVVMHVKKLAHIDGARFSARGGLSFA
LKGRSIPFQVATLRGEGGDHLTFRMTVAALFPTSLADLGLTDDQVRQFAD
LAAAGRGMVVTGARDREIRRRLTDLYLQEHEAEGKTVLVVGSGAGTGEQR
FPRIPVPSDADLSAVVSACLEHDPDILVLEDVTDGQAFAAACRATLRGKL
VVAGIGCGDAVGALDQLIAFRDMHVLVPAYLRGVITCTPIRPLCPACRRS
EPFPAAERAALGIGADVTSCWRSAGCESCDQTGHDGRRYLLDVLVLDHDL
RERFEAARNGAEVIEHLRGQGWRGITDERQTLLAEGTISLEEYASSLHG
>GSU1982 general secretion pathway protein-related protein
MKARIMYEAYFNLTTKPFELLPNPDFIFPSKSHKRALMYLDYGISERAGF
ILLTGDIGTGKTTLIRNMIQLKDERTIISRIFNTRVEPEQLLAMICHDFG
IPAEGRGKVALLNELNDYLIEQFALGNRPVLIIDEAQNLSADLLEDIRLL
SNLETSDTKLLQIVLVGQPELREVLALPQLLQLRQRISINCHISPLSREE
TEGYIFHRLDRAGNRNAVAFSSDALDIVYRYSRGIPRLVNIICDFILLSA
FAEQTTEIPGEMVRDIIGDLDFENHYWGGAEVAASERAPEAARLAPGRSD
EATELVATLRAVIDRLESLEGDFARMSRGVLDEMSEKVASLENAFRFHVD
ETDSHISEIRRRIEKVQNLEVGTTGECQPQDSPKGLLKRMFGA
>GSU1792 clpP, ATP-dependent Clp protease, proteolytic subunit ClpP
MLVPIVVEQTGRGERSYDIYSRLLKDRIIFLGGPVDDHVANLVIAQMLFL
EAEDPDKDIHLYINSPGGVVTSGMAIYDTMQYIKAPVSTICVGQAASMGA
LLLSGGEKGKRFSLKHSRIMIHQPLGGFQGQATDIHIHAQEILKLKKRLN
EILAENTGQQLAKVEADTERDYFMSGAEAKDYGIIDNIIERNTPSGGTR
>GSU2550 drpA, DNA processing protein DprA
MYHWFALRSVPLVGNVLYRRLLDRFGSPEAVFRASDAELASVRGVSSAVA
ASIRGHDPRAFAESECERVRRAGVRIVTIRDPDYPPLLLQIADPPPYLYV
KGDPTGLEPAVAVVGSRRATVYGRTVTARLAEDLARRGVAVVSGMARGID
TAAHQGALAGEGRTVGVLGCGIDVVYPPENRALFARVADRGALVSEFPLG
MGPLAENFPRRNRIISGMCRGVLVVEAAERSGSLITAQMALDQGRDVFAI
PGNITSSGSSGTNRLIREGAKLVAGVEDILEELVPRAAAAVAVAPPLPPL
PAGEAALMAMFGADPLHIDEIIAKSALTVGEVSAMLLRLELKGVVTQLPG
KFFHAN
>GSU0642 ffH, signal recognition particle protein
MLENLSDKLDVLFKKLRGQGVMSEENIKEALREVRLVLLEADVNFKVVKD
FVERVRVRAVGTQVLQSLTPGQQVIKIVQEELVALMGGGEDNSLDLAAKP
PVPIMMVGLQGAGKTTSCGKLARLLKGQRRRPLLVPADVYRPAAIEQLKT
LGRQLSVEVFDSRADQDPVDICREALRYATLNGFDVVILDTAGRHQIDEY
LMNELVRIKEAAEPREILFVADAMTGQEAVNVASGFNDRLDITGVVLTKL
DGDAKGGAALSIRAVTGKPVKLVGVGEKLDALEVFHADRLVSRILGMGDI
LTLVEKAQATFDSQEAERLQQKLKKSQFDLEDFRNQLQQIKKMGSIESIL
GMIPGVGKAMKQLQGAQPSERELKRIEAIIGSMTPAERANHAIINGSRRL
RIAKGSGTTVQEVNQLLKRFTEAQKMMKQLQKLGPKGLMRGMKGMGKGMF
PF
>GSU3056 flhA, flagellar biosynthetic protein FlhA
MANTAVDAVDLQPAKSNSDIYMAVALIGVLALMIIPLPAFLLDLFLAANI
TIALAILLVALYTQQPLDFSVFPSVLLVTTLFRLALNVAGTRLILLHGNE
GVDAAGHVIKAFGQFVVGGNYVVGAVIFLILVIINFVVITKGAGRVAEVA
ARFTLDAMPGKQMAIDADLSSGLINEKEARRRRSRVSREADFYGSMDGAS
KFVRGDAVAGILIMLVNIIGGFIIGVWQNGMPLEAALSNYTLLTIGEGLV
AQIPALIISTAAGIIVTRSADEKNFGHEISGQFLNYPKAFYVSSGVLFAF
GLIPGLPHVAFFLLSGAAYMAGRLAKERAQVVEDDLMTLPAPAETGESGD
QAGAIRPLDMLELEVGYGLVPMVDAAQEGELLERIRSIRRQYAQKMGFVV
PPVHIHDNLQLKPHEYNILIKGAKVGGGEMIGQYLAMDSGAVSMPVEGVR
TTEPVFGLPAIWIRPELKEQAQLAGYTVVDSTTIIATHISEIIRKHSHEM
VGRQELQQLLDNLSSSFPKVVEDLVPNLLNLGTVLRVVRNLLREGVSIRD
LRTVLETLADYGGLTKDPDTLTEFVRQGLGRSIVEQYKRDDDTLCLISLD
RRVEEVVAEAIQPSDQGSYLAIEPNTAQLILSGIRQEMEKFNQIGTQPVL
LASPSIRRHVKKLTERFVPNLVVLSHNEVPSGIKIQSLGVVTLNAG
>GSU0426 flhB, flagellar biosynthetic protein FlhB
MSDDKHSKTEKPTAKKLDEAKKKGVPHSRDLTSTVTLIAAMVALYTTGGF
MFTTLKRTSGELLGSMGTFHLTEASVEHLLIKLFLVFLSVVMPFMLVVVI
SGLATTMVQVGFSMNSERITFKLDKLNPVTNAQKLFNKDSLVEMLKAVLK
IVIVGYMSYKIMRDEMDGLLFLADTDLAGILEVFKHLAFKLVIHTCGVLL
ILGVLDLAFVKWRFIDNLKMTKQEVKDEHKESEGDPKVKGKIRQMQFQQA
QKRLRKVIPTADVVVTNPTHYAVALKYERETMAAPLVLAKGVDHMAQTIK
AIARENNVMLVENRFLARELYAQVKEGQPIPESLYTAVAEVLAYVYSLKG
KI
>GSU0409 fliE, flagellar hook-basal body complex protein FliE
MIDGIESGLGIAQAFPSVTGEAKPGNLAADGGKFFGELVSKVSELQAQSD
TAIKGLVSGESKGLHEVMIAMEKSSISFQFLSQVRNKAVEAYQEVMRMQV
>GSU0410 fliF, flagellar M-ring protein FliF
MPEALNKLIQPFMALPPAKRWVVGGVVGLSVIAFTILILVANRTDYRPLF
TNLTSEDAGEIVTKLKEQKVPYRIAADGKAILVPSDKVYDLRLSLASDGL
PQGGGVGFEIFDRKNFGMTDFVQKLNYQRALQGELSRTISQISGVEQARV
HLVIPEKSLFKEDEKPATASVVLKVKGQRQLRENDVQGIVHLVASAIEGM
NPEHVTVLDQKGKLLSKNTPGDAAGKMTASMQEVQRAYERSTEERLQSLL
DKAVGAGKSVARVSAVFDFRQVERYEEKYDPETVVRSEQRSEEKQDGSTV
TGGVPGVQTNLGRTAGQPAGTSGGGSKNDETLNYEVSRATARTIEPVGTL
SKVSVAILVDGKYDAAAAGKDGKEAKPKYTPRSPDELQKIDALVKSSVGF
NVERGDQVTVVNIPFQDTGDVGAGEADKWWNAPIFLSLLKNGLIGFGFLA
LLLFVVRPLLKTLKPEKSTSFEPIPSAEDALNQIAEIHRLQIGNQTVSQM
ELINKIKQEPYQAAQIIQNWLRDKGEE
>GSU0413 fliI, flagellum-specific ATP synthase FliI
MSRIDLSRYLSAVDAMKPIRFHGKVTQVVGLVIEGFCPDAAVGTLCLVHP
NDGDPIPAEVVGFRDNKTLLMPLGELRGVGLGSLISVKRKKASLGVGPGL
LGRVIDGLGVPIDDKGPLAIREEYPIYANPVNPMKRRPIRQPLDLGIRAI
NALLTCGEGQRVGIMAGSGVGKSTLLGMIARYTEADVNVIALIGERGREL
REFIEKDLQEEGLKKSVVVVATSDQPPLVRMRGAYIATTIAEYFQAQGKK
VLLMMDSATRFAMAMREVGLAIGEPPTTKGYTPSVFAALPKLLERTGSFL
DGSITGLYTVLVEGDDFNEPISDAMRSILDGHIVLNRELAARAIYPPLDI
LASASRVMNDVTERSQQQFASRFKELLAAYRQAEDLINIGAYKPGSNPTI
DYAIAKMDGMINFIRQGIHDGVSMEQSIAELADIFDEGMAL
>GSU0422 fliN, flagellar motor switch protein FliN
MSDFTKEETKDGELDRKNLEFILDIPLQLTVELGRTKILVKDVLQLNQGA
VVELTKLAGEPLDVFVNSKLVARGEAVVVNEKFGVRLVDIVSPNERVEKV
L
>GSU0423 fliP, flagellar biosynthetic protein FliP
MDGVPIFKRIPFIALCVILLTASLAAAAEPLALPSVSIGVGKATKPGDVS
VVLQIFFLMTVLSLAPGLLMMTTSFTRIAVVLSFLRHAIGTQQAPPNQII
IALSLFLTFFVMAPVWQQVNTQAIQPYRAAQITQDEALKRAVAPMRKFML
SQTREKDLALFLNLSKLPRPRTADDIPTLTLIPAFMISELRTAFQIGFLI
FIPFLVVDMVVASVLMSMGMMMLPPVMISLPFKILLFVLVDGWGLVIGSL
IKSFG
>GSU0424 fliQ, flagellar biosynthetic protein FliQ
MSPDLVVQLARRSFEVTLMLAAPLLISGLVVGLAVSIFQAVTSIQEATLA
FAPKIIAVMVALVIFFPWMMNYMSDFTREVYALIATMRR
>GSU0425 fliR, flagellar biosynthesis protein FliR
MFPLTTPFPTANDVAFFTLVMGRMAGIFAAIPIFGGRRVPTPIKALLVFA
MTMVCFPIIKEKMPQLPTDVLSLGFLMVQEVLVGVSLGLLSLIIFAAVEF
AGQIVSVQIGLTIVTEFDPSQGGQLSIMSIILEMLATLLFLSLGMHHIFI
GALVQSYDVLPLGAWHMSGALLQFIVTTIGEVFVLAVRLAAPVMVTLLAT
SVMLGIMARSFPQMNVFFVSMPLNIGIGFIVLGLSLPLFLHTVQGHFGLL
DEQLKTMMKLMGKG
>GSU3036 fliS, flagellar protein FliS
MLTPFNQYQNTQVGTASPEKILIMLYDGAINFSKIALERMEKKDLAGKGK
YISKAQAIVSELMNTLNHDVGGGIAQRLEQLYIYVIDEYINANINNSPRA
LENAIRILTVLRDSWVEAIDIWKRERDAVPPSVHQPGYVAGQAR
>GSU1132 ftsY, cell division protein FtsY
MAEERKGFFKGLWGKVTGGDRDQEEAVNETTSAVADVAAPPEQEDRRPGL
FERLKQGLSKTRDSLVGRIDRLVLGKKEIDADTLEELEEILITADLGVQT
TVELIRGLEQRLSRNELKDGEALREALKEDIHGRLARDAHQLDVTGASPF
VIMVIGVNGVGKTTTIGKLAARFTAQGKKVILAAGDTFRAAAAEQLQIWG
ERTGVDVIRHKEGADPSAVVFDSIKAAVARGADILIVDTAGRLHTKVNLM
EELKKVRRIMSREIPGAPHETLLVLDAATGQNALSQAKLFKEAAQVTGIA
LTKLDGTAKGGIVVAICNEFRIPVRYIGVGEGIDDLRDFDPSQFVEALFQ
>GSU0328 gspE, general secretion pathway protein E
MEQIARRLGIPFLAEIGDNEADAALLARLPLAFARGRLVLPLRERDGRLL
VVSGNPADLSAIDEVRGVYGMEVELAAATPDTVLGAVNHLYARLGSSAQE
VVEELEGEDLSVIATELAEPKDLLDLTDEAPVIRLLNSILSEAVKERASD
IHIEPYERELEVRFRIDGILYRKLAPPKVVQEALVSRVKIMAGLNIAEKR
LPQDGRIRVIVAGRDVDIRVSIIPTFFGERVVLRLLDKQKGLISLENIGL
SEGGVRSMERLLGRTSGIILVTGPTGSGKSTTLYAALNRLNSPEKNIITI
EDPIEYQVKGIGQIQVNPKIELTFAQGLRAILRQDPDIVMVGEIRDAETA
EIAMQASLTGHLVLSTLHTNDSATAIARLVDMGIEPFMVASSLSAVLAQR
LVRRICPHCRESYTPERDYAGITLPSTLYRGRGCDACFGLGTLGRVGIYE
LLPVDGEICSMIIRREPAGAIKEYAVGKGMRTLRDDGLAKAAAGITTIEE
VLRVTQEEYADLPV
>GSU0326 gspG, general secretion pathway protein G
MHNTLRNRRGFTLIEIMVVIAILALLAALVGPRIIGRSDDAKVADAKVQI
KNLETALKLYKLDSGTYPSTEQGLMALVAAPTVGTIPKNYRSEGYLESKQ
VPKDPWGNDFVYLSPGEHGDYDLYSFGADGVKGGEGKNADIESWNLQ
>GSU0322 gspK, general secretion pathway protein K
MRRSESERGFALILTLVVTALLIAVTTEFIHGVYVDTSLHRNFVNLQQAS
LMAEGGVTGGISLLRNLRTSGNDQGLQQLLADPVQFEDEKGRVSITIEEE
DGKLNLNAVTLPNGDEHVFYGPAERRLLTALKLPTALHDSLADWLDANDE
PRPDGGESAYYQSLSAPYAPRNAPFATFGELGLVRGVEPAVLERLRPFAT
VFVDGGAINVNTAPLQVLMALDEGISEGIARDIMQRRRIKPFKSVGELSE
IPGMETIAGKLSGFAGARGSTYRLVSRAAVGDVTRLVEAVVNLDGTQPRY
LYWREY
>GSU1267 lepB, signal peptidase I
MDYKETQYGQSTPSEQAAEPVKKKHIVREYAESIIIAVILALIIRTFVVQ
AFKIPSGSMEDTLAIGDHILVSKFIYGTKIPFVDGRYLKIRDPKRGDVIV
FEYPEDPSKDFIKRVIGLPGDTIQVVQKQVFINGKPFSVPQEVHKEKDVI
PAAQNPRDNFGPVTVPENSYFVMGDNRDRSYDSRFWGFVKNSQIKGLAFI
KYWSWDREKFRVRWGSIGDIIK
>GSU3135 lspA, lipoprotein signal peptidase
MKPTYRIFNAVVLGSLVLDQATKVLIDRTMDLYQSIPVIDGLFSITYLRN
RGAAFSFLADFSYRLPFFILVSVVALGVIAVTFRKLRDDQHLAAAALALI
FSGALGNLIDRVRLGEVIDFLDVYWKTYHWPAFNVADSAICVGVALLAVD
MIREERRKAP
>GSU2043 pilD, type 4 prepilin-like proteins leader peptide processing enzyme
MTLPIVFYLFSFVLGAVVGSFLNVCIYRLPTGESVVFPPSRCTSCGTRIR
PWDNIPILSWLILRGACRACRAKISARYPLVELINGLLCLALFLKFGPTL
TFAALFVFCSALVAISFIDLDHQIIPDVISLPGIVLGFVLSFFLPWLGWL
NSLIGIAAGGGSLLLVAWLYERLTGKEGMGGGDIKLLAMMGAFLGWRAVP
FIIFASSLVGSVIGLTLMMLQKKDSKLAIPFGPFLALGALLYIFFGKAII
LWYLSIGAR
>GSU0146 pilT-1, twitching motility protein PilT
MELNDILTVAVRAKASDVHIKTGLPPVVRIDGRLRPIPNAPRLAPDQVRA
MALAIMNDRQKRLFEEHFECDTAYGVPGLGRFRVSVYSQRGTVAMVFRFI
PFGIPSMENLTLPPVIKKLAMEERGLILVTGTTGSGKSTTLAAMIDYINE
HRTCNIITVEDPVEFLHRDKKSILSQREVGFDTVSFATALKGALRQDPDV
ILVGEMRDLETIETAMHAAETGHLVMSTLHTLDATETINRIISVFPPYHQ
RQVRIQLAGVIKGVVSQRLVPRADGKGRVPAVEIMIGTARIKEYIDDKDK
TKLLPEAIAQGYTSYGMQTFDQSLMLLYTQKLITYEEALRQSSNPDDFAL
KVSGISSTSDSTWDDFVHDEAPPAEGEGSVEGIEKF
>GSU0230 pilT-2, twitching motility protein PilT
MDMNLLSQILGIAFEKRVSDLHFEVDNPPFFRAKGQLLRSKLPKLSPQDT
EFIARAVMEQNHRTLPDELRELDASYSLPNGGRFRVSIFRQRGSIGIVMR
VIPPHVGTFEELNLPPVLGEIAKAPNGLVLVTGPTGNGKSTTLASMIRHL
NETCTFNIITIEDPIEFLFTSDKSCIIQREVGIDTVDFSAALRSSLRMDP
DVIMVGEMRDLETIDACIKAAETGHLVFSTLHTQSAVSTINRLIGHFPPD
AQEVLRQRLADILVATVSLRLIKDKSGENILPVVEVMRATTTIQACIREG
RLDEIEKHIENGRSLYQMQTLDQHLLELCEKDVITFDQAKQITRSMDLER
KLAFTE
>GSU0436 pilT-3, twitching motility protein PilT
MARIDALFKLLKEQGASDLHLSSGAPPIFRLHGEMARQNFKVLSHEELTA
ILYEILTDKQKADFEERRDLDFAYAIPGLARFRGNYMMTHRGIAAVFRII
PSKILSADDLSLPDGVRRMTQFKKGLVLVTGPTGSGKSTTLAAMIDLINA
TRKEHILTLEDPLEFIHENKMSLLNQRQIGEHSLSFSAALRAALREDPDV
ILVGEMRDLETIGLAMSAAETGHLVFGTLHTNSAAKTIDRIIDVFPTDQQ
EQTRAMLSESLKGVVCQQLLKTADGKGRVAALEIMLGTPAIANLIREGKT
FQIPSIIQTAKRDGMQLMDQHLLDLFKTKRITAEEAYRCAQDKKQFEQYL
TEKPGQ
>GSU1492 pilT-4, twitching motility protein PilT
MANMHQLLTELVNRGGSDLHITTNSPPQIRVDGQLIPLEMPPLNAVDTKQ
LCYSILTEQQKHKFEEANELDLSFGIKGLSRFRGNVFIQRGAVAGVFRVI
PYKILTFEELGLPVVVKELAEKPRGLILVTGPTGSGKSTTLAAIIDKINT
ERHDHIVTIEDPIEYLHPHKSCVVNQREVGADTKSFKNALKYILRQDPDV
VLVGELRDLETIEAALTLAETGHLCLATLHTNSAVQTINRIVDVFPPYQQ
PQVRAQLSFVLEGVMSQTLLPNVSGKGRVLALEVMVPNPAIRNLIREDKI
HQIYSQMQVGQEKFGMQTMNQSLFSLLQKRRISLDVAMARSSDPDELKQM
LASAQRPPGQRPQMR
>GSU2050 secA, preprotein translocase, SecA subunit
MFGAIIKKIVGSKNERELKRMWPVVEKINGLESQVAGLTDDQLREKTFEF
KERIARGESLESLLPEAFAVCREGGKRALGMRHFDVQLIGGMVLHQGKIA
EMKTGEGKTLVATLPAYLNALTGRGVHVVTVNDYLARRDSEWMGRLYRFL
GLTVGVIVHGIDDDERRAAYAADITYGTNNEFGFDYLRDNMKFALEDYVQ
RPFFFSIVDEVDSILIDEARTPLIISGPTEDSTDKYYIIDRIIPHLKKGE
VKEVEANTLSGKRKVYTGDFTVDEKARSSSLTEEGVAKVEKLLKIDNLYD
PRHMEILHHVNQALRAHALFRRDVDYVVKDGEVIIVDEFTGRLMPGRRWS
DGLHQAIEAKEGVEIENENQTLATITFQNYFRMYEKLSGMTGTADTEAEE
FHKIYKLEVTVIPTNRPLLRPDFPDVIYKTEREKFNAVIEEIKGCHEKGQ
PTLVGTISIEKSEVLAEILRKQGIPHNVLNAKQHEREAEIVAQAGRKGMV
TIATNMAGRGTDILLGGNPEGLAKQWRRANPDAPEEEYEKVLAEYRTLCA
REHDEVVALGGLHIIGTERHESRRIDNQLRGRSGRQGDPGSSRFYLSLED
DLLRIFGSERVSKIMDFLKIEEGEAITHGMITKAIENAQKKVEAHNFEIR
KHLIEYDDVMNKQREVIYTQRREILAGQDIRRHFTQMMDDTIEEISSFAI
EKVSAHEWDWQSIGEGILKTYGFQIDIPPQTMDRLSPESFRTLLKEKVHE
AFDAKVAAFGDELMDHLIKVIMLQTIDAQWKDHLLSIDHLKEGIGLRGYG
QKDPKQEYKKEAYQLFMDMMARIAAETVEKIFWVQIAHEEDVERMEEEQQ
KQARKKMVFNLVDEDETSEPSKSKKLAGRNEPCPCGSGKKYKKCCGK
>GSU2617 secD, protein-export membrane protein SecD
MSKGLLWRFSLIALFITLSLLYLTPTLVSPLPSWWKGLLPKDRIHLGLDL
QGGTHLVMEVETQKAVEGTLDLIATDLEDALSAKTLRYKQIARQGGDRVG
MTFYDRGTADEAQKLLKDKYPTMTLVPPYDEGGFVHLQLRMNEKEAQERK
DRAVAQALETIRNRIDQFGVSEPVIAREGLTNIVVQLPGISDPKRAIELI
GRTARLEFKLVDETVNPAIATPGTIPEDTEILMEKRTDPTTGAVTEIPLA
VKKKAIITGDLLTDAQIRIDSQFNQPYVAIEFNSTGARLFDQVTAANVGK
RFAIVLDNTIYSAPVIRERISGGSAQISGSFTEKEAADLAIVLRAGSLPA
PVKIIQNVTVGPSLGEDSINKGLMAGAIGVALVILFMGIYYKLSGMVANF
GMILNVLFLMGALAALGATLTLPGIAAIVLLIGMSVDANVLIFERIREEL
RLGKTPIAALDSGYDKAFLTIMDSHVTTLITAAVLFQFGTGPVKGFAVSL
SLGVIINLFTALVGTKAIFDFVLNRLRVKRLSV
>GSU2616 secF, protein-export membrane protein SecF
MQIIGKTNFDFMGKKKITFVISSIIALLGLIGVGQIALGTANMGIDFSGG
TAVQLNFSQPVAIDQARHALAKHGFKDANLQEVSGGNKLLVKVGKATHVQ
GPAADAIEDAFRKEFTDNRFVIESSTEIGPAIGDKLRKDTLVAVVISLVG
IVIYIAWRFDFAFGVGALAATLHDVLAMFAVFFVMQKEINLLFITAVLTI
AGYSLTDTVVVFDRIRENLHKNVKDSLTAICNFSINEVLSRTIITALTTF
LATASLFFFGGEVIHDFAFALLVGIIVGVYSSVFVASPIVVIWGSRNKET
KA
>GSU1627 secG, preprotein translocase, SecG subunit
MMIFLTFLHILVCLALIGIVLLQSGKGAEMGASFGAGGSQSVFGASGGTT
FLSKLTTAAAIIFMLTSLTLAYLSGRAETSSIMPAKGVSAPAPKPAAPPA
QPQQTQPAPAQPAVPAPAAPAK
>GSU2837 secY, preprotein translocase, SecY subunit
MIDAFQNIFRIPELKKRVLFSLGMLAVYRVGCHIPTPGIDSNALAHFFAQ
ARGTLLGLFDMFSGGALEKLTVFALGIMPYISSSIIFQLLTVVVPSIEKL
SKEGESGRKKIIQYTRYGTIVLSVVQALGISIGLESMRGPAGELVVPNPG
WGFRLMTVITLTAGTAFIMWLGEQMSEKGIGNGISLIIFAGIVARIPTAL
LNTGRLIKTGQLSLFVILLVVALMFLVIAAIVYVERGQRRLPIHYAKRVV
GLKTYGGQTSHLPLKVNMAGVIPPIFASSIIMFPATVANFINVPWVQTVA
KSLTPGNLAYEIFYVAFIIFFCYFYTAVSFNPVDVAENVKKHGGYIPGIR
PGKETSDYLDRVLTKLTFAGALYISAVCVLPSVLVGKFNLPFYFGGTALL
IAVGVGMDTAAQIESHLITRSYEGFMKGVRVRGGR
>GSU0820 sppA-1, signal peptide peptidase SppA, 36K type
MRMIIAFLLGCLGMFLTTGCAFVSVPLMSAPQPLAEQVLEGEGTKKILIV
DISGAIGDQAKGGGLLSRGTPSTVSLVREVLLKAERDPKVAGLILRINSP
GGTVTASDIIRHDLLAFKERRNLPVSACIMGIGASGGYYVATAADGITAH
PTALTGSIGVLLMTFNVEGLLGKVGVEEKTIKSGGKKDLLSPFRRATPEE
ERLVQGVIDQFHGRFVDVVQARPGNRLSRHDLLTLADGRIFSAADALAAG
LIDRIGYLDDVIASLRDRIGDPDARVVTYFRPGSYQGSIYAESAAEPSMA
DLLGGFDMTGGGQFMYLWRPW
>GSU1234 sppA-2, signal peptide peptidase SppA, 36K type
MTAATIGDGIGYAEVKGPIIDSQETVKQLDDLRKKSSVKAVVLRVESPGG
VIGPSQEIYAAVKRLAATKKVVVSMGSVAASGGYHVAVPAAVIYANPGTI
TGSIGVLMKLSNIEGLMDKVGLKAFTLKSGKFKDSGSPVRKLTEEERAVL
QGVIDNLHDQFVRAVAEGRQLPVEEVRRLADGRVYTGEQALRLKLVDRLG
TLHDAVMEAGRLAGIEGEPTLIIPPKKRKLLRDMLFGEVAEAVRGSVRKE
EGLSFSYELE
>GSU0028 tolQ, tolQ protein
MTLFAGTGLVVKLVLVVLIFFSVVSWAIIFFKLLQINRANGESDRFLDFF
WKTKRFDAISSQLDRFGNSPLSVLFNEGYAELRRLLDKGGEQRDEPGVVS
TDLGGIDNIARALRRATTSEITRLEKYVTFLATTGSTAPFIGLFGTVWGI
MNAFKGIGETGSASLAVVAPGIAEALIATAIGLVAAIPAVMAYNHFQHKI
KVLIASMDNFSTEFLNIVQRTFAGK