Gene list
Applied filters:
COG category: Intracellular trafficking and secretion
Gene type: CDS
Genomic element: chromosome
Number of genes found: 100
![]() | ||||||
Show UniProt / TrEMBL protein name | ![]() |
View in Fasta format (DNA) | ![]() |
View as list | ![]() |
|
![]() |
# Geobacter sulfurreducens PCA, PCA >GSU1496 pilin domain protein MANYPHTPTQAAKRRKETLMLQKLRNRKGFTLIELLIVVAIIGILAAIAI PQFSAYRVKAYNSAASSDLRNLKTALESAFADDQTYPPES >GSU1500 hypothetical protein MVPFTDHMRGRPFVEKLGYLPASDALKVLVADQKQSVGASLIMKVMMYFG GKVSGGNLKVTTDVDYKAMSRTVHAALKLDPYNMDGYYFAQAILVWDVKQ YRLANDLLEYGMKYRTWDWYLPFFAGFNYAFFLKDYPNAARMYMRAGDLS GEPLFKKLAGRYLQQSGQTEIAIAYLTTMEKGARDKAVKESFRIRLTAFR RVLLIEKARDGFIAEHGRLPSSVEEMLAKGYLKSIPADPYGGKFYLEPTG DVSTTSKFAFAGVQNN >GSU0814 outer membrane efflux protein, putative MKALRRTALALAVTLLPLAAAWGADAGAGTKLTLDDCIKKALTAAPELGE AQADIDATSAKLDEAKAHRYPQIEFLGLIGPVPQARGNQVYSPDGINDTE RWTWFQRGDATLVQPLYTFGKISENMKAASHGIEVDRAKKDQKGTDVALQ VKEYYYGILLARELRELVLETRDILDDAKTKARKLIDKGSPNADELDIYK LDAFRGEVAKYLEEATKGEQLALAALKTRMGLSPAEAVDIDAERLQPAGA TLGDLTAYVEEARTRRPEFRQLNEGIKAREALVEAAKANYWPDLFLGGYV SAAYADKRTRIDNPFVPDDFNHVWAGIALGVKWKLDFGITGAKVAGERAQ YNRLLSTKAYADEFIPLQIRKAWLEAREAEASATAMKEAYTNAKRWVVAS VANYDFGVGPAQEIFDGLQNYARMRAAYFQAIYNQNMALARLDHAVGQAP LN >GSU2185 flgM family protein MKIDTNPPVTTVNQVKGETSQAASGADARKTGAAGGQATDTVDLSRNAER LVKANATLRTMPDVRVEKVEELKKQIAAGEYNVSARDVAEKMLISMRNGV TA >GSU0327 general secretion pathway protein F MPTFRYSAYTAGGRETSGTIEAESLKEAKLHLKRDGLYPRDIGPVSETAG TVSRRFGGRNAGPAQVALMTRRLATLVGSQVPIYEAVTTLWEQEEPGEIK KALGRIRERLAEGANLAKALSLEPRLFSESYVAMVAAGEASGALDAVLER VALFLEEQRAIRSKITASLAYPTLMVLVGSAVMLFLLAFVIPKIVTIFED NRAALPLITIALIKTSTFLRSFWWACIAAVAGVVLLYRRLMKDDAFRLRR DRFLLRIPVVGSLLRQLILSRFAKVLGLLLSSGVPVMRALEITAQVVVNR HYRAALTGVTAGLAEGGTLSGALRTTGLFPPLLVHMVAVGEKGGELEEML GKAGSAFEREFESSVSGLMALLEPLLVLAMGLAVGLVVVAVLLPIFELNQ LIR >GSU2695 outer membrane efflux protein MSMFGITQSSRATEQQSDPAATVCRAASKRSQRVSGFAPKPFSVAQVLYV SIMTLTLAGCTTMAPKYERPAAPVPVAWPEGPAYKQTASATEKPVADIPW QEFFVDEKLHKLIALALENNRDLRAAALNIERSRAQYQIRRSDLFPKVDA SAGATFTRQPEGLSATGRADTIDQYSVGLGVSSYELDLFGRVRSLKDQAL EQYLATEQARRSVQISLVSEVAVNYLTLAADRERLKLAQETLANQQESYQ LTKSRFDAGVSSALDLHQAQTSVDAARVDIARFTTLVAQDGNALSLVVGS PVPAELLPSALSDTLTALKDVAPGLPSDVLLRRPDILQAENLLKGANANI GAARANFFPRITLVSNVGFGSDDLAKLFSGDSFTWSFAPRITLPIFTAGA NQATLEVAEADRNIAVAQYEKVIQTAFREVADALAQRGTIDDQLTAQQSL SDATAESHRLSQARYEKGVDNYLQVLDSQRALYSAQQNLIGVRLVRLLNL ATLYKVLGGGSE >GSU1063 hypothetical protein MSLVEMLIALLILVVGFLSVIMVLWMSINSGRFTRDMTMAASLGQDMLER FTARSYGSLPATGGAFEPYTTANASAVGYVREVKVEDNVPDVGIKTVTVR VRWNSNGHERSRTFTMLKRDY >GSU1493 type IV pilus biogenesis protein PilC MPKFNWEARSRTGSVQKGVMEAASAAAVEAQLKKYGFGSISIKEEGKGLS MEIKLPGFAPKVETKDLVVFTRQFATMIDSGLPLVQCLDILSSQQENKTF KDVLIRVKESVEGGSTFADALSKHPKVFDQLYVNLVAAGEVGGILDTILN RLAAYIEKAMKLKKQVKGAMVYPTTIMAIAVIVVGVILIFVIPTFAKMFQ EFGGELPGPTKFVINLSNFIVKYILLIIGLIFALIVGFKKYYATTGGRKK IDAFALKAPIAGPIIKKVSVARFTRTLGTLISSGVPIMDGLEIVAKTAGN KVVEEAVYKVRQAISEGKTMAEPLQECGVFPPMVVQMISVGEATGAMDAM LSKIADFYDDEVDEAVSAMTALMEPMLMVFLGTTVGGLVIAMYLPIFKLA GTVGG >GSU2122 TraG family protein MSLFKKRIINPAGGMTDIGNGVNLDRRTKIVPIAIPDADRKRHTFVFGTT GVGKTRLCENLIEQDIRKGYSVVYFDPKGDQQIFTKIYEVARDSDRLEEL MLVTPIFPEYSAVVDPMAFYFMPDELVGHIVSGIMGGREPFYRNIAKEIT TAVISAYIIQSKRHGNLPILNIDEIRKRIRRESLDSTMKSLRSIGNAEAD LTAGMIEDILKSPMEYYAKVSSTLRTALMELSSGNIGKIIGQADSNRFIK NLESGKPVILVVHTGAMITREASATLGKVLLSMIQSFVGRVYLSNRQKVN PPLSIFIDEAQSLLYQGVEELFAKAGSADVMVTAFAQSVNQVYAVIGEEF GKSILDNTNTKIFMRCSDAETSDYVVKHFGVQNVLTGIFGSNQVTTREVE QDILRVQDVLSLKPREFYMMTYSGRFKGTTNDAHEPKMKIIFPEAPAVIT SKLSQSKPATPPSP >GSU2882 cytochrome c family protein MLAVGLADTAAAASTCSDCHGMPPIDAAYRNITTGGFKGSHQTHQPATAP AGACAVCHTGSGSYAMDHMNGTIEMASNINASPLAATYGKGVFFNQTSNP VMGTCSNVNCHFEKTTPVWSSGPLTVPAGCSICHGLPPADGSHPAATAGS GRKHGDYYGTGTGSCVKCHPDHAAAAKPFAHASSAGNRGLILRFTAAPNT AGTYSKTANLNYPNYLPSQTTAANRNGTCTAMYCHSDGNGGAARATATWG GTLAADCTGCHGGNAASAAPIITGLHAQHVNAAAVLGTTIECARCHNGPV SAGNDRAVTGVAAHVDGVKTVAFVGGGTWNATAKTCSATTCHSSGKATAP QPPTPAWTGAAMGCNGCHGTLNPLGTPDYASGGSGTALANSHAKHVAAAA DCALCHANTTTTGTAIKAGSTLHTNGAIDVNLDTTNARVGATATWTAGTK TCATVYCHGATLTGGTTKSPVWGATLTGCGTCHGFPPATSVHTGKVATDC TGCHPHVNASGTGFTDASKHINGVVEASGGHAVPYYTHKSPAPTTAQCSG CHANASATAPYPAGTSPNYTAPDCRSCHVVKSPYGVADCSSCHGTAGATG VNLGRPTGSTFPNIAGQHGKSDHRVACSTCHGTYGTGRTDGTHGPGNHTP STVSTRATKVNVTLSTWNPTTRTCGTVCSYNHGSKTWY >GSU2038 hypothetical protein MRQSSVLGCARPAGLAALLTLLLAAGGDGAAAATMNDYCIQPPFVSQSVP PLVMFEVGREHKLYYEAYNDANDLDDDGRLDTTYKHSIDYYGYFDPYKCY THSGGSGSNDKYTPVSTTADKFCSSGQWSGNILNWLTMSRMDVLKKVLFG GQRSADSNTATYLERVYVPQDAHSWGKEVTGRLCSNGTNYTDMCQFDSDC DTGYTCVDKSVNLIGITASDTGTACSFTSSIKWDTTGKILVAKYTHSNFS CGSDSTDLISSYEPANLVAGFPVYVATFGDAILNPAADHGDQFNYLALAE FSVSKSDKGNWMFAIDGDDGVELEIINPAGDASTIVASRYGCNSACNCQT NSGTINLNTTGYWRLIARHSEKSGQDGVKVWYKKPSKTQSSDPWVLFGSS TLTLRAPTIPAGAECTLKDRSFIETGKPKVGTTPKQHLFCSTTLSDGGTP ILRFLGNKENRIWEWVSKERPVCDSSLGAPTDYTVQVEVCKSVSPDNRPT GKKDDLASGRETNCKDYAGTFKPVGLLQKFGEGEGAKVCSRTLAKSCTSD SDCGAGEGLCIYKSPMYFGMFTDSYTKNLSGGVLRKNIGSILDETNANNG IFQTSENVQGNIMITLDRLKTIGFRYTDQSYQDASGGSCGWITDRPLNEG ECRMWGNPIAEMMYESLRYFAGKGAPTTEFTYTTPADSGLSLSKPSWGYS KGSTTYQLYDIYPPCAKPFMLILSDINPSYDSDQIPGSSFKKTDGTYFSE DAASPQLGLGVAGADGVSLLNKLADTIGTSEGIIGDSWFVGENGSTTDFV CSSKSVTKLSLLRGMCPEEPTKRGSFYSAALAYYGLTLMKEKTGKPDVST FVVALSSPVADLKIKAGNSHVSILPVGKSVSGSHGINASCAQKCTLTADE DGLHISNCSSTAYCPSNQIVDFYIDSLKYDNDKNVIYAKYRINYEDVEQG ADHDMDAIVTYEVCTQSAIDQGLGACSGSLGSNIQIKLNSDYAAGSIDQV MGFVISGTTEDGVYLPVRDRDVSSADSDTPATVAGLPLNWSKTFTISGNP TGTLKSPLWYAAKWGGFIDANNNKKPDLASEWDKDGDGEPDNYFLVVNPL KLEQQLQKALTDILNRVSSGTAASILSNNDNNGATLLQAIFYPRKNFAET ELAWTGELQAFWYYIDPFLNTNSIREDTDQDLRLKLKTDYVLDFRFDTND NKTKIDRSLDVDGNGSGDSYVNTIEPEQVNALWKAGSLLWSRNLSTSPRT IYTSYRDAASKDQLTVFTTAGKDLFKANLQAADATEEDKIINYIRGTEQS GYRNRTVTIGGSTGVWRLGDIISSTPRLQSNARLNGYHLPPPVGYKDSSY QRYLDSNEYKTRGMGYVGANDGMLHAFNLGVLKAGTTKDVTSFITGSDFG KEMWTYIPRNALPYLKYLADPEYDHLYYVDASPSLNDVSIEVTEGTGCTD AAYWLCTKQTVYQAGTDSTTKELDLDKTSWRTVLLGAMGLGGASRNTTDA CSASTDCVKTPIANVGYSSYFALDVTTPTSPSLMWEFASADLGYSTVGPA IVRIGGETNGRWFAVLASGPTGPINTQTHQFLGRSTQTLKLFILDLKTGA LLRTIDTGIQNAFAGSLSGGTLDTDRSAGTTGKYNDDAVYLGYVRKDTTT GTWTKGGVLRLFTKENIDPAQWWWATLVDDIGPVTSAVAQLQDTTHKNHW LFFGSGRYYYKAGSDLDDAAGRRALYGIKDPCYDLNNKMKTTCNTPTVLA TDLVNQTDSIQGMGTAPGWYVLLDEASGSAGAERVITDPVAAPNGAIFFT SFKPAADVCKFGGDLALWGVNYSTGGYLAPSQLIGEAIIQSSTGSFEQID LGSSFTQRLNRKTAERQGVPPRNKPTIVTNANIKPQKRIIHIREK >GSU3120 conserved hypothetical protein MKKQLCMIALVGATAFGTTGTSWSLDGPPPPEPPPMGKGQEHFLERMASV LKLTDAQQAQIEALISTDAEQNAPLHRQLAENERALREATTAASLDEATV RALAATKGNLMTEMIVSRAKLRNAINAILTAEQRELADRLDPLKYGPPRP RPDRPGME >GSU2221 general secretion pathway protein-related protein MYREFYGLREKPFSKTPDPRYLFLSRGHREALARLQYAVEERELALLTGD IGCGKTTISRALMDAMGEACRFCFIFNPRLSPLEFLRVIARSLGIDNPAG AKDELLKQLTETLYLMHAEGRCPVIVVDEAQLIPDRDVFDEIRLLTNYQL DDQNLMSVVLMGQPELRQILADPVHEPLRQRIALHYHLQPLSLDETLEYI DFRLEVAGGTPGLFSPDAVQRIHELSGGVPRKINILATNALLVGYGRDAA WIDASLVEELRDEANLY >GSU2030 type IV pilus biogenesis protein PilO MDARIEKLLKLPNKQKLALLAAILVVEGAALYWGLYAPRQKELTALRGKL EKLQTEVQEKTRIANNLPKLKKEYQQLQKDLENALTELPNQKEIPSLLTG ITSVGKGAGLDFLLFRPKGEVPKDFYAEVPVDISVSGSFYGVANFFTAVG NLPRIVNITNVSFTDIKPVGGKTTVKVNCLATTFRFIEKKETKDDKKK >GSU3398 metal ion efflux outer membrane protein family protein, putative MNRAVSLPVAALLFLAVTGASAGNAGADTRIFTLTEAVEHALSNNGELKA ARSGHEAAQAGTVRAGLYPNPVLELEGATGALTGSSNENSMELAISQEVL LGGKRAKRLEAAERDVEAVRWWLADRERLTALEVKTAYADLLLAQQRVDL AKQAAELGSRLLTLTQERFAAGDIPELEVNLARVEAARSDSERMAAEREI VPVRSRLSTLMGLDPGTEAAVVAPPDEQPVTALVAGLATKAREHRPDLKA LTAEQARGEVEVSLARSEGVPNLTVGLFVAHERSTDAIGTDEEKTRDTIV GVRLSLPLPLFDRNQAGIREAGARRGGTEARLIAARTALEREVAAEYARM AAAEEVLRLYAKDILPRLKENLALVQEAYGLGEIDILAVIEEQKKYVAVH GSYLAAVHARQTARARLEAAVGAPLDELTTGGTQ >GSU0330 general secretion pathway protein C, putative MKKVYLLTALLIALNSLAAARIAAGLISYRLARTAPHTAMGATGAPAAAV SDDILSFASILDQGLFGRATQGKLTPLSAPVQGAAGAQQPAPTHGDLTLL GTARGSFRETFALIRRATPPEERVFRLGDTVFGIGPLVGVQKESVEILAN GRKIRLTTPLAMGIEPTSAPPPPAVAPQTGAVQVGAGSYVIDQRALNAAL ENLGQVMTDARLLPSVKEGKVEGFRISEVKPAGVFSMIGMRNGDILLRIN DLPVDSPERAIQSLASLKGQNRIKLDLVRDGQPTTFSYDIR >GSU0025 tolB protein MKHVRIFATLLALLVISVTPAVIHSEDVYREVTASGAHNLTLAVDNPRNL GGADDAALARDVAEVLRFDMTLAGPFSVMAPPAGTSPGGIRPGEFDLDSW RNAGVDLLVKSGYTITGDSVTMEFRLYNATQGRELAAKRYTGKRSDLRRI THTFSDDIMQTMTGERGPFTGKIAFVSTESGNKEIYLMDYDGHNVQRLTK NRSINLNPDFSPNGRELAYTSYRRGNPDLFRREIFTGTEAVVSSHRGINV TGTWSPDGKQLALAMSRDGNSEIYAISRDGRDPRRLTTHQAIDVSPAWSP DGKRIAFVSDRLGKPQVFIMNADGSDVRRLTTSGAYNVSPRWSPKGDRLV YCRQEGGFQIYSIATDGTGDTRLTSEGSNEHPRWSPDGRFLTFSSTRDGG EAIYVMRSDGSGQTRVYRSKGKASHPTWSPRW >GSU2899 high-molecular-weight cytochrome c MINVYMKWHRGCALAGAAVLSALACAAALFALAGTAAAGTITDCSGCHGM PPVDAPYRNISTGRFMGSHDTHAWLSAGTPNCAICHKMPATYNHRNNNVD FVTTINNSRLTARYRNLTSFTRTASPSFGTCSNVNCHFEKTTPQNGATPL YLDGGKTIQQKCATCHSAPPADARHAKHAQYYGDVTTACVKCHPDHAAKP GKGAFSHATSAGRRGLVIQFTAFPNTGGSFSGDVSYPLYLYNPSRTGACT NLYCHSPGTKAGSYDPPNQSPDWAGTLGTSCLGCHRGDADSGSPMQTGSH GAHVGVNAAAQIGCVQCHGATVTNSRQIGDPTQHVNKKADISFEAAIGSA GTYGGAAGHVAKDVGTAYGTCRNIYCHSDGATTTPPFNDYAVTWGAADFP TGCTGCHGGQQGSGNVIASNNHRKHVDASYNGGLGTGIGCVECHAPTVSG NLTIADKARHVNRFKDYSGQRAGTIAAGTCSNVYCHSSGQRSPAYRTMVP WSDTATTYGCSSCHGASTAGPAGVFVSRFGEPNYNNYSSADRNWFNSHNP KHVRSAGDCSTCHAGTTQNGVSLVPGTTLHANTQKNVTFNTAVAGGNSPS YNDLSRRCTNVYCHSNGRQTGRAYATPRWGGSAQNCNACHPIAGLGGAHG IHVGGMIPTFYAYTGNHPVGAAYRYGCANCHPMDPVHHIDGHIDLSLSKN DVNAGGLKTKNGATNGSGLNSAGSGLTGTTGASVRCASVYCHSNGYAANL VYAATPDWYGGSFAGDRCAACHGNSPNSTIAGSSSHYNNRFLGYTGVAGG HQIGIHAMNIYSSPGKRATAGTTGSSSHGNAATATTISCNICHYETVTTA RNDDNAVCKTCHYDGNAVGALSGNRAAIADRSKHVNGLVDVAFKPVAVIS KAQMRPASFAIATYSSVWKRNGGYKVSGANDSAKQALDTATMWDGGTKTC SNIACHNGQSVKWTDNNGITECVSCHSAM >GSU0314 general secretion protein E N-terminal domain protein MPIRLGEMLIKAGMITHTQLDEALKGQVIFGGRLGTNLIEMGVIGEEELA RVLSEKLRVPCVDPDELMAVPDHLLSLVPRDMVERYKIVPLGVDGRRLRL VMADPSDLPAIDEIAFRTGFVIVPMVAPEIRLFMALEKYYGIRREVRALP VSETLGGRRCTYGTKRPAEQLMRRDVVDFSTLPDNGKYLPWEGGDIDDNR IEAAERYTIDALSRTLADCRERDAVADALVDYAGRLFGCAGLLLVMRDMA AGWEAVAGHERLREFGQLRISLQEASHVRTVVEERSVYLGPPGNMPTDRR LAEALGGAGAPGLMLVPMVMGRRVVTILCAAGEMVALGARLSEAQTIARK GVLAFEILILRGKILMT >GSU2137 metal ion efflux outer membrane protein family protein, putative MEVISLGTTVFARGALLLVPGIAVLLSSTLVRADEPSLSLPKVIEYSLQN NGELKALREEKGVRDASKFRAGLLPNPTLDLEAGTGALTGSSDENNLTIG VSQEFLLAGKRDKRLTVAERELEAYRWQLADRERTLREEVKAVFYDVMLA EQRLKLTDRSIDLNRQLLEVTKDRLAAGDIPELEMNLAKVELTRSEGARI EVERALLQTRSRLFAFMGLPAGEAPAIAGTLDNGFSLNKNLADLKQLALG LRPDLKALEAEKGRGDADITLAEAEKVPNLTAGLFYTHDRRTDATGTGEE KVRDNLLGIRLSMPIPVFDKNQAGLQEARAKRSSSESRLTAATRIVERDV ETAYISALNAEKVLSLYKSNIIPQLEENLKLTQEAYRLGEVGILSVIQEQ KKFFEVSDGYLTALHDRQLALVKLESAVATDINGGVQ >GSU0702 cytochrome c family protein MVVGSEVTVNTRSWKMLMMWCSVMFVGISPVLTGVDEVRAAIQCYQCHGT AATSDYRPEDSAFRAISTGGFKGNHRTHMASSATANSCTICHGTSGYQAS HRDGTIAIASNINASPAVGTYSRGTSFAQSASVVPGTCSNVNCHFETVTP AWGSTPLASTADCGTCHGSAPATGSHPVGGSKHGAYYGTGTSSCGTCHPD HGAKEGTARFAHATSVGRDLQVSFAAAPNSGSGSYSGPVNDYLPSQSNVF GTCSGTYCHSPGTKSSAFDAPNQSAAWGGSLACNGCHKSDRSSGTAMTSG SHRAHVDGFGLGYATVRCVRCHGATVTSAMAIGTYGNHVNKLVNVAFDST TTAKNGTYGGVSSPMTKAPGSAYGSCTNVYCHSSGQGNSGSWPPTYTTPT WGSAATGQCGTCHGINGDTHSGYGTPTIMTSGSHTRHLAYSFGITSDETR CATCHATTLTGFTPTVCSSSVCHNQISQKHANYEVNVGFPDYYGATAAYG GTPKPGDGYGSCTNIYCHSDGRDTLHYAQTPLTWGGASLGCTGCHGSATA PQNGSGGTTLSGKHAVHVNNAAIFGTNNSLGCVQCHNRTVSDNSTLLGTT GTQYHVNKTRDYYGTMAGSYNSGTKVCSNVYCHSSGQATPVYRTVAAWTS ATGYGCNSCHGADSAFTAAAGEPNYANGGAGTATANSHQRHVASLGITAT TGCAACHCRTVDATVTNKLRNYSTLHLNQARNVAMKAINGKSGTYDSNAK TCSATYCHGTSPSPAWGGSTACNSCHSARATDAHWAANSAHKIHWEGSVL PSSYAMTPGNGTGDAATYRFACSSCHSPSGGSAHAGGALAASQAAQVYFG YSAAGVRGSYTGTGTAYSNDNGFNWTAGSSGCTATYCHSGGAGQAGRSAV AWSTSAKTSYNCNLCHGNAAAPNNDWRRGAPLYASGSITYKGQAKGNAHG AHISAQTSLSAVYMQCAHCHGATTATSTAISDKRKHLNKSYDVSAGGTFR DGDNTANSTAVTIAYSYNVAGSSCSNVSCHPVGLDVTTTPSTPLTRSTST AAWNSSYKCTDCHRIPMQDTDTYHHAMYDYGRTYPTTIPDGSATSGTNYT SRTCTMCHVDHTIFSPDLNVNSAGRSYNLRTAIGSTPTTTANFTNSDYSA SGGGICISCHSTERTKSTVRRKQETNATKTPPVTLANYSGSAHQYNVPAS FFSDGSRFQGNCSKCHNALVNETSVFMSYTGSGDSFGNHNSGIRRLQGSL GATGGEVAEEQICYRCHSLSSDADPGGGPPKAVANKDYYGVAPMSLASQD IFAANRDFRAANPTYSLTNKLYFKPAAAETPAEAMPNQHNTGDSFSGGTW VGRVMSPWETTVTYETKSQGTNLEGTSYWRMVTFTSPAVYSTTTVPAGNW IINLYCRESSTAQNARVRYMVYKWNNPADTMGATIIAKGTYATELSTAGA PGTVRQIPVSVGTLTLNAGEKIVVDLSLETTTTSTNGYTASFYFGSRAPS ELTLPGSVAWTYADPGAPGYGHATQHYSGIHLPSRQNETLAYIAQNRHIE CVDCHNPHATRNGLHGDYGIATGGSSTTLVNSNKNWVPNQWVGDYINIIS GTGAGQTATISGNNATTVTVAGWTAPASGSVYRIIANNNAVSRSQRGVSG ASVSYSGAWTNGTYTMVSEAAYEYQICFKCHAATNPTTNTLRYWNVTSPS LGAARWTNIGLEFNPNNASYHPVIQPLPSTGNRRLQATALTGGWQPGEVM TCSDCHGRDTSTSATAQGPHGSTVKWLLTGMYQNWPFTSAAANGTSSGGT LLTGTGTATPPANNFCFNCHTWAAGGNGHVKSSGGHSRPCVNCHIRVPHG GKVPRLLTGANAPSRYKPDGNGGAFSGSYLTSAILPASGYMGANNVSCSS ICGARHTDNSIKAYSW >GSU2031 type IV pilus biogenesis protein PilN MVRINLLPVRSSKKKETARQQMAILLVSVLVVLGIGVGLFGYAQAKIKAT KNDISGAESELQRLKGKIGELENIKKLKDDVTKKLNVLTQLRKEKTGPVR RLATLSDATPEKLWLTKYSENGPNVSIGGVAVDEDLIAAFMRNLQQTEDY TNVELIVSEQTEIGGVKAKRFELTCVIKALKKEEPAPAKKK >GSU0797 membrane protein, putative MSIYANFFVAIWLKFFFLLTPFFVMSVFLSMTQEIAPRDRRRLALKVTVA VLVACFTLFFFGNHLFSLFGITIDSFRIGAGALLFLSAVQMVQGDAAVSP SDRQGDISVVPLAIPVTVGPATTGALLVMGAELQGAWQTVIGCAALAAAV LCIGALLGSAASIEHVIGKRGITILSKLTGLMLAALAAQIVFTGIKSFLA IH >GSU2710 hypothetical protein MAASQEPTQESEPAWAPQQAQAPSFAPADSDAEEAADNLAVSSGIPSGPA ATDEDFSFDFGGEPTAGSENQAGQSTASEPEVFDFGGFEEESVTEEQSRP AEPEAASAVGGFGDGIAFGEVSPEAFAAATDAGGREEGSTGDFALEFEEG VAKEDASPKIEAEEEAEPFDFGELDLGIDETAPEPEPRDRTQQATVAAVP EQPVTEPAVTTAPLSPLDVPFGEEEAPPLVISSRRKGRSLLPLVAIVVSV LVIVALAGFGFYFFKEGPAAFNKLGIGFVAEWLGFEAREEGGIGLDKVRG GYLANAEAGEVFVIRGEAINNYRKPRASIQVKGALLGSGGQAVVQKIAYC GNSLSDEQIASLPMAKIESAMNNQFGDSLSNLGVQPGKRIPFVIVLANIP REATDFTVEVTGSTVATQ >GSU0022 mttA/Hcf106 family protein MFGIGMPELIVILVIALIVIGPQKLPDIARSLGKGLAEFKRASDDFQRNL AEEVRTLDEKEKAEKGEAAAEPVKRDLAAEVKAYEDQAASGVHQGEPAPV AGSPESEKKSA >GSU1482 outer membrane efflux protein MKPLPIRSIPFLILALCAATPAMASALSLKECLSLAAAGNYSLAVTASDR LIAQEAITQARSGFLPRVDFQGGYTLQAKPQAVNFGERGSIETQDGTYGF FNLSVTQTIYDFGRTSSRRQRASLLHEATAQQYAAAEKDVFLQVVEAYFG ILEARRLLGTAEEEVAQRTDHLRIATNLFEQGVVTRNDLLQAQVRLAESS QKRLVAANRLENRWMYLNHLTGRPLDQRDELEEGTESAMPDTAGAEERAF ANRPELTALARSADAADAEVSEARSGYYPELYAKAGVDYVENSKVREQAI YAATVGLKINLFEGFATESRHRQSVERLTRSRDALRLAREQVRLELATAL NDARVAEQRIKSVETAIRQGEENLRINRDRYQAQVGTATDVLDAQTLLTQ IKTDYHRAVFDFQVAKARVSKAMGEL >GSU2032 type IV pilus biogenesis protein PilM MLFSKKKEIVGIDIGSSSVKLVQLKEQKGGWQLVNIGIQPLPPEAIVDNT LMDSSSVIEAVKGLMKGLSVKVKDVACSISGNTVIIRKIKLPAMTPEELE DQIQWEAEQYIPFDINDVNIDFQILEPDEDDPSRMNVLLVASKKEIINDY VNVFAETGLKLVIVDVDSFAVQNAFELNYETDPEEVVALINVGASILNLN IVRGGSSLFTRDVQVGGNLFTEEIQKQFALSSEEAEQVKITGEYPDKAKL KDVIARVNETLAVEMRRSLDFYNTTAGEGRIARVYLSGGAAKTAMLAETV QNKLGVPVEMLDPLTKITCSEKEFDPEYLREIGPLVTVAVGLATRRVGDK W >GSU1777 hypothetical protein MKSSSVTALILRIDRRFRGDSRGLSLIELVFTVAILGILAMAVVPFTQMA AKRSKEIELRRNLRVIRTAIDDYKKDYDKAIKDKKIMDVANRSGYPESFE KLIEGEDFGGLYAYKKKYLRRIPVDPFHPPEVGEPPKWGMRSSVDDPESD LWGGEDLYDVYSLSDGIAIDGSKYKDW >GSU2028 type IV pilus biogenesis protein PilQ MRIQTTFSRKLVLMSLLAVFCGCADVHSSVKGDAMVEQAKGGVLREIKVA ETGEGARVVLSADRPLAYTFYKTADPPKAVIDLARTEPGAIASPMDVNSD TIRRIETVRYGEGATAMTRVEVYFTRDTEANATIDASDKGMLVLAVARPV APAASVAVEAAPAAGQAGESSTGAPVAVAAAPAVAPAPVAVEPQAPIAPV VAPQTGPPVIKAIEAQNGYLVIETSGEIKDFKFFRLAKPDRLVVDISDAR LGINSKAIPLNALGVGMARIGAYPDKVRIVLDAAGGTLSPLNVTKGAAGL IVAPADKVVDVPRAAAPPQPSASASQARQPEAVKSAEPAKGLTPAVDAIE FKVLDGTSRITVAVTGACANDKPVKSADGFSVTLRNCILPKKLQRHLDTG AFASVVQKVTPYQVKTKGRSDVKIQVQLRQPASYDVRRDGDLLQVSVRNP EGFEPPVADVPASPTLQDGMDQAAVRQKEPSRETDPLAGVAQSGGTKKAY TGRRVTLEFSDADIRKIFQLIAEVSNLNFLVGDDVTGTISLKLVNVPWDQ ALDVILENKGLGMQRDGNIVQIRPKSKIQTLADEEQALKRAKERGMELKT EVFDINFAAVGDIVSQFNAVKSERGTISQDARTNRVIVKDIEPALAEMRI LLKNLDLPEKQVLIEARIVEATSTFTRDLGVQWGIHSNDSGADIIRSVDA GFGGIVTPPPASGFPAATSSGGAVGISFAKMGSLQVDLRLSAAAVAGLVK IVSTPKVVTLNNKAAKISQGQSIPYQTTSAEGTKTEFVEAALTLEVTPHI TADGSVSMKIKASNNSAGTGSPPPINKKEATTELLVKNGETTVIGGIYVD SDTDEDRGVPFLMDIPVLGWLFKSNTKNKTKTELLIFITPKIVS >GSU1491 type IV pilus biogenesis protein PilB MQASRLGELLVRNNIITKEQLAKALDEQRTSGGQQRLGSILVKNGLVTEP DLTTFLSKQYGVPSINLSEFEADMAVVKIIPADVAQKYQIVPVNRAGSTL IIAMADPSNIFAIDDIKFMTGYNVEVVVASESSIKTAIDKYYDQSASLAD VMNDLEMDDLEVIGEDEDVDVSSLERATEDAPVVKLVNLILTDAIKKKAS DIHIEPYERTFRVRYRIDGVLYEVMKPPLKLKNAITSRIKIMADLDIAER RLPQDGRIKIKMGGGQDMDYRVSVLPTLFGEKVVLRLLDKSNLQLDMTKL GYEPTALSYFKEAIHKPFGMVLVTGPTGSGKTVSLYSALSELNKTTENIS TAEDPVEFNFAGINQVQMHEDIGLTFAAALRSFLRQDPDIIMIGEIRDFE TAEIAIKAALTGHLVLSTLHTNDAPATINRLLNMGVEPFLVASAVNLITA QRLARRVCSECKAVEEIPIQALIDAGVPPEEAPEYVCFRGTGCAKCNNTG YKGRVGFYQVMPMLEEIRELILNGANTAEIKRESMRLGIKTMRQSGLTKL KEGVTSFEEVLRVTVADD >GSU3190 twin-arginine translocation protein, TatA/E family MQYSLTTLWFFGGYCMFGFGMPELIVILIIVLVVFGAGRLPEIGGALGKS IRNFKKASEGKEEIEIKPQKKDEPKKDA >GSU2029 lipoprotein, putative MTRRNSIPILAVLVALTFVSAGCGKKEQAPSPPPPPQKASPAPKAQPPVQ GRATSAAIAPVAGLSQYDFANRRDPFKPFLQAKAPEKTRAVRGSSAGLLP IQSYNVEQFRISGIIVGLKESKALIVDPAGKGYVVKEGMSIGANNGVITK IAPSYLEVNERYTDDFGKVRKRTVKLSLAKKQ >GSU0950 outer membrane efflux protein MRVMIVTAALLLLSAQRLPATEPLTLREALVTAMENHPRTMAARESLRGA DARAGQARSGYYPRLDLVADWNRGRSYLTALEGFKETETYTAGLTLRQTL HDFGRTSGTVAAAEGERGAARESLAAVRQDVALRVREVYNVLLTAERRVE ALRETVRAREEVLGQARAFYSEGVRPRLDVVRAEADLYAARTLLTAAENE RDMARVDLAAAMGLDSLPDRPLAEPDGESPSLPDLAEAKRRATAGRVELK RYDALHAAAQGAATAARGGHLPVIEATAGAGYADRSFPPEGNTWGVGVRL TVPIFAGFATREKEREVAAALREVEARRRDQQLQVGREVEGAWLGMREAL ARVESTGRELEAARESRTLAIERYREGVGTIIEVTDAQARELEAETANIR ARGDTRIALARLDRAVGDDGGTEQ >GSU2618 preprotein translocase, YajC subunit MLGIAFAMAAPPGGAQAGGAMGAFQAILPLVFMFAIFYFLLIRPQQKKAK EHRALLDSLKRGDQVVTAGGMHGKVSGIDGDIVNLEIAPGVVIKITKGYV ASLKKD >GSU0828 metal ion efflux outer membrane protein family protein, putative MSGAVHLFLTAVILAPLGASGEVRTLTLPQAVEYALAHNGELKALRNEKD VARAGLERAVLLPNPTLELSADSGVMTGSPDETALSIGISQEFLTGGKRA KRRAVAEREAEAVHFQIADRERQLSLDVKSSFSELILAQKRRELAGRAVE LNGKLLEITRERLAAGDIPELEVNLARVEVARSEGRKIEAEREFAPLLAR LRTLLGVPAGEEIGFDGIPEQSPLSISLDDLIRLALENRPDLKAFQATSA KGEAAVELAEAERIPNVTLGLGFTHERSTDATGSGDEKTRDNLLGVTLSI PLPLFDRNQAGIREARAVRQGADNRLEFARSSVPREIEGDYARLAAAEKT LRLYADGILPQLEDNLKLVQEAYRLGEVGILAVIEEQKKYIEVNDGYLVA LAERQAALARLEASVGTDFQNRTNGGAQ >GSU3221 cytochrome c family protein MNIMKNRILTVVCAAALSAGALAGVALAKQTYTPGTGVSNSPHNINNVVT NGDEYGRVCAYCHTPHHAIVSGAIAEYNPLWSHQVNEETYTPYASRTFDG GSVDNMQSDPLVGPSRLCMSCHDGVIAVSQHYGTAPAAGNGSAAVGDNWN EISVGDLAFGEGLTNDHPIGFDYDAVASTDKGNGTLGSGIKAANTQFSVT LAGTGVNYAGTHDRKISDLMYNNGSKNIMTCASCHDVHNNENPEDYLLIN KQAGSQICLTCHKK >GSU1609 outer membrane efflux protein MKRMTTALLLVMAPLTVHAADGGVRLSLKEAIQSAVQKNLDVRAELYNPA MAEADIRKSLGIYDTQLTISTDFQYAVTEPVSSFLSGTNTSRTRTLTLNP GVNQLTPLGGTVGLTFNNAYNYTNSTRSLSEYWNSDLTLSLSQPLLKNFG KEPTELGIMVARTAKDGSLERFRTLLLDTVARVRTEYNRLYSLREDLEVK KTSLELARKILTDTQARVKAGVLPAMEILNAEFGVASREKDLIDAEKAVR DQNDVLHVLLQLPGKEEIIPVDIPTRDQYQAEEDALIRKALDLRPELREQ KASLRTSELETRVARNRTLPDLNLTASAAVTGLDRHYNRNLEKVGSADYP VWGVGLVFQYPLGNNAAENEYRRSKLKVEQGRTQIRSLEANVGNEVKTAI RGVDSGYKQLDVTDRGRAFAEERLRSFIKRSEVGLATIKDVLEVESELAT AKSNQIKALTGYGDAVTQLLRATGELLDREGITVTEKEGDSLYEQSARD >GSU0787 twin-arginine translocation protein, TatA/E family MFGFGMPEMIIILVIALVVVGPSKLPQLGQALGSSIKSFKKGMNEDEVKV INKTNEA >GSU1783 type IV pilus biogenesis protein PilB, putative MEKQSFKRKTIGQILVQQGSLNPDQIPYLVEKRNASTKRFGEVCVGDGLI TEENLARALAEQFGLDYVDARGVRLNESLLATLPPDAIYRYQFVPLEEED GTLVVLIADPTDVLKLDELELLLDRPFIVKLATETAIATILKKGEATSRV LKEVSEDFMLQLVKENEKGEEILSMEKISADTSPIIKLVNSTVMDALTRR ASDIHIETALEGVIIKYRIDGVLYRATEPLDIHFQAPIISRLKVMSELDI SERRIPQDGRFKVRLNDKAIDFRVSIMPSAFGEDAVIRILDKESIASDLK GLTLETLGMHPREMKRLRRKIREPYGMVLVTGPTGSGKTTTLYAALTEIH TGEEKIITIEDPVEYVLRGIVQIPVNEKKGLTFARGLRSILRHDPDKIMV GEIRDPETAQIAVQSALTGHLVFTTVHANNAFDVLGRFIHMGIDPYNFVS CLNCVMAQRLVRKACPHCKYPVEHSDSVLIESGLDPEECRDVTFYESRGC EECNGTGYRGRSAIIELLDLNDQMRELIMAKAPAAQLKAAARESGTVFLR ESAIEKVFAGETTLREINRVTFVE >GSU1251 BNR repeat domain protein MNVTTLWGGWFKALLAAAFILATLGAPAAGWSAVPHVAAGGNHVLTLRSD GTLWAAGSNQFGQLGDGTGINRTSPVQVPGTWKTVAAGTTHSLGIKADGS LWAWGSNLFGQLGQPLVNGQLATNKFSPIRIGTGNDWVAVSAGELTSFGL KGDGTLWSWGNNLFGELGDGTTVSRPQPVQVVADPLSNGRYVAVAAGGEH ALALQADGTLWAWGANLTGQLGTGGTGILPNPTPLKIGTDRDWTAISAGE MHSVALKADGSLWSWGQNLFGELGNGVALPGANVTTPTRVGTGSDWVAIA AGALHTVALHRNGTVSAWGNNAQGAVGDGTVANRSTPTLIVAPVTLVNNV AIAAGTGVSFSVLANGTVFGWGGNGAGQLGNGTFGAVTAPTVLSAGASAW LGVEPGGAFSLGLRSNGTLWAWGGNASGQLGDGTTTPSAIPVPVVGGAGN WRTTATGTAHSVAVRADGTLWAWGDNSSGQLGIGSTVAASTPQQITVTNP ASAGNDWTAVAGGGAHSLGLKADGTLWSWGDNTFGQLGDGTGTSNRTTPV QIVTGNPGNFDRNWVAVAAGGVFSLALQADGTIWTWGDNSLLQLGTDPAM LTPATQRNVPAQLAVASPPSTAFNSSWVAIAAGQDHGLSLQADGTLWAWG ANSVGQLGNGVTTATFTPVQVINAEGVPYVSLAAGTSHSLAGRSDGTLWA WGNNTSGQLGTGPHPGDADPLNPQPHTVPARILTSNPVSAADDWLVATAG GSHSAALKSNGTLWTWGQNTSGQLGNGTTTEADIPTALLEPRISVSPASL AFGPAPIGIPPSPTRTLTITNLGSAPLSAAIASNNAAFTVNPAACNNLPP AGFCEITVTFVPALPVGVKSATLTITSNDPVTPLVSVGATGTAANPFFIT ASVDPASPVGSGTISPAGVQAVAAGGGATYTITAASGYAIVDVIVNGVSR GPIGSYTFVNVQQDQSIIALFAASVTISASSGPGGIISPTGTVSLPLGGS QKFTITPNPGYAIAGVNVIEMVEQLDVNGNGTGIFNPVPRALGPVSSFTF FSVRANGSSISATFVPRELHEWSWQNPKPQGQAISRIATDGNGTYVAVGE FGTILTSTDSVNWTVRTFGSVNLNSVAYGGNHTFVAVGSYGRILLSTDDG ATWTVQSSGVTASLRGVAHSGTVFAAVGVTENNPNFPYEPRTAILTSPDG VTWTSRFLDQPFNGTLNLFDVAYGGGRFVAVGMEGHVVISEDNGGSWTAL PSDPVTRPTNLNGVAFGAGTFVAAGDFGQIVTSTDGSNWVNRPLMSVAEL KGVGFGTISDPLFAGNLLQVFTVVGTDGEILTSEDAGTTWTFRTSGLAAG GAGPMLQAAMVGLNRGVTPFVPTVIVAGSEGNLLASSDTIDWTNLFTTIT RYPLRSIAWGNGTFVAVGDGDPGTTLHPPISAPTVLTSTNTGLTWTRRYL SPGHNIRGIAYGNGVFVAVGASGYDLDQNASRQAIILTSTNGITWTPRNS GTFLNLNAVTFANGRFVAVSDYNATGDVADEGALILVSTNGITWNTIRKI TSTAGNLRSVIAGTNGTTPALVAVGDFGTVLRSIDNGQNWTEVTSGTINF AELNGIAFNNTSNTYAAIGITSEVFTSTDNGATWSMRSLPVDDTLLVRME GITYAYGSFVAVGNDNHILTSPDGASWTINIGAEYINSSLYGVAAGNGSY IAVGSDGTILQSNRLLDNPPIIGVDPTSLTFDITPEGTLSAGQVITISNL GINDLTIGALGFDGSDAGEFTVSEDFCSQAVIPATGSCTVVVTFAPTGAG VKSAELKIPSNDPDSNDIEVPVSGTAIPRYLLTVTNAGNGTGSIASSPAG ISITNSAAGANSSALFVVDTAVTLTATPAVGSIFEGWSGAGAGACSGTTT PCTLTMSQARNVTATFTRKFIITATAGANGSISPAGQVLVNPGANQAFTI TPAPGYSVFGVLVNGLSVGAVTSHTFTAVNADQTISATFTGVPPVRLTLQ SGNNVFSTIGQAVAATPVNTAATIDAQAVQTADNGLTLNRGITLTFRGGY GAGFSGIVGMTTVTGPVTVTNGSLTVSDMVIK >GSU1778 type II secretion system protein, putative MRTIANMAIAAVVLTFLAGCTAGLTAFNKAEKLEEEGKLDEAVMKFAEAA STNPQQTEYRMRLLKASEKAAFEHLKNGDTAYEQQLLDEALREYQSAVSL NPALARAKQRSDELIRVRNSLTYYREGLEFEKSNKPREALQAYRKALDLN PGNKEIKEALEKLLQTRRTKLEGFELNLKSTKPITLKFKEAKLKDVFYIL SQLSGINFIFDEGVKEQNVTVFLENATFNQALDLLTSMNKLGRKVLNEST ILIFPKTPEKAKQYEDLMVKTFYLNSLDAKKAVNLLRTMLQVKKIYVNEE LNALVIRDNPELIDVAGKILEANDVPDAEVLLEVEVIQVSKNNSELFGLG LSRYAVSLNARTPGSGGDFFSDTFDPKVTTTSNDTTTVTTTQVKNLLNLF NWNGYQGFLTVPSATYNFSKQLSNAEVLSNPKIRVKNREKSKFTVGTRVP ITTTSSTGSVGGVTVNVQYVDVGVKLNAEPVIQLNNEVTIKLGVEVSSII GEKTVGSGDAISSVVTIGTRNVDTVLNLKDGETSIIGGLIEDTKSKSKQK IWLLGDIPLIGSLLSSHNDRYDKSELVLAITPRIVRAVSVPEADVAAFWS GKEDDPTAVKPFSSFELEPDFAAQPAGAPAKGAPAVPSAKPGSAAPAGQP ATAADAPAAVVQPAATPAAASAAGVQASVAPVQPAMRGSLNIAAPAGVDL GGQFKVEVKVTDVKGLAKAPFTLLYDPIFIEYVGAAEGNFLNRDGKPTIF NALADKAAGRVVITMDRSAAGEGVDGSGTLLSATFKAKNKGPASLGLQNV KFVDQANRPLDIIPYNTVVEVK >GSU2911 hypothetical protein MLEPMSLARIAALSLTLLSALLFGLPAGAIAGVITSDTVWQGNVTVTEDV VVPEGVTLTVRAGTVVTVAAAESTKTDPEYLSPLTEITIRGRLAVEGTVT APVRFSGQEKRAGSWAGILIDGGTAAIRECLVADADSGISVAEGTLHLSS STLRENRYGLVAQGAASRITVEGARIIENDYGVLSLQGARVTAHATEVAR NRKKDAHTAAVRPHTLPKGPPVNEEAPVSRRYQDMVLLGETLWQGRILVD GTVRVPEGSRLVILPGTIVEFTRRDTNGDGIGENGIMIQGRLLAKGTPDR PITFRSAEKDRRMGDWDAVNIMNSSAGQNLVEHCRIEHAYRGLHFHFSTV AIHNTTLTNNYRGIQFQESVVALHGNTLCGNKSGVQGRDSEVELVDNLLC GNQVGGNFFRTTLTVRGNRIVANGREGLRLREGAVTVRENLFDGNRFGLM AADLYHGEVNRNSISGNAETGISLKNVDSVEFAGNAVTANGLSGFNIQDS GSLITGNLIADNGERGMGILSFAGIISGNNFAGNGLYAIDLDGSGDVSAP GNWWGGRDPALVINDQRDDPRLGRVDYGRPGAAPTPFVWPLATVAADTVW RGVITVAVPTTVLPGAALTVAPGTTVQFAAGTGLEIKGKLLASGQRDGTI RFTSVDRKGPSDWNEILLEYATGSVISNCIVEYATWGIHSHFTDLAVTDC LIRHNYGGMRFRSGPVRIRRSIFRDNTIGIRSYIGNALIAENLITANETG IFVREKGGGLTVTGNSLVGNSGYNMRIGDFNDQDVNARGNWWGEGDPGGT IFDGRQEPGIGMVLYEPWLDKPGPVGPANGGTP >GSU0412 flagellar assembly protein fliH, putative MSSSKASRIIKVDQSPNQAIRSYSFGFIAADAPQELPPEADGFVPFALGT PVPLPGLQSAEEPDPDPVVPFNLEGKVVLAEDELQARVDEVFRNGMDEGR RQAERGLANVFKSLRDGVAALTGLRSRVMKESEEDLLRLAVMIARKIVQR EVAQDPQVLAAIVAAAVGGCTERDRVVVRLNPDDYTQVSANRQAFLAGLG EESAITLAPDESIGPGGCLVETATGTVDARIEAQLDEIYRSLLEERSAPV EPSASPDTDSRADLAFGGEETIAPFKGQGAWVKGSEEKPRDDV >GSU1153 outer membrane protein, OMP85 family MKTTTIFTAALLACLSAPAHAQDLRGSEAGAVMKRESDVRDYYELQQRLK ESRESRPEEAVTDRTEPAPQPPADGGQRVFISHIVTDPSEILTEDDLRQV VAPLEGKEVSIRELLAMVDRINDLYRQKGYLTARAVLPPQKVERGTVRIR LVEGRVGRISVGGNRHTRDWFVTSRLHLREGDLVRLDTLENDLFRFNAIN DVKLRAEVKPGTATGTTDLILRTQEPDNYRVVAFADNGGGRYIGQERLGL TLQDLSLLGIRDPLTVGGTVADGTLSAHASYSLPLTPVGTRLGVTYDYTS IWITSGPFESLDVDGTSSDLGLTLSQPFALSPALSVTTFAGFNWKKSTTD FGGDTIFENRTRTLTLGGDLLAIDGYGTWFTRHVLTQGFHDFGGDRSFFK YNGDLVRTFILPDDFSALVRASGQVSGNHLLPSSEQFQLGGIATVRGFYE GLLIGDDGYFVSAELTLPLFPADASVYDVRLSPLLRGAIFFDHGGAFPYK GSGESIDHNDFLSSAGFGFILNLAKYLTGRIDFGFPVGERDPDPGTVRVH FSLSSEIL >GSU2143 hypothetical protein MNKGIIAVVAVVGALMASSAVYAGWGWGNGGWCMTGNSQNVSTQKMRSFQ KESFKARESLMDKQLELQDEYSKDVPDGRKIAALRKEIASLQDQLQATGD KYGVGNWGTGGGMNYRQSSGYGCGCGYCNW >GSU1781 hypothetical protein MDIRINLATRYFYNTRKVNTAIAAVILGLLLLLAYNIASLVANVSTERAL KKDMGILQARFDESAKGITEKQYRDLLKKIAEVNAVIGKKAFDWLLLLNR LEEVVPEGVALGAIDPSLKDGTLKLSGAARSFGALRSLMENLESSTHFTD VLLLNQGQLSVGEKQKGISFTVTCKVDFT >GSU1537 general secretion pathway protein-related protein MYRDHFGFTEQPFALTPNPDFLFLSTHHQEGFAHLLYGIDTHAGFIELTG EVGTGKTTLIRTFLNQLDPATHRTALIFNPTLSSLGLLQGINREFGLPCA SSERGELLEALNRFLLEESSAGRTVVLVIDEAQNLSAEVLEHIRLISNLE TERDKLIQIVLVGQPELKRLLALEELRQLNQRITVRYHLEPMGCDDTREY IRHRIRVAGGGREPVAFTLGAVKKIYRFSKGLPRLINAVCDRALLLAYTR DSREITASMAAEAIIDVRQEEGRRFPIPRKTLQLLTLAAAISIGVAVFNR ADKEPPPPAVAGEATPAPAPQPPPLTGDAVRKSLTATAAGDNLVTAANAL LGAWQAPAIDRAGRSDDIRTLAARRGFTATEIKGSLDDILRFDAPVLLQV ELPDGTSRFLTLTAANNGSFTVVPAVAGKDSLSRGEIEAFWEGRAWMFWK NFHGIPLRTRAGSRGKGVKPLQELLKGAGFYDEKPTGDFDAATEEGVRRF QQSEGLQPDGKAGEKTLALLYRRAGGFFPPGLTGAKGSTQ >GSU1154 surface repeat protein, putative MNGKRAKRTLGAFTGKRSTFRKLVSLSAALTLSLPPQAFPAQIVPDGRTG TSLTIRDNVTDVTTSTVRGANAYNSFQTFDVYRGNVVNLHVPDSAVNLLN LVHGQASTIDGILNAYKDGRIGGNVFFANPYGFLVGASGSVNVGALSVMT PTTSFMESFFLAPGVPSESAAAMMLNGTVPINPDGLISIRGQINALGTIR LSGGSVVNSGTITSTGTYGTAADTGSLFRAMVNADGLESGNNIVARNGGI EIVAAQSVENYGSIYSKGQSLHLQAGTELIVADGETISTRKIEGDPSDAA VHLLTDNSSGNSGNLTLEAPTITLGSGARLLTHASGDYLAGNIELLAGQN ITLNNGARLLAGHASDPAKGGDVLLKVSAINAIGASRTADAGIRAVNSVI RGRNVTLSSIADTSLIVHLLEQNPTLSLDEAQAYLNSELDDLVSDGPGGE YLAVTTSATAKTELYGTTIEGTGAVTIEAKAGARAGFKKNAVAEVIIDDL RDADQATVLAKSYIRGNKVSITSTADTSLTFNVLGSVLKLTDQSWLPDPV TGELQLLNDQLFDFSEIPLVSLSTATAHTTVGGATFISAGDTLTISSEGI SAAKPTFSSPLLFSAAWGESTVEAKTLVNGTTELSAANKATVKATTDVEI NVTADVNSTNKPVDAVFVHAKNTAVTTSLVGNDTTTTAGAVEVNAAATAD ISANALAKNAGGSGVGIAVAVNESTTTTTATLGGNVTADAGNVTVKATTD ITKNNTGADAATLGNPNTISARITDFQAGIKRNVTKGIIDATGLLKPETS ERITGFIFPGIKEGTFNLSGAVTYSKSVNTTTAAIAPDATVQAQGNIDVT ARIDDRPNASVGSKATSTGTAIGGAAVIADFTNNASASIGTGASVDATGS LLVDAQTVVPYPWQIDWDSPVTILNHLQDGILDLLLTSYAINSAGGKSGM GLAAAVSVFNLENNANAWIDEGARINTVFDKDAMTLPNQIVTVHAKNDIS TVNAVGLLSKKFLGTSGGKAAIGGSGNIIDIRENATATIRGDSVVKSEST IDVKAENVNHLVTVTEAGGSSDQVGVEGAVSINTITGGAVAAIDDDADVD AGGNISVEAKGTAKTISVAGGVVATKGQVGIGFAVSLNTIDTDASAYIGN YDPLQQDDVPALGQVSTDGSLTVKATSSNEIGAYSVTGALATNSTAQTEV PKDAQETKDGAGSVAGSSGSGKGTFGIAVSGDASVNDITSDTLAYISDGA TVSQAGNATLSATNTLAVNALAGAVTISTQQEGNGLAGSYSQNTLGGTTA AYLDDASLTISGDLDMDAQVNGEINTISASVQGTKGKVGVAGSVSVNEIT NTTQTYLTGSTVRGVEAVDLTARDDSIIKSIAGAVSYGGKAGIGLSFAWN SLDNLTQAYVDTSALTATGDITVSATTNNAIDTISAALGASTGDMAGEAA VSVNTLSNETHAWISGQNNGSGVESVGSISLAADDQSRIFAIAGGLAATS GKAAFGLSFAWSDVSNIVDAGIRTGADVESTSGNVEVAADSTTRVQAFAV GGSFASKVGIGGSVSVAEGTNSVTATIDGTSGVTADGNVLVTASDDVDIF SLAGNVAGAGSAAIAVANSTLVTHNLVEATLGAGATVSARGNGTAGRIYT GDKDASGNRTKEDVTGLAVSAASFENLQTIAAGGAGGGKVGIAGSATVTV LDEKTYATVGQGAHVNDADDGDAAQNVLIRASDRTGLLGVAGAVAFGGSG GVGAGADVGVITKDTEASIASSAQTPTTVKAKGNIVVTADSSEDITSVAA SLSAGGSAGIAGSASVYDLGLNTAATIGNSAVVRADGSVAVSAHDGTEMD MIAGNGAFGGTAGVGASAAVQVITKTVIAAIGEQADVTGRGTGDGVVVAD GGFAVSYGADSGDEGEIRAPTTNGSGSDSGALTGQRSAPPSTRTVNGVAV TATNQDDIESISATGSIAGTAAITLAGNVNAITTTTSATIGTGATVNQDT AAADAGQSVLVAAGNDYYHMGVAGSGSGAGAVGIGVGADVTVANLTTTAD IGEGALVSAAKDVEVSALAGEEVLSISASLGVAGTVGVSGSVSVLSIDNT TSAGTGTDSMVDAGGNVRIAARDDTETDMIAGTVAIGIGGAGVGGAVGVT SVSKETTATVGSNATVNARGNDTGSMTAYTGDDSDTTGQIRGLSVEAASS EDIFSVAAAGAGGFYAGVSGAVTVQTVDSATRASIGTNASVNEGVTDGHD EQDVNVSARNSARTNVITGALGVGALGAAGGVDVGAIRNDTSASIADGAQ IYANRDVEVNALAKTEIDSVVVSAAGGLGAIAGGVAVYSVGTGLEQEAKD QLKSDGGDFADVNSYADDQASDNSIGTLLTGSGDSRIRSIAADAQAKRSD VAVTDQLNNQTPRGTAAFIGGAAVEAGRHVDVSARRTVDADILAGAAAGG ALGLGAGIGIVNVSGSTQAYILGSGRANAAGNILVSANTDATGTVDAYVG TGGIVAVNAALAIYNDTASTSAYLGDGGVIDRADQVDINATGLHTVTAHT FGVSAGAAAGGLSLAKARVGGTIDASVGEDTRIGQDSRNTGDTVGSLSVS AQTITNGTARSEAAAGGILSGQGSIATADIGPTVTAEIGNRTAARVDTDV AAMATATVTGTATANGASIGALGVGVSEATARTMPVVRATIGDETVITAG QDVTVSGTATTTATTHATASAGALIGIAGTTSTATGLPQVNTAIGSGSTI EADRAVSVTATTTNSASSDASGWIGGLAAFGSNTATAVTAVPLYDAQGRF ISWFGSTTGALIGGNTAITTDSVNVAATSSNVALADSRAGAGGGIAGVTT RAETIQVNTTTAAIADSSDDNARKIHATNSIAITADALTTLNAFADSSTA GLVGVSGARTDNAATSVVEASVGTNNALEAGTDLAVLALNEIVKYSPQRP ANLTSGAGGVFGGAAGQSSTSLSTFTTATLAGNTISGSDQTISAGGDLTV AAENAVLATDMAQLSAGGLIAVADVRSAITSDNTATATIGANANVHAGND LNVLAKTNANVQTTTNTSTWGFAAGGDGTALNTVVADNDVVVGTNAALSA DNDIDLFAGQGLSDLQNSLISRADARSWVSGAIPVSDVTGWAYLYDFNDI LIDTGSNLKAGRDINLGAFSGLATVEGYAKAKKKSYLLFGIPITIYSNGS RRSWFFNHEGTDVSGPSVTVNGTLESGLNRHKTLVIGPDGTVVGGTLTSA DYEQTTINLRDKVLDKKAKLDAKIAEIDPSGTYPNLPEGDKILYDALKTE VQILEQKLAEWEGKSNAELTVPLIAVKDLMTGSGDINITTTTLKGTGTLK VPGTDFMIRIDNNSLAHLELNDLEIPKSASGNVNLNGKAITSHSYGGETL QIVAGQNLGRRIEIFNNAYLDDFPGALTPSDIVLKGDIINYGGRVSIRNN SGSVAVGGNIIADDLDMVMQGGFVREWQPGLYQPGHLIAGNNIYISGEIL DINDTIQSGIPYRYITIPEFDPETLGPDMVIPTIGDELSVAKWDPVNQRI VIYRVDFGGGKVELFGNIVSTTGTGALKVMDGYGEIVIENLSSRDVVINT LDVGPRVDGQIKIVDTGRKYTDNGIYVGDNNQLLTLITGNGDALNVSQGY QLWDAATRKFIYTELDSHKADSRSSSYAPHAGARVATFSDWTITEKDVAA AWDNWWTTRFTAGNFWLQFALMVDAGMKQAILDQFGKKADNPIGIEFLGN LDEGRISIFNSGAAGPSDIYLNGSIRNTVGNVSIRNDRGGIYSLNDTYLV TGRNIALTATEGSIGTLDQGIRTDTVGGSLRATAGGLINVEEVEGDLVID TVTTTGDVRLVSAGSLRDGSGTSPSITGTNISLTATAGGIGTGDNALVVN ADGILTAESLHSIYLTEKEGNAHINRIASREGDVVLTVDGGLEDYNFNEG LDDDTKDKLLTTWDDLKLTDDTKVQQSIDQYKEQKKSQYQAAHRLSDNGT PFDPSDDQYDATYDPSWQYTLTATEQSEFNEGVWTADELLNAKNLTTIPE LGKTEVLIEEANVSGRNVTIVTGAGVGSVLADEVISADAISNGTVTPDQR IMVARAEKDDITIDNGNLIVQLKNDVDVRASQSVTIQSRDHVYLGAETDV NIDRVDAGNGDIRLKIVGGIINGRTDDEANLIGRDLILEASAGGVGSAAR PLVTDLSLGGVLTARARDGIFIREAGGDISADSIISQNGAVELTVANGSA AIGQISAPGHVLLEVSGNIVNGRDDNGVNIIGDDLIIESSAGGAGTSANA LVTDLSGNGVLTAWVRDDLFLEERNGDLTIDTIASTNGAVELSVAAGSAI VGGITAPRRIRMTASANIVNGRDDGRENLITDDLSLEAAGGSVGSAEKFI VSRLRPAGILTGLSQDSFFLEEQGGGLTVDSVVSQTGSVHLTVPDGSVDA DHISAPGTVSIRANGPLLTVHRVDPTVLDVRNTFSGGTIVVGQADVAESV MARGDTVLLGEIHHTGSGTLHFDVDGGSKTMADMVRIGTDSNTAIDFDHL SSDTAVITADVDNLSLFDTRIGNRGDFSNSLYHVIVGNRDKRVRPCHLQL YATEPFSLTMTADKRFTTTAFAVNYDPHFVVNGFSTENSVVGTTEKMIWT GKRQNRLYYDPMEPGSRPWQRHMAPSGHDAVDIQPGAVGIDASDSLLEAD TVKVLTGNTGANR >GSU2869 preprotein translocase, SecE subunit, putative MWGPRVIAKTKEFLTEVKAELDKVTWPTRKETVSTTWVVVAIVLLISVYL GVCDVVLAKLMRIILG >GSU1330 metal ion efflux outer membrane protein family protein, putative MPESIGPSLRKLVTGLYFSQNILRCCGIFFTARRSATMRRTDFQPVTAWH GLCNADLRPYPEVPETEMITCILRTLLASAVCVLLLAGPARSGDAPAGED LNSLVSRALAVNPELKASEARWEMFRNRVAQAGALADPMLMFKLQNFLLR DPLDSRRDPMSQRVIGISQELPFWGKRALKTEIADREAEALRWQVEERKL ELARMVKETWYQLYLVDRELDIVERNIRVMDDFVTLAETRYSVGQGAQQD VFKGQVERSRMLDMQIALAQQRTSLQATLNTLLFRPAETPVGRVPDLEIR PISLSAAELRALAEENRPQFRSVRAQLEKGAAGHRLAGLESFPDVTLSLE YMQRDPSMDERGYDMYSVGLTFNLPVQRERRRAMARESVAETDMARAELN TLNNAIALGIADSLARLERSEKLAQLYRTGIIPQAEQSLESATIGYRVGK VDFLSLLDARVTVFNYERQYYEALAEHGMRRAQLEALVGRELE >GSU1066 hypothetical protein MSTDNTTLRKFALLAAAMAFAGIVASLALAAVSQIPLFLLNISQPNVMIL LDNSGSMDIIMQHSAFDPTARYSGGFDNDRTYYQTTSNGYHYLSTGNDYI RDDKKGNFTKNSVTIKLPLPYDDTRWDGNYLNWLFYHATSSQRSTVSTDA TLQKTRIQTARGVISNLVKTVSGVRFGLAKLNVDGYDRFDRKQTDGGSIV RNCGDLTSANVDTSVSGISAETWTPLGEALSEVWQYFKGGTSLYNTGVSY TSPITSSCQKSFTIVVTDGEPTYDGCYRGDFSSYGCDNAADADSHLADVA AHMNGSDATSAYGGTQSVTTYTIGMTIDSSLLRTTAENGGGSYYTTTSGM DLATALQNAVNEILGRQSSASAVAVSTAYLTSNTTLYRARFDSTDWSGYL EAYGINKANGAVTGYPNSPKWEAGALLNANSARTVYTAGVQSGVYRRVDF TSTNAATLAPAGFMNFSSASTASMIGYVRGDVEPAGYRHRASKLGDMVQS APVILGPPDGYYSDNNYATFKRNNATRQSLILAGANDGMLHAFNADTGAE EWAFIPNILLPKLKLLRATPYTHTNYVNGAITVGDAFITAKGLDGKSETS SSWRTIAVCGLREGGKGYFALDVTDAANPIPLWEITNTSPSETSGTVVGL GYSFGTPLIVKLKDSSQSGGFRWVALLANGYEGTTSGRAATLIVADLATG AVIREIVADASTFSGVSPNGLATPAAIDRDADGFVDYVYAGDLTGHLWKF DLSSSNSNNWDVVWKRSGTPVALCRAKTAAGSVQPITTAPDVVLRGGYQI VFFGTGKYYESTDISSTQPQTFYGAYDYNSTTTPTSAQATNGALLTRADL TAQTVTRIDESGTSWRTSSNNPIGLTKGWYLDLPVAGERVITDPVARSRK IIFTTFIPNTDACSFGGISWLMELNMDTGGEVVRPVFDVNLDGKVDYSDT VLGDLKVKPTGTLLGDGLASTPAIVGAGDEHEYKYITKTTGEIIKLLEGG GHSQIGLRSWRQLK >GSU0279 cadherin domain/calx-beta domain protein MKKTSDDQVRAAKGKSIPLSSSGKDFIEGSLDFRPVSPKKQTAVRKPGEE QEEKAAVAAEQTESDHMTADEAADVSYDHVAAITGEQSFADTLSVADQTK TGKEEKCGDNNDDDDCDDKGGWLWWAGGAVGVAGGSIGFAVAALNGDDDE GTHVDTAAVAFAGRVTDGPVHGATIYNDINKNGVYDDGIDKAMTHNSEAI TSDADGNFTITVGDLIENGITDINKLKLVAYGGIDTVTGEAVTVDFTAPE GYRYLNPVTSLIAAYMEAYNEANPNAKITAAEAEEAVIQALGLPQIDYAT TDLALPETAVEAQKVAAILAVAAMLIEESGTDSDGFAFLAAHLAPSETPL PGTMTYLTDEVTTALQSANDTTAANQFSSTVVAVNDATSLDDINTALNNT IFADLIVAGNVKVGQTLDGGLGMGSTTGLEVTYQWLESIDGTTWTPIAGE TGSDYTIRPTDILHHLRLQATYIGTDGQPRTIFYDVGVVPDSSPVFASST SGAVAENEAVGTVVYRAEATSDLENNPLSYSLGGTDADLFTIDVATGEVT LKNPADYESKSSYSIDITATDTYGLTSTTSVTIGIDNLDEVAPSITSGPT AATIAENSGPGQVVYTAAADDSADISGGVTFSLKADGDAALFTIDAATGE VTLTGNPDYEAKPAYSFTVVATDAAGHSTEQTVTLAIDNLDEVAPSITSG PTAATIAENSGPGQVVYTAAADDSADISGGVTFSLKADEDAALFTIDAAT GKVTLTGNPDYEAKPAYSFTVVATDAAGHSTEQTVTLAIDNLDEVAPSIT SGPTAAAIEENSGPGQVVYTAAADDSADISGGVTFSLKADGDAALFTIDA ATGEVTLTGNPDYEAKPAYSFTVVATDAAGHSTEQTVTLAIDNLDEVAPS ITSGPTADIAENTGAGQVIYTAVADDAADISGGVTWSLKAGSDAALTIDA VTGAVTLADNPDHEAKSGYSFTVVATDAAGHSTEQAVELSVLDNNASAAI AVDLTTIAEGSEGTSTILTYTVTRTSALNASSVDWAISGVDAADLAAGQA AVGTVTFAIGETSKTFTVEVVGDRTIESNEDLVVTLGNPGNDIDLGTADS SATTIADDDGEVSIAATAVSVPEGDTGDSRVVTFTVTRTNTLSASSVDWD VAGGTVNAADFGGTLPSGTVTFAEGEATKTISITVTGDRIIEPDETLTVR LSNPGLNLVLGVDEASSTIVNDDVGFSIFGDVMDVVEGGIGEQRAITFHV VRSDSLTLPMTIDYRLIPRGSTVPDGFDFTGSPDSLGDNAGRPSGTISFG PDETSKTVTIYVAGDAVPELNETFSIVLANAPPSTIIINGEIEGVIRSDE TQYSIHAVTAATVEGNGTGGIQQFLITRTGDTSQPGSVGYTLSEYGENPT EANDFAAGTPLTGTISFAAGETSKILSVNLEGDSVLEGYESFQVALHTLD SNSIIGTNTAVASIIPDDAAINIAATDSIVKEGTGAVSRSHTFTLTRSSH VDSEVTVDWHLAGTGANPVDAADFGGTLPSGSVTFAPGETVKTLTITPST DAAYEPHESYEIVLSTSQLGVVLETDHASGMILNDDSGLTLVATNLDKAE GNPGTPSQLTFTVQRTGDTTGESTVHWELVSADGSGVSAADFASGILPSG DLTFSRGVTSRVVTIPLTTDNIIEPDKGFTIRLSSPSEGTELLVSEVGGY IRNDDAAFTLESVSPVAEGHNGTTTVTFTVVRTGDISGADTVEYVVAPAD GGAVVDGADFVGGQLPDGLITFNAGEASKTVTLAVAGDNALESDEAFTIT LVNPGVGSTIASGSTDVVILNDDDALSIVATDADQAEAAAGGTRDFTFTV NRTGFLDRATTVNWSVAGVGANQVDAADFGGALPSGTLEFAANESSKTIT ITVNGDYFQEADEGFRVTLSSPSDGTTLTTASADGVIRNDDTGLAITATT TTLAEGDSGTVTHVFTVTRTGVTTGTTTVDWALAGSGGHPVDAADFGGTL PSGTLVFAPGETTKTIEVQASGDTDIEPGEGFTITLSGADGNADIMTASA NGTVVADDISIAISAGTASVMEGATGSSRVLQFTVTRTGDLASPVSIDWS ASGMDAADFANGTALSGTINFGAGETVKTINLTQIGDNVSESDETLTITL SNPAGNPAHDRTYITSATATTDVVNDDASLTITADAASQNEHNTGDGEAT SFTFTVTRTGDTSTETTIDWVLQLPGGAGSAAGNDFVAGQDLLGTNSGLP SGTISFAADETSKTITVLVATDNQVEQDETFSIQLQGAGANTEVSGNSAS AVISNDDTGFSIIALAADHTEANGGTVTYTFRVTRAGDISSAATVDWDVA GSGASPANADDFGGSLPGGTLSFAENEASKEISFTVSGDTVVEQDEEFTV TISNAQLTDATPQLIQDATVGGIIRNDDQSFSVSAANASVTEGSAGTTQI AYTITRTGDLSDSVTIDYAVTGAGGAATSDVQGGVLPTGTLTFAAGETSK SVTFDVIADTLAEGNETFTLTLTNPSAGIIGTASDSTVVVNDDTNFALSA PAPFAEGESGSATATFTVTRSGDSTGAGSVQWSVAPATGLTTADFTGNQD LLGTNSGLPSGTITFAAGETSKNITIQVAGDLTLENDETLRVILADPTGG TIEGTDGDKSTTILTDDDSFSISTLTASRAEGNSDSTITYTVTRTGSLVG ARDLTWTITGADGFATGNDLAGGQAATGTVSFADGQESATIVVNVKGDSA VESDETMTVTLSGAPANSVIGTASASTVLTNDDASVSIVTLIADKNEGNV TIVTPTGEVPGSTASTFTVSAAGTVSGTVESAGDRDWYKVNLVAGHQYQI DLIGNGSYTAGDVFLSLRNSTGIQLASNDDFIGVNSRITYTAPSNGIYFI DAGHLGSGTGTYGVTIADLTVPGTDMSAPAYGVGAQPYTFTITRTGDTTH GSTVEWRVAQGVGVDAVDFGSVGSQDLLGDNSGLPSGTVTFAAGEISKTL TVNIATDSAKETDEILRVVLSNPSAGTEVITASADGIVRNDDAELNITAG TFNLLEGDGLHGTGKAMTYTVTRTGNINQTSTVDWSVVHGTTSSADFTNG VGSNLTPSGTLTFASGVATQTIVVYVYGDTGVGSVEGDETFSIQLSNPNS GSALGNITSYTSTILEDDTRLVLSAADYSQAEKTAGNNTTYTYNIAREGY TGGTTNYSWAVGYTDPYTGNPAYMYDNTQSRYETVTANASDFTGSLSGSG SFSAGQTNASFTVTVTGDDTPEDDEWFAVNLTASSGYDEVTVIYDDPTKG TGTQLARTYYSPYRTYYDGQQVSSATNGVASNTNYLFSSIERDEAVYYLS DREVASTSVQTLNPGDGLRTRVEGDTPADGGAGATTVTIEGVEYGYVEHI FAVQRQVATAGTASVGWRIGTYYNAAVSADDFLTITRDGNGDITAITTAG ALPSGTVTFADGQEWAYIKFYTKVDDIGEYDEYFSIFLENPSAGSSIYTY DTVSYPQYNYGIITNDDTRFDASVNDVVEGGTLTYTVTRSGDSRGTDTVD WSLALPGSEATNESNNSTGTWYKLDPSDIDSVTPSNGTATYNAGTLTWSG TLTFEDGETTKTITVVTTDDSWTETWREELPIVLSNATNVNAGEGNHDQE TASTGYTDTARVYDNESDPLIGVSVGSSTTWEGTGANDSATGNSVTFTIT RTDQGGRDGSLNYPTTVAWRLDGSGINWGSANNSAEILTYGGDAASVNEY TSNTTYGVVTFAAGETSKNVVVTFTGDRYVESDKTLTFTVLDPDDAEHGP LYTDFYGPADINNAQASVTTTLKNDDIRLWVGGWDTYSGDANGYYTNVQT SAYEGNPLTFAVNRYGRLDCDIVVNYTLINGTTTNGDFTTTSGSFTLAAQ GSAYGEYTYSISLADLLTDDTTVEANETFTLRLSAPGDSAGSSVRFQSYY ADYTSSYNSPATTLDVRGTVYDDDTTYTLTPASTSLVETDQGASQTFSFD VTRGGTGYTGAAQLRWRVEAVGGTPADSADFTSTDLLGTNNGLPSGTVSF ANGELTKTFSVLIRGDLVAENNETFRVVLYEDVLTSSSPTITNSQSVASS TLTIVTDDTGISIADSTLTESDANQTMTFTITRSGDTSGTSSMNWTLYHG TTTAGDFSGATTGTVSFAAGETSKTISVTVAGDATPEADETFTILLSNLV GVDEAIDISATGTIKNDDSSFAIAGDAASSPESGSQTFTITRTNDTAQSQ TITWSVSAGSAGAADFGGSLPSGSVTFAPGEMSKTITISPSSDATPETDE SYTVSIALGAGTTGDTITQATATGTIENDDAAIYIAADQTNQQEGHSGTT PFTFTVTRTGNTTGAASVDWALSSAGASAADFTTADGLGSNGGLPSGTIT FADGESAKTITIEIVGDEVVEADESFTITLSNAAGGAIITGSAGSTIAND DSTIAIAADSAVKNEGNSGTTAYTFTITRTGYLGEAETVEYSVAGSGAHP ADGTDFNGTTGTLTLAAGEATTTLTINVSGDLSGEPDEDFTVTLSNPSSG VTITTDTATGSILADDIVFDVAAPASQTEGNPGDTTYFDFVVTRSGNLSG SQTLTWSVAGIGADGTSGSDFDSTTGTVTFDPGETSKTISVPVKGDYLGE ADENFRLTLTGPDGVVFTHNSADATIIDDEASLRISATDAGRAEGANGVT SYTFTVTRTGNTALEATVDWSLAAGATDPDDFAGGTLPSGSLSFAAGELS KTITVDVAGDTAIEGDESFTVSLSNASTGADIVIGSATGTIVSDDVEWTV SPLSVPAVEGDGASSYVFRVTRTGSLSATTLDWSTAGSGTNPADADDFLG SFFPSGTLVFAQGQTSQDIVVQIAGDNLLEADKEFSVTLAAPVNGLTHSY AEQTASATIVNDDDVISIAPLSADHAEGTDSSSPFTFTVTRTGSLTGTST VGWRIVHGDTSADDFVATTGTVSFADGQDTATLTVLVSGDRNLEGDEGFS VELYNPGAGSTVDDTATTASGIIHDDDVDLSLAAADANVAEGDSSTAGHA TFTVTRSGDLSVETSVNWNVVAGTATAADFAGGELPGGTVVFGAGESSKT ITIDLAGDGAWEGNETYTVQLSGASDHADIVANNVSGQIIDDDDTLTLSA VSADHAEGNSGATIYTFRIDRAGTATGATSVEWIAAGSGAHPTDQDDLLA TTGTVTFADGETSKTFTVEVAGDTTGEYDETFSVSLANPAYGSTTVGAPV TATVRNDDAVLFVRADHVSVAEGADGVETTFTFTVTRSGDTSGAASALWE VTGSGLRPANAADFGGIFPSGAVAFQPGESTQQISLTVLGDAVGEYDETF SLVLSDPEGATILEGTAETIIANDDTGISITALDADKAEGNNGLTDFTFR IERVGLANGAASVSWAVAGTGSYPAGADDFAGGILPSGTVYFADGESVKD ITIQVAGDETYGQDQTFRVLLSNPAGANLINADATGVIRNDDSQVAITAL DTAKLEGNAGTTTFSFQVTRTGALDTSATIDWDVIGSGGHQTVAGDFAGN SFPGGALTFAVGESSKTITVEVAGDTLTEVDEEFSVRLRNPGSGVSIAPN AGEASATILSDDDGVVLIGLDVDRHEGASGTQTVYTYQVLRSGNIDAPIT LNYAVSGDVDSADFMSPLTGSFEMGAGENSRLLTLTVNGDDIVEPDEFFQ VTLSGSGINIDSTPVTGAIRGDDVAGDGNDVIHAAATADTINSGAGDDVI HLTMDNLLHLQVTDGAHVDGGLGFDTILFDAAGQEFDLVALVANDAMSGI EKIDLGGEGNTLRLTTAELLHQDQNLFSILANGSEPFHQLMVDGDADDEV IIADITNWSHGAADTYTDGSVTYDVYTNGTDHTQLLINQAITNVHGVAG >GSU3466 membrane protein, putative MEKRALIAVVLSILFFYGYTALFSPPPKETPKPVATATQSQPAQQVTAAP VPVAVPAQPQPAVAARDVSVDTPAYSVTFSTQGGSIKRLDLKRYHETAGP GGKNVTLVSEDNPSNYTIGLRAPGFGLDQNAVFVPSADALTVGPGEKKQL SFTWVSPAGVTVTKTYNFSGDGYGLEIQYQVTNSGSARVSSPVQTVQTYP LVPKVKESRFETFGPATFAQDKLFEDKVKDLESGAKTHAAPLWSGFADKY FLSAVLAHEGSMAAATIRKTASGYLENTISSPELSLNPGEGRALTYRLFF GPKDIDVLKAQGNSLERAINLGWFAMLAKPLLHSLKFFHNYTGNYGIAII IITVIIKVIFYPLTHSSYKSMKEMQKLQPKMQQLREKYKNDREAMNRAMM ELYQTHKVNPVGGCLPMLVQIPVFFALYKALMFSIELRHAPFMLWITDLA AKDPYYVTPIIMGVTMVIQQKMTPSQMDPVQQKMMMALPVVFTFMFLNFP SGLVLYWLVNNVLTIIQQYYINRSISTAEAK >GSU1784 type IV pilus biogenesis protein PilC, putative MALYHCKLGSSEGRIITRELEAANPEMLRTSLEEQGFFVFEIKKKPLQFL WDKGGGRRKVDNKALLTLNQELLVLIKAGLPIIQALDTVLERVERGTLFD VLAVVREDVKGGMALSDALEKHTKVFPHLYVASVRAGERTGDLQLTIRRY IAFLKRVEEVRKRFISALVYPAILVTVATLAITFLLVYVVPTFSQVYADA GSQLPLPTRILIAFSTSLKQLFPLIIAAVIGAVFFFRRWAATESGRYRVD DIKIRIPFIGDVFSKFAVSSFTRTLATVIGSGIPIVESLKMSVGTLNNRV LERRMLEAVVKIEEGMSLSGAIESARIMPPLALRMLGVGESTGSLEEMLS DIAEYFEGEIDARLHLLTTAIEPAIMIVMGLVVGVIIVTMYLPVFKIAGT VG >GSU0278 outer membrane efflux protein MGSVRTRVLGAVIGALCLAATAWGQEAPMTGDGLDVALNFAAIGHPSIKS QVAELKALGSDLKSAEYKRYPALTIQAQTMTNNQNQVVAVVQQPLWVGGR IDGGIEQADVGLRIGRAALLDVRRRVMEETSAAYATLRGALQRLKAAELN VGEHEKLKGLVSRRVEGGVASNADILLAESRLSQAIAQRIQLKGAVSRAR SDLLALTQQPVAGNEPVPAHLLELPEPERMAEMIVKTSATVEQRILKVED ARITSRLAVASMMPALYAKVEQNVYVAERYGETPHETRFGVVLQGSVEGL GLSGWKKVKASDSRIDAARKEVAAAENDVRRQAEALVTDILSLRQVVRSY ELLVTSTEETLASFLRQYDAGRKSWVDVLNAQREFSEARISLEQARSSLE EASLRLAARLGQLDSFTGGNE >GSU0391 Outer membrane efflux family protein MLVLSPPVLLAADRSVTLQEALQSALERNHLVNGARFEREAAERGAAASR SRYFPHIFLEEGFAASDAPDRVFMMKLDQGRFTLDDFQLENLNTPSSYRD FRTAVTLEQPLFDLGIGYGREMAEKEAERAGFSLAQRREDVGLAVYAAYL DVQKGRVTLAATEKEVAEARESLRVAQARSREGTALRSDELRARTFLSES EQRTITALNDLRLARMRLALATGEDAGGSLDIAEELISSPVRLSEDELVR EALVNRSDLKGGEKDVERAEAAAGAARSAWFPTVYAGASYQMNNRDVPFG RDNDAWMAGVNLRWELFDGLRRSHDQAKAGAVRQAALQYLEQQRKEVVLR VREAALRREEAGKRLEVARHALLAADEGMRIVAKRYENGLATMVELLDAQ SALNRSRTGLVERESEYLLATARVYHAAGLFLKETVK >GSU2773 conserved domain protein MAGKSLKKPTTIVTPGVKEDVSEKADEKTYDLRDPSQLVALFTENENEDF IKISLSKYIESLIQSHNMASYNIVFLYDETRSISRYHSNQIYEAASSATD KKDILLILHSGGGQIEPAYLISKTCKSLTKKKFVVGIPRRAKSAATLISL GADEIHMGLMSELGPIDPQIGGYPALGLSNALNTLAGLACRFPDSADMFA KYLTDNLNLKDLGYFERVSESASQYAERLLAGKKFPAPSTAQSLANHFVN HYKDHGFVIDSDEATNLLGSNIIKQNTAEYLFANELYKFLDFASFLFSYF KKKEFYYVGDIRSGLSTRDKRNT >GSU0329 general secretion pathway protein D, putative MKKRFLNLLTAAVLACALLAPLPASAKGVVLNFNDVDIATMVKFISDLTG KNFVLDERVKGKISIYSPSKLTPDEAFSLFTSVLELKGFTLVQAGKVYKV VPTAAAKQSGMRLLSDKDRLPVSDAYVARIIPLERISAQEAVAFLQPIVS KDGYIAAFGASNMLMVVDSALNIQKLTGILTLVDSPQKREGAEIIFLKNA SADSVSGVIREWLGGKTSRPAGQAGQATATSSAGVLIVPDTRLNALVIFG SDQDKDDIKKLIAMVDVVPPTTSSKINVYYLENADATDVAKVIDGLIKGT PTTPGQPGVPAAAPVQSPFEGGKISVTPDKATNSLVIMASPVDYQNILQV IQKLDKRRRQVFVQALIAEVSLDKLKDVGVQIGALAVGTQGDASGGAVLD PFNFLSATSGPQFLLVKALEELGKNVSVSAQVKALVSDGAINVLSTPNIL TSDNKEAEIFVGENVPFLSQTNLTTGGISQQSIERKDTGITLRITPQISE GEYVKLDIYQEISAVKENKGQANDLVTTKRSAKTAVVVKDKDTVVIGGLI QDRDTETINKIPLLGDIPLLGWLFKTKSTRREKTNLMIVLTPRIIRGAEE MNDVSGQQRDKFGEALSLDAPFDLKRDLQLSK >GSU1782 conserved domain protein MLFAQNAIGMEVSQHGVRFVVLGGGKGAPRLVTHGGASFAPGIVRILHRE PNVVDPKAFVGTVREEYCKLLVRTDLVSVTLPDAVGRVMIMDFDTRFKNR EEGRDMIRWKLKKSLPFDAGDMHLDYQTLRERDNGSLSVLIAVVARQVVT QYEDLLLEAGIQPNRIDFNTFNLYRVAAKRIPSEDTSLFVAFHGGVLSML ALTDGLIDFYRVKEMGREVVDPNRIFMEINSSLLVYRDKNPGREVKSVFC LAPPDGGESFAGIVAEASGIDPVMFNPQAVMGNNGTATDPALLHDLAAAI GAATRNL >GSU0027 TolR protein MEVGSRNSGDRGTMSQINVTPLVDVMLVLLIIFMVTAPMMQQGVQVNLPK AETKAMTAQDEAVVVSIDRAGKVFVNSTEVASGDLTAKLTAMVANRAKKE VFLKADRDVPYGQVVKTMAEIKGAGIERLGMVTEPAPGK >GSU1486 MttB family protein MSEDNQKVLPFLEHLVELRKRLIIIIVAVVVGMGFAWNLSNGLLHFVTKP ITGETYLTDIKKQVYQEVGKRFPAAYKQFELEKAMNAAPKERKLNYSAPL EPFFVQCKISVIAGFILALPVVFYQLWLFIAPGLTRKEKRMVLPFVTVST VSFCVGALFFLVVIWPVIINFSLSYEAEGLNSLFNMSAYINFCLRLILMF GLIFELPILMLLLSRFGIVTYQFLARNRRYALLASSIVAAFHADLITMFV IMVPLYMMYEISVWLVLLFGKKKPVEAVAGEGPTGAAS >GSU2609 type IV pilus assembly protein, putative MSQKKVGEILIEHRLISEDQLREALELQKVFPDQPVGQLLCKLGFLSESE LSYILEQTGKRQKLGDILIRERLVDEERLNQARVAAKRDGSTLERALRKL RLVEEEPLAKTIATQYDLSFVHINTLEIEPDLARCINPNYAQRQRIVPIS RIGNTITLAMAYPIKLHELKELEQSIKSRIIPVIAMESEIIQAQQRLYKT AASAAHALTLDEADLEIAPGSIVDILSSGAGEDEPDIDDEVRTITERDSV IVKLVNKIIFDAHQNRASDIHIEPYPGKNDVIVRMRVDGSCKVYQRIPFK YKYAIPSRLKIMAELDIAEKRKPQDGKINFKKFGPLDLELRIATMPTAGG LEDVVIRLLNTGQAYSFDSLSLTDRNMRIFGESITKPYGLVLVVGPTGSG KTTTLHAAIARINRPEVKIWTAEDPVEITQKGLRQVQVNQRIGLTFAAAL RSFLRLDPDVIMVGEMRDEETASIAVEASLTGHLVLSTLHTNSAPETVTR LLEMGLDPFSFSDSLLCVVAQRLARRLCEDCRELYRPDRKELSEIIEEYG EEQFAATGLLGNEVVLARPVGCTTCNQSGYRGRLGIHEVLEGTDTMKSLV KKKSDTEIIRRQAMADGMTTLRQDGILKVFQGLTDIHEVRKVCLK >GSU1618 hypothetical protein MAVKFFGQFLVEKEVVTREVLLQAIELQESVNLSFGATAMAMGLLTEADI EKVHNAQRCEDLRFGDMAVKLELLTADQMQQVLTRQKNGHLYIGEALVKV GGLSADDLPRYLDEFKADQAQYATDTVSIPAGLANPNIWEMMVDLSHKML TRVALLTFRPEPCFMANRLPRKDVYAAMDFSGDVSGCYLMGVSTDAQARI ARAILKEANVDEEPKEVLDDTVMEFINVVCGNIAAKAAQLGKSIEIAPPR IVEASAGIVPPPDHLCLCFPACLAEGDHVELAVFIKE >GSU0781 twin-arginine translocation protein, TatA/E family MFGFGMPEMIIILVIALVVFGPGKLPQLGQSLGASIRNFKKASLEEPEKI TVKEKDEHA >GSU1646 lipoprotein, putative MTTTRSLPFLLIVLALAGCGDFEWFPDTETNQGSLGTVADAERNAPVVSK AFTITGGSAAISIANGEYSIDGAPYTSASGNVQTGQSVTVRHITSNAYSS SMTTVLTIANESIPFTSVTMSKPPVFVNSSTATDFTFAPKLDVEPNTVIV SDSRTITGNTSPAPISITDGEYSTNGTDFTSSAGTINAGQSLHVRHTSPT TYQTIKTTRLTVGGVRTNFSSMTKAAPFVNYSTVSPVNADPTNVISVAQP LAATKDFTTSTHVSFSIRYFLNNSSNEQKNIGLTIAGADAQNRRIYYGTI DAAVPANANPYTSTHSFGAALTIAQYNSITQWLVTKIIIYQ >GSU3317 hypothetical protein MSRPLTRTVVALALLAASSAAHADGGPAAGGLAEGSVTYSAIAPNSITNS KIVDGAVTDAKLGFGAVTTPKIADGAVTDGKIRGPIAGWKIGPHGHDATD IVQGTIDAARLPVGTGPGTVAAGDHTHEFLPKKPATLIVVAPTGGDYVSP IDALNAITDASAEKPYLVKIMPGVYDLGIATMVMKEYVDVEGSGELATRL RGSAADAGVVACASRAELRGLSIEAAGAEGNIVGIFNGSSAPRIRNVSVT VQGGKGTFGIYNLMAEPLLDSVTVTAHGGDAGFGIFNIHSSPVIRNATIS AGNGVYTTSSGSATVEGSIITATLFSIFNDTATTTRVANTRLAGGRIVNS GIMKCAGVYDAEFDPVQCR >GSU0435 MSHA biogenesis protein MshE, putative MESIVKEGSLGSILFKCQIISEDDIRRALDEQERTGGRFGEALVSLGIVT QEDIDWALSNQLNIPYVRLKPAMVDRDAVALVPAVMARQHNLIPLIRAGE ELSIAIADPLNVAAVAAVEKETGCAVSVSVALIREIREMQERFYGPPDTE ERLGFTSSAFPPQALAAMNHDLTGGKFIDYLLLFVAQQKLSSLSLHPLGD RVSVIGRRGGTTREVGQLAPSRYPDVVMHVKKLAHIDGARFSARGGLSFA LKGRSIPFQVATLRGEGGDHLTFRMTVAALFPTSLADLGLTDDQVRQFAD LAAAGRGMVVTGARDREIRRRLTDLYLQEHEAEGKTVLVVGSGAGTGEQR FPRIPVPSDADLSAVVSACLEHDPDILVLEDVTDGQAFAAACRATLRGKL VVAGIGCGDAVGALDQLIAFRDMHVLVPAYLRGVITCTPIRPLCPACRRS EPFPAAERAALGIGADVTSCWRSAGCESCDQTGHDGRRYLLDVLVLDHDL RERFEAARNGAEVIEHLRGQGWRGITDERQTLLAEGTISLEEYASSLHG >GSU1982 general secretion pathway protein-related protein MKARIMYEAYFNLTTKPFELLPNPDFIFPSKSHKRALMYLDYGISERAGF ILLTGDIGTGKTTLIRNMIQLKDERTIISRIFNTRVEPEQLLAMICHDFG IPAEGRGKVALLNELNDYLIEQFALGNRPVLIIDEAQNLSADLLEDIRLL SNLETSDTKLLQIVLVGQPELREVLALPQLLQLRQRISINCHISPLSREE TEGYIFHRLDRAGNRNAVAFSSDALDIVYRYSRGIPRLVNIICDFILLSA FAEQTTEIPGEMVRDIIGDLDFENHYWGGAEVAASERAPEAARLAPGRSD EATELVATLRAVIDRLESLEGDFARMSRGVLDEMSEKVASLENAFRFHVD ETDSHISEIRRRIEKVQNLEVGTTGECQPQDSPKGLLKRMFGA >GSU1792 clpP, ATP-dependent Clp protease, proteolytic subunit ClpP MLVPIVVEQTGRGERSYDIYSRLLKDRIIFLGGPVDDHVANLVIAQMLFL EAEDPDKDIHLYINSPGGVVTSGMAIYDTMQYIKAPVSTICVGQAASMGA LLLSGGEKGKRFSLKHSRIMIHQPLGGFQGQATDIHIHAQEILKLKKRLN EILAENTGQQLAKVEADTERDYFMSGAEAKDYGIIDNIIERNTPSGGTR >GSU2550 drpA, DNA processing protein DprA MYHWFALRSVPLVGNVLYRRLLDRFGSPEAVFRASDAELASVRGVSSAVA ASIRGHDPRAFAESECERVRRAGVRIVTIRDPDYPPLLLQIADPPPYLYV KGDPTGLEPAVAVVGSRRATVYGRTVTARLAEDLARRGVAVVSGMARGID TAAHQGALAGEGRTVGVLGCGIDVVYPPENRALFARVADRGALVSEFPLG MGPLAENFPRRNRIISGMCRGVLVVEAAERSGSLITAQMALDQGRDVFAI PGNITSSGSSGTNRLIREGAKLVAGVEDILEELVPRAAAAVAVAPPLPPL PAGEAALMAMFGADPLHIDEIIAKSALTVGEVSAMLLRLELKGVVTQLPG KFFHAN >GSU0642 ffH, signal recognition particle protein MLENLSDKLDVLFKKLRGQGVMSEENIKEALREVRLVLLEADVNFKVVKD FVERVRVRAVGTQVLQSLTPGQQVIKIVQEELVALMGGGEDNSLDLAAKP PVPIMMVGLQGAGKTTSCGKLARLLKGQRRRPLLVPADVYRPAAIEQLKT LGRQLSVEVFDSRADQDPVDICREALRYATLNGFDVVILDTAGRHQIDEY LMNELVRIKEAAEPREILFVADAMTGQEAVNVASGFNDRLDITGVVLTKL DGDAKGGAALSIRAVTGKPVKLVGVGEKLDALEVFHADRLVSRILGMGDI LTLVEKAQATFDSQEAERLQQKLKKSQFDLEDFRNQLQQIKKMGSIESIL GMIPGVGKAMKQLQGAQPSERELKRIEAIIGSMTPAERANHAIINGSRRL RIAKGSGTTVQEVNQLLKRFTEAQKMMKQLQKLGPKGLMRGMKGMGKGMF PF >GSU3056 flhA, flagellar biosynthetic protein FlhA MANTAVDAVDLQPAKSNSDIYMAVALIGVLALMIIPLPAFLLDLFLAANI TIALAILLVALYTQQPLDFSVFPSVLLVTTLFRLALNVAGTRLILLHGNE GVDAAGHVIKAFGQFVVGGNYVVGAVIFLILVIINFVVITKGAGRVAEVA ARFTLDAMPGKQMAIDADLSSGLINEKEARRRRSRVSREADFYGSMDGAS KFVRGDAVAGILIMLVNIIGGFIIGVWQNGMPLEAALSNYTLLTIGEGLV AQIPALIISTAAGIIVTRSADEKNFGHEISGQFLNYPKAFYVSSGVLFAF GLIPGLPHVAFFLLSGAAYMAGRLAKERAQVVEDDLMTLPAPAETGESGD QAGAIRPLDMLELEVGYGLVPMVDAAQEGELLERIRSIRRQYAQKMGFVV PPVHIHDNLQLKPHEYNILIKGAKVGGGEMIGQYLAMDSGAVSMPVEGVR TTEPVFGLPAIWIRPELKEQAQLAGYTVVDSTTIIATHISEIIRKHSHEM VGRQELQQLLDNLSSSFPKVVEDLVPNLLNLGTVLRVVRNLLREGVSIRD LRTVLETLADYGGLTKDPDTLTEFVRQGLGRSIVEQYKRDDDTLCLISLD RRVEEVVAEAIQPSDQGSYLAIEPNTAQLILSGIRQEMEKFNQIGTQPVL LASPSIRRHVKKLTERFVPNLVVLSHNEVPSGIKIQSLGVVTLNAG >GSU0426 flhB, flagellar biosynthetic protein FlhB MSDDKHSKTEKPTAKKLDEAKKKGVPHSRDLTSTVTLIAAMVALYTTGGF MFTTLKRTSGELLGSMGTFHLTEASVEHLLIKLFLVFLSVVMPFMLVVVI SGLATTMVQVGFSMNSERITFKLDKLNPVTNAQKLFNKDSLVEMLKAVLK IVIVGYMSYKIMRDEMDGLLFLADTDLAGILEVFKHLAFKLVIHTCGVLL ILGVLDLAFVKWRFIDNLKMTKQEVKDEHKESEGDPKVKGKIRQMQFQQA QKRLRKVIPTADVVVTNPTHYAVALKYERETMAAPLVLAKGVDHMAQTIK AIARENNVMLVENRFLARELYAQVKEGQPIPESLYTAVAEVLAYVYSLKG KI >GSU0409 fliE, flagellar hook-basal body complex protein FliE MIDGIESGLGIAQAFPSVTGEAKPGNLAADGGKFFGELVSKVSELQAQSD TAIKGLVSGESKGLHEVMIAMEKSSISFQFLSQVRNKAVEAYQEVMRMQV >GSU0410 fliF, flagellar M-ring protein FliF MPEALNKLIQPFMALPPAKRWVVGGVVGLSVIAFTILILVANRTDYRPLF TNLTSEDAGEIVTKLKEQKVPYRIAADGKAILVPSDKVYDLRLSLASDGL PQGGGVGFEIFDRKNFGMTDFVQKLNYQRALQGELSRTISQISGVEQARV HLVIPEKSLFKEDEKPATASVVLKVKGQRQLRENDVQGIVHLVASAIEGM NPEHVTVLDQKGKLLSKNTPGDAAGKMTASMQEVQRAYERSTEERLQSLL DKAVGAGKSVARVSAVFDFRQVERYEEKYDPETVVRSEQRSEEKQDGSTV TGGVPGVQTNLGRTAGQPAGTSGGGSKNDETLNYEVSRATARTIEPVGTL SKVSVAILVDGKYDAAAAGKDGKEAKPKYTPRSPDELQKIDALVKSSVGF NVERGDQVTVVNIPFQDTGDVGAGEADKWWNAPIFLSLLKNGLIGFGFLA LLLFVVRPLLKTLKPEKSTSFEPIPSAEDALNQIAEIHRLQIGNQTVSQM ELINKIKQEPYQAAQIIQNWLRDKGEE >GSU0413 fliI, flagellum-specific ATP synthase FliI MSRIDLSRYLSAVDAMKPIRFHGKVTQVVGLVIEGFCPDAAVGTLCLVHP NDGDPIPAEVVGFRDNKTLLMPLGELRGVGLGSLISVKRKKASLGVGPGL LGRVIDGLGVPIDDKGPLAIREEYPIYANPVNPMKRRPIRQPLDLGIRAI NALLTCGEGQRVGIMAGSGVGKSTLLGMIARYTEADVNVIALIGERGREL REFIEKDLQEEGLKKSVVVVATSDQPPLVRMRGAYIATTIAEYFQAQGKK VLLMMDSATRFAMAMREVGLAIGEPPTTKGYTPSVFAALPKLLERTGSFL DGSITGLYTVLVEGDDFNEPISDAMRSILDGHIVLNRELAARAIYPPLDI LASASRVMNDVTERSQQQFASRFKELLAAYRQAEDLINIGAYKPGSNPTI DYAIAKMDGMINFIRQGIHDGVSMEQSIAELADIFDEGMAL >GSU0422 fliN, flagellar motor switch protein FliN MSDFTKEETKDGELDRKNLEFILDIPLQLTVELGRTKILVKDVLQLNQGA VVELTKLAGEPLDVFVNSKLVARGEAVVVNEKFGVRLVDIVSPNERVEKV L >GSU0423 fliP, flagellar biosynthetic protein FliP MDGVPIFKRIPFIALCVILLTASLAAAAEPLALPSVSIGVGKATKPGDVS VVLQIFFLMTVLSLAPGLLMMTTSFTRIAVVLSFLRHAIGTQQAPPNQII IALSLFLTFFVMAPVWQQVNTQAIQPYRAAQITQDEALKRAVAPMRKFML SQTREKDLALFLNLSKLPRPRTADDIPTLTLIPAFMISELRTAFQIGFLI FIPFLVVDMVVASVLMSMGMMMLPPVMISLPFKILLFVLVDGWGLVIGSL IKSFG >GSU0424 fliQ, flagellar biosynthetic protein FliQ MSPDLVVQLARRSFEVTLMLAAPLLISGLVVGLAVSIFQAVTSIQEATLA FAPKIIAVMVALVIFFPWMMNYMSDFTREVYALIATMRR >GSU0425 fliR, flagellar biosynthesis protein FliR MFPLTTPFPTANDVAFFTLVMGRMAGIFAAIPIFGGRRVPTPIKALLVFA MTMVCFPIIKEKMPQLPTDVLSLGFLMVQEVLVGVSLGLLSLIIFAAVEF AGQIVSVQIGLTIVTEFDPSQGGQLSIMSIILEMLATLLFLSLGMHHIFI GALVQSYDVLPLGAWHMSGALLQFIVTTIGEVFVLAVRLAAPVMVTLLAT SVMLGIMARSFPQMNVFFVSMPLNIGIGFIVLGLSLPLFLHTVQGHFGLL DEQLKTMMKLMGKG >GSU3036 fliS, flagellar protein FliS MLTPFNQYQNTQVGTASPEKILIMLYDGAINFSKIALERMEKKDLAGKGK YISKAQAIVSELMNTLNHDVGGGIAQRLEQLYIYVIDEYINANINNSPRA LENAIRILTVLRDSWVEAIDIWKRERDAVPPSVHQPGYVAGQAR >GSU1132 ftsY, cell division protein FtsY MAEERKGFFKGLWGKVTGGDRDQEEAVNETTSAVADVAAPPEQEDRRPGL FERLKQGLSKTRDSLVGRIDRLVLGKKEIDADTLEELEEILITADLGVQT TVELIRGLEQRLSRNELKDGEALREALKEDIHGRLARDAHQLDVTGASPF VIMVIGVNGVGKTTTIGKLAARFTAQGKKVILAAGDTFRAAAAEQLQIWG ERTGVDVIRHKEGADPSAVVFDSIKAAVARGADILIVDTAGRLHTKVNLM EELKKVRRIMSREIPGAPHETLLVLDAATGQNALSQAKLFKEAAQVTGIA LTKLDGTAKGGIVVAICNEFRIPVRYIGVGEGIDDLRDFDPSQFVEALFQ >GSU0328 gspE, general secretion pathway protein E MEQIARRLGIPFLAEIGDNEADAALLARLPLAFARGRLVLPLRERDGRLL VVSGNPADLSAIDEVRGVYGMEVELAAATPDTVLGAVNHLYARLGSSAQE VVEELEGEDLSVIATELAEPKDLLDLTDEAPVIRLLNSILSEAVKERASD IHIEPYERELEVRFRIDGILYRKLAPPKVVQEALVSRVKIMAGLNIAEKR LPQDGRIRVIVAGRDVDIRVSIIPTFFGERVVLRLLDKQKGLISLENIGL SEGGVRSMERLLGRTSGIILVTGPTGSGKSTTLYAALNRLNSPEKNIITI EDPIEYQVKGIGQIQVNPKIELTFAQGLRAILRQDPDIVMVGEIRDAETA EIAMQASLTGHLVLSTLHTNDSATAIARLVDMGIEPFMVASSLSAVLAQR LVRRICPHCRESYTPERDYAGITLPSTLYRGRGCDACFGLGTLGRVGIYE LLPVDGEICSMIIRREPAGAIKEYAVGKGMRTLRDDGLAKAAAGITTIEE VLRVTQEEYADLPV >GSU0326 gspG, general secretion pathway protein G MHNTLRNRRGFTLIEIMVVIAILALLAALVGPRIIGRSDDAKVADAKVQI KNLETALKLYKLDSGTYPSTEQGLMALVAAPTVGTIPKNYRSEGYLESKQ VPKDPWGNDFVYLSPGEHGDYDLYSFGADGVKGGEGKNADIESWNLQ >GSU0322 gspK, general secretion pathway protein K MRRSESERGFALILTLVVTALLIAVTTEFIHGVYVDTSLHRNFVNLQQAS LMAEGGVTGGISLLRNLRTSGNDQGLQQLLADPVQFEDEKGRVSITIEEE DGKLNLNAVTLPNGDEHVFYGPAERRLLTALKLPTALHDSLADWLDANDE PRPDGGESAYYQSLSAPYAPRNAPFATFGELGLVRGVEPAVLERLRPFAT VFVDGGAINVNTAPLQVLMALDEGISEGIARDIMQRRRIKPFKSVGELSE IPGMETIAGKLSGFAGARGSTYRLVSRAAVGDVTRLVEAVVNLDGTQPRY LYWREY >GSU1267 lepB, signal peptidase I MDYKETQYGQSTPSEQAAEPVKKKHIVREYAESIIIAVILALIIRTFVVQ AFKIPSGSMEDTLAIGDHILVSKFIYGTKIPFVDGRYLKIRDPKRGDVIV FEYPEDPSKDFIKRVIGLPGDTIQVVQKQVFINGKPFSVPQEVHKEKDVI PAAQNPRDNFGPVTVPENSYFVMGDNRDRSYDSRFWGFVKNSQIKGLAFI KYWSWDREKFRVRWGSIGDIIK >GSU3135 lspA, lipoprotein signal peptidase MKPTYRIFNAVVLGSLVLDQATKVLIDRTMDLYQSIPVIDGLFSITYLRN RGAAFSFLADFSYRLPFFILVSVVALGVIAVTFRKLRDDQHLAAAALALI FSGALGNLIDRVRLGEVIDFLDVYWKTYHWPAFNVADSAICVGVALLAVD MIREERRKAP >GSU2043 pilD, type 4 prepilin-like proteins leader peptide processing enzyme MTLPIVFYLFSFVLGAVVGSFLNVCIYRLPTGESVVFPPSRCTSCGTRIR PWDNIPILSWLILRGACRACRAKISARYPLVELINGLLCLALFLKFGPTL TFAALFVFCSALVAISFIDLDHQIIPDVISLPGIVLGFVLSFFLPWLGWL NSLIGIAAGGGSLLLVAWLYERLTGKEGMGGGDIKLLAMMGAFLGWRAVP FIIFASSLVGSVIGLTLMMLQKKDSKLAIPFGPFLALGALLYIFFGKAII LWYLSIGAR >GSU0146 pilT-1, twitching motility protein PilT MELNDILTVAVRAKASDVHIKTGLPPVVRIDGRLRPIPNAPRLAPDQVRA MALAIMNDRQKRLFEEHFECDTAYGVPGLGRFRVSVYSQRGTVAMVFRFI PFGIPSMENLTLPPVIKKLAMEERGLILVTGTTGSGKSTTLAAMIDYINE HRTCNIITVEDPVEFLHRDKKSILSQREVGFDTVSFATALKGALRQDPDV ILVGEMRDLETIETAMHAAETGHLVMSTLHTLDATETINRIISVFPPYHQ RQVRIQLAGVIKGVVSQRLVPRADGKGRVPAVEIMIGTARIKEYIDDKDK TKLLPEAIAQGYTSYGMQTFDQSLMLLYTQKLITYEEALRQSSNPDDFAL KVSGISSTSDSTWDDFVHDEAPPAEGEGSVEGIEKF >GSU0230 pilT-2, twitching motility protein PilT MDMNLLSQILGIAFEKRVSDLHFEVDNPPFFRAKGQLLRSKLPKLSPQDT EFIARAVMEQNHRTLPDELRELDASYSLPNGGRFRVSIFRQRGSIGIVMR VIPPHVGTFEELNLPPVLGEIAKAPNGLVLVTGPTGNGKSTTLASMIRHL NETCTFNIITIEDPIEFLFTSDKSCIIQREVGIDTVDFSAALRSSLRMDP DVIMVGEMRDLETIDACIKAAETGHLVFSTLHTQSAVSTINRLIGHFPPD AQEVLRQRLADILVATVSLRLIKDKSGENILPVVEVMRATTTIQACIREG RLDEIEKHIENGRSLYQMQTLDQHLLELCEKDVITFDQAKQITRSMDLER KLAFTE >GSU0436 pilT-3, twitching motility protein PilT MARIDALFKLLKEQGASDLHLSSGAPPIFRLHGEMARQNFKVLSHEELTA ILYEILTDKQKADFEERRDLDFAYAIPGLARFRGNYMMTHRGIAAVFRII PSKILSADDLSLPDGVRRMTQFKKGLVLVTGPTGSGKSTTLAAMIDLINA TRKEHILTLEDPLEFIHENKMSLLNQRQIGEHSLSFSAALRAALREDPDV ILVGEMRDLETIGLAMSAAETGHLVFGTLHTNSAAKTIDRIIDVFPTDQQ EQTRAMLSESLKGVVCQQLLKTADGKGRVAALEIMLGTPAIANLIREGKT FQIPSIIQTAKRDGMQLMDQHLLDLFKTKRITAEEAYRCAQDKKQFEQYL TEKPGQ >GSU1492 pilT-4, twitching motility protein PilT MANMHQLLTELVNRGGSDLHITTNSPPQIRVDGQLIPLEMPPLNAVDTKQ LCYSILTEQQKHKFEEANELDLSFGIKGLSRFRGNVFIQRGAVAGVFRVI PYKILTFEELGLPVVVKELAEKPRGLILVTGPTGSGKSTTLAAIIDKINT ERHDHIVTIEDPIEYLHPHKSCVVNQREVGADTKSFKNALKYILRQDPDV VLVGELRDLETIEAALTLAETGHLCLATLHTNSAVQTINRIVDVFPPYQQ PQVRAQLSFVLEGVMSQTLLPNVSGKGRVLALEVMVPNPAIRNLIREDKI HQIYSQMQVGQEKFGMQTMNQSLFSLLQKRRISLDVAMARSSDPDELKQM LASAQRPPGQRPQMR >GSU2050 secA, preprotein translocase, SecA subunit MFGAIIKKIVGSKNERELKRMWPVVEKINGLESQVAGLTDDQLREKTFEF KERIARGESLESLLPEAFAVCREGGKRALGMRHFDVQLIGGMVLHQGKIA EMKTGEGKTLVATLPAYLNALTGRGVHVVTVNDYLARRDSEWMGRLYRFL GLTVGVIVHGIDDDERRAAYAADITYGTNNEFGFDYLRDNMKFALEDYVQ RPFFFSIVDEVDSILIDEARTPLIISGPTEDSTDKYYIIDRIIPHLKKGE VKEVEANTLSGKRKVYTGDFTVDEKARSSSLTEEGVAKVEKLLKIDNLYD PRHMEILHHVNQALRAHALFRRDVDYVVKDGEVIIVDEFTGRLMPGRRWS DGLHQAIEAKEGVEIENENQTLATITFQNYFRMYEKLSGMTGTADTEAEE FHKIYKLEVTVIPTNRPLLRPDFPDVIYKTEREKFNAVIEEIKGCHEKGQ PTLVGTISIEKSEVLAEILRKQGIPHNVLNAKQHEREAEIVAQAGRKGMV TIATNMAGRGTDILLGGNPEGLAKQWRRANPDAPEEEYEKVLAEYRTLCA REHDEVVALGGLHIIGTERHESRRIDNQLRGRSGRQGDPGSSRFYLSLED DLLRIFGSERVSKIMDFLKIEEGEAITHGMITKAIENAQKKVEAHNFEIR KHLIEYDDVMNKQREVIYTQRREILAGQDIRRHFTQMMDDTIEEISSFAI EKVSAHEWDWQSIGEGILKTYGFQIDIPPQTMDRLSPESFRTLLKEKVHE AFDAKVAAFGDELMDHLIKVIMLQTIDAQWKDHLLSIDHLKEGIGLRGYG QKDPKQEYKKEAYQLFMDMMARIAAETVEKIFWVQIAHEEDVERMEEEQQ KQARKKMVFNLVDEDETSEPSKSKKLAGRNEPCPCGSGKKYKKCCGK >GSU2617 secD, protein-export membrane protein SecD MSKGLLWRFSLIALFITLSLLYLTPTLVSPLPSWWKGLLPKDRIHLGLDL QGGTHLVMEVETQKAVEGTLDLIATDLEDALSAKTLRYKQIARQGGDRVG MTFYDRGTADEAQKLLKDKYPTMTLVPPYDEGGFVHLQLRMNEKEAQERK DRAVAQALETIRNRIDQFGVSEPVIAREGLTNIVVQLPGISDPKRAIELI GRTARLEFKLVDETVNPAIATPGTIPEDTEILMEKRTDPTTGAVTEIPLA VKKKAIITGDLLTDAQIRIDSQFNQPYVAIEFNSTGARLFDQVTAANVGK RFAIVLDNTIYSAPVIRERISGGSAQISGSFTEKEAADLAIVLRAGSLPA PVKIIQNVTVGPSLGEDSINKGLMAGAIGVALVILFMGIYYKLSGMVANF GMILNVLFLMGALAALGATLTLPGIAAIVLLIGMSVDANVLIFERIREEL RLGKTPIAALDSGYDKAFLTIMDSHVTTLITAAVLFQFGTGPVKGFAVSL SLGVIINLFTALVGTKAIFDFVLNRLRVKRLSV >GSU2616 secF, protein-export membrane protein SecF MQIIGKTNFDFMGKKKITFVISSIIALLGLIGVGQIALGTANMGIDFSGG TAVQLNFSQPVAIDQARHALAKHGFKDANLQEVSGGNKLLVKVGKATHVQ GPAADAIEDAFRKEFTDNRFVIESSTEIGPAIGDKLRKDTLVAVVISLVG IVIYIAWRFDFAFGVGALAATLHDVLAMFAVFFVMQKEINLLFITAVLTI AGYSLTDTVVVFDRIRENLHKNVKDSLTAICNFSINEVLSRTIITALTTF LATASLFFFGGEVIHDFAFALLVGIIVGVYSSVFVASPIVVIWGSRNKET KA >GSU1627 secG, preprotein translocase, SecG subunit MMIFLTFLHILVCLALIGIVLLQSGKGAEMGASFGAGGSQSVFGASGGTT FLSKLTTAAAIIFMLTSLTLAYLSGRAETSSIMPAKGVSAPAPKPAAPPA QPQQTQPAPAQPAVPAPAAPAK >GSU2837 secY, preprotein translocase, SecY subunit MIDAFQNIFRIPELKKRVLFSLGMLAVYRVGCHIPTPGIDSNALAHFFAQ ARGTLLGLFDMFSGGALEKLTVFALGIMPYISSSIIFQLLTVVVPSIEKL SKEGESGRKKIIQYTRYGTIVLSVVQALGISIGLESMRGPAGELVVPNPG WGFRLMTVITLTAGTAFIMWLGEQMSEKGIGNGISLIIFAGIVARIPTAL LNTGRLIKTGQLSLFVILLVVALMFLVIAAIVYVERGQRRLPIHYAKRVV GLKTYGGQTSHLPLKVNMAGVIPPIFASSIIMFPATVANFINVPWVQTVA KSLTPGNLAYEIFYVAFIIFFCYFYTAVSFNPVDVAENVKKHGGYIPGIR PGKETSDYLDRVLTKLTFAGALYISAVCVLPSVLVGKFNLPFYFGGTALL IAVGVGMDTAAQIESHLITRSYEGFMKGVRVRGGR >GSU0820 sppA-1, signal peptide peptidase SppA, 36K type MRMIIAFLLGCLGMFLTTGCAFVSVPLMSAPQPLAEQVLEGEGTKKILIV DISGAIGDQAKGGGLLSRGTPSTVSLVREVLLKAERDPKVAGLILRINSP GGTVTASDIIRHDLLAFKERRNLPVSACIMGIGASGGYYVATAADGITAH PTALTGSIGVLLMTFNVEGLLGKVGVEEKTIKSGGKKDLLSPFRRATPEE ERLVQGVIDQFHGRFVDVVQARPGNRLSRHDLLTLADGRIFSAADALAAG LIDRIGYLDDVIASLRDRIGDPDARVVTYFRPGSYQGSIYAESAAEPSMA DLLGGFDMTGGGQFMYLWRPW >GSU1234 sppA-2, signal peptide peptidase SppA, 36K type MTAATIGDGIGYAEVKGPIIDSQETVKQLDDLRKKSSVKAVVLRVESPGG VIGPSQEIYAAVKRLAATKKVVVSMGSVAASGGYHVAVPAAVIYANPGTI TGSIGVLMKLSNIEGLMDKVGLKAFTLKSGKFKDSGSPVRKLTEEERAVL QGVIDNLHDQFVRAVAEGRQLPVEEVRRLADGRVYTGEQALRLKLVDRLG TLHDAVMEAGRLAGIEGEPTLIIPPKKRKLLRDMLFGEVAEAVRGSVRKE EGLSFSYELE >GSU0028 tolQ, tolQ protein MTLFAGTGLVVKLVLVVLIFFSVVSWAIIFFKLLQINRANGESDRFLDFF WKTKRFDAISSQLDRFGNSPLSVLFNEGYAELRRLLDKGGEQRDEPGVVS TDLGGIDNIARALRRATTSEITRLEKYVTFLATTGSTAPFIGLFGTVWGI MNAFKGIGETGSASLAVVAPGIAEALIATAIGLVAAIPAVMAYNHFQHKI KVLIASMDNFSTEFLNIVQRTFAGK