TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 641

Free access
Sort by:

 



# Geobacter sulfurreducens PCA, PCA

>GSU2054 hypothetical protein
MAPSVGKINRFSAVGPFLSIDILDEWGSCIVNRLEFSEIFIYTKAIPPYR
IN
>GSU1226 hypothetical protein
MIPEARSRLNSGVQGRFALTLKKIVAPVGEKRAVPVDILRQQQDGQRAGD
VVDHQGFLVDRLSNDPFQFGGGNHVQVDDGVHHLEEVVADQAIPSQQLSL
VDTQKKIETDEAVDEHDCLGRRSHIYPPQKGEGITVTGGRNDKEKEIEPG
QQHEAEGARVHFQFLIFGHLAAFPR
>GSU0676 lipoprotein, putative
MRKNMLMAVTMLVLALGALTLTGCNEQIFNVNDVASDPAAYTGTITIAGV
MGGTSQADRTVFGIMDLKELACTTPGCNKIFIPIRANGTVPALGDEVRVT
GSFRTEPGGLIFVAEKMKVVKNHKIGG
>GSU3151 conserved hypothetical protein
MNRKTIVIPALYLTLLSAGVTLSAGSARAHCDTMDGPVIQDARIALKKND
VTPVLKWVREKDEPTVRASFKEAVAAAQKGTKAKEAAEHRFFTSLVKVHR
TAEGAPFTGLKPAGTVEPPVAAADEALASGSSTELVKIVTDAVAAGIRER
FDRVVEAKKHKDESVAAGRKYVAAYVEYTHYVERLHQAAEGHAAHHGGPT
GTKAPHGHGDAHVSHGH
>GSU2662 membrane protein, putative
MEMVTVSCPSCGFNKSVPRSAIPAGATHVNCPRCKTRFAWEASSAAASPV
PPAPPRPAAAPSVAPGIPVRKPAPPKPEPAGNEMVGVEELFRLAWDTFKQ
RWGTLLGLFLLTPLMALIPAGIFFGACHLLASALPASRTVLLAVGGIGAV
TMALVAYSWGFGALICATVDDEADLRRALAEGKRLLWPYVWVSTLLGIII
GGGFLLAVIPGIIFSVWFFFAPLILFAEGTGGMDAVLKSREYVRGHWFDV
FVRLLIIWGLSGLLGLIPVVGPIVSILLAPFVMLVQALIYRDLRRVKGDV
PYPCGSRDKAVWVGIGIMGYVVVLGVGGYLVAKSPLLQMLKQGKAGAGLM
VPVPANPGGEAGTISFGDGPSGNRPSSPESVSAALADQELADCMLYVYAL
DYTGTVKVNGEERYVIKGERSMNYNYTGSAGLKRGENTVTVDYQTLPDTP
LREIKIKLYRYDWDAKQEHLIGEWSLNDEGGSRSYTVQVE
>GSU2684 hypothetical protein
MMPIPGGVLGKATGTSTDPALNAATRMARAAEPATLRTGRAVAAAALSAP
LLRSCFHGGVYVALYPMPDEARITGHLRTIGRRARQTRSLESF
>GSU2884 cytochrome c family protein
MNRCDLGNLRPRRQSFVEKLMQAGVVALAATLVLVAAARAIDPPHGAVSG
FTCSTCHSSHLSLGSTGYNNICLSCHRPGVPLGGNRPLTMNDMANPFGTY
TGARRGIIYQNSHAWTGSDTVLPAGALPPLSAGLTAGATTGALSCTRCHD
PHNNTYPPYLRGANDQDQLCLDCHRVRNTSDHTRGTHPVNVNYGAAAAKD
PAGFHPAPVNANPANPTAAMKLANGAVLCSTCHGVHYADSNSATFDNHSG
HGTLTPSAGFLLRTDLRGATAASVNICTNCHRLPNHGAKGQNIQCADCHG
GHVDLADGTVPNVFLVNRYINASSSFGAVRGRMVFLQYTAAARRIYKNPA
GTGICQACHPVPTGGSYPAEHALATGTAADCAACHSHGNGTGAFSAAGAS
CTSCHGNPPRSSATGGPDGAAAGYAVTGVSEAATPHRRHAGGGSDYGYGC
AQCHQGNSHNTGTFQDVFRNTAGLVAAMAGAAPAYNSASRTCSAIYCHSD
GAPRNSALTPVLALKPVPAWTNGSGTITGCASCHAAQPATNAHAAHLAKG
YGCALCHAATVSDNVTISDRSRHADGVKTVSFSATNPLAAGTSWNAATAS
CSASRCHSSGTSGSAGAPVTTPVWTNPATGTCGSCHATSPAIAATSSQTI
ATGLHTTHFSAAYGVKLGTSLTACQKCHDYTTAKHVNGSVDLLASACTGC
HPQGASWTVATRFACTSCHAAVPSIINGVAAPYKARYAVSGHGQPAATYA
ASRACESCHNPDAPHISGVSGDATRLTLPNNNTLCASCHGDAAKVPTAAK
RNLATHVTAKGGAATSLCSSCHDVHGTTNLAMIRTVIGGKAISFSNLSSG
FVKTVSPYDGLCQVCHTATGHFRAGQALDGHPTKNCLSCHSHQGSYAFQP
VGGGACDSCHGYPPVPAGFVTGAGNYAAGKPEDYAGGGGAHVVARHVPKT
ASPVEGWANCTPCHGNGSLSPATHTMVLPVTPSKVTIDPSDRYKFNHALP
LGGQQYSGKLLDSGANATGSCFNVRCHFKPSKKWSTTK
>GSU1241 conserved hypothetical protein
MITKDMIIGDIIRKHPRTLTVFVKYGLDCNECQIADYEELEHGAGVHKVN
IEQLLSELNEHIGSGTE
>GSU2200 hypothetical protein
MKKAVSIVMALALLTMSLVTFSGTASAAEVKMIATIKKIEMKGSAATVTL
KDTKSDKTVTVTVKDELTLDKFKDKRIVEGDEIRLKYDDATGESKLFRKT
AGC
>GSU1259 hypothetical protein
MLIAIVVHAFIVGGRESRLAEQRAMVKRLSLTDLALFTDARYTRHPSMAD
LHSPFQDYPLSFEHFPSGSLMAPPPHVRTYGRH
>GSU1045 membrane protein, putative
MSFVSALCRLAVALWVGGATLFTFVLTPTIFRSESRDAAGRIVGYLFPGY
FRWGLACGAVALVALLVVRGKNWIPAAALMVVMLAVTAFQALHIEPKAAE
LKRQIPSFETTTKDHPLRREFSKLHGISAACNLSVIAGGVVLVILL
>GSU2111 hypothetical protein
MGSLKELLTECILDGGEKCIISGVPATVKDASIVDIASLFSTCHMGKRYC
CRDYHHRMGFIRALNVKVLP
>GSU2403 hypothetical protein
MNIRCFPYKTLNMKKIRKKYQWIDIGEDARRQYIDAQATFTAWEDARKAA
SEVRGGMYWKRQGETEYLIRTSPGNAQKSIGPRSEKTEQIYRNFTARKAE
AEQRLADLTSALERQQRMNRALFVGRAPRILIDILNKLVKVGLSEYFTVI
GTHSLYAYEAAAGVGFGEAAALATQDIDLLLDTRKRLSFITQMTTMGTSM
LKLIQKVDPTFRIRDDQKYTAVNSKGFEVDIIRREPKDGDPHPLRLTDED
DEFYAVPARNAGLLLDGPRFSAMIIATTGHMARMNTISPAAFVRFKRWMA
EQPDRDPMKRQRDILQANMVEELIVEYLPHLQQ
>GSU0968 hypothetical protein
MNGWEVFKVLLSVAMVMAVALPEPARAVPAAVAASVDSTATILSAGFESL
VQDPPAGWSLVDNAGTGALWRFDDPAKRGNNTGGTGGFAIADSDYAGAVN
MDTELRTPLLDLSGYETVTLRFETLFEHEAGETAAVELSTNGGAGPWQPL
WSRSGGNYGPATEELDLTSLAAGKPNIMIRFHYYGANYAWYWKVDDVALT
GIPRAQHQLAVTVNGSGTVASSPAGISCPATTCSALFYDGLPVTLTATPA
ADYVFSSWSGACTGTGPCVVTLGGAAAVTATFVPSTRNLNVTVSGAGQGT
VTSVPAGIACSAGSCNGAFDTGSQVELIATPSTGSVFAGWSGDCTGTGAC
TPTMTADRNVGAAFDPALTVELTIGTATTLHTTLTAAMAAIANGSAAAIK
ARAMEFPESLAISGGKSIVFYGGYAEGFGSVAGYSVLRGTVTVGVGTLTA
SNLAVR
>GSU2182 hypothetical protein
MPLYSLKLSIYDPEIIAAFERLRKSRKQATFTHEAIKQFLATERGKQVLA
NMDGGTSDQQLPALTSMAVNKPADKPPNETMSGEGASGFVQDVDYADVLN
SILE
>GSU1000 conserved hypothetical protein
MWRIKSRAPPARLILRNRPAPAAGRQYPLRIEESIMTTYTPLRDVPAAAG
YGPGDVFVLFGELFGRGYANGIVDEARKAGMTIIGATVGRRDNNGPLRPL
TAEELAEAEANLGGKIINVPLEAGFDLEPAGDGLTPVDRLKGVKPETVAT
TALDMAAIEESRQKGLARFRGNLADFARELDGLVPSGANLLIVHTMAGGI
PRARTLMPLLNRVFKGQGDRFLSSELFWNSDVGRLCSLSFDEVTADTFGA
LVDATASLRARVEVAGARVSYAAYGYHGCEVLIGGEYRWQSYTPYLQGWA
KIRLEEWACRVWQQGVKATVYNSPEIQTNSSALFLGVELSLYPLLAALAR
EGGSSAAAGIRAACQALLREGETMDAVLAQADAYLASPVLASFGKLEEWP
RHNTPDQAALMLTASDALMTMNADPKNIVCAELSKAVFQAVGQLMFDHSW
VPQAPVLWLNHDVIARRLAV
>GSU0801 hypothetical protein
MACATPRPRPPRSNIPISHLVATAVRLRAAPGCAPAKRIIRPGTTEEADG
AVSVKMPRSGRTLMFHAAALALAMAPVAAAAAAGENGGDGKGRRYQLCRF
DEDWSYLRAPSRRDDLWDPVKYLPLTSSGDAYLSLGGDARLRYEYFNNAN
WGRGPQDNDGYLLQRYLVHADLRLTDAARLFAQLQSSLEEGRSGGPRPTD
EDQLDLHQLFVDATPAVGDTRPLTVRVGRQELTFGSSRLVSVRESPNNRR
SFDGVRIIGRVAGMRLDAFATRPVETKPHVFDNRSDDAQAFWGAYGVAPL
PVLPGGSIDLYYLGLYRRGARFDQGRARETRHAGGTRIWGAAGAWDYNAE
FVFQWGSFGNGRIRAWTAASDTGYTLRNAPWQPRLGLRANIASGDSDPTD
RTLGTFNPFYPKGNYFSEAGLMGPANLMNLHPEITVKPARNLSMRLDWDV
IWRHSRHDGIYANSLSVVRSGTTGGSRSVGSQLQCQVDWQVDRHLLLTAN
YSRFFAGPFIRESGPGKDVDYVGTWATYRF
>GSU0958 lipoprotein, putative
MALYDEKAKQTEEYVGLAGGVTLSCRSVAARAAKTKRSLLVSCSVSEKLA
GGVKRKTGKRRI
>GSU0657 hypothetical protein
MPRKAAATPPSLFSSHSVAMANMEIAPDTLAKVTMKRKPEANKLCLSCRR
PCKQAAAVVISSCPRYYPGPKIKRENWKQLEFDLISLH
>GSU2936 hypothetical protein
MFNQEKIYMPVPQQLARGDRMGRGPYEKSGMIEDTGLPPGKKIHERFDIL
FPTEDVVEDGKKVRKTLAHDLEVEVKLWYLPFGSMNSDPFLWHEFTQKVS
ISAKGK
>GSU1948 hypothetical protein
MKTKRTLQALVPSLLIPLLVPLGALAAPSITSVSNSTLSNGQVVTISGSG
FGAKTTVAPQFWDTVENQSAYSGLANGATVPTGSGKPWETQCTWDGGACV
KYSTTASEQRGLSTATYRATSTTQGLLEGRRIAGTNKIYVSWWWKPGTNP
VSGDHSSKFIRLSSSSDVSNKTMSWTQMHNYIYRSDTGYCNNEGWANWNG
NVGQWNFHEVMIDGASRTYSLKINGKYLANNSSWANCAAFNFDYVWMIGW
DAGGNSPPAITSWMDDIYVDNTFSRVMICDNANFSSATKCEMQIPTNTWT
DGQLQVTVNQGSFPTNSTAYLYVVDANGSPSASKQITFGTSSGGGTATIA
PPSGLKVVN
>GSU3237 hypothetical protein
MASKRERKKVLDSLMEFGYFSMLFQKLRKGGEGTCTR
>GSU2582 hypothetical protein
MPLPLLCYCTLLFRPFLVLRIEAEHELVRGRPGGVQLVDRDVVVDAQVGQ
VFDGAGQVQPVGLPGRQHVGRNRLLDAVRGIGKAVGVQAGLAAVGHPGPF
AVTRFRDRIGYHEVAVAVGHQQVVGVDVVVVVIVVGPGHAGQVVPVAEQR
IVVGGDQAGVEPVFRGVEQVAEAILRIEVTGARMIGVGGPGQPLGEQGRQ
DFGGLNGGRGGGEESPFLVGEAEAVTGLGLGEHAVHHEAQLGPEAFPLLH
VAVGQLLHGVVDAGIGVGRHLEIHVDGLAVGPLHFPALAGLERVRAHLVG
EVVGVVAGGHDALETGHVVLVDDLHAGRLDVGLFPGDHAVRGSLGRAGEE
GAGVGHILGIGGRRHRHVAHVPPGDGAGPGGRSGNRQRYSQQCGQLLCSF
ASRHDVLLFFVWLRLLRALPCAGEANIAPPCCFQ
>GSU0973 hypothetical protein
MSDYTVIADIGATLIGLLRESMGDLLSADAIALLSPAELEGQDIRLTLFL
YAVSENPHLKNAPAPPASPGIRRAPPLHLDLSYLLTAHGSRLITDRTERS
LEEHRILGRAMRVLYDNSLLTGSVLKGGLAGSTAQFRVNLQPLALDDLSK
IWTALPNNALKASAGYQVTPVAIDSTRSTGGKPVVERTLEYYQKGAGL
>GSU1824 hypothetical protein
MAEQTACSCGEKHQGHLCVLKSKGLMDEVRHLTSSPTVSCFMCGAEANSA
DNVCEPVPLDK
>GSU1333 hypothetical protein
MKKLAVIIAVAALALAAPLTVMAADHGKGGMKHGDHADHGTAAHEEVVDG
VKATFKIMSMAEHMKAMNMELPKGMKETHHIAVEFKDAKTGKAITQGEVN
LKVQGPDNKAQTKALMAMQGHFGADFDLSKKGKYGIMSKFKVKDGTVRSA
RFWYEVK
>GSU2369 hypothetical protein
MNDPEPLLGVTEPLEESGDPFESRPDAEAPQTVRYVPGSPRRSWLAGIIG
NERVNVH
>GSU2424 hypothetical protein
MIYLTTDQFDQAVYFELRRSEHPRRSGGAMHVLHGLLGNGVSEMPVTIRS
GQDWTDVDFGKGDLFSFVDERTVRRMLGQVAREMELH
>GSU2764 hypothetical protein
MNYTHEFDEASGICTIRVTGVFRRPEDSDELKRFAADFFTSNGCRLFLID
VSQTELIAGTMPTFYAGTPQGELARVLRQVKTAFVRRSLTEDDHFLETVA
VNRGFKLRAFDSIDKAVAWLARGT
>GSU2900 hypothetical protein
MKSQSQTVIVTFEKKKEQHRRDAPQNTTKTLK
>GSU1813 hypothetical protein
MSTTRKQSCAVCAWRANCAKKFCVTDGGARCPDFSRDVSIKDVEEEPNEE
TK
>GSU0861 conserved hypothetical protein
MQERLTEAGREVTGSLVIDEACHMLRVQRDLRAKGEMVRSAEALLVLACG
AGVQSVSASTDKRTVAGLDTLFLGNIRRFGQFEQRCSLCGACRLNETAGI
CPVTLCPKGILNGPCGGMDEGRCEVKPDAECAWHQIFTRLHGQVETGPLA
ETTPPADFSTCRRPADLKLDK
>GSU3169 conserved domain protein
MGATLVVNGQTVVHKESGGVVTTTDVCKTPVGNTVVPIPYTNVARSVDVA
NGSSTVAVDGHPVMLKDSVFSRSSGDEPGSLGGVASGVTGGKAKFVNFSN
NVFFEGRPVCRRLDPMVSNLSGTGNTPPAPLMQENIAVAADDHQGHRIPF
TFSFRHPDLKSGRVEFPSFTARHAMEGTTDPGDDPDGYCDDVHLAGAPGT
HDLRFADFERERKKLPREETTT
>GSU3306 hypothetical protein
MKKLVIFACSMLLLILAAGIASAAQIKVYVSEFAITGAANKDELKTTLQT
LLASRLSGDSLLSVDSPAGADVLVKGSYIAFGKIFSIDAVARDAGGRVIA
RSFQQGESQDELIPAVGKLGQGLATEIAAKAVAAPSAPTQPLRSMPTPVA
RDIEKPEQASGGDIVRAPLASDIVKPQDVTRTASGGWLSQRLTGELIAIT
PGKLMADGSQDFYLAGDKSLRLYRKSDNLKLLAEVTFGAREKVLGIDSAD
LDKDGIPEAYVTIMDGESLASQVWVADGNNLKRVAAKLPYYFRGIALEGG
EQKIFVQQMSNDRDFYGDIFELVKSGNSYEAKNPFKLPRFGYLYNFNRFR
DKTGAENWVVINEDGYLIVYSSSGEELWRSSDKFGGSETYFLREDLANVR
TTGDPNRWIFLEQRITVTPAGDVIVPKNDGMFVIGNNRAYKKSSIYCFAW
NGSSLDEKWHTKQSQNYLPDYFYDHSRRELVNLEIVKKEGLLTGGASMVA
VRKVE
>GSU3335 hypothetical protein
MRLKDEQIARLAERLLEGLTAAGLITLKAERGKILDGIRRAVATDIKGEE
ELEKEAERLLEQTLRSMGSGAGIDRHRMLKMIKDKLAKDRGVVL
>GSU3175 hypothetical protein
MRFCGVCVAIYMLLLGAVSAHADQQVTFSATKHERERKIVKTLLKKEIRE
AKTLSKEPVQIDIALADLNHDGTREILAYVRQSPYFCGTEGCWFVIFRQE
GRTWRNILQLIIQENVSLSSISTSDFVDLIVEDKSVWSWHDYKYDFSHKL
P
>GSU0072 hypothetical protein
MLVKKEDGYCRPLFLSATILMLLAFCPTCSQ
>GSU3033 hypothetical protein
MSADGSEYGRYFEQLQKVNLTVRLGDTGSFDGTAAITSLKGSLAWLELFG
AEQPPPNTLSEGAEVSVSVWTGGALCRCDGRVETLRDDRQFAIRLVGRVR
ELQRREYFRLDVSIPFSYELLAGVSPEEAQERWLHERMDSWRSPAMVQEG
SSWRVVDWNGRDIPSARANLSGGGMRFRVSEDVASGTLMLVSLFLPHPQP
RVICVLAEALRCAEITLTLQAGTHYSLSMRFITISDKDREAVISYLFSEQ
RRELMSKSDRFQIGGRR
>GSU2658 lipoprotein, putative
MRTRTFVMAYDLLSFPFMRPAALGLHASLACSLFHPLREGCATPTTHLNI
GKGKSDCQGMAEYPVTTSLHNIMLKT
>GSU2048 hypothetical protein
MLRLVVLLVALVACGMIVGSPVRAADLKITEMAVTTKISRNNPIDAVRRI
SHRSVKALYCFTRIVNPGAQETVIRHVWYKDGTMASEQELTVKGEKWRTW
SKKPVDAESVGSWRVEAVDSTGKVLKAVEFKVH
>GSU1455 hypothetical protein
MVKCLVRLPSRILYLVPRVLLLALFLTYVTFMLLSSYEWFRPVEVAVQGA
LAKLVQGAAGLWSRL
>GSU2112 hypothetical protein
MAISRNRELGMTAWIEGHLDVTTATMPKMVARQWQRLLMDDEFSFHRLAL
FGFVSRRQRDTGDSGAFPDAEFAHFLGEFRVKIQQILNGRGAVVVLPMFK
RVGLQSIKRAQVAAGITGGVK
>GSU2117 hypothetical protein
MSELGPLYFDAEQMIGLDEFARRMNACANTVRYWIRTGKLVEGRHFIRQG
RKYLFPWSPQLLALIFDDWRPVVPPKRPLLDSRRPNRQPLKLRC
>GSU1787 cytochrome c family protein
MMTRIFWVMFVTALVMGSARLNYGAPGPKFTDSGGGNKHNLSYSNTNVNF
KATNSTDMRARQICIFCHTPHNARPQTTLWNRSDTTQSFGHYSSSSLSLH
LDATARTASDYAAEPNGSSRLCLSCHDGVTALGAVLAGVPIEVNGSMYTK
MTGDHVFDRAKITNSHHPVSFKYTAEVVARLKTLENPVSDYWLPADAPSQ
ESPGAPGTNKARQFVRLDKEKRMQCTTCHDPHQSQFYDTNTPPLTPFWAY
DGRGLSTVNADTVHDEVCYACHSFKTPNP
>GSU0979 conserved hypothetical protein
MPMALDNRKDPYGAFRFLVELEGLVVGGFAEVSGLQAETEVEEYREGGVN
EHPHKLRKITRYPNLVLRRGITDCRVLWEWYRDVIAGTVSRRNGAVILMD
REGNPAWRMEFSGAYPVKWEGPELKADGNATAIEKVELVHNGLKVE
>GSU2754 hypothetical protein
MVFLLPSHRIRFLEGGGGLAEKRVPAKKHRTSPEGKAPETADQPP
>GSU0190 hypothetical protein
MIQWLNPIFSFRKGRKSWNGRKKWRKSSSESSQKKIVCLAASRAPLPAGK
PNDNGNRPVDRLSFTLTPQGFSGVSGERHGQQIATHAPLLFQKTHIVRLL
SSP
>GSU0755 hypothetical protein
MALKCAVGGGRISLQQTDALGLSGIRLLHTPCRKLRDGEWVSRHNGLSVA
VDLSAAFQALAAESCARVFNEATPGSAKPIDCINDIMGTYPRIGQKRTIR
HHRGVPFDAAKKPCRDIGRASFGVRWRDNSMNMDKLLKAFAA
>GSU0980 hypothetical protein
MTLAERITERCSWRRAAGRRGCPGTVSLLTPERLMGRYLRGPMTLTPVPL
VHPGRTGRLDLSLSLRFFWQTNGSLVMGRPTGELPVSSVLRHGGTAMAPA
GPVEQGPGASAAGRGSDTAARFVHSAGGLPRCLDGEILRTILGRFEDVRV
RDLGPGRREDGRNHRQDRDSFSRAAIVGMVTMSTSGWGRILPAHRREQSV
GGNVFPHRAVRMMRRTPAGVAATQPLGEKSHASGDHPAVVSGRSFSAGQR
LSAPRTRLIGTVRPLGGLAAMGKRSGSAGGPARVGTASLPERGTVAGSAA
MDQRRRGMVSPVPADALEKELVLTHAMAGPVSRSALVSRPPAAVPANAVP
FAGPVAAETGRVARAPAAAVDVNRLADEVCRVIERRLVTERERRGR
>GSU1780 hypothetical protein
MTKQQLIQLITSRKRTVVVLLALATVNIILAVFKGVYQEPRIAELRDEWT
SKRSLNTGARRDVSSIYRQGTEDLKVYRERLTPKRGFTALMMEVFETAAD
NGLTVKGVTYKPKQVPDQKLVVYDVVLSLGGTYAGIKSFLSDLQCHRSML
VVESLSLNSGSAVEEAVALKVQLTAYLRTEE
>GSU0978 hypothetical protein
MGMEHRDRRRWCEEISRINRSANADATSIADL
>GSU0315 hypothetical protein
MKKQSAAPCKPRDMRQALAELSVLPGAIGSLVTDRGGKILERVFPPLFDD
TTLSAVSEDLAECVITLGISSVSTETLFLRYTEGIVAIKPLDDGLLLLLC
TRESSGKSITCAIDAAAARLRRVMSPAPPSVQVGPNGPASDRLAIAMSSE
SYPAK
>GSU1094 hypothetical protein
MSRFNLPIPGLGELDRQIAKIRELRDALQRLNTPAAGAVGAASGEA
>GSU2659 hypothetical protein
MNLKNDLGSQAIQDVLTSYPEIGEILQRYDIGCVTCKVGICLLKDVVSIH
GLSKEDEAGIEAEINDYLTSKAA
>GSU0557 conserved hypothetical protein, interruption-C
MDFKTVCQDILSLTKLNYNACIFGDGLPVTLRFANSIGEVLTAGKNIRAE
VLTFKHYI
>GSU1104 hypothetical protein
MREKILGVLMVLFVLCFAAIIIEETFLGGRKRRRLEKEARRRRQSDGE
>GSU0615 cytochrome c family protein
MPDNFSRHPLIASAPAWAPKFVRSAMKALLAGSVCRLLAVSLVMSATSLW
SAPIPGIPDGCATRCHRSKARDGVVHGPVASDDCIACHNPVGAAVHPKQQ
GAFRPVAKGAALCQTCHESMASKRVVHPPVGGGECLSCHDPHQSANPSLL
KARGAALCFGCHDAAAFSVRHGHLPVTTGECLKCHDPHQSDSPRLLRGSG
AALCFRCHDEKMAAGRSIHQPVARGECCDCHNPHGSSFPKLLRNAYPEAL
YLSYEQNDFALCFTCHSRQMADDRRTDTLTGFRNGDNNLHYLHINKPDKG
RSCKTCHDAHAAPQQRLVKERIPGFGSWDIPIRYTKTDTGGTCVVGCHKP
KSYDRLRAVSNP
>GSU2566 hypothetical protein
MMEISLILMVRKGRVMGLPPADASWRAVGIYPVRVK
>GSU1779 hypothetical protein
MNRQRLILAILLVVLGLSLIYSYVRSPRQQTVTSLPSTTSTKAPSRDRKA
APAKPVTDETRVRLDLLECSKGRFAGFRRNIFRPIFHDETKVPPAPPVLP
RPIPPPLPPRPVLPSPPPSVVPPQPPPVVRDMARFTFLGFLKKENRKTIF
LAKDKEIILVRQGDKIAGIYEAVNITDEALSIRSTTDGSEIVIPLVENRP
LSGQIR
>GSU2304 hypothetical protein
MYRHGCLKRAKALLLLIVALHVTGYSLMLRDQAFAPLQEQITFCCSANIE
ADDAQTDIDDFKPPKHSFIDYTTYFCPSRLVPIYQPSESRLVFAEPFRLP
PQVYPEISVPPQNVA
>GSU2774 hypothetical protein
MIETIYGIYKSNLAFFGWASALVNCLLVGWMYFNKQRHERELIQLKAKVE
HEKELEFEKRKKLYGMKAEQFEKYFRMVDEFNKKQNAEIPKRMQPLFSAY
LKSMLEAGNDQNMRNEAILAFSDQVHAILADGMEDYMKIQAETHCLKLVA
GEALGEILTRLEKLYGESFQLSQQFLNEFTAILGDQPKVDAYTNTIRQKG
LEMKEALVELMEQMRQELQQI
>GSU2528 conserved domain protein
MGPPGRELSDSIAKRSGLCQMDGGCVAPRPSIARIVKQPGIPPGCCVPVI
PVSWVAAGGRDRNPSAEHPPGTPEGRLIMGIVSRAAALCAIVILWVASVV
GAAELGTARLSLVAGDVQILTEDTNEWVAAAVNTPLAEGDRLWSPDGSRA
EIQVRGGVQVRVDARTALDIVDLGRESFQLSLAEGRAYLTNRKGGVDRIR
VDTPYASTGIYDNSIVMVDATSGGTAVVAVIKGYAVVETRQGTTRVSAGS
ELRLRADEQAEIAPIGSPDEWETWNRKRDRLLADAAESLRYVPDELDDYA
ADLDANGRWLYAREYGYVWSPRVSAAVDWAPYRLGRWTWVRGSYVWISYE
PWGWAPYHYGRWVFVSRVGWCWVPPSRGAAYWGPGYVGWVYSSDTVAWVP
LAPGEIYYGIGFYGPFSVDITNVAVNPVVVRTYRHIHVRNAVTIIDRDTF
ITGRRGRSIARENPFISARVEVGPPAIRPARETRRPIDKRIAPERQPPER
IRKLKPEEVRRDRRTVPGDAGSVFSPGRQADEMRVKRRETPHRQEAAPAP
SGSRQGGKGAGLPSGRERTPEGKGRNAPPPPVPRAVPGKAPSPPPRPDDA
VRQPRVQQRSPQVKTPAPDTRKQMPEVKKQVPEMKRQAPAPEEQTPQVKP
APPAGAGGQGREISNRPAPRPEQAREPELQRPARERRDEGQGKDRKKYGR
DDRE
>GSU2407 hypothetical protein
MCCNFYPGLRTAHLMLGIMHNVTRYRTRKEHGMHYFIKENKLHRYPVPKR
CGVQLQKEILRDTIPYNVEQCIYCMRRWPEDEKKEC
>GSU0335 hypothetical protein
MTATRIHENNLLTAVVVGSWLLLAIMTAAGYLFGSVQFAGGILAGGILAL
ANYYWLFRIIRRALGLDAHQAARFAQLRYLLRFAILAFVIYLLVVHAGIE
VLGLTLGLSLLVIVITALAIYMFVTKGD
>GSU3412 hypothetical protein
MGAFGRSNSPPPEGKGWLTGVTVDYTENGSPPSTAKNACAASPRTRSRLR
GYNTARIRHHGIRAAHFSST
>GSU1669 hypothetical protein
MKVCLQKGGRFQPAFGHTGREVLIMARFYDPKDETDRSSVETVLRKGGIE
YFLRRERGDTGMLEIYVAEEDLPRAEELLIRGKGGNE
>GSU3416 hypothetical protein
MRQGISEVSMKRYAMILLLVAGQLLSAGAGHGEEPAVRTASGEFAILGGY
GITHRGFGATRTQVETVDAILRYGHFLSGELGTGRWFQGRHELLVEVPLH
LTLDPRVRTMTGGYLLGSWKFTSLEGLAPYVFAGGGILYNDLGLDTQGTR
LNFSYQGGTGLQWFVRPDTALGIEYRYHHVSNAGTAEPNEPLNSSKFLLG
VSVYR
>GSU0616 cytochrome c family protein
MLGKGLWLAVCAVILLGTPATAGTIAFVAPTPNTWVGRSDHLVLKLNNPE
ITAVRINVNGVVGDMLAISSPEYRKAFQDFLIVQPLWDQGRNEVSVEAYA
GKERVETAVASVYYAPGRDGATVPPEFKPFVFHVADTESRCAGCHNMEPS
PAQLLSTQERENPCFGCHRGMLKVAFVHGPAGTYSCVYCHKEKASPKYSV
PKRDSALCVECHEDKSTDFAKRKYVHGPIAGGMCEVCHDPHGSANRAQLR
MPINTLCLSCHETVARRPHILRTPSGEGHPVSGRKDPSASASGRDMSCIS
CHNPHAADVRYFFVNNAEERMALCQMCHNK
>GSU3414 hypothetical protein
MKHKNIAQISAMTLGMLLTAGVAHAVSVNYTTTHVFKKEDIQCGRGIIGS
TCLGNPVVEGSLTYYGIDSVYGWYVKNFDSSALQPKPRDGVYDDGRVADI
LDGSGNVIGVKVQNNETGNWKTGPIGGEWAKGLGALKAKAATEKYVVMDH
LMRDPADPNPLQEGIDYNKRMKDDGKYLYFWGNYNQEPTPVYIYTALPLP
DKWKVAGASYKVTSAKLVVDHDISNSPNDQIRPEDFENENATGILPQYTV
CPTAPALAPAACSGLPAGTWVSAVDSQEGGDLELLPAGTVLKALNPATGL
FEYTNAWYTTLDRDPFGGLNPRFRLKSSKYGQDLPGVEIPQYVVGTPTTT
TIDLLTIKDPVTGLPVLADSKNWNNFLDEHTDIFGNYDPFDGYSMEDCPL
TRDLDLMLYVKGEVGKPTVVRSAQLLVTYEDPDGVAVPTIDLAVTSVAIP
TSASKSSVQTLKVTVDNLQAGVASGSLTLVGKNSSGVVVGNFSQTVATPA
DNSAVTYSFVWTAPSYGTTVKWEAKVTNEKDINGANNLKTASTVVKK
>GSU2938 hypothetical protein
MKKIMMSIAAALVALSMAAFAFAAGSFSGKVVKVDGEKVTVKAEKVPAWA
KKGAHVQASGGMPTIVEVKGDEVVLKFGKAKAAKIKTDSSMTISESAGDA
LQGC
>GSU0176 conserved hypothetical protein
MDILATMAERKIQEAMARGELSNLVGAGKLLAMDEDLSGVPAELRMAYRI
LKNAGFVPPEVELRKEIVSLRELVNSLEESEERRQRRRELDFKLLKLAMM
RNRPMNLDDFPEYRDKVAAKLGGE
>GSU1924 IPT/TIG domain protein
MIVPLMLVAVLVAGTAGGATRPAKQKQIAAAAATKAPEPVPVNILSIIPA
QGEPNTTVTLSGSGFASGTTAYLGNTEIPARLVGTDVLSFEIPRLQPGLY
ALFVKRGDGITSRTYNFSVLPPKPVVESLYPETVDVCSPSGERLVTITGR
NFQAESRVLFNGAAVRSSFGSPQALSFIVPPVQAGLHQIQVKNAEETVTT
PVTLVVRGTPEVFSVVRGESSVNYYNLVIEGRNFQPGATLSIEESGLQLG
LNAGGGKRIRAGATGERDMLIYVDCSKLVYQRYPVDPTDKDLRLQVINPG
GETSSVVQVSAP
>GSU0553 hypothetical protein
MFLLPSTITRNVIFISHATPEDNDFTIWLASRLQAEGFEVWIDKQALVGG
EKFWQEIDQVIRHKAVKFLLVYSENICYQKQRGILKNGVAKEISMAEGIA
TTNGLKDFMILLNVDGSEYNLFIGADTLNQVPFYENWAGGYSQLRKKFKK
EGLEPTGSPVGDFSKWYEDEYVIHHGIENLHEIYYDCWWPIPELPESFFV
YIFPNNVVASQVHQRSKYPTGKASNILASFYDTSEFEVEIDGQLERIPFT
AKHEIKVSDVIARTVRLGFPGTRDVENYFRSLLQRTFHLIMKERHLHWYE
MANMKQAYFFSAKSHPKIKFEYPHRATKAGKSKVIFGKHLSSFWHYALSA
KPIITPQLAFGLKSHIVFTTDGVKLWDDKAKMHTARRRKGRLMFNEEWRD
LLFAFLHALCQGGTTVNIMLNPTFELALKPWTNTYHSDFGYIEPKGQDRH
DILAAEYEYQDREDIDPDEVANV
>GSU3184 hypothetical protein
MAMLQTATSASRGWIEGSGLTLGAIFTALLEPVWAPVRAVSQLFDNSPHN
PDEGILQASPLIDLVAQQKKKTIEVAEGFILEREKVLTNYLMKEALYKDA
PHLIFTSVVMTMDMEYCHIDGYFGLKVYNPLYTNREDIEANKPCAYWTPQ
HGEWVKAPRDNSVYVRRDKQANTSEKHMTPEEFDSYQEEARIEGITASYL
DDKGNAREVKIVAAPVLLPDAETVLYENWIKQLQYTELAVVKYPLKLLPM
YHFDPRRWHLEKNATEVLDKVGDRGIYLGFKMYTAQGYRPWDPRLPILKD
FYAKCCLRGTPIVNHCTPKGAAAYERDEYFGFEHPLDGENDRKQKQDKRS
AYFSRHFVSPQAWKELLDGGAELVDGRMHVSFKNLYLCLAHFGGPTDLGM
EWNRQIIEMIESGEYPNLYTDISSSFADSGFRKDFKAIIRKHPKIKDRIL
FGTDWYMTFVYSAPFVGKNLWDYCTETKKFLDDFDTSLWPAFTQYNPYRF
YRLDQQIGRIAENIIEMRAKDKKILEVLDELEEYQINEIRREVAWIKFAN
HGHVIFKETP
>GSU2166 hypothetical protein
MSPLDRQTDEPTNEERAGRIDTVMQAYCLTLENRDFDGDEDDVKDMLTDL
MHFCERMEIDFEENLRVARNNYKHERLAEQGDTGQLGCPVCGCFLEVTRT
DTLLGIDRELYDCQECDEIFIRELNAPDSPLQRAVKCVGCGNMIPQASAR
ILYQRDDYAHFIGECCWDERLRE
>GSU3003 hypothetical protein
MTILETSWVQGNDREIEELVRLVNEAIALELNASRLYALFQDLFPDDGEF
WQTLSIEEENHANLLRNGRRLFLPEGRFPRELLPESLEPLVEKNRELETL
FDRYEQTPPSREEAFRTALVLEESAGELHFQRAMESRAPSWTLKVFQTLN
NDDRDHANRLRDYMAAKGIAE
>GSU1254 hypothetical protein
MGSRIAAHLLIIGLVLVCWLESAPAAGITIKATGEGIYGIDVAALDDIAV
VDLEVGYDTLNFDAPRIDRGAALPSDVSFITNTAIPGELRVILKRSRAIT
AGGRIAVLRFQPIGETPGRLLSAHARLITTKGVSHALPVGVVNAQPKPSV
GPGSITDKISGGTGSPSPLDTAAGDMTPSPVASGGSGSRNTPGDRYAGRG
GADATVTFADSSPGEAKPAAAPPGSPSPAATPAVPDMASQPAEQAATRAT
ADQARAVPAQTIVYGRVLDRFSEVRGETAAALVALFTPPAGQKVTQQPQV
AISDGRTVVTLRVELPRSFEGTPVFAVTSAKTLSWEQDGEGTWTVRVLPD
ADTLDAVFVTYFGEGLIQYPLVVAPALTRLPGGAPALDEAGFALFLKERA
REGKPRFDLNGDGTRDYRDDYLFTANFIARGRQAAGTPAQSPTRSDN
>GSU2035 hypothetical protein
MEVAGLETLRYDIEHAGYALPWSFQSAITYEEAEDATAAGYNDAPSGVPW
PLRIGDDAGLNGTDYLVVKSAMAARNDASGKSSHMGTGSARPWQSDNPLS
QFASSDSVVVINPRFQADDPSAAGRELVVNGSEFFANYSAAIGNTKHLRH
FSPDNYPMASKSPFFFIYGLDTASSVRMPFNRADFFVERPANIAMRCADD
SKVGVLYKAVVRQDDGELIKMPLLDCVADFQVVFGLDTNGDGSVDLHQND
LESPSELTDVRRQLKEIEVFVLAHEGARDPGFSYPQQKVMVGYIIGTTPF
GREFDLATINNWRNYRWKVYSFIVLPKNLGNY
>GSU0393 hypothetical protein
MLRAPARGGAAAMHDDGRDPEKDETNAVCPSWLLWVLAVWLFIICIVLVL
RHGWTFPEF
>GSU3049 hypothetical protein
MANRFTDVSPHSLSENEIFRPDKQTLNLTPSRRKRLVSP
>GSU1849 hypothetical protein
MLKQQAKQFTILCKLIDIILIYIAFAAAYEIRSKIGNIGDFYHYLWVLLV
IIPVWHLLLSKYGMYASTRTHSIPKLISDLVKVHIIGGVITASLIYFIEP
GGFRRILFGIFILL
>GSU1447 hypothetical protein
MADAREVLEIMKEVAKTRIAMLRDGTTFHQGDKRAYFLQQYEEKLAQIEH
LIRRISIRIVGPDETNRPRPTK
>GSU0155 hypothetical protein
MVQIIDKKVNLEYPLGHHLHCMIAQVPNHLRGAEKGFAIEEPDRQWERVR
SILDLVAAGEGNLKKLHFLMFPETHVPVGRFDEMLTAISGTFRPNTVTMF
GVEHVSLKAYREMLERFREDNAEAIELVDRDIDSGDVLEMPVNWCCIAVK
EATGKLRVFLEAKSHPFHGEEFLDKFHDLYRGRHFYLFRSRPSCFNFMVL
ICLDYLYRDLYSSNIKQIIDHANQLYFSTRQTLDTIFVVQCNPKPEHRAY
RDVLAGFYGEYLEDTPGVRETVTVFGNTSEETRIEDVPGGHAFGTSSVVI
NSSHRLARVQLSEFATDDFDGAPICRLRFGTGTRLYYFNLPLHHEIDPRT
TRVPLKVHTIMRPSRDGGWVKISGDELVAGFEIAQNM
>GSU1625 hypothetical protein
MTGYEYTIAQPKAISIILSKRSEEHSNVLV
>GSU0071 hypothetical protein
MDTTVIDEAIDKYVHERMEKGKTRAAERFLSYAYLKYAGDEVGEFLRKAK
GLTRYYVDFLTLMENPFKGPELAWFASMIVVGIVSCVMLSEEQTRLAGIF
ILSGTLVHAWSLLRMVAKKWREIGVMIAIYREIIEIIERETSSLPSSAR
>GSU2889 hypothetical protein
MNVKNAAVILNGFVHDFAAGIWLAAIAAIVLIHRMHGGQAAEIVAILNRL
EHIFFWASVVAMVVIMATGAGRTFTYVENWYGPDAEKVRRRMLIIKHLVL
FAAFGAGYLCVYGMVFH
>GSU2355 hypothetical protein
MSKRTLLIVTVIISSFFVVLGMRNPDLNRSPAPKQRPRAVIESVSKTSDR
VVAQSVSAVVTLSAVLPAPPRPIPNSQLVPEIYSPPHSPLLASHSSRAPP
LS
>GSU0765 hypothetical protein
MSGILPPLLFSDFCIHNDHHTNGPGEQRLGRCFFLVIPMMNPVLAIAERL
KLFKDGGHMLVTSLVSGS
>GSU2737 polyheme membrane-associated cytochrome c
MSRKVTKYSAVLAVSLFAAALAGCGSENKEGTVGTGPGGVATVGDSACVQ
CHSAVTEALTGESLIAQYQKSSPHNTAGLGCESCHGGGAQHNGVGPIPFA
QPDASRCADCHDGTTAVATNSDTAFAESRHNIQTIRSGATCRRCHTHEGA
VLSNIAGYTGDLATLEDTVNQNKVPLVSSYSQISCATCHEHGGGLRTIKA
TNGAAGPVVNWDPNNNRTVDQFDLCTSCHNMYSYNGSTLLTNGVPVNGVA
TGTVGHHETTWYRIIATTHFDNYSTGPQAGAGASGTNAKVEGYVLRRTGA
NPCFDCHGHEAKTNTRPGRDATIHTDWAKSAHAGGLLTAKYNAVGALTGA
AAVNAAMNAYVDDTTAIAWTHYNWDASSRGSCQRCHTATGAANFMSNPAG
YDPTGAGNSFSHLQGWSAANGSKQNELLYCWGCHTNAGTGELRNPGAITE
NYAGVNSTSTGTTGTAVTISYPDIAGSNVCMTCHLGREAGENIKAITDAD
GILGFVNSHYLAAGGQLFGKTGYEYATRSYAKPTFFAHDKIGTAAAPGTG
TNGPCAGCHMTTPNSHSFLPVTKDGTGAVTAITSTACATCHAGAYALTPE
ALTAEEEEYVASLEALKAALAGKGILFFNAHPYFYRDTNANGIGDPGELV
SSNAFTNWAGVYGLALWKDVMGAAFNANLLIHDPGGYAHNRFYVKRLIWD
SIDFIYDGVLNNDVTAAIDAQVTATRLDSATATAAKAYLGTTRP
>GSU1744 hypothetical protein
MRRLIPACLVIALGVLTGAPAFAAEKQPREEVAFIRSSGETFSLDRSVRK
QLDAAAARVAARDGRRIIMIEGGARRGGSDEERARNSLNLAMMAEKYLRR
SHSLSRDILLAAAPEPTGKGREFIRILTLPDDFTAVHVSRAGTSSQP
>GSU2466 hypothetical protein
MHRDIRLHGQIDERIEYYAIVAGDDSHQRYFFNAAEGPGRELRFFAPGNE
FVLGPGGIRHEGNGGSFCEYMFGVDQPVPDLAKGDVINRLVMYGARMEEE
SGNLIFDDRTGGQLGFEKMFFEGNAVVNYFFFISSARVTGPLRRQQESIV
RTIGKTLKRSAAVGEQDENALIAEVLALLSDPTALFFLFKLINVHHREYH
DTFRRLYFRNKKIADDDFAMLTAVAARHDIDRYQQERIRIDVMYKHPANR
RIVDEYKNILIGCHRKGEISRLENARLTRLKTLSVRNKIPGALFYTLDEV
LKNDKKLVAPEEQDYLSDIRQILEGIFLTERDIESSIDREDMARLLFAKK
KAAENRDHAFEELLLDASRLCDEKMRDGAELAIIEGFSYLITFFDRYDTT
SQAVNQLAFMENVRITEEMIRSLLGNKTAFDSLRDGLFDELFIAGLLENK
YLGRYGRMKITNLRRGLAEIELNRLTVAALMEQLLTIDREERLAILLLSH
VRDRIRNFYSKYTTKADQQALRREVNEELIAKKLVDGNVPDRLFDETILT
IQKEAVYLHGLLPRIIADRDIALREDFLENSGLDRFYVEELEREYFELNE
LDLEELYQIRKGLS
>GSU2116 hypothetical protein
MRGSPTYQVQQIFDRSGINQIGQSKHDAKQSIRAELNEQGKPCGCHELGQ
NMGIHSYATADAYRDVWRQILEYARAEYGVRDVEKLTDQHVSGFLEQKVS
ESLAYATFQQYAAATEKLAVALNGYAERHGTGQHYDFHGALKDVRETARE
LLVRFDGSRAYDRPTDLVAAVGKESHNLAACIQHESGTRVHEHSQIKAEA
LRGTRVDRFTGEERGYFAMEGKGGKEGLKAVSVETYQRLEQHIAVHGVFT
INEGSYRESLRRAAEISGQRYEGSHGLRWNWAQERYATLQNHGQNELQAM
CEVSREMGHERADITMHYLRGK
>GSU3351 hypothetical protein
MSRVTHKSVVEEQGQRTARSTDVYLPKEGHEAALCKKCGALYRNKRWAVA
GEEAGTGSLAKVVCPACQRMADNNPGGIVTFAGDYLLAHEDAILNSIKNI
EAKSRVKNPLGRIMEISQDKNVLTIATTEDKLAQKLGREIFKAHRGELHY
RWSHGESFVRVSWFR
>GSU2037 hypothetical protein
MRQGGFSLVELVVIIGIIGILIAIGTLDYGNWSRKYNAEAQMKTMYADFA
SAKLDALHSKRQYQITLGSSQYVIRNYSSAFDATGTVTQTKTLKTPVTWS
HDNIITFDTRGFTPLSSARTVCLASSGATIVDCVIVSQTRIDLGQKINPG
GECASANCRPR
>GSU1367 hypothetical protein
MTNALLLKSIPYEEFDYQTLLDVVHGYARPRMKISAMLAKGDIIRVKKGL
YILGEPLRRRPFCRELLANLVYGPSYISLEYALHYHGLTPERVEAVTSVT
CGRSRTFATPVGTFSYRMIPLEAFRTGMDRVELDDGRSFLVAIPEKALAD
RIVADRGAGISTQKELHEYLLSSLRIDPGGLRELDPVRVAEIARAYRSRR
VKLLADLISRLHRRGGR
>GSU2162 hypothetical protein
MKITISNDKASATVICRDLILDDSVAGARNLFYLTAYGPTQEIRAFAQIL
AMKGSLECHGKEDRSINIWSNHHLRVIPKMGEGYSGVYITPSSDSSFLIG
SSKADCYQVFTRILDQREFVHRDWYEVLFNEVSTEIEPFVGNKRCWKFRG
HELKSEIVNRLKYGGLKMPPATAHFTIEKEERHALSN
>GSU0839 hypothetical protein
MLWPVSSPWMTIVYALQYSPSSVREARARRKRSTRPAGSGPGRINGAHPV
VHFHIPASSPSVRSRSRYGSDACLLEGFPGPLPGLAILASTATTSAGFWS
RYVMNRGSRQCPRETVSARLSGRTRTVFVSRKAVRMVSRRSSSSLSRTTR
TVIAGCSSIRRNRSASAAIDRYTSRPGITPETVFRISSPTLSGA
>GSU0872 lipoprotein, putative
MKRSVLVALAVVSLAAAGCKDKPKAPETAAPAPQEQSGMPAGMPAGMPPA
PQGEMPKDAVHGGGDPHAGLKPVEVPAGVGHKGKVLSTMDAAGYTYLEVE
EKGQKLWVAVMQTKVKVGDQVEFPDSPPMLNFHSKTLNRTFEKIIFAPGI
RIGGK
>GSU0838 hypothetical protein
MPLPGQVRAEGVIGHARAVSSMTEGSAAGTSPPAGNVVLMGIDSTGDVVF
DTPGSFALFTAHTAGQRCSGQSPALG
>GSU0901 conserved domain protein
MDGGEEGVEQPLPLQQVPDRVRPPGEQVAQPVFPLKVDEDREAAVEGEGK
TGHERQGGPLFELFAQVPGGRHVVHGKGNHRAVETFHEASFFLAGEFREV
RPAPAALPGGQKRPTGKGADGIDAAGPIMIQEGTVARLPPMEAQPAGEVP
GPGEKLPQPRHMPGNGRCLEIADDVRDPDCFGEGAVDGQGAAGAAAPAAV
DPSAPLGVAVDPAPDHPVEPLLQYPHFIAHFSPHGVPLTGRCPSHFPASL
SQDPRRGEGEIPR
>GSU0965 hypothetical protein
MKMQAHVVQHGVFLLVWSTLDGCEDVVGKPLGQFLVLHALAVPAAAELDD
HVDVVGRDIPASHIADVIALTVEGADYSVLHGYLYLLNRVRSD
>GSU0558 hypothetical protein
MRDNVPQIGCFLYFNFNSLLIHITFDSVFNASSIKFRTVVLPSPQAPYNP
ITMLLGSVLLAMTSEIRLEKGHLENLSLSGLDIGASDE
>GSU1647 hypothetical protein
MVVLVRCIDDTITVALEANLKQLIREGSIKSFLRAGEWVDAVTSLERKKR
PVTLSEGRYLTLVSGF
>GSU1726 conserved hypothetical protein
MRGDRIRQLIAQEAARLMYVEGVSEYRDAKRKAARRFGLEKSLSLGSHLP
SNAEIHRELQRLIAIHEEKAHPERLHKMRCAALRCMEMLAPFRPYLVGSV
LTGAVTAASDIDLHLFAGSSEEVEEYLRSRNIPYESDVVTIRQGGEFIEY
SHIYLEEDEVIIECTVYAPDDIYRVPKSSITGKPMERASITKLRKLIAEA
DPANV
>GSU1187 hypothetical protein
MATAERRSILCPNCRRLINRDEPSCPYCGTSRPVRG
>GSU0157 fibronectin type III domain protein
MGGNHRIVVKLVAGIALAAALAGCGRKGPLVPPESFVPASVSTLTVEQKD
DAFYVSWPAPEKDEGGRPLKDLAGFRVYRRPVLPPGQDCEECPTAYTLIR
TVDLEYPQDVRSYDGRYVLADRGLTNGATYQYKVISFKKDGSESGASNRA
RRTMVAAPPAPRLAASSSATGVALQWEPSPPATGEPVGAYIYRRRGDDLA
SLVLLTPSPVTGTSYEDQRLERGVTYVYLVRQVVAVDGQLVEGAASNTVR
GALTEPE
>GSU2356 hypothetical protein
MESTTPYHYFSKLKLPKQYNFQLKAISREIVNPVKLQAKYNTF
>GSU0703 hypothetical protein
MILLAVVLAVCSACAGGDTGEVEATIRRYNDLLVQGYRAQNMNPLVEVTT
HEHALKLYHHMAALGEGQLRMESKLKRIRFLNIERRGNSEATVETEETWD
FTHYRMATNEKYAEEKDFIYRMGYVLNKENGRWLITGVNTISGTSTNNVV
PWPVLDRKGRVVNQAPGGAQGRHP
>GSU0502 lipoprotein, putative
MRRLAALLFLLSLAGCGSDRRAVEEVLRLREEALSRGDAALYASLLAPAY
RQARPDLEQNLLISGPIGYRSLERRIRLDGTTAEVAGRYVMKANVKGRIL
ELAGQETIVLHREKDGWKIAGGL
>GSU2908 hypothetical protein
MTLLKRIGRFLASMELAVALFLIVAVAAIPGTFAGTRALYAHPAFLCLLA
GVALNLVCCTLRRFRSLSVPVLILHLGVLVICGGVAARTFGHVATVNVYE
GTSVDQAYRWDLERDAPLGADLTVTRINREFYPIPVRVGVLRGSAKEALH
TLKTGDSFTAGPYRVTADSLQFPSEVLRLSVFRNGQLVGTCDTSGERSLP
AEFPYDFRLVAFQNPVLKRLWVDLALSRDGVAIAEGTAEVNAPFIWNGLY
FYNTQVGTDAMGMSFAGIQVVRDPGRPCVFAGFAMVGLGAVLACARRLRR
KQ
>GSU1389 CRISPR-associated protein, CT1974 family
MYLSKVLINGTACRNPYEIHRVLWKLFPEDADAERDFLFRVERSGQQSVE
VLLQSRREPTMAASREVLLMGSKPYLLSLQQDQQLRFMLVANPIKTINDE
SARLNSANEIKKCRVPLIREEDLRAWLKRKLEGVAVIEAVEVEKRPAMNF
RKAREKRVGKVQAVSFHGVLSVTDPVGLISLINTGIGPAKAFGCGLLSLA
RT
>GSU2133 lipoprotein, putative
MKRLCLLVMAISLLSGCAESSALIKANSVSLRTDVFEELTNGVTAPQGDV
DVRLTATFKTHKPGIYSAKDIHGTPDYKLLLNIDGQAVLLQGSLQKENNE
PMKLVDPEAGDGIRYRFSKNLRLKAGIHKIVVALPDDEVAIAREITLNEG
DRNSLVVEPIYSTKPGKRRPGMYSTTSFTEGIRSLRVMLNGKEV
>GSU2646 hypothetical protein
MSADKNMYDTVLAYFEQEFAAVRERLKEGRYAGLKERVLVSLKISDALTL
LAPYTRGDQRARKLVKAGEALRTELLSVRDVIEKKRAPRKKRQDMLICQV
LRKRKPLLLV
>GSU0249 membrane protein, putative
MTRLIVFGILAAFFFSSTFVLNRAMSLEGGHWAWTASLRYGYMLLLLCTW
LLLKGRRGVLIAAWDAFRRNWLFWTTAGSVGCGAFYALISFSASFAPGWV
VATTWQITILATPVVLLLFGRRVPARGVLFTAMIFLGIVLVTLEQATAAP
LGDILRGVVPVVAAAFAYPLGNQMVWEARHGTSSLLPAITDPVVDDSFGR
VLLMVLGSIPFWVVLLIVAAPPPPSPGQLVNTALVAVFSGVIATSLFLHA
RHCASTTYELAAIDATQSTEVIFSLLGEIVFLHGALPGSSGTAGLVLVVG
GLMLYVRAQTAGPLKEMVHSRAEKG
>GSU2205 hypothetical protein
MYNDAEARMNGLYFPSFAELGYTDPVLTSAFYGLQNLPRIQEYSDLRYRQ
IDLSAGGTYQFTPNLFMTAQAGFQQFMDDAPYVYGDQDGITYTGSVGIGY
KF
>GSU0716 hypothetical protein
MPKGIALALGLNAVDPKHYGGWAGKLNACEADAEDMAAIAAERGFAVTTL
MTKAATRAKVIDAIGKAAKALGKGDIFMLSYSGHGGQVPDTSNDEPDGVD
ETWCLFDGELIDDELYALLGKFAAGVRVLVFSDSCHSGTVVKMAYYNGTT
AARSAGPDEGEIRYRAMPQSVAMRTYRANREFYDTIQQKTKKVDLADVKA
SILLISGCQDNQLSQDGAFNGAFTGQLLRVWKNGLYKGSYRSFHKAIVRR
MPPDQTPNFFTAGTPDPAFLKQRPFTV
>GSU3137 cytochrome c family protein
MLRLLAGNIALAALAVAATGGTATAATPPAAGSGCVGCHGNPSIMKKLGY
PHFTVTPQEVRAQTGMPADCHLCHGGSPDAKEKDKAHAGMGRLVAVRKKG
LTGETVERKHPLSLGSNPMERIKLMTDKAGTAAPDQSVSIILWQDKRTDT
LSQDFGRMEKSCGACHARQFAEFTRSTMARNGKQSAYRTWTDRERGPHNC
GAWFGDNHEAIAANTAAPFDRATNALNQRQCNTCHVGCLDCHYDPQPADP
ADPKKGMHGFRRTPPPESCYGGGRGQTCHAGPEERRRGAGYFGGVYANPE
GAPPDVHLQAKVACLDCHESSRNGNGLTHAMVKRQAACDRCHGAIVASHG
TSVHRTLTCEACHVRNVGGYQGTYWGPGILGGIGTPFFKYKEYYGIMDDP
ILIRNQKGRWIPVKPYPMAVMNVKDAGLKPGLHWRWPATLPDLERTDDAY
GYVGLVGGLPENNKALLWIQMDKVSHKYGPPRPCDSCHGAADGAQRREVD
WDFTDAGALPFSGSHTVVADARGLAIHNISTQERIETTAGTTVSSFAPWF
YLKNAWTIPGDFSLPTIGDRTRYERFKDNRDVARNAGIIHR
>GSU2108 hypothetical protein
MNRIADVFERPIDRTIEEVIKVDQANEATVQNELEEYIATDSIKDQFAEV
YREIAAGPSTPREGIGVWVSGFFGSGKSSFAKILGYTVAQQKVGATTATE
LFKKRLRDARVVDLLDLVTMRIPFHAVIFDVSMDRGVRATNDRLTEIMYR
ALLRELGYAEDFDLAELEITLEGDGRLEAFERKFLELHGSEWKKRRQLGL
AINEAGAALHDLDPRTYPTADSYAQGIGAGRADVDPNKLARRAFELCARR
HPGKALIFIIDEVGQYVSRSVDKMLDLQAIIQAFGVEGKNRTERQEAVSP
FWIVVTSQEKLNEVVTALDSKKIELARLQDRFRITVDLKQSDITEVTSER
VLKKKAGAVDLIGKRFDADEPRIKQCCTLERTHRNLEINRSSFVRLYPYL
PYQIDLCIDIVAGLRLKRGAHRHVGGSNRTIIKQAQQMMINDRTRLAEQP
IGTLVTLDKVYELLEVGNLLPTEVSREVANVAQRLPGNPLAHKVVKAIAL
LESVKDLPRTPHNIAVVLHPSIEANPLTKDVVAALKELEEAQFIRQSEEG
YKLLTVQEKNWETQRNGRDPREADRNRIHRELIREIFSEPKLRTYRYKDL
RGFRTSLLVNGEWVESEGEIPLNILLTAREEHQDTLVDAREASVSKTTEL
FWVATLTDEINSLVTELFRSREMIGEYDRLAAQQRLTVEETGCLADEKSR
RDRTQRNLRSKLLACIEGGNAFFSGVSYDAPVLGSSLSESLGTLLERAVP
VLYPKLEVGVLRLNGDEPVKFLTSANLNGLPQIFYHDKAERSLVIKQADR
FVPNLGCELCRELLDYLKREHAYGNRITGKMLESHFSGLGYAWERESIRL
GLAILFRGGAIEVTHQGRKYRNYTEPTSRVPFVNNPAFRSASFSPREALD
LKVLANAARMYEEISGRDVDIEEGAIAQAFKQIVVSDREKLLPLAARLGA
LSLPGAKIVQEQLNWVEGILEMPADDCVKTLAGDGKAYLEGRRQSFALDK
AATDENIQAIARARRVLMEQWPVLVLNNDDPELKDAAEKLGETLQSEDAL
ARIDNLRFYSESLSNAYRTLYENLFEKRKAAYTAALDQIKGRPEWLAVAE
DPDIAQEQLDLILQPLTLKAEAVLDLPHGATVCQRTGATLAQLESDLAAV
EAVGRDVLRRILELAAPEEKIERVAVARLYPGRITSKDELEDFITNLRER
LSKVLAQGGTIILE
>GSU0053 hypothetical protein
MNDLVQKYDHWLENSGPAALVIREQLMPVEGRDGVLFPATFADTGYNIDK
FDDGGNVCLIDSVGSQANRIEPIFMTKDYAGLVPQIVVQAGNKKVNLLEA
GHRAGDAIIRCSELQQTLRAAFNNVLNGNAEPLARIAPTSLVFGVWDSRD
TQAKLPRLVASTIRAYNVRPLTRSAQYVPAVDYNAEGLLEEPGDLRDAEG
KVKSKHPFAQRGFVHVPATGALGGVIATGGIRRDATLHLAALRLLSAGQD
EAKSKALRRYILSLALTAFTVPVTGYLRQGCNLVLDPENPLEFKEVFNDG
TRNDVGITHTEAIVYAKAVAKEFGIDPERNLDEKKAPDREVPFDKVLAKK
DVSDAGGSKKKAK
>GSU2469 hypothetical protein
MKDRDYTAPGIVMGVIFYITILSSAAPTLKMIDIVAPIATLLAALVGAEA
AFRLQREKTEKTEMTRRKSAANQTVHLISNIYSNLRQYRREIIEPTETLP
VPWLGLRGLLAYHYEVNLPREGLLFMFEGEGDGAQIYAELIMEEQRYNLA
IGMIKQLHETIKNKLQPELERLGYYPGNVDIQQLLSQLPPSLIIDLNSLT
HETYKIVRENIESIQEVYEKFQQSMKSQFPNIKVARLIFDDGMPNQAETP
A
>GSU1994 hypothetical protein
MKKTLSALLLLAILVALPMLGNATPYSPTSSLIQNFGYISENPVTAGTKL
FDVQALENGAKFIGNIYPTTSGSWAEIRLGTTGAFDLSSYDSFMLQIGNF
NENPWAYSLYITGQTNGVDYLVQSAWSTINNGSTGTLKLDFTGLNVDLSN
VKGLGFNIGAIVPLPGQDYTFETVAAPVPEPGTIMLLGAGLVGVGLYIRR
KRA
>GSU3274 cytochrome c family protein, putative
MGLLVAATAWGGEVTYRKDIKPIFDVRCAGCHGADAAPEYHAFKAEKEKW
LAKGQGMRMDTYSHLIFYTAWPDTGALMRRLDDGKNSKDAKPGNMYRHLG
ATEEERQRNLAVFKAWVGVWNLKKWPDITKEELNAITVTY
>GSU2917 hypothetical protein
MIVIVMGIMLGIVGQSWQMVMKREREEELLFRGNQYRIALEQWHTPTRSG
GGAPRPLNDLKDLLKDPQSAARQKYLRRLYKDPITGKDFATIVQPGKGIV
GVMSTSEQEPLKKTNFPDELATFENQDQYKKWVFTIVAPAGQQNQGARTQ
ATTSSQQLIQ
>GSU0620 hypothetical protein
MVLRKVVRKSVGAPDCMRTSSLTEKLSEVREKGRQAALTVVTERTYSSIS
LLRRWNEMAEPDTVLVPLLNPLFTVIAELLKLRGLVPLAADRSSSTSKRQ
VGDAAYATRPRTFWLR
>GSU1407 hypothetical protein
MSSSTDTFSSVCSIKRARPAAMGYWLQVGQNRILTFIVRPPVLR
>GSU3275 hypothetical protein
MAESSPDFLFIQNHLSGLSGEILARHLISGQTDGRPTIVLFGDATECTPV
PGVIDSCLDLTQADVVLSAAIIGLITGAMHRQDVPAVTEAGAETEEAATP
SQATQVADDVPPTPEPTEPSPFDQKLQSVLEETLPPVPLAEIEEHVALGR
QAEPAHEDAPQEQPARSRPGALRSTVRPFAIGIAVLIVAAAVIYLAVSLG
TNSTTHPTQTPVAEVKPTPSGTAQQKPASQPPAPAPETLPPAIAPAQTPP
AGMTELPPFIPRQKPDRAYGEKNPGWERYLGARTEFKVYREAGIIKAIQS
IDRSGVGIPESFMRGALEQMTRVRQFSINGKETKGSFHVEKGTTTAGARI
LIYRTAPKGSIRAFVVYFP
>GSU1645 hypothetical protein
MFFCSLEELFKKYLIEKLTWVEVVKSFVAARG
>GSU0324 general secretion pathway protein I, putative
MRTIGARGFTLLEVMIAVAIIAGVVFTVIGVVNNHLAVAGRDRDEGVAVL
LGRQMLDDLDTRQDIPEKSNGTFAPVRPDYAWVMTSTATQVPGFRKVTVA
ISWDNARRTVSLVRYVAK
>GSU1430 hypothetical protein
MWKKLNGAILKRLRLRESWVIFFVLGIIMMNYPFTQIFNKPWRIFNIPQL
YLYFQAGWVVSIFVIYLFTKAIDLPDDDHDEGDHR
>GSU2585 hypothetical protein
MEFFVCPCPETGTVILDGVDQGPNKDDAGVLQAKQCNPGRHTVALRCPAG
KTCVPLKRTVVIKDTDPVAPQEVAFQCV
>GSU1667 hypothetical protein
MYLFQILLPLYDNDGTNFPQEEFVRVRDELTERYGGITTYVRSPAKGLWK
DSATTTVHDDIVIYEVMAEHLDRPWWHRFREELTERFRQDVLIVRVSEVQ
LL
>GSU3248 hypothetical protein
METFTPGQQKDFKVLIDFVRVYCHANHDPAAREPFAVPPELERRYRKGVR
LCPDCAALLYHGIAKRRKCPLDPKPSCKHCRIHCYSREYRAKIREVMAFS
GRRMIMRGRLDYLWHYFF
>GSU3258 hypothetical protein
MGDGKDDLESLSREIRKIIDTNRQFLDRIMDDDFEADGDEAGEPHGEAEE
EFEEL
>GSU3170 hypothetical protein
MTISATLRRALLFLLITPIVTLLACTSSEGTQKKSTQPSTDKGVPRMTTY
PIGRFAIDVPAEMKLVHQWQRLRYAEIEEFAWSGSVPRERARDEAWKKRL
TEIGKLTPPKGKERVIIETREFPGVGAWARGILFYGNRMSPKDGNWLILM
DVGQTAAWLELKGLVQYEQDMLHDLTEIAKSYQARRFGDLKLPPGNWFYT
ERGAINLPYLEQETAAARFEGHPLGLEIDIETTETHKVEEAGLIKKTMAA
MATGFATGLDIDKIRSRKKTVAGLDGEEEVLKMNDGNKTVLNFAWEYRGR
KDSGEYPEIRITMDSADGKLEQKLQVWDAILNSLKPMYGTGR
>GSU2512 hypothetical protein
MRQSIAHTLAGTAARSTLAVMIAVVLGFGPDAALAKRGGRDDNRTEFYGI
VQERPEQGLHGTWVIDGRTVATDSRTEFDQSEGRLDVGSCAKVHIRNGRV
HEIDSEPMGHCR
>GSU0077 hypothetical protein
MQSGWPVLGFFRQNRTRGRDAMKEQDRTAVCNRTEHKEENVRVRNVKTGE
IGFTFGCTEEGGTIQVRLTNGQLDSWVAADCEEAPE
>GSU1100 hypothetical protein
MWKEDVIDLKLTKFFEVSARVLLSLLITAILLAILGGIIRTFYDMRLVVS
HDFHEAFRTILVDVLTVLAVVEVLRTALAYFSEGRVKVTYIIDTVLVTVL
TEIMAFWYRDMSWDKVAMVIALVLALAGVRVVAVRFSPRRIREEL
>GSU0960 hypothetical protein
MKRAHRWCEAQLIPVSGEGQNIQKLKRIDSVM
>GSU0565 hypothetical protein
MAWGLLGMRGAVGSVTINGAAGDLVQVYRAD
>GSU2897 hypothetical protein
MHLRVRKGRDQMKKRSYVKPRIVGSANVHPC
>GSU2249 hypothetical protein
MILVCEPVCHGFEHSSFNAAMLLVISHACPEEEIVFLAERDHLGFVGAET
LVSSRKAIQFEPIDIPKRKSIKAERFELEHRLVRMLFERAERSGASRVVF
TCISNETLRAVKLQFPAFPGVKCLMVMHSILSSVAERPSLLPWKNRDTFR
NWLLRGNCDRLTYIVLGDSIKRELLDRVPSLAPYVTSIDHPYHFEPAVKY
SPFAGNSITFGALGVLRKVKGSHEFLRLAHEIQKTRTSFCSRFVCIGPII
DRKLRGLLSDDVELPSPDAPLSPQDYALHVRSIDYAIFFHKPETYTLTAS
GVLFDAFSYLKPVIALRNPFFESYFERMGNIGYLCDNYGELKRTVLSLLD
SPPYEEYESQRQNILKGRADMEIPALAQRLRSILSPSRCE
>GSU1135 hypothetical protein
MDQFQQKTSPFESAEVLAVFTDNGLQEFSEMRRG
>GSU1366 hypothetical protein
MTNALLLKSIPYEEFDYQTLLDAVHGYARPRMKEARLGSDQANKSKL
>GSU1212 conserved hypothetical protein
MKQFAYHQGARGRLALAAVIAAVAMFLSLIASPSTAATVRQKTFASPDEA
VKALVAAVTAHDEKELLQILGPGSKELVSSGDEVADKTDRERFLNSFEQA
NRLERKSATTAVVHVGPDQWTMPIPIVKKGSRWIFDVRKGKSEILKRRIG
RNELHVIDVLDAYVDAQLEYASKDCRGGGKVEFAQRLFSTEGNRDGLYWE
AKEGELQSPLGPLVAQAAKEGYAPERNLSPFHGYYFKILKGQGKHAVGGA
YHYVIKDKMLLGFALVAYPAEYGNSGVMTFIVNQAGKVYEKNLGKDTRRR
AESMELFDPDPTWKPVKTEQTPARK
>GSU0988 conserved hypothetical protein
MPIVPIPDLDDKRYADLVREAVASLAVRAPAWTDHNASDPGITLVELFAW
LAEMQIYGLNCLTPDHYRTFLRLVGIRPVPAVPATVALTLSSTAGRELMI
PRGTRLTAETGGEPLPFETTADLFLLNNRIAAVISAWGGGFRNVTDANAR
DAFFFPAFGEQPAPGGTLLIAFERMLPAGREVRLAIDLYEADLPPVGAHR
GEPAAVVPPVDLVWEYSTEAGYRRLDLLEDGTTGLTRSGQILFSVPADMT
ATTSPYLPATVPAPRFPMIRCRLPEESTAVPLPSEGLSACRRAAARQGTG
FYEIPPRIDSIRLNTVTARQAVSVVDENLVSSSGTDWGNGMPGQVVSLAR
RPAMEGSLRVRVLTGTDWEEWAERDNLDASGPTDRHFVLDPAVGTILFGD
GRNGRVLPEGARVRADYLSGGGAAGNLRPLASWRFDDPLLADLAARNDAS
ASGGRDAEPLEEAIARAPLELREVDRAITSDDFEYLALNTPGLRVARARA
LPLWEPEAPEDRPVPATVTVVVVPWSFTPRPYPGPRFLRAVCDHLDRHRL
VTTRVRVIPPLYAQVTVRTRVSAGEGVRPEELRARVAERLLEFLHPLKGG
EDGTGWPFGRGVYRSEIIAAIREVSGVECVLETTLSGDTCTRVDGEGNLM
IDRDALVYSERHEVDVTARSGRCTVTY
>GSU0467 hypothetical protein
MLWRGRLLLPLFFACALLCLSWTAALASERRDVSVEGRWGAVGLGVDFAT
GDYGSGTRTDFISVPLIVDLYPTERLDLELVIPWVYQTNGDNAYGTVMPY
RQGYARGAATAAGTRFQAGAGGNSGSTAGGSTGSANGLGDITLTAGYIII
PEGTVAPRVRPLLYLKFPTADEDKGLGTGKFDAGGGLSLDKWLGDWRPFG
EAVWIVQGSSDLYATKDYLSYEAGLGRQFTPSLYAAILGRGATAPAEDSD
APFEGRLKGVYAFDSMALEGYVAAGLSDASPDFAAGLAVFYDF
>GSU2152 hypothetical protein
MNAPAQIIPISPTASSLTHHAVNISELKAQVNLIQYVMRDVMKSGEHYGT
IPGCGDKAVLLKPGAEKLMLTFRLANDVEVETIDMHLGHREYRIKVTLYS
PAGQRLGTGVGSCSTMESKYRFRVGPVELTNKPVPKEYWDIRKENPAKAQ
EIIGGRGFSAKKDDSGNWKIARQGEKVEHDNPADFYNTCLKMAKKRGLVD
AVLTSTAASDIFTQDIEEDPDLYGQTEHNGGNSAGNTSTTSGTSQTAKPD
SAPSSNEAEPVSTSEIIGYLTAKGIRMELSADESEISAFPDFNDTSARQW
LKDHGFKWDGKGKCWQYR
>GSU2126 hypothetical protein
MADNKQESSGKKPSGKQSWQDRKLKKVVRDDIDVISRKTLDQSDFITMLN
THDNVLHRLRMTMGRNKNITFEKAQEFIERSQKIREEIWRPRFSWTRK
>GSU2114 hypothetical protein
MKKEVIKALQLLEKVAMEVGDGSIERYFNGFKNGSGTLYEYVLDILEDQD
NVSEREALMALFGVHSAINHDPQAMAFFDRVQCDCCTKGMGYPKYLQHLL
EKGGVVSTRVLSGNQMKHFQGIETRER
>GSU2417 hypothetical protein
MSDLMTTRAALQETMTFLGAIASGMEEAIGESANSITYLAGKRLGMQFSA
DIRKTDDVEEALAAVRTVLQDNNCLWHFEPFQAHDRPALIQATDKGDEDI
HLVFRDCMIRQSLFRFGHHQKGSLCTMMFGFFSGALLNVMGVDSTLEIMH
AGENACLKRLVIHKKREEKP
>GSU2155 hypothetical protein
MKIYRTGTLPPDLKSFTRENRFEDLKEALQTIASLILIAVMLFVFMFAAA
TPPEPQVYPKGWEEVRG
>GSU1563 hypothetical protein
MGPSGCSGSIHGFELLSLSDMNLFRGYPIFS
>GSU1092 GH3 auxin-responsive promoter family protein
MIRSSIDLMLKTGAAALAKNFENRDPIAGQREILARLVVAAAVTRFGRDH
GFAELAGEPFDRLYAEFRRRVPIRTYADFWRDYFSAWQSDANGSQRLLLE
DVTWPGRIPFFCETSGTTAPTKYIPFSREMFAANKRAAIDLTACYLHRTP
RSRLFQGKLLYMAGNTSLTDLGNGVQSGDMSGITLRHRPFYLRPFVAPGM
AVSALPWEEKVVELAGLLLSDRSIRGISGVPPWIILLLQRVEEMGHRPIA
ELLPNLELIIHGGTSMKPYRREFDGLFPRRRPHFLEVLPSSEAFMAFQLP
GEDRMRLVPHYGAFFEFIPIEDLDEGGTPAADAPTVPLEAVETGRRYAVI
LTTCAGLWRYHIGDTLRFTALSPHFIEFTGRDRFLDRFEEKVTQGEVEEA
VARLNQLDGIEVREFMVGPDIGERRHLWVLAVGEMNERSGDVLGRHLDAT
LRSLNADYATFRSQGRIAGPRVVTVSGDVIYRWSKEVRGKLGGQSKIPHV
DPTVEGEMVASLAAFARRFQACWIADSFQGFGPTGRVKVQNSSNFTGSRS
LPERTRPSSSVV
>GSU3012 hypothetical protein
MRDSKGFSPFSLLPSTVGVEAPERLKYSIVQRLVAERAVKVNAHDGRSLR
GAIH
>GSU1449 hypothetical protein
MPGCRLGQAWREREILMVKDKGIALAVVAMGLYVLGGVLTFAGLYLIGFM
EGRDFFGWGDGRTLGYLFFCVGISLSILGVLLMRIFRNRGFS
>GSU1719 hypothetical protein
MSAELLEGQMPLFDVPAPAEKSEAPAAEPAVKEARKSTKKKAAPEPVTTA
RRAKEKTKKAPAKGSAKAVGPRGGRGLSGLVPEGDVRLTANMREDIHLKL
KIAAARQRTTIGEILEELVEKYIK
>GSU2589 hypothetical protein
MLPNNPPHISYDLKRLSGFSMESRYGNVPAPS
>GSU2332 hypothetical protein
MPENLIIPSVDLRIGALEEYNRRQKEKAHHRQKGPKPRPCITLSREFGCE
GYPVAESLLKLMPEQTGDQWVLVDKDVLDEVARRHHVSREILEALGTKNR
ILSQVLATFSERWKTDHEYFELLCSHIVSLAEQGNVIIVGLGGAIITHHR
EHCYDFRIFGSREFKIGSICQRMNLPPDKAADLIEKQQAQIDNFTREFLF
EDEANPAIYDLLFNNDRMCPETMARTIADYVAGRIAAGTADKRHV
>GSU2672 hypothetical protein
MKAYSAGLFLLLACALLPACAGLQIGGGGDSPSALAMTSHGQVRQTIDRL
IAAYEAKDSRGFSELVSERYTGEAAILETAVRRDFSANHNLAIRYTVNNI
TLDDGGKAFAAITFTRSWTDVKTARTMNETRETSLVFIRENGTYRLYSQN
GPPLFGLH
>GSU2299 cytochrome c family protein
MRMKTGAVIAASAITMAAISAFAGADHAEFVKGPFKTGQDVTKACLECHS
EAGDQVLSSSHWKWKGPPKLVKGLEKSKVEYGKANLINNFCISIEGGDRC
DNQEFCAKCHPSYGWTDNTFDFTNKNNIDCLVCHTSDRQYKKGLAGQPDP
KAIAAGKLDLEKAAQKVGKPGLANCGVCHFYGGGGDAVKHGDLDSTMEKP
KRDHDVHMGTTDSGGLGMTCQQCHKTSHHQIAGASTFLATYSERVACEDC
HTGANAPHQKSKNGALINRHLATVACQTCHIPVFAKGQATKMSWDWSDVG
KDIEAGEQFDKETFMKHKGTFIWGKDVVPTYTWYNGTIERYLKGQKIKDP
SKVVNISRPLGDITDKKAKIFPFKVHTGKQPMDSLSNCLLIPQTHNALWK
DYDWENALKKGAKGSQVPYSGKYQFVKTAFYGSINHEVATRDKALQCGEC
HMGGTRMNWKALGYKGDPMQTGGRFAKTAAKK
>GSU1385 conserved hypothetical protein
MNLLTEAWIPVRPLPAGAACKLTLRQLLCEDGRWELCLPRDDMELAALQL
LICMTQVFFPPQSTAALKQRIVAPLSLDEFESGCAPYREWFRLDHPDFPF
MQVRNVSAKDPTPMDKLLAGLTGATNSCFVNEPGLGACLCGGCAAIALFN
QSSNAPSFGGGFKGSLRGGAPVTTLVQGEHLRQTVWLNVLSEERLTQIIP
WHHATRSQQPTWVAPIKAGETIPSQAIGVLRGLFWQPGHLELMPPQGEQA
CICCGFVEKQVYAGFNKAKFNYTIDGLWPHPHGPRISSVKKGEREEKFVS
FTTAAPAWTHLSRFVVQRQTTNSKEEGQEPAAVVRQAQGLLNKLRLAVGG
YRNNQASILERRHELIFLNDGWSGHTEVVHALVGQAVGYRDALNKALFVF
STGFKEIKGAGVKVHELAKEQFYRRSEDTVLDTLARIDFEQAALALLAMG
DGLHGIVVGLFNESVAPYRNDPELIRTTAVARKTLYKHLNALKPQRDGRR
DDGTNA
>GSU2983 hypothetical protein
MSEVQYCYVTVDRLDCADILPCFKAGRSYMGTRRTTIAIASLQLLLVLFV
QVVPLIHAVAGAQAGFVKKAGCHCTDGECCAAMAGMAQECRCEGSCAANR
HKKEPDCCSTGEEVPVISCGCCNRDPLSPAAVPLIKWECIPPLFILEAGV
TRCTVNLSSYSGMLTAFRPEPSIPPPEA
>GSU2163 hypothetical protein
MRYPIEQVSDNWERRIINNGYVQHREVYRNGSRGEWQFYISGFGPTVEGG
TGRCTVLKEGGSYDHFVPIDANNRIKINGRWYDRRYWDH
>GSU2893 hypothetical protein
MKDYSLESDFVRELDRDILSYVEKGLENRDEDGFNDLALRCFELQFNTVE
PYRRFCLDKGRSPGKVERWEDIPAVPSMAFKKFVMTSFPAERAEQRYFTS
GTTDPLNKGKILRDPAGVTLINAANGLLTREYVFPDVDRMRMFLMVPSPD
IAPGMGMAVGLDVVRRMFGSPDSRYLIDRRGLDLAFLLSALMEAERTGEP
IVIIGSTAGFVYFMNACERDGVRFRLPPGSRLCDGGGYLAQFGECSREEF
YLKSAEILQVDEHHCVNVLGMGEVSTNFFDNVLKDHLAGRPLARAKVVPP
WTRTRVVDVETLEPVPDGDPGLLRHYDLVNRGMVVAVQTDNVGFMTPGGF
EIIGRWKKTSWELETEAIKQAHGPRFMTPIIEFLLKRSLKKVGKLHDKIT
RTNAGR
>GSU1418 hypothetical protein
MDKRFLLKALYETGTNALLSGGDFRLYVLFLAAAGTNGRGLLPYSVIEEA
LGPIHPPGRLTFMCRRLEELGLIQLHGPPVATIGVGYRLKEPVPVIHNAP
MESAPSAGNGDPHETQ
>GSU0594 cytochrome c family protein
MRTLRKTAIVLSLLAILPAAALAEDKTADACLSCHSSKDIGKSGSHLYID
AAKYSRTTHAIIGCTSCHDSVTASHPDDGIRPSRAACRECHSPIAKEYTA
SLHANNAGCADCHDPHTVKPPALTSGRDMNAMCAKCHETAKMIEVHSKWL
PQADLHIDALPCITCHTGSENYVITLYIQKRAGEKPQSDFKTATYEDLSR
TLPGDRDVKAIIDTNNDNYISLAELRSFNKSARHDGMRLWGMMTPEKVTH
TYQILDNRWDCTFCHASGPRAMQTSFVAFPDKNGTYSRVAVEKGAVLDLL
YGTPDFYMMGSTRSMALSIIGGLIVSGGIAFAGIHGTFRFLTRKNRKEH
>GSU1084 hypothetical protein
MTIACPQCGATAEARTDTRFHGCAFCGSSFMVERGVGIAEYHLDHERDDR
VAWSAVASTLERDGAGQAVERTGCEYLVAPFWLSRPESGGNRLVPALRHP
WLQGPLPGLPGGDLLFVPSGDDFILPDIPADEALGEGADRGGTSLSLVYL
PLYLLSFRLAGHDCRALVSAVDRRVALLTPLPARSESVPFRHAALVAGFA
ITLTASGLAVKSHFLRAAVFAAVLAVAWPVCTALLRRGE
>GSU0824 conserved hypothetical protein
MTELDKALTVLRQDMTDAKSQSAFYDLFLNATFYVPILDEAAEVEDNAAQ
KGEVLPLVIEAEGNDYLMLFDTKERLQEWTQADARYVEVPGYVLAATTMP
PLHWALNAGTEYSKQFLPDEIAWLRDVVERCNAAAAKQQG
>GSU3462 lipoprotein, putative
MKIRILKIIAAVLLPTLILTACTSYRSQYVGFRPAEDYINSQVVNGVTIG
GEAFADTASATDAFGFDVIGTGVMPVQVVMSNKGPRSLEIVSSQTFLVND
QNRYFPVIPNTTAVDRIEKSTQFASFFGKGAGKGALLGAAGGAILGAAIG
IVSGHSVAEAVGKGAAVGAAGGAIAGGVSEGTSGERERTIIEDIRNKGLE
GKALPADSIASGFLFFPAEAGSAKELRLQLRERETGVTSNVVLRFK
>GSU0995 hypothetical protein
MYTVYGGILLKNSQFVEQESAGGENAEFFHETIKCFFGRVDKDSQGSETP
SVKQVAAGCGCSGPIS
>GSU2542 hypothetical protein
MSSTDPPKKHPCPDCRFCQWCGDDRCSLCLRGCKKGKKLSLAEQVELFEK
VNRKASATEDTE
>GSU1413 hypothetical protein
MIDLSCSVSAAVAMAVTPAGGWHDFLEGAASEQGNVFPHRGRL
>GSU1133 hypothetical protein
MDTELFSDLERRLDMLLERYGALKRENEQLRADNARLQQEREGVKGRIDG
ILRKLEGI
>GSU2892 hypothetical protein
MRAADGSGERISDRMLGTGEAMCTCAALVADMIIDAGQEGASVAALLEKA
KREGAG
>GSU3386 lipoprotein, putative
MKFLNSFVPSRVFPPQVCFVVVAVLLGLSGCADTDKPVPQASSPKDAAVK
TETYSSAVLPAPAPAFTPASSARAATPAVKYTAGEDPDFARQCGWPVKCP
PTLPGAILPSKRIVAYYGNPLSKRMGALGAYPKDEMLQRLKSEVAAWQKA
DPSLPVQPALHLIAVVAQGEPGKDGKYRMVMPDAIVNQVYGWAREANAIL
FIDIQTGHDDIRTVLPRFEWILKNPDVHLGIDPEFNLISSKAKPGKKIGT
YDAADINYASGYLKDLVTKYNLPPKVLTVHRFTRNGVTNSKKIVLRPEVQ
IVMHMDGWGAPWLKRDSYKDYIVSEPVQYTGFKLFYHNDTKKGHPLLTPR
EVLRLHPQPIYIQYQ
>GSU0953 hypothetical protein
MEAAQLLKQTRRVTNCFILIAAFLVSTGCVNYVRIDGPYEGKVIDAKTGQ
PIEGAVVFGEWSKAHPGAGGASHTYYDSREVLTDGNGEFSIPGLGLLVLT
MIEEMDVIIFKAGYEQITPNPWSGLKNVWPKDKVIWQGDKATFRLKRLSM
KERRKRHVSFPSCAVEHRGKMRNLIRESNIEMREMGMPANMLLPEE
>GSU0395 conserved hypothetical protein
MFWLKKKDNAPADSLAEQVGLDAEGFYRTGRMHCAEAVLMAMRNAYSPEM
PQDVVRVAAGFGGGSNAGCLCGAVSGGTMAFGLAVKEDKRRVNRLTRQLH
DWFKHEYGATCCRIATKNAKGKGGCAVLTGEVAKKAAELLEE
>GSU0438 lipoprotein, putative
MMFRLGWGILVCLFLALAAGCYHLRLEPAALPVQRAVTARSAADVFDRVR
DVLVQDGYAVERDERRDNGEGGVITCGYRHFSTRAGGISQPVGGRLYYHR
LMITVNGGEEGAEILMESTGLEIRSSYVYEEGGAVRSFAKRYPYEQYPGM
FDLKAVDRELARVRGMLEAALRQGER
>GSU2468 hypothetical protein
MKTRLTLKPGQHGTKSLIKKYGDALVCVRFRYDPETKQRLKTVELIVERS
DWTPPPPRFTNDTLVPLRVKATDVAIRNQVKAVGGRWNPEKKLWFVTYGN
IVGTPLEKHIDVDGFG
>GSU3227 hypothetical protein
MKRNRRHPAAGPVLISGAALIVGLASGSATAAYNAAHKDIVLKNAAGSAI
KATDTVNAFSIKNTCFGTAGCHGDTNAGGTLRFSYDDIERHSYHAQLGAN
EIRGWNPWNPDSADAFRKGAGPVGKNWVQSPGHVGSW
>GSU2573 hypothetical protein
MEEQRLSILIGRFTAAATAHGEALEAMDAGRADRHATMLARLYREIAAFG
AAGRAGLLDAAERGTGAAAGMAAVYSLSHDAPRSLAILRRLAREPGLLGF
RASVAIERWEKGELTLS
>GSU2427 hypothetical protein
MSKKPEVTFKYIFTYDYNPVYVNGAHGGISPRGDLVVNFYLERPPLPNEV
THELTPSGTIGAETSVEPEDFAQSLVRFVPTGVVLNYHTARELHHWLGEK
VRELDALEQHKAALRAAAQQPGDEDVQH
>GSU0984 hypothetical protein
MGREAAGLPLRKNPSVTGGGHGAVIAGASAFPA
>GSU1337 hypothetical protein
MSPLSITPGHKKTSTSGSVSSSGKGLACRKQRPHARAAMPSWRTMGRPLW
DRWLLPFTERGNYTQPVRGMSIMKLPGSSPPAAGSFLEFGQCFALFCKFV
HISGRVAMNIRTMTAVAAVLSVAAVAGCGGGGGQSAAPQTTISGTVADGY
LFKAEVFLDRNGNYQQDSGEPATTTDVNGRFSLAITPGEEALYPVVARAV
GSNTYDMDAPAQPLTQGYILTAPRGMHGFISPISTLLREKYETGNYASVS
DAMHEIRQQLGLTVLNPASAVLSTDDFVALSGSTNTTTHTEYQRVRTVAG
VIADLMAGQISSDAAVNVNRYRYIMGRINLSLPQLCFEATQSANPDVANM
RTRYIGTIPGTFVNMTGMFRNMTTTTFWNLSGSTMVPRNRGGMMR
>GSU3358 hypothetical protein
MSGRPISAGDIIEARCTKCRAVYNHTIVAMIGTRVVRVQCNTCGGQHNYH
SPQEEKKTVERSSARRPAGEKSRRASAVAGPSEWEHAMEGRDSASAVPYA
MDASFRVDALVNHPTFGIGIVTAVIKPNKVEILFRDGKKLLRCSL
>GSU1062 cytochrome c, putative
MAVNWWTLRILVAVAASLWLIADASAMSAFARKYGMQCDGCHSRIPKLNE
FGTAFLKNGFAIPAGAHPPVSAARKPESEIEPMASTPVPRESREPAASAA
PAGSPEGVVASAPSAEPLPEQTATEPPPPPPPMVLYKLKARDGSAYYTDN
PRAAADLQEPAAAPARKAAGRTHGKAVPEAKAPARRRSPVAPDAAPPLFR
SFSECMEHQLVDTPPPGSAGEMMDLLTEAERVCAGYPAVKR
>GSU0753 lipoprotein, putative
MKRTRYLVGIILTAIALAGCGDDKNDTGTAGSEYVFPAGTATLAFTAMST
AQLSVPVSGIELTLALPAGMGVTTTTGGTGQIVNSTLTPGSALSGTNLAY
GNYSASTRKAYLSMATTSSSYRSGEFLRLACTVDTGTSITLSGLRALNTP
VTLVRAVGYDALTNSTVDLTGKLAVTLDAVR
>GSU2514 hypothetical protein
MAGLYRLVVTSLPSPAGAATAQKKPGRRPIVV
>GSU2971 hypothetical protein
MADSKHTETAAILAFFAGAALAAGAALLLTPKTGREVREKLGDVTDDAVC
KLRSAVREAKYKVAPKSKDDTGEYEGGGCWI
>GSU2972 hypothetical protein
MRGQIQQPPPSYSPVSSLDLGATLYFASRTADRSLHTASSVTSPSFSRTS
RPVLGVSRRAAPAARAAPAKKARIAAVSVCLLSAMLSLLVVDWMFLYT
>GSU0840 hypothetical protein
MDVLFYAPERVGEDIRKTVSGVIPGLEVYRSMAALAERFRRILEQPAITV
LVVRDRDELERLLTMRTAFRDTKTVLVLPDRRAETVSRGHCLEPRFMTYL
DQNPAEVVAVLARMASPGNGPGNPSRRQASLP
>GSU1412 hypothetical protein
MNWAELYQATTDKPLGEFVAALAKAGSRRGFVIHNEEKMEMAHTFGAHGV
MVGEGFDLHMIQVCKPKKAAASLQKNPERAALMPKFITTFTRDGRTEVRM
LRYRRPMIEALVDDTEFASSLDESFDAIVAMIEEAR
>GSU2644 hypothetical protein
MTAQLILFAAVLAAPAVCRAVDLGISSDTIVRVYERDDRNGGGRSLLPVY
EFVRLDYGSVTTPGLSLHAYGWGRLDLRDSHEDGDAEGDLLYAYLEYIDP
DRDHQMRLGRQYLFEGVMRESFDGLYARTELVPTVALSAFAGMPVGAGSA
AGRSGDLVYGARIVQGQRGRYDAGISYKYVANDGSRSEELLGADLSLSLP
AGISMFAHSTLNLLTGGWGEHSYEARIPFASMELRPFYQRYRYSDYLNDR
SGASRPFRFQTVLGDRVEIAGAEGFWYPTESVEFGGRYKYLTYDSRFGNA
HMATALATVRRGIFSEAGLEFGRVQGDLTENRYYLGRGYLYGDFAPWFAT
GDVTYVRYDRAIYGENSALFVSAGVGRKMLDRALSLKLSVDYSNDPYFHK
DYRGTLAVTYAFRR
>GSU0722 hypothetical protein
MKHDERDERMIQTVRRELDRSTSDLDGAIAGRLAEIRAKAVAEAGRRHFP
FLPRWVTVGGVATLTAAAVAGIIWFSSPSVEPVPVAVQDDPEQIELIAAN
DHIQLYEDIEFYHWLAAQENQ
>GSU1676 hypothetical protein
MTCATLKAFIAADEQLTDIHFLVQTKARRGDRPAVHYNASRFFDHHEARL
VSHLIELRGDVFDNALSRSLMRLGCLIIVGGTAMLVRNAAAYIAVPVFAL
LLYSEIRLVRRAYLMDSSLKGYISYLGRTRLQRRDDFVRDVVEHSAHIAE
CISR
>GSU0592 cytochrome c family protein
MRFRCLPLLAVCIISLAGFAYGITIKDAVFQTKDAGRVVFSHKAHIGKKG
IENNCKACHDGIFSLRKKVSYTMADMEKGKSCGACHNGKEGVFPLKECAR
CHAVKEITYQVKSTGPTPFSHQKHLAVYDNCNACHPKLFNAGPNKRATMA
DMEKGKSCGACHNGKTAFGLNECATCHMVKEVLLSSPGTGKIIFSHKLHA
GKMKCDQCHNKLYVPGRNKPVGMAAMEKGKSCGACHNGKSVFDVKQCAKC
HPVKEVNYKVKGAGPATFSHALHLSMYTCSDCHNKLYKTGRNTKVVTMHE
MEKGKSCGACHNGKTAFSVREDCVKCHNM
>GSU0754 fibronectin type III domain protein
MHNQTRSATVFTPLQRPISRFPGRCRPLGTLGIVLGVILALMVILGGVAQ
AAITARITMTTYPEFASQNTPPGHITFMRASIPNAGNNYDAAVAWQDITG
NVTNLAVNLGTTPGYFTMGFNDNSYMSASSCTSNVNFSTSSAYMTDGGSF
DVLITCTVPDTTPPTPQSAATNAAGTTVTITFDESIEASDGWASTTDFTV
TVNSTPVAVSGLSFSGASVTLTLATPVGIGQTVKVGFDDSNMGLVDAAWN
QVASFSNLAVTNNSTVVTNAAPTFVGATTALTVNANSGAADIKGLLHVSD
SDGSQTLTWSQNTAPSHGSLSFAGATASSGSADITPGGTITYTPAANYAG
SDSFAVRVSDGTDTATRTITVTINAVAPGAPTIGAATAGDTQVSVAFTAP
ASTGGAAITGYTATASPGGATGTCASSPCTVTGLTNGTAYTFTVAATNSA
GTGSASAASNSATPKAAQTITFTNPGGQNFGTTPTLTATATSNLAVSFSS
ATVGLCTITSEGALTFVATGTCTIEANQAGNSVYAPAPTVSRSFAVNAVA
PGAPTIGSATAGNAQASVTFTAPASTGGATITGYTATSNPGGLTGTCVGS
PCTVTGLTNGTAYTFTVTATNSAGTGSASAASNSVTPKAAQTIAFANPGA
KNFGTTPTLTATATSGLTVSFTSATTEVCTITSGGALSFVTAGICTINAD
QAGDSSYLAATTVSQSFAVNAVVPGAPTSVTASPDDGQATVSFTAPASTG
GAAITGYTVTSTPGSITGTCAASPCLITGLTNGTAYTFTVTATNSAGTSS
ASAASNSVTPTPGPAVISVAVPANGSYGTGSSLDFTVTWDSTATITGTPR
IALTIGGTTVYADYASSPTATTSLFRYVVTAGKNDTDGITIGALTLNGGS
IRNSSGVDATLTLNSVASTTGVLVDTTIPTVSSVTVPGNGTYTAGQNLDF
TVSFSENVTVTGTDSTLGLTVGAAARTATYLAKTATAITYRYPVQSGDLD
SDGITVGGLALNGSTIIDVAGNGANLTLNGVGATAGVLVDAEAPAVTAFT
VAARAAGLTVPIASFTAADGSGSGVAGYLITTSDTPPPAGDPGWSALAPT
TYAVATVGTYTLYPWVKDAAGNVSAVHGAPATVGVYPVLYAVPGGRESGF
CESWANACELRYALANAVNGQDIWAAAGTYTPTAGTDRSATFQLVSGVAL
YGGFAGTETARAERNFRANVTILSGDLGAQGTAGDNSYHVVTGADSATLD
GFTISGGNASGNLQDGYGGGMYNDYINSPAVANCIFTGNYASLGGAMFNL
GSSPTVTNCSFSGNTASSYGGGMYNTDANPVLTNCTFSNNLSYGYGGGMV
NGALSNPTLTNCTFSGNAANISGGGMYNTHGNPSVKNCTFSGNSAGTSGG
GMYNANNSPVVRNSIFWGNSNGEISDNASTPTVSDTVVQGGYPGGSNIMT
ADPLLAPLADNGGVTRTMALLSGSPAIDAGDCSTGPAADQRGMSRPQDAT
CDMGAFERGVPDAVAVEGGAGQSAAVNAAFAAPLRLKVTDSLGAVLDGIN
VAFAGPGSGAGIAAGGPATTDSAGIASFTATANGTVGSYSVTATVDSLSA
GFPLTNIKGDQAISFNPPATATFGDAPITLSATGGASGNPVTFTVASGPG
SLAGTTLTITGTGSIVVTASQAGNANYNAAPDVQRTITVGQAGQTVAFGA
APTVVVNGSGTVSASATSGLTVAFSSTTPAVCSVSGTTVTGLTAGSCTIA
ANQAGNANYNAAPQITQTFAVGKGSQTVSFGAAPSLTVGGTATLSATATS
GLAVTFASATPTVCSVAGSTVTGKALGTCSITASQPGNADYNAAPQAIQG
ITVVYGTTPPMMALSILSDNAVTTDTTQNICGIATDPAGIRSVTVNDEDV
SVNPDGSFSYPVQLVAMANSIRIVVTNNAGISSAVTRTINLDASAPRLTV
ASPDDNAVIWQQHITVSGSVTPLDPTTVVSWSVNGSAPQVAALSGVDYSF
TTNLQEGMNTILITASNAAGQTVETKRTVTRATVFSLAVTDPAADIRTAL
GTYTLTGEVADNTSPVAVTVAMDGRVYTPTVVNGAFRQQLTFSEAKVYQV
AVTGVDQNSNTLTVQRNIIYAQPSATGTTGAGVTIVDALQALRMTVGIIR
PDSSQIARLDVAPMVNGASVGDGRIDITDVIVILRMALGMIH
>GSU2881 hypothetical protein
MKVSVSSELPLSFQFPEQGLAFIPVEIAEPGRPPGNAPAGCSRGGRQFTG
RGAEGQGRPQDNEKEPQEHAGHAGEFHEDDGGPALHAGKSHPVLAEGAGG
PRRKGDPPRFPLVVHHPKDKKGTLAVDSLTLRIHIVFCLELEGVPGEAEI
VVTGLRQALRMIDGETEDLAAAGWIGELYAEGVPRVGGLIHHDSPPIHHG
VHFHGRTMFGRAGRKKAQIRCDLPHLVIGTEGEAHPIEADRLRQLEPSGQ
EPPLLNAPGIGRVEHLPLEHGEGTSLLSEGDEERGGDDRHHAAVEKGIAP
VSPGVLDEAERLARRDGQIATAPEPGRQFFPPELNRSPLINGRRRGKGLP
PGGPALLRGGEGQGNEGKQEDAREPRGGEDMMKPQDPGQGDQRRICPMPG
HHLIGSGEECLGHQGRRHDGRQQERNQGDDCCP
>GSU2322 hypothetical protein
METAVEVRAPAAAAWELLTDTRRWSEWGPSVSGVECPERFIGPGTIGWVR
TVVGLRLPFRITEFEPGRRWCWSVAGIPATGHRVEPLGQERCRVVFEVPM
PAAPYLAVCRVAALRIRSILEGGGGKEGGAAL
>GSU2548 hypothetical protein
MLTDMHETCEQCRFAYARIDTECWGETSSRRVMCPICGWTKYEEQVWNFA
LATTVKRSVMQGYGAYRLIPPGGFSGYNAFHVPPSGEVVTHIRQLLDSGW
KGYLTLWDEEKGKARLLAGHPLQKFEIPAAGDPSP
>GSU2643 cytochrome c family protein
MLLALVVSCAQQARYRLPVKHPPIFELGERREFCTKCHGYRKEPVDFERY
NHTPLFTDSHRMVAYQNQNICAICHEQSFCNDCHASRTELKPSEKSPTET
YRRMQHRGDYLSRHRIDGRLDPSSCFRCHGNPRAAATCRPCHG
>GSU1858 IPT/TIG domain protein
MVLHGRNVGIFRVVLAVVLTVIVATGAWAATLTGNVQNSSGKAGRVYLRA
SNNSFGTSFTIAAGQTLPFSIRGVTSNNMYTVDAFLDVTGLGVQHANDPT
GISSSVQVTSDTTYNVGTITLANPSVPFSGEYSPLATPLAANGAVFLMLD
SDFYYSEQYNREYPVAETFDVERATNGSANPAICASLTNITTVKTGIKNR
DQGAWADSAGQATSCYRVTAHASGQASVTSSWMPVMPKQGSRSVSGTITI
NGVTPSGPLYVAVVDESGDPPKVAAAAISNPSSPQAFTITGVEDGTYFLF
AFLDMNNNGTEDFGDVVFGEFNTTHVVVNGQNVTGVSATMTAVDSLASIM
TNHWTGNSQWGYFDNYSLNFAIDGMRKKPVTVVVTSGPNIAVPIDIGLND
WGFSSWVFINSTRPTPGSQYNLDITYEGDSSPTPTTVTISTVLDAFATNL
APTGNIQFNQNQLFSWAAPSSPPASYTYTVQVSDATNYNQIWYIEDLPST
TTSVTYNQNNEAFGPLEANKTCQWSVSVRDAAGNAAFRQTVFTPVTGPAI
NGFNPAGGNTGTTVYIDGLNFSTTPANNVVTFAGSMGRVAATVTNATASR
LTVTVPSGVTTGTIQVTSSGVGSAESTSSFSVGTAGSFAGLVVNSASTPL
GGVAVSLGGNPSVTTTSQVGTGSFSLGGLPANGTYDLVAEKTGYLPAVSA
VINGGSAITTTAPFLLFTQAEVNSWGVYAGKGVITARVVDQSGNPIGGAV
ASIQSGMGKTYTAYYSSDGITFGGTSTSANNGLIVIPNLEPNDWVTLGAS
KTGWTFYTTSFRVRPFAVGEGAVFGAPSMPSVSGISPQSGKAGSTVTITG
SNFETTQSVTLAGQNASFVVNSATQITVTVPAGASTGAFVVNTLGGDAGS
GLFTVLQTLAVTTTGTGNGTVTSVSPDSRIACQSGSATNCSADFDKGITV
TLTATPDNSGSIFSGWSGACTASAGDCSVTMDADKAVTATFSVTPNLKLI
NGMTETTFASLADAYAAAVTGDTIMARALDFTGPFNFNRNVAILLSGGYD
ASFTPTAGYTKLLGGLTVSLGSVTFSNVALQ
>GSU2688 hypothetical protein
MKPARIALYVLIALLAMMAALYVLTREPHPDWPINLIHLLFKH
>GSU3016 hypothetical protein
MGSWGRGIWRLAVMRGGLVVVVSALLLFGGAAVAMQLPEPGEIERLKESG
EFENRKKFAEEIGNHKIDPELVKRAVNKAKKKALRQQGFTANEADQMAGA
YAPPPAWSGMPTTGSVKLFALLIDFSDYPAVTPQATVHDKLFGAGNPAEA
PYESLAAFYSRSSYGLLDFSGGTTLGWYRPGYSRSAVTQTPAGRENLIKE
ALNYFNTQGHDFSQYDNNSDGKIDYFLVIWTGPAGPWASFWWGYATSFSD
FNYLLDGKRLGKYSWQWEGSPYGQQFGPLVTIHETGHALGLPDYYDYSAT
AGPKGGVGGLDMMDHNWGDHSAFSKWVLDWIDPVVVAGGSQTITLNPSAT
STDAVLIMPGITSSDLFDEYFLVQNRHRAGNDVHYPNDGLLIWHVDATLN
SSGTNYLYNNSDTARKLLRLMEADGLEEIEKNLSYNTADAGDYYVPGASF
GPATVPSSRSYGGLNTGVTVTNITRSGNQVTAAFSIGAAAPSGTVKINGG
AAATNSTTVTLNLTAGGGANPVSQMRFSNDDATWSAWEAYKTTRTGWTIP
ASDGEKTVYAQFRDNLQIESSSYYDTIDLDTVAPVAVVVANSPLAGPTKA
TTFGFGVNGGDVVSYKYKINSGAWSAEIPVKTSMNIVGLVNGSYTLSVVG
KDAAGNWQSTASPTTVSWDVDTIKPVTTASPAGGTYAAAQTVTLSSSETA
TIYYTTNGTTPTTASPQYSGPITVTPTSTLKFFAVDQAGNAEAVKSLVYQ
LPPVAALTGVPAARTKATVAAITVGGATAVAYKFRLDNGTWSAETPIATK
ISLSALAAGSHTVDVIAKNALGAWQTTATTASWTVDLAAPVTTASLAAGI
YKPGQTVSLSASETATVYYTTNGTTPTTASAKYSAPLAINTSTTLKFFAV
DQAGNAEAIKTNIYTITLVSGTPPTTTKLATATLTVAGIGVSAYKYSLDG
GALSAEIPVATKISLTALGTGLHTVSVIGKTAAGVWQVAPTAVSWTVEQT
APTTTASPAGGTYAASQTVTLTADETATIYYTITGLAPTTSSAKYTGPIT
VAPTKVLKFFAVDQAGNAEAVKTEIYALPVTAIIAGTPAAKTKYTTATLT
VGGSGVVAYKRSVDGGAWSAETPVATKIALTGLAAGAHRVDVIAKNAAGG
WQDDQAPTTVQWTVDLTPPVTTASPAGGTYATPQTVNLSADETATIYYTT
NGTTPTTASAKYVGPIPLPATATLKYFAIDPAGNAEAVKTQSYALPPLAV
INGVPVASKATSVTLTIGGVGVVAYKYKLDGGSWSAEIPVATKTAITGLS
EGAHTVSVIGKSVAGTWQATSVASSKSWIVDLTAPTTAASLAAGVYKGGQ
AVTLSPDETATVYYTTNGTTPTTASAKYTAPLTVNATSTLKFFAIDTAGN
VEAVKTFSYVIVTLAGAPGGTTKLTSATLTVGGSGVAAYKFKLDNGAWSA
ETPLATKISLAGLAAGTHTVSVVGRNGSGVWQANADATTASWTVEVAPPV
TSASPAGGTYDAPQVVTLSANETATIYYTLTGAAPTTASSKYAAPVSVAP
GKTLKFFAVDQHGNAEAVKTEIYAPSPLAVLSGTPATFSRATSATLTVGG
TGVVAYRWKLDSGAWSSETPVATKIVLSGLASGSHAVAVLGKNLVGTWQD
VPTAVAWTVDISAPVTTASVPGGTYASAQSVELYSSETATIYYTVNGAAP
TTASPKYTGPIVIGTTQSLKYFAVDQAGNVEGVKAQSFTIKPAAVINGIP
ADPTTATTATFTVGGPYVVSYKYKLDNGVWSVEIPVTQARSLAGLADGTH
MLSVLGKNAGGTWQTVETTAVWTIQ
>GSU0336 hypothetical protein
MNDDKRRLIQTLGLVSSMGISVALAIIIGVLIGRQLDKWLGTHPWFFFIF
LFFGIAAGFRNIYIIAGRAIKKDDGDKDSRE
>GSU0251 hypothetical protein
MRGYSTPTVTVTAEATVREAAGAVRTEPLNLVAETRVEEDCPEYRLSRDL
ESPSQCAIRGRRQGYTWRSPHDGTSAICGTEREAVAGRAAHRNPARACGA
E
>GSU3079 hypothetical protein
MSTAINDLVLVHIDGKPGFYARIEDVSPDVKPGWWRVKLLVLTVPLEIYT
WILEEAQINGTPFTMGGTPIRMEKVVSPLARESVPQPQSLEPPLDEPPDK
GKGAKVVSLLDRKKDK
>GSU3413 lipoprotein, putative
MRCQVGAIAVLIVLALGGCASSSSWTGVYHATVEGVTGTARVSASGDRVR
MEFSSQRKTSVSILRYDRGVAWLLAPTMAVYREISLADLRRDVPLFFDPA
LRVNRTELGKELLDGRDTMKYAAEITQNGTTFRGYLWEAVPPLPVPLRWE
DKRGAVVSWEQVVPAPIPSGQFEIPGGYAEAAAALGPNVMPRAGRHSE
>GSU0600 hypothetical protein
MVTRSTKLVAIVNLARRLRRTAAVPDYLTELL
>GSU1080 hypothetical protein
MTDCWKDRGERGVRSPLFVAVQAAGSYGISVSSGRPLPKRWLS
>GSU2324 hypothetical protein
MNLYELFQQPGLGVMATAAPDGSVNTAVYARPHLIDETTLVWGMTDGRTV
RNIARNPHASFLFKAGTPGFSGVRLALELIRTEEEGEMLAHIKENAETLV
GPGTGNAVTHAAWFRVVEVRPLI
>GSU2103 hypothetical protein
MDTLKRRSAAVLPFDRPPCPLIADAPALPSLALSADEPP
>GSU2577 hypothetical protein
MVLAADVHDRTGRLLLGAGVELERKHIVILMTWGVTEADIAGEDSSGSAS
PLSPEVPPALMEAAEAELRPLFGENDMEHPVMRELLRLAALRKVIHGNP
>GSU2943 hypothetical protein
MKQLSAAALFISGALLLLLPRILAPVCIHASNIDPKLGAFMPCSDAAAMT
YVIGAITMACSCAVMFAPDTATAKGALWLCLTLAGLLFYGYLLTPGVCHA
EGMPCRASTLPAALALGAFQVFFSLLGLFSIGSLDRFRASLRRRT
>GSU3352 hypothetical protein
MPSSPRRVSLPIQSVRTAHRSSIGRGVILLAGRGRAVLRPHALLGAAALR
AAALAAFLAAGAIRRGVAITTATAASLPIPSLLLHAAAATRLLLCRRSRS
GRNLLHCTEWGGGKAENHGSSNRDKNVS
>GSU0673 hypothetical protein
MRIYAVLAAAVVVLWGNSTVGAAPKVPEKLVYELSWTGVPVGTATQEISD
AGDLRTIVSTARSNDWLSVFFPVEDRIVSSYDRHRAPFPGLARHYRLQTR
EGNRRRDREIVFEHERGVARFTDHKTGEKANIPIPVDTIDVYGSFYYVRY
LPLEVGTSHHLNILDSKRQRLIEVRVLRKEKLKTVLGEVETIVIQPLVYS
EGVFEGKGTVHIWLTDDEQRIPVRARTRVTVGSVTATLVDGTFEKKR
>GSU1476 hypothetical protein
MTITTLLGLLAGTLTSVAAIPQVVRAFRTKRVRDISIWQPLILVTGLFLW
LGYGLLLRDLPLIVANIVSIICNSAVIVMKMSYGGDDNRPADDYPCE
>GSU0992 hypothetical protein
MTSQKAGPKPINTARRPERPAPVPSPARSGTPAGPVRQLQQTIGNRRTGE
MLKAATAADATGETAPGVMELKGMPTFVPPPPIADFLKERTRGNVNVRFG
SLAAGQLEVRRVGQRYSVREQAIPLTHPLFTGALAPSLLVAVGDGGKIRG
RVGFDGGKGNLAVLIRKSPDLVGLGALDLSRLGSAINTLENGALHLGIKG
TPIRMGGAFTGTLTLEAVNEAITFEGSAAVTVRGLASGSLELKRSTEGIV
TGTATVGLTLPKGFSGNVAVGWDGRAITGEGKVGYTGEKFSGEVLLRLME
KGAAARLEAANTAPERTPAPAATVGPRSDKVDYVVFGEGDLTFAFTDWLN
GTAHVIVDPRGNLTVIGKITPQKEFILFPQKDFNKDLFKVEARASYGIPV
VGNIFIFANVGMGAFAKLGPAKFYNIVVEGTYSTDPKKAQNFTIQGSLNI
SAAAGLRLRGEAGAGLEILSHDIKAGAGVNALAGIRGYAEATPVIGYREK
GAEGEDKKGEFFIRGDMEIAAQPFLGLSGDLFVEIDAPWWSPVPDKRWTW
PLGGKEWPIGGSFGVGATVDYVFGSSQAPSVEFKPVEFSAEKFMTDLYSD
KATGGSGDKGEKKGAWKEKNTRAAEPPPKQAKKGAAQEGEAPKQSPAKAR
VTPGGPKKAKKPADPNARTADGKTVRQYQAEAEKRGKRPGAKEPEKKADK
PATDVAARMGRVKTALDQALAYAEKTGIGLNELNTVLKSIRRRKEYGLKE
LKARDGGENWIVAATLNPTQDLKTVKKKGKAAAAGDLYYEKPFPIPSYKE
HAGAGAKHKTLYTGRAKADKLFHEKRDRKTGQVGKWESWARQNLSEPLKK
KAADLGMSDRDIVRPRFNRAGEKMTFHIDHIVEYQLQGDDETENYWMLRG
SENSSSGSTLKAAIAAVRTESDKKRKEDGQPATVKIFFKNPVLEGNAEDT
IYWRKSEIKKGDHIAAYEKHRDSLKNQLKE
>GSU2752 hypothetical protein
MSKGDTKDDNTVQEAVLIMKIVFYRDISGGKPA
>GSU2922 hypothetical protein
MKRSTPRRRSLVLLAAFTAAALVIGLLMFDKYRQRHELPTVQPVPGATGS
FRVTLFFAAPEGDGLAREGREVPACDQLADCVDAVLDELINGPLGALQPT
VPPTTTVRHVRIEGHTAVIDFGRELADGLPGGSSSEMTAVYSVVNTVCFN
FPQLKQVRFLLDGAEAESLTGHLDLRQPIEPDYSKERPAEPEPRQPQ
>GSU2660 conserved domain protein
MSHLGCGCPGSMARVIEREETTTNDTARTPSALRQWPVQLHLVPPSAPYF
RNADILISADCVAFALGSFHQDLLKGKALAIACPKLDETGTYVDKLATIF
RNNDVKSVTVAIMEVPCCRGLDIMVQQALAKAGNPVPLETIIIGIDGERR
N
>GSU3185 hypothetical protein
MNYLKWLYFTLISIILFSTNAIAHGGSQMEPSYLYEIQDFTSAQSLEPRI
FTIGGGLFGGYRNVSVFPVPFDNAVGSNDMGNAITIISFPKGKIDYDRYF
RNAADDMSGGGEFVSQTAKDILGYGQVRRFLLFDFKKKLHREYRIVFPIT
QYIEKIALADSERKRFLFEIESQKRNAKEPFDSNNFLQLIDLSDDKHKVI
KEVFKPAGTIWTTTKERVFLYHILEKKLQVFDMKLEVAHHPLEDALKQYK
DKIDFSRIYVHPQFPFAILRGKKGAVYIGWEKKNIPVPHLLISGASQFVI
SPDGQWVVFKYERTGAAKTYLMPVSEKYPHYLGSPILLADTYFDEGKFAW
TTNPTGFVGSRLDKIYHWDLENRDFPEKGKMSFHDYIVKNDLEKLTREKK
QGLGNKH
>GSU1751 hypothetical protein
MIPVMRTNRTDSDRTGEGALLSDTWDTTVAGGHRGYGAMKGELTREGA
>GSU0182 lipoprotein, putative
MKTRLSVISVTLILAAALAGCATSREMEQVQADQRLLDAKIEQALQEAQA
AKAAADAAKLEAQDATTRAESAEKAAQERERLADEKAKKADAVFQKSMRK
>GSU1209 conserved hypothetical protein
MKRLLLFACLATLVTAGECRAGFLDELTQRAAPLLQGSALDDATVVKGLK
EALATGTERAVNAVAKPDGYFGNQLVKILLPEKIRTAADVLGTLGYRKQV
DEFVLSMNRAAEKAAPKAAGLFGDAVRQMTFDDAKKILNGGDRAATTFLE
GKTRSKLFEEFKPVIAKSMSEVGTTRAYQEMIGKYEALPLAALAGGTSLD
LDTYVTDKALDGLFTMLGQEEKKIRTNPAARTTELLKTVFGSK
>GSU0052 hypothetical protein
MSSNSKSAISIRVDPTNPGQFFACCGLLELADRLWDGAEGWFDKDGFLFS
LRPYQEKAQKCAPATLLAEITRCRLTNTMSDSELKRRDQLMAVPKKIIES
DPSLEAEKKVLDKLWREAPLVLHSPFNIRVDWFLDEYCGGTMFKTWAGQQ
SVFEIGRGMKAALDSGDWSHTSPDDWLRRRMLNDSLPFNFDSELGGVGSD
LDVGFSFDPLKTIKVQARPLLEFLAFVGLQRFRPMKINAENRFQYFLWFG
PLVSEVAAPAACGLFGPVRLKAFEFRLLYRTKYLKSFLPATPIQRS
>GSU1060 hypothetical protein
MNMGKKIQEMAVWLEFLVGSGLAIFFHLVLHYEQAAYVIFGIGILLALGT
YLIREEIEKNRSHLVEQYQQAHEIPFAMAGIADPECLAKAHELIAGTRRT
ITILQQGYIPLDENEFYLEGAKLSDQAVREIRAVDPLMAGWWTRGALVNF
YQANLRALDRGVRIRRIFVVNREELGEPEVQKVLLSHYRDDIDIRIVYRD
ELPAAGEIIGRDTNSSFDFALYDDRVATDVFAHPGKYYGRKTAQPMEVAK
YRHLYELIEHSAHTVSEDGDRIVPSAEVMPLAS
>GSU2910 hypothetical protein
MRGAALRATLWLALTGFLAGAADAAVLRGRVTDVDGKPMAGVKLFVYDSA
HVRRPATFISPPSAGDGTTAVRVPPGTYWVVARLKLDESYGPLMPGDKHS
GEPAVMDLTADAEIEQDFTVADIRDLGRMRRPLVADFVKLAGRVLDGDGN
PVANAFVFASATSDNSRMPDYISAWTGADGRYTLIVPAGHKQYVGSSRRF
PPSTWRASAEFVPSPGTADSALDVGLGPD
>GSU2671 hypothetical protein
MKRLKKARRLRLLTAWLAVVALAPAAALAAEHEIHGSNDFSITYNDVQGA
GGSQSSLTEGTRYLEILNLFGNGRAGDFDYNWTVGAKATDDRRNDQKTFS
LTNLQGRVSNKIHTITLGDTFEAFSQYSLNTAVKGGSYKFTPEQSYLPQL
SLLYGIAYSRWDNFWGVDAVERQVAGGKIRQNIGPDFWIGLSGVQTNDHV
RVLGGELTNHQTWTVDWEYRPIPGLTIQGESSWLNGDRSPSDGAPFDAYH
GHAHKVVAIGDGGPSRVTLEYERVSPEFRTTIGSATPDREKVKAKWRYKY
SKDLSITSAMLWYHDNLDGQKAYGTDHYKPEIAVSLKKPFKRQYAVADIS
YKLDRAYSPVMRSLDHYITMGYRDRFGVLDSDSTFGIIVYDQNGTAKRKD
VEFTYNTSLSSRHTLGRFVLKPSIYLGGWTHNDEVAENTDQIVEYSAGLG
VDVPDLKITSTIRVGENRLLKEAGTDTAKTFAGVNVFWRPEMLAKAQGML
YAKASVNDFRYDPSLAAGSRNFRETSVTAGISIQF
>GSU3440 hypothetical protein
MTAPSDDFGGLVTRATRLWTGNFGNLLVLSLVFSLVAWIPVANIGFLAGY
IRAVLKVARGGKAEPGDIFRAWDCFGDLFVYFLLVVVAMFALNHVPVIGQ
VAGFVLSVVVAPGMYGIIDRRMKFMDALRWSLETIRGDLAGWILAVLVGS
IFTAVGALLLGIGIIITLGWGSLVMALQYDKGEQPRVIIL
>GSU0498 hypothetical protein
MKTLLVFLAVTLLNGAFLYPLFYLGLEREVSWWLVAAMVVVGAFCIYLLV
RYRKSL
>GSU2474 hypothetical protein
MIFVAMLGVRSLIIVEILLANLKPEIYTSNIY
>GSU2500 hypothetical protein
MLAVDGQVRGTAAAGDEIARQERKGVEDVDHGVAPLGDVDLVGGRVDLDR
PRPHPHVNPLGDEPVGKIDIDQVGGKLVDDQQPVAPPVQLRPEGRGFGRQ
RGDHGAPRDVHHRHRVGQGIEHVGRASQPVHDHGAGPAPHDDPGQFRLGG
RVHEAERIGAVVGHQQGPAVGGQGQFDRAVADAEPDAGQGIVGHQVDDLD
GVVARQGHVGPARRLIEDDAGRRPPGLDELRDPGLPLVEHRDGVGPLVGD
EARESPQGPPGDLGLEDRREAEALADRCIFIQREGDEDTLLGFHHEGEVD
HDRLLGGQQGHSPLVGLGARPRDPERQAAVAPPRQDRRPQHPLPLHGGGA
RLGQPEGEGYLRAVALDRRNPRHGEVEPAGVPGERLQVEEQFPLGEQLGL
GDDNGRVAAAARQRGEHGDPQNDEEGSPAGRHALGRILVHGAPRSPCHGH
GTDVWHLVQSVDCSGEKSDAGRS
>GSU0738 hypothetical protein
MWVGSEMFSRVLGIFVSSEVFRLLCVIKCY
>GSU0311 hypothetical protein
MCRIIASLLKSIRSNFLFSTLLLRLPAETGCHCQPYRPFRAKL
>GSU0834 hypothetical protein
MKKLLVLAAIVSFGLNALPVFAQPEPTGKDDRLVNCTSRDSLPERITRLE
KEIAKGERVYTAAELDRLERKRDEAKDMLRVLMLGGKR
>GSU1213 conserved hypothetical protein
MKSIITRALRLACIIIVMLMIPAGLLAQDEESAPEQATTFSKEELTQMLA
PIALYPDSLTAQILMASTYPLEVVDAERWRRQNMGLKGEELDKALQEKSW
DPSVKSLCHFPDILFAMSEKLDQTRKLGDAFLSQEDDVTATIQELRRKAD
EQGNLKTTAEQKVIHEKEIIRIEPADPTVVYVPVYNPLYVYGPWWYPAYP
PYYWYYPPGFAITGGYIGFGPRVFIGFDLFAWAWFDWPVHRVYVNIGKTR
RFHQHYYRRDVGSHVWQHNPVHRRGVAYRDRKTSDFYGKRPPRMAPVRPE
TRGYPGKFPGRPESRQQQPARERRDVPGAPQTGKGPGQVPRRDSTVAPQA
QPPQREQVQKPARRDTPFQGVGEGSFERRAIKRGESSIRSVTPAPQGGGR
GGWSGQRAPSGGGRGGGEMRMPGGGGGGGRGGGKKEFR
>GSU1079 hypothetical protein
MAGLLIALFINEPLPPSQREGTVMKTQHMFILSLLIVMLTVTTAQAVSLT
YTYSDKDYLGGASWGSMTAYVEDANTLGIRYTAAPTSVIGAGSTATGFAF
TFSGALPIAIYNPSRGAFGDDQDVLKWVVFEKKLSTLPPPQNGDEFNPVI
DNKYVFEFAATEGDGRSINSTGVKAGQSDIFYMDFNSSAVDFLAFSLADL
KSFVELTAVRLMGLPSDINGGSLFLAGKLEDSTPAPVPEPGTILLLGIGI
AGLGLYRMKRG
>GSU0325 general secretion pathway protein H, putative
MPTSRAGTCNNHGFTLMELVVVIAILALAAALVLPRLTPGDTANLHRAAR
EFAATLRFIQDRAITAKTAYRMKLVPGESAIAITTILPDGTEGEPDDPIL
RRRLLPEGISIASTFTPRVGKRAAGEVVVTVGAAGFEEFTVFHLKAEGSD
AVMTVMAYPSSGKVKVAEGYREEPL
>GSU0563 hypothetical protein
MAAIAVAKVVDINERPKRVASILNIPSLPKSARVIDCNAAFIPTDVVDTC
SIEIDANDFPDLLSGYNFSESPTTETSHSAVFTKVGPEFPLAVEFTAVPK
ELKHGGHVKVLADRERRRAVIDLYVE
>GSU0050 hypothetical protein
MIGRRDFFKVLGLAALIPAAPLKPASAGRRIDLLEVHIAGFQYHEGMSAG
VFSMLKRGGELCLRREPDNPYDPFAIAMETQAGNKLGYLPRRINGIPAAI
LDQGVTTRAEIVEIDALAPPWERVLVRVWQEV
>GSU2251 hypothetical protein
MNLLLFPLYALEKLLIDLDSQWWWFPLAAFAGGRLCKEHKPELIYSTGGA
ASAHMAAFFVSRFTGLPWIAEFQDPLVFRDWKRSKRAYAVYAFVERVVCA
RASAVIYLTEAAREHASKRTELGDKGYVIYAGAEPATPEYDDSRPCCRDT
WRIAHFGSLGGSRNLSVFLRGLEMFLSNRPELAEIVRFDQYGTSDRVSCE
LVDNFPVSGVVRNRGRVSRNEALSAMADCDVLLLIQNTEDISAETIPSKT
YEYFHMAKPILGLLHRNPELSRMLAELGHRPLAADDPVAVKQGLMEMIYE
WQEGGRREMAISPYTVKAAVGRLVAIADSILKN
>GSU0788 conserved hypothetical protein
MLTVIIGAVALFAAGVLFSLPEVHHVYGFGGIALGALLACTTLLAETRAL
SDFWSRFRPRGGGEV
>GSU1375 hypothetical protein
MDRSSRNRLTERQIPRFAGETLFDGIARAVCRAGCLPRKELYEAWEVARR
VRRRFRGGRVVDLACGHGLLAWIMLLLDDSSPGALAVDRRLPPSAAGLAA
VLETEWPRLRGRVRLLEADLAEVELGRDDLVVSAHACGGLTDLVLARAVA
AGARVAVLPCCHDLAGADLAGLQGWLDGPLAMDVLRATGLRERGYRVFTQ
QIPGDISPKNRLLLAEPASFSPCR
>GSU2159 hypothetical protein
MSRYRALKSSVDLHLQDLIRNHDKVYHLAVFERNDGGYDVLYGYGRRGAS
MRLGILKGGENLRDYQTAIRLMHKKETEQVQGDGYQYADGITPGVWNNLM
PKPSAAPTAPATPQPKEPGPEATEPHIGGNGPKSWFW
>GSU2154 hypothetical protein
MSKLAAFLGRAIPTHQQKQSSHLGDRTLYVGASDIAGCPRKAALGKQNPT
GHDIKTLLRFSRGHAAQAMYADFFRTGGALFEEEVEVRHPAIPEIRCHID
FLFYANRQTKRLHIVEMKSTDGIPDEPYASWVDQLHVQMGLLQLTLDPAV
EIGGSILVVDLNAGSYHEFNSYTPNRLVFDQLVEKGKHILAAARGECAPR
TEPGILCGYCPFRTGCPSHAVALDLPREILDTGRKYLELNEQKKALENRL
DVLKNDILTFADGTFKGASDGILINVTSVADSTTIDSYKLKRNYPDIYDQ
VTKPKSGYVKLEIKPFTPLAAQAA
>GSU2740 hypothetical protein
MDLLSSFYLQLEACSGRITAAADITSFGRLPSLSQPLITFGASDQRGYDA
LA
>GSU2181 hypothetical protein
MIFTLFSPQKKQIIHGKTLNKRGVHQMATSFLLPRIKNLAPSGTFTPVEI
AFLIEKGYLQYGFERHDVTREIQEAGVAYIEHIIRDIQGELQAHVGMHAD
FDRVLLFGGGAAFLKNGLPARNIEVIVLPEPEFANARGFQTLAFAKGV
>GSU1225 hypothetical protein
MDSRPLGLVLLTGLYLFFFIVTATSYGNPFPFLGRIYVGTAAKAIVFVDS
LICLYLFLGIYKRQLLTWYCLIGYNLFEVVNTIINLNMITTAELERIIGE
PIDKEALVVNNIASALAILLLTQYIYRHRAFFTNRRHYLF
>GSU1931 hypothetical protein
MTVLFFLALDTDAEVHGSARKRLRGVPLAVLADAVACRTLHPRVLDALAR
LHYRYGELATQIAVHPNCDDGTRAFLAERGIAAAVGVTTVHPVHTTEHPP
QDEVPAEEDDEPVDEESEQFKSKYQLAQVMGVAEKIKIAQTGDKEWRMIL
IKDSNKLVSGTVIRNPRITESEVLAIVKSSIQNDEILRVVCASKEWLKNY
QIRKALVENPKTPIHSALRLLTTLTEKDLSHLAKSKNIQTVLSTQARKLL
IAKKEKR
>GSU2499 hypothetical protein
MRILHGFMAAAALAAALLPATAGAEGLRGFLELSYVNSDNQSTDAGNVTT
TTKSSSILQRYNLSLDRMLYPTLRFTGGGTVDYTMTRAELNDNQLDTTLL
KVSPFLDLTLGTPLYSAGLGYNRRQEISDTEGIASSNIIHENYNAKLSLR
PEGLPSLEFLYTRTNSYDADRQFQDVVLNSYNLGSRYKPVEQVELNYQGL
YTDATDNLTRNNTTVLTHTGRVTYNDQFWQDRVTLYANYNISSQETNSAL
FRLFPTAGLSALTDTPLLVTLAANPALVDGNLNTGSNVTIGIQPVGGDIR
PRNLGFDFGGAVTVNRIRVWLNTDLPSDIENRFSWTIYTSSDNLNWTLVQ
LLPSAPFRSNEAGTGWFYELGFSGATSRYLKVVTSPLPLGTVSLVNPTAN
LTDIQVTELEGFVQFEDVQRRTTSQVSHVLDTNMRVRILENPNLNYELSY
YYDRSDSDFFSTSRYILSNGLSASHRFNQYLSTSARFAREDSKENQGARV
SYIYSALLTATPLPTLTHSLVFSGRNEEIDGRKSDTNSLFLNNSATLYQG
VDVNLNGGYSMATAESGQKTNTTIVTFGLTLLPHSTLTVNASYSGNMTEQ
TGGGLPDNSTFTHRGDLSAAFNPLRSVYLFGAFGVIMEKNKDMVTTRNIG
GTWSPFRDGALQLSFSYNEAYNSSANQKDRTIVPSMRWNIRGGSYLDMSY
LYLKSTSVSQSSESRTFSSTLRFAF
>GSU3428 cytochrome c family protein, putative
MRRMLATCLFATLFGTLPAFGAEGDAVPRARMAADRLSATLRERVMIVMK
EKGPAEAVKVCFAEAQKLTAELARTEHLSIRRTSLKIRNPANSPDPFERS
ALERLAAMKEQGSLPGEMIVPARVGGKQVLRYVKPILVQPGCLVCHGAED
SIPPEVKTHLQACYPGDRATGFAAGDLRGIVSVVVEE
>GSU2293 hypothetical protein
METLYTMMVVLTTVVSAVMIPRIMLDWLRYQEFLRDRNDEELKMLIAGHK
GWIIRHGLCALGAVALVTCIKCLPELARYDELAGVTAAYGMMTLAFAFVE
SLLAQRIESSLQSGLVPVSTDSQFEQ
>GSU0317 conserved hypothetical protein
MKYVNAFVLMILLASQSIDAHAFGSMGGMGGGGAVSVQSTPLAGKVVETF
NSGGYTYVSLERDGKTVWAAIPETSLEVGREVTLKPGMEMQQFFSKALNR
TFDSIYFSEGLATQAASTKGKSSAGSKGGVVTPAEKIAVEKAAGANGFTV
AELHGNREKLDGKTISVRAKVTKVSAGIMGKNWLHLQDGSGDSSKGSHDL
TVTTQDLPALGEVVTVTGTLRKDKDFGGGYKYDAIVEDGTVSH
>GSU1368 hypothetical protein
MHEAVARMLAKYEPKSVDDSVRALREIIQEVALLGLWRSKFFEHAAFYGG
TALRILYGLDRFSENLDFSLLEPSPDFNLARYTASLEEELSAFGFNVRVE
MVGKAVESAVQSAFLKANTRNELLVIETGGELAGQVAAGQVLKVKIEVDT
DPPTGFTTSTRYLLQPIPFAVRSYSLPDLFAGKMHALLFRRWKNRVKGRD
WYDFVWYAANHPQLNLAHLEQRMRQTGHWSGDLPLSAAAFRELLSDSIDR
LDVDQARNDVAPFVKDQQALALWSHDFFRDVARRVQTDEG
>GSU3389 hypothetical protein
MKLVGFLCAVAIILLPFHSPGADSRDAELTFVAGEVFVKYEDVEEWVAAD
RGMPLAEGDQLWVAEGGRVEVSLRNGGAIRLDEFSVVDVLVLDRDNAQFF
MVAGDLYGVYPSGKPEMEVETPFGSLQPRARTTFRVALADEGDAVQVDVL
KGNVLAETAGGTTRIEGGTSWYLEEGSTELTPLDPPDEWDEWNLERSRPA
GPKDHGQRM
>GSU2776 conserved hypothetical protein
MNAPLRIVIAALLATAAPAVGGATETGTMMCAKGIVSIGTTAGEVLAKCG
QPATATSREDRRVAREWQSSGGRRLTTVSIDDWIFNFGPNQFQYQVILEN
GRVVRIESLDYGY
>GSU0730 hypothetical protein
MSFVHGPLCVSQILRRKKKKELLQNNCSLKIYYFEYILLTEYRGIPMMAL
TFTQARRTTLSWSTWTAPSPFPSRNRLRPPIAGRWP
>GSU1508 conserved hypothetical protein
MISVGKGKIMITPDHNIIKPIAIVLPQFHPIRENNLWWGEGFTEWTNVVK
GRPRFRGHYQPHLPKDLGYYDLRLPETRKAQAALATKYGIYGFCYYHYWF
NGRRILESPVDAMLESGEPDFPFMLCWANENWTRNWDGSYNKVLLEQKYS
KEDFVNHARHLIRYFKDDRYIKIDGRPVFAIYKDDIGEDIEKYLLLFRNE
LLKNSINVFLCRFERNIGTSRDLNRAFQVFDAGIEFQPLARQFSYLINYN
NRLSRRIYKTLKKHFCFCKNIDDVYVYSHLVDNDLKYDFQQGWPIFPGVC
PGWDNSARRRDTTAIIFDKSTPEIFKLWVREKIRITDWNLLPERFLFVNA
WNEWAEGNHLEPCEKWGTQYLAALQAGINEM
>GSU0302 hypothetical protein
MPHRLHLEQFGPVHALPVLHYRLEFAHLVREAVRRVKPDCIAIELPSTIE
APFLRAVERLPEISVIHYEGRQRRDGAESVYLLVEPADPLVEGARLALER
RIPLRLVDVDTDSYPRHVEALPDSYAIHRIGLTPYYEEYRRAAASVAPGR
EDRRRERGMAWRLQELAKEHGSILFVCGMYHLERIKDDFGRPQAAPLERV
RRQGVRLFNLHPDSCREILDEFPFISAVHELRRGPLPPEPDDRGETLRKR
FSAFELIVGGRKDLPAEELLRHAVERGARHAGRGEEFPDRQRIIFRLFQE
AARHYRQETGDPVHLWQKRAFFRFARNYALASGALLPDLFQLLMAARGCV
DDNFAYALWRLATFYHWQRAEADIPTISISPEEIWGGSRRIRFRPRERRR
KGLSHLGFLKRKKEKRPGEWLEGFTDPSICSYPPEDVLIEEYGRFLKKKG
AMQLSEELSRTEPFTSSLLDGIDLRETLRNVADGRVYVRESQRAKGGVGS
VVVIFDEDRENGNYPYRTTWLGEHEQESDMAFYATPPEDNIVGPGICRCE
YGGFLLSYPPRRMMDVWRDPDYVFARSKPEVLLLAALDYSPEKHVVHVAA
RPPRSIFRQIAARMGKKIVHIPLGSLSSVKLKSIRVLHILHGHDKRQVAK
DYIW
>GSU0714 hypothetical protein
MEKETKLSAETVKALLNEDINREDFQFVLQQLLDAWRPILEEELKLSESA
ERLVAVAEKQPHSCEDEQLLADRLFAPLATADVALRTLTPQAREALGPID
QWQWCLRKILCCLRFGWLLSRSRTFPVSVYYLYRYWLCIRRLFQNDPTGR
QPTPEERADFRKLTAGFAEVFRPWLVQEAKAMDHSMELADGAVSGQVDCH
SGGDAAEALFEKFLTVDNARLLMGAEIFEKLSKDPRFWLCRCWCICAFRF
GWCLGRSRSLIDLVRCLVAYFRCLRRCFQPLVCELTDPIGCVAEEVNTDL
KALVVAVKGTATGGGFLRYVLEWSRDGIAWHASDFHYPPIPPGGGTQGNS
PVAGGLLAYFDTTARDEGAYTIRLTVYGVQGTTCVRTITFSLFKQDVRIL
GFDGAFTLDTTAYDPAAMFVETVPALCTRPSGIHEISFGECLSIWGSAFV
GGCEGRKIKRYLIDYKPGFETDPTTGGWINIWKVEYNTVWQYRDMNMRKD
TSVLTASWVTDCVVPVPFPPYCLMNVPEARLSPSCWQTHVSTCGLSGLVT
LRLVVEDTGGTLYYDTQKVWIDNKPICAMIRIDAVPRCADIRVSAFATPP
DCAVPWNLPLSGIAWDEYIDPALPLTRPNDNFDFYWVKVSKQGGTEVQIP
VSWSMGTPCFFGTNRVGDPGTSCTPCDPANPLPAAVFGTLAQFDLRAIDP
LCSASVGYPVPADLLLPRGECCVYVFKLRVQDRTYTPGGPHWREALWPVR
ICNDLKPA
>GSU0827 hypothetical protein
MMTQLMKAWPVRMRSSVLKKMIAVMLLVALVAIAGNGRCFALDAGNSLQP
SYGKAGETVLKSCCGEETPCCPTDNDSASDHCSTCFSCPCSAPLSSHGAL
VSYAPLIASISFLEHRNAPPDVYLSIFVPPQILA
>GSU3267 hypothetical protein
MGTTDMIISAAILGVAAWLLYRSVWKKKGHCQGCEGGCCDKK
>GSU2580 hypothetical protein
MPVTQEAVEKFLGRLLTDDGFRQRAIVSLADACREEGYRLSSEELRAVNQ
DYFDLLEQLAGQLDRSIKRFSPAEPEECGRESRLSQPAND
>GSU2784 hypothetical protein
MDIFEELRRAVARAREENDLPDFLAERLCRIAGQPERYRHLAADIADLAG
QVALYDTYGQTGYMGMGVNNAVLEGSIRRLEEAGRSPAGQ
>GSU1786 cytochrome c family protein
MPVAAVLICLVLLSVMPAFAADYTGGVHLDRSKVPRGCSTCHLKFNFKDG
GGPETCIVCHGDPSRLTRQYAGMPRGFAPRESDMKNIEDEFRKPFRHPTF
DVRGVHRGGETLPEADPRMPRHAVCVDCHNPHHVSPANKFAGIKGRRVGN
LISTVTAEFELCYRCHAESANLPGRFNNKRAELASTNPSFHPVEAEGKNT
AVVSLIHPYKERKVTPGDVSIISCGDCHGSESSSGPRGPHGSLNEHILVE
NYSTRDNQPESPHVYALCYRCHDRSSILGNESFKFHAKHIQGTGGIAGSN
GTSCYTCHNSHGSVEYKYLIRFNRSVVSPNSQGQLKFVEKGISTFRGECY
LSCHGVDHNPRVY
>GSU1386 conserved hypothetical protein
MERTHDFMGLYQAWERLAPGPRAELRRVERPDDLLEVPAFYRLFSGRGTT
EWEKGAYQRLIFCLPCIKHSGENISLGRALAKGRGVGEKRLFQVVRSSEP
NDMIQLRRILRMAEPTVNWDLAARQLWYWNDRSKRDLLEDYFLNHSN
>GSU0912 hypothetical protein
MNQSTRIVVGIISLFLSLFLAWRIGIWLEPAPAGPSLPAGDPKSPPYATG
TVQEDLHFEIRNVRISGDGATLNGIGIIRFDTDRERIKPAVLAMLTAVKK
KTPAAKMITLELKPAVECTQCTLARATYREGRTVIRYGIPSQEQIERHNA
LIGTTDGTGRRIDRPRLYRPDKETFGAGLAVTMALEAARQKNPSAGEEQL
LDQAAATVGISPVVAARHRDFMKAYFTGDGYGEETLDTPLQ
>GSU3226 cytochrome c family protein
MAARPMITALVFFLGTAAPAAALSPSLSVTDSHNRHNLSVSGPGLTGATR
ASTEERVCIFCHTPHHATSITPLWSRELSSALYTPYDSSTLKAIQKPGQP
TGASRLCLGCHDGTIAPGMLSGGKMIAGLPTLPTNVRSNLSTDLSDDHPI
SFPYAGSISVAAVLRLPAELPPEISLEDGRIECTSCHDPHKDRYPPADHP
QKSGKFLVLDNNNYSALCTACHSRAGLTEGAHYLPGDPCESCHQVHRADQ
PARLLKGANVQETCVLTCHNGTGTVETSGTDIRSASVFGKAHQTGRLVLS
GRHDADENPLDFEPSQPHVECVDCHNPCITRHESTPLASPPTLNGRLVGV
TIAKSPDGIKTYADSEYAICYKCHGDRSFVPPAVPRRVQTGDQSLRFAQE
NPSYHPVSAPGKGMSVPSLRFEIVGFRPARTLSISSLIYCTDCHSSNKGS
KVGGSGASGPHGSDYQPILMDRYEHDTYPLAYAESNYSLCFRCHDQTILL
DPGRSAFPLHQSHLVNHQVPCSVCHDPHGVPLVLGGTTAANSHLINFDTR
FVTTGSYDSPGRSCTVSCHSANPRTY
>GSU2056 tripartite ATP-independent periplasmic transporters, DctQ component, putative
MERLFGILSKAFMVVGGVALLALVLLATGNVASRIGRAPFAGTYEIVSFL
GAIVIAGALGHTQRRKDHIVVDILSERFPAPVKRLLDAVNYAVTCGLFGI
AAWQVWVWGDKLRLSGELSETLKLTYYPFVYVVAAGFGVLAGVLFIDCMK
TVLRGKESEE
>GSU1959 hypothetical protein
MWAHCSAIARTMVYPSFFCSGDWLKVAAENLCPGDEAYILLAKDADDIRG
VLPLVRKRNALGGTDLHYLGSDFYPDPLGLICSPADRADCAAALRNHLLN
APDWDRLILDFLLEDEPADWTLPGKPVSVAPFKVLPRDFSELLGEFKKKK
RYNLRAMVKKLLDAGGELAASSGPESNIAYLDALFFLHEKRASERSLDSS
FTGPRVQSLHRALVAASDNVRFFGLRLNGQMIAVIYGFEFCNRFFYYQVA
HDPDHGHLSPGTVLLYLVIEACCVNGLTEFNFLQGDETYKGIWTNDSRIL
YRCVLNRSTWRSRVFSAVEESRGYVKRAMGLMSRGN
>GSU1232 hypothetical protein
MAGKEITVTFYRQNHVRDNWKVDLQGAESQRFFGTLADDLSRSGVILKQA
VNDGFTIDINSYGDLLNSIRISSPTDGFANVCIGHVIGQSPNLDLLDDIR
RAVTRIAFAPETVPPEDHNRKVCHNCGCGC
>GSU2273 hypothetical protein
MVMGTPLLSGSGAIRTPLSAGHDLVQNAGSHSLERHRAKILALAKADSLT
RHESGGGGIAKGKGRTALERGKGLRPFDGENATIHDSEVLRHGERRGAVG
RVRHGQRDDAPFPGTPWKNGPKRDLHRIRAVESEVVKRGPAVSLHQVTVA
DTALSGESEPGHAAARGLRSLECLQPVIHRPDPLLGLPTGPIDLLQLRQG
VLGPLPDRREGFGVALGLSRRLAALRFCRRLEYLFPRFDGSEPILKALAP
FGQAGDLTAESGSGPEGLHFHADHAGTAGRTHPEGPGDVGRRERDFHRAP
GGACFARNPGPVQIVGDSAAGHGRHGSQQKTDYIKRESHPPSLA
>GSU2668 hypothetical protein
MYQESEIFNLVLGVISLIFFAGFGTRRLSIPRSLLIAYGCLLLSYLFTVV
EGFIAPDAFNLLEHLAVAVAGVVFAYGCLHISRKQSSEGNGTS
>GSU1823 hypothetical protein
MLSHDVFSFLYDAVFVQSFTWSREGRIFSCKVAVCQAGIPEAALAGSSMP
FFDVDRR
>GSU1682 lipoprotein, putative
MKYFLIILTSLAVLWGCSPVGTQMRAWKGAPLDELIIRAGAPSSVSADMG
GYRLYHWYEDRGAVYTYGGMVNLSCDRTIGVDDDDVIISVDWRGNCIAPA
DSPWERTNDMQNTENTGVSQ
>GSU1558 hypothetical protein
MGKDRMKQLLAGLGIASLVAGAGAMGPGPALGTSG
>GSU3403 hypothetical protein
MGFKKSLIAVTATAAIAASAVPALALENEFHGMFRLKGVVNNFNATPVIR
LHGTVDTPLGYGFYDPQGKEKDVPTATYLEQRARLMYIAKANDDLKLVTH
FEIDSRWGDTSYTSGRNRGGAIGADTVNIETKSVYLDFNLPATGVNFKVG
IQPYNDAFKGVLFDADMAGVLASRTFGDLDASAGFFRFDDKGTVPGKKTR
DMFAATGKYAVTKDIKVGGAYYFIDDDRSEDDGFSLPYLFQEQGFTNTYH
NVGVNAEATFGPVTVDGFFLYQFGTLRAPANRHVSAFAGNVGARAQVGPG
TARTEFLYVSGDKGGSGTTNAFQSVQDEHGYYGGNMQILFRDAYAMTIDN
AIVYTTNNKDQGVIAGFVGYDLPITSKFSASANAGFAAVAKKNGNTSNAD
NSKYLGTEINASLEYKMFDNMSAIARGAYVVLGDYYSGVAAGGQDPDDPY
MASLILNYAF
>GSU0641 hypothetical protein
MPRVAMFGSFRGVIFACGKVFFCLDRPKTGGL
>GSU2778 hypothetical protein
MNQTVTVEEWVARFRAIGLDDEAMAQWHRLFERENPAGHQSFLEWLGLPA
ERIAEIRRTYA
>GSU0793 hypothetical protein
MRQFRKLFCLALSLTMTIVVFTAGVGAASSHGKDVSAHKSCQYCGMDREK
FAHSRMLIEFEDGTTVATCSLHCVAVDLANNIDKAPKSIMVADYSTKQLI
DAEKAYWVIGGSKQGVMTRNAKWAFAAKDAAEAFIKENGGRPASFDEAIK
SAYEDMYSDTKMIRDKRKQMKMKREGHAKGSDMK
>GSU1743 lipoprotein, putative
MKAHVYALLITIVFLAGCSGIKVVPLQVPQGVVNLSDNSQTVTKDGVSIT
VANSNTEMVSYNLDGTVTAFVVTIDNQSSREIGFDTGSFLLLDGENRQYL
PLTPEQVKELMSRDSYYLIPYPYVGFYYLEDYERTSFYNTTESQLPYYYE
VYPQDIQTRALAANPVIPGAKSTGLVYFRIDLGAVKEVKLLIYRSGSSKS
VAPDFLFPFAVGK
>GSU0469 hypothetical protein
MTRKVLAVAVMVAAGALALGGEAMARGGKGPMAGQGTGDRLQLRDGSCLS
STATATAASAATSSRQQLRDGSGAGTARQGVGTGAMRRLGPGDGTGNVTP
PQDGTGYGSAYTGGTAR
>GSU0232 hypothetical protein
MRFSASADGLRKGFLKQFRFFPIGVLEGALVIEFN
>GSU2211 hypothetical protein
MTHWNPRAACREKNDESPPDCRRVYSSLIREFP
>GSU1026 conserved hypothetical protein
MLQDKELLEALSSARVLTSMITPAVLISAAGTLIFSTSNRLGRVFDRVNS
LKSEIESVLAGTLPFAAERMEFLKVQVQKQRVRARLIQHALAALYTATAL
FVASSLGIAFNVAYGSRETSWIPTALALSGGLFLFAASAILIYESRHNLT
FINRHIDFVEYLEEQYGKQNGKHTKDI
>GSU2058 hypothetical protein
MMVNMISSKVIADMGVPLDQWFLSFLIAQQPENFNNFSGCAVTDPLLH
>GSU2173 lipoprotein, putative
MRTVGGVCGGTTGATGCGMNIYIFGDGQVEGLAEMKNCWAAR
>GSU1668 hypothetical protein
MLLEDVGNEVYKTWSNTKRRDEIAKLVQGYRSGLPAFILCRMTEAIAGSR
KRARKFLHEMMPPAERQEAVTRESGPMAEFVKDCLL
>GSU2040 hypothetical protein
MFPTHDLCHFFTQRVPSHRCRVQEKSDALKTIAIALPDVADWI
>GSU1396 hypothetical protein
MLKQLLAPLLLLTLALPPCVHCQEHEGERVFIVKDYRYQPAEGSQADPQM
GQDLCGTRCNALSFDYLNVIEPGGWRMIKVADHSEITVGLDNPFLGGACV
CIGDEYIVKIDDLNRVR
>GSU1965 hypothetical protein
MICASGARIVGAVTEDDLKFSLLYKQSDPTESLAQKIVKQFIPPLVTKAR
MLIDRILPAAGSDDMIPASWDQFESAADTVLRDPHVRSSLVKVIEDADLA
VIHGDGCMVGNTRIARAELFLSYLIKKRFGKPVIMVNHTADFDHPTLRAM
AENVYPLFDDVVFRDPISAQRCTSLCRGRFSADTAFLFKPAPLADWLPVA
QRPTYFDVWPDTACFDPAKPYLCIGGSSIFSYNGTPQELIERVAALVTHI
GSIYRGQIVLTVSDIVDQAVFRPIAQQLNLPLVGLTTPVQQAVDIIGNAQ
AYVGGRWHPSIFALRGGTPVVPISSKTFKMQALIEMSGLASCTFDALSID
QEKERMGQQLMSYIAQGDGLRHQLKLWAEEKSKNCWDNVTYLKQFPGSAT
TA
>GSU2533 hypothetical protein
MSIVRFFCAALAAVTVAVAGDARALDLLGEKLALHGFASQGYLDSTGNNF
LADTRDGTFQLNEFGLTLTSRLTDHLHLGAQLLSRDYGDVGDNEIRLDWA
LADYRVHDLIGVRVGKIKRPMGLYNEGRDSDFLRPMIFLPQSIYDETRRD
MLIACIGGGLYGNVPAGPLGDFDYQFFRGWFDFPDDSVIARATRANAQTF
ATRKGLGTVTDVTMDNKYVVGAQLIFNTSIDGLRAAVSWQRGRHDLTLNG
GALPPGELAIRGKYVYSLEYVNGPLTLASEYAETDREQRLFGQLTMDGRT
QEWYVMGTWAFNEQVSLSLLYDVFYNDKHDHDGKAYLRQGRPDYLGWRKD
FGAGLRFDINPSWTVKAEWHQVDGAALYLELYNSPQDMKRRWGYGAVKVS
YNF
>GSU2142 hypothetical protein
MIFINCLASDGSDYNRVSSDKIRSNIKSVCATLSGNGFRFTWLPLIIAIH
IHEYFCSTQITVCYDTADGAHAISATTTATTTCQDTHQHH
>GSU2968 hypothetical protein
MKVLGLTFAVMLLALLVVLSVLAHAAAPVPPEPEDPGMLAGALLGFASFV
VATKLIPGIIRFASIIDEETVSPNHAAIAETGDGGT
>GSU3355 hypothetical protein
MRVSMHSILREMGGEYHHHRRGKSGLLPEGTGWSRFGCKKAQAMKPR
>GSU0672 hypothetical protein
MPEKLLVPSIEQRIAGMLEVTRRMKDESEGRKKGKIRPTVTISREFGCEA
FPMAERLKEILDRQSGEPWVIMDKGLLEEAAKRLDVSTEVFSSLGHRPRF
LDDMIATLTPHWKSERDYFRILCDQIASLADAGNVILIGRGSTVITQSMA
NCFHFRIYASHEYKVRSIARRAKLPLQEAEHLVERKQKERDRFIRDFLDR
DISDLSLYHLAINNDRNSSDLIARTIADYLVARKSG
>GSU2402 hypothetical protein
MRLVVSLSPAFIIAAMCFSVSVCPGIRAEDSATYEWTDDQGTVTFSDNPA
RIPDKQRKRVKKRATIAGEASQAVIEQKTAPVKKQQPQRVESYGGHGEGW
WRQQFSTIRAEIAQTRADLETKRQNLKTLHYKMTVSNATTQGIFGNPRKN
RNNYRAAHDDIREGEKQLEALEQKLVDLESEAARHGVPFEWRR
>GSU0974 hypothetical protein
MTEAVMLQPPDRSVSRLSFAMRLVDDFAPEREPPAGVAVTLTGGGTQGIR
NPGGWHLFMDLPPGTYRVSARSELYLAEEKEIDTALLDPGLPVVELLLKP
SAAYPFPPGTTLVRGVVRNEAGQDVPDARVEAVPWMPVPSVKGRVRQGGA
QAGQPTIRLEHLSGGLAAGDMLLVRDGDPSRQEIVRIAAPLPSNASQPFT
LAAPLRFGHAAGTSVHLLAAAAPFTTATAGTGEFVVYQRNASARLFVIRL
STSVAGYLPDERDLEIAEGTTQSVGVISLHQL
>GSU3110 hypothetical protein
MDKFLLPLFVINLLLTLTDAAVGYHAAPALMRHFTPDEVTAEHSTAGMRG
MLGLVVALYMFFNCYGYYRGSGVILLVVTGVILADMVAQIALRKRADRRG
GE
>GSU3111 hypothetical protein
MSYKKVVGLALLLLMSAAPARPADGPAPAQGKDTCLLYHDNCPDRKDDIY
QRIARLRREIAKGPAVYTPEELRTLQQMLDEYEQLLDRLLYHNTD
>GSU3305 hypothetical protein
MKKMAIMVMAAFMMSATVPALAAEMTKEEKDMCLLASKNCATEVDSLQKK
IKKLNAEIKKGKKVYSADEIKKLQQKLDEANDLLDSILKGGGN
>GSU1536 hypothetical protein
MSLILDALRKMEQERKARRGATENIRPDVLGHRGAQRSDSRKRWLFAAGA
AVILLAGLAVGLLLKGGNDATRTAAVQRRDAGRPAPLTAGYQDEGTESAP
APAPVPQPPVPQPQPAPALEAPRPVPAPPAAAVPRPVPAAPRPDEEEEES
APVVRAPSTPAPVAVAQPADLSVTGIAWQDERRLRRAVVNGTLVAEGNVV
AGARIVSIGERKVRFSKDGRTFDVPLSGGH
>GSU0956 hypothetical protein
MEYIELLKNDTVGYYIAVVVAAITAILQLLKKTNEYFEKFHVTKKLARYS
ALIECCDENTQEHKFMTKLKLLEAFKIATNINTTPIKSKFIMRLYDIGMF
TIKDLKQVYTFFEVSDNEKAMCKFSIFNKLEVIWSSVVALFISIVFIAVF
IAITPHNLKQFFGYIAIMSLYLLCMYGVGEPIRKAITYRKFKLILKQEGL
WEEMSSQPANQLEKSELAESN
>GSU3458 hypothetical protein
MKRGMSWCMMSSSRLRTRPSLEPSSFITASIIATTS
>GSU0873 hypothetical protein
MAYAGGDFNRLETGVGISSAVNGVLGHFALGCGGHTCRHACGHAALLLRS
RRSGLGGLGLILASGGGERNNGERYQY
>GSU2176 hypothetical protein
MNCPFPDEAMKTVVSYLRRSGQTVVFSESSFVLAKGNPNISVIEQACADG
AVSLTEDGSIQVCGTRIMAELDTVKLRRRVEDHLRKSATKQDIIRIAACL
GIRLK
>GSU2150 hypothetical protein
MYIEGHISKIILIYKWICKYDMEFSLFADP
>GSU2747 hypothetical protein
MTIETIKAWCTVEEAVAKFGVERAKLLGWVEDGLVRAEEEKGKVVRLNMD
DVELKVEEMTGL
>GSU3410 hypothetical protein
MKANVAASTLAMVLGSVTSALAASGAREDNSGVLVWIFLGFCALIIVAQL
APAVLTMIGMAKGVADSVGRKAEAKN
>GSU1869 lipoprotein, putative
MTMNQARRFIRIMACCLAVTGFGLLTGCATLTELMGTWHVQPSPVETPPT
APADTSAAAPPAVATERPVDQGNVSPPPEQVLPVPAAGSGIDTPRRTLPG
LDRLAPAAPAVATVLRNLLITEDTTWRGEVLIEGWVRIAPQTTLTLESGT
VVRFRRTAPGITAAPGLLVQGRLVVRGSTDAPVRFTPANGTPLAGEWYGI
VLLASEKKNIIEHCRVEGAVMGLDASHSALTMNGASFSSCGTGVRLAESV
VTFAGGEVSGCEVGMELDGSEIDLRDVRFNGNVDGLTVRRSSLYLAGARV
SDCKGRGIDAFDTRLKLEGAVVERCESGIVLTECDGSVAHSLIAGNRRVG
LILSRSGVKISGSEIARNDVGLRVEDGRGIAWGNSFSGNGTYDIYNAGSE
DFRAMANWWGTGQSAQNLDSRLYDGNDDPVRGMVLVAPVLDSAPLPAVPN
SVAK
>GSU2077 hypothetical protein
MIFICILPLVCPSNLFYSLVKESNAKEKRAMKCPICNDHTGIAIDMHSDG
YADNLLECTTCGAIWIQEFEGIVLLNKKVA
>GSU2595 hypothetical protein
MGRSWRWLRTLSAIFFKLSKILFGFFYPLIGYLRSCR
>GSU1565 hypothetical protein
MYLARDKNNDLYLFEALPIRGSECWWAEKGVDGTYLRLNPVLYPEVTWDK
EPLPVRLMVMEGE
>GSU1229 lipoprotein, putative
MRLRAKLALAFVLLAAAIPAGAGCEELVAAAEPEDLSYVSVVPQESVTEE
GEYVEVPLEEVNVRTRQIVDLSAGYRFVDVSRDGSRAAEYDRLRSSGAGR
FTVGSVGRDLKLVVEGEYLNTKDYNASLLFDYKGYYRLNLRTEALYHNLD
HEAPHVPIAADLNPTDEYGMYVRQDSINFRYKFHDYPIHLNIGHWMLVRD
GYSQLRFSDYGFDESPNPADLDANNQDRRNRLLFQSRPVKNHTQEGTIGF
DAHLGPVNILYTFLIRQLDDGIAPPGFQFAPRYDENGDLLRSAGLLQHNT
TPESRYYSHTAKLYTSLTGGIIGSASYSYGKRDNQSSLTTVAGADQTSNT
LQNVAGDFVYTPCQWFSFSTKYRHQEIDRDAPSVVQVPSLSNPVVSVRSG
MDTRRDVISVNFSLRPNTILTFNGEYRGDFQQRSNTGTDVDRWRLPDTTS
SNRGTMTLLYRPFKGMRLRALYAYTATDSPSYGTEAENRHEGNILASYTM
AGRWGVTASYKIVRENNDHITQDVTTFDDPPLHGTLTLPRDRRSVHGAAS
VWFAPLEALMVSVSAGYLRDHSDQAILLSRFLLGGYAGTKYTSEAQIISL
NASYRFTELVDLSLALQQVRSLSDFKPEDTTADGISTFLIRDVSQAKTIE
NSLSARVNYHLSRLVSCGLDYSYRDYDNKLSSLYEGNVHQVTALVRAKW
>GSU2694 hypothetical protein
MFRSIDQFLVDYFKHSMKLRSVSVIALRFPARYSLPPPSTL
>GSU2790 hypothetical protein
MDTYMIDAELEHYLGERLSSSSRQALCRLHVRACARFHGAELEEYRRRLR
AHARAYARLGRMLQSPFRHMETALFLSSFTLFVAGIIMVVNGDLSALVAC
GTAAGLVGMIECARKLAGHWHRYGVMEAVYRELEEQLANG
>GSU0198 conserved hypothetical protein
MPSACPRNSSISLTLFRRTHLKRILLLSMVMLALAGCISGKDWRTASREP
AGIAPDPAVTKEAVLQVYGARAWGWRGWFAIHTWIAAKGTGEPGYTVYEV
VGWRLHRGQPVVRIEKDLPDRFWFGEKPRLLKEHRGAGVDELIDAVDRAA
RSYPWAGTYQAFPGPNSNTFTAWISRQVPQLDLDLPFSALGSGYGD
>GSU2924 hypothetical protein
MEKQELLDKTRETLSPFETGNIIAYIKSLTLKSAMENPWIISILAIVIFY
GVFKRSKFVLSTVFAIISITLLIRLTLPTGEGNELTLGSTLPFAFGGLVI
GAFLIYFIFIKSE
>GSU0137 hypothetical protein
MHDYYQLVTVADLPQDNRQILDTLTAIGAGKLPNDLRLLNYYRSIPVNYG
ATVESVERGSVELTVNQQQAIVMQLEKQVFLKSGHFPKDVLAAVTYVNID
KSVAILANFAYAVIRSERRQFVRVEVKDKIEASFSGDGFTVAGLLNDISL
GGVAISATIPTHPDAGIKGKVTLRLPGGPFEMPAQVLRVIPAKQTDLCII
EIQPDNRTEKGISQYIFQRQVEIIRELKDSIF
>GSU2003 hypothetical protein
MAILFLLLGPWRSMAFTAHQVTTDARIAFEQILDLWRAEDLERLYERLDH
PAAWGWDYFAERMVYASRVPACCWEKLQDVSVAARDADYVVINAKVGFEV
EGVGTRFVTRDFHLRRIDGIWKLPMNDVLTLSKYNFQRIPRKIYQRTP
>GSU1240 hypothetical protein
MSDELFNYYKYFDLDQRIEVRFPRAQNVSFREWGVVTLFDGDLLVLQLSR
DSLPENVSARTGTILDLRLGKGGYGYCCRAIIVEEHGDARLTVRLIGEVI
PDELREYYRIDAYIPLRYSVPRGLSDHEIRQRWRARRYPPPTTVEAQGAS
LTTPPQEPPREEELSQPPPLAANISGSGIRIRITEELSVDTLVDMELFLS
QDKLRVVPVVCQVVHVAALRHKAGEPPLFSTALRFLCIDERDRDMVVGFV
SAAQLEHLRIMRGGNVSVTELEYAEYSRHRRLRRIIVGIIVVTLIIVAVV
VLIISRINGKKGEIEQTYEREIKKYRHIVPWR
>GSU0727 lipoprotein, putative
MKRMKRTAGALLLGLALLLGGCGGGEVRISAEIPVVYPVDYAPAITWLSF
WKDADRYFVPGSIDFTDPDADTDIMTIVVSDSLGRVMARTVADLDDYVGY
TGGTVSFSIDYLTYPPGTYTFTIYLTDRGGNLSNPVYGTFQVW
>GSU0540 hypothetical protein
MRPKKRKRCTVLTYKVVEIGTVTEDVIEEALNEWTAKGWRFDGMQFAMRE
ASRRPSMAFLIFTREAEEA
>GSU2123 HD domain protein
MAKLAELSHLWKDREVQIKDAANLWREQSQENNSATPRPSFTHTEIERFF
SEMIEHRSSINGIRRDLIIWLLNLLDKEGDCPSVVRKHKDEAERIYSEDS
YAMLEAVPLYRHTLTVARNFIAKADQEALLADIILIALAHDIGKIPSYHD
GMYSSGDHPIIAGLILNGIPEFVSLPNKEDILRAVTGHHLMKSDNILTDG
LKQSDHEARQSELSALYLKMREQKNRRTENGVDTQFAAEEMEPTFARPAT
TELPSAERADPLGFALAKEKYTPQLINIPECIDPDAVIESLKNRINVVES
TPKGDQWSVVSMPNGVIYVNPDALWETIKEVSNYDPLVYASEGFEAEKRN
LMHSVITELARTKQALATEYVADGYYTSKVSIITGGGKRLSHLLIPLTAQ
SFGVTASELEEQKSAQLKRMVKDIKHKNKEVEQCVGR
>GSU1867 hypothetical protein
MRKVAAVKGKGMPVFVAGVRAAAAAFVLLLAVAGCSLPPERPVTKDELYG
TGIYSFYQIKESPESVLAALNREGEVILDARYRERPVYIKILALSSGLQV
HVIDK
>GSU0068 cytochrome c family protein
MKSHVWRPLYVALSVAGALLLARSFLVPNDFGVHEQGYMYGWHRKGNENE
WKQVPVKYRTKAYCVGCHAAKVASIDRSPHGVIPCENCHGPALGHPKDPP
TLTIDHSRRLCLRCHAKLVTPSSGRSQIRGIDPDTHNPQAECRLCHDPHH
PNLEELKRYE
>GSU0005 hypothetical protein
MISPTRRSWRSSRDGPRSVTGLFIPVRGFRSQAPVMERHRTIDMSLRERR
DILDLLERLKKEDLSPAEMDEIGSSLRKEGRRALSPLFRCLRRETDPEMI
ARYAALLEFFEDDAWLDQLVAITLARRDLDDEGKSALLSILNEYGVDVSS
PLFAPITDRAGGLRLTLPRLLDRGERGLVTFMEGLVTYTPEAQQAVVREL
PLVDDRRVVELFRVLLGFDEPETLPTVIETLGRVRYPEAADLLRDFLAAA
PDAYRAPAERSLRRLAFLGFTPAGETPAPLPFAAAFTGTADASGYSCVMV
ARWTGPGIIDTLFMELHESEGMRDAWGWSGLSSEGFQDLLREHHVEDTLV
AVDPAFAVLLVRDAIQRSIGSGFYLPPEFYVRRSIFAEVDLTPVAYEPPF
GEDEVAKACTPSRIAVGDDLIRDWFFDGWFIFNERVRLLADDLDRLEGDP
FGRGDEAAVDRFLESWCRRYVAPRRERLVRRLLLAADLMVRSGRECSLVE
QSLAAARSIREELVPLHRQPFIRRWLLDLIEMVREARAEGYEFPPTRRDE
EYEGEWD
>GSU0919 hypothetical protein
MKARIGTLTFVLGGVSPAFASVGEAEGFQGVFTTVFICYCAIVVVAQTIA
AIRSLIGSGSTKGAVPPAESQKT
>GSU2939 outer membrane porin FmdC, putative
MMNMRKISTISGVALGTALLAGTAFAGPRITFGPEDQGALQIDYKGQFQM
SIRDNGSGANGDDTTTNFNFRRNRLAFMGKYGDMLSLYVQTEFTEDPNVG
TLGVGDNSADTEFQLLDAVMRFKFHDGFRVNVGKFKHNLTRENLEACEMP
LTLDRSLFIRAPYVSTRDVGVAVWGNLFNDVFQYRLDAMEGRKAGDRDAN
GYSSPDSNFRYTARAHVTLLDPEKDYGYKGTYMGEKQVLTVGAGVQYEPN
VAYGNAATQSDSKDYTAWTVDGYFEYPVEGFGTITASAAYADFSMDDAYT
VSTNVDSGAIGLNGEKNGWYAKAGYMLPNFPLQIFGRYERWSFASLNNVV
DQDINWYGVGANYYIWGQNLKLTAEISKTDFDKEGTYNGVKSEDFTTFIT
QLQLLF
>GSU0821 conserved hypothetical protein
MTHQIDELIRRIKELQEELETEFKQKREEFQFVIEKKRIRFAEEVARQQR
RLKTGLFRYLIEARPLNILTAPIIYAGFIPFMALDLFLFIYQAICFPVYG
IPKVKRADYLIFDREDLPYLNIIEKFNCFYCSYGNGLAAYAREISARTEQ
YWCPIKHARRIKAAHDRYPRFFEFGDAESFSKGLERLRQELEKEREQGEE
RR
>GSU0163 hypothetical protein
MGATLVLRNKKVREGIIIELAVWILDAPVDGCSHRFKYRLFCGVLETGSS
LVRYDNERGKGDHRHVVDGEQPYVFTSLEKLFADFEGDVREVLKK
>GSU1247 hypothetical protein
MKRAVLALSLALTAVPCFAASDVGFDVNINVGNRPQVVVPAPPPVVVPGP
PVAIAEPPVFLVPRSLGFYVAVGVPYDLFYLSGSYYLWSGDVWYRSSHYN
GPWGVVKYKNLPPGLRKHKLDRIRYHRDGEYRVYEVERDHYRGRHFRPGK
ESKRERKMDHERWKEERRRDKDGRRDHGGKGRGRGHDD
>GSU1946 hypothetical protein
MLVLSALFCRTVVTRITKTGHVAKKIQPHDLPSIFVAVGFPKKNKKKPGP
YTISGISATRLSL
>GSU0059 hypothetical protein
MVPARLPKITTHATTTKRPARTTPMMSEMVGAGSDRVLLFLSVICFWIRA
SVEARKGYWLDMICLLSGKTLLPCGSVVDNLPAAGGCLR
>GSU1284 cytochrome c, putative
MKGIAWAIALLSVAAAATVHAAGASLPKGYEKWDMSKEKVIADKSSLFYG
IHNIYVDKKAMAAYRKGGPYSEGSRFVVVQHTIKDVGGKPTKGRRSMIVL
MTKDKKQTATGGWLFAGFTAEGKPSGVDTVKNCYECHLKEAGSRDLVISR
YTDFR
>GSU1051 hypothetical protein
MVEKRKTGRSKKRLSLRFGTDTPSRLAFTEDVSPRGLFIKTTNLCPPGTL
IQIELELPDSEPVFLEGMVRWSKKVPPQVIHLVRKSGMGIKITQFIAGEE
RYRRFIDELRRTP
>GSU0312 hypothetical protein
MENVSLKGLFVRTDQKIPINEQVDVSMFFFGSSAELSFSLEASVVRITDD
GIGLNFRKIDMDSLVHSDTAVTTTGADRQGVIEEFYGFIEKD
>GSU0355 hypothetical protein
MGELKPMTTESVKELLEEAGAKAMARSGRTESYSAPRDFSFEVKAIFPNG
MGLHIVARQFNYRDPWEATGRVNDLVDVSLLRDGIYTPLPKGYKWFQGRD
LEEGVDEARLRELIACVRDVNPKLYLLQELTGDL
>GSU2739 hypothetical protein
MGIKGFTRLLLLGSVLTASTAWSAEIHGRSSTQLLWFNNYFNDQRQIELG
EYLRMSVTNLDKAGKASLFGYGRMTQDLNNGEGFNGRLYYLYGEYRGLFD
KLDIRLGRQFVNLAAGTAIIDGGQVDLNNAGPIGLTVFGGRNVIFSLDEE
NGHGGDLALGVAAYLNGFKSTDLEVSWFRKWDDWNIARDTLGASVKQYLF
NNLRVYGNTRYDVVSETFSEVLAGAKYYPTSNLVLTGEWYQSYPVFDATS
IYSVFAVDRYQEAVFRADYTITEQIAVHGGYTRQWFEGGANGNIYQAGLS
VRPVEPLQLNFEYDNNQGYNGKTHGFIADAYYDVTKSLQLSGGIAYDTYQ
RDVLSDDEIARLYWVGGKYRLAKNMSASLRVENNVNAVYDNDVQGRFVFD
YDF
>GSU2497 lipoprotein, putative
MISSRSCSNRLLALLPLVLLLLAAGCARLSVEAGRPSGPLVAVYPMEDLS
GADAPLAAMRSGFTERLRQNGVRIMDEADLRRFMEQQRIRYTGGLDLETS
AAFKAKAGVDAVLIMSLEYYSDVSPPKVSLVARLVATGEKPEILWMDSVG
MAGDDAPGILGIGIVEDPSVLWNRALDRLSGSLLHKLAGEEAPAPGWERR
FKPKEAYRFMELDPRRRYSVAVLPFFNESDRTNAGEIMVLHVVKQLVSSK
AFAVLEPGVIRGKMLAMRIIMQNGISLPAADLLANNLETDFILTGRVFEY
RDMPSGSGKPRVDFSLQLLERSSKMVIWATKSRNTGDDGVFFYDWRRVST
ANVLTEGMMRAFLGVMVEETREVKLTPPRMSQEELERILMSP
>GSU3459 hypothetical protein
MPVKLKASDLFYKYPKDVVNRDQPKFRCKPDPSPFNRDDLYEVVAMMEAV
MNELGSSDGRVLNLLEDIMHQDMPRFIESREMVFDCLVETARERLGLG
>GSU3231 hypothetical protein
MLATRSGPVGFLFLMFLSFPFHGMAARGDRPSAVGAVDA
>GSU0258 hypothetical protein
MTPEVSALDKALAKVCELCPVCLHARYHQQGVVFEFVRNVERDICPFCKA
YERVHGKKSHERRQPCRR
>GSU0550 conserved hypothetical protein
MFVIAIASLIVITQRAQATDNKGQAVLERMGAAGKGRRHTPHHHLNKRIV
R
>GSU0086 hypothetical protein
MVEPFYPSASFAARSENAWPFTLSSTGRLLSPKPMVITPPSRWFSISANK
TALGLISRLPDCTKPRFSRLAANCSCSVLDTFLKSCFRMAGPSNQTSMIR
LASVTSGLSSGVVIWTATTRSMLMKRSMGCPLTSY
>GSU0667 membrane protein, putative
MENSLRSTLLDIVKGCIATVALFIAYLELPVIGMLAGVVVPLPALLYDLK
RGKWSGLAIVLASALIFMLIAGPAGALLYVLQAGIFSLALPRFLTMGGGP
ARALASAVAVTVAVITAAAVGYGVTRGVTVEGQVAESLQSSITQAIQLYE
KSGVSGADLDELREGMEQAAKSLAQLYPSLFVVGIAMAGGLNLLLLQRFG
RRLGLAVPGGSFGKYRNPDHLVWLPICAGFALLAGHDAVTTIALNVLVLT
GFLYFIQGMAIIIHLFDRYAAPAFLRYLLYFLLFVQAYLVVAVALFGLLD
LWGNFRRPRIPTNL
>GSU0199 hypothetical protein
MPQQTVKIRQKTPLESSAYPPLVDKKNAPDSSEAFRDHASPFSRNLHAHR
QKFLPFSP
>GSU2641 hypothetical protein
MMAEALQRENPSGFEGSVAGLPLTDVIQLKGQNRFSGCIAVEYRERQGMI
FFRDGDIIHAEQGKLMGEMALYEIIRWPGGRFNIQPKVTTTSRTIQQSIS
FLLLEAHRLMDEEQAAVRADEGTRREEAAAPAAVQNRMSAVAERIVRIAG
VSYAVLMKKDGTPLEDDSFEAEVLSAKGMSLAGIGSRLGELFGLDEVKSA
AVHARGKQILLFEAKNHYVSIAVRGDSSLAHVENEIRSTFAGRK
>GSU2153 hypothetical protein
MSLLPGTTHRQGDKIMTTHIIKHVTIKGCQVPVYAPVKERQISIDVTRII
AGVIFGSLLTAVGTLPFLI
>GSU1335 hypothetical protein
MPHRLGNAMTVPPERESAPVPSDRTAPPSEKGRSL
>GSU0618 cytochrome c family protein
MRSEVKIGLALTALLVAVTAAGAASIKNTKHDLSSGSTGATFKATNTDQI
CVFCHTPHNAQQDIPLWNRGNPTASTFTLYSSSSMNNVPVKQGFTADSIS
LFCMSCHDGATGLGGAVHNDPNGAAIAMVGGNDLITGEANLGTDLSNDHP
VNFEVTPAGIAADGNLGALDTGTNPPTMKTGDVTNGLPLFKSARGATTLE
CGSCHKVHDNTDAPFLRTTMAGSKLCLGCHKK
>GSU3241 hypothetical protein
MILLTYGRKSRAGFPCPASGHEKSRTPWGCGFFRESGRGDYFFSAAGAAA
SSFFSSFLAFLAFLAFLTGFFSSFFTSAAGAAGAAGAASFFASVAKTVPA
KATATRAATRVGQNFFHYEYLQKIG
>GSU2076 cytochrome c family protein
MKKKVLIGASLAAVVLTGAAMVGAAVPPPPVNQFLGIYDTKFPNLTKADC
LECHVSDTVLVQQHHALINTVTPPASCINTSGTVPPTLATGCHVMVPDGS
GGFTFQDFRNCFNCHTQTPHHTSPAAVAKDCKYCHGNFIDNPLDGHYIPT
YSASSVTPMPSGRSVTATDGNVVIVQGCEACHQAAPNAIDPKTNTVRPIF
SNQDTHHGTGITDCNLCHNTSSNVPIRQCEVCHGVNSLHNIQKDSPNAAN
LGTVKPGLEDLGWGHIGNNWDCQGCHWSWFGNSSPYTNATVPAINGQSSY
TVTAGKEAVLTIVGSSFVNVGPDGVTTYQPTVALVSGSTSLTLTPFSVTE
SEIKVSVPALVEGVYELRITKANKVSNLAKLTVAPARIIASATLATGKTL
TITGTGFGPAPSSEYDAGIGVYAGTTQANVISWSDTKVVATSPDFATNGY
VTVKTINGPLSGKILAAPKKVKR
>GSU2504 cytochrome c family protein
MKKGMKVSLSVAAAALLMSAPAAFAFHSGGVAECEGCHTMHNSLGGAVMN
SATAQFTTGPMLLQGATQSSSCLNCHQHAGDTGPSSYHISTAEADMPAGT
APLQMTPGGDFGWVKKTYTWNVRGLNTSEGERKGHNIVAGDYNYVADTTL
TTAPGGTYPANQLHCSSCHDPHGKYRRFVDGSIATTGLPIKNSGSYQNSN
DPTAWGAVGAYRILGGTGYQPKSLSGSYAFANQVPAAVAPSTYNRTEATT
QTRVAYGQGMSEWCANCHTDIHNSAYPTNLRHPAGNGAKFGATIAGLYNS
YKKSGDLTGTQASAYLSLAPFEEGTADYTVLKGHAKIDDTALTGADATSN
VNCLSCHRAHASGFDSMTRFNLAYEFTTIADASGNSIYGTDPNTSSLQGR
SVNEMTAAYYGRTADKFAPYQRALCNKCHAKD
>GSU2728 hypothetical protein
MDLPVKSYRFAVFVVLSGLVHVAGLCVFPSIGILDLGTPVLPMADVSISL
REPETSVPPLPEVQSPRESVDPGETVPPGERSVATAPERADQPNASSVAR
EAAPAAEALSPVAPEVAAVAAPVSEPAASFEAVADAAPAARPQHLEVMPP
LRRAGEFLAPGREKLTYRITMLGIPVGEAMIEAVRGREREEVTITTRVRS
YPVISAIYPVDDVIETRLVAGNYLITRIRQREGTFTGDSGFTLMMREKKA
FWADRLRNRYATHLLPREDVTDIVSGFYFLRNKRLEVGQPVVLHLFDSNE
YAPTTVAVLRREGVTLPGGRTVDTLVVHPLLKTAGIFRRTGDMMIWVTDD
AYKVPVRMDTAISLGRVRAELISAETGD
>GSU2634 hypothetical protein
MPEFFDVPSDMDVHESILSKETKNGFLVDVRMVKRHRQYEAALFLNGRYK
PGPPLPRPLDNPSGDTTHWMGVRPSVGFTYDEAQTILEDVKAQNDLHHIT
FRDTWGREYGD
>GSU0737 hypothetical protein
MDDIKALYGPRGLSQFLEEAAQEKLAAVKREAYKGLIEGYRSTADETVEV
LKKLEAAVTEERDV
>GSU3134 hypothetical protein
MSKKLTFQHSLAVCIGILAGAAVLGATAHAAEIVPFRTINMSPLALIHPA
PAAGSARLLAPRETEVTLIADAANNFAIDSNANESVHLDGETYRVALDLR
YGVAPWFEAGIQVPFIGASGGALDSFVEGFHKVFDFNNGGRDKYPQDELL
FSYHRDGTERLHYDDGGFGIGDVRLNAAVRLHEGTGESRTSLALRGSLKL
PTGESSRLRGSGGTDFALWLTGSTDLPVGEWGHLTFFGAGGGMVMGDGDV
LADQQRNAAGFGTLGFGWSPASWIALKVQGDWHSAFYGTSGLRELGSDTI
TITSGGTIAFSDRTALDIGVAEDASVKTAPDVVFHLALSHRF
>GSU0449 hypothetical protein
MLDAPLHISPGTGSSDFSLRYTIFFPQGKKMTVP
>GSU3232 cytochrome c family protein, putative
MASMTAAGLAVGAGIALGAHPVPIQLKTFEEVAAQYGAPVMPVMVDPVTK
KGFPYSPKQTCGGCHDYNSISDHAFHSAQGRSEWVDTANGAFDATKAKPW
TQGTAMYGKW
>GSU0445 hypothetical protein
MGAAGVVAVFVSLSSQLKLSRSKSPAAAGSAAGAGVSPNPPKLSVGTSQG
LSISGTCAAPGAAAVISLLSPVSRSVAKSSVPASASVVEGAGTAGAAGVS
SKEKSPRSSPADAGSAGAGDGESPRSRSEKMSSPGLKDSGAAGTSQVKLS
SAFAAFAADGAATATAGAATVSPAKGSTTARIPWVCLKAERSWLHFLQLS
RNSKLL
>GSU1278 conserved hypothetical protein
MTQKFTKDMTFAQALQTHPGVAGVLRSYNLGCIGCMGAQNESLEQGANAH
GLNVEDILRDLNALA
>GSU1388 CRISPR-associated protein, CT1976 family
MRDYLILKLQGPMQSWGEHTFEGKRPSGNFPTRSALLGLLGACLGIRRNE
AGRLQQLADSVAFAVRKEERFRIRHDGRKTSLPIIKATDYHTVRDAREDY
FGLKSHETIQTWREYLFDATYTVAVWNTDVATISLVELEQAVKSPHFTPY
LGRRSCPLSRPLFEMSFSAANACDALRMIPSDGGVIYSEEPGETRTIRLR
DVPLVRQPRQFASRNVHVYGGVDVSE
>GSU1022 hypothetical protein
MPLQSLRSCRAITCDSFMVTIFTPLQENQQ
>GSU0208 hypothetical protein
MSRLPGKRIVALFLVLGALLCAGIPAWGAAPLVMVDQGHGQCFVIEQEGE
LQLSRFGAILGEEGLRVAASRERLTDESLRQATGLVISGPFAPLAPAEVD
AVIRFIERGGRVALMLHIGPPVGALMQRVGIAFSNGVLHEQINLVGTDDI
SFQVRDLTAHPLFAGIDHFSLYGGWALNGQAPLARTSAETWVDLNGDQQL
TERDAMDRFTVVAEGNVGAGRILVFGDDAIFQNRYLDEANSRLARNLAQW
LGGGKLSGGKVRGRDM
>GSU1018 hypothetical protein
MKAAVLSLVILFAASASALAQPLQVVLATTATALTGTVIKPAPTEFYGLE
VVLQLPVGITPKLDPLTPGKYFLDPATVVMVDNSMLNTGTLVSSALTPST
DPSVGDTVKFLLVNPYGVKFGTFGSLWFDFKPGFVPNYNTTTQNVSNADF
KVLSYRVLDVSGAEISTDQAAASISKTYVWKQP
>GSU3171 conserved domain protein
MSISSEINQIDIKWWELFDTIRGLRTEVQNRIVIEREVIPIIFVPGIMGS
RLKFAKGKKQGTRAWDPDSSGFMFLNYGRFDVSAKKKKALLVGPEFDPDY
LEVDTNNPKLDKSFLRRYPRADKRGWGGVMWGSYGPILRALHDHQWEEPV
GHCFEFPVHAFGYNWTASNMEAGRKLKEYIGATIDHYAKGLRDEAGRPLE
RLCTRVILVTHSMGGLVARSACVLHGAAEKVLGVVHGVQPATGSPAAYWR
MKAGFERPAGGPRKSIWDWFLNPVKMFERRAMGYASAWILGTDGEEVTAL
LGNMPGGLQLLPNKDYRDNSGSRQWLAYPGADGRTLSLPRSDPYEDIYRV
RDNVPWRMVNPDWLDPGNPTGSSKSYRSKTSWDHYEEYLLEAEEFHDNLG
SRVHDDTVQFYSTTLDSPDRIVYTREPHVHSTPHQFRNKGAFRAYVDSND
EIRETFAGATAVVALEPFSGTGDGTVPDSSGKALKVPTARIGRDSPADFF
SIDHQGIYGTDTAKKIVFTTVWNLAIKRIEEQVGGPAGD
>GSU2775 hypothetical protein
MRMTGGLGTMNNETAESGTGCPSNRRRTRNLAPLLCICFLLALPRAVPSH
AGYPAPSEAQCREMVDSMLQTMKTVPVERERDRREARVVLDKAEKIVRDN
RQKGASECETWAAIGKLVVGQ
>GSU3397 hypothetical protein
MFSMVLPLQRKGIALLLLLLMVSVALGGAVCLGADHHPTHAAADTVAPTA
GDEAPCCPDEEGHSDGDHCQSCLSCPCQAPLAGDGLSLSYAPSIHVLSFS
EHRVPPPDVYLSKFVPPAESRLIL
>GSU0964 hypothetical protein
MKDGIVCPFYGKSDDICDVGCGYISPHDVNMIIKFCSCRYRECMKYQELA
ERFPHDILTPVKC
>GSU1040 hypothetical protein
MRRSPSNKQKKLIIITNKADVIMHKTMHMKSGLH
>GSU3249 hypothetical protein
MLMGGFITQNPLKSKNAPFDLRHDMETPFGL
>GSU2598 hypothetical protein
MKRAHRWCEAQLIPVSGEGQNIQKLKRIDSVM
>GSU0744 hypothetical protein
MACGRRTAGSGKVRGGAPAPKISKRHLSYFTAPEPELPEFSLREVYWWFS
RDLFSAGASPRTGGRRGEHAAQLERTMKC
>GSU1938 hypothetical protein
MKFRLLAAVLILILPTGCAPVRKVVTDIRQRSAQEEKLSLAVELIARGRI
DNATVLLDAIIIEKPVRGVTDEALFRLALLRLPSEPRTGDIAKATKLLDQ
LQRDYPDSPWAHQALPLSDFIAEIPARIEAAGELRRQIKSLRDLNLSLTR
ENKELRLNLEKLKTLDLELERKLKP
>GSU3409 hypothetical protein
MKTTAIALIATIATATTAFGASGAAIDEGGFLAPLFLAFGAAIVAFQTVP
AIMLMGSMLKGLFSPKVEEGATTH
>GSU0976 conserved hypothetical protein
MAGRKDPYRNFRFLVEIDGIVQAGFSEVTVPDTSTDVVEYREGSEEPRLR
KLSGLNKFGNVTLKWGTTDSLEFFNWRKLVMQGKMKDARKNMAVILMDEE
GNPAARWEFENAWPNKYDPADLNAKGNDVAIESVEIVHEGMKRVS
>GSU2039 hypothetical protein
MKLWLTLCAVVLLFWTSETTALTHHTYMNGTVLGISGDTIKIDDKVFTIA
PKAKIVVQDKRNGAYFEERASRSYISVGNSVTVRAMGNLVNEIIIERWKR
>GSU0725 hypothetical protein
MRDARNPRYGKEFRPVCVLARDMKSQQQRYQTGGHAMKTNDLVPIHDAAR
ELGTTHLRLLMLVKQGTLAGELCDGEWFLPRHAVEQFRAFGGDSRADLAC
RATCKAPSCGCKG
>GSU3115 hypothetical protein
MLALSFVLIGGGLLVTAWGLPAAHRLRRPWGAVAALAVLGGVVSALTGVL
LAVVPHFFG
>GSU2937 cytochrome c family protein
MKHKTAWLTLAAAALALCAAAPVFAEKAGIGWQETIVAKSGKAKTMAELA
KMYDSSSCIECHQEVHDEWEQSIHARSIFGTGRTAATFMTAVVNGLMEWE
YSGVKSPSDVKVEHLMGCAKCHLPQLADAEDSVAKEIISTIGSWQDALRK
KDSAKAVEEADKLKSLNINCLVCHNRNAITHKWTDGYPQAGVVYGSKEGE
HPSAAFPTMKVSPIMSESIQCGQCHGLGPNLELDNPTQCCTSYASYLWAY
KAEGGRENCQECHMKKSKLGHNMQSYRDPGMAKAAVEFKAEAYGYHWRDG
ALVTPKAVVKVEMTNHAGHSIPDG
>GSU1597 hypothetical protein
MRKRTLRMGIYALSLAVALVTVATFFHEQIAGVLGLSPPAASFFVSLGFW
GGGISGGCGVVLAVAGLFLRSGPDRQHPCRLAPSLIILCAFTFIFILLLV
AHFRGTEYPPPLRPGETITI
>GSU2334 hypothetical protein
MAINKLPWVISMCNRAVIRLFGNDVRRGRIWPQHH
>GSU0118 hypothetical protein
MWFLRTGVLLWLLSTVAIGTVSAAPPPARPETGKGVLLVKNSLAHDNAGG
GSFVVIYREPGIGRIGELRVASLPALAGVVKTPPGVLALAVTATKGEWLR
VHYDDAGRSGWLEPRRTWTYLTWEEYLRGKTARLLRGLRKEFYLARSGPD
GSAPELPPLTPERSMRILDVDGDRARVLVDLSVMAWLRWRDPDGRLLITV
E
>GSU2564 hypothetical protein
MGRDGSKVVSLEEERTRRAMVGKATGPDAEAASAVPGGATAWKQMNALLA
DPRAREVVRALPEQQFYWILTEVGMTDAVEFWELASPRQRMFILDMELWD
SWNFSEAKSHQWLNHLLEAGEEAVLEQLPHMDAELLILMFKKEIIVGGGI
GDLVSDEERVADWDHSFDNVYFLTFRNAERARAVGTFIDIIFRNDRELYL
GLMEGIKVELESELEELAYRFRSGRLADLGFPELEDALTIYARLSPDSFI
PSAEKRLDLASGITGLPMPAGYEESFLNRALARVEGPVDQELTALVNSAL
VADGAALRSREGMEAIFRRVHGYLSIALEHLSDGDVERAAAIIGREYLSR
LFRLGFGMVMELRKHHESLVIDDYATGRVIEGLRATPPRFYRALDPDGID
GFREFRELADIRRINELLDRHGDPA
>GSU0749 hypothetical protein
MHTKWNGRCHNGIARFVCSFPASLLPRHRWFKSPGISGFPYLDSASPS
>GSU3272 hypothetical protein
MERRTQRPEQHWTSGIGMNAWTVYAQQLKMEFIFMVVDGADGGICRLERH
DKEGTCGT
>GSU3364 hypothetical protein
MISTGYNFFLHPSDLACIWHSNYAKSISTMPINTRRIPDMGRMRENPRYN
VISMRISDEERDRLQAIMEATHKSVSDIMREAMELFSVQLEQSQPGDQKA
A
>GSU0701 cytochrome c family protein
MKSLQAGCVSAVLAAICIATPAGAFHSGGAAQCEGCHTMHNTSEGAAMTG
SSARFQSGPFLLRGSDQSSACLNCHQQAGDTGPTSYHVSTAENDMPVGSP
PLQLTPGGDFGWLKKSYSWAPAPGSASVEIRGEQHGHNIVAADFNYVADT
TNINAPGSSTPYPSDSLHCSSCHDPHGAYRIVENGVQATSGKPIRGAGSY
GSLPDAGSAVGTYRLLAGVGYKPKSLAGGPAFTHAAPFAVAPVVYNRSEE
SSDTRVSYGKGTSAWCTNCHALMHSAPGSSLHPADKQLGDVAQTYNAYRK
TGSLDGTVATAYLSLVPFELGDVTDLTALKDAIATTAGPAATDKVSCLSC
HRAHASGFSHMTRFAIDSNHITVADASGNAIWPDPVMNPSQAQGRTVAET
RQAYYGRHPQAFSPYQRVLCNKCHARD
>GSU2320 hypothetical protein
MKAPIVILCLLCFALTPPAASHGRSVQGVTFLERIQAGDATLTLHNAALL
RYLKVIKAYVAALYLPEGVPPQRVLTDVPKRLELSYLVSIKGSDFGKGAE
PVLRRNLAAEELARLRSRIDRINAAYRDVKPGDRYALTYLPGRGTELTLN
GTPLITIEGADFAAAYFGIWLGRRPIDDTLKNDLLGSG
>GSU0934 hypothetical protein
MSKDKDERKRPQLRLVVNNVEKRSGRPVAGEDDFITLDELILRRHEFRGH
FYAGVGRLQEKAYRSLERYLERKGWAYGLDPMHGRLMVIPAGLVCPEAIV
PDSSQQDEVLVFAGEDLTGVGLCLSLEMIIPFWSDDDAVMEDALLSAPIL
PYGMLFLEENRQDGYLDLIYRVALPLYPPAPTVRVLDKLFAVARTELAET
LRTLADFSD
>GSU2730 hypothetical protein
MQRFFAVLLVMIFIGSAALAAGGLDGFLGNLNVEARADPTGFAGRLSAQF
GVPGAQVSVVLGSVREPADAFMVYQLGQMTHKPYETVLKTYQANQGKGWG
VIAKRLGIKPGSTEFHALKRGDFALTGKPSHASHDDHHHGTEKGKGKGKG
KGH
>GSU1608 hypothetical protein
MARILTGVALMALLISAGIVQALETRVDVRVRTKDAKFLGTSMGGALVSI
RNTDTGELLSTGVTAGTTGNTATIMVNPHVRGGVLSDAQSALFRAVLDID
VPTRIEVRAAGPLSQRQAMGEASVTQWLIPGKHVTAGDGLVLELPGFAVD
VLAPPTNHAVTLEDGLATVTVRASVMMMCGCPVAPGGLWNADGYEIRAMV
RLDGSAPEEVPLHFAGESSQFMGELAVRKPGVYEVTVYAFDPATGNTGLD
RTTFIAQ
>GSU0977 conserved hypothetical protein
MAFQTEFEFTLPKGYVDGEGNLHRQGIMRLATAADEVLPLKDPRVQQNPA
YLTVILLSRVMVKLGSLPEVNPRIVEGLFVSDLSYLQEFYRQINDGGAAQ
VKAVCPRCEHAFTMELAPGE
>GSU1288 hypothetical protein
MPLLISSNPNTVLADIARTAHMPLESGEADRFRENGVIPSREEKPLLSRT
GGAMGRAEQLQAGNIDGTHAAHVYLHRAGLIQGFKQGTSEFSGKGYGQVT
PDTEDVPLTRIPSVLSFNRQSAHDIS
>GSU1556 hypothetical protein
MVRAKKSAMGVLGILAAALAALMLMHGCGGGGDGGSASTAASTGTLKLSI
TDKQSDSFDKVIISIREIRVVPAGRENAPDNDPGLPVIARFTPSRVIDVM
QLQFVQEALGEAVLPAGAYSQIRLVLDPAPANYLTLKADPVTQIPLTTPS
GQQSGVKVLGPVEVRAGVINAVMIDFDPNTAIVARGNGDYNLKPTGIRVV
HLSDPLTQFGSISGNVMSTFMPWSSATVSVKRRGSTNDNDPIAAGLIFSN
YTGGAWQAPFAAFVPPNTLPVTYKAFITTNGFQVYSSSAVSVVTGQATDL
GTIPLIPLP
>GSU0054 hypothetical protein
MSMYFVLTIAFLDGRFHGRRDGDEPEWPPSPLRVFQALVAASARMNGGAL
SSDGSSALQWLQAQPAPAIVAPSGILSASPYRLSVPNNAMDIVARAWGRG
NETNSGDANPATHRTMKSIRPIHLIDSSSVHYLWRVSEPVAPEIADYVHA
IVEMAQNINVLGWGIDMVVGNGAMLTEEQMESLPGERWLPHAETGVDGLR
VPVNGTLADLQARHEGFLSRLAHGIFTPPPPLAVYDKINYRRAIDPPPRA
IAAFSLLKTDASGFRAFDTARWALTVAGMTRHAARRAAQGAGWKESRING
CILGHGESIGDEKHLPTGPQRFAYLPVPSLEARGAGKAPVIGSVRRVIIT
AFDGACGDEIDWARRALSGQMLEKIKKDESDDKEHVALLSLLPGSDKVIR
SYLRPSSSWATVTPVVLPGYDDPAHYRRRLQHVTNSDEQKRLLWHLHERI
DGLLRKAIVQAQFPEILAKNALIEWRKVGYWRGADLADRYGVPDHLKKFP
RYHVKIQWRNDCQMPVRIDGPICIGGGRFYGLGLFAPVD
>GSU2036 hypothetical protein
MRTAGPGDKGFTLIEVLVATVIILVGLLGLLQAVNVSMEHNLRNSLRDEA
VRVGESTVYGMRRLTFDNLTGETTLKVNRNLRAFTKEYTVKRSLSDLGGR
STELVVTVEWDYRGTTYQHRVSTIISNPQ
>GSU2726 hypothetical protein
MKQRLQFLALAIASLPGMAGAADVAVDATTLIRLEERSVPGFKDETLAPA
TQFLGIDVEKLADGNLSFHLYGWGRLDLADKSTDGGSTDGDLAYAYLAYR
FPRADGLLKLGRFYVYEGVAAEHVDGVSVRTDLAGGFTGSLFGGVPVRLN
MTDRNKGDAIAGGRLGWRLPGYLEIGASALYEANTDTGLTGTPENYRQLV
GGDIWLAPFRMVELRGTTSYNTATDGFAEHSYLLLVTPMKGVSVSGEYND
YNLEHFLATTNNRSLFNIDRDTTLRSYGGSVAWTVAKPLELIADYKRINY
HRADRGNTHRYGGEVRMTLAERTVRAGLSYHRADADVSANAYHEVRAYIL
RDATRYRASVDAIAHLYDDAVQPYGGDNAYEVIASLGWKMLPNLMLSGDI
SYGENPRLDSETRGVVRLVYNFSHSGKGAAK
>GSU2883 cytochrome c family protein
MDVNGRSTAVRRIVSTVAALALALVLGGHARAASTCTDCHGMPPIDAAYR
NITTGGFKGSHQTHQPATATDCAVCHAGSATFATDHMNEKVELSSNINAS
PLAATYSKGVFFNQTSNPVMGTCSNVNCHFEKTTPTWAGAKLVSPTDCGV
CHGSAPATGSHPSLLAAGKKHGDYYGTGTGSCVKCHPDHTVEAKPFAHAT
SAGKRPLSLRFTAAPNNGGTYSKTANLTYPNYLPSRNPARDGSCSAMYCH
SDGNGGAARATATWGGTLAADCTGCHGGNAASAAPIITGLHAQHVDAAAV
LGTTIECARCHNGPVSAGNDRAVTGVAAHVDGVKTVAFVGGGTWNATAKT
CSATTCHSSGKATAPQPPTPAWTGAAMGCNGCHGIVGGTFTSIAGEPNYA
NAGAGLPLANSHRKHVVIAADCGVCHANTSTTGTAIKAGSTIHTNGVINV
NFNTQAGATATWAASTKTCSNISCHGTANPVWGGTLPADCTGCHGGGVTS
AAPIATLNHPTHLSRAYGPGNYLGTATAACQSCHDYSATQPDAKHADGTV
QVLNAAGSACVKCHPGALPAWTSTTRIACTSCHAATPSVLANGVAAPYKA
NYATRGHGQAAVTYNASRACESCHDANGAHISGVLGDAVRLALPNDNTQC
ASCHNDPAKVPTPNEQNVLSHVTVAGGAATSLCKSCHDPHGTTNLAMVRT
SIGGKTITFTNLSSGFVKTVAPYDGLCQVCHTATGHFRSGQAMDGHPTKN
CLSCHGHKGGAYAFAPATACDSCHGYPPVPAGFVTGTGNYAAGKPEDYAG
GGGAHVVAKHVPKTAVPAEGWANCTMCHGNGALNPATHQMALPVIPSKIT
VDVRDRLKFNHSLPLGPQQYDGKLLDGGANATGSCFNVNCHFKPSRKWST
SR
>GSU1109 hypothetical protein
MPMGLFREKKVTNYFFSAGAAWSAGAGAWAEVIGAWAGAGAGAACSAGAG
AGASSFFLQPAAKATAIARHSVRESSFFIDVFTSFPLKLPFSREAHGFLT
QRMRNILALEIKSRFFYTVYNFIFSRIQYVSETKLKIILIFHRFKHTNFM
FVISSMA
>GSU2385 hypothetical protein
MDNNALGPGHATVVSCGIFRPEIEHLARQGRFPWPVRFLESMLHMTPSLL
EAEITRLIGSLDDGPVVLLFGDCHARMSEHGSLPGVRRVRGVNCCEILLG
PERYHRMLRDGVFFLLPEWTSRWEDVFIRRLGFADSRLAAQFMGEMHTRI
TYLDTGVTAVPLVELERIGHFLGLPVTVEAVDAAALVRVVEESTGDGGAD
C
>GSU2770 hypothetical protein
MISSVHNFVLTTATFIAAISMISCGADSADSTPSLPAQRPFYDFQYNLDQ
LTVGIRLFHPDYPYSPRIDIYEGATLCDSHPLTSQTGNLTLQNSSKCRDF
TINLQYVPAANACQPGGLYLQSGSVTNPADGQTTNYTAMTLAAWDQNYNR
VVPWLTPFDTPNLLSQMSQSDYVHADPLSTVGQWTYRLDYASSAIPPDGC
SLQTNLDVSISPADETLGARVHNYTIRETNVGGQITASVTPNTAGAPTAS
VTLDGTIRHLSFVGLDVLREANIQASMAFDSPAQVSPVTAMFAPMRPYVK
IESELFLNSLNGFPIGSLASHVAKFFSTITMPLMGFYDPLSLEPQVNANI
GVSLATPLSIQGQVEYVVTPIALIPNYTFRPGFDDRTDCTAVPDPGLSCR
LDSIITNYNATNGVPDSAARYILDVTLYSRTGGTYYPVIDMPKVVIPVSE
TN
>GSU1964 hypothetical protein
MTVFVIESPFQVMNAIEAVHHFKFKDNVLIVLLSGLFSKESYERIIDKKQ
WKEVRYIPFRYKLTNSDFGLHPPKNVYERMQELYLTLDRSVKKKVINNMC
RSLGRVDNLVLGSYRRYYDIHMRHIANNVDHNRLFLIDVGTDTLEIGRQR
IEEHNAGSGVLGDEPAKSLKSMLRDKLVEWDGTGVDSLTFFTSYDFEVSD
PDRVIKNDYRYFKSLADRAAVSDCVWFIAQPLVEQEYLTHEVYVRYLAKV
KQYFSGRPVFYIPHPRESGRYIDLARDEFGFETKRFDVPVEYAVTVGGSR
PQCIASFFSSALENFAVIFGNRIDIKAFYLHVSDLLKNHESVDEVYSLYV
SKGMDIEVVS
>GSU1031 hypothetical protein
MTIANCRLKFFGIVPKLLLNTAFPDVSLKRPVVRQWIRLERLIVKL
>GSU1125 hypothetical protein
MLFAAILLIIIAVPVDAFAWGAGIHLQLGTAVINNLAAVNPAVAAVIGQY
PHDFLYGCIAADITIGKKFTHYLQHCHRWRIGMKVLEYAWNPPQKACAYG
YLAHLAADTVAHNYFVPYKIMGSFSTLTLKHAYWEMRFETFVDRDIWETA
RTVCLENYKANDALLRNVLSDTIFSFGTNKRIFNSILLVSRLEKWQQVLK
TMSDSSSYVLENGDREEFMTLAEEAVFDFLAKMEESRFYLADPTGERALT
AAEAVRKNLRLLYRSGKLSKDDAFAQVASLKGRLRQAIWEPQKLLQILSS
A
>GSU3080 hypothetical protein
MRYPTAMNLKKSTLFMNDQNVWHAAVAAITGQYHRLPQPVRAMLREKGGA
ICSAKEELHRLAHEMGSSVICASCGGECCLRGKYHFTVADLLIYRSTNAE
LFEPRFGRDFCPYLGDAGCLMQPSLRPFNCITFNCERVEGLWEPERIEEF
YRRERELRRLYNELEVALGSQVRQGLLMGQSD
>GSU0987 conserved hypothetical protein
MNADLPLIDGRTAADVVEKIRATAPFYVPEWSGGVDGDAGSALTGVFGDM
VVEVLQRLNRVPERHLAAFLEVLGLRLLPPRPAETVVVFTLAKDADRSAL
VASGTQVMAEKTTDHPELLFETDENVLAVPSRITALYSTIPDTAGGREKV
FGHTEAWTAGSSFTIFHGTENLQEHALYLRQTGLFTVRSGVEIHLSGVPR
ALADQVAWSYTDADGRETLFGARFDGAGGTLILSTGAGRPELGRCVVNGI
DGIWLKGVPKVGADGTTALARVKDAVLRTVVGVGTRSLPTAVIHPDVAFA
GDVPQDLTVTAGGDFIRNLLPFGDKPMPLAAFHLASREVFSKRGARITLR
ITCATDLDVPIERVQGIGAVFSARLRGAGIATAGELLSRSDAQVAGIIRA
PGSARPASSYLLRARNIREATAKAYYDKTGAVSYRSGRAAALGPGLSWEY
WNGTGWCAISDVTDSTGGLLATGEVTFTCPADMAEVEVSGQRNWWIRVRL
SSGDYGREFEIVNGQAVAAGFTPPRLGRITLSWDGTDPSAMGEPDSILTR
NNLEWDDGLAVLRTGAVFRPFRPLADLRPAFYAGFDRPLAKGPIGLYLDL
AEIDYARDFRPRVQWEVYDAGACEWLRLDAEDGTAGLTRAGIVRVVVPEG
GVPAPLFGTRLHWLRAVLVAGEYAPTALGAGLPYRLRRGYLNIPRIIGRP
AHLKGSILNPARWRSWRQLPSNSPPIARGIYLNAARALQLTSVRNERPGS
GNGLPDQSFTLARKPVFDDQVWVNEFGLISAAELDRLAADAPGRVSRVTD
REGRVSEVWVRWEGRDDLLASGPLDRHYGIDRSSGLVSFGNGKRGRVLPA
GTNNVRVSYRTGGGAAGNLGWGAIARMRTGIPLVDKVVNPGPCGGGVEGE
DISALYRRGPQSLRRRDRAVTVEDYEGLIREQFPGMALVKCLPVCDDRGM
TRTGWLTVIIVPRAADDRPIPSAALRRRVEEFVAGHGANVVTAPCHAVVT
RPSYVRVSVDAALVPRSLDQAPAVETAALAALGRFLHPLTGGWDGGGWPF
GRLVCHSDLYRLLEEIEGVDRVASMAVTAVDEMGRRMELGEADELTRPVD
PYLLVSSGDHRVAARAGDV
>GSU0764 hypothetical protein
MNYDGSEKSRLFCILKHTWSWLLCYTPKRRTNG
>GSU1845 hypothetical protein
MNRIPLGLLIALLILPVFPGYVWALASSNIPLDSPVYLYLDKLAGFGLIT
SDTKGIRPYSKAETARLLLEAEANLVKNEEQIPPLAFEFIKRIRELIPRE
ASLREEPEKAPLVDYTPVTSARLRYVYLDGQPRSYNRDVLDPANQSAFGF
IGGNLRPQPPGVVHQSGTEGTPLVENNEGVVYGDGHNLEFRWSMEGYLGR
YATVLVEPNALHTDETNRLSLQKGYLKLGSGGIELEVGRDANWFGPGYRG
ATTLTNNAANFDMVKLSSPEPVDVGWIKEYLGDFKYALIGSRFDATGSGA
DYRKPWFFGMKLTLKPKPWLEIGANLVRQEGGPGFTGDVSLFDQIFGGAD
NDHVNSIAGIDLRVRIPWLRNTEIYGEYSGEDSASFWPFVESYVAGIYIP
CLTPSGRDDLRFEYYRGSVIHYTDWQFPAGYVYKGMTPGHSQGGAAEEFF
IRYSHWFSARNSLALEYFHTERGQEGRLKVNSAGRYDPVNGVMQAVERKD
ALRAFWSLPFYGDVDLNVNYGWERISNVDLVLGNDRTNQLVTLALTYRY
>GSU2965 conserved hypothetical protein
MDVVDKDSAVSRRQVLKISVDAAGVLLSGRGLAVAAEAAGPIEKPVDRPA
KASKRFLASMNCSQAICEVYGPSFGVPADVAGKMGTGFAGGMGLGSECGA
VAGAVAVLGLKYGPDTGRTMVKVTEFIAEFRKRCGETSCSKLLKVDMGTA
EGIKAAADKGLFTSACPGYVKTAGEILEKMLA
>GSU1395 hypothetical protein
MKRLMGAVTGLLISGILVASGMGAEPSKAPRQKRQKGEPAARYDVNAAGS
KVPQAQRNVVKKREDAKKRRDELRKVRDANVRAAD
>GSU3011 hypothetical protein
MKYAFFREGAASKVIREPGNLPERMSSGLRGRGAEAIVNSSTVFEPRPLS
GGGVFHSWHRSFS
>GSU2724 cytochrome c family protein
MSSALHATDRRLTVKSTIVRFLTVFCVGLALWGCSSGSGDAPAIDPTTGR
HPANWIETHWGEFRKDRDQCVSCHGSYLDAGQSGGISGVSCFSPNRGAQV
CHATGPAGHPAGWGAADQHGRSGAMAAPAVAAGFAYCSRCHGALFDNGPA
RSCFACHTNAPHPDAPWHGTTASGTNHIFVDQGNAPECAKCHATGANSTL
QPTTPAPAGTPAGCFNNTLCHGAEAGHPAGWSDPNSANFHGPRAKAADSP
SGGFSYCRRCHGANYQGVGTAVSCFSATVGAFSCHNGGAPAIAPHSPAPW
RAARTHTNTDTTNAASCAACHANGANSTRQPTTPAPAGTAAGCFNNTLCH
GSDVQPPHPVAGAYLPSAQHGADAINRNNNGTGLNTCQPCHATPASGANP
RFNVPRGATLTAGCETCHTARTAHPTPWLIGRGTTQGVTNQLRHSTLAVL
ATAGTAVTNYCTLCHGANLDGAGGVAPSCVSGSSRLNIAGVNAVCHFNAV
AVKDSGTGLFNIQTGCVSCHGNPPVGSTFPNRDLQHTEHLFPNVTCGSCH
SGAGYATVLHGNGTANVVLSATFQDKDGGAPAYNAGTGTCSNVSCHGGQT
SPPWPSGQINVSTDCASCHAYGTTQFNSFSSGDKRHHTEITLVCSECHDT
TKLATTHFTNLQTSAMEGPASSTLLDSLNYDPATRSCLFTCHIGNENHTR
SMTW
>GSU2785 hypothetical protein
MTVFEELEQATLAQFEQGELPHWLADPALAVARTPECYAGKEYLVEILVA
QVREYDPYAETGCCKWAYDHEDIKRTLRWLDE
>GSU2736 hypothetical protein
MGHGKRIERAFTFDNRARQGNLLVRRPISGTCAQPSLMEDRVYVMH
>GSU2179 hypothetical protein
MNVLVCDLGFSSAKWIYGERKGRIISAFSYNGDNLLIGEDSLMASGSSYL
KTMEELVRYYPVFVEQCLKQAGCEGDILLAVGLPYSYWQEQHKPGGAVPA
LAKSLTDGAIKDVAVFPQGLGGLRDYLDGLPERPTGNVLGIDIGFNTIIF
THCCPIVSKAAPVVAKPLLSVV
>GSU3183 hypothetical protein
MGVNNDSNAGPSSGEQKVVPDKRTFRLKVVEYLSWDTCDENSRTFKLGHE
DNKPIGGRVFKIKMPDGSIVEKSTDEKGIIELTDQDANATFEVIFEPEEA
ALNNKYTLFYNRTAVVQTKLPGDTP
>GSU0192 hypothetical protein
MTIKVGNIVSHTGGLGWGSGKVLEVTATLAMIQFSDGKSRKIAASYFCTL
EPAAPDSYVPPPEAAVVAKPSRPPRMTKKK
>GSU1489 membrane protein, putative
MRASQSAIFRRIDPFAILSFTIPLAVYLLTLAPSVTFFDSGEFITAIHSL
GTAHSPGYPLFVNYGKPFTWLPFGNIAFRVNIATAVSASAACFAVYFLTH
YLLKREELVPDLRFSDTFARLCGLAAALAFAFSARLWLQSNHDKPYPLVA
FMTAMVFYLLVTWRERYDEGEMRPSHIYLGAFICGLATGAHQTIVLMLPA
YAWLILSKNWRLAARVKELALTVAFGLMGFAIQLHMPIRAMRNPLLNWGD
PRTLDLFLWNFLRKGYPVEPPDRNLALAWQQLSAFNLPREFTWVGLVLLL
VGIGAFFTRRRDEIIAYLLAIVCSLLVIVGYFNTPGDLIFLTEEFFTPIY
LLSAVFIGLGLFTLLRLALRNNPVEKLRTVPIKLMAGLLLLVLPSAICAM
NYYENDQHENYVAFDYATNTLRSLSPGAVLFTWGDSGAFPLWYLQGVERM
REDLDLLHTPHLVFDWYLDAFPHLFGRSVLRTMPERRQSSEVALKLAVAE
QIVSRPVFIDFSTRYSLPFAEYGLKQRGICYQLINDGGSVLRPPDLAVWR
LYASRGYTSQMGFRDLDTGKAILIYASCRMEAGEILLRMGYRQQGAEELS
AAAAIAPELRAQVEQVLAAYGAR
>GSU1295 conserved hypothetical protein
MYRRILALLMVLAAFIGNVHGATLELSTSDAPPYSTPDGSGLADRIVLEA
FRRTGVTVKIVAAPSERSLVNANLGIVDGEYLRISGIEKMYPNLVMVPES
ICENEFTAFARDPAIKLKGWESLGPYSVGFITGWKLLEENVAKVKSLTKV
KDDEALFDLLANGRIDIAIFDRRRGEAYLKHSRERGVRPLTPPLARRTMY
LYLNRKHAALVPILAGALSQMKRDGSIRRIMISALGTAD
>GSU1943 hypothetical protein
MKKLIGLVAGLLLMGGVAQASPLTFTDTTLFTSTGTLAAQDLVSYGGSSV
NFLEGTGDYVAWKHQFTFTPPAQQILSGRLTLTLLDDDADKLLNPLTWEL
AFGYAEDGSWALGGVNTGAYGYNVNVEFLKDGEFNVKLASLLGDFYITRS
DLTITYEPESAPVPEPGTIVLLGAGLLGLGAYSRRRMKRQ
>GSU3452 hypothetical protein
MVYRQQQAIDRLEDELKALRTQFLAVAPSPTRPPDEETPPPHY
>GSU2927 cytochrome c, putative
MKLAKKDWFFIVLIVVVVGVFWAISGEVRTKKVPLDTNHKRFYDAFAQGA
GKLDLDRQCVECHHEKPGGIPFPKNHPVKPADGPMRCLFCHKFK
>GSU3034 hypothetical protein
MNTYAEANKRQTPPGQTPEDGTLIVRAVLDDGASHAAELQRFTLTTAGLF
RAGNERAALDHYIELLGGFDMLIRALADVAFVLGVDFSTTPVGNVTLERF
VSQLNTLLQETMAAQQRKDWVLIADLLEYELAPHLEQWRELFAALGDKAA
R
>GSU0225 hypothetical protein
MLWYGYSMPYPGRIAIDGSQGRRLRFFAATAAPCRRRRQMPRRAKNPPTA
VRAQMVAVTISSLVVMAFLLR
>GSU1393 CRISPR-associated protein, CT1978 family
MLVIVVENAPPRLRGRLALWLLEVRAGVYVGKVSRRVREMIWNTTVQGID
DGNAVMAWSTNTESGFDFLTLGANRRVPVEMDGIKLVSFLPENGAQQAAE
P
>GSU0282 hypothetical protein
MLASGDFFRFYHVTWISHVKAEKKCGHGGETQELPRMLVRCRREYEIYVG
ESKGVCGLFCRRIGYLREGSCIPPRVRRRAPVRPRT
>GSU0275 hypothetical protein
MAEAAMRIRDGIRLNASMHEEIRRRTTSPDLWNRHHIMMAQHAAHTPDAA
GFVTSAAQPAALP
>GSU2642 cytochrome c family protein
MKRLIALAGLVTLTLAGCGGGSNAGNASSPVATSAHTPIWVTYHRFPTTE
SFSNNGAEALNQCKVCHGTNLLGAAEGAGAPACLSCHVVDPVPYPERCYS
CHGSDPVKPFFSWYSTMRSTRNGMPIDPAFVQRVRNGSLRHLKHDAVPPA
EMENPDACLRCHAPQTEFPDRPLPEIHHKLVFTVRDITGDGIPELVDCTS
CHVYTIDPELNIPIRAVRDCVLCHPTIVR
>GSU0938 hypothetical protein
MKMTMKILSLTTALVTMITGVALATPSTQVWIPSTDVQAFGTAHLGIDNY
LRVSKNPDGSRQPNTYDLGLTFGLLPFEKLQMEAGVDYMVNATSPGDSHP
FYLNVKLATPEGSLFGYSPAIAVGAYNIGFKEDVSDQNIVYGLVAKTLPV
VGRLSAGYFVGNSAVLNDANGENDNHGVLLSWDRTMSEISDKLWVAVDYQ
GSDSFMGALNFGVSWAFSKNVSLLVGYDIYNSSVTGGKNTFTTQLDINFP
>GSU1186 hypothetical protein
MTDSVPHGRRCRCCRDAIRQERGNLLKKANRIVRPSPWSRKEHVGAARER
TTDGLG
>GSU2478 hypothetical protein
MKVYFFDCATGVYQGEGFEDEAVIEMIEGVTVVAPPPYRKGEIPVYQCER
KAWTVKPLSPETETGPTPG
>GSU2125 conserved domain protein
MKRTVCTRTTAACLAAAIALNPVLSWSGWVDDWVQQKSSTSPSYYEGSKR
GYYTGGSFSARWANTDDHLFTASLPKLKSGCGGIDMFLGGFSFLNVDYLV
QKLQNILSAAPAAAFDIALKTLAPQVADTIKTLEAITDRLNSLQLNDCKA
AKALVATSASPFSSVMSDSLKAEMKTAQTDFMVSSGAKDLYQDVSKLFES
EQKANAGNKPGVPGTTQTSATSATAGCPAEVLQIFGDGSVLENLASKKGM
NSDYVKLVRGFIGDVVIQSPATTGTTYQAQYVPPCDKNNSFDSFISGSAQ
GRDTAGACADITDANKNLMVYVAGKMQSIAGKIKSKTALTADEEAFLKST
PLSVGLILKNATATNTESEVIGKLAELTARAYGYYMLLDLFGRAVQLQEM
ARNIMSTQKGNKSGASPETCQLALLGEGMQHIQTLEEKTLQLLTVAQQSY
ANAASEINAVEMIVQNMKKFDDTVFSELSDRFGKGIALRATGRI
>GSU0560 hypothetical protein
MGPIFNSLLGLILGDMALRNNYWKDSSITLLMLLTQSS
>GSU0495 hypothetical protein
MDHDLYASSGYPLQGGLPTPARRLMVAGVTMSPLSFAAGGFFMSTVGAFC
RF
>GSU2393 ISGsu5, transposase, truncation
MRFYTKTHKHYCGIDLHAKKMFLCILDAQGEVLLHRNISSSPEAFLKAVA
PFRDGLVVGVECMFAWYWLADLCRKEGIEFVLGHALYMGHKKGTGYFFRS
IFGLPLSRLVDSNPNSLHIISAHRSFPYGACLLTLRRSLSISLLVIAVST
YCVQFAGNVLGRAASILRLVPATRSGCEDHCQSLALAVIVALTGLASI
>GSU2734 hypothetical protein
MDLLSSFYLQLEACSGRITAAADITSFGRLPSLSQPLITFGASDQRGYDA
LA
>GSU0316 hypothetical protein
MAQEALCRKRHEGFNGSMVGVSLADLIQLKGTNRFSGCISVEFGGRCGVI
FFQDGEIVHAEKGSCEGEEALQRIMRWPGGRFTAYPHLATDRHSIRKNRQ
HLLLDAHRAMDERHRHVDPKGERAEQTPLRERIRAIGMVSDAITLDFDGV
PTDEKSPSADRLAAGACFLARLADRIGSRLGTGSPTSMVLEGTREHLFIY
TGKHHLLAVAAGADHQPPMVEEAIRRAVQTA
>GSU0062 TraD protein, putative
MDNTPFVRQPLDKRCVGWTIFNDLETTIDIQALAASLIPAAAGNTDPFWN
GAARDVFTGILSSLWRDNYRSNRGIWNSLTAPIEEIAEMIHGVAGGEATV
TTAGTRMVSW
>GSU1196 hypothetical protein
MIKAGITFMVLALVFYTYAVFSGRKEGLHAKHLLVFGAGLLFDYLGTHQM
SLYARTFGKAPEWHNLTGILSLAGMAFHFLLALAAAFARKTERINHTFHR
VSLTIYTLWCIAFASGAIAGMLKAAKH
>GSU2110 hypothetical protein
MFVEACGRLVLIRKIYGLVKVFMVKKTIRYMRISLMLTNDAIFTTRLQKG
GALLDDMRQLVCQWTEAVAKPESQAGEILHKATRSRVADTYRRAFLPRFL
KGSPQDAWKLARALEEAHPSIEIIRPFYYWITARAEPILYAYVTEELVEK
AKTSDRAVRTEETASWINNTLRSCGRVWSPTVQLKVARGILAALRDFAIL
EGAVNKRLVHGHLPIETFCLIAFLLNLMGIGARELVEHSDWKLFLLAGPE
VERLFFEAHQHGWLHYQAAGRVHRIEFPTNDFKDYIRVIFGTES
>GSU3337 hypothetical protein
MVTFAGTLRIRAVLLLVVACSWLFFGCTPRPAVSVSPDFRPTGGVETAYV
VPFASALVPELFSETAFNDFVDLLNGRRGETGIASFVILKEEVKEVDPTW
LDRQHYISGDIWSYVEDSGCCATSIRVKARAYLHEPGRKDPVAEIFVPLE
AFFDHDRSTVERERDRLARTLARELADRFAVILAPRR
>GSU2596 lipoprotein, putative
MSKKSSYIALLTIVLLFQGCAALTSNPYYKPSTAESAEGSADRGGALFIR
FGGTNSLTVNKPEVEVSVRSNGTGVYYPMAMGPVLPVIPTFMTAPDKAPP
TALLEIYLRPLRENITFNPMLVKIRNSKGIEVFPTSYINSHARAFTFYNG
KEITSETKEIVLNKGSRYCFVLEYDMRDSPDWDLHMTISGLRAEGKEVPT
PEFKFERGSAFYLIWGGRGKYCSQ
>GSU0554 conserved hypothetical protein, interruption-N
MSDLYHIQEPKLTFGYGQKLQDPRDGLMLFGPFTRNKLKGQVNIGIVGPE
PQRGYLREYLKRIHRPVTPETPDKARPSFPGLEAAFDIFINFDNLAEIDV
KSSEVERCLRFTDGYQRVHNLTNLYATPLEKYGNEEEMPVTVWFVVIPDS
IYTYGRPKSTIPKATDNIRAGLKKKDRDSAQSFLFDDMNKMQEAYNFEIN
FHHQLKAKLLSAKVVTQIIKESTIAYDQIWSDEENILSARKFDTAKAWNI
ATTLYYKTGGIPWKLGDVREKVCYLGLVYKKTDTTESNTNACCAAQMFLD
SGDGMVFRGNLGPWYNPTTKEFHLRKSDASELLNQSLVAFKEKSGFYPEQ
IFIHARTNFNDDEWEGFTGAAEGKTNVVGVKIRSDGTFKIYRDHPYCVPR
GTVLQVSDKQGYLWSRGFVPRLQTQMGLEVPNPLSIEITKGMAAPVFLDT
KMR
>GSU3338 hypothetical protein
MIEEGLKGDEDLGYRIFSARFVEVGSRLHPDAGGAAP
>GSU1001 hypothetical protein
MGVSRECCRRWGKAVVAQVVLFVVAVVAIPALADSIPAYLASATSHDSPA
IVSPPPASPASQSENATFFSISASPTGDVTLNCGGVPVEIGYTDLAPHDE
IREQSQRITRQQDPLINGFSVKVTFLF
>GSU2709 hypothetical protein
MDNEKGRHESRPFSLSEQKNTGTTYVMPVLKWYDLSVRLR
>GSU1406 conserved domain protein
MELDKQAPRSEFAVYLDGEIIFSRLERGRMPEPLDIIPAIRARRHGTSG
>GSU0262 hypothetical protein
MAVQKEGHRAVKLEEPLAEDYDLDMLEGRRMGRSAL
>GSU0076 hypothetical protein
MRKTEGPFTREIACQQILMEDSSVFSVQWTTVPSDLRPRLSAEFLLERYL
AYIRRFTLTLIRPMVAADGIAFRLAGTRRSLILFTPPARREGPGHESLSL
RICGGFLVQGRQCDRGELSFMLDDDVSGVRLTLRLTDYCPLLLGSSEPSR
LRKWLYRFTQAYIHKVVTVRFLARVYADLAGGGGCVRVVRARVREGEDL
>GSU0532 conserved domain protein
MPHRINGTTKEKQLNKLIVRRNLPRDIRWLHLLNHDDLRVPDLEHKAERI
RPAVPFIKGDRHLTATEPHLFVADIGKPVKVARHPDLIFQFH
>GSU2034 hypothetical protein
MSLVIIMVVLYLVTQGTRLSSAQKRYQNSLEATYGGLDLTMRDLLPELVR
TAVDSPTLASFQTDITTIETRLGAVDLKVSDESDCLRQKLSLSTDQWTSC
ADEQRTENPKIFPDMSFVLKDTGDKPEFKVNAKIIDTRQGITNTSNLNLF
TGGVTAGSDSGIQGPGTVNPYLYTVEIEGERESNPNEKAAISVLYAY
>GSU0669 hypothetical protein
MQALFAFDRGIRRPRTLLVPPPRIAPPPGTATRWLLRA
>GSU0078 hypothetical protein
MEGRKSKRLSVEVGCWLVEMDGATCLYTFDLSDEGVAVITEDPLPVGRSV
ILQFFTPRSASAVTLKAQVVWSRLEPEGAMGLRFVEMDVRGRETLREFMR
LLLEQRQRERPVR
>GSU0061 hypothetical protein
MQQPTASVVSYVAEYHKATETTMGRYKKVIEITGHDEVAAKLLEGLIDAG
TRYFSKVVEMEHRMASARFRLDGEELRELTETLDRSRRLAHESLISSLHV
FNRYIVKEYGEELKEAGIEGGIFPKPEANRDRIAIADWAGELLTGIYENR
HR
>GSU0835 hypothetical protein
MKCPICNDRTTIEIVMHSDGYADNLLECTRCGTIWIANVDGIILLNNNVV
SG
>GSU2208 lipoprotein, putative
MTRSLMTRFLIAGLAALVLAAGGGCGYRAARIGGEGSALADKTVHVDIFT
NKSYRPNVEAVLTNSLIDEFVRREGSRAVEDSGELTLSGAVVSYGTAAVS
YTATDTVKEYRATVTVEATLRRTDSGQVLWKGSLSAYQDFPANDDLVLQQ
NSEEQAIAVICRRLARELTLRMSENF
>GSU0717 hypothetical protein
MMKPVYLFFLLVFVVPGVTGGILAANRARNVFVWSLLSAVFPIFLLVIYF
HKPLREVEGKFRRCGGCGEFIPWRETACRYCGREQPTHP
>GSU1857 hypothetical protein
MILLAAGILVAMPMDRDAEAVVSKAFVTVGLTPSFTGVGSISFLVNNPAG
TSYLSTTLLNQAASPNLVETTVTDANTISFFWAEVISGINTTSLPLFEFE
YTVANDQYPAFSLGLDQNAGMGIYDVNILPLSTSLSSFVLATRYLTDTGV
TQYPVTVTVSGAGSGTVTSNPAGLTCTSGPCLYGFDANSQVTLMPTPSSD
SYFKAWTGSCTGTGNCVISSLAAGATVGAEFAKYQAIRIAGTTSYFDTVQ
DAYNTATTGAVLEAMATTSASLGALTMNTNKTITLKGGYNGDYTPSTTLT
TTVGTITVAAGTLIAQNIVVN
>GSU1553 conserved hypothetical protein
MRVFDTIQQTVSTPLVSYEPGGRVLAISGESYPENSFEFYAPIAQWVKAA
LADPEGLTLDINVSYMNSSSTKCMLDLLDLFEDAYGRGAAVSITWRYDRE
NPRSLDLAEEFKEEVTLPFAIVELEE
>GSU1362 hypothetical protein
MDLIEIADVREWLSQFSEADIPIAISMLRSLNLVSHNAFSAGIENSIKTI
VKKHGKAAVFPVKRALVKKVPGKDYRHLYQSADAIGYLLTNLERSMPRSI
SVEPTHASMKAEKVKHIVFIDDLIGTGKRVNSFWQEWANKSYKSWLSFRY
CELYILSYAASRSGIEYLLNEIPYLKPENILTSISLPRVSPLANDEMKRI
CNYYGEKTAKSVVKFGYGNQMLNVIFQHACPNNAPSILWSPGKQWKALFP
NRSIPASFHKVFGGTFPFKIADSLWDIHQPRLALEILDRLESGRIQRETV
ELLTVLGFLARGVGLIRIYECTLLEKSRVDEAILILKQVGLVTKNFEITS
FGKEVLNRFKATDKRKYELVCDQLLYYPHQFKGLQRKSSEGPT
>GSU1015 hypothetical protein
MFGLFKRKKADPAIFDSAEAAFDYACRNLENRILLEAVIPALVEEFVRVS
PEGERYFRIRLANSQGGQAIEACTLKESSRAPSVGDLIGYKVVRYDPDMP
PPFDLLGFIAYRFEFVHVPGKGWRTAESFIPDTIKPTLRM
>GSU2479 hypothetical protein
MSRPVGTEKTIQQNAARHRLHCPLTNAPVRNAIRAGPGEKP
>GSU2748 cytochrome c family protein, putative
MEPATRRSVMPVYEFYCADCHTIFNFLARRVNTTASPACPRCGRPGLDRR
VSRFAITRNRPDEPLDGMPDLDDEQLERAMRSLAGEMEGLNEDDPRQVAR
LMRKLQEATGMNLGPAFDEAMRRLEAGEDPEALEEEMGDLFDAENPFSRE
GIKGLKRRLSPPEHDDTLYRLPVGDEEAGTP
>GSU3214 cytochrome c family protein
MVWSSGTLRHLVSLISAFALVAAGSIAAEAAISRGVCTFSTQLVDPEMRA
GYIGMVATAADASYSTRTAAVDAVEDESEGFAIFGTLLDAGYHEPVASRT
DTITTACLGCHDGVSAPPVEAAQRITRVSTGGFTAQSVQTASPMKTHPMG
MNYAQVAASGRGFRDPALVEQSLMLFDGTVSCLTCHNMLNPRPMHLAVED
YRSALCLTCHVK
>GSU2138 hypothetical protein
MSKKTGAAIILRLLQLNLFERRDLESLFKPPKVEASPQLVLL
>GSU1715 hypothetical protein
MSGFVLKIDSCGGMDYITGIDVDDHSVWMKQGNVCRYIEMANLPETTRNT
MAGFNHRIKLLVERKYTELLKDIEPEIHIPKP
>GSU2255 conserved hypothetical protein
MARPRLYIDEFVGITNRLETLPLAFAIQKEYGHEIILDWSELDSFSVAGT
RRAKLRFWQKPGALRVRNCDAAQFATFAGKKIILRSLDGPSDKLDPIYLD
VASRVKLAPNLADAIRRIFARVQGRPVVAVHIRHGDYRIVDESRYDPLAC
EWPAVPLWWYEAVMERIALKQKDVSFFLSCTGDPATYETLHRRFDVFTLD
VTSPYGYKGPDHDSRVNPVADLFALACCPVMLATPMSGYSHWAANALGIP
AVCIIPRAGATLDNPLMGSLRLYGKRLPAWRAAGREEGAEPISDAFEGID
LTGTADTSWL
>GSU2801 cytochrome c family protein
MRQRSALFTILAVVAALLAAAGAVQARSPVFDVDKEFYPYYPSLIKWNKS
TVPFNAPEVCGSCHEQQFNEWNGSVHSLAFRDPIYQGELNKAVKAVGHSI
SRQCEGCHSPVGVVTGEIKGPGFQGLSSMAMAGVSCDVCHSISGVTHWQT
PSHEPENGSFILTPGVETANGQVLVKRGPFKPSPECGGGFHQCEESDLHL
RADLCASCHQVYHYDAHFPIEATYLEWKHGPYAQKSILCQDCHMVDLDTF
KRSADQFVIPDRKEYRHYFNGANFLLTYLAAGAAKKAGDEELARNLMRQY
EMAIQRLKMAADLEVTPVYRSGRLAELKVRVKNIRAGHNLPTSLTNVRQM
WLEITARDEQGTVLMTSGTLNPDGSLPAEARAFTSDGMGSDFHFAIDPWV
VTAFSKHETIPPKGWKDVHYGIQVPEGVGRITVEAKLRFRQADQKVAEAL
LGAVPKDIDLEQIYGIKSVPPLPVVDMVVKQETVKTAP
>GSU2321 hypothetical protein
MKALLTTICLIVLAVMLFVTVTASIHEDVVTAARLLLPDPWFRATLADAY
FGFLFFWLWVAWRERSAGRAGLWFILIMTLGNFAMAGYLLWHLRRWEPAD
GIGKLLMGERHGNGR
>GSU2148 hypothetical protein
MIRRISAAFAGGALGAFIDSFNIWFMGKAGISDLIGLSMKPEFTAPWLYQ
RMVWGGIWMLLLILPVLNQRAILRGCLFSLLPSAMMLFLVLPGMGKGTFG
FGFGTITPAVVIGLNCIYGLVASLWYERAIRNSDGMKNG
>GSU2169 hypothetical protein
MSKKTGAAIGNGAFYRYKNKKATTWAASIQTGRVSGLKMLAGSPAGIYGK
NLEFATDNVVPVQRLLKFLPAQREFPTVDTCSD
>GSU0039 hypothetical protein
MSFMGLFMTSSRKIAELDFFRLKNIGKTLLPMPT
>GSU0852 lipoprotein, putative
MHAKHLLLALMLFLTAGCAQTAVSLRPAETYPNSYEFFDFAFSWKTTIGP
DGATIDGVAENRTYYYIRDLELTATLLDEGGTPIGTGSFLFFPNQLAIGE
RADFSIPVRAVMTAKPKTIRFFYRYYLSEERMSGISRFHTFYGSP
>GSU1361 Piwi domain protein
MADNLSQLAAHSTIPEPLLLFKDNRTDTHPLRGLSQYGPYSACFNLPGQV
RLAYLAPTEHMRKLDAIVRELQNPATPKEATNYYVEYGGFEKVFKVPLVM
PQEHLRCLALDECHGVAANGNGLALADKIVQSMSGLFRQKHAFDVLLVYL
PASWKKCFEYDGFDLHDRIKAKVAPLNLPIQIINDTALTRQCRANVMWGV
SVALYAKAGGIPWKLADWDKDEAYIGLSYAIKKNAEGQEYTTCCSQVFDP
DGTGFEFVAYDTREFITDRKGNPYLSYQEMQSVLSKSLHLYQSSHNGRMP
RKIFIHKTTHFTEDEIQGAFDSFSSSTEIELVQIIQSTNWYGLKVDGKKG
DKPVAPASYPVDRGLYQPLTESECLLWTQGSVMGVNQQNPGQPVFKEAAL
TPLPNPIMLRRFSGNGGWHATCSSILALTKVDWNNNTLYKKLPVTLVYSQ
VFADVVKQTPEIVNEIYDYRFFM
>GSU0833 hypothetical protein
MTPAPAAVTVATARTINAIIATTRTSMACLDGNGGVIACTRPLNVVFSGY
FRRRDNRAGEVKKRRTPEGRVQRSAPIAY
>GSU0959 hypothetical protein
MQVTDKAYTEHRVFIELERYAEFYEQLAMSVFLFTTMGTKSICNIDTYVY
SSMQGTVESIKTILLAGRINDAYALLRKYYDSAVINVYSNLYLRNNFSIE
NFVVEKIHNWLQGKESLPEFRVMSQYVRTSEALKPINDLLYVDDRYKRLR
ARCNDHTHYNFYRNVMLNDNEIYIENRGWWLDRFAEDVRDIFVLHLGYIF
FLNDHYMMSSDYIDALDCSMQPEEDSQYWVASFIQEIFNEVVTPGRPDIT
ATIKANSAMQLS
>GSU2298 hypothetical protein
MEELQALQVRLEQKIREMECSPTVSSDELLAKNVRHLQDLSANTRAQRQS
MADFEKFVTWMSSSLAGYEKYIQAGSVVAGFARVLPIPYAGQASVLTKFV
SQGVLSLNAASGSIASYLKTSDRYLAMVDAIDPVRPDAVKVSAAARYADG
ELLRAMTDVQAKLRTSADVSASTLSFLETLDHYAGNTDEYWAKTKAFVTR
NDAYKNDKSYLAASIEGLRNRAASFNGRLRLFEESAKKDLPLIKNLVAYD
DLIRELDRKRASGDGGGAAAQAVK
>GSU1217 hypothetical protein
MEEPVYRFSFLSVAQVHSFAMDQPVSIVLGPDNMYWVVPDAMVGELHRRG
FQFFR
>GSU0056 conserved hypothetical protein
MEKSTIFAKMHSLSGQFSIRGYAMGRLAKENILDLSISERIQLVEDIWDS
IASVPESVQLTDEQKIELDRRLDAYHADPGKGSPWDIVRERIRNRRDV
>GSU3141 hypothetical protein
MTLIRLTLFLTAVLATTHAFAARQATPVAESEVRLALESILDLWRDGNYD
ALYERTSITGGGSREAFARRLAGASRRPACCWEKMQEVRVMVRGTDSVSV
RASFGLEGGPGGTNHVTRAIRLVREDGVWKASRADILALAGEKRATYRYV
KQEGNAGP
>GSU2160 hypothetical protein
MNNTTISKVLQGLVLVHLSFSVWSGKKKLRPEDLKGANLPPDKLASLGSK
RIFDPDALKVFATLKRQAERACEEVGVRFLGGYAIPEEKLAPLMEKMEQV
EQSFATAKTNFLSAYDDNLKDWLAEAGEWSEIVARAVEPANRVAERLSCG
FTAFKVEAPGETVSAVQPLERQVGGLADQLIHEIGALAKSTWEESYRGRT
SVTRKALRPLKAILDKLSSLTFVGPDKISGLVANVHAALATVPRSGPVTG
ATLMGLVGVLCELSDIAGFITTEEREEMLQPLPEPEVEPELSAPLPPKTV
TIMPQPVQQIEAPAVWFW
>GSU1381 hypothetical protein
MIRCGYCGHEFAEEEGIRSCGKCGKPGGCRMVRCPKCFYENPPEAKAPKT
LRKVIDLLKK
>GSU3039 hypothetical protein
MPFTPRLSAFIFGQRAKRRLLVHVSAAAGKALEKKSLRLHFFRKKLPSHE
QQEGSSKITAGYQITPP
>GSU2712 hypothetical protein
MVTSPPFMLYHFKEAMKRIQPWFKIFSAIGERACANGSPHY
>GSU2014 hypothetical protein
MLCQTVLPGTECTFWGKQGCIFTGGSCQNVVENCEGCERIVEGTIGKVCS
AYPAPGKKWVGGICNFATHVKVEIKSEDVRVNPLKASKKASGGKKK
>GSU2359 conserved hypothetical protein
MERFICIHGHFYQPPRENPWLEAVEIQDSAHPYHDWNERITAECYAPNGA
SRLFDDEGRIRDIVSNYARISFNFGPTLLAWMEEYSPQAYRAILEADRLS
IGWRSGHGNAVAQVYNHIIMPLANRRDKETQVRWGIADFERRFGRFPEGM
WLAETAVDVETLDVLAAEGIRFTILAPHQAARMRARGGRKWHSVEGGHID
PTRPYLCRLPSGRSIVIFFYDGPISRAVAFEKLLDHGERFARRLLSGFDD
GRHHAQILSIATDGETYGHHHMHGDMALAYALSYLEDNGLARLTNYGEFL
ELHPPGHEVEIVENSSWSCIHGIERWRGDCGCNSGGYPHWNQQWRGPLRH
AFDWLRDELAGPYEMAAGALLRDPWQARDQYFQVIADRSPDRLESFVNRH
ALRPLNDDERIRVLMLLEMQRHLLLMYTSCGWFFDELSGIETVQVIQYAG
RAVQLAQELFGNGSETSFLQRLAAVRGNNPDYRGGAGIYEAFVRPAMIDL
VKVGAHFAVSSLFQEYGEETRIFSYRVKSEDVRLIPAGQTRLAIGRVWVG
STTTGEEARITYCVLYFGNHALNGGVRIFRGNEAYEAMVIEIARAFESSD
FAEIIRLMDAHFGMHNYSLRDLFRDEQRRILGMVIGGVLTEFEDRYTALY
DNYRLLMGFVRQTGMPVPYRFMTTAQLALNLKLDRAFTKPGIAEAEVGEI
LREIADWEVEIERIDLEFQIRRRLEKAMTCLRDNPLDSTILTEAAGLLEL
AGHVPIDLNLWLAQNIYLEIARPLWRELTERSRGGDTHNDWWFGQFRTLG
ERLNFNANAVLPPVPEA
>GSU0711 endonuclease/exonuclease/phosphatase family protein
MGTVKITSWNVAHLERLTRAHPTAKERRRRDAVVREIRELAPDILCLLEG
PAGEAGIDLVANDLLDGEWVAVKAADGNYATRGDQWIWFLVRQDLAARAS
LLPVGTWDAFAGVSWTCHLWGEFTEGTHRHYRHPQVLILDWNGLRVEFIG
LHLKSKFINQGERLWRDERERFIREALTARIKMATEAANVRAYIDAKFRQ
VGNPALFVLGDFNDGPGKEYFEEQYLFFDLIANIQGNVFATSRFLNHGLF
DIPDELCWTVRFRDFVQPDRRPEILLDHILFTQGLVNGSLPWQVKPHAGK
VEHEVHELANAGLPKDSQTSDHRPVSLLVTTGPNP
>GSU2711 hypothetical protein
MPLSVIEKSEITDSAPFFAVAAFFRCPTSHRGVTNDHPVCAMQQQIPA
>GSU0362 hypothetical protein
MAQVFDFTRLRRLTIIHVFVQAFLLILLVGTTSVLLGKVPSQVFMNSVIR
VVVLQLILFYPVYKLAASDAEREVASAVTGLPAEEMQALRRKRVFSDILK
GALIIFFFTFILRAPGAPQIQFSILAVFLLSYLAYFQCFNSVAKRLMRAK
S
>GSU2584 lipoprotein, putative
MRVTGRGLRLFLAGAALVAGGCAHYPENPRLAAVSTEAGYRYSVVRPRPT
ADKPFVLLAFSGGGTRAAAFSFGLMEELNRVEYAGRDGARHRLLDDVEII
SSVSGGSFTSAYYALYPDRFFDDFPGTFLYRDIQGGLVRRLFNPYNWWRL
ASPDFSRIDMAEEYYNDTVFGGKTFADLVAAPGKRAPFLVLNATDISIAH
RFEFTQDQFDLLCSDLAGVSVARAVAASSNFPVAFAPLTLTIHKEPCGPL
PEWIGLGLDRESNPKRRVADAEAGRSYREPDRLYAHLLDGGLSDNLGLRG
PFQAVTTTDSAWSILRYANLNRLGRLMVIAANAKTTKQRSWDARSAPPGI
DAVLDVALNGPMDDVSFDSVEMIDGHFQQMRQLARTVDSCNRRLGQECPS
APPIPNPIVTEFTFAELTFDDIPDPRLRLCLQALPTSFALPAKTVDLLRA
AAGYLLMGSAEFVGGMQRLDSTWQPRPVAIDPALVDEVCGPASDGGS
>GSU1652 hypothetical protein
MEESQKQPRLCSEIQLFDLCEVGAADVCGHRDGRYCTKGDLLARFEAIAD
DDRSPEQYFAEELDDAEGADDLGYDEAFGVDEYGEEDEE
>GSU2503 cytochrome c family protein
MKRFKTLPVTAAALLVLGATGTGWAFHSGGVAECEGCHTMHNSLGGQQMN
NATAQFTTGPYLLQGTDQSSSCLNCHQHAGDTGPSSYHISTAEADMPAGS
PPLQLTPGGDFGWLKKTYSWVPRAGAATEWSHGERKGHNIVAADYNYVAD
STITAAPGSMNTPFPANQFSCISCHDPHGTYRITSLGVQAKTGKPIRSSG
SYGAVPDATTAVGVYRLLGGAGYKPKSLSGGDAFMFAAPYAVAPNSYNRS
EATSDTRVAYGRSMSLWCANCHGQMHTTFGTLVHPADQNLGPFVAQIYNS
YKKSGNLTGAVDTAFTSLVPFQSDNTHDLNALKAQTTSTAGPISTDRVMC
LSCHRAHASGFDSMMRYGLGNEFMTVADASGNPVWPDPVTNAAQAQGRTV
AETQQAYYGRPPTNFAPYQRVLCNKCHAKD
>GSU1283 hypothetical protein
MSLVTKPILNARERDNCMKMRYFTAAMLVVAGVFLASGQPAVAGHAHHGH
DGAAESPVAAPADAVEYPDCVHCGMDRVKFAHSRMLIVYQDGEKVATCSI
RCVALDLKTKKNKPVASFQVGDFTNGTLIDAEKAHWVIGGDRRGVMTKTP
KWAFADKDAAEAFFARHGGTPATFAEALKAATDEL
>GSU1360 hypothetical protein
MDVLTDNEFYQHYLQNSQHMMWFLGAGTSRSAGLPTASDIIWDLKHRYYC
LHENQDYQKHDINNHAIKSKIQSYMDSKGFPLQWSPEEYSFYFELVFRDD
YEAQRKYLLEALASRKVSLNIGHRVLAALLEMNQTKVVFTTNFDDVIETA
FSDISGKHLSVYHLEGSYAALSALNTEAFPIYAKIHGDFRYQKIKNLTPD
LQTNDREIHKCFLAAAIRFGLVVSGYSGRDENVMTMLRAAIDQNNAFPHG
LYWTVPSISKSEPAVQDLITYAQGKGVRAYLVETGTFDEMLSKIWRQVKD
KPAAIDAKVRTARVCPVSIPLPGPGKSFPALRTNALPVVTQSIRCGVVTL
ASPITFSELKERISQKSPKALLTYTEKVLFLGGEPEIRKIFSNDEINSIG
QYYIDEIAQSVAASTFLKSFVEEAILTALLREKPILHRVRHRTHYAVIPN
ASAKDDRFLDLRKAVGFKGDLGYITGNVTNAKELSWAEAVSIRLEERGGK
LWIMLKPEIWIKPLDRREEATDFIRSRRRYRFNQCSYQILDAWIKILFGS
IGGGGTVNISCFPDAEFKAEFEIGTRTAFSLGVGYGR
>GSU2349 hypothetical protein
MPFFSMVTPFYSGWSHATAGAVPGTLKRRSAHRTQIMAFLHGTGVEESYA
>GSU1978 hypothetical protein
MRSVSCTKLLAVSVLLAVTAFLIHGRSQAVTANGSGAPLRSEFERVAGWN
PLGDQPLEPRVVEELRLDDYLYRSFIRDGAVVTLYIGYYHTAGKVGAAHD
PLVCFNGQGWRITGRSKGQLSLAGSPELRVNYSSMIVERDGQREQIVYWF
QTNGKTAATTLAQKVHMVQDRLFGSGENNAFVRISTPVAGDSTARPLERI
THFVEAFYPSFHRYAAGAGPLETR
>GSU3163 hypothetical protein
MMVRSLTAMLVLAATVALTPALLHSAPDKPGRTAESRNPSVVIFVGEG
>GSU2640 hypothetical protein
MREILTQLNAVPGVVGSFVCGGGGELMVHAFPPLFDESIVRQASAAVADR
NSGVRAAAGSFDLLDFRYNDGRIVVKPLSEAFLLLLCTKAINLQVLNISL
NVAKGKLDALIAAAGQVFAAPASVQASAAKAVDGVLTLPACHIADSTVGS
SFEQLGMGALNQTTAQQISNFYKAGAVKKLKLTSTASGISAIFPFMVVNE
NDAAYDGKIILCRAIEKKLKVGPGDHLTVEIP
>GSU0038 lipoprotein, putative
MKVLFRLVVVALTVMVAVAGCSKKEEQKAGGDAGAQHGQAAKKESVVVVP
DNVKGKWKSVKIAVTDKAANKESVYTINVGAELAIPESNLTIAVDNFLPH
FTMDGTTLTSQSNEPKNPAAQIRILEGGKEVFKGWLFSLYPTTHAFNHPK
YGFTLVDFIPAN
>GSU0161 hypothetical protein
MTLAQSTFFRLKRTPYIRIAALSGISFIQYETGGFIDDGKDQ
>GSU0996 hypothetical protein
MNMQQVKEIAKQRGIKAGSMKKAELVRAIQSDEGNEACYGTNRADSCGQD
SCLWRDDCN
>GSU1932 hypothetical protein
MNIEFDQEPAGREPENKGGNTRVLLLVLLLLVAVFGYLYYFTGFIKPREE
VKAPAPVQPAQVKQPIPPRTEAQPAGEGAKDAAKIESAAAPKTEPAKPEA
GKAEAQKEPAPKAAPSKDAAKKPEPAPAKTAKSAPAPQAVAKTAPAKPAP
APAADKAKAEAAKKAEPTAKPAANKGAKPEAAVAAKAAAGAPGKKKPAPV
EPATAAAKAGDTKGAAATARKDKPKAAAVAKSTATVSKVAATGPYTLRIG
EYVVPSAMDKDKDRVRNAGLTPVVKEGSKKKEPMVRLLLTEAADQEGARR
ELARLKDATVEGFILNEGGTYRVYAGSYFVAERAQREQARLASLGIKVAL
RKSSVSVPTLVLTAGMFETKDEALKAAAALKKAGLKPVLAGAGK
>GSU1192 conserved hypothetical protein
MDKRRFTRVPFAISAMVTHGESSFTGEVADLSLNGMFVKTGGEKPAIGEE
VEVVISLSEGEPSLNVELAGEVVRADANGVALRFSRIEVDSFIHLRNIVE
YNTPEPTRVMDEFADYVEDRIGRK
>GSU1169 hypothetical protein
MGRIVVIALLLALSGTAAAMTVGRFSAGDLSGWEEQTFSGKRKTVYSLVK
DGERTVLKAESRGSASGLIRTMSFDPKTQPWLRWSWKVAGTLTKGNEGVK
EGDDYAARVYVIFPGTFFWQTRAINYIWANRLPKGASIPNAFAPKNVMMV
AVESGAAGAGRWLSEERNVYEDYVRLFGEEPPRAEGVALMTDSDNTGGEA
AAWYGDIVLAEK
>GSU2934 cytochrome c family protein
MLCIAACLLAAPVAPWPADAATGENCRVCHRDVLTGPHAGIGCSPCHGDD
RATVRNPALVADRAARCVACHRGYDRLFDHPMTTRSAERRFVADTLGRAD
GAFFRNQCGGCHVTGCLDCHPGGGHALARPRDAQCLSCHRGYFVGTDYHG
RAPREDALRYQRGELANGETFLPMLPDVHAEAGMGCADCHSMASLAAGKR
TAKRCTDCHTPDPRVVEHRIAAHQDRLECVACHAAWAPQEYGTFWLRFTN
SPRLQSRFQVVTRQDDWVKSAYLKRQDAPPLALNEAGRVSPARPQFILYH
SDFRNERVVGQENRLLAARWKTFAPHTIRRETVMCDGCHDNPRRFLLEPP
HERIYRTRADGLSLDSFWDRTGQRIANGTFMEPERVRRMGVRTPEYTKGY
LERWRDLTGDAGGSSRR
>GSU3242 hypothetical protein
MKKEEKKPVKKAKKAKKAKKEEKKEEAAAPAAEKK
>GSU3332 cytochrome c family protein, putative
MLTRVFTLLAFVAMLSLASGCAMLMSWKAIPPPGGCDQCHTVPISANWQV
SYRTVALTDERGSDRPYFQTETYNVPPTDRPRSSLDIRKVDELPCFECHK
SPSPAHKGRVGKFHH
>GSU0966 hypothetical protein
MSEKELELLARIEELERHTWRNSAILRAVAMVAGTAMIGIIVASGVVGGG
HGSHWTGPAWNTMWLYGLFAANAVSMFWTTHKD
>GSU1064 hypothetical protein
MQQQARLAMTAMERDLRMTGYGLSDVGNLKIRFYNKGVGTVTDNYYAVAE
ANVFGSPKVAVPAADTDSIAIRYCDQPSDAMPDITLSGSVPPPSSSNTPV
SSTDGINANDLIIIYDPTDLAKYGSILMVTDTTSAGGLSVKHGNGAGDGP
LASFYNPSNPAVNLFPPGGYPVGAKVLRLAPEALRTVRYYVDAGMNLRRQ
SQAGLTGAVEERPVASGIEDFQVKYYFRDGTWLDAPVTDDPDHDVDKLRA
VRISIIVRTAKPDRQYGAGDSIRLTGDDGNGVARSGGGYRRMVMSTTISL
RNLAVRDM
>GSU1191 hypothetical protein
MVRHRVRDGRRCNRSILLVKGRRESFYQVELMV
>GSU2295 hypothetical protein
MKSDGKRTQTFVMILVALVVICMATAVGILHLASLADDGLLAGQAELFSF
SVLPTVHARQLSASGRKEPEAVYTMDLVYETFEGTDVGQKTLDDLCRQLE
IDPRHARYKLQQRGVKIGPSETLGDAARRYGVNPVALLQALLVGEKVRSS
QRRTEEVPRLRFMP
>GSU1472 hypothetical protein
MFLLPKGNPLYENIAASKVKIPDMFEKLKSSSFTGYLSFTFPSSIAIFFF
EAGKLISAMLEHDGKRLTGFEAIAGVCTQIFGGGGALSVYKLSKDLTMCL
HAMLHGDVLYQGQELKLIDIKGLLEKMKAQRLNGCLRIYTEDRTALIFYK
EGAPLGFFHDGSHDIETSASESQKIAGLPGSKIDVLSTKSAEELMHYDLL
EMVNIAKLWESTTARFSSEREKIRKEAEVVGQQHADDALKELEDDLKEVA
MAYVGKMGASLVEKELNDRGGRRALTEGSQVGAFLAGVEKGAKLLTSISK
TREMLETMKNEIAERTKSLL
>GSU0166 hypothetical protein
MGDKFFFSIPFVLAIFWQIQKHAYATQFQFLM
>GSU1248 transposase, ISL3 family, truncation
MPINSTKRVQTLPDQGRGIPSMTEAALIAATLGLTAPWEIKAITFADTER
RMDITIDFYHGGTFRCPRCGAHAHVSESVQEIWHRENFFHYTAYLHVRVP
RLACEKECGTSNAVLPWAQRNSLFTLIPLVLGV
>GSU2300 hypothetical protein
MAALFHFRTSSTNPLPTRKAATTGITLPKHHYQRPFQC
>GSU0675 hypothetical protein
MCSGIIRIVSLLTLVAALLVCSGIAECSVFATDMPIADTCCPVQPQEQDT
DRDNMPVPCSDAACACPSCLVADEHVHHAPAIPHAEGSCGAVAVRSLPPS
PVAPPIEYPPESA
>GSU0640 conserved hypothetical protein
MVREGERVAIFNSIHRVMKAEKLLKGQGMVILLIPAPRDLHADCGLAIRY
ASADAVAVESSLREADLEPEEIHEKTAVGYSRLP
>GSU0105 cytochrome c family protein, putative
MVWLKLPRLRMNMHLVLAMAMVWFLAGDISLAGAFECNVCHSKNPAMVKM
HEAVRGRGCFGCHKVGERLMGKGQPKDKASLMERRVKDAACLECHKVKPQ
AGE
>GSU1850 hypothetical protein
MRLENIMKALNKYVRLATVFTLLLVFYVFYYPSNANTNDVISVKIYPNYS
QQTLIGWGGNIYPFPVDDDFMKYLLTSLPVNHLRIFSDFPDVPGEIEDKF
LSYINSKGINIILNFRLNDRTLTINDLPKKICDYIEYLNKRGISVRYLTL
NEPDAKLSSKFTFDNYVSLVKLIKTELLLRNLSTEMVGPDTATPNGRFIE
RLAEVGALSYFKAIAYHSYEYKGLDGFAAIANTAKFYDKPVWITEQNYDA
VGSNMYSNTYAANNIRNLFWGINKGNAELCLFFSFAWGRDGGVVLFDKGK
KVKPIFYELQKVFNTIPKGSVVLKTDSEIPVLAVRYNNKIIIMLYNDQNT
DKNVRISIGEKQVLRSIKGMTMDIITY
>GSU1309 hypothetical protein
MAIPQIPLFRWPKRRLFTLAALLVVVFAAVTSLRFRYYGHWQGIYILRGV
DGHYLVKDDLLLGDENALLLALPAEPVFRFLNGNVSHATGGPRLEYEWFR
RDGSGFVRSFAADGTEYLTCLSRYLDSGGKESRGLFVGGGLPFDLERDRR
QSMNETGMAFFDGASWHHIWCNVNEAISPWSNPAAVMPPSAWKYRGSRVL
ESTPRRLVLGSSHWFELDGVPVSMDRFMIFRAGTRYVTLVIRMTNIGTQP
TGYFYVYGDEPWVGNYGSAEGNVGWVVDGLVPYEGSLDLKRHSWAGYVDY
GNEAAGETHDRNGLANFIEWQGPVTPDTVYFSNRIGTFANVSARVPLSHP
QNRVIVLQWGPRLLAPGQSDIFVLSIGMADREPNTGFPLKPDTSFDLASF
QQFLATQGF
>GSU2132 hypothetical protein
MDATTGSGNVGKRRRLTMAELAEILDDVAERLKSKTVCSLAPLAEERAAE
KIHQAKAKKYFSKAAALGAPEKILALVALNPERGTQRLKKSLADQGISLS
RQSIHKILLKYNLNRPAMRKAWQATQKETNP
>GSU0920 hypothetical protein
MGASFRGADPGCSDVMESCERYDADAATAEEASYHAHQKQGHRHELDEYR
GYADECEKGQQKVIHAEQGDDDGRPCIVKDGE
>GSU2522 hypothetical protein
MKREETECPESSVAEAVSRGDFLRTIGMGLGAAGLAGLMGGQAFAGDPEG
KGKYVIVITNGGNDPNRAVFALLMAQVVADKGWGKLHVWMTWKGPIWPTA
TRPGGSNPPSSRSSAMPKAS
>GSU0713 hypothetical protein
MHGKTQQPNQTSPRVDLTAILEVLKSVGLVVGGIGGGAILFFWIGNAIIV
ARLRAYNLYGVVHYTDEYVTEAGFQFFQDIFTFFQDWRLIPFFVILMALL
LLTVPVGGGKAANERPAGKGRITRSVSLILRYRINYIIFLTLALSAAFIL
TSGWFVRKLSSDIVCQERVLADLSKDLGGMVLILLPRKGALERADEFQQR
FYEELTFGVEPTLHWMNYALAEISRREGTPLSLEVFQKRFSIREPSPFRS
DADFEKSETYVALRNVWLSHALQQRLRERVNLTLQDFRSLLSGHLTSEGD
TSSLVLIPANYELAGESMRRLTRLRENISAFFVPGDEDTSRICTSLGSLK
PIAFGQFMISYSFWVLIGILVYLLLNSTRLIRLPLWEGGYFVVIFLLFLT
VLITLPTAYGRFKFEFKVQRLKDIVFTGEDKGVHPLRKKLDEARQRGAEL
YILGPTKGREIIIGAFEHPEEADAGSTRIIMLERSAYSYMSVEPVRPEKI
PGIIRLLRQQAGTTKHENVALAAERGGDGPP
>GSU1127 hypothetical protein
MAKGYQANRERLEQIGLLGKALAKRAGFACEWCEGKDDLRPWDYQPDDEP
SEETLALLCGRCRELAGGRRGDAHELRGIRNALWSQVPAVAEGAARVLAT
SREPWVREAIEESLIDETVKAEILGI
>GSU2213 GAF domain protein
MSVEEEKGLLARADEFLQAFKKGAEFTQELLRENERLRFRTLQLEADQKG
EGVAAALAVENRKLAARVEELEREIAEIIGQIRQIEAENNDFAARYVEIE
EENNNLANLYIASYQLHSTLDFSEVLQIITEIIINLIGAEEFAVMLVDEK
TGSLDAVAAEGISLEELPRVRLGEGVIGIAAGTGENYFAEGPETYERDLL
RPMVCIPLKIKEHVIGVIAIYKLLIQKKSFASVDYELFTLLAGHAATAIF
SSRIYSDSERKLSTMQGFINLLTT
>GSU0444 hypothetical protein
MKCPKCGYNSFEFLDNCKKCSHDLSAFKQTHGIRAVVLPLAGLTVAAPAV
AVAAPSAANAANAEESFTWEVPAAPESFKPGDDIFSDLDLGLSPSPAPAE
PASAGLDLGDFSFDDTPAAPAVPAPSTTDADAGTDDFATLLETGDNSEIT
AAAPGAAQVPEMESPWEVPTDSFGGFGDTPAPAAEPAAAGDFDLESFSWD
DKETKTATTPAAPKGPEAGVESFPGTDFDSLFGDAGEEEK
>GSU1874 hypothetical protein
MGIEAAVGAWNNTPRGDFAYKGTNLDIKDELKYDDETRLMGRVKIETPLF
FPNLYLMATPMEFDGQGSKSGTFQFGDVTFDGTVPFTSELKLDHYDLGLY
WGIPLLKTATAGVVNVDLGIDARLIDLKAEVVQGGVTESKSLVVGVPMIY
AGVQVRPLSWFAAEGEARGMAYGDNRYYDLIARGKLILFDHLFAAAGYRY
EKIEIDESDIKADVDFGGPFAEVGFQF
>GSU2165 hypothetical protein
MTETNTVKRLIIPMHDNSEFFSDNADYAIIDLDAALIERIKKLAVVVRQM
KANKISEFNHTCDFRTVDWDAEPDNGKVAMKEFEGRMECSTLNVTDSDFF
WSGYYKHTDVRWETDTVLIKSLDEPDILDER
>GSU2459 hypothetical protein
MTGSRKRRRFLRGGGSSSLLKALRESFFLGERKEEAENRFLLVKCLKSCM
YVHFARTSRPAAPARMRGHLNG
>GSU0746 cytochrome p460, putative
MKFMLAVLALTLAGAAASGAGELPVAPNGIACLTDYRAWEAVAPSWRDDR
GQIRVILGNPTAMKALRSGTRPFPDGSTFAKLAWSAKKHPKFPLATVPDA
FAQVEFMVKDGEKFKPTGGWGFARFVGQELKPYGKDASFVLECFGCHTPV
KDNDFVFTHLATIP
>GSU1552 conserved hypothetical protein
MDLFQLREQFSRDGILMCFNGPFSHSIIEEIGIAIRNHLAAENIARMAVQ
DVFAVYIEMTQNARNYLTRRDISPAEAGSATIVIARRDEFYSVTSGNVIL
NDDVEQLRTRIDHIKAQAPDELKKLIRQQLRAEVQPGAMGAGIGLMEMAK
RASGRLEYSVRPVDGRHSFFTLTAHI
>GSU1195 HD domain protein
MDILTLDLGLDPGERRLAGVVALLHDVGRFEQYRCYGTFRDSESENHGTL
GVRVLTGEGVLDGLAPEERRMILGGVALHNVFRIPDAIDGPARRLLHLVR
DADKLDIWRVFLEFYGLPPEEQASAVGLGFPDLPVCTPGVVETVMRGELV
NLATLKTLSDFKLLQLSWVFDLNFAVSRRLVVERNYVEQMAATLPPGEDA
ARVVAFVREYLARSG
>GSU0826 hypothetical protein
MLIRVQYPDGRYDYVKHTRLDDLIDSVKISRFLRSSGWVVIGEDPVRRRG
NRAPYVGPERRLAA
>GSU1841 membrane protein, putative
MLTPGSRRKLYTVLLAVWGVAILVLTLMPASKAEPLPFPGWDKIEHAVAF
GALAWFAGRFLVVYGKCVRRCWLWAFCATVTYGALIEVAQATLTTTRTAE
LGDLAADAVGAGLVCLLASRTVVAGNREG
>GSU2421 hypothetical protein
MSAITEQVTLDDHVNAATEVAFLLDIFAATVDNVMGGATASVGRIAGREM
ARKLPVHLTNPTLDEVVAILNSRMEAGYRFRLEGDGTERTLLFDHCVLRE
VCAKRGLTMGTALCRLFHAYLDGILNELICRPVKSEVVSCGDQCRLNLRT
Q
>GSU2663 lipoprotein, putative
MKGDRMRHMVCNVEDSQTLHCLTANATVSCARCGAKAHSPSNVCDPVSLT
IEGD
>GSU2931 hypothetical protein
MVGAVAGAASLARRRSDAGGQKSGDQDQKGQTSDQFSPRKTMASTVRATM
TAKAPSVSHEEARLPSPNQTARSGRCLPSHGMSRNQRERATMPQAKPLPW
LTTSWVQRRNQADLAGFGGAAGLAGARSGAGTTRARSASRSSPAAKARLT
GRKK
>GSU0748 hypothetical protein
MRIKDPFSYSARWDSVGLDCAYCVHFAGPSSWPDTDRTSRCRLHAIPLDI
GLGRCGYKEDEWFCRDFSDDGRAHGTAVEEFEALKAGLQSETLYGAYGLR
GYLKEIEFKRLR
>GSU1436 hypothetical protein
MTTYYSPAAELEGELVIDKADFVIASVEDEIRADNLCRKLLSRFYFSLLE
QGVDPREATLLASGADHFVRDFVIGFMHRSLFDDQPGIVRRFGGNWYIVH
TVEPSVGELAIHLAGVRAFYRFLLERDLLDGPFFEEIDRECSDLPYYEER
IKAFWAIEGDGYLAWERECSLKQW
>GSU2470 hypothetical protein
MNFAEIIVVALAIILILGLIAIILGPMFGWRIAARNEEFIPFSSRIEARE
KIRVALMEQGWEIEHDGPDRMKARTKISWRSWGEIIKIEFADGGAKVSSE
CSLPTQAIDYGKNKINVQKLVDALTKRNA
>GSU0323 general secretion pathway protein j, putative
MLLLAVVAGALYSSYFTVVRAKERTDEGAEERRELRGTLDLLRREIDGAF
FRGNDVRLRFVVEDRDIFGKPASSLEFAAVAPPRAGDLPASDLEKIRYAV
KETDSTLSLTRESTELFGSVKPMPYPQMEAIEGFLVECYDAGKWVKTWNT
ELVPRLPERVRITIRVRQGDQTVEFTTLARPKGRQS
>GSU3314 lipoprotein, putative
MSRVRQWFFVALVALVAAGCAHNPFNIPREDYEQRVKVLGVAPIFMDADS
DIRHPERDTLVTLVREYNRVNEKELVNMLRESGAYFMVRPIEAAAPDELF
RTLLFRSERRDDAGVVYNKHFYKPEEIRTLVTQNSVDAVLLVVVSGLTMR
DRTFASNLVKYLDTDYNVLTMTAQVLDANGTALWEYPNFRVGSPSTPVLA
ELQYPDFDQAAANMDERVEVKFKTIPGITRYLGKKEKDILFRERKATAPY
YHLFDDMVGLLKPPMDLFNRKDKKDASPAAAAPPSVQAVRPVERPVSQPA
AAPAQVASPQPVSDPEPVVAVPEPAPIREVPLDGSGAR
>GSU1679 hypothetical protein
MKINSITAVAACLIPCFLANSMAFGGTCDFMSLTSAGGNDYIIKTNGSIS
NVQSIILKVGYSDENRLASKQPVSQEGTAQWTFFNSTPRSDGLLIYASTQ
QPLHLSGTVARIRFARVTTPPTLACTITADYVKSEKNQDTSNNSTTSGSS
GNSSATVADETVKSGDSFTTQNDQGTSSNGLETSQFGATSMTVRPDVHDG
GETPPPPHEEQEQPEDELQHDEEVTPTAQPELPSQPDATPPPSPSRSPEH
VEYRSILSRFQAEAGERTPSALMDLFSEPVAQGISQTPPIGLSDGESIVT
LTLKATGDDGATPLFILKGARLLALTREPGGFWAVKTLPDRGVSEAVLTV
LSGEDSVVYPLTVAPRINIKPKEGEPLTEADFTAFIAQRGGEQGQAGDLN
GDGKRDYLDDYIFTANYLAATSTPLSGARTKEPSPSPAQP
>GSU3276 LysM domain protein
MALPSRSPRNHMNTCHFYRVQLLRASLLLMAISATASASATSFELDVKDL
QTVAPAKRAPKRKAPAPAASAETKRDTVPAETKRGVSRYTVKPGDFIFKI
LMREYGLSNAAAEALIPEIQRINNLRSITRLSVGQTILIPLEGPRAAVRV
QEFGERRETAEPAATVAVAPPTEPSPQVVESGPAPPSEQTAPPVTEAAAP
VPPRPEPAPATRPAEAAPPISPAPVVVSAPSSFSRRLITLWQSLVPGQER
VEPITLNGRVLPPEEFPLLLAADGGKILVDMKGTLLPRDKSTLAERHPDI
RLVSRGASSERDFFTVLLRTAEFAHVEEDAVATIGSDPQLTVKADYRITR
LPAVATRGPENVFLFLERNGSCLPAALISALADNGINSVEFCDVAPQPPS
VPGYELRSVTGTTPCELAVQLLGVLAVKLDRNRIVSGSMGENAESRFSIR
VDGYFENDGKKFILSCNDNDSYNYTLFRLLQLEGYGIIQLGDKDDFATVA
DKILTVLRYPHSFGRHDFTHDRYTVSALGFRVTRLGPVSGRMLIIDRPVD
PAFAELLQWER
>GSU0370 hypothetical protein
MIVTQEIARIAQALLKDSTYSLVEARWQTLTPLNLTPGQMVQAEVLANLP
DSRYLIRLANQLLRMEIPLNLQPGQTVELTFVSEEPRLVFALSKEANSGV
PVRISDTGRWLNQLATSRNDAVQSTPLPRPSIILQEPPRDTGRLAEGLRN
ALTRSGVFYESHLSQWVKGERPLADLLREPQGSLSRLAVATTSGGNNTSP
APANGGAPPQQNAAPPASQSPGGPPAATTDAGQKPAAAPLPAANQPTGSQ
PAAGGGNAPSTVPASAGTGTTGSLGSPSVATPATGDAPQAPHQSGTPAPS
VTTTPAGGAPPTDQAPPPLRPDGGTAALPDTDQATPRPQGPAPDGTRPPT
PQEPLRPDSTPVAGKPLPAPSQPDQSSPQGPTLRQVPLPDPAAQNSPTAP
GTTVVSQARGERIAVDQHAASQPRAPMALAADGDGVAVAARAGDPAATER
AGGLRHIPPPPQGGVEPQTIPIIKEQLTTLATGQFTWHGQVWPGQDMEWK
VEEREADGRGSSAERSWQTEVALDLPRLGSVRATLSLGSSGVTVNLAARN
EETIAAMKEGRPRLEEALDAAGIRMTGFRVSHDDE
>GSU1262 membrane protein, putative
MILHPAVIALFVGSLLTSGMLVATAFFAVKILRHWDLASGSERQLSFERR
TYLLATIMTCCCVFQLLSLFLFIHVADSLSALFPGAMCAAGTLNLNRFGY
PTLVLKITTFLLAGFWLILNHADNQGYDYPLIRIKYSLLLVITPLLLAET
GLQGGYFLSLQPRVITSCCGSLFGGESSGMVPTLTTLGDVPALATLGGVL
TLTLICGGIYLWLGRMGYLFAALSAATFAVGVVGLLSAIPVYIYALPTHH
CPFCMLHREYGHVGYLLYATLLGGGVAGVGMGGLQPFRRVASLASSLPML
QRRLVLFAMVCYGLFGGIAAWQVMVSELRLV
>GSU1944 hypothetical protein
MKVLVFLLVTFLSLTSVGVHAAPLTYLFGGKVTSSTMGAFEFSEVQVQVG
SPFIGSFTYDSLGSNPQPVNFEVAVLYEYITGLGQGWTVLVDKTQAQLGV
STPDVLGFDSLAIVSNAPVFGVFAGKLENNQLLLYESTYWPSILSDTTNT
AANLLLQGPGGAMDYWLNLDAFTIQKFFGVHGLFEGEPWNFVGQIEYLQQ
TSSPVIVGVVPEPGTGLLIAFGLAGLWGVTACCRKQNKG
>GSU1373 hypothetical protein
MLSSAAPFQSGVFVVVAGMRIVPTGVEFKSGSCRNSA
>GSU2394 hypothetical protein
MYLIHPSCSLQLARFEFNYLVYYFLIDFHLFFLLRPTP
>GSU2405 hypothetical protein
MTAQQYEIIISRKHELVSVPGITLHELSRLCECHLSVAKRLFAIGLIEPQ
STGTTPLFDRSAVVRARKALRLKRDLRLSFDAVALVMELLDRIDDLERRL
>GSU2501 cytochrome c family protein
MKAPKAILALLVVLPVLLPATTGRSLAFHEGGAGECEGCHTMHNSEGGFA
VATNGRPVGMGNRYLLKGGDSSSTCLNCHEKAGDVGPTSYHVSTPGFEMP
FGVPPKQLGPGGDFGWLRKSYTWIPAFGSTLSSSEGDRHGHNVVAGDYGY
LQDALHLTAPGGSYPAAGLSCVACHDPHGRYRRDQDGSIAVTGKPIRGSG
SYATGVEPDAAMAVGAYRLLGGNGYYPKSLGPAFSFINGPPAAVAPADYN
RSESTTATRVAYGQGMSEWCRNCHPNIHRDGSGGGLVHPAGTEAPLGGAM
VDYYNRYIKTGDLSGVEATSWLSLVPFEAGTNNYTLLRNIVTATPTKGPS
TADGTPRVMCLTCHRAHAGGWDSATRWNTKTPYVVYNGKYSQEGEAYQPY
GQGRSEMEALRAYYDRPASLFSPEQSTLCTKCHTSVP
>GSU0619 hypothetical protein
MHIGLASVSVLAGLVLCSPASVRAGISGEAELGYVRYEADANGARVSDAH
SFYQRYSLLYETSGELYKGRVGRYDLALGYEWGSFDTKIKNPSNPTDPQT
NYNPSVSAGHLLYRGEVVLDPLELPLRLRAYSYDTNRINMHEDLSGIDGS
SIFAPRLITNLYDGMHINSGATLVFGVKNGLTNGYNAIFRHIPLVMLDYR
DDVNRDLKSLAPVDTRMRRLAFVSLNKRDNWFHYRTTDFRDYLNTDQSFK
ETQMQIGTVDHLLARRWIDLTNWVKISADGQFTKHASSMPANSFEAYELN
LFAVASRKTWDARTFSSFSRLLDDEGITLERTVPFYASGVLGADVDWKAS
LYSNEKKIQSPSGSVVNNSNYSASVRADMFRRSPFTLGVVVRGESTESYG
SKLLSFEGGVETASTARLSRDYSLSGSYTVKYFDATGGTGSGDGYLNQNV
LGRVAYAASPTWRFEVEEDLSAASGTNPRNFNNSAITVNSGFSSGTSTVS
GSAISFQRRNSEIDEYVRSVTTVSAAWRPFSRTSLSFSVSEDVLMQSGAP
TDFLTTFRSTIDYSTPTFLARMRNSYNVRMVGGESIDYLESVGILEYRPV
RQLEGLLTYTYNVGDVDVNTRSEFLDLRQRLGYSFYTLSGANRKLLELYE
EFTYNRTLNTGTVANELNTTRRFTLGVRYFPLRSLFVGADARYSLIEPGS
ITEQLYTGVVGLNFRKLQANLEYTYGKRDGSDNRIEKRIAANVKKFF
>GSU0597 hypothetical protein
MKTTSKALMVTLATGGTVWAASGAESEANGLLVTMFLAFGALIIVFQLLP
GLMLFAAMVKGLFIRPAKDIATGTSGNETA
>GSU3228 cytochrome c family protein
MARLYNDGAGTAYDSANAKFIDNLFGWTNYLAGTKNLNGAFEPFNFGSAA
GFKSKVDNSLAGEMRDCGECHVGGGLMEYMSNTIPTSDYTGGVSYDPAKR
TSYRDVVFGGLFTAFNALIDVFNPDPARRGDVVINDYAQTGVMEMDCLIC
HFEGYDWEKRKEAVRKGEFDASRALGSGLATSAASGVQVTYDTNLVKTNA
SGDLVVDLSAKLSPVPPADNCASCHLSQYNVDWKKRGDMWIEGTEAHYGI
GCMGCHQRKDAASPTVGTSGLVSSTTLGLCDPAKGGASPFDAMWNKLDKS
QFKECVDCHAPTSTPTYDTYNAPNPVSAHAIKGLTDKIAFDKDGNSVSHI
ELIDCSACHIRKSGFTGGAFVDGTGADEEGRLALHDEPQVSRDMTNGVAL
HWLGGKLYSANLLTSFFWRDVNGLAAGLDVNNDGRTAGMDALLQTHVNDI
NLAAGLQALTKDGIVTPAEITTSINALKGNSLGTGGGDTGIKAYLGIDDP
TNAKVFLPKLSFLMVPFKSSHNIARGSEAWGAKGCKECHADGAGFYNGTY
PVNGNMEGKFNYSSAQAATFTKVNGLSDPSDSHPNVVTKKGDRSMPVQII
TADNPYSTTAVNANATIRNIDRSEMLYEATFKAVNTAWYDETAPIMGATR
PAACSGATSPFYCAAPGGAAAKAGATSTLGWLLKIDTTADNGATVVSRTK
QISSDMVASVDDIITNLGATFTGGFEFTMVAIDTNSDSVNDALKITAKPG
YKIRINAQCDAGPFGFAGSLYQADPIVRAAGTFAGRTAYVGYLNAIGSAP
AAVIATVGTTDATGNPAEITVTQGNVALTAAAAGALPADKYSYSWTCSDA
AGTSEGQSVTRAFNSLGTYDLTLRVKNLYTNEEKVDQIKVKVTAPAPAAG
VAVAAAGISYNSPSAGYAIIPLTITGVTFNKVKVVWGDGNTTIYTTSDAN
FVLPSHKFWGYPAKKSFTAKVYVYNGTTLAAQNDNITVLFP
>GSU1559 hypothetical protein
MKKDAASDTAADKAKKKAKKKKADKPAEKKTETPAKQ
>GSU1872 hypothetical protein
MKRILVALALTFVAGFAFAADENPLPKPDVLTEETVYTSQRLSAVYDTII
IRDLTTEGAELSNLDSEEMTKLEAMKPLLVRTVTDSIEMELKLNKLFKTI
QKNTQPKGKAVILEGAFTEFNAGNRAVRFWVGFGAGKTYLKVKGRLIDAE
SGKELATFEDRETGYRGTMTLENFADLFPHQAKSLGENIVKFIQKLY
>GSU1673 hypothetical protein
MTIIDSSMNSDQLDILRDFGYSLPNWRIQPPVAVPGGFMGRYIREERTAK
EKIQDTVNTIIAFFLMIVASALVVLLVFLFYENSKG
>GSU3182 conserved domain protein
MGKSTVFVNKQGMSNKGSGAVVVSGPDVCLTPMGTTVVPIPYTNVARSAS
LADGTKTVKIDGTMGAIDGCCYKKSSGDESGSKKGVGSGTIQDKAEFINY
SFDVKIEGKGACRNFDPMTQNNKNAM
>GSU1093 hypothetical protein
MLTNTVIPLAQRTVARHPALLSVASAATGLLRHMAIKPHTPSSRPRELAL
RRRLPSVHAVRNQIFREKGTGTQPTIVIGGFVPDATEAVEFQRELFRRHG
SVYYLNYARHGFSVEMFFAQLADLVEDLNRRGEKPVIFAISFGCSLLSRF
LRERYSGEGLTVGGIVMTSPVLCTQDLVRPEREKGGGVRMLESSLRRILR
AEATKEEELERQIERARRCFQALFEAGAVNRILSHRHLSIRKKIMSVLQT
TSCLGGYERVVALKDAAFAAAGPLFAGPALVMLAEDEENILAPSSPTLAA
LRDTGVRGELFPRGKVRTVASPMNGDTVPHASLIFHHHCYNPLIEAWYDR
LASPLRIAVVS
>GSU1670 lipoprotein, putative
MRTNNGFPRGKRIRAVLSVVVVAVALLLGCAAGADEGIKEGGKKVGEGFK
TIGKETGQAFKEGGKEIGQGVKQMGKEAGQEAKKTGRSVGEWFRETGRKT
GEAFREMGRSIKRFFTGE
>GSU1761 cytochrome c family protein
MTARKGLISLQTQQGTLPRKTFTSLIMLMITALMTLAGCGGGGGGSTAAT
STGTPTTAAVSGVAATGAPLVGTIRLKDSSSPAVEKTTSSATDGSFTVDV
TGLTPPYILKADGTSGGTAVTICSFAAGPGTANINPLSNAALASAAGVSD
PAAAVYASPSPAMLETISANLPAAVAALRTQLKPLLDQYGANVHPITAPF
TANHTGLDAVLDVIRVQLGAGTMVVANRATNAPIFSAPLMNINGGTFTMG
NMPTPPTPATDGQGLYAANCAACHGALATSEKKGTTLARLQSAVSANAGG
MGFLSSLTSAQLQAIVDVLAVAPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PNGSALYGNNCQACHGSITNSDIQTRTVSAIQSAISGNRGGMGFLSTLTS
AEIQAIATSLASAVTPTPTPTPTPTPTPTPTPTPTPTPTPTPTVDPGKTV
YDSRCASCHRLGTYDASGSAPNLSRAGTKIDGKFTAGVSGHKGITLTAAD
LANLKTFVNAN
>GSU0584 hypothetical protein
MVVDEVTYLFSACRHSIPFTRTEEVFCGELVAAFAVLYSGFRQEGYAAHF
RTALLASLMDITVARFLRGDHRKAFWSIQQLIQLLKNLSYQRYEGKPATT
GFLVYRTTLVEFRKRVRYQKSAWLDLNPQQALTPTLFDNPITYRFIDGTN
SLFAANVQMQLTGIVKPSGLAERDEVDRHAHRSVFALLRNAGAGAFAVKV
NESSEIEAIICPDKLLVRRRGQWGIFDPDIFRSFLAGCLDPEEIEELAWT
LYTLSKARHGTVVLILERGDRQLARLKRGSVGGDDPLGMMLAKRVKGKTI
SQLKQSGELMRLLSSDGMTVFSRRGRLLDTGVIIDTSHARELVTGGGRTT
AATAASFSGRVIKVSEDGPIELYQGGARVYSFG
>GSU0414 flagellar protein FliJ, putative
MTHKHGFQLQQVLNYRKEVEKVKKLEFATARHEFEHATDVLSRHEAEADR
ARVEYNNKQAAGTTANELQLYADFFARKHMDIQFQRIEVDNLNRKMSEKR
EDLMDAAKDKKALELLKEKQMLAFRREMAERERAFLDEMALQKVAR
>GSU1439 hypothetical protein
MKRDHAAPGPDKESREQQAERGPLHSKASAGKPWQQKWSANGSGRDRRPP
GDSRGNRPLWRECPIRKSRGQHNQSHDRQPPFTGGNQLDGGENKPFDSCE
RELPHGRRRDHAVIQICRQVCGPDSGIPDLKASALSHQMGSHTGSPQHWQ
QQRAVIHYPRQTGEADSLDDLIGAEGPLAGGAPDINAVGFQNQKPDVSAG
QLPPDTIVAHLVVGSDHHRGPGAGSANGVQTGRHQVRHASDTPACLPGSE
GIAHRQLRGDDRHAPRRGIEPTKKTRHIFVNGPADRRRLWTSTGGAIHVG
TGLTHASSSSIVSLLRTTVVPMDRCSACSHQRSLSLTPSRPARSSIN
>GSU3044 hypothetical protein
MDKGLSRLVELLNARGASMEEMVRLLEEERASIVALQAERLQAAVEQKVQ
VAARMEALDGDCRRVIADEARVRGLGEATLSPLLAGNPSPEKTELAGLQT
KLSTLAQEIKGMIADNRRLLETSMVTVGRSLAFFQSRLTVAETYGNSGSM
VERSSGGILRKEI
>GSU0320 hypothetical protein
MSALNRIRDTWNDMDRPTRLRWGYGVIAILALAVIFSFVYNRIGLLENKR
QRREADLVEMLRLKGRYQEARSTSMRLANRLAAVRADDSPAKLVEETGIR
GKSLRITPLKTELKNGFTEEAADIKIDSLTANEAVNLLHRIEKGQRPAII
RKALLRTRFDDPSKLDLTLTVALLRGAPQGAR
>GSU1662 hypothetical protein
MRFPSALEDPNISVRLLSAEETIHSLQKREFIGEELDSCHFWLQEGKEVF
RDAVCGILPYDSYDKIWATLNQIRHMFCRIVPREELQALVVEIRAGLSYV
AARKCRDEYDKDLNEIEKSLGLSAHHSPTLQNERELRRALERLSRIAADA
RESHWRKVNLFRKRLFLTSLSLMVLLLMSLVLLPFVLPAGAPSWIEIFLI
ISFGTMGGLVSALNSREPLESQTSAYYIQRTLLFLRPFIGATAGLIVCLM
QATGLISLLPAMATGADSRVFHPAYLVLAFVAGFSERFFITQLEKVSGGA
RDRKD
>GSU1569 transcriptional regulator, CopG family
MPHFRKDCSTKEKKARRTLGNRLDNVISIRISDQEKRNLERLIKRSSKSV
SEVMREALELWSYKQRALFSE
>GSU3178 hypothetical protein
MRKKASAISFGINGTLSEGFRVGRIVSATDNRFIVDYPENSLGPIAARLT
TSVKNRLSGHGQPGRGKKILLAFENNDPGQPIICRHNVLTT
>GSU1806 hypothetical protein
MRVPLELRNVATGYTVADKGPGELVVTLNGPNILLLKLRDEKIIMPLDMT
GVGAGTVLFTGFEARLSVPSKVRVTRVYPAEISVRVEQSPARPESPKTHE
SR
>GSU2970 conserved hypothetical protein
MITHIVFFKLTDPSAPTIAATREKLESMRGKVPLLRHIEVGVDVIRSERS
YDLALVTRFDSLADLQAYQVDPYHAGEVVPHMKSVCSSIVAVDYES
>GSU2373 lipoprotein, putative
MKQVLFAAFILLFLSGCSLFLSEPRVAVKDVAVTGFGADGVDIELLLGVT
NENSFALSLTGYSYDLQVLALPLTTGGDRRRIEFPGGATTDVRLPFRVPF
RSVMEILKRHPDPAAIPYRLTGSLEVETPLGLTLVPVSSTGTFSVPQRYR
PAEILKGFTEVINGLRR
>GSU0263 hypothetical protein
MAHNFNLYQPATMYYGPHMQTGLKKFSGTPLRIAVIATIFVVLALNGVAS
SGFLEHGGSLQPKCAISKSQSATAHSTTLKHVQIVVLGQRLFEFHSPVSF
LLNGHQDPRPQTSLTASIVFTRAPPARSSLS
>GSU1890 hypothetical protein
MVKTSKIRHILAVVIILVTLYLVASLAIRLVGGNEKEAGLPKLPRNIDLS
LKTLHYAETREGVKKWDLYADRGEYDRQRDVTLLTGVRIVFPGTDKTGEI
TLRSDKAEYFNATKDVTLTGNIHARSTSGMEFTTGRASYRASRQLVVTDD
RVSFRDARFAVEGVGMEFLVPTRTLRILNDVRATIAPAPKD
>GSU1993 hypothetical protein
MPTMKNQQKQCDMWDNILMNLIFVNKSDAHFGQAYMVLMRENPQRPV
>GSU3273 hypothetical protein
MGQMISEMRHGVRRPVWHDLAERFDEPRECLASFLALEAAEVLAGEKPAN
LIGIANRTRACGRNLYVLWKQWGAMVLGESGLAVRELSDRGDSLLLLLYR
PEALEALLRRPSASAVLARAGYGAATGLDEVLTELSTRIDGDRFPHEIGV
FLGYPLKDVAAFMGLVRIPFACQGPWKIFGDPRESLRLAEVFRCCRARMA
ERLNRCTSALDCLADGSPRPVVGFDEIESHSYGGKAMCIAVIGGMDRLEK
HYREEAVRSGVELRMFSQSENNIAVKLKRFDALVIFTNKVSHRVKNEAMA
VAKQNGIPVFMHHSCGVCTLRDCLSCLTATVAHVGESDRPDLGRRKNHS
>GSU2689 hypothetical protein
MPRESEPWVMRGDVAAAIGDARYYLKTRKEMS
>GSU0590 hypothetical protein
MKRTTERSPSHDNGISPPNRDLVRQEAVGRIRRYRVTASRGLWAMALFLA
VSTAALWDFSFLPPMSDQVRAFLGKPPSATMISGVLMLYTFSAIILILAR
MMGGAEQYSGFAHVGYLTAFYLFYHFARALNDNYWAVFVAGITILGLESY
HIWNFCSEAIRKDQEIIDTIDRNRRE
>GSU2104 lipoprotein, putative
MNVMSIRAKSIVSLTTMMLLGCIVAEISLNGTVYGYDKQASANDSSKEGV
ASNSIERLSEALRKALGEFDHAFQSRNVDALKQLFAPDIVMYEQGTQNRG
RDDVLTNHLGPELSSFQELAATYSDIRVMESERMAIITRNFSIKGKRQGR
FFGIRGTETQGWQLRDGRWELAHLHWSFPSSH
>GSU3224 hypothetical protein
MVLLVLSFPCPADAARLTLLRFGKLTQNISLGYEFDQQSSQSDGGTSLDS
TSHRFDERYRAAFRYAVLSPRLLNGSVRVGANLEQENEETTNRSASDGSS
HQLEYNVNGTWFQRSWYPVTFFTYQQYNRVRQAFTPSYDIKTDGVGATLS
LQNNYVPTLFNYSLRSNATSGAGADTENTIEAFLLSFSHNYRSLSNATLS
LSLNKSDTVTKGEATTTSSDTRSVVFKHLLNLGTTDLTRTLNTLASYVDE
TGTVRNHTLELNESLDWWVGKALQTHAIYIHRDRTFADRKTVEDSGVGWI
QHRLYKSLTTRLDLRGKNTTFDNNGSITEIGGGGTIDYEKKLPEESSLLV
GVGYNYGITDNKLGADALPVINEQTTVPDLPPYELVMGQLDVNPDSIVVR
NRERTFIYTPGTDYQVVTDGRETRIVFPQIPTVGPFPITAGSILSIDYEY
RIAPSITYSQSVYTGYATLSLFQERHRLFVRITDSSQDVLENRGAGSPQL
SDYRTLNLGFESRREFLSYGASYLRIDSDINPSQTFEGYSRYVRSFGRDL
VRLNLRERYTLHDNGDDRTESSYTNTLNLTGEYLRPVTNWATMRLIGDYL
MVRGRSPSDEVSASVELNAQFGRTELSLFSELAWKFLPQNTNRSTAIRLQ
ISRSF
>GSU2156 hypothetical protein
MNGPGNIICSGDEIHLPAVAMEEVNLSLHGANRFLERDIPLEAVEGVRHL
LPLLNEKPLRFRYKGVLVVARLVNGSPRIITAWKVPPSA
>GSU1325 hypothetical protein
MGIDFTDILPSPCAARKGILAAGNHAPLDSPTPPCHSSIRDWRRKPCARG
EYVLITFHPLTSTCRLLKSYCGAAASACGAAMQDQALQ
>GSU1785 cytochrome c family protein, putative
MSLYAPGSRKLVFFNAFVAVSLWATLNAQSVSAAERVSPHDAQGACLSCH
IATERDLRSDRLSGDGKSQLKSDANGVCLQCHGIDFGHGVGKKPEMNRKG
LPLDGDGLIACAITCHDMHVVAAENPHQKAYHLRIPVEDLCFSCHDR
>GSU2315 hypothetical protein
MITQIKKFEKMIIFSLVVMMALVLFLATLELAWIIVKDIVSPPFLLLEID
ELLEIFGLFMLVLIGIELLETIAKTYLAQSTDHARIVMAVAIIAIARKVI
ILDVKDLSGQVLLGIAAIILALSIGYYLIRRRSGGSEREGDTSSEPERL
>GSU2765 hypothetical protein
MKEYLLLFDARACELAAGAVDAGHEADRRG
>GSU0639 hypothetical protein
MHAFFMISGLTLTALLINIPLGYLRQGCEKFTFGWYFYIHISIPLIIYLR
VKAGFSWKFIPLTLGGAVAGQLIGGFIHRRRQNNG
>GSU2810 hypothetical protein
MERKRLTTNSVAPLCLKADQAYGTEVVDTQAIPLNDVTAD
>GSU2130 hypothetical protein
MKNLVHIRVWNRLRKIREFKFNYLEGGLYFNRLNSVGNSAHTKMADRCGA
RERTALRPVWDAMIIADHSNRWRRLGGCHESNT
>GSU3310 hypothetical protein
MSNLSKYRQSQVDCNYVNCCDTQGMMMDGEKKICAVCGKVFTPKELRSLF
EEAGEWLAAELWNDAGSLCPLCLENRAKLAMMYVIDR
>GSU3234 hypothetical protein
MRKKLSEAAGPLEGMEMNSKGSRRKKARSTSSGPFHDGSCFSLNQLAG
>GSU1548 hypothetical protein
MELERRGRTFYESLAIGCGNGRISALAAALAKAELDHLETFTRMRERLPE
SMRGPNLTDEELMTAADRVRRMVLPRPGVVSETVIASNLAKALDMAIVME
ADSVAFYAEAAAGLDGIDADVIAAIVAEEREHLVMLQEVRNLCAAVFTE
>GSU0710 hypothetical protein
MKKQALIATVAAGILSLVAGVAPATDQERVRENVREKVRTEAKEQERVYG
SQLMTRKERAEYRARMRAAKTVQEREQIRKEHHERMKERARERGVTLPDE
PPVPGGGMGPAGMGAPGKGMGPGGGRNR
>GSU0832 lipoprotein, putative
MNRITRIVLTVFTIALLSGCARSAELIRTAGIAERQDVFVTQAEGQPLPP
GYAELRIVSLLKTHKPGLYSSADVHGTPGYKLLLNIGGQALELSGDPRPE
RKEPVGPRDPEAGDGMRYQFSRTLHVKAGTHRTVMAIPDDTIAVEKEITL
ADGSRNTLVLEPVYGKMPVRSRPWNYNVTDFNEGIRSLRVVFNGKEK
>GSU1318 hypothetical protein
MEDVTLVMQSEVKINGTTYAIHVYRREDGHCFAVTHLKNGDIIINDGETH
EEALTRQEGVLPLAVGSRAILDEFRRSSGLLLKFSH
>GSU1376 hypothetical protein
MESAIGKAIRLETEPVAVLFADEKPEGAGQFAKGSWGCVMFLLAAASRGK
TGAFDRETYGCVGGGVGLGFGNAYESFPGGIPGFCRFLSTGNASDPAGSA
MAEAMKNAGARSEFVEHFLHGERYKKSPELVEKFVAAAPITEVPAPYVVM
KPLSLVDPERESPVSVTFLVTPDQLSALVVLANYDRPGFENVAVPYLAAC
QVVGIMSYREAQSSAPRCLIGLTDISARNYLKSQGASDKLTFTIPFARFK
EMEANVAGSFLEGETWGSVIGRD
>GSU2973 lipoprotein, putative
MNVRRNLSLLLLALVLAGCRNPYLIAQRFEDSSREHNRLMRWQGLEQSCV
IFAGEQVKDVCLERARAAKDVSVADYRVTSTELDVNKGTATVRVEVDYFI
LPSTRLKTLVDEQQWRYVEDEEKGSRWLLVTPPPEFR
>GSU2727 hypothetical protein
MRDALAELSDEKRKLIGKYRLVPKDINGKIYWVRRFGNRPDHPYATQRAL
RNCHIIELVFSFYDLCVAKMTYFKNNFADYIPCKNNYRVGQMEECEPWDM
EILVQRETGIVIDLRNLAEISDIEVFKEMCKWLESRLNEADACSRKVMGL
>GSU2106 hypothetical protein
MGKVSDKVCELLNRQVKERGIVVWYDSEKVYDGLAKRLQIPETTVLCYEN
GFFPLRERMEQFLEFVTEDGRPQDDCGVPPKLIVYVPKSRSETGFALIEA
ESVGAVVEPGAPTADRNSRLSSLVEQVFNGVAPEKAAHVARQTEEGLLTL
AELDQMAEEAGTVTTGALKLIFGAASPVDVILSFLAGNEFDVKLEKKHSL
SELAVVAREELGLDVELIPTATELRTAIRRHLLLCELIQGLPEEGFPPSL
STIALPRKPVQFDTIRHVLNVWRQRLDLQEAYRETAENVQAMTQLDKAVF
PTEALRGLETFPFIEYRLLLSARNNLAQRDTDSPLALAKERRASFWSRVY
PEISLRWSLVEVAARLLQVSGLICDALKRRKFTAEELITAYTRHGEPWML
LDRYARHLESRYVRCETFGSTDDAGMDELMHQCRVEYANTLAVLAEAYSG
ALEREGFTPAGFDQQSQTFRNAVAPLLKDGGKVAYFLVDALRYEMADELC
EGLSADFTVHLEPALGQLPGITSVGMAALLPAAENGLALEAVGGRMRVCV
QGETVNDRSARVTWFQEKSDSPTKVFKLGDIVRLTPKRKRELAEAQLVIV
TSQEIDRHGEEGGEDEETRLYMDEVLEKLRRGLRNLVQAGIMELVIAADH
GFIFAEALESGLKMDPPGGKTIALHDRVWIGQGGVAADGFFRVNASDLEL
GGPLELAFPKGLGIFKVPGGGCSYFHGGASLQEQIIPVCRLTAKAPKRSA
VITVNLKLQLAKPKITNRFFSVTVSLEAEGLFTDVAKRVRLEILSGKDEV
GHTAMAAYGFEEGSREIVVDYGKPNSVTIMISETKKLEHIEIRAVDCESQ
LLLASLTGVPVELAI
>GSU2733 hypothetical protein
MGIKGFTRLLLLGSVLTASTAWSAEIHGRSSTQLLWFNNYFNDQRQIELG
EYLRMSVTNLDKAGKASLFGYGRMTQDLNNGEGFNGRLYYLYGEYRGLFD
KLDIRLGRQFVNLAAGTAIIDGGQVDLNNAGPIGLTVFGGRNVIFSLDEE
NGHGGDLALGVAAYLNGFKSTDLEVSWFRKWDDWNIARDTLGASVKQYLF
NNLRVYGNTRYDVVSETFSEVLAGAKYYPTSNLVLTGEWYQSYPVFDATS
IYSVFAVDRYQEAVFRADYTITEQIAVHGGYTRQWFEGGANGNIYQAGLS
VRPVEPLQLNFEYDNNQGYNGKTHGFIADAYYDVTKSLQLSGGIAYDTYQ
RDVLSDDEIARLYWVGGKYRLAKNMSASLRVENNVNAVYDNDVQGRFVFD
YDF
>GSU0760 hypothetical protein
MILLITLPHPFYLPLPVRLPVPLLLPLPELPLPPQPHMVVRTMVIRIKSR
ILCARLMV
>GSU2109 hypothetical protein
MSSLEQNLDLLRQDLVAEPMRIAAHSDMPFAIFRYSPEEEFQLRKRLRLL
AYSLSENHGRKVAFLSISRLVWGIVRRFEGTDYLFKTESIRGFQAAEEHI
NRLLSSEDYRPIADEVLERIKGFDHDRDIVFLVRAGGFAPFIYRCSSLLD
GLHRRTKVPVILFYPGSAEAGTDLKFFDLPSEANLGVYNYRVKIYGVDK
>GSU2967 hypothetical protein
MKVIDFALEMEEMGKDYFRRLASEASMNGVRTVFTILAEEQQELYDTFLA
IKRGASAHCNADSQALERAREAFAKIFAPDGAHLALLKTDLAAYEHAMQV
EAAIVSFCEELAERERDEEARALLREIAAEERRHYDTVESIHDFVAAPRW
HLCWGESCNLREM
>GSU3215 hypothetical protein
MDTWPAPTGTETTRNRTLREGGGNDKAEHHAQTSRAVRPEGPPVDLR
>GSU3164 hypothetical protein
MERFFTEKGIPYTCRDIRRDRAAFREWRERYGGEIVPMVVLDGGKKVIDG
CDIPAIERALADIRSSRP
>GSU2294 cytochrome c family protein
MPSFLKPLRAAIPAALAVLCSLPPAASALTLTSNDCGACHVGSNLDRHHR
LISEAGMECLACHQLHDLPAGGYTILLSRECLACHNATVHNGVSHTVTTP
ADCARCHRESLETTHAENGWHGGDPAINAEIFPCLRCHTSTSTSIIETIS
AGLAGETPSCMNCHPYGMKRSRGTSSVRR
>GSU3220 hypothetical protein
MISPAHPQRWWKSPDRKAIGIYEEQTAHFMTFK
>GSU2729 hypothetical protein
MESYRQPDVPAPHPRRHAIAERPPSTHPGHDHASPEELPGPMNPGAIRCN
AADPIVT
>GSU2767 cytochrome c family protein, putative
MKLSRRDGIFLVIILALITVLVLNRGSEQGKPLPADDAHRPFSEALARGA
DRETTEQGCVRCHATAARPLPPGHPPKEQCLFCHRPAPATR
>GSU1546 hypothetical protein
MAPEQPAMTHPIEISPNHDLIRDRLRLCLDSARHDTATFAMLTVVLTPIF
MVVGTIFLVFTLMIVDVPLVDHLGYDASVATGANLSLAFMAASFFLRPTQ
DCQRGRRGYPWFVVGALLFCVLLGISYGTTLPATAPGLFWSLYLVFALAM
LGCIGHVYEPHDDYYVGWVAGPILIDDPFTIEDDIDRAHLSLGLVGALSH
LIISSYGEIFGSLWLWRGLNDHELSASVEFLKALAAKDSNRARTRMLSVG
NASARDIMRALQKLEMILVVDGAPRLSMKGRELLGLKP
>GSU0125 conserved hypothetical protein
MSLTTELVDWTHEQKCRKAVAALEKNGFTAVYCPSGREAFDYILTEAVDA
RSIGFGGSLSVVDLQVIDRLREMDKELLIHGLPGLSLEERVAIMRRQLTC
DLFLTGTNALTLSGWLVNIDATGNRVASMFFGPRTVIVVAGRNKIVDGGV
TEAIARVKEWASPPNARRLSYKTPCATTGFCSDCNSPDRICRITTVIDRN
PRLTDLRVLVVNEDMGL
>GSU2966 hypothetical protein
MNSTGIPPVEPHQYSRTIRSSNRPAGTDPSRQLPATACRSFQQVSCRTLL
RIATAFNHGTFDNSPFRATIGPA
>GSU1065 hypothetical protein
MVRPLTNNRGAALALALVLMVVLSLMVAILYRTTLMDMLFSTHYQESQKA
FFAAESGVKAGLAWLANQGSAPENQLNPAPWFSNTTTSRPTAATVWSDDE
PVTADGVVSGRFRYYIQHLKNAPASYAGGESAKQGTSTSAGAMVHYYRIT
SEGENRNATVTRQIQVVTTAAY
>GSU2131 hypothetical protein
MKKIIVSVCAAALALAAIPAFASVPPEGKDDCLLYGKNCPNVLDSLPERI
AKLNKEIAKGEKVYTSEELNLLERKLKEDNRTMRVLNKPGK
>GSU2920 hypothetical protein
MYFVSVFIVCQEKIEDNLIEAGRFFLWFGGRFRWWG
>GSU0481 hypothetical protein
MGYKVKTFGMELRPLKAMQELFALDAMVNAFVAENGVKRIVSVSDSPTTD
DKGETIGLVRVVAYED
>GSU0100 hypothetical protein
MFGSAQRKKPKETLGLFVVICRLVANREGFSAPF
>GSU1020 hypothetical protein
MGPGGACGAAVRLFPQDVSCKRVGLPTQKGPHMVAGLFASGRETLAVLDR
HGQFLQFILQVGNQSSHRFPVAVRQPVHPCQGGFEAPGRGKGAPDRDCRV
PHRLLTAPGNLLELLEHRGVAPLLLHDPLAQVGVFLVGKPLFFPGLAQEV
DGAVVGKGAGLFPPCQELGFVLLHEAVGQRQLLGRFLLDADFPRAVFLEA
AVSPVTVEIAVVHAEQPVAQLHPG
>GSU1675 hypothetical protein
MLYHVADKVVPTLKACSPQIRDIPLEGGIHQVRATDEPDLTVEQKSKNGH
GYVGGSIPDQHGRSADNDQATESHETAGQGIIENIAPQFNQVGNEACLVV
VEKTGGIVVNGRSISPPGLGLHEEVDVGQLFIGRNECFERGTGHKPSR
>GSU2742 hypothetical protein
MTIVVRLLSAVAILALTAGSVLAAEWVVLDEDPMLSNFYYDSSSVSRDKE
GMVSVWTRAIYSEEGKADALDTMGNPPAFKDLSHTHFLYTIDCKAAKSRL
EKVIHFDDKAEPIREYNLSGKTDWEAIDEFTRLGLLQEAVCQ
>GSU3149 hypothetical protein
MPAYPGSGKLDRNTPAVPLAIIRPAFSELRVIDGQPAPL
>GSU1543 hypothetical protein
MFSSYRFCARRIAALISRDWLPLSPPAKFASSLEEINSVTRTVVDTHFRD
SIADRFDVSGISGRKPLNPHEDPSLGLYITKIVKPFGKQFGFADFNHDYT
VASWLQAVNP
>GSU2886 TonB-dependent receptor, putative
MPGRGSGPTIQENTRDEHPVMQPARTGKRPAANDGCRPRLPLPVLVAGTG
EGGAVPLPVGGRRRCLPEVRHDLRLLAGGIDIGIGRGVGVVADEPAVVQA
VRTHRMARDTGHIVLVVAFAVNVVVPAAGAATSLEGVTLGAEQGGLPVAG
VVTHVSQGPELAVGARGPGGAGNQHGLTGRAGGVTVAALHHLGADGGPGC
LHLVCPQSPRRGGAGLAVEGHLDRVGRRVGRVELPGLVGKGLARRADDGA
HAVGGGERIGGVTLIAQLVLLGRRLHGRVPGRRGPRRRSIRLDPVVLAEV
GDVVAVLVQVGLGAVAIGAGILLARHTGGDGVHYILVRTVVTGLADGGRK
AAVLRDQVVLVTEGAGPLPHRGHMPRHGRNGHGEQCCECHDYAFHRAPPL
IGQLDVVHVSPPRYCTRKAPSAT
>GSU0139 hypothetical protein
MRYQASPQTRLLLKYIHTFSAMMWIGGAQAILILLYKDRLATNGDELFAV
NDAIRSLDNWLIGPGVAGSILSGALISLTTHWGFFRHRWVSVKWVVTVVA
TLFGIVFLGPWLQDLSEITGLNRLAVFDNLAYVQTYRLGVIFGVAQMVVL
LVLVLISIFKPELEPCSPAARQLERRLAVLLRPVVAPFMGLQGQICNLLR
LERD
>GSU0943 hypothetical protein
MAERNEESMKTEEPVRLVFVVEGRVAVAPYLDTVRSEVEDVTVVSSIGEL
FPLMAREAFNGILLDVPTLVRATSTEKAALYDLIQVFPTLRVKWDSRSAT
VRALFYDRVPGPGAGLGSFVRQVCACFVPRIIRRRDRVAAHLNVMVSETP
AFSPEEVTRTVSLNLSLDGCFLIAVDPWPQGARLWLRMLEFDDQTPIAAE
VRWRKPWGVSPGIAGAGVRFLELTDEQRKCVVDLCEQNRVVGGI
>GSU1863 transcriptional regulator, Ros/MucR family
MAPTLLELTASIVSSHAAVSELSTEELVQEIQKVHATLQQLEGGAAAPEV
AAEEAKAPAMTLKKAFQADQVCCMICGKGGMKTLTRHLAQVHQMKPGEYR
KQFNIPSSQPLTAKKFSEARKQMAKDRGLAENLAKARAVRAAKIQEKKAA
AEKPAKTKATRAKKATA
>GSU1661 hypothetical protein
MRYRILAIAMVSLLAALPAAGADVRDLATCTTKVFNEINLTRQWSGKAPA
GCTATVAVEKRADGIFVTAWTVEATAGGWVLTALSSAAGYAEVADKKILA
RANREIVSRAARLGKCLDSIKAVNDPLECRTHATKSYLVDEVTGTEHNRL
IWLDDNGRHTVVEYSFGDTEATPTPPADLFEGTPIQPGVIIQLYREVNAL
PSLKPNFAS
>GSU3223 cytochrome c family protein
MRRSVAVIWLLRVGLGALALCLTACDPVVRHKTLSTIFDGVPTLPPTDEY
CVDFARQYHQEQLGLAKVEVKEEGPKGSTHLPYGEKRCADCHGSDKDKSG
GLVVPKQELCFKCHPGFLKGAFQHGPAAVGDCLACHLPHAAPNPDLLGAP
RDQICGRCHTEERLTNRMHDRLKDSKIACVECHDPHASDARYLLR
>GSU0209 hypothetical protein
MTLVCLTTKRKNARRTSKAATVPPCFVFGAVSGLLLECWLDLLTAYVSML
QIMTLACLTIYKIP
>GSU0060 hypothetical protein
MAGSMTLSLHGANRFLERDVPLEALLGVKLVTPILNDDP
>GSU1969 hypothetical protein
MSETVYVVHCIDTEGPLHESTEATFDRLKSIFKLDLEPSTALLRRLQAGE
VDLGGIESAVMKVVDPHLLAYNDTWDKVDAMLAECLSDEFRMRVRDSFGG
GWVYNWFCVDHVDYDYNPRRRDMGYHNVFDHYRHLMRETGSMQDGLHFHY
HPHPFNREAHRCATHWWANSDSLPQILSRRVIDRNWFPSAHRPGFHVIRP
DSHWFLEQFIPFDFSSQAMVPTDDDRAQFGLKGGRFGDWRRSPVNWQPFH
PAPDDYQVPGSCRRWVARCLNVGTRFRLMTENDVRQAFREARDGKPVVLS
FTNHDFRDMRPDIEGVLHLLGRVAAEFPDVPFRFAEALEAMRGALGLPLP
PPCELDLTLTTIGDSAHVLEVSSETPTFGPQPWLALKTAAGTYHYDNFDI
EIPFHRWQYVFDEETFPLKALSAVGVAANNAAGTTTVAVMDTATGTVTRR
HWNGTSSASHQE
>GSU2895 hypothetical protein
MRIPFTLVCLLAVGLWGPATRASAAGAGVVNFVKPVVPPSALDATDDGKG
NPLCGDGLVLQKNGRCCPEGTANDLWPGVCTPVGTIEDQLFNQDDITYYC
PSADMYIVSDRSRNWFSCADSGSVPLYAAARQSTFCRDHGYGYLLKIRPW
GVAGCVRVGDSLGDGCIEAGTCSFSGP
>GSU1405 hypothetical protein
MTLGSSGKGQVAKRCPDFLYAHGRAEASRPGKGIVLPDLRNKGLHQMGTV
TIPLRVVGDAQYIGQQPFREFTLRILPEGNLSEFFEDVHTQSFQPEVP
>GSU1024 cyd-4, cytochrome c3
MKRLIAAAALTLFCAGLAVAHDKVVVLEAKNGNVTFDHKKHAGVKGECKA
CHETEAGGKIAGMGKDWAHKTCTGCHKEMGKGPTKCGECHKK
>GSU1760 cyd-5, cytochrome c3
MKRTVILFAAMILTASVGLAADVILFPSKNGAVTFTHKRHSEFVRECRSC
HEKTPGKIRNFGKDYAHKTCKGCHEVRGAGPTKCKLCHTG
>GSU2802 dRAT, NAD(+)--dinitrogen-reductaseADP-D-ribosyltransfe rase
MLSSGFNLCNLPPWVIASRHFNDNPHPLEVQGVRPANRFLFEKLDGIASA
EERGTVFNDYMSVKFQLHHWQAQQTDTARKSLKNSYLRYLRGWMMDSNSV
EGAVLKGWVESRMGIVPTFHKARIGGIQSESYYTYVMDRTAGSKRTNAIN
SQLDILYEFCQYEQARRMAGERWITLYRGTFDADDYDVVEELGKREKIVR
FNNLVSFTSVEERAWEFGFTVWEIRAPLSKIFFFNDLLPNSIMKGEGEYL
VIGGEYRVRQIMCTI
>GSU2731 ferA, polyheme membrane-associated cytochrome c
MSRKVTKYSAVLAVSLFAAALAGCGSENKEGTVGTGPGGVATVGDTACVQ
CHSAVVDPLTGESIITQYTRSFHYSKGVGCEGCHGGGAQHNGVGPLPFPL
AGQSEAQIAARCASCHNGVIAPLSSSPNFVNGNHANPFGGEEAKENLCSR
CHSHEGAIFGAQAGFTGDGNILRNAAYQPVYPQDPETFNVMTCATCHQHG
GAQRQVFTQISTAGVPNSRRTVAWDPNRNSINDQYDLCTSCHTVNTMTGT
LIGSGNVLQIFTSNAVGSGTKSVTTAPFYHNTRWFRTLPSTHYDFPESKT
TASGTTIEGYVIRRNTANPCFDCHGHEFQTNTRRLAGADRPNTIFLDWGQ
SAHGGKLLQAKVAAAALASSGAAEVDDVMKAGATDATAPGWTHYNWDDTA
SRGACQRCHTSTGASNFLNNPAGYDRTGAGNSFTHLAGWTSSNKRSDQNE
LLYCWGCHTKAGTGELRNPGAITEVYPGINSTSTGTTGLDVTVSYPDIKG
SNVCMGCHLGREVGDNIKAITDADGILGFVNSHYLTAGGQLFGTTGYEYA
TRSYANPAFFQHDKIGTAAAPGTGTNGPCAGCHMTTPTSHLFLPVTKDGT
GAITAITSTACVTCHAGTFALTPEGLTAEEEEYVASLEALKAALAGKGIL
FFNAHPYFYRDTNANGIADPGETVSSNAFTNWAGVYGLALWQDVMGAAFN
ANLLIHDPGGYAHNRFYSKRLIWDSIDFIFDGVLNNDVTAAIDAQVTAAR
LDSATATAAKAYLGATRP
>GSU3045 flgM, negative regulator of flagellin synthesis FlgM
MLIDNDIVSLSSVGALPVAPPKADAGQAAAAGTTPVAERVELSIGKSAID
SLKDAAGNGESFRAERVAEVRQQMIEGAYRVDARSVAMRMIGTNV