TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Gene type: CDS
Genomic element: chromosome

Number of genes found: 227

Free access
Sort by:

 



# Geobacter sulfurreducens PCA, PCA

>GSU3139 conserved hypothetical protein
MKRLGTFAGIALVLALTAVVARTCFSHDLIVRIKDRKEISFEEMLRDLKA
GKVIYVGETHDNPYHHDLQLRIVRELHRAGVPLAIAMEMFTYESQEELDR
WVAGKTDPALFQQIYLKNWNFPWALYGDILLFARDRRIPLVGLNVPREVT
RKVARQGFESLSREERRKLPPSITCDVDDAYMAMIRRSYSDHDTSAKTFK
NFCEAQMLWNKSMAYHLVEYLKNNPGRTVVVITGSGHAVRGGMPVQVDRE
KPGLASRVVLPDPVVQNGAISVEDADYLWLSH
>GSU3295 conserved hypothetical protein
MPRPFRLSCALLHEPVSSNLMDLKYGDTSFSLAIPPERLMGVIRPSVPAP
DSDPAAIISEALDRCADAIATFRTGERVVIVTSDITRYTGSEIYLPLLVE
RLAAAGVRERDIEIVVALGIHRKQTEHEHQRITGPLYGRIRITDHECDNP
GKLVLIGRTSTGVDVVVNRTVAEADRVILTGTIGFHYFAGFGGGRKSILP
GVAGRASCMASHYAVLNPGEGTGRNPLATTGNLEGNPVHQTMVEACAMVN
PEFILNTVLSPAKRIIAAFAGHWREAHEEGCRFYADRFSFPLKKPADLVV
VSCGGYPKDINVIQSHKSMEYGCQALRRGGVMVLLAQCRDGYGNATFFDW
FRFRELDAFEARLRSHYEINGQTAFSLLQKARTFSIILVSDLPPEEVRTM
GMTPARTLDEAMTAAASLLPSDYTAYVVPEGGTVLPIMM
>GSU1788 NHL repeat domain protein
MRLSRGVFLLLLLLALASPLMIVGCGGPSFLPAASLRDPSVDMAWPPAPN
PARIRFLREISGPEQVKAEPGAIARFLEFVTGEQFKHVPFVTPYGVVSDG
GTLLFVSDSSSGVVHRIDLARQKVSYIVRAGDEFLSSPVGLALSPSGDLY
VSDSVNAKVYVFSRDGEFLRVLADGQVDFKRPAGLAVNSKGVLFVVDVLA
HKLKVFNVSGRFLGDFPPDDIGGKLNLPSHVAVDKDDKVYVTDALNFTVK
VYDSARRYLRSIGEIGDAPGSFARPRGVAVDSDLNVYVIDAAFDNFQIFN
QEGQLLLFVGKPGKKSGEFYMPSGIHIDRNDRIFISDSYNRRVQVFEYLK
EENR
>GSU0446 conserved hypothetical protein TIGR00046
MRRFIVTGLDLSGKNVVVRGDLFRHMARSLRLKIDTEVILADGKEAECTG
VITAMGRDSLTVTISERRQIRGDGDGLWLTLCQGLPKGDKMELILQKSTE
LGVSEVVPFLAERSVPRISPEREQERLRRWQRVAEEAARQAGCPAVPRVQ
LARGLTEAVRGTDHDLKLLLWEDERTTTLAGVLAAAQTPKRIAVVVGPEG
GFTAAEAAEAREAGFVPVTLGRRILRTETAGIALVAILQYLYGDLGSGPH
PRPAPAQPIP
>GSU1218 conserved hypothetical protein
MPMITTLTCPRCRAETVWEGNPHRPFCSARCKTVDLAAWADEEYRIAGPE
APSDNDENDRE
>GSU0970 conserved hypothetical protein
MNSKRYRSDTRPAACAIRLLAVLVLLAGLAARAEAGPALPVMVKDINTAS
VPVSSSPSGMTANGGILFFSAGDGANGPELWKSDGSAEGTVLVKDINAGP
GPGTPQNFSVMNGITYFSAMDSFRGVELWKSDGTAAGTVIVKDIYQGGES
SNPLELTVAGNTLFFSADHPVYGKELWKSDGTAEGTVLVADIAAEASSTP
QWLRAVNGTLFFAADDGLHGRELWKSDGTPEGTVMVKDINPFGGSDPGEM
AVSGGILYFTADDGENGHELWRSDGTAEGTYLVADIAPGEESSYPFEPVG
INGLLYFTANDGYTGYELWQSDGTPEGTTLVKDINPDGEDSMPWGIVGMD
RYVYFAADDGVNGYELWRTDGTMGGTEMVADIQPGMGGSMYSSPRLVNGM
LLFAADDGEHGIEIWKSDGTAEGTLMVRDIIPDAMSWPSELMVHNGTLYF
AADDGVNGTELWKSDGTAEGTVLVRNIAPETASSLPYQLAVMGTTVFFAA
ADTDLDFDVWKSDGTADGTVLVKEINPEGWAYLDRLMVVGDTLYFLAEDN
YGEASHGIELWKSDGTAEGTRMIKDINPGPQGIFFPGNPNYPFSMAAVGT
TVYFPGFTAGNGHELWKSDGTAEGTVLVKDINPVFDFSSFPDSFTAMNGA
VYFVADDGTHGAELWKSDGTADGTRMVRDIYPDGIGSSPLSLTVMNNVLY
FSAAGDEGGYGLWKSDGTAEGTTFVKDTSPFNHSLLPAYLTPVNGTLFFA
AHDENAGFELWKSDGTTDGTVLVADILPGEGASNLRLLTGVNGTLFFVAD
DGVHGEELWKSDGTPEGTVMVKDIFPGDGISGITWIKVMNGMLYFAADDG
VNGLELWQSDGTAEGTVLVTNIVAGQGSSSPSYPVVAGNTLYFAATDGGS
GVELWKFSPDPPDGDLTGNETLEIPDVLRALRIAAGIAAPTVADFIHGDV
APLDGNGRPAPDGVIDMNDVLVVLRKMLGVVSW
>GSU1069 conserved hypothetical protein
MAEKQYNWSAIAKNPKFIELHRKKTTFLVGWWVFSTVFYFLLPIGAAYAP
GLFKIKILGRINFGYLFALSQFFVSWGIAMYYAHVANKDFDRLTRELVDE
LK
>GSU0390 conserved hypothetical protein
MLNKEKWKKNIKAILSLDSHPGHIAAGFAVGVFISFTPFFAFHTLMAIAA
AFIFRLNKLTCITGAWVNTPLTVVPVLAISYKLGRVMRGLPPAELSFHGL
DWHALKPHATSLLLGTSVIGFVAAVVAYVICYWLVVRFRRKDETLATLAE
EMEEVGEELE
>GSU2301 hypothetical protein
MFLTKIIIMLYKIICMRTVSIFHTCLAAVLATVIFIFSTVGFGVACQLVV
PKVVDSSHHYHTDRLCPADSERNDGSSTPASECPLAHGDLPPCCADETIL
PAYCPSFALLDVHDPLRAPPQVYLDWFVPPQNQT
>GSU2440 hypothetical protein
MKRLAIVVRDDAYDRLYTPLTFAYVAASKGIAVDILFVLWAVRVLTEKGA
KAVKIDSRHAAEEEWFKDRVRRDGDPLEIYDFIKLVKKTGNAWFYGCRLA
AATFDVDESQLIPEADGIVDSLWFLEEKAIKADHCQYF
>GSU0131 conserved hypothetical protein
MAEHPPQKRLFIALMGLTCLLIVGGIYLLWWVPTTGLANIHPSLPRVVGI
VLGGLSLLALLGTAVLVLTTALGKDMYFTRFLRGVVIKFLLPVIELLGRA
FGISTDTIRQSFIAMNNSLVLSQRYKVKPDRILILLPHCLQLFDCEIKVT
GDINKCIRCGRCDIKGLAELAEKYRIDISVATGGTLARKVIIEKRPKLVV
AVACERDLTSGIKDCYPLPVIGVLNDRPFGPCFNTSVDIAKIDESLQVIM
G
>GSU3167 hypothetical protein
MPSVDPCWRWSAFGKHPAAADYFRLGEDSPFVEGLAAWVENGYRHLISRH
EATMPFCSWRFWARGFGRDSLVLGVVRLSSDSLGRSYPLLIMGSGPLEGW
EEQWDLLPFACEPSWCQIEYLATHSFDGLKHVGEELRRVRPPGSDWDELA
ERRRGLNRLGSPLDPYASFLDIPSLERLVAEHSGREEFSVRLDRGPVHDK
ITLVSLWHLLTKRTVKGVPHTLFMGGNLEHSFLASFRRALAPGDFLHLWS
APEAGGWHDSIGTGHALDIASIGKEPVCPERPAGEDIRYDPLFDALQAEV
DKLTSPAVAGAVDWERVVHMAADILATRSKDLLVASYLAVGLVQTRGGDG
LALGLTVWRDLLERFWCTLYPTRTRGRQRSVEWWLDRTVIALHHQGNWSL
PPEQHGIVLETLGAIDRFLGERLEDAPSLTKLRQLLADMAPEGGEETAPE
AVVATPAQAEAPATGLFAPPRQTAAPAVEPPCQAIATGSSLQMLEEALRQ
VGEAAGALIQQDPAAPVSYRLSRLAAWGKVTELPPSINGRTRIPPPERQV
LTLLQELASHGDGEALLKAAEARLPQFIFWLDLSRLTAKALCRLGDRFAA
AREAVCAETAAFVGRLPALPDLTFADGTPFAAPETRQWLTGLAQRGASVA
DCPAVGHNDGRAAAIAREVEEAQALIRDGKLLEAVERFQKQLGNGASRKE
RLHWRLALAQLLVNTNRAKLALPHLEQVVADIEAFGLEEYDPALALRGLK
LAWAGFDSQIEPRFKEKAADALHRIARLDPVEMVRLTKG
>GSU1852 membrane protein, putative
MSLYTVIIMQRDRFQRIVITFTAIIATSFLCIASSLLALSVLAGFIVVYG
LVFDKESITHPVYVISGLVGFYFIFGSLNISTYRGEISEHTCLLEYMFLC
SMIIGVLIYDIKPNEHAPIHLKVPNLLLILAGLPAFVGLLWIGLSPGFPL
FDPNLFTKVRGKAYFLSETIFVLFVLVINNIYLKEFSKIKRALIIAGLLF
FISLPGYRGWPIIAILCLCLLSLRYRQKKLFTTLAMYSVLVLGLITGLAY
YRRLHSDELILAELVVQKFDAEQLGVFGALLHFALRESIAISQFLIERYQ
QNVREIHGSLFLSDFMTMFPGSRDSGGIMIASIFGEYSGVGLTPGALGAL
IYEFGTINTFFIAMLIGIILSYFYKISLKYAVPGYSCLYYLIIIYIIHYI
HRGIPKPSYLTNPLLIIFLLTLSKQLSKNIKHLKDIK
>GSU0450 conserved hypothetical protein
MKRSIKHIYLLSDATGETVERVVRAALSQFRDVEARFHRVTRIRSREDVI
WALEEVLREPGMVVYTLVDTELAQLLRDEAEAHGLDAIDLISPLLFKLSD
FFGEAPQKEPGLLHQINSEYHKRVDAVDFTVKHDDGQDPRGLAKADFILV
GVSRSSKTPLSMYLAHKGYKVANVPIVKGIDPPPELYKVDQKRVVGLIID
AERLVQIRTARLRNLGQMPKGSYADYERIEEELEFCRRLYRRNPQWLVID
VTKKSVEESAAEIIQKLAG
>GSU3327 conserved hypothetical protein
MKITREELAVQVDVAQWSWLRAHLERGGLIVVNRGLDLVEVGVSIASDDT
AVVGDWIQTGLLAKPSAEQISAWDANEEISFSALIISPYVLIQEQKQ
>GSU1664 conserved hypothetical protein
MDLCEQSISGKLLQALGEFNRGDWFECHETLEDLWIGSEGEIRDFYQGAL
QLAVALHHWRNGNLGGAMSLLQGGAGYLRRVRPVCQRVDVAGLISAADRL
REELSRLGPERMAEADRSLFPRMVLVAVPGGEGHRVK
>GSU2563 conserved hypothetical protein
MIIDTPLADRYPHAMKNQDESGDGWESLCERCGLCCFEKIEDENGTVFFT
ATPCRYLDVTTRQCVIYDKRFDINPECVKLTEELVRTLPWLHDDCAYRKA
LGVRLSKVRMNGRNKGKRG
>GSU0224 conserved hypothetical protein
MSTGNETLAGRAIGMSELAQYQPGSVVSRTLIDKKIGTITLFAFDEGQGL
SEHTAPYDAFVQIVDGVADITIDGQVHRVAAGQMIIMPADRPHALRAVER
FKMLLVMIRA
>GSU2498 lipoprotein, putative
MRSRFVTHTGALMRAFCGIFGLFAFLLMQGCSHVEHFRDPNMDFGAIQSV
AVMPFGNLSKDTQAAERVRDVFNTKLLATGALYVIPVGEVSRVTGVAGVM
NPTAPTPEEVVKLAKMIKADAVITGMVREYGDVRSGSSMADVISLSLQVM
EAQTGRVVWSASATEGGIGFTDRLLGGGGRPLNDVTEKAVDDLISKLFQ
>GSU3092 YqeY family protein
MLRDRLNEEMKAAMKARDEVRLSAIRLVRSSVKNREIEARRELSEQEVTE
VVSSLVKQRRESIRMFGEAGRTDLVEKEERELQVLLGFLPQQLTREEIGG
LVASAIAETGAQGVKDMGRVMKALMPHVAGRADGSLVSAIVKEQLA
>GSU0019 pentapeptide repeat domain protein
MLRSSLQTHEKEAYPTMLRYLLRPFLSLMAVALICGPAVESVASASGNGG
KKRPLDYEEYVRLITGGPRKVARKESRKESRKSKVEVAAAEKPAAVQQPV
VMAATPAPRRAVTPSPGRDGWRPSPEQVHEILRTSRNLAGAVLRGAVLAG
FDLRGVTLAGADLFGANLAGANLDGANLRGTSLEMVNLRGASLRGANLAG
AGLFKADLEGADLQGANLSGVYAVCANLRGASLAGVTTVGGHFAQATFDD
RSQGAAVAQAQNATRNDIVPVVGGEKALPSGGEKGRILLLNF
>GSU2197 conserved hypothetical protein
MHIVMTIHGFALDSIAQMPVVLLKDERGEVTLPIWINGTDTLYIVAELIR
RDASSSGERKDLLTALLGHLGAEVLDISIDGGKDGKFICSVRLMVGEEEL
RLPVRISEAIVLALKNALPVMVPRHLVEEATAPAGTMGDHFTEADERRFV
DFLEGLDPADLGKYPM
>GSU1056 conserved hypothetical protein TIGR00149
MIRHLEFKSRTRTDMIDITSAVQEQVRSAAVRNGVCHLFVLHTTAGITIN
EGADPAVQRDMVNFLDRLVPIDPYFTHAEGNSDAHIKSTLTGTSLTVFIV
EGKLLLGSWQSIYLCEFDGPRHRRVAVKVVPDP
>GSU0257 conserved hypothetical protein
MPTVTVIAEVRAREEAVDAVRVELLKLVSETRQEEGCLEYRLHQDGDDPA
LFIFYENWQSPACLERHLGSGHFRAYLAAVEGMIAGKTVRRLSELT
>GSU2638 conserved hypothetical protein
MTTALWLLAAVLILIGIAGTLLPALPGIPLVFGGLVLAAWIDGFSRVGWL
PLLVLGLLTVASFAIDPYASGKGARRYGASRHAVIGASIGAVVGVFFGLP
GLVAGPFLGASLGELLARGDLERAGRAGFGAWQGFVIGTVLKVALALAMV
AFFLAAYLF
>GSU1474 DedA family protein
MQEFLGQYLTTYGYGVLFVWTFLEGEAGLILAGFLAFQGYLDIGGVILTA
LGGAFAGDQFYFYLGRWKGPWLLKVFTLIARKFRKALRLIERYGTFVAFV
SRYTYGFRIILPIILGMTTFPARRFLWLNLCSAFLWAVLFSLAGYFFGKS
ASLFVEDVSRYESHLLAILAGLVFCMWLFHFIHARMRRRPARERLRRMRQ
REFD
>GSU1684 conserved hypothetical protein
MAAPAALKRISATVWELPVSYKKGMLVPARIIATEKLINAMDAGVFEQVS
NVACLPGIQKYAFCMPDGHWGYGFPIGGLAAMDPETGVISPGGIGFDINC
GMRLVLTTLTYEEVKPRLRELVDALFYRVPAGVGSHGFVRLSHDEFCRVA
EQGSSWCLKHGYAWPEDLEMTEEHGCFTGADATKVSQRAVDRGYNQIGTL
GSGNHYLEVQVARPENIFDEDTARAFGITVPNQVVIMFHCGSRGFGHQVA
TDYLQLFLSVMEKKYGIRTNDRELACAPFRSREGQDYFAAMKCAVNMAFA
NRQVILHRIREVFSDVFGRDPGDLGMDMVYDVAHNTAKLETHLVDGKKRE
LLVHRKGATRAFGPGMEGIPARYRETGQPVIIGGSMETGSYLLAGDPGGG
DTFFTTAHGSGRTMSRHQAKKLIKGQKLQRDMEERGIYVRTASWGGLAEE
AGQAYKNIDDVAEATELSHLSRRVARLVPIGNVKG
>GSU2551 LysM domain protein
MTMKRMLAASLLLALSLVAPLGVLAAAEEPTVYVIQKGDTLWGLSERFLK
DPYYWPNLWARNPAIGNPHFIYPGQRVRVYPDRIEIEPRTPATPEAAPSP
RPSEEPVAERSFLVSGSEGYLVEKGFKPAGRIITTSQSRVIVGEDDIVYV
DIGADQGAKVGDRFSVYKKLEAVSHPVTNVILGEKMIPLGTLQLTEVEGK
VSKAIVTKSYQEIGPGSLILPWRDKRRLVPLKAAQQDMNGFVVDSQGGNK
AISEGDLVFIDLGKSQGAQPGHMLYVLRDVVPDQQYADISVKKLPPEVIG
ALVVVASGETTSTALVVKSIDTIYRGDRVEVRAAR
>GSU1585 conserved hypothetical protein
MQKDDVAGRITAVAEQVLTPQGLELVEVEYKREGRQMVLRLFVDKPGGIS
LDDCAAVSRELSEILDVEDFIRENYTLEVSSPGLNRPLKKEADYERYAGR
LVKVRTFELLADEEGNRRKTFLGDLVGLSDGVVTLTLREGQLARIPLDKI
AKANLEFEF
>GSU1527 conserved hypothetical protein
MSIYHDVIKSEVFPALGCTEPIAVAYAASLAAERLGAEVETVTASVDPGV
FKNGFAVTVPKTGGLKGNVIAAALGALIARPELKMEILSGADERLLAQAE
LLVSSGRATVALVKERTDLYIDVVVTGGGRTARAVLEGGHTNIVRLECDG
RILLNADEPVSAVDSHAYRAVLRQMTFSEMIGLLDDLDQGDLVYLKRGVE
MNLRIAEEGKQLTKVGHYVEELVRKGFLLADVVSSSKILTASASDARMAG
LPYPVMSSGGSGNQGIVAILVPYNVGMFFHVPEETILRSIALSHLVNAYI
KCHTGDLAPICGCAIAAGVGAAVAIVYQQAGPDMHKIDLAVNTIISDIGG
MLCDGAKGGCALKVVSSTDAAIRAAYMALNGHGISEEEGFVGKSAEETIH
NLSRIADKGMALVDDTMLCIMLQKRSTEP
>GSU0415 hypothetical protein
MATVMVMSGVLFLAPPVSAPPAGAQPAAGEPQAPAAAAAGKSVVGEMALV
EAKRQQLAAREAALAVKEQELKNLSAKLEARVKELESAKAALDRSLDARK
KVQSANYQKMLKVYKALKPAEAAQLLDKMDEGEVLEILNEMDQKRVAKLL
PLMKQERALRWTRHSLAAK
>GSU2257 conserved hypothetical protein
MALSIELFEILACPRCTGEVKPVNNGSALVCEACRLRFPVRDDIPVMLLD
EAERIGDR
>GSU1138 Ser/Thr protein phosphatase family protein
MPVNILFIGDIVGSPGRQALVRELHRLVDHHRVDLVVANGENSAGGFGIT
EETAKELFSLGIDVLTSGNHIWDKRESFSFIGREERLVRPANYPPGTVGR
GSTVVRTAGGVPVGILNLEGRVFMNNLDCPFRAADQEIERLRESTPLIFV
DFHAEATSEKIALGWYLDGKASAVVGTHTHVQTADERILPGGTAYITDAG
MTGSFDSVIGVRKELAVERFVTQMPVRFEVAKKDVRLNGVVIGVDPASGR
ALSIERISLICS
>GSU3078 mraZ protein, putative
MFRGIYETTIDAKGRTSLPAKFREVLVDVHGDDRFVITNSAPVDLGAGTF
SSGLLIFPYAKWVEFEENFRSSKGLTSAQRNSIMRTIISPAVECCADKLG
RLLIPPHLRKGAALERDILFVGVMDKIEVWSQAEREKVRIQDLKNFPSDS
ETVAELGL
>GSU0494 iron-sulfur cluster-binding protein
MHTVAVERAARYDPSEVAEAMDRALASLGGMDTFVRPGERVLIKPNMLAA
KAPERAVTTHPEVLRAVIRLVRKAGGIPLVGDSPGIGGFRAVAEKSGMAA
VVREAGAELVPFDEAVAVPGSGLFRRMDVARPYLEADRLINLPKLKTHEM
MTMTCAVKNLFGAVVGTAKAGWHLKAGADRELFARLLLEIYLLRPPDLTI
VDGIVAMEGDGPGSGDPRPMGLILAGANAVAVDVVAAELAGIPKQLLWVE
RAAERLGIDGWDRSRIATVGLPPDDARVPDFRLPHLSDVQFGIPGFLKNR
LRHHLTARPVPNPEGCHLCGACLDACPPRAISVRDGRLHFDYHACIRCFC
CRELCPDGSLGVRDGVLLKLLKKFKE
>GSU1929 MgtC family protein
MDFTFEMIGRLVLASVLGALIGLERELHGRPAGFRTHLLVSLGACLFVVT
SIEFHRLYANTSGVGSIGADPGRVAAQVVAGIGFLGAGAIIREGTSIRGL
TTAACLWVAAAIGLSCGIGLFAISLFVTAISLAALLLLKRVEGLLSRDTY
TSVKVWSDDLEGQLERIEQILQECRLQVLTMSVERDMTAASLRLTYQVKV
TSRGHACGIMDAVVSVAGVKRVRID
>GSU2276 conserved hypothetical protein
MALIKPFRALRPPKHLAEKVAALPYDVMNVAEAKAMASGNPYSFLHISRP
EIDLPAEMDPYAEPVYEKGRENLERFTAEGILVQEPRECYYVYRQKMGEF
VQTGLVVCAGVDDYESGVIKKHELTRADKEEDRVKHIDYLDANDEPVFYT
YRNDPAITAIVARVASAEPAYDFTTDDGVSHTLWVVDDRAVIDELTARFA
SIATLYVADGHHRSAAAGRVRDLRRSANRGHRGDEEYNWFLTVIFPDSEM
TIMPYNRVVKDLNGFTVAEFMARVGEHFSVSPEEGRFEPKSRHQFGMYLE
GRWYQLVPKQGAFDEHDAVSHLDVSILQNNLLGPVLGIRNPRTDQRIHFV
GGIRGIGELERLVAGGEYRVAFSLHPTSMAELMQLADAGKIMPPKSTWFE
PKLRSGLFVHLLT
>GSU1087 conserved hypothetical protein
MIELFEKVFLTGLGVVSLSQKKVEECLVDLKEKYKVGEDEGKAILEKIQT
MAKDVKGRIEEMADVEVKRAMDRLGLVPREEYDRLVKRVEALEAKAGVGD
PSTEC
>GSU0432 conserved hypothetical protein
MGSPPWSPPASLETQLFERGHEFSFAQAVRLLRLLAAAQGREGACPCGVR
VRPELSLSFPVADVARIERQSDGYRITARFLGLYGPSSPLPTFYTEELID
EEREDGSASRDFLDVLSHRIFDLWVAGDAKYRLFNRVVEDGSGDDLERLF
CLAGLDLAQGQTVLPESGRWLRYVGLLGQVPRSALGLRTMVADALGVPVE
VIPCQHRQVSIPPGQRLAVGIAASSLGIDTVVGTNLDDRLGMFRLCLGPL
SREQFADLLPGAPGRSTLDLIVELYLDAPLAWDVELTLAPGETPEARLGA
CRGARVGWDTWLGVAAGESLPCVVFPGHFP
>GSU0472 conserved hypothetical protein
MYQLSRTQILPVPLTVAWDFFSDPRNLAAITPPDMGFVITSPVPERTHAG
MVVTYAVSPFGGLRLPWVTEITHCAEPSLFVDEQRFGPYRFWHHQHHFLE
VSGGVEMRDIVHYILPFGIIGRLSAPVVAKRLKAIFDYRRDTLAVRFAAL
SRGA
>GSU0985 conserved hypothetical protein
MPMAARTGDMTSHGTPLGPGPGCATVLIGGMPAWRAGSDFHACPLMNPGP
SPHVGGTVAAGSATVLIGGLPAARQGDSIVESGPPNSITLGCPTVMIG
>GSU0857 membrane protein, putative
MDAAVFFTTFGIIFLAELGDKTQLTAMALATRYPWKKIFIGIALAFAVLN
VGAVALGKFLFAVLPIFWIKLVSGGLFLFFGISTLRGGDGDNDGEKGPAS
ARGPMLTAFLMILLAELGDKTQLVTTSLAAQHESPLSVFAGSTLALWGVS
LLGIFIGKQLMRVIPLGTIHRVAGVLFLVFGLVILYQTFTGP
>GSU3422 conserved hypothetical protein
MGTRSKRSERCARCRLHIHRCVCPAMPRYSLATRVVLVMHHREYPKTTAT
GPLALEILPNSELRIHGEPGRSLNLSDLDTPARRILLLYPGDDVPVLDRE
LLERDGRPVTLVVPDGTWRQASRMGRRLPGLARAEMVRLPPGPPTEWGVR
RENHPQGLATFEAIARALGIIESPDVQSGMEHLFRLMVRQTLGARGCAVD
RD
>GSU3336 hypothetical protein
MSISEDRISHLAHRIYDRLWKDDLADFADERQALHCLKEGIASFFAVAGE
VDAAVRRKLASYSQAKVPGSRDYEILYQKFYQEEMAKRKW
>GSU1081 conserved hypothetical protein
MKLLLHICCAPCAIYPVSRLRDSGADVTGFFFNHNIHPYQEYRRRLDTVL
EYADRIELPLEVRDEYRLEEFLAAVAGNPADRCWYCYFSRLDAAAAAAAA
GGFDGFTSSLLYSRYQKHDDIRIAAERAAARHGVAFVYDDYRRGWQDGIR
VSKEMGLYRQQYCGCIYSEKDRYHPRQGKK
>GSU0981 conserved hypothetical protein
MALEKAKLVNADTGEEVAVLFNPTEYAVEKGNQYAEIAIPGLEAPLLQFS
RGTARTLTMDLFLDTSETGQDVRVHTKRITSFLDIDPETHAPPVCRFVWG
GGESFTGVLERATQRFTMFLADGTPVRATVGVALKEFRTGLNREKPLQSP
DRTKVRTMGEGDSLWLLAAREYGDPAQWRFIARESGIVNPRRVKAGTDLV
IPPIE
>GSU1769 conserved hypothetical protein
MALVVIAVIAAVFFLDRAYRKQPVVQQTPPPVVERHKLPPREPEQPVAHE
DYTGVIHHPAEPQRAQRPTGPGTLAVIIDDMGKGLPEARALMDIGIPLTF
AIIPGLPKVRRVAEEARQRGIEVMVHLPMEPKGYPERRLEANGLLLSQGD
DEIAGRVNGYLNEIPQAVGANNHMGSGFTENRQKMAVVMGVLKERGLFFV
DSKTSPVSVGDAVAREMGVRTAVRNVFLDNIQESGYITNQLRQAASIARK
RGNAIAICHPHPATIQTLAVELPRLRDEGITFVTVSRLVR
>GSU3176 lysM domain protein
MSKTFEDFLTALAQKESSRRYDIENYSGYLGKYQVGEYALIDAGYYLHDG
TGARKNAEGKYVDNDWIGHWTGRKGVKSKVDFLSAPEAQEDAIRHHVANL
WKQIKALHLDLYEGTTVNGIVITKSGMIGGAHLKGVGGLKSYLKSAGRNI
PKDGNGTSIEHYVSTFGGYDIESILKEACGATDNAENHRAMTTVGQSRSK
TKLAKKHVGNGGNAQHYVVRPGDTLSRISRMHNLSVAEILTVNPSITDRN
RIAVGQKLVIPSDKTASHSYGKTPYAPASHPQVGSTPRQPWWGETFVGGF
RKKWN
>GSU3277 lysM domain protein
MKSALRVVSASLVLSMASLAAGGEYLLYTPGASEGSRPSGPDEGVLVKSI
TVQKGDTLYSLSRKYNGKGGYYPQILLFNEIKNPDLIYAGNKLMVPVAPA
GPKAVFSQQPSAVPAAAAKRESAVRPAREKVAPVAAHREAKRESTVTTSV
RTPEPAAATPPVPSRPAAEAHRPQTGKADDNEHSLYEKGVSAYKSGAYSQ
SVELFERFLARYPSSPLVPDATLYRADAFLKMAGQ
>GSU3000 cbiX protein
MKTAILLMAHGSRIPEANDAVREIAAMVKEMTGFEIVEVSFREQHLPDIQ
QGIDACVAQGAERVLLMPYFLFVGAHVQEDLPEEMAEARTRYPAVEFAMG
GHLGVHRKLAEVAADRIAEALAATGWR
>GSU1680 conserved hypothetical protein
MPYRYLPDIATADVAFEAWGETREEMFCAAADALTNVMVDDLASVTPTEE
IVISLANEELDLLLFSFLQELIFLKDARCFLLRVPRIMITETEEVLHLDA
IARGERIDNERHPMMVDVKAVTLHLFSVWRQENDWWARVVLDI
>GSU0485 conserved hypothetical protein
MSAVNPLPMSLQPCRKALMIFAKRPMAGRVKTRLTPPLSPGDAAELYRRM
LLDILAKCARMVGVDLLLFYEPGEGSGRFFEEAAPGWACRPQEGGDLGAR
LDSAFRLAFGEGYGEVAVIGTDSPDLPEEYVRLAFDLLDHRAVDAVYGPS
EDGGYYLLALKQHRPELFRDIPWSTGEVLEHSLARAKAVGVRVELLPVWY
DVDSIADLWRPGLAGAGTVAPLTAEFVSGLISAHPPDTPPPAGER
>GSU3402 hypothetical protein
MKKMAVLMLTAFAFSATVPTFAAEMSKEEKDMCLLASKNCAGEVDSLQKK
VKKLQAEIKKGKKVYTAEELKKLEQKLKEANEMVDVLLKQGGGGK
>GSU3088 YbaK/EbsC protein
MAKDKAPVTPAIRQLRAAGVTFTDHVYAYEEKGGTAVSARELGVDEHCVV
KTLIMEDEAKRPLIVLMHGDRQVSTKELARVMGVKSVSPCSPDTANRHSG
YQVGGTSPFGTRHAMPVYMEESILGLPRIYINGGKRGYLVGLDPCEACRL
LDPILVRVAI
>GSU0899 conserved hypothetical protein
MQAGNPMSLQSMLPYVTPPVVGAVIGYVTNDIAIRMLFRPLKPWRVFGIR
VPLTPGVIPAGRHEFAATIGRMVGTHLVTGTDVARALGKDSFRRELQEAV
SGKLDTLLDRDLGTVGSLLPEGFESWLRDGVDLAARRAGETVAGYLQGED
FRRELSLFLRDLEERVLPRELERILAGETGDAVRAGAGRALAALLQSRGV
SRAVAGLVDDKLDELLASERPLRELLPPELVELATAQAREAMPVILGRVA
EFVRSPEIRDTFEAKLREGVQQYLANLKGTLGFMAGFISVEKLNSYFPGL
VGSVTDEVVRWLGEETTHRRVAAMVDQGLGRILDRPLAATVEQLPYRRVA
LLRRSIRRGALGVVRRPETAGFLLALAERELVRLAADPSRPILERLAPED
DPSRLREGLVGWLTDRLRDPGIRATLERIVAGRVEEWLFNRPLGRLSARI
SAEVRRDLDQLIYRQVADLLEREAPPLIDALDVARMVEEKINTLDILAVE
GLLLSIMQKHFLYINLFGALLGFVMGSFNVLLMGLGR
>GSU1168 hypothetical protein
MCSRDEYPWDFTMTRFIAFCAALALALVLTPSPVTAERARDVLAQMNAAR
TDPQGYAEHLREFRSRFRGRNYTVPCSRTRIVTHEGTAAVDEAIRFLLRQ
RPISPLAWSDGLARAAAAHVGVQGRNGETGHGEGQGGMRARIERQGTWKK
TIGENIGYGPDNARAVVIQLIVDDGVPGRGHRKNIFDPAFAVAGVACGPH
PVFGTVCVIDFAGGFSD
>GSU0384 conserved hypothetical protein
MSEQGAVCYTFEAAVEMAITMEEEGFRHYLDAIRRVKNKGAKQILKEAAL
DELEHKLSLEKALLEGQMEGAGSMERQIPTMNLGYVLAKKELSPESDARE
ALAYAIHLEKGAIDFYQRMAQGCAGAPMAKLFDRLLADETKHLQQLEDMY
EQHFMTEN
>GSU2488 conserved hypothetical protein
MTPFPAYVPPDFTRPDLAAAPPVRVAEAPGSGVLPAGFHATSNYPEYVHL
GGGRWLLASGSRMDAVLVLDGDMLRVVEPRLVRRGDRVVVGRTENGEEGI
YVHTTGFDAPAGAGGDKFTFRSRGTRESPFSRSYEELYDILRHDRDHGHI
VWVLGPAVAFDRDSRAAMAGLIGGGYCHALLAGNALATHDLEAALFHTGL
GQDIYTQALVAGGHYHHLDVINEVLRHGSLERAIAELGIRDGIIRACLEH
RVPVVLAGSIRDDGPLPDVITDSRQAQDAMRAHARRATTVIALATQLHAI
AVGNMTPSYRVLDDGTVRPVYFYIVDMSEFGADKLANRGSGQARAILTNV
QDFMVNLWNNLKG
>GSU2679 conserved hypothetical protein
MSNLVLFLLMMCGGVFIAVQPSINARLAQKTGVIESSTVSFAVGALALLI
VSILVGRGSLRGVAAANWWELTGGLCGAFYVTLVIFAVPRIGTAAAMAAT
IAAQLATGLLLDHYRLFGYQGAPFDLKRGIGVVLLLIGAALVFRR
>GSU1207 HesB/YadR/YfhF family protein, selenocysteine-containing
MTITDAAKAVLAPIVGEHPGKILRVVFEGFGUGGPRLGLVLDEPADNDAR
MVLNGIEVAVTSNFRSLLDDQILDYITNEQGEGLVFRRESGDVCC
>GSU0868 conserved hypothetical protein TIGR00159
MFHLIRWQDIADIIIMSFLAYRLYSWFRHTRAMQVLIGLGILAGVYFVTR
NLGLFMTSWILQELGTVLFVLIIVVFQAEIRQALYRFSLLRTFIGRQEGG
GELDLAELGRTVFGLARERTGALIVLQRQEALDDYLLHGVKVDGLPSSHL
LGSIFRNGTPLHDGAVIIKDGRVSQASCHLPLSMKTELPQNFGTRHRAGI
GLSERSDAVVIIVSEERGEVGMALAGEYRKIASPEEFAEVIQGLLYPQRP
ENVAFTLRQRLLRNLVPKIVTTLIVIAGWLVVTTKEGGIFTVTVPIKFHN
LPPRSVLVKSVPESVEVQLKVFTSLIPSPRQLDLVADLNLAGVHDGVNSL
AVKDDDLNLPLGVVVTGINPPMVKVTIAGKERKQLRVRPKLTGQLPGRAK
LRSVTADPDTVVVEGPGHLLEGLESLPTETVDLAGLRRGGVVERRVVSPS
PQIRVLRDEPVRVVVVTSVK
>GSU2914 NHL repeat protein
MVHRSPGRSLFTAFSLALIVISLVSSAFAIPAPGVATHAPITEGIRSPLR
IAGDAAGNFYVTDALSGGVLKYDNSGYLTDVIRTVKSAQGVAVASDGSLI
VSQGNGVVIVDGAGSVTGQLGIGAGQFKMANGIAVDDTGYIYVVDSLDNC
VQVFNPAGGFVRRFGTFGAAAGQFSTPTGIAFEKRARHLAVVDTRNGRIQ
FFDTNGTFVRSIGAFGSGPLKFTAPQGVAFEYSNDPTPVLKRMYVVDTFQ
GQVQVVEPAAVPVFLAYIGGYGTTNGKLMVPSDLRFDQANGRLLVVNGYG
NLTVYGIDGGGLTTDTIPPALDIDPLVSPFYAPSLELHGTVEAGASVTVT
AGGRTTVGTLSFPTPTTWRVSLAGFAPGETVLTVIARDAAGNTSTKTASV
TYLQQAPYLTVDSVPAAVTNVFAQHISGAVEPGCAVTVTNAATGITANAT
VFGDTWSHTVALAPGVNSVTVKAVRPLSAAALAAFSAILDTAAPVLTVSA
LADGSYTSEQVQNVRIEASDAHPGEVAVNGRPVTMTNGSGSTAVTLSPGA
NVITVAAADLAGNLTVNTRTIFYDCDLPVVTFTSPADGAFVAVDHVTVSG
TVDQAATVTVAGHPARFDGSAWSADVPLVQGLNTIEVVAVDFAGNTTAVK
RTLTYDAGSPAIVITSPAQDMAVNRQVVGLVGSANDTSPVTVTADVDGVP
VEVSTVEGSFSLSVHLGEEGAHAVTVRATDAAGNAGVVTRTLIYDVTPPV
LTLNEVNTVYPAELSGTVERGATVAVEDHSGTVGEVVITEGAWHAVLTLG
SYDPHSLAVKATDAAGNSTVRSLVVRAPDGDVDGDGRITVADALVALRIF
TGQLSPSASHLASGDIGPLYQGKSRPNGVIDLVDALLILRKALGLQSW
>GSU2183 fic family protein
MALNLKKLPILFLSDSTTSVAISREAKAGRIRKIGPRLYTSNTSDDPAYI
IRQNWLQALTLLFPGCVVSNRTALESRVSPAGRVYVTGDYARTLELHGTK
FVQVKGSPPVEGDTPLLGIFMASRARALLENLTPSRERSGGELKNLPREM
IERRLAELLNVEKEDSVNRLRDQARQIAPLLGLEKEFDELDDLIGTLLRT
RKATLADPVAKAHQLGTPYDPHALERVETLWSVLASMPHNYRPTNAGTGT
PFYTVSFFDAYFSNYIEGTRFKVDEAKEIVDSGVIPAIRPADGHDILGTY
RVVSSLENMRRTPKNPDEFIELLQSRHADIMVGRPDKRPGEFKEEVNYAG
ATRFVDPDLVRGTLSQGFSLYKSLEHPFARALLIMFLVAEVHPFDDGNGR
AARAMMNSELITAEETRIIIPSVFRNEYVASLKRMTNHLQPESFISVMSF
AQEFVSRISFDSYASARVQMEGCNAFDDPADDKRLLQPSIS
>GSU2437 conserved hypothetical protein
MDKPAKNTICLWYDGDAEDAARFYARTFPDSSVGAVFRAPGDFPSGKKGD
VLTVEFTVMGIPCLGLNGGPEFTHNEAFSFQVATVDQDETDRYWNAIVGN
GGQESVCGWCKDKWGISWQITPIALTKAVTDPDTAAAKRAFDAMMQMKKI
DIAAIEAAYRG
>GSU0751 conserved hypothetical protein
MADQAVLQKLFQDGGARDYRWIDPEEIVVGQWVRMKCMFGCSDYGRNASC
PPNTPPVGECRDFFREYRLGAIFRFTKQLDDPEERHGWSRELNQKLLELE
RAVFLAGYPKAFLMFMDNCKLCRECARTRAGCRNLKHARPSAEAMGVDVF
ATVAKFDYPIDVLGRYTDEMNRYAFLLIE
>GSU2493 NHL repeat domain protein
MKAPEGIACGKRQLVVSDTGGGRLLSYSIENGSVRGGTEIKAEQVPYPTK
VQLTSKGEILVIDGKLRRIAHFSPEGSFAGYLDLKGVPAPATIAPRGIRV
DDKDNIHILDIFGARVIVTDAAGTFQSQMPFPEGYRFISDLAVSASGTTL
LLDSVASRVYAAARGETAFKPLTDGMKEYLQFPTSIATDKQGKIFILDQT
GNAVVTLGQDGSFQGRQLNIGWKNSFLHYPSQLCISEENDIDVVIADRNN
NRIQVFRVVR
>GSU0173 conserved hypothetical protein
MAPDRSGGPAAAELIAGLGLERHPEGGWYRETYRAAGTIPGTLLPGQVGG
ERSFSTAIYFLLERGDISALHRIRSDELWHFHAGAPLIVHVITPAGGSYA
LTLGSDPASGETFQAVVPAGCWFGAETTGDYSLVGCTVAPGFDFADFEMG
SRADLLGRFPTHAGIIRRLTRDGD
>GSU2331 conserved hypothetical protein
MTHGLLVFLALLPFFLLLDYLWLGRLMRGFYLRELGDLARSEGDAIKPRL
LAAAGVYLALPGGIVLFALPRVDPARPLVSALGWGFLYGLVVYGVYDLTN
RATLSEWPLRMATVDICWGGLLCAVSTLIAALLDPLLP
>GSU3204 conserved hypothetical protein
MPSFDIVSKVEMQEVDNAVNQTVKEISQRYDFKGSKCEIKLEKDAIKLLA
DDDYKLKAVVDILQSKCIKRGISIKSLQYGNVEPASGGMVRQAVDIQQGI
SKEKGKDIIAVVKESKLKVQAQIQDDQVRVTGKNRDDLQDVIKLLKGKDL
GVELQFVNFRD
>GSU1071 conserved hypothetical protein
MAEKQYDWASIAKNPKFVELHRKKTTFLVGWWVFSTVYYFLLPIGAAYAP
GLFKIKILSNINFGYLFALSQFFVSWGIAMYYAHVANKDFDRLTRELVDE
LK
>GSU1160 hypothetical protein
MKRLLLIVMVLFVGVVPALGNGAGEKVEFRATIGDDGVQRVRIVGGEYYY
RPNHIIVKVNVPVEITATNDSRVVPHDLIVKAPEAGMDFRIDLKKDAQAV
RFTPTKVGVYPMYCDKKLLFFASHREKGMEGVIEVVE
>GSU0747 conserved hypothetical protein
MSPHDPNPKRPRYPMPDFFRQALEEHGLMADYLARPAYQRNDYIGWVNRA
KRSETKEKRLHQMLDELEKGGVYMNMAHPPSRKDKE
>GSU3166 conserved hypothetical protein
MMKSLKWLLLASVVVLTVFLVAGIVLALDWPWWVTFCLLLLLAGIAAGAI
LLRSLLLRRRERHFVREVIEQEKDTLSTLSDGSRTHLDQLAQQWKEGVGA
LRGSHLKRRGNPLYVLPWYLVIGESGSGKSTSLASARLPSPYTDSRRSAD
DAGTRNCDWWFFEESVVLDTAGRYAVPVNGERDRDEWQKLLALLVRHRRR
EPLNGLVLTVAADRLLAGGREENAEEGRTMRRRVDELMRALGVRVPVYVL
VTKCDLIEGMNCFAELLPEKALRQPMGMINRDLTANVGSFTARTMETVTE
RLRSLRLQILHRPEARDATPALAFFPEEFAGLREGLTAFMEGTFRENPYQ
ETPLLRGVFFASGRQAGSPRSRFSGTLGTVAGPQPLPDTSRGIFLHDFFA
RVLPADRALLTPTQRSVQWRTVTGNLGLLSWLLVGLALCGLLSFSFVKNM
ATIREVSRQFERTPPINADPVNNLLVLESLRQGILRVEEQNRAWWVPRFG
LNESRRVEAVLKDKFCNQFRDGFLAPYDRHLAEELNGLSAATPDELFARY
AIHLTRRINILTASRAGRGLDELRALPQPASLSFMAAETPSVADAHKRFG
ELYLHYLAWRSDSPDLAREAAQLQGWLRRSLALKTDSLSWLAPWVDRHGG
LSPVTLAEFWGGSVPAPDEPTITPSFTGKGKDRIDGLMRELETAVGDSRL
LAKARAGFVPWYRALCLQAWQGFAASFPRGAERLRGSGEWQQAATRMATE
HGPYFAFITRMTQELHTLAGTEGLPPFAAQLFAFQIARTGGAVSHDAAAN
VTAESRRLMASIRRRLGNEAAASTIDPTFMASRSWQEYQAALAAIAPAAS
SRQQAFQLASQTFTDDPATGKTPFHAAWNAAARLKRDIAGAGADDAFWRL
VTGPLDFLWGYVRREAGCQLQTLWEEQVLAPTLGMPPQQAGPMLLGADGL
AWRFVKGPAAPFVRGTAAGYAPRQALGGALPLEGSLFAFLGKGAQLNAMA
SGRQSNYTVAIKGLPTDANPEARIKPHGTRLELQCAGQAQALANHNFPAG
KTFYWSPDSCGDVIFQIEAGDQVLTKRYAGPQAFPDFLREFASGHRTFAA
GEFPGEKDALARMGITSIRVSYQFTGSQQVIRQGGAMAGATAPRMIARCW
>GSU3342 conserved domain protein
MERLGASGLSPEREERFLRELNDTRTHELPEPPLTGPPVPGIYDAHDLME
LQSMAQPQVSHTTRIRSLDELLERDRLREEDGLPRKIRIGKLIKPGAGGK
EKIVVVPTTVEEKLIHDRAPEETEEDESMGGTGDGDEGEIIGEQPVRPQQ
EGGSGTAGHGEGEGHELESTAYDLGRILTERFDLPNLKEKGKKSSLSHYS
YDLTDRNRGFGQILEKKQTLRRILETNIALGTVADVAEIDPTRLVISPRD
RVYRILSRELEYESQALVFFIRDYSGSMEGKATEAVCSQHVLIYSWLLYQ
FARQVETRFILHDNDAREVPDFYTYYNLRVAGGTRVAAAYRMVNEIVEKE
SLARDYNIYVFHGTDGDDWDTNGEETIPELRRMLAYANRIGVTIAEHTYG
SSGNTEVERYLKRSGLLEEKPELLRMDVMGEDAEESRIIEGIKRLIS
>GSU1832 conserved hypothetical protein
MSEGETLTLGLEGAASPYTVRLEVFEGPLDLLLHLIKKNEVDIYDIPVST
ITRQYLEYIRMMKELNLEVAGDFIVMASTLIQIKSRMLLPAPDEEPGAED
EEADPRAELVRRLLEYQRYKEAALTLSERELLGRDVHVRAAAGDEPEPAP
EEEPVEVELFELIEAFRRVLDRVSQESFHEVGSESITIAERINDILTALE
GKESLLFDDLFPEGSNRDFFVATFLAVLELCKLKMVRVVQASRYGSIWIA
PAVSDSADAAGELTDAPA
>GSU3177 Rhs family protein
MTIQKAAASLLSQGSTSSFQFEIPATRHLLSVAGFTVDERISHPYDIHLT
LATKDNVDLDEVIDKEAVLSVDHEGGTRYFHGVVREFTSLGTDGDYDLYH
AHIVPALWFLSLEQDCRVFQFKNVQDIVAEILEESNITSDRYRFALSRED
RLRKFCVQYRETDLNFVSRLLEEEGIFYFFEHYEDKHVIVFSDTGSGYLY
MPGKRQIPFNTNDGMVPGKESVFDFIYSRRVRPGKVSQRDYCYKHTNLDL
TTQRQGKVSAQREVYDFPGNYFNEERGTYLANVRLERLLVLGATAEGQSS
CPRMMPGHEWELSGHDYAGKYLPVAVIHHGAQPQVLGEHAGDGGFRYDNE
FIAVPAAVTVRPQIVAGRPAIVGLQTAVVTGSPGEEIHADPDGYLRVKVQ
FPWDRRGRKDGRTSCWVRVGQPWGGGGWGTQFLPRVGDEVLVTFLEGDPD
RPMVIGSAYNSENQPLYALPASKTQSGIRTRSYPNGGTDNFHELRFEDKK
GSEEIYLQSEKDWNILVKNDKGQTVGRNESLTVGNNRSKTVGVDQSESVG
VNKSIQVGANHNESIGANMTLSVGGFKNETVGINSLETIGGAKELAIGGL
YQVAVGGVMNETVAGAKTEEVGLAKAVFVGNNMSENVKGDRTTNTNGNYT
ETISAKYYAKADEYVIEAPKITLKAGSSSIVMDGSSITIKASKIFQN
>GSU0647 conserved hypothetical protein
MTCGECSRPAVSVALVHYPVYDKNRQVVATAVTNLDIHDISRSARTFGLN
HYYLVTPVEGQKELAGRIIRHWREGWGASYNPKRKAALDLVRISNSIEEV
LEELGNEYGAPARLVTTGARQHPRSIGYEQMASIMAHDREHPYLIVFGTG
WGLTEDFFDRADFVLAPIQGPGEYNHLSVRSAAAIIMDRLFGVR
>GSU0504 conserved hypothetical protein
MTRFIGEKILMRIFIGEGDRWGSRPLHEALLELLRREGCAGATVLRGVAG
FGASSVCHTARLLDLSADLPMVVEVVDDQERLDALMPKIDDMMTGGMITL
EKATVIRYTPAGKGGAATP
>GSU1642 conserved hypothetical protein
MFDIDVQDAIFKSIQTEKNAMNFYQFGAGRMKNPDAVKVFELLAREEREH
AGHFHKIYQRNDIPDLEAFLNTPPDHASDWMASLAKTIDADFTEQKAMEL
AMEKELGLEKALRETAARIADPQVRAVFELNARETRNHYEMIESEYARLM
AMVHESDMDTYVRE
>GSU1091 lipoprotein, putative
MEKVTKMISRMMVRVVTVAFALSLVAVACHEAEARAGGGRSFGSRGTRTY
RPPTRTVDPAPASRPQPAPAVPQTFPQQRQGGSFLRSMAGGIVGGLLGGM
LFRSLGFAGPGAGGVGLFDILLIGGILYLVYRFVASRRREAAATAGRAPD
WSREEPVTPQQYRYGQPAESVPAAGADSGLAHVRQMDGSFSEEQFRQLAQ
DVFFRVQGAWTRRDLSSAREVLAPEMERALQGDVAELKARGQVNRLENIA
VRQVELVEAWQEEGCDFITVRFLANLLDYTTDEEGRVLSGSDREPVKFEE
FWTFTRPVGPNPWKLSAIQQA
>GSU1802 YjeF family protein
MKVVSGETMQRMDRRAIDEFGIPGLVLMENAGRGCADAIREMFGRDGCIP
VLVVAGKGNNGGDGYVIARLLAGEGWPVHTVVLARKDEIGGDARENLDRL
DPSTVSYLPAGGTLSSLTARLDAAALVVDALLGTGLKNEVQGAYAEAIRH
IAASARPVVSVDIPSGIDAATGKVLGVAVTASLTVTFALAKYGHVLYPGA
LHCGRLRVVNIGIPESVAREADGILYVDAAEAAAVVKRRDPCSHKGSFGH
SLVIAGSVGKTGAAAMAANSAVRSGAGLVSLAVPASLNAILELKTTEAMT
IPLADGGVGFLGDESLVPLRDAIRGRDAIALGPGLSWQPATAALVRHLLA
DIMVPLVLDADGLNAISEQTELLKGARPDTVVLTPHPGEMARLAGTTTAA
VEADRIGVARDFAAQFGVYLILKGARSVIAAPDGRIALNGSGNPGMASGG
MGDVLTGVVTALLGQGYEPFDACILGAFVHGHAADLVAADKGETGMSALD
VQERLPYAFNSLIRLKGEQ
>GSU2631 conserved hypothetical protein TIGR00149
MKHFRKELWFETRQRRQFINITPTVRECLRESGIREGLLLCNAMHITASV
FINDDESGLHHDFEVWLEGLAPEKPYDRYRHNGYEDNADAHLKRTVMGRE
VVVAVTAGELDLGPWEQIFYGEFDGKRRKRVLIKIIGE
>GSU0354 conserved hypothetical protein
MQSTEASAHRELENTIVRLLKRRPFYGQFLLAMRREQRPGTFPLGVTFRD
GVPVLMVNPHRLEAESPAIREGLLEHCVRHVIHLHMVRRKGRNGHDWDVA
CDLAINPSIEHLPADAPLPVHFSADDGLAAEEYYSLLSNPFDAGSLEGQG
TGRASRDEGGATGDGCDRDLNVTTVDDHSAWEEADSTPFRLAEEVVRGMV
REAWRQADGHVPADIRRVIEGMLAPSPIPWQQVLRQFVAAAGRTGRETSW
LKEHRRFVHMTPGIRKRRRLNLLVGIDVSESTDTVELREAFARELVRISR
GRDAQVTVLYANSRIQQMESFRGALGLTEAYYGGGFTDLRPVFEHARTMI
PRPAAVIYLTDGVGPAPEQMEFPTLWVLTRAGEKPVPWGVELRLEV
>GSU2505 NHL repeat domain protein
MRQIGNRRVAPLAILAAVLMGGPTNASTGEFKVAYLYNLSDFTGTIPYSL
AKIALDATARETYVISGETVKVFNNSGMEVYRFTNTMESGIVYDAAIDEQ
GRILVLAYNNGTPSLLLCNYRAEPIKPIEFKGLPEKLAAFKPNRLMLRDG
KICLVSLGSMMVVMLDPEGNYVKHIDLAEASSVTEEDRLNTGIGAVAFDS
DGSVLFTSPVTGKVFRISANGTVESFGKRGSAPGKFGVPTGIAVDGLGNY
YISDKLRCAILVFDRQFKFIYEFGGRGDAPGSLVGPDDLAIDAEGKLYVG
QLGNRGVSVFKVTHD
>GSU0757 lipoprotein, putative
MTRILAVALLLTTACAQAGGGLLSAPTTVVPCSAPWNQAVEESVPTGDGQ
GHGPDIGSEEWKSAIEFRLGVRGNPDVPDRTGDAWCGYIDRLVRERAPAG
ATAPEGPSYDCDTVTPGSIEALVCGDRELSALDRKLADVYAAASARAVNE
HPPRLGAEQRGWIKGRNDCWKSDTVRECVRDEYLRRTAELQATYRLVPGI
GPVRFVCDGNPANEVVATFFQTEPATLIAERGDEVSLMFVQPSGSGAKYQ
GRNETFWEHQGEASITWGYGAPEMRCVKAP
>GSU3085 conserved hypothetical protein TIGR00486
MPLMMTPKVSDILGIINKFAPPVLAEEWDNVGLLVGDPTCAVSRIMVALD
GTRETVDAAIAADCQLLLTHHPFLFRPLKRITANDPTGATVLRAISGGLA
VISLHTNYDIADGGMNDLLAQRLDMCSAHPLSVTATEELVKLAVFVPLGH
EEQVSEALFRFSGTVGTYRDCSFRSGGTGTFRPLEGARPFLGTVGVREQV
AETRIEVLVRKDAVSSAVSALLKAHPYEEPAFDLYPLLNRGAAQGLGRIG
YLAEETTLALFADAVKAKLGLAGVRLVGDAGKRVKKVALCGGSGMSLLRD
AHRQGADVFVTGDVKYHEARDAEALGIALLDAGHFGTEAIMVSGVAERLT
ADLVLKGFEADVVAFNGERDPFVWR
>GSU3225 NHL repeat domain protein
MEQSVPARGTSPARRLATAARHCLVLAVASILAACTTITAVTPNEPEQRL
VWPGPPLQPRIEWVREVYNQKGLGVSPGFWGRIARFVLGEKEERFIRPHG
ILADEQIFALVDSGAGRVHLIDLKRGTYRLLPEEGKTPMVSPIGIARDSR
GAIYVTDSGTGLIHRFSDDGDSFVALDLRPLHRPTGIAFNPVTGLLYVAE
TGAHRIVAFDSAGKETLRIGGSGMEPGAFNFPTDLAVMADGRLLVTDSLN
SRIQIFTADGKPAGSFGEAGDTPGRFTRPKGVAVDSEGHIYVCDSQQDMV
QIFDETGRLLLAFGDKGSLPGQFWMPSGIHIANDMIYVSDTYNQRVQVFR
YLKEEPWGQDPHTPD
>GSU0507 membrane protein, putative
MLNELILTAVAYIVGSIPTGLLLARASGVDIRATGSGNIGATNVYRTLGR
TVGIATLLGDCLKGLVPVLVARKLGFADPWVAAVGLAAFLGHVYTIFLGF
KGGKGVATALGVFLGVSPLSVLGALALFIGIVATTRYISLGSIIAAAAMP
LFVAAVERRPLLVGMTLVIAVIVIVKHRENIRRLREGTENRFKA
>GSU0476 conserved hypothetical protein
MEQIDDSRLRGIIHDRIKERGGRIPFADFMAACLYEPGLGYYTSPGRKVG
AEGDFYTSINVHRVFGRLIGREICRMWEVMGCPAPFTLVEAGAGHGRLAA
DVLDAVRELNPELYASLTLRLVEAEPSLAEAQRQVLAEHLDRVAWNDPAE
LMGGTLTFTGCLYSNELIDSFPTHVVEMTPAGLREVFVTADGDGFAEQLD
LPSTPDLADYFRRIDVNLQPGQRTEINLNACRWLEGVARCLERGFVLTVD
YGFLSPELYGPMRQNGTLLCYFRHTIQEDPYQRVGHQDITSHVDFTTLIL
RGEELGLHKAWFGEQYRFLMAAGLMEELMALEAAAATEEERIKIRLVLKK
LVLPEGGMGDTFKILVQAKGVENPRLLCMRDWSKLF
>GSU2558 conserved domain protein
MTSFTAFALPGELDLNRLAADLGFPRRYRWEEPMVLDQASLKPLSGDQGT
TKRVYLYFFGGVVFVNCTEEEARAFFWSMAHYAEPFKSVPDEKYRDDYAL
ALGESSGPAVTNDLATMPVYDPAYVDTICFVIAKSVALERIEERVDQVLD
EMETVIGMLDRGKLGISDRRLAKLAANVLTYKYQSIAHVMVLEKPEFTWE
NPEADRLYLTMANVFELNQRYNEIKHKGETLLDITEVFTSLAHARRASRL
EWTIIILIFIEIVIYLFELAR
>GSU0181 lipoprotein, putative
MTFTRHRMAARFFPVCICCMLLLCGCSHHRAASIFQEANDLFSQGNYSAS
LDAYTRIGETHPAAKDRVLFEKGFIHAYPRNEHKDYQKALECFEQLVREF
PESRYRQDSERMIFGINTVVLKDGTIAAQQARIEALRQDLDDRNKDIAAL
RETIKILEQKVFAIATRKGAVDKILIEKKDRRLSLLSMGEVIRTYKVALG
GNPVGPKERQGDNKTPEGSYVIEGRNKGSRYHLSLRISYPNEKDKKRARE
MGVSPGGDIMIHGIKNGLSQVGAAHAEVDWTQGCIAVTDEEIEEIAEAAP
NGTVVEIRP
>GSU0095 conserved hypothetical protein TIGR00103
MSKGLAGIMKQAQMMQQKMAKLQEEAANRTAEATAGGGAVTAVVSGKNQI
VSLAIKPEAVDPEDVEMLQDLVVTAVNEALKKVQSQFAEEMSKVTGGLNI
PGLF
>GSU1184 conserved hypothetical protein
MNILAHLAFSGDDPEIMAGNLMGDFVKGPLAGRYPPRLTLGLELHRAIDS
FANGHESFTRSKRRLAPSFGHYRGVLVDVYYDHFLASEWERYRAEPLQSF
ITRARAIAIGFASLMPERLVQLLPPMFDEWLPSYAESAGIGRVLRRMSAR
VGRPNPLALGEGELLRCYRELRGDFLQFHPALTTFVVDFIARRE
>GSU1185 conserved hypothetical protein
MNLEIHTIKVDIPQDCNVILGQTHFIKTAEDLYEVVATTVPQARFGIAFT
EASGPCLIRTEGNDEELIRVCVRNLQAIGAGHVFCVLLKEAYPINILNQI
KNCPEVCRIFCATANPLQVIVASTSQGWGVLGVIDGHPPKGVETDDDRQH
RRDFLRAIGYKR
>GSU0269 conserved hypothetical protein
MTSPNQPENEGVVPIELILARLLRIGSIIAAILLAIGIAATLLTGAAYAP
RFITAGLVVLLATPIMRVLVAGLVFFRERDWLFTLFCLVVLCSLAAGVLL
GQVG
>GSU2231 conserved hypothetical protein
MVKIVKVQFHTAGKLYDFGSGDLDLKQGDRVIVETERGRSIAMVVTPPRE
YEDTHVPEGLKNIVRLAEPSDLASAARNAAKEQDAYHFCLRKIKERGMDM
KLVKVEYLFDGSKAIFYFTADGRVDFRELVKDLAHQFHTRIEMRQIGVRD
ESKMIGGIGICGRELCCSSFLRDFEPVSVKMAKEQNLALNPTKISGQCGR
LLCCLGYEFETYCSLKKCLPKCGKIVKCGTAEGEVVKQNILEGTVTIRTE
EDREMVVKGEEIKPENIFDRPKAPRKEGGREKDQKSPQDGERRERPRDRD
KERKEGSGGERRERQEREQAPPREEGNNRERRGGKGRDRDKKEKK
>GSU0141 conserved hypothetical protein
MKILYFDCFAGIAGDMTVAALLDLGVPFEVVRDAVGCLRLPHSSYSLATE
RTSRKGITATRFVVHVEEHQPNRHYGDIAAMIEESPLADGIKEKAQRIFF
RLAEAEAKVHGVELGRVHFHEVGAVDSIADIVGAAAAIDWLGIESIHGGA
LPLGSGFVETAHGRLPVPAPATAELLRGIPVHGEAGPGERVTPTGAAILA
ALAAGFGPIPPMTVTGVGCGAGTRDFADIPNILRVFQGEIDRGFERDDVV
VIEAHIDDTSPEILGYVMERCLAAGALDAAFSPLQMKKNRPAVRLTVVVH
PEQRDELAALILRETSAIGVRFHPAGRLKLRRLVEERDTTLGRVRVKVIN
GDGVARVAPEYDDCCRIAAERGMPLMEVYRIVERECGQ
>GSU2092 conserved hypothetical protein
MALPSTVHRAVVQLSHVDRGIYETLQTTLARHPSETAERVVLRLLAFALW
YEPELTFTKGICAGDEPDLWCKGPDGRVTLWVEVGTPDPERLLKACRHAE
RVVLLACGPARFRWDDQHLARLTGVPNLSVLGIDHAVVAQLAAGLERNIS
WELTVTDGTLYLTTGGETLEAVLESLAGSAPAAG
>GSU0165 conserved hypothetical protein
MKTSVKTISKKLQHSLQKGITELSVAGYKSIGRLQTIDLKPLTILSGSNS
SGKSSIMQPLLLLKQTLEAGYDPGPLLLNGPNLKYNSASDILTQCKNNKL
NSFSVGIRVAKNELLTTSYKKQINKGFKIDEMTYGDKNEIHFNQSMSASD
TERIVPNEFKDLYKVLPKKYRPEIEWHITRNRCFLEAVPKSKDGSQMGPN
VSPSSIVANAIRNIIHLPGLRGNPARTYPITAVGRSFPGTFETYSASILA
DWQDSKNNKLRELCADLERLGLTWKVAASRINDTQVELKVGRLPHATRGG
AMDLVNIADVGFGVSQTLPVVVALHVAAPGQTVYLEQPEIHLHPRAQAVM
AEVITDAVNRGVKVVVETHSSIFLLTMQSLVAEEKLPSDVVSLHWFSRDD
DGLTIVTSASLDSSGSFGQWPEDFGAVNLEIESRYLDAFESKLGVDSSGC
K
>GSU1115 transcriptional regulator, putative
MADAVHAGKGDASVGTLLREAREARSLSLDEAARVTRLGKNYLVALESDE
FDKLPNLAYARGFIRVYAGFLGLSADELLRRYDAVGDDGGHRSPVEDAMP
APQGKAADSISPRNRWSLPLVLLLLVVALALMLRLQDEEPSRPIETGQLT
AAAPEARQPATPAPQQQLSTARQPETSPPAPADDTVAEQQAVEGNAASSP
ARGVILKLKINKDSWLNITIDESVSQQYDLKAGDLIEWKGERVFALDVGN
AGGVEGEFNGKPLGVLGEEGKPAHLVLSADGGGD
>GSU3343 SpoVR-like family protein
MQLIDQHTKKIMEGCKERAREAGLRFSDETLEYIVTNRDMLELHPKVMIP
TLYDYWVHDVEVLKEKGKYELYPHNPYETVINTRPPISFYNDNNPDWLNV
MIFYHVLAHIDFFQNNLYYRHTWDYDLTGKALADKRLIARLRSEHGRRLD
YVIEFARSIDNLVGYYGELSALFRGETPPLPRRLDYYFDVFLQRVKKVKT
AGYVQEIERYNRCMRDFGDLGEETFFAEVMARYPEFEALYLKSRSEERRG
RPDLMQFIMEHSPVLNREEHRWMKSVLEVIRSTSVYFQPQMRTKIMNEGW
ASYWHEKLFLTDDRIRGHEVDFARVHSGVTCLHRVGLNPYAIGMRLFQHI
EEQADKGRISLEFQQLADIHARQEFDRGTGGGTAFIFAIRENLCDFSFIN
TFVDQDFVNRHKLFVAGRRLNKERMTWQYYVKSRTAEAYRAMLAESLYHP
PVIAVDEAKGEGKYLYLDHRFEGKPLVKDFIENTLMGIEYLWGGPVKLET
SEVGLPPPDEAVKPPERQQKIHWRRYVYTMEDRKLSRTLL
>GSU3172 conserved hypothetical protein
MSNEASVAPKERVNIVYRPADGEGREEVELPLKILVLGDFTGQPDERPVE
KRDPVPVDKENFSEVMKAQRIMLNIAVSNRLYDAPDEELPVKLKMESLRD
FGPEAIVEQVPEMKRLLELREALRALKAPLSNIPDFRRKIQELVTDDAAR
AKLLAEIGIEG
>GSU2976 DedA family protein
MHQAIDWLVATIGAMGYPGIFILMAMESSVFPVPSELVMPPAGYLAQQGE
MNIWLAIFLGTAGSLAGAYANYFAAHYLGRPLLLKYGKYVWITEEKFARV
ESFFHKHGEISTFIGRLLPVVRHLISLPAGLAGMHHGTFTLYTLLGAFIW
CTILAWIGYVIGENRDLIMEYSHQALIGVVIFSVALVAVYVWRHRRKK
>GSU0506 methylamine utilization protein MauE, putative
MDAVKRHLTALLRVALGVVFLYAAVIKIANPPAFAGNVAAYQLLPYAGNY
LVAAILPWIEAICGLLLVTGWRARSAAALVAVMNILFIVLLISTVARGLD
IDCGCFRQGGEKTSAWTAIFRDIMLLVAAVFVFRKTKQ
>GSU0369 FlhB domain protein, putative
MTTNDRDRKAVALSYREGHYAPQVVAKGYGVTAEAIIACAREAGVYVHQS
PELVRQLIQVDIDSCIPADLYRAVAELLAWLYWLEHAEGD
>GSU0463 membrane protein, putative
MFARGAAAEADPVPTIASPQGPEAAVAAPAAPPTACPRTLRFSFTGAARE
YFGIWIVNTLLKILTLGVYSAWAKVRKRRYLYGNTMLHGAPFDYLANPRV
LFRGWLIGVLAFLLYTLGTNYSPTLSFVIGALFFAAVPWLVVRSRLFNLR
NTSYRNLRFTFRADYRQAYLVYGLLSLLVPLTLGLLYPYAAYRRKRFLVE
NSAYGTTSFSFTATTRDFYLLYLKAVAGFVAIVIVAVPFLFLAGGAFLPA
GTGGPWRLAAVLPAFLIPLAYLYFVIYISTNETNLVWSGTQVAGARFTCS
LRARHMAWLYLSNAVAIFLTLGLMIPWATVRVLRYRMENLTVLATGDLEE
FAAAPTEEVTATGEEIGDIFGIDVAL
>GSU0728 conserved hypothetical protein
MFESAELDHAVDKKTWKARVPPLREALLDAQYDLLEARGFPVVILISGVD
GAGKGETVNILNEWLDPRHVETNAPGDASDEERERPPMWRFWRSLPPRGK
IGIFFGSWYSGAISDHMEGRSKQAKLDQALERIRRFERMLADEGALVLKF
WLHLSRDQQERRLKALEKNPRTRWRVTARDWKNFKVYERFRDLATHVLRA
TSTAEAPWTVVSGVDPRYRSLTVGNAILSALRARLATPENPHPPRTALPV
SPATDAVMLLRSLNLSRTITKKRYEEELEELQGRLSLLTREPRFRKRAVV
AVFEGSDAAGKGGAIRRVTQALDAKIYRVVPIAAPTEDELAQPYLWRFWR
TIPRLGRFAIFDRSWYGRVLVERVEGFCSRGDWMRAYSEINDFEEQLVES
RIVVAKFWLAISPEEQLRRFREREETGFKRFKITEDDWRNREKWGEYETA
VCDMIDRTSTEIAPWTLVEAENKQYARIKVLRTLCERIEAVL
>GSU3145 MOSC domain protein
MSARVVAVCISRNKGERKTPVTGVELRENHGVVGDAHAGDWHRQVSLLAK
ESIDKMRAMGLDVDSGDFAENITTEGIDLPALPVGARLTVGETLLEVTQI
GKECHTRCAIYYQAGDCVMPKEGIFARVLTGGAVRPGDVIVRVP
>GSU1580 ErfK/YbiS/YcfS/YnhG family protein
MLVSRKNSMGRQPLSFFPGPTMIRRLAVVIALIILAAIVVHEPEITAEPA
ASPLDPAKEDLSRVDYPSQRDLDWYPRFIRPNDSLESLFGDDWVYIARFN
RIDRRHTYPGMTIKVPRDMAVARAYTPLPKEYEPAKRYAKYILVSLTEQW
IGAYERGTLKFSMPAATGKKGNETPTGLFRIDARDRTHTSSLYQTDDNSA
QYPMDYAMRFFIDKQNVGYWIHARDLPGKPASHGCVGLFDEPMQNRMYGI
PARPVLHDSKKLYDWAVGEADHGPDSGTIELIDGGPVVEVIGENPVYQPA
PLRPMVATR
>GSU3024 hypothetical protein
MPRPCRASRRCSSLPPTTKGGWNCGKRSPKPCRKAERLAMSHFEKNIAAL
HRRNPPLAECLEAVTPGGDLHLGEARNGEPTARSGEVWLHSPYNPTREAQ
EWARAQAERFTPDAPLTVVGFGLGHHLRELATLGFGGTVIEKSLALLKAA
LEASDLVPVLERFELMAGIPADLIRRRHNHLLRGNTTAHPATVRISPDMA
TLAQYADGYDPAARGGLKILLVNPIYGGSLPAAHHAARALRALGHQVIPF
ESERLAAAMDFGKEFIFNQSRRAFHAGLTSLLSQAVELRARETRPDMILA
LAQAPLLPETMKRIEARGIPTAFWFVEDYRVLPYWRDSAPAASYFFGIQT
DNFAAELARAGVTRYAYLPTCAEPSVHRPMELTPAEREEFGSPLSFVGAG
YYNRQIFFKGLTDFPFRIWGSDWPFPSPLVPFIQRGGARIDTETTVKIFN
ASAVNLNLHSSTTCDGVVPDGDFVNPRTFELAACGAFQLVDRRALLPELF
DEGELETFGCLDEAREKISRFLADPEERRQVAGRGMARVLAEHTYEHRMA
ELVALMTGTFPHLAERIRQRADQRDEIMADLDRHPGLAGLLATLPDQRWF
TLNDVLGTIVTGQGKLSRAEKLFLMLQNVEVLWEKIPE
>GSU0478 conserved hypothetical protein
MNEHGKEMLDAIMRAMEIEKETFDFYTRAEHKTFNPEGKRIFRWLARTEE
QHYLKLNELYQSLHEGGRWVFYGGSTVSLDPAGPGEKQVAFDTDDRQALE
IAMEIEKKGIAHFEELMEKATDPQGKSMLRALRDEEAEHLRIVTEKYNAL
QR
>GSU0221 cytochrome c oxidase, subunit IV
MTSDSTTHRPVGYGTLASVWAALLILTAATVFVTRLDLGGYKVAAALTIA
SVKGGLVIAVFMHMKYEGWLLRWLLFLALVTLALFIGITFFDVLYR
>GSU2554 conserved hypothetical protein
MSKAAMLTMGIGFAGQALFFMRFFVQWIHTERRKESVIPEAFWYFSIIGG
LFLLVYAVIKRDPVFIVGQSTGTVIYLRNLYFIRKNKRKDVIDALES
>GSU0917 conserved hypothetical protein
MPRAVIRSADERNSCVIKWLFLTVGVIATGLGVIGIFLPLVPTTPFLLLA
AACFARSSDRFHRWLVEHAHLGPMVRGYLEGTGIPRRAKTVAIVMVWLTV
PPSAFLLVPMPWVRALLLVIATGVTIYLVRLPTAPADNH
>GSU2460 ribonuclease BN, putative
MGNFMDDQCLLHASALTYSTLLSIVPFFALAFAVLKGLGVQNTLEPFILD
QVAAGSHEIVDRIVTYINNTKVGSLGAVGLVTLVVSTVTLLGNIEETFNS
IWGVRETRSLYRRFSDYLSVVVFGPILIFAAISVTTTLESQKAVQWLIGT
AYLGDVLVAAFRLVPYVSIWLALVFLYMLIPNTKVRFRSALVGGVIAGTL
WQAAQWGYIHFQVGVAKYNAIYGTLAVLPVFMIWLYASWSIVLFGVEVVY
AHQHRRTFRHETHIPDLEPAARERLALALLLEVCRTFFRDGAPWNAERLA
ERLDVPERTARETLGMLVAKGWLAESCGEETLYLPARELEHMRVRELIAS
LRGHAMPLATGRFDPAVEWLAARMEGAVAAELGEVNFRTLAEGGGVD
>GSU1083 conserved hypothetical protein
MRTEGPVAINSMNTVFLEVVEWFDDSGREMVRRIPPEGSAEIKLGAQLVV
RESQRAVFFRDGKAADCFGPGRHTLTSANLPILTKLLSLPWGGTSPFRCE
VCFVGIQTFTDLRWGTKEPVAFRDSRFGMVRLRAFGTYTLRVVDPQLLVN
ALVGTRGLYTSSELEELFRDIIVARLNDYLGETIDSVLDLPARYDETSAA
LKERLAGDFGGFGIELAELYVNAITPPPEVQKAIDERTSMEAAGDVDRYL
KFKAARSLEAAASAEGGGEAAQGMGIGVGAGLGMILPGMVANAMAQGADA
SAPVGTGSCPRCMAPLVAGGNFCHQCGAPVESGFCSGCGKPLPTEARFCP
GCGRQAGA
>GSU2323 membrane protein, putative
MTAILLTTAGFVLASMTGWFVASRLKGRNDIADVAWGLGFILAAAVSLVA
GGHYAPRGLLVSLLVLVWGVRLALHIHTRNRGKGEDPRYRQWREEWGRWF
VLRSFLQVFMLQGVLLVLVAVPVIFVNGAPPTPLGWLDSLGFFIWLTGFL
FEAVGDRQLLHFIRNPENKGQLMTGGLWRYTRHPNYFGEVTLWWGIWLIA
LAVPGGWWTVIGPLAITVLILKVSGIPMLEKHYEGRPDFEEYKRRTSAFF
PLPPRG
>GSU3040 conserved hypothetical protein
MNVTTTRFGEIAVEEAKIITLPDGMLGFSEKRFVLLTPQNITPFCWLQSV
ENPELAFVVVDTKECASDYAVKLTAEESEKLCVNDGDEVVLLAVVTMASD
PFNITVNLQGPIALNPKRMLAKQIVLEGSRYTTKHPFFDQAARSKAPGKR
NASGEVTAA
>GSU3219 fibronectin type III domain protein
MKLRRCIASLATVGMLAMLLVQAEPRQVTGSGYSAETGKRQRPAPKESGS
ADKAIRKRWLVQFNGPVRPEQRRQLEALGCRIGDYMPTNAFVALMDDKAA
KRVALLSFVEDITRFAPADKLVGTARKDLTAAPTSEIRIRKVLRVDDPAD
RAAVIAATLRGNGRILNVGARTITVEVPEELLAPLAQQEETAWIGEVGEL
RLHNSDAAWVVQTNEVDNRTIWEKGITGAGQIVGIADSGVDYDMPWFADP
NGALPGPGHRKIVGYDATLGDNHDVADGHGTHIAGTICGDRGPGMPGNGI
APGARIHVQDLVGTDGTLTGSLELETVLKKAYDSGARIFNGSWGVDSGNY
DALAAALDDFSWRHKDFLAVFANGNGGPAEQTATSPAIAKNATSVVATGN
GTDAATVSAESSVGQAPDGRANPSVGAPGQGVVSARSDGLLGSGNSGTMA
MSGTSVAAAVTSGAAALIRQYFTDGFFPTGSPVATNKLQPSAALLKAVLV
NSAEALLSDDPGDSCPSKRGGWGRPKLINTLFFNGDSHSLEVVDGGTGLE
TDGVWQRLYFSPGGRRLKITLAWTDAPAAPGATSPLTNDLNLVVVAPDGT
TYLGNDLNCSHGDYESRTGGFSDRVNVEEQVVIKRPVAGTYLVKVIGASI
PVGPQPFALVMTGVTGVTSDGRIALTNSTNGTLEAPGQVSVMVTDRDINR
DASAIETMTVDLLGETESNPEQVVLTETGPNTGTFTGTCRIALGGTAIHD
NGALEARHGETISARYTDEINLSGYPRLVSVSARIVDSVPPTISAVGVGA
QLSETSATVAWTTDEPADSKVSYGSDGSLGLSVIDGAFVSNHLLALSGLS
EGQDYFFSVTSSDAAGNVSTDTNGGNNYTFRTASLPPSLEVFCSAENNET
YLPTVRVFGTATDPAGIDRVTVNGQPAVWRATDGYYEATASLVPGSNTIT
VIATDTLNNPAGQTLTVNRTLPPFDLLVTSVVSTTNLLPSSAIRVDGTVR
NEGTADAPSAEVAFYLSRGGAAAGIPLGSIPISPIPAGQSGAVSFTASLP
PEVVPGVYFIVATVDPADQVAEAREDNNSLTGNPVTVGRPDLVPLTVSNT
TLMSPGGTISTSLSVRNDGAASAPVSTVAAWLSVDTNLSGDDILIGTAPA
AALSPGASTTVGISGVLPPGIQSGTLNIIAVVDALGEVAEADENNNRATG
QPLTIGTAELSVTTVTMPASIVRGSTASATATVANTGHYAATGVRVGVYL
SSDTAITTSDMFLGSGVIASLEPGASAPVSIPVPITDIVAAGTWYVGAVV
DDLGMIAESDEGNNALAGNQVEILADGLDLTVQGVTAPASGTTGQPVTIT
ATVAATMPAAASAVQFFVSRDPVITSADTYLATKAVGSFGAAGAQTVTAT
VTLPTTLTSDTWYLGAIADAYGVITEINETNNASAGRAITVNGPELVVES
LTSASDTAYTAGTVSLASTIRSMAGAAPTHRVEFYLSTDPAITTSDIYLG
YRTASLPAGGSSTATTILTIPRYLTGGDYYIGAIADPGNVIAEANENDNS
LGIPLHIIGPDLQVDGLSLPGSALSGVPLTISSRAFSTQGGSGSFTVDFY
LSSDQTITTGDVYLGRRTVSSLAVAGASTATATVTIPNYVFTGRYYVGAI
VDPYNYVKEETETNNSTGTDQAVAVEVTGAELALASLSAPASAKPGETIA
VVNALATTAGSAPSSYMEFYLSTDSIITAADRYLGGRTVSALPAGGANNA
ATGLKVPADILPGTYYLGAVSDPYNTVREANEADNTRTVQLTVTGRDLTV
EALSGPAAALAGATIGVANAVKSAGGAVPGFDVTFYLSRDAVITRSDAYL
GTRFVSGLDIDGANTVTTTLKLPNDLEGGRYYLGAIVDGGNLIPETDESN
NASAAAPIDLVGADLAVSALTAPATASAGETISAQVIVTTRAGGSPYSLV
NYYLSTDETVSPDDIYLGVSTIPSLGPGGGATVGKSVKLPADMEPATYYL
IAVADPANAVAEADETNNTSQPRAIAVTVP
>GSU0040 conserved hypothetical protein
MCGRFTLTLPPDLLAEIIGEIEAARVQPRYNIAPAQEVAVVRQDAGGRRH
LDYLRWGLIPPWAKDASVGNHMINARSETVAEKPAFRHAFRSRRCLVLAS
GFYEWKAEGNRKQPLYIHMKDGGPMVFAGLWESWKSPEGAIVESCTILTT
YSNSLIRPLHDRMPVILGRSDWDIWLSREATSEELTPLFQPYPSDLLAMY
PVGTGVNSPRNDSPDLLEPLNEP
>GSU1014 smr domain protein
MKKKSTQSGQKTKKFSPSPFSGLKGFRPEPAEPEKKQPAVAAPPPRSTAE
PDDLNLFLREMAGVNRMDRSGKGPSEQRPPKEPAGQARTADDDLVKGLEA
ADQAAFAEAIARLKLDVSFRESLPGDSGAPRPVSRLRQLKSGQIRIDLEL
DLHGLTREEAVASLERFITGAHRRGQKAVLVITGKGNNSSGEPVLQGAVL
SWLRERGKSMVAEFAPAPRDMGGSGAVVVFLRTTRVKAG
>GSU1562 conserved hypothetical protein TIGR00106
MNVMVDLCIVPIGVGVSLSPYVAACQKVLDEAGLKTSLHSYGTNIEGEWD
AVFAAVRRCHEVVHGMGAPRITTTIKLGTRTDRVQTMEDKVRSVQEKMG
>GSU0866 YGGT family protein
MFVVANFLLAIAKVADILLTIYLYIIIARAIISWVNPDPYNPIVNFLYRS
TEPVLSRVRRILPDLGGLDLSPILVLVAIYFIQSFVIRTIYDFAFRMKLG
TGAM
>GSU3318 conserved hypothetical protein
MKPHRIAVLFAVLSLAVPCAAAPVVSTEADQRGVSLTIYNQNLGLVKDRR
EIRLPKGSGELRFMDVAAQVIPSSVSMAAPEGDGIRVLEQNYEYDLLSPQ
KLMDKYVGREVKLYQKNPTTEREEVVAATLLSTTGGPVFRIGDEITFGHP
GRIIFPGVPDDLIARPTLVWLLESGREAARQVEATYLTNGISWRADYVVT
LAEKDDRADLAGWVTIDNRSGATYRDAALKLVAGDINRVREDEGRAKMMR
AEAMAAAPAFREESFFEYHLYTLQCPSTIKDNQTKQISLVTAAEIPVRKE
FLLRGESYFFHGPAGEPRKEKVAVYTEFGNRRESGLGMPLPRGVARVYKR
DGDGSLQFVGEDSIDHTPEKETVRVKLGDAFDVVAERRQTEWRKTASDTY
EAAFEIALRNHKAENITVRVVEPVPGDWRILTSSLKPTGGDARAAEFLVP
VPKDGEAKLTYRVRMRY
>GSU3128 hypothetical protein
MQITGWNRIVPMLGHEVHGKGAMAMKIRKKILDFEYEEVLDARTRELIRV
GCAVAVGCPT
>GSU1339 hypothetical protein
MQTGTVKQMMKLGVIALAAVLVVAGAVTAFSLPGFGKFEKAKVVKGAVSI
PLADLQDGKARFYRHSEGGRDIAFFAVKAPNGSYRAAFDACEVCFQEKKG
YVQDGGFMTCRQCNKKFATDRIGEGPGGGCNPAAIPARQAGGNLVINVAD
IRTGARFF
>GSU1431 conserved hypothetical protein
MEQHHVRSLLKPSAYPDPTGSVELVQTHVSYIFLTDNFAYKVKKPVDFGF
LNFSTPDRRRFYCEEEVRLNRRLCPDIYLGVAEVRDTPAGAAFVGEGQVI
DHAVKMKRLPADRMLDRLLSAGLAGEPEIRAVARVVGTFHLEAEHSSEID
AFGDLATINANWVENFQQTRPYCGITLSAGDHDFIRGWVERFMAERVDLF
AARVGDGRIRDGNGDIHMENICLGADGRVCIFDCIEFNNRFRYGDTAADI
AFLLMDFDFVLRPDLGAAFLDEYTRITNDASVVEMIDFYKVYRAFVRGKV
ESMRLSDPQIPENEKQAAAARASRYFRLARGYAVRQGLPPTLFITCGLTG
TGKSRLSRELSLDLGLEIASSDVVRKELAGLRSTERPDQDYGAGIYTDRF
SLLTYGELERRARVALEGGRSIVVDATFRRRADRERFGALARDAGATFVI
LHAVCPEPTIRSRLNSRQKDRSEPSDGTWEVYLRQKDEFDPPSDQEGNLL
CIDTSDTASAMVDKVLHGLELLPCGKS
>GSU0863 conserved hypothetical protein
MPLYEYRCDACDKQFELRQKFSDAPASECPSCGGPVHKLISQSGFALKGG
GWYAQGYSSGGAGKSEAPACPSGGTCAGCPSAA
>GSU2353 conserved hypothetical protein
MAEKQYDWAAIARNPKFVELHRKKTTFLVGWWVFSTVFYFLLPIGAAYAP
GLFKIKILGRINIGYLFALSQFFVSWGIAMYYAHVANKDFDRLTRELVDE
LK
>GSU2333 membrane protein, putative
MAKNRFLLIIAAVFVVLPAALSLITFYTDWLFFRETGYTQVFTTALAAKV
GAGLASGLFMFAFAMVNLYFANRASLPHTPRGVFFEGGNVYRLQRDEMVQ
MVKPLSILAALVLSLLAGRWGALQWQNLLLFTNGVTVGTSDPIMGKDLGF
YLFSLPLLEHVKGFVAFTVLVTGIMVGAVYFFRGGIILSDRGADVDGAVR
RHLAILLGIFSLTLATGFYLDAVRLLLAGGNSFHGAGYVDVNARLPLYRI
LTLATPLAGAVVAFGLWKGAWRLTLIPPIIVAAVYGIGIVGYPAMLQKFK
VAPNELALETPYIANSIRFTRLGYDLDKIKTVPFDVELNLSAADIAKNDA
TIRNIRLWDHGPLLKTYSQLQQIRTYYKFFDVDNDRYLVNGQYTQVMLSP
RELSYNDLPSRNWINERLIFTHGNGLAVGPVSRISREGLPEFFIKDIPAV
SLADIRVTRPEIYYGELSNDYVIVGTKVPEFSYPTATGNINTTYGGKGGV
ALDSMLRKALFAARFKTEKILLSSDITDQSRILYYRTVGERVKTVAPFIR
FDGDPYLVVADNGTLKWIIDGYTHSSRLPYSKPLRGGINYMRNSVKAVVD
AYDGTLDFYISDPDDVMIKVYARIFPGLFKPLSAMSADLRGHIRYPHQFL
QVQAAMFATYHMTDPKVFYNRENLWEIPVLGEAPMEPYYTVMKLPGEARE
EYILLLPFTPSKRDNLAAWLTARCDGENYGKLLAYTFPRDRLIYGPKQID
ARINQDSHISQQLTLWSQRGSQVIRGSMLVIPIEQSLLYVQPLFLAAADK
AGLPELRRVIVAYGDEVVMEESLELALQRIFGGKRAPVAGVAAAPEDGKA
STGDLAREAMSIFERATNLQRQGDWAGYGEELRKLQQVLKQLAR
>GSU3467 conserved hypothetical protein TIGR00278
MLKGILYIIGIYQRYLSPLKGPTCRFYPSCSRYAHESLTRYGLVKGLWLT
TIRILKCHPFHPGGVDPVK
>GSU1275 conserved hypothetical protein
MRTIILLALSNVFMTFAWYAHLKNLKAAPWYIAVVVSWGIAFFEYLIQVP
ANRMGYGTFSLGQLKIMQEVITLAVFVPFAVYYMGQPLKLDYLWAGCCLA
GAVFFIFRA
>GSU3165 conserved domain protein
MLNGTAVRPCPVMPIDAGFDPAFGRRTAAAGREVYPSMHLTDCFTDVMAY
LNHSLRTIGVRQPPYGEIRCEIERFMATADAAARSEGLDGEEFDLARFAV
CAWIDEAILSSSWEEKNTWLKEQLQRIHYATTDAGEEFFTRLTALGLNQR
EVREVFYLCLAFGFTGKYCKPGDEYHLEQLKTAQLKLLMGSSVGLPSLER
TELFPEAWPSGPAPATVRPGRGQGSRAFLVAVAAVPVALFLILLLAYRFT
LTGVGDTFLRTVPY
>GSU1568 conserved hypothetical protein
MKCPVCTTVNLVMSERQGIEIDYCPECRGVWLDRGELDKIIERTQTKETA
ATHSPQPAAQPAYAPQQTGHAPAGYGYSHGHGHKPYRKKSFLEELFD
>GSU0430 tail lysozyme, putative
MREERLLERIRSLERDPSRRGGTDRGRLVDSILAHVRRIINTRRGSVPIA
PDFGIPDMLDVLQSYPDSVREIERSIRAAIQGFEPRLADVRVAFVPQEDD
VLALRFAISARLGSDGGAVCFETLVDTDGKVTVRR
>GSU1308 conserved hypothetical protein
MMKIRFCEHNKGKNKVVKRLHEQFPKLDVRIKDCVGKCGPCHKTPFALVD
GKTICGIDSEDLYGKIVKEMGK
>GSU2552 lipoprotein, putative
MNLLRHAVAGLCLVASAGCSGQDVMVRKQSEMETRLEHLHQASTTSGVRL
AELSAEVTALRERLAAQGAELEQIKAGQRELQANITERLAQVAPAAGTPS
RIEVVSRDGAPREKDGPPDAYLKAFGLYSANNFAGAVEAFQAFLAEHPDS
EYAGNALYWIGECHYSRSDLPRALDAFRLVAERYPASTKVPDALLKSGYT
LYAMKEPERAREILESLAAKYPRSPAAAKARERLAVATPKKQ
>GSU2404 pentapeptide repeat domain protein
MFNKLATSVAFGLLSIATAHAFDPLVIERAKSLGECEHCDFVSADLKGVD
LKGIKLDESNFTGADLSAAAIDDCGACNFTGANLTGAQMDGASLDEAIFD
TADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNANFS
GAKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLLR
DAKLKGADLRQSRFHSVSIYDTATNRLGESFDPVRCADLQEAGALFDATT
KCGK
>GSU1074 conserved hypothetical protein TIGR01033
MSGHNKWSTIRHKKGAADAKRGKVFTKLIKEITVAAKIGGGDPNGNPRLR
AAVDKAKAENMPKDNIERAIKKGTGELEGVSYEEIIYEGYGPGGVAVLVE
CMTDNRNRTVGEVRSTFTKCNGNMGEAGCVSWMFDKKGLIVLSKDVDFEK
LFETALEAGADDVADEEEQYEVLTDPAAFIDVREALEKAGFPFESAEITM
IPQTMVKLDGKNAENMLKLMDRLEDNDDVQNVYANFDISTEEMEKLM
>GSU2560 hypothetical protein
MRISGHTPLWIAARALLAGRVDDFFDRWRRAAETFDPEDIHDLRVSSRRL
REGFLLFAAAYLPESVRRAARGVRSVTRLLGALRNTDEALVFFRHMAETA
DGSAGRHLARIIARHEDLRREQRRQLRLRLRELDPRRTREALARIVACPT
LFSPPPSGIDPFTTIAVFAAERLRERLDDVLALVPDACREADVEAQHRLR
IAVKHYRYRLEILSFLPGNDFDSLHAAIKAYQDVLGTMHDLDVFAGIVRH
EGLPSADECVILNHIAGERHRQFALFAGLLAETPFEAIGEQVRRAL
>GSU2167 CHC2 zinc finger domain protein
MTLNTTLQAARDMGVPERYAVMRRLAFLQPQIKHLEREIWGARRRTDRSR
DLLIKALSESLIRDQEKELRPMKREATALLNHVNGKETVQAPGGITQEMI
DQARQYPITSIIEFSKGRYRCCPFHEDRNPSMALYENHVHCFVCNRTWDS
ISATMALDGVTFREAVLALQS
>GSU0171 YaiI/YqxD family protein
MKIWIDADACPRAVKEILFRASTRLRVPLCLVANRSLAKHAGPLVESVVV
ADGFDVADDYIAEHAAPTDLVVTADVPLAARIVAKGGVALDPRGELYSEE
SIGERLAMRDLLSELRDTGMIQGGGPAPFSMSDRNRFASALDSLLHRMLR
R
>GSU3173 conserved hypothetical protein
MSSEMNGTAPFQDQGATDVSLLEDIVRVTKLQPADEAYSITRRGVAALMA
QLLEPGAEARKVSKAVLDDMIAEIDRKLSLQMDEILHHPQFQALESSWRS
LHFLVDRTDFRENNRIEFLNATKEELLDDFEDAPEVAKSGLYKTVYSAEY
GQFGGKPFGAIIGNYDFGPGAQDIKLLQSLAAVAGMAHAPFMAAASPQFF
GCDDFTALPNLKDISSILEGPQYAKWQAFRESEDARYVGLALPRFLLRLP
YGEATRPAKCFNYEEQVSDSHDRYCWGNAAFAFATRLTESFANFRWCANI
IGPQGGGTVHDLPLHQYQAMGAIQTKIPTEVLISERREFELAEGGFIALT
MRKGSDNAAFFSANSVQKPKFFGSSKEGKEAELNYKLGLQLPYMFIVSRL
AHYLKVIQREHIGTWKERGDLENELNLWIRQYVSEMDNPMPGVRSRRPLR
QAEVTVEEVPGEPGWYRVGLKVTPHFKYMGAYFTLSLVGKLDKE
>GSU2178 hypothetical protein
MKVEDIEELLNTRIIEAFLRGFSVVEITRALRKTTATSVYNLLRDTGRIK
SMARSEYRRQYDIDPRLATACRQKGFSFGRWCLSWRLDPHMAVAELKSAP
DTEEESAAHVALKRDFPDIYLRLYNGAKIKSEKKGKYRSKPASLMIEWSS
ERKAFLATVPECPGIEASGKNWDDVFSAIKSVHLMHEHVKRIDWLLSRSA
DEFCPASGAE
>GSU2239 conserved hypothetical protein TIGR00255
MIKSMTGYGKSVVETDTGRTIVEIRSVNHRYGEVYVKMPRTLLAFENDVR
KAVGDRLKRGKIEVFVQREEAVGGENLPNVNVPLAKAYRDAFEQLKRELD
LADPVTLPLILSQRDVLSAREEDGNEDALRGELLGAVRGAVEAMETMRLR
EGEALLADLTARRRTLSDIIERIALRAPAMVAEYAARLRERLTQLLSGTT
LDETRFAQEVALMADRSDITEELVRFRSHLVQFDDTLKLSEPVGRKLDFL
MQEFNREVNTIGSKAGDADTAALVVELKAELEKIREQVQNIE
>GSU0236 conserved hypothetical protein
MPTPPAFPLTVFYDGACSVCAAEMAVYRRKNHGGRLVFVDISDPAFDPSP
WGITLEAFMAQMHAIDRDGRAYRGVEGFWAIWQAFPASTFYGLLGTLVML
PGFNLLARLGYRGFARIRRYLPKRASRCDGGTCRMDRH
>GSU2906 conserved hypothetical protein
MKSQVLDVQTLKTFNDEKRHQETIWSDDHARVSLICMKPGQEIVTHTHHG
SHIWMVMEGTGQFQSGGKTQSITTGQIVIVPAFEDHGIRNASQENLVIAS
ITAQGD
>GSU1269 hypothetical protein
MKNTDRSIITFSFIALLLLLAAAVCNAADGTLTGKVANSASTAAIAGATV
SATGTVGTRSAVTDSSGGYRLALPGGTYAVTCTAQGFKAFSKTGITITEG
RSTTLNIALAPLAGAAVENLGAMPRNLVERTASSITLTASVSGTPTSYSW
TQVKGPRVPLSPASARSATADVSSLNVAVDTELVFRLTVSGENGVPASRD
VSAFIEPADMEPVLGPDVQVGGSTTAVQKYAVNGVEWSLFNIGNKLCATP
IGMTKGPVYSIHLPGFVNDIDIVAHNGLTYALISAGSEGIIVVDVTDPSA
MTRTATARINYYRGGLSFTEGGGSILTGQEVSGVKGAVAALVTDGVTLYI
ADNDFGIHRTALTNLLASGGPVLEPDGTLLIDHEVFTLQYAGENPWGRPV
DLKLHGGKLFALLKELGLGIFDPVTLEQVGRYNLYTDTMMKEDWFADMDV
RQTVAKDPATGEPFVDSFTGMPDFRQTSFEILQVMKKDVAASTPWADFDR
YGKFYYKAQGVDFATFNGRTIAYIAYSLGGLVAVDITGFETATPATFLNG
RYLGYIPAVPANGPKEPTGTRSRSILPYYGAGMLKESGIVDVKIRGTRAY
LTDHFAGLMIIDGADIPDQHWKQAGGPFNNDTNGIPGDHWPDTEFVTSFD
MSPYDPLDNESMPKWMYQAPCLLVTAEINGHGNRLLLMDTMATDAAGNID
LLECAGAGGFNFIDLINLRAPAMSDRYAIPVYIPTTDEIGAKADSTAGQT
ISIGHSAGISASDRYVYVADGPHGVSAWRITDDAGYPTDDIHLVANTLAD
EYPEVVNGVKIYPASHASNVVFDPVNHVAWSGSSSLGLRRVKVAGVEADL
GRVGAPLLLPLSLSDCFEHNAEWGTVKPVQYQDHAYDVELRGNYAFTADG
SNGITVYDVTKDPSTAASGFLVANIGAGKERPPLGTASGIALWTDSATGK
SYAFVAAGPRGVGVVDITDVKNMSLVKVFEPIKLEDGKVGAADGQAVDVK
VAGSHAFFSYDSFGVVCYRIADLIAPLPSGISPTDVWKKSQTGRLVYDYR
PVAASRFKLQLVPGYEDWAGGAVKMTFTQVSGKLIFYVAFAEAGLLKIDW
TDPTAPVLKDVAPTVNECTDVTIANGRLFAADGSGGLVFFK
>GSU3179 conserved domain protein
MYSLLDEIADATAAETAKAVPNEVIVDGTRITFDAREEIELKCGKASIVL
TSAGKVIIKGAYLLSQSSGENRIKGASVGIN
>GSU1253 hypothetical protein
MSALIDTWFLGCRAAATAVLYVLFAVVAGGAVSAARAATVAASAPAPVTV
GSGKKAVRQGISLELLVKPTHAEDGFSAAVVEGAYADVSLKMTDAATGHP
VTVMQPRVWIDLRKGTGDFREKQMACSDKVRTYLQGTLSFRPDIDLNSYF
VLTLNNDASISVIDPIIGVAGYSQLYAMILLKRPGEDWVFSFDGKRLYVT
MPKAGEVAEIDTDSLAVVRNIPAGAMPVRMAFQPDSRYLWVGNDSSDPAR
SGVTVIDPFSGTQAAFIQTGAGHHEIAFADDSRYAFVTNQQSGTLTIIDI
QSLAPVKTLRLGDGPVSVAYSALSKCAHVVCEGDGNLLVVDGLSHEIISR
KTFEPGFREVRFAPGDRWYFTVNGKNGQILVFDAAGNELLHSTVVAGRPE
HVSFSTEFAYIRSLGTADVTLIPLAALGKAAILTPLVVSGGSRPPADSPL
LPPTADPIVVTPEGNSVLIASPSDGAVVYYMEGMGVPMGSFKTYGRLPRA
VNVINRQLRQNVPGVFSSRVKIPLAGTYDVAVLLDSPRIVQCFEFSADAN
PALQRLKTERPVMVEFVGSALRVEAGRESVVRFRVTDPVTHEPVADLRDL
IVVTSLVPTGAWRQQLVSRPVGNGIYEVTFTARKVGIYSFNFAIPSLKVK
LYQLAGMMIHAVETPAEAGSDVPAGQKKD
>GSU1314 membrane protein, putative
MNAQRIIVISFCVALVALFFILDLGRLLTFASLKANHGALLAFYGEHRTL
TVAVFLAIYIIQTALSLPGATILSLAAGALFGAVAGTAWAVTGATIGATL
AFLLTRYLFHDAVQRRFGPRLEGINRELEKAGLNYLLFLRLVPLFPFFLI
NLGAGLTRLPLRTFVLGTFVGIIPGGFVYVNAGASLAAIASPADIASPRV
IGSFALLGLFSLVPVLYKKITAQRRT
>GSU1889 conserved hypothetical protein
MKRTIISALFLLTVTAMAVAGPLPASRGGQPITVKSNELSADSRNRTATF
SGKVTARQGDLTIYSDRLIVHYRDDGGDVEKVEAVGNVKIVQGNRLATAK
EGVYYNTEQKIVLSGDPKVYQGENMISGKVITYFVNEERSVVTGGGDSRV
EAVIHPTDKGKNGGTKR
>GSU1369 conserved hypothetical protein
MKDIPGESRPNATVADAGRSIWSSIAENGGYDPPQSPLFHRGEANDESRG
VVSMKTYTKALIPRARDMRSNMTEPEKRMWYQCLKHLPLRFRRQRPVGPY
IVDFYCAELKLVIEIDGASHATDEGIVYDAGRTAFLEGLGLRVIRFGNHE
VMNNIEGVFESLQKEVRPPSIPPIP
>GSU3293 conserved hypothetical protein
MEFATLEDIIRFAVQREETAYRLYKTAAEKATSIAARKMFEEMANEEAGH
KHAFENLNIEGAEHYTFAERPDMKLAEYMVDLPFREDMDYPEILRYAMKT
EESAYKLYMAASEMTDDPKLKRMLMVLADVEKGHKLKVEALYDEKVLTEM
>GSU3131 hypothetical protein
MPHSDQMETPQRRGGRRMAMAVVAVLLLCLLILILVKLYLASPLATRHLS
RLLSRTLGCPVTITALHTSGMGIAVRGIAIGNPAGFPAGVLAAADAVTVV
PRWGALIRGERTLHRIILDGVRLTLVRDRSGTWNLARLATGMKGKKPAPD
TRIGEVVVRRGSLAVEGHRMEGIDLNLRDLSTQGTTDSRLELAFADAAGN
RFSLAGRGRMGARPAFDLTLSAPSITLASFSQRSSGRALDLAGAHGSLAV
RAQLREGMIRIDGGGAVQNGVLMARGQRIPLGGNLQAAVSYDMQADRAVL
DRLDLVLSDLVAARLTGTMEQVRRKGRFDARLTLDEVDLARLGRFLPASG
TRIVTAGMLRTRGIRVVGDRARGVTSVDGAVTLSGGTIAMGARVVARDVD
GTVSVSPAARGWRLSGQLLSPRTAGDVMVDGLRIPFEAEVSSHFKPTEAR
VAGMSCRVMGIPLEGGGRFRPTAAQPLQLSLRVPSSSLMLAATRLEPYGL
RPERGTAALSLDLAGGVRGPFVGELAAAVNGAALSIKGKPATLGKGDLRA
RFAWRQGAASAAGTLLLDDASYGGRQGELAAPFRLDGRSLMLDAPRFRFG
PSAGTARLVVVRLPDAPAGTVRPFTLEVTEGAVTHGTGGAGGINASIRGT
YHGGTPGWPEVEGSGAAAHLTLQGKPIGAPSVRFTLGREKGEAVVGGHLL
GGALSARIDFPARSPLEDLRFSGSLRQAKLAAIAPLMPATGRIPAVAGGE
ATISVDGGFAGGAGLHCGLEARASGIALDGAGGKRLLSGAEARVAARVAG
ERLDLREAVAVLGEGASVRIAGSMDRFRSPERTGTFSVTLPRTSLTSLVD
PLINVLPRPVQEATLEGDVAVAAEVALAGKAGMVDGTATLSGVRLDFPSQ
KLNVSGIGGTIPFSLRTAAEAAKRPRQELSFSRENFARLQEMLRQQGPGD
HTLTVGKIRFGPLELGETTLGLRAANGTLEISSLRSGLYEGQLLGRGFVT
ARGGIVYGGDILVYDLSLARLCDAVPKIRGYVSGRVDGLLSLHGQGAGMD
GLTGFTDLWARESTRERMLLSREFLQRLAGKKLRGFFFRDDRPYDRGEVS
AYLENGYLTFTTLDLAHTNIFGIKDLSVSVAPVQNRIALTHLFGAIKEAA
TRGKAVAGEPAPAEKPVEMEFQWRD
>GSU0617 NHL repeat domain protein
MNVLKLIRRFVGAAALCALCAGCAGQQVREERRYFWPPLPERPRIEWLGA
YSSQNDFPKQGFASFMAAIAGEEQAMSLTKPLDVYADGQDRIYVADPGLR
GVVVFNMKERSVSMLGGPQAANQFNTPVSVTGDSQGNIYVSDAEKGGILI
FDRFEVPRRFIDTKAAVKRNTDIAVDEKGQRILVVDAREHRIAILDMQGG
LLSAFGKRGIEDGEFNFPVAVAINHKGEIIVGDAMNARVQIFDQDGKFLR
KFGRRGDGPADFQIMKGVAVDSEDHIYVTEGKGHKLIIFGTNGEYLLTVG
GLYSAITTGKQAPGGFVIPQGVFIDDKDVIYVVDQLNRRFQVFQYISDDF
LKRNPIPGWQE
>GSU2346 membrane protein, putative
MPNAYIRYRIKRLREYIAERTAGIDHRAIIKNAVAGAELSGSYVSLLLLA
SLIALLGLLTNSVAVVIGAMLISPLMGPILSFGLAFTIGDLALARRGLRV
IAVSVGLTIALTAFVTLLSPLKEPTTEILSRVRPNVYDLFVAVLAGTAGA
IALCTKKNYMITATGVAVATAVIPPLSVVGFGLGSGHPMLGLGGFLLFFT
NFVAIVLTADLVFFLFNFRSSMVSEETYPARKRLLILGTVLALLSIPLVH
TLVTDLRKVKLTKRVERVLKLHLEKEAHSRVTGFSVQEKDGRVGVNVTVN
TVRPFEKKAEEEIEKEIGSFLGRNVELNLEQVVVTAGSIEPRAELSAVSP
VGQPASARDESAAAVQASASRLLRDVRTELEALTSPFAVEDIGLGFGEKP
GPARVSVVLRRDYPLTDDERLLLARLLERRLQVPVSLRVELAPFLPRVIF
DEKGDLSQESVAAMALIKNLPGGPGAFSFRVEAGGKDSRRHADRLKRYLQ
EELGVPAANVTAAVRPGTGASVLVVRR
>GSU1571 conserved domain protein
MRIAIEEYSSAWERAFLAERKRLLGTALPVDAHVEHIGSTAVTGLAAKPV
IDIMIGLLREKELDVLVGPVRSLGYEYLPEYETVMPFRRFFRRIGDAGVS
FHLHAVVKETAFWQDHLLFRDLLRGDDALRTAYEALKKQLSEQEWPSGDQ
YAAAKFGFIKEALSRGRIAGAGATGRVVYQVSSVPLAPGAWLRPYYLQRY
GAMINAAIAALSTGEKALLAYCQKEAWLLLEMPQEIPSLKPEQLKMMIVL
EALFEQVRHEEAPTLPSRLASIFTWPVPGVAQRFRDAYFPGGVIHRCVIR
SGSALEFDGALLPPGINLDQAGPLAMAEEVRRVRQRAERYWRRAHEPEFP
ELLVQGTVEVVGREG
>GSU2348 conserved hypothetical protein
MKSALVLAIALAALMVTFSLQNSQTVQVRFLGWYFEGALVIVLLMSFAIG
VLTMYLASLPARFAHRRQLAEYRHQLDTCNRSLDRLRSEAADEKPPQA
>GSU1464 LysM domain protein
MKVRAGISALVTIAVLAGCASHLPVYRIDALARLSAVKMMGAEKYAPPEL
ASLQQAIVDGDALMQSGEHELADSYYLLAIRKSEAVERSIAREREHERAM
QVRSENERLARERQKALEELKRELAEKEKEEKKRIERAPQPVRDKEKEKE
VRSFPLHHTVKRGESLPQIAAQPDVYGDAALWPLLYRANRDQIRDPGRIW
PGQVLRIPRNISRDDLAEARRYAQERPAP
>GSU0602 conserved hypothetical protein
MGLFARWRRKRLRRGGFPPQWLKIVERNVPFYGRLCVDDQQELLRHVQVF
VAEKSFEGCSGIQVTDEIKVTVAALACLLILHRPGDYYPYLSSIVIYPDE
YVGQRQRWDEGGVVTEGPEPRVGESWELGTVVLSWKDVVLDAQAPDDGFN
VVFHEFAHQLDHEDRLTDYDGFLPNEPEQSPWREILEREYRRLIDDDEAG
RWTFLDPYGAESPAEFFAVATESFFEMPGELKARHTGLYELLRSYYRQDP
AVWE
>GSU0887 conserved hypothetical protein
MAKSALRHKLIVIAGPNGSGKTTFTSQVLRHDWSEGCIFINPDEIAKNEF
GDWNSPEAVMKAAARAQELREECLRGKRSMLLETVFSVPEKLDFIRRAKE
ADFFIRFFFIGTDSPAINAARVARRVMAGGHDVPIAKIISRYQRSIANSA
LAISMVDRAYVYDNSIDDREPKKLFRTRDGRIFKTYRDLALHEWARMVVE
GLPD
>GSU0900 hypothetical protein
MKRILLPLLICPACLPKEHPLDLSGAGEQAEDIVSGTLSCRRCRRRYPIR
EGVALLLPEPEEGPWGGQWKYEEAATVNSYLWSHFADLMGDLDAGTAYGD
WAGCLAIGGGRAFDAGCAVGRLTFEMAQRSEIAVGCDLSVAFVRTARRLA
AEGRIDFSLPLEGNLREEFRIELPGHWQTDRTEFVVADALRIPFARGSFD
QTASLNLVDRVRHPLAHLYEVNRVALAAGASFLFSDPFSWSTANTPEEAW
LGGTAGGPYAGRGIDNVRALLEGKDGIIAPPWRIDRQGSVDWKLRSHHNH
FELVRSRFLAASR
>GSU2680 conserved hypothetical protein
MEPNSRDIAGVRTVALFEGAKGALVIAAGLGLLALIHHDVQALAEEIVSH
FHLNPASRIPRIFLEAANAASDGRLRLLALGAFGYAGLRFTEAWGLWRAR
PWAEWLGIVSGGIYLPLEVYELVVSISAVKIGTFLVNLIVVAVLVRARIR
ARR
>GSU3032 hypothetical protein
MGEDGTETCLHSLYDPIHEADSFVAGPVDAAVLVFLGTGLGYHLPRSLAN
NPHVSRVILVERYPELAELAAARLAQGWSGRIDVVTLPSAGRFPPDPEEL
DAAALYVVPHPPSVRAHPAWYDHYRILLATVGRTNSATDRPHAAGRPFTI
LVPFEAYYVQRECIHGLESLGHRVVVLDCRGREGDEASLFGEALRGERPD
LVLSVNMRGLDRRGVAAEMLRRLGIPLALWFVDSPEFILYGEALPPADGC
HVFLWEKSYLPAVAAQGYRVSYLPLAADVGLAKAARVDERFSAGLSFVGN
SLVSGFLARLAVKFPVTPQVMAFAGEAVERIIAARGDQLRLADQLVAEGA
RFLPDDDARLFFRAYLLHSATSAYRTRLLGQLLPLGLTFFGDPDGWRKVF
GSTIDARPDVNYFRETPAVYASSAVNVNATSLQMPHTVNQRVFDVPLCGG
FLLTDRQGALDELFAENEVALYDGISDVAETARFYLERKGLRREIAERAR
CRVLGEHTYGHRMQKVIAEIFGRPV
>GSU0484 conserved hypothetical protein TIGR00294
MPDMQTSRDTRKIPISKVGVKDISYPIVVMDKNRKFQQTVARVNMYVDLP
HHFKGTHMSRFIEILNAYREDIALDKMEPILQEMKKKLGASSAHLEIEFP
YFIEKRAPVSGARSLMEYTCTFTGTLAETFDFVLGVQVPVTSLCPCSKEL
SRYGAHNQRSHITVRVRYAGFVWIEELVELIEGCGSSPVWSLLKRADEKF
VTERAYENPKFVEDIVREATLALAAHEAITWFSVEAENFESIHKHSAYAA
IEQDKRKA
>GSU3209 iojap-related protein
MTDKLTLTPLERALQCARFALDKKALDVKVLEIGRISSIADYLVLATGRS
DKQAQAIADSVKKGLKKYGKVIDMEGLREGNWIVIDYGDVIVHIFREELR
SYYDLDGLWSAAGQVAIPAEYLWEGKEGGGE
>GSU1683 conserved hypothetical protein
MRISDRLSSRDVDKKGKQESLSRSDVAGSPFVRTLTRNRSEFENYEQELQ
VLKDELDKVGNELEREPTIANFRTFRDLIGRITKNVTSHAYRLQRVGGTT
LNPRCFEIITVIDREADNLYRLIMTENKDRLAITNKILELKGLVVDLKI
>GSU1107 conserved hypothetical protein TIGR00296
MPRLTDDDKKLLLMLAREAIVTYVREGITPATELSVPSLCEHYGCFVCIK
KGGELRGCIGNFTSSQPLYQLVREMAVSAATRDPRFYPMTSKDITDFSLE
ISVLSPLEKISSPEQITVGTHGIYIEKNFFRGVLLPQVATEYGWDRDTFL
MQTCVKAGLKPDDWRDGADIYIFSAEVFS
>GSU1046 conserved hypothetical protein
MEIPCHPDSRPLALADKPLLDALFTELQPRVSELTFANLYLFRGIHDYRL
TRLGDALVVLGRGYGGEAYALPPLSGDVTGALRTLLADGFTIYGADDTFL
ERHGADAAITVEEDRDGFDYLYLRSDLADLPGNRFHKKKNRINYFAARHP
FEVRVFGPDHRQGCLALLDEWRRVRDAAGSTSLDPETAAAAEAVTLSAEL
GLEGVVIAVEGRVGAFALGERLNRETAVCHFEKSDPFMEGISQLVNREFC
RLFTDCTFVNREQDLGEPGLRTAKLSYHPVELVRKFWLRKTPEP
>GSU0353 membrane protein, putative
MEQQLAADGLIVLFVVSFLAATLIPVGSEWLLVALLTQDYNPLAVTTTAT
VGNILGACTTWAVGMWGGVYLVRRILRISEDAERRAEQFYRRYGYWSLLL
SWLPILGDPLCLVGGILRVGFGRFVLLVGIGKLARYSFIAWVTVHAVN
>GSU1208 membrane protein, putative
MITALDVFGTFVFALSGAFRAIKYELDLLGVLVLAVATGVGGGMIRDLLL
GTTPPMVFRNESYLAICVAGGLLVFLAAGRLAPIWDWVMVADAVGLGVFA
AIGAQKGAAGGLGGFGIVMMAAMTATGGGVVRDILVMEIPAVLRTDFYAS
AAILGGACFVAARAAGAPEQVQLFTCLAVTLVLRLLAMRFGLSLPRIKGL
APAPGDGGTGDGKE
>GSU2827 conserved hypothetical protein
MPDFGNPFAGLANGRKLTHAELVRAIRFMVAAEYEAIQLYMQLAESTDDE
LAIAVLKDIADEERVHAGEFLKLLYHLAPDEEKFYAEGAEEVEELAEEVK
KRAKTKGGKSKKS
>GSU2275 conserved hypothetical protein
MNRERRYWLFKSEPSCFSFDDLGSRPNGTEHWDGVRNFQARNLLRDEIKP
GDGVLFYHSNVPDPAVVGIARVVREGYPDWTALDPAGEHFDPRAGRNNPI
WYMVDVRYVKPLARPVTLAELKMHPEVADMVLLQRSRLSVQPVTPAQWEF
ILRLGGIHEPFNL
>GSU2763 conserved hypothetical protein
MRRPVFPAPYRHEHAPVKNVNEVVRDQLTAGQRAADWIASRVGSWQFIIG
QSVLLTVWVILNITAWIRHWDPYPFILMNLVLSLQAAYTAPMIMMSQNRQ
AARDRIEAHNDYEVNQKAEAEIRAILDNLAAQNAAIAEVHAMLDELRGRL
ADRGQPNA
>GSU2233 conserved hypothetical protein
MALFFRGIISLLIVVLIIVAVLFVDFAYKTFSLRQRDVTTDAIVVLAGGR
GRVEEGVRLFRERKGTYLYLIGVDPVVKKRDLFRDREGERLADRVFLENL
SRNTLENALYGREIIMRKEVRSIRLITSRYHMKRATLLFRQVLPRDIAIY
PHPVDTTNLKERWWDDKGSFRLLFSEFYKYCLFRFFFLFASAELRIPAR
>GSU1073 conserved hypothetical protein
MPTTTLLAPLVRLVRRVLLAGTAAALLAACATIPQMADETDRDLPPILAQ
DEIFRPYVKIGTVEVNLKRYGSIEELQREAEEWAHDALSLEASKIGADAV
ILPEVRIEKDTYIIFPVIYVKGKGTAVKFQ
>GSU0792 conserved hypothetical protein
MCLILFALDAHPRYRLVLAANRDEFYARPTAPAAFWDDAPQVLAGRDLTA
GGTWCGVTRDGRIAAVTNYRDPGAHRVGARSRGELVAGFLGGDEAPSRWL
EHLQRNGHDYNGFNLIFGDGNGLHYHSNRGAAASPLSPGIHGLSNHLLDT
PWPKVARGRDALARLLATADEPAVDDLFAILANRTPAPDHLLPDTGVSLD
WERLLSPLFITSPTYGTRSSTVILVDRSGQCTFVERSYNGAADHPRTVEY
RFEVIT
>GSU2648 conserved hypothetical protein
MAFVIMAPRTIPRRPRRSMKNRLIESTRYIILIAVIGSFAAAVTLIAYGG
ILAFRTITETLASGYVSSKGMKSLVLSFIEVVDTFLLGTVLYIISLALYE
LFIDDTVPVPQWLTIHNLDDLKYKLVGVVVVVIGVLFLGQVMTWDGQRDL
LGYGVAASLVIAALTYFLSLKKK
>GSU1048 SEC-C motif domain protein
MTNLCPCGTGKSFGECCEPLVTGARAALTAEELMRSRYTAYTRAEIGYIH
DTTHPDHRADFDEKGTREWAESSQWEGLEILATAGGGPADTEGRVEFIAR
YRDTGGRRTHHELAEFRKVDDAWYFTDGYGIKPQPAVSTKIGRNDPCTCG
SGKKYKKCCGA
>GSU0520 conserved hypothetical protein
MARDEHDSRYLTAELPGTGGLFKETPEDFLVEEISLYLPCGEGEHTYAVI
EKRGITTLEAIRRLCRATGAPERDTGYAGMKDARGITRQTVSLPRVTPAE
VMGLNIPGIRILSADRHRNKLRLGHLAGNRFRLRLRETVPDAADRARAIL
DVLARRGVPNRFGEQRYGIQGNSHLVGRAMLAGDWRGAVDLLMGDPAKVE
GERWRAAIEAYRQGDLAGSLSLFPGHCRTERDVLQRLAKRPDDFERAVHG
VHPRLKKLYLSACQSALFDRVLETRLQTIDHVMEGDLAWKHDNGACFLVT
DAKAEAPRAERFEISPTGPLFGCRMTSPEGEPAALEKSILDAAGLTPAAF
NLAGGLRMEGERRPLRVPLEGPELSCEEEDLVLRFSLPRGSYATAVLREV
MKRPGNGA
>GSU2818 membrane protein, putative
MMQLNRCRKECDMNGSNLRRGTFTILLALCATPWVGTAQALVMGIALGLL
QANPWPRQTARYSKMLLQASVVGLGFGLSLGEVIQTGKDSIWYSVIGISC
TLLVGYGLGKLFKTGTNTSALISFGTAICGGSAIAAMAPVLKAKSDETAV
ALATVFTLNSAALLLFPLVGHWLQLDQNTFGVWSGLAIHDTSSVVGATSA
YGATALAIGTTVKLTRAIWIAPVVMAASLIKGGEQQARIPLFIIGFLGAA
AIRTLLPSYEHFWGELAGVAKQCLVVTLFLVGAGLSREVVKQVGIRPLVQ
AVSLWVLVSALTLVALKLPWSA
>GSU2456 UDP-2,3-diacylglucosamine hydrolase, putative
MRAIFIADAHLRQPGDRNYRLLMEFLAGLRGNVDTLYILGDLFEFWIGYD
PVPFTHYLPLLDRLRELVDSGVRIEYFEGNHDFHMGPFFTETLRATVHPG
PAVIDIQGERLYLCHGDEVNRKDYPYRLLRFILHSSLTRAATRIVPPAVT
CRIAVGMSRESRKNHGRRRHRWDYAAILRAFAAERFREGCRAVVAGHFHQ
PFIDRNDDGTVLLSLGDWLTHYTYGEWSDGTLSLKTYGTP
>GSU3289 conserved hypothetical protein
MGKEYSVQEALKLAIKGEKDSMDFYRKAATVTKNERARKVFDLLANEEVG
HLKAFFDHYKGGDFGDITVYMAQPVDTKNPTYMALVKAIDEETHEQKALE
IALKEEKACIDQYTVLAKDIIDPLVKGIFQQVIKETEKHLALIEEEYRHV
MTMVHESDQDTYVRE
>GSU1807 conserved hypothetical protein TIGR00159
MTEPLQIFGWRDAIDIAIVAWIIYRLIIMLRGRVANRLLLVLAFLAALYS
LSRLAGFEAFHWIVGSLFSSLILILVILFQHDIRRALVTHGKHRHALTED
RDEQEERDHASLIVGELIAAATSLSSRRIGALIVIEREMGVMSHVETGTE
VDAKITSEILTSIFLPYSPIHDGAVIIRRGKLTRAGCFLPLSQDPTINKN
LGTRHRAALGLTELVDCVVLVVSEETGTISVAVGGRIISVSDAASLRKIL
KKLLEPRWLTE
>GSU1313 carboxymuconolactone decarboxylase
MDSYYVPDDLAKFGDIGKDAPDLAKKFFDYYGAVFAEGELTEREKTLIAL
AVAHAVQCPYCIDAYTRACLEKGSNLGEMTEAVHVANAIRGGAALVHGVQ
MRKIAEKLSL
>GSU0636 membrane protein, putative
MGALLAILVLFFAKILFTGKIIRAPDITNEFYWTIKHYKEMGFLDLFRVH
LRAGWDWLTNGGTTEGGGTLSLQFLFYRSLIFWLFPAPANVAWFIVFHLF
VGGAGTYFLCRAIGTGRAAALLGGLIFAIAPENASLINAGHAQKIATISF
APWAFYFLERGYQSRRTIFFLASAVVLAIQFFNMHWQIAFYTCLAIGAYG
LCRTAGIIAGDPDSRTGKGIARLVGLNAVILCFFLSTVAISLIPLADWSR
ETTRGLQSGSNQGQGGLQVEEAMSWSMPPEEAVTFVIPGFFGLSRQEGGY
DDPSHGTYYWGRMVFTQTTDYMGLLPWLLAPLPFIFRRDRYAWLAFGAVV
GGLFFSFGKYTPFYWLLYEHFPGIDHFRVPKMMLFVTTLGLAVLAARGAD
LLLDDEVRATSAFSRYLAGAIALVPALLALLGITMAARPYVMDLLSPMIT
QPTRFEQGPALVAQRLQTIQREIGIAAAFAAVYGAVLWSWARGWFSRRAL
PYLLVGVFLVDVGRVNAKFMLLQDVPQKVKGEKSPVVEFLAPMPKTGRVL
PIDGSDPMEYVSHGIPVMFTSNPVQIARWQDFLESFSFDSAMPDMMNVRY
LVHDAQQYEDDRQALGPRYVPVFASPDGSRLVLENRGVLPKAWLAPSALL
LSDPRQILGIMQQPSYNPRSFAVVEEQPPIPMPPPMAQPKGDAGEVTVTR
YEANDIACDVRAGRNALLVLGEKFHAGWRARVDGTPTEIHRVNYILRGVY
LSPGRHRVEFTFDPLPFKVGKYLTLTSFAIFLLMVGREWLLSRTRRGGGE
>GSU0431 conserved hypothetical protein
MITRHFQDELARLKELGAEFSVTHPALAPMLGGPSTDPDVERLLEGVAFQ
TALLRQKLDDDFPEVVHDLVRLVAPHYLRPVPATTIVAFEPKPSLTRSRL
IPAGTELASIPVEGTRCLFRTTSPVELHPLELLEATFAQPAGSAPVVTLS
LSLTGLSLDHWEPRSLRFFLAGDHAPAAELFLILSRHLKRIVITPEERGA
SASLPAACLQPVGLNDDEHLIPWPSHAFPGYRLLQEYFAAPQRFLFLELT
GWERWTTRGSGSRFTITFELGELTIPPPPVRRDSFVLFASPAVNLFSHDA
EPILLDHRVGRYPVRPAGLEPGHGQVYSVDRVTGIVRGAATERIYHPFEQ
FRDGADAQPTFHTAVSASPVRAGFDVHLAVAYPQGEGLPETETLSIALTC
SNGRLPENLWLGDLCEPTSSSPESATFHNITPLTPTLLPPLGKNLLWRLV
SHLSLNRLSLASADNLRTLLALYLFEEGGDRGDLAANRKRLDGIEAVIAK
PAGRLVGGHLLRGSEIVLTLRGDHFASPGDLFLFGAVLDRFLGGYASLNS
FTRLTVRESVRGEVYSWAPRLGHRQLL
>GSU2791 conserved hypothetical protein
MLSMISTPRKETKNAMIITTTPTIEGKRIVRYCGVVAGEAILGANLFKDL
FANIRDMVGGRSATYERELQRARDIALRELEERAEELGATAVVGVDLDYE
VMGQGNGMLMVSASGTAVVVE
>GSU1479 conserved hypothetical protein
MKAELFFSSAEMERIRNAVAAAEAATSGEIVTMVVAESDSYREAETLGAV
LLAGLVSIAVAVVLHHVTIWTYIPLVFVLFPAARAGMRRMPRLKLPFVGR
ARLAEAVRERAVRAFYERGLYRTRQETGVLIFISLLEHKVWILGDRGINA
KIPPGFWEGLAGELARGIREGRGCDALCAAVAGCGAELARHFPRRADDTN
ELTDDLMT
>GSU2661 conserved hypothetical protein
MKRNITQTLVDEHKLILRMLDVLERNASLTAEGLYTNYRFYLDAVDFIRH
YADRFHHAKEEDVLFEALVNNGMPRENSPVAAMLMEHDRGRAYVKAMEEA
ATAALAGSTGRDGDIAENALGYLIMLREHIAKEDDILYPLAERVLPEEVR
GGILAGYESAEARTPADFTERYTKAVEHYEAEEQNRAA
>GSU1328 conserved hypothetical protein
MSEFTNVTIIREANVYFDGGVVSRTVVFPDGTKKTLGIMQPGEYTFTTGA
PEIMEILSGELDLKLPGSDAWNRVGGGESFDVPANSSFTMKVLSLTDYCC
SFLG
>GSU0136 membrane protein, putative
MAKSNMGAVADSLALLMIRVPLGAIFIAHGSQKLLGAFGGQGLTATFRVF
EEKLGIPPIFTLLAIIAEFGGGVGVLCGFLTRLSGFGIASTMAVAMYKVH
WANGFFLNGPRGNGIEYNLALLGMALALVFAGGGAWSVDRYLFKR
>GSU0983 conserved hypothetical protein
MGLYDLLDAEGKDDQKGKSPGVAVGVVADNQDPEGMGRVKVRFPWKDDAD
ESTWARVVTPMAGKGRGLWFLPEVGDEVLVAFDHGDVQHPYVLGSLWNGT
DTPPGDNGDGNNNIRKITSRSGHELVFDDTSGAEKVVIRTKAGHAITLDD
AGGGEKIEIKDKSGSNKLVIDSAQNSIGIESSMKLTIKGQMVEIQSDANM
TLKAGAVMTIQGSMVKIN
>GSU0233 conserved hypothetical protein
MEILLNDVEVRVLGCLIEKELATPEYYPLTLNSLTTACNQKSNRDPVMAL
EESEVVRALDGLKMKHVAIQAADSGRVPRYRHILSERLRFSPAELAILAE
LLLRGPQTLGELRTRAERMHPFADLAAVEQVLGELAERTPPLVMRLPRQP
GRKESRFAHLLAGEPDLSAEERTAPPEGARLQVMAENERIAALELEVATL
RAEVGELRQVMEEFRSQFE
>GSU0183 lipoprotein, putative
MRRIVTLPTVAYWLRSAVLGICVIALTGCAATRQAWFSPVHPESELERNR
FQVAPGTDVIGRLGLVKLEEGDTLPDVARHFSVGINAISAANPGVDVWLP
KAGQRITLPLNFILPDAPRKGIVINLATMRLFQFKENGGALAVSTYPVGI
GTKERPTPQGPTRVARKASRPTWYVPASIAEDHRKKGDILPARVPPGPEN
PLGDYALYLSRSGYLIHGTNKPASIGLKASNGCMRLYPEHIEVLFRDTAV
NTPVLIVNQPYLIGQRDGVLYMEAHTPLEDSGTSELARIHAKLKSFESAS
ARTLDWDKIKQVQAEARGIPVPILELRPGAEQEAASPLRIEHPAALYGRP
ELPALKLDAWYVMVGDMRDEMEARRMAAIINHQGPPIPARVLSKDSSYRI
IAGPFSDDGKARDAAKRLKIDLDIDGIVIEPGRTI
>GSU1477 LemA family protein
MKRLVMVVTALLGLSLLSGCGYNVMQANEEAVFSAWGDVEAAYQRRADLI
PNLVEVVKGYAKHEAETLTAVTEARAKVGSMQVTRDAVNNPETLAKFQQA
QGELSSALSRLMVVVERYPDLKANQNFLDLQNQLEGTENRINVARTRYNK
AVQDFNTSIRTFPNSLTNKLLLHLERKEPFKADEGAKAAPKVKF
>GSU2193 conserved hypothetical protein
MNLIDCALRMEEEAAAHYSQLAAAAPVEELRSIFGLLAAAEKEHHDKLVA
MLGDADAAATAFTALDDASCVFKPLLGKRDLVAELRRDPDGYRHVVKEEE
ESIKFYEDLAAKAEREDTRAILLKLADEERRHLSIVENIYSFIEEPKTYL
AWGEFSNLKEY
>GSU2146 conserved hypothetical protein
MKRFEITVKSEKSFDQAVSAIEEKAAEKGFRVLHTHDVAKTLTEKGFARE
PLKIVEICNAKYASQVLEKDVRIALMLPCPISIYEQKGETFISTMLPTAI
VDFFPEAGIETLASDVERIVLDIINLAK
>GSU3442 conserved hypothetical protein
MVRVKRIYDEPATEDGTRVLVDRLWPRGIAKDKARIDEWLKEIAPSDELR
QWFGHDPARWDEFRERYRRELDAKAELLDGLRKLAAGGTVTLLFAAKDEQ
HNNAVVLKDILVEQ
>GSU0897 conserved hypothetical protein
MIRAVHSAAIVLLVFSYMAEFSGAASGGYAPASVGTQPVGTSFRDRLADG
SPGPVMVVIPPGRFRMGAIFGGGDPDEKPVHEVSIPRAFAIGAYEVTFAE
YDRFCEATGREKPKDGRRWFGPLSRNWGRGNKPAMNVSWDDAVAYVKWLS
DQTGHRYRLPSEAEWEYAARGGKDTPYWWGGTVGQNKANCKGCGSRWDKK
ITAPVGSFAPNPYGMFDTAGNVWEWCVDTWHESYDGAPADGSPWIGGEDS
RRVQRGGSFGSKPRYIRSSARGRGAQDGRYVYLGFRVVREL
>GSU1898 membrane protein, putative
MEDVLRIAVGTVTMRPYVFAFFAAYLVAAVPHLGWRKTLLFTAAGYLISF
ASEFSSINTGIPYGWYYYIDTTRDRELWIAGVPFFDSLSYVFLAYCSYAT
ALFVVSPIKAWRWDLVTLESRSIRGSFAVLFLGALFQVFLDIIIDPVALQ
GYRWFLGQIYGYREPGIHFGVPISNYVGWWVVSVIMVFVLQRIDLWCEGK
DGKPAGVANPPLRSLYGPILYLSVLVFNLAVTLWLGEHLMALTGILIYVL
AGSIAIVTIVRRTNRYRKEELAEHLRDYPWSAVSGRCAKE
>GSU1151 conserved hypothetical protein
MKRVIINADDFGLSDGVNRAVIKAWQEGILTSASLMVGGDAFHEAVHLAK
ANPGLHVGLHLTLVQGRAVGEHGGFPSICDSKGAFSDDPVQTGMRYFFIR
SFRKQLYREIEAQIVRFLQTGLPLSHVDGHLNIHMHPVVFDLLCGLMRKH
GITTFRLCRENLRANLTLDRERRFGKMVDAFIFSKLAGRCRPILDHLGIG
YAQEVKGLLNSGRMTEEYLLRVLDTLEDGTTELYFHPGCLPCAEITRRMP
DYRHEEELAALMSPRVKGRLKELGIELGNYRGEVKTYV
>GSU2412 hypothetical protein
MREEERQVRFRLIFREPERFPAGQDPPDGLHRLVFLEPRPCTEDELRRLS
PAISFDRAMIGVNLNADGKLQIWGVVHSGTRWMQAVHGGTQLINPVPDSL
IVFVTGPGRISVSVGSTMIAGLRGGQVVSMAREVFAASWLREIFATHRDE
LWDLHLDARQKSGDIWPPIDPEFPVILGQNLLRRVISLMRSYRHGGTLLI
IPSERTDELKQANPYLNIKYPFLTDIGRRRIFPHIVGIMNEFARVASRSQ
RECRPLGWDEYLALSDPVLTRMDEALFELAHFVADLSLVDGAVVLNRRFE
VLGFGAEISGGLENVTRLHVALDIEGETRQPEAVKGRGTRHRSAYRLCNA
LHDALAIVISQDGQVLFVHWHDGAVTCWDQVATSLLDF
>GSU0519 conserved hypothetical protein
MNDPQKLCDVNFFRPRKGYMTGEVAIIAAVLVGWAVANFGFQGVLALLAQ
SPDGEGILTRLTFLSFPWHFWFTGQFLPLWFIILCVLFNIYIDRHTEQHS
RRRDRSHD
>GSU3186 conserved hypothetical protein
MDLLNSTPFAATPLFLSDRHGAETLLVIVKGTWRINRDGTLSVAEEQVPI
RFEPLYSGDPASSSLIHDTDIILEKPGTDCILLGHAWAPKVGVESVDVTF
AVGPVRKAVRVFGERIWMKCLGMVSMSRAAPFEKIPLVWERAFGGADTSW
PDPKNHEFCLENPVGRGILAKRTKQEIDGLRLPNLEDPTHPIRKTDDHPR
PMGFGPIPPHWQPRAKYAGTYDDHWRKYVNPLLPEDMDSRFHSSAPPGLM
SNRHLSGTEQVLVANASRNGRLEFTLPGIAPRVSVAIGATVHELKMQLDT
LIVEPDEERLVLVWRGNHNVHGKLHSLKRVCIEAR
>GSU3208 conserved hypothetical protein
MKLRVLWVGKTQEEWVRRGIDEYAGRVRRYAPLEIGEARDEKGAAAEAMR
ARECERLDKLVPRTSRLILLDERGDQLTSPEFAAYISRCRDTAVPELAFA
IGGAYGFADEFRRRADRVIALSRMTFTHQMVRVVLLEQIYRAFTIIGNEP
YHH
>GSU1012 membrane protein, putative
MNDLLRTLAVLVVVVILLRRKMHLGLVMLIGAAILALLYLTPPLDFLAGA
WVALVSPSSLEMTATLIFTMIMENILRSTGTLKRMVESLSEVFPDARFVM
ASMPAMIGMLPSPGGAVFSAPMVSEAASRLSIPADQKAFVNYWYRHIWEY
VSPLYPGIILVAGLAHIPYQKIVLANLPYALSVVLWGGVFAFSGVGPTPA
SSSTSVGRGKALRVFLITISPIMAALVLVVVFRVNPVPAMGGVTVLMYLA
HRYSPAAIVRSLRESISLKAMTLVFGIMIFQETLRLTGALDGISRFFAES
GLPTVLIITVIPFLAGTMTGLTVAFVGITFPILMPLMGGDVPSLGLLSLA
FGSGFAGVMISPVHLCLVLTREYFGADMTKVYHRLWIPQALVLAAAVVPV
YIFQ
>GSU0464 S4 domain protein
MPRGEPMKIDTDHIKLDSFLKAANLVASGGEAKIIIAEGAVRVNGETELR
RGRKLRPGDQVEVAGECFVIE
>GSU3278 hypothetical protein
MQATGKVKPVNFMKSFRTTFVRCLAGYLIIASLLTSPAFAVTSEDSQIFI
AGFNAYQKKDYQAAIDRMKTVLDKYPDTPLRDMAIFWLARASFKAGYERD
AARYMSQFFKEYPDSPLKGTVEDDLLGLVTRYDKGEQLPAMARRGAEAPV
SETSVARQVLAEKTASARQAAEQAAQKAASITAGAATTTAEAVGAATAAA
TAASQAVARKAAEEKAAEEKAAAERLAAEQAARERVEAERLAAQKAAQEK
ATAERRVAEQTAREKAEAERVAAQKAAEEKVAAERRAAEQAAREKAEAER
VAAQKAAEEKVAAERRAAEQAAREKAEAERVAAQKAAEEKAAAGRLAAEQ
AAREKAEAERLAAEKTAAELRAAEQAAAAKVEAERVAADQARRDREAAER
LAAEKAARQKEETEQSAVRRAEEQKRAALREKAVAEYKSLIDRFPGTRAA
AAAAAKLAEMGITHVAPSPAVAAAPPALPEKNAQILTLEVGQFADADLMI
QAPAQGLEAGKRHEIPFEVVNRGNGTDSFYLESGFPSEFGAQFADAGRKE
MPINLTPSLAPGERFRGILSIAVPREAIDGQKLVYPIKVASRLDREASQA
RDIFLTASAPLLRAVVKTDKNQVLPGEKVSYRVVLLNIGTAAAQGVALRL
NYPPQYEPVGFGEAGFKQEMKAALVLDGMRLVSGESREFEVTFQLKDEAI
ARQELFLRADLVNSELQTRDSFISAAAVVKPVSGVAVRTASEKLVVIPGQ
QVSIPLVVTNTGNQREDVIIKPNLPPNVAYTVYQDLNRDGIRQNNEPIIN
HVGPLAPKEESYVILDLSTPFSESDGTAATVSLAFEPETDQARGALASLR
LLYSRPVVDLVMSGKGGRLKPGEVSSFELNFTNRGSNMAKSVELQSTMPK
DLELVASDPSFVRNANGDYLWTFDELGASEKRTIKVTFRVRSGTAVGTSI
QVKNVLKYQDQLGNRY
>GSU2319 conserved hypothetical protein
MKFLTVRAMLAGILAGLLPACAGERPASLGVRDGRLSPCPSSPNCVSSQE
SDDRHRIAPLAFTGDPDAAFAHLMLVLEARGDTTIIEQTGEYLRVELRTT
FFVDDGEFLLDRGRRVIHVRSASRLGYSDLGKNRGRMEDIRRAFSPAGSA
P
>GSU3290 conserved hypothetical protein
MPIYEYRCEDCGGTFSLLQKMGAGERETCCPACGSDRVRKLISAPAVGSS
AGVAPAGGHACSIGGG
>GSU0429 conserved hypothetical protein
MNAHPPLYWHQGLFLQPHHFQLQDLSVQGRLAPLFRHLHPHFWGAGELEI
EASALGTRVFSILSGEFLFPDGTWAVIGENALCEPRSFDDSWIEGDRPLP
VYVGVRKWSGDRENVTVTERLENLGGITTRFAAAADPEEMRDLHAGGPAG
QVKRLRHVLKILWDSERDQLGDWHLIPVARLERFGADVRLSGRFIPPCLS
LEASEPLFRIVREIRDQVAARARQLEEHKKQRGIQNAGFGSRDMVYLLAL
RSLNRHVPLLFHLTEARQVHPWDVYGALRQLIGELSSFSERVSALGKLDD
GVRLLPPYDHRALGERFFVARDLIARLLDEITAGPDYVIRLTHDGTFFSA
DLKPGIFETGSRYYLALRTTTEPAAFLPSLEAAAKLSARNHLPVLAARAL
PGIGLSHLPVPPQELPRRADTHYFAVDTAGDQWPLVEREHALALHWTGAP
ADLEVELMVVAKG
>GSU3222 NHL repeat domain protein
MKPRFAVAHRPFTQAARLFLLCSLLWISGCAGKTGTAGKTFFPPPPNLPR
LQYLMGIANSTDVEGKDSSFSLFGGLAEQREKIRYIVKPYGITEAGGKLY
VSDVGTAQIVVIDLPGKKFELLKGAAGPGKLTTPANVAVDKDGFIYVADA
GRREVVVFTPEGDFLKAIGGDRDMKPVDVVVSGDRAFVLDIKSSDIKVFN
VKSGQYLESFGTAGGPFERLAMPINLAMDSKGFLYATNGVSGRVLKFDRD
GNLLLSFGQMGDGFGQFARPKGIAVDPTGLIHVVDGGHQNVQLFSDTGRL
LLFYGDAGKDSTASLNLPAGIAYSTANLEYFQKMADSSFKLDGVVFVTNQ
GGKANKVAVYGYGKREGIDYEQEYEKIRKELEERARKAREKEAQEGKKAG
QAEPKAAEPAAK
>GSU0083 conserved hypothetical protein TIGR00726
MEMKRADKVHYVEPSLLAAAGVAVQGFTTRHEGVSRTPYNSLNLGTGTAD
ASHSVEGNRSILARAFGGTVERLVTVTQVHGTDLLVIDAPNPDYGYFQRL
EADGIITNQPGVMIGVCVADCVPVLLLDPVKGVAAALHAGWKGTASGICR
KGVDAFVSVFGSDPRDILAAVGPAIGPCCYEVDTPVFQAFRQAGCEWDEV
ATLSGVARWHLDLARANARQLAASGIAERNIETSGQCVCCSPEQFFSYRR
DKGDTGRQMGFIMLKG
>GSU1157 conserved hypothetical protein
MFVHSLCLHLHLPSHSLKGKRGIVKSILARARQDFNVSAAEVDLQDVPDE
AVLAFATVTGDRSPGRHLLERLEDRIAEERPDVTIVAAEFEER
>GSU3251 conserved hypothetical protein
MPTPIKIGVSSCLLGEKVRYDGGHKHDRYITDTLGRFFEFVSVCPEAECG
MPIPREAMRLEADGGEVRLVTSRSRVDKTEQMLDFCRRKVLELESEDLCG
FIFKKDSPSSGLFNVKLYRRGMPTKAGRGLFADAVTKHFPLLPVEEEGRL
HDMGLREHFIERVFAFRRWKDLLLGGKSLGRLVAFHADHKLLIMAHSPEV
YREMGALVARGKELGCDDLFACYQELFMKALSLHATVRKNTNVLQHIAGY
FKKQLTREEKAELQEVIGEYHRHLVPLIVPVTLLRHYVKKYGQEYLLKQV
YLSPTPLELMLRNHV
>GSU0864 conserved hypothetical protein TIGR00251
MRMPDPPSPAPRITDSANGVTFSVHVQPRASRNEICGVQGEAIKLRLTSP
PVEGEANRLCVEFLAKRLGVPKSCVAIIAGEKSRHKTIRVSGSDAAAVLA
LLENSRR
>GSU0195 conserved hypothetical protein
MYGKDRIYKKVEIIGVSGVSIEGAIETALVRARNSLDKLSWFEVQEVRGH
IGADGKVAEYQVVLKVSFELKD
>GSU1967 membrane protein, putative
MERDRRIDTIRGLLVAIMIIDHVGGYLKLATAQTLGYVSAAEGFVFLSGF
VCAKVYGRYLDNTRQLLTKTFRRSFLIYRYHIAVALLLPLLALLIPPYQR
SWMELLHPYDVAPLKTVLLDLVLLHQGDYLDILPLYAWLLLLCPALLLLV
RRVGAVPVLAVSFGFWVAGQFVDPMQLIASSFGAQYGTGFFNLISWQFLF
TLGLCLGISGQWLDRLCANKMMLATVAVCCLVFFAFRHSDLAVNHSLLVD
RSKLPIHRLLNFCFLMVLLRQMLQNVPRDFFVPYVEYIGRHSLQIFSLHV
LVIYLFRPVAWRMDSMFGEAGLTALSLVIFALSTLPVYVYNRYRQTSSLL
FRQGLDSGLR
>GSU2807 conserved hypothetical protein
MVLLDNADRVVLLTEKERPRGRKIDISLTFLLPHDNWFSVEDGVSDHYFS
RRGFLRASLLGVLCLRGIGSALATEFLEESYPVGRLSLRNIHTGEHLSVT
YRTPDGEVDLDALNSINWLLRCHFTNQHTEMDLAVIEYLNMVDKVLGGGR
EFRIISGYRSPEYNRILSEHNGAVAKQSLHMEGKAIDIAVPGVSLAVLRD
LAAGFRCGGVGYYPHSGFVHLDSGRFRTW
>GSU0023 TPR domain protein
MKQLTMPLTALALFTLGGCVTNSDLDVVRRDLDELKNRQFQVEKEVGGVR
TEATARIESSFKDLDTERAGVRKGLADLQAAMDGIKVDMQVLAGKVDDVG
LAAKKPGDDLILLREDLERRLTAFDQRLAKAETGIEALQKKVAEQAPSPK
ETEKPTPEALYQKGLDAYRAGNYGVARESFTRFLEQHPKHELAANARYWT
GETYYSEKKFEQAILEFQEVIKNYPGKEKVPAAMLKQAAAFSEIGDAKSA
RFVLRKLADDYPSSEEAKRAKDRLKELK
>GSU2682 conserved hypothetical protein
MERRTFLKLVGLSSLFAEGAFRRLFAAPQPTVAVARGKDYARTTRSALAL
LGGMKRFVKPGDVVVLKPNMGWDRTPAQAANTHPLVVRALAEEALAAGAK
RVKVFDRTCNDERRCYVQSGISDALKGMKGVELKFIENERFRRVPIKGAV
LPAWELYDEALSADVFINVPVAKHHGLSRLTLGLKNVMGVMGGNRGSIHR
HLDDALADINSVLKSHLTVIDATRILTDHGPQGGSPADVKVMDTVIASTD
IVAADAYATTLFGLKPEDIPVTVAAHKRGLGEMNLKRVRIVTA
>GSU2411 conserved hypothetical protein TIGR00104
MHPSQLQSPPMTHDSLFTYRPIGTLYSPYSRRIDAPHQGTVVEGTETGEP
ALATLELHEWLDESAIRDLSGFDRLWLIFAFHLSEGWKSRVKPPRGGPKR
GVLATRAPHRPNAIGLSAVELVAVEGRTLHLRGVDLLDGTPVLDIKPYVP
YADAFPDARAGWIDEVDAEQGRHSAPGPRKPR
>GSU0168 Fic family protein
MFTEVSPRGGRLVVQQKGAEGYSAFVPHPLPPNPPLQIDDEMGWLMERAN
RALGRLDGCTYTLPNPDLFLYMYVRKEAVLSSQIEGTQASLDDLLEYEGE
IEGKSSPDDINEVSNYVDAMNYGLERLQELPLSLRLIKEIHARLMAGIRG
GHKSPGEFRTSQNWIGGTRPGNAAFVPPPANEVVTCLGDLEKFLHDESVP
PLLKAGLAHAQFETIHPFLDGNGRMGRLLIAFILCHDQVLEKPLLYLSLF
FKKHRQEYYERLNAVRRDGDWEGWIKYYLQGVYEISKQATDAAKAIMDLM
ARDRQKVTGLGKAAPTALALLEMLYRKPYVTIPYVARELRISSPAASKAV
NNLAALGILIEVSGKKRDRVFLHESYLSIIREGTELYR
>GSU3174 hcP-2, hcp protein
MAMPAHMTLTGEKQGKIDGSCELQGRENTIQLYEMKHDIHMPRNPHDGLP
TGKRVHGPLSIVKMFDKSSPKLYQALCTGEHMKNVQIKWYRINKQGLEEH
YFTTTLEDAIVVEMKPYMPMTLLPANEPYGHMEEVAFSYKKIKWTWEPNG
IEAEDSWSVPK
>GSU2473 vapB, virulence associated protein B
MKTAKIFQNGQSQAVRLPKEFRFEDSEVFIKKSGNVVQLIPRSDSWNSLF
GSLKKFSRDFMSERIQPELDKRDGF