TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Organism: Symbiobacterium thermophilum, IAM14863
Gene type: CDS

Number of genes found: 662

Free access
Sort by:

 



# Symbiobacterium thermophilum, IAM14863

>STH871 conserved hypothetical protein
MGGGDGLGLVVTVIAAMGGAAVVMLVAWLVVGGLVRAGLVGAIVAYRRGE
DVGFATFWSYATRYAGKMVLLGLIYGVILLVSAVVVIFPLIGQIVYLLWL
PTAFVTLGIYPAYLIIHDGYGVVSAVKQGLRILERQTGQTVLGGLIMALF
YLAGSLLCAVPIVNIVGALFVAILGQPLVYYFFVERFESEVRPQVIW
>STH1899 conserved hypothetical protein
MDDGFAAKLLSNFLIALGVTLGGSLSGAAASLLSGGHPLDRLRYLSEDLR
LWGILVAVSGSFEPLRTLEQGLLFGQGRALVRQVVALVVAMAGANVAYFL
IQTVAGRRL
>STH618 hypothetical protein
MAPEVALRPMDVRNEALKLIRERVGDRTIGPATTLTALTEEYETSMEELV
EALEAEFGVELSEELLVDVETVGELCSLVARVEE
>STH2217 hypothetical protein
MWGAHLERGSEPEGWSSYSISLPTLIGARNAFTNTEPAIEIRQTPNGPET
HLYVTLRNYTDTGVGLRDGPTGADGAVVALGPGGELKWYRKFGPADRLNG
RKASLNTAPLVLVSRAALIFGDVNGYIYSYALDTGNPAGGDGRPTVIPPE
GKAPTDRLFLLKNNEQPNTGPYNFSQVSGVGVDPAFANGLMLVGVNYEDA
GGTSGRLVAYRAGETYDLLWLDAPEPMKLQPGKPAAMQPRLQLEMTERTL
SALCPGPLSVQWYLTDQSGQPIRPLGATPLPGDLAPQQPYPVPVTVTLTG
SDPAEGQIVGVIDLPSVYALSAPHTANPRLVVARAIAEKMGLTAEKSCAG
AVAEVVEREGEPGPEGGLANNVLMVPYTSQQPAQAIVDDPSVVELAVPEI
VEAGRPFEVGIYLGYQNNLGRTEIRVPLRLWARPAAGGSPPAGGWQAVTI
GRLPCCTLKPGTIPGLSAGTWDIIAEIDYPDDTRPENNRLIRRVEVHSSQ
GGQRGGAEGGAITD
>STH2019 hypothetical protein
MDRLSTLTQTEPLTAHELLLLNELIRLESGDVQKTKSMLAMVSDADLRQQ
LTGCLQTGQAHVRALHDFCRARGIGG
>STH2395 conserved hypothetical protein
MDLHMFVPDFQAMREELTRLGFEELRTAEEVQEKLPTAKGVTLLAINSMC
GCAGGIARPAAALALRQVKPDHLMTVFAGQDKEATAAARALFPQYPPSSP
SFAVLKDGQAVAMIPRSQIEGSDPQTVAQRIVEAVQAARG
>STH2545 conserved hypothetical protein containing LPG-repeat
MPLPPAAPQSQLNQPAGLPTGAEAAVHLFPGLTSPAVPAGQPQIQPGVAA
QGLPGLAPQIQPGVAAQGLPGPAPQIQPGVAAGGLPGPAPQIQPAIPAGA
FPGPSPQVLPGIPVQGSPGPVGQALPGMPDQTSPGPVGQVLPGTASQALP
GTPAQGFPGGVPQGLSGMAAAGMPGPVPQKAGDGPARAASGAASQGQPVM
AVPAGSDPFWAQRQALAQQLLAQQAVQQQAAMAAMLPDLANQVLGADGAA
PPTNGATRPESDAPAPATQRRLRAGTYTTRGLQGGYSTRG
>STH1007 hypothetical protein
MPIACRFARRGMLVAPRRLTAPGGPVSRVEGSTAGSGGDRSGGRDPIVEL
LAELRGRAVTLQVGQRLLTGRLILADPVVIVDGQGRATCARPEAVVAVTF
>STH271 hypothetical protein
MWNRCLELHRQGRQQGRWLDEGELKALTKGGQFALHAASVQAVVELFVEC
LERTRAMRRAGRCVFRTDSTTHSGRIRPLITDEIDHRLRG
>STH2467 hypothetical protein, glycine-rich
MRRRLDLTTPADVFFDPVVQLAALCPICALAALRPEAALAALAPGLVLFN
PAINPAAALAAFAAGSVPARRRPHRTGRPAVGPGRPWLHDAGVAWEGDRV
AADAGLPAGAGGGSGAWCAFPGAPDAVVVRLGGSRWVFRGDGRA
>STH386 hypothetical protein
MTVPAATWNPPAPQHGAVILTRTWRIVVYGDAVPRGARLYVALGPASLAE
GYERLFPFLEAQRLCHRHVRTPELLRQLESLAAWAGKSVAVDPPESADPE
ELAGALDELLAGAGLTGPSLIAGARPFGGRTGLVFIRRAEPRPEQGGARH
WLARLLGEGRPGHAPRGLGVHG
>STH183 hypothetical protein
MSCAYHPDREVRGICSHCGRPVCAECLVDLNGQPYCKRCLAARMQQPVRE
INGLVRFVLSIAPGVGHFYMGYFHRGTQLFLITLVGAAVLNLAFPGLLGL
YIPAAIFFSIFDAREIHLRLSQGLEVEDRGFFDVQALPQRFGQRQAGIAL
IVVGALALWRVLTTDLLRWIFGANYFLVQRTLNGFTFGALALALGVWLLM
RQPRDR
>STH2405 hypothetical protein
MGYADHRAAFREGRRPFSIARRRRGIGRVRALPVALACGSAHLPSTSDQQ
ARDASSKAPLEQRVPPVGMPDRMAPSLRLTRAAFTAYFAIG
>STH2079 hypothetical protein
MEEGYVRGVSTFAGLVKVDHSENDRWRPESARRIFVAPEEARRLHPHWNR
MLERALSAALALGPRRPPASDPQRTAKAPRSVV
>STH1911 hypothetical protein
MRRFWIALLLVLGLSATCLGDSAPLAPVGPHVQPLDSTTVRLAEERIEIF
LRRDTRHDDRLSTSAVGEYRIWFRFEPEADEEMMVGFPLFIFDPEQAIFG
AHIENLRVEIDGREVATEVRESAVHQDADAGPVNWAVFPVTFRAGQPLEM
VVSYTMNVLPYGKGYDAPLWVAYVLRTGAHWAGTIGRAEAVLAMDRPIRA
EDIRSEENPFRITTPGWVLEDGALRWVWEDVEPDFDIHVVLENPYWRDMG
VEIREMLQAGIADPAALTAVIEATRSLAEAGNMGMNSSLRDGSLPGEAAD
RLLPDVLAAAESFLAEHPDDDEVRVQYLLLLRETSLPLVWESADWVWRVR
SEDHLALFLREAVARGMTDRLYVQRWRPWAEEELAGYPWRPETQDAIAAF
LTEVMPGTFASEDAARAWVEANAGEALTPERADRLAAGAVRRAATAGSGA
AQPPAADPADRPSPPAGETPAPGAAEPTDAGAQRGWAPAVGVAVGAVLLL
ALGTVVVRRRHRSDGEARKG
>STH264 hypothetical protein
MRPVCLGSVNPCRPPALRECLAAPGEECTKISGAGELQLDGTSRSIPEHE
YVTWVRPAVRLTIEVDLNRKLRQHGRSQ
>STH1024 hypothetical protein
MTVQATTPATSERSGRIPRAVPVLAFLCAALVGLWWRLPLLVAPVADFEQ
GPGGTEAYVTVATQIREQSASFWTGGDVHLRLSEAEVSGMLSSALLSGGS
PDGPLARVRGGVDDGVLLVEAVVAPPADRVPARLAGPVGLRLRLAPEVDE
TGVIQFRITGAHVGKIPVSPHLIRLAGWLLQPRWQGYDAREAAFILPVSD
MISQALGRRIEIRSFTAGQGQLSLTIAMPEY
>STH240 conserved hypothetical protein
MRYFPSERSLWFGLIVWVVVPGVVAFLLVLLLREPTRWWDWILAGAAPAA
LEGLLIWIWFGTGYAVTENELVIRSAFLTWRIPLAAIRRVRPTRSPLTSP
ALSMDRLEVRTNKGSAPLISPRNRSEFLALLRERCPQADIVA
>STH669 conserved hypothetical protein
MYLSLLRLNPASAAVQRDLRDVQALHQRVMSAFPDVLDPEVEARAYFGVL
YRLELNRYSGQVLLYVQSRVEPDWGRLPAGYLTPADGLPNPAVKRVDEAY
ARIREGRVLRFRLRANPTRKIDTKSGPNGEKRNGRRVPLSGLDAQLGWME
RKAREHGFELLEATVAAAGASERVRSYTTGRTFQGVLFEGRLVVRDAGRF
REALERGIGPGKAYGYGLLSVGPG
>STH2128 hypothetical protein
MAAIAGVPRLPDGLSPPADRPAVDQAEADRVTLDVNELIARAAGEGPVHE
APAGGAGTVRVCIREEADGRRKAVVDLTEAPWVVPAVDPYALTELGLDSV
VEFILWKAEGASGRLGRPAPPPRPGGVAGSAASRHPRGWAYPSAGGLPPS
GSSGSPCAG
>STH400 hypothetical protein
MTGMNSRTRYLLAALCAAAAGLLTFLYLGGIGVAAGSGTEIVWVARREIL
PGTQLKEEMLQRVEVDGPTRQLLAREALPRTASDTPDGWYATRAIRPGEP
LIPAGNVSPVPPSADVTPPEALRVVSLRTEWVGPPELQPTEEVDLYVVTG
DGEALRILTGARVVQAESDRVSVLVPEEQVPLVIAAADGVTVKVVRRLEG
LLR
>STH1477 hypothetical protein
MAEQDRRTVHEGPMADRRGPGWRQDAAELEARDRERKENPMGALPNNLTT
IDKAHAARAKSDEGSVDPQADLGGGAAGESEE
>STH2981 hypothetical protein, glutamate-rich
MSASALLREMIELTRLQVQRVIQRDHQGLLEGSRQQEALLKALQEAEIDG
SPEKMRAIYEELEREKVKLQSLLEAESQRVDFLMRLLLGGPELNSVGYPS
TVTKRAGGRMLNKKT
>STH2929 hypothetical protein
MSAGKGSEAGFLRQLAEAIRHHGGPAYDAQRARSLAEAVWRDLGHRAPDD
FPHLVQEGDLRFVCQWYEQPPIMTVMVWDGEMWRDMERLLLPGEPPEREG
ETR
>STH2582 hypothetical protein
MEPFAEHLYRRLAEGLLVSECPRLEEAYVLSLYVDFDDGPQRMKLWLYYN
TPWQLRRAISQGEQPDEARWYFALWEPEFITAVGLSEQECDRMADTESHR
LLARWLARRGLYRAPEELDALLADPGRYDAWESAARQSLLDLVIRVARRL
HDTGIILCRFGRTIPLLVHQWEYDQWCLEATRKVNPPGLITEFERWWWLE
WSCG
>STH999 hypothetical protein
MEWPHAYWEVPAMSEILPPEVRQKIDEWEIRQRYNPVDVEKVEGNLRMTK
VRRTIPVTQHFHANHPGETSR
>STH2669 hypothetical protein
MDAAEIRKYGVEPDFEQYTYEENACSWCGQPVLIIQSKERLPGYRTAATA
CENCMSKMTKRTRAKEFRWVFLPGWRPA
>STH184 hypothetical protein
MSERELRSLDLKEFLALVEADIRELDRRQLARHRLPSLPLRPVSCCGKRL
GAFPPSARVRCPFCGTWIAPGEDPRPADPSAR
>STH2573 hypothetical protein, alanine-rich
MHYGGFGKIPAFSPFIGAQAGADDLFIAPGAAAYPAANATVYRVYEYAVP
VTTVATVPSAYPTTAGIPGIAAYPAGKGFGI
>STH1088 conserved hypothetical protein
MKRRYRPYLHAVVLTAILIGAQPRPAASQPVPAALAAVPPASQARLRAPG
QRSPGQPSRPAEPRPSPSPDPGQPIRAAEPGQSPPAAEPGQSRRTREPGQ
PPRAPEPGQPSRAAEPGQSPAPAPSSESQPVDPPEPEKPDLTPRLREIYE
RRARRFLTDWEGPPLEEDFLLERKTAQWALLHEEGKYRYVKAWAEARGVR
FVEARTRIWVKELKVTEDRARFYVAQTLQLGYQYPGEEAVNRFGVGSRHI
VELHRTADGRWLIGLEWYSDPLGDETEAPAELPARRSSGNRGNSGLHAVT
ALAAQRYDRRGAVEYADTYCGLAAGCGNDHKYNPKFRNYMGEGGDCANFV
SQALRYGGKLQMPLFTRADALIGHLRYAGKGDLAVRGDFGTVWRIAAGRP
EGFRSFLKPGDVLGYEEKGKMTHVALITGFDSRGYPVANSHTADRYRVPF
DLGWDRKTIYWFVAMRD
>STH1395 hypothetical protein
MDRLNIGAVKVTNVSGVVSTGFAYIRRSRTSGKGTVGGSAITGDCGIAPL
YFGVLVDSDAVDAPFWWTVNGWGPA
>STH288 hypothetical protein
MAHGGRTNEQSPGRRLLACPGTAQSNRMLRSNSSILANRRMRTRMSGGVR
GRGLTAPSYSIEPIRLSAPDTGAGTDRNHHQVTAASRRFRSLPCTALPAA
ASGLH
>STH2940 hypothetical protein
MGDYRRCLAHGRRLLKAGVLTPGERLRVLVLVERCRRATGSVRQICSARL
RLVSAGMLRAAGRPQQAVDEAFRALQEAEPLSPLQVRAQLFLSRVAGATG
RLVEALAFALAARVTAATCGMGWLERRASDLLALQLRAGGWTAVSELASA
LAAQGVDVYEYLDLADLISFARE
>STH3177 hypothetical protein
MHARIPAAGYLRPLFWSLALRIRGGDGLFDKLRFSVPGGTEHLIPFAHLS
RMQEAVELLGAEDFPLLMRLGAVDGLRLQPVRAGVLHDEALRASQRLVAH
QVPTLTFHSPSGAALGSLFGGQGEADVAASDSARVSLTPRGIRIALRQFP
PPVGFRSTPGLERGWFACFFASLRFGEDGICGLRTPEMGGSGAPVLLPEL
PKFPPVTRWHRAFVAGRPDVAEVRFAFTPAQDVFRDVLHALTAATQESLR
LKRALEIELV
>STH143 hypothetical protein
MARDTDATAGTETIVVPGPIRGGHCPPPREIVLVDARKVFDFCTQEDLLE
RCFTIPNLGTGATILSCRITQIQCEEVADRQPVNNGGDGRAIVSIQITLT
LQIQVLPAMGMQPVTVERTISFPKQVVLCAPEGTDVTCDVEGSCICTVQP
NAAQGEPNVCCTIQLCTVLTVTADVKLLVPTFGSVLPRRCPSAASPAGCP
PEVEPCEPPVRLAVRRRERDRDRDRDKDCGCGN
>STH551 hypothetical protein
MMVSARGGAPPVPGLAGRPCAPRRAYRAGAGGRMKGRWLVMNQSTRVRTV
VTLLLLIPLLLVRPGRPAAAQEADPFAYCAAVGTVDRPDHRWTGPPVPDA
VIEGLIRAAGLPEDAPRDPLRRSTFWRCMGGHVYACFVGANLPCQEKADT
RRIPRAAMWRFCRANPGADSIPAVVTGRATVYQWRCTGSRPTIVRQVDAP
DARGFLKRIWYRISPK
>STH1630 conserved hypothetical protein
MNGAQTTGALRALAAAAWRAAYNDGWRFSRTKAGLWTLLAQAAVAGLAAW
RVAAHPPAVPAGEGLWLGFAEGWLVGLLMALLRGRERLFTGPLVTLVHLS
PAPARTLVLLHVLRSLPGRAWFALLFCAALVPALGAGGVTGGPLAVRAAA
VWLSTAAGGVLGHLTGTGALVGVVRRWPAALAAIPALAMVLFLGLVGLTL
YVFTAGLWQLGAAPAGPPAPAGRPGPGLPAASAVLFALVGLVALAALVRP
AVRRSGTPTPDAHREAWLAVREALDRGSRPVRSRWPAAAGGPPGALQALA
WLLAVRNWFSLVRLGLWAAVLAMPFVLGPAPSRMEPFRAAALCIGMGLVA
ALFNYGEQAAALFSVDGERAAIGVLAGIRPAQLVLGKWLAALPLVAVAAL
TTLVWAAAAGCGPADVARLTAACGAIALAATTWLVGAAAFDAAPRPGGAA
PAGEVLAAAFEQVPTRPGGIVGLAGAAVLAAAGVWLYLREPRALAALAVP
ALAAALAGWRWVDRVWRRGALG
>STH521 hypothetical protein, alanine-rich
MILGGGWAKVKERLEGGSPSSRRAARRRHELPDAGLPDRPGRSGRGPAAS
DVERRLPAPAGAGRPPAPEVARTEAAGMEGFLTEGVRTEGFGTEGLQTEG
LRQEGFESEGVGSEGVPREGLRPTVAPSALRSSLQRSAGGDAGGTARAPT
GADAGPAVDPGEAAALLSTREGLARAVLLAEVLGKPRALRPYGR
>STH2939 hypothetical protein
MEKPTGKLCSAGGVAVRSRLFAPGRRLVAEVVSVYLDRAVLAFGDGVRIE
VAARAPLREGERVILRVERARNGAVVLRLAGDEAGLDRLV
>STH2721 hypothetical protein
MCILLPLGQNVQTYLRKYGTCGPDLPLHCPGCGGRMRRHGRYWRWVFTAH
QKAYIPIYRWWCPGCRKTCAILPDFLKPYARFITLVREAVVRGRVRRGLP
WSTLARRLSSPTVSWLSEKTLRRWLVRARALAGEWSQYLAERVLRFWPDT
DLDALTPRREGPDATLHFLLDVGDWYRRQMGRRPEEHGGVFAALNRLGEG
TASL
>STH2203 hypothetical protein, alanine-rich
MTQRERLQALCQAALAAAAGLQAMLGGAEIVLEDGPREVGRCGVGSRGVA
PRGDSLRGASLAEAVLHLARSAAAHLPGRHAASPAEEAGHPPAQSAHRLV
ETVHLPPKAARQPVEAAPPPSEAAHRPGEAAHLPSEAAHRPGEAAHLPSE
AAHRPVVAAHPPAKAAHPPAEGAHPPAEAAPAGQKGPALTLAIYPAPELA
PLAAALGLDGEPLAPDGDHAADATTRELAWLPLLPGWSTYRTFPGSRLLA
VAGYRRAALVAFEAAGRPAPMGRPPESHWQAQMWAPLGCDALAAALAPTE
RGPLFMEAPARLRDAIRTGRVRRALLGGADLAWALEQPGARAVRPLTGPG
LDWMPVWGLFSRGVRPNLPDPPPELDAHLVLVPAEAVPGLTGWLEGAVVT
>STH2268 hypothetical protein
MSSMRRTILLLVVSALLVGCSARANLTQLPDAVTFTPITEATDFNIDAEL
VLQPHPELPNSWQGSMTFRFVSLVDEDLHNVRVAIFYPEQMVEMLLIPEE
TLVLPPSGRRFDVTPERPEWRFTHTVAAFDWEQLGEVKAAILNPIRLRIV
WDGAERFLEIPPSEIAVREPNG
>STH2576 hypothetical protein
MNSMSRVITGLGALVIVLGVIGAVWAARATPPGAIRVGLTTAAGLGGAFA
GAVLLGLGAIIDRLDLLTGRGAQRAAEPAPPPVTPVTCPSCGERQDARAD
RPHHDCVKCGRPVIRDWVYLGRHA
>STH1445 hypothetical protein
MGRAAWIVVGWLMIPLGIGLAVVQAVPGEPPGVAGIAPPAAEAQARAAEA
QAPATAVLAGEGADAGPSGVELPEEWRRAIDQVVIPREVLDRLPELAAIQ
QKLMRVDGVLMSWTDHRNGKVFVGVRSLEARERVEAAIAREGIPPEWVEI
FGGAGLLSEERPLEGCQLADPPRPEPGPQYLEVTPAVVRPGQGISLVIRG
VPEDQLMRGVEAYLECWDGEGWSPRFYLITAYGGGGPHSVLYGGPLVIVD
LGLFGTGPERHLVPDQLEPGWYRIRKPLGRTRFVPAEVVGYLQVR
>STH313 hypothetical protein
MLHEPWQIWLRPSCFNPHPALGPAATGALLPLQGVHSLVSILTQPTPRAT
GRASDGGREVLGVSILARPWIRVLPAVTVEASRVTGFQEPGATPRVELNC
PLIEHVSILTRTLGRVLRATASYQSEQKWPILTRSRGRVLGNLIRFQTIL
SLTRMTPQVTLPLFQPSHSLAAGCYGTRGKPPGRSD
>STH1573 hypothetical protein
MVPVGRVFARGSGWLPRTRVPAGPGRAVGRVGPLVIWLVVGGVIGIVLIA
VPQPLNVAPGFGLLLGVRMHTSALLEQLLGRVEGRRREPVRPDPDTLLER
LARARLAPRRKDRIRRKAQGGR
>STH284 transposase-like protein
MEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRLNLVEILNRHLR
IKQKTYAHTPVDKVVEALVAILGNCRYMKDLNFDPEPLVADPAVAQAWGQ
ERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQEVAAVAGPDRSG
MVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIAAAFLSGKQQRF
AIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEWVEACLAQQKAR
VRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQRLRQYRQENRT
NLAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGSNVAYKHLFDAV
PAENWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRWDADGREVRSVI
LTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGTPRLRKYEANAA
FTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAARCRATIRCLGD
TLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSRET
>STH2169 hypothetical protein
MRRLVRTWDRLVRRFLGIRPARPDGVLGYAVRRYRGRGIAVEGGASLRRG
DLVLELHLDSRRLAEKTAGQSPHRRILWLRRTLLADLRALARAVESDPAL
AGAAGLWGMTVLHRGVDALGFAVAEPAAGFVGRLATWYMRGLLAAYHPEA
TDRVQHRAEALVAREIFLPMDRLRALIGPAPADGEEAFQTVGPARPGRSS
GPGGPTASSEPPT
>STH2481 hypothetical protein
MQNAEVRELVQRSLERYAGDERVLDAFWQWASNPTPPSAGYALEENAGFI
ATNLMLRYGEPAVDLVMSVNEECRRLAGRASIQELQRTIVTVLNGPVGEQ
LRQGIVRRLSQPTVARRALTAWLVNRARWRRMPGREPVVAGEFDWSLGAD
LEGSGADCQSAVLRALYGPEVAQVDLAREAMAVGVVNRLFYRNVAGRMEA
KLRPGPRLPVSALVY
>STH2276 hypothetical protein
MPLLLRRNVGIDSPLQEPLNVLLTEVPCIGRNLLRHLIDIGFDLLHHGDQ
VRSICRLVCYGSSDDNLRIGIDDGLGVVPLDELLIRAFHDPGIGIRKVAL
RCWCGFCLRPLPPTTALGPLGSRLLGRLMFVSAGPQAHFGFQSSLGFLNL
GQPPLTEAEFFGQFIPTTMWAEAVILGLVSLLGFLEQLCDLRLQALFFLH
HPAVSSSPCAWRRWP
>STH557 conserved hypothetical protein
MQDYARRCETGNGTVQNALRSLIEAGAVQVEARGHKGTFLVQADYRKLRE
IAGLASLLGLMPLPYTRRYEGLASGLYEAVRQSDLPFSIGYMRGARNRIA
ALHRGQADFAVLSRLSAELAVAEGGVVIVMGLGPQTFVSQHGILLARPEY
TGITDGMRVGIDPQTLDQAWLTAAECRGRQVELVEIPYSHLLQRLREGQI
DAAIWNLDELSSGTMEIYSRPLQSPEARRIAESSSEAVLVIDANRPDLER
LLPEIIDPALVRRVQDEVLEGKRIPSY
>STH2828 glycine-rich cell wall protein
MGRAGARPTVASAPRRPPAPDALRINALPLVNAKGKPVRWPVGGGLRTVR
ARWAAMAFCALLLSGCMAQQQPQQSPEPSSEQVRKAVRESLSEPEMKDYI
RLQVKEQAGVEILEMALDTQEGKDALTDAVKEVMGSPTGEQLIQQKLAEM
LEDPIVQQQLQQAIRETLQDILSKGTQDSRRGGGQGGGQGGGQGGGQGGG
QSGGQQGGGGGGQ
>STH3225 hypothetical protein
MFSGFCGLVSTCRRDAAGLSVWPLRPSGEMRSASKPLSLVRSRFSWRRNV
RWAEAVGCTFRLESDAWITMFEFTGGRVKSRRPSRTGTVLSSSTCFSATG
AASPRHRESD
>STH335 hypothetical protein
MIAHNPAYSCLHYSSPPWPAHQTRPDKCLCTTSSCRSKPAKNTKVAYPPE
RILRPWIWEIGDRYLLLTNDFLFGERLPFFRRRPAIKLLPSLNGNFIIPL
VVK
>STH2379 hypothetical protein
MQHAVLQSRCGWRSMAMHEDSADQHFHGEMTVARHRVLLVVAAILVGCSR
PTDTVTSTPSPRPSSLRRSRRHLQTPLRIPTLCAQTLIWS
>STH2756 conserved hypothetical protein
MIRDLLLRIGDGVVDYHLLLDEQEIALNPLLGGRVAIRFTGERRCVYCGR
RASKLFNNGSCWPCFRRLAINDLCQVKPTLCHYETCREPEWGDAHCMIPT
YVYLARSSDVKVGITRSLPGRWLDQGAVEAVPIARVPNRKMAGELEAFLT
QYVADKTNWRRMLKGEVADADLLAERSRLLELIPAEFRPYVLPDEEVRSF
TYPLKAPPPKLASCDLEKGDVEGTLLGMKGSYLVLDTGVLNVPKFAGYVV
TFEAG
>STH2069 hypothetical protein
MGRRPDAGWSCALRRTGEGGGWTGPQDDLGEERFLALQLQRSPHGLHRGP
GREGEGEDDPLRVRGQVAEAAVVPSCTCPVRRAEPGSVTGPAARPSPVIR
ARSAAMATVKCRRMLPPP
>STH393 conserved domain protein
MVEFALLAPILLYLVLCIPVFGMFTHSWMVVSGAARAGARAASLLRVQGS
REAVARQAADQNMYLRRSDGDLVLFDPARDVQVRVQGGTVTVQVTYHQPS
YLPLLSALLGGSGEAGGDTVPITAAATFVIEQGGSLK
>STH197 hypothetical protein
MEAVRDRVVRSAPGRTGGRISVAVPAASTLRSWEQRVYVPPARNKGRRNK
GRHRLYTEADPIPLRPGGRRVVAEAQKAARLRSRTVGRLPTRAVGPLRAG
>STH2288 transposase-like protein
MKKHQSEPNDTWWDLPLLNHHVEVRLMQSHGTTPNIAAQAIKSTTRHGFL
VALGWVAQRLNLVEILNRHLRIKQKTYAHTPVDKVVEALVAILGNCRYMK
DLNFDPEPLVADPAVAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALA
EIQAPLLQQEVAAVAGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQ
GKLARGYQIAAAFLSGKQQRFAIDGLLKSGKANSRSGACLLELIPRIEAR
IGRPLRRVEWVEACLAQQKARVRQLYQQLQTVSGKGSARRKQKLQREFQE
EVQHLREVNQRLRQYRQENRTNLAPLRILLRADSAFGTPEVIQRLLELGY
EFTIKSYSGSNVAYKHLFDAVPAENWVEVEKNRFASEAVTVPGPTLLAPY
PVRLVAMRRWDADGREVRSVILTTLQPEELTTTEVVKLYHGRQTIEAGFQ
EWKGTFHFGTPRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEA
GSRLLVRVAARCRATIRCLGDTLRLVFSRGTPLAGAEITLNRATPYPYAL
LTPRMSSCSRET
>STH2523 conserved domain protein, histidine-rich
MSDPFFGGGAFRLFDIMFVLVPILVIGGFVFVFGSMIYTAVSNIQKPLLT
RRARVVAKRVRHSTHHHHHGDHHHHSHHTAYYVTFEFEDGRREEFSVRGD
QYGLLVEGDEGTLQSQGTWFKGFYRD
>STH1962 hypothetical protein
MFSGGTAVRPRACCGGQIGASCSVYDEVDGVRLTDLPPPEVEDPGNTQWN
RVPCRIRLEPVRSTPHWEDARPELAGPGLGENAWEAYNAARQVFAPAPQG
GEPAGRLARLEPGPGVALVSGMRPAHAADLAAGLRA
>STH812 hypothetical protein
MQSPEGLHKSGENFVLATKRYLEATLVIDVLWTVYDGNGMTVLRGLDGQP
HTFDALARLKEGEKRPIYIEAKKRTSANINAEYDDFIAKSFSCYMAERWR
PDWNPYFMFVTDHPFKMKDYARLLEADYLLTFLGTFEKYGIANAALEHVV
DFKDRVWLVIWSNRQELMCVTNPALYQKLVVDGGG
>STH2384 hypothetical protein
MEEILRGYRDRAYALLAENRAQSREGIELLVRERLPHVAAAMQAGGKPLT
TAQLDALYHELLEISFTMFSLGYTMAHTVDQTAGTRA
>STH682 hypothetical protein
MADKKEQKKGGFLTALKGVLYGMASHGNVTAALRTRMHMEHLFMFITLGD
MLGIPVLPPYYSLRILPYAVPNIESWKQRVFRERDFTDAIY
>STH2649 conserved hypothetical protein
MPVEERLIWGMRLARMLSACVEVCVALMLLRMADPKAMLRLNALAGLVGP
AVFIAVSALGLAASLGRLEPGRLLVVLLGIALVVWGTR
>STH479 hypothetical protein
MTLHEVAAELARRMNCTVEPAHGDAQSVTVRGKGYHFVVAGFFGGWQATL
YLPDQDPVTFYGEAVEALEIRLKGRLSGRPVD
>STH281 hypothetical protein
MGNDPVERHIRTAFDYVVRVYEELHFLFQDLRTALQERDRNLQPLVGRIY
GFQSGMDKPRHWLARYVSECWVDDRHGLMEGRRAAYLSATVSLLTPDDQP
IEPLFSYGVVAPMNHPQAMDFCHQGVWYATANHQDLFSYRVDDNEVSGHS
LPMNTEIWFRSKLPDGSYAWPKRGVMVIQPLTAIRDTNSVYEIAGKMVSL
WNKYADQVARG
>STH59 hypothetical protein
MRRSLRKIDSQEVQQYAQSEPLFTVTSLDEARQRLVDMGQDPARYCFYTW
MRQGVRYYRVWEDQRLKEGIVKPAPFGQEAARLKRADAEKLYTVAKRFAG
DRVEFAQRDGVLWVIDGETGKAKVGISVPVVGGGLIFYQGEWNEHQAAHF
VNALLVRGQRPDAAFVEAEALAGVPS
>STH2644 conserved hypothetical protein
MQGRVELLGRMREFAWNREGWFVPLEEALRGLTAREAAWRPLVGGLTIWQ
ILNHLNYWNTYMLRRITGVSPDTPVLENEETFGSQGDPNDEDGWHALVAQ
TRQIAEELRAALADLSESDLDRPLPNMGTVADSLAAWVMHDAYHTGQIIL
IRKLQGAWPPEG
>STH2247 hypothetical protein
MIHDRGRDIVVVTDRPNYPGEENRPVGRPLCSRSHGKVWVVQGLDRPSDM
VRWQYYEAETTKQDESGFGGFITPSAVMAPPHGSNPSFVIGTDGFEGGRA
LRLALDRDNDYRPYRVWTVDSTAGFAGNFTSDGTHAYWLDTRGRLWGANL
HDGLEPAGWKGHMVDVPALIGARNAFTNTEPAVEVRQTPDGPKTHLYVTL
RNYTDTGAGLRDGPTGSDGAVVALGPGGELKWYRKFGPADRLGERMASLN
TAPLALISRGALIFGDVSGHIYSYALDTGQPMGGDGRPALIQPEGRAPTD
RLFLLKDDEQPNTGPYNFSQVSGVGVDPAFAYGLLLVGVNYTDASGASGR
LVAYRAGEAYDLRWLDEPGPLELEPGKTVALQPRLQLELTARPLAALCPG
PLSVEWFLTDESGKLVRSLGTAPLPGDLAPQQPHPVPLTVALTETDPAEG
QIIRVIDLPSVYALSAPHTANPKVAAARGLAAALGLPAEKSCAGAAAEVV
EREGEPGPEGGLANNVLIVPYRNPQPPDEPERPGEEGGGGKMIIDDPWVA
ELAVPEVAEAGRPFEVGIYLGYQNNLGQDEIRVSLRLWARPAGGASPPPD
VWQMVTIDKVPCCSLKPGTIPGLSAGTWEIVAEIDYPEDTRPENNRVIRR
VEVLSSKPVEVGGPEGGAITD
>STH395 pilus subunit protein
MSRIAKGFRRLVVRQEGQGMTEYGLIIALIAVVLITTLTGLNKTLDKTFN
KVTTQLNNTVNKK
>STH2050 conserved domain protein
MAPKWLRGRPALTVDLVEPLQEGPPGRIAGGGMGKRRALAGPVLPLPVFM
PMYTSVPAKGSGRYTLQVYTKVTVSGPEPRKYRSTSRVTSALFKPIPTSR
RPRSAPAAPGAVAREVPQVRRRIAIASLLAVALAAALFAGRSFAGLPGRP
GVRWEARHIRIDEARLAAIDDVMDLIEPLWWTVNISSPEAYEASLAPFTR
GQRLVYALQMYRLEVDNGGHGQFYGNSGGIVWRDALEGFELLGLRELADN
LRASAALLGGDPPLDRGEREALLSELRPDFAEITEQFWKLEAQHDVDGAI
LAYARAHPEEFLFDGVVRVPTSDTVILRQFIIWFFP
>STH2469 hypothetical protein
MSKKTRVVTSTACPVDVSRPVVSVSCPKTVPTATAGVTDQSDKKGFFDKK
GFFPFSFGFGGKSVPGQVFNNPAVDLAPFSKLASLAAFSPAAALAAKSPV
LAIFNPAINPAAAFLQFNPGAAPTLALTGKGKPFGKFGI
>STH406 hypothetical protein
MFWLQVKQFFVIALATVNLVLLIWALLAHRQRRVLPAGYYRLLPASAAIG
LVQVSIGMFFLIQGRIPYWQHVLYGALVGVGAVLQFVLLPGTPAGQQYRN
RPLVHAAVALFVALVGIRSWMTG
>STH2809 conserved hypothetical protein
MANVLERGIAVWRRHWPGLFLACLAVAVVPGLTVALVGALLGLLPVSLAD
ALVLLLPVQFAGQGADVTLFFGQKGSWVLLVLYGVLVLLGSAGILGILWT
VVHEGRAPKASDFLGGIRRYAGRLVGVRVLCGLVLVAAALAAGLVYVLAG
SVLAGWLVPIAGLAALLAGISAVLYADFALVVEEGTATEAMGDSVGVLTE
RWREAVGAALMFILISVAGEAAGWLVARIIPGLPGLLLSLLPVALAESIL
LAYLAVRYVEHVHRGIYGRDGRFAIDERKAG
>STH2580 conserved domain protein
MRKRTRTLSATLALGLAASLSVGSVALAGTEVRVPVDGALDAYLAITSVV
AAYPEEFYYEAEAPVTVTFHGDNLSWEDIEYLGEGYVEDGFVYLPDSGDT
VRFAVKNYTYYEEAQVHDELTEEPMDEVFYVSGNYAVLTTPGYYSVVAAP
LAVAPTRAIIQVRSSSAESGHTGQAASEPAAESPPESVVASPTTARVLVD
GKEVAFEAYTIDGYNYFKLRDLAMALNVSEKQFQVTWDGEFNAINLVSGH
PYTAVGGELTVCTDPEARQAVPTASKVFVNGRQVDLTAYNIGGYNYFKLR
DVAKAMDFGVTWHQATSTIGIDTTSGYSE
>STH560 conserved hypothetical protein
MGLSYPILPLRPDPFSSVRPAANEAGRDVSPFGHPGTRLWYRPTLPAHNG
ADAEDRLTLFRGGLYNIQETVDWKGWIGMIRFAVGGQVDKKKVADLVRKV
GGSAVQVTEHSDLEAAMQVKNGKADYYIGACHTGGGGLAMAIAMLGKARC
LTVSMPGRPPKEEEIRKAVADGVKAFAFTADHAEQAVSMIVRAILERGQA
>STH2494 hypothetical protein
MWPRTVDWEELRFGVEIEFVGGNPAAVELLPGWVMAFDELQVDDTGEDSG
AELKPPPLLWRDREQMRVMLDRLKATGARANWSCGLHVHVGLEPWGQDIV
LPLVDEALAHQDALRGLLRTAEDRLIHCPPVVPAMRDRYLAKPERASLVR
RGRPQSHRCGINAAAWFDFGTVEIRYPNGSLEYDEVVNTVELCLRFVAAV
GAGRLVPPAAAGATGGGAAASGPAASAADAAALAALLGAPADGYPLPRPA
PRWHRERAWLEDALVPVLEPLVRSRVPEAEILAIRPVPGGFRVTAEEPGD
RRMAFLTRPGPDGWSLEPLESERG
>STH738 hypothetical protein
MRRPIRQRPIRPEYSPGTMSRELDWLVDRWQSSRPEERLQIVLAKIRAEE
RLEEERHEELAPVAR
>STH717 hypothetical protein, glycine/proline-rich
MLHFLQVRISGGRTKGMKRRVLRQLQSLLAVLLSVAGLVVAAMPAFAREP
ETSAAWLLAMEAEELLWEKAGGPECRNTTYQVTIPQLTEDRVYPLPIECP
DLQNLAPGDSGPVGTYRVINGGTAALRLQLRGRVEGVLFSGSVPVMLTIQ
DDTGRTYAYDAAYHDMPDLPAGETRTITVTYAWPFTAENVYQGELGKIFI
DLLAKGTGDPGGGGPGGGGGGGGGGGGGGGSPPPDRPPTPETESVAPASG
RIIVRVLGDPAGNGDLMPLEGAEVQLADLSAATDPLGQVVFDGLPFGAYT
ALATAVNPLTGADPLSGTGAAQIIAEQPEATITIVLTWPPRPLPPPQEAQ
PEPETPTGSLTVRVVDASSRNGGREVPIAGAIVIVGDRAGYTDRAGELHL
TELPLGDYTVYAESGDPTDPEGPIRSGQTRVRLTEEEPHRVVVIRLMWEQ
TEPVFDVAPGSIAGRICAPRTGAARVWATRESGETVGTAVAATGRIGIWL
DYTLSDLAPGVWTLTLQNPGDRPVSQQVIVQPGQVTLARDFTLACTGDGL
TPPPHLGYYVAGGLLLASGWLLRRMGRKAMA
>STH1818 conserved domain protein
MMISTPTQRWIAIGLLAVALTGCSFLSKAPSEEPVAPVQEPSDPTEPSEQ
PTPGSGDGEGPAGDERPPDDEPAAEEVTMLDCLRRFGLSELAPGVDELIR
GLPDDGMWQCVARTARFDRNYRAWVAHEVLVSGVDHIDASLPGYRAFVRT
TELHFAEPQTLPVPEGWTLQDALPSPSGRYLAARLADGGVGWWAADGRAQ
ERYDVDGYDLVWHPEEDQLAFISDGRRLHLVRPAGPAVHQVYEAPAEAPV
RFPYWALETWYAGYNESYPPGTLLVLADPEGEPEGLALRPDTGRWGRFPA
KRIFDPVARAVRFPSWLTQPWSTLFGEYLHLVPDGGWLAYPNPGGSPYTF
YQVPDGVEPVTLTWSPNARFFALVERRDGQLIAQVLRGSHDYTGRQYAVP
LENAHIAVWDDGVTVFTVTGATVRAKNHLTGTEHAWQVEGEVKAIRLGGF
RLYIVLEDRIIVVPFSEAASLSEAPRAEHPADAA
>STH1380 N-acetylmuramoyl-L-alanine amidase-like protein
MTWNAPIMGRSLARPEQMTAYALRDNPFAPDLAGLYLGLASRLGVRGDVA
YAQALVETNGFRMDGWAERGRYNFGGLGVTRPGDPGISFRSPEEGVLAHL
QALSRQAGGGPLPPDMPDLTPFLSPEDRGRIRRVGDLRRPGWPEDEEYGE
AVARTLAAILLEPAEGEPYRIEQDLLPPGSPARPGARDGTGRWQGGGGIV
IYRSASPHLDGPGMRALLERPVAGEVRSFHFVIDGRSIRQLVPLGEIAYH
TDGRNHRDIGVMICERGWGTSEWEEGYRRLVWLAARLMVVLGIGIEGLSG
GFWWNPVRHPYDPTHLGWKPEDGPATGLFHWNRLVADVAAARPAVEKLFG
LHPAEPQVSPAAQGRNPPAGDRPGTATQGRNLSIGHTGAAGGGKPPRGNW
VQAARTRSARYSSRRR
>STH464 hypothetical protein
MLVSWRVGEGRIFWSADTAWLSNARIGEAQNLDLALQILLPPDGGQVAFD
EYHHGFVAPTHWWQVLRGSLQAFGLMLAAAVALLFWSYGARFGSPLPAPE
RPPRAAVEYVHSMSRLYRRAGARAVVLRALHRSLRTELSRLTGGAEGLSH
GEIAGRAAARCGVPAGEIERLLERTADLNHIPADAELIDLARQAEKLRRR
IEHATYRDG
>STH1331 hypothetical protein
MRPGRHEIWLTFVRRTGTSEERVMPGRRDVRSVMTGQELAHAVAESIVLT
VFGGEIQVVDQLPNVKLRLLDRRADLHLPGAPPLDQDGQVWLRVTVRRSE
PGLLERLFGPRVHTG
>STH1552 conserved hypothetical protein
MAQIRDYRPEDAPTLQDLLRQIWGQDTWTVQYYRFGASAPTAQGGFLRTL
VADEGGTLVGFGSAWTNPFHPHALYVGINVHPRFGRRGLGTRLLMALAQR
SPGRLPLQGSTWEHSESGMRFARRHGFTEVRRTWEPELPLGDPQEDLHAD
CESRCAALGYAIVPLADLQAASDGLRQLAPLLAEVYTATHGPNPPRTMDA
DGWTDLLRRDPPDPEASFIALRHGRPVAMSAAHPDPDAGLILSWRGVAAG
HRAHERDLILALTRRQVLAAHRRGLPALKGEFDSTDPWAMIQMEAFPFRP
APCWVTFRRAP
>STH865 conserved hypothetical protein
MSKPMMPYGRPGWPVMPQMTPGMMPGMMPGMMPGMMPGMMPGMMPGITPG
MVPGMTPGTTPGMMPPMTSPQMMMPPGAPPELVYGGQVMIPQEQSYVENI
LRMNRGKTATVYATYDNNPEWAARVFRGQVENAARDHVVLSDPQTGMRYV
ILMVNIDYLTFDEPLRYPEEIYRPPGTF
>STH2436 conserved hypothetical protein
MIAPQGCHGGQVSGMTRRTLFWLGPLVFLVHDLEEVFLARAWVEQNLLLI
AGTPFEPVVEAMGYEPGKFGLVVALATVVYGVIAWSAARACQPGLSLNLY
VATVLTLFVNVITHVGQALLLRMYTPGVITAVLVFLPYTVAAFRMLRAQR
LLTATTWKTSPLMGIGMLAALFGLMIAL
>STH1575 conserved hypothetical protein
MARRIRLIDICHARSGDKGDASDISLFANDDAAWEIIRKHVTKERVAEYF
SPVATGPVERWEVPNVQALKFVVHGALGGGAPRSLRSDNLGKTFAAALLR
MEITVEE
>STH2563 conserved hypothetical protein
MQAITIGTAAGIDEIRQRLDDEFRFLREEGLQVRIGQSHRGSFAFLDCAV
DGAGRSSQADALLRHSVASALSDVIVEKWESDLIRKIIRGSYSYFSRDEQ
DLIADYTGRTLNGGSDSPNGYKVNRKSLILHRLRDYFDTADELVLEGFVN
FRLRDYIEELEDAVDRAVDEFLMEREHREFIRLLRHFVEIQEPRIEHVHV
LLGPGHAFRLLDDDGGAIRSEALEEFVVEMVESEVSYEDLLISALITLAP
RSLTVHLADRPPGRDEALETILGVFGDRTVRCEGCPVCSQATAAACEPSQ
ATLHH
>STH2952 hypothetical protein, alanine-rich
MTPMSHFYPPPMRGPAWYPPAGFGAPAWYPYTAAYALYPPAAMAPAGWYA
VADRPPAPDPAPAEWMALVEPALNSVIPAIHGLLEISGLLLRHLPAAFDM
TGSGPEEPARDDPLTMRTGLGHIGLFGPAEPDVIDIEAARSDAALPPPEG
NGRDGAGAADLSGDQTEQA
>STH3105 hypothetical protein, glycine-rich
MAEGAAGVTGAIAGAALGSAAGPLGTVAGAIAGGLGGAAMANGAGNTTGA
ENQNGRGRNRAGDQAGQNPTR
>STH2527 hypothetical protein
MSRYADITELIAEQAADPTEQKALEAVARKVARELERPVPYRPEFRDALR
EELMRTARRRLRPWYRRPAVVGPGLAAVAATVALAVGLQLAQPPAGQGPG
QVAQHEPAPSVAPPLSTDPAPPGAQGGADTPYLVALPADLPEVRLADERT
TEESLLAPAASPVIASVQLMRLTARPGEAEFRTMAARLGFRGESRRTDRG
WVVADEDRTLTMTMDGTVQYADLTEPDADAPRVDEQAAGQAAQRFLDQAA
LPVHSQPDITAREDGFMVVYTEHVEGRPVVNARTEIAVNQAGTVVRAKAY
VPSGVTIHATYTEFVSERQAVEMAESRGGSFRRAELVWVRSVGDGTVYLQ
PAWRVLGTNAQGTPVARYVAALTQQDGAGE
>STH1739 conserved hypothetical protein
MNDPRALPWWAWVLVGLILGAQGVLLFLDARRRGARAWFWGLIGLIHFPM
PTLLYWLLVVRRGRPH
>STH671 conserved domain protein
MPPRPYTPWQRNFKVSRMSHLNWSTRAKECVARQAASSKPWSFCRPPSGS
VVIETPVGSDVRVRAIKVALEEAARAAGIRALSVSDSGDFFADPLRYGMV
GALVYALASLGLFGVAMPIIAMRQQGILRLMRTTPVTRLTFVLAQVPARL
ILGMALTLCALLAAWALWDVTLPQLAAALGTSVLGFWMSAAFGYLVGGLG
GRHGWW
>STH2946 conserved hypothetical protein
MQVQLIRSPKPTADGVSVEMIHSCQDMRVLLIQLSPGGEVPLATSSSSVS
LQVVSGRCELVAGCEWVAAAAGTIWFYPPGEPFGVRATGEPATVLATYAP
RP
>STH2240 hypothetical protein
MILGVTSWLSRIAPTTQGRKIGPQVARSALVRMERFGWCKVWIVKMDIQA
GGSTKLQPPST
>STH2374 hypothetical protein, glycine-rich
MSVGLPGVPTRLRGGGPAARPGGRAGGGRTHSGAGSAGGAAPENGQGGLA
AWVKDRAAGLTALFTLSVPVLLAVGGAALVSGALYELVVGGRLFNWVEAV
IGSVIGGISAGVGGWLPGSGSQPPTSRRVERRVWSATWSTPRSAPSAARW
APLPRVASPPACSWV
>STH465 hypothetical protein
MDRLVVWNVDGLDPDEVDALEAWVFAGGTALVGGPLSGFRGTFQPAAEGT
ARPAAPHPATLGIRAVSVGSGRFGGARGSG
>STH767 conserved hypothetical protein
MKRIVSVSLGSSRRNSVATVSVLGEEVTVERIGTDGSIEQAIALIRELDG
KVDAFGMGGIDLYVYAGGRRYAFRDAKRIAAAAVRTPIVDGSGLKNTLER
RAVYWLDRNVMPLRGKRVLMVASVDRFGMAEALVDVGADIVFGDMMFALG
IPIPIRSFRTIHVLARILLPIITQLPFTWLYPTGAKQETHQPRFTEYYQW
ADLIAGDWLYIRRYAPDRLEGKTILTNTVTPADIEWMKQAGVATLVTTTP
NLGGRSFGTNLMEGLIVAAAKKRADEMSPADYEEWLDRIGFTPRVERPAA
RGSAAGD
>STH995 hypothetical protein
MVPVRITPVYRDVLLSLEAAELGSPLWPAFYHMAYLPHRAYFDGLADTYG
PLLMGQGGLPGIVTRLAPALRRALEPAPGYRMEERVRSVMDRIQPLLPGR
MPHVWLATLFFAAPAATIAVGGRPAIALGMERFSPAPPPGPRYWYPPEAV
EEMVPHEAAHVARMEALGLPPSPQHLTLLDMVMLEGTALLFTDLLLGRET
LATFMDPDRLAWHRAHDAEAVAAAAREFHAGGLTVFARYFAADAPVSGYW
VGYSLCRRYLDRHGAGAMQEMLCLPSEEILRRLG
>STH1720 small acid-soluble spore protein
MAQFTPTRRVLPHELLDRLKWEVAEQAGLTEQIVQHGWPQMSSRACGHIG
GRIGGRMVKVMLKYAEQALAEGSATLK
>STH2530 hypothetical protein
MRHSLPRHRAILPVEVRTMPRSYERVLAVLPYIGAIRLPVRPWVIPEWLY
TVPAGLVMAGLLWLGARGRSPFVSYHARTGLLWALQANGLLTALSLLAEL
WYRAWYHTGVPLWNQVWHLNAELFRWAAVLTTLLTGLAMWRAAKGQTGDP
LGLPSPSFFRGHDVHLEGSEEGRY
>STH2085 conserved hypothetical protein
MAVGRAQGLSAGAASWCKSTLGVWGRSDSGIVPLRRGGLAMRDVHSSGVP
QALAVPFGLVEDAFRRLEKRLEGMSEEEFHFKAPDNVNSTAMLLRHLVLV
DMTYLHLIMGDSERVPEIRGKYGPFQDENGRIPEAQGATLSGLLGEYREL
MDTARRYISGLSDEDAERAVVVPWWPEPATVRYMLWHMAGHSCHHQGQIA
RLRAAYRQR
>STH2727 hypothetical protein
MTVHFLPSQVKVPRHILLHDQLTPAAKVLWAGLQLRPKNQEHLAQLCGLS
QASIKPGQTQLERFSLLNPGPFPRADWAFLPVDLLRTKPVHPQAKITYAA
IQLLPTFADGVAEATTEQLTQLTGRHPATVRRALRSLVDAGWLQVRRSGR
TNVLQILVLNPQLEAQKSMIAQINRRLERAPYLGEAIMREWLTLTVDRDN
FEDDASPGFLVNPYTGEELKIDRLYPPDVGFEFNGPQHYGPTDLYPNEEQ
ARRQMGRDLIKQAICTQRGILLVTIHAEDLSLEGMLKRLPDRLPLRSLDH
KELVIAHLQSVSRAYRQKVPLPPPVPKQACPPGYATGGNAQQYASR
>STH1570 conserved domain protein
MSEREGIAMAITPASGLAVARLTELRPARSHRHQDGKAMEGFCLVTGDAN
FSRVTLFVENGRTLTDRSGQGGFYYEGWLVGPAGVVSLGAFNVGPDGRGS
ATRVVAASHLRPGRAELVRVTVEPFGGTADGGIAVLEGRLIWLDAAAEPR
PVTGMATTVSQQPTAGAATAVAQQPATGAPTIVAQQPTAGMATPWAQPAA
GPSAATGRGPVAPAWSSGSTAGSSSASGTGQPGRTAGPFISDQPAGTTAS
AVADEPAGTHVSVVADPPAGTTASADVVPSPGTTTLTDQPDGTSVGNAPG
LPATESVPARAEPFAAEGESGLRGSPGDPDTWETAPIVADSGSAPAQSAA
GAVADVPAQEGVHASGDQTHSTGATGTTDQDQAPADASAQEEATSQSQPA
PRYVNPLAVQVQLVQRHPMTPRATGTATLNLRRGHLTLSLRGLPSPTALG
RDGKSGRPFNAYRVWLVNQKTQMRTPAGYCERVWGENFRFEADGLPLNRS
DTILITAEDRSVATSTGHPAPQVLIGTYDPRL
>STH1291 hypothetical protein
MHEVRVIHAKLELEGLFEFLDDVLTHVATQTTQHERLRFWLRESTRSAYD
APSHPAVPFLSKPPADTVVLLGYVRSPEHLRWIHEQRLYNMRTGGRRGSV
LPGSRVLSAELVVLYGPHMRTAEMWRVAGTPLMLSEEEVRELHYPTPRGR
YVCLPLEPLPSVELLQKMSSDHVRRVKERLSPTSYPGEPVAVTWFELLQ
>STH2620 hypothetical protein
MMAEILILLGLLSIVLLRPTSVEIRRRELQHGYDVFGSALLAAPGNPGRE
PSGPHRTPAGTAGPSAA
>STH263 hypothetical protein
METDPEYRATWQRVRLVEHALEVLTPVERQLIQAIYIDQTHNLVGAALLI
GYSRAQTARIRQRALARLAMAMGLI
>STH1943 conserved hypothetical protein
MIQTPVVVNANTAQVLVVSEIPLTPPAFKIDHIDKLVEVDDCVAACDKVI
INGRLIKNITYKTAKEWDHKGGLNRVCGDVRHCTVEIPFHLFIDVPGSRD
GDDCEIEDAIVAGEFDKLVDRNRDGTFSKLLEKSVIKVRAKVVRRRWLKV
NAQDVTPRRLRCPETSEVVTVPGLDIPTQDKGDKPFAPGDEW
>STH1784 hypothetical protein
MKLGMELVARHPLPLSLGTFFETWIATAAGSARRSLGRDEYHLRIHVDTG
ARLAATGRTHVCQVDLEAAGLGRNGARPALLTPVDVPDGSPVIWGIRTNR
FLDVADLIASAGARLLREGSEPAPFALDDPRVRLECVAPGRYIVDITRLL
AD
>STH65 hypothetical protein
MATVLIATSVTVALRCPRCGRLELSELSRFALGRAGSQRVECECGHHLLT
VGVRPGQVWMQVPCFLCEGTHFRYYSPGQFWQPGLKQIACAETDLQLGVL
GDEGAVVEYVRPGLSDLERFMEDDAFDGFFDDPVVMYQVMNQVQELGAQG
LLRCRCGSRDVGVDLFPDRLELYCVACGRQRTVPAATEQDLDALMRVAYL
EIGGDASSRCRGHKK
>STH765 hypothetical protein
MKEVLQWPNWAILATFIAVVISLVLNIVLYQRVQVLYSKFGAQALPFRNR
DELEAAFQAGTIDRAEYEQLKRKIS
>STH2932 small acid-soluble spore protein
MANNNNNDLVVSSARSALEQLKYETAAELGIPNYAQQYKGDLPSRVNGSV
GGYMVKKMIALAEQQLNGGTVR
>STH2003 conserved hypothetical protein
MRPVQIDLCTLTEDLLPLYADDLLSPATRQLLEQHAAACPACRERLRAAG
IQHPPPAPTDLPRVERPARAFFRRLSRLLYAGLAAVVALLVATGSLGYVA
GRQSLHDRQRIPQRVAGAEELARQAIPGWDRATAHGLVMDVGVTERIPRT
DAAITVEKAWFSSRQVYVLYTVTAPEDRYWFPVEALLRDDNPDRILDDGR
TPWNRLADWGGVSPEGFHSVLIFNRVEPYPGRSRLRLVLRQWMRLDPRSG
PQHLGTDNTFWEDLEILLPWNEAYLAEPPPEVIPWRHQHTWLGRTLALDA
LEVGVGETRLTGTISLPAGERDPRLYATLVIGNQELEGGYYALEPAGEPG
RYRFTLSYDGPDRWPAPVELRLHAIDFVTDQVLEWPVNWAKYREPAPADD
RPMDPEDQVSLPFYDSELVSTHADDSGVAIEQRTPKRKAPYVKSSLHMGG
RGIPPDLGPGFEIVNDDGEVMTNLGGFGGLVYEGPDGKDVRERVAAMWWD
ELPESFRRSERLIVRYVHPSATLVLDETWTLPVRE
>STH1997 hypothetical protein
MCWHCSGHDWDWDWDWNCDCQCPCRCRGDRRRRFDRKRRCDLHDDWDGDW
DGHDRGRCRRSRRRRCDHRDDDPKSDLCSRPVVVHGHGGAKRFLFI
>STH3150 hypothetical protein
MSDENRPQTGAEHGHESPPVPKFLLLVYAVIAIFFVYYLAANLRFGPNSP
TGF
>STH1136 transposase-like protein
MVGVWPLPIHHVEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRL
NLVEILNRHLRIKQKTYAHTPVDKVVEALVAILGNCRYMKDLNFDPEPLV
ADPAVAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALAEIQSPLLQQE
VAAVAGPDRTGRVIVDIDLTGQKVRGETKQYTGTDFGYIQGKLARGYQIA
AAFLSGKQQRFAIDGLLKSGKANSRSGDCLLELIPRIEARIGRPLRRVEW
VEACLAQQKARVRELHHQLQTVSGKGSARRKQKLQSEFQEAVQHLREVNQ
RLRQYRQENRTNPAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGS
NIAYKRRFDAVPAEEWVEVEKKRFASEAVTVPGPTLLAPYPVRLVAMRRW
DADGREVRSVILTTLQPEEFTTTEVVKLYHGRQTIEAGFQEWKGTFHFGT
PRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAA
RCRATIRCLGDTLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSR
ET
>STH428 hypothetical protein
MRNRRNLLIWIATLLSAVGVFLLVVNGQMMLGLSAAVAVALFGAGRRACR
LGPDSTPSLDPE
>STH1432 small acid-soluble spore protein
MFGTGTTGQTGNSSNQLVVSRARAAMEQLKQQTAAELGIHNYDQQYKGDL
PSRVNGSVGGYMVKKMIAIAESQLAGGTTTGQF
>STH853 hypothetical protein
MRNSDMDELARRRAAREREAGGDDDTVRFTWTDTLAMIIAAYQVLMPIVL
LMIGILLVVYLLFRVFFH
>STH2283 hypothetical protein
MQVLHLVRQLLYLVMYALHLVMQVLKPCTYGPVSRRARCPSIREGQCGAR
QGLGRGTLVWCPAFSP
>STH1183 hypothetical protein
MGVTFALFTDSREGGSHELVAGTVRIDGERVNDTVRGPMFYIDGASDGVT
HDGQEGLLPTGLWAPGDAHHRGFQIENVGTLDVKLTGMTAELKAGDEVLA
GALDVKIYDDLYLDEDGHVLNPEATLIYSGKLAEFLGSGMSFPTPIELAP
GDLASIGILVSFPLDAGDAYQGTTIKVTFSAHAEQLRNN
>STH2109 conserved hypothetical protein
MLQIAFPAAAAGVSLTVDPAFQGHFKEGEWVTLWVEVRGDEEAASGEVVV
QTEAPPVFGPVSGTRYAVPYSVSAGEAQRVAVSLPNEYSWPVSVSLYADG
ELVDVQTPDLSWEPRYTLMVGVLAADDDILNTLSGLRGGDNVRVVSLTAE
TLPDSPGLLNSLDLLVVAGYDTGSLTARQVQSLEAWVDRGGTLLLFGGPE
GERTLGPLPASLKPVNVSGSAEVSLAPLGDLSGVPLEGTAVVSAGPLARG
TVLARAGEPGGGGEPPVLAAAAPLGSGRVVYLAYDPAGAPVARWAGQTAL
LDRLVGLSAGRPPAFDTDWRVQYAIQQVPDWALPSVWTVVLVLGGYLVAV
GPVNYLVLRRLDRREWGWVTVPLLSVIFLGAVYGMGSGRFQEGITHVMTT
TELVPDSRTGVMTGYVGLYAPGRSRLSLPLPGAGLVRPLTTGTFVGGVES
RIVAGDPLTLELSGLTNYNMTAFALEQPVTVPGGLELVDVEVTEFLVTGR
IRNTLPVPVSGVEVGTAYDVVSVGDLAPGQTSEPFTAGRKARVDSGRKGP
IPLPGTGTPDPDADPRREQLRAYVWESGQGRLGSGVLVMGWTEEPLAQPP
VPDLGRRVTGANLVYGFQPLPAPADGDLPAGVVLGRPVDSERVQLIGQNV
YYAPPGSHRFVLSLPVLDPGQVAEVTVDLQAPVREAVMSVYVRNQRTGEW
ELLAGQRATLVDWQDYVLPGGVMELRYDLSVEAEFLAPTVAVKGVR
>STH662 transposase-like protein
MEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRLNLVEILNRHLR
IKQKTYAHTPVDKVVEALVAILGNCRYMKDLNFDPEPLVADPAVAQAWGQ
ERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQEVAAVAGPDRSG
MVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIAAAFLSGKQQRF
AIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEWVEACLAQQKAR
VRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQRLRQYRQENRT
NPAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGSNVAYKHLFDAV
PAENWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRWDADGREVRSVI
LTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGTPRMRKYEANAA
FTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAARCRATIRCLGD
TLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSRET
>STH1290 conserved hypothetical protein
MKTSYPAPAKELDSVSPLFRKALAYVQHDAEDIYVLSAKYGLVTLDQPLE
PYE
>STH2397 conserved hypothetical protein
MRVVTSAPRQEMLWDERNLSRDQGKGAVLGYMFDSAEFAALREEYLQGAR
ERAGRLREAVAALREGGPVDLRQLRQEVHKLRGSGGFYGFRALSAAAAAA
EDALLMVLDGEQERDDHALADLVARVVAEIEAATL
>STH929 conserved hypothetical protein
MGASSCAVTRVTGPIGCTAVWLKRKVRPDRRSTEMHAANILLRSSARWLA
VVLPLGVLIRSAFVWLYTPGPFYWGDLIHAHSHTAYFGWAGLGLMGLILH
LLPRFTGRPVAESRPLTWLLRLAPWAVGGALVTFAWWGYAGPSIGFAALN
EVLWVLFAVVFWREVRGRPLRDWPAPLVLMGAAVLLLLLSISGTTLVIIS
EAVLDGAFPLLYQAGVYLFLELYADGWVEIGLMGVVLALAEREAAGGQAD
EGPAARRVATAALLLVGPAALRMLIPRGLEGPLVWLSIAAGALFGLAQLY
FLAIARRLRLPAAAGPWWRLAALSLALKAALEFMPVLPAWSALALDRNPV
IAYLHLKLLLLASSGLLGALAAVRGAAEPWSFRLYAAGSVVMVAALAAHG
LLARAIPAVNWPLYAVAFAAAFPAAAGGIWGAQPWR
>STH3033 hypothetical protein
MVLVLLFVPFDLRLCGRADLAADDWESPLAGRAEWRLRLRWGLLPVTAGA
QWSDGRLGQPEVRVLGFPVRGGNKGGRGHRPKTPGATKRKAGRKAGRTKP
DPELLLTLAREAVQLPGRLWRSLGVRLTAEGSYGLPDPALTGLCEAIRWS
AGLGRSLRLTPDFERPCLVGRGELTGRVYGFRLAAIAWHVARRPAVWNRL
VGTIRFRPLRTILLRGGA
>STH3265 conserved hypothetical protein
MGFGAYQTRLKNQYLREAENKYMAAFHKLKWTSENIEERMAKLMATNDPR
LQESLLADLRVFSAQAVEHMATLPFTTMNTPRITNFLNTLRAQSDEKHHK
LNTGQGLTDEDWNQLMELRHQAVYFEQELSNLLGLVGNNLIRWQPTVQAT
SPAQSGQASTPITKSVMLLEENLPVPPGEENALAPEKAQLPRPQTDPGPR
VDAAAAAEAIKRFVDMPLAGEPVQTGVYDPEDTEKGLSLYYFDARKQNGL
PMNFGVSVHGGHVVYMIDGRAVTEKKLSHEELIAKAKELLAKWGFDQVAL
VSSAENDGTLVMDFAPVAGGVAVHTEMIRVMLAMDNGELVGYDARAYWVN
HVERDLPAPQIAAAEAAARTSPRLTVTGEPRLALIADRRGQERLVWEVPG
RVEDQYFRVFVDAMDGTEVDLIRSVGDPAPPMQEG
>STH2429 hypothetical protein
MAPWTDDLVYRVQALAELADDLHNRLAQVEAGLERQRAESARLWAEHREV
VAEMRAALLPLKSRIGFVEGRLDGKKETASLLISSFLSAVAVLVSALLGI
LNLQ
>STH2241 hypothetical protein
MVYRTIDQHRLRCALAQFLRFPVRHNPYETVGILHHRHTLRSPTFRDPVA
GQS
>STH2983 hypothetical protein, proline-rich
METFLYLVLALAAVGALAYAVYQVTVRQQRMLAEITRLERLTAEVIMSAE
ALLDEIDQRMARLNDLAAQLEIRAVAEVQAKARSRAKSGTQPQADAPPDG
RPPAPAPPDAGDQEAPQPQPERAPEVEAAQQKPKRGRRSRAGAGSTAVPS
GPAAGSQQAGGSRQAADSGQFASPGQPSQPAEAPPAEPSPAEPARPQAPA
DRYGDLRQAVWRLADEGKSPVEIAEALGVPRGEVLLLLNLRGKKAPR
>STH2909 conserved domain protein
MSRPTLHNALRQLHDKEWVQYRRTGLRTVFCLVNTDDGLRLPSDILLQPL
PRPAKWIWAVLRTLDRPVTYQELRSLTGFSLDTIRESIQQLLHSRWLVME
GTARRPMRLKARNPAESKRQVEIEQWEKQFTDGVRSGLSRGQCLLYLIVR
MALPGSKILVNARLPGLTNPLTDAELEIDLYLPDYRLGLEYNGPQHYTPT
EWFPDEVQFRSQRARDLIKLGLCRERGIDLFVVTLQDLWPDQIWRRLARK
APVREIPDEEWHLVRWLYKHIRRYQESVHRENGAAG
>STH58 putative middle wall protein precursor
MFQRLRTSAALVLLLCMLLALLPTRPAAAAEPLSDAEAVELLQEYGIVLG
NPDGSLGLEDSLTREQAAALFVRSYGMSDLAKMLAATVPFPDAQGRWSAG
DIAMAYKLGLMKGDPSGLFRPTDRITYAEVLTVLLRMVEQEPTGTWDPDD
ILARARSLGIAPSGVGAREYATRGPVFWALASTLVNVPLRDAPNLLRKHI
DQVPPSLTVDPVQTPTTEARVTVTGKAIGARSVTVAGQQATFDPRTGTFS
HAVSLDLGVNNVTVEATDRAGNTAVRTLTIERKGVASRITISGPSIVPAN
SSVRLEVTATDSRGNTVPLEDLEATVTGDLATFDTRTMTLKTSDQMGRGT
LTLRSGNARGTYSFQVYGPSEKAAALEILEINKGHAPAVGKETTVTVRVL
DEGGKVVTDDYFRTVTLRTSGMSGLTVSDSTVQTEKGVATFTIKASRTGT
ATLTVSSPGLKSAEVDVQFLESTRIVLTPNPKQLKPDGSSKATIRAALVD
ENGRSITNQSDRDILVLLTASGTDGYFTNDLVVIPRGRSNSSGSDAVFVA
GEMPGTATIRGEVISDHKYSVQTLSLPVDQGMTGSRFEVTFSKTNPKPGE
AVTVTVEVRDASNRLVTTGSYAFQLKLSTSNNDRLIDGIPEGVTLTFPGS
GYTPVSDGRSASDPNKNPMSVIGRTEKGVARLTLVYERSGTVKITPVPVG
ATYEAFNGTDFGAAASSLNFYAPSREVSFSGTPAKVILTVDSDLGKDQPG
GAVKSAKALTVRAKVVDAYNNPIPNFREYATLERLPNGTGVTRIAGVNRR
MTQDGVAEFTVYATSEEGWDQYVVTVGSMKSQPLTIAVNRTAPPAPEVIA
IHGVKQGGLSPVGGYVGPDADFMEITLARQDALYGNQPTNYVIAKVYRKG
ESRPFFTSEAIDLANGVPTIRIPRSALKVGTYYYEVVVNNAAGDSPRSLA
LDDYTMATVVDYNSDYRLNSATYDALTGRLTLSTSRLTSTGTVDPSKIRI
VDGKEVLRLDPSVVTVSSVTSSSVVILLNDQKDELTPDRFHGSDVYVEAD
MGWFVNKERTQFAQPATKVPVKPMATIAEAALDLDGKRLYLYGEGFRQGT
LNLTAIGITDQAETVKLTSQDKTAVTPTDEQIVVNLSNATVSQLSKLSGS
RLYIAADAGWLYTGSGSSANRVGAITGTMAPVYSRATVTRAEYDRTTGIL
RLTGSNLAGAVLDPGRLRFVLNTQDGGWSPKTSPTAVGDTTGVITVQFSA
EDAAEFVNRFNGRVVYMNTLDGWLVDAAGRPVVRLPDFSVQFAVPNK
>STH2090 conserved hypothetical protein
MHSDAGGPPPGTPGGRGPAAFARDEGGRDEEGTSGPRGPHGPLYHPPPPP
PRSRRQSIVGPLVLIFLGLFFLGQSLGLIEWSLWEVVWRLWPVWLIVAGL
DMVFGQRGGWARALLVLIAVAIVLGIVLGIQPQRAIDRPTGAVPSSPSRT
FLMEPSERVPAAPSLPVTGQQPEPVTIAQPLEGISAAEVRIESAVSVLEI
RGGDLPDLLIEGTVVPLIGEQVQWDYDAVDGTGVFRLYSDKPTRISSGPR
QGEWDLVLTDRIPISLHLSTGVADSDIDLTRLHVPELNVEASVGEVSIKL
PETGAVKGTINAGIGEVILWIPEGRPARIRVETGIGSTSVAPGFLYRDGY
YVTPQYADQEDAVDLVVTGGVGTVDLRIMD
>STH716 hypothetical protein
MLQKLQLVAREGSLSVTDVYLGHIKRDEDRKLLAHADAYRWSDLDPQLRA
ILTQHLYQAIEGAQGEVNFPCMYTGEVLPVDKVASAEYDRLHNRPSVEVL
EAIRDMEEVHPQSLVDLGNVLAYKVMAAGAIKNHLVAILRFRAVPELGAT
PLRFSFATVLDLEDREESLFDEHTGSFRTQVLNNVIKRGSVSRAVFFPCL
DAEGREIADLMVYAGSGAAAWFKALEATRRLSPRREGQALLRMITEQNAT
GEVPHDLFRRMGADLLDEAADGLSAESVVRSMEKAVGHGVDRLGFQARWE
SAFGDLSYRPAYNSLFGGAEKPTRMKMRAGSIEITLTPADLESFRQVTVG
DRTFILFAVPERARVVVGKDLDLKIKPVGLADIQRWMTGEEDPAS
>STH2372 hypothetical protein
MKPVWVHPQTLEYMGRSIVEKVYAVKVPKGLRGVRRAGRQPGRALSRRRA
HHAGVHSQPVGH
>STH1515 conserved hypothetical protein
MGLWVGSAAGFALSAPKIFAAFGPDRQSAGDLAGDIIYLLNNVGLLLGVG
ALLTLLPRLRSGVNKARLALLLGCLGLALTTWLYIFPQMERAQPPEPIQT
YAETDPARVEYNRWHTLSGRVYGAAMVLGVGVIVLGPFAETRGGRR
>STH380 hypothetical protein
MTMRRWLALLLGAGLLCAGVLYAGAAPGEWRSAYQIRRRAAELAQAADVM
VLAEVGEQVLPSYPAEGRVYTDTVVSVVRGGGFAAGTELTLRLPGGRTDD
LEVIVDYLHPFPDKGGQVFLLLKEAGEKYDVLAFMELVDGRPRWPEGRAY
MRYLSQLE
>STH2475 hypothetical protein
MLDVGEIGVRDVSIVTAGALKPLAQVQALPRLLRLHPDMVR
>STH1559 conserved domain protein
MTECGPVVLWSVVPVEVILAGAEAGVPPLREVRVDGRLVLVAAGADGRGT
VVRLISGQAQDYLDARFQPGATVMLERG
>STH2678 hypothetical protein, alanine-rich
MIWFLLFLAVLGIAWFLFVPRTGDETALPLDGDRQDTDLPNVSAHATSNV
EPTLTHPAALFTADGTDMPTAPAEAGQDGGGQFTAGQDAGERAAAVQGAP
EPTYAGQDAAHSAAHAAQEAAPDAVQNAARKTAELGWMPDNYQEPRLDAP
PGNFE
>STH2733 conserved hypothetical protein
MRAAGIGQPSGRRLVRNRLEGMGMSSGTEDRNYGMIAHGLVVLNGASAFM
GSLGSLGWAAAVASVVLYFVWKSRSPFVVRHAKQAAGVQVFLFLLSVVLF
PFTMLFTVGAAASGSLGGVVALVFLVSLFNLAVGVATIVCGVMGLMRAQK
GEEYTYPVVGALVDRIDV
>STH1869 putative membrane transporter
MKGQSGERTLPFSVLLSSPGPIVVGAGLLIGRSSTQLADFIRRTAELAAI
VVSWAVYRVLRRGGEPDSARRRRLERTANGFVGGAMCLSGAAMLVLALFS
GGAEKGNVIPGLVIAGLGVTTNGWFWFRYRTLDRREPSAILAAQSRLYRA
KTLVDGCVTAALATLALAPASPAARYVDLGGSAAVALYMVISGVLIVRGG
SRAERTV
>STH2488 hypothetical protein
METFSQASVPPANYGKYFIDGGICTMADTEPKAQGNEAYYATARTLGTGT
TSSPTMKIWRVMDDYLARFGRAVSASELAQETGLTVEQIDDIFAQEYYQK
HYGFRRFDSLEEWRAWAMETGVLYNPELHRPDPEEPEEVPAEQGEPSEEP
SPESPNG
>STH2657 hypothetical protein
MPPSTTSCWPVMNPDWSGSARKSAAAAMSSGRPTRPTGCWAWSSGRSSSP
VTSIHPGRMAFTRTAGPSVMARAWVRARMPPLLAEYASVLGSDCSARVEA
MLMIAAWPAARSSGAANRDMR
>STH3285 conserved hypothetical protein
MTHPPASGGRGMRLRLLAKPGAAYRAPPVGASKTTIQRAFGRLEAGPMEP
IDREVVQQRLESFVNTDVYMHLETTNGAYAGEAAGGNGMAVCAFVRNARV
RFSRAQITGSGPYRVGLQLHSGEGWTYAEGLTDYEVDEQGRLLLAGHDAN
GCLAIALELSHTPFPMK
>STH185 histidinol phosphatase and related hydrolases of PHP family
MRIIADLHVQVRRGPAEGPVAAQVSAALVRGLEAVALFAPHSPAALPVLR
AVRAIDARTPTLRVLSGVSCRILSPEGDLRLHTRAGREHDLVLAVAAPQV
ATALARWVPRYRPAARRALTDAVVAAVYRHDVDLVALPARPEVDPAEVAR
ALRDRGVALEVCARRPRPPAALLRQLARLGVRFAVTSGAGRPEEVGDCGA
AVALLAAAGVPPEQVVNSDRGGLTEWLAARRRLSDPGGWADWSRQGGTEG
RERPGGRERRPGDWADWSAQGGEIH
>STH2008 conserved hypothetical protein
MERIMDVVAEAHRIGGAGVLWPGFDLAAIPILLYGTEEAWLIGHPAPPEG
YGHAGEVAGRPVYRGPVRPELAGNTAAPVGGHLTALVNLSDSPPTDPGAL
ARLILHEAFHVFQQTAFPDLPRMDAAALQAMAAYPENDPANNAMAIVENR
LLARALEGDPAAPGAFVSMRQHRHRQLVRMDRGDAVVYEQTVEWVEGTPT
YVELKAGAPTDGLVERLQTHNLGGRHAAYRRFYDTGAAQALLLDRVAPGW
QARIADAGGCLQRLLEESLTEPLPPVNQVVVAEGLAALLEAEQEAEAERQ
ARIAALLRQLDEGPGLAVEIRLPPEVQGLMWDPTNLLTISPGRRLHTRFC
GAVGPDGLRVTIHALCLEEWGERGRRFRLRLPGRPEVQRGGRLRIVTEAL
SVDAPYGHIEEEPGLMRVQLWLSRR
>STH266 hypothetical protein
MLASLAVGVLACLAARRLVTLLSLIVYLAVGGRRTASGTVGRWLALSAGV
LAPIGLIPLIWWVLLTDPATWTASQPYHIVLMMVSLGLSLGYFRDRRRPG
TVMWCLRQEGLALAARRAGRRQL
>STH2201 hypothetical protein
MCTAIGLMAFSLVITFIRMRYSVMIRGSAAPTNVRFSITMVTFLYMVITQ
LPGIRDKVDWKRPLGRTGPHSTPGGLALMVAGLFTAISPWGVGWTHVFDG
VNYALLMAKPLAITGGLLMLAGAGLLLSARLGRPPGEWLADGVRWRIAAR
PPETAAKGGRS
>STH402 hypothetical protein
MRKVLAGVRGSIAVETAITLPLVLLLTVGGISVLLWLHHKTWMQALVAGT
ARERAADAAWTGYYKDIRDSLRASGSGLVLADVRLFSFHLPVDPPFVVAG
ACAAPAGRVPRIGTYGAPGAGSGVTAPTDGGGWLSPVQALRGQISRWLER
LEGLAAEAEDHADAAVMLAEQAVWYRRVADNLAGGDPFRVRQAVDYLAGA
AVEEVAALPCRTDGSGEVVLTAKAVIQGERTFGQR
>STH1462 hypothetical protein
MQELIFPAVGRWVSAQREWEDLVAEMRHELPARLCQDLVRQAQHRAQQVN
WAFGRQMERRFKEIEHARRDGARKKGRKPEAKALRRRLTLMDPAAVARDL
AAGLTPPDLSEYNQIVADLSRAMPHPGIATRYTLPPQALRALADQIAAAG
RPMAQALSEHAQNLKQYAELVDRLRAQPLVQKGARLIGSAVATALEPLVG
REAAAQLMGVDRSLDRALEQVEEGWAAFQREYAEFAAERQRRHQLIYLSL
VGGLFLRVQRDLRRLGCELVVDEQGNCRTVATPERQAWLRRHFQAAAPAV
RTALDEGRLEEAEALAARLVAQVVTDPVAARTELDGKSVGYLAYALHLSA
WLARARELWGQGKPEAAAAIWRQVFTDYPCLVADADLHPVPGEPLPLVTA
GFRLAAYAEWVRQSNRADEADALLMAQVRFAERLSRLNPEHALPGEALSP
EMREWAATLSAYLRRSAADPGALGLPERPVPLSRYPGVLRRFRQAVGTSL
ETSPLDRYLRGRVNLLRGATAAAATGGVGLLAALAWLLL
>STH458 hypothetical protein
MPAGVPVSLWNAEGNELAVMELKQSYPSLVIAVPGLVLGEPYAIHAGEGK
AVGVPDDGHFVGWEARRPTWLTGAGVTTAPRTFPGWRR
>STH283 hypothetical protein
MKLELDPEAYAGWFERLKGYVGEVVAADVLARQGYQVELADVPNQPGWDL
LVEGQPFNVKITDDPDQILEHFAEYPDIPVITSPEVAAELPPELQDHVLA
LEGLEADQIAELTQETLDGVEGLDIGPSFPVVPAALSALREIQHLALGQT
DLGTSLRSAALDIAGTGIGGWVGAKVGAAVGTAILPGLGTVIGALLAGLA
GSMVGRAITNDIKLAALRKALEEYQEQYARFHKTATEEHNILVAKVVRTI
DIHRAELLAEIQAIRRRHEEAARQARNRFWDVAQDLGKRLPELLLRAEAE
LAEEQQRVLAELPRSPVWRRILWPSADDLRYRAVELWFTRQRQELRRMAS
EFSKNLDKLTPEGMLAEVRKCLSGKIFDSAALKEAVDRVVSEYNATRARI
ALERRLAVQEVRAAVGKRAAELHRYTAEQVKTLTQRVRPLAEALRPYREK
VEAEAAKLGYQLK
>STH1058 hypothetical protein
MNLSESTKAALILDSTSARDQLNLLWHEVSARLSR
>STH3227 conserved hypothetical protein
MLRMELQFYALFMTVLSGIGVGLLFDLLRAVRRFLRPGPLLAALGDLLFW
GAATAMVGTGLFLGNWGEYRFYVLVGLLTGLVLHFALASPAVLWLADRLL
HAVAWVLKVLWELAMRLVWFPLQAVAAVLWRLGRGLGTWLGRRLRGPYRW
LRLRYLLARRRLRRAWRHWFRGPKT
>STH1468 conserved hypothetical protein
MSLQILRPVTIKARVTESLKARLTAELNAAIKQLDDEMAELESQVKRAQL
TATISPQQQLQLHQLVEQERAVRAEKKAQLQEEIRRVQALPLGAEVVQGT
IQATATVKVGDSLDALMTAEIVVEDGKVIEIRGEV
>STH2877 conserved hypothetical protein
MSLSEPNARPPQALLYVQDLPRAAAYYAGLPGFRAVPAGSADVLVTAPEG
ETLVLLRAGTEPPPALAAPPAGPGAWVYLHRRDLPDLAARLRQAGLMPDG
PTVTYPGYRQMLLPDPEGYVLAFWEELPVTDDQILELYRTGPARLADAVG
RLGDAGLDVPRAPGKWTARQIVHHIVDSDAETFHVLRIALALPGRRIATT
VWENDEWMRGLACDRRAVEPAIALFAAARAWILEALSHLPDALERTVTWP
SGYTASVRDLLRQQGGHAVHHILQLEEMQRRLKK
>STH1292 transposase-like protein
MVGVWPLPIHHVEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRL
NLVEILNRHLRIKQKTYAHTPVDKVVEALVAILGNYRYMKDLNFDPEPLV
ADPAVAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQE
VAAVAGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIA
AAFLSGKQQRFAIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEW
VEACLAQQKARVRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQ
RLRQYRQENRTNLAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGS
NVAYKHLFDAVPAENWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRW
DADGREVRSVILTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGT
PRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAA
RCRATIRCLGDTLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSR
ET
>STH2378 hypothetical protein
MPNARRAATLLAILVLLTGCAGSQTDADATPVEKQASAADENDTPDRDAA
DRNIDPVNDREWALRPATRVDLNGDGLDELIIGELGRWSTEKVKVLRQDG
SDLLKLDQWLPDTTQTRVDLVTMDGLPNPVLVIHVGDPDEWHSMEFTWME
GEELVSVWGWHPKHNVAFGEDYAIRPDGTVEITGDIAGFTIVRRYRIEPY
ADAYMPYVARKQDEVVIPGSYPTTAADLLTALFIARWYDLPDLVDQYLPD
PDIRKAFMKEDFGEIRYRPIPVETGLLVYTDCGPQIEPADPDDNGLVEFL
VSVASYEERLYWTGEAVISTERDGRLVVKDIRITDRGGFLPRRG
>STH2603 hypothetical protein
MHSAQLEYAEHSPARIRALLNRIDPAAVGVETPPQWYAEDVYFEVAYESY
GVAVPWAREKGREVRPVDWTADPLEQVNALLWPAVNPEPDPGGSGEERIT
YEDLLQSWDVPPLLFADTPEWHDAVNRSYANADANPNPYGEAGRRYMLYR
NLMIAREIVNMAADHEGKRVVVLIGAAHKPDLDLFLATVPNIVVRQASEW
DGATAAEVAAEERRADHLAILWYNLAGTRRIKDGDVDLTRMDRLLAGLEK
ATPLDPEVRFLRARWHHVRGDTARAVAQYAALAWGQTWEDRPFTFPDVGL
YKRMMDWSNAYASMGQTVEYGIDNVFSPVGNLTVRQRVLYELATQEHDPA
ARERARHELLAEPLNVRQRQQLQALLNPPAAEQP
>STH2273 hypothetical protein
MKIRGISAVLGVLMVVQLLVVPAYGAEVQDSSARLYTTEDRLHSGSGTAI
SARDPLVQVNPGGIALEGSITIDGLEVPYRITGNITRSAVNTDRLVVEAT
DTLDNYEVVFLALEQGEAVAPFFTDSAPETVAMLRLYMRDKRTDTFVVQE
LSSPLITKAIKEAWRFSADAPVNTVEAELWHAKVYEPVTVEEIDVTPVQA
TNFRTYDRTILYRVTYQISGKTVYEDFRVRHYLQCPTVITIQEECGAVLE
IVEDRSWSPNDPNFGGPATFTEMREAEINFATDAGDYIQSAYWGGAWKNK
GSVNFSVRLGYSFKLPLLPLSLSFSTGYKSSEQVASEGSKRYTVPNSNGE
YVRNHGIAYPTTAILNDADYGHNIWAEVLVGHYTQPGQKQLMAGWKYTLI
NMGNTTWASTETVTGSVVYSNY
>STH1850 conserved hypothetical protein
MKWEWLKEVAAEHRYKIIGGLGGFLFALLVVWFGWWALFILFCTGIGYWI
GKRMDEGPESLVQIFERLLPPRRR
>STH1174 hypothetical protein
MGLRFVWSSLLSVILTGGLLGATTLALFTDQAESSGQFTACTLPLRCSTV
VNTESALFGGSAPARITKAGPTEHVLQPGEQHDITITVELPKEAGNAYQN
AQGTLTITFFAPQRTNMEWVPLGPKLKADNVAVPATGRVYDLCQVELTGG
EVVNLDKARGIWIQVWDPTPKWNWAPIIGDDHNLWSDNDLPSGGYNVYIF
MEDGTVYRTLTIPHTRNQTNERRTS
>STH2462 hypothetical protein
MSGDRSSLRPEGGGASVGAVPRRWIVVACAVLAALAAGIVWVYTHPPLQS
TGMHTRVDDQRMPVAYGFDLENTGRWPITLVAVKVDGQEILPPFAIAVSN
FTEGHLAGSAAIALETYGHKLTTGPVYGWKMQPERYTGHARYAVRLGTRG
LPPAHEKIILHHRYLGLPLRHDLTPWIW
>STH1838 hypothetical protein, proline-rich
MQRTYPAPAPLPWPPPVLPAPGPAALGPAGVQGGRGTGVPGVAFVPVTAV
APPAQPGGPHPLPALPPCPLPWGFPNPFAIRAAVVPAPPAQSGSQSWPAR
PGMVTVTGPAGYLPGIGYFDPLQVAAAAAPLLEAAAQTPPGPYAPPAP
>STH477 conserved hypothetical protein
MMGAIVRFVVSAVVLLVVAWLTPGFAVNGFWAALFASLVIAGLGWVAQAM
LGRGASPAGRGVAGFIAAAVIIYLTGWLFPGWLTVTWWGALIAAFLIGLA
DAFVPTELR
>STH1163 conserved hypothetical protein
MHEEASHMQKGILLEVEGKTGIVLTPTGEFRRVPLPAGDPAVGDEILVPA
LDAGTGLRRWARIWAAAAAAVLLAVLAPVGYWHWSLAQPAALVMIDINPS
IQLTVNGRRHVVEAEGLNADGQEVLDQIEWRRRPVEAVTRAIAAQAIRLG
KLNPAADDSAVVVAVAPLAGTGRSAGLTQSIVEQSRSALQFVVVQEAESQ
GAAPLAGVAVVQATPDEVEEARRQDLTVPRLIILEEVQADHPEVTADAIR
DVPPGQFLKSLGIDPGEVFSRAEQRRAPAQPAATPAAAGDRGVHGRADDG
REERHSNSWWNLWTGNSGKDSSGKGDRGKDKPGKDSPGKGNPGAANPGKS
NPVKDGPGAANPGKSNPVKDGPGKDNPGKSNPGKDGPGKDNPGKSNPGKD
GPGAANPGKSNPGKDGPGKDNPGKSNPGKDGPGKANPGKSNPGKDGPGKD
NPGKSNPGKDGPGKDNSGKSNPGKDGPGAANPGKSNPGKDGPGKDSPGNG
GAPQGDSKGSRLDDGRPGNAGKANAGKDGSDKGGPGKNGSANGRSARDDD
GGDRPVQPGQPRPGQPGPAAEPAGAGKPGPAMPIRAEAHRSGGR
>STH2970 conserved hypothetical protein
MDKLIGIILVLVGLAALFRITLSGAWLFLLIAAALALAVKSGWLGRGAYV
AAVVFVLLAVAGLAARTVIFGLAMLFRLAPLILLALGIYYVYKAFVR
>STH1178 hypothetical protein
MRALGRQIRESGSRLPLVLIPVTMVISSLAAGHVTGATFALNQDVGQAQS
IQFQALTTSQVLRFTSGRVQLVEDGGPDQVGSEVVRVFRIMNISDRRVDL
SVELTGIPWARAALSRSVLGPGETAEIVVSGTPDQPGQYQGYLTVYGMGG
FLELGMEFTVVIVPAPDPCRPAEWPAPGARGGEASGDPVREPDPEETEES
EAVIGLHPDLLGLPAEDAPEPAAAEPDGEPEAPEAADEAGHEPGEGAGEP
AAPGDDGSASESGDEAVPSGDGVDGGAAAPPDTTDPCADAPGGQLPDPAP
DCGVPAEAVVEEEEGEQAAEPDDCAPAAAPPGDEAAEPGDGDQDGRKEED
EDAAGEPVEDDAEGTGDPHDDELTEGDTAGEPEDDADDSGSGAEQPGDAV
GGTDEAADDGIAGGDGDAGSGETDSDEPDSDGPEDGAPDDGDPGVSDDAD
GPGAEDADADTGDADGGSPGGEQAHGDSAGDDGADPDDTGQPLTGSDAAD
DADAGGQTT
>STH1887 hypothetical protein
MSRILNRPVDVTTGPDGRPAAFRFAGGRERVCAVLDTWMETGRWWEQEPE
LIAYRVETESGGVFELNFIPREQRWLLYKAYD
>STH1954 hypothetical protein
MDVTAAFDPLNPSVEAISLRQRVNFLATAADHAYGRLLELFPEAASAGRP
QIRLYESHEAFRAAVGAAAPPDALAWYNPGDPLRLSPEFLRGLMRWETER
DLGYEFVKHISAAASGQPVALIDPIAMGLFERSTAGDLPYLPDPRRLVGT
PLPDLAALFSTPVQSLGAAGQRAYATAAAELVRFLQDRLPAEELQGPAPG
RGWSLGALADRLGQTPETLAAEFEVFLHRQLQATSVLNVPAAQSRVPEGL
PDAIARRAEAAAGGDVEAFLRRTSPAHRDGWSAWLAAARRYGLVRYEASL
LDWERNEGVALVLERLQFRDGRTVIGVVRQHWALEDAGWAAGPVESVWTG
ADGP
>STH1401 hypothetical protein
MDLNGDGRVEGIIGALGEWHRDEVIVVDGGGAPLLILNDTLSSFSQGRVD
LVAVPGVADAVLVNQVGDPGDWHSIQFIWMEGDRFVSPQGWLPKNSLAHG
QGYTIRPDGTVDITGSLAGYTFVRQYRMEKAPPESSWPYQAVLVSQEVTP
GPYPTSAADLLTAVFLARWYGLTDQLEQYIPDPVVREAFLNQEMDEIRYW
PLPVQVGRLVEGETGPRIEPASPGDDGAVEFLAVVQQYEGGTYRTGRAVI
ATAEDGRMVVTDLEILEQGWSL
>STH420 conserved hypothetical protein
MASQQIRCTVSSCYYYAAGDNCAAEEIMVRADPAALGKSGMEVGDIGGEA
RQSNHTLCETFIPHQQGPKPGIRRIAK
>STH3030 conserved hypothetical protein
MLVALYLAAIVAANLSVVYFGPYASIINSFVFIGLDITTRDALHERWEGR
NLWGKMALLILSGSVLSWLLNRDAGRIALASFTAFAASGAADTVVYHLLR
HRPKLWRVNGSNAVSAAVDSVVFTSVAFGSFMPLIMAGQFLAKVAGGYLW
ALVLNRVYWRRAQAQER
>STH931 hypothetical protein
MQHTHPEVIAMNPEELSATCQYIISELGRIETVAGTLAMIEREHYDALNR
FDDRALLDLATEEQSAARQLSMVKHVCGELARRMADIQSALERRPEGGED
RAPAH
>STH2674 hypothetical protein
MSQTISREEMRQAYLDILERAEDATALRLAIICSDQFPETLKERARLAPS
LEALRALLAEHRRRSVPVG
>STH1606 hypothetical protein
MSTLSVRVRNPFLLRGSLEVVLEVARMDLANAEIEEIRGLLAAIPNSVRP
TELQVPVAAARAALLAVRYFNQSRTRHWLREEMVNALLDLERALERHLRD
AAGGG
>STH153 hypothetical protein
MLLHGDFGWRNLLLCDGGTLVLLDFERAVIGPAWLDLAKCLDRELRLPQD
REGFLQGYEKASGMPLARPPEAYLTCLRLWVAAGILLFTSKHADEPFAEH
GRRLLQQVTGDLDLA
>STH860 conserved domain protein
MRLRLVTVPDRLWADRSIPEGAKLVWCYVSALCHLRSEFTYKELREGAGI
SLPSLHKYLSVLSRAGWLNCARISLRVVRCEITGPMGGPRLVLPTDILFE
RRLPRGACWTWGLIGRMGGRFEYTELRKATGYSQDTLSKHVRALIAQRWL
VGGPHRKARRVIYNVRGANPRAIQRAQELQELERGLQVARSTPGYSQGQY
ILARLIREMYPGVEVLENAEITGLDNVETRGRLQVDVYMPDQKLAIEFQG
HQHAGPTERYPSVEEFRARRRRDLIKRGLCLEKGIELISLWPQDLSCAGI
ARALGHRLHFAGAREDRWHVARFLEQRAEWYRKAAARSQCSSF
>STH2177 hypothetical protein
MHRGLPLSRRSREDGVNLWMRRCGRLRGRAVGGTPAPGQQDQPRPQQP
>STH2118 hypothetical protein
MSKSYQRWMLICTDCAYIIAAALAVWAVPLLYQLGPLAVLPYVGIAGMVW
SAVTAAGLAVIWFRHERPDSRFRVRLTAWVRRPAWRESEA
>STH325 hypothetical protein
MFPIPEQIRSLLKSRSMVGPSAPSAVVTFDLRSHLAGKPFGEWPTVPLPN
LRGITWRRSDRTYFMVQNSPRALVQCDQAGNVIKTLDVNASLIGLDLDST
DNDLLWLADYYRGVLAVRISTGAIEVDRVITDLPYITAVCALPDTIWVGS
GYNFQRKIAVYDRQDWTLLATIATPVGPRDMTWDGRYVWLAGDRDIVALD
TETLEAVPGQVVSVQGASRLNGIAVADDSIVVTDGGRIYRVHMPSLTVRP
ERVRISKDKGAVAQRAQVSWPNGNPYNPRDLPGYYSPDRGIDRPETLNGW
KDVVVPGADITIEMGYGEDRSLAFAGQVDEVSIDVEGGDSGADYSISIDC
RDHGWRLLDQTVVNDQGEYYLAYEDPEGIEASLTARDLLIRAGFAPDKVF
TEPTGIVVQCKVFERQSYADALEWLSNVTGYELLLYDDGSAYWRYPSDRQ
PAEWRQPLRLEGTDWVTLGHAPIVAGSMVLEDPTETDEGGTPVCYTEGVD
YEVDLAAGAVRRLDGGSIPDGGQVLASYVYAAWTFREGEDLFRIGYKLTR
RDQYARIRVAGEATDPDDPTQKFPVYGTWTHPAAAANGLPPAKVQFVEIR
ELDSAEKCQAAANQLGHDMLPHAREVRFAAVAVPWLQPGDCIQIVESSTT
ISEVYRISELEIEYGPDGAIMYGVAHHYGYAPPPATEEVTPDA
>STH595 hypothetical protein
MARTRPLLICRPGLPAALPAALSKSRSRLLRTLFAALLALASACGSPAAT
GTFTTLARGTHSGITAQEAALITSLAEWEALWRRHASRFDPPPALPPVDF
SRSSVVALFAGERPTGGYSLYITDVAPQGDSLRVTALELRPAPGRPVTQA
VTQPYHVIAVPRVKKDTRLEVRWRVRTDSAGLPAGAVLLGPAHHPLDGDA
LTGLRRHPAAEPADHLMDLRAHDQRVGARHPLEEGLEEQVDLVGLPGADA
PGAHEVRGVILEVQVVVDLRDPTVHHEAIFVHAGPVGPAAAQVEADRVAA
LHA
>STH1179 conserved hypothetical protein
MGLFAARRAAVGVAGVSISVLGAWAVRIIANARVTVIGCAVFGSVAVGLV
AVGLTGTGISVPACDPVVGGLVGPADRVAGLLSARPTVVGVVLRLTGGVT
LGEFVVVRIAGPLSVVLHRLPGRIFVLFLAAVLVAIAGFGRLIAGRSCSG
GAVIGLRSLLAFLLLHDGLGGNPAVRSRIGQLAAGSVSTRICRVRGRGGA
TIHAVPGRNGLVA
>STH2873 hypothetical protein
MYTLRYPYHLPYFGGGLVAALLAWGFLQMAVLLEPEVKPGVQFIVGAGFA
VVSWAHISVVACLTIIPQVVRAGRLLDTFDPIWILGGLSPLVVPAAAVAL
RIAGRPLLPSLAAALFRFSVPLGFLSLIPRWLHLPKQFRWLFLFTGVMFL
VLDGLGFDFRPKGRARSA
>STH2122 conserved hypothetical protein
MDDRAGIFSAAELERLENDLTGRRFGFRVIILEEAFAGAEPVDAEARFQA
MADELLADVPRDAVLITIAMEEGLVDFRVWRDGAVQSAFREATGRAFERS
VEQIMDAFVPPAAEGDIAGAIVAAADRIESLAAAPAAQPLPATGGSGAGR
PSPGPRGSGVAQPGAGGAAARTSAPGRIGAVLAGLAALIAAVVELALFLQ
YRRTHRKCLELRNSFVSDLVKMHEQDLPLARNYDGEETRGHVTAAAAASD
RAFDAYRAGSEKLAEAARLARRWRFGAGTRALDEVYRAFQQAAAANREAQ
EAYAPVSAAILGWDGAVGETGARREAAERTLADLQDQTGWEFPALRERMD
AAARLQQAAEAARGKDPVRALRIMREAGETFAAIGADLTRLSGLREACAA
QRRDADQARAEIEQARVGLGLRFVEEDPVQALERALQEQARAGERMARGE
VDGAEQALAGGQAALDEARAILARYREAVEQYPGKRQALVEGAGWLAAQQ
GPARNVLDQLAARYAPEDWADVRDLPAALADLERRAQADLAEAERFVRPE
VQRHLQAYRLLNERLAELAALRERAALLTTLPDRLAAAGEAAREKLTRAE
REWAAARELVSREGLALPRDAAERADRIEVALDRARRLLEERPLAVSRAG
REAAAALELAGGLRQAVAELARRAGEARARLRQARAEATAALVHSRFNPG
AAAALQRALDAGEQAMAAGRYDQAVAEAESALRSARALVAAYQRHKAEER
RRQMAMAAAAAQAMRRASEPRGHFRPGPGGRGSGGGGGFGGGPRGSGGGG
RFGGGGRGSGGGGRWK
>STH333 restriction endonuclease R.SthI
MASARKPPKPLNLPSPLVLPDPAGAIEFGASLDALRQLFLVQALLATVKT
LDLNVLNAELAQYVPAHRLQEMAGVGLRAEIVYPVPCLIEANPRLIAYYR
LLLGYSAKRLYRMNSPLGPFETLEKKGVINPRLRPHIPTICSAMIASAEK
LVDGLGIQRITRELLDDLQLLTLGSQLQGARNVAIGMAGMHAVRLIIEDI
VAPYITKSDANSIVLTNSAGRTVTIRIGADPDVRIEEKVGRNVKRTLSIE
VKAGEDNSNIYNRLGEAEKSHNKAKKKGFTEFWTIVNTSNIDLVKARVQT
PTTNEFFLLRLLKDKSTPEYADFRQRIEALCGIPTT
>STH1431 small acid-soluble spore protein
MASNNQSSNNSILVSQARAALEQMKNETAAELGIQNYAQQYKGDLPSRVN
GSVGGYMVKKMIALAEQQLAGGTGTTTIR
>STH1734 hypothetical protein
MGVVVQSFDGRMPGDLYERLVRLRRRHRTVLQAAAAGDFSNEAELHALSR
RLGRMLAETARMPVEQRGGLRMLEISLAVDRMFPELRAWNQQAAFRYPPA
GRAAGR
>STH2412 hypothetical protein
MPPRGRSTPGAASALFALRVGEGAGGYLSERYLLCSHCVLVRAPADTSPG
GICSVRTACWCKRGGVPLGAAQRWGHRLAQPTVTAYRPGASGRGHPRRGW
PHGARTDRKPQGRVACRGGPTARANRPNASGRDHPGRGCPHRA
>STH36 hypothetical protein
MTDRLHAGPADDLAAGLLRAVLGGRDRGRISLVGLSKNAGKTVTLNRLIR
AAAGLGIPLGLLSTGRDGEPVDAVTELPKPRIWAPEGTLVATAAAGGPDG
GSGAAGEDRERTARVAVLRETGITTPLGPVVIGRVVRSGEVLLVGPGSAA
RIAALLDELEAAGAALCLVDGSLDRRAAAAPAVTGRAILAAGAAYSASMD
ATVSQVRYLLELFDLPEAPPEWRHGARPPVCLLLPDGPPVAVPVPSALTD
PAVVAEAAAAAPAGAVLSVTGALTGGLLLALLSRPCAHIPILVPDPTHVL
ADRGAWRRWRRQGGQVFVEKRVEIAAVTTNAYSPVGPSYDAAAFCARVAE
VAGRPVVDLVAGIAHHLPADHPHRDPVHR
>STH2759 hypothetical protein
MAMTLLDSGNLFLTWLFVARRSLPGGVLVLSGDVLLGSLLLGRGGLPAIA
GSRGP
>STH2334 conserved hypothetical protein
MRSAVRPRGPDARRRGPHRAGEGSPAVVMTRERTARGGACMRGARLVGLL
CLMALLLAVPAPRAQVAGAAGQDRAAVLQELFVLNRRLEEARAALADVEA
RLRAVSSQEEAALADLARLQEELAARREQYGRRLRYYREQGNRGPWVLLL
TAGSLADFLWRLDALKQIMDYDARLARVLMETQAAVAAQAERLAQARAEA
DRLRDEQLARVAELEAAIADREAILAGLGDERAEVEEALAAVEADWQQSA
MPVLDALGQALQQLGTAGLQPDDLRFSLFPPSAVATITEEQLNAYIGGYD
ALAGLRVDLVGEEEAFVLSGTFADVPVEISGGFLVLPDGRLRFEPSLMQV
REFRVPEGVIAEIVSQGLLDIDVSSLVAPLTLTGVELTDGRMTVRAGLR
>STH243 hypothetical protein
MMDLLGRLFGRRQPESPAAAWLRRRAQAEPAEVVVEHFRAIEAHDLEWIL
ATLTPERARLYNSPSTFDRRRLSVKAARVTRVAPAPDAPVARVPGYAEQQ
VFRVEYELELAPGEARRDPTLAEGPQWAYFLLVRERAGEPWLIADWGR
>STH1057 hypothetical protein
MRGKQQGVRSGRPAQCEVTSVAKAGVDRRREETRTPTCSVLVFSCFGARR
RRATVVSVRQGLSRDVGRFWRSDSAMESRAFTDRLASTSTHMRARVIPLP
RLTLPATPGFHCRRARERLMPQT
>STH268 hypothetical protein
MITRIEYVVCPTRLVPDAPYISQLASADALADLIRKELRAAYPAAEVVVR
VARLMEMLAPDACLQVVAEDGCDQDARLAVEQIIQSVCTDRQAEWIIIDR
TEPMLQSDVAAIAENILRCHLGRLHPPGTIHSTVREWASRIISDPSYAET
LAEVGGEVPYHQLTQYQAGITALVSDLCHDLVRFRRGEATDLAPAIIWAY
ERHLSVLTDPPAPSPSAAWHLPSPSEKRQ
>STH2136 hypothetical protein
MLGPVHPLPARRPGTAVAPHRGRRANERRRLSVYRIQKGTRKSGAGRHVA
VETLRLLLLLQFAALLYLIYRVHRLEQLVNGRIRTTPRLTQRSGRAGKVV
PLLRDDIPPGPFSSGNPDRDDNS
>STH3288 conserved hypothetical protein
MAVLLLALSGVVAGCSGQTGNHSRDAAQEATLPEGFPEAPDWYHHAPKPI
QETYLAAAHHHDELQHIPCYCGCGAFHASNSDCYFQRDETGAVVAFDRHA
VGCQICLDITRQTVAGLESGRSLTRIRQEIDEAYQAMGLTPTPTPLPPPD
Q
>STH2386 hypothetical protein
MRYNGAFISGMLAGLALGAILVVALTPQTRQPVMQGMSRMGKGMRRMWND
GMDAVADAMTGDVD
>STH1728 hypothetical protein, glutamine-rich
MGLPFPVGEGGVQEEDSGMTVHGDLERAIAMAQAARGNYLLFATQSEDEK
AHQVFKQMAEDMERHVMILESRLQYLEQYNALVQHSRGQQQGQDQGSGQG
QRKAQGQGQTHGQGPGLHQAQGANQPPGQGSGQGQGQGADQREGPQGMGW
QFPRMDGQGGGQQAQGQRQDGQQQAQQDVPQQQVGGHGGAPHGQGGPWQG
GTGQSGTGQGRWVQQAQGGGENPAGDGQAGGSGQFSQAGQPDPSAGAGTD
QQMASQCGQPQSDRQQGQQQSGQDGQPWQQQGQPSQGAEPWQQQA
>STH2256 conserved domain protein
MRRVLALLMMLTMMLVPALAQAETAVRLVVDGVEVQTDVAPVLENDRTLV
PIRAVTEALGFEVEWDQETRTATLTKGETTIQLTVGSPEAVVNGEKVALD
VAPFIVSDRMMVPVRFVAEEIGLLVDWEQETRTVLITSQPPADEAAETGE
SDEDGEADEAGEPAGVVEPAALELLAQAHVASEVNVRQTGHFTVTIEGGL
IPVDSEIFLEMYQEAPERALGYNTVRAFGMEQTIGVAVLDGQYWMQDETG
AWAQMVMEEMAPTDRLSDPSALVNLNPAEYDWASATVTREAYEEAELQVV
TVVMDKSGLAALIGEASELITDVRMEARYWLNDDGTLHHIDVYVETIADE
PVPIRVVMQGTVFIEPWDGTIEFPPEITGAAE
>STH1255 hypothetical protein
MSFVEHTTGGPRNQGGWSRGALPSGVGGVYPGRGTPPRLEAQARCGWAAP
AGAGEVGIMLRVRKWPIAALVLAALLSACSAGAGSPAGETDAGSAPAGSG
APAGDTAPGALPAVLAEQVTDGLVRIAFAAGDTIDRPGIYLLDTATGQGE
GWLPPPEDDWTYFFGQVTDDNRFLIGTAYQEDDGYIVDRQTGAMWRWDAS
RYRLLLAAEQGFLFAELAEEPDGFRDETGRLIWTGPDLRVRHTFVLADPV
RSAVLAPDGRRAAILLSRGEVIALDLESGAAVTLADLQLGPDAWPAMAPF
DDLLQVAVEIPDDQAPPAAQPHRLVRLDWDGTVIADLRAPGYPLTFSPDG
RWMTWVEWPVDRLAPVTVVADAWTLEPGLRAFGVTTCFPAVGSGGARWLS
DGSGLVVHSSDGYRLLTPAGELRALPAFSGQDWKGEPQPAPDDPDLFALG
RIAVSDGAGTDQMGITLEGFVTPYGHVNPWGRDSAVLRFVLPPKPGGGMC
EERPPMETAVQVQAGPIPEFPLVVAGVEGCLPLIPREFGTGEACLPNGTR
LVPYRPRSDVPALGWYADAWRLWVRTEDGQIGEISLEGTPLRWALD
>STH2053 hypothetical protein
MKTTRRNALLTLLTGALAALAVAAGGRGLPGARAAARRLARPAQRQAKVE
MYGYTEVNLS
>STH419 small acid-soluble spore protein
MARRRSLLSDETKMHFARLQGAGDRAQPGDYGQLTSREAGNMVKYAIQAA
QQALAGQTPDPSRIQ
>STH2512 hypothetical protein
MRVTVMTGNRKDAQPSARERARRYRVRRVRRRIEPGEERDVRGPAEAADD
ADPGTPEQDRALQSGCMSMVRVVGLFFVVMILSIIATWFFR
>STH2479 hypothetical protein
MHKNMHTKTQRPAAPPETERPAGKCPQVHVEEHVIEEILRAAGLEELD
>STH2024 conserved hypothetical protein
MLTDETTVFATGQVRSVQTLAVPGVRFVPDRHQVQEAGFAYFAFLDRLAR
PLLRVRFGERGTAAVRLCGITLLSFRPPEVREAPGMASIRFPIAAGVLVQ
RPMRGRGELRFEMHADRLVMAVEGYYAALAGAGGSDVRHWIYERTQAAIH
RRVAARYLDLWLGRLIAARSIKS
>STH2895 hypothetical protein
MTNIVHPRPTDFAHEPLVHWTADGLFAAPWTGRPDTEEPADVEELSARLN
QEPAHREDITLALGHLLAVRAHCRRAASALDHLTCRRHYADALKECESAL
FRLHSCEGADAAALTAWALGVHALIRAQTTLTQDPTEARALAAEALGRLT
GPQTALALGAGADALSALKLQQETVELEDRLLRLPAEVRVMTEAAEQAAE
SRLNALNLAVRALRRRYAQAFAWAVGWGVLNLLLMAAAPINALLLPAVPW
SIPCIVALPILWWATWAVPFRDGLPFFGFVRRLRLEAIADLRAAAGDLEP
PGDRLAAEIGPLLEEGAQAERRLRQFYLFRLPEQYETVWKARAIAEGALA
RLKGDWVADMIDAPAAPLREVMEETERIAALIGVRLAEP
>STH617 hypothetical protein
MEAEPSCPLPAVQGGCGRDASETILLEVVGFEGEDDTVRISGSEVHVRLS
AELRNLVQQRLNEAAVGR
>STH1396 hypothetical protein
MFGFAPVWPAAAPLFAAPVINVWWYELKNVSSGGVVSTGVLDLGFLTSNS
KGNSLTDQGGDATYSATWYGFLSDTDWVDTPYANMAGFQVGG
>STH2780 hypothetical protein
MASMLAMMNRLMRIRRIRFSPFRCLSMKQPVGLPVRCTGSLFASARTSKA
VSGAGLAPLI
>STH820 conserved hypothetical protein
MRIQDLILERQRSLFVGRARELRLLQELVTCPADDWHLLHIHGPGGIGKS
TLLRLAAERIGPDRAILIDSSCGFGRPEDVLARIGQELAARGAMVSEAAD
RSAGSVARALNAHAARQGGILLALDAFEKWALVEEWLREEWIPGLDHRVR
ICTAGRHRLIGPWQAGGWNLLIRYLELPPLSRQEVDEYGRRLGLSDEGVL
EDLRRISGGHPLALSLAGPLLVRRGRLGAGCGEARALMQQLMEAVLADVA
DPLLHRCTEAAAVLLRFDHDLLEATLGEPVPTDRFRALCSLPFVTRCGDC
WMLHDSVRQWARADVRARRPEAYARYRSRAGQAIDRRLGAGPAEQSEWLF
DRLYLSESDFMHGLLFGQEDELEVHGLSASDADEVERMYRRLLRATRRSP
REERLLALIRPLLAAAPEAFGALRRQGRLMAFGSVFPLTGRTVPILAAHP
VTAPVARRYVPGQSRYYLGLAGHDPQGVSGLEALMARDLLRLLPDQGLVL
GQLEENGWDRFFRLIGFERAPWLDAASPGGTSYRGYVLDLRTERLHAKVL
RRLRAEEGAPERGPVRTAPRQPPAKPGRTPAQPPARQANLTAAALAQRLR
RALRHFDRLYARPELVEPLRFLATCAPGEDAADPALQVQEQIRRAIQRLE
DGNEAERWDGQILRMAFLERRGSHEQTAERLHLSVPTYYRHLRLAVHRLA
RELLRAAPG
>STH2129 hypothetical protein
MAHLDDPAADRPKGAAARSRTHASRMPSRPRRIVPPFPARCRRIWVFERY
TPS
>STH2894 hypothetical protein
MRDHSQVGRDSVDGPYLERPDTRAEQYLVGALLAAVCLAAGWLVVRIRST
PGAAHPLEFTGLFLLLALAAYLGHAGYELSTVRYALRDQRFRVAQGRRAV
ELDLRQPLRLHRWLNRWDGSGAAAAELGVAEVEWYPPVALVRTACWVVVG
HDPAGLCRAVAVRPSPRLLALLREMAVPKWGEEVGETADR
>STH324 hypothetical protein
MAVMSDLWGKTPLRVVSMARPKTQVGLTEKPVIPAPGASGPQSILMGTGR
LRRRREIRGLATPAEYDQLEADYEALVVRTVTLADGMVMRAMIASLEADE
VEGTGGSLLSYTMTLVEA
>STH2067 hypothetical protein
MSPGGTSPSPVLRIAHMWGDRAVRNRTAGWMLFSSACLLLGLSLQPVSPN
AVLVGFFAGFACALAAVVSYLLGY
>STH2480 hypothetical protein
MAGTGNKWYQSPAVVGLIITLLGFGLFTLAFWLISIWLLS
>STH2519 conserved domain protein
MLTEVAVPHQLLDDGELTDSARVLWCYLRAIRKQKTRLTWAELRRVTGLS
QYRLKQHMRALDALRWIQLDSLGTREFECCALWRGTRAFTIPVDLVLDRR
IPHGAKWLWAHIRGVGEFDYAQLQHVMRCSRISLRKYVRILSHYGWLVGE
VVRAGRCKVYRFETVNIIDENRHADVQDFYEGLRLARLKERYSVGQYLLK
CIVAVLVVDQLLIENGEVLRLVNPFTGGQLHYDLLLPVSRVALEFQGPQH
YRTTQRYPSDTQLQEQRMRDDIKRRFSETHGITLIEVRADDLSFGTISSL
LRRHNVPIRHSLDDQRHLCDAIEAEAANYRRWAQRLERRLSSG
>STH1553 hypothetical protein
MKAGRVPDGLLSPRCRGHDGSNAGVAPMPAPPVNRGDGRPVFMAKGGSAM
RKIGLGLLILLACAPALYWAPWLSADAAQQRAEASFTSGWTGVVDGCGIN
CQGCGSVGAERVPFGWRVELEYACGLLPADLPEHHRRTVLFVSAFGTVHR
VNQQ
>STH2579 hypothetical protein
MLAKRSLSSVGRARALHAWGRGFESLSDHQQPQGFTGSGSLAGACALTRL
VGRVPRCGADPMAAWKKASTFRAASSHSLSSACADRFAVIVTVERPRAFC
VTVSETPSRPGVGTAAEGCGMALIVTLPVPLRVRCRCFPPALVPSRGPSQ
PTLRPELRCLVP
>STH1428 hypothetical protein
MASSRIRRQLRQLFRLVRRLDGAGVSENLILAVVFFALSRDDRGTLAAFW
QDDGSGEDDGSGDEDD
>STH246 hypothetical protein
MELLILPLFVLGVAAIVAFWNRAGRWSDDDVGTAVGNALETLNAALGGGE
HLAALEYRRQVHDEGEQAGGAGPQAFFPGVRDDVVVIRPPRGADEGGPPR
PDEAALTAVVEVSGAAVGRVRFRPLGAGAWRLEELSAPPDVYVRTARLLL
AYLFEIEEAGRVELAADVAGLGACGFRDGAVTPETVRPPQHRFSLRLQAD
>STH2396 conserved hypothetical protein
MRLRPGPVPLPREHGAWAMVLTPAVVAVLALGPQPLGLVALLGWLTAYAA
RGPLEVLLGRGASGRAGMASAEPEVAGFWLLALAAAAAALLLPVAALRPA
VLGLLAGALLLACLVFWLAQRGRTRSLLSGFLSVTGLMAGAPLYELAGAG
GMSLRGWALTYACFAFFAGSIFRVKTMARERKRRSFHDLSVAVHFAFLAA
AAYPAVRGWAPPLVPLALVPPLVWSVVCAWRARRRGAARLDRVGWSEVFL
TVLFAGLTVLALRLPAWPPPAAHP
>STH3143 hypothetical protein
MRGYLITLLLIAFLVGSLFRASEVTGMRIFNMFATFGIFLFGVVGSLALT
RLQQRSGQRGVEEALKALEPDWVITDWSAAPGERPDYLLVGPAGLLAVCV
EHSPGQARSARARRRLAGAADRARQCADWVRSELSGWREAADLPVFPLVL
LSRMRAPDDQEEDGVPVLNPESLAGFVANLEPGPALSRPLRYRITRRLRE
GRLPETG
>STH262 hypothetical protein
MAAILYRHGLAVVKAGGLAEINLLAVAGERLVGAVEVKRRYHEAARYPTL
YINLRKVAAMATWSVALGVPAWFVVVYDDQVQAISLADLPGAVGGCRVDG
YRVRHGATALEPVLEVRVNAMRLIGRTEETAEVEIGGPASIPDQSRMVDL
DGQPLLRDQATLDQAVAIAGRLAPLLGGDRVGHFGPISPLDCWAARGDRP
LVLVLTTCLPLPPEWHRVVTLPLRLVLALQATAQLAGLRAVVAVQADTDL
CWLERTSRSHRTG
>STH1916 hypothetical protein
MDFTHPTYVILPIKPPVADQVLRIRERYGHPSPNPVEVTVAGSSGVDVLE
PDQDPAAVVQALERIAASTPPFQTEFGEVRRFPGTHIYWLSLKDEAPFRA
AEELRHSLPAQPLPLPAPLHAAGAGALRGGGGGAPPPASSGHGGLRRDRP
GGGGGAGEPAGGGPHPVDGQAHRRMSPQGHASQADPPRGSAWLHPCACG
>STH2567 hypothetical protein
MPHGDLEDFHQGHLKKGFQNLPKGLPNFPYSTGPTSAPMLCRGLTLC
>STH1374 hypothetical protein
MVFKLRTRNAEFETGTFIPWVASGDATVSDDVACSGFWTARLGPGTGYIQ
QDGVAYPNRSFEVQFRVGAVGGMATSPLVMTLDFYDQPQFGRLFQWEQQR
ERLGGAYLPEPEPQPQGRLLRRFTLTIPAADIPTVDGATPVGCLQLNVDA
GLSPNNTRRFRLTFAKPTAGGADLIVDAIHVIERTVP
>STH596 conserved hypothetical protein
MNENGFVVDGRVTKIHDDLNFEYDPADFMRPWRIRTRETDQVNLLFEPFF
ERVARTDALVVRSEVHQMIGRFSGWVTTEAGERIAIERMVGWAEEHRARW
>STH2931 hypothetical protein
MKARDTIDLYATLADMKAVDYRSVLALTALIDLLVAKGIITREEMAERAS
QLDAAGEKSLPVGG
>STH2763 hypothetical protein
MGSGRGSGQVLQSPVLWGRGLRLGRTRIDGKLMERSGLDEVNLRRARRAF
QQRWLRTPPPERRRLAEAGTVARFRYVIDTLRAAWSPEVWSLWLRYLEHT
DEEIDEMMAGEYPISPYMVRVSSALLGVTVDFLEAGCWPAQDYDGHDIDV
CPQWQFTHA
>STH1123 conserved hypothetical protein
MRRFSRSRPLRGMGRGPVRLRFRRGPRLRLRGGLRAWLGRLSLRLRVRRI
WLRNLILAAAVLYLLIRIGDLLLMQPLKAVAEVEARRLGAAAVNRVVAAQ
VSKSLAGAQVVEYVRDESGRIAAYHVNTPLVNRVASEAAVAVQEELKRLA
ETPVRLPLGALTGSALLSNLGPRLSVQLVPVGSVSITVQQEFKGEGINQT
RHRVWLEAAATMRIILPLTTQEVPLVQELPLADTVIVGPVPNALYGGSLG
GVTLPAGR
>STH472 hypothetical protein
MTPETHLLLCLLLMAVAVIAAMVIMDLPAPAQPPRRRRAKRRACRRGRAH
LRVISPRKAMEI
>STH2324 hypothetical protein
MRSSGGTRRARKPNTFRCCGHQVEKGVIDPVGDRLYYAGRYMRHHVAEAH
GSVFAYLIGLDKLQTSGERGDNLPRELEDDDDGGQ
>STH2888 hypothetical protein
MNLDLRGSTPDQIRGYLESLGGTVRPDGTVAGPDWTATLEAGEHRAFGAV
WPRVVVTFSGDPQSVARVVSGLRLRAMRGGG
>STH1394 hypothetical protein
MRPRHMVTERGVALTIEFHIHMIKVNGIDSSSAFTIGTNLLIGFGSQSKS
VSGGNAVTGDYSSLPSLVNAIDDRDYIDTPAWQVAQAE
>STH939 hypothetical protein
MQVNGGISVGVDLRLIAEARRLLADETPLPFDCGTLCGHRCCTDFGPDEG
IYLIPGELPLFDGTEDFTRWQFHSTREYEFAPSWEERFDAVPFLQCTRLC
DRTKRPFECRTYPLLPYLRPDGELEMRYSFLARGLCPLPERYRLDELDPA
FVEAARRAWALLMEDEAMREHVEWLTAQFDALELPGLDEEGPCGARGGGE
PS
>STH1015 hypothetical protein
MNELQLLFQALASLLPEAILSRVLTYAAITAALVALLRRIPWFEARRQVA
APVAAVVLGQGFGWLAVGFDTAQWGTAVGAGVVIALAAVGGYSGTKNVAQ
HVRKAS
>STH2883 hypothetical protein
MPMQVVRAGSWQRWTVAGAALLLVLVYALQWLAGGGEIELHQVMNYFQVV
SSMVIVTWVGDRRNRPTMLTGSVLRVARAFGWEAIPLAAAEVKPLRTGWQ
VRWNDGREVRRLSLVTPPEFREAVLRAAEQARAAPPGEAPRTVREALLAV
PLAERSAVRGRGAAYLALLTVLPLLSLWLNHPWPLFGLPVLVWFQDRIFA
GTTVVLSDRILWLLGAQGPVDQIPLERVRSVEGVKRNRVLLRLDDPRHPA
LKLFRYHEGVDVLAQIEKAMAGQPLFTPAGEGVSQAAAARAGDDPTAGEA
VSPAGGGDHSHLRALPAPGPVRGAGGRPRPAGKGAAAHVRAAGSP
>STH2400 hypothetical protein
MHGTKGGYGCASICDRLKTGTAFGRRRRKEDPPVSQSVMTERLRVEEAVT
RFFHMCNSHCPGNVAPLLTADVELTAERTASGQEGVHSYLVWLWQTYPDL
TFRVEHVLVDGPAAAAEVTAEAQGERRARCVIFHFRDGLIRRMRWY
>STH862 hypothetical protein
MAVAAVPGHRRRVPFECEVWRGRGLRYRRGVAVAAPKRHEVSYGAHDQAE
RRRSRSSLTWPSVPTRKTPAPSRFVLQRASDVPQQVSCTVRAGALARFQV
SVGAGAGKRSGPSPSSVTAGVGLAPATSSRNAAQVDMGVPRTAELSNPRA
SFDSKRSPNVLVPHMRTGNRMGGRLARSNCAQASPGDSKSRIWPIPTGGS
>STH46 hypothetical protein
MFGSGQERPEHSNRRSMSSTVLCEIERLRAIYRAESPTSHFKRRISLILR
IDNLRFAKACPRPLVRRCYPQFGRSRQNPVMVDHAASL
>STH1405 conserved domain protein
MLCVRLARLIAGFLILPLVLGFGAARPAAAAEAAGVAAPGSGFRLERSEL
GGQEGPAAVSVFDGVAYLLDAEEHRIVAFGAIAGKKMDVIRYPDRYAATD
MAFDGKHFHLLDISRQRYAVLDARGNVVSDSPAPADAAFRQVGTDVRLAE
RTAGTQPLPVHGGLMTMSRTDGFGAISVSVDGGASFAIESRYLLGGADLL
GSNAYGYYFVVEELLDSHEVTAERSVRWYGPDGQLRGIYEVPFWEYAAYP
TRDVAFDPVSGAVYNLIVRDEEIEVRTVTWTDPGAYTGRLARLSRQGEAP
EGRPATEGASAAPPGPEVLAGPALQNRSGRELPGDLSRLAAALAAESPVR
YRILVVASAEGEDLTGYLDRVVAQWGPPEPDMLYLLIFAEQNYNIRFYMG
ADFRTSGVDVDEMFNLVYAYYLLGKRDGDVAGSLARLIGAVNRRMAGECL
VYTPYYSPGLRRSAALDAQTVDALLRCYFAGQEAEAFPDDQRPLDARWDP
GLVRAIGELQADPADGPDAGARLLFEVPFDVLPAAGADSVWAKQGGEPGA
DRWITGLRWRVTVAQEGKFWRLERFGPYQQDGTVGR
>STH2377 hypothetical protein, glycine-rich
MRSWLAPSAPPVQHAPPDAGEHQCGSRDHQAAGSHHPARGGQLGDGRPRG
FSGSLRRGRRFGLAGRLRQRGAYNRSGWLLCRRKLRGDGFHGFCGLLRLP
GLRLHWLRIAGRGGAGRPAARWVGVTGILRPAGVPRLCGIPRITGIAGIT
RISRVAGVAGLFRGARILRLARFLRCAGILRFAGLTGRPGLPGLPRPTQL
AHLLRDTRFHGIIRVAGILGRARVLRIARLPGVAGVPGVARVPGPAPTTA
AGRVHLSGLTGIHRVIRTTWGTGVAGVPGWAGLPGASGPPGFRPVTVGSP
PVTPASPLAGPCVRRQGL
>STH1667 hypothetical protein
MTLGVTGVQRPAEGLDWREERDREAVEAFQARHRRLADAVERVLMRLVVL
GLVILALAQMVGMDRFGLLAALEGTPVHEVTDWSRSLARTANIPYGR
>STH2764 hypothetical protein
MGVTTGQEVKGVPAVLFLHGLQWLLAYLSLDLVFRTVDVRPGLLARSRAR
CRRRYVLAAAAATLVIGAAQVWTAGGAGAAAAGTAGLGWLPWWLLWRFGW
RKQFPHLYQEPPEGKRV
>STH1017 hypothetical protein
MLSLVLIRLRGQDRVDPADWARIRMPDCRVVDTDAMQVVQPFGAWGG
>STH1461 hypothetical protein
MARLDALFFQHIPPLRAAAVALPEGGVLFSDDGFADFLRTGRPTSDPAGP
QVLSTGVAWEELRSAWSRVLPLEPFAHDGWLAHLQTYAGLDWSGLDALGL
DL
>STH279 hypothetical protein
MEIASTSSLFRSVSGAPNRRYSVCIYKSHIWSTRRALKAANPPSNNEIRP
TARKMVRVMPRAPCLINGLPSLRRPMTDDNSAKDTVTATKFAFAFSSIPF
LRSFSEACNLSNVPRSISTYLSLSGGSSPVSIIGRISSTRRRCATQAPAY
GRISSSIMSINLLTNDN
>STH2531 hypothetical protein
MHRPEVGRHYKLVRDFEIHDPDTGTLKHLIRKGEVVKVRKIEEEKDLIYL
EGISVPAFFRAFQMHIVPAEE
>STH3104 hypothetical protein
MLRQTHRSYYAAFRSSHWLARLNLIYLAVVPAFLVGSLIYLVWFYLTV
>STH2612 conserved hypothetical protein
MRAPSRLALWALCAALLIGVLGDQLILHAPDLGMGGFLWVAALAAATVVL
ARWVPEAETPAAESAPALQVERGRHGSLQGEGRWLLAAAVVFAACLAWRA
SPVLQFLNLLAIGLSLGLAALTLRDGGLSRAGILTYPLGLVLSAAHHAAG
PFWLLFSDVDWKGLSRGPGSRRAAAALRGLVIAVPLLLLFGSLLAAADAV
FAALLDRAFVWRSGSWLLHLFGTLFFGFLAAGFLRTGLIRQPGGLPRLDA
SGVGERGSFALGPVETGVVLGLLNLLFLAFVLVQIRYLFGGADLVMATAG
LTYAEYARSGFFELVQVTALVLPVLLALHWALPEEDRAAHRLFRRLGTPL
IALLFSVMASAVQRMWLYVLEYGLTELRLYATAFMVWMAFLFVWFLATVM
RGARERFAAGALAGGLAVLLLLHAVNPDALIARINVARAVETGRFDAGYA
ASLGPDALPVLVEALDRLPDPARAELEARLEERIHRPPSADWRAWNWGRA
RAAQVASSLRTE
>STH2073 hypothetical protein
MAVGRLPPDVEAARRRPHAATVRSAPACVAASEPHTTSHGRGAIRPFPAL
VTGERPRVAVLNPETGQAQDPTPCA
>STH1778 conserved hypothetical protein
MNKRRGDEPSARTIQLLVSYAIRYPEIATVRYDPGRQALRLSFLVKGPLT
DEEFSAAAGRIRETLEVYHLMNQREADLIEVRRTAFGELNTIAVERDVHS
LSPEELYTVVAVVRSLFQQRLLTEPMEYFGEEELMAQDELIEEVMASLSN
QRQQPNLIAIRENGRLMIFHK
>STH2239 hypothetical protein
MIGTDGFEGGRAVRLALDRNDNYRPYLLWTVDGPAGFAGNFTSDGTNTYW
LDTRGHLWGASLISGSEPESWGTYSIPLPALIGAKAAFTNTEPAVEVRQT
PDGPETHLYVTLRNYSDTGVGLRDGPTGSDGAVVAIGPSGELKWYRKFGP
AERLDGRMASLNTAPLVLGSRGALLFGDVNGFFYSYSLDNGNPSGGSARA
ALIHPESRTPVDRLFLLKAGERPNVGPYNFSQVSGVGVDPAFAHGLLLVG
VNFETDGGTSGRLVAFRTGESYDLRWLDAPAPLELTPGTPVALEPRLQLE
MTTRTLAALCPGPWTVRWFVTDADGQLVRPLGEVPLPGELGPQQPYPVPL
TVTLAEGDPAEGWIVGVIDLPTVYALSTAHTSNPKVALARGMAAAQGLPA
EKTCAGAVAELIEREGEPGPEGGLANNVLIVPYTIRQPAQVVVDDPSVVD
LAVPSMADAGRPFEVGIYLGYQNNLGRSAIQVPLRLWARPESGGSAPAGG
WQAVTIDRVPCCTLKTGTIPGLSAGTWEIVAEIDYPEDTRPENNRVIRRV
EVLSSKPVEAGGPEGGAITD
>STH3242 hypothetical protein
MAHRPASWFRNVAVAGLVVALAMVPAIAALLHGALPLWLHIPLTVLALSL
LALGSAAAGVAAVEFRQRQQPAKSGE
>STH2455 hypothetical protein
MKVVVVLGTLLVAVYTLNYARWAWRRQLRFGAAGLVLLAVATVAVPAWIM
WFLN
>STH403 hypothetical protein, glutamate-rich
MSGRSASGRRCEAGYASVYVALVVSLVLVPLTLIVIDFTTLAYWRAKLQG
TADALAIGAVLETREFHIPIPTEFVGYFPVNGFLINALHLSNLKPGGVHT
RVEPKLERLVELNRDRMGAQVEVRAGDARVLPAGNFLSPFMYVHIPVEAE
VPLLTPLLGTLLGSDHPNRVTLRAESCAAAWYRPDAWVHRWWDADPETLV
KTIDAVINVTDEPQKYYRLINCVDEAADVLSLLEIVAREHLARDPEAREF
LEQWMGNRPQPREMQRARKGLERLPERCGPDDPCVDGSRESIEEWARSTV
ERRRAEEEEAKKREEAARRAAGKDAERNAAGGDDP
>STH446 conserved hypothetical protein
MRPQNGDDPDRFRVENILGGKLYVSPSGPPSGTDQPLADGAYVSLADGEG
NLWFLPDPDLNSRPGSGTEFGFQVRSVYDQVLSDPTPVSITVRPVDDPPV
GGPDRVGPFREDSQNNAIAVGDLLKNDAPGPADEAAYDGQTVSLVGVAAG
VHGTVTLQGNQILFTPDPDFHGTATFTYTIRDSAGLEADVPVEVEVLPVA
DTPSVGGGTTREDEPLEGIAVARSDKDGPEVTHFYITGVQGGDVYPPGSS
QPVADGSFIRAADVGAGLTFVPAPDAFGESGFGFTVHGAVGESLDFLSPE
GARAVIRVIPVNDPPVAVDDDLDALIPQRAVEDGDPITFPAALLTANDSA
GPANEAGQGLRVVGVSPVEGVAVSLDEAAQEITFIPDQDFHGTASFTYIV
EDDGTTDDQSDPQRSAPATVRLTVEPRADLPEVPQVVTTAEDALSEPIRL
AKNANDGDEIAAFRIRNVEGGTLYHGDGKTPLILADGVATIPADQGEAYV
RFLPAPDRHSALGDTFSFQVQAVAPAYSSESSSPSPKGRRAWSSYPTRTA
TARRGIPSASW
>STH1099 conserved hypothetical protein
MLIDNKGRLFGKINLLDLVVLLGILAVAGRFVYGALTGPAATPTGQDQVI
EMTLRIPAVTQWTIDAIQVGDEVYDSKSNTRMGQIVDAWWEPAVVVREMP
DGIVPHESDTHFDLYVTVRGPARVSPNGVTMSGIEVKVGRSNQYKSAFWA
ATGTTVAFDLNPPER
>STH1193 hypothetical protein
MATVCRVDTLYHYLMGGDESLPGILDKGLLPASAFPESERWQRFESFYRS
LYAQWAEPLLGPFRNSGVYFTPIDFRLLPGTYLHRRTRIAVPLSEIPLEQ
AVLTYELDGRRTVVPLTAEALEEAAALWTADLVRAWYARNPNMSFYYVPQ
VAVFPDGGIAVHTAWVERAPAEA
>STH1125 hypothetical protein
MNLAMHCHPDTAYRIAEMRSLEYNPHLAVHRHAIEEMLRARRAEAALRRR
FFLRFWLRSILFRLAPGLARRLAL
>STH2223 conserved hypothetical protein
MSAREAFAAMREAGLEPVLAAARQKVYEADGLRGRVRLTLGAAQLDELAN
LLGRVSVRDGQVLVPLQDLDAALTRSRFRTGLLDVLEAGYGPIVTRRQER
EAAASRWASWLKQVGDALPPVDGVRAWFERVVAGESPSARWVRRVYRTDA
EQAARAVAAVGTALARLPVDRGEHELLAVFAASLAGDPHAFDEKEPAGAL
LEHALRERFGPPPEELRPHEARAFLLDQAGLGADQVSSTVLVAHLAGARW
APAPGASHPMVAAMTACTGAWAVTLGELRRWSAARAHRGRAYVVENPPVF
EWLLRRLAGAPPERRATLICTGGFLSAAGFRLLDLLAAEGTEIWYGGDFD
RSGITIAVGLAARYGGLFRLWRFGPEDYRAAALGPGGQPLSEADRAALLS
VTGPLAPTARAVATGGVTAYQERLVEVLAQDLMEQ
>STH2845 hypothetical protein
MMAPLSVVLRRNRYLSEAFLLAFGVCAAIGMALIFVAGLLALGT
>STH1055 conserved hypothetical protein
MTETELRELTAGFVPWNQWVRPVAESKVALVATAGVYLKHGLQEPYDDGR
PGGDPSFREFPVVVRYEDLAVAGPLPAPAREDLNLVFPLERLRAMAESRA
IDAVAPFAYSFSGTITDPVPLLAGYGPSVAYRIRRMGADVALVCAAGGTG
RLTAALVARLIELAGVPTVVLDGEADGLRGLGIPRGVAIRRPAAPGDAEG
QQRLLRAALEAAWGLHEPGVVEL
>STH675 hypothetical protein
MGIRIAGNLLTADGMDCTRPSPLAPWRLTDDGHGLPGNGLYKRSATVYG
>STH567 conserved hypothetical protein
MLRIGLALPSLSPTLSGGHGPLMISGFLGTVIGLERAVAMGRPWMFAAPA
LAAAGGLLILCGAPAAAGALSMALSSAALILLFTRIVRVQPTLFNYVLTL
AAAVWLGGNLSWLFGTPITHVVPAWAGFLVLTIAGERLELARLVRISRGG
TAAFLAVLAVVVAGAALTPFHYSLGWRVTGLGLAALSLWLLRYDIARRTV
RTRGLTRFIALCMLPGYFWLLVGGLIPLRYGFIYGGIYDAALHAVFLGFV
FSMIFGHAPVIFPAVLGVRMAFTPRFYSHLVLLHLSVLLRVGAGLAEWLP
GRQWAVVLNVAAVLLFLLNTVRSVRRKPAQLR
>STH2001 conserved domain protein
MRRASVATLLAATAAVFVACTSPPPPPGTSMEVEELSAQVAELTAVNQRL
EAENRELEARVAELDAKVKELEFRNQNLADRLSPEEREVNLINPRFPPAV
GGEPGWEYHQVLSADLDNDGVEERVSVTTNAFWMEDRKEFGWDDGHPWHV
YVEEPDGTRTYLFSDWVQLGKLDVILDREGPGVFIVYRRDGGMIIYRATY
QGPGQFRTVRSYQIPLSYSATWANPDMFR
>STH2142 hypothetical protein
MEHGGGKTLAISREDVVSKLQALVESRGPWYIERLQVSDSEGNRYTPEVG
EDGSVELVLRSGPSRRRR
>STH665 conserved hypothetical protein
MIRLSGHPDRLSLRQALAEAHVVREVCDPSPLVVVAIHRLLMALIYRVYR
PVTRADWAALWNAGRFDPGPLDGYGAFWMDRFELFHPERPFYQVPFIDGE
KVHPISALVLEAASGNNPTLFDHGRVEGGVALPPDRAACHLLAHQLFALG
GGVSKPFNRMDAPLTKGLVVEALDTNLFRTLLLNTLPLEDWERLIPPTDD
DAPFWEGDDPPEPVREGTPVKGPLHYLTWQSRQLHLCTDEESGLVTGCQI
RQRYALPKDGVRLDPGKVYQQSPKEGFVPFKLNKERAVWQYTHVLLQTSG
QDYSRPYLTDWLATMHRFRSRYGIAFPSRVILAVTGLTTDPQKAAKVELW
RRERLPLPMTILDQPELMAEVEEMLAEARRVEGLLSRTAQALVWASAERK
ALGDAVTYTWTGKLPPGKKLDQVKGLARSLGMVARYWPQLEEPFRRSIED
LAVKSAGEVRSAWREAVMMAARDAFRSGRDGLLHTEASFEVLTCVGSAFH
GKLSRIFAAAGEEETESDEGAIE
>STH2041 conserved domain protein
MISVRGLRPAIWSGLLILMLALLYPRSTRVQTAGAWETTGTFEVPAVAAD
GTAFSYTMRGHEGQFGFIDSPFVAGKTDKYMWHFWGNQEDFLYKTLEVTG
VNRRGQRVPVLTTTLGGEHNGAVAHTPSMMSLPTPGLWRLEVRVDGNHLG
DIVVEVLPSA
>STH1938 conserved hypothetical protein
MKAGHHCAIITPPIPCGMGGYAARSGPAEGVHDPLFARALVLEAGGERLG
IITCDILHLERPVVEAARARAAELTGIPPERVMLLASHTHSGPSPGDWST
HPSPPEYRAWLPLQLAGCLLQATRQMAEVDLAWAEAPVEGLGKNRTDPDL
PFDPTLRLLALLAGGRPRAVLLNYGCHPTVMGPENRLISADWPGAAVAAL
RRALGDDVWVGFAQGCAGDVSARFTRREQSFAEVERHGWLLAGAALTALG
RLGGPEGGSLSRPLRLGARSRVVRLEARRLPSAEEAALQVESARARVAEL
EAAGASHAELRIAQTALQGAELALRYATEGIPVDLNCEVQAFAIGDAALV
SLAGEPFSALGQAIRSRSPFAVTLVAGYGNGYCGYIPDRAAFARGGYEAL
SAPSEPGSGERLVDAAADLLSELKAEVAGA
>STH905 conserved domain protein
MEALANARLHQKRPLDVRVPDLLWRDRGLTEGARYLWCFLWMTRTTCTKY
TFAELRRATGLSQHSLLRHLEALSRKLWLQYTRSGRTVEVQPLWPRSCRY
IVLTDDLLLDRSLPHAARWVWGVIRRLGRRFSYQKLIQLTGYCHNSLTKY
LRVLQEKGHLTGTTCRVGRRKAFDLTASNPAESRRLAALAKFEKSKSLVQ
QWRAYSFGQFLLARMVEMLTGTVLLENGAASCLDNPETGARMQFDLFLPK
YSVALEYQGPQHSRVTQRFPDAAELQRQQQRDRLKRQLSEAAGIRLIEVH
PPDLSFRRLAELLREAGVPLRDLPDEERYVYQALLRHSQRYRAAVRQEAA
V
>STH61 putative S-layer associated protein
MRSPVCRVGLIALLILLIPGAALAYLSDVAGHWSAVWLSALEARGIIAGD
RAGRFYPDDPLTRAELARIAVTGLGYGTEALALQGVPSRFTDVAPDDPLA
GYVELLWELGLTEGYPDGRFRPEAPLTRVELAAMLVRAAGLEQRAAVLRS
ARLSYRDAGEIPAWAVGAVAAAAEAGLMSGDGDGYYRPWALVTRAEGAAA
VAKWLDQYGLLFDLEGTLVAFDPREGEGTVRTPMGEEHSISLAVDAAILR
GGERVLPPALQVADQVSLILNDAGQVRFVEAWYADLLAEGLEPAGGNGVW
ATLGTGQRRRLWVQPGALVFVNGRPATLAEALGQGPTYMALDVKTEEVRY
LDAVRTPFGGRVAVVEDDRIWLEDGDGFLRYDLASDAILVDAGRRVEFYE
IAEGDQVAFALDGDGRITYLEIVR
>STH1128 transposase-like protein
MVGVWPLPIHHVEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRL
NLVEILNRHLRIKQKTYAHTPVDKVVEALVAILGNCRYMKDLNFDPEPLV
ADPAVAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQE
VAAVAGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIA
AAFLSGKQQRFAIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEW
VEACLAQQKARVRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQ
RLRQYRQENRTNLAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGS
NVAYKHLFDAVPAENWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRW
DADGREVRSVILTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGT
PRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAA
RCRATIRCLGDTLRLVFSRGTLPWLELRLP
>STH1048 hypothetical protein
MEQDGRRAPGRDGGGRLTEQELYFVTRALDAELLALQKNDHYAEEMADEN
ARRLMQEHADIHHRRVRLLLGLLDAPDDITTHAKLLLQSGEGVQAGAHD
>STH2070 conserved hypothetical protein
MQLPPWFMPGWPVPLAQRPPWGRPDWNNNDWNDRDDDDRDDGRDVEWNLR
QEGGPLSRTGWSNWRRIGAGTTLDSPAPATYQNDLHLFVRGTDSGIYWNR
MSRRTGWGEWWEVPGQGRTPSAPAATAYRGSLHLFVRGTDNRIYQNRLTG
NRWSRWTEVPGGGLTTSSPAVTVHQGALYLAVRGTDNAVYVQRYDGQWQG
WQRVPTGATSDAPAIVSYMGALWLFVRGTDNRIYYAVMRAPNQWSGWTEV
PGGGLTPSGPGAVVYRGNLYVVVRGTNDGIYLNVRDPRGQWSGWTEVPGQ
GRTPSGPSGVRFADRLYLFVRGTDNGIYTTVRTRR
>STH622 hypothetical protein
MRERLYRAGYPATAAGWTGFLALMAFALALTVVATGVNGLALIGLVGLTW
LLNVATVHWAVPWRLVCHVFGWVVLYRSVVPDRAWYLVVPAALLLAGSII
WEQERRLARA
>STH1906 hypothetical protein
MLLEWRRFEDGQTAVGIQAFVRRSGTFAHDRLYGWGLKHAETTPAHRASF
TEDGALLVEWDMGDPARHTRIRRYALNLEQGSVAIVEERFVPEGSDLVYP
EAPEAVLHAAQIAAAYRLEDELPRYFASTDVAGAFLEAIPPWPYEVTGVE
LASVTAFDEYCVPTTEPAQPDADGKAPFVVWLAGQINLIVKVGTVWFETD
DAGRTIIRDFRLDEGCA
>STH2354 hypothetical protein
MTADLNAVRLEAALYFQRNPFARETPESLALRLGRPETLVEPALGQLVEL
QVLERRGRLYRYCRPYVAPAWDEAPSTR
>STH2859 hypothetical protein
MTSLGSIIVGALGGLIVTVVVMYFWPRSRNRHLGALLAAVLGVAFAFVVM
TQVRI
>STH2789 hypothetical protein
MASVYALAADLFFAERLERALRDLGHKPEVRDLSAGGSGQAPGNAAITQV
SLPGGVDLAIVDLEAGEAALALVRAARERGIPVLAFGAHVDVEARRAAEG
LGARVVTKSRLTRSFTDLVAAMLGPSV
>STH806 hypothetical protein
MTEQPWIASFVVRLTGPPLRVLVRHVQSGQEMRCLRLEEALAFMEHCLQL
SGKTEGE
>STH3329 hypothetical protein
MDTEPFCSYRTGLLRQAPSPGAPVGDVPRGTTRKVVGPQAEKYTRPAVMQ
GTRTGSDRTPSGLMRRFSTDVPRGTLPPIERADMPPQHRPALLVKRRQVA
RVTRHMVLTVRHDTPRGRRFTGMGDCPDLADNRRSVTGRTGTLPARGLAK
ALTTCRLRMARRKMTP
>STH639 hypothetical protein
MGTRQRRPGTGTRRLRRGRPWMRGRGRRRTGWLAALVGRLRRRWRSFWHW
EVLHALRERYPQLSRTRTLSPGELRRFPQRTWP
>STH928 hypothetical protein
MSWKSVRDLLPLVVLHVAAAAVHAASAVGLSGVRPGAAFEVALAFVALVV
PMAAVLALAAGMRRVGAALLVLSYLGVAGLTLVGHLGLNLLVNALDSAPS
AFRTAFFASGLVMPVIQVCGVVEALRTWGLIAEPRRNRLSVK
>STH1840 hypothetical protein
MEAPEVKIEVECAHCGNRKLVQLPGQWTPMPLNENNQPFPGPAMPVVLLG
CTRCGYVQMFSPQAIKPKPFPERPAQQQGAQQDSGAK
>STH1800 hypothetical protein
MGNGPPPLWQDSPRQCGKGFRGWVKMGWLIAVGCLAAAMALLWLAPVQLH
LRLAQANLKAEVFVELRVGWLRFARAITLGEKAPRVRARTRRRPGAKKLD
VRDLLAASVQGLRYLGRATWCERLRLRVEVGGLDACDSALLAGLGWSLVC
NLLAQFGRLVQVNPAGVAVAVVPNFQRPLLRTDLDCILRVPLGKATWAAG
MVLRAALRRRALLARIRERQRRKGDRTDGRSPDSSPHEDGDGQPEEHGGR
EHGHR
>STH1578 hypothetical protein
MLLTDVALMEHRITLVGALGGLLAHAGRRLEPWYLAGVTGHAFRLTLDLV
ISPSAPHELNFHEVFPLWERLGVWFKRVAARPRDPGYADVRAEVIERVVQ
AVQRGRPAIVYDLLGCEEYGLVVGVEGDRWACLTPGAPTEPQWMEVASWP
PADHAAFTRAEVIALLDLTPDFDRRAAEVASLRFAVDHFWAPASRDMWLQ
HGRQAYQFWRTTLAATGMPLHGPEPGRGHSYNLAVLHAARSDAAAYLADL
AGRYPEADALQRAVGAYRRVADALAEASALAPYPGAPLAERRAEVAACLD
RALAAETEGIEAIERALRALR
>STH2806 hypothetical protein
MADVQSYQKCLTDYCPLPHYRNWVPTPGGPGWQPGPTEGEGLGMAGFLVA
LLWLASILIFLWIWDDATRRHGQNIGCLWALAVLALGPIAIIAYFLFGRS
EN
>STH1378 conserved hypothetical protein
MPEERLQPAQLEPICIRTTQVYDWCYLNGTTSVLLPDIVFPADTSVVAGI
EAAIISVECAEIDRDQDPSGLANVTIRRLVTFELTFVDVNGNPVPVIVNG
VEVTTQTRSRFFDEFIRVCAPDGTTLTCEITDAAARAFMTVINGTPAVQV
ELFICQNIQVTADVTVSININDYCLPDACQPGQKPFQCPPPTTPPPACGP
LTQGRARKAGSSQG
>STH1282 conserved hypothetical protein
MLTFLLWLLLLVIAPPVALVALLLYPLVWLILLPFRIVGIAVDGVLRLVW
AVVTLPARILTRRW
>STH209 hypothetical protein
MFMCKYCLEQFEDERLAYILFPESRKNHPAADAFALKFCSRAHLVAFLQH
ISHQHQPYSLTRVAGNSRETFPAAPPLDLLHQMSQIA
>STH2955 hypothetical protein
MEGQELFAGGGEPVVYLPTEAGTAIAPDGRKLVFFSVPALDVMIKQVLAA
QPREYTYRWGYHPGERLHVLLFGWPTGHGAGLAIPEGVGDAILNFMQGTT
DVYITAAPVGDKLRGPVTPEVIDELRFGMTVYLPEVKFKPEGWT
>STH2167 hypothetical protein
MHGGVRGFLITLGIWVFLTLTHMTIEWIGGRSPLAMFAREPRARYIAITV
AVVLALGTAIGVASDRVRRRAAQAAGAAGGKEESAWPG
>STH1433 spore protease
MQARDYLSPFTDLALEATAAARGDARTELPGVVMREERYDVATVSWVEVM
PGVGEQTIGKPAGNYVTIDAPQLRRRNRDLQEQVGKILVEQLNKLLHLKE
DDTVLVVGLGNWNATPDALGPRVTGKLMVTRHLREFVPADVAGGLRAVAA
IAPGVLGTTGIETAEVILGIVERIKPAVVICIDALAARSVERIGTTIQIA
DTGIAPGGGIGNKRQALNRESLGVDVIAIGVPTVVHATTIVQDAFEAMAQ
SFAGTKPFFQLFNSREERQKLLSEVLSPTVGELVVTPKEIDELTEDMAKL
IAGSLNAVLHTSVTAQDLLRYIN
>STH497 hypothetical protein, glycine-rich
MNWDDMGTMPYGGTAMPMGMDGMPYGKWNMPYGKGGMPFGKGGMPFGKGG
MPFGKGGMPFDKGMPFGKGGMPFGKGGFGKMPGWGGVPGGGFAKQYPPGM
VGIPYGFYPGYGLVQ
>STH2624 hypothetical protein
MNRLRVCAWCGSAIAEEQEEHRNLCPECSQVDELEAATPLYPVGERQPFQ
D
>STH2048 hypothetical protein
MRVLLWFLGRGACACARLDSRVAAEVAGWAEGQRLCLRIGSAGPAMALAK
EGGRLRFLGLRTVPEADLVIHFKHLEGALPVFLGQQSIAGAYAQRRMTLR
GDLTFAMSAVRVLLITEAYLFPGIITRRIMQRVPRREVSMGRVYLRTLLP
T
>STH280 conserved hypothetical protein
MSGGSSGWSAPRHWKLGILYYHYPEDRRMVVPRRRGMGMTLNSAHKHTWF
FVGALFLPLVTVLLLILVSTAAGEISVP
>STH3145 hypothetical protein
MFDLAYLLGYASPIPTPWYHGVRVSEATELLIVLTLFLGVLAAVYFIQRW
IGDY
>STH1905 hypothetical protein
MNRRHRVHPRILPWLLIAALLVSGCSAGGPDPSAAAPTDRTENPGQTADP
QMPEDPAGPDSAADDPPVAGAVPADLDGDGAPEWIRGLTDTWTADPVEIY
DAQGEKLLTWETFG
>STH1753 conserved hypothetical protein
MIAEARACSEQLAFGLRTIQGAIRDLTPEQLAFVPPGLANSIATLVLHVA
ATEVVVAEQVLGRRAAEDLRRAVLLDRYTPAPGAPIPAADPGETAESLIG
KLERARAEVLGALEALTPADLERTFPFRPGGRPQPLTFFLQLLPFHIASH
YGQIALIKRFLRQDT
>STH201 conserved hypothetical protein
MLIKTEEAWFRLARYLNAADVLPAELLAEVQRYAAGEQLYIPRPQERLPW
GTRTGARAQLALRNEQIRQRRREGASIEELMREFHLSYDSIRKILSGKNG
SNCSRG
>STH306 transposase-like protein
MAAVAGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIA
AAFLSGKQQRFAIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEW
VEACLAQQKARVRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQ
RLRQYRQENRTNLAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGS
NVAYKHLFDAVPAENWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRW
DADGREVRSVILTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGT
PRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAA
RCRATIRCLGDTLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSR
ET
>STH1309 conserved domain protein
MKGRDEDMGSPLEREFHEAMVSICERATRECGYNPRDLRRMIFDHGGLGA
ARLLLRTDRVSDGFVRLWELKRLDLTVEAHVLLPRFAALFTEEEHRIARR
RLQDYGYPVPEPRPAATRPSTQAGEEAVPPAAEAVATSQMATGGSRHHLQ
RLVGDNPELLNCLLLSASESLRAFAGGHPHWVWPLATNGYRENRDVTYLD
ALGLADLTPRLAEFWPSDGPVWDGLAQVSGRHGIRGVLLLEARSHVQEIT
GPGYQVGRESLERMQQRLEQVKAVLGADPAADWVGKYYQAASRIAHLYFL
NILAQVPAWLVNLHFVGDREQSGPQTVAEWEVSFKSLDTALGLPPGHLLA
GRIITAFLPVVV
>STH858 hypothetical protein
MMEPVDAAGDAGRAALPPVQTDPVTSAEEALARIPPAALEGRTFDALLVE
DYGHRSPDRQWHERECSTWIVQILHPSSGVTFLIDAATGELNAFVQSEAQ
APAKLPKSAQDAVDLVFEQWGHLPYIRQFPSAIGSSESPLRLPDESEIDR
FSALSTTAATADRRGEYLVTPPRTGRKTAAKDRPGGSSPCIPPVSSRRPP
>STH666 conserved hypothetical protein
MKGPSSRQVEFAGWLSGLDDRGTLAALRRGLMLEEEQLFELFGYVPPRFL
TGLRPGEERLYLMVAALYAYHPVSFGEEELAERRRNLGESLRRLAEEKAR
QRGGLDEGEELLPESLKRRMEALLSAPRADLFGHLRQVISLLKSEEIPVD
WAQLLSDLQRWEAPDRRVQWAWSRSFYVGYQTEGGDETDVR
>STH1240 conserved hypothetical protein
MPRRWRVKRRAEKGAAGPAHTAAGESLAQMVERLSIALERSNVLDYTVLL
QHPWRMLWINFVGGMARGLGIGIGFTVLSAILLYILRGLMMANLPFIGDL
IATIVRLVEQQLRP
>STH1819 hypothetical protein
MREPVTLSREEAVAATLDQLGNRNLQMVDATYTGEYEWRGNRCRAWALRL
STDIGEEVTVTVAVHACTGDLLSP
>STH947 hypothetical protein
MQKAGTAYAAVPAFCLAAPALPSRYLSPMSSAVSDQPPGAELVLRHRRVG
VGYGLWVPAAHRLPGRGSPFARAVGRSGRDGLRTGGFPVGRTAPIRRAHA
QGLQGPGAFVRNRPAAGRPADSVPRSHHDHVAVGHHGLMCVRPFPGPRHA
RLTSWCSVCPSGSVPLPRARPAIRLPRGLTLTFFDLWC
>STH1181 spore coat protein
MKRILLSLATVGALSALVTGATFALFTASTENTNNTFTAGTVRFGKDFNA
SCDLSADNIAPGDSGSCSYTITYVGSLEAFVGVEHSVSGDLFSGDNPMTY
TVNGKSDSGTILLGKASNGDTVTADVQWSFPLAAGDEYEGKSGTISLTFK
AVQVRNNEDASGNPISWNEDGT
>STH2418 hypothetical protein
MKVRRSWVAVAVPSGLLQVRSSRLRARGTTNFHLAGGRSGSSPWVDRRCR
AERVTSGMKGTPGGAKCDGFSLGWLSK
>STH2655 conserved hypothetical protein
MSLAYGAGRFVACADGGLSASTDGLTWTPVPLDARYWHIDLSCTGKEFVA
TGCITSRTCEVILTSADGVSRYQRVGLAAFDQVAYGNGRYVLSFGNAAWT
WSSADLTNWTTVNTLLFLALTFDQGMFVAADADEGTIRTSADGVEWTEVF
HPDQPVYISDLAYGGGRFVAVGGTNSGPRRAVVLTSGDGRQWSLQVQEEA
EGLVRVVYGAGRFVANSEAALAVSEDGTGCRVQVLRGDEVRARRI
>STH2794 hypothetical protein
MNAKGCWDDMTLDLFRMGLLEEAEAEAVRGHVQRCAACQVRLGALDLEDL
RLEAALALTREEAKALAAADLPGRLAARVSVGRNGARALAVGLSGLGLAA
VAGLQGAVLPWLQGLPLGGWMTAGHWAPELAGRLLRSARSFLFAPPTSRH
NVLVPLLAAGLVIILLNLRHRRRRLSPFHH
>STH1027 hypothetical protein
MKMEFVEKELKSWGEIIITTSGGEKFEIHLGDTEFDYENRVIRLKSPNAH
YVIEGDSVESITKHYGHPDND
>STH269 hypothetical protein
MPVYARNPREAAEIKRLAESEYLYCVPRGPLAYYVGTDPALIDPLYWYTI
QTAQHTV
>STH2317 hypothetical protein
MTMPFGWGRDDDWWDDDWDGMGRGNPWRRRRRRRRFPSFGKFPFQKFNKF
PWGKSPWGKWGKRW
>STH327 transposase-like protein
MEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRLNLAEILNRHLR
IKQKTYAHTPVDKVVEALVAILGNCRYMKDLNFDPEPLVADPAVAQAWGQ
ERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQEVAAVAGPDRSG
MVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIAAAFLSGKQQRF
AIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEWVEACLAQQKAR
VRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQRLRQYRQENRT
NLAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGSNVAYKHLFDAV
PAENWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRWDADGREVRSVI
LTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGTPRLRKYEANAA
FTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAARCRATIRCLGD
TLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSRET
>STH1133 hypothetical protein
MTFETFPQSTETIIDQVDNVKLHVRSHQSYTEFWSESDSVLLVEGRGWSP
VLAADGSVIFVDSEDLSLKKLLINGDEQQLLPPGSAAGFLRISSDRKYLG
YAIPINLSPDSNWAGLYGAGIMDLETGKVLVSRTKENFFTSPIGWYGDQL
IVHEWDATQTQDSLDLYALSLDGELTKFALGLPQANSYPEISPDHQWLVY
EDQDGNTILVNLEQVSFGIISNVVLPQWTSEGLTGLVDGQRFLIRFTIE
>STH178 hypothetical protein, glycine-rich
MRWEVCSMPQWLMRAVEPIVAHGLREMRTEGADREHILREVALMSALVGT
GMPAVQAIRLVEAMEPQLLGMPRGEAREPEHAGYPDMGGWGMGFPGMGFP
GMGYPGMGYPGMGYPGMGYPGMGYPGMGGWGKDGFGKGMMMGPWGKMPGT
GPMGWPQGGQNYMQGTQPQ
>STH973 hypothetical protein
MYWLIMAHNVMRWVILVAAVATLAGALAAGKKAADGWAGRAAQAYTVALD
VQVLIGLVIWLLRSGWNHDAFLAFIHPGTMILAMLVAHFGRTLQKRSVPV
GGFVAFLVSLVLVIAAIPRWAWPV
>STH2347 conserved hypothetical protein
MRLRDLASTIRSKNAGVDLITFDILFKTREAYERAKACPALSREGIARLF
RIPGDRIYSYVTFDPALAIKFTLRRQRPSGSAGEHDLFGAQQYAPLLDVE
VS
>STH2671 hypothetical protein
MDVRFSSGQEGFAAHPPNSLRRQTRRSGDMAKGALTQKDLMIWVMNHPNG
TKYYYLIEREPFEREGIKKAVHRMGSTGDAVFLYRDQAEAELAKFPPGDE
EPEGAAG
>STH782 conserved domain protein, glutamate-rich
MSSIRLQEIMTRAVVGRCDRRVVWTHTAPADGADCVLGVHVGNAQLTVEP
GPSGPEVRLTAVCEVWCGAGDGTRVERITCTHTEPADVPLVARVVGETET
TGRLLRGARCLEAEVREGLFHLTIEMHIALEAVGTARLWVKAFDLEEEGA
ADLDAFGSGTSDSSQSVEEPVADWEGQTEAETWAEPETWAEPEEWTGTAP
DGEPEEGTERAGKAEGQADPEPEPLAEPEPPSQADGEPVAWDAFRTDMAA
VASVQDRESRRTSTAVTRFRGPTSARITIIQ
>STH2284 hypothetical protein
MRWPWWIAGVISCALALSPASDALAGNWTTKGGSPQRMSVETDPHGLVEL
RSYWETQPLGESATQPVVVDGVLYHLAGDYLWVLDLSEVQPPGAEASAVR
EPVSKLKIPFNRVPDGPFIAPQSSPTYSPETGILYFGTGYGWLWAYHTRE
GWAQPAELELGCPIIGSPLVIHDRGRDIVVVADRPNFPGEENRPEGRPLC
PRAHGKVWVVQGLDQLVGSVHRQVYEAATTKQVEGGFGGFVTPSAVAAPP
HGENPSFVIGTDGFEGGRAVRLALDRDNGYRPYLVWTVDGPAGFAGNFTA
DGTNAYWLDTRGYLWGASLAKGREPEGWSGHSISLPALIGARNAFTNTEP
AVEVRQTPDGPKTHLYVTLRNYTDTGAGLRDGPTGSDGAVVALGPGGELK
WYRKFGPVDRLGERMASLNTAPLALTSRGALIFGDVNGHIYSYALDTGQP
MGGDGRPALIQPEGRAPTDRLFLLKGDEQPNTGPYNFSQVSGVGVDPAFA
HGLLLVGVNYTDASGTSGRLVAYRAGEAYDLRWLDEPGPLELEPGKTVAL
QPRLQLELTARTLAALCPGPLSVEWFLTDENGKLVRFLGTAPLPGDLAPQ
QPHSVPLTVALTETDPAEGQIVGIIDLPSVYALSAPHTANPKVAAARGLA
AALGLAAEKSCAGAVAEVVEREGEPGPEGGLANNVLIVPYRNPQPPDEPE
RPGEEGGGGKAIIDDPWLAELAVPEVAEAGRPFEVGIYLGYQNNLGQGEI
RVPLRLWARPTGGASPPPDVWQMVTIDKVPCCSLKPGTIPGLSVGAWEII
AEIDYPADTRPENNRVIRRVEVLQVKPGESGGAEGGAITD
>STH1574 conserved hypothetical protein
MRTVRIGAGQGFYGDSLLPVLDVARYGDVKYISFDTLAELTLAILEKGRR
KDPTGGYTRDVVPMMRNLLPIAKEKGIRLITNAGGINPRGAARAVAEVAR
ELGLSVRIACVTGDDIYDRLDELEARGVTFADKETGQALGDVRDRVLFAS
VYLGARVVADALATGADVVITGRTTDTAQFLGPLIHEFGWAPDDWDRLAQ
GIVLGHLMECSGQVTGGNYQVGWEDIPDLHRIGFPIAEVSEDGTFILTKA
PGTGGRVDLKSVKEQFLYEIHDPTSYVTPDVVCDLTTTRLEQVGENRVRV
SGTKGRPAPPTLKALLGYADGWMGEGYISFSWPKAYSKAKRAAQIIRARL
EMQGVKPEEIHEEYIGINSLWGALAPEPVDEDQINEVRLRIAIRTRNKKD
CEILAREFPPLLLSGPPTASAVAGTPQPRELMGLWSTLIPRELIEPYVKI
EVEEV
>STH3268 hypothetical protein
MYTGFPRPVENSRSPKRTRGAALFVPTAGLSPGIVDNLWPTWGDAVDSFR
ATNRKNLHVQTAGPGPA
>STH2746 hypothetical protein
MCTPALVWPPIQRAAGNPDSDVPAAPVPAQNLNSPPTPDPRFCAPATSPP
PAHAQNLNLPPTPNHRFCAPAPCSPPVHAQNLNQPSTPDFRFCAPAPCSP
PAHAQNLNLPPTPNHRLCAPATSPPPAHAQNLNSLPTPDLRFCAPTPCSP
PVHAQNLNSLPTPDLRFCAPATCSPPVHAQNLNSPPTPDLRFCAPAPCPS
HAHAQNLNSPPTPDLRFCAPATCPPPVHAQNLNSPPTPDPRFCAPAPCPP
PLTRKT
>STH2099 conserved hypothetical protein
MGVILAFITAAEFLILYVRGMSALVVTTLAALSAAKFFLVAAYFMHLRFD
PRLLTAIFAVGVTLATLITIAVRFISLA
>STH536 hypothetical protein
MGWRRMTAAVLAAMTVTGCALRADRASRWAVEGATDAVEHRMYDSTLEIP
NPGFYIVDLSTGRAETWLLKGRAAEDTDYALSEDGRWITLQGDGRIYWVN
RETGEAFSWQPAELRLRATAPDRFLLEAVQDRAERLTFHVADGRFRLLQT
LTADLGGQQAAGAAFSPDGTQVVLSTSREGQIEDTTADVRVYLVDLATRK
VRDLGGPPEPEEGTVVSAELQPHADQLVVSYLVQGMEPTGHVWASSIVRL
YSWQGELLAEYPVAGQGARPSPDGRLLVSYDDLGHFADAVVVTDLATGRP
LFRVLNAWARGWTADAADLLLRVPAVGNFLVSREGGDLRPAPEVSHAGTP
LVWRAGLEPSPDAPDRFLAGLAVVDGAGTVLRRIELPARDDWRIDNPRWG
ASGQEVAFTINPLIGQSSNEWGLPMPAAVQKPPFPDPYLLAVQDPQGDCL
NLREQPSLQGRIIRCLPTGTRLAVADLSDAPVRLNQAWTCDEQHCWAWVR
TEQGENGWVSLSTGIVVWAD
>STH2757 hypothetical protein
MRSWMLHLMAIGGGAALLQAARGFQQLVVDPRTFTYAWGYFFLAVALQVL
AGLVFLLPVLWARVRGEGGVRPHWGLVALYVLAGLVLAFLQPLQIVLGRS
GFIAAADFVRILPTGTSPNIFGAFLIAIGIVYAVGRGPEPGQKPAER
>STH2945 conserved hypothetical protein
MKGPKKAGPQVIDPALSPPIYVPIGMALAGVLGLLVLQALVIAKAPMIFG
PFRFNPAALAAVHLFTLGFATSVMIGAFHQITPVMMRGRPVEGGWALLQG
LAYLAGSWALVWGFYRSAGPWITAGGTLVVLALVVFAALVVRAMRSATQW
TVAGRFMAAALSFVLGTVIWGLVLAFNLRRGFLPDSLDYSPLGAHVLLGL
GGWFMLTVFGVSYQLFPMFALTNRPSAASGHAVLAKLSAGLWGAFASLLL
HLPALTAACLLLAAAGAARYAADFFAMMRQRRRPELDLSMRYGVTALAFL
VPAALLLLLSVRHRAFAVPGAWLFLFGFVATMILGMLYRIIPFLFWHHLM
RNRRSKTQRLPKLDEMFAQRTARAGYFCWTAGVLLTGLLLLGGALGWWDA
RYLVRGSAILNLTGSLLFAGTILQVLRARPLDGRERSMG
>STH2615 hypothetical protein
MTDKRFSPDEQAFLTETGLKAGDLAPLVADAPPLPDDVLARIRGRAREKA
GLAHAAAAAPAAAPPDVPTASRRPRRWLVAAAAALLLVGGTLALSNPEGA
LAAIHRLVRLVPGIGLTETDGETWILPEPVSVEQDGVRVTVTGAISNRDG
TQVRYRVDWPADEPLDKVEFASARESAFPELRLPDGTVIRRWGGSLAGGT
RDMVGQYNFGPLPPGTSEITLVLPYLSGVAEPVEIPVTLVNAEEAGLAGA
HPGNWSEERLGVQVGVPYWTAVDDRIVVSLDTRLPDGTRVIDYRDWIEPH
GFVEPVLIDDQGHTYPLIIDEADRSGPWRRAVFQGPLAPDARQLTLTVPY
LILADYTAEARLTVPLEQLAEGKPLRLNEELQIGSHRFTVETVTRIDADT
FGFTLDLGPEEGGVLLQKVGIRHRMFGGPRAWSAHGENGQFTSMEVDFRS
APRRNLVIVFADPGYRVHGEWQVELPVPAGRSDAE
>STH1637 conserved hypothetical protein
MNLRQLRWPVVLVTLAVTLAGLFGAGQLVRSQTIDQPLVTALAGVDGLAA
YHLAAVGGVSEITIEPAPGASLRKVYGEVDRRVRQILKDGQYVIAVAGSG
AGELEPLVERLNLFVQEAVATGAFTGMADRIAAEAAAAGARAHMAVDDRR
VYLTVWQADAYAYRVVERPAWPPAAPQGGGTGL
>STH331 hypothetical protein
MTMRDQDRLAAVTAVVTRAVNQTDPMGLLSIGAPEDEYEPEIRELVAGLG
DAWPDPTAVLRLCESVFSHFFGARYVNAKRMKRLAGRIVMGLRELERTP
>STH1608 hypothetical protein
MNYLVVIVLALTAVVVVSVIRTRRDRELLADEVRRRGGEVIRLIRARRGS
PFPDTGRGWWAWKVEWRDAGGERTSWALTTRDGLGEWRD
>STH2578 hypothetical protein
MPAESEKGVRTLPQDARILGTLSVQLAAEGRWQDALHTAHACLRLAEPGS
VLYLWALHMLACTYVDAGLPQRARPYAQAYLRQAAGNPQLASYTPFVVRA
MGHIAYQEHRFLSAFRWYKKAHALFCRQGDHVQAAVTSHNMAWALTRAGR
PHRAREVLAPRHAFPAELAYLFDGAMAAILAAEGRLSDTIQRGHEALSAA
GRRAHDLVDAAEVALFLARAYWRLREHGAASAWISRAAEFAALQGWRFVD
VLHLNERAGGGEVPHAASPRGSANLHHRGCFTTGIA
>STH1397 hypothetical protein, proline-and glycine-rich
MTARRRTESPAPPRKEAGGRRPGWWSPPCRPPSGSGGTAPQPWSARMRRR
RRVGMRADRPQHPGDPGRSPAVEQLAAQLDRVIQALGRLVLVLEELTARP
PVHLQVERLEVQTLAFHLGDIDVEQLEGQLNIGITHSLTLESGPSPGEDE
PVRPPGAPRARPFEVDPARPAPFGAGTPGRRAAGESAPDGRAGGKPGPGD
DDSGRPPRALRNSVVSRWLAQPGQGAPARSGRPAGDRPEQAGPACETAPA
GTHPEAAGPADAGPAAADGSSPPRIQIWPPPDAEGSETDGEGQPGAVR
>STH2949 hypothetical protein
MPLAVAPALQEQLAPFRPADAEQVWVRLVDTQETDEAFSFQVQVIRRDSA
GRLALEGDDEVLVSYLEGGEPLVTGYAHAAAGAPDLAVAVWYGAGPEDAA
APALLDLPAAVLADGAGGYRVSVEALAVLGEIAWQEDGSVSLVLAGGTRT
EPLSVAEIEGRSYVQGTDLEAAVDGRRVDLSDAGYTYAVDWRPDLGQLGL
WRAVRRPMS
>STH1348 hypothetical protein
MNHTPVSFRVTASDRDEERLAIEIRGTPAGEAVIRYGSGTATVDARVQLS
EFPQVIMALDRFHHEFAAELLAYVLDRGAMAQESDGSIIYDLGSALHAVP
LPANHEVGLGYPNSW
>STH2901 conserved hypothetical protein
MLQLRLQTAFLLLLGQLLLGGLGWFLAGRWPWIGLIPALVLLWFAWRLGV
VFREEAERRTRRYRTAAAAWTAVMSQIPGLLLLPFWAPDWIYSLWQGTFL
PVAALLERYRPGLGQVVVPWLWASAAAIVLLFAGAGREVPAPAPPPVRAA
PGEWVPARRLADVQKRGVRVR
>STH2656 hypothetical protein, serine-rich
MDSRSPNWEMPALFTSTCSAGPTCSRNRRTSSSWATSQRCISTPSTGSGS
MSAPSTLAPRRANARAMARPMPRAAPVTATTLPANVFSIRLSSFRPEVRP
EATPVIVCPTAGASRPARVLLSPSQTAPSTPSSIGMMKG
>STH1839 hypothetical protein
MMFYGFGKPFYGKSFLGKPFVGKPFFGKGFPATKASGFNSFIPNTTVATT
PFPGAI
>STH2243 conserved domain protein
MWGEQTRRVGWLTLLVLTVLLFQPVVGVAATDEPKPVVVDYPPENYEPGP
GWPEGWSPPPPPEGMSEQMRLRVEQALRERPDLYERYTWWVWEPLPEWTK
EYPWFWREDWMGCGTTLHVYAYPVPDTQLITRVFGRSYGWTRLYLEPCRY
EPVQFIVEALEMSEEEKLAEGRRAEYLYWVDKSHPGLGRDDGRDRNGSPD
HIYVYLNQGDASKRFDTPAYLDTTVNRVRVPIRFVSEMMGAEVTWDETGR
RVTIHFPATTREVMKVVPAEGYDYPDLFDAEEYLPNGYRFYFEQRVVRIP
ERTITLTIDHPVAIVDGREVRLDAPPVIRNDRTMVPVRFVAEQMGAKVYW
VGKEPIFRLDDGTMSGTYQVHIFTPTFPLYEYPSWYLENRAVRY
>STH448 conserved hypothetical protein
MRSRDITLTGTYLPNTPVTLVVDGVAQVQGMTDSAGRFRLTGTLEPGRNR
VYVMGEGPLASREYAVRYLPPYTDMAGHWAEAVVDALHERGVVSLYPLPR
FEPEAQVTRMEFAVMVARALRLPAAEEQAPFTDVAVMPQWALLEVAAAVK
AGIIKGMPDGSFAPDRLVTRAEMAAMLTRALAWAGLEVKPEAPEFADQAE
IPDWAVEPVAAAVRHGLIKGYPDGTFRPGSPTTRAEAATMVHRLLQTVIG
LNP
>STH2960 hypothetical protein
MKPDPRVTPTGDAAADNPAISRLTVERLGMELATELGVDPGRIGALGASP
DPAVQRYEAETHSPKSRPNR
>STH2219 conserved domain protein
MQRGYARRARQLACLVLAALLLHTPVRTGAAEEMKPEVAEYPAENYEPGP
GWPEGWTPPPPPEGMSEQMRAEVERILREHPDMYDRYDWWVWDPLPAWTN
EYPWFWREDKMGCGTTLYVFAYPVPDTQLITRVFGRSFGWSRLYLKPCRY
EPIQSITETRELSEEEKLAEERGIEYEYWMDKSHPGLGRDDGRASNGSPD
HVFVYFNQGDASKRFDTPAYLDTTVNRVRVPIRFISEMMGAEVSWDQAKR
QVTIYFPAVTREVVKAVPAPGYDYPDLFDPEAYLPNGHRFLLEKQTISTS
ERTIVLTVDHPAAIVDGMEIPLDAPPVIRNDRTMVPVRFIAEQMGAKVYW
VGAEPIFRRDDGSMSGTYQVHIFTPFFPLFEYPSWYLETRAVRY
>STH579 conserved hypothetical protein
MTRAELDAHVRDIVRWHFSPDTGSPYWLRKAADLGFDPVAEVRGFDDLAR
FPAVAGEWRSVPVGDLIPRGCSEPFDVWETGGTTGPPVRIVDAQERTRGI
QRVDRMLDEHGFPRREARTRPGGPTGAIAPAHGGTDTTEGRAHTGGGDAY
PAWLHLGPTGPHLVGTNVARLARLRGFLYHTVDLDPRWVKRLYREGRADE
ARRYTAHLVEQALAVLRTQPVRVLSTTPPLLQALCEHPEAYAALAGRVEG
IIWFGTSLSAEGLRLLEEELLPEARFVGWYGNTLMGIACQRPRREGDTHR
CIFSPPGPAAVVRVVDPQDPSRRVAPGEAGQVRISLLTRELFLPWHLERD
RAVRVPWPGSELWDELAEVAPLDGGGARVEGVY
>STH1289 hypothetical protein
MNAENDVRRRTSRLGQRSVRATPWSIRKDLSELTFEFHMGREYRQHLIPL
LDQVGAHCTCPVAGLPIAERLHFYGQGCPPLV
>STH445 conserved hypothetical protein
MSVHGVVKRGRGSARVLRMGLTMLLIIGLVSGVPAHGSKLADVGAVEGNP
VGQPDEESPSDPGGEDPDSVGQSDEEFPSDEGDEDSGSVGQPDEESPSGP
GGEDPDSVGQPDEEFPSGGGDEDSGFVGQPDEESPSDPGGEGSDPGDVDP
GDDEDGPAFPVAPGTEDDGGTDPAQEGLFAALAWCCPGPRWSPTTAAPPP
>STH912 hypothetical protein
MGACPSGICAGGAGVIPLLAARRFRSRVKGGGFGGGFREILGKEGREPVL
KNVFRKLIGLLVVALFAVAVVQYFRQTSGNPYVWSALGSGLWILLPLAAV
LSAVWWADRRMRARGE
>STH685 hypothetical protein
MRHEQIIVAAVAGVFFVWYFAGAWLNRRRGAQILGAVRQAVGAVGRGATV
RWHGRSAFEVAVAEPLAPLAAFRLLCLLEPRDFPLALAWNRLRGRRDQVV
IHAEFARPPREGRPLALAECDIPGLTGAALSGSPPHLRLTLQVAVGGEDA
IARAFALARQLAEQPPRGQPPGGVRAASAGGAAT
>STH48 hypothetical protein
MHLCMCILLPLGQNVQTYLRKYGTCGPDLPLHCPGCGGRMRRHGRYWRWV
FTAHQKAYIPIYRWWCPGCRKTCAILPDFLKPYARFITLVREAVVRGRVR
RGLPWSTLARRLSSPTVSWLSEKTLRRWLVRARALAGEWSQYLAERVLRF
WPDTDLDALTPRREGPDAALHFLLDVGDWYRRQMGRRPEEHGGVFAALNR
LGEGTASL
>STH743 hypothetical protein
MTPPLGPSSARRASPGPGSRAKSEPGLCASGQILRVNPVPAPGSRAKPEP
GLCASGQILRVNPVPAPGSRAKSEPGLCASGQILRVNPVPVPGARAKSEP
GLCASGQILRVDSAAAPGSRAKSEPGLCASGQILRVDPVPAPGSRAKSEP
GLCASGQILRVDPVPAPGSRAKSEPGLCASGQILRVDPVPAPAPAPSRLA
AYSPAEVEQREAPGVTRPRSPGTPVEARLGPGTGAFTCGIGTPDLAWTVS
PCAQNLGQA
>STH2277 hypothetical protein
MCILLPLGQNVQTYLRKYGTCGPDLPLHCPGCGDRMRRHGRYWRWVFTAH
QKAYIPIYRWWCPGCRKTCAILPDFLKPYARFITLVREAVVRGRVRRGLP
WSTLARRLSSPTVSWLSEKRGCRETHAMKRESLREQEPRKRFA
>STH282 conserved hypothetical protein
MYTPPMQKSDMEPSLLARLVISPCGRNCGRASPETPICAQRRDGWVPHAE
LRGKISQSGSNAFDRLEDLLTSDVFGRLRYLPPDRGLIPVLQQAVNLQGN
RGCPLPQTVSPEPDYQFWPWLKRCEPDVLIQLSGDEGKPFLVLVECKYRS
PKSDRPGAEANGDVEERSALPDQLAREFLDLTDWLSEQGGDGILLYVTGH
SVLPRSDLETSAAALAGEAGSNRGNRLRERFLDRTYWIGWPAMWAVFREL
YTQEQDPYRRRILGDLLDLLAHKGYRRFRGWPEPNPAGPLPPPPSPVWYR
RDT
>STH1817 conserved domain protein
MGAQMAWFGLMYGFSRGQMQLYQGLLVPLFQITPARPLAFLLGRVIEAVP
TRAWTTLLWAWAYSAMVPGPGRWPAMALLWTAGLATGMLAHLAGLLLLTF
WSRYHPRSMRHGNTAFGVLTIAMMTAAVVYLMEGGTATELALALRAARRS
VLTVLTAAAGLPGLMLALALLIRPQWVESHYREGLYRVLELGEQDTVRSG
RSWWLPLRGDGVLRAVLSREWLQWVRYRMARTQLLIFAAGVVCVWFAGRS
AAGRAMPLAGLVGSVGGLSLLAWYNAFGHWVAKVFQQERITIALYRLAAV
PTVRLVIAKLVSVLVPSVLLVAVAAAVGSAAARLPLSTAGSLLLWTELGL
VFGVLGGFGAAAATANQEPEEPEAPGAPRMEQGGSAMVQSSAWSSLARTF
GLTVSTALPLWAAAGRPWLDLPPASAWTIALGVPLALFLGGVGWMLRTWR
L
>STH316 hypothetical protein
MLLTFFLFYPLHIDVSILTRPRGRVLPVNCAKEIAMVEKFQSSPGLAAGC
YMSWRSNRFSSWGFQSSPGLAAGCYRTVTINRAATVVFQSSPGLAAGCYT
RPGGGRKHHPPAGFNPHPASRPGATWPKFASWPATSSFQSSPGLAAGCYR
QKTDAEYSADLFQSSPGLAAGCYPYPRGGRFLSQVSVLVLRTALQRPFRQ
YPTVTGETRNPLPTWPWRHLRTPRVFVSITGSHLENERPFKVDRTENTEL
LYVLFLGFHQAVDAQAVFRLVNLVQEVPDQFLVLGFM
>STH2793 conserved hypothetical protein
MNRMRRWRIWLVLTLAALAAAVSTVGAAAAGGPEAGRPPIAAGEEQVIIP
EGERLEDDLVVNAEKGAAVYGTLAGDLILLGADAVITGTVEGDVIGYANR
VWISGQVLGNVRVLALEAVTIEGRVGRSAITVSQRATLGPEAEIGTTWFA
LSGDVELGGTVGRSLFATASRVTVSGEVGGDLSLHGYDQARVLPGAVVRG
GILAVADRPPVVDDGASVGEVRFVAREGATQPLLTMDGFALGRLAGFAAV
GLLVTWLAPGLLGSFQRRVSGHFWATLAVGAGLLAGVPVLALVLMLTVGG
IPAALMVVLPLYAAAIYLGQVFVAGWLGWAILGRVRGDGQAPRSAAFLLG
LVCLTLFSRLPYVRYVGSFLAVSLALGGLSLTLGPWIRRIARED
>STH2714 putative transposase
MKKHQSEPNDTWWDLPLLNHHVEVRLMQSHGTTPNIAAQAIKSTTRHGFL
VALGWVAQRLNLVEILNRHLRIKQKTYAHTPVDKVVEALVAILGNCRYMK
DLNFDPEPLVADPAVAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALA
EIQAPLLQQEVAAVAGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQ
GKLARGYQIAAAFLSGKQQRFAIDGLLKSGKANSRSGACLLELIPRIEAR
IGRPLRRVEWVEACLAQQKARVRQLYQQLQTVSGKGSARRKQKLQREFQE
EVQHLREVNQRLRQYRQENRTNLAPLRILLRADSAFGTPEVIQRLLELGY
EFTIKSYSGSNVAYKHLFDAVPAENWVEVEKNRFASEAVTVPGPTLLAPY
PVRLVAMRRWDADGREVRSVILTTLQPEELTTTEVVKLYHGRQTIEAGFQ
EWKGTFHFGTPRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEA
GSRLLVRVAARCRATIRCLGDTLRLVFSRGTPLAGAEITLNRATPYPYAL
LTPRMSSCSRET
>STH506 hypothetical protein
MPRLNRRDPLLDPLLPPSAWEIARRRRRVRRWVALALAAALVGLGAWGWL
TFTDRTRSLPPVQVAEAAGEALLRAARYDFRAAVTGSSPDGFFPNAQLAG
EFQAEPLLLHLRGEVGSGAAMMPIEYWLSGDALFVRQARGGWVRSPTADV
PDVVAFLPEQLAAPLLAGVNRAELVGRERVDGAQTAVLALDLDPAVMQVA
PPGQDERVEYRLWVDVRRLRPVRMAIRVERPADAGSSFDYQLDLTYPAPG
SLSLPDEVLQAAGPGE
>STH1287 hypothetical protein, histidine-rich
MRLIIILMPDGKSQHHRPVHPGGVCMQDDTAHIPGGTWSDSPLRPSLERF
AALHTRADVRWGHPEESDDGYRYEALYDPGFIGRFLDQIAAEFAGPPAFS
GTRLFRVAHFLPLRLAGYLFAAEGRVIRLRENLVLRNAAHRQGSHPHGHH
PHGGHPHGGRPPGLRVLEPRAVVLPDDPAAGAPGIETAPDRAALADALFA
EVTALAEPLMRRLHDLGVLPPGTGWGILLDFLAAGFLAAGRAGTGLDAAW
AAWEEAIAGRTFPARRLPRRLQFTADGRTDEIMVHAHCCLRCALPHMKDR
EDRHCIECYLESDENRIARLLGRRAAGAHAH
>STH2801 conserved hypothetical protein
MRLPAQGQPMGVPCREEDDARMQYQTGGMMPLTEKDLSYLKDMMSWELLA
AKKAYHYANETQDAECRQAMFQIAEQHQRNLERLLTHLHEHVNQPTTIAV
SGADAPTVVM
>STH3139 hypothetical protein, alanine-rich
MAGNPTSRGRGPRRHWPASALAVALAAFLTVPCTAVCRASPATDAAPTAR
TVPAGKEAAPPEAVRTMSAAQGEGLPIPGDWIPPLPAVAGFRLVLSPPAG
PGGAPQPLYATHPDHRPVIALVLDRLVGSRLTAGTAEPSAAGGVLHVALR
RGGHLAVEPAGECPDAADPGERGYVCPPSPSDVVLRLSDGRAVRVRNPWM
AGWLQEGWRAHLPVGAPQLLDRDAAIALAREQSGLPAWQARFVDAYPVER
SGGTEVRPAWLLEAELPAGQRIRLVLDAQTGEVLRLVQLEALP
>STH835 conserved hypothetical protein
MIGMAQAEEALSSVAERARRALRDNEAAPPQPPAHVDSPRPVRSPAAGSP
TELIAQRMQAIARVKELEPRLASARARAGELSQRLTSTPPDSVHYAELLR
WLADVHVQLRQLEAEYEEAQHIIRNSAWVMQLLADPLSS
>STH1336 hypothetical protein, glycine-rich
MKRMPLKWLGLAAAAALLASGCTAQPQNANRVNQSNLFTTTGDADTCAAA
LGNTVGANAFWGMAGVNRNVTANGVIIGNVALVALPRESQNTPVNGAATG
TGGQNGAAAGNGVQTGAAGGNGVPMGTTGNGVGTGTAAGNGLSTGVQRDG
DVTGNLGTLTPPGTVDTTPGHVGTVPGNTGSTVMTPGLGTTRGATGVVGG
RVESPSLPGTGTTTGTGTRAGTGTRAGTGTPAGTGTAAGTGTMAGSGTRG
GTGATTRAGTAFGTRTGTTAADPLERVRTACPTLADIRVVNDETDRARLA
EITAAVRSGRPVTEFMSELATMAQRATSAGPGASGRQRNRPENFRNRLQQ
GNLVPGRAGTPTTPAPAAPRTPPAPNAPGTDGRGSTPARPGTGTATDPGT
SATD
>STH2130 conserved hypothetical protein
MAALVSRAGASGRNTGRAGLTPPSRQANIQSIRIRNACGERGVLQMMTIA
NSVLVVLSLLLGGRLLMHYRNKPRPHSLWYGVGLILIACAALPELVYELT
GQVPAVLWWLYWSSASSCVAFLSVGSAYLISQKFGKGALVTAAVTTAWVV
AATLLTGGMGPAVLSSEAFRSAPTGAIKLPFLLQNICGAMLILVTAIMSF
VRTRAPFALLIAGGVFMFASGGASAGLLEFSQIFAFTQTAGILLLYAGVS
LSLRPRQRPQHQSVSG
>STH3083 hypothetical protein
MVPKNGAEGKSARRRAPPAAEVDESPLIEAARAGVDGAFDSLVRPHLDRA
YRITVPFENVREFAQRFVSR
>STH3182 hypothetical protein
MTSLRLRAASALLALTLLVTGCARQTPQQQEDVPTLEQLRERAAAAFDGT
ASSAQRRTAAENAVRTLLGLLQRPESFQISDDEWTASPRPGLEAMVRHID
LGAGIHLYALALPGRTLVDEIDRVAVQVRSGNSPRAFELSPLPAARLVSA
YLHDTPGARRITLALQENERSGYIAHFSGPAAGPFELVTDAFAGMEGTYG
PARLTVVDGTLQVALEQGTWSPAFDPKDGSLHLATEIFLKFDGRFSLADE
SRFDAFALLDIAADPLRRCQREGDCPEAVTARVGTSREQAAEAAWELAKA
KLTRQLEAGWSDSLTARLPTGSRMLSDEGRGLSVSLLSIPAPGQYLKGRY
FNVVQFRTSGVPATRALPLPGLVESVRGFAHRGMPAMLVVVDDSAGQEDV
GTAKRTLYLLRLDAGNDWQFASDWVGYVTRAPYWNIGDVTADEITISWEP
ALYPLFSVELQDGDEPHVTVCQYPGRCHQLTWVDGRLHSLPLLTHYMGEL
TRPHGEEDLVWAASQMAEFLALVDPAALTGGRLSQLVDPDGSLGIRVFDV
GENTRVITLPTGPGGMGMAVIHAPGQAELVKTYDGVVTRWEGAQIVQAGQ
EKRLLLLGRSDRAAVLLAYRQQGGRWVPVDALDEAVDRNALLTLRVTHTP
GAERPARGVVVQGAFGVSASLTGGGASFCEGGALCMNYRYDGGWVLR
>STH1044 hypothetical protein
MAVRVAPDGATQPGRAARRASEREVQARVHRQMGITAAVMGVAAAGVGFW
WIGRVFGVGPLEALQAAWPLFLLVLVILALGMLLSLGAARAAWLTERFLA
RRRQRKVDSR
>STH3151 hypothetical protein
MDIEWTRFSDLSVSSGFLLIAVLVLWAAVAVVMYWAFMSGSTSMNEDVKF
KYVEDDKPITR
>STH337 conserved hypothetical protein
MAVGMFGMPALGHTMALAHYIAALLVGLTFRFYGIHEHERTAEPPREGNM
LSRALAALIKARQEDGRPLGQMLGDAIKDSMRTMALICGYIMMFAVLARM
IDVTGLFPYVSAPFRWLFAVLHIDPALVRAAVTGLLEIDLGTLAASRATD
APLAQQVAIAGAIIAWSGLSVHGQVASVLSGTDIRMGPYVAARLLHAVLA
FIWTLVLLPMAGPLSLGAARVLPTLGGLPAALWQTPFWEHLMRGTRWSLV
IPLCLALIGAGAALLTGGVRWVHFHTRR
>STH1056 hypothetical protein
MNYEWVEVYKAPDHVEAELVRGLLEASGIPVVTEARGAQSLPFLLGPARV
GGHISIRVPPEHAQAAREVLAAREEPSWPSGPGGGNGEDEM
>STH2501 conserved hypothetical protein
MERGIALTREAAAPAASAAYCAVTVAEDRVSAAVPGVAWQSRSLARLLAG
ADAASLVAVTLGAGVDELIRDLFAREEFALATIVDAAGSALVHGLAEWVR
GALLQEAEAGAAAGVTLTPLYGPGYGDWKVEEQPGLVALAGGEAIGLRCT
PTCYLVPQKSLVGLIGWLPSGGRDWPAVGCTRCTLADCAYRVRTSRR
>STH3126 conserved hypothetical protein
MKDLLCDQFQETVSECLIRHRSVLDIMAKYQESNARVNRALTKSVTSCGC
IEIHGTKQRYPQDATLEQLRDHAMEHITGKLCDHCREVIEAEIGNNLFYL
AALCNVLDLNLYDVLLKEHKKLSALGLYNFAD
>STH131 conserved hypothetical protein
MALYPCNAFAAGDHQGTRAGPEGLVGDGTWTAPEQEAAPFDWAVASWNGE
GELIEVSLRVRVGDDWSPWFSFGRWSSTGERASVPDQVHPPFGRLETDTL
LLDRPARAYQFRVTLRAAALRRLWLAAARKDERSGEPPHRDAWGVELDVP
MRSQMVFPDGGNVWCSPVSLAMVMAYYGHVESIPGETVPGVYDAAYRGHG
NWPFNTAYASTRGFRAYVDRFGSFAELERWIARRRPLIASVAYDRSWLPN
APIARTAGHILVVRGFTPEGDVIVNDPAAPSDPEVRRVYRRELFRRAWLD
RSGVVYVLEPLSAPGGL
>STH1569 conserved hypothetical protein
MMRRYLPIVLIVLLLAGCGTRTATPTEPTTPETPGQSTEKPPVSESVHVP
EPEPEPVEPTLTTDPVYLLAPSAATLWGGPVSVVVENSPGARPQAGVNQA
DLVVETLTEAEITRFFTLFWTKPADKIGPVRSARQGFVDMADAYNTPFAH
SGGSAEALAMLRAAWGPRNLDEIYTAGGYFYRTGDREPPHNLYTSTDLLG
QAIVDRGIDMTSVPTTARSAAVPTVGDHTAVDVFWHRLNEARWVWEDGRY
VRYTDGVLHTDEEGRPITAVNLLFLGVGGVNRGVDLGWTLYLWDGGPATV
LVAGHRYEGTWRLEPGGFVLYPPEGGQLPPPGARAHLGAPDHRRV
>STH2758 hypothetical protein
MDKGAGPSDGSQATAPERDWEEQARRADERLAELRRELSHPVFVPTQVPE
GLHPQMPSQSDRLVSIAYVDDDLNRVLRVSSGPLGCCIDADPNKMVAGGT
PIRDGREARFIPYQPEFGGNILWWQEDGAYVALSGPHLTEEDLFAIAEFL
SPTATLTGPPDALPPDVYPPDYRTEVPQVDAVIDAFLAKDVHRLASCDML
RCEEGAEGALEPVVTWGSCEVNNTWTGWDTVRSLVEGWVGEPRLLYAAYR
VQGIGDWAYLVQFAKYRHDQRPKSAVLDGEGRILQLWMGCGALFDMQADH
YEPIAPPRTGL
>STH922 hypothetical protein
MSSHTLHCMGGGGDVFNELQLKLLNGRTISIHVEPGFDVDDLEYLPGLRA
VSLGNLTVLYEGLVYDVRPRRTLAVHGAAPSLTR
>STH182 hypothetical protein
MEGRDLLADLEDPELAQLVLNEPPPLPEDFTARVMARVEAERPRPFALLW
PWLQGHWSVHQYASVAYALAATLVVVSLGNRFFLWSQATDRLAVWTAKGQ
AYWAAIQAWSGPLSERLLSMWYALTAFLY
>STH267 hypothetical protein
MYTRSQDHCRAISTWIATGISTLISTGVSTPFSTDVSTSPSTSGFHAGER
GPMPTSQAWCGPAGSEPGELGLRIHRLGYRRSPLAGLRNSPFALPH
>STH1016 hypothetical protein
MSARLPRQHVYRSGHGCVRSTEARHPSQWDLDPEGWPALERLAGLPHWAS
ACRIVSSTELKAVRTAECLARRNRLPPPEAVSALGERHKGSVVPNHAGVV
VRLFRFPSQPAAQGWETVRAALAASTTRSRRWWRPPGGGTSSWSPTASCC
RSS
>STH2269 hypothetical protein
MGLGRSGTQRAPKRRGRERHYFSLDRHGHFSARRSIKQKCNGRWWSRSSR
CTRILPWRTPSCSVFARCSVSTKWAS
>STH83 conserved domain protein
MHRRSPRRSPYLFAAIDFGYTLLASLGLFGGLGWWLDGKLRTAPLFLIAG
ILLGLAVAFNGLLRRLNAIDRAVKAAKKEETQKTRDGQP
>STH2893 hypothetical protein
MARLLTAELLLLALTAAFAAAGVPWLAGQPPAWPLLSVAFLMALAFSALY
ALANLAARRLRQPALLAAGLVALASVAVDALRRGPFAWGEGLALAAGLVL
SALGGALGRGAGAGLWAYAAPLAAVRYALVPLAGSGEPVRALAVQSAAFY
LFLRGLVRLAFPTSAEGNPSAGPLVHRPVPDRVVGLVEATARRRALPYAT
REDGSRDEEAVAVRCPRAKAAELAERLRAVLEGHPVTVTQGVQAGDEVEL
VIRVVPPPRDG
>STH2468 hypothetical protein
MHFGGFDAPDQVFFNPAVDLAPFSKLASLAAFSPAAALAAKSPVLAIFNP
AINPRAALLAFNPAAAPTLALTGGLGGKFFKKPFRKGCC
>STH2461 hypothetical protein, arginine-rich
MRNQDAAGAASWFSRIPGPHSLPVQRWVGVRDAGRVSRAGGRSGAKAAER
TPRQRMCTFVESLRGRRFGARGAANAYGARASRKCTFASVVGFSAHVRTY
WGPECRADQCRGNRCALSPSPYAAGISGLAGWEKPMGRGFHESAHSRQRR
GRGQERNRGRGRAETRAGTTGTNVHLPRILTQQAFPGPRGGKSPWGAGFT
KVHVRVNGRVQGECVNVSGPRASTGVVPRQRMCTFGESLRGRRFGARWPG
EAYGTRVSRKCTFAPTSGARAGPSPGRGRTETRAGTTGTKVHSRVNGRPS
GRVHERVRAPSFDGGSAEAKDPGRSGRGVGAVAQVTR
>STH909 hypothetical protein
MNFWLWIIGWIAASAVVVFALSINGKTESGEALEGGHGHGHH
>STH124 hypothetical protein
MRTAAIIFEGGPRAENPLQETLTGLRHAVTLDTVAKFCAAGLDQVVLATN
HPALAEAAARLGARIFDTRGDRFHFGRSLVQAVRESGADAVIYLSGAALP
LIGQEEIAWIRDALRRDEPTVVVNNVQSVDLVAWRPASRLERVDPPENDN
ILGWRLRDTGMERVLLPNSAAVNFDLDTPTDYLILALSGKGGPRAQAALR
AAGWSDARLRAAADVLAAELPEVALIGRVGSAIMEHFNRNLPVRLRVFSE
ERGMKALGREAAGLVRSVVGDLIEDLGPERFFARLGETCDAAFFDTRVLF
ANRGRRVSEWDRFQSDLGNVEQIGDPFVRAFTRAALECPIPVVLGGHSVV
AGGLWVLADRAIALRGGPMRF
>STH3173 hypothetical protein
MALALPDWIRGMLLRPAETFAQAQTQMRFAYLWILLTVFTVEAVMLLFHP
SVRSAVPSLPAGALLWSLLNLMLTLFAVQAVLLFWSGRIFGWKIELQEAA
KYTGLVWVLFLVEDIVTFVPYLTQRDWLVLWASVPFLVWRVVAQTAGVHR
ISGLPLVRCLGLVLTATLPWNLPLLYLNWSGLVAP
>STH443 hypothetical protein
MPQTTGLLPGSIAMIIFGAALLWGGLAFFITTALRAEQKK
>STH2165 hypothetical protein
MGHGQPPRGRTGEKVTESGTYQCEDQTRWTYQAGERFRECPSTGKPTVWE
KTSEPEHADSR
>STH1756 conserved domain protein
MRRFFVVLLMLILIPTAVPGTALAEEPVRLFVNGSQVVPDVAPVIVSDRT
LVPVRFLAEPLGFAVEWDGATGTVTLSGARVIRLAVGRPEAIVDGTVVAL
DVAPVNVDGRVMVPLRFVAEQMGAAVEWDPQNRVVTVTAARILPEGQVGG
DLALVGNVEALGAWAEGHTTGQFRLSAVAGVLSMDYTFALDGYREGDDLL
VYLTGAGPGLDYRSALAVRAGKVWRLGPGGTWVRRPEEDAAFGPDGLPGW
GGDLPAEGGLFDPDTTELAGARILREERTLEGVTYDVVVVEWDQAALSPL
FAEFPPTVGDASLLLTVYALPDTGGPVRVDLSMHAADATGVTVDLSGSLA
VKPLEGAIPFPPEILSE
>STH310 conserved hypothetical protein
MACFSRPEFKVERVSYPVMTPSAARGILEAIFWRPEFRYQIRAIGVLKPG
KTISILRNELAERQGNAPIFIEDQRQQRASLVLRDVAYLIHADMVLRPHA
TDPIYKYVDQFRRRVERGQYHHAPYLGTREFPAFFSPDNGETPPDLNLDV
GPMLFDIAFIPSSQRPEMEFHRHGPGGACRVTGYAQPLFFDARVEHGWLH
VPPQKYDDLYRMEGDDAAGTGQGSRPFSG
>STH3328 conserved hypothetical protein
MAVLLQQDPLLVAFTALGFCIVAIVIMLIVLVRQSILLRRYRSLLRGNTN
ASLEDLLIQQQQATADLRAAQESIRRRLSDLESASQKYLQRIGIVRYNAF
PDVGADLSFSCALLDGEDNGVVVTSLYGRSECRTYAKPIRGGSSSYALTD
EEKQALRLARGVDNEKA
>STH784 hypothetical protein, glycine-rich
MDRQVAAGTVRLRWRDANVAQSGDLMARLLNLVQEEKRDMESLYAEEPVR
AHLLQATGDDGSGDGSGSGDGSGGGSGDGSGSGDGSGSGDGSGADDADAA
MTGGGSGGDEADADGSGADAGSGGEDGSGDEGVDATMTDDGSGADAGSGG
EDGSGDEGVDATMTGGGSGGDADGADDDSFGSGM
>STH2409 hypothetical protein
MPSPVPGCQLNPGYLTREVSLFRSLQRSALRILATSRLNDRPVPSPVPGC
QLNAGQLTREVSFFRSLQRSALRIFCHFTSERPPGAFPGPRMPAEAGILD
M
>STH2843 hypothetical protein
MQQPGMVTYSPQAPVQTQPQAALAAPVPFAAGPFGSAPPFAQTGLPVQGQ
QPNPAQVLTQQLVQTARSLEQLFPGYQALLSLLLEASSNAPSPALNEAVR
SVEEGLYYHGAALGAIRRILCGEATPNVLMNLAAGFHGLMRTQPRVRAAA
EQVLSLMPSARQSLTGTLVQSIGAADGVLAAAATAIQTLVGPQAWEAART
T
>STH2022 conserved hypothetical protein
MEQIRAAVDFSFQEIERLAWQQTLDCFREALVEALSAIDTALYESRDPSR
YVYKEMRSRSVVTKFGPITFQRRYYWDREEERFVFLLDEVLGIAKRQRVS
DSVRADAVEASVTAGSYRGAAAELERRDCQVFVSHEAIRQWNLQTGRALA
AAEKQQQMTLAGTRRVRVLFIEADGFWPARQRGKKAEVRLFVIHEGWIQR
GPGSKEYSLVNRRDFVPEPGRDSWEQLSELLESEYDLSETWVIINGDRAL
WIREGVTWFPKALYQIDRFHLKRELNHVLRHRPQRLEQAHAALEANDAGR
LLAVLDEAHKAETDTDRRGKMRRLLADLRMMPESIRDYRIRLQERGVNVE
GLRGVGAAEGAVERYSARLRKVGRSWSESGLMAMLQVMAAYYRGALRGAV
QYVERALGLESVNAAAEKVRHRVRETVGRGVDAARHARMPILNAGRNNSG
GYSKTFRGFAGLVTR
>STH2619 hypothetical protein
MSMMEILVVLGLLSIVLFAPTSPEIRRRELQYGFGVLTPEAVARPR
>STH2071 hypothetical protein
MRARRCRSPRDRGRITGVERDQPAACRAGTTASPEVAKMRKAILFLAALA
AVLGGVWLYPVGGRIWTATQVSGVALVREGRRAELSNPAALPHAAQDALL
AARLDRFRRLAPALRGGPCVPLDLALRKGNVWVNVSLKCTDGLTFDDIDR
DAWR
>STH1712 conserved domain protein
MPRWVRFLSCTVTGAVLSFGGVLLAAVAAERPPGFWRALQAFAVTPGFWA
LALLFCLLAAGAVAAARALVSLYGLPPAAAGALGGAVLATCYLAALVSRH
LDAWGGWEGTLPRLWPAALWIALPFAASGAASGWLWERLD
>STH1049 hypothetical protein
MTDADRLRDCLCSSRDLAKLYAEAALEAANNGVREFFLAMHGEETHNQEV
LFHFLHTRGEHPTRAADPGHIAAVRQRYGEAYRALGLTDPPSFRRYATAD
PRVPPAAAQAPEHFRPH
>STH1202 conserved hypothetical protein
MSAAPQWNRVPKVQPAPQVAPAPRPVPRPARPRVSIPAVLGLAGAWALLI
FLAFAVVQTRMEIRAVEAATAQAQRQIAHLQEQNRALEAMIANAASVEEV
QRWALAHGMRPPAGVDGTLEGRAEAVAVRTPAQPAAAEPVATEEAPTLWQ
ALRDRFTERVGALAAGLR
>STH790 hypothetical protein
MASAFRAVLDIWWLMFLVFVPCALLGYLVSLLSPGGRRRRISRSGVRALE
QLDRMSGIEFEEFLKALFEEVSKYR
>STH715 hypothetical protein
MDSFQLVFLFVAFLVLVFTVLKWVRLLRKLDAALQAAQEAARRQRGQDAG
LQDAGSSSPVIQR
>STH1705 conserved hypothetical protein
MRALWTLFRQDLRSRWRLGHGHPRRRVRTALLLAAVALLHLDALTGQGPA
PGEAGDLPGILGLAAAYAALEALALLSVALVGLVRGGRTAALAGYVALPL
IGLPAAALGHALASLSLRVLGRGLTHLLVRALQLAALGAALAGLDLSGGF
GWWAGPGHALARGAVAVLASTPLLVALAARLTAFRAHEGAGGRRSAEGIV
LRPVGFSALLRKEMRLIVRDPRVWGRLLAAPVAAGAIVLSIFRSQPLLSD
PAWTATALGVFCAGMVGMMTDVFTHGEADRVPLLRLSPTPLSRLIAAKTL
PALPLVLLTAAAGGVFLAAEGIPGAPLLIAAGALAGTACAWFNVEYRFRM
FTTRSTVRSYAAAFLMVLALFAASAAVLGHVSFGPVVGVTNLVLAGLLFA
GAFMLAGRPQW
>STH884 putative general stress protein
MFKHSTRCPISAAAHREWSAFLAGPEAERADHFWVRVIQERPVSLALASR
VGVPHQSPQVLLIRDGRAVWHASHYAITAGRLKAALDQAAAQR
>STH2245 transposase-like protein
MPLLNHHVEVRLMQSHGTTPEVTAQAIKSTTRHGFLVALGWVAQRLNLVD
VLNRHLRINQKTYTHTPVDKVVEALVAILGNCRYMKDLNFDPDPLVADPA
VAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQEVAAV
AGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQGQLARGYQIAAAFL
TGKQQRFAIDGLLKSGKANSRSGACLLELIPRVEARIGRPLRRVEWVEAC
LAQQKVRVRQLHQALQTVSGKGSARRKQKLQSELQETVQVIRELNHRLRQ
YRQDNQTNPAPLRIVLRADSAFGTPEVIQRLLELGYEFTIKSYSGSNPAY
KRLFDAVPAEGWVEVEKNRFASEAVAVPGPTLLAPYPVRLVAMRRWDTDG
REVRSVILTTLQTEALTMTEVVKLYHGRQTIEAGFQEWKGTFHFGAPRLR
KYEANAAFTQLVLFAFNLVRWAWRYLSTSSSKLAEAKSRLLVRVAARCRA
TIRCIGDTLRLVFSRGTPLAGAEITLNRAVSYQCTFSTPRMSSCSRET
>STH1416 hypothetical protein
MVPPCLVRSGADPLVPGTIIPSALGNGGVPVYLLDARTRRLVVRHRTDSL
VQVRRPDGMRQSFSLRSRVYSNPPSAPACTNRWLSAALRPWVYFSRSSLY
NIEA
>STH1329 conserved hypothetical protein
MRMEAVPTPSGAGVTIQNRLAGPEKGSERLLVLLPGNNYSCDAPAFFYLK
QAAVQAGWDVLSTAYAFQLTGGEVDGPAMLADVRAALDEVLPRGYREICI
AAKSLGSAIALPLARSLAGYRVSLLILTPVPQFLTEPVGDLRTLVVIGTN
DPVYHMTECGAARKRADAEWAVIPGLDHGFNVPGDWKASAAALEQVIAPC
VRFLAGPAGR
>STH1886 hypothetical protein, glycine-rich
MRGSAGAEGGHAQNLHPGSGRRFRFCAGAPGQRGGHAQNLHPGSGRRFRF
CAGAPGQKGGHAQNLHPGSGRRFRFCAGAPGQKGRHAQNLHPGSGGRFRF
CAGAPGQKGRHAQNLHPDSGRRFRFCA
>STH1035 hypothetical protein, glycine-rich
MRRRKILIWAAAALLAVLVAGCGKYADQTGPNRTDPSPSSSAGSAGGNQP
ATGGGGDGGGDAVGNAAGGAAGDGAGPGGGGGAEPAGKVQLGPLAVGESA
QVGPLTVTMHRAETVDQAPAPGYTYLLIEVTVENGGSAPYTVNPTEQHKV
QTPEEKNAPYNLQATALRTPKFQGTLRQGESGSGWLGFLAKKLEGTYTYT
FTHPEYGEATWEFTLQ
>STH2253 conserved domain protein
MGEWEPMKGDHTRRARWLAWLMLAALLVHGPVGAGAEEPLQPEVAEYPAE
NYEPGPGWPEGWTPPPPPEGMSEQMRAEVEQALRDNPDMYDRYDWWVWDP
LPAWTQEYPWYWEEDWMVCGTTIHVYAYPKPDTHLIQQVFGRSYVVGRLH
LKPCRYEPMQYIVAVRELSEEEKLAEGRGIEYEYWMDKSHPGLGRDYGYR
HNGSPEEIFVFLNFGNASSHFDTPAYLDTTVNRVRVPIRFISEMMGAEVT
WDQAGRRVTIHFPAVTREVVKAVPAPGYDYPDLIHPETHLPDPYRYLLQE
QTVSVPERTIVLTIDHPVAVVDGHEVALDAPPVIRNDRTMVPVRFIAEQM
GAKVYWVGAEPIYRLDDGTMSGRYQVHIFTPFFPLYEYPSWYLENRAVRY
>STH1456 hypothetical protein
MGKEDLRLDESGRPVEEDGGHRLPGWVYGLLLALLVVILLGVESILNWLL
P
>STH1069 hypothetical protein
MRRRLRLLAPLLAILLLTAAGCGEPEAQGPAAGGDQTDVQSEPPAEGEGQ
QDGQQIPLTLVYSHPDTGLTLGVPEGWVVYPLEDAVVALISPERGEEDFF
RENILVTADGQFPDPALDVYVEALEQEVRHRYPDTETLESGEATVGGVPA
RWIVDRFTGEKGETRVYRVVLVRDGIAYVLHGTAPVWTFDDYRPVFEAVA
QSITWVEPRAVEPEPEAPSETGP
>STH527 conserved hypothetical protein
MQQRRRTLGERLVSLLELPGDAVLDVPRAVLIGSMELVVENHRGLVEYRP
ERVVLRMPEGKMTVDGADLRIGFLSPDQAVILGRIDGLRYAPPEGGGA
>STH1895 hypothetical protein
MGKTVVGLFRHSGDADLAAAHLRDAFALDADRLNVIGEADLGAMSGARIS
DTDAWLLAAFADVGMEVGLDAGSDPGLDLAGSRAPVFRRWGDQVRRGRTL
VVARADDPDEAVAIAGEMRRAGADRVDLIGEENWPAR
>STH1503 hypothetical protein
MVAPVAAEPGVAVTVRYLTDVTWDRLTHTVVRDGEVVRRGAIRGREASFR
LPDAPGEYTVTFRYTWGGPLAGTWGEGLWQFVVRLQERMQRRPGCGIAAA
RSLCPLLSTPGRTRRCAGPPCRSRGRPASEPAGGRGCSGR
>STH942 putative nucleotidyltransferase
MDMSLPPDVAELMQSFASGLRRVLGEKLVGVYLGGSVSLGDYCAGSSDLD
FLVVTDGRLSPADLDALAVFHREFLTAHPSAGRLEGDYAPRECIIPEGTT
VPVPRCKRGAFVPEVDQVMLSADNICNMRENGIAFYGPPPAEVLPPVSPD
QVRAAVRAMLAEAPKPAQRPEEQADELLDLLRSLCALETGKPTTKAQGAE
WARRHLEPRWHPVVDAALAVRRGGPAAGWDEGMARSAAELDRLLRERYAI
AAQPPS
>STH1320 hypothetical protein
MANVIGAAVLSSVLQVIFSIVVLRRPMPRF
>STH979 hypothetical protein
MESDAGGAGGSRVPTSRKGGLRTAPVLVPALLAFGAAALCGGGLIASVAP
AAGCPPLAGAQPVAVTARAADGDAHGVTGYLFRNGEALIPVRGLVERTGR
IYWDESGQTVTLLGPRDVLSVHVPGGRAETRIAVLNGEVIAAHAVRCEEQ
VYLPADLVATVLHLEVAVPSDDRGERMSR
>STH1104 hypothetical protein
MPFSAVMCCSLAVAFPLWYTYSHQVHGTRVEGGNGGMSNSAAAEVFWTIF
SNTGSVIAYLLYKRFTVQ
>STH1077 hypothetical protein
MRIRTIATGLLLAAALVGCSLGPDRPLTELPEGAEFAPAGAVSGFAIRGE
LHLYPHHELVGGGLASFIVHVESLSDEGVRGLRMAVFYPEEIALVSDVEP
YLESPRGYDLTPDQPSYGLGRTFTFPDWGQAHVVRQLAEKPLRVHLVWDG
GEQFLEVPPEAIRVTQSDEPRRLANDPDLFSFPTHPAFLDHPPFSEGRYL
SRARDVTAADLLAWYRAEMPASGWEALPAPDRALLFRKGEMFLSLAAADE
PDGTTVMWSHLRGTAEVPEDAAVRIVRARYPESRENQWVATYLADGAGSG
ADSPVWEVRGLRDGKVWVTAWVDAVTAELRVAE
>STH1693 hypothetical protein
MGHAGTSRASSGGGSLTTIWRRGRTDPALRTDRNLCDNIMESGGGDMKEQ
ITRTEVYSPSGRLYYIDVSDAGYVCVYDADGRRMLRVNLKGISAKAMVKR
IRENIETGAPAYLAGWPPKDVDRILTQHAIACER
>STH850 hypothetical protein
MQYLVFEFPSEKELRDTVKKLWDGYNVSGEMAIKPIGNGKWRLEMYTEKE
LRESTLEKFADYRIEGD
>STH1663 conserved hypothetical protein
MQTPLRPLGISPADLEAELAARRASHRAQVDLLKARLAEAGARNAQLRQR
RAGLQAERERLEQEIQRVLEEFDRTGTEFEVRRREVAERHRRELADREAE
LEAVRKERDNWIALERQIAEGILAAVQPFLQLQAYLREQEGGRP
>STH1087 conserved domain protein
MRVNGLYMPSQVKVPAFILFHEQLGASAKVAWAGLQLRPRGQNHLSELCG
LSRTAIKPSYRQLEQLSLLRPWTCSKGQHAYLPVDLLRSRCVIAQAKITY
GALQLAPTFRDGQVEITVNQIARLTRCDPSTARRALRSLVDAGWVEVRRS
RRTGPFQLILSNPQLDAQKKAIAEINRRLEKAPYRGEALMREWLTLLIDL
DNFEDDASPGFLVNPYTGEEMQIDRFYPPNAGFEFNGAQHYGPTALYPSE
EQARRQLGRDLIKQAICLRRGIHLAVVHAEDLSLQGMLRRIPDCLPRRRL
DGQELVIAHLESRSRDYQQRTPLPMPVPR
>STH2147 hypothetical protein
MEMEPVQMRLKAPRERVFADWTEPDLLVRWWGPEARAEVDLLVRW
>STH499 conserved hypothetical protein
MSRDLPRPAAGVPPLLDVHFEDPTAPERLAAAIARQFRALPHPERHLPVF
FLVGATSSTGDSLGPFTGWFLRRKGFRGEHVGDLADPVHATNLRERLAEA
RARALRRERLPYIIAVDAAVGRPGRITVNRGPLRPGAAMGKALPQVGHLH
IMGGTANFPFMIWFTDLDQTVGMAEVIADGLLAFWAAYESGELFGRPSVA
RTGLA
>STH1639 hypothetical protein, proline-and glutamine-rich
MRRLARSFRSAGGPQQNPAGSQAPAQQPGGPQDAPPQGRDQAYLEPQGAQ
QPPWLPSLNGETAASMLTALRAAKQQLVQELEANLKQLKAVLEETHRIQK
QMEAVLMEARAEIGLQQRAGGHGQDQQPQRQQQPQARQPQAWGQMQQPQQ
LQQPQAQGQPPAQGHLQQPQGHGEPQPPQAPGHGRPQQPQAPSAGATGGP
RGAQPPSPGAPGPRRPRPPLSSDPPPPWEPPEPASEPGWSPWQPPV
>STH326 hypothetical protein
MPNPVREIMRIQDRRAALGSVPGQQQTISERSGWSSQVTGLGVDGHPLLG
GAIQLVAGAGVTLTQDPVTGRIIISASGGGGGGVGYPRFDPDEPPAIPSQ
WDDEFNTSVLHQKWTPLAVHGQGGFQAGEVISMLSCYGGHNDVNPMALLQ
SAPSGPWVMTAKVINAQQYLNGGGGDGSACVGLAVAPAADQGDAMDTWVI
GHYSNANTYIRSEIWAKPISWGGSKGMFTFSSPSAYLRIVWDGTRLSYWA
APDGLAWYQWVSPYAPNWTPGRIGITVRGAVGPSYVDWFRVTTSSSFTRT
TRHSWC
>STH1781 hypothetical protein
MAYTLQTFIHHKVFGNHLSKCKPLTYRKEDWLHLTRGRERSVRLIIRVML
GVPSAHHGPNEHAMWCFQFPKGSLLTVHLHRGTVAEISTYEADKDELEEA
VDYLLEEVAARLRQL
>STH32 unknown protein
MGIAVLATSFLLLVSARQDFHRLSPEPREPGQLAAQLVSLFLEGAAAG
>STH425 conserved hypothetical protein
MPEEEGDSMEEREKLEKMDLLRNRFKISYARAREVLEQNGWDAVAAAVQL
EEEQTPSGFFTEELKVSGRDLVETLKRILHEGNVNRIIVRDPKGYEILNL
PVGGAVVFALVLPMLTALSAVVILALDYTVVVERSS
>STH2031 conserved domain protein
MRRWGLAALPAAALAAAVALAGCGGKQSAEPKLAVTASQEDGALVVRIST
ENWQPGKDGHVHIYLNDGPEAMIYGYTYRVPGIEPGRYKIHVELANPRHE
HIGVSETIYFDVQP
>STH2274 hypothetical protein
MVTIRVLAALLIVGSLLLTACDYGGEAYPTQLNELVQMTNDLSESIAVLL
QADLVTTAEASSTKRLVTGLQSQVHDAVKALEKEKISVSLFWLNNMLEQL
DLVVTAVHARTSQGGDVLTPDERTALSNGLSLLSEMGRTALDLQGEEMTA
SRRLSETLRAWDELASDMPSLY
>STH2717 hypothetical protein
MVVATILLTTVTISHAMLSFGGARHSRPHRSRRTCHSYVASMEVDVSFGL
LCAVHRSQVETLNHEPYLVVGMPHRSPEPGLGVDLEGCPI
>STH1012 small multi-drug export protein
MFVTRLRRADTANPGGFLVFNWSAALKVMAISAIPFLELRFGIPVGIVSG
LDPAVAVALGVLGNVLQVPLIIFIMYMLRRIAQLVPWAARILARIDRAAE
RHEAKVRRYGWLGLALLIGIPIPGTGLWTGAAVANLMRMPLMLTALSMAA
GVAIAGVLVGAVTTGAIAVIDLF
>STH522 hypothetical protein
MVRHRHRTGRLRPGGVSLVNLFSARAGPGAPLTLALLALLEHSDRSVLEG
FLALAGLSPPEGAEASFHYPAPDGPPGAGEIRLGDRRIRVAAVAPGEPPP
EFPPGTAEGLLVGGPAPAGPGGLPGAEEGEPVRRLSWERLDRWLEEAAAV
HDPDTRTGFLIRQFRAYLPEAGITWFRGFDAEELDRAPRAFRELSAFHRR
VGELFEQVGSGFAASWPLLRTARPEDLLAGYWFRDYAVGSGGGDFLRVAL
DLGAGELQIALWFEPGGGAHGRLQLALAEEGGLSRALARLTPPPVLRLWS
AADERHIPVGDLDPSRIHAVDWEAYTAAVQVARPLTDLAGEGLPDRLAGW
TRALLDALAPVLSEVVH
>STH1678 conserved hypothetical protein
MDVESPPVTAGGRAHREGRPMMPVGPILILLLLVLAATGFLTGPLRRIGL
SGRAALLLLAAMLLGSALELRLAPRLTLNVGSGLLPALLSLYLLRTMRRW
WEPLAAVGGAANAAASLALISLYFPPGLPTELNLFYLDAQYLYAAVAGTL
GSVIGHTRPASFAAAVWGSLAADVYHYLHYSGAGHGDIVHRMGGGGFHGT
ALVAGVLALTLSELLQVGAPERRAAAPPHLPTS
>STH1854 hypothetical protein
MSPSFLQQLVDRLRQEPRVLAVYVDRGLTAPRTGEVQEITLHLAVEGSFA
QAADAWAASLAELVYSGPESHGWTLITPDGTEWRLCFHGPSADFAEEGLQ
GVFDRRAPSRVREGAQEGASPSPGGAAGQAQTDVAAVAAAFWRDLYRAAR
AIRAGRALTAHHWLFACGGRMIDLYRLALQPGPSGRGWEGADEVPGMARA
LEPVREFLSSPLECPAQRRAARRLGEAFEGLVLPLCRRIGATYPMALRTL
TFRALEPEPDAAGGGGAAPHPSDATARGRGSGSGSGMNV
>STH3325 conserved hypothetical protein
MDDAGAPAVIAAAVERCAAGLIHRCDGLVALCIGTDRSIGDALGPLVGSR
LAVALPEGVTVLGTLDKPVHAGNLTEALAWVARNFRRPLVLAVDACLGRA
ESVGFLAVGSGSLRPGAGVSKSLPPAGDVYLTGVVNVGGFMEYFVLQNTR
LSLVMRMAQVAAAGLEQGLRALLMRHGPRRQEQ
>STH1347 hypothetical protein
MRRCLAYLLGHLIVAAEYLDTDVLQQYVAWSEELGVTLPEVPYRPLPGYP
PYPQVEAREG
>STH3228 conserved hypothetical protein
MEERTHEGATHSLSIANRERVQITGVVAVDSFDDAEVNLETEMGLLTIRG
EELQIKQLDLEKGQFEISGFVSALQYASPRQRTGRQQRGRSFLERLLR
>STH1631 hypothetical protein
MLLLLSGAAGAASGTLENVWILRNGFTLTVNGRAVEADNLLLEDRAYVPL
REAAAAMGMAVAWDPATRTASVSPRSAALRLAAPVDPFNPVVDLASLGVR
AEDVEAIGLETAGASYVAVRFALEWDQLHMTDDQAAAEGRPVLALHTDYV
LKLFTRTGGMLRVGFTTGGLPELTPTGRRQIVLVPPSPEQGFHWPYFLVL
PSDGHRQANAGHRRYLIVDINNTGPSNSVADTVARTRVELERGQLPSARL
AEELWAPMLMPAIPRPVITYRYGGQQNTFLTHALDRDTATLHLMMQEPGL
AALLERQFRQAGVTVDSLMHLDRQVAAMIQHAVGYLNEHGFGVEEQAFLV
GFSASGTFVDRLAALHPDQVKAVVAGAALDNMLLPLAEHGGERLIYPIGV
ADYEAITGRAFDLEAHNRVARLVYMGENDDSNTVLYQDSYGDEERRIITT
LWGEEVLPRARALTALYGESGGHGMFILDRGVGHEYSEVMYQYMKAFLQA
NRDADAPAYPLPDDPEQLPFTLYAP
>STH2238 group II intron-encoding maturase variant
MEQVVARENMLAALKRVERNGGAQAFGRGGGVMMRSGRHVSTWRKGTTGS
WTWTWRSSSTGSTTTC
>STH233 hypothetical protein
MLEGKEALLNACRKRNMRVCETPINLLESDVQAEGNLDGMEVALEMNVIS
GMPGESRGTLTFQGNLSSLLAVLARVWPVPPEQDEAVSPGRQSWPAPPRK
VGKDTPPM
>STH332 hypothetical protein
MGFWGRRKERLAPLVLTTAPCRVNGTVCCSWQGACSGTNPTRRSRIRRFP
YPLIQKHACIFFGYLPYITSGIRADKGGLSPVIPPITCDKGYPCVQTQGS
TKRRAPAGLIGGGCFVFQGLHCQVVGIPQRASMRCRKSAYSGVLLSFRRR
RRKNSLVVGV
>STH216 conserved hypothetical protein
MGRTLMAIGALLLVVGAAIALFERFLPGIGRLPGDIIIRRGNFTFYFPIA
TSLLASIGLTLLFWLWQRLGR
>STH2230 conserved hypothetical protein
MKKHDRIGAVVLIVIGVAAAVYSVIKLKVGTLRVPGSGFMPLLASLAVIG
GSIGWLWEVRGPDPDPRPLWPDKNWLRPLLGLLFLILYALLFKPLGHLFS
TLVFLGLWQFLVERVNWRRATLVTLLGAAGMYLLFVVLLGVHVPTSPLGL
>STH2213 hypothetical protein
MLISTGQAGAEDWITKGGTPQRMSVVAEPLGLTEITVYWETKPLGESATQ
PAVVNGVIYHLAGPYLWRLELDQLGRPKGDATAVTDPVKAYKIEFNRPPG
GPVIAPQSSPTYSPDSGVLYFGTGYGWLWAYNTREGWYRPAELDLGCPIV
GSPLVIHDRGRDIVVVADRPNYPGEEDRPVGRPLCPRSHGKVWVVQGLDR
PDDVVRWQNYQAPTTKQAEDGFGGYITPSAVSAPVHGLNPSFVVGADGFE
GGRAIRLALDRNNDYRPYMVWAVDGPAGFAGNFTTDGTNAYWMDTRGHLW
GATLHKGHEPTGWGTYSIPLPALIGAKAAFTNTEPAVEVRQTPDGPETHL
YVTLRNYSDTGAGLRDGPTGSDGAVVAIGSSGELKWYRKFGPADRLDGRM
ASLNTAPLALVSRGALLFGDVNGFFYSYSLDNGNPSGGSARAALIHPESR
TPVDRLFLLKAGERPNVGPYNFSQVSGVGVDPAFAHGLLLVGVNFETDGG
TSGRLVAFRTGESYDLRWLDAPAPLELTPGTPVALEPRLQLEMTTRTLAA
LCPGPWTVRWFVTDADGQLVRPLGEVPLPGELGPQQPYPVPLTVTLAEGD
PAEGWIVGVIDLPTVYALSTAHTSNPKVALARGMAAAQGLPAEKTCAGAV
AELIERKGEPGPEGGLANNVLIVPYAIRQPAQVVVDDPSVVDLAVPSMAD
AGRPFEVGIYLGYQNNLGRSAIRVPLRLWARPESGGSAPAGGWQTVTIDR
MPCCTLKTGTIPGLSAGTWEIVAEIDYPEDTRPENNRVIRRVEVLSSKPV
EAGGPEGGAITD
>STH1323 hypothetical protein
MQNRRAHMYEFQGRDWTELARAWGISLEHEDDELAARVRHYMRTHVSADA
TPDPAMVADLRRFVAGFCENAKERPDAPLWQGLRDIQHDLTFVQFCDVLL
RHMWC
>STH2451 hypothetical protein, glutamate-rich
MRRAAIPGPARARRRPRASYERHPTRSQKGGIGTVRVRQRLFSLMLALVL
AVGAVTVSGCMPQQQPESQQQHSQGGGEDREKQEREEGEKDQEEKETGGG
KGGKSGGSGGSGQGGGQQQGGSGGGSQDSTGGSR
>STH970 hypothetical protein
MGMAAVLSSRPEVSREVSMPMPTGGPVRLPAQVAGSMSRMQGLLQAAQPE
IQSAYAAHWLAARQAAGRPYFQDLSQHMMMGLYGTTALGGLLRYALSGRA
TPEVVGGIIDQVQLIRQSYEGAQQALERFLEDEEVRSLSGVQLMSRTLPH
LDRFYQQMEQPGQAVLNGIEWQPSEEVQGWPVRRAEVGGPARGWQPPLPR
WESPGELDRTGDVSELPPPEAEA
>STH679 hypothetical protein
MREQLLAARNLVMGTTNILLLAVPDGWRVLEGPSQPEIDRWSERADQRWM
SEGRATYRLVAVAPEAPEFARAEVELRLTATPSAPDVHDVRHRFTTVRRG
LLPRREVPAAEVEIDCPFTGRRLRLELSPAMRGNRPTARPEDLQRLVHTM
LEGLRCH
>STH1740 hypothetical protein
MKREPEPWERSLRDGLDAVAGAVDGEQPPDLGALIMLVDDVQRAQRLALR
RDLRRFVAVAALILFGWLWAWLQFPAYFLVAQGLLAAALAAGAALAQAAG
RRVSHE
>STH1429 hypothetical protein
MRPQALLLPLVSAAALLPAVAGLSQGPVVGMAAVFIAVIVIAIVIIVVIA
AAVVFIGMRAGHRPGVPFFLPAFLPAVRPNRRRGRRFSPTSCTPAAGVT
>STH2426 hypothetical protein
MVLTVIGLALLGAALLLPNVLALRLAFAGAAEREAARVEARAAARRSGGQ
AAARVRVRIPAPRLVRRGYLWKALSPLLFALSVMFGIAVGGTTGYVLMGL
GLLNLLFGLWGWDEYVPGHTGKR
>STH3306 conserved hypothetical protein
MTKPSGRAREVLLDLLLAALVLTSVVLTVRVWYPEPLFGDSDTTEPSLQQ
QPVPIVREMPEIFRPERIVVARADGGRAELHAGSPTYSTSWQRIRKALTG
LDVRGGATLIDQVPRGDAGAPSLELFLPTALQVGQWADLLQWQAPFLRNG
SMLVDRVIVTLGERPAVYLSGPLGFELYLADLPEDQRAALVAHVERLDPT
LFGPYRELVLDGLQVTAAPGLMVPDVKAWPAAEVTVLMPDLWEEEARYFP
DLSVVRQIDEQDARSLTDGRRLLRITGAGVLQYRTAEGSAPAASPELEQA
LEAAGQWVGSRGGWPQDVVLHRYERDAGVGRLAFEVHTGGRYPVQSLPGA
MQVHVSAAARVVYFERTPTVANVTFEEALLPLVPPEDALAHALPAAPLLS
SEPVRAMYLTYLLRPPAGAGEPWTAEPTWVIQAGSTEVYVNAVAREFPLP
PKVLH
>STH1295 hypothetical protein
MGGRLSPQTVVDFIRNKWTDSTGMSGRIRPEYARAGRTFVASSVMEARVS
ALERTVTQHDKRIGTLDRLQTTTEATDAAMQQDIGEAKEAAQHL
>STH170 hypothetical protein
MRVDERKIRQYLRSEATRVVPPPDMWERIQREMDRDRLRQERRALRRRLF
SVPRVAVAAFAVVMLGFLVAPYGLAARQMTLLPGRWFGIHRTWIAALTPT
AWSEAGRPAVTDLRWSSQEITLLR
>STH2382 hypothetical protein
MMIIYLTARPDRGPGLLLGLRPTRAQRRLLGLAPWRRLPVPLPRRRGTCV
SLARYRYKRLAPGETAFPIVGSRRQSLRSAPTFFRLRVLMRLPGADSDFT
GSGARKFLTGPSGSCGLRLPICPARSSTELSRYETVVGKVRSQV
>STH1334 conserved hypothetical protein
MGDQSSLLLASLLLREPVMPDPAAVSARLAARCGGRYTVSWDEAEGEGDI
TLSCSVSGHTVVLGLMRAPVPAADLEAACARNPFWPEAAEVCRQHRAHLM
VTLMKGWGDPIRRHLVLTDFVAALSEAVGGLAVLWGPVGVLQSAEYFREQ
AAEASADHLPLFLWVEFALVRHDGVPFVATYGLDAFGVMEVEGGSSRMKP
IQLLERVFDVAHYLCLEGPVLNDGDTIGGSERERIPVRHTDSILDRPGPV
IRIEFDAAGGRRGLLSRLFGR
>STH1796 hypothetical protein
MTEQEWQQHPEREVPALPGQEPHFPGGDPGGRPQGIHPGRSEGQAALSSG
DSVPVLVAWFADASSARACLRALELRGVGVTLAEEAPPGHTGPVAIHASP
SVGEKDPVMTERGMGRGAVLGATVGATAGFLAATYLVPPLGAASATGAML
TTLAGAGLGSFFGNLADQARDDSGNPAGQGGDAAAAQESATLRFRLEVSA
RPTQVDEVMALVSAWDPEEMRLLPTGAGEARGYPAGRRGDGDGGRIPVEG
GPA
>STH3323 hypothetical protein
MDLSAMSPLDWVVVVILLLAVGAGWVRGIVRVLLGFLSFLAAVMIAGRAT
GPVVAWLDGMWNLTGRIADGILDRSVDAGGALSASLDQVAIPQPYKAALV
QDVARATAESGEASALELAAQQIAEGAATAVCFVLLVILLAAALRWLGSL
FADVVQSLPIVGLSDRLLGAAALGAAAVLALNLVLVWVMPTLSVFGLRGL
GELVSQSVTPPYLIQAFEWMRRLVIGGGLRLWNG
>STH1385 conserved domain protein
MVRLTRCRRRTLLLLAALALALLPAGALANAGPPRRAGDTGGPLLPGTSD
QVHVLGEELRFDLGPDLGSATVTARYRLANRGPDLEDQRFVFVVQDVGGR
TDLAVSWNGEPVPVRLGMDQLGPEELAEMARAWTSVDQWLDPVTGEPYEA
EFYGSDPVLRYYLFSLDMPAGAEGELAVTYDHTAGSDRTRYTYRVHHYSY
LLLPARGWASFGPLQIRVAAPAGSRYYFAANLDFRDEGGEYVAEYPGLPE
ANLTFAVMNRAGLFLGPQPGPYYGLGFALVLVLSAAVGVGIGRLAGRLPR
PALATGVGVLAALVPTGPALVWLSVLLLSSGLPQLRDQAYASAFAGFATG
LVGAALCAAVAGWTARGTWRRRRPAGGPG
>STH699 conserved domain protein
MKLWKKTAALSALVMGVSAAPALATSVPADTPVDATPISAPVEILPIVAP
EEAGSFLRLVGPVEWVDLEGGFWAVAGMRLIGDQEEIAQFAGQEVVVEGT
EFTGISFHMVPAIEVMSIRLAGEVEAMAPVLRDVGADAPLPREILVNGKP
VARELGSPVVADGVLLVPLRAIAEAMGAEVAWDGEGQLVRVGLADRTVIF
RIGQQEAEVLDAAVAPAGSARISMAQPAQIVGDRTMISADAITTVFGLRQ
TEAEEGVMSLVPGDVADLTAPEPVTLPDDAFHDLLTGRIRQVEDGRILLE
GPPMANGEPMLIWLTVSDDTQITVGEGAGTAADLQVGAEVIVSLSGPILE
SFPARGGADSIQVLPAPKADILTGVIKEIEGGRILLEGEPMDSGEPFLAW
LAVDENTVITIGDAGATAADLQVGARVEVELTGPMLMSYPAQGGAARIRI
LPAE
>STH466 hypothetical protein
MRRALATLLLVLWLPSAAASAPLPAGAYAGRLEEAAALLDEAERLAQQGD
ETAARRRIYAAADRLSGVDAVWADAGDVEADLTDLMDSLRGAAEDPEALS
RARGLLAEHLHAAADLVEAEPVHAPGARASLDRALAQVAGQSLLDRAREW
VLGLFTRPLQGKELGPVPPSVWWIGGAVGALALAWAGFALYRGLTGHGAG
TEGTWREGRRTEPARPPTPAELLHAAQEAADRSDYLAAVRLSHLALLQHL
DGLGIIRYQPAQTDREHERQLARRRPDLVPALRSLHDLLEGLLYGGRPAR
ADEFRQAESLVMQLWREGDATSGSAATPGPSSSA
>STH2473 conserved hypothetical protein
MPRRRGRWRPGRRGLDVPALQGTILVVDSRDRRDIAALTSTTESRRSAAS
GTLTYGTILRLFLPLSLSDVIMVIAGPILTIGLTKLANPEISLAAYAVAN
NVAILLESPIIMLLHASNLLSRYRETYQPLRHFMLWANALLTALYALLAF
TPAYDLIFRTLLGQPDAIAAAARPAFQVQLLWPAAIGWRRFYQGILIQHK
RSSVVAYAGFARIGSLALVTALGVMGRAHGATLAGLALVVSVIVEAAAVT
WLARPVLQAGVTDPHAEAPDWAPRTVGQIALWYWPLAMTQILVSVVRPLL
SGGIARSVDPELGLAAWPVAWSTILMVANAVRMVQQVALTLLQDRRSYLM
LRRFTLTIGAVACCVMALLAVTPLGRGYMEWVLGLKGGLASVADAALPAL
GFGVVFPFYVALQNWLQALLIKDGRTLVVNAGAIAGGAVTLAAVYAGALI
WRLPGAVLGIVSLLCGAAVELVVLVRASRPQRQAWLAR
>STH1931 conserved hypothetical protein
MPLEHLTTTSGIGAPKHRCTVACRTGTGESRAQRGAAEYGHNGVPTKRLL
LGRGGLLVLLARTNTLVRLGILLALSVLGAYIKLGPSSIAFDAMAGFVAA
LLMGPAAGALICGLGHVAVAAVTGFPLTLPFHLASAAAMAGVGCLGGLAA
RRFGLVAGAAVLVVANGILAPALLALLPNPLGLGLFAALALPLTVAAGAN
AAVALLVVLGLRRAGVEG
>STH2226 conserved hypothetical protein
MPPRAVTAEMPEANYLIGHPDGKYYRLLMRCFYERSLSHITYVRTDDLVA
FVQQYLPYDDATCRQHLDQMEKWGLVTLIPEQSKPANLMELRQKPRVYQA
SRIALRLEALRAELEQEEGAASLDPAALDLLVQRVRELADFIEQDGLHRD
GAAPEAHRLWAGVYESFGAFSRRIREYLEDLPRHRPREVLDYEAFRAYRD
LLIPYLQDYARRLFDRREQLRMRLRVLARSKELLATTAAVVEAQQVRADG
SRPDFDRQKERFLREIDALIGYFGDQGDVDVLLERAQAWVSEITRHARRL
SEQHLGGSVREQTLLDLARRFSECTSLEQAESLAQVVFAATLPLHWRGEA
PPPADKDPWEAEPVTVPLYAVRRGQRPRQQPEATADRTSEALQKMLSAKA
ERDRAAQELAELFGDESELDLGDLTVREPRQRQLLLRLLYKALSQQGPVG
VGYRNWSVTAEVKADAPLGRLSAADGTATLPHVILRLHKGGPR
>STH1758 conserved hypothetical protein
MGGDDVAVATRRDVWLLARELAEAIAETPEVQEYRRTEDAVLADPDAVAL
IREYEAAKRAVKLSRGRPPEEQKALVERFLAIEERFNAHPVIQAYWNARV
ALDAFMERINAVVTFPITGETAPRAKGGCGSGSCGCGG
>STH2583 conserved hypothetical protein
MADLLSTQLLKLQCLVCENTGEACIESRFKLPRKAKKIAEIQARIVDLEA
DVIDDQVCISGVVHKQIFFVGEDHHVHHVPEDVPFTTFVNCPGAKAGDIA
EVKANIAKVNFNLEWGQEVVQRVIVQFVVRVSEDCQVNVRLNPRGPLVKA
ECVVGEATKAVTIENCIELDRRAIKIRDLQVSLEDVKAEATSDQVMFQGT
LVKNIFYISDEDVELFQEERIPFSGIADVCGAEPGDNVTLQAQIVRVDKV
LSDGVELRQRVVLSLFIKVTRTCEINVAEDPTGPQVMASRVIATGERQVL
VENVTDLNKPAQKIQEIQARVEDVAVEVIPNKVIITGTLHKQIFFVGEDD
IVRHQGEDIPFTTFVDLPGVNPGDEISVTPVVEHVGFDLLDKVWVSHHGD
DCDDDEGRPVFRRLLQRTVILLCVTGSQEHPIRIAQAPFTGLAAG
>STH1305 conserved hypothetical protein
MDDRRKVGEMRPSQLITTFGPGAIVDLPDLSVLVAGLNHWDEDLCQPIPE
SRLTARLMIDRVLAPPMNPRNPKEPNLPVFRFPTFMVCPDCRTLARYDRF
VADRNGVLYCYHKGTEKETDKVSEASRVFPVRFMIACPSGHLSDFPWHAF
VHRDKPCPLGDKGRLTLDESGTSGSIRDVVVRCSCRPEKGRSLGDAFGKD
AGKVLGRCPGRRPWLGRDATELCDETPRALLRGASNAYFPIVESALAIPP
YTRPEFDYLDKVRDQLEEASSFEEFRDQAWRWVNREVRELFTPEQIWTAW
EERRKLRRGPDQDLLFPEYRALLSTPYSDPEQDFEVEDQPVPPLFEGLID
RLVQVRRLREVRVLAGFTRIDPPSDITAIVTGDEETRSTARRAFLGPLQP
EGKRWLPGIMVRGEGIFFTLNLDAVRAWEERVKDAADAMAQAHKKYCTDR
NIYPHPPFPGARYVLLHSLAHALIRQLGIRSGYSTTALRERIYARNTPDE
QMAGVLIYTATPDSEGSLGGLVEQGQTDRFGEALWHALQEASYCSSDPLC
AEHEPEAHGDLNGAACHACQLLAETCCERSNRFLDRSFLVPTVSNPHLAF
FGGVL
>STH1182 spore coat-associated protein
MTERPPGSTGPGGRGLGGNPMSKKTLMVLGSAVAISALSIGGTLALFTDA
TDASKDFVAGTLCLRSDRNDGDPVPGPMFYITPEQGKTPSGQLGINATGP
WAPGDSHTRTLTVYNSPDCSSMDAWLVSVEARLTEGIEDQYEPMAEKLHV
KVKTPKRGGPDEVVAEAPLSDFLAGPVSIAYPDGTKIPLHLTSNRHMKFE
VTFDKAAGNDYQDKTLVVDFIVHAEQMKNNP
>STH1599 conserved hypothetical protein
MEQIRAAVDFSFQEIERLAWQQTLDCFREALVEALSAIDTALYESRDPSR
YVYKEMRSRSVVTKFGPITFQRRYYWDREEERFVFLLDEVLGIAKRQRVS
DSVRADAVEASVTAGSYRGAAAELERRDCQVFVSHEAIRQWNLQTGRALA
AAEKQQQMTLAGTRRVRVLFIEADGFWPARQRGKKAEVRLFVIHEGWIQR
GPGSKEYSLVNRRDFVPEPGRDSWEQLSELLESEYDLSETWVIINGDRAL
WIREGVTWFPKALYQIDRFHLKRELNHVLRHRPQRLEQAHAALEANDAGR
LLAVLDEAHKAETDTDRRGKMRRLLADLRTMPESIRDYRIRLQERGVNVE
GLRGVGAAEGAVERYSARLRKVGRSWSESGLMAMLQVMAAYYRGALRGAV
QYVERALGLESVNAAAEKVRHRVRETVGRGVDAARHARMPILNAGRNNSG
GYSKTFRGFAGLVTR
>STH1281 hypothetical protein
MPKRSGEGFAGTRNRPGGSPAFAVPGSRARRSARRWARGVWVGCTICSHP
VLAEAAPVSVWSVRIEYGSERCPGSVQSLQNELLFGVNLGGVRFTRTE
>STH813 hypothetical protein
MYRTLQEKDPWQAVHSIQAYVMRALAWAFSDRLEATGAFQHTFLPDYMVH
LQPGPKLAVHLRFSEVSALALAQSMPTKSRHVKHVILVNTLPDHLVEDRK
NDTEYKRRNVLVTSFDIIDQIIRVVELVEEHGPYSTTLRIRNLQRELVRW
FLDSYGWRPAPRLEDFIRHQPVEPRRTLRLRSTPVTVWVQRHPDGYAAYR
YLRDSTSLGVRSIQKRSFKEMGRGASFSRVNRIRDVAKPTVLLTWTDQRV
HIPVLACTFTRHAPMGQNAFQILPQAVACAEIGTPFILVAPEKAPVRRMT
GEVTVEKAHPLLYQALTRMMHIYRIPILMLPWPTDDDLNGLPTLRCDRRY
PSLPDRATSAIQLLFRCVDFVLEGHSYTSAVSNLLGYYGVSRRRLEMEEA
RFRLGMRAIQEPPRNSGLLLHTGDTLSYIRETTGVRNARIPFALRHRPTT
VVFTTRSKSLRADPYLGTLLAFDFAFCRTGPSVVDRSSNLFAQFQDVHLD
EALTQLFENAKLEQPRFNRNALFLQYADALIFKDGMVTKRGDKWVVYR
>STH3331 hypothetical protein
MATLRRTQALPRRDLDCCRRTKSHEQHSSPTRTPRSHGLSRPDRLRSVSR
GVWTVLVFHAEHRAPEGSAISRRWPFVPVAYRLQCRSCPPLSPRVSCSTV
FHVEHRVCPALTTAASRASVSTPCQRSTWNTGPSGQHKLTTANRALPFTR
TPYTARVRRRHTPQAETASDPSPTAPRCSTWNTLSGCLPPLLPSHGDTNL
PRPPAHLLAARERARHAVRSTSRP
>STH1144 hypothetical protein
MAVFVCTQCGAEREGRCKPQKCACGAKGTFVKKEEAKKEQ
>STH1717 hypothetical protein, glycine-rich
MRRADGGEALVDLGALASGVLWGLLLMVAAALTLGVLDFWFPPEPQAEAA
RTLIVQGVAAALAGARSTWLAGRGGFMHGLAAGVGLVAAVTVIIGIFRDL
PGILGLARGLAVGAGAGALAGVAAVSLGRR
>STH328 hypothetical protein
MQAAGSEVRRMEEGLMKLAIEQGLWAALFVGLLLWTLRQNNARETRYLEI
IKTLGEEVRARLDRLESLASRGRRDE
>STH2570 conserved hypothetical protein
MENVSSDTPILESFEQWKQFLSDRVKEFKRTGASQQTITNMATSIGNFLA
EKVDPRNREQRLLKELWDVGTEEERKALASMVTKMVSDGKVH
>STH1676 hypothetical protein
MGVRPTRDALRGRAPAPAGRRRRVAMLLSALAAVFSAQVLLTGCSARPLP
VQPADAVAAALAHPEVADWYAAHSAPRVLAGLNPAAARGLQRFRPDALVD
LAAEGLVVRFTSALGEPPKRVDVLIDKDSGQVLDVRMGGVGWRGWR
>STH2373 hypothetical protein
MPRREIGRDPGAWWRPRLRSRIPSRGPLPLRRGARSAPGSSGTSWTRAGP
VRLPASGSMCGRRRSGKTSPGGGRPMVTSTMCSGRGPWAACGR
>STH1871 conserved hypothetical protein
MVQERAQVMVLGTFHMENPNLDYVKTGYGDVLSEPYQQQIQEVVARLLRY
RPTHVAVEAPPDAAPALQERYDQYRAGGYTLGRNEIDQLGFRTAAAMGHS
RIFGIDYRLDLDFSAVLDEAGKLGLTGFLNAFERMTRSVAEAQERAEADG
GVLGLLRYLNSPDHDAMHGVYLQIAAVGAGASYVGARLVADWYRRNLFMF
SHIADLARQPEDGILVLVGAGHRPLLRQFITESPDLEYVDPLPYLT
>STH1090 conserved hypothetical protein
MLAVWGAWTVHLYRQYQALAYELEAERQRNFAEMISHVEAMRGLMGKSLA
AGSTRQNALYMGEVYRRASLAAANFMALPLPEELGAATGKFLNQIGDFAY
SVVRHEAAGRTMDEAQRQELARLYQAATDLTATLRDTGQASVSEGFRFAK
AGVGLSDLFTAWRERRAAGGADLTQDQAQKSLIPPGLNQVGPQMDQMPVL
VYDGPFSDHLEQRTPAMGGPAITPEEARARALAFLPEGVTADALEVTERN
GRVPVFAVRLPPGGGRPAVTVDLAREGGHLVSYINARPAGEPRLTLEDAR
EAGLAYLAAHGWSEMEPTYGEVADGFATVQFVHAPGGVRIYPDQVKVRVA
LDNGEVVGVDARSYVMSHRERGGLTPSVTREQARAAVNPELQVEEARLAL
IPTEAGDGEVLCWEFRGTLGEETYLVYVNAHTGLEERILQMLITDSGTLA
L
>STH1771 hypothetical protein
MDCLTPSRICRSGRGGSGRDSQSKNGSPTKAGGSSRQPSGTTALRLRTEP
PFRICLRFTVTSSTSCSLLLRPLYQPRHAK
>STH1185 hypothetical protein
MALSSIHAAQAGSSIRPTWSNIAFNSAAVAGSIRLARSAPPSFTRREAAA
QRAENWPSLAATWGMDSTWYRSQTARDIGTSDLSRALLTMRPLISDATFP
GLLPYTRTRWAKGSIRADQVSADPPEPWFQRVGRTQATCRELRPGATRRS
RSTGSPARWCCAACSQSTP
>STH1489 conserved hypothetical protein
MSARGPGIPMWVAGVLRASGGRLPADLLRGGMGMAGGVPGVDRRSAPFWR
GLLVGAGAVAAALAMGLGYLATAGVTVTVTGDLLVARMGEEVEAAVRREL
SGVLRQAQAEWPQRVQEAARSQIAAVRVEVAGLSLELPEAARTQVERAAA
EAAQAAVESVAGNLPVDELAERIGARAAALARERLPQVLAEQRIVVEAAP
WLRVPVRLQVR
>STH1706 hypothetical protein
MNRTSENRLSPTGPRSVPVPWLRYLAPLPLLGFLSLRDPLYGAFFAFGAF
LNWGSENRWLRAVSYLGFLGFLR
>STH227 hypothetical protein, proline-rich
MRRPWLRAAALILSTALILSGCGSADNPPPSPPPQTEPKPTPNPTPPETP
ETPSPPADAPEDPAEPEPDVPALPGDTEALREALRVQSPITYTIVLAHDA
PREMDKTAYLDQMLAEQGYPGKNEILLVLFPADNYNIRFAMGSLVFDRRI
TLQQMLELVQSQYLTRSRQGDPAGGLAALINAINERAK
>STH1436 conserved domain protein, glycine-rich
MILRKVREEGGFRVRLGLRLGLRLGLRLGLRLGLRFGLRIGIRHGSGRCA
GFRVGVVIGRLIGQRRCVGRRLRPGGVSGRRGSLL
>STH1662 hypothetical protein, glutamate-rich
MRQSNGRPLRRKLLGVAPQEAERQLREQAERFALEIQHLREALEQARAEE
ADLTAQCEALAAEVQEAQRKLERLQKGLERSRTMAPIQALVLAREIADLE
DEHAARLAELAAEQERIRAEIAERRASLQQWVTSLLESVAERGGG
>STH2199 conserved domain protein
MAVRGWFTVRVDLVSGAGLHFAPQPGRIFLVGPSHTFLDLATAIDLGFAR
WDLSHLHEFRFPDGRCFGIPDDEDPRVVDYDTVRVASQVKRGDRFTYIFD
FGANWEHHCQVLASNLDPLEVLGHVPQQPTPIWGWGWIPDQYGRRSYGAE
A
>STH988 hypothetical protein
MVVAYFERLTPTRWPADAHIQTPCDPLVAAAEFSYSNPGFAGLLEVSVQG
LVGGRPVLKPVLVQVDGPGRCRLAPPDSRLAVPTRELV
>STH842 hypothetical protein
MADDLEALLLHAFIDLIEERKAAGRRELVATHETIAQWLSDRTGLNVTPR
HVQYLTLALRDGQIIDIGGGGIGRPNTYDTREAQMGTDAFWDQVEAFLMV
WRMPGREALRKADPGA
>STH774 hypothetical protein
MLRRVAAVLLALLLVGSAAYVNRDRSPETAAAGTPGGPVRLLLAVPAGSG
SMALRPVEPDTLADLPGFDPIPVPLGLTVIDTAGLRAVGCLDLPVSDLAL
APDGRRLLLWGASWDCTTGCAREGHGVYVVDPARLTQVAHLLPGNVATPR
GFSPDGRYAYVEEDTGQGKLRHRVLHLATGRLGAGPVMRGPALRVLDAEL
LRRDREPLRPVRGHQEGGLQPHRAHAGEEELGLHRDHVAGLQRGGLLRSQ
DRPFVQLEPHAVADEPYPAVARPHEVLLQARPARDVHASLPELLAGGAGA
RLPGDHVQDLPGGVVGLHELPG
>STH672 hypothetical protein
MRPHDLPACGWRPASRCACSPSPAFDCGWPYGWPLRSAGCGDRARKTNGP
HSSRCGSGRQPYPQSPGLPDPVRCSPPGRMRQYNHSRVIIRLFGRPSFLP
GFRC
>STH2020 spore coat protein F
MSSVISAIGEAMGMKPDDRVIAEGLLTAAKLKATAYCAAVLETTNPDLRR
LLTTHLTDALAEHERCTRLAIERGWYRAHASPEDLVSQSLQDARHVLDG
>STH2718 hypothetical protein
MERLVPSSWSSPAPEIFVHSSPGAARHSRSAGGRHGFTAAQANGSHPELY
KLHCSKLSRGGRLAGRPRPWRQTNASVDERHCRQSSHCQIARATTMR
>STH1766 hypothetical protein
MDRVREGDARNSPVFGKPAHRSPSRKPMIGYAISAEIATPDLNLLCVSGA
DRPSGRPDPSARGFRHRVPARNRPRSWTWLLRRASRPDVGRESAAAPAPP
LACRSCCPAYAHLLVP
>STH1966 hypothetical protein
MVKWIAGLVALAVAVGAAIWVGYSRGHGGLPAVFGSLERSVAIRADYQVY
TTLEELVEEASVIAVGMVEGVEGTRNLARDPDNPDLPHPSLEIIGYDYRF
VVEEYLKGDGPDTILVTEARKFLSHGREGIPAEPPAPEMESGGRYILFLN
RAADAEGRWIGVAEPWRFALRDAEVQVESMWEAAPRAFPSVGEGEFVDQI
RTLVSAL
>STH2472 conserved hypothetical protein
MRMGGAEVNGIDDLAAVMDVHLARFDRDGDHRAAFLRVYRHMTLAVRERL
RRPFFLDPAWVERVAVRFGWYYFDALERFERGDDPPPAWAYAFDIAARKR
AFLLQDILLGMNAHINNDLPLVVAEILRAEGDERSVFETVRRRFDHDQIN
RVLHDVIPGVQDEVARHYGRLIRPLGRLMGDLDRQLTTYGLKTWRDQVWR
NARCLLAAAGEEERQLVIRFIQQDALHLAREIYRFAPLRLLRPLARWMRR
WRLF
>STH1620 hypothetical protein
MTPGPPTGPPGRGMGMRSLLAVIAVAVGIFLLVTVVLPFVGILLAAVLAL
AGLGVVAYLAAPVLARLPWFRDRIHVEEHPLGRTIRFGAVRVHQRAHARR
ADDFIDVEGRTIDVEEDLLPPGDGRD
>STH2800 conserved hypothetical protein
MRRPSSCEEGRAMQQQGVQPQFSMQGGHLIMAHQEQTGLPKVKDPSMNDR
DRMQDLLAQEKYLTSGYNMALIEASHDALFDVIKQNCDASYQMQRQIFNL
MFKKGWYKLPVADAQMVQHTFQQFVQYQTQFPFPPGPQVAQMAAQTASGQ
AGGQGGGMQANRQ
>STH474 hypothetical protein
MRRGPAVLLLICLLFGAAYGAWRAAAFGYGQLSAFQPVAARAERPAIGSA
PLADRVVLLLIDGLTADRVYRLPSLDWLRRHGAAYRLEAAEAGGTSDWTA
TLLSGTPPAHGGFPAAPGGGTAGVDTLIDAAARSQVPAGGAGGPALAALA
GGALSPWREGATLAELGDAVYAMLEPGGPRLVIVQAADLAGGEVDDGALA
ELDLRLVALFDRIDWRDTAVVVAGSPDGRAARTGSPLILAGSGVLAGGGG
DATVYDVAPTVAALAGLPAPVSPRGKPILSALAVEGRPLDALTQVHLESR
RAWAAAALAAYGSHEELPPAPASAVEGAAYLERMEQRLRTARETWMKAGA
LDRLPYVGPALLLMLLYLLFVYRSPSGGAAFRAHLAYLAAFHLLFFALGG
QYGTGAAGLDGPWAWAALRYVLAAAGAGVLAASAAGYAVSRRPFRKSRYV
AAAGLHAALSCAALLALPVGALVLATGWEFPVALPPAGLWVWFFLTCLQV
MVLGALAPVWAVVTVQAARFARHRWPVPEVGDPEVNADRVVRLKALRRAR
RRS
>STH37 lysine 5,6-aminomutase alpha subunit
MSLTLNIDPARVARARELAGRIVDQVQPFIERHTTTAIERAVVRLFGVEG
KGPDDVPLANLVVESLQDGGGLDRGAAYWLANACTQTGRTPRQVAEAVAG
GELDLMQLPRLPEEEIRAEAERLARMALDRIRQNRRTREELIGRYGERPA
PWLYLIVATGNIYEDVVQAQNAARQGADVIAVIRSTAQSLLDYVPEGPTT
EGFGGTFATQENFRIMRRALDEVGAELGRYIRLTNYASGLCMPEIAALGA
LERLDMMLSDSMYGILFRDINPKRTLIDQHFARMIQAEAGIIINTGEDNY
LTTSDAYEAAHTVLASQFINEQFAHRSGMPDSQIGLGHAFEMDPSLEDGL
LYEMADALLSRTCFPNAPLKYMPPTRHMTGDIFMGHVMDAMFNLTSVATG
QQIHLVGILTEAIHTPYLHDRFLALENAKYVINYARHFTEEFQVRPGGRI
ERRASEVLERTLGLLEQIAERGLFQAIADGVFAGIRRPADGGKGLDGVLA
RGEGYLNPFYDLCLKGGGGR
>STH2878 hypothetical protein
MTSLFLYSFYVGFGLTLVSVILGFLNLGGDAGDLSHGGHLGPGGDLSLDA
GGDLSVDLATDVPMAHDGSVGKATGKAAALSPVNFQTLVAALMGFGGAGY
LTSHYGAPVALAGAAAGGFATGWLVYRFQRFLLRGERPLPPTRYTGIVGR
LTVPIREGGTGEVVYTLNQTRTVSAARSVDGHPIPKGEEVVILRYERGIA
YVQPWREFMKEENLKEV
>STH2260 hypothetical protein
MLHPDALTGIKEQIIRLAEADRALLDDLRREVRDLAGSVRVIRPRSSTAI
SLVASDGGNNKLVFDPFYVQVVRVVDSYGKELLVDVVSPTTDPDSLLTNH
FTPSGDPKSALGRLMADLGVRSLSALSHMIPDSRRVRERPETVSPSWVQV
YRDICEWAVLYERICYQTFASDTLIVRDGLLRSKVFRGEGFIRMRERMEE
AIERVWRQDRRRIYLVGLAKHSQVLNRYGLAMAIEETLPAGDARYVAIPR
ELEAKAYKWQEWARGAETEEGEAPKFVAGDMFFVRFGPHSGDPVWAVDIF
SAQRDRADEILGHLLADARDGFPVPYYPRCLQRAHEHAQLADFDWAILQD
QILDAVRGLLPREKQFLLDAYRLRADVTGERYG
>STH2510 conserved hypothetical protein
MRTKDVLTWTVIIGTGIGAFWDGFYAPGQQMVAVALSALLSLAAPSIQLG
RAEGAALLLMVSGTAASLARPAAAGAAAHGPVLVAGWILALIAGRAWGAH
RPEQAAGALARFWAVTGALMAFGGLLFISFTPPHHSGRLASFLGYPIAVG
MIGLLGLAGSLPDLAAGRIWAAPLAMGGALGFLLSGSRGVWLVGLLLAGF
LCWAAPALVRACLPRTGRALAAALAAAVWAAPAVQQHDAPRLWPIVALAL
LTLVFTEQFQHVRAVRAAFAAATGLALLTAPGWGWFLGRAATLPLTDTSS
AERLAFLRDGLALAADLPLGAGYRAWTALHLQAASYGYYSAEVHSAPLDI
ALAFGWAGAAGFALLAAGFLLRLRRARGWGWSRTVALGGLGALLLHALVD
WELSYALFAFPLWFGFGLVGGPAAEAGPRSAGRALPWPRALTALLAGVAL
AGAVLLGAGDGLTELAARSLGRGAPEAALRHAGAAAALTPWNDLTQAYRG
AALAQLGRGEEALAALARARQLGPYEPWYAQMAATELGRQGRWLEAAAAW
TDYVRLWPWETEAYAEAVAAHLRYLDRAQASGDRGAAALLAESGQRLLAA
LAEQKAREPADAPRRGMDVERPVFAEARERFAAVLTR
>STH2248 hypothetical protein
MVPSPAVVAGDWITKGGSPQRTSTVLDPHGLVELSPYWETEALGESAAQP
AVVDGVIFHLAGPTSGGWSWMTSND
>STH1042 conserved hypothetical protein
MKVNHSTLQGVPVLSHLCYATGLQLLSWSLGLAATLVLLDSRSGFLRDHP
PLAWRVAGVLLGPLAVLSLAAVAVGGGPRWVTLGLVLPYALMALGASWVW
YRGARADLRRQREREAAVLGLLAVLFASLAQTGGDGLPVLSLAASLGSAA
LVGGLGILALGCTVRRRRDEVEVDPNCDGIPVRVVATGLGIMALAASDVG
RALLTGSAPLPASLGLWLGCSLLVPVLVLLAGRRLSDWNRPLLWRTACLA
ALVGQLALQTTLVA
>STH2032 hypothetical protein
MRRRLDGRRLPGALAALLVTLILLAPARPALAHAVDLFGYVSVDEQGTVT
ARLVDVYGGLVEGQRVELYARPQGGRAGPPVVMEELEPGTYRGRAVPNRA
GPYEVVIDLPLAGDLHRITLPVEPGVPVAERYVIMEQIEPLLTWTRVLFA
AAAVVLAVGSVIAWRRRPGGPEPEAAGGGVKE
>STH2264 hypothetical protein
MAVHLGRLSPQGGVHLCMCILLPLGQNVQTYLRKYGTCGPDLPLHCPGCG
GRMRRHGRYWRWVFTAHQKAYIPIYRWWCPGCRKTCAILPDFLKPYARFI
TLVREAVVRGRVRRGLPWSTLARRLSSPTVSWLSEKTLRRWLVRARALAG
EWSQYLAERVLRFWPDTDLDALTPRREGPDATLHFLLDVGDWYRRQMGRR
PEEHGGVFAALNRLGEGTASL
>STH256 conserved hypothetical protein
MPASRREAGEGAGVGCSGSKGVPPPHPHPLAIGRRCHGGWPYMPASRREA
GEGAGVGCSGSKGVPPPHPHPLAIDRRCHGGWPYMPASRREAGEGAGVGC
SGSKGVPPPHPHPLAIGRRCHGGWPYMPASRREAGEGAGVGCSGSKGVPP
PHPHPLAIGRRCHGGWPYMPASRREAGEGAGVGCSGSKGVPPPHPHPLAI
DRRCHGGWPYMPASRREAGEGAGVGCSGSKGVPPPAGNRQAVP
>STH892 conserved hypothetical protein
MTGEVLARAEAELGRPVVLTWRYPILPDEMAMVVASTRGQQRLHDVTLFI
FDPAGRLALIRKHHYPPGIWRAPGGGVKPGEEFAAGAAREGWEETGLAVR
VTRYLLRVHVTFTCGGQEQPWTTHVVLAEGEGDPATRDPREIGGVKWGSM
EELCGPIADAMLATGRGLFAYRVALHRQVAQLLP
>STH56 hypothetical protein
MLASLRATVLPQFTVEVYFDCEPDCRTHPRYVLVLRDGTAGAVELEFASA
RDLRAFLARSRQALTECRRTARIHRSPSKRVASGTVQATLF
>STH2064 conserved hypothetical protein
MFSIARFLAACLVMCLVFGVLVFGMLLMNPVSFINSYPPEIQAEYYRSQS
KEAARRKLTVIMGIKKAAALTAFLFLFAWMAKWAGADTFGEGLMAVYGYG
LVLAVFDTCFLDWVLFPNIRRIRLPGTEHMDREYRQKWFHVKAMLPMLPL
FAVGGVAAALIMVWIW
>STH2272 hypothetical protein, serine-rich
MSSSAVGGASLVKTAFSTCSPNSPHSSAAAFANLGSRSIPFAHSIRPFTR
HGLPNCRCYGSTWTTNRVMHSARCGWASANRRSNAKNLTENGTPSNDLPS
AAMACSTVRVTHPIPVARSVRSSGSKRRLCLDSSAVPYSYSAICQASGST
RTFTVEGSTGPGKSMSTTIGMAKRAAVHPGFTFQSP
>STH2186 hypothetical protein
MKQPVAFPGRKVFSSEVSLPPFAVPPSSTAHTRTVVEPLASCAGVKLRWP
TSPSWVRVWGPASSRTVMSPPGVKSGGALTAALMVTSPYAIAGASLPQGP
GPNRQTRK
>STH1563 hypothetical protein
MPGCRHHLLRRLGSAAIPSQARTCPARNSRRTRHRGVAPGSYAEPSKRPK
RHPCAASPNRRQQKGGRVVRKRTRGKWLLLSLTAVTAVVFLAHCSPSREA
EVAARWTTDDVQRITIDVPRPCRPLSVDEFVCDEVILRRP
>STH3156 hypothetical protein
MPMPRPLHDLRQPQIGPTAGDLGQSAFAANLRDVDGPDAGPQIEATVRPW
EAPEPRNPPPGQRRPSGPALRLGADVKD
>STH1843 conserved hypothetical protein
MAKEPPPIPGWRRAVRVGLLSFLLAVVVNWSSNLALHRVHAIIAFVIVLL
LIAVNIVFDILGTSVTAAELPPLNSMAAKKVPGAKQAIWLVRNADAVANF
CNDVVGDVAGAVTGAAGATVAMKLARMMQGASWVEEALGLLIIGVISGLT
VGGKAAGKSFAIENATTAVMAAGRIIYWAENITGRSFTGGRSGSNGRRRN
S
>STH1534 hypothetical protein
MDSERETRARIEELRQRLHRQVSGPLTPHQLQGLLPISQEIDRLAVDFIR
RRWQQTAVKQAQRK
>STH1896 conserved domain protein
MQRGTAFGLPEGGMAVRRLLYTIAALLLAVGSLGLAVLPRIAPGWLGFGV
LGACFSGIGAAGWVLGGVAAGNWPRLRAGWFLLPAGLLGVVLSAMDHAGL
PLFWAAGEAVTGLLLLVPLFPRHDGGESSGRPPGPVQAPPVAGRQEGR
>STH2524 hypothetical protein
MGMMAGQPIPFLRPALHALVRGPWRTLWIVSPSLHLAALQGLLPELERAG
AEIRVLTDLSYQAIGDGRVELAALQRLRRLEGCEVRWQPDLGACIYASDE
GGALVGSGPLTLDGLDGPRQYGVHLAEAGPVLSDLEQWWESARRLSPQEW
AALEERVGLRRNALRIGAEITRLGAFVRVSAHGTRRTRRLDPRDFGVLAL
GNVRLRPMDVTLCRLDLVSRAKDELDAILAERGLEWNGQYLVPRHFLEQE
WPGIFAAREKQLAEDLRSPEQQAAVAAQLAQARSAIAGFLGDLYPFVDAD
GRPADQWVAEQLERLVPEGLAAAVCAEAGVEYRVLTILPEDARSIAEMDE
LLRDPRLRSVQLTLPI
>STH1964 hypothetical protein
MTDALKALEAVHDRYLTIMERAYRTGETDVLA
>STH2078 conserved hypothetical protein
MISPERTLTPEQRDELIRTLKTRFERNMHRHPGLDWTDVQARLEARPEKL
WSLHEMERTGGEPDVVGFDSQTGEYIVYDCAPESPKGRRSLCYDHAALES
RKQHKPEGSAVGMAEAMGIELLTEQEYRELQKLGAFDTKTSSWVRTPPEI
RRLGGAVFCDRRYDTVWVYHNGAESYYASRGFRGSLRV
>STH2814 hypothetical protein
MTKKGMSWGTSGAHHMAQIRSLEAEGQLKHWLGGWQKRRWPEAKAVDRRH
GPCRVIERLATTDPAGWLAARLPLLASAAGNSELGRRLKRLAQAVPTSEL
ISRGQPAPWRSRMIPTRQASA
>STH2027 conserved hypothetical protein
MDELNRRLAAAAERQARKEKLERQAARVEEEYRSAAARVRELKRQLEKEQ
RDVEQLESGSLGSLLATLFTDRTERLDRERREAAEALVRYEEARRWAEQL
RADLEAIRAEVAELSGAEEEYRSLLAEKERRIRQAQGPAADRLLALEAEE
AQARGRAREIAEARAAGTAARSGLGRVLEQLQSAEGWGAWDMFGGGWFST
MVKHSRIDAAEAELQMVRHDLDVFRRELADVGAVVNLPSVDLDGFTRFAD
YFFDNFFVDWMVQSKIGEARRRVEEARRQVDRLLEWLDGEERRVHEALEE
IRRRREQLLTGHTEDW
>STH2638 succinate dehydrogenase membrane subunit
MNQVKGSLWAWILHRITAVLLVVLVAIHFGIMHFVDPTAVITFADSQMRL
QSALYLVVDSGLLVLGLYHGLNGIRNVVLDYWPRSGRVVGWILAVIGVVA
AGYGSMALAAFLSQ
>STH2910 conserved hypothetical protein
MSPIAYLIVGIEIGFWVLVISGLAVRYVLNRPRLSALLLAMTLVLDGILL
VVTTRDVLVNGATPTLAHGIAAFYLSVSVTFGKRIVRWADERFQYYVLKR
GPRPRQLYGYEHARYNFSGSMLYLLAYLLGGGYLGLLIYLIGDLGRSVAL
LRVLGFWTVSLVIDMAISVSYFIWPRRPESESAD
>STH1459 conserved domain protein
MPPDWADRQNPDGGWRSREFSSPVSNMGTTSVTLMRMIWCGLGDSPECRA
TVEYLRRTQQPDGRWTEDPAQYGANPPEWNRPGDLAVDLWETANNAACLC
ALGLAQDPVTEKAVAWLRANRREDGTFPGYIHTTYAMAAVCHMRGEGVEA
ERYLADSLRILHKFKDESWFDVMDLTWALILWGLAGLDVRTEAVRSYRDE
LRRRRNADGTWSSRYPGCGPQYTLEAVEILRALG
>STH2548 hypothetical protein
MNFGGFGTDIATFSPFIGAQAGANDLAIWPGAAIYPSANATIYRVYEYAV
PVTTVATVPSAYPTTAGIPGIAGYPGWYGI
>STH1138 hypothetical protein
MAEGRAKPMLTICNAGGFARAFVRSFKGSC
>STH5 conserved hypothetical protein
MYLHVGADVVVALRQVIAIVNLRTVRKGGPTQELLDKLRRENRLIQVEGG
EARSLVLTDAGGILSPISATTLKRRSESAVLLG
>STH857 ABC transporter permease protein
MKGVWRKSVRRGEGPTPGESAWHVANRASGGAAARQAGPTPSPPHRRHGD
RPPGLMDLIWAGLRRSPGQTAAGLLAGLLAALALVGGLSLFQGMGRMLEQ
GLDRLGADMVLARPEHRPLVEQWLATGATEPIPATIDVARWRQGVDEAQI
LGLAGVEAVDLSAGGPGVPAGELASILVLRLQFWESAMMARAALADVLPE
ADAVVGEQATRHVLTDLQPLVRHLGRAAAVAALGASLVAGLLASVRVGQR
RAELGMLRAMGATRGYLVALTVGESCALALAGALAGGLLVVGALWLIPAT
ADVLRFLGVLPALGLVGLAAAAITVLSALASLPTALQAAWLDPLEAARRH
R
>STH1324 hypothetical protein
MSGHDGRDRTMENVRSLQDLGRPVEEARKARTARSATSTATEATARPLTY
NGRGG
>STH2414 hypothetical protein
MNPGSTSERSRKMEARRRAVDARFVEGWMSYIGAVQGVLNAAGMGPWEYW
RLMGDTGMAFHLVMHETCCVSSVTAYDWMNTHLAALSRIGVHSEVYGAHP
AMPTFDAARRRALVHIRESIDRGIGVVLWGVDSGEFGVVWGYDEEDGVLL
TSGCHGPRGNPIPYENLGMSFPGAPELHYQIVLERVPADERQTARSSLRY
YVDLMERQGHMGPGYRAGLSAYDNWMRGLQREGFDQFGCRYATFVYAEAR
ACAARYVEHLAEAERALAPAAEAFRRTAAIYGRMMEVLGQDLADPSALDT
PVSAEQAAALIPLLREAKESESETVARVAQYLRTSE
>STH277 conserved hypothetical protein
MAPSKSVVAGKTPTQGGSSLVMVEIRADNGFSFEELESGVWQQVVECFRE
VLVEALSRVDTLLYENRDAQRYVFKEMRSRTLMTKFGPITFKRRYYWDQE
EKRFVFLLDEVLQIAKRQRVSESVRADAVEAAVTAGSFRGAAAELGRRDC
QVYVSHEAIRQWSIETGMALAAAQKQRQVTEGGTRKVRFLYIEADGFWPG
RQRGKKAEVRLFVIHEGWVERTPAGTEYRLVNRRDFIPDSGRDSWEQLSE
FLEREYDLSDTWVIINGDRARWIREGVTWFPKALYQIDRFHLKQELNHVL
RHQPRLLERANAALETNDAGGLLAVLEEARTAETDSKRRGEIRRLVADLR
TMPESIRDYRVRLQERGVSVEGLRGLGAAEGAVERYSARLRKVGRSWSER
GLKAMMHVLAAYFHGTLRGAVEMVERQLGLESLAAVREKVRHRVVETVGR
GIEGVRHGRMPILYAGRNASGGYSRMLRAAAGFTK
>STH290 hypothetical protein
MYEYKVLMLSAAEQLEGALNRYAREGWRCRFQYVVEGFGGFTKLVVTLER
KVGSVPDDPAEA
>STH2285 conserved domain protein
MRRGHVRRSLAVLVLAALLSAPVGAGAEEELTPEVAEYPAENYEPGPGWP
EGWTPPPPPEGMSERMRAEVEQILREHPDLYDRYDWWVWDPLPSWTNEYP
WFWREDKMGCGTTLHVYAYPVPDTQLITREFGRSFGWSRLYWKPCRYEPI
QSITETRELSEEEKLAEGRRAEYLYWVDKSHPGLGRDDGSESNGNPDYIY
IYLNQGDAAKRFDVPAYLDTTVNRVRVPIRFVSEMMGAEVSWDPDGRRVT
INFPAITREVPKVVPAPGFDYPDLFDPEEYLPNGHRFAFEERTVSTPERT
IILTIDHPVALVDGREVPLDAPPVIRNDRTMVPVRFIAEQMGAKVYWVGA
EPIFRLDDGTMSGTYQVHIFTPFFPLYEYPSWYLENRAVRY
>STH204 stage V sporulation protein AE
MMYLKAFLVGGLICLIAQVILDNTKLSPGHVLSGLTVAGGVLGGLGLYDK
LVEFAGAGASVPIASFGNALVKGALQELDQNGLVGVLTGMFEVTSTGIAA
AIVFAFLTAVTFNPKS
>STH1562 conserved domain protein
MFSPGISVPDALIRTRHLSEGAKMFWCYLRTVRKEPVSFKQLRARTGLAQ
NSILKYLRELSRTGWLEYRRGRYSLTCHAIWKDGRPAFRLPVDLICDRGL
PAAAKLVWGAITQLKDGFSYAELVRCTRYSRNTVAKYVRLLLQKGWLRGK
AHREARRKRFLVQPANPHHERRQADLNLFQRAKKLAEKRIGDSIGQLYAT
YMVSLIVREDIIICNAQPWGLVNPRTGARLQYDILLPRSRVAIEFQGPQH
DGPTALYPDEAQFRAQQERDHLKRQRSAELGIRLIEVRAQDLTFPRLAQL
LQEAGVPVTSQPCAAPHLYELLEKETAAYRKAVSEVTRAG
>STH2218 hypothetical protein
MLLRSRSRFPNFQSHLTGHPPAHLSPPSPALVTARRLAFSTLVLATVGSG
RITQQKDGLGQPNWNWAAPSWDPRWSSTTGGGILWWWLIAQTTPERRTGR
KDGPSAPVTTGKSGSYRGWISLTGQFTDRHIKQRRARRLKKGSAASSRHL
RC
>STH401 hypothetical protein
MEIALVLPALVLLVIGGWVAGYMTFAKMALAMAANRAAREFAAAVSLHDR
LKEPERAYRYDGGFAESFGLPRWAVHALIMRTQAAPGPSRSDQAVVVAMC
YRLPLALPGGLPAGVRTAEAAAGSWREIRSEAEEWEALLRRAEALAARGE
QLTGQARWAADLWRQVIEGPPADFTPLDGADLQGPEGLEQAARKLCEPPA
GDGRSLIVTARAAYLLQTVFEPEGGGRR
>STH614 conserved hypothetical protein
MRHVNTLCLKLAAYILIFALALPAAGHRAFPQSLMMAAVHTLVLWLGDLI
VLPRYGRTTALAGDAAALVLGSLLMLRAMGAALRFGGLLAAVGMAVLFEA
WFHYHLTENRLLEG
>STH2568 hypothetical protein
MGVNELILWWVLFLIANAILAAATGMLVKAGPFARVRVPALLSTFLFICG
MTLFLVAGKQNLLQETIAAIFGN
>STH1153 hypothetical protein
MSETKWSDAQVEQYLRSLTYADALPDAPALSPELAGRALAAPQRRRAGAR
WKVGVAAAAVAGLLLVGSNSSMVADAVERILGVGIRNMTQQEYAQRVSGD
WPADMPMPEYYSPEETAQLATFPLRSPAWVPEGFTLLNGPAGAFEWLHTE
DGQWELLEDPEHFYVTEAYQSADGRRIHIRQSLFGEAQWISWPPGTEQLE
VAGHPAFLREDVEPVHVEPPEGVEPTDDWTPDSLNLLYLWVEEPDGRITE
ITLDGDVDPEVLIQVAESLFAGDGSAN
>STH1683 hypothetical protein
MKDRGRDRLRLALAAAAASAVLLTGPALAAGTAGSDPAKDAAERALDVYR
GFVPWEDFGFASQEELRDAVVAEGVPVFRLDPSALDGRLHRLGDAIRDVR
LVEFVIAGGDRAVTRLTLRRTEDGYERVGFGGDGEHLRRGLSLLPAPEGA
RLVLLGPAEFLYWSGEEAEYLVNINGFALAGLEPLRLYTPEEALPGLRQY
ARGLLGLPDEPGLPPAAAWTLAATGALFVGWVVHGFWQIARR
>STH2661 hypothetical protein
MRRNEILERLRELYHQAHETLERAEHLPERLPLDVQPQSLNYDRDLSRIQ
GEYNRLVGLAQAEGLLSASDLEAQGLPERMGD
>STH753 conserved hypothetical protein
MAFPGRDLRLRRLSALTLVLLLAAAALAGCVRLELNLRVNPDGTGEKEII
AAIDREFLAVAAAMRARDPLARLEAELREDPKATVTRYQLGDMVGFRAVA
PFRNELILNDETWKGHFLIRDRIFWRDYTLDLETLLDMAELDAVAASFDS
PVDFRVSVELPARIGETNGELDPSGRKVTWTLIPGRRDHLVLTARQYLYG
RIAAAGAAALAAAVGLAFGIRRGQRRRRPVGGIGQAKT
>STH2042 conserved domain protein
MHPTSGIQQVPGGWTGHGKVYPYEFRSQKIPGGAGTCRSPLEERLFRALA
RNPAVLSFAVRPMRIAFHVGPKAHHFTPDVLVQYADGRKVLVVVKPGQTV
PDPAEQAMFKAAADYAAAHGMAFEVWTGPRWVSATDPRGGNGDFDIATAA
RLQTEAAAADQPHRREPAKGTLTFWTVIVVVVVLVMLMRLAH
>STH1350 hypothetical protein, glycine-rich
MRNHTRMLPIAATLLLALLAAGCAASPGDLASFEDAGSREIAHTGGREPA
GGTLVDGGGNAAAWVDPDALPASGGRLAQAVDPDAPVSSGQLMPAPDAPI
PSRPVGSGNGGASGGGSSGMEPGVIYPTAPAEGAGSSGFAGEVGDLRLLY
CGEDKYVHGRGRDLDARKCLWDAYLSNRAAAFRTVAYTIEGDPIAYDLDA
LGPDRIQVLVDSEDRFGATGEFLYTCTAMEFTDDLGFVLKGCAPNEPEGS
RAFLSEEGELYVP
>STH1488 conserved hypothetical protein
MAVRPIDSLTMLPRLQEAGRAVQQTEQYPHAFQQVLGAQVQQEGERSQKQ
VRRRTEAEQPTVTPDGRRPGGQGQPSGRGEHHRRRQARPEPGDAPGGTGR
LDVKV
>STH1006 hypothetical protein
MDGVVIRRQKVVAFDNDVVYTVDRNGLARATRIDQIDSVDF
>STH2200 conserved hypothetical protein
MPGGGGGGFLQYMDSMAMAFGFLLFLWILHHACGRCVKLHHGAFLSGSMV
SLIVVFLVGRKAELALDPFVGAMALLAGLLYATGFLALIDGRRRLTPYFV
AYGMVGILGYALAAHYGNVTGAVWQMPLLWMTAIPSGAVLLGGPVWLMSR
KGDGSHALWVPLGTILLYAAFYYMVGTTNGTIALGDNASSLINTLLAAGY
LCHLLGFVLVRRWLAADRPDAAEPA
>STH2581 hypothetical protein
MGQVLFYRAPLDRPAAALHVTVSPVRTAVLPSLTEAPVLPGRVRVVVRYG
PRTGPVGVHRGWLHTDLPPARRLRLAGCQQDDGGLLLSLATVPTRAGRES
LTDRFRLPLHMTTWGIEAISGVRVELFCIPSVLPGRLVLRGRAAVAVETP
AERLVRAVPFCRGRAVALNPHLSWRARAAVEGLDLRVAPGGLVEGWLHVA
VACWGEPEEAMGGAAAAPADPVAVRRVDAAIARVEAEAVRDGLALVSGAV
ELDVAWADRSGRGRWTCRAVPFSALVPLAGLCEGDQLEPVAQVEQLSRVG
AGASARATLLLGIGLTALRPVHREIGGAWYRMAQVVGQAVATVQLDEPLF
PREERSAPTEPWRRVRLDLGQHGPWKALRARIRRTCGRSSLEVQGEPFGA
ESGAATGVRLELPGGADAQVSLAAVGPQAELRVRRPARGGVEVPLPRGTG
RGGWHLLAGPARWVVDASPCADGLRCLVRDAGGLRHVVLVAEREAGGEAA
EDGHGQGAGGNLPAAWTVAGTAVRGLGMDRVWAEVEG
>STH484 hypothetical protein
MRQQAGNICRRCGRSSEELFQMLDNGGRWAVVAGACTCGHRWTEIHTAVP
NLSAVRRWLRQGCQPPDGPVGPMQGGPWQTWRVEGLGPEPVHSPGR
>STH2735 hypothetical protein
MRRVLFPVLVVLALALSLAGCGRAQGPQWGNLNLTLEDVQQNENHWTARM
VLSNPTDKVQVIEYTADARYTMIVTRNGQTVLEQPFDILRDGDPRILNLS
PGVSKEYVVAWTYRNKEGERVEPGTYEVSVRLDAVTVVQEAGKTQPTVIE
PRTVGPVKVQVQ
>STH854 conserved hypothetical protein
MSKLTRDWRPLLFIVLGAGAVLVSHFFLHTCTDHGHFMETAAGTLVPMRC
HWSERAFQGVGALVAFAGLAMYVFPDAARGLSFAVAGAGLLMILIPMWLV
PTCDMPGMVCNLSFKPGAYLTGGITALSGLGGTLRLRRLDVMGGERSAV
>STH861 hypothetical protein
MERFAIPGWDSFLRSSECSLRSSLCSLASLYDRLPPAERVLVDRRAPVLG
DRHTPGFHLVSSCQVRLQRIRVRTGSDLGLRMPRGTCVCSRLRLQATRYE
LPPPAARVLVDRRAPVLGDRNPPVFHLVNMRQVKIQRIRVRTGGDLSLRS
PRGTCGRSRLRASASRYELHSPAARVLVDRHPPVPGDRHTPGFHLVSVRQ
VKIQRIRVRTGGDLSLRSPRGTCGRPRLRASASRYELHSPAARVLVDRHP
PVPGDRHTPGFHLVSVRQVKIQRIRVRTGGDLSLRSPRRTCGRPRLRASA
SRYELHSPAARVLVDRHPPVPGDRHTPGFHLVSVRQVKLQRIRMRTGDDL
DRFRRCGPVVAPVCAPARLQVRVSEGPRRAAVAAE
>STH1294 conserved domain protein
MGVRARLVSGMLALLLLIGGCGQVDAAGLVEEYLTALTIGDFDRAAQLST
EALVELNGDDSAEARFVQTYLAQVRFTVREAQKEDDDRYRVPVELIAPDL
RPIVGRVLANLLGEAFASAFMLTPLDEKTMTQRLVDSLTAEITRSDASTI
TVERTVMVERIDGQWKVTTPVFAESGLLDIFDDGFLANSYAEESRARRAE
SLRRRIAAIEEQLTELRRDREFAEQSRSKLSKFVVSNAKFYWAVDGWWEE
PVIELTVQNNTEFPVARAYFHGRLVSPGRSVPWVEEDFNYAIPGGIEPGE
IQTWKLVANSFSTWGSAPRDRDDLILEVEVTRLDGPNDQVLLDAEFPDYK
AEWLADLEADLAELQEELRALRTEE
>STH1382 hypothetical protein
MDRRTFPNRPAHHKVLKVRGDAGMDEAALIRAAQQRECLQRDYGTGYGTA
GCRWTVDSMVVILVGPDLPSVVELAGSAVNHP
>STH2975 hypothetical protein
MPRFHDSPCTASRRPCLRRGIRNPRQLSHEPAHLGVERLLLLSKGLKLLT
KSDFVPWQEQRNRRLHRRIPATNEFVHLVHRRRGRGLLQGLQELPTRFVR
PLVRLVEALLPIR
>STH323 hypothetical protein
MEAVANVKAAGAEVVDAAKDQFAAMKEFGRDAAGVVRDFFKEKDKPPLPE
VGDDAKYASDQLGDLSGTANTAAGGMDALRESAEKAGSAVREAFGGFKAY
ALGEVAYRLGALAPDWQSLAAQQYAAANRFSAGLPAAAYATAPANSTLTI
DGSVRVEGVNSRGDLVAVTRLLADDLRAEAERYPAAPSQRRWR
>STH175 conserved hypothetical protein
MAAVQEDRLMVLQMLADGRLTVEEADTLLRSLEETAAHADAAGAGPARRA
DSAADRHEPGARLRVLLQQMLDEANAALDRALRSVEERLAELERREPYRQ
IRQLADSLDRVAAREAVREARAAAKEAVREARAAAKEAVEEALEEVEEKV
ADAIETAKEAVADAIEAAEEAVADALEAAEEAVADAAHAAEEAEENVRFR
ASGRFGS
>STH1867 conserved hypothetical protein
MEQIRAAVDFSFQEIERLAWQQTLDCFREALVEALSAIDTALYESRDPSR
YVYKEMRSRSVVTKFGPITFQRRYYWDREEERFVFLLDEVLGIAKRQRVS
DSVRADAVEASVTAGSYRGAAAELERRDCQVFVSHEAIRQWNLQTGRALA
AAEKQQQMTLAGTRRVRVLFIEADGFWPARQRGKKAEVRLFVIHEGWIQR
GPGSKEYSLVNRRDFVPEPGRDSWEQLSELLESEYDLSETWVIINGDRAL
WIREGVTWFPKALYQIDRFHLKRELNHVLRHRPQRLEQAHAALEANDAGR
LLAVLDEAHKAETDTDRRGKMRRLLADLRTMPESIRDYRIRLQERGVNVE
GLRGVGAAEGAVERYSARLRKVGRSWSESGLMAMLQVMAAYYRGALRGAV
QYVERALGLESVNAAAEKVRHRVRETVGRGVDAARHARMPILNAGRNNSG
GYSKTFRGFAGLVTR
>STH3305 hypothetical protein
MVNLILAYSLWGPKGELDPGEPTARSQLVQLRTRLDERGLILPNAVTLPT
TPHPMRFLRVEFRPNPTKAPPVDEPSEGAPAASRLRSRYDAAARETVLEP
VDGAGGTPVDLSDRLLLRQVAEDFLRANGLFPHDVQLSGIFLRENGAMVE
FVPTYDGLPVFSGYLRVYLSPQGVERVAHHWVEPVGFKPGAPKAVRAASE
ALLRLAGHLEPGDGQVRTIVDVRLGYYSGPSVTVPGAEEISAWETVPVWR
ITLDNGQVYYVNAFNGELES
>STH2387 conserved hypothetical protein
MRCPNCGSRSIGRVGAEQYYCWDCCVEFNTARGAVRIYHLDSDGELIAAA
EHRVEASAPLGPTNERRR
>STH2188 hypothetical protein
MGNTTGAGVSGMDRTEELLRASFSRLEGLIGEVRDLRYKIKSVQGDVESL
RAALGGGGNGNAEPSGSAEGEPSAAARSDGTGSEPGESGSAAAGAKAGPG
DSAFEERLRRIEQRLDYLASKWLELDEKLFRLQKRLGSTTG
>STH2434 hypothetical protein
MLPTVQEVLQADWQETAQSVQPANCTVVCSRGADRVRICLAKENPPIAWI
TPS
>STH684 hypothetical protein
MEVGGQHGGHGHGHQPWVKHAPLHARPRTLREKVRQFFWGLLFFEWYHEL
RHERAKYSDVLNLVLFGELLGIPLMNSSIGLRLLPFVLPELDGWKHRQLE
EREVIEEVPHVH
>STH2900 hypothetical protein
MLRRDRRSHRPVFLVLRSRPGDSGARPLAAPFASPDITVGPDGRPRAVVL
NLGSREVVASTEFYSVPAGLPVTAENARLVGTGNPAIIRPGEAVTVSCNE
PWLNRQADVLVVMAFHPELDPVARPFDVLADRHVGQMNYAWVGTYAGSLP
DGDVRVEIRPAPQGLFRLKLYQEGALYPRCDRIMKPHGHRFHWMEVEGDL
RMLFDLAVVDNNRLTLGVGPRGGSPRSGLLTRVTA
>STH2529 conserved hypothetical protein
MDLYLFGAGASAAEGAPATRDLFPMAYRALGPRFDPQVEEVWEFLGAVFQ
RPVTGPDTFRYLPALDEVISLVDWSLHVDQGLGPDYDPPRLYRIRQALEH
LVCSVLERARGRQRPDDGPHARFARALASRAPDSYALVSLNYDTLLDDAL
RAAGLTPDYALDPGKSGGPLLLKLHGSLNWVHCPACDQIAVLREPVAHQL
TRAAGLACGRCGSRRLAGVVISPTWLKRYSPGRLTRVFEQALEAVRRARR
IVIIGYSLPPADVAVHHLLRRGLLTRAHPEPPAVEVITRSPSEQLLDRFR
RLFGPGVAFDCTGFRGQT
>STH528 conserved hypothetical protein
MLRRLVRYLLGTLRIEVTGGDIERFLNGCLEEGILLWDLRRTPERLHATM
MLADFFQLRPVARAGRCRVRIRARHGLPFLTRRLRRRPFLLAGALGGLAA
LVWAGSHLWVVDVRINGPGYLDPRAILAVAAEAGLRQGAWKARIDLDHVT
QHLKSRVEELSWAVVRVDGTRAVIEVVEKGAVTPPDQAQCVHLVARKAGV
VEQVIPLQGEPLVKKGDVVQPGEMLVECALRYWAGGRPVVIPGTPPPPRT
DLARTVVAQAAVKARVAYRQYYEYPAVEQVKEPTGRRHVQWVLNWNGRSI
ILRGRGPVPFAEYEVTTRTLAPGQWRNWSPPVEIVIREALEVSTRAEPVP
VEVLTERARAAMERRLAWILGPNDRVLTPLRAEVVQQDGGYVGILVTVET
LEEVSAPLEGPMLTMGR
>STH615 hypothetical protein
MRLEEFTYLPLDVKEKALRYLANMMGEPERPLDQLTADLERALDRAVRGG
LFSYLREEQPVEDRARADAVLFPTGHTTPTGEEILAKCILNKNPNRQPWF
GLFFQAARREGFVIGDLYFRTWNEGARFLEELASMAIPERWNYSQYQSKQ
QHPILKSYVEKTYERLKQQGRVLRNESKLLFNTGLLNVYFKEIYVLGEAD
PEYPQRVINARPVLENDRAVLELFLNQKPPMATYFDRITDVIFDPDLEIN
TDDIHIIDDNFDRIPPKYRNRKKSEIFALFQAAIEFARIMARRNYKLVVP
QYYMGQIQFLMPIYLSGEFSGPPDFALVLQKMGDVYRGNTILTLDMAYQN
ARLIAKPDTTWLSPDKF
>STH1264 transposase-like protein
MIELLAESLPLRRVASRLNISVSTAFRWRHRALTVLSARDRKPLSGDVRV
ETFLVKYSEKGSRVCHGPGSWGYWNVVRKGEEPVGRVRSGQSAGRRRFRL
LIDGRPLHVMVAETDTGYEFDILGQGRRNAEMVAAGLARLIRPGSRVFAF
GGSEYRRACETLGCEHHDGVAAVSRWFQCLRGAEEGGASRGAEEVTTPEV
RFPNLPIWWLRRFRGVATKYLGHYLAWFRDIVRIVEFPAGANMASADGRA
LLRRPADWGVPPT
>STH1430 small acid-soluble spore protein
MPGNNNSILVSQARAALEQMKNETAAELGIQNYAQQYKGDLPSRVNGSVG
GYMVKKMIALAEQQLAGGTGTF
>STH3025 conserved hypothetical protein
MQRLEGPGVAFVPLLLRIPLAAVFLFAGWPKLTNLAGAAALFGNFGLPGW
LGQFAAVLEVVGGLLLLLGLGTRVMGLLFLMEMVVATALVNWGQAWAGGT
FDYTAVRMQITAMFACLALVLTGGGAASLDALWLRRRSQGAADGAVIRR
>STH2346 conserved hypothetical protein
MTEIRLLTPTGHLGFTMLEPESFLRGMAQGPDFVVADSGSSDIGPYPLGA
DEPCSPVEWQTHDLEIMLVACRRQGVPMIIGSAADTGTDRGVDTYVQILR
TVAERHGLPPFRLAYIYAQVSPDEVRRRLRTGVRIAGLDGRPDLTEADLD
RTDRIVAVMGAEPIIAALEAGAEVVICGRSSDPAIFAAPLLWKGLDPATA
YYAGKVLECASFCAEPFMGKESVLGVVRPGEVIVEPMHPGQRCTPLSVAS
HAMYERATPFFEHLPGGVLDMRNCRYEAVDERRCRVTGFAFQPEPTVRVK
LEGAGKRGERCLAIVGFRDPDTVARIDAVLDWARNKVEARFGPGGYELHF
HVYGRNGVMADLEVVDRPAHELCVVVEGVAPDRRTAAELTAMAARQMFYA
RLPGTKGTAGTAALMSDEVLPARPAYEWTVNHVMPVQDPRELFRLHLTTV
P
>STH1288 hypothetical protein
MQHIRNPRCFLVYALAPEGGSPAEANRLLNAYVADERRGLAVFHDHFIGR
PGGVAIFFAETAEQRAAIADLGPLEGWHVEIRPLIYSRSPAGFDEQTAYT
LRAYRNADWEKLRVEQRPRYGDPAREAETAEEDVGE
>STH3271 conserved hypothetical protein
MRSRLLRLIYYGLPALVVAVSVLALNSGILLKRPSGEHDDVAGHLRLLLH
HADTQRWEEARDAARKAGEAWLRMRGRIHLTSARDEVETFDLELAGLRGA
LETGDPVQARIAVHRLLALWEDLGS
>STH34 Zn-dependent alcohol dehydrogenases and related dehydrogenases
MTMGCPYGTHRVLEPAGSLPQPAWRLNDDPKIYDNEILIDVQRLNIDSAS
FRQISEEAGGDPDRIAAIIRRIVTERGKMHNPVTGSGGMLIGTVREVGPA
LNADVRPGDRIATLVSLSLTPLRIEAIHRVLPESDQVEVTGTAVLFETGI
YAKLPDDIPDNLALAALDVCGAPAQTARLVKPGDRVLILGGGGKSGLLCS
YAARRAGAAEVICFDYSDESLARARRLGAAHRYIQGDARDALGVMRAVGE
KVDLTLSLVNVPGAELGAILATKPTGKIYFFSMATSFTAAALGAEGVGSQ
VEMLIGNGYYPGHADLTLNLLRESPELRALFEEIYA
>STH1959 conserved hypothetical protein
MSLWSLLRTEGRLVWKGRWVALLTLALAAASVWGGPFGTYRYAVRSAERI
HALRAAYEAEFAELILDDLVAELQGTLALLHPAMGPNYLLAILAVLGTMV
LPIWGAQLVGNEFRHRTAKARAAHVGWGAMVAAKVAWLLLLSAGLAALFA
GIGALSGQVTWRAAQDALLLAGEVTPPPLKAPLFAQALTAALGLFFFGLV
GLLAALVTRSALAGALVGLALPYVEAFVIGTPEWGWLLPRIAYGNLMVDH
FVYLPGGMVGEPLALVPAPAPWVSWAVVAGWTVLACLAALRLGQRQQILS
>STH2959 conserved hypothetical protein
MDMRRVVTWLSVSLFAAVAVAVWAAWTANLNEHNRLLANALEAERQRNFV
DMVHHVEQIQALLGKGLATGSERQNMRYMADVHRHASAAMTAFSSLPLPT
EVSSTTGKFLQQVGDFAYALLRDEAAGRSLTEAERAELQRLREAAAALKE
QLGTMLTQYQRGGFRWHQPAQLSLRTLMRPPGRVLSDPAASGGPAAAALL
DGGWAQLATSMEQMPTFIYNGPFSDHVNQTPPALSGPQLSREEAGGRLAT
ALPDAAAYRVVDVVESGGNLPAYTFRLAPAGTRGSAYTATVSLTRQGGHL
LEFLSGRVLGQPALDLEQARAVGQEYLARIGYADMVPTFAQIQDGVAFVA
YVYQQGEVIVYPDQAKVKVALDNGEVLGVDARQYLTNHRRRTLPRPRLTP
DEARSRLNPHLEVSRVQLALIPDAAGTGEVLTYEFQGKIGDEVYLVYINA
ETGAEEQILQLIVTDGGTFTL
>STH1961 hypothetical protein
MNLWTTPVPRRIMEQQPRGGVQPHTEGCGMARRRVLVPALAAGLVLLMGS
TALAQPAVRLPQAQDSKFIEDRVEVSVRREDGGLSTQATKYTKTVTITRS
FKSLAGLTVAKHYSRTTWTYGGGTLYSSPRGIDVDWWTAPLNNYKTHSAK
WDWYTTGKGGTGRSNTQVQFVFGVPTPWGPVGSSYSSRIYTTVNGNGGYS
YF
>STH2220 hypothetical protein
MAGGIALGLLSTLAMVLLKPMGVSTVFPSSLAIALKPLFPGFVEANAYFQ
TIPLSIG
>STH1780 hypothetical protein
MHWWEETEARIHRTTSAATLGQGLWQRLVQAGLEPDAAYLAAYDLAALQL
LCADLVKLVDSLLEVADGDRAGWRRHALALLRWANSAESWARETAAATNR
LLDSLDLEPSTLAAREEVEAEPNGALSPEEEAKLDGRYRHWHLLYERLDL
KLSTMGLEEAVQRAVARTLARLYEEAVVTFRLLSGLSRESRPRYGQVARL
LLQINTTWHFDLGPYLLGLGQARPDGRGTVGLATWLLLASREPDMHMG
>STH1554 hypothetical protein
MRSTLHIVSACDYLRIRQALQPVLSRDQFYCSRDAQREQDDPKPALCVHL
MQREDPAPFPTLHGHMRIRQ
>STH2364 conserved hypothetical protein
MVGAGYTSFRKRKSPLPVGRGLFAFARVQAPPGGGTCPTREERSEMRDAR
CVQCERYAQNQWLGRERRTGGVVELARHLVGIQSQLRTTPDLAARARLAG
ARPGDVRRELEETRRLVRTWTVRGTLHVVAAEDLPLLWRALRPEWESRWS
KYLDKHVTRVQREAAAAACLQILARGPATRAELLEGAQQILGCRDEWVAY
LFSSWGGVLKDLAYAGRVVYGPERDGEVQFVRTEDWLDLPVADMDPDDAL
AVLLERYLRAYSPARIQDFAHWSGVSVRRAREALTRLGGRVAVRDGWLVP
EGEGGGSCAPDRQAGGPGADAGCVRLLPKFDPFLLAHADKFYLDEDHYKA
VFRAAADVAAVILSGGRVIGTWRAEGSRVVPALFGPVDDAAAAELAQEVE
AVSAWLRQG
>STH889 hypothetical protein
MSLNFLREARVLFHGRRGLAWLVPLAVLLLRFAGFDATSPENAAALELLL
PLVGPAIAVVLLSLDQHGMGEVVLAGSRAGHFVWVYRLAWVVLWLLLLVA
VPAPSPDLLLALGPAALLMGVVLSLSRRISLRAGLSVALVWWGISYLAIR
LADPLLYFSPFSWAMLQLVQTGLFPEEIWLRKGAQLLLGLLLALDWARQN
MKGHHH
>STH1357 hypothetical protein
MRVRLGRRRPPRRTLVGFVLILAGVSIVLTALPGFVYAAVIGGLIAYIGY
TLIGR
>STH2225 hypothetical protein
MATTGAAEQRQLALQDLMDRYLISQADEPEAYRRVALHEGYLKAWFHERP
RWRILSGRGVYRLERMPSQVLFHRGLPRLRSPLSYACLCWVLWFAETLVT
TARDWFVISELAQRIATVSEGRFTLAERSHREALVQALQFLIDLGGLLLR
DGDADRWVAGQDYLGEPPEVMYEFTEMAPRLLANFSYESLAVAVTADHGR
RTAPPTGEEAPPLTRAWRALLLGPVFWKADDPEAFAALEANYEAVYRDLE
ASLGWQVELTSSFARIWRTTTARHAGAVLLDLYPDPGEEAEERHTRYLFH
PILLLLGRCQEGVAAGRWSADADGAVAISAGELEDLLTELRSQHRPSWGA
ALGALSIAELVTVVLSEMRRMGLLRGPDRFGRCWLLPAAAGIRGRYVTRE
TGARRAPEPEQPQAQQIRLF
>STH1768 conserved hypothetical protein
MAALVETAAALPWMLFLYAADGEAGWFEAVPGAWLPLAVFLTAAVWESGS
RGDARPARVGALVAGAALAYLAAYALLPAPLRTGVFGLNPALAFVPVAVY
LWYAGASCALEGLEYGRLFERAWRLFGGQLAGIVCLLLLGEARSGPVQLV
LGWSVVLLFGAGLSLLILTRERALLGAFDAEEQAAEPTSPAVTGFVLALL
AGTVAASAVLTVDRVATLLRAVGRLVAPVYNVLVQVAAYLAMGVAMLLEP
LFNLLKWLASQQEPTEQGGGEDVAPGPPDLPPDAGAQLDLEPVLKAGLIL
VAVALAALLLWRMSAVRRRTSEVDEERTNLGFWASLWQDLRGLLQARRTA
GAVGDGAGEEPVPPGSARELYRRLQRWGSSRVRPRRPAETPNAYRSALAG
AQPRAADAVDAVTAVYNQERYGSRPPDAETVAEAAALWEASCERDFTV
>STH756 conserved hypothetical protein
MKMRKLHLLLATAATIAAFGVGGATLALLTDQASPSSHEFVAGTLRLDGR
RGGGDTVPGPMFYLGGDQGLNPTGEWAPGDTQVRWFEIQNVGTLAARMTK
ISATMSEGADTTLAEQLFVRVTDEYGLEVAAGRLSDFLDEEGEPFRGPAI
ELEPGDILTLTFEVSLPLGTTDAFQGLSAESVTFSVSAEQVRNNP
>STH3138 hypothetical protein
MTGAHATGHHLNEAAERGEPARDVPLEGLVWIEKEDQAKQGMARKGDMRD
AAAAPDRPHPAGRPDQGGRQSH
>STH2371 hypothetical protein
MREVLTELQRLLQAEGLDGQYGYLLRRCFELLSWAGNEPDAAAALKEYLL
GIPNAPGTLWDLVIWRDSFEERKRVNERLDQLRERLMALARSL
>STH2645 hypothetical protein
MIAMMNETFYPPRPARSQGLKGISFVTLAGLTLAVLALASSMLSALNMRY
EVTPEALRIRSGFSTREIPLDEVTGVWRPEALTGGVRKFGTSVPNLRTGR
WRFNETGDITLYATRLETLVVVDTADGRYGVTPENPDAFTDALRSRLPAG
FSPAGGSGTAAASMLIPLLLVAVAVPTTLYAVGLPSRFPQTLRYELGPDG
LTIRTGFRPVQVRYADVERVEVASPKGYPIRIYGTAVGSLLWGKFRWPDA
GPNLYLYCTRMKPLVLLRLRDDRTIGITPEEDERFVAELRKRMG
>STH2659 small acid-soluble spore protein
MCIFGFTGRDSFPFVYEIASAAKIGAGGSHMQKKTKSPTAKGRQKKPKEL
TPLDRLKLEIAQELGLAEKIARSGWAELTAAESGRLGGILNRRLKELKLA
IGPKGTLIPTAKS
>STH3088 hypothetical protein
MDERQIRQRLESLVDRWLKASPAEKRILIQEKIRLEEMLEEERA
>STH2216 transposase-like protein
MVGVWPLPIHHVEVRLMQSHGTTPNIAAQAIKSTTRHGFLVALGWVAQRL
NLVEILNRHLRIKQKTYAHTPVDKVVEALVAILGNCRYMKDLNFDPEPLV
ADPAVAQAWGQERFAHFSTVCATFSKLTEENVQQLSDALAEIQAPLLQQE
VAAVAGPDRSGMVIVDIDLTGQKVRGETRQYTGTDFGYIQGKLARGYQIA
AAFLSGKQQRFAIDGLLKSGKANSRSGACLLELIPRIEARIGRPLRRVEW
VEACLAQQKARVRQLYQQLQTVSGKGSARRKQKLQREFQEEVQHLREVNQ
RLRQYRQENRTNPAPLRIVLRADSAFGTPEVIQRLLELGYEFTIKSYSGS
NPAYKRLFDAVPAEGWVEVEKNRFASEAVTVPGPTLLAPYPVRLVAMRRW
DADGREVRSVILTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGT
PRMRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLAEAGSRLLVRVAA
RCRATIRCLGDTLRLVFSRGTPLAGAEITLNRATPYPYALLTPRMSSCSR
ET
>STH776 hypothetical protein
MLLEIISEAVVGRAEHTFTWTHTLPADGVVQVLGVRPGPSEARVRAEGGS
PQAEVTVDVDLWFLGQDGTRVTRTRCQTVQPAPVALRGNLLSDPSYDLRL
LGNARTTDVRVEDGSVHLDMAVTVEVEARALTRYWVRAEDHVPAR
>STH3309 hypothetical protein
MKGQRVVWRLLSAVVAGALLLCGLAGWPRARAAVYRYYRHQGRVRTLGGL
AGYRTAESGRFVLYYLPEDGDLAALILEQAEAVYERVVAEVGYPPAERVP
LILYPDRTALRAAYGWGSEQSAVGVYWRGTVGLLSPRVWIAAGDPAELAS
EFRRLSPVAHELTHYLLDELTEGNYPRWFTEGLAQYVEELATGYVWPEAA
AAPLDPPLYTLSELTRRFDDLPDRPLAYRQSHLLVAHIAEAHGRPGLTGL
IALLAEGVPFDRAVETALGRSMASIYADWLDQAEGRSAQGRNVRLSERKG
LLEER
>STH1262 conserved hypothetical protein
MPIYEFRCNACNHLFEELVPLNTTGENQKCPECGHVGARRLVSAFAAHGL
ENGHIAVGQKLTSLDKARASGKSESKSGESKSSGESASKSA
>STH2511 hypothetical protein, alanine-rich
MYQHIHYPDAPAPRSLPRWTVWAFAPVALALGLIVGVAASAYGPWSAPAG
EPAVQAPPEQETPAPADSTALPLPATGNVPTQTPGPEAPSAVGTAPLLPP
AVDDSTYVTPVPVTAGPSLAEVRTLVDARTVLASEFGTFTAGGRQWFQVD
YQMARDINYDTMLVGIVKIADYNNWLTAVRDYRDELTRWLTAAARRVRDA
AQNEGFKLSWALFEVVPEPPYGFLASEVTQMPRGQGYLVTRPLAAVSDFS
GPTVTLAAVAGTGGGPVTDPSPASVYAPVLRFDPTDLYRPPTAP
>STH1557 hypothetical protein
MNRMPNFREWTPSEIMAQMPEARSVLAKHFGREGISTASGFRLRDLARQK
DVELSDVIRDLNDVARATGMPW
>STH1019 hypothetical protein
MINALTREQVAGLRPPELQSYEEPLYEVGVPTLDLENRVPHRRLTSIEAA
QLQAAFWQAEQHFGKEYDEVYRTLDELRERTEESGPLNEEWRRAIDAASN
LHWNFARAIGVLAFLAGQREGFRLGAAAVTRGAQDQAIATALIGALVDDK
QLDRLIGQLATGREDPEYIV
>STH1834 hypothetical protein
MGFPSPPALALWCRPDARAPSRYVCIDLDCKIGFLWRTNCGRQKMRSLFA
LGDASCSGMITGLVPNRDL
>STH295 hypothetical protein
MPSGGEHQGRDEWPVQHLLQFLRFIATVALTILIIAVLGLALGPKDAGTV
TLYALVIGLNALIALLYWPVVRFVQRRGR
>STH1944 hypothetical protein
MRRHGMMGFVAGFVTAAVLFGGLAVAAGYERQITVHFRPLKYVFDGIERT
PQGGEAGFIYQNRTYVPLRFIAESLGRPVEFVDGTIYVGAIPGKTPEVWN
RLTEQGEGSFKVQFFADRALSLQGEEMPAATVVTLVAPGGDVEDKRGVTS
QLWADYDLPAGVGRMSGTLYVPHQYFGLAGERRVGRLVVLNELNRPIYTS
PDLTTKSDPVPFSVPLDGVKRVRIVVTLHPYEGVPLGDQLVMAQMGIAGL
KFE
>STH969 S-layer associated protein, putataive
MVRLRKRWDGTDRHVDEGVCMTMTTPRRTDITHRALKRAVAAVALAAALA
LAAPPVTHGAALWDTAGHWARAEIAAGVAAGYISGFPDGSFRPDQPMTRA
EFYTLLTGALGLAPRPGDSAPYAAGHWAARQGRLQAAAAAGLLDPADYGG
WLAPDEPVTRREIVLAGVRALGRAHLVGGRALASADAASYPDWLQAWAAE
AVHLGILQGYADGALGLDRTATRAEALVMVQRIRYALLAEVAETDEDAVP
GARRYPAPGEPTWTVDSGSPARPVFSDGRQGYALNAAVAGYTLLPAPGRA
AWLSTVDGGGTYRLYWLADGRAAEVARGAEPIAALAVAEDGRLWFSRGRE
ILAAAEGGIVERHTLAGQATFGALDGAGALWAVDSTNLYRVAGGKADRYL
LPSPLLARVRYVAPAADGSVWLLLRSEEMGGGVEAVRIRGGRVAQEVSLI
GGGPQARTAHVQAAVLARRGDDLLILVTAPERAVIRFDLAEGAAARLVLP
PEVGEGAQVVPAADGGALVLGRQGRRWRIVD
>STH100 conserved hypothetical protein
MRPRNAHPRPVRRAGPRLAPEQTLTRQQVSLPLLLSLALALALVAAPLWA
FPGLRPAAERPRRESATPLPGPALEMLNRAWEAAGDRLEQALLFLQEPVE
DPQVAERLAAGLGWGADAPTGEERSLRLVDGIGGPYVEVTWRLRGEAAAR
WDERYRELRQVLAAEGLWPQVQVELSGRAAGGQPLALAGAALDALGARQR
EPWSDGRAASVAGYTPLLPPGPYAVNVQAAVRRDGDGDGNRLWIGWPTVR
SDY
>STH1140 multidrug efflux protein variant
MIAFAWVYEAAASMVSQFGQVYLPALGLSMLGAGVVFSASRLLGAAGGWV
AARLQGGGADRWLRWGPLGQAALILLAGLARTAGGGVALAVNEGVDGLVY
PLLSARINQAVPSAQRATILSFQSLGSSLLIALAFPAAAALPSVFTVYAI
VGGALAAAALGWALPRQRAAVPAPEGR
>STH317 hypothetical protein
MAAGSSSTPPCFNPHPASRPGATRRSASPSRPRPRFNPHPASRPGATLPL
DLLHRDGHHVSILTRPRGRVLRWAFFRPAAIRSFQSSPGLAAGCYPCIAA
QIGCRVMFQSSPGLAAGCYPAQGLAHLEE
>STH1772 hypothetical protein
MGYDEVQALRDRIRALEERTRLLEERIRGLRTSRRILMNLLAAQERERRA
LVARLEAENGRLQRKTARFARAVLERNIRIVRLEESLRRQTDVSSG
>STH2258 hypothetical protein
MDELRTTGVPWGVIVGKIKEHLPTEWHDLDNRAYQSVKPFLDETFGQGNW
DTEKRPTRDGSRSVTWAFVRNRVDE
>STH217 conserved hypothetical protein
MRNVWPLVRVQLRQTWRGSLERMTGTRSRLGLLLIPLLILAFLPLILLFV
AMYVGLYWGLEPFGQAPFVLTVALTAGQLLCLIFGVLYVISVFYFSQDLR
LLIPLPLRPGEIVLAKLVSILLGEYLSMAPVVIPGLAVYGFLADVGPLYI
PFAVLIYLMLPVFPLVLSGLFSLLLMRITALRRNRDLFRVFGALLGIVLA
IALQFATRFQQGNLQAEDVQRFLENQQPLIQGVSRWIVTSAWGTQALQAG
SPALGIPYFLLFTGAVAASLALLVAGAERFFFGGLLSGGERATSGREISR
SELAERTGRVQSPLRALFLREVRLLNRNPSHLMTAISTPVLIPLFIAIPM
ATGNLVVEGFDLARHADSPWVPAILLAAVFGMSSTSVIASSAVSREGRWF
WISQSLPVAPRVQIHAKLLHSLLFNLLNLLIVGAAALWLGLGTPRNLAVL
LAGGLPVCVLSGYSGLLIDVLKPHLTWTDPQQAMKGNINSLLGMLIHLGV
TAVTVGAAIVLYAAARPFFLPGLVIVLALEAWLLGQIVGALADARYAQYE
L
>STH1682 conserved hypothetical protein
MDSVSLVLLAAALSLDALMAGLLYGLRGIRVSPGAAAIVSGATALLLGAA
MGAGAFLAARLPPAAAEQAGAAILAATGLWITAQTLRSHPGVRRRAGEPT
PRRVWRLRLGSFGIVVEILREPSVADLDRSGHINPAEALLLGVALALDSV
AAGLGAGMTGLSPLGLPLAAGVTSFALLSAGSRLAGHLPLRLGGRWAALH
GVMLALLGLYRMMR
>STH594 conserved hypothetical protein
MNVERELLEPVPLCDDRGRLNPDAVGWSRRPLHRCNLRGHWLRKKRWNYW
AITTETHLFSLTVTDLDYAGLGFAYFLDFGTGDFTEQTVLHLLGRGLTLP
DDVEGEVRFDDEKMRLYMYDDGETVRLTAASPDFGGVPMQADFTVTRPPE
QESVNVVIPWSRHRLSRRPCRPGGRCGSAIGASPLRASPAWTSGGASGPT
GASGTGPPLQACRAATRSASTWAAAGPTGPA
>STH1851 conserved domain protein
MGIIDRVVLSIYTFSLAIISGLMVLLAIAPEWIPVHIWLQDLLLTGRGRV
VLGLVGSAFFVVSVRLIVFAFSSQGGGRPVIYETSMGEVHISLGAVESLV
KKTARSVKGVRDMKAVITHAEDGLHAHLTGTVSPEVSIPEVSEEIQSAVR
QYVRRVVGVEMAEVRIDIENIATDSRTRRLD
>STH207 conserved hypothetical protein
MEKNRNRPEDALQGSVRSETQRVLSQLREGEAWSHRAVAMGYLAGGNEGQ
FLTADQSLGDAGDTQSPTDIDTGR
>STH1495 hypothetical protein
MESGSWRWDLVGLRWEEAQAILQDRGVAYTWSVTAPPNRPVGIGELRVVA
QRETPAGPAFVLAHREYARREQPAGQGGQGGA
>STH3130 conserved hypothetical protein
MPRGRDGMDYGVALFHTTSMALKAEQRLKAAGLAVKLIPTPRQFSSDCGF
ALRFAWADCARVEELLRTGGVETAGLRRL
>STH2134 hypothetical protein
MGSPAEGWPLPDAHDGIDLFRGGQLLRNAAEAYGRALTPEEYWWLGVGLA
AATLAELGGREYAARRIQQLLGRLGGPGDPEEWFEQFHNELAALDAEGLL
WPLYQRDERGKMARTDAVVAAGFDADAGWALALAHLAIARAVLEERQEAI
PTLAEVIADLIEGAPAAEHLPRVQAALGMQPEGAEEP
>STH1649 hypothetical protein
MGGGGTHWISALPVRRALPFLLAAGPGLTAGNSSGWCASLRSAWPPRLPA
PRRCSSGPRNRLDRLRGSAPPGGRAFPPERTRRRRKHRSRMTAGAVIPCA
RPGWITDCTGSSAHARTARPDDDTRRSAPVTRSGPAACTLSSGPP
>STH668 conserved hypothetical protein
MYTLLLRLAGPMQSWGTASRFTERDTGMEPSKSGVIGILCAALGKPREER
PEDGGRWPSLAELARLRMGVRIDAPGRVGVDFQTAGGGRLGTRPYGVAKA
DGSKPDSVMSWRYYLEDAAFLVGLESGNRALLERLHVALREPVWPLYLGR
KSYVPSSPIFLPDGLREGGLETVLAQYPPLVVSGQERVRVVVDQAEPSGE
VRMDVPLDFARRLFGIRYVCSYSIDLPDEAGT
>STH3216 conserved hypothetical protein
MRVQWKHVREAVNSTVKIGLVRHFRVKHAYPAKLLVTPDEVAEWFAGYDN
ADIEYGHVDLSGGDWQHCFCSDLPRAVKTARAIFPGEIIILKDLREIEPY
PFRGNRIKLPFLVWAVLVRLVWYFKSHPNVESRRAVRERVRRVVDRVLQS
DSDALVVSHAALMPFLRSELKRRGFRGPWFGHAANGLLYVFER
>STH2566 hypothetical protein
MSRKGHGKRGSSAVWAVTAAAVALVAVLICASVWSARSGRAPAGEAVRDA
DLGAVGNAIGPADAPVTVVEYMDYA
>STH1530 conserved domain protein
MRQRVEEAMAEGVLDLPANLVAHVERCPHCSAEVREVEMLLHRLRSLPAS
LDLSPVPAAVDRVLQATASNLTAAGAVPAPAPVEKKRRRTPQWQWVLGQV
AAVAAVIAIAAGGLTLLGLGIHGAVSGQEPGRIVERWVAPLRDWTQALFR
NVR
>STH2393 hypothetical protein
MRSQLAAAGLRERLEAEIYDFELELRGTDKSNLLIRLDSFRLEILGARPD
LLLHNLAAIILTEAGAFRLNTVEVGFTAWLKAGQGRPLNLVAQAFTPAWG
AAEAEFLDRRFAMTWDWATPTTGYTFHASSAEDEELMLNFKAREGYMTLT
ELQTGLWIQEQRQRFEQAARLFLNLLGWTL
>STH2520 conserved hypothetical protein
MNKAEMYYEYLKEEGYVPRYDSDGDIVFKVEGLTYLLFASEDDEPFFRLC
LPNIWKVESAAERERALAAACRVNAEIKVVKLHLVEDNVWASVEMLIDPP
EGFKPVFDRALKIVRLGAERFVSYMNTPVQ
>STH859 hypothetical protein
MSVKEAHACDPGSLRSVTHCCSSRAAPFLNLRRHVPIRERVPRQ
>STH3112 conserved domain protein , histidine-and aspartate-rich
MSRAGCRSYAGGREGGERMSWRDDHDRRSGLCADLEDDRDCGCGGHGWNG
RHDQDGGDRWDGDHGHGHHDHRGDHDHHGHHDHHHNHHGGHDGHHHRHRR
RQRRLIPIGSVTVDCTPVTFRTPGQAINVVSCPVSWIIPGTRCIATIHTP
PVQTPVLPIHRHRKACGCH
>STH250 conserved hypothetical protein
MLLQSGKSAGLSGSIAGAGEQIFGKKKGLDEILNRFSMAFATLFVLTSVL
MLFLEARG
>STH908 hypothetical protein, alanine-rich
MSDADKELRQLIKDEFDRSLAGWTFTPAMRQAVLDRIAGEGEPEATPVPT
LSRRFRPAYWVAAAAAAFVLAINLWPRVSTDGRLTGTASDARSGGAPESA
PGAEAAAYGLQATGELQALGAAPGQSGTADSAEADAASTLKAGGVRALSA
ISPVRLTLSVAEVPDVFVVERAGEKVSLHVSETTGRALTDGAEEAQAGAT
AAAVPDALDLNPLPDGNVVVVQRWAVKIVDQHGNPVAESPVEPEAAAVAE
GPAGETVVVSSDALTLFTVTGKPAELIGLDQYPSLVAVSGTRMAVANDGG
VEVYEGGARALTLPRLQPRAMALAPDGALAVLTGSPPDARLLIYAADGSL
LLDRPVAPDGEGFGFIGDGLQVAVGSVVYDRTGDERWHFPFAPARVTALA
DGVSVLAWNSRQAARVLADGGTQVWMAEVTDADLVRASGSARGDLVVLVA
AADDGAAVWVIDQGGNQRHAERLSRVPVDVTASGDRILLLNAEGLEVRPL
NR
>STH2351 conserved domain protein
MSRSGVRPPELPAAREGSAAVRDGRIAVTDPQGDGRPAVLIPGDDVILMI
EGQRVTLPVAVFSWQDVQVRPRRPACPVRGFRVAVSPDGLSAYLEVEEQG
PPGYRIADAGPARVLALTAEPLPGLQANPRDADVHQALRRAGVSAGVIPS
AVAYALASPAARTVQVTWLPELPAARTVQVTWLPEVPAARTVQVTWFPEV
PAARTVQVTWLPEVPAARTVQVTWLPEVPAARTVQVTWLPEVPAAHTVQV
TWLPEVPAARTVQVAWLPEVPAARTVQVAWLPDVPAPPAVTRRRRDR
>STH399 septum site-determining protein
MTLRVALCGLPPAAARRLEGWLAAHPALQLSVSFSAPDVDSLLDRLSESP
AHAVVLDAGLGLEGLSCAQELAAAGYAVLCLAGGPASAPLRRRAADLGLT
LCPDADPARAATLLRRLLGLGAGSAQIGHVIAFHSPRGGAGTTSLLLHAA
RSLHGRGQTVAVVEVSGGGGAAPLLGLRPGGGWEELVGLPPEELLSDPCG
PERVAAALRAVEPGLHLLPSAGPAVMDELHPDLVEAVLRLLGPCGCTFAL
VDTPAEMTLTAAAAIAASDAVCLIGLPDAVSAYRFVQVESLLAGLQVPPE
RVHPVLNRWREPAPPSVEEALAFLPYRPAVRVPEESRPAVDPSGRFCGFR
PGGGAARALERLVDALVQEVAGT
>STH2999 fliJ, flagellar basal-body protein
MKRFRFRLQRLLEIRQQETKVALNNFARARLATRAAAMRLEQAAARRAES
AQRLLARRQGRMTVLEWRQSTELHEALVAAEWLAAEQLAAAQQEEERRRW
ELTEAERREKVLERLRDRRAAEFERAMLAAEQAAIDEMAQTVYREGGGRR
>STH1642 gerKA2, spore germination protein KA
MSTRSGSTPHRLRVHRLAGQSLQLEAYFRSLGLAREADLARRMAVSPRSE
TRLFELVHRNPALPVPARLGQAEALMDLYFGHSQDLTVRHISLEPGAVPA
MIVYLPTMVDEPRLDQSLHSLLQPASAAERPPAARDVVGWIRHHQVSAPD
TSEVVQIRQAAFAVSEGMAVLFVEGSGCALAFDVTGGPQRAISESKTQRV
VRGPREGFIESIHVNLTLIRRRVRDPRLRVDMLTVGELTRTRVAVCYLAT
VCKQSLIDEAMRRIRRVKVDGMLDSGQLMELIEDTPWTVFPLVRATERPD
AVVGGLLEGRFAIVVDGSPWVLVAPSTFMDLIHSPEDYFERFPAVVLVRI
LRVLFAAVALFGPSIYVALTTFHRETIPTNLLLTIMAAREGIPFPAAMEA
FMMELGFEIIREAGVRMPSQLGQSVSIVGALILGESAIQAGIVSAPMIIT
VAVTALANLMLPDYSTALALRMLRFPLLILAGTYGAYGLILGATALLIHL
LSLRSFGTPYMAPFGPLLPSDLRDTVVRSPLWARQKRPAAVEQTDPVRAG
HGMKPGPGPVRRAGARR
>STH1640 gerKB2, spore germination protein KB
MSVGHRVLISHQQLMFLVYTLHFSATELLLPSSLAETGKSGGWMAPLVSF
FLSAVPVALMLGLLVRRHPHLGLGALSHHLLGRLPARLLMLLTTLFNVGL
TALCLRDMVEAIPVAILPVTPTLAVALPFLLVAGYGAYCGAEVLARLAFF
FMVIAVSLFSLVVVTLLRLVRALHLLPLWDQSPLQLVAAAWPTTGWYAES
WTFLPLAAMVDRPQYAGRGLIAGALIAAVHLMSCTALSIGIFGHSLVAHF
AFPIHALFQQITIGEFVERLDVILITICLLGMIVKTATHLWLAVDAAQFA
LGLRNQRPLLPALGLAAVLWMLSIPNLPWLFAFSTTVWTPFSLCLGLGVP
ALLLAASWIRERQYRPPDISS
>STH1641 gerKC2, spore germination protein KC
MRRVAALLLLLPLLTGCWGRLELADLALVIALGVDLADNGLYEFTFGIAG
PPVSHMKNAPQGDAGIPSRIVVTQRGRSFAGVLREIELRLPRRINLTHTL
VVFIGERLAEHGLGDALDFVLRAPEFRLQGLIVMVRGGPVRTLLETEPLM
ENLQTKALTEIAKAHIGLEVRLWEFFSARATTYRAPLLPVIELMESPDTG
AKGQRYAARLGGAAVLRGDRVALYLDAREVRAIKWLRGRGRDGVITVPCG
DEPGPESVSFRVVQAGHRVRLHRGGKPAFAVALRGRLRVTEMQCPRPLVE
PQVHEQLVQRAEAELRDLMEQVIANLQEAGVDPVFFGEHVRALRPGLWRT
VGEERWGETWREVPVTVSVDLRLHTTGLMRNPLRS
>STH2104 gerKC2, spore germination protein KC
MRRRMRLLALSVLLALGSALLSGCWSRVELNDIGLVLGLAVDVGEEEPVR
VTLYVPRPLSPEQGGGIGGDQEPIWVVAREADNFSDALALIRLASARRLV
FHHLRVVLIGEEYARKHGIGDVLDVLATNHEIRLTVRPFLVEGRAQEVLE
TLPQLRALQPFNLTGILQTKGGLEWRLKDVLVARASDTHSVWMPTIQVVP
RPAITANSPPTAVTLSGVALFRRDYLQRILEPAAYQVIAWFLGNPSGFTI
TAPCPTGGNGSVSAQVVSGRTRVRPRWQDGGIAFKVEITSNVNIMRSECE
MAEVKHAEVREHLEQVLASDLRERIAQFIRITQEAVTDPVGFGKHAQLAF
PRFYKTMEDKWGENFWPNTPVEVAVKMTVDQAGLVTGPVHPTERELRERV
H
>STH2454 gerXC, spore germination protein GerKC
MPRSLLVLLLVLPLLAGCWNKMEIEEGAYVLALGVDEGRGGSLAITVVIA
KPRALAGKEGGSPEEPPVLITTVEAPGIAAATNILHGYIGRRVQFHHVQA
VFVHEQLAREKGLVFLDEVARFRQLRETAFLIVTREPAAEFLQRVAPELD
INPIKFIEQLTYHTRTSGTLPSASQISSFIALLNAEYQEPIAYYAALSAE
ESGSETISPERESQIEAGVLPRSGGPAVEMIGAAVFRGRRMVGVLNGEEV
RALLVLQNRFQGAFDAIPDPGDPDEFIILHVSRGRPTRIFVDRLGEKPSI
RAYITLEAEVVGVPSGIDYALPDRQDELEQAIGQHVLQTMERLIEKTQRW
ESDVAGFGRHAVSAFPTVQTWEAYNWPRRYAEAEITATVDVRLRRFGLTL
APTRSSEERMTR
>STH1220 spoIIGA, sporulation-specific protease
MSGLPAVRPDVFFLVNFGLDLALLWFAARAARVRYRAWRLWVAALLGAAL
AVLPVLLPDVPAAAPASPSAGPAPKAGWLFSGAALLAGSAALAWLLVWPG
PWAQFAAVLGFFWTGLILSGGLLFLLAERYPALFAAPPAALVAGGAGVAL
AGAQLLWQAYRERAEVDDGLYELEVRVDGRREVVEGLVDSGNLLRTPVGR
MPVAVVEGGRLRSLLPPAVLEAASSGPMGLDGLPAEWQSRCQLVPFAAVG
RSDGWLLVIRPDGLSVRPRGRGDWVQVEGRVGLAAGPLDPEGRYAALLPT
AMIAAARRAHGRARGRPVEAQSGERGEGRPDVHVR
>STH1861 spoIIIAB, stage III sporulation protein AB
MTLKLLGAALTVLAPSWIGFQIAARYARRPAELRGFQNGLAVLVTEVEYG
ATPMPDALRSAARASGPVAGAVLADAADRLEGGGGITPGEALAAALEERR
GATCLTPADQEILAALVPVLGASDRRDQVRHLRLALERLAAAEAEATDER
RRYEKMYRYVGVLSGLALVLILI
>STH1860 spoIIIAC, stage III sporulation protein AC
MSRVDVQLLFRIAGVGIIVAVMATVLKQAGKDEQGQMVTLVGVLVVLMMV
VTLLGRFFNLVKATFGMY
>STH1859 spoIIIAD, stage III sporulation protein AD
MEIVQIVGLGLLAGMLISVLRQHRPELAMQLSIAAGVMLFALMMAKVLRV
VEVIQSLAARASLEQAHIDTVLKIIGIAYITDFGAQVLSDAGEKAVATKV
EMAGKIIIMLLAVPIILGVLDAILNLLG
>STH1858 spoIIIAE, stage III sporulation protein AE
MRSVKRTWWWVLLLLAVALTALPASAEGPAAPGGLLWEQAAALDTSAIER
FLDEVNRTWEGYGPQITLGDFLRLSSGEDSPSLSAGAILQGLLRYLVREV
LANADLLMKLVVLAITAAVLGQMQSAMGADAAGSVAYWVIYLVLVGLAIT
GFGLAVGAARQVMESLNAFMLAVLPTLLTVLIALGGAATAAIFQPLMVTM
LNVTSTVMVNVVFPLAFLAAVLDIVSGLHEKYRLSNLAHLLRQGATVTLG
ATGTVFLGTVAVKGAAGAVADGLSVKAAKFVTGSFVPVIGKTLADATDLI
VGSSLLLKNALGMLGATAIFFIVAFPLLKILSISWVYQVAGALVQPAGAG
EIARMLTTMAKSLYMLFAAVGMVALMFFISVVVIVGSANVAVMVR
>STH1857 spoIIIAF, stage III sporulation protein AF
MAALTEWVRGLVVLVVLASLLEMLLPMGGMKRFVRLAMGLVIMLGIVRPI
LGLLGGQVAVDPEPWLEPTTSLPSVSEIARQAERFQARTQALLLEELQDR
IRRAAEEAARSVEGVAEAAAAVELAGGPRLETVSLERVTVTVVLGSRFGQ
VRPVAPVRIGGEEQAAGGGAGTRARAPTPAETPLAETVRRQVAEQLGLTD
ASQVTVWIESVDAAGR
>STH1856 spoIIIAG, stage III sporulation protein AG
MGERKAPGNGPGSPWSWLGLGLSKEQAKLVPLIAVLLFIGILMLQSDELF
GIDADSPPLDPAPGASLVGPVGAEDELTRLERQKAAELEEMLGQIEGAGR
VRVMVTLAAGPAIQVVKNTTVDQSTTTEEAADSSTRRIESINTREDHVFT
RSGSSEQPVIAQTSAPEIAGVLIVAEGARDVRIRARLLDAAMVALNVPAN
RIQVVPADGR
>STH1855 spoIIIAH, stage III sporulation protein AH
MFVVKRGSDLLRFALYVLVVASLVWYVVSRFGEWRAAQAPSRGPVLADGP
APVVGPEPEEAEEALVRPDEEVLAEGAEGTDYFAEYRIERERTRGALGDR
LREVMVAEGASAEVRQEAAAQYLELGRRAALESQAEALVRARGFTDVIVH
LTDGSAQVVVKARSLSQQQVAQIIDTVSRTTGVRATAITVMARDD
>STH1823 spoIIM, stage II sporulation protein M
MQLYAAWRENLEELLLERGGLVLWHTALFVVGLLFGALALRSLEVETQLE
LARQISGALQALREGEPPAPGPLLREALFRQARTVALMWVLGVSLVGSVA
VMALPLVRGFTSGFAVAYLTAELGARGVLLAAAGHLPQTVLEVPGVILAA
SASVGFAVEVLTSWRVRRRLSGYYDALARYSNTLLCAAVLLAAAALVEGY
VTPHLVRLVLEAP
>STH481 spoIIP, stage II sporulation protein P
MVVAATVGLLVNRRPPESGSLPAMTARASDGRTHESERTEESFWTALFRP
GLPTARQMLRRAVPALAVRGPLGEPDDRVLRFLWTGPGPQRPQTLFQAAL
PFLRPGALPEEPMTVDPSLPAGESPGPEPRPSPPAAAEAPFPRPEPGATV
LNDGLPLVGIYHTHDYEAYISEFPDLAVTSDQDLQRIASYDHSKRTIVDI
GAILARRLRDLGVTTVHAPFKHQELGYEYAYQSSRNTARRILREAPTVKV
LMDLHRDGNMDLDSTVWIDGQPVARVRCVIGVRDDLTHWQENLAFCNRLM
EKMEEANPGITLPTLTPQARYNQDLLPGAILLEIGNALNTFEEAERAVYY
LADALVELLRAGEYPGK
>STH1223 spoIIR, pro-sigma-E processing factor SpoIIR
MNRKLITGWLLIVLGLGQAGGGVLLAAAARAEDAALADPNLVRIHVIAHS
DSPEEQALKLAVRDAVLDRLRPALSGARSRAEAEARIAVLLPELEQAARA
VAAAWGAAHPVRAELGTYAFPGKGYGDLYLPAGRYRALRILIGDAAGANF
WCLVYPSFCYTIREVQVPPTEAVGTTLPEGCADGCAGSCTGRYPEPWAEG
CPGICTAGCVSPL
>STH1811 spoVAE, stage V sporulation protein AE
MLGVRGPSKAAASPTARLADPPPAGPPVHVSDPAPGGIGLKPPVRVILLT
DGDSAARRVAENVAQRLGLRCISASAGNPTPLSGPELVALIKQAVHDPVL
VMVDDKGDPGTGPGEEALAYICRHPDIRVLGAVAVASNTRARGVEVDVSI
DREGRVRDGPVDKDGQPRRRGRLKGDTVDVLNRLSVPIIVGVGDPGKTGG
EIDLEAGCEITARAIRQILERS