TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Organism: Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334
Gene type: CDS

Number of genes found: 428

Free access
Sort by:

 



# Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334

>SP_0164 hypothetical protein
MRSLFKKIVAVLVIGLILLGIARTPQVHKMARGIDPGPANFI
>SP_1548 hypothetical protein
MKRIVFELIFIATTWYIFLPPLNLTSWEFLFFLCGHLLVVAILFGFGKGI
NLVKTVHVRHGKAEAALNLEGFKINRLGKILLASIGGILLLAALVSLVTS
SMFQAKNYANVVTVTEKDFTEFPKSDTSKVPILDRSTAEKIGDRYLGSLT
DKVSQYVAADTYTQLTIDGKPYRVTPLEYADPIKWFNNQAKGIGEYIKVD
MVTGNADLVDLKTPIKYSDSEYFNRDVKRHLRLKYPTKIFKTPSFEVDDE
GNPFYVATVYQKQFGLAVPRPASVIILDATNGETKEYSLSDVPEWVDRIY
PAEETIEQINYNGKYKDGFLNAMISKKNVTQTTNGYNYLSIGNDIYLYTG
VTSANADESNLGFILENMRTGEITKYSLASATEESARESAEGAVQEKSYK
ATFPILINLNDKPLYIMGLKDNAGLVKEYALVDAVEYQNVIVATTVEEML
SKYANKNDLEIDNATTESINGVVADLKSAVIKGDTVYFFKVDGKIYKVKA
SVSDDLPYLENGKTFEGQVGKDNYLKTFKLR
>SP_0039 IS1381, transposase OrfB, truncation
MRNIGQAGKILADSGYQGLMKIYPQAQTPRKSSKLKPLTVEDKACNHALS
KEISKVENIFAKVKTFKMFSTTYRNHRKRFGLRMNLIAGIINHELGF
>SP_0513 hypothetical protein
MNSRVEFRIFTIVDLDKEEEHLHEMHLKGWRYRTSRFGLFYFDQCQPDDV
IYRIYDSRFLKKV
>SP_1292 SAP domain protein
MNFFSKLFNLKQNNHNRDTNSDCNNFYLNELECGLTPGQLILIDWTQKTG
RNYNFPRYFKYSLQIDPESTHNQLYKLGYFTKNKTLSYLTVVELKTILSK
HNLATSGKKAELITRIINNVNIDNLDIPFEFKLTKEAQNLIIEHSDYIKA
YYDKDITMEDYCKEKNNISFKATFGDIKWSLLNKQAHRNTVSGDFGCLSN
TRKAQGRHLEQEGNIKHALIYYIESLIITISGLENNFSATDYPVYYPDSI
PDYSLKHIQTLMESLSDDDYDFAFDEALFRFSILNANHFLSKEDIDYLRV
NLPRSTAEEINNYLKKYECYSPLNNLELDDFE
>SP_1132 hypothetical protein
MEILSKEIQLQGLQLLKQTLETLVELEKQRSSKLDLISRKELMDLLGISA
TTLDNWEDLGLKRYQTPMDGAKKVFYRPSDVYLFLAIK
>SP_0162 hypothetical protein
MSMWRDWAPMWWSFSVLSEIWYNSTNQFLGK
>SP_1487 hypothetical protein
MAKCKKYEEFGLDSLLQETRGGRNHAYMTVEQEKVFLARHLKATEAGEFV
TIDALFQAYKKELGRSYTRDAFYQLLKRHGWRNITPRPEHPKKADAQTIV
ASKNKVSIQEDK
>SP_0818 IS630-Spn1, transposase Orf1
MWYNLLMAYSIDFRKKVLSYCERTGSITEASHVFQISRNTIYGWLKLKEK
TGELNHQVKGIKPRKVDRDRLKNYLTDNPDAYLTEIASEFGCHPTTIHYA
LKAMGYTRKKKELHLL
>SP_1866 hypothetical protein
MHVVQNLVNIPKLTRIFISKQKNNTPSKLFGFFMKFTENS
>SP_0195 hypothetical protein
MKKSSTALWRSDTVWETVSAQLPEMGKSWNVQ
>SP_0201 hypothetical protein
MLSHAWDDTATLYRKSERLSPSAILSPLHYTATEENRNKLLNDLKEKQPK
VIVVNDKVVVWSEVETLLKENYQQVKTDYSEFKVYKIK
>SP_1707 hypothetical protein
MMEHLFKMIILLPCFYFFSWIDKDNRESKFFPIFYYFYWIYITLYALFSL
AWTVFSVLFFNIVLRNLTDIKLWGIWLLLLLIAFASDWLAYVFFKKMLDL
RRELGKSKGGRH
>SP_1822 conserved domain protein
MKDYKINFDLGKIEYFDNNCLIQVYKFISFYDICEMVFAFHLPPDELITN
VIFKEKINSMLKCYIDRLLYVFINPTHFTEKVNLQFYGSFFSYEFICREV
GNILKNKGVKCNLNFFEGEEYL
>SP_0990 hypothetical protein
MGVAEKIEEVSMGKSLLTDEMIERANRGEKISGPPLLDDNEETKILPTSS
SRFGYANPKDHGFSQETLKIQVEPSIHKSRRIENTKRNVFNSKLNKILFA
VIFLLILLVLAMKLL
>SP_0514 hypothetical protein
MILDFLKKYKHELQDFRDRGWELIGAGSCSILRKSSSDLLPEDQVYMSKG
LKWEVMRSRLRSCTATFSGGLVVCMSLFREDLSMSFFLIFVLYAFLISYL
IYGYFRLKRKYRVDE
>SP_1481 hypothetical protein
MICQFIRDMLDLPAKNVTILEGSNIHVLPSMPYSA
>SP_1042 hypothetical protein
MMKMATDKNRIMISLDDKNLEKLENLVEDARDRRGMRLTKSQVIELLLNT
VDYFDDIMGAIYSKK
>SP_0832 hypothetical protein
MVIGVLDSKEELKESENDAPKLETPLREEPRLAPQTLPEASEVLENKREE
SKVEIT
>SP_0089 hypothetical protein
MFLADEKGSEHTAAELIDNLKEVIAKLKANA
>SP_1109 hypothetical protein
MFEEFPKLPDLKQVTFPNDKEKSQNSKEKLDDCFPTTPI
>SP_1756 conserved domain protein
MSEEDLFYKDVEGRMEELKQKPIKKEKETRGEKISKTFSLLLGLMILIGL
LFTLLGILR
>SP_0361 IS1167, transposase, truncation
MGGYFRNFENFKKRIFIALNIKKERTKFVLSQA
>SP_1347 hypothetical protein
MAERFWENLSIILAERNISWIELTRKMFAGEFHYPSELNRLYQKIRH
>SP_1436 hypothetical protein
MDNKKLKVKDLVSIGVFGVIYFAFMFGVGMMGLIPILFLIYPTVLAIVAG
TVVMLFMAKVQKPWALFIFGMISPLVMFAAGHTYVVVVLSLIVMIIAELI
RKIGNYNSFKYNMLSYAIFSTWICSSLMQMLLAKEKYMEWSLMTMGKDYV
DVLEKLITYPHMALVALGAFLGGILGAYIGKALLKKHFSNGLYCVGYFTP
CLILWCYLN
>SP_1304 hypothetical protein
MVFLKIDKKTNVFLGNIFLKDKILKYAIRKENV
>SP_2117 hypothetical protein
MGMFVGMFKARVESHEIILDVKALMPWISAICLLIGFISMFLTFNFLKKS
RKFHSLYQEEMDDDLNETYYVQMYRNLEFGTIAFNITGVAIPLAIFISLS
EVIILHTNPQTFFLSFLLFVVFLVAQKSLFKTIAIVRQFDLEFFATPKDV
LNYINSYDEGERQANLEQSFRILFQLHQYVLPALYIFLIIISFLTGEIQL
LAFLLVGAIHVYINVMQLPMVKRYFK
>SP_1425 hypothetical protein
MRITMSGNIQNSELFKFFNENSIKVVDFETKKETLKDIYLNRSK
>SP_0343 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0138 hypothetical protein
MHSFGIHPRYPYKMSEIVEMSKKISDLNDKKEFHKKNCELRFQDSVLDNI
NSVPGTTHLFSCLNIQTLVKIILYIFRYNSCMKIFNFRDKELIFFNKIVN
GLIQNIEENLEDDIERILKYLYICLFNEIFIIKNKVNFFDDVEFNQTLSE
FLDKL
>SP_1658 hypothetical protein
MGGILLLIGPFVLLGIAVNTAATTLNGGATAGAFSGVALLLNALKIANLV
LGIIAIVYYKGDKRVGAAPSVLMIVSGGVSLILFRS
>SP_0692 hypothetical protein
MIIMQDNFLFEEIEEISVPVNDFSAGLATGIGFGLAILALAGC
>SP_1038 hypothetical protein
MINKLSRYMESSGKTYQNHYVTILKWYEEDKDKLRQKGLNKKMNYDVGES
L
>SP_1760 conserved domain protein
MIITQRQSIHWGEVGGTYMYGTTVSYYPDKSVRLYNPLLPSGEILKTWFS
SVNYQAARTQPQLPLLKRKQEYQLSLVFDCQPENGVYTKITFFDRYGDIL
EKKVEKVKDFIFTYPEDSYTYRVSLLSAGFESLTFYHFSIKEIRSV
>SP_0861 hypothetical protein
MSKKDKKIEIQVADAKVNVGKDSFEGYTLTIGKKVIGEIAELDGQFAIIK
NGNVDSFYKKLEKAVEILIENYNLAK
>SP_0142 hypothetical protein
MLNLQFAETMELTEAELQDVRGGNLVNSMGGGGRSGISGWGVPGIYPGWG
NQGMSPNRGAFDWTIDLADGLFGRRRR
>SP_1779 hypothetical protein
MTMYQDLLRKIAEEKPNYNQEEIQWLFDHLGNPSPEIRNVLLNQGLHYLS
KEKDTRGFSSQYGWVHAFAHGADLLTEVVCHPDFPKNRVHEVFDILGQLF
KRMSIRFTDDEDWRLARVIYEPILQGKLEQEQVASWIKTVDFPIEEREDF
YKFSNFRSCLVEVYVQLDQRNSLQDDLKEAIQSFQY
>SP_1078 hypothetical protein
MAFGDNGNRKKTMFEKITLFIVIIMLVASLLGIFATAIGALSNL
>SP_0398 hypothetical protein
MSLIFVVIYKVKEAGQKVFKIGKRQPIGCSKILIGCPLLWKALLDFLLRL
SF
>SP_1307 hypothetical protein
MKKENEYVILTTASLGVMIGIVFAIFLDFPVEYGISLGLLNGIVLGSLIV
YKNNKN
>SP_2124 hypothetical protein
MINILYFLIILTIWQVFDEFSEKYDKMKKIRNQGEVYGADWKSL
>SP_0116 hypothetical protein
MSLKLLDCILDYQERFNGKTCQVSTNYKYLEIFKVNFCLTDLHHLFDLHK
ITRDYASQTKPAIQDGVFILEDFRNILCTMM
>SP_1570 hypothetical protein
MERSLFGLFTAFLCFICFLAGAQAFRKKRYGLSILLWLNAFTNLVNSIHA
FYMTLF
>SP_1657 hypothetical protein
MRIDSIASSVTTGVSKVIVSFELLDVATVFSSLLVEVFFELYVPHATRRE
RDNRATPIPINFFVLFMLKYLLFF
>SP_1146 hypothetical protein
MDFLNHSFDTKKVINTKINAVNSKNNVGKNFIDVYREMKEVPNNKIHQSK
VNVALIKK
>SP_1819 hypothetical protein
MMNLSSIYSSMPTTKSKQKGWTNTKKASNTQ
>SP_1729 IS1381, transposase OrfA/OrfB, truncation
MREYRTYEEIAADFGIHESNLIRRSQWVEVTLVQSGVTISRTPLSFEDTI
MIDVTEVKINRPKKQLANDSGKKKFHAMKAQAIVTSQGRIVSLDIAVNYS
HDMKLFKMSCRNIGQAGKILADSDYQGLMKIYPQAQTPRKSSKLKPLTAE
DKACNHALSKERSKVENIFAKVKTFKMFSTTYRNHRKRFGLRMNLIAGII
NHELGF
>SP_0834 hemolysin-related protein
MSAQITINHKKARYVRIELEGYNALSLAEVEVFCFIATNAETATQVSKPV
QPISQTPVKDKTLTIQHSGAYIARYSITWEEVPVDKDGNQVVRSHSWEGS
GRNQTAGFVLNLPIKENMRNLRVKIEKKTGLLWNRWQTIYENRPILAQPH
RKITHWGTTLNSKVSDDDVL
>SP_1141 hypothetical protein
MPEFIIVEGNNDLGEFFQIDGELFSDNELLENLKKWREWEVPVIIDDWCN
RILNEDETEILYFPTHEDKMNYIRVEKDLEPLYHTSNKIYATISKSEWLE
LLN
>SP_0854 hypothetical protein
MSAYLKEALKGAAKTQTKTNFGKPDFHIEKYKQDSLVTYRN
>SP_0059 hypothetical protein
MGIAIFLPLFSFFHRKFYHKSEKNSSLLNKKLE
>SP_0774 hypothetical protein
MLSVNTILEKFYKEHQVKPFISPERELDTWLLSPKPVPKRNMDLLVDDSL
AGDIILLWRIQFGTFTTET
>SP_1003 conserved hypothetical protein
MKINKKYLAGSVAVLALSVCSYELGRHQAGQVKKESNRVSYIDGDQAGQK
AENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAII
SEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLKDAAHADNIRTK
EEIKRQKQEHSHNHGGGSNDQAVVAARAQGRYTTDDGYIFNASDIIEDTG
DAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPA
QPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDP
AQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPD
SRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVF
EENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREF
YNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVDDIL
AFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITS
DEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSG
NTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHN
IKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHV
RKNKVDQDSKPDEDKEHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTD
TEETEEEAEDTTDEAEIPQVENSVINAKIADAEALLEKVTDPSIRQNAME
TLTGLKSSLLLGTKDNNTISAEVDSLLALLKESQPAPIQ
>SP_1914 hypothetical protein
MKKKAFGIVLLVLAAWILLQGNFGIPSLDGKIWPLLGIVFFAYKSIESIL
RRHLTSAVFTGLLALIIANYAYDLLPVTNHSLIWASILVVLGVGYLTHSS
KFWNEKKWWYNGKKTVVTDKEVAFGSGTFYKQDQDLVDDQVEVAFGDAKI
YYDNAEMLGDFATLNIEVAFGNATVYVPQHWRVDLKVETSFGAAKADAPV
APTSKTLIIRGDVAFGKLEIVYVK
>SP_1233 hypothetical protein
MSGLLYHTSVYAVKKEILVNTRKKTQFMTMTALLTAIAILIPIVMPFKIV
IPPASYTLGSHIAIFIAMFLSPLMAVFVILASSFGFLMAGYPMVIVFRAF
SHISFGALGALYLQKFPDTLDKPKSSWIFNFVLAVVHALAEVLACVVFYA
TSGTNVENMFYVLFVLVGFGTIIHSMVDYTLALAVYKVLRKRR
>SP_0126 hypothetical protein
MSRMSKFVIELSSFFLVHFYIRKRKGKVSIFLNYF
>SP_0188 hypothetical protein
MSRKKYENDEKSQKKLKIGRKSDVFYGIID
>SP_1986 hypothetical protein
MLMCEKIRIRRVSDYPSARGGLEDILIMENMTNHLLLVQIRVHGYLLDFA
SIEGQRQKHYRLKNLPQTVELTVDDVEEDVDLTLPENRSYQEADFFERMF
RENC
>SP_0906 hypothetical protein
MAVKFTKRDDLDKMFEEFAKLPDLKQVTFPDDKEKKVKAEKKN
>SP_1054 Tn5252, Orf 10 protein
MKRDVRDIRKQFRLTEAEEKQILALMRERGETNFSDFLRKSLLSSDLQKQ
METWFALWQSQKLEQISRDVHEVLILAQSERQVTQEHVSILLTCVQELIQ
EVANTIPLSKEFREKYMR
>SP_1696 hypothetical protein
MALILESSKRNEDSHMTVTIKVNYQTTFQKKEAKN
>SP_1253 hypothetical protein
MAAKLWEEGKMVYASSASMTKRLKLAMSKV
>SP_2139 hypothetical protein
MILWSFDFANDHAHAFFMDNVEWSHADSYFRSFVSDDVEERYTENVYLDS
LSVKQKFKFIFDFGDEWRFECQVLREIETEDEEAYLVRSVGTSPEQYPDY
DGFDYEEW
>SP_1060 hypothetical protein
MADMKNKYDVKRIIPDELSESLDIFLKNYSETGLSDYNTYLFYGFILKSY
KLPRENRYSIKLLVKELQNRGLKVTLIINIYYHALNCLALNDGLKIYGED
FLI
>SP_1787 hypothetical protein
MVLSGGKSAMPMTQKEMVKLLTAHGWIKTRGGKGSHIKMEKQGERPITIL
HGELNKYTERGIRKQAGL
>SP_0076 hypothetical protein
MIDVTIGQKSKTGAFNASYSICFSGENFSF
>SP_1353 hypothetical protein
MRNPYLPVFESDKRLIETDKLIWFPAKNSLAGFLF
>SP_1757 conserved hypothetical protein
MIELYDSYSQESRDLHESLGATGLSQLGVVIDADGFLPDGLLSPFTYYLG
YEDGKPLYFNQVPVSDFWEILGDNQSACIEDVTQERAVIHYADGMQARLV
KQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVL
LENHVTGDILLTLPGQSMRYFANKVEFITFFLQDLEIDTSQLIFNTLATP
FLVSFHHPDKSGSDVLVWQEPLYDAIPGNMQLILESDNVRTKKIIIPNKA
TYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEA
IAGALPDVTFRIAAVTEMSSKLLDMLCYPNVALYQNASPQKIQELYQLSD
IYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFESSEVA
ALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGG
>SP_0072 hypothetical protein
MQIAGIIFNSSTTNGDKVDFNPTENVDLRNNFASLVK
>SP_0810 hypothetical protein
MTVEEKKVFLARHLKAAEAGEFVTIDALFQAYKKELGRSYTRDAFYQLLK
CHGWRNIMPRPEHPKKADAQTIVASKNKISIQEEKKAL
>SP_0296 hypothetical protein
MSLEAISEALTHSDTVTTKTYVNTSNIVPLSAGQVAYQHLKNK
>SP_1788 hypothetical protein
MKNRIIDVFEVVNRLLVITVENPDFEDLRVNHTQ
>SP_1533 conserved domain protein
MLENGDLIFVRDGSDMGQAIQTSTGNYSHVAIYLDGMIYHASGQAGVVCQ
EPADFFESNHLYDLYVYPEMDIQSVKERACKHLGAPYNASFYPDAAGFYC
SQYIAEILPIFETIPMKFGDGEQEISDFWREYYIELGLPVPLNQAGTNPS
QLAASPLLQCKERNLHDSDF
>SP_1772 cell wall surface anchor family protein
MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKT
VEKTDALATNDTVVLGTISTSNSASSTSLSASESASTSASESASTSASTS
ASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDY
VASVTNVNLQSYAKRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINA
SLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPK
TNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQ
VLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSF
TFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASAS
ASTSASASASTSASASASTSASASASTSASVSASTSASASASTSASASAS
TSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTS
ASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSAS
ASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASAS
ASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTS
ASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS
ASTSASASASTSASASASTSASASASISASESASTSASASASTSASASAS
TSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASESASTSASASASTSASESASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASASASTSASASTSASESAS
TSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTS
ASASASTSASASASTSASASASISASESASTSASASASTSASVSASTSAS
ASASTSASESASTSASASASTSASESASTSASASASTSASASASISASES
ASTSASASASTSASASASTSASASASTSASESASTSTSASASTSASESAS
TSASASASTSASASASTSASASASTSASASASTSASASTSASESASTSAS
ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS
ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS
TSASVSASTSASESASTSASASASTSASASASTSASESASTSASASASTS
ASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS
ASTSASASASISASESASTSASASASTSASASASTSASVSASTSASASAS
TSASASASISASESASTSASASASTSASASASTSASASASTSASASASIS
ASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS
ASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASASAS
TSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTS
ASASASTSASASASASTSASASASTSASASASTSASASASISASESASTS
ASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSAS
ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS
TSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSAS
ASASTSASASASTSASASASISASESASTSASASASTSASASASTSASAS
ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS
TSASASASTSASESASTSASASASTSASESASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASTSASESASTSASAS
ASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASAS
TSASASASTSASASASISASESASTSASASASTSASVSASTSASASASTS
ASESASTSASASASTSASESASTSASASASTSASASASISASESASTSAS
ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS
ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS
ASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASASAS
TSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTS
ASASASTSASASASASTSASASASTSASASASTSASASASISASESASTS
ASASASASTSASASASTSASASASTSASASASISASESASTSASESASTS
TSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASES
ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASVSAS
TSASASASTSASASASTSASESASTSASASTSASESASTSASASASTSAS
ASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASAS
ASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASAS
TSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTS
ASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSAS
ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVSAS
TSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSAS
ASASTSASASASISASESASTSASASASTSASASASTSASASASTSASES
ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASAS
TSASESASTSASASASTSASESASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASAS
ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS
TSASASASISASESASTSASASASTSASVSASTSASASASTSASESASTS
ASASASTSASESASTSASASASTSASASASISASESASTSASASASTSAS
ASASTSASASASTSASESASTSTSASASTSASESASTSASASASTSASAS
ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ESASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASES
ASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASASTSASASASISAS
ESASTSASASASTSASASASTSASVSASTSASASASTSASASASISASES
ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASESASTSASASASTSASASASISASESASTSAS
ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS
ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVS
ASTSASESASTSASASASTSASASASTSASESASTSASASASTSASESAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASISASESASTSASASASTSASASASTSASVSASTSASASASTSASAS
ASISASESASTSASASASTSASASASTSASASASTSASASASISASESAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSVSNSANHSNSQVGNTSGSTGKSQKELPNTGTESSIG
SVLLGVLAAVTGIGLVAKRRKRDEEE
>SP_0009 hypothetical protein
MENLLDVIEQFLSLSDEKLEELADKNQLLRLQEEKERKNA
>SP_2141 glycosyl hydrolase-related protein
MVRFTGLSLKQTQAIEVLKGHISLPDVEVAVTQSDQASISIEGEEGHYQL
TYRKPHQLYRALSLLVTVLAEADKVEIEEQAAYEDLAYMVDCSRNAVLNV
ASAKQMIEILALMGYSTFELYMEDTYQIEGQPYFGYFRGAYSAEELQEIE
AYAQQFDVTFVPCIQTLAHLSAFVKWGVKEVQELRDVEDILLIGEEKVYD
LIDGMFATLSKLKTRKVNIGMDEAHLVGLGRYLILNGVVDRSLLMCQHLE
RVLDIADKYGFHCQMWSDMFFKLMSADGQYDRDVEIPEETRVYLDRLKDR
VTLVYWDYYQDSEEKYNRNFRNHHKISHDLAFAGGAWKWIGFTPHNHFSR
LVAIEANKACRANQIKEVIVTGWGDNGGETAQFSILPSLQIWAELSYRND
LDGLSAHFKTNTGLTVEDFMQIDLANLLPDLPGNLSGINPNRYVFYQDIL
CPILDQHMTPEQDKPHFAQAAETLANIKEKAGNYAYLFETQAQLNAILSS
KVDVGRRIRQAYQADDKESLQQIARQELPELRSQIEDFHALFSHQWLKEN
KVFGLDTVDIRMGGLLQRIKRAESRIEVYLAGQLDRIDELEVEILPFTDF
YADKDFAATTANQWHTIATASTIYTT
>SP_0853 hypothetical protein
MKENNMLVNWKLESDVNDYVKKQFENLGLKKLQDY
>SP_0094 hypothetical protein
MFSLVLILTIQEISRTLYNFQSNKLHSFSQAGILV
>SP_1418 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0374 hypothetical protein
MSKKRRNRHKKEGQEPQFDFDEAKELTVGQAIRKNEEVESGVLPEDSILD
KYVKQHRDEIEADKFATRQYKKEEFVETQSLDDLIQEMREAVEKSEASSE
EVPSSEDILLPLPLDDEEQGLDPLLLDDENPTEMTEEVEEEQNLSRLDQE
DSEKKSKKGFILTVLALVSVIICVSAYYVYRQVARSTKEIETSQSTTANQ
SDVDDFNTLYDAFYTDSNKTALKNSQFDKLSQLKTLLDKLEGSREHTLAK
SKYDSLATQIKAIQDVNAQFEKPAIVDGVLDTNAKAKSDAKFTDIKTGNT
ELDKVLDKAISLGKSQQTSTSSSSSSQTSSSSSSQASSNTTSEPKPSSSN
ETRSSRSEVNMGLSSAGVAVQRSASRVAYNQSAIDDSNNSAWDFADGVLE
QILATSRSRGYITGDQYILERVNIVNGNGYYNLYKPDGTYLFTLNCKTGY
FVGNGAGHADDLDY
>SP_0759 hypothetical protein
MRLEAVFQPLILGEKNEIFNTQYSQLDGERSRGKIPDFA
>SP_1028 hypothetical protein
MSTSSRVLVLKKFHGIMDGNRNVAVFFVGQ
>SP_0560 hypothetical protein
MEFLLVLCNLDYHLDKFKEPIQYLKHYLVKQQLDCKRDDHPKEFHNPNNR
FDKKNSKKTKKISFSLLWLNEPPSRIH
>SP_0997 hypothetical protein
MVASASASSTSTQAQEQVDKSELRALSQELDQRLKALATVSDPKIDATKA
VLLDAQKAPEDSALTE
>SP_0068 hypothetical protein
MDHTRLSSKDLWSAFPTSNSIMGENLAWNHDGFLKAIEQWRAEKADYVEK
KIVVQTTGNLVTMSR
>SP_0683 hypothetical protein
MKPLSYVIRITFLLFVVKEKIEFFRYFTILPL
>SP_0684 hypothetical protein
MEEYGHMEKVQEVVRLFQITIMVKNIIIHLS
>SP_1926 hypothetical protein
MKGVTNMTPEEMYLTERLDVQIAHFLKKSVQHRRRYKVLKITEIVAGFLI
AVFCAIPMPGDRYRLISVALSSLGLLCEGIINLYNAKENWISYQKTAQLL
EKEKFLYQCQTEKYAGKTKAFALFVKTCEGLISEEINQWESIQSKEVAAS
ADAPVKKE
>SP_1503 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0491 hypothetical protein
MCFVPYYKVNHCEEAFAWYQDVNLVYLVDGVKLPYSQADLGSHVFLFGSA
W
>SP_1677 hypothetical protein
MAQKGVSLIKAAFDTDNFLMRFSEKVLDIVTANLLFVVSCLPIVTIGVAK
ISLYETMFEVKKSRRVPVFKIYLRSFKQNLKLGLQLGLMELGIVFLTLSD
LYLFWGQTALPFQLLKAICLGILIFLTIVMLASYPIAARYDLSWKEILQK
GLMLASFNFPWFFLMLAILVLIVMVLYLSAFSLLLGGSVFLLFGFGLLVF
IQTGLMEKIFAKYQ
>SP_0124 hypothetical protein
MMKDLNNYREISNKELQEIKGGFGVGVGIALFMAGYTIGKDLRKKFGKSC
>SP_1924 hypothetical protein
MIKRGDVVALYLPFPTISSDLAVKNHMYICIDNSMTKNKELVKNQTFKPA
LLTRRLVKNFMIEEPDLARNPFTRPTLIDLDKVFMLDNTVIPTSYLARRR
RNVSEELYEEILDYLVQPRLISLNKSEFMQLNPGTY
>SP_1363 conserved domain protein
MFEHYSVADLFANLYKKRKANILALIALFALIAVPFTIKAVRNKNTVKDT
TSYSTYLIYKITPPKESDKTILNHQIGGYSDFYGKLIDGNLNGAYLFNDV
EPSELKKIASELDTTETTLKNSTNDYWWKKLTVYYMIDDAGVGVKILTSS
KDANNLLEKKIDGLIEKFKHAYANVKIEKLETINSKELNANGETALGLNV
KNLILRLVVIGVVCVILVVMGNVLVYLFNPTINRVGDFSQYQIDFVTEIT
TIANLADVLSYKNTGQELTIVSSNKAILDKLKQSQEALKGMHFVDLQDVS
SLLERDTVLLVEEYGVTRYKKFEQSLQILRNLNRSILGVATFKL
>SP_2102 hypothetical protein
MLYNNDKEEISMLKEVLTVAKVAKKSSLFLGGVAFGTLGLKILASKEAKK
GYSKALAKAYKLKDELDASVSVVKQHGDDVLQDAKYLYEQEKKEEQLDSL
IGE
>SP_1640 hypothetical protein
MLSTRFTGKLSKWGNYFGIVNTILSGAIDYILGNKAAIITYPVTFLIYTF
AIKKWEASQEGRPNQMSQKQVKLAAIIISIIAFLFAFVTNYIGYGGKMNL
LAYVTTIAFALSLIANALNALKLTTQWGFWLIYNFVQLTKAGIQGNFANI
GKYIFYILNAIGALFVWNDEEVR
>SP_1480 hypothetical protein
MAELDNGIQVIIEIQVHHQNFFINRLWPYLCSQVNQNLEKIRQREGDTHQ
SYKQIALVYAIAIVDSNYFSDDLAFHSFIVK
>SP_2104 hypothetical protein
MLKKYFSKYKWTDLFWILFVILTCLYIGNHDLFTLNHQEFSFRGSVWGLV
LALYHLLFIDKFVISNRK
>SP_0911 hypothetical protein
MKHKEHILIGLLYLLSPFIGQLLVEHTHFISTEFTGTAYVICWLSVVISI
HHFSKNVLSQQQK
>SP_1349 hypothetical protein
MTFLDDYHKKHNYPLFYESYLQNVMEFLESQDIKNGVDAFVDDHQNLVFV
LYGQGYRAEGKEGILTTQVTVKAYDEDKKPINFANLLDSLIY
>SP_0693 hypothetical protein
MRIVDKIKILPTPYEGHYHLYIPSSKKHVLVGKQEKNG
>SP_0198 hypothetical protein
MKLKRFTLSLASLASFSLLVACSQRAQQVQQPVAQQQVQQPAQQNTNTAN
AGGNQNQAAPVQNQPVAQPTDIDGTYTGQDDGDRITLVVTGTTGTWTELE
SDGDQKVKQVTLDSANQRMIIGDDVKIYTVNGNQIVVDDMDRDPSDQIVL
TK
>SP_1252 hypothetical protein
MVSMVDKDGKLIPEQGGARSTSPAPVVIRKGLDIDKIMMHLSDTFNSWDY
RQVEYY
>SP_1805 hypothetical protein
MSVEEKLNQAKGSIKEGVGKAIGDEKMEKEGAAEKVVSKVKEVAEDAKDA
VEGAVEGVKNMLSGDDK
>SP_0924 hypothetical protein
MILMTKNINLTNEELELIQGGADPYGKNPNGRYDWEIEPVLTLLVHGFCP
RGTYDSGYIGGGNHLCKGSAARF
>SP_0573 hypothetical protein
MYNFSQSCYNQPIGIKEVTLMAVFVSLDGIVVEVLDVFSSFNGDSEFFLC
IAF
>SP_1345 hypothetical protein
MTKQMKLMECDLVHSVQIVAVTGVLLVGKIVNSFKL
>SP_0297 hypothetical protein
MNNLDNMRFIMEIFASFSPEIELLLSYFSLFLMIYFNFLPLGKNNKDMN
>SP_0558 conserved hypothetical protein
MIRCKKEIRSLYMAEQDLAMQVLQQVVKLPVVKVDRSKFLVDKFSKELDP
KDIPTLLEQGPTTLLSQEILDRVANACIRDNVLLASGTSVLAGLPGGLAM
AITIPADVAQFYAFSLKLAQELGYIYGYEDLWASREELSEDAQNTLLLYL
GVMLGVNGTAALLRVGSITIAKQVMKIVPNKALTKTLWYPILKKVLKIFG
VNLTKGGLAKGMGKFIPILGGIISGGLTFATMKPMGESLQKELSKLVNYS
EVQYQEDVETIRKEAEIIKGE
>SP_1025 hypothetical protein
MPSHYTRTKTFMDIYIKKAIIHQFSPDDTELFLADKFLNITPKIEEYLRK
KIEHVYSDEAKTGIFEEENPFFNHITDDLLETSVTLANLWKEEFSISENL
KTNDLIFVQFSKEGVEHFAFLRIALRETLTHLGGEVDNPIKLTQNNLPGF
GTGADEALVVNLQSSKYHLIEKRIKYNGTFLNYFSDNLLAVAPKISPKKS
IKELEKTAQRIAKSFNTDDFQFQSKVKSAIFNNLEESNELSPEKLANDLF
DNNLTARLSFIDQVKEAVPEPVQFDEIDASRQLKKFENQKLSLSNGIELI
VPNNVYQDAESVEFIQNENGTYSILIKNIEDIQSK
>SP_0595 hypothetical protein
MQILCYFTITVVAKPNNSGEVHLDVSIEDNQGGSGYNFSSVSSSSQTAKY
EGTVYNNNSSLYITIDKTSDATALLKLKLNNVDNQPATEVPSSGITVKLN
AKDNAGNWTSASNKKEVTVKIVSAKPTYPDKILVKNPDNIKDTEKMPLLK
N
>SP_1832 hypothetical protein
MVKINKICSIQGSSVENEDIVGSQNQYFWIIGGATDLYNSKEEIGYSVSE
VVHILSESLSVNCKESKTLKQIFETALLEVKDEIGLNSYKLTEYSKMK
>SP_0108 hypothetical protein
MQRLEVYKNYQRLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKN
KKSYDRLAICYVPIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVEIY
YQAKER
>SP_0790 conserved domain protein
MKKMKYYEETSALLHEFSEENQKYFEELWESFNLAGFLYDEDYLREQIYL
MMLDFSEAERDGMSAEDYLGKNPKKIMKEILKGAPRSSIKESLLTPILVL
AVLRYYQLLSDFSKGPLLTVNLLTFLGQLLIFLIGFGLVATILRRSLVQD
SPKMKIGTYIVVGTIVLLVVLGYVGMASFIQEGAFYIPAPWDSLSVFTIS
LVIGIWNWKEAVFRPFVSMIIAHLVVGSLLRYYEWMGISNVFLTKVIPLA
VLFIGIFVLFRGFKKIKWSEV
>SP_2241 hypothetical protein
MSTHFVLQELKKTRIRRCLMKSLARLLIIHVFISIFLFFALTSGAISHTV
LLLLLLFLPALNKGLEKIQSKRIPVLNAALFFLLISFPQLLTNPVQWKFS
IFLVVTIISSLAYFYNFYQVVKEVDQKQLI
>SP_0316 hypothetical protein
MVYLRAISPNHQPAPKDAGFHVVQALLLLTRHLPL
>SP_0691 hypothetical protein
MMKKVTHMSDEVFLFEEIEEIVAPTDGEFLGEVLLGTGVVLLIGVACC
>SP_0311 hypothetical protein
MSLDIDKEKMTIMGIAFENRSVFKSVWYALSTNMIEGWRPTVSDVEKLRD
EALALGMT
>SP_1556 hypothetical protein
MNTDYLPLEKRCLSCLSWKVSAIPYFSEAAKVLCILIIEHAWNPV
>SP_1945 hypothetical protein
MKSMRILFLLALIQISLSSCFLWKECILSFKQSTAFFIGSMVFVSGICAG
VNYLYTRKQEVHSVLASKKSVKLFYSMLLLINLLGAVLVLSDNLFIKNTL
QQELVDFLLPSFFFLFGLDLLIFLPLKKYVRDFLAMLDRKKTVLVTILAT
LLFLRNPMTIVSLLIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVI
VTLIFF
>SP_0809 hypothetical protein
MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVL
>SP_1047 hypothetical protein
MNCRGHETRQRIVRDFEVQPKAHIKLLANQQKHSDAGATIEDEYYVFIAE
SKIDGKKEVIQCCMGAARDFLELINHKGLPLFNPLVGDSHVNNRQEYDNT
GSGNL
>SP_0543 hypothetical protein
MVSPRTNQLMFIGLADFMFVICLYRGISETEFYQQLIAYIGVFSACLSRF
CSCGA
>SP_0172 hypothetical protein
MIFIEYTTILLPLARDFVYVEGLGSYVVELFCSF
>SP_0167 hypothetical protein
MDKKLDILDKVKEYLGNKTTQILDNQYKEFLKLNDIRRAFGISEKVLNNS
FNFTSKEFNDLINNENYLFEYACRIREEWRKKCFNHSYRFLCSPIITDDF
LNTKTLRSSQIEYKYERYLSKSSIGDRAVDGFVSFNTLTANGMSAIKLCL
EILNSIFFKKKIDLLYSTGYYETRFLLNNLAKSGISCYEVSNCELDKDKF
YNVFMMEPNRADLTLQKTDFKIVEYFVKYKNNSIKVVILDISYQGSNFKL
VEFLEKFKFANVIIFVVRSLIKLDQMGLELTNGGIIEVFIPNHLRKLKNF
IEEEFNKFRNSHGANLSLYEYCLLDNSLTLKNDWNYSDLVMKFTSNFYAD
IKDLFMENSDIEIIHEEGVPFVFLDLIGEGKKEYEMFFQWLNFFYKQLGI
TLYARNSFGFRNLTVEYFGIIGTERYIFKICPGVYKGLSYYLMKFLLKSF
SNEYLKTTDEVNR
>SP_1174 conserved domain protein
MKINKKYLAGSVAVLALSVCSYELGRYQAGQDKKESNRVAYIDGDQAGQK
AENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAII
SEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLKDAAHADNIRTK
EEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTGD
AYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQ
PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPA
QITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDS
RPEEPSPQPTPEPSPSPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSR
YIPAKDLSAETAAGIDSKLAKQESLSHKLGTKKTDLPSSDREFYNKAYDL
LARIHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLVEDILAFLAPIR
HPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYV
TPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGA
EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFD
EGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQ
ADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEES
EEPQVETEKVEEKLREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQD
NNTIMAEAEKLLALLKESK
>SP_1810 hypothetical protein
MLNQDLFDSLEAQKIVDTLMKGQKDYVDERLEKRETMIVSNGYAWTRPNH
IDTAFASADLFEYKLQLAGQTWGYLEFETNTEKYGKVLLIIKGKKRLTNQ
FPLVQKNKSGYLFEYAQMNTLYLNQHSSYKNDENSHSFPIQMELVSDEMI
QEIEQATKNSNIEKFMILTYEADSENNIISVDVVMPDARTGQLHLIQDLS
EYIQSSSYHFEEAKYQDIPNFSELSETEDFEIIPRIEKQEGQK
>SP_0328 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0067 hypothetical protein
MALTLAGAVLTNDVFANDRLVATQTTDGKNENVLTSEVLKPSSGNVLVGI
KGEFVAPHQQSILDAINAICKEAADEGLVDKYVPIK
>SP_0031 hypothetical protein
MIAKKIFSNPEITCQFIRDMLDLPAKNVTILEGSDIHVLLSMPYSVQDFY
TSIDVLAELDNGTQVIIEIQVHHQNFFINHLWAYLCSQVNQNLEKIRQRE
GDTH
>SP_0110 hypothetical protein
MLLLLLCTTFFVFNVNYTREVVRIQEMGKTVDSLDLYLKDINEPAASVLR
FFEDVSKEYKVSIIKTDSGDEVVKSGVFDKDTFPYQEFGISSLDFTTDGE
GVYSNKEISNKLGTIPTFLKAKPIQLMTFQTYIKDTSRSLNGRYTITSTQ
EMDKDRIVQKWSDFFKIDQATLLEPTYKSAVEVINRDLLLSAIVFVLAIL
LLVLVTVYQPMMEMKRVGVQKLLGFQDRAVLADVVKGNLYLLLGGALVIN
LGVFFLLDYKPKDLFPMLWLSHFLLLQLYLFISWLTYLLIQKMTISSLLK
GFSSFKFGLIFNYVMKIGTTILLTALLIGVGRSLEQENKELAYQQQWVSQ
GNYLTLETFKLNDNLWQEELAGSGKSTDYFYRFYQDLVEKTQAGYVQSSS
LPVKNFVQSEQIQQYQLTDTVDVYYANRNFLKSKGFKLPNTGIKKVILMP
ASTKGEEDKNQLLGKLIAFHSMKYEEQQKRTIEEMDVEIAYYEGDWSFFP
YSDKRKENLSNPIISLVNDSDMMWDEKASLSTTGLNNPIKIENTVQHQKE
ITELVEKLSDGNYLKFSSIQAIQQEKVDSYRDAVRNFNLLFALFGLLSMM
ISYFLLVTTFLLKRRDIITKKFMGWKLVDRYRPLLVLLLLGYSFPLLVLI
FFAHAFLPLLLFAGFTCLDILFVLGLASRMEKRSLVELLKGGIL
>SP_0542 hypothetical protein
MCNNGLTFLLGPFAIGIGVTGAAGGAILGGVAYAAICWW
>SP_1303 hypothetical protein
MKSTKEEIQTIKTLLKDSRTAKYYKRLQIVL
>SP_0471 conserved hypothetical protein
MDWYDYMIQASKQSQFNASHWFRYLRKVIFEDYSYLTNQDVEKLLDSKEL
TRFQKISLKYAFQEHTPTHKYVISLNKPAKLTNVQKLMEKYKHG
>SP_1643 hypothetical protein
MILSLVSLSDIPLFLQGTLLILGHLIPSYRICQSLKRDFPQAYQEPISFW
SIL
>SP_1379 hypothetical protein
MLFYSSFKKWYTRLPAKLGSKCVRITVKNALPSWRSISFYQRKSNSKL
>SP_0653 hypothetical protein
MDKQYLHEKLDAMRQNFVESTHHERAMGVLDQAHMSKKMLKIKKKLVALE
MERCQRKIEHKDCSKIDQKIKEQKEIFESCCKKD
>SP_0183 hypothetical protein
MQDQHAIKNKKTIKATAGAVAFSLTFLSYIQ
>SP_0934 hypothetical protein
MTASFMVAEMRRHKKIVTNPYFFDRIEVVKKK
>SP_0329 hypothetical protein
MNDLGKYNELERSSKLTKRQFFENQMLDYTIIAHESFEIIRHSVYQTDDR
EVENALAFEVKNDETDKLILLLSEDIGVGEKLCLVDGTKMRGKCLVYDKI
NERMIRLQC
>SP_0407 hypothetical protein
MSSCLPCPFGAFTVSPEFRPFTVMENQTIPHFY
>SP_0563 hypothetical protein
MSYEQEFMKEFEAWVNTQIMINDMAHKESQKVYEEDQDERAKDAMIRYES
RLDAYQFLLGKFENFKVGKGFHDLPEGLFGERNY
>SP_1892 hypothetical protein
MIQIVVRSVKDYSENRKFDAETLEFRKTYSKMKYGRNNVILEFKLNYNNI
VEVSF
>SP_0495 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0327 hypothetical protein
MKEERRQFFERVDGNQCRDYILSHCSKDYEKVKSSLERLMDNRFMFDSPW
DMEPCSKIHQIQPMVWDQVFEDDPEWSYMLNRQEYLLQFMIGYLVEGDKD
YIQKCKFFLFDWIEQVREFSPQSLMTRTLDTGIRSFTWLKLLLLLLKFDL
LEEKELEKILVSLEKQIDFMKSYYRAKYTLSNWGILQTIPMLAIYHFFSD
KMDLEEAYHFASEELKQQIETQILGDGSQFEQSILYHVEVYKALLDLCLL
LPDLQDSYQELLEKMATYIQMMTGLDGRTLAFGDSDSTETTEILSLSAVV
LNQEDLLNGLDVKVDLLSLLFLGREKVKRLQEFEKRAWQPKSMIFEDSGH
VCIKDEHRYLFFKNGPLGSAHSHSDENSFCLQYQGQPIFIDAGRYSYREI
YERYLLKSAWSHSTCIVDGKAPERITGSWEYEYYPHSLFCHHKEREGVHY
IEGAYWSAEPDLPYLHRRKILMLVEDVWLLVDDIRCQGQHEVLTQFILDK
DVTYQDGKINQLRLWSEVDFDLEDTIISPKSPIIHPKSKNHPESLP
>SP_2047 conserved domain protein
MWKKKKVKAGVLLYAVTIAAIFSLLLQFYLNRQVAHYQDYALNKEKLVAF
AMAKRTKDKVEQESGEQFFNLGQVSYQNKKTGLVTRVRTDKSQYEFLFPS
VKIKEEKRDKKEEVATDSSEKVEKKKSEEKPEKKENS
>SP_1459 hypothetical protein
MREIMLLQLFSLYFESLILTTILVLIFLGIWIGLRAMSGVDKTARARQAH
LYDMIMIGVLVVPVLSFAVMSLILVFKA
>SP_2200 hypothetical protein
MIAQNKKNVKETAYRKEQEISPTPIFYTRSPSLFTL
>SP_1314 IS66 family element, Orf1
MELLLYTISKVKLLEDILMPQPIVPVEIPQSRRFDSKKRNDILLKIRIGK
LEVSFFQSLNLEMVEQLLDKVLLYDNSSI
>SP_0114 hypothetical protein
MQRLEVYKNYQHLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKN
KKSYDRLAICYVRIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVETH
YQPKKL
>SP_0451 hypothetical protein
MAKSNFEKVESVVGWVRDKKITGYRISKETNAREMSIIALAQGRAKVKNI
SFETALGLIDFYEKNYEKFED
>SP_0561 conserved domain protein
MEVVMDNIIDVSIPVAEVVDKHPEVLEILVELGFKPLANPLMRNTVGRKV
SLKQGSKLAGTPMDKIVRTLEANGYEVIGLD
>SP_0772 hypothetical protein
MDFFYNGIAITPNTYLSAWFVNFIAALPLNFLIVEPIARFILSSFQKPFT
GEEVEDFQDDDEIPTII
>SP_0639 hypothetical protein
MMFVIEEVKDENQKKAVVAEVLKDLPEWFGIPESTQAYIEGTTTLQVWTA
YQESDLTRFVSLSYSSEDCAEIDCLGVKKLIKVEKLGANCLLL
>SP_0596 hypothetical protein
MKEANKNHPAGAPTFAKGEGEHANDIVATYSDGTTYYVPLNDVTKYAR
>SP_0633 hypothetical protein
MLEVGLNFLISLFTFTFDILYPIVKVGHTDDYSHGAVKLSVSLVDKDAMK
KIFVTVIGYFEINIDENITDILYVNGTAILYLYLRSIVSIVSAIDSSEAM
LLPIINVLELLDKSQPFEEE
>SP_0487 hypothetical protein
MERIPLSILTFYIPKVPSYSIKEKQSKWLQSGYKSIKTDKAILSSSPIQT
ILSVVESHHISLRSRTSLKRRK
>SP_0598 hypothetical protein
MEVTNESNPKILGLCQQKATEKFYFISDFIGLIGRNFSLFDSDEVQQNSR
KQSL
>SP_0821 hypothetical protein
MKIFVNLDYKKILFVRQKGFYLDMQGQSQLVLD
>SP_1762 hypothetical protein
MYYFIPAWYGSERTWHADITPWYFSHFRLEFDDTFHQIRLFQEQDIDSRL
LVLAYQPHLRYFLYRHGVLEMDTYSVFDVMQDFHNLHTQVLSIRDIEWDD
DCEFIYSPFTIIVQKNGKKFAKVEHGVEGFISDIQYFEPNGQIHMHHIVD
DRGFVSSIIFFEDGQAAYQEYLNLKGEWQFRERLKEGGQVEVNPILGYRF
KMLTYQNMGDLVAEFFENYLQTYVKDQDIFMLPSHSHHDQLVLDRLPSTN
PKLLSLFIGRNPQDTFRDLDVTFEKSDLILVDREDSLRLLQELYPERMHQ
CYHLSSFDTRLRLGRSQTKKESIIYFQLDFEQGIDNQALLQVLSFVAENK
DTEVIFGAFAASQEQMNEVEGIVESFIQENIQSENLGKAIDYGDAENPLE
ENQHQDLRLQFVNLNDELDLIKTLEFVRLIVDLNRHPHLYTQIAGISAGI
PQINLVETVYVEHLKNGYLLADVTEFSKAAHYYTDRLKEWNESLIYSIDK
IKEHTGQQFLGKLEKWIEEVKNVKGT
>SP_2039 conserved hypothetical protein
MLNKIRDYLDFAGLQYRNPDKAGAEREKMLAFRHKGQEARKVFTELAKAF
QASHPEWQLQQTSQWMNQAQRLRPHFWVYLQRDGQVTEPMMALRLYGTST
DFGISLEVSFIERKKDEQTLGKQAKVLDIPTVKGIYYLTYSNGQSQRWEA
NEEKRRTLREKVRSQEVRKVLVKVDVPMTENSSEEEIVEGLLKSYSKILP
YYLATRK
>SP_1604 hypothetical protein
MAKEPWQEDIYDQEESRAERRHRNHGGADRMANRILTILASIFFVIVVVM
VIVLIYLSSGGSNRTAALKGFHDSDASVVQISSSSSSQPEQSSEPESTSS
SSEEAANPEGTIKVLAGEGEAAIAARAGISIAQLEALNPGHMATGSWFAN
PGDVIKIK
>SP_1842 hypothetical protein
MGDRYYRALNGSEPDKYLLEKVELYKTDAIELVDVNK
>SP_1092 hypothetical protein
MVSLQNRKDRAKMFELTYKDCYHVERTLKYEDHEALMLTLSGCVTLPDTL
YVTSLTFRGKKVYQGLVGDLYRFLSHADFLHQN
>SP_0699 hypothetical protein
MSYFNDYKHKWEGKNELIFLTAILQNSLIAIF
>SP_0070 hypothetical protein
MNDKKEVDGEHWPLLIYKDWILVASISDFSIVS
>SP_1031 hypothetical protein
MNDVAIILETKSEERDISKQIFIDELMKNIDII
>SP_0874 hypothetical protein
MTPFLAKECKGIPKIKIKNVDLTTFYQGMQKNAKE
>SP_2154 IS3-Spn1, hypothetical protein, truncation
MKLSYEDKVQIYELRKQGQSFKQLSKRFGVDVSGLKSSESLR
>SP_1165 hypothetical protein
MANTVKVFLKEISQNKKENPVKTRDFLVKNIFSQTF
>SP_0888 hypothetical protein
MVVKTRKQGNSITITIPSEFNIPSGVKYEAKLLPSGEIIFTPEELGQQVS
YVSDDAFDLNLDKIFDEYDDVFKALVEK
>SP_0077 hypothetical protein
MTYSHIYQVLFLPKLSIKRWHFLGLVLVDFALSYLSHFELFMVQWKHVIQ
II
>SP_1133 hypothetical protein
MKIVTFKPTKQIDDGFYLPGIDILFVSDKADAKDKEDVILFLSRNGLNKS
>SP_0879 hypothetical protein
MYHETELATRKGNFSFFKIFLKKQSIINHNQRRECMSNYRRTSKPKTEHI
KKGFTVFQKTVATIASILGLITASITIMNALDNNKNIKKEPTTSQTTTIV
KEIQKESPKENTSPTKETNTSQEKTQQEETPKSSVKEEKKEDQKTATQDS
STPASSKPATENEKQSNAPTSENKSNQ
>SP_1834 hypothetical protein
MSDIQVNIPGECLYDKVFVLSFIIYNISTNLNIVNGIFYIQAKKDSITFE
WKAKEQTRKLAIDSSKPCFEVVDIVK
>SP_1455 hypothetical protein
MKEILICDIIKRKGLHKKVGRTKIQRRQNRNQGGLAFRFMKGLVNFLGVI
ASGAIRDLWRYSC
>SP_0995 IS630-Spn1, transposase Orf1
MWYNLLMAYSIDFRKKVLSYCERIGSITEASHVFQISRNTIYGWLKLKEK
TGELNHQVKGTKPRKVDRDRLKNYLTDNPDAYLTEIASDFGCHPTTIHYA
LKAMGYTRKKEPHLL
>SP_2147 hypothetical protein
MLLQNLSSKECCIFLSFHTNISLYTDPSDNQMIIGHFLIICIFFEKFFKK
ILYPFYRSLCICIYVFYPMLCIIT
>SP_1802 hypothetical protein
MYMSKAKKICFIIFCILILTIFLPVLIDYHQVSDLGIHLLSWRQNSVVEF
YLARYVFWGTVVLSTLVLLSILVVMFYPKRYLEIQLETKNDTLKLKNSAI
EGFVRSLVSDHRLIKNPTVHVNLRKNKCFVHVEGKILPSDNIADRCQIIQ
NEITNGLKQFFGIERQVKLEVAVKNYQPKPQNKKTVSRVK
>SP_1585 hypothetical protein
MPIFVYHNIIKDNIIILFVFSHSVSLYKKAIHFEPLFLIYRLCYE
>SP_0899 conserved hypothetical protein
MKKSRKLATLGICSALFLGLAACQQQHATSEGTNQRQSSSAKVPWKASYT
NLNNQVSTEEVKSLLSAHLDPNSVDAFFNLVNDYNTIVGSTGLSGDFTSF
THTEYDVEKISHLWNQKKGDFVGTNCRINSYCLLKNSVTIPKLEKNDQLL
FLDNDAIDKGKVFDSQDKEEFDILFSRVPTESTTDVKVHAEKMEAFFSQF
QFNEKARMLSVVLHDNLDGEYLFVGHVGVLVPADDGFLFVEKLTFEEPYQ
AIKFASKEDCYKYLGTKYADYTGEGLAKPFIMDNDKWVKL
>SP_1611 hypothetical protein
MEQTLFELELLPEEDIIVTGLPKYCSFTCLITGR
>SP_1189 hypothetical protein
MVSGSVFADSALTTVDKANDIVLNVDGNKFYNVSVSEDIVNAGQILEDYF
YVDKFGNINLKGTPEELAKNIGISVQEASLMYGAVKELPNVYERGPVGFR
FNLGPQVRGMGGWAAGAFATGYAGWHLKQFAVNPVTSGFVAVISGAIGWA
VKTAVENYWTVAVATVEVPFVNLVYTIDLP
>SP_2043 hypothetical protein
MKLKKYSIQGVGKVIFPASFSDEIVQNLAMIGFFEKYGIIVVI
>SP_1352 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0115 hypothetical protein
MELVLPNNYVALEQEEMMYLDGGGVGRNWWNSRGSFATVLDVDLAIYSGG
ATIYSAYAIKKAISANRGAITRTLRSLIIKHVGSAAGHLVNTALNVALTV
TGFSLGGAIAYGADWADGSLDGYIFA
>SP_1333 hypothetical protein
MAERFWENLSIILAERNISWIELTRKMFAGEFHYPSELNRLYQKIRHYKM
EQRMPQSPWVERIVQILDLDYEDLFRR
>SP_1490 hypothetical protein
MLEIDLTVLNDLPYSCFILLYKPETVYIFSK
>SP_0429 hypothetical protein
MTGTNTFTVLSTEDLEQTSGGLAVWEDGYSRWLYYREFAPYMRQGALNSY
IDAWKYGFRAG
>SP_1641 conserved domain protein
MKKLYRIHFIAIAVIDLLLFAFFITRLETSFEWLLLSGLIFFLAQGLLLF
LLVVRLKHQFAEIYPQINKKIRFYYLGVLTIDFLFFVLLAFISSQRFSSL
MPIITACHSTFYYMTADYLRENYPDFYDKHISLWECL
>SP_1210 hypothetical protein
MANLSQGLSLYLMTHHYQAPKSVIDFGLWIAKAPSQERGRLAFLQMLAQT
LQGFR
>SP_2187 conserved domain protein
MYLGDLMEKAECGQFSILSFLLQESQTTVKAVMEETGFSKATLTKYVTLL
NDKALDSGLELAIHSEDENLRLSIGAATKGRDIRSLFLESAVKYQILVYL
LYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEH
QIHYFYFCLFRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLD
LVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLRLLRKVPSFFAGQH
IPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMK
KEELLGDYTEDHVTYELSQLCAQVYLYKGYILQDRYKYQLENRHPYLLME
HDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQHMRIGL
DLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKE
QTPVYYLKNDLDMEDLVAIRQLLFT
>SP_1621 putative transcription antiterminator BglG family protein
MSRKQEQMETLLLLLRDSKDYISAKVLGEKLNCSDKTVYRLVKGINKDCP
VEAFILSEKGRGFKLNPRSSLVDVDGNFTEAFDPEVRREKLLERLLLTAP
KPHSIYDLGEEFYVSESVVLKDRQILQESLAIYGLDLKMRQRKLFIDGDE
AQIRSAILNLLPMFNQLDLEQITQNKVQPLDGELAHFCLGLLITLERELG
VNIPYPYNINIFSHLYIFISRNRRSTSIHVVAPSKPTIVDEKIYSVCQKI
IQEIEQYFRMKVDAVEIDYLYQYVVSSRLQKPFSSGKLPFSQRVLDVTHY
YFSRMCMDNREIETTDPDFVDLASHISPLLRRLDNRVQIKNSLLSQILLT
YPNLVKELTTISKEVSLVFGFASLSLDEIGFLVLYFARFQEKRARPLKTV
VMCTSGVGTSELLRARLEKQFSELDIIDVVAYHQLDELINLYPDLDFIVT
TVALQEPASVPFVLVSVFLTEGDKQRLQAKIQEINYE
>SP_0612 hypothetical protein
MKTDLHFENHQKNGIMVGKDSAESIRTFRIRG
>SP_1728 hypothetical protein
MSVNLLTLLFIPVMVSSSGSEFQSGWQEHQLIAEKVSKTLDKTFDKDVRE
IPTSQFYQKFVDEMGRTYSGNLILQELITVNGAYKATYIGELSSN
>SP_1775 conserved domain protein
MRAQSFFLTFSFIRSKIKLALNKGVLNMIEITYIDASKNERTVTFESYED
FERSQQACLIGVADYYPVQKLTYKGHNLDYHGTYGDIFFYLMKQDLSQYN
>SP_1188 hypothetical protein
MNHSFKKITVFCFIVSCVLCLLDLMNFKNVATFLFFCLPVFVLIYKNK
>SP_1006 hypothetical protein
MICLAQKTFYFFLAICRRLLVAIYHVLLKQESYNPRLQGLTEIRNPDKTM
FVQDAIRFAQQHGFNML
>SP_1036 hypothetical protein
MFIISPDLFNIAVILYILFFIHDILLLILS
>SP_1140 hypothetical protein
MAGNENDNLTSKQIKFIDAMLTEPTIDKACQKAGVSRATGHKYLKVAAVK
KTLRIKQDEMMDKTTQMLYLASSNAVSVLNDIMMDSKVNPFIRTQAAKAI
LEQSYKTHEIFGVVRQIEELRLEIEEVSKGNQRVTRTQGVIK
>SP_0191 hypothetical protein
MKKIVLVSLAFLFVLVGCGQKKETGPATKTEKDTLQSALPVIENAEKNTV
VTKTLVLPKSDDGSQQTQTITYKDKTFLSLAIQQKRPVSDELKTYIDQHG
VEETQKALLEAEEKDKSIIEARKLAGFKLETKLLSATELQTTTSFDFQVL
DVKKASQLEHLKNIGLENLLKNEPSKYISDRLANGATEQ
>SP_0800 hypothetical protein
MPVRKLQSYEVDYQEELNQQLPHYQAYTPEAQSDANLKEILFFINIAVFC
ICIAIFSFIFLALKLSTALAFAAAIGFSLLVLKVQRSIIKRKRRR
>SP_1755 hypothetical protein
MIEILIVLAIILSLALIVLVTIQPRQNQLFSMDATSNIGKPSYWQSNTLV
KVLTLLVSLALFILLLTFMVITYK
>SP_1300 hypothetical protein
MSISPRFETLEQAIASKDLEKVREAFKKMNSTWTINESVVRDNSIAHYGR
VETAISFLPSSMEIEPTDESGT
>SP_0449 hypothetical protein
MNITNLFSIKTGCDETDRQLQKLFFQLDLQLGELTDQLRKLDSNFVPRSQ
FVDTLDLNDVEYKEILNYFIFHRNDSEESLVEWLYDWISTNRYELPKEFS
IRMAHKYHESVTEVFGDE
>SP_1962 hypothetical protein
MIVGSNPSTAFYNLIYQVSNEQKAQFEGLFLFSLE
>SP_2071 hypothetical protein
MEHLVVLSFIFSKFLECGILRKRLNSDKNWRNSTKKLEKNEGKRYDRKEE
ILEEEHVTY
>SP_1820 hypothetical protein
MAPRYQRPHTEVYSVCGLFSIRRLVYLLLGP
>SP_1048 hypothetical protein
MWLIILWNAKPDTPLFNFKDEVIKYKTYEPFESSIKRVNTTIKNGSKGKT
LTEMINGYRADNDIRDEICNFNILKNKIRDMKNQQGNTMESYF
>SP_0093 hypothetical protein
MVKIRTRENMDIYILVPKKPLPSPDQPEESSDSYFRS
>SP_1004 conserved hypothetical protein
MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQK
SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF
SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTK
DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA
YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT
QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR
TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV
VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF
HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII
AEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS
HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVNKEKNAIIYPH
GDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV
VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKV
LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT
VPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDALVRVFDEFHGN
AYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE
VPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSE
TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP
SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP
EAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK
FTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA
>SP_0018 hypothetical protein
MIMLQKIYEQMANFYDSIEEEYGPTFGDNFDWEHVHFKFLIYYLVRYGIG
CRKDFIVYHYRVAYRLYLEKLVMNRGFISC
>SP_0635 hypothetical protein
MLRMIVSKQYLLFYLIHEKEVHTLRIINSRIDYLNQLDHLFRTCRKLFSS
QIISL
>SP_1492 cell wall surface anchor family protein
MVPKTATSTETKTITRIIHYVDKVTNQNVKEDVVQPVTLSRTKTENKVTG
VVTYGEWTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYD
RLSTPEKPIPQPNPEHPSVPTPNPELPNQETPTPDKPTPEPGTPKTETPV
NPDPEVPTYETGKREELPNTGTEANATLASAGIMTLLAGLGLGFFKKKED
EK
>SP_1579 hypothetical protein
MYAFDSSLSNNRLELSDNIILCYNEEKTEVFKCQKQVISF
>SP_0322 glucuronyl hydrolase
MIKKVTIEKIKSPERFLEVPLLTKEEVGQAIDKVIRQLELNLDYFKEDFP
TPATFDNVYPIMDNTEWTNGFWTGELWLAYEYSQQDAFKNIAHKNVLSFL
DRVNKRVELDHHDLGFLYTPSCMAEYKINGDGEAREATLKAADKLIERYQ
EKGGFIQAWGDLGKKEHYRLIIDCLLNIQLLFFAYQETGDQKYYDIAESH
FYASANNVIRDDASSFHTFYFDPETGQPFKGVTRQGYSDDSCWARGQSWG
VYGIPLTYRHLKDESCFDLFKGVTNYFLNRLPKDHVSYWDLIFNDGSDQS
RDSSATAIAVCGIHEMLKHLPEVDADKDIYKHAMHAMLRSLIEHYANDQF
TPGGTSLLHGVYSWHSGKGVDEGNIWGDYYYLEALIRFYKDWNLYW
>SP_1931 hypothetical protein
MMPANTKVIFQEMFADFQNYYVLIGGTATSIVLDSQGFKSRTTKDYDMVI
IDEVKNKEFYTTLNHFLELGEYQGSQKDEKAQLFRFTTTNPEFPSMIELF
SILPEYPLKKDGREIPLHFDQDASLSALLLDEDYYNILVHEKETIQGYSV
LSNCGLYSSKISSNHVSFHLQPQNSVLSSLQLAS
>SP_1494 hypothetical protein
MLNVDQDFMSISKSNKSGSDWKKTFTVRITNRLANDLNNVLKQVDKDTPN
TPTWLNSAASKAKDDDRVYKLLKTLIPGENYLSC
>SP_1120 hypothetical protein
MFLLYYLFREDSSKLLYFFNYFENLQQVHLLVQL
>SP_0682 hypothetical protein
MVYLVLGILLLLLYVFATPESIKGTVNIVAMVCILVALLILLVLSFLKIF
QLPTEIFLAIAMLILAYFSVRDITLMPVKKSKRR
>SP_0958 hypothetical protein
MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLE
SRIAANERAINENEEKLSQMSDTSSEEYQFAKNNLDVQKNLLTRKTEILT
LLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALY
PLNIKAHTLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNH
LDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLVGSLISGFGQL
DYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQ
KMPVLFLSLIGIVGLLFGIQTIQPLQRIAHLIPFTYLRSVEILSGRLPKQ
IDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRF
>SP_0190 hypothetical protein
MQKHVCVQLYKHQGWEHFPVLFYFNFKKKEERNEKNSSC
>SP_1947 hypothetical protein
MRKKRGIKKLVTFALLGVFMFSNTIPYQQFIQKNKQLEIRVQSQKKSNGL
DVGKAD
>SP_1224 conserved domain protein
MTEHLKSNTMVLPLKKGAQKMTTITLKVSEADKTFMKAMAKFEGVSLSEL
IRTKTLEALEDEYDARVADLAYQEYLEDLEKGVEPITWEEMMHDLGLKDE
>SP_1305 hypothetical protein
MKEFLENFCFFFTVKKNSAIMSYVIKYDNKRRTI
>SP_0475 hypothetical protein
MQINAILKKKKLLLEGNKMVIRVFDQQKNTYSSFALEELSYYMNRVFKTN
IELVEEKEADIFVGLVNKEDRKDHVLISLDKGKGRIESNTIVGLLIGIYR
MFHEFGVVYTRPGRRHDFVPELRFEDFLDKQLSIDETASYYHRGVCIEGA
DSFENILDFIDWLPKIGMNSFFIQFENPYSFLKRWYEHEFNPYLNKEQFS
NELVQELSDRLDKELQKRGLIHHRVGHGWTGEVLGYSSKFGWESGLSISE
EKKPYVAEINGKRELFNTAPILTSLDFSNPDVADKMVEIIKDYAKKRPDV
NYLHVWLSDARNNICECENCRQELVSDQYIRILNQLDRALTSEGLDTKIC
FLLYHELLWAPQKEKLDNPERFTMMFAPITRTFEMSYADVDFDNSIPTPK
PYMRNKIILPNSLEENLSYLFEWQKAFKGDSFVYDYPLGRAHYGDLGYMK
ISQTIYRDVSYLSNLHLNGYISCQELRAGFPHNFPNYVMGEMLWKKTRSY
EELIEEYFSALYGENWQSVVEYLEKLSIYSSCDYFNAIGSRQSDVLANHY
YIAYNLADNFLPIIEENISKLLNSQKDEWKQLSYHREYVVKMAKALYLQA
TGKTRQAQDEWRNVLNYIRGHELLFQSNLDVYRVIEVAKNYAGFHL
>SP_1642 hypothetical protein
MKKIIFIKTIQLLVIDGIMLAFLTFKRGLTWDWILIYSGWLIFFHPVLLT
YLSNQLCDHFS
>SP_0309 hypothetical protein
MDKKERQKIEQQRREMALTNTFFNRYLLLRYSIALFFFGNIYWLLSQFIS
PSPIIIFPIMLIVFSILATVEQFKLYGNRKEKLGITLMFVRIQMLISIGL
LVLTWTSWFKNLFPIFENNQVARLFVFVVLLLGLVLSLLDIRRIKKIYKR
TDKAYQQFVQLEKNSLSL
>SP_1528 hypothetical protein
MIFWKKQLTKQTKSCIIKQIFKAGGSGNEKV
>SP_0368 cell wall surface anchor family protein
MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLP
ADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKP
AEEKPKEDKPAAAKPETPKTVTPEWQTVANKEQQGTVTIREEKGVRYNQL
SSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLK
FKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITL
KSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVS
VKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRV
KEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDD
AHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDESKLLSSISF
LGNALVSVSSNQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYG
FVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQ
WEKAYKGIVFPEYTKELPSAKVVITEDANADKNVDWQDGAIAYRSIMNNP
QGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLK
GYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASET
YPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDL
KKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHG
GEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGA
ANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWEN
GTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTV
TLNGRVIQDGSAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSD
WAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTNPE
MSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKE
KVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSL
ALNYVKAYAHNTRRDNATVDDTSYFQNMYAFFTTGADVSNVTLTLSREAG
DQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVE
GVEDNRTHLSEKHNPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQ
TIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLE
MHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNA
NFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTKESMDAL
KEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTA
PAQAQEGLANAFDGNVSSLWHTSWNGGDVGKPATMVLKEPTEITGLRYVP
RGSGSNGNLRDVKLVVTDESGKEHTFTATDWPNNNKPKDIDFGKTIKAKK
IVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALVKAQKLT
DKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKPDAPT
VEKPEFKLRSLASEQGKTPDYKQEIARPETPEQILPATGESQSDTALILA
SVSLALSALFVVKTKKD
>SP_0258 hypothetical protein
MMELVLKTIIGPIVVGVVLRIVDKWLNKDK
>SP_0705 hypothetical protein
MFGCLWYIFSTFRGLCIMKQFVQFYKKDFLAVLVYFILLLSCVLSSTVYL
LRCRQYSIHPNVLEWILVLLQDMTTGVYCFPFTYILFFFYLMNNYFNRLE
CRIRLKSIKHFTSFSFKLAALSTGIWTATLFLLIFLIAFSNGFSFSLEIK
EVDFLREFYGISIANNASFFIGFFFSYIAYYFFLSLLTISSFSWFKKSNM
SLVFLFTFLFVESLFWIYQLDNGIIGLLPIFQYMVNSNPYALIYWLTLLS
IIIPLTVFSVHRNWRRV
>SP_2182 hypothetical protein
MFVFADDSLSANSKVVSGEAQFENGSSVRFGDTQVNILSDEVLEVVNPDG
SVDTIERRADGVYINGAFYMAYQKNEIDLNISFRSYDPNVWNYVNTIHGN
KQANTFANFMTGAGISYMIGRIGALLGGPWGAIIGGAYFGIQAYQSYLDS
QSPYPYYITSTYIHVAQRKWKFITEYYRNSNYTGYVKTVTTYVNF
>SP_1103 hypothetical protein
MGLMAMLLITIRRENQALVNNKDYPLEMKGTLEIL
>SP_0098 hypothetical protein
MRKWTKGFLIFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEK
LTFGKEVENLEITLHQHTLTITDSFDDQIHISYHPSLSAHHDLITNQNDR
TLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRLPKGRTLKGIN
ISANRGQTTIINASLENATLNTNSYILRIEGSRIKNSKLTTPNIVNIFDT
VLTDSQLESTENHFHAENIQVHGKVELTAKDYLRIILDQKESQRINWDIS
SNYGSIFQFTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTP
SRR
>SP_0503 hypothetical protein
MPDIVFEIEFFHSDSLIFFYTSDDKSNSQKSQEDFSKNK
>SP_0654 hypothetical protein
MNPVVKKIKEDVRGITDLPHPIFTGFDCLKYNQ
>SP_0414 hypothetical protein
MSVKLFHCQSISFLGAESKKKSTESVSADKKGSL
>SP_2025 hypothetical protein
MRGLVLSAISNQCFELNGTRFLVCSLILIGP
>SP_0125 hypothetical protein
MTNFDILDNQFLSLSENELSDIDGGLAPLVIFGVAVSWKAIAGGTALIGS
GLAAGYFLGGD
>SP_0472 hypothetical protein
MLIINRFFSLFVLDWYRTELWINTLVSYPVPKYVGRKMKQLLRVSVLEKI
LSIEFFK
>SP_1843 hypothetical protein
MVSITTYQNNQVSNNKFQTSLHFIEVVSKDL
>SP_0792 hypothetical protein
MKNVELKEKNMTFEEILPGLKAKRKYVRTGWGGAENYVQLFDTIEQNGLA
LEMTPYFLINVSGEGEGFSMWSPTVCDVLATDWVEVHD
>SP_1965 hypothetical protein
MRQVMKMNKKSSYVVKRLLLVIIVLILGTLALGIGLMVGYGILGKGQDPW
AILSPAKWQELIHKFTGN
>SP_1949 hypothetical protein
MKNDFVIGKSLKELSLEEMQLVYGGTDGADPRSTIICSATLSFIASYLGS
AQTRCGKDNKKK
>SP_0650 hypothetical protein
MILTLVVCIILTKLFRLKKLGRNFADLAFPVLVFEYYLITAKTFTHNFLP
RLGLALSILAIILVFFFLLKKRSFYYPKFIKFFWRAGFLLTLIMYIEMIV
ELFLMK
>SP_1265 hypothetical protein
MSVRKNKFFKSRDYKACLRKNSKTLTNKNKMVIIKNGLK
>SP_1493 hypothetical protein
MPMNIILIAKLLRENTNTKANALNNGWARSGSEEFKKFSHFVGVDKGIVR
TNVLTGKKLSDKIRKEVGSGDSKLGKGGYFSTGDVLLGKDVVSYTVQVFS
ENNERVGVNTQSHRVQYNLPILADFSVIQDTVEPSRTVVEKIIPKLNIPE
EEKGKITEEIKKKKKTSELAELISENVKVRYVDEQGRLLSLKNDTGIGEK
ESDGTYITNKKQLIGTSYNVTDKKLSSMTTTDGKYYTFKEADTNSASLTG
NIVSEGRTVTLVYRESEAPTTATVTANYYKEGRQEKLVESVIKADLAIGS
EYTTESKTIEGKTTTEDKEDRVITRKTTYTLVATPENAYQKTVQQLTITT
VRMLRKQWFPKQQPLLRRRL
>SP_0773 hypothetical protein
MKIYFLKKWENIDSKRILNHIRMGVFKIMFQW
>SP_0170 hypothetical protein
MGRRFCFICSLKKVTAVITDDSTEQNYEELEIYTQVIV
>SP_1703 conserved domain protein
MAHLKSFITRYSKVYIGLVLLIWLSFFFIPWDKPLLGIRIDIFIIQKILL
AFGILSILMALLSKKVSLFVFGLICCLSLWINLFITFAILPIFGN
>SP_0461 putative transcriptional regulator
MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQ
ETFEEELTFNLDTQQVQLIEHHSHQTNYYFHQLYNQSTILKILRFFLLQG
NQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHF
SILVALTWKRREFPLEFPESKEFEKLKNLFMYPILMEHCQTYLEPHANMT
FTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPL
YHISKAIVQEWMTEQKIEGVIDQHRLYLFSLYLTETIFSSLPAIPIFIIL
NNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAM
IAKKAHYLL
>SP_1476 hypothetical protein
MVGHFLDDFDGYDSYIWFEEGMVEYISRKYFLTEEEFQAEKICNQSLVEL
FQKKYSWHSLNDFGSSTYDKNYASIFYEYWRSFLTVDKLVENLGSVQAVL
DSYHLWANTEKTFPLLDWFVQQKLIEKEI
>SP_1835 hypothetical protein
MSEIVETRVFFCFQNIVQNFVPVFHVLVILLHDRIYSMLIL
>SP_1080 hypothetical protein
MGDKPISFRDADGNFVSAADVWNEKKLEELFNRLNPNRALRLARTKKENP
SQ
>SP_1454 hypothetical protein
MAHGDLLYHDGLFFSAKKEDGTYDFHENFEYVTPWLKQGD
>SP_2177 hypothetical protein
MMKQRKELYLFLGRTALYFLIFLGLLYFFSYLGQGQGSFIYNEF
>SP_0455 hypothetical protein
MKKWTFSRAFCRALKSSPNHQIEIRNSLDKTIDFSYSLSFFYLPLYHTFS
VYGSSL
>SP_2160 conserved hypothetical protein
MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSD
RNNPDSLLHLKKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELY
IEHIDIQSCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFTSFNKNI
LCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGRKGVQ
FKVVCHSKVTDGEVSVLGETIVIRNATEVFLYLKSMTDYWGNIDISSLQG
EFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCLSIPTNLLLENTKKY
SNYLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTININ
TQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNT
DGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEHFEMIK
EAFLFFEDYLFEVDGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQIL
RYFCDSCIGIAKQLGDNSDFISRVKELKKKLPKTKIGSNGQIQEWLEDYE
EVEPGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQ
EREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLNN
ATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPALPSAWSEG
EVKGFRVRGGYKVSFAWKNGDITFLKLEGGNKDQKVRVRIYGKNTDVQNI
ELVFNSEKIIELNF
>SP_1660 hypothetical protein
MIQYRDLKHRKNLLQFYKKYSENNILSLYFQRAGDGVSPVIPIDKIILSK
TQV
>SP_1818 hypothetical protein
MRMMFSSLVSDEPNFTAFSACFQQPQALCERKNCNFSIYYFLASSSLQSQ
LGPCLHDQRH
>SP_1093 hypothetical protein
MAFEKIIQLKNCRYDYTLSPSVKKFTLKDNTFFETKVGNYELTRLLEKVP
NSGEGFQLKIIINKELTGAKINITDKFGLRLVDIFKSEDHHIHQEKFYFL
MDSLVERGVFTKSER
>SP_0900 IS1381, transposase OrfA, truncation
MNYEASKQLTDARFKRLVGVQRTTFEEILAVLKTAYQLKHAKGGRKPKLS
LEDLLMATLQYVREYRTYEEIAADFGIHESNLIRRSQWV
>SP_1912 hypothetical protein
MNGMKAKKMWMAGLALLGIGSLALATKKVADDRKLMKTQEELTEIVRDHF
SDMGEIATLYVQVYESSLESLVGGVIFEDGRHYTFVYENEDLVYEEEVL
>SP_0511 hypothetical protein
MKIKNTFQLKELPPVTNFEKNGSRAKKRNKIGSIRIVKIRLA
>SP_2063 LysM domain protein
MKKRMLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYT
VQYGDTLSTIAEALGVDVTVLANLNKITNMDLIFPETVLTTTVNEAEEVT
EVEIQTPQADSSEEVTTATADLTTNQVTVDDQTVQVADLSQPIAEVTKTV
IASEEVAPSTGTSVPEEQTTETTRPVEEATPQETTPAEKQETQASPQAAS
AVEVTTTSSEAKEVASSNGATAAVSTYQPEETKIISTTYEAPAAPDYAGL
AVAKSENAGLQPQTAAFKEEIANLFGITSFSGYRPGDSGDHGKGLAIDFM
VPERSELGDKIAEYAIQNMASRGISYIIWKQRFYAPFDSKYGPANTWNPM
PDRGSVTENHYDHVHVSMNG
>SP_2105 hypothetical protein
MNKLMKFISVFLTSIVLIVSAIPSVSAVYASEQVSQIETNMELQPVTSLT
EEQINTLANEIQSFHPDVSQQWIKEVINRQLQGDYTIPPTYSPFRAVWQG
ITVNQMGALLDTAIALALGGTTAGLANLIKVKGKHAAKSAIRSAISRYLG
SWFVNDVALEFAMNLLSPGTYLAQLWDKNDAIPNNGRINF
>SP_1039 hypothetical protein
MDNDWNGLADLIANLIAKYAGALDLDNLPDPTPAKNQEMKNSFDMAKTQI
ETD
>SP_1789 hypothetical protein
MEMMELPSQEILIFTKQIRHWILSDQVISGKRKLFFREDTPKEILDLYEN
IKSKLDFAYQEVHSNNGLKKYEK
>SP_1495 hypothetical protein
MYQNEDLYKKGLNVELAHQQIKGFFEAEFKNRINGVLNTKIKNSTLNRVN
KKTIHQSNKNSMINLKQKQRKMLKNKAILC
>SP_1629 hypothetical protein
MLIYETVALVGMDSGISIKHILQKMKNKKLSQNP
>SP_0815 hypothetical protein
MLLCVLLLKDLLDFLSNRVFTQFSYLIGIEIPINFSE
>SP_0634 conserved domain protein
MFSSYFNPLYIMVSNLHQNDKINQLISDYKQNMKAFYITIEKFIRDDESL
KCYFIKVISSRSKVTSLDQIEADKTIQRKYSSELKKFIGFYNEIICEENS
FLHVRKRWSSWFR
>SP_0534 hypothetical protein
MVSIQYKEESMFFMLAFLIFTIQEVLMTIYDLSDPRSK
>SP_1705 hypothetical protein
MFKNFNNILLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVIFFKRELIS
IFSYNDYSEANLEIPKLLLNLSLFMVGWLSVILLESDLADHYHHLIRYQS
SSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMM
LYLLLSYLIALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVM
LIVSRLEEKFKKG
>SP_2004 hypothetical protein
MKRGIIYFFIGLSLLVWLVEMFTGWFDQTLLRQFIRGALGFGFMIFVVFL
MRMEWLKGEYHEYD
>SP_0087 hypothetical protein
MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVLFCLMGKSYKEIIELL
>SP_1138 hypothetical protein
MVSLPHLVYMVVESMAITSQRAISHPMKSVYFCLGL
>SP_1718 hypothetical protein
MNLFLIKMLSETISLFPEVLEEENFTRKKELLK
>SP_1150 hypothetical protein
MEFFITLYYTMISLVNLLKYLGHPFFSSSSAKS
>SP_0025 hypothetical protein
MKKSNILFIFILLLCIGLQYETIYYTDGSRSGAEYGLMGVSIFLALFYMI
PALYFLFRIGKNGNCQRRF
>SP_1339 hypothetical protein
MAAEVLNLQLVSVQVDETDEVDGMRFSTFSTNRCGNWSAFSWENC
>SP_1904 hypothetical protein
MKLKKLLKDDTKVFEKSTFKFVEGYKIYLTESKESGIKQMDNVIKYFEFI
ESKSIALYFQKRLNELID
>SP_1971 hypothetical protein
MLMDKTFLHRQLLKNLINVLYTYFQEKKRENLKKISVTQNTDFIDLLVIA
TKDT
>SP_1216 hypothetical protein
MKYRKRFLKPKVCDIIIKKKDLGVYYGFFGIYLKD
>SP_0196 hypothetical protein
MERPVNIFTPTPRNGEELERPVDVFSPYSHS
>SP_1668 hypothetical protein
MNLWDIFFTTQATEPPKFDLFWYVSLFTLLALTFYTAHRYREKKVYQRFF
QILQTVQLILLYGWYWVNHMPLSESLPFYHCRMAMFVVLLLPGQSKYKQY
FALLGTFGTLAAFVYPVPDAYPFPHITILSFIFGHLALLGNSLVYLLRQY
NARLLDVKGIFLMTFALNALIFVVNLVTGGDYGFLTKPPLVGDHGLVANY
LLVSIVLVATISLTKKILEFFLAQEAEKMIAKEA
>SP_1930 hypothetical protein
MRERSATGAQGLSKSIKKHLNDLTRLTASLLGDEKLSAITSSSAVKADMH
RFVIELEPVKSTILQNNDISLDQNEIFEILKNFLDG
>SP_0174 hypothetical protein
MVKHNFDVTDKTGKISSKHCFEITDKTDVV
>SP_1058 hypothetical protein
MRSLFRKIVALLVIGLILLGTAGGTQVHKMARGIDPGPANGIYR
>SP_1432 hypothetical protein
MSSKLLKAKEQVKSQDKDKKSILGQIKSFKTDDKSKSNKKDHSKGAER
>SP_0428 hypothetical protein
MRLERSHFLWKNVIQVEKLPVEEMGYKIDKKRNIL
>SP_0244 hypothetical protein
MSHSFKKSLQKEILHRSSIAAFVTSRAFSDTVSPI
>SP_1181 hypothetical protein
MVNFSCLSILSPRFLLLSYHKLAHHSNTKTTK
>SP_0621 hypothetical protein
MSVIEKLNHEKSLQALSNYGRMEAVELEKEIDYEIS
>SP_0714 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_1938 hypothetical protein
MKRTYRDCKNILLKIFLVWGVIVDRMQTLSVLFTVSK
>SP_1562 hypothetical protein
MFNVKWTRIGKKMPENGIIGRKCFDREDIRDEFSTIIQSAILDQFVCKSM
DDSYQSD
>SP_1694 hypothetical protein
MIFKAFKTKKQRKRQVELLLTVFFDSFLIDLFLHLFGIVPFKLDKILIVS
LIIFPIISTSIYAYEKLFEKVFDKD
>SP_2005 hypothetical protein
MFSGLDESFYHFPWELFAGFGMMSWLVREGLKLVGDVKKELEE
>SP_1932 hypothetical protein
MTLKDDDDPRIEEESEALENMILQYLGEDDAS
>SP_0133 hypothetical protein
MVTMQYSCGKININIPDGYGDIKDIVFSAHIIVRYNNGHCGGIDPHIIGL
CKKQIRRMSLYPILIIVSRDSKVIDDYKNLDIAYVDCTQCSNNFETALHV
KNILKLLKIQLIHCHGYSTNYFLYMLKKLDKNGFGKVKTVITCHGWVEYN
LKKKFLTYFDFWTYSMGDAFICVSETMKKKIGEYNKK
>SP_1708 hypothetical protein
MSVMEHLFKFLLLAPYFYFDNWIEKANRNSKFFPIFYYFYWFYIPFYSLF
SLAWTVVSVLFFNTVLRNVTDIKLWGIWFLFILLAIGMNWLTYSCFKEMF
RLRQELGKSKGGRH
>SP_0367 hypothetical protein
MFPIAFSAIICYIKSIIIFYLSELSIALSEEGVFFKKKM
>SP_1175 conserved domain protein
MILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGI
NAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYKLKD
EDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHRE
GGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHY
IPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNP
GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPA
QITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDS
RPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEEKGI
SRYVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFYDKAY
NLLTEAHKALFENKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAP
ITHPERLGKPNSQIEYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDA
YVTPHMGHSHWIGKDSLSDKEKVAAQAYTKEKGILPPSPDADVKANPTGD
SAAAIYNRVKGEKRIPLVRLPYMVEHTVEVKNGNLIIPHKDHYHNIKFAW
FDDHTYKAPNGYTLEDLFATIKYYVEHPDERPHSNDGWGNASEHVLGKKD
HSEDPNKNFKADEEPVEETPAEPEVPQVETEKVEAQLKEAEVLLAKVTDS
SLKANATETLAGLRNNLTLQIMDNNSIMAEAEKLLALLKGSNPSSVSKEK
IN
>SP_1439 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_1158 hypothetical protein
MLEIDLIVLIVLSYFYFISLYNCYCFLFHDFNLTVQI
>SP_1925 hypothetical protein
MSQSSYLSPLLWLKKEADKEKMSATQCQIFFFYYQMFELLFARESDMKDL
CLGTKGFYFSQLEKNLLSGVSRFLKNLEGKVTLKANQEVSARKALFLALT
TSQSDWQELAPVFDFYQTIGRLENPSLLSSQDRQHLMWIYQSALEKDYIV
KVIGDKHFVLKRQDATKLTARQTQTLEILSQSEDLVNPVYVTLGEKGVLL
LD
>SP_1380 hypothetical protein
MLVEKRRLRMRLKVIKKLVDINILYSSQEANLANLRKKQAKNPGKKVNVS
ARVLSSYIFSSLLMIICFSNIAIHFPFEEIPIYFSSMIAILLVIAFSTSL
TAFYNVFYESKDLVSYRPYAFKESEIIIAKGLSVLLPALTGIVPILAYFL
VLYIRLAPSLWLGLPLMLLSLTLLFVSVALVMVVAVHFLAQTRVFRKYQS
IFSNVMIGIGVLIPLIFIFFLQSTFGSIVDKVRDIPFLLYPLHIFYKIAV
EPFSTEALVGLLAWIGLTLFLLYLTKKKVLPRFYDVILLNSEEKVKKERR
SKERISTTKKGFFRMVLRYHLTLLGQGTGVITVLFTSAFLPYLMMIGLIS
KIRDSQIVPDIHPPYWLPLFFIALFIAVVNNNITSLHSIALSLERENVDF
LKSLPFDFARYVKVKFWIIYAVQSFLPVLTLLGLSLYLGLPIISMIYLLV
VWIIASVILSCHHYFKDVKNLSTNWSSITDLVNRSNGIVAIVLLFIYSAI
LMALVIGSIFLVQSLSPILAISLGVGALIVLLALAIFGYHYYLSRILAEI
EKR
>SP_0548 hypothetical protein
MPLFPGMGVSVSSLSPKSISFLRIKFPYYSTSFWPIRQPL
>SP_1217 hypothetical protein
MLEIDLTVLIDLSYSYFILLYFYENLKKSVKSWISLICINKIVYMVD
>SP_0833 hypothetical protein
MYMTGHSLGCYLAQIAAVEAYQKYPDFYNHVLRKVTTFSAPKVITSRTVW
NAKNGFWDVGLESRKLAVSGKIKHYVVDNDNVVTPLIHNNRDIVTFTGNS
RFKHRSRGYFESPMNDIPNFNIGKQATLDKHGYRDPKLDKVRFFKKQALP
RSSSQPSAEPMENIASGKQVTQSSTAFGGDARRAVDGKVDGNYGHNSVTH
TNFQSKPWWQVDLAKEETIRQINIYNRTDTAQDRLANFDVILLDSSGKEI
E
>SP_1628 hypothetical protein
MIIMRRFYSHLPYYLVILFFYWPLYELFLLVVSDPLTLKGLYINNLLFFT
PLVILIVSLLYSYRFRFSL
>SP_1785 hypothetical protein
MATYGFLDILEEELDKNFPFDFEISWDKRNHAVEVSFLLEAQNAAGVEMV
DEDGEVSSDDILFEEAVLFYNPAKSTVNEEDYLTVIPYLPKKGFSREFLA
YFALFLKDTAEVGLDVLMDFLEDPEAEEFVMEWNQEVFEEGKIGLEKGEF
YPYPRY
>SP_1595 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_1679 hypothetical protein
MVSSKYAARASFLDGQGITVDEMAWIIRGIVNALIGRYIKLGTYAAKYGI
SMARSILSRVAATAAARVGLLTKISGWILRVAVNVADVYGNFANNIAAAW
DAYDKIPNNGRINF
>SP_2118 hypothetical protein
MMNKYKVIYYVVVIALLVSVFLLIGMDLSWFNPYQSDQFVWVYFALIPVI
EWIEKKSKNLASEKGE
>SP_1844 hypothetical protein
MKFYSYDYVLSQIGQQNGIMVGFGIVLLAVTVFFAFKAYHNKKGSEFREL
VMISDLALFSSAFGQHHDLSKQSSF
>SP_1091 hypothetical protein
MQIFYIKTKIFLSFFLFLLIFSQCFYKIEE
>SP_1350 conserved domain protein
MKRITANQYQTSERYYKLPKLLFESERYKNMKLEVKVVYSVLKDRLELSL
SKGWIDEDGAIYLIYSNSNLMALLGCSKSKLLSM
>SP_2159 fucolectin-related protein
MNKEKIKRKLITILFVCIGMLCFGLLAGVKADNRVQMRTTINNESPLLLS
PLYGNDNGNGLWWGNTLKGAWEAIPEDVKPYAAIELHPAKVCKPTSCIPR
DTKELREWYVKMLEEAQSLNIPVFLVIMSAGERNTVPPEWLDEQFQKYSV
LKGVLNIENYWIYNNQLAPHSAKYLEVCAKYGAHFIWHDHEKWFWETIMN
DPTFFEASQKYHKNLVLATKNTPIRDDAGTDSIVSGFWLSGLCDNWGSST
DTWKWWEKHYTNTFETGRARDMRSYASEPESMIAMEMMNVYTGGGTVYNF
ECAAYTFMTNDVPTPAFTKGIIPFFRHAIQNPAPSKEEVVNRTKAVFWNG
EGRISSLNGFYQGLYSNDETMPLYNNGRYHILPVIHEKIDKEKISSIFPN
AKILTKNSEELSSKVNYLNSLYPKLYEGDGYAQRVGNSWYIYNSNANINK
NQQVMLPMYTNNTKSLSLDLTPHTYAVVKENPNNLHILLNNYRTDKTAMW
ALSGNFDASKSWKKEELELANWISKNYSINPVDNDFRTTTLTLKGHTGHK
PQINISGDKNHYTYTENWDENTHVYTITVNHNGMVEMSINTEGTGPVSFP
TPDKFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTR
ADNPSWWEVDLKKMDKVGLVKIYNRTDAETQRLSNFDVILYDNNRNEVAK
KHVNNLSGESVSLDFKEKGARYIKVKLLTSGVPLSLAEVEVFRESDGKQS
EEDIDKITEDKVVSTNKVATQSSTNYEGVAALAVDGNKDGDYGHHSVTHT
KADSNAWWQVDLGEEFTVSKVDIYNRTDAEPQRLSNFDVIFLSSSGEEVF
RRHFDKVVDGLLSLKVPSVGAKLVKIELKSAAIPLSLAEVEVYGSKRTPK
KLSNIALTKETRQSSTDYNGFSRLAVDGNKNGDYGHHSVTHTKEDSPSWW
EIDLAQTEELEKLIIYNRTDAEIQRLSNFDIIIYDSNDYEVFTQHIDSLE
SNNLSIDLKGLKGKKVRISLRSAGIPLSLAEVEVYTYK
>SP_1630 hypothetical protein
MKVEPRCDVLSRMSHFFIRILIMELQELVERSWAIRQAYHELEVKHHDSK
WTVEEDLLALSNDIGNFQRLVMTKQGRYYDETPYTLEQKLSENIWWLLEL
SQRLDIDILTEMENFLSDKEKQLNVRTWK
>SP_0518 hypothetical protein
MSVLDEEYLKNTRKVYNDFCNQADNYRTSKDFIDNIPIEYLARYRELY
>SP_0685 hypothetical protein
MSRWDGHSDKGEAPAGKPPMHGFGLNGENK
>SP_1488 hypothetical protein
MKSIKEEIQTIKTLLKDSRTAKYHKRLQIVLFRLMGKSYKEIIELL
>SP_0996 IS630-Spn1, transposase Orf2, truncation
MVAGLTNGELIAPMTYEETMTSDFFEAWFQKFLLPTLTTPSVIIVK
>SP_1037 putative type II restriction endonuclease
MKIHCLKLKNKELNKEVAFYLTSIIRQALKNTEYKDQISSTVLPDIKIKL
PIDSRGTPDWNYMERYRDR
>SP_0444 hypothetical protein
MNVIFVFIKKIPISFTKKKKELISQSFINLIPH
>SP_2115 hypothetical protein
MKRVILLAVIQAVVLFFIIGALAYAFKGDFFYNYLAVVFAPIAGVLRFGT
AYITEIVLPRKAAEIAEKRKAGKNSK
>SP_1723 hypothetical protein
MQAFYVKKEEKLSKDYHKINTGNQSNFYENVKDNEIKYFLTKVSNLFFKE
FLMKQSKTINLKLAH
>SP_0733 hypothetical protein
MTVEEEKAFLARHLKATEAGEFVTIDALFQAYKKELGRSYTRDAFYQLLK
RHGWRNITPRPEHPKKADAQTIVASKNKISIQEGKKAF
>SP_1678 hypothetical protein
MFMVGTYESFTDKKENSLQRKMMEEQTWHKKE
>SP_1794 hypothetical protein
MEELMKNNERLGIKLSRDSVLGLREVRRLYLGSSDIPVSDGYVIEVAYNQ
ISHEIDIIDWVELNKSKIKISEISESVDIDATSLRTTLTLDTLVYEGMRD
IQLKLRELTKGRVFFSFVVKLVLFASILKKKDLLEKFQEKC
>SP_1170 hypothetical protein
MSEVDFNEAVNYEFTSDTCQLANSIYQSLFKFFDKKNFSGDLIFTWKSPS
LVKEGDYIGRRDSQVDNLRVIGNIFPNYLTNRKYSLNMNRNGCMGDFPHD
FFDIYLDHVAKYAYEQKVNNIKEYYPLKRAILHQENALYFRFFSNFDDFL
EKNYLKTIWQVSKETPFSEMDFNMFKNISEKIIFERGSKMLNDLKSNYKK
>SP_1145 hypothetical protein
MTAEIGILNKNGVVLASDSAVTLSDGKNSKVFNSARKLFTLSKEHSVGIM
IYGNASFMEIPWEVILNEFKQAIGTDLLDNTAQYVEKLIEFLISFKHLQV
EDLLRNYIVRSTRSILDSIAYEAQETADLRVSNGETITLDDFNKILLNAI
TNFSLEISKVEAESNFEFFEAELELIKSIVDDVFKSFPHTDDEVEVIAKS
LYKAIFIGYDSTNITGLVIAGYGTDEIFPSIRQIELYGIFSKRLIWKVIN
ESVINHHKTCHIIPFAQSEMVETIMNGIDPNLNVYIAEQVSSVMEKNGLG
DEIENIFENISSIQQKYYINPIIDLIGMQPLNEMASTAKTFIELTSFKRK
IVNTLETVGGPVDVLAISKGEGPIWIDRKYYFDIDKNLDYRMRKES
>SP_0587 hypothetical protein
MLGSLSFFLRNETKIQVIKIPSRQATQRKIGNRTTERLLLGRFIFFHRVV
GKFSFQDTSLERFNTKVSKAFTLIAIGRLAKCFTNSLVK
>SP_1351 hypothetical protein
MIWVKATQLVDEMEQVGLIRFDEFGNVGILVLEGQ
>SP_0038 putative acyl carrier protein
MTEKEIFDRIVTIIQERQGEDFVVTESLSLKDDLDADSVDLMEFILTLED
EFSIEISDEEIDQLQNVGDVVKIIQGK
>SP_1977 hypothetical protein
MKEKQDFCLFFRKQSVFQFHFQSIIRLFFKIEAI
>SP_0559 hypothetical protein
MNPIKAFAKIYGNYFLTVQGVKVMKTIKKADHVVVGLGKLFIADKLMDTA
RWLIKPEERE
>SP_0512 hypothetical protein
MFLIFIIDWVLLIVFAIQISYIFWRLSQKWKELSNK
>SP_0670 hypothetical protein
MAKGFAKGLVTGVAGTVAAVAGAVYAFKKKVIEPEEQKAAFIEENRKKAA
RRRVSR
>SP_0040 hypothetical protein
MSCYKLDKFSITVSINFFHPNLELASKRLELAFLFQNHLYF
>SP_0277 hypothetical protein
MKLRIFAEDKPAKKVFEYQLELADRTILLSTALLSGAIALAGIFSALKEK
>SP_1635 hypothetical protein
MYLVIGIVLAFIVSFWKDNRSLWNPVLFLLSLISSYFYLSYLFYKNGYEN
VQLAFYIFAFVLLPFLLFLSGIFLIYNGVILLKREGRSKPHYLSMLFDFY
>SP_1049 hypothetical protein
MWSVATTKKVIILDFKLFYKHFVFVDKGNFDNKNPKR
>SP_1065 hypothetical protein
MLLFYVGRLNGQKYHCISFVSWHIKYRSILLMILKSLLSWRAIQICSTNE
ESTILGYSVVKGQIKNLTHFQTL
>SP_0763 hypothetical protein
MTIARFSRATESWNGLMVEISPCCLDFIQKSVTQALFKLLIH
>SP_1948 conserved domain protein
MTNFNSNEKFCGKSLKSLSADEMSLIYGASDGAEPRWTPTPIILKSAAAS
SKVCISAAVSGIGGLVSYNNDCLG
>SP_0866 hypothetical protein
MNKQQFIIMALFTAAETYFFNEAWMTGRYIMAAFWAILLFRNFRVSYVMG
KIVDVIDQHFNRKD
>SP_0203 hypothetical protein
MGKYQLDDKGRAQVTRYHEKHSKGGAGKKERLLSFREQFLNKNKKK
>SP_1385 hypothetical protein
MKMSTFFKKSFWPTFMIVNQTAILFHLKDGLDRQYLTTESIYWVIGTFIF
GNILVAVFSNMKIWDKKKNGSKKKYILKK
>SP_0520 hypothetical protein
MKLSNLLLFAGAAAGSYLVTKNRQTITDEVLNTTDRVQAIKDDVDIIQNS
LQIINQQKELIKEYQEDLTYKFKVLEKDIQTRLAVIKEMQGTEDK
>SP_0564 hypothetical protein
MSKKLNRKKQLRNGLRRAGAFSSTVTKVVDETKKVVKRAEQSASAAGKAV
SKKVEQAVEATKEQAQKVANSVEDFAANLGGLPLDRAKTFYDEGIKSASD
FKNWTEKELLALKGIGPATIKKLKENGIKFK
>SP_1581 hypothetical protein
MHKTLENIGEFEEDNLYYSSIDKSRNKDQFSHIFGLYNICSG
>SP_0134 hypothetical protein
MESIIKNKKIVAINNGINVSNSDLDVVGVQDFKKEFCIPNNKKSFVMLEG
WIQKKGRIDSLNLQKNYF
>SP_1209 hypothetical protein
MHAILRYFIRRLFYHIFYKIYSLISKKHQSLPSDVRQF
>SP_0679 hypothetical protein
MGILSIILGLLFPIVGLILGIIGLVLAISYQKESQLDYKIEKILNILGIV
ISVVNWIVAIALIFR
>SP_0293 hypothetical protein
MIFNPICCMIREKKGDRDMAFTNTHMRSASFGIVTSLPDDIIDSFWYIID
HFLKNVFELEEELEFQLLNNQGKITFHFSSQHLPTAIDFDFNHPFDPRYP
PRVLVLDMDGRETILLPEENDLF
>SP_2179 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_0706 hypothetical protein
MEMGKLSSHMWRLNQIIYTKYFWGYVLFWILICLGLWYWLEGNDRLVIEI
LKGPNLSQNSFLVLSIWLLHWFIIHTFFLAVVYRRRASDFFMEVIRFSSI
KLWIRYQIWTCFLYGLILIMVKVLVIQFMLQLPNWDIGVLFIVDSLNACV
LVLFCFMLYALGANVQMNFACVSFFLLMIVFGGLFVGNRTNYLFYILNRG
NGDIGRDLFLQLLFLVFLFQSIFYFTRQKRRFIE
>SP_1952 hypothetical protein
MLKILNNSVIYLCLIPLLFLLLIIFPNDSLIYYFRLILISLISITLHELG
HFLVGRCLSYKLEMLATPFFFYFRKKIYFKFPVLLAFGYCQMSNRNITNE
KNSDRNLVFYFFGGGGANLIVAILALLGFVPYASEFFILNIILFLVTVCL
PIDGTDGNAIREIVLYSKDSKTYQRFFANSLYNNPYITIDDFAKLTSKEK
SFFSKFEKCLINLYFEVKGEGSAFTIANHEVFDSNIQENIIHEYYKLLHK
SSDWIIPQNLSLEEVVFTLSNYIYSKNNKYLEKIKYLKKLVDFRQEEIID
FILNKEEVL
>SP_1831 hypothetical protein
MYYSSFNILYYRLPIFAKLSEKKLEYMMLGDCVMLVNEMEITDHRVDNLF
EKGKNEIKDSIGTNSVLNKKIILQKIRKLSNQPSGYWIGSLDERFLDHAI
INQIDVTSEQIVLMSDGFYEFYQNNQNKTFEELIKMRFNSSAIDPIYGKK
DDASILVIDV
>SP_0470 hypothetical protein
MDKMKPVFQALNKELIQENLTLTIICVGGYVLEYHGLRATQDVDAFMAL
>SP_0269 hypothetical protein
MFSAIFIQKLISNITNKKEKYLDKQGKILLQ
>SP_2140 hypothetical protein
MKQKNYLVSNATVRQTYDKIAESECFLRAIGGIMLHLKLVKQEIEAEKPA
SVEAWIISVKFKKGCYRHI
>SP_0902 hypothetical protein
MKKYFIGGLGSNAYHSKDFLQELDSQVYFLNPYEKHLRDETELKSWFKNE
IVEEESICLIGHSLGGDLARYFASEFEEVKKLILLDGGYLDLDKILPLDT
ELEETKNYIKSQIVSDLDVLTSKEKSEAKHWSENMEKAVRQSYHWNVEYN
RYELAINYENIEAILRLRRKIQAFKREVGDTLFISPRYPNEATWREEALK
ELPDYFDTIFLENFGHELYTQAPKEIASLMNEWLAYFL
>SP_0080 hypothetical protein
MMPKMANRDRSPLSSSKSSSKAGLYGKIERSDKRE
>SP_1929 hypothetical protein
MKREQDKLIRTVKSISNNVLEAEVYYSSFNLL
>SP_1059 hypothetical protein
MSSQMKAFLNELQGNMEVVGEFFDNPEKVIRKFGIVGREKEALLIRDLND
MDDYALSIQNSVASPSGAHSSTCHFVRQDLQSA
>SP_1443 IS66 family element, Orf1
MEFLLYTISKVKLLEDILMPQPIVPVEIPQSRRFDSKKRNDILLKIRIGK
LEVSFFQSLNLEMVEQLLDKVLLYDNSSI
>SP_1793 hypothetical protein
MKQKQPIVSRTKQHTFEELIQDQKLERLAKLSPDLVGRYGFTASCASSFA
NLIKEAYGGKNLNVVYASRMLALWNIACSCYHKADGYSLADALFSDKKIC
LDSYYYHKNTSNTITSDVIKDVYDNYNNYMVLTREATPEYIYVVQTEMPK
DSDLYFYIREVLGLSFSTMHYAFLVKVLAGALARKYKPYRN
>SP_2089 transposase, IS1380-Spn1 related, truncation
MTNLSSVDSEELFQFYRERGNAENFIKERKAGFFGDKTDSSTMIKNEVRM
MMGCLAYNLYLFLKQLAGDEVKSLTIKRFRRLFLHIAGKYVSTARRHILK
FSSLYAYSKQFQALFDTICQINLILPVPYRARGQGKTCLTE
>SP_0747 hypothetical protein
MTFSNMINPLSKKSLPIIPKKANPVEFAFYHH
>SP_1254 hypothetical protein
MTSDKAGLERKFAAKERKRNKPGVVLCGSMDELCALAQLNPEIEAFY
>SP_1836 hypothetical protein
MMSSKDSKCYTKLLTSYFKPRDSHKKGKSYKNSLSEEKGASTTGVNYYQL
KTTVFTLN
>SP_0052 hypothetical protein
MKERLDDNPIVQGNWKTLGFQFRHETPLVAAAVPHKLR
>SP_0465 hypothetical protein
MFLPFLSASLYLQTHHFIAFPNRQSYLLRETRKSHFFLIHHPF
>SP_1211 hypothetical protein
MKQFKILSDKYLESITGSDGNLGPGFGVIIP
>SP_0840 hypothetical protein
MKILKRYILELCFILSFALPFIKGTNADNGRCFVETYYGFTFLMEHAIVT
AVFICSFLIAFLLKNDGRNGLLRVVIAF
>SP_1172 hypothetical protein
MLYAVPFYFNRSETIVFLNCESIKTDCDGAILALETFKN
>SP_1465 hypothetical protein
MKTTFSYPKWAEIPNIDLYLDQVLLYVNQVCAPISPNKDKGLTASMVNNY
VKNGYLTKPDKKKYQRQQIARLIAITTLKSVFSIQEIAQTLNTLQTQASS
DQLYDAFVDYMNQGIDPANPIIQTSCQTVKLYHQTLDLIDHTQEEVIQ
>SP_0497 hypothetical protein
MIIMVLEFDKKLQNIKRFLENNKINLLGGENEESIF
>SP_1656 hypothetical protein
MENNDSFTKLKESTQKLFDAQKKRLNNEDRIETTKNNVIAKHCQTVLSFL
VLTSFFVKNCVK
>SP_1052 putative phosphoesterase
MRGFNNKIKSVYQELTNSKEKFGSFHKTLIHLHTPVSYDYKLFSNWTATK
YRKITEDELYDIFFENKKIKVDKTIFFSNFDKVVFSSSKEYISFLMLAEA
IIKNGIEIVVVTDHNTTKGIKKLQMAVSIIMKNYPIYDIHPHILHGVEIS
AADKLHIVCIYDYEQESWVNQWLSENIISEKDGSYQHSLTIMKDFNNQKI
VNYIAHFNSYDILKKGSHLSGAYKRKIFSKENTRFWSLILTRKNLRNNLI
FSIKKLVY
>SP_2120 hypothetical protein
MIDKVVRNLLLTFFFCKMTKIIIFLTTILVKKKKICYNEFKLRNRKQKGV
IMWVLGFILFMIFFYSNNSKKIKKLENKIKRLERKEKGNAEMSRLLQEMI
GKEPIITGVYIGPDNWEVVDVDEEWVKLRRVDNTGKEKFKLQRIEDIQTV
EFDGE
>SP_0816 hypothetical protein
MKLINTTNSHSQLVKSQLESTDATLVEVYSAGNTDVIFTQAPLHYEILIS
NKHRAIREPEIETIQEFFLKRKIDKASVDEANIKTLYSEKLIGISIPIK
>SP_0956 hypothetical protein
MTVTKSYKYDWNTVWEYSTNYHDHQYAWIPSWSRYDSYSEYKVGGGWNYA
RYEVINYYSGGY
>SP_1183 hypothetical protein
MKSLGKWYVSTGKEWICHSDDELEEFKNLFLNFINPEEWDTISFDSDFMP
FQQS
>SP_0223 hypothetical protein
MKKINFPRNFSFFVKFPIFTWSFDALDILNYQDAFVTPGI
>SP_1401 hypothetical protein
MDKISCLSLPPLSPIIVISCLFFPFLMQGKLLSLDDNPKARKVSQTSLLS
QTSLQLKGKDSIL
>SP_1958 hypothetical protein
MKKFDNYIIEKPCDSNSDKLQKILIIESLVDDILQFSLRINNSVGEIFLL
QPF
>SP_0504 hypothetical protein
MNPDRAEEYFCRGCQGENPEDIEFYDEQLQAEKVEDLNIRLEVKN
>SP_0734 hypothetical protein
MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVL
>SP_1864 conserved hypothetical protein
MVEPNLESLIKDLYNHARHDLSEDLVAALLETTKKLPTTNEQLQAVRLSG
LVNRELLLNPKHPAPELLNLARFVKREEAKYRGTATSALMYEELFKML
>SP_1337 IS1380-Spn1, transposase
MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL
VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG
QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST
HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE
ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS
RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL
FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL
IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT
GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH
>SP_1043 hypothetical protein
MVVYIRQSKLPSEVSINKYNAQVGAYLQGEEVILYQSFSEIKELTSEDIV
VDYIMETRALLKMMGLNVPVHDYPIELKEFYGRKIYAGILGEIVNIPDNW
GKFIKPKAGSKVFTGRVVNGTHDLIGIGLPFDYPIWISEVVEFIAEWRCF
VLDGRVLDVRPYTGDYHAQFDASVIDEAISCWKDAPIAYGLDIGVTRDGR
TLVVEVNDGYALGNYGLSPLKSINFHRARWKEMVKPYFEKNEIFKIQQDV
IF
>SP_1761 hypothetical protein
MSKELNILQIGLANWENHYDIPENMSWYYFYPNSSKALREIIEKEDINRF
HAVLIEDGQYSRDLFSYVKYFEPYTLFYNQNLQINDREVVDFLKKRCAQA
IDFLSPQQLINDLSKSLFGGGYGDKLFPPTIQVNPNFTGAISYQGLDYVS
LEGEFGQDFAQLAYWAYNIMVQKTLPIELWLEYEKEGNCDFRLVIRKMWS
GSVDDFFEEVIVSEKDLEQALFMDSRDGDYFLSISVEARGRGTIKLGNLH
QRWSRKQFGKFVLGGNILHDSKRDEINYFFHPGDFKPPLTVYFAGYRPAE
GFEGYFMMKTLGCPFILFSDPRLEGGAFYLGTDELEGKVKDTITHYLDYL
GFDHKDLILSGLSMGTFPALYYGASFEPHAIIVGKPLANLGTIASRGRLD
APGVSNLAFDCLIHHTGGTSSQDMTELDQRFWKIFKQANFSKTTFGLSYM
KDEEMDPQAYEQLVSYLCNTGAKILSKGTAGRHNDDTDTNISWFLHFYRM
VLETGFGREKR
>SP_0490 hypothetical protein
MKVNIADLHLTQLYLSEKKLQDIQMLYQSAETIQVDPISILAFGDCLLIT
DGHHRAYQALLAGRDTISAEWDRDGGDELYHLYAQACEERKIYSVFDLED
RILAQDGYEAKWYNWCDGFNQAATLLLKR
>SP_1335 hypothetical protein
MMPNYPCEFEVTFLDDYHKKHNYPLFYESYLQNVMEFLESQDIKNGVNAF
VDDNQNLVFVLY
>SP_1452 hypothetical protein
MGDEENAKWTERGVLMDVTIKKKDGKTTIGTAKAHPTWVNRTPKGTFSPE
GYPLYHYQTYILEDFIEDGSHRDQLDEATKERIDTAYKEMNEHVGLKWY
>SP_0448 hypothetical protein
MYNVITPSVIVLADQNKADWSYDENAVINIYDDANFEDGRLHMNFEQFFK
LAQIAREEGLEIHSPFERAGATKSARYIAKWILRNKKH
>SP_0270 hypothetical protein
MLQKYTQMISVTKCIITKNKKTQENVDAYN
>SP_0389 hypothetical protein
MKKTVYKKLGISIIASTLLASQLSTVSALSVISSTGEEYEVSETLEKGPE
SNDSSLSEISPTYGSYYQKQSEVLSVMMI
>SP_0582 hypothetical protein
MIKTFLSALSVILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATA
TMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLILLLFGTDKTLSSASN
KTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQR
IKLLFHQVGLSMANYDIFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTT
RFGTIVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNIIHNQLASKYPK
AIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATID
HILLPKNHYYVKDLDIVSFQNSDHRCIFTEITF
>SP_1108 hypothetical protein
MTAFQQLPSSVLQTGLFFSPSVSLDSQTVSAKEYLFPYQKERLKPFRQVK
GRQANI
>SP_1921 hypothetical protein
MILSEKITWDFFNQENSSHRNLIILQRTIFI
>SP_0528 blpC, peptide pheromone BlpC
MDKKQNLTSFQELTTTELNQITGGGLWEDLLYNINRYAHYIT
>SP_0531 blpI, bacteriocin BlpI
MNTKMMEQFSVMDNEELEIVSGGRGNLGSAIGGCIGAVLLAAATGPITGG
AATLICVGSGIMSSL
>SP_0532 blpJ, bacteriocin BlpJ
MNTKMLSQLEVMDTEMLAKVEGGYSSTDCQNALITGVTTGIITGGTGAGL
ATLGVAGLAGAFVGAHIGAIGGGLTCLGGMVGDKLGLSW
>SP_0533 blpK, bacteriocin BlpK
MDTKMMSQFSVMDTEMLACVEGGGCNWGDFAKAGVGGGAARGLQLGIKTG
TWQGAATGAAGGAILGGVAYAATCWW
>SP_0539 blpM, bacteriocin BlpM
MDTKIMEQFHEMDITMLSSIEGGKNNWQTNVLEGGGAAFGGWGLGTAICA
ASGVGAPFMGACGYIGAKFGVDLWAGVTGATGGF
>SP_0540 blpN, BlpN protein
MNTYCNINETMLSEVYGGNSGGAAVVAALGCAAGGVKYGRLLGPWGAAIG
GIGGAVVCGYLAYTATS
>SP_0541 blpO, bacteriocin BlpO
MDTKMMSQFAVMDNEMLACVEGGDIDWGRKISCAAGVAYGAIDGCATTV
>SP_0524 blpT, BlpT protein, fusion
MTDTDPIKRAHTLITDLNKAYQACKQASADDVRFQEQLNSILGFLAKAET
VDNRFLIELEKFYQTSSLLMGLSALDPDAPTRAAWRAYDRFHFDQVKTKL
ILNENQRAN
>SP_0041 blpU, bacteriocin BlpU
MNTKTMSQFEIMDTEMLACVEGGGCNWGDFAKAGVGGGAARGLQLGIKTR
TWQGAATGAVGGAILGGVAYAATCWW
>SP_0544 blpX, immunity protein BlpX
MEVFNMKYRLFFVIFLSSVLDILLGTFLQISIVSIGWLVLYSGLFEAGVF
LLANKGVAVKIKEVDIRNRFKFIFGKTLWFQILLLIFLIIKLYLGLDARL
ILFYGHIFIVFNALMYLLSSSQVSLKKNKLSS
>SP_0546 blpZ, BlpZ protein, fusion
MYKHLFFLDSKTLDRLTPYILVLASDTIAFNVFVLTFVSAVVFNFLNSML
ALMAIFIGAGYVVGFWLLILNENQRAN
>SP_0123 ccs1, competence-induced protein Ccs1
MGWNFRVVNLLSLHSQTKNPSISRASKHLIQSKIQTKRHRSRGGEKASRE
SQRSFSTRTWFVVVPVWQIEDSPHKRKQQKQ
>SP_0200 ccs4, competence-induced protein Ccs4
MSVYGRVEEVHKENREPLEYQIEQESHHRESSRLPLVKILLWSTLVTGIT
LGVPLLLDLMSAQEVQDFYAGWALHQTGKIYSDYYGSQGLLYYLLTYVSQ
GGFFFAIFEWLALVAGGFFLFRSADTLTEQGDQAGHLVTIFYMLVTGLAF
GGGYATLLALPFLFAAFSLVAAYLSNPSHDKGFVRIGLALAGGFFFAPLS
SLLFIAVVSLGLLVFNLGHRRFAHGFYQFLAVALGFSLVFYPTAYYSAAT
GSFGDAISGIRYPIDSIRFDFTSKILENMFFYGLLSLGLGFVVCIFLGLF
QSKPFKLYVISVPASLVVILGLILLFFSQEPLHASYLMVVFPVFLLLLVT
NIKSQQRGRSARRSRRETPVSLWSRFFKGNLYLLVFGFVYLLSVPFLMKF
VLYPVPYQERNRLADLVKEETNTEDAISCMG
>SP_2237 comC2, competence stimulating peptide 2
MKNTVKLEQFVALKEKDLQKIKGGEMRISRIILDFLFLRKK
>SP_1449 cppA, cppA protein
MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLE
KLVLEEAPSMRTRKVEGRKKLARLIVKVENPLEIEGILSKTDSIHRLYKG
QNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEI
SMELHLPTDIESFLESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLV
NELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEV
>SP_0352 cps4G, capsular polysaccharide biosynthesis protein Cps4G
MRVLFILSDNIYLTPYFNFYKELLKKLSISYDVIYWDKNINEIITKQNYY
RISFSGKGKLSKILGYVKFRKEIKKKLKENDYDMILPLHSIVSFILVDFL
LFSFKNRYIYDIRDYSYEKFLVYRLVQKQLVKNSLMNIVSSDGYKFFLPM
GEYFTTHNLPNMIELNEVKQLKNNSTFPIQLSYIGLIRFQEQNKKIIDFF
ANDSRFQLNFIGTNAGELREFCQEKNISNVNLVDTFQPKDTMSFYKNTDV
VLNLYGNHTPLLDYALSNKLYFAALLYKPILVCEDTYMEKVSIENGFGFV
LPMKDESEKDCLALYIQNLDRKQLIKNCDNFMDRISLEKQKTEIELEKRI
LSLRKKND
>SP_1850 dpnC, type II restriction endonuclease DpnI
MELHFNLELVETYKSNSQKARILTEDWVYRQSYCPNCGNNPLNHFENNRP
VADFYCNHCSEEFELKSKKGNFSSTINDGAYATMMKRVQADNNPNFFFLT
YTKNFEVNNFLVLPKQFVTPKSIIQRKPLAPTARRAGWIGCNIDLSQVPS
KGRIFLVQDGQVRDPEKVTKEFKQGLFLRKSSLSSRGWTIEILNCIDKIE
GSEFTLEDMYRFESDLKNIFVKNNHIKEKIRQQLQILRDKEIIEFKGRGK
YRKL
>SP_1849 dpnD, DpnD protein
MKTKQLVASEEVYDFLKVIWPDYETESRYDNLSLIVCTLSDPDCVRWLSE
NMKFGDEKQLALMKEKYGWEVGDKLPEWLHSSYHRLLLIGELLESNLKLK
KYTVEITETLSRLVSIEAENPDEAERLVREKYKSCEIVLDADDFQDYDTS
IYE
>SP_1964 endA, DNA-entry nuclease
MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPS
QALAESVLTDAVKSQIKGSLEWNGSGAFIVNGNKTNLDAKVSSKPYADNK
TKTVGKETVPTVANALLSKATRQYKNRKETGNGSTSWTPPGWHQVKNLKG
SYTHAVDRGHLLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTG
QNYYESKVRKALDQNKRVRYRVTLYYASNEDLVPSASQIEAKSSDGELEF
NVLVPNVQKGLQLDYRTGEVTVTQ
>SP_1573 lytC, lysozyme
MKTKIGLASICLLGLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKT
QTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYKAWFYINSDGRYS
QNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNK
WYYFKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTY
ANQEWQKVGGKWYYFKKWGYMARNEWQGNYYLTGSGAMATDEVIMDGTRY
IFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVIDISEHN
GRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYL
YTYAENETDAESDAKQTIELIKKYNMNLSYPIYYDVENWEYVNKSKRAPS
DTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLLQTRLKHPDILKHVNWVA
AYTNALEWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWY
>SP_0602 pep27, pep27 protein
MRKEFHNVLSSGQLLADKRPARDYNRK