Gene list
Applied filters:
COG category: Unclassified
Organism: Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334
Gene type: CDS
Number of genes found: 428
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334 >SP_0164 hypothetical protein MRSLFKKIVAVLVIGLILLGIARTPQVHKMARGIDPGPANFI >SP_1548 hypothetical protein MKRIVFELIFIATTWYIFLPPLNLTSWEFLFFLCGHLLVVAILFGFGKGI NLVKTVHVRHGKAEAALNLEGFKINRLGKILLASIGGILLLAALVSLVTS SMFQAKNYANVVTVTEKDFTEFPKSDTSKVPILDRSTAEKIGDRYLGSLT DKVSQYVAADTYTQLTIDGKPYRVTPLEYADPIKWFNNQAKGIGEYIKVD MVTGNADLVDLKTPIKYSDSEYFNRDVKRHLRLKYPTKIFKTPSFEVDDE GNPFYVATVYQKQFGLAVPRPASVIILDATNGETKEYSLSDVPEWVDRIY PAEETIEQINYNGKYKDGFLNAMISKKNVTQTTNGYNYLSIGNDIYLYTG VTSANADESNLGFILENMRTGEITKYSLASATEESARESAEGAVQEKSYK ATFPILINLNDKPLYIMGLKDNAGLVKEYALVDAVEYQNVIVATTVEEML SKYANKNDLEIDNATTESINGVVADLKSAVIKGDTVYFFKVDGKIYKVKA SVSDDLPYLENGKTFEGQVGKDNYLKTFKLR >SP_0039 IS1381, transposase OrfB, truncation MRNIGQAGKILADSGYQGLMKIYPQAQTPRKSSKLKPLTVEDKACNHALS KEISKVENIFAKVKTFKMFSTTYRNHRKRFGLRMNLIAGIINHELGF >SP_0513 hypothetical protein MNSRVEFRIFTIVDLDKEEEHLHEMHLKGWRYRTSRFGLFYFDQCQPDDV IYRIYDSRFLKKV >SP_1292 SAP domain protein MNFFSKLFNLKQNNHNRDTNSDCNNFYLNELECGLTPGQLILIDWTQKTG RNYNFPRYFKYSLQIDPESTHNQLYKLGYFTKNKTLSYLTVVELKTILSK HNLATSGKKAELITRIINNVNIDNLDIPFEFKLTKEAQNLIIEHSDYIKA YYDKDITMEDYCKEKNNISFKATFGDIKWSLLNKQAHRNTVSGDFGCLSN TRKAQGRHLEQEGNIKHALIYYIESLIITISGLENNFSATDYPVYYPDSI PDYSLKHIQTLMESLSDDDYDFAFDEALFRFSILNANHFLSKEDIDYLRV NLPRSTAEEINNYLKKYECYSPLNNLELDDFE >SP_1132 hypothetical protein MEILSKEIQLQGLQLLKQTLETLVELEKQRSSKLDLISRKELMDLLGISA TTLDNWEDLGLKRYQTPMDGAKKVFYRPSDVYLFLAIK >SP_0162 hypothetical protein MSMWRDWAPMWWSFSVLSEIWYNSTNQFLGK >SP_1487 hypothetical protein MAKCKKYEEFGLDSLLQETRGGRNHAYMTVEQEKVFLARHLKATEAGEFV TIDALFQAYKKELGRSYTRDAFYQLLKRHGWRNITPRPEHPKKADAQTIV ASKNKVSIQEDK >SP_0818 IS630-Spn1, transposase Orf1 MWYNLLMAYSIDFRKKVLSYCERTGSITEASHVFQISRNTIYGWLKLKEK TGELNHQVKGIKPRKVDRDRLKNYLTDNPDAYLTEIASEFGCHPTTIHYA LKAMGYTRKKKELHLL >SP_1866 hypothetical protein MHVVQNLVNIPKLTRIFISKQKNNTPSKLFGFFMKFTENS >SP_0195 hypothetical protein MKKSSTALWRSDTVWETVSAQLPEMGKSWNVQ >SP_0201 hypothetical protein MLSHAWDDTATLYRKSERLSPSAILSPLHYTATEENRNKLLNDLKEKQPK VIVVNDKVVVWSEVETLLKENYQQVKTDYSEFKVYKIK >SP_1707 hypothetical protein MMEHLFKMIILLPCFYFFSWIDKDNRESKFFPIFYYFYWIYITLYALFSL AWTVFSVLFFNIVLRNLTDIKLWGIWLLLLLIAFASDWLAYVFFKKMLDL RRELGKSKGGRH >SP_1822 conserved domain protein MKDYKINFDLGKIEYFDNNCLIQVYKFISFYDICEMVFAFHLPPDELITN VIFKEKINSMLKCYIDRLLYVFINPTHFTEKVNLQFYGSFFSYEFICREV GNILKNKGVKCNLNFFEGEEYL >SP_0990 hypothetical protein MGVAEKIEEVSMGKSLLTDEMIERANRGEKISGPPLLDDNEETKILPTSS SRFGYANPKDHGFSQETLKIQVEPSIHKSRRIENTKRNVFNSKLNKILFA VIFLLILLVLAMKLL >SP_0514 hypothetical protein MILDFLKKYKHELQDFRDRGWELIGAGSCSILRKSSSDLLPEDQVYMSKG LKWEVMRSRLRSCTATFSGGLVVCMSLFREDLSMSFFLIFVLYAFLISYL IYGYFRLKRKYRVDE >SP_1481 hypothetical protein MICQFIRDMLDLPAKNVTILEGSNIHVLPSMPYSA >SP_1042 hypothetical protein MMKMATDKNRIMISLDDKNLEKLENLVEDARDRRGMRLTKSQVIELLLNT VDYFDDIMGAIYSKK >SP_0832 hypothetical protein MVIGVLDSKEELKESENDAPKLETPLREEPRLAPQTLPEASEVLENKREE SKVEIT >SP_0089 hypothetical protein MFLADEKGSEHTAAELIDNLKEVIAKLKANA >SP_1109 hypothetical protein MFEEFPKLPDLKQVTFPNDKEKSQNSKEKLDDCFPTTPI >SP_1756 conserved domain protein MSEEDLFYKDVEGRMEELKQKPIKKEKETRGEKISKTFSLLLGLMILIGL LFTLLGILR >SP_0361 IS1167, transposase, truncation MGGYFRNFENFKKRIFIALNIKKERTKFVLSQA >SP_1347 hypothetical protein MAERFWENLSIILAERNISWIELTRKMFAGEFHYPSELNRLYQKIRH >SP_1436 hypothetical protein MDNKKLKVKDLVSIGVFGVIYFAFMFGVGMMGLIPILFLIYPTVLAIVAG TVVMLFMAKVQKPWALFIFGMISPLVMFAAGHTYVVVVLSLIVMIIAELI RKIGNYNSFKYNMLSYAIFSTWICSSLMQMLLAKEKYMEWSLMTMGKDYV DVLEKLITYPHMALVALGAFLGGILGAYIGKALLKKHFSNGLYCVGYFTP CLILWCYLN >SP_1304 hypothetical protein MVFLKIDKKTNVFLGNIFLKDKILKYAIRKENV >SP_2117 hypothetical protein MGMFVGMFKARVESHEIILDVKALMPWISAICLLIGFISMFLTFNFLKKS RKFHSLYQEEMDDDLNETYYVQMYRNLEFGTIAFNITGVAIPLAIFISLS EVIILHTNPQTFFLSFLLFVVFLVAQKSLFKTIAIVRQFDLEFFATPKDV LNYINSYDEGERQANLEQSFRILFQLHQYVLPALYIFLIIISFLTGEIQL LAFLLVGAIHVYINVMQLPMVKRYFK >SP_1425 hypothetical protein MRITMSGNIQNSELFKFFNENSIKVVDFETKKETLKDIYLNRSK >SP_0343 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0138 hypothetical protein MHSFGIHPRYPYKMSEIVEMSKKISDLNDKKEFHKKNCELRFQDSVLDNI NSVPGTTHLFSCLNIQTLVKIILYIFRYNSCMKIFNFRDKELIFFNKIVN GLIQNIEENLEDDIERILKYLYICLFNEIFIIKNKVNFFDDVEFNQTLSE FLDKL >SP_1658 hypothetical protein MGGILLLIGPFVLLGIAVNTAATTLNGGATAGAFSGVALLLNALKIANLV LGIIAIVYYKGDKRVGAAPSVLMIVSGGVSLILFRS >SP_0692 hypothetical protein MIIMQDNFLFEEIEEISVPVNDFSAGLATGIGFGLAILALAGC >SP_1038 hypothetical protein MINKLSRYMESSGKTYQNHYVTILKWYEEDKDKLRQKGLNKKMNYDVGES L >SP_1760 conserved domain protein MIITQRQSIHWGEVGGTYMYGTTVSYYPDKSVRLYNPLLPSGEILKTWFS SVNYQAARTQPQLPLLKRKQEYQLSLVFDCQPENGVYTKITFFDRYGDIL EKKVEKVKDFIFTYPEDSYTYRVSLLSAGFESLTFYHFSIKEIRSV >SP_0861 hypothetical protein MSKKDKKIEIQVADAKVNVGKDSFEGYTLTIGKKVIGEIAELDGQFAIIK NGNVDSFYKKLEKAVEILIENYNLAK >SP_0142 hypothetical protein MLNLQFAETMELTEAELQDVRGGNLVNSMGGGGRSGISGWGVPGIYPGWG NQGMSPNRGAFDWTIDLADGLFGRRRR >SP_1779 hypothetical protein MTMYQDLLRKIAEEKPNYNQEEIQWLFDHLGNPSPEIRNVLLNQGLHYLS KEKDTRGFSSQYGWVHAFAHGADLLTEVVCHPDFPKNRVHEVFDILGQLF KRMSIRFTDDEDWRLARVIYEPILQGKLEQEQVASWIKTVDFPIEEREDF YKFSNFRSCLVEVYVQLDQRNSLQDDLKEAIQSFQY >SP_1078 hypothetical protein MAFGDNGNRKKTMFEKITLFIVIIMLVASLLGIFATAIGALSNL >SP_0398 hypothetical protein MSLIFVVIYKVKEAGQKVFKIGKRQPIGCSKILIGCPLLWKALLDFLLRL SF >SP_1307 hypothetical protein MKKENEYVILTTASLGVMIGIVFAIFLDFPVEYGISLGLLNGIVLGSLIV YKNNKN >SP_2124 hypothetical protein MINILYFLIILTIWQVFDEFSEKYDKMKKIRNQGEVYGADWKSL >SP_0116 hypothetical protein MSLKLLDCILDYQERFNGKTCQVSTNYKYLEIFKVNFCLTDLHHLFDLHK ITRDYASQTKPAIQDGVFILEDFRNILCTMM >SP_1570 hypothetical protein MERSLFGLFTAFLCFICFLAGAQAFRKKRYGLSILLWLNAFTNLVNSIHA FYMTLF >SP_1657 hypothetical protein MRIDSIASSVTTGVSKVIVSFELLDVATVFSSLLVEVFFELYVPHATRRE RDNRATPIPINFFVLFMLKYLLFF >SP_1146 hypothetical protein MDFLNHSFDTKKVINTKINAVNSKNNVGKNFIDVYREMKEVPNNKIHQSK VNVALIKK >SP_1819 hypothetical protein MMNLSSIYSSMPTTKSKQKGWTNTKKASNTQ >SP_1729 IS1381, transposase OrfA/OrfB, truncation MREYRTYEEIAADFGIHESNLIRRSQWVEVTLVQSGVTISRTPLSFEDTI MIDVTEVKINRPKKQLANDSGKKKFHAMKAQAIVTSQGRIVSLDIAVNYS HDMKLFKMSCRNIGQAGKILADSDYQGLMKIYPQAQTPRKSSKLKPLTAE DKACNHALSKERSKVENIFAKVKTFKMFSTTYRNHRKRFGLRMNLIAGII NHELGF >SP_0834 hemolysin-related protein MSAQITINHKKARYVRIELEGYNALSLAEVEVFCFIATNAETATQVSKPV QPISQTPVKDKTLTIQHSGAYIARYSITWEEVPVDKDGNQVVRSHSWEGS GRNQTAGFVLNLPIKENMRNLRVKIEKKTGLLWNRWQTIYENRPILAQPH RKITHWGTTLNSKVSDDDVL >SP_1141 hypothetical protein MPEFIIVEGNNDLGEFFQIDGELFSDNELLENLKKWREWEVPVIIDDWCN RILNEDETEILYFPTHEDKMNYIRVEKDLEPLYHTSNKIYATISKSEWLE LLN >SP_0854 hypothetical protein MSAYLKEALKGAAKTQTKTNFGKPDFHIEKYKQDSLVTYRN >SP_0059 hypothetical protein MGIAIFLPLFSFFHRKFYHKSEKNSSLLNKKLE >SP_0774 hypothetical protein MLSVNTILEKFYKEHQVKPFISPERELDTWLLSPKPVPKRNMDLLVDDSL AGDIILLWRIQFGTFTTET >SP_1003 conserved hypothetical protein MKINKKYLAGSVAVLALSVCSYELGRHQAGQVKKESNRVSYIDGDQAGQK AENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAII SEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLKDAAHADNIRTK EEIKRQKQEHSHNHGGGSNDQAVVAARAQGRYTTDDGYIFNASDIIEDTG DAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPA QPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDP AQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPD SRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVF EENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREF YNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVDDIL AFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITS DEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSG NTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHN IKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHV RKNKVDQDSKPDEDKEHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTD TEETEEEAEDTTDEAEIPQVENSVINAKIADAEALLEKVTDPSIRQNAME TLTGLKSSLLLGTKDNNTISAEVDSLLALLKESQPAPIQ >SP_1914 hypothetical protein MKKKAFGIVLLVLAAWILLQGNFGIPSLDGKIWPLLGIVFFAYKSIESIL RRHLTSAVFTGLLALIIANYAYDLLPVTNHSLIWASILVVLGVGYLTHSS KFWNEKKWWYNGKKTVVTDKEVAFGSGTFYKQDQDLVDDQVEVAFGDAKI YYDNAEMLGDFATLNIEVAFGNATVYVPQHWRVDLKVETSFGAAKADAPV APTSKTLIIRGDVAFGKLEIVYVK >SP_1233 hypothetical protein MSGLLYHTSVYAVKKEILVNTRKKTQFMTMTALLTAIAILIPIVMPFKIV IPPASYTLGSHIAIFIAMFLSPLMAVFVILASSFGFLMAGYPMVIVFRAF SHISFGALGALYLQKFPDTLDKPKSSWIFNFVLAVVHALAEVLACVVFYA TSGTNVENMFYVLFVLVGFGTIIHSMVDYTLALAVYKVLRKRR >SP_0126 hypothetical protein MSRMSKFVIELSSFFLVHFYIRKRKGKVSIFLNYF >SP_0188 hypothetical protein MSRKKYENDEKSQKKLKIGRKSDVFYGIID >SP_1986 hypothetical protein MLMCEKIRIRRVSDYPSARGGLEDILIMENMTNHLLLVQIRVHGYLLDFA SIEGQRQKHYRLKNLPQTVELTVDDVEEDVDLTLPENRSYQEADFFERMF RENC >SP_0906 hypothetical protein MAVKFTKRDDLDKMFEEFAKLPDLKQVTFPDDKEKKVKAEKKN >SP_1054 Tn5252, Orf 10 protein MKRDVRDIRKQFRLTEAEEKQILALMRERGETNFSDFLRKSLLSSDLQKQ METWFALWQSQKLEQISRDVHEVLILAQSERQVTQEHVSILLTCVQELIQ EVANTIPLSKEFREKYMR >SP_1696 hypothetical protein MALILESSKRNEDSHMTVTIKVNYQTTFQKKEAKN >SP_1253 hypothetical protein MAAKLWEEGKMVYASSASMTKRLKLAMSKV >SP_2139 hypothetical protein MILWSFDFANDHAHAFFMDNVEWSHADSYFRSFVSDDVEERYTENVYLDS LSVKQKFKFIFDFGDEWRFECQVLREIETEDEEAYLVRSVGTSPEQYPDY DGFDYEEW >SP_1060 hypothetical protein MADMKNKYDVKRIIPDELSESLDIFLKNYSETGLSDYNTYLFYGFILKSY KLPRENRYSIKLLVKELQNRGLKVTLIINIYYHALNCLALNDGLKIYGED FLI >SP_1787 hypothetical protein MVLSGGKSAMPMTQKEMVKLLTAHGWIKTRGGKGSHIKMEKQGERPITIL HGELNKYTERGIRKQAGL >SP_0076 hypothetical protein MIDVTIGQKSKTGAFNASYSICFSGENFSF >SP_1353 hypothetical protein MRNPYLPVFESDKRLIETDKLIWFPAKNSLAGFLF >SP_1757 conserved hypothetical protein MIELYDSYSQESRDLHESLGATGLSQLGVVIDADGFLPDGLLSPFTYYLG YEDGKPLYFNQVPVSDFWEILGDNQSACIEDVTQERAVIHYADGMQARLV KQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVL LENHVTGDILLTLPGQSMRYFANKVEFITFFLQDLEIDTSQLIFNTLATP FLVSFHHPDKSGSDVLVWQEPLYDAIPGNMQLILESDNVRTKKIIIPNKA TYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEA IAGALPDVTFRIAAVTEMSSKLLDMLCYPNVALYQNASPQKIQELYQLSD IYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFESSEVA ALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGG >SP_0072 hypothetical protein MQIAGIIFNSSTTNGDKVDFNPTENVDLRNNFASLVK >SP_0810 hypothetical protein MTVEEKKVFLARHLKAAEAGEFVTIDALFQAYKKELGRSYTRDAFYQLLK CHGWRNIMPRPEHPKKADAQTIVASKNKISIQEEKKAL >SP_0296 hypothetical protein MSLEAISEALTHSDTVTTKTYVNTSNIVPLSAGQVAYQHLKNK >SP_1788 hypothetical protein MKNRIIDVFEVVNRLLVITVENPDFEDLRVNHTQ >SP_1533 conserved domain protein MLENGDLIFVRDGSDMGQAIQTSTGNYSHVAIYLDGMIYHASGQAGVVCQ EPADFFESNHLYDLYVYPEMDIQSVKERACKHLGAPYNASFYPDAAGFYC SQYIAEILPIFETIPMKFGDGEQEISDFWREYYIELGLPVPLNQAGTNPS QLAASPLLQCKERNLHDSDF >SP_1772 cell wall surface anchor family protein MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKT VEKTDALATNDTVVLGTISTSNSASSTSLSASESASTSASESASTSASTS ASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDY VASVTNVNLQSYAKRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINA SLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPK TNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQ VLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSF TFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASAS ASTSASASASTSASASASTSASASASTSASVSASTSASASASTSASASAS TSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTS ASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSAS ASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASAS ASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTS ASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS ASTSASASASTSASASASTSASASASISASESASTSASASASTSASASAS TSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASESASTSASASASTSASESASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASASASTSASASTSASESAS TSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASISASESASTSASASASTSASVSASTSAS ASASTSASESASTSASASASTSASESASTSASASASTSASASASISASES ASTSASASASTSASASASTSASASASTSASESASTSTSASASTSASESAS TSASASASTSASASASTSASASASTSASASASTSASASTSASESASTSAS ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS TSASVSASTSASESASTSASASASTSASASASTSASESASTSASASASTS ASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS ASTSASASASISASESASTSASASASTSASASASTSASVSASTSASASAS TSASASASISASESASTSASASASTSASASASTSASASASTSASASASIS ASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS ASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASASAS TSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASASTSASASASTSASASASTSASASASISASESASTS ASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSAS ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS TSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSAS ASASTSASASASTSASASASISASESASTSASASASTSASASASTSASAS ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS TSASASASTSASESASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASTSASESASTSASAS ASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASAS TSASASASTSASASASISASESASTSASASASTSASVSASTSASASASTS ASESASTSASASASTSASESASTSASASASTSASASASISASESASTSAS ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS ASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASASAS TSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASASTSASASASTSASASASTSASASASISASESASTS ASASASASTSASASASTSASASASTSASASASISASESASTSASESASTS TSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASES ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASVSAS TSASASASTSASASASTSASESASTSASASTSASESASTSASASASTSAS ASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASAS ASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASAS TSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTS ASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSAS ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVSAS TSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSAS ASASTSASASASISASESASTSASASASTSASASASTSASASASTSASES ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASAS TSASESASTSASASASTSASESASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASAS ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS TSASASASISASESASTSASASASTSASVSASTSASASASTSASESASTS ASASASTSASESASTSASASASTSASASASISASESASTSASASASTSAS ASASTSASASASTSASESASTSTSASASTSASESASTSASASASTSASAS ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ESASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASES ASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASISAS ESASTSASASASTSASASASTSASVSASTSASASASTSASASASISASES ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASESASTSASASASTSASASASISASESASTSAS ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVS ASTSASESASTSASASASTSASASASTSASESASTSASASASTSASESAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASISASESASTSASASASTSASASASTSASVSASTSASASASTSASAS ASISASESASTSASASASTSASASASTSASASASTSASASASISASESAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSVSNSANHSNSQVGNTSGSTGKSQKELPNTGTESSIG SVLLGVLAAVTGIGLVAKRRKRDEEE >SP_0009 hypothetical protein MENLLDVIEQFLSLSDEKLEELADKNQLLRLQEEKERKNA >SP_2141 glycosyl hydrolase-related protein MVRFTGLSLKQTQAIEVLKGHISLPDVEVAVTQSDQASISIEGEEGHYQL TYRKPHQLYRALSLLVTVLAEADKVEIEEQAAYEDLAYMVDCSRNAVLNV ASAKQMIEILALMGYSTFELYMEDTYQIEGQPYFGYFRGAYSAEELQEIE AYAQQFDVTFVPCIQTLAHLSAFVKWGVKEVQELRDVEDILLIGEEKVYD LIDGMFATLSKLKTRKVNIGMDEAHLVGLGRYLILNGVVDRSLLMCQHLE RVLDIADKYGFHCQMWSDMFFKLMSADGQYDRDVEIPEETRVYLDRLKDR VTLVYWDYYQDSEEKYNRNFRNHHKISHDLAFAGGAWKWIGFTPHNHFSR LVAIEANKACRANQIKEVIVTGWGDNGGETAQFSILPSLQIWAELSYRND LDGLSAHFKTNTGLTVEDFMQIDLANLLPDLPGNLSGINPNRYVFYQDIL CPILDQHMTPEQDKPHFAQAAETLANIKEKAGNYAYLFETQAQLNAILSS KVDVGRRIRQAYQADDKESLQQIARQELPELRSQIEDFHALFSHQWLKEN KVFGLDTVDIRMGGLLQRIKRAESRIEVYLAGQLDRIDELEVEILPFTDF YADKDFAATTANQWHTIATASTIYTT >SP_0853 hypothetical protein MKENNMLVNWKLESDVNDYVKKQFENLGLKKLQDY >SP_0094 hypothetical protein MFSLVLILTIQEISRTLYNFQSNKLHSFSQAGILV >SP_1418 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0374 hypothetical protein MSKKRRNRHKKEGQEPQFDFDEAKELTVGQAIRKNEEVESGVLPEDSILD KYVKQHRDEIEADKFATRQYKKEEFVETQSLDDLIQEMREAVEKSEASSE EVPSSEDILLPLPLDDEEQGLDPLLLDDENPTEMTEEVEEEQNLSRLDQE DSEKKSKKGFILTVLALVSVIICVSAYYVYRQVARSTKEIETSQSTTANQ SDVDDFNTLYDAFYTDSNKTALKNSQFDKLSQLKTLLDKLEGSREHTLAK SKYDSLATQIKAIQDVNAQFEKPAIVDGVLDTNAKAKSDAKFTDIKTGNT ELDKVLDKAISLGKSQQTSTSSSSSSQTSSSSSSQASSNTTSEPKPSSSN ETRSSRSEVNMGLSSAGVAVQRSASRVAYNQSAIDDSNNSAWDFADGVLE QILATSRSRGYITGDQYILERVNIVNGNGYYNLYKPDGTYLFTLNCKTGY FVGNGAGHADDLDY >SP_0759 hypothetical protein MRLEAVFQPLILGEKNEIFNTQYSQLDGERSRGKIPDFA >SP_1028 hypothetical protein MSTSSRVLVLKKFHGIMDGNRNVAVFFVGQ >SP_0560 hypothetical protein MEFLLVLCNLDYHLDKFKEPIQYLKHYLVKQQLDCKRDDHPKEFHNPNNR FDKKNSKKTKKISFSLLWLNEPPSRIH >SP_0997 hypothetical protein MVASASASSTSTQAQEQVDKSELRALSQELDQRLKALATVSDPKIDATKA VLLDAQKAPEDSALTE >SP_0068 hypothetical protein MDHTRLSSKDLWSAFPTSNSIMGENLAWNHDGFLKAIEQWRAEKADYVEK KIVVQTTGNLVTMSR >SP_0683 hypothetical protein MKPLSYVIRITFLLFVVKEKIEFFRYFTILPL >SP_0684 hypothetical protein MEEYGHMEKVQEVVRLFQITIMVKNIIIHLS >SP_1926 hypothetical protein MKGVTNMTPEEMYLTERLDVQIAHFLKKSVQHRRRYKVLKITEIVAGFLI AVFCAIPMPGDRYRLISVALSSLGLLCEGIINLYNAKENWISYQKTAQLL EKEKFLYQCQTEKYAGKTKAFALFVKTCEGLISEEINQWESIQSKEVAAS ADAPVKKE >SP_1503 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0491 hypothetical protein MCFVPYYKVNHCEEAFAWYQDVNLVYLVDGVKLPYSQADLGSHVFLFGSA W >SP_1677 hypothetical protein MAQKGVSLIKAAFDTDNFLMRFSEKVLDIVTANLLFVVSCLPIVTIGVAK ISLYETMFEVKKSRRVPVFKIYLRSFKQNLKLGLQLGLMELGIVFLTLSD LYLFWGQTALPFQLLKAICLGILIFLTIVMLASYPIAARYDLSWKEILQK GLMLASFNFPWFFLMLAILVLIVMVLYLSAFSLLLGGSVFLLFGFGLLVF IQTGLMEKIFAKYQ >SP_0124 hypothetical protein MMKDLNNYREISNKELQEIKGGFGVGVGIALFMAGYTIGKDLRKKFGKSC >SP_1924 hypothetical protein MIKRGDVVALYLPFPTISSDLAVKNHMYICIDNSMTKNKELVKNQTFKPA LLTRRLVKNFMIEEPDLARNPFTRPTLIDLDKVFMLDNTVIPTSYLARRR RNVSEELYEEILDYLVQPRLISLNKSEFMQLNPGTY >SP_1363 conserved domain protein MFEHYSVADLFANLYKKRKANILALIALFALIAVPFTIKAVRNKNTVKDT TSYSTYLIYKITPPKESDKTILNHQIGGYSDFYGKLIDGNLNGAYLFNDV EPSELKKIASELDTTETTLKNSTNDYWWKKLTVYYMIDDAGVGVKILTSS KDANNLLEKKIDGLIEKFKHAYANVKIEKLETINSKELNANGETALGLNV KNLILRLVVIGVVCVILVVMGNVLVYLFNPTINRVGDFSQYQIDFVTEIT TIANLADVLSYKNTGQELTIVSSNKAILDKLKQSQEALKGMHFVDLQDVS SLLERDTVLLVEEYGVTRYKKFEQSLQILRNLNRSILGVATFKL >SP_2102 hypothetical protein MLYNNDKEEISMLKEVLTVAKVAKKSSLFLGGVAFGTLGLKILASKEAKK GYSKALAKAYKLKDELDASVSVVKQHGDDVLQDAKYLYEQEKKEEQLDSL IGE >SP_1640 hypothetical protein MLSTRFTGKLSKWGNYFGIVNTILSGAIDYILGNKAAIITYPVTFLIYTF AIKKWEASQEGRPNQMSQKQVKLAAIIISIIAFLFAFVTNYIGYGGKMNL LAYVTTIAFALSLIANALNALKLTTQWGFWLIYNFVQLTKAGIQGNFANI GKYIFYILNAIGALFVWNDEEVR >SP_1480 hypothetical protein MAELDNGIQVIIEIQVHHQNFFINRLWPYLCSQVNQNLEKIRQREGDTHQ SYKQIALVYAIAIVDSNYFSDDLAFHSFIVK >SP_2104 hypothetical protein MLKKYFSKYKWTDLFWILFVILTCLYIGNHDLFTLNHQEFSFRGSVWGLV LALYHLLFIDKFVISNRK >SP_0911 hypothetical protein MKHKEHILIGLLYLLSPFIGQLLVEHTHFISTEFTGTAYVICWLSVVISI HHFSKNVLSQQQK >SP_1349 hypothetical protein MTFLDDYHKKHNYPLFYESYLQNVMEFLESQDIKNGVDAFVDDHQNLVFV LYGQGYRAEGKEGILTTQVTVKAYDEDKKPINFANLLDSLIY >SP_0693 hypothetical protein MRIVDKIKILPTPYEGHYHLYIPSSKKHVLVGKQEKNG >SP_0198 hypothetical protein MKLKRFTLSLASLASFSLLVACSQRAQQVQQPVAQQQVQQPAQQNTNTAN AGGNQNQAAPVQNQPVAQPTDIDGTYTGQDDGDRITLVVTGTTGTWTELE SDGDQKVKQVTLDSANQRMIIGDDVKIYTVNGNQIVVDDMDRDPSDQIVL TK >SP_1252 hypothetical protein MVSMVDKDGKLIPEQGGARSTSPAPVVIRKGLDIDKIMMHLSDTFNSWDY RQVEYY >SP_1805 hypothetical protein MSVEEKLNQAKGSIKEGVGKAIGDEKMEKEGAAEKVVSKVKEVAEDAKDA VEGAVEGVKNMLSGDDK >SP_0924 hypothetical protein MILMTKNINLTNEELELIQGGADPYGKNPNGRYDWEIEPVLTLLVHGFCP RGTYDSGYIGGGNHLCKGSAARF >SP_0573 hypothetical protein MYNFSQSCYNQPIGIKEVTLMAVFVSLDGIVVEVLDVFSSFNGDSEFFLC IAF >SP_1345 hypothetical protein MTKQMKLMECDLVHSVQIVAVTGVLLVGKIVNSFKL >SP_0297 hypothetical protein MNNLDNMRFIMEIFASFSPEIELLLSYFSLFLMIYFNFLPLGKNNKDMN >SP_0558 conserved hypothetical protein MIRCKKEIRSLYMAEQDLAMQVLQQVVKLPVVKVDRSKFLVDKFSKELDP KDIPTLLEQGPTTLLSQEILDRVANACIRDNVLLASGTSVLAGLPGGLAM AITIPADVAQFYAFSLKLAQELGYIYGYEDLWASREELSEDAQNTLLLYL GVMLGVNGTAALLRVGSITIAKQVMKIVPNKALTKTLWYPILKKVLKIFG VNLTKGGLAKGMGKFIPILGGIISGGLTFATMKPMGESLQKELSKLVNYS EVQYQEDVETIRKEAEIIKGE >SP_1025 hypothetical protein MPSHYTRTKTFMDIYIKKAIIHQFSPDDTELFLADKFLNITPKIEEYLRK KIEHVYSDEAKTGIFEEENPFFNHITDDLLETSVTLANLWKEEFSISENL KTNDLIFVQFSKEGVEHFAFLRIALRETLTHLGGEVDNPIKLTQNNLPGF GTGADEALVVNLQSSKYHLIEKRIKYNGTFLNYFSDNLLAVAPKISPKKS IKELEKTAQRIAKSFNTDDFQFQSKVKSAIFNNLEESNELSPEKLANDLF DNNLTARLSFIDQVKEAVPEPVQFDEIDASRQLKKFENQKLSLSNGIELI VPNNVYQDAESVEFIQNENGTYSILIKNIEDIQSK >SP_0595 hypothetical protein MQILCYFTITVVAKPNNSGEVHLDVSIEDNQGGSGYNFSSVSSSSQTAKY EGTVYNNNSSLYITIDKTSDATALLKLKLNNVDNQPATEVPSSGITVKLN AKDNAGNWTSASNKKEVTVKIVSAKPTYPDKILVKNPDNIKDTEKMPLLK N >SP_1832 hypothetical protein MVKINKICSIQGSSVENEDIVGSQNQYFWIIGGATDLYNSKEEIGYSVSE VVHILSESLSVNCKESKTLKQIFETALLEVKDEIGLNSYKLTEYSKMK >SP_0108 hypothetical protein MQRLEVYKNYQRLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKN KKSYDRLAICYVPIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVEIY YQAKER >SP_0790 conserved domain protein MKKMKYYEETSALLHEFSEENQKYFEELWESFNLAGFLYDEDYLREQIYL MMLDFSEAERDGMSAEDYLGKNPKKIMKEILKGAPRSSIKESLLTPILVL AVLRYYQLLSDFSKGPLLTVNLLTFLGQLLIFLIGFGLVATILRRSLVQD SPKMKIGTYIVVGTIVLLVVLGYVGMASFIQEGAFYIPAPWDSLSVFTIS LVIGIWNWKEAVFRPFVSMIIAHLVVGSLLRYYEWMGISNVFLTKVIPLA VLFIGIFVLFRGFKKIKWSEV >SP_2241 hypothetical protein MSTHFVLQELKKTRIRRCLMKSLARLLIIHVFISIFLFFALTSGAISHTV LLLLLLFLPALNKGLEKIQSKRIPVLNAALFFLLISFPQLLTNPVQWKFS IFLVVTIISSLAYFYNFYQVVKEVDQKQLI >SP_0316 hypothetical protein MVYLRAISPNHQPAPKDAGFHVVQALLLLTRHLPL >SP_0691 hypothetical protein MMKKVTHMSDEVFLFEEIEEIVAPTDGEFLGEVLLGTGVVLLIGVACC >SP_0311 hypothetical protein MSLDIDKEKMTIMGIAFENRSVFKSVWYALSTNMIEGWRPTVSDVEKLRD EALALGMT >SP_1556 hypothetical protein MNTDYLPLEKRCLSCLSWKVSAIPYFSEAAKVLCILIIEHAWNPV >SP_1945 hypothetical protein MKSMRILFLLALIQISLSSCFLWKECILSFKQSTAFFIGSMVFVSGICAG VNYLYTRKQEVHSVLASKKSVKLFYSMLLLINLLGAVLVLSDNLFIKNTL QQELVDFLLPSFFFLFGLDLLIFLPLKKYVRDFLAMLDRKKTVLVTILAT LLFLRNPMTIVSLLIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVI VTLIFF >SP_0809 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVL >SP_1047 hypothetical protein MNCRGHETRQRIVRDFEVQPKAHIKLLANQQKHSDAGATIEDEYYVFIAE SKIDGKKEVIQCCMGAARDFLELINHKGLPLFNPLVGDSHVNNRQEYDNT GSGNL >SP_0543 hypothetical protein MVSPRTNQLMFIGLADFMFVICLYRGISETEFYQQLIAYIGVFSACLSRF CSCGA >SP_0172 hypothetical protein MIFIEYTTILLPLARDFVYVEGLGSYVVELFCSF >SP_0167 hypothetical protein MDKKLDILDKVKEYLGNKTTQILDNQYKEFLKLNDIRRAFGISEKVLNNS FNFTSKEFNDLINNENYLFEYACRIREEWRKKCFNHSYRFLCSPIITDDF LNTKTLRSSQIEYKYERYLSKSSIGDRAVDGFVSFNTLTANGMSAIKLCL EILNSIFFKKKIDLLYSTGYYETRFLLNNLAKSGISCYEVSNCELDKDKF YNVFMMEPNRADLTLQKTDFKIVEYFVKYKNNSIKVVILDISYQGSNFKL VEFLEKFKFANVIIFVVRSLIKLDQMGLELTNGGIIEVFIPNHLRKLKNF IEEEFNKFRNSHGANLSLYEYCLLDNSLTLKNDWNYSDLVMKFTSNFYAD IKDLFMENSDIEIIHEEGVPFVFLDLIGEGKKEYEMFFQWLNFFYKQLGI TLYARNSFGFRNLTVEYFGIIGTERYIFKICPGVYKGLSYYLMKFLLKSF SNEYLKTTDEVNR >SP_1174 conserved domain protein MKINKKYLAGSVAVLALSVCSYELGRYQAGQDKKESNRVAYIDGDQAGQK AENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAII SEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLKDAAHADNIRTK EEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTGD AYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQ PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPA QITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDS RPEEPSPQPTPEPSPSPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSR YIPAKDLSAETAAGIDSKLAKQESLSHKLGTKKTDLPSSDREFYNKAYDL LARIHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLVEDILAFLAPIR HPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYV TPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGA EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFD EGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQ ADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEES EEPQVETEKVEEKLREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQD NNTIMAEAEKLLALLKESK >SP_1810 hypothetical protein MLNQDLFDSLEAQKIVDTLMKGQKDYVDERLEKRETMIVSNGYAWTRPNH IDTAFASADLFEYKLQLAGQTWGYLEFETNTEKYGKVLLIIKGKKRLTNQ FPLVQKNKSGYLFEYAQMNTLYLNQHSSYKNDENSHSFPIQMELVSDEMI QEIEQATKNSNIEKFMILTYEADSENNIISVDVVMPDARTGQLHLIQDLS EYIQSSSYHFEEAKYQDIPNFSELSETEDFEIIPRIEKQEGQK >SP_0328 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0067 hypothetical protein MALTLAGAVLTNDVFANDRLVATQTTDGKNENVLTSEVLKPSSGNVLVGI KGEFVAPHQQSILDAINAICKEAADEGLVDKYVPIK >SP_0031 hypothetical protein MIAKKIFSNPEITCQFIRDMLDLPAKNVTILEGSDIHVLLSMPYSVQDFY TSIDVLAELDNGTQVIIEIQVHHQNFFINHLWAYLCSQVNQNLEKIRQRE GDTH >SP_0110 hypothetical protein MLLLLLCTTFFVFNVNYTREVVRIQEMGKTVDSLDLYLKDINEPAASVLR FFEDVSKEYKVSIIKTDSGDEVVKSGVFDKDTFPYQEFGISSLDFTTDGE GVYSNKEISNKLGTIPTFLKAKPIQLMTFQTYIKDTSRSLNGRYTITSTQ EMDKDRIVQKWSDFFKIDQATLLEPTYKSAVEVINRDLLLSAIVFVLAIL LLVLVTVYQPMMEMKRVGVQKLLGFQDRAVLADVVKGNLYLLLGGALVIN LGVFFLLDYKPKDLFPMLWLSHFLLLQLYLFISWLTYLLIQKMTISSLLK GFSSFKFGLIFNYVMKIGTTILLTALLIGVGRSLEQENKELAYQQQWVSQ GNYLTLETFKLNDNLWQEELAGSGKSTDYFYRFYQDLVEKTQAGYVQSSS LPVKNFVQSEQIQQYQLTDTVDVYYANRNFLKSKGFKLPNTGIKKVILMP ASTKGEEDKNQLLGKLIAFHSMKYEEQQKRTIEEMDVEIAYYEGDWSFFP YSDKRKENLSNPIISLVNDSDMMWDEKASLSTTGLNNPIKIENTVQHQKE ITELVEKLSDGNYLKFSSIQAIQQEKVDSYRDAVRNFNLLFALFGLLSMM ISYFLLVTTFLLKRRDIITKKFMGWKLVDRYRPLLVLLLLGYSFPLLVLI FFAHAFLPLLLFAGFTCLDILFVLGLASRMEKRSLVELLKGGIL >SP_0542 hypothetical protein MCNNGLTFLLGPFAIGIGVTGAAGGAILGGVAYAAICWW >SP_1303 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYYKRLQIVL >SP_0471 conserved hypothetical protein MDWYDYMIQASKQSQFNASHWFRYLRKVIFEDYSYLTNQDVEKLLDSKEL TRFQKISLKYAFQEHTPTHKYVISLNKPAKLTNVQKLMEKYKHG >SP_1643 hypothetical protein MILSLVSLSDIPLFLQGTLLILGHLIPSYRICQSLKRDFPQAYQEPISFW SIL >SP_1379 hypothetical protein MLFYSSFKKWYTRLPAKLGSKCVRITVKNALPSWRSISFYQRKSNSKL >SP_0653 hypothetical protein MDKQYLHEKLDAMRQNFVESTHHERAMGVLDQAHMSKKMLKIKKKLVALE MERCQRKIEHKDCSKIDQKIKEQKEIFESCCKKD >SP_0183 hypothetical protein MQDQHAIKNKKTIKATAGAVAFSLTFLSYIQ >SP_0934 hypothetical protein MTASFMVAEMRRHKKIVTNPYFFDRIEVVKKK >SP_0329 hypothetical protein MNDLGKYNELERSSKLTKRQFFENQMLDYTIIAHESFEIIRHSVYQTDDR EVENALAFEVKNDETDKLILLLSEDIGVGEKLCLVDGTKMRGKCLVYDKI NERMIRLQC >SP_0407 hypothetical protein MSSCLPCPFGAFTVSPEFRPFTVMENQTIPHFY >SP_0563 hypothetical protein MSYEQEFMKEFEAWVNTQIMINDMAHKESQKVYEEDQDERAKDAMIRYES RLDAYQFLLGKFENFKVGKGFHDLPEGLFGERNY >SP_1892 hypothetical protein MIQIVVRSVKDYSENRKFDAETLEFRKTYSKMKYGRNNVILEFKLNYNNI VEVSF >SP_0495 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0327 hypothetical protein MKEERRQFFERVDGNQCRDYILSHCSKDYEKVKSSLERLMDNRFMFDSPW DMEPCSKIHQIQPMVWDQVFEDDPEWSYMLNRQEYLLQFMIGYLVEGDKD YIQKCKFFLFDWIEQVREFSPQSLMTRTLDTGIRSFTWLKLLLLLLKFDL LEEKELEKILVSLEKQIDFMKSYYRAKYTLSNWGILQTIPMLAIYHFFSD KMDLEEAYHFASEELKQQIETQILGDGSQFEQSILYHVEVYKALLDLCLL LPDLQDSYQELLEKMATYIQMMTGLDGRTLAFGDSDSTETTEILSLSAVV LNQEDLLNGLDVKVDLLSLLFLGREKVKRLQEFEKRAWQPKSMIFEDSGH VCIKDEHRYLFFKNGPLGSAHSHSDENSFCLQYQGQPIFIDAGRYSYREI YERYLLKSAWSHSTCIVDGKAPERITGSWEYEYYPHSLFCHHKEREGVHY IEGAYWSAEPDLPYLHRRKILMLVEDVWLLVDDIRCQGQHEVLTQFILDK DVTYQDGKINQLRLWSEVDFDLEDTIISPKSPIIHPKSKNHPESLP >SP_2047 conserved domain protein MWKKKKVKAGVLLYAVTIAAIFSLLLQFYLNRQVAHYQDYALNKEKLVAF AMAKRTKDKVEQESGEQFFNLGQVSYQNKKTGLVTRVRTDKSQYEFLFPS VKIKEEKRDKKEEVATDSSEKVEKKKSEEKPEKKENS >SP_1459 hypothetical protein MREIMLLQLFSLYFESLILTTILVLIFLGIWIGLRAMSGVDKTARARQAH LYDMIMIGVLVVPVLSFAVMSLILVFKA >SP_2200 hypothetical protein MIAQNKKNVKETAYRKEQEISPTPIFYTRSPSLFTL >SP_1314 IS66 family element, Orf1 MELLLYTISKVKLLEDILMPQPIVPVEIPQSRRFDSKKRNDILLKIRIGK LEVSFFQSLNLEMVEQLLDKVLLYDNSSI >SP_0114 hypothetical protein MQRLEVYKNYQHLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKN KKSYDRLAICYVRIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVETH YQPKKL >SP_0451 hypothetical protein MAKSNFEKVESVVGWVRDKKITGYRISKETNAREMSIIALAQGRAKVKNI SFETALGLIDFYEKNYEKFED >SP_0561 conserved domain protein MEVVMDNIIDVSIPVAEVVDKHPEVLEILVELGFKPLANPLMRNTVGRKV SLKQGSKLAGTPMDKIVRTLEANGYEVIGLD >SP_0772 hypothetical protein MDFFYNGIAITPNTYLSAWFVNFIAALPLNFLIVEPIARFILSSFQKPFT GEEVEDFQDDDEIPTII >SP_0639 hypothetical protein MMFVIEEVKDENQKKAVVAEVLKDLPEWFGIPESTQAYIEGTTTLQVWTA YQESDLTRFVSLSYSSEDCAEIDCLGVKKLIKVEKLGANCLLL >SP_0596 hypothetical protein MKEANKNHPAGAPTFAKGEGEHANDIVATYSDGTTYYVPLNDVTKYAR >SP_0633 hypothetical protein MLEVGLNFLISLFTFTFDILYPIVKVGHTDDYSHGAVKLSVSLVDKDAMK KIFVTVIGYFEINIDENITDILYVNGTAILYLYLRSIVSIVSAIDSSEAM LLPIINVLELLDKSQPFEEE >SP_0487 hypothetical protein MERIPLSILTFYIPKVPSYSIKEKQSKWLQSGYKSIKTDKAILSSSPIQT ILSVVESHHISLRSRTSLKRRK >SP_0598 hypothetical protein MEVTNESNPKILGLCQQKATEKFYFISDFIGLIGRNFSLFDSDEVQQNSR KQSL >SP_0821 hypothetical protein MKIFVNLDYKKILFVRQKGFYLDMQGQSQLVLD >SP_1762 hypothetical protein MYYFIPAWYGSERTWHADITPWYFSHFRLEFDDTFHQIRLFQEQDIDSRL LVLAYQPHLRYFLYRHGVLEMDTYSVFDVMQDFHNLHTQVLSIRDIEWDD DCEFIYSPFTIIVQKNGKKFAKVEHGVEGFISDIQYFEPNGQIHMHHIVD DRGFVSSIIFFEDGQAAYQEYLNLKGEWQFRERLKEGGQVEVNPILGYRF KMLTYQNMGDLVAEFFENYLQTYVKDQDIFMLPSHSHHDQLVLDRLPSTN PKLLSLFIGRNPQDTFRDLDVTFEKSDLILVDREDSLRLLQELYPERMHQ CYHLSSFDTRLRLGRSQTKKESIIYFQLDFEQGIDNQALLQVLSFVAENK DTEVIFGAFAASQEQMNEVEGIVESFIQENIQSENLGKAIDYGDAENPLE ENQHQDLRLQFVNLNDELDLIKTLEFVRLIVDLNRHPHLYTQIAGISAGI PQINLVETVYVEHLKNGYLLADVTEFSKAAHYYTDRLKEWNESLIYSIDK IKEHTGQQFLGKLEKWIEEVKNVKGT >SP_2039 conserved hypothetical protein MLNKIRDYLDFAGLQYRNPDKAGAEREKMLAFRHKGQEARKVFTELAKAF QASHPEWQLQQTSQWMNQAQRLRPHFWVYLQRDGQVTEPMMALRLYGTST DFGISLEVSFIERKKDEQTLGKQAKVLDIPTVKGIYYLTYSNGQSQRWEA NEEKRRTLREKVRSQEVRKVLVKVDVPMTENSSEEEIVEGLLKSYSKILP YYLATRK >SP_1604 hypothetical protein MAKEPWQEDIYDQEESRAERRHRNHGGADRMANRILTILASIFFVIVVVM VIVLIYLSSGGSNRTAALKGFHDSDASVVQISSSSSSQPEQSSEPESTSS SSEEAANPEGTIKVLAGEGEAAIAARAGISIAQLEALNPGHMATGSWFAN PGDVIKIK >SP_1842 hypothetical protein MGDRYYRALNGSEPDKYLLEKVELYKTDAIELVDVNK >SP_1092 hypothetical protein MVSLQNRKDRAKMFELTYKDCYHVERTLKYEDHEALMLTLSGCVTLPDTL YVTSLTFRGKKVYQGLVGDLYRFLSHADFLHQN >SP_0699 hypothetical protein MSYFNDYKHKWEGKNELIFLTAILQNSLIAIF >SP_0070 hypothetical protein MNDKKEVDGEHWPLLIYKDWILVASISDFSIVS >SP_1031 hypothetical protein MNDVAIILETKSEERDISKQIFIDELMKNIDII >SP_0874 hypothetical protein MTPFLAKECKGIPKIKIKNVDLTTFYQGMQKNAKE >SP_2154 IS3-Spn1, hypothetical protein, truncation MKLSYEDKVQIYELRKQGQSFKQLSKRFGVDVSGLKSSESLR >SP_1165 hypothetical protein MANTVKVFLKEISQNKKENPVKTRDFLVKNIFSQTF >SP_0888 hypothetical protein MVVKTRKQGNSITITIPSEFNIPSGVKYEAKLLPSGEIIFTPEELGQQVS YVSDDAFDLNLDKIFDEYDDVFKALVEK >SP_0077 hypothetical protein MTYSHIYQVLFLPKLSIKRWHFLGLVLVDFALSYLSHFELFMVQWKHVIQ II >SP_1133 hypothetical protein MKIVTFKPTKQIDDGFYLPGIDILFVSDKADAKDKEDVILFLSRNGLNKS >SP_0879 hypothetical protein MYHETELATRKGNFSFFKIFLKKQSIINHNQRRECMSNYRRTSKPKTEHI KKGFTVFQKTVATIASILGLITASITIMNALDNNKNIKKEPTTSQTTTIV KEIQKESPKENTSPTKETNTSQEKTQQEETPKSSVKEEKKEDQKTATQDS STPASSKPATENEKQSNAPTSENKSNQ >SP_1834 hypothetical protein MSDIQVNIPGECLYDKVFVLSFIIYNISTNLNIVNGIFYIQAKKDSITFE WKAKEQTRKLAIDSSKPCFEVVDIVK >SP_1455 hypothetical protein MKEILICDIIKRKGLHKKVGRTKIQRRQNRNQGGLAFRFMKGLVNFLGVI ASGAIRDLWRYSC >SP_0995 IS630-Spn1, transposase Orf1 MWYNLLMAYSIDFRKKVLSYCERIGSITEASHVFQISRNTIYGWLKLKEK TGELNHQVKGTKPRKVDRDRLKNYLTDNPDAYLTEIASDFGCHPTTIHYA LKAMGYTRKKEPHLL >SP_2147 hypothetical protein MLLQNLSSKECCIFLSFHTNISLYTDPSDNQMIIGHFLIICIFFEKFFKK ILYPFYRSLCICIYVFYPMLCIIT >SP_1802 hypothetical protein MYMSKAKKICFIIFCILILTIFLPVLIDYHQVSDLGIHLLSWRQNSVVEF YLARYVFWGTVVLSTLVLLSILVVMFYPKRYLEIQLETKNDTLKLKNSAI EGFVRSLVSDHRLIKNPTVHVNLRKNKCFVHVEGKILPSDNIADRCQIIQ NEITNGLKQFFGIERQVKLEVAVKNYQPKPQNKKTVSRVK >SP_1585 hypothetical protein MPIFVYHNIIKDNIIILFVFSHSVSLYKKAIHFEPLFLIYRLCYE >SP_0899 conserved hypothetical protein MKKSRKLATLGICSALFLGLAACQQQHATSEGTNQRQSSSAKVPWKASYT NLNNQVSTEEVKSLLSAHLDPNSVDAFFNLVNDYNTIVGSTGLSGDFTSF THTEYDVEKISHLWNQKKGDFVGTNCRINSYCLLKNSVTIPKLEKNDQLL FLDNDAIDKGKVFDSQDKEEFDILFSRVPTESTTDVKVHAEKMEAFFSQF QFNEKARMLSVVLHDNLDGEYLFVGHVGVLVPADDGFLFVEKLTFEEPYQ AIKFASKEDCYKYLGTKYADYTGEGLAKPFIMDNDKWVKL >SP_1611 hypothetical protein MEQTLFELELLPEEDIIVTGLPKYCSFTCLITGR >SP_1189 hypothetical protein MVSGSVFADSALTTVDKANDIVLNVDGNKFYNVSVSEDIVNAGQILEDYF YVDKFGNINLKGTPEELAKNIGISVQEASLMYGAVKELPNVYERGPVGFR FNLGPQVRGMGGWAAGAFATGYAGWHLKQFAVNPVTSGFVAVISGAIGWA VKTAVENYWTVAVATVEVPFVNLVYTIDLP >SP_2043 hypothetical protein MKLKKYSIQGVGKVIFPASFSDEIVQNLAMIGFFEKYGIIVVI >SP_1352 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0115 hypothetical protein MELVLPNNYVALEQEEMMYLDGGGVGRNWWNSRGSFATVLDVDLAIYSGG ATIYSAYAIKKAISANRGAITRTLRSLIIKHVGSAAGHLVNTALNVALTV TGFSLGGAIAYGADWADGSLDGYIFA >SP_1333 hypothetical protein MAERFWENLSIILAERNISWIELTRKMFAGEFHYPSELNRLYQKIRHYKM EQRMPQSPWVERIVQILDLDYEDLFRR >SP_1490 hypothetical protein MLEIDLTVLNDLPYSCFILLYKPETVYIFSK >SP_0429 hypothetical protein MTGTNTFTVLSTEDLEQTSGGLAVWEDGYSRWLYYREFAPYMRQGALNSY IDAWKYGFRAG >SP_1641 conserved domain protein MKKLYRIHFIAIAVIDLLLFAFFITRLETSFEWLLLSGLIFFLAQGLLLF LLVVRLKHQFAEIYPQINKKIRFYYLGVLTIDFLFFVLLAFISSQRFSSL MPIITACHSTFYYMTADYLRENYPDFYDKHISLWECL >SP_1210 hypothetical protein MANLSQGLSLYLMTHHYQAPKSVIDFGLWIAKAPSQERGRLAFLQMLAQT LQGFR >SP_2187 conserved domain protein MYLGDLMEKAECGQFSILSFLLQESQTTVKAVMEETGFSKATLTKYVTLL NDKALDSGLELAIHSEDENLRLSIGAATKGRDIRSLFLESAVKYQILVYL LYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEH QIHYFYFCLFRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLD LVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLRLLRKVPSFFAGQH IPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMK KEELLGDYTEDHVTYELSQLCAQVYLYKGYILQDRYKYQLENRHPYLLME HDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQHMRIGL DLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKE QTPVYYLKNDLDMEDLVAIRQLLFT >SP_1621 putative transcription antiterminator BglG family protein MSRKQEQMETLLLLLRDSKDYISAKVLGEKLNCSDKTVYRLVKGINKDCP VEAFILSEKGRGFKLNPRSSLVDVDGNFTEAFDPEVRREKLLERLLLTAP KPHSIYDLGEEFYVSESVVLKDRQILQESLAIYGLDLKMRQRKLFIDGDE AQIRSAILNLLPMFNQLDLEQITQNKVQPLDGELAHFCLGLLITLERELG VNIPYPYNINIFSHLYIFISRNRRSTSIHVVAPSKPTIVDEKIYSVCQKI IQEIEQYFRMKVDAVEIDYLYQYVVSSRLQKPFSSGKLPFSQRVLDVTHY YFSRMCMDNREIETTDPDFVDLASHISPLLRRLDNRVQIKNSLLSQILLT YPNLVKELTTISKEVSLVFGFASLSLDEIGFLVLYFARFQEKRARPLKTV VMCTSGVGTSELLRARLEKQFSELDIIDVVAYHQLDELINLYPDLDFIVT TVALQEPASVPFVLVSVFLTEGDKQRLQAKIQEINYE >SP_0612 hypothetical protein MKTDLHFENHQKNGIMVGKDSAESIRTFRIRG >SP_1728 hypothetical protein MSVNLLTLLFIPVMVSSSGSEFQSGWQEHQLIAEKVSKTLDKTFDKDVRE IPTSQFYQKFVDEMGRTYSGNLILQELITVNGAYKATYIGELSSN >SP_1775 conserved domain protein MRAQSFFLTFSFIRSKIKLALNKGVLNMIEITYIDASKNERTVTFESYED FERSQQACLIGVADYYPVQKLTYKGHNLDYHGTYGDIFFYLMKQDLSQYN >SP_1188 hypothetical protein MNHSFKKITVFCFIVSCVLCLLDLMNFKNVATFLFFCLPVFVLIYKNK >SP_1006 hypothetical protein MICLAQKTFYFFLAICRRLLVAIYHVLLKQESYNPRLQGLTEIRNPDKTM FVQDAIRFAQQHGFNML >SP_1036 hypothetical protein MFIISPDLFNIAVILYILFFIHDILLLILS >SP_1140 hypothetical protein MAGNENDNLTSKQIKFIDAMLTEPTIDKACQKAGVSRATGHKYLKVAAVK KTLRIKQDEMMDKTTQMLYLASSNAVSVLNDIMMDSKVNPFIRTQAAKAI LEQSYKTHEIFGVVRQIEELRLEIEEVSKGNQRVTRTQGVIK >SP_0191 hypothetical protein MKKIVLVSLAFLFVLVGCGQKKETGPATKTEKDTLQSALPVIENAEKNTV VTKTLVLPKSDDGSQQTQTITYKDKTFLSLAIQQKRPVSDELKTYIDQHG VEETQKALLEAEEKDKSIIEARKLAGFKLETKLLSATELQTTTSFDFQVL DVKKASQLEHLKNIGLENLLKNEPSKYISDRLANGATEQ >SP_0800 hypothetical protein MPVRKLQSYEVDYQEELNQQLPHYQAYTPEAQSDANLKEILFFINIAVFC ICIAIFSFIFLALKLSTALAFAAAIGFSLLVLKVQRSIIKRKRRR >SP_1755 hypothetical protein MIEILIVLAIILSLALIVLVTIQPRQNQLFSMDATSNIGKPSYWQSNTLV KVLTLLVSLALFILLLTFMVITYK >SP_1300 hypothetical protein MSISPRFETLEQAIASKDLEKVREAFKKMNSTWTINESVVRDNSIAHYGR VETAISFLPSSMEIEPTDESGT >SP_0449 hypothetical protein MNITNLFSIKTGCDETDRQLQKLFFQLDLQLGELTDQLRKLDSNFVPRSQ FVDTLDLNDVEYKEILNYFIFHRNDSEESLVEWLYDWISTNRYELPKEFS IRMAHKYHESVTEVFGDE >SP_1962 hypothetical protein MIVGSNPSTAFYNLIYQVSNEQKAQFEGLFLFSLE >SP_2071 hypothetical protein MEHLVVLSFIFSKFLECGILRKRLNSDKNWRNSTKKLEKNEGKRYDRKEE ILEEEHVTY >SP_1820 hypothetical protein MAPRYQRPHTEVYSVCGLFSIRRLVYLLLGP >SP_1048 hypothetical protein MWLIILWNAKPDTPLFNFKDEVIKYKTYEPFESSIKRVNTTIKNGSKGKT LTEMINGYRADNDIRDEICNFNILKNKIRDMKNQQGNTMESYF >SP_0093 hypothetical protein MVKIRTRENMDIYILVPKKPLPSPDQPEESSDSYFRS >SP_1004 conserved hypothetical protein MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQK SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTK DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII AEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVNKEKNAIIYPH GDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKV LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT VPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDALVRVFDEFHGN AYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE VPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSE TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP EAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK FTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA >SP_0018 hypothetical protein MIMLQKIYEQMANFYDSIEEEYGPTFGDNFDWEHVHFKFLIYYLVRYGIG CRKDFIVYHYRVAYRLYLEKLVMNRGFISC >SP_0635 hypothetical protein MLRMIVSKQYLLFYLIHEKEVHTLRIINSRIDYLNQLDHLFRTCRKLFSS QIISL >SP_1492 cell wall surface anchor family protein MVPKTATSTETKTITRIIHYVDKVTNQNVKEDVVQPVTLSRTKTENKVTG VVTYGEWTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYD RLSTPEKPIPQPNPEHPSVPTPNPELPNQETPTPDKPTPEPGTPKTETPV NPDPEVPTYETGKREELPNTGTEANATLASAGIMTLLAGLGLGFFKKKED EK >SP_1579 hypothetical protein MYAFDSSLSNNRLELSDNIILCYNEEKTEVFKCQKQVISF >SP_0322 glucuronyl hydrolase MIKKVTIEKIKSPERFLEVPLLTKEEVGQAIDKVIRQLELNLDYFKEDFP TPATFDNVYPIMDNTEWTNGFWTGELWLAYEYSQQDAFKNIAHKNVLSFL DRVNKRVELDHHDLGFLYTPSCMAEYKINGDGEAREATLKAADKLIERYQ EKGGFIQAWGDLGKKEHYRLIIDCLLNIQLLFFAYQETGDQKYYDIAESH FYASANNVIRDDASSFHTFYFDPETGQPFKGVTRQGYSDDSCWARGQSWG VYGIPLTYRHLKDESCFDLFKGVTNYFLNRLPKDHVSYWDLIFNDGSDQS RDSSATAIAVCGIHEMLKHLPEVDADKDIYKHAMHAMLRSLIEHYANDQF TPGGTSLLHGVYSWHSGKGVDEGNIWGDYYYLEALIRFYKDWNLYW >SP_1931 hypothetical protein MMPANTKVIFQEMFADFQNYYVLIGGTATSIVLDSQGFKSRTTKDYDMVI IDEVKNKEFYTTLNHFLELGEYQGSQKDEKAQLFRFTTTNPEFPSMIELF SILPEYPLKKDGREIPLHFDQDASLSALLLDEDYYNILVHEKETIQGYSV LSNCGLYSSKISSNHVSFHLQPQNSVLSSLQLAS >SP_1494 hypothetical protein MLNVDQDFMSISKSNKSGSDWKKTFTVRITNRLANDLNNVLKQVDKDTPN TPTWLNSAASKAKDDDRVYKLLKTLIPGENYLSC >SP_1120 hypothetical protein MFLLYYLFREDSSKLLYFFNYFENLQQVHLLVQL >SP_0682 hypothetical protein MVYLVLGILLLLLYVFATPESIKGTVNIVAMVCILVALLILLVLSFLKIF QLPTEIFLAIAMLILAYFSVRDITLMPVKKSKRR >SP_0958 hypothetical protein MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLE SRIAANERAINENEEKLSQMSDTSSEEYQFAKNNLDVQKNLLTRKTEILT LLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALY PLNIKAHTLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNH LDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLVGSLISGFGQL DYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQ KMPVLFLSLIGIVGLLFGIQTIQPLQRIAHLIPFTYLRSVEILSGRLPKQ IDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRF >SP_0190 hypothetical protein MQKHVCVQLYKHQGWEHFPVLFYFNFKKKEERNEKNSSC >SP_1947 hypothetical protein MRKKRGIKKLVTFALLGVFMFSNTIPYQQFIQKNKQLEIRVQSQKKSNGL DVGKAD >SP_1224 conserved domain protein MTEHLKSNTMVLPLKKGAQKMTTITLKVSEADKTFMKAMAKFEGVSLSEL IRTKTLEALEDEYDARVADLAYQEYLEDLEKGVEPITWEEMMHDLGLKDE >SP_1305 hypothetical protein MKEFLENFCFFFTVKKNSAIMSYVIKYDNKRRTI >SP_0475 hypothetical protein MQINAILKKKKLLLEGNKMVIRVFDQQKNTYSSFALEELSYYMNRVFKTN IELVEEKEADIFVGLVNKEDRKDHVLISLDKGKGRIESNTIVGLLIGIYR MFHEFGVVYTRPGRRHDFVPELRFEDFLDKQLSIDETASYYHRGVCIEGA DSFENILDFIDWLPKIGMNSFFIQFENPYSFLKRWYEHEFNPYLNKEQFS NELVQELSDRLDKELQKRGLIHHRVGHGWTGEVLGYSSKFGWESGLSISE EKKPYVAEINGKRELFNTAPILTSLDFSNPDVADKMVEIIKDYAKKRPDV NYLHVWLSDARNNICECENCRQELVSDQYIRILNQLDRALTSEGLDTKIC FLLYHELLWAPQKEKLDNPERFTMMFAPITRTFEMSYADVDFDNSIPTPK PYMRNKIILPNSLEENLSYLFEWQKAFKGDSFVYDYPLGRAHYGDLGYMK ISQTIYRDVSYLSNLHLNGYISCQELRAGFPHNFPNYVMGEMLWKKTRSY EELIEEYFSALYGENWQSVVEYLEKLSIYSSCDYFNAIGSRQSDVLANHY YIAYNLADNFLPIIEENISKLLNSQKDEWKQLSYHREYVVKMAKALYLQA TGKTRQAQDEWRNVLNYIRGHELLFQSNLDVYRVIEVAKNYAGFHL >SP_1642 hypothetical protein MKKIIFIKTIQLLVIDGIMLAFLTFKRGLTWDWILIYSGWLIFFHPVLLT YLSNQLCDHFS >SP_0309 hypothetical protein MDKKERQKIEQQRREMALTNTFFNRYLLLRYSIALFFFGNIYWLLSQFIS PSPIIIFPIMLIVFSILATVEQFKLYGNRKEKLGITLMFVRIQMLISIGL LVLTWTSWFKNLFPIFENNQVARLFVFVVLLLGLVLSLLDIRRIKKIYKR TDKAYQQFVQLEKNSLSL >SP_1528 hypothetical protein MIFWKKQLTKQTKSCIIKQIFKAGGSGNEKV >SP_0368 cell wall surface anchor family protein MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLP ADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKP AEEKPKEDKPAAAKPETPKTVTPEWQTVANKEQQGTVTIREEKGVRYNQL SSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLK FKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITL KSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVS VKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRV KEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDD AHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDESKLLSSISF LGNALVSVSSNQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYG FVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQ WEKAYKGIVFPEYTKELPSAKVVITEDANADKNVDWQDGAIAYRSIMNNP QGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLK GYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASET YPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDL KKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHG GEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGA ANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWEN GTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTV TLNGRVIQDGSAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSD WAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTNPE MSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKE KVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSL ALNYVKAYAHNTRRDNATVDDTSYFQNMYAFFTTGADVSNVTLTLSREAG DQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVE GVEDNRTHLSEKHNPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQ TIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLE MHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNA NFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTKESMDAL KEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTA PAQAQEGLANAFDGNVSSLWHTSWNGGDVGKPATMVLKEPTEITGLRYVP RGSGSNGNLRDVKLVVTDESGKEHTFTATDWPNNNKPKDIDFGKTIKAKK IVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALVKAQKLT DKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKPDAPT VEKPEFKLRSLASEQGKTPDYKQEIARPETPEQILPATGESQSDTALILA SVSLALSALFVVKTKKD >SP_0258 hypothetical protein MMELVLKTIIGPIVVGVVLRIVDKWLNKDK >SP_0705 hypothetical protein MFGCLWYIFSTFRGLCIMKQFVQFYKKDFLAVLVYFILLLSCVLSSTVYL LRCRQYSIHPNVLEWILVLLQDMTTGVYCFPFTYILFFFYLMNNYFNRLE CRIRLKSIKHFTSFSFKLAALSTGIWTATLFLLIFLIAFSNGFSFSLEIK EVDFLREFYGISIANNASFFIGFFFSYIAYYFFLSLLTISSFSWFKKSNM SLVFLFTFLFVESLFWIYQLDNGIIGLLPIFQYMVNSNPYALIYWLTLLS IIIPLTVFSVHRNWRRV >SP_2182 hypothetical protein MFVFADDSLSANSKVVSGEAQFENGSSVRFGDTQVNILSDEVLEVVNPDG SVDTIERRADGVYINGAFYMAYQKNEIDLNISFRSYDPNVWNYVNTIHGN KQANTFANFMTGAGISYMIGRIGALLGGPWGAIIGGAYFGIQAYQSYLDS QSPYPYYITSTYIHVAQRKWKFITEYYRNSNYTGYVKTVTTYVNF >SP_1103 hypothetical protein MGLMAMLLITIRRENQALVNNKDYPLEMKGTLEIL >SP_0098 hypothetical protein MRKWTKGFLIFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEK LTFGKEVENLEITLHQHTLTITDSFDDQIHISYHPSLSAHHDLITNQNDR TLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRLPKGRTLKGIN ISANRGQTTIINASLENATLNTNSYILRIEGSRIKNSKLTTPNIVNIFDT VLTDSQLESTENHFHAENIQVHGKVELTAKDYLRIILDQKESQRINWDIS SNYGSIFQFTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTP SRR >SP_0503 hypothetical protein MPDIVFEIEFFHSDSLIFFYTSDDKSNSQKSQEDFSKNK >SP_0654 hypothetical protein MNPVVKKIKEDVRGITDLPHPIFTGFDCLKYNQ >SP_0414 hypothetical protein MSVKLFHCQSISFLGAESKKKSTESVSADKKGSL >SP_2025 hypothetical protein MRGLVLSAISNQCFELNGTRFLVCSLILIGP >SP_0125 hypothetical protein MTNFDILDNQFLSLSENELSDIDGGLAPLVIFGVAVSWKAIAGGTALIGS GLAAGYFLGGD >SP_0472 hypothetical protein MLIINRFFSLFVLDWYRTELWINTLVSYPVPKYVGRKMKQLLRVSVLEKI LSIEFFK >SP_1843 hypothetical protein MVSITTYQNNQVSNNKFQTSLHFIEVVSKDL >SP_0792 hypothetical protein MKNVELKEKNMTFEEILPGLKAKRKYVRTGWGGAENYVQLFDTIEQNGLA LEMTPYFLINVSGEGEGFSMWSPTVCDVLATDWVEVHD >SP_1965 hypothetical protein MRQVMKMNKKSSYVVKRLLLVIIVLILGTLALGIGLMVGYGILGKGQDPW AILSPAKWQELIHKFTGN >SP_1949 hypothetical protein MKNDFVIGKSLKELSLEEMQLVYGGTDGADPRSTIICSATLSFIASYLGS AQTRCGKDNKKK >SP_0650 hypothetical protein MILTLVVCIILTKLFRLKKLGRNFADLAFPVLVFEYYLITAKTFTHNFLP RLGLALSILAIILVFFFLLKKRSFYYPKFIKFFWRAGFLLTLIMYIEMIV ELFLMK >SP_1265 hypothetical protein MSVRKNKFFKSRDYKACLRKNSKTLTNKNKMVIIKNGLK >SP_1493 hypothetical protein MPMNIILIAKLLRENTNTKANALNNGWARSGSEEFKKFSHFVGVDKGIVR TNVLTGKKLSDKIRKEVGSGDSKLGKGGYFSTGDVLLGKDVVSYTVQVFS ENNERVGVNTQSHRVQYNLPILADFSVIQDTVEPSRTVVEKIIPKLNIPE EEKGKITEEIKKKKKTSELAELISENVKVRYVDEQGRLLSLKNDTGIGEK ESDGTYITNKKQLIGTSYNVTDKKLSSMTTTDGKYYTFKEADTNSASLTG NIVSEGRTVTLVYRESEAPTTATVTANYYKEGRQEKLVESVIKADLAIGS EYTTESKTIEGKTTTEDKEDRVITRKTTYTLVATPENAYQKTVQQLTITT VRMLRKQWFPKQQPLLRRRL >SP_0773 hypothetical protein MKIYFLKKWENIDSKRILNHIRMGVFKIMFQW >SP_0170 hypothetical protein MGRRFCFICSLKKVTAVITDDSTEQNYEELEIYTQVIV >SP_1703 conserved domain protein MAHLKSFITRYSKVYIGLVLLIWLSFFFIPWDKPLLGIRIDIFIIQKILL AFGILSILMALLSKKVSLFVFGLICCLSLWINLFITFAILPIFGN >SP_0461 putative transcriptional regulator MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQ ETFEEELTFNLDTQQVQLIEHHSHQTNYYFHQLYNQSTILKILRFFLLQG NQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHF SILVALTWKRREFPLEFPESKEFEKLKNLFMYPILMEHCQTYLEPHANMT FTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPL YHISKAIVQEWMTEQKIEGVIDQHRLYLFSLYLTETIFSSLPAIPIFIIL NNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAM IAKKAHYLL >SP_1476 hypothetical protein MVGHFLDDFDGYDSYIWFEEGMVEYISRKYFLTEEEFQAEKICNQSLVEL FQKKYSWHSLNDFGSSTYDKNYASIFYEYWRSFLTVDKLVENLGSVQAVL DSYHLWANTEKTFPLLDWFVQQKLIEKEI >SP_1835 hypothetical protein MSEIVETRVFFCFQNIVQNFVPVFHVLVILLHDRIYSMLIL >SP_1080 hypothetical protein MGDKPISFRDADGNFVSAADVWNEKKLEELFNRLNPNRALRLARTKKENP SQ >SP_1454 hypothetical protein MAHGDLLYHDGLFFSAKKEDGTYDFHENFEYVTPWLKQGD >SP_2177 hypothetical protein MMKQRKELYLFLGRTALYFLIFLGLLYFFSYLGQGQGSFIYNEF >SP_0455 hypothetical protein MKKWTFSRAFCRALKSSPNHQIEIRNSLDKTIDFSYSLSFFYLPLYHTFS VYGSSL >SP_2160 conserved hypothetical protein MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSD RNNPDSLLHLKKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELY IEHIDIQSCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFTSFNKNI LCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGRKGVQ FKVVCHSKVTDGEVSVLGETIVIRNATEVFLYLKSMTDYWGNIDISSLQG EFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCLSIPTNLLLENTKKY SNYLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTININ TQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNT DGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEHFEMIK EAFLFFEDYLFEVDGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQIL RYFCDSCIGIAKQLGDNSDFISRVKELKKKLPKTKIGSNGQIQEWLEDYE EVEPGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQ EREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLNN ATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPALPSAWSEG EVKGFRVRGGYKVSFAWKNGDITFLKLEGGNKDQKVRVRIYGKNTDVQNI ELVFNSEKIIELNF >SP_1660 hypothetical protein MIQYRDLKHRKNLLQFYKKYSENNILSLYFQRAGDGVSPVIPIDKIILSK TQV >SP_1818 hypothetical protein MRMMFSSLVSDEPNFTAFSACFQQPQALCERKNCNFSIYYFLASSSLQSQ LGPCLHDQRH >SP_1093 hypothetical protein MAFEKIIQLKNCRYDYTLSPSVKKFTLKDNTFFETKVGNYELTRLLEKVP NSGEGFQLKIIINKELTGAKINITDKFGLRLVDIFKSEDHHIHQEKFYFL MDSLVERGVFTKSER >SP_0900 IS1381, transposase OrfA, truncation MNYEASKQLTDARFKRLVGVQRTTFEEILAVLKTAYQLKHAKGGRKPKLS LEDLLMATLQYVREYRTYEEIAADFGIHESNLIRRSQWV >SP_1912 hypothetical protein MNGMKAKKMWMAGLALLGIGSLALATKKVADDRKLMKTQEELTEIVRDHF SDMGEIATLYVQVYESSLESLVGGVIFEDGRHYTFVYENEDLVYEEEVL >SP_0511 hypothetical protein MKIKNTFQLKELPPVTNFEKNGSRAKKRNKIGSIRIVKIRLA >SP_2063 LysM domain protein MKKRMLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYT VQYGDTLSTIAEALGVDVTVLANLNKITNMDLIFPETVLTTTVNEAEEVT EVEIQTPQADSSEEVTTATADLTTNQVTVDDQTVQVADLSQPIAEVTKTV IASEEVAPSTGTSVPEEQTTETTRPVEEATPQETTPAEKQETQASPQAAS AVEVTTTSSEAKEVASSNGATAAVSTYQPEETKIISTTYEAPAAPDYAGL AVAKSENAGLQPQTAAFKEEIANLFGITSFSGYRPGDSGDHGKGLAIDFM VPERSELGDKIAEYAIQNMASRGISYIIWKQRFYAPFDSKYGPANTWNPM PDRGSVTENHYDHVHVSMNG >SP_2105 hypothetical protein MNKLMKFISVFLTSIVLIVSAIPSVSAVYASEQVSQIETNMELQPVTSLT EEQINTLANEIQSFHPDVSQQWIKEVINRQLQGDYTIPPTYSPFRAVWQG ITVNQMGALLDTAIALALGGTTAGLANLIKVKGKHAAKSAIRSAISRYLG SWFVNDVALEFAMNLLSPGTYLAQLWDKNDAIPNNGRINF >SP_1039 hypothetical protein MDNDWNGLADLIANLIAKYAGALDLDNLPDPTPAKNQEMKNSFDMAKTQI ETD >SP_1789 hypothetical protein MEMMELPSQEILIFTKQIRHWILSDQVISGKRKLFFREDTPKEILDLYEN IKSKLDFAYQEVHSNNGLKKYEK >SP_1495 hypothetical protein MYQNEDLYKKGLNVELAHQQIKGFFEAEFKNRINGVLNTKIKNSTLNRVN KKTIHQSNKNSMINLKQKQRKMLKNKAILC >SP_1629 hypothetical protein MLIYETVALVGMDSGISIKHILQKMKNKKLSQNP >SP_0815 hypothetical protein MLLCVLLLKDLLDFLSNRVFTQFSYLIGIEIPINFSE >SP_0634 conserved domain protein MFSSYFNPLYIMVSNLHQNDKINQLISDYKQNMKAFYITIEKFIRDDESL KCYFIKVISSRSKVTSLDQIEADKTIQRKYSSELKKFIGFYNEIICEENS FLHVRKRWSSWFR >SP_0534 hypothetical protein MVSIQYKEESMFFMLAFLIFTIQEVLMTIYDLSDPRSK >SP_1705 hypothetical protein MFKNFNNILLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVIFFKRELIS IFSYNDYSEANLEIPKLLLNLSLFMVGWLSVILLESDLADHYHHLIRYQS SSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMM LYLLLSYLIALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVM LIVSRLEEKFKKG >SP_2004 hypothetical protein MKRGIIYFFIGLSLLVWLVEMFTGWFDQTLLRQFIRGALGFGFMIFVVFL MRMEWLKGEYHEYD >SP_0087 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVLFCLMGKSYKEIIELL >SP_1138 hypothetical protein MVSLPHLVYMVVESMAITSQRAISHPMKSVYFCLGL >SP_1718 hypothetical protein MNLFLIKMLSETISLFPEVLEEENFTRKKELLK >SP_1150 hypothetical protein MEFFITLYYTMISLVNLLKYLGHPFFSSSSAKS >SP_0025 hypothetical protein MKKSNILFIFILLLCIGLQYETIYYTDGSRSGAEYGLMGVSIFLALFYMI PALYFLFRIGKNGNCQRRF >SP_1339 hypothetical protein MAAEVLNLQLVSVQVDETDEVDGMRFSTFSTNRCGNWSAFSWENC >SP_1904 hypothetical protein MKLKKLLKDDTKVFEKSTFKFVEGYKIYLTESKESGIKQMDNVIKYFEFI ESKSIALYFQKRLNELID >SP_1971 hypothetical protein MLMDKTFLHRQLLKNLINVLYTYFQEKKRENLKKISVTQNTDFIDLLVIA TKDT >SP_1216 hypothetical protein MKYRKRFLKPKVCDIIIKKKDLGVYYGFFGIYLKD >SP_0196 hypothetical protein MERPVNIFTPTPRNGEELERPVDVFSPYSHS >SP_1668 hypothetical protein MNLWDIFFTTQATEPPKFDLFWYVSLFTLLALTFYTAHRYREKKVYQRFF QILQTVQLILLYGWYWVNHMPLSESLPFYHCRMAMFVVLLLPGQSKYKQY FALLGTFGTLAAFVYPVPDAYPFPHITILSFIFGHLALLGNSLVYLLRQY NARLLDVKGIFLMTFALNALIFVVNLVTGGDYGFLTKPPLVGDHGLVANY LLVSIVLVATISLTKKILEFFLAQEAEKMIAKEA >SP_1930 hypothetical protein MRERSATGAQGLSKSIKKHLNDLTRLTASLLGDEKLSAITSSSAVKADMH RFVIELEPVKSTILQNNDISLDQNEIFEILKNFLDG >SP_0174 hypothetical protein MVKHNFDVTDKTGKISSKHCFEITDKTDVV >SP_1058 hypothetical protein MRSLFRKIVALLVIGLILLGTAGGTQVHKMARGIDPGPANGIYR >SP_1432 hypothetical protein MSSKLLKAKEQVKSQDKDKKSILGQIKSFKTDDKSKSNKKDHSKGAER >SP_0428 hypothetical protein MRLERSHFLWKNVIQVEKLPVEEMGYKIDKKRNIL >SP_0244 hypothetical protein MSHSFKKSLQKEILHRSSIAAFVTSRAFSDTVSPI >SP_1181 hypothetical protein MVNFSCLSILSPRFLLLSYHKLAHHSNTKTTK >SP_0621 hypothetical protein MSVIEKLNHEKSLQALSNYGRMEAVELEKEIDYEIS >SP_0714 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1938 hypothetical protein MKRTYRDCKNILLKIFLVWGVIVDRMQTLSVLFTVSK >SP_1562 hypothetical protein MFNVKWTRIGKKMPENGIIGRKCFDREDIRDEFSTIIQSAILDQFVCKSM DDSYQSD >SP_1694 hypothetical protein MIFKAFKTKKQRKRQVELLLTVFFDSFLIDLFLHLFGIVPFKLDKILIVS LIIFPIISTSIYAYEKLFEKVFDKD >SP_2005 hypothetical protein MFSGLDESFYHFPWELFAGFGMMSWLVREGLKLVGDVKKELEE >SP_1932 hypothetical protein MTLKDDDDPRIEEESEALENMILQYLGEDDAS >SP_0133 hypothetical protein MVTMQYSCGKININIPDGYGDIKDIVFSAHIIVRYNNGHCGGIDPHIIGL CKKQIRRMSLYPILIIVSRDSKVIDDYKNLDIAYVDCTQCSNNFETALHV KNILKLLKIQLIHCHGYSTNYFLYMLKKLDKNGFGKVKTVITCHGWVEYN LKKKFLTYFDFWTYSMGDAFICVSETMKKKIGEYNKK >SP_1708 hypothetical protein MSVMEHLFKFLLLAPYFYFDNWIEKANRNSKFFPIFYYFYWFYIPFYSLF SLAWTVVSVLFFNTVLRNVTDIKLWGIWFLFILLAIGMNWLTYSCFKEMF RLRQELGKSKGGRH >SP_0367 hypothetical protein MFPIAFSAIICYIKSIIIFYLSELSIALSEEGVFFKKKM >SP_1175 conserved domain protein MILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGI NAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYKLKD EDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHRE GGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHY IPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNP GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPA QITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDS RPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEEKGI SRYVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFYDKAY NLLTEAHKALFENKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAP ITHPERLGKPNSQIEYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDA YVTPHMGHSHWIGKDSLSDKEKVAAQAYTKEKGILPPSPDADVKANPTGD SAAAIYNRVKGEKRIPLVRLPYMVEHTVEVKNGNLIIPHKDHYHNIKFAW FDDHTYKAPNGYTLEDLFATIKYYVEHPDERPHSNDGWGNASEHVLGKKD HSEDPNKNFKADEEPVEETPAEPEVPQVETEKVEAQLKEAEVLLAKVTDS SLKANATETLAGLRNNLTLQIMDNNSIMAEAEKLLALLKGSNPSSVSKEK IN >SP_1439 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1158 hypothetical protein MLEIDLIVLIVLSYFYFISLYNCYCFLFHDFNLTVQI >SP_1925 hypothetical protein MSQSSYLSPLLWLKKEADKEKMSATQCQIFFFYYQMFELLFARESDMKDL CLGTKGFYFSQLEKNLLSGVSRFLKNLEGKVTLKANQEVSARKALFLALT TSQSDWQELAPVFDFYQTIGRLENPSLLSSQDRQHLMWIYQSALEKDYIV KVIGDKHFVLKRQDATKLTARQTQTLEILSQSEDLVNPVYVTLGEKGVLL LD >SP_1380 hypothetical protein MLVEKRRLRMRLKVIKKLVDINILYSSQEANLANLRKKQAKNPGKKVNVS ARVLSSYIFSSLLMIICFSNIAIHFPFEEIPIYFSSMIAILLVIAFSTSL TAFYNVFYESKDLVSYRPYAFKESEIIIAKGLSVLLPALTGIVPILAYFL VLYIRLAPSLWLGLPLMLLSLTLLFVSVALVMVVAVHFLAQTRVFRKYQS IFSNVMIGIGVLIPLIFIFFLQSTFGSIVDKVRDIPFLLYPLHIFYKIAV EPFSTEALVGLLAWIGLTLFLLYLTKKKVLPRFYDVILLNSEEKVKKERR SKERISTTKKGFFRMVLRYHLTLLGQGTGVITVLFTSAFLPYLMMIGLIS KIRDSQIVPDIHPPYWLPLFFIALFIAVVNNNITSLHSIALSLERENVDF LKSLPFDFARYVKVKFWIIYAVQSFLPVLTLLGLSLYLGLPIISMIYLLV VWIIASVILSCHHYFKDVKNLSTNWSSITDLVNRSNGIVAIVLLFIYSAI LMALVIGSIFLVQSLSPILAISLGVGALIVLLALAIFGYHYYLSRILAEI EKR >SP_0548 hypothetical protein MPLFPGMGVSVSSLSPKSISFLRIKFPYYSTSFWPIRQPL >SP_1217 hypothetical protein MLEIDLTVLIDLSYSYFILLYFYENLKKSVKSWISLICINKIVYMVD >SP_0833 hypothetical protein MYMTGHSLGCYLAQIAAVEAYQKYPDFYNHVLRKVTTFSAPKVITSRTVW NAKNGFWDVGLESRKLAVSGKIKHYVVDNDNVVTPLIHNNRDIVTFTGNS RFKHRSRGYFESPMNDIPNFNIGKQATLDKHGYRDPKLDKVRFFKKQALP RSSSQPSAEPMENIASGKQVTQSSTAFGGDARRAVDGKVDGNYGHNSVTH TNFQSKPWWQVDLAKEETIRQINIYNRTDTAQDRLANFDVILLDSSGKEI E >SP_1628 hypothetical protein MIIMRRFYSHLPYYLVILFFYWPLYELFLLVVSDPLTLKGLYINNLLFFT PLVILIVSLLYSYRFRFSL >SP_1785 hypothetical protein MATYGFLDILEEELDKNFPFDFEISWDKRNHAVEVSFLLEAQNAAGVEMV DEDGEVSSDDILFEEAVLFYNPAKSTVNEEDYLTVIPYLPKKGFSREFLA YFALFLKDTAEVGLDVLMDFLEDPEAEEFVMEWNQEVFEEGKIGLEKGEF YPYPRY >SP_1595 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1679 hypothetical protein MVSSKYAARASFLDGQGITVDEMAWIIRGIVNALIGRYIKLGTYAAKYGI SMARSILSRVAATAAARVGLLTKISGWILRVAVNVADVYGNFANNIAAAW DAYDKIPNNGRINF >SP_2118 hypothetical protein MMNKYKVIYYVVVIALLVSVFLLIGMDLSWFNPYQSDQFVWVYFALIPVI EWIEKKSKNLASEKGE >SP_1844 hypothetical protein MKFYSYDYVLSQIGQQNGIMVGFGIVLLAVTVFFAFKAYHNKKGSEFREL VMISDLALFSSAFGQHHDLSKQSSF >SP_1091 hypothetical protein MQIFYIKTKIFLSFFLFLLIFSQCFYKIEE >SP_1350 conserved domain protein MKRITANQYQTSERYYKLPKLLFESERYKNMKLEVKVVYSVLKDRLELSL SKGWIDEDGAIYLIYSNSNLMALLGCSKSKLLSM >SP_2159 fucolectin-related protein MNKEKIKRKLITILFVCIGMLCFGLLAGVKADNRVQMRTTINNESPLLLS PLYGNDNGNGLWWGNTLKGAWEAIPEDVKPYAAIELHPAKVCKPTSCIPR DTKELREWYVKMLEEAQSLNIPVFLVIMSAGERNTVPPEWLDEQFQKYSV LKGVLNIENYWIYNNQLAPHSAKYLEVCAKYGAHFIWHDHEKWFWETIMN DPTFFEASQKYHKNLVLATKNTPIRDDAGTDSIVSGFWLSGLCDNWGSST DTWKWWEKHYTNTFETGRARDMRSYASEPESMIAMEMMNVYTGGGTVYNF ECAAYTFMTNDVPTPAFTKGIIPFFRHAIQNPAPSKEEVVNRTKAVFWNG EGRISSLNGFYQGLYSNDETMPLYNNGRYHILPVIHEKIDKEKISSIFPN AKILTKNSEELSSKVNYLNSLYPKLYEGDGYAQRVGNSWYIYNSNANINK NQQVMLPMYTNNTKSLSLDLTPHTYAVVKENPNNLHILLNNYRTDKTAMW ALSGNFDASKSWKKEELELANWISKNYSINPVDNDFRTTTLTLKGHTGHK PQINISGDKNHYTYTENWDENTHVYTITVNHNGMVEMSINTEGTGPVSFP TPDKFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTR ADNPSWWEVDLKKMDKVGLVKIYNRTDAETQRLSNFDVILYDNNRNEVAK KHVNNLSGESVSLDFKEKGARYIKVKLLTSGVPLSLAEVEVFRESDGKQS EEDIDKITEDKVVSTNKVATQSSTNYEGVAALAVDGNKDGDYGHHSVTHT KADSNAWWQVDLGEEFTVSKVDIYNRTDAEPQRLSNFDVIFLSSSGEEVF RRHFDKVVDGLLSLKVPSVGAKLVKIELKSAAIPLSLAEVEVYGSKRTPK KLSNIALTKETRQSSTDYNGFSRLAVDGNKNGDYGHHSVTHTKEDSPSWW EIDLAQTEELEKLIIYNRTDAEIQRLSNFDIIIYDSNDYEVFTQHIDSLE SNNLSIDLKGLKGKKVRISLRSAGIPLSLAEVEVYTYK >SP_1630 hypothetical protein MKVEPRCDVLSRMSHFFIRILIMELQELVERSWAIRQAYHELEVKHHDSK WTVEEDLLALSNDIGNFQRLVMTKQGRYYDETPYTLEQKLSENIWWLLEL SQRLDIDILTEMENFLSDKEKQLNVRTWK >SP_0518 hypothetical protein MSVLDEEYLKNTRKVYNDFCNQADNYRTSKDFIDNIPIEYLARYRELY >SP_0685 hypothetical protein MSRWDGHSDKGEAPAGKPPMHGFGLNGENK >SP_1488 hypothetical protein MKSIKEEIQTIKTLLKDSRTAKYHKRLQIVLFRLMGKSYKEIIELL >SP_0996 IS630-Spn1, transposase Orf2, truncation MVAGLTNGELIAPMTYEETMTSDFFEAWFQKFLLPTLTTPSVIIVK >SP_1037 putative type II restriction endonuclease MKIHCLKLKNKELNKEVAFYLTSIIRQALKNTEYKDQISSTVLPDIKIKL PIDSRGTPDWNYMERYRDR >SP_0444 hypothetical protein MNVIFVFIKKIPISFTKKKKELISQSFINLIPH >SP_2115 hypothetical protein MKRVILLAVIQAVVLFFIIGALAYAFKGDFFYNYLAVVFAPIAGVLRFGT AYITEIVLPRKAAEIAEKRKAGKNSK >SP_1723 hypothetical protein MQAFYVKKEEKLSKDYHKINTGNQSNFYENVKDNEIKYFLTKVSNLFFKE FLMKQSKTINLKLAH >SP_0733 hypothetical protein MTVEEEKAFLARHLKATEAGEFVTIDALFQAYKKELGRSYTRDAFYQLLK RHGWRNITPRPEHPKKADAQTIVASKNKISIQEGKKAF >SP_1678 hypothetical protein MFMVGTYESFTDKKENSLQRKMMEEQTWHKKE >SP_1794 hypothetical protein MEELMKNNERLGIKLSRDSVLGLREVRRLYLGSSDIPVSDGYVIEVAYNQ ISHEIDIIDWVELNKSKIKISEISESVDIDATSLRTTLTLDTLVYEGMRD IQLKLRELTKGRVFFSFVVKLVLFASILKKKDLLEKFQEKC >SP_1170 hypothetical protein MSEVDFNEAVNYEFTSDTCQLANSIYQSLFKFFDKKNFSGDLIFTWKSPS LVKEGDYIGRRDSQVDNLRVIGNIFPNYLTNRKYSLNMNRNGCMGDFPHD FFDIYLDHVAKYAYEQKVNNIKEYYPLKRAILHQENALYFRFFSNFDDFL EKNYLKTIWQVSKETPFSEMDFNMFKNISEKIIFERGSKMLNDLKSNYKK >SP_1145 hypothetical protein MTAEIGILNKNGVVLASDSAVTLSDGKNSKVFNSARKLFTLSKEHSVGIM IYGNASFMEIPWEVILNEFKQAIGTDLLDNTAQYVEKLIEFLISFKHLQV EDLLRNYIVRSTRSILDSIAYEAQETADLRVSNGETITLDDFNKILLNAI TNFSLEISKVEAESNFEFFEAELELIKSIVDDVFKSFPHTDDEVEVIAKS LYKAIFIGYDSTNITGLVIAGYGTDEIFPSIRQIELYGIFSKRLIWKVIN ESVINHHKTCHIIPFAQSEMVETIMNGIDPNLNVYIAEQVSSVMEKNGLG DEIENIFENISSIQQKYYINPIIDLIGMQPLNEMASTAKTFIELTSFKRK IVNTLETVGGPVDVLAISKGEGPIWIDRKYYFDIDKNLDYRMRKES >SP_0587 hypothetical protein MLGSLSFFLRNETKIQVIKIPSRQATQRKIGNRTTERLLLGRFIFFHRVV GKFSFQDTSLERFNTKVSKAFTLIAIGRLAKCFTNSLVK >SP_1351 hypothetical protein MIWVKATQLVDEMEQVGLIRFDEFGNVGILVLEGQ >SP_0038 putative acyl carrier protein MTEKEIFDRIVTIIQERQGEDFVVTESLSLKDDLDADSVDLMEFILTLED EFSIEISDEEIDQLQNVGDVVKIIQGK >SP_1977 hypothetical protein MKEKQDFCLFFRKQSVFQFHFQSIIRLFFKIEAI >SP_0559 hypothetical protein MNPIKAFAKIYGNYFLTVQGVKVMKTIKKADHVVVGLGKLFIADKLMDTA RWLIKPEERE >SP_0512 hypothetical protein MFLIFIIDWVLLIVFAIQISYIFWRLSQKWKELSNK >SP_0670 hypothetical protein MAKGFAKGLVTGVAGTVAAVAGAVYAFKKKVIEPEEQKAAFIEENRKKAA RRRVSR >SP_0040 hypothetical protein MSCYKLDKFSITVSINFFHPNLELASKRLELAFLFQNHLYF >SP_0277 hypothetical protein MKLRIFAEDKPAKKVFEYQLELADRTILLSTALLSGAIALAGIFSALKEK >SP_1635 hypothetical protein MYLVIGIVLAFIVSFWKDNRSLWNPVLFLLSLISSYFYLSYLFYKNGYEN VQLAFYIFAFVLLPFLLFLSGIFLIYNGVILLKREGRSKPHYLSMLFDFY >SP_1049 hypothetical protein MWSVATTKKVIILDFKLFYKHFVFVDKGNFDNKNPKR >SP_1065 hypothetical protein MLLFYVGRLNGQKYHCISFVSWHIKYRSILLMILKSLLSWRAIQICSTNE ESTILGYSVVKGQIKNLTHFQTL >SP_0763 hypothetical protein MTIARFSRATESWNGLMVEISPCCLDFIQKSVTQALFKLLIH >SP_1948 conserved domain protein MTNFNSNEKFCGKSLKSLSADEMSLIYGASDGAEPRWTPTPIILKSAAAS SKVCISAAVSGIGGLVSYNNDCLG >SP_0866 hypothetical protein MNKQQFIIMALFTAAETYFFNEAWMTGRYIMAAFWAILLFRNFRVSYVMG KIVDVIDQHFNRKD >SP_0203 hypothetical protein MGKYQLDDKGRAQVTRYHEKHSKGGAGKKERLLSFREQFLNKNKKK >SP_1385 hypothetical protein MKMSTFFKKSFWPTFMIVNQTAILFHLKDGLDRQYLTTESIYWVIGTFIF GNILVAVFSNMKIWDKKKNGSKKKYILKK >SP_0520 hypothetical protein MKLSNLLLFAGAAAGSYLVTKNRQTITDEVLNTTDRVQAIKDDVDIIQNS LQIINQQKELIKEYQEDLTYKFKVLEKDIQTRLAVIKEMQGTEDK >SP_0564 hypothetical protein MSKKLNRKKQLRNGLRRAGAFSSTVTKVVDETKKVVKRAEQSASAAGKAV SKKVEQAVEATKEQAQKVANSVEDFAANLGGLPLDRAKTFYDEGIKSASD FKNWTEKELLALKGIGPATIKKLKENGIKFK >SP_1581 hypothetical protein MHKTLENIGEFEEDNLYYSSIDKSRNKDQFSHIFGLYNICSG >SP_0134 hypothetical protein MESIIKNKKIVAINNGINVSNSDLDVVGVQDFKKEFCIPNNKKSFVMLEG WIQKKGRIDSLNLQKNYF >SP_1209 hypothetical protein MHAILRYFIRRLFYHIFYKIYSLISKKHQSLPSDVRQF >SP_0679 hypothetical protein MGILSIILGLLFPIVGLILGIIGLVLAISYQKESQLDYKIEKILNILGIV ISVVNWIVAIALIFR >SP_0293 hypothetical protein MIFNPICCMIREKKGDRDMAFTNTHMRSASFGIVTSLPDDIIDSFWYIID HFLKNVFELEEELEFQLLNNQGKITFHFSSQHLPTAIDFDFNHPFDPRYP PRVLVLDMDGRETILLPEENDLF >SP_2179 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0706 hypothetical protein MEMGKLSSHMWRLNQIIYTKYFWGYVLFWILICLGLWYWLEGNDRLVIEI LKGPNLSQNSFLVLSIWLLHWFIIHTFFLAVVYRRRASDFFMEVIRFSSI KLWIRYQIWTCFLYGLILIMVKVLVIQFMLQLPNWDIGVLFIVDSLNACV LVLFCFMLYALGANVQMNFACVSFFLLMIVFGGLFVGNRTNYLFYILNRG NGDIGRDLFLQLLFLVFLFQSIFYFTRQKRRFIE >SP_1952 hypothetical protein MLKILNNSVIYLCLIPLLFLLLIIFPNDSLIYYFRLILISLISITLHELG HFLVGRCLSYKLEMLATPFFFYFRKKIYFKFPVLLAFGYCQMSNRNITNE KNSDRNLVFYFFGGGGANLIVAILALLGFVPYASEFFILNIILFLVTVCL PIDGTDGNAIREIVLYSKDSKTYQRFFANSLYNNPYITIDDFAKLTSKEK SFFSKFEKCLINLYFEVKGEGSAFTIANHEVFDSNIQENIIHEYYKLLHK SSDWIIPQNLSLEEVVFTLSNYIYSKNNKYLEKIKYLKKLVDFRQEEIID FILNKEEVL >SP_1831 hypothetical protein MYYSSFNILYYRLPIFAKLSEKKLEYMMLGDCVMLVNEMEITDHRVDNLF EKGKNEIKDSIGTNSVLNKKIILQKIRKLSNQPSGYWIGSLDERFLDHAI INQIDVTSEQIVLMSDGFYEFYQNNQNKTFEELIKMRFNSSAIDPIYGKK DDASILVIDV >SP_0470 hypothetical protein MDKMKPVFQALNKELIQENLTLTIICVGGYVLEYHGLRATQDVDAFMAL >SP_0269 hypothetical protein MFSAIFIQKLISNITNKKEKYLDKQGKILLQ >SP_2140 hypothetical protein MKQKNYLVSNATVRQTYDKIAESECFLRAIGGIMLHLKLVKQEIEAEKPA SVEAWIISVKFKKGCYRHI >SP_0902 hypothetical protein MKKYFIGGLGSNAYHSKDFLQELDSQVYFLNPYEKHLRDETELKSWFKNE IVEEESICLIGHSLGGDLARYFASEFEEVKKLILLDGGYLDLDKILPLDT ELEETKNYIKSQIVSDLDVLTSKEKSEAKHWSENMEKAVRQSYHWNVEYN RYELAINYENIEAILRLRRKIQAFKREVGDTLFISPRYPNEATWREEALK ELPDYFDTIFLENFGHELYTQAPKEIASLMNEWLAYFL >SP_0080 hypothetical protein MMPKMANRDRSPLSSSKSSSKAGLYGKIERSDKRE >SP_1929 hypothetical protein MKREQDKLIRTVKSISNNVLEAEVYYSSFNLL >SP_1059 hypothetical protein MSSQMKAFLNELQGNMEVVGEFFDNPEKVIRKFGIVGREKEALLIRDLND MDDYALSIQNSVASPSGAHSSTCHFVRQDLQSA >SP_1443 IS66 family element, Orf1 MEFLLYTISKVKLLEDILMPQPIVPVEIPQSRRFDSKKRNDILLKIRIGK LEVSFFQSLNLEMVEQLLDKVLLYDNSSI >SP_1793 hypothetical protein MKQKQPIVSRTKQHTFEELIQDQKLERLAKLSPDLVGRYGFTASCASSFA NLIKEAYGGKNLNVVYASRMLALWNIACSCYHKADGYSLADALFSDKKIC LDSYYYHKNTSNTITSDVIKDVYDNYNNYMVLTREATPEYIYVVQTEMPK DSDLYFYIREVLGLSFSTMHYAFLVKVLAGALARKYKPYRN >SP_2089 transposase, IS1380-Spn1 related, truncation MTNLSSVDSEELFQFYRERGNAENFIKERKAGFFGDKTDSSTMIKNEVRM MMGCLAYNLYLFLKQLAGDEVKSLTIKRFRRLFLHIAGKYVSTARRHILK FSSLYAYSKQFQALFDTICQINLILPVPYRARGQGKTCLTE >SP_0747 hypothetical protein MTFSNMINPLSKKSLPIIPKKANPVEFAFYHH >SP_1254 hypothetical protein MTSDKAGLERKFAAKERKRNKPGVVLCGSMDELCALAQLNPEIEAFY >SP_1836 hypothetical protein MMSSKDSKCYTKLLTSYFKPRDSHKKGKSYKNSLSEEKGASTTGVNYYQL KTTVFTLN >SP_0052 hypothetical protein MKERLDDNPIVQGNWKTLGFQFRHETPLVAAAVPHKLR >SP_0465 hypothetical protein MFLPFLSASLYLQTHHFIAFPNRQSYLLRETRKSHFFLIHHPF >SP_1211 hypothetical protein MKQFKILSDKYLESITGSDGNLGPGFGVIIP >SP_0840 hypothetical protein MKILKRYILELCFILSFALPFIKGTNADNGRCFVETYYGFTFLMEHAIVT AVFICSFLIAFLLKNDGRNGLLRVVIAF >SP_1172 hypothetical protein MLYAVPFYFNRSETIVFLNCESIKTDCDGAILALETFKN >SP_1465 hypothetical protein MKTTFSYPKWAEIPNIDLYLDQVLLYVNQVCAPISPNKDKGLTASMVNNY VKNGYLTKPDKKKYQRQQIARLIAITTLKSVFSIQEIAQTLNTLQTQASS DQLYDAFVDYMNQGIDPANPIIQTSCQTVKLYHQTLDLIDHTQEEVIQ >SP_0497 hypothetical protein MIIMVLEFDKKLQNIKRFLENNKINLLGGENEESIF >SP_1656 hypothetical protein MENNDSFTKLKESTQKLFDAQKKRLNNEDRIETTKNNVIAKHCQTVLSFL VLTSFFVKNCVK >SP_1052 putative phosphoesterase MRGFNNKIKSVYQELTNSKEKFGSFHKTLIHLHTPVSYDYKLFSNWTATK YRKITEDELYDIFFENKKIKVDKTIFFSNFDKVVFSSSKEYISFLMLAEA IIKNGIEIVVVTDHNTTKGIKKLQMAVSIIMKNYPIYDIHPHILHGVEIS AADKLHIVCIYDYEQESWVNQWLSENIISEKDGSYQHSLTIMKDFNNQKI VNYIAHFNSYDILKKGSHLSGAYKRKIFSKENTRFWSLILTRKNLRNNLI FSIKKLVY >SP_2120 hypothetical protein MIDKVVRNLLLTFFFCKMTKIIIFLTTILVKKKKICYNEFKLRNRKQKGV IMWVLGFILFMIFFYSNNSKKIKKLENKIKRLERKEKGNAEMSRLLQEMI GKEPIITGVYIGPDNWEVVDVDEEWVKLRRVDNTGKEKFKLQRIEDIQTV EFDGE >SP_0816 hypothetical protein MKLINTTNSHSQLVKSQLESTDATLVEVYSAGNTDVIFTQAPLHYEILIS NKHRAIREPEIETIQEFFLKRKIDKASVDEANIKTLYSEKLIGISIPIK >SP_0956 hypothetical protein MTVTKSYKYDWNTVWEYSTNYHDHQYAWIPSWSRYDSYSEYKVGGGWNYA RYEVINYYSGGY >SP_1183 hypothetical protein MKSLGKWYVSTGKEWICHSDDELEEFKNLFLNFINPEEWDTISFDSDFMP FQQS >SP_0223 hypothetical protein MKKINFPRNFSFFVKFPIFTWSFDALDILNYQDAFVTPGI >SP_1401 hypothetical protein MDKISCLSLPPLSPIIVISCLFFPFLMQGKLLSLDDNPKARKVSQTSLLS QTSLQLKGKDSIL >SP_1958 hypothetical protein MKKFDNYIIEKPCDSNSDKLQKILIIESLVDDILQFSLRINNSVGEIFLL QPF >SP_0504 hypothetical protein MNPDRAEEYFCRGCQGENPEDIEFYDEQLQAEKVEDLNIRLEVKN >SP_0734 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVL >SP_1864 conserved hypothetical protein MVEPNLESLIKDLYNHARHDLSEDLVAALLETTKKLPTTNEQLQAVRLSG LVNRELLLNPKHPAPELLNLARFVKREEAKYRGTATSALMYEELFKML >SP_1337 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1043 hypothetical protein MVVYIRQSKLPSEVSINKYNAQVGAYLQGEEVILYQSFSEIKELTSEDIV VDYIMETRALLKMMGLNVPVHDYPIELKEFYGRKIYAGILGEIVNIPDNW GKFIKPKAGSKVFTGRVVNGTHDLIGIGLPFDYPIWISEVVEFIAEWRCF VLDGRVLDVRPYTGDYHAQFDASVIDEAISCWKDAPIAYGLDIGVTRDGR TLVVEVNDGYALGNYGLSPLKSINFHRARWKEMVKPYFEKNEIFKIQQDV IF >SP_1761 hypothetical protein MSKELNILQIGLANWENHYDIPENMSWYYFYPNSSKALREIIEKEDINRF HAVLIEDGQYSRDLFSYVKYFEPYTLFYNQNLQINDREVVDFLKKRCAQA IDFLSPQQLINDLSKSLFGGGYGDKLFPPTIQVNPNFTGAISYQGLDYVS LEGEFGQDFAQLAYWAYNIMVQKTLPIELWLEYEKEGNCDFRLVIRKMWS GSVDDFFEEVIVSEKDLEQALFMDSRDGDYFLSISVEARGRGTIKLGNLH QRWSRKQFGKFVLGGNILHDSKRDEINYFFHPGDFKPPLTVYFAGYRPAE GFEGYFMMKTLGCPFILFSDPRLEGGAFYLGTDELEGKVKDTITHYLDYL GFDHKDLILSGLSMGTFPALYYGASFEPHAIIVGKPLANLGTIASRGRLD APGVSNLAFDCLIHHTGGTSSQDMTELDQRFWKIFKQANFSKTTFGLSYM KDEEMDPQAYEQLVSYLCNTGAKILSKGTAGRHNDDTDTNISWFLHFYRM VLETGFGREKR >SP_0490 hypothetical protein MKVNIADLHLTQLYLSEKKLQDIQMLYQSAETIQVDPISILAFGDCLLIT DGHHRAYQALLAGRDTISAEWDRDGGDELYHLYAQACEERKIYSVFDLED RILAQDGYEAKWYNWCDGFNQAATLLLKR >SP_1335 hypothetical protein MMPNYPCEFEVTFLDDYHKKHNYPLFYESYLQNVMEFLESQDIKNGVNAF VDDNQNLVFVLY >SP_1452 hypothetical protein MGDEENAKWTERGVLMDVTIKKKDGKTTIGTAKAHPTWVNRTPKGTFSPE GYPLYHYQTYILEDFIEDGSHRDQLDEATKERIDTAYKEMNEHVGLKWY >SP_0448 hypothetical protein MYNVITPSVIVLADQNKADWSYDENAVINIYDDANFEDGRLHMNFEQFFK LAQIAREEGLEIHSPFERAGATKSARYIAKWILRNKKH >SP_0270 hypothetical protein MLQKYTQMISVTKCIITKNKKTQENVDAYN >SP_0389 hypothetical protein MKKTVYKKLGISIIASTLLASQLSTVSALSVISSTGEEYEVSETLEKGPE SNDSSLSEISPTYGSYYQKQSEVLSVMMI >SP_0582 hypothetical protein MIKTFLSALSVILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATA TMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLILLLFGTDKTLSSASN KTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQR IKLLFHQVGLSMANYDIFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTT RFGTIVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNIIHNQLASKYPK AIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATID HILLPKNHYYVKDLDIVSFQNSDHRCIFTEITF >SP_1108 hypothetical protein MTAFQQLPSSVLQTGLFFSPSVSLDSQTVSAKEYLFPYQKERLKPFRQVK GRQANI >SP_1921 hypothetical protein MILSEKITWDFFNQENSSHRNLIILQRTIFI >SP_0528 blpC, peptide pheromone BlpC MDKKQNLTSFQELTTTELNQITGGGLWEDLLYNINRYAHYIT >SP_0531 blpI, bacteriocin BlpI MNTKMMEQFSVMDNEELEIVSGGRGNLGSAIGGCIGAVLLAAATGPITGG AATLICVGSGIMSSL >SP_0532 blpJ, bacteriocin BlpJ MNTKMLSQLEVMDTEMLAKVEGGYSSTDCQNALITGVTTGIITGGTGAGL ATLGVAGLAGAFVGAHIGAIGGGLTCLGGMVGDKLGLSW >SP_0533 blpK, bacteriocin BlpK MDTKMMSQFSVMDTEMLACVEGGGCNWGDFAKAGVGGGAARGLQLGIKTG TWQGAATGAAGGAILGGVAYAATCWW >SP_0539 blpM, bacteriocin BlpM MDTKIMEQFHEMDITMLSSIEGGKNNWQTNVLEGGGAAFGGWGLGTAICA ASGVGAPFMGACGYIGAKFGVDLWAGVTGATGGF >SP_0540 blpN, BlpN protein MNTYCNINETMLSEVYGGNSGGAAVVAALGCAAGGVKYGRLLGPWGAAIG GIGGAVVCGYLAYTATS >SP_0541 blpO, bacteriocin BlpO MDTKMMSQFAVMDNEMLACVEGGDIDWGRKISCAAGVAYGAIDGCATTV >SP_0524 blpT, BlpT protein, fusion MTDTDPIKRAHTLITDLNKAYQACKQASADDVRFQEQLNSILGFLAKAET VDNRFLIELEKFYQTSSLLMGLSALDPDAPTRAAWRAYDRFHFDQVKTKL ILNENQRAN >SP_0041 blpU, bacteriocin BlpU MNTKTMSQFEIMDTEMLACVEGGGCNWGDFAKAGVGGGAARGLQLGIKTR TWQGAATGAVGGAILGGVAYAATCWW >SP_0544 blpX, immunity protein BlpX MEVFNMKYRLFFVIFLSSVLDILLGTFLQISIVSIGWLVLYSGLFEAGVF LLANKGVAVKIKEVDIRNRFKFIFGKTLWFQILLLIFLIIKLYLGLDARL ILFYGHIFIVFNALMYLLSSSQVSLKKNKLSS >SP_0546 blpZ, BlpZ protein, fusion MYKHLFFLDSKTLDRLTPYILVLASDTIAFNVFVLTFVSAVVFNFLNSML ALMAIFIGAGYVVGFWLLILNENQRAN >SP_0123 ccs1, competence-induced protein Ccs1 MGWNFRVVNLLSLHSQTKNPSISRASKHLIQSKIQTKRHRSRGGEKASRE SQRSFSTRTWFVVVPVWQIEDSPHKRKQQKQ >SP_0200 ccs4, competence-induced protein Ccs4 MSVYGRVEEVHKENREPLEYQIEQESHHRESSRLPLVKILLWSTLVTGIT LGVPLLLDLMSAQEVQDFYAGWALHQTGKIYSDYYGSQGLLYYLLTYVSQ GGFFFAIFEWLALVAGGFFLFRSADTLTEQGDQAGHLVTIFYMLVTGLAF GGGYATLLALPFLFAAFSLVAAYLSNPSHDKGFVRIGLALAGGFFFAPLS SLLFIAVVSLGLLVFNLGHRRFAHGFYQFLAVALGFSLVFYPTAYYSAAT GSFGDAISGIRYPIDSIRFDFTSKILENMFFYGLLSLGLGFVVCIFLGLF QSKPFKLYVISVPASLVVILGLILLFFSQEPLHASYLMVVFPVFLLLLVT NIKSQQRGRSARRSRRETPVSLWSRFFKGNLYLLVFGFVYLLSVPFLMKF VLYPVPYQERNRLADLVKEETNTEDAISCMG >SP_2237 comC2, competence stimulating peptide 2 MKNTVKLEQFVALKEKDLQKIKGGEMRISRIILDFLFLRKK >SP_1449 cppA, cppA protein MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLE KLVLEEAPSMRTRKVEGRKKLARLIVKVENPLEIEGILSKTDSIHRLYKG QNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEI SMELHLPTDIESFLESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLV NELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEV >SP_0352 cps4G, capsular polysaccharide biosynthesis protein Cps4G MRVLFILSDNIYLTPYFNFYKELLKKLSISYDVIYWDKNINEIITKQNYY RISFSGKGKLSKILGYVKFRKEIKKKLKENDYDMILPLHSIVSFILVDFL LFSFKNRYIYDIRDYSYEKFLVYRLVQKQLVKNSLMNIVSSDGYKFFLPM GEYFTTHNLPNMIELNEVKQLKNNSTFPIQLSYIGLIRFQEQNKKIIDFF ANDSRFQLNFIGTNAGELREFCQEKNISNVNLVDTFQPKDTMSFYKNTDV VLNLYGNHTPLLDYALSNKLYFAALLYKPILVCEDTYMEKVSIENGFGFV LPMKDESEKDCLALYIQNLDRKQLIKNCDNFMDRISLEKQKTEIELEKRI LSLRKKND >SP_1850 dpnC, type II restriction endonuclease DpnI MELHFNLELVETYKSNSQKARILTEDWVYRQSYCPNCGNNPLNHFENNRP VADFYCNHCSEEFELKSKKGNFSSTINDGAYATMMKRVQADNNPNFFFLT YTKNFEVNNFLVLPKQFVTPKSIIQRKPLAPTARRAGWIGCNIDLSQVPS KGRIFLVQDGQVRDPEKVTKEFKQGLFLRKSSLSSRGWTIEILNCIDKIE GSEFTLEDMYRFESDLKNIFVKNNHIKEKIRQQLQILRDKEIIEFKGRGK YRKL >SP_1849 dpnD, DpnD protein MKTKQLVASEEVYDFLKVIWPDYETESRYDNLSLIVCTLSDPDCVRWLSE NMKFGDEKQLALMKEKYGWEVGDKLPEWLHSSYHRLLLIGELLESNLKLK KYTVEITETLSRLVSIEAENPDEAERLVREKYKSCEIVLDADDFQDYDTS IYE >SP_1964 endA, DNA-entry nuclease MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPS QALAESVLTDAVKSQIKGSLEWNGSGAFIVNGNKTNLDAKVSSKPYADNK TKTVGKETVPTVANALLSKATRQYKNRKETGNGSTSWTPPGWHQVKNLKG SYTHAVDRGHLLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTG QNYYESKVRKALDQNKRVRYRVTLYYASNEDLVPSASQIEAKSSDGELEF NVLVPNVQKGLQLDYRTGEVTVTQ >SP_1573 lytC, lysozyme MKTKIGLASICLLGLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKT QTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYKAWFYINSDGRYS QNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNK WYYFKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTY ANQEWQKVGGKWYYFKKWGYMARNEWQGNYYLTGSGAMATDEVIMDGTRY IFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVIDISEHN GRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYL YTYAENETDAESDAKQTIELIKKYNMNLSYPIYYDVENWEYVNKSKRAPS DTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLLQTRLKHPDILKHVNWVA AYTNALEWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWY >SP_0602 pep27, pep27 protein MRKEFHNVLSSGQLLADKRPARDYNRK