Gene list
Applied filters:
COG category: Unclassified
Gene type: CDS
Genomic element: chromosome
Number of genes found: 428
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334 >SP_0098 hypothetical protein MRKWTKGFLIFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEK LTFGKEVENLEITLHQHTLTITDSFDDQIHISYHPSLSAHHDLITNQNDR TLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRLPKGRTLKGIN ISANRGQTTIINASLENATLNTNSYILRIEGSRIKNSKLTTPNIVNIFDT VLTDSQLESTENHFHAENIQVHGKVELTAKDYLRIILDQKESQRINWDIS SNYGSIFQFTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTP SRR >SP_2120 hypothetical protein MIDKVVRNLLLTFFFCKMTKIIIFLTTILVKKKKICYNEFKLRNRKQKGV IMWVLGFILFMIFFYSNNSKKIKKLENKIKRLERKEKGNAEMSRLLQEMI GKEPIITGVYIGPDNWEVVDVDEEWVKLRRVDNTGKEKFKLQRIEDIQTV EFDGE >SP_1834 hypothetical protein MSDIQVNIPGECLYDKVFVLSFIIYNISTNLNIVNGIFYIQAKKDSITFE WKAKEQTRKLAIDSSKPCFEVVDIVK >SP_0800 hypothetical protein MPVRKLQSYEVDYQEELNQQLPHYQAYTPEAQSDANLKEILFFINIAVFC ICIAIFSFIFLALKLSTALAFAAAIGFSLLVLKVQRSIIKRKRRR >SP_1756 conserved domain protein MSEEDLFYKDVEGRMEELKQKPIKKEKETRGEKISKTFSLLLGLMILIGL LFTLLGILR >SP_1003 conserved hypothetical protein MKINKKYLAGSVAVLALSVCSYELGRHQAGQVKKESNRVSYIDGDQAGQK AENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAII SEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLKDAAHADNIRTK EEIKRQKQEHSHNHGGGSNDQAVVAARAQGRYTTDDGYIFNASDIIEDTG DAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPA QPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDP AQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPD SRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVF EENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREF YNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVDDIL AFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITS DEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSG NTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHN IKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHV RKNKVDQDSKPDEDKEHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTD TEETEEEAEDTTDEAEIPQVENSVINAKIADAEALLEKVTDPSIRQNAME TLTGLKSSLLLGTKDNNTISAEVDSLLALLKESQPAPIQ >SP_2182 hypothetical protein MFVFADDSLSANSKVVSGEAQFENGSSVRFGDTQVNILSDEVLEVVNPDG SVDTIERRADGVYINGAFYMAYQKNEIDLNISFRSYDPNVWNYVNTIHGN KQANTFANFMTGAGISYMIGRIGALLGGPWGAIIGGAYFGIQAYQSYLDS QSPYPYYITSTYIHVAQRKWKFITEYYRNSNYTGYVKTVTTYVNF >SP_1353 hypothetical protein MRNPYLPVFESDKRLIETDKLIWFPAKNSLAGFLF >SP_1986 hypothetical protein MLMCEKIRIRRVSDYPSARGGLEDILIMENMTNHLLLVQIRVHGYLLDFA SIEGQRQKHYRLKNLPQTVELTVDDVEEDVDLTLPENRSYQEADFFERMF RENC >SP_1480 hypothetical protein MAELDNGIQVIIEIQVHHQNFFINRLWPYLCSQVNQNLEKIRQREGDTHQ SYKQIALVYAIAIVDSNYFSDDLAFHSFIVK >SP_0692 hypothetical protein MIIMQDNFLFEEIEEISVPVNDFSAGLATGIGFGLAILALAGC >SP_1300 hypothetical protein MSISPRFETLEQAIASKDLEKVREAFKKMNSTWTINESVVRDNSIAHYGR VETAISFLPSSMEIEPTDESGT >SP_0077 hypothetical protein MTYSHIYQVLFLPKLSIKRWHFLGLVLVDFALSYLSHFELFMVQWKHVIQ II >SP_2105 hypothetical protein MNKLMKFISVFLTSIVLIVSAIPSVSAVYASEQVSQIETNMELQPVTSLT EEQINTLANEIQSFHPDVSQQWIKEVINRQLQGDYTIPPTYSPFRAVWQG ITVNQMGALLDTAIALALGGTTAGLANLIKVKGKHAAKSAIRSAISRYLG SWFVNDVALEFAMNLLSPGTYLAQLWDKNDAIPNNGRINF >SP_0714 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1842 hypothetical protein MGDRYYRALNGSEPDKYLLEKVELYKTDAIELVDVNK >SP_1337 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0504 hypothetical protein MNPDRAEEYFCRGCQGENPEDIEFYDEQLQAEKVEDLNIRLEVKN >SP_1772 cell wall surface anchor family protein MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKT VEKTDALATNDTVVLGTISTSNSASSTSLSASESASTSASESASTSASTS ASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDY VASVTNVNLQSYAKRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINA SLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYTVTYVNPK TNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQ VLSYNTSTMTTQGSGYTWGNGAQMNGFFAKKGYGLTSSWTVPITGTDTSF TFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASAS ASTSASASASTSASASASTSASASASTSASVSASTSASASASTSASASAS TSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTS ASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSAS ASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASAS ASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTS ASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS ASTSASASASTSASASASTSASASASISASESASTSASASASTSASASAS TSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASESASTSASASASTSASESASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASASASTSASASTSASESAS TSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASISASESASTSASASASTSASVSASTSAS ASASTSASESASTSASASASTSASESASTSASASASTSASASASISASES ASTSASASASTSASASASTSASASASTSASESASTSTSASASTSASESAS TSASASASTSASASASTSASASASTSASASASTSASASTSASESASTSAS ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS TSASVSASTSASESASTSASASASTSASASASTSASESASTSASASASTS ASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS ASTSASASASISASESASTSASASASTSASASASTSASVSASTSASASAS TSASASASISASESASTSASASASTSASASASTSASASASTSASASASIS ASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS ASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASASAS TSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASASTSASASASTSASASASTSASASASISASESASTS ASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSAS ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS TSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSAS ASASTSASASASTSASASASISASESASTSASASASTSASASASTSASAS ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS TSASASASTSASESASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASTSASESASTSASAS ASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASAS TSASASASTSASASASISASESASTSASASASTSASVSASTSASASASTS ASESASTSASASASTSASESASTSASASASTSASASASISASESASTSAS ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS ASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASASAS TSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTS ASASASTSASASASASTSASASASTSASASASTSASASASISASESASTS ASASASASTSASASASTSASASASTSASASASISASESASTSASESASTS TSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASES ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASVSAS TSASASASTSASASASTSASESASTSASASTSASESASTSASASASTSAS ASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASAS ASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASAS TSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTS ASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSAS ASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVSAS TSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSAS ASASTSASASASISASESASTSASASASTSASASASTSASASASTSASES ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASAS TSASESASTSASASASTSASESASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASAS ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS TSASASASISASESASTSASASASTSASVSASTSASASASTSASESASTS ASASASTSASESASTSASASASTSASASASISASESASTSASASASTSAS ASASTSASASASTSASESASTSTSASASTSASESASTSASASASTSASAS ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ESASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASES ASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASISAS ESASTSASASASTSASASASTSASVSASTSASASASTSASASASISASES ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASESASTSASASASTSASASASISASESASTSAS ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVS ASTSASESASTSASASASTSASASASTSASESASTSASASASTSASESAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASISASESASTSASASASTSASASASTSASVSASTSASASASTSASAS ASISASESASTSASASASTSASASASTSASASASTSASASASISASESAS TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTS ASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS ASASTSASASASTSVSNSANHSNSQVGNTSGSTGKSQKELPNTGTESSIG SVLLGVLAAVTGIGLVAKRRKRDEEE >SP_2159 fucolectin-related protein MNKEKIKRKLITILFVCIGMLCFGLLAGVKADNRVQMRTTINNESPLLLS PLYGNDNGNGLWWGNTLKGAWEAIPEDVKPYAAIELHPAKVCKPTSCIPR DTKELREWYVKMLEEAQSLNIPVFLVIMSAGERNTVPPEWLDEQFQKYSV LKGVLNIENYWIYNNQLAPHSAKYLEVCAKYGAHFIWHDHEKWFWETIMN DPTFFEASQKYHKNLVLATKNTPIRDDAGTDSIVSGFWLSGLCDNWGSST DTWKWWEKHYTNTFETGRARDMRSYASEPESMIAMEMMNVYTGGGTVYNF ECAAYTFMTNDVPTPAFTKGIIPFFRHAIQNPAPSKEEVVNRTKAVFWNG EGRISSLNGFYQGLYSNDETMPLYNNGRYHILPVIHEKIDKEKISSIFPN AKILTKNSEELSSKVNYLNSLYPKLYEGDGYAQRVGNSWYIYNSNANINK NQQVMLPMYTNNTKSLSLDLTPHTYAVVKENPNNLHILLNNYRTDKTAMW ALSGNFDASKSWKKEELELANWISKNYSINPVDNDFRTTTLTLKGHTGHK PQINISGDKNHYTYTENWDENTHVYTITVNHNGMVEMSINTEGTGPVSFP TPDKFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTR ADNPSWWEVDLKKMDKVGLVKIYNRTDAETQRLSNFDVILYDNNRNEVAK KHVNNLSGESVSLDFKEKGARYIKVKLLTSGVPLSLAEVEVFRESDGKQS EEDIDKITEDKVVSTNKVATQSSTNYEGVAALAVDGNKDGDYGHHSVTHT KADSNAWWQVDLGEEFTVSKVDIYNRTDAEPQRLSNFDVIFLSSSGEEVF RRHFDKVVDGLLSLKVPSVGAKLVKIELKSAAIPLSLAEVEVYGSKRTPK KLSNIALTKETRQSSTDYNGFSRLAVDGNKNGDYGHHSVTHTKEDSPSWW EIDLAQTEELEKLIIYNRTDAEIQRLSNFDIIIYDSNDYEVFTQHIDSLE SNNLSIDLKGLKGKKVRISLRSAGIPLSLAEVEVYTYK >SP_1660 hypothetical protein MIQYRDLKHRKNLLQFYKKYSENNILSLYFQRAGDGVSPVIPIDKIILSK TQV >SP_1779 hypothetical protein MTMYQDLLRKIAEEKPNYNQEEIQWLFDHLGNPSPEIRNVLLNQGLHYLS KEKDTRGFSSQYGWVHAFAHGADLLTEVVCHPDFPKNRVHEVFDILGQLF KRMSIRFTDDEDWRLARVIYEPILQGKLEQEQVASWIKTVDFPIEEREDF YKFSNFRSCLVEVYVQLDQRNSLQDDLKEAIQSFQY >SP_0167 hypothetical protein MDKKLDILDKVKEYLGNKTTQILDNQYKEFLKLNDIRRAFGISEKVLNNS FNFTSKEFNDLINNENYLFEYACRIREEWRKKCFNHSYRFLCSPIITDDF LNTKTLRSSQIEYKYERYLSKSSIGDRAVDGFVSFNTLTANGMSAIKLCL EILNSIFFKKKIDLLYSTGYYETRFLLNNLAKSGISCYEVSNCELDKDKF YNVFMMEPNRADLTLQKTDFKIVEYFVKYKNNSIKVVILDISYQGSNFKL VEFLEKFKFANVIIFVVRSLIKLDQMGLELTNGGIIEVFIPNHLRKLKNF IEEEFNKFRNSHGANLSLYEYCLLDNSLTLKNDWNYSDLVMKFTSNFYAD IKDLFMENSDIEIIHEEGVPFVFLDLIGEGKKEYEMFFQWLNFFYKQLGI TLYARNSFGFRNLTVEYFGIIGTERYIFKICPGVYKGLSYYLMKFLLKSF SNEYLKTTDEVNR >SP_1892 hypothetical protein MIQIVVRSVKDYSENRKFDAETLEFRKTYSKMKYGRNNVILEFKLNYNNI VEVSF >SP_0691 hypothetical protein MMKKVTHMSDEVFLFEEIEEIVAPTDGEFLGEVLLGTGVVLLIGVACC >SP_1810 hypothetical protein MLNQDLFDSLEAQKIVDTLMKGQKDYVDERLEKRETMIVSNGYAWTRPNH IDTAFASADLFEYKLQLAGQTWGYLEFETNTEKYGKVLLIIKGKKRLTNQ FPLVQKNKSGYLFEYAQMNTLYLNQHSSYKNDENSHSFPIQMELVSDEMI QEIEQATKNSNIEKFMILTYEADSENNIISVDVVMPDARTGQLHLIQDLS EYIQSSSYHFEEAKYQDIPNFSELSETEDFEIIPRIEKQEGQK >SP_0990 hypothetical protein MGVAEKIEEVSMGKSLLTDEMIERANRGEKISGPPLLDDNEETKILPTSS SRFGYANPKDHGFSQETLKIQVEPSIHKSRRIENTKRNVFNSKLNKILFA VIFLLILLVLAMKLL >SP_2160 conserved hypothetical protein MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSD RNNPDSLLHLKKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELY IEHIDIQSCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFTSFNKNI LCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGRKGVQ FKVVCHSKVTDGEVSVLGETIVIRNATEVFLYLKSMTDYWGNIDISSLQG EFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCLSIPTNLLLENTKKY SNYLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTININ TQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNT DGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEHFEMIK EAFLFFEDYLFEVDGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQIL RYFCDSCIGIAKQLGDNSDFISRVKELKKKLPKTKIGSNGQIQEWLEDYE EVEPGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQ EREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLNN ATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPALPSAWSEG EVKGFRVRGGYKVSFAWKNGDITFLKLEGGNKDQKVRVRIYGKNTDVQNI ELVFNSEKIIELNF >SP_1921 hypothetical protein MILSEKITWDFFNQENSSHRNLIILQRTIFI >SP_1789 hypothetical protein MEMMELPSQEILIFTKQIRHWILSDQVISGKRKLFFREDTPKEILDLYEN IKSKLDFAYQEVHSNNGLKKYEK >SP_1788 hypothetical protein MKNRIIDVFEVVNRLLVITVENPDFEDLRVNHTQ >SP_1904 hypothetical protein MKLKKLLKDDTKVFEKSTFKFVEGYKIYLTESKESGIKQMDNVIKYFEFI ESKSIALYFQKRLNELID >SP_0038 putative acyl carrier protein MTEKEIFDRIVTIIQERQGEDFVVTESLSLKDDLDADSVDLMEFILTLED EFSIEISDEEIDQLQNVGDVVKIIQGK >SP_2117 hypothetical protein MGMFVGMFKARVESHEIILDVKALMPWISAICLLIGFISMFLTFNFLKKS RKFHSLYQEEMDDDLNETYYVQMYRNLEFGTIAFNITGVAIPLAIFISLS EVIILHTNPQTFFLSFLLFVVFLVAQKSLFKTIAIVRQFDLEFFATPKDV LNYINSYDEGERQANLEQSFRILFQLHQYVLPALYIFLIIISFLTGEIQL LAFLLVGAIHVYINVMQLPMVKRYFK >SP_0635 hypothetical protein MLRMIVSKQYLLFYLIHEKEVHTLRIINSRIDYLNQLDHLFRTCRKLFSS QIISL >SP_0563 hypothetical protein MSYEQEFMKEFEAWVNTQIMINDMAHKESQKVYEEDQDERAKDAMIRYES RLDAYQFLLGKFENFKVGKGFHDLPEGLFGERNY >SP_1455 hypothetical protein MKEILICDIIKRKGLHKKVGRTKIQRRQNRNQGGLAFRFMKGLVNFLGVI ASGAIRDLWRYSC >SP_1339 hypothetical protein MAAEVLNLQLVSVQVDETDEVDGMRFSTFSTNRCGNWSAFSWENC >SP_0110 hypothetical protein MLLLLLCTTFFVFNVNYTREVVRIQEMGKTVDSLDLYLKDINEPAASVLR FFEDVSKEYKVSIIKTDSGDEVVKSGVFDKDTFPYQEFGISSLDFTTDGE GVYSNKEISNKLGTIPTFLKAKPIQLMTFQTYIKDTSRSLNGRYTITSTQ EMDKDRIVQKWSDFFKIDQATLLEPTYKSAVEVINRDLLLSAIVFVLAIL LLVLVTVYQPMMEMKRVGVQKLLGFQDRAVLADVVKGNLYLLLGGALVIN LGVFFLLDYKPKDLFPMLWLSHFLLLQLYLFISWLTYLLIQKMTISSLLK GFSSFKFGLIFNYVMKIGTTILLTALLIGVGRSLEQENKELAYQQQWVSQ GNYLTLETFKLNDNLWQEELAGSGKSTDYFYRFYQDLVEKTQAGYVQSSS LPVKNFVQSEQIQQYQLTDTVDVYYANRNFLKSKGFKLPNTGIKKVILMP ASTKGEEDKNQLLGKLIAFHSMKYEEQQKRTIEEMDVEIAYYEGDWSFFP YSDKRKENLSNPIISLVNDSDMMWDEKASLSTTGLNNPIKIENTVQHQKE ITELVEKLSDGNYLKFSSIQAIQQEKVDSYRDAVRNFNLLFALFGLLSMM ISYFLLVTTFLLKRRDIITKKFMGWKLVDRYRPLLVLLLLGYSFPLLVLI FFAHAFLPLLLFAGFTCLDILFVLGLASRMEKRSLVELLKGGIL >SP_1145 hypothetical protein MTAEIGILNKNGVVLASDSAVTLSDGKNSKVFNSARKLFTLSKEHSVGIM IYGNASFMEIPWEVILNEFKQAIGTDLLDNTAQYVEKLIEFLISFKHLQV EDLLRNYIVRSTRSILDSIAYEAQETADLRVSNGETITLDDFNKILLNAI TNFSLEISKVEAESNFEFFEAELELIKSIVDDVFKSFPHTDDEVEVIAKS LYKAIFIGYDSTNITGLVIAGYGTDEIFPSIRQIELYGIFSKRLIWKVIN ESVINHHKTCHIIPFAQSEMVETIMNGIDPNLNVYIAEQVSSVMEKNGLG DEIENIFENISSIQQKYYINPIIDLIGMQPLNEMASTAKTFIELTSFKRK IVNTLETVGGPVDVLAISKGEGPIWIDRKYYFDIDKNLDYRMRKES >SP_1831 hypothetical protein MYYSSFNILYYRLPIFAKLSEKKLEYMMLGDCVMLVNEMEITDHRVDNLF EKGKNEIKDSIGTNSVLNKKIILQKIRKLSNQPSGYWIGSLDERFLDHAI INQIDVTSEQIVLMSDGFYEFYQNNQNKTFEELIKMRFNSSAIDPIYGKK DDASILVIDV >SP_1459 hypothetical protein MREIMLLQLFSLYFESLILTTILVLIFLGIWIGLRAMSGVDKTARARQAH LYDMIMIGVLVVPVLSFAVMSLILVFKA >SP_0475 hypothetical protein MQINAILKKKKLLLEGNKMVIRVFDQQKNTYSSFALEELSYYMNRVFKTN IELVEEKEADIFVGLVNKEDRKDHVLISLDKGKGRIESNTIVGLLIGIYR MFHEFGVVYTRPGRRHDFVPELRFEDFLDKQLSIDETASYYHRGVCIEGA DSFENILDFIDWLPKIGMNSFFIQFENPYSFLKRWYEHEFNPYLNKEQFS NELVQELSDRLDKELQKRGLIHHRVGHGWTGEVLGYSSKFGWESGLSISE EKKPYVAEINGKRELFNTAPILTSLDFSNPDVADKMVEIIKDYAKKRPDV NYLHVWLSDARNNICECENCRQELVSDQYIRILNQLDRALTSEGLDTKIC FLLYHELLWAPQKEKLDNPERFTMMFAPITRTFEMSYADVDFDNSIPTPK PYMRNKIILPNSLEENLSYLFEWQKAFKGDSFVYDYPLGRAHYGDLGYMK ISQTIYRDVSYLSNLHLNGYISCQELRAGFPHNFPNYVMGEMLWKKTRSY EELIEEYFSALYGENWQSVVEYLEKLSIYSSCDYFNAIGSRQSDVLANHY YIAYNLADNFLPIIEENISKLLNSQKDEWKQLSYHREYVVKMAKALYLQA TGKTRQAQDEWRNVLNYIRGHELLFQSNLDVYRVIEVAKNYAGFHL >SP_1794 hypothetical protein MEELMKNNERLGIKLSRDSVLGLREVRRLYLGSSDIPVSDGYVIEVAYNQ ISHEIDIIDWVELNKSKIKISEISESVDIDATSLRTTLTLDTLVYEGMRD IQLKLRELTKGRVFFSFVVKLVLFASILKKKDLLEKFQEKC >SP_0076 hypothetical protein MIDVTIGQKSKTGAFNASYSICFSGENFSF >SP_0270 hypothetical protein MLQKYTQMISVTKCIITKNKKTQENVDAYN >SP_0654 hypothetical protein MNPVVKKIKEDVRGITDLPHPIFTGFDCLKYNQ >SP_1487 hypothetical protein MAKCKKYEEFGLDSLLQETRGGRNHAYMTVEQEKVFLARHLKATEAGEFV TIDALFQAYKKELGRSYTRDAFYQLLKRHGWRNITPRPEHPKKADAQTIV ASKNKVSIQEDK >SP_0639 hypothetical protein MMFVIEEVKDENQKKAVVAEVLKDLPEWFGIPESTQAYIEGTTTLQVWTA YQESDLTRFVSLSYSSEDCAEIDCLGVKKLIKVEKLGANCLLL >SP_1945 hypothetical protein MKSMRILFLLALIQISLSSCFLWKECILSFKQSTAFFIGSMVFVSGICAG VNYLYTRKQEVHSVLASKKSVKLFYSMLLLINLLGAVLVLSDNLFIKNTL QQELVDFLLPSFFFLFGLDLLIFLPLKKYVRDFLAMLDRKKTVLVTILAT LLFLRNPMTIVSLLIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVI VTLIFF >SP_1054 Tn5252, Orf 10 protein MKRDVRDIRKQFRLTEAEEKQILALMRERGETNFSDFLRKSLLSSDLQKQ METWFALWQSQKLEQISRDVHEVLILAQSERQVTQEHVSILLTCVQELIQ EVANTIPLSKEFREKYMR >SP_0198 hypothetical protein MKLKRFTLSLASLASFSLLVACSQRAQQVQQPVAQQQVQQPAQQNTNTAN AGGNQNQAAPVQNQPVAQPTDIDGTYTGQDDGDRITLVVTGTTGTWTELE SDGDQKVKQVTLDSANQRMIIGDDVKIYTVNGNQIVVDDMDRDPSDQIVL TK >SP_1581 hypothetical protein MHKTLENIGEFEEDNLYYSSIDKSRNKDQFSHIFGLYNICSG >SP_1188 hypothetical protein MNHSFKKITVFCFIVSCVLCLLDLMNFKNVATFLFFCLPVFVLIYKNK >SP_0125 hypothetical protein MTNFDILDNQFLSLSENELSDIDGGLAPLVIFGVAVSWKAIAGGTALIGS GLAAGYFLGGD >SP_0682 hypothetical protein MVYLVLGILLLLLYVFATPESIKGTVNIVAMVCILVALLILLVLSFLKIF QLPTEIFLAIAMLILAYFSVRDITLMPVKKSKRR >SP_1108 hypothetical protein MTAFQQLPSSVLQTGLFFSPSVSLDSQTVSAKEYLFPYQKERLKPFRQVK GRQANI >SP_1570 hypothetical protein MERSLFGLFTAFLCFICFLAGAQAFRKKRYGLSILLWLNAFTNLVNSIHA FYMTLF >SP_0997 hypothetical protein MVASASASSTSTQAQEQVDKSELRALSQELDQRLKALATVSDPKIDATKA VLLDAQKAPEDSALTE >SP_2043 hypothetical protein MKLKKYSIQGVGKVIFPASFSDEIVQNLAMIGFFEKYGIIVVI >SP_1078 hypothetical protein MAFGDNGNRKKTMFEKITLFIVIIMLVASLLGIFATAIGALSNL >SP_0039 IS1381, transposase OrfB, truncation MRNIGQAGKILADSGYQGLMKIYPQAQTPRKSSKLKPLTVEDKACNHALS KEISKVENIFAKVKTFKMFSTTYRNHRKRFGLRMNLIAGIINHELGF >SP_1844 hypothetical protein MKFYSYDYVLSQIGQQNGIMVGFGIVLLAVTVFFAFKAYHNKKGSEFREL VMISDLALFSSAFGQHHDLSKQSSF >SP_1488 hypothetical protein MKSIKEEIQTIKTLLKDSRTAKYHKRLQIVLFRLMGKSYKEIIELL >SP_0470 hypothetical protein MDKMKPVFQALNKELIQENLTLTIICVGGYVLEYHGLRATQDVDAFMAL >SP_1493 hypothetical protein MPMNIILIAKLLRENTNTKANALNNGWARSGSEEFKKFSHFVGVDKGIVR TNVLTGKKLSDKIRKEVGSGDSKLGKGGYFSTGDVLLGKDVVSYTVQVFS ENNERVGVNTQSHRVQYNLPILADFSVIQDTVEPSRTVVEKIIPKLNIPE EEKGKITEEIKKKKKTSELAELISENVKVRYVDEQGRLLSLKNDTGIGEK ESDGTYITNKKQLIGTSYNVTDKKLSSMTTTDGKYYTFKEADTNSASLTG NIVSEGRTVTLVYRESEAPTTATVTANYYKEGRQEKLVESVIKADLAIGS EYTTESKTIEGKTTTEDKEDRVITRKTTYTLVATPENAYQKTVQQLTITT VRMLRKQWFPKQQPLLRRRL >SP_0429 hypothetical protein MTGTNTFTVLSTEDLEQTSGGLAVWEDGYSRWLYYREFAPYMRQGALNSY IDAWKYGFRAG >SP_0558 conserved hypothetical protein MIRCKKEIRSLYMAEQDLAMQVLQQVVKLPVVKVDRSKFLVDKFSKELDP KDIPTLLEQGPTTLLSQEILDRVANACIRDNVLLASGTSVLAGLPGGLAM AITIPADVAQFYAFSLKLAQELGYIYGYEDLWASREELSEDAQNTLLLYL GVMLGVNGTAALLRVGSITIAKQVMKIVPNKALTKTLWYPILKKVLKIFG VNLTKGGLAKGMGKFIPILGGIISGGLTFATMKPMGESLQKELSKLVNYS EVQYQEDVETIRKEAEIIKGE >SP_0170 hypothetical protein MGRRFCFICSLKKVTAVITDDSTEQNYEELEIYTQVIV >SP_2089 transposase, IS1380-Spn1 related, truncation MTNLSSVDSEELFQFYRERGNAENFIKERKAGFFGDKTDSSTMIKNEVRM MMGCLAYNLYLFLKQLAGDEVKSLTIKRFRRLFLHIAGKYVSTARRHILK FSSLYAYSKQFQALFDTICQINLILPVPYRARGQGKTCLTE >SP_0497 hypothetical protein MIIMVLEFDKKLQNIKRFLENNKINLLGGENEESIF >SP_0810 hypothetical protein MTVEEKKVFLARHLKAAEAGEFVTIDALFQAYKKELGRSYTRDAFYQLLK CHGWRNIMPRPEHPKKADAQTIVASKNKISIQEEKKAL >SP_1495 hypothetical protein MYQNEDLYKKGLNVELAHQQIKGFFEAEFKNRINGVLNTKIKNSTLNRVN KKTIHQSNKNSMINLKQKQRKMLKNKAILC >SP_0818 IS630-Spn1, transposase Orf1 MWYNLLMAYSIDFRKKVLSYCERTGSITEASHVFQISRNTIYGWLKLKEK TGELNHQVKGIKPRKVDRDRLKNYLTDNPDAYLTEIASEFGCHPTTIHYA LKAMGYTRKKKELHLL >SP_2200 hypothetical protein MIAQNKKNVKETAYRKEQEISPTPIFYTRSPSLFTL >SP_2139 hypothetical protein MILWSFDFANDHAHAFFMDNVEWSHADSYFRSFVSDDVEERYTENVYLDS LSVKQKFKFIFDFGDEWRFECQVLREIETEDEEAYLVRSVGTSPEQYPDY DGFDYEEW >SP_0398 hypothetical protein MSLIFVVIYKVKEAGQKVFKIGKRQPIGCSKILIGCPLLWKALLDFLLRL SF >SP_0070 hypothetical protein MNDKKEVDGEHWPLLIYKDWILVASISDFSIVS >SP_1165 hypothetical protein MANTVKVFLKEISQNKKENPVKTRDFLVKNIFSQTF >SP_0548 hypothetical protein MPLFPGMGVSVSSLSPKSISFLRIKFPYYSTSFWPIRQPL >SP_0258 hypothetical protein MMELVLKTIIGPIVVGVVLRIVDKWLNKDK >SP_0472 hypothetical protein MLIINRFFSLFVLDWYRTELWINTLVSYPVPKYVGRKMKQLLRVSVLEKI LSIEFFK >SP_1036 hypothetical protein MFIISPDLFNIAVILYILFFIHDILLLILS >SP_1170 hypothetical protein MSEVDFNEAVNYEFTSDTCQLANSIYQSLFKFFDKKNFSGDLIFTWKSPS LVKEGDYIGRRDSQVDNLRVIGNIFPNYLTNRKYSLNMNRNGCMGDFPHD FFDIYLDHVAKYAYEQKVNNIKEYYPLKRAILHQENALYFRFFSNFDDFL EKNYLKTIWQVSKETPFSEMDFNMFKNISEKIIFERGSKMLNDLKSNYKK >SP_1109 hypothetical protein MFEEFPKLPDLKQVTFPNDKEKSQNSKEKLDDCFPTTPI >SP_0520 hypothetical protein MKLSNLLLFAGAAAGSYLVTKNRQTITDEVLNTTDRVQAIKDDVDIIQNS LQIINQQKELIKEYQEDLTYKFKVLEKDIQTRLAVIKEMQGTEDK >SP_0087 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVLFCLMGKSYKEIIELL >SP_1678 hypothetical protein MFMVGTYESFTDKKENSLQRKMMEEQTWHKKE >SP_0490 hypothetical protein MKVNIADLHLTQLYLSEKKLQDIQMLYQSAETIQVDPISILAFGDCLLIT DGHHRAYQALLAGRDTISAEWDRDGGDELYHLYAQACEERKIYSVFDLED RILAQDGYEAKWYNWCDGFNQAATLLLKR >SP_1630 hypothetical protein MKVEPRCDVLSRMSHFFIRILIMELQELVERSWAIRQAYHELEVKHHDSK WTVEEDLLALSNDIGNFQRLVMTKQGRYYDETPYTLEQKLSENIWWLLEL SQRLDIDILTEMENFLSDKEKQLNVRTWK >SP_0461 putative transcriptional regulator MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQ ETFEEELTFNLDTQQVQLIEHHSHQTNYYFHQLYNQSTILKILRFFLLQG NQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHF SILVALTWKRREFPLEFPESKEFEKLKNLFMYPILMEHCQTYLEPHANMT FTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPL YHISKAIVQEWMTEQKIEGVIDQHRLYLFSLYLTETIFSSLPAIPIFIIL NNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAM IAKKAHYLL >SP_1038 hypothetical protein MINKLSRYMESSGKTYQNHYVTILKWYEEDKDKLRQKGLNKKMNYDVGES L >SP_1528 hypothetical protein MIFWKKQLTKQTKSCIIKQIFKAGGSGNEKV >SP_0414 hypothetical protein MSVKLFHCQSISFLGAESKKKSTESVSADKKGSL >SP_1452 hypothetical protein MGDEENAKWTERGVLMDVTIKKKDGKTTIGTAKAHPTWVNRTPKGTFSPE GYPLYHYQTYILEDFIEDGSHRDQLDEATKERIDTAYKEMNEHVGLKWY >SP_1696 hypothetical protein MALILESSKRNEDSHMTVTIKVNYQTTFQKKEAKN >SP_1059 hypothetical protein MSSQMKAFLNELQGNMEVVGEFFDNPEKVIRKFGIVGREKEALLIRDLND MDDYALSIQNSVASPSGAHSSTCHFVRQDLQSA >SP_1658 hypothetical protein MGGILLLIGPFVLLGIAVNTAATTLNGGATAGAFSGVALLLNALKIANLV LGIIAIVYYKGDKRVGAAPSVLMIVSGGVSLILFRS >SP_0790 conserved domain protein MKKMKYYEETSALLHEFSEENQKYFEELWESFNLAGFLYDEDYLREQIYL MMLDFSEAERDGMSAEDYLGKNPKKIMKEILKGAPRSSIKESLLTPILVL AVLRYYQLLSDFSKGPLLTVNLLTFLGQLLIFLIGFGLVATILRRSLVQD SPKMKIGTYIVVGTIVLLVVLGYVGMASFIQEGAFYIPAPWDSLSVFTIS LVIGIWNWKEAVFRPFVSMIIAHLVVGSLLRYYEWMGISNVFLTKVIPLA VLFIGIFVLFRGFKKIKWSEV >SP_1080 hypothetical protein MGDKPISFRDADGNFVSAADVWNEKKLEELFNRLNPNRALRLARTKKENP SQ >SP_1380 hypothetical protein MLVEKRRLRMRLKVIKKLVDINILYSSQEANLANLRKKQAKNPGKKVNVS ARVLSSYIFSSLLMIICFSNIAIHFPFEEIPIYFSSMIAILLVIAFSTSL TAFYNVFYESKDLVSYRPYAFKESEIIIAKGLSVLLPALTGIVPILAYFL VLYIRLAPSLWLGLPLMLLSLTLLFVSVALVMVVAVHFLAQTRVFRKYQS IFSNVMIGIGVLIPLIFIFFLQSTFGSIVDKVRDIPFLLYPLHIFYKIAV EPFSTEALVGLLAWIGLTLFLLYLTKKKVLPRFYDVILLNSEEKVKKERR SKERISTTKKGFFRMVLRYHLTLLGQGTGVITVLFTSAFLPYLMMIGLIS KIRDSQIVPDIHPPYWLPLFFIALFIAVVNNNITSLHSIALSLERENVDF LKSLPFDFARYVKVKFWIIYAVQSFLPVLTLLGLSLYLGLPIISMIYLLV VWIIASVILSCHHYFKDVKNLSTNWSSITDLVNRSNGIVAIVLLFIYSAI LMALVIGSIFLVQSLSPILAISLGVGALIVLLALAIFGYHYYLSRILAEI EKR >SP_2177 hypothetical protein MMKQRKELYLFLGRTALYFLIFLGLLYFFSYLGQGQGSFIYNEF >SP_0190 hypothetical protein MQKHVCVQLYKHQGWEHFPVLFYFNFKKKEERNEKNSSC >SP_2179 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1314 IS66 family element, Orf1 MELLLYTISKVKLLEDILMPQPIVPVEIPQSRRFDSKKRNDILLKIRIGK LEVSFFQSLNLEMVEQLLDKVLLYDNSSI >SP_1802 hypothetical protein MYMSKAKKICFIIFCILILTIFLPVLIDYHQVSDLGIHLLSWRQNSVVEF YLARYVFWGTVVLSTLVLLSILVVMFYPKRYLEIQLETKNDTLKLKNSAI EGFVRSLVSDHRLIKNPTVHVNLRKNKCFVHVEGKILPSDNIADRCQIIQ NEITNGLKQFFGIERQVKLEVAVKNYQPKPQNKKTVSRVK >SP_1060 hypothetical protein MADMKNKYDVKRIIPDELSESLDIFLKNYSETGLSDYNTYLFYGFILKSY KLPRENRYSIKLLVKELQNRGLKVTLIINIYYHALNCLALNDGLKIYGED FLI >SP_1141 hypothetical protein MPEFIIVEGNNDLGEFFQIDGELFSDNELLENLKKWREWEVPVIIDDWCN RILNEDETEILYFPTHEDKMNYIRVEKDLEPLYHTSNKIYATISKSEWLE LLN >SP_0124 hypothetical protein MMKDLNNYREISNKELQEIKGGFGVGVGIALFMAGYTIGKDLRKKFGKSC >SP_1150 hypothetical protein MEFFITLYYTMISLVNLLKYLGHPFFSSSSAKS >SP_1347 hypothetical protein MAERFWENLSIILAERNISWIELTRKMFAGEFHYPSELNRLYQKIRH >SP_1439 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1224 conserved domain protein MTEHLKSNTMVLPLKKGAQKMTTITLKVSEADKTFMKAMAKFEGVSLSEL IRTKTLEALEDEYDARVADLAYQEYLEDLEKGVEPITWEEMMHDLGLKDE >SP_0067 hypothetical protein MALTLAGAVLTNDVFANDRLVATQTTDGKNENVLTSEVLKPSSGNVLVGI KGEFVAPHQQSILDAINAICKEAADEGLVDKYVPIK >SP_1641 conserved domain protein MKKLYRIHFIAIAVIDLLLFAFFITRLETSFEWLLLSGLIFFLAQGLLLF LLVVRLKHQFAEIYPQINKKIRFYYLGVLTIDFLFFVLLAFISSQRFSSL MPIITACHSTFYYMTADYLRENYPDFYDKHISLWECL >SP_0514 hypothetical protein MILDFLKKYKHELQDFRDRGWELIGAGSCSILRKSSSDLLPEDQVYMSKG LKWEVMRSRLRSCTATFSGGLVVCMSLFREDLSMSFFLIFVLYAFLISYL IYGYFRLKRKYRVDE >SP_1158 hypothetical protein MLEIDLIVLIVLSYFYFISLYNCYCFLFHDFNLTVQI >SP_1708 hypothetical protein MSVMEHLFKFLLLAPYFYFDNWIEKANRNSKFFPIFYYFYWFYIPFYSLF SLAWTVVSVLFFNTVLRNVTDIKLWGIWFLFILLAIGMNWLTYSCFKEMF RLRQELGKSKGGRH >SP_0512 hypothetical protein MFLIFIIDWVLLIVFAIQISYIFWRLSQKWKELSNK >SP_1401 hypothetical protein MDKISCLSLPPLSPIIVISCLFFPFLMQGKLLSLDDNPKARKVSQTSLLS QTSLQLKGKDSIL >SP_1785 hypothetical protein MATYGFLDILEEELDKNFPFDFEISWDKRNHAVEVSFLLEAQNAAGVEMV DEDGEVSSDDILFEEAVLFYNPAKSTVNEEDYLTVIPYLPKKGFSREFLA YFALFLKDTAEVGLDVLMDFLEDPEAEEFVMEWNQEVFEEGKIGLEKGEF YPYPRY >SP_1052 putative phosphoesterase MRGFNNKIKSVYQELTNSKEKFGSFHKTLIHLHTPVSYDYKLFSNWTATK YRKITEDELYDIFFENKKIKVDKTIFFSNFDKVVFSSSKEYISFLMLAEA IIKNGIEIVVVTDHNTTKGIKKLQMAVSIIMKNYPIYDIHPHILHGVEIS AADKLHIVCIYDYEQESWVNQWLSENIISEKDGSYQHSLTIMKDFNNQKI VNYIAHFNSYDILKKGSHLSGAYKRKIFSKENTRFWSLILTRKNLRNNLI FSIKKLVY >SP_0816 hypothetical protein MKLINTTNSHSQLVKSQLESTDATLVEVYSAGNTDVIFTQAPLHYEILIS NKHRAIREPEIETIQEFFLKRKIDKASVDEANIKTLYSEKLIGISIPIK >SP_1604 hypothetical protein MAKEPWQEDIYDQEESRAERRHRNHGGADRMANRILTILASIFFVIVVVM VIVLIYLSSGGSNRTAALKGFHDSDASVVQISSSSSSQPEQSSEPESTSS SSEEAANPEGTIKVLAGEGEAAIAARAGISIAQLEALNPGHMATGSWFAN PGDVIKIK >SP_0684 hypothetical protein MEEYGHMEKVQEVVRLFQITIMVKNIIIHLS >SP_1621 putative transcription antiterminator BglG family protein MSRKQEQMETLLLLLRDSKDYISAKVLGEKLNCSDKTVYRLVKGINKDCP VEAFILSEKGRGFKLNPRSSLVDVDGNFTEAFDPEVRREKLLERLLLTAP KPHSIYDLGEEFYVSESVVLKDRQILQESLAIYGLDLKMRQRKLFIDGDE AQIRSAILNLLPMFNQLDLEQITQNKVQPLDGELAHFCLGLLITLERELG VNIPYPYNINIFSHLYIFISRNRRSTSIHVVAPSKPTIVDEKIYSVCQKI IQEIEQYFRMKVDAVEIDYLYQYVVSSRLQKPFSSGKLPFSQRVLDVTHY YFSRMCMDNREIETTDPDFVDLASHISPLLRRLDNRVQIKNSLLSQILLT YPNLVKELTTISKEVSLVFGFASLSLDEIGFLVLYFARFQEKRARPLKTV VMCTSGVGTSELLRARLEKQFSELDIIDVVAYHQLDELINLYPDLDFIVT TVALQEPASVPFVLVSVFLTEGDKQRLQAKIQEINYE >SP_0633 hypothetical protein MLEVGLNFLISLFTFTFDILYPIVKVGHTDDYSHGAVKLSVSLVDKDAMK KIFVTVIGYFEINIDENITDILYVNGTAILYLYLRSIVSIVSAIDSSEAM LLPIINVLELLDKSQPFEEE >SP_1120 hypothetical protein MFLLYYLFREDSSKLLYFFNYFENLQQVHLLVQL >SP_0297 hypothetical protein MNNLDNMRFIMEIFASFSPEIELLLSYFSLFLMIYFNFLPLGKNNKDMN >SP_1210 hypothetical protein MANLSQGLSLYLMTHHYQAPKSVIDFGLWIAKAPSQERGRLAFLQMLAQT LQGFR >SP_1432 hypothetical protein MSSKLLKAKEQVKSQDKDKKSILGQIKSFKTDDKSKSNKKDHSKGAER >SP_0465 hypothetical protein MFLPFLSASLYLQTHHFIAFPNRQSYLLRETRKSHFFLIHHPF >SP_0542 hypothetical protein MCNNGLTFLLGPFAIGIGVTGAAGGAILGGVAYAAICWW >SP_0832 hypothetical protein MVIGVLDSKEELKESENDAPKLETPLREEPRLAPQTLPEASEVLENKREE SKVEIT >SP_1138 hypothetical protein MVSLPHLVYMVVESMAITSQRAISHPMKSVYFCLGL >SP_0471 conserved hypothetical protein MDWYDYMIQASKQSQFNASHWFRYLRKVIFEDYSYLTNQDVEKLLDSKEL TRFQKISLKYAFQEHTPTHKYVISLNKPAKLTNVQKLMEKYKHG >SP_0559 hypothetical protein MNPIKAFAKIYGNYFLTVQGVKVMKTIKKADHVVVGLGKLFIADKLMDTA RWLIKPEERE >SP_1004 conserved hypothetical protein MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQK SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTK DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII AEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVNKEKNAIIYPH GDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKV LEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFT VPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDALVRVFDEFHGN AYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE VPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTSEKVEKEKLSE TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLP EAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK FTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA >SP_1656 hypothetical protein MENNDSFTKLKESTQKLFDAQKKRLNNEDRIETTKNNVIAKHCQTVLSFL VLTSFFVKNCVK >SP_1924 hypothetical protein MIKRGDVVALYLPFPTISSDLAVKNHMYICIDNSMTKNKELVKNQTFKPA LLTRRLVKNFMIEEPDLARNPFTRPTLIDLDKVFMLDNTVIPTSYLARRR RNVSEELYEEILDYLVQPRLISLNKSEFMQLNPGTY >SP_0009 hypothetical protein MENLLDVIEQFLSLSDEKLEELADKNQLLRLQEEKERKNA >SP_0598 hypothetical protein MEVTNESNPKILGLCQQKATEKFYFISDFIGLIGRNFSLFDSDEVQQNSR KQSL >SP_0902 hypothetical protein MKKYFIGGLGSNAYHSKDFLQELDSQVYFLNPYEKHLRDETELKSWFKNE IVEEESICLIGHSLGGDLARYFASEFEEVKKLILLDGGYLDLDKILPLDT ELEETKNYIKSQIVSDLDVLTSKEKSEAKHWSENMEKAVRQSYHWNVEYN RYELAINYENIEAILRLRRKIQAFKREVGDTLFISPRYPNEATWREEALK ELPDYFDTIFLENFGHELYTQAPKEIASLMNEWLAYFL >SP_0367 hypothetical protein MFPIAFSAIICYIKSIIIFYLSELSIALSEEGVFFKKKM >SP_0534 hypothetical protein MVSIQYKEESMFFMLAFLIFTIQEVLMTIYDLSDPRSK >SP_2102 hypothetical protein MLYNNDKEEISMLKEVLTVAKVAKKSSLFLGGVAFGTLGLKILASKEAKK GYSKALAKAYKLKDELDASVSVVKQHGDDVLQDAKYLYEQEKKEEQLDSL IGE >SP_0195 hypothetical protein MKKSSTALWRSDTVWETVSAQLPEMGKSWNVQ >SP_0899 conserved hypothetical protein MKKSRKLATLGICSALFLGLAACQQQHATSEGTNQRQSSSAKVPWKASYT NLNNQVSTEEVKSLLSAHLDPNSVDAFFNLVNDYNTIVGSTGLSGDFTSF THTEYDVEKISHLWNQKKGDFVGTNCRINSYCLLKNSVTIPKLEKNDQLL FLDNDAIDKGKVFDSQDKEEFDILFSRVPTESTTDVKVHAEKMEAFFSQF QFNEKARMLSVVLHDNLDGEYLFVGHVGVLVPADDGFLFVEKLTFEEPYQ AIKFASKEDCYKYLGTKYADYTGEGLAKPFIMDNDKWVKL >SP_1476 hypothetical protein MVGHFLDDFDGYDSYIWFEEGMVEYISRKYFLTEEEFQAEKICNQSLVEL FQKKYSWHSLNDFGSSTYDKNYASIFYEYWRSFLTVDKLVENLGSVQAVL DSYHLWANTEKTFPLLDWFVQQKLIEKEI >SP_0025 hypothetical protein MKKSNILFIFILLLCIGLQYETIYYTDGSRSGAEYGLMGVSIFLALFYMI PALYFLFRIGKNGNCQRRF >SP_0374 hypothetical protein MSKKRRNRHKKEGQEPQFDFDEAKELTVGQAIRKNEEVESGVLPEDSILD KYVKQHRDEIEADKFATRQYKKEEFVETQSLDDLIQEMREAVEKSEASSE EVPSSEDILLPLPLDDEEQGLDPLLLDDENPTEMTEEVEEEQNLSRLDQE DSEKKSKKGFILTVLALVSVIICVSAYYVYRQVARSTKEIETSQSTTANQ SDVDDFNTLYDAFYTDSNKTALKNSQFDKLSQLKTLLDKLEGSREHTLAK SKYDSLATQIKAIQDVNAQFEKPAIVDGVLDTNAKAKSDAKFTDIKTGNT ELDKVLDKAISLGKSQQTSTSSSSSSQTSSSSSSQASSNTTSEPKPSSSN ETRSSRSEVNMGLSSAGVAVQRSASRVAYNQSAIDDSNNSAWDFADGVLE QILATSRSRGYITGDQYILERVNIVNGNGYYNLYKPDGTYLFTLNCKTGY FVGNGAGHADDLDY >SP_1843 hypothetical protein MVSITTYQNNQVSNNKFQTSLHFIEVVSKDL >SP_0900 IS1381, transposase OrfA, truncation MNYEASKQLTDARFKRLVGVQRTTFEEILAVLKTAYQLKHAKGGRKPKLS LEDLLMATLQYVREYRTYEEIAADFGIHESNLIRRSQWV >SP_1977 hypothetical protein MKEKQDFCLFFRKQSVFQFHFQSIIRLFFKIEAI >SP_1183 hypothetical protein MKSLGKWYVSTGKEWICHSDDELEEFKNLFLNFINPEEWDTISFDSDFMP FQQS >SP_1952 hypothetical protein MLKILNNSVIYLCLIPLLFLLLIIFPNDSLIYYFRLILISLISITLHELG HFLVGRCLSYKLEMLATPFFFYFRKKIYFKFPVLLAFGYCQMSNRNITNE KNSDRNLVFYFFGGGGANLIVAILALLGFVPYASEFFILNIILFLVTVCL PIDGTDGNAIREIVLYSKDSKTYQRFFANSLYNNPYITIDDFAKLTSKEK SFFSKFEKCLINLYFEVKGEGSAFTIANHEVFDSNIQENIIHEYYKLLHK SSDWIIPQNLSLEEVVFTLSNYIYSKNNKYLEKIKYLKKLVDFRQEEIID FILNKEEVL >SP_1835 hypothetical protein MSEIVETRVFFCFQNIVQNFVPVFHVLVILLHDRIYSMLIL >SP_0934 hypothetical protein MTASFMVAEMRRHKKIVTNPYFFDRIEVVKKK >SP_1048 hypothetical protein MWLIILWNAKPDTPLFNFKDEVIKYKTYEPFESSIKRVNTTIKNGSKGKT LTEMINGYRADNDIRDEICNFNILKNKIRDMKNQQGNTMESYF >SP_1350 conserved domain protein MKRITANQYQTSERYYKLPKLLFESERYKNMKLEVKVVYSVLKDRLELSL SKGWIDEDGAIYLIYSNSNLMALLGCSKSKLLSM >SP_1305 hypothetical protein MKEFLENFCFFFTVKKNSAIMSYVIKYDNKRRTI >SP_0815 hypothetical protein MLLCVLLLKDLLDFLSNRVFTQFSYLIGIEIPINFSE >SP_0223 hypothetical protein MKKINFPRNFSFFVKFPIFTWSFDALDILNYQDAFVTPGI >SP_0956 hypothetical protein MTVTKSYKYDWNTVWEYSTNYHDHQYAWIPSWSRYDSYSEYKVGGGWNYA RYEVINYYSGGY >SP_0792 hypothetical protein MKNVELKEKNMTFEEILPGLKAKRKYVRTGWGGAENYVQLFDTIEQNGLA LEMTPYFLINVSGEGEGFSMWSPTVCDVLATDWVEVHD >SP_0621 hypothetical protein MSVIEKLNHEKSLQALSNYGRMEAVELEKEIDYEIS >SP_1533 conserved domain protein MLENGDLIFVRDGSDMGQAIQTSTGNYSHVAIYLDGMIYHASGQAGVVCQ EPADFFESNHLYDLYVYPEMDIQSVKERACKHLGAPYNASFYPDAAGFYC SQYIAEILPIFETIPMKFGDGEQEISDFWREYYIELGLPVPLNQAGTNPS QLAASPLLQCKERNLHDSDF >SP_0996 IS630-Spn1, transposase Orf2, truncation MVAGLTNGELIAPMTYEETMTSDFFEAWFQKFLLPTLTTPSVIIVK >SP_0543 hypothetical protein MVSPRTNQLMFIGLADFMFVICLYRGISETEFYQQLIAYIGVFSACLSRF CSCGA >SP_0596 hypothetical protein MKEANKNHPAGAPTFAKGEGEHANDIVATYSDGTTYYVPLNDVTKYAR >SP_0888 hypothetical protein MVVKTRKQGNSITITIPSEFNIPSGVKYEAKLLPSGEIIFTPEELGQQVS YVSDDAFDLNLDKIFDEYDDVFKALVEK >SP_0595 hypothetical protein MQILCYFTITVVAKPNNSGEVHLDVSIEDNQGGSGYNFSSVSSSSQTAKY EGTVYNNNSSLYITIDKTSDATALLKLKLNNVDNQPATEVPSSGITVKLN AKDNAGNWTSASNKKEVTVKIVSAKPTYPDKILVKNPDNIKDTEKMPLLK N >SP_1216 hypothetical protein MKYRKRFLKPKVCDIIIKKKDLGVYYGFFGIYLKD >SP_0059 hypothetical protein MGIAIFLPLFSFFHRKFYHKSEKNSSLLNKKLE >SP_0650 hypothetical protein MILTLVVCIILTKLFRLKKLGRNFADLAFPVLVFEYYLITAKTFTHNFLP RLGLALSILAIILVFFFLLKKRSFYYPKFIKFFWRAGFLLTLIMYIEMIV ELFLMK >SP_1694 hypothetical protein MIFKAFKTKKQRKRQVELLLTVFFDSFLIDLFLHLFGIVPFKLDKILIVS LIIFPIISTSIYAYEKLFEKVFDKD >SP_1042 hypothetical protein MMKMATDKNRIMISLDDKNLEKLENLVEDARDRRGMRLTKSQVIELLLNT VDYFDDIMGAIYSKK >SP_0389 hypothetical protein MKKTVYKKLGISIIASTLLASQLSTVSALSVISSTGEEYEVSETLEKGPE SNDSSLSEISPTYGSYYQKQSEVLSVMMI >SP_0561 conserved domain protein MEVVMDNIIDVSIPVAEVVDKHPEVLEILVELGFKPLANPLMRNTVGRKV SLKQGSKLAGTPMDKIVRTLEANGYEVIGLD >SP_1335 hypothetical protein MMPNYPCEFEVTFLDDYHKKHNYPLFYESYLQNVMEFLESQDIKNGVNAF VDDNQNLVFVLY >SP_1140 hypothetical protein MAGNENDNLTSKQIKFIDAMLTEPTIDKACQKAGVSRATGHKYLKVAAVK KTLRIKQDEMMDKTTQMLYLASSNAVSVLNDIMMDSKVNPFIRTQAAKAI LEQSYKTHEIFGVVRQIEELRLEIEEVSKGNQRVTRTQGVIK >SP_0833 hypothetical protein MYMTGHSLGCYLAQIAAVEAYQKYPDFYNHVLRKVTTFSAPKVITSRTVW NAKNGFWDVGLESRKLAVSGKIKHYVVDNDNVVTPLIHNNRDIVTFTGNS RFKHRSRGYFESPMNDIPNFNIGKQATLDKHGYRDPKLDKVRFFKKQALP RSSSQPSAEPMENIASGKQVTQSSTAFGGDARRAVDGKVDGNYGHNSVTH TNFQSKPWWQVDLAKEETIRQINIYNRTDTAQDRLANFDVILLDSSGKEI E >SP_1958 hypothetical protein MKKFDNYIIEKPCDSNSDKLQKILIIESLVDDILQFSLRINNSVGEIFLL QPF >SP_0309 hypothetical protein MDKKERQKIEQQRREMALTNTFFNRYLLLRYSIALFFFGNIYWLLSQFIS PSPIIIFPIMLIVFSILATVEQFKLYGNRKEKLGITLMFVRIQMLISIGL LVLTWTSWFKNLFPIFENNQVARLFVFVVLLLGLVLSLLDIRRIKKIYKR TDKAYQQFVQLEKNSLSL >SP_1465 hypothetical protein MKTTFSYPKWAEIPNIDLYLDQVLLYVNQVCAPISPNKDKGLTASMVNNY VKNGYLTKPDKKKYQRQQIARLIAITTLKSVFSIQEIAQTLNTLQTQASS DQLYDAFVDYMNQGIDPANPIIQTSCQTVKLYHQTLDLIDHTQEEVIQ >SP_1971 hypothetical protein MLMDKTFLHRQLLKNLINVLYTYFQEKKRENLKKISVTQNTDFIDLLVIA TKDT >SP_0094 hypothetical protein MFSLVLILTIQEISRTLYNFQSNKLHSFSQAGILV >SP_1912 hypothetical protein MNGMKAKKMWMAGLALLGIGSLALATKKVADDRKLMKTQEELTEIVRDHF SDMGEIATLYVQVYESSLESLVGGVIFEDGRHYTFVYENEDLVYEEEVL >SP_0511 hypothetical protein MKIKNTFQLKELPPVTNFEKNGSRAKKRNKIGSIRIVKIRLA >SP_1947 hypothetical protein MRKKRGIKKLVTFALLGVFMFSNTIPYQQFIQKNKQLEIRVQSQKKSNGL DVGKAD >SP_1006 hypothetical protein MICLAQKTFYFFLAICRRLLVAIYHVLLKQESYNPRLQGLTEIRNPDKTM FVQDAIRFAQQHGFNML >SP_1037 putative type II restriction endonuclease MKIHCLKLKNKELNKEVAFYLTSIIRQALKNTEYKDQISSTVLPDIKIKL PIDSRGTPDWNYMERYRDR >SP_1379 hypothetical protein MLFYSSFKKWYTRLPAKLGSKCVRITVKNALPSWRSISFYQRKSNSKL >SP_0133 hypothetical protein MVTMQYSCGKININIPDGYGDIKDIVFSAHIIVRYNNGHCGGIDPHIIGL CKKQIRRMSLYPILIIVSRDSKVIDDYKNLDIAYVDCTQCSNNFETALHV KNILKLLKIQLIHCHGYSTNYFLYMLKKLDKNGFGKVKTVITCHGWVEYN LKKKFLTYFDFWTYSMGDAFICVSETMKKKIGEYNKK >SP_0455 hypothetical protein MKKWTFSRAFCRALKSSPNHQIEIRNSLDKTIDFSYSLSFFYLPLYHTFS VYGSSL >SP_1385 hypothetical protein MKMSTFFKKSFWPTFMIVNQTAILFHLKDGLDRQYLTTESIYWVIGTFIF GNILVAVFSNMKIWDKKKNGSKKKYILKK >SP_0773 hypothetical protein MKIYFLKKWENIDSKRILNHIRMGVFKIMFQW >SP_0444 hypothetical protein MNVIFVFIKKIPISFTKKKKELISQSFINLIPH >SP_0311 hypothetical protein MSLDIDKEKMTIMGIAFENRSVFKSVWYALSTNMIEGWRPTVSDVEKLRD EALALGMT >SP_0747 hypothetical protein MTFSNMINPLSKKSLPIIPKKANPVEFAFYHH >SP_0634 conserved domain protein MFSSYFNPLYIMVSNLHQNDKINQLISDYKQNMKAFYITIEKFIRDDESL KCYFIKVISSRSKVTSLDQIEADKTIQRKYSSELKKFIGFYNEIICEENS FLHVRKRWSSWFR >SP_1926 hypothetical protein MKGVTNMTPEEMYLTERLDVQIAHFLKKSVQHRRRYKVLKITEIVAGFLI AVFCAIPMPGDRYRLISVALSSLGLLCEGIINLYNAKENWISYQKTAQLL EKEKFLYQCQTEKYAGKTKAFALFVKTCEGLISEEINQWESIQSKEVAAS ADAPVKKE >SP_2140 hypothetical protein MKQKNYLVSNATVRQTYDKIAESECFLRAIGGIMLHLKLVKQEIEAEKPA SVEAWIISVKFKKGCYRHI >SP_1254 hypothetical protein MTSDKAGLERKFAAKERKRNKPGVVLCGSMDELCALAQLNPEIEAFY >SP_0582 hypothetical protein MIKTFLSALSVILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATA TMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLILLLFGTDKTLSSASN KTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQR IKLLFHQVGLSMANYDIFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTT RFGTIVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNIIHNQLASKYPK AIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATID HILLPKNHYYVKDLDIVSFQNSDHRCIFTEITF >SP_2241 hypothetical protein MSTHFVLQELKKTRIRRCLMKSLARLLIIHVFISIFLFFALTSGAISHTV LLLLLLFLPALNKGLEKIQSKRIPVLNAALFFLLISFPQLLTNPVQWKFS IFLVVTIISSLAYFYNFYQVVKEVDQKQLI >SP_1209 hypothetical protein MHAILRYFIRRLFYHIFYKIYSLISKKHQSLPSDVRQF >SP_0685 hypothetical protein MSRWDGHSDKGEAPAGKPPMHGFGLNGENK >SP_0924 hypothetical protein MILMTKNINLTNEELELIQGGADPYGKNPNGRYDWEIEPVLTLLVHGFCP RGTYDSGYIGGGNHLCKGSAARF >SP_0322 glucuronyl hydrolase MIKKVTIEKIKSPERFLEVPLLTKEEVGQAIDKVIRQLELNLDYFKEDFP TPATFDNVYPIMDNTEWTNGFWTGELWLAYEYSQQDAFKNIAHKNVLSFL DRVNKRVELDHHDLGFLYTPSCMAEYKINGDGEAREATLKAADKLIERYQ EKGGFIQAWGDLGKKEHYRLIIDCLLNIQLLFFAYQETGDQKYYDIAESH FYASANNVIRDDASSFHTFYFDPETGQPFKGVTRQGYSDDSCWARGQSWG VYGIPLTYRHLKDESCFDLFKGVTNYFLNRLPKDHVSYWDLIFNDGSDQS RDSSATAIAVCGIHEMLKHLPEVDADKDIYKHAMHAMLRSLIEHYANDQF TPGGTSLLHGVYSWHSGKGVDEGNIWGDYYYLEALIRFYKDWNLYW >SP_0407 hypothetical protein MSSCLPCPFGAFTVSPEFRPFTVMENQTIPHFY >SP_0116 hypothetical protein MSLKLLDCILDYQERFNGKTCQVSTNYKYLEIFKVNFCLTDLHHLFDLHK ITRDYASQTKPAIQDGVFILEDFRNILCTMM >SP_1914 hypothetical protein MKKKAFGIVLLVLAAWILLQGNFGIPSLDGKIWPLLGIVFFAYKSIESIL RRHLTSAVFTGLLALIIANYAYDLLPVTNHSLIWASILVVLGVGYLTHSS KFWNEKKWWYNGKKTVVTDKEVAFGSGTFYKQDQDLVDDQVEVAFGDAKI YYDNAEMLGDFATLNIEVAFGNATVYVPQHWRVDLKVETSFGAAKADAPV APTSKTLIIRGDVAFGKLEIVYVK >SP_0734 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVL >SP_0296 hypothetical protein MSLEAISEALTHSDTVTTKTYVNTSNIVPLSAGQVAYQHLKNK >SP_0203 hypothetical protein MGKYQLDDKGRAQVTRYHEKHSKGGAGKKERLLSFREQFLNKNKKK >SP_0196 hypothetical protein MERPVNIFTPTPRNGEELERPVDVFSPYSHS >SP_1252 hypothetical protein MVSMVDKDGKLIPEQGGARSTSPAPVVIRKGLDIDKIMMHLSDTFNSWDY RQVEYY >SP_1965 hypothetical protein MRQVMKMNKKSSYVVKRLLLVIIVLILGTLALGIGLMVGYGILGKGQDPW AILSPAKWQELIHKFTGN >SP_1172 hypothetical protein MLYAVPFYFNRSETIVFLNCESIKTDCDGAILALETFKN >SP_0089 hypothetical protein MFLADEKGSEHTAAELIDNLKEVIAKLKANA >SP_0368 cell wall surface anchor family protein MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLP ADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKP AEEKPKEDKPAAAKPETPKTVTPEWQTVANKEQQGTVTIREEKGVRYNQL SSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLK FKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITL KSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVS VKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRV KEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDD AHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDESKLLSSISF LGNALVSVSSNQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYG FVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQ WEKAYKGIVFPEYTKELPSAKVVITEDANADKNVDWQDGAIAYRSIMNNP QGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLK GYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASET YPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDL KKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHG GEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGA ANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWEN GTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTV TLNGRVIQDGSAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSD WAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTNPE MSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKE KVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSL ALNYVKAYAHNTRRDNATVDDTSYFQNMYAFFTTGADVSNVTLTLSREAG DQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVE GVEDNRTHLSEKHNPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQ TIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLE MHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNA NFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTKESMDAL KEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTA PAQAQEGLANAFDGNVSSLWHTSWNGGDVGKPATMVLKEPTEITGLRYVP RGSGSNGNLRDVKLVVTDESGKEHTFTATDWPNNNKPKDIDFGKTIKAKK IVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALVKAQKLT DKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKPDAPT VEKPEFKLRSLASEQGKTPDYKQEIARPETPEQILPATGESQSDTALILA SVSLALSALFVVKTKKD >SP_0564 hypothetical protein MSKKLNRKKQLRNGLRRAGAFSSTVTKVVDETKKVVKRAEQSASAAGKAV SKKVEQAVEATKEQAQKVANSVEDFAANLGGLPLDRAKTFYDEGIKSASD FKNWTEKELLALKGIGPATIKKLKENGIKFK >SP_1548 hypothetical protein MKRIVFELIFIATTWYIFLPPLNLTSWEFLFFLCGHLLVVAILFGFGKGI NLVKTVHVRHGKAEAALNLEGFKINRLGKILLASIGGILLLAALVSLVTS SMFQAKNYANVVTVTEKDFTEFPKSDTSKVPILDRSTAEKIGDRYLGSLT DKVSQYVAADTYTQLTIDGKPYRVTPLEYADPIKWFNNQAKGIGEYIKVD MVTGNADLVDLKTPIKYSDSEYFNRDVKRHLRLKYPTKIFKTPSFEVDDE GNPFYVATVYQKQFGLAVPRPASVIILDATNGETKEYSLSDVPEWVDRIY PAEETIEQINYNGKYKDGFLNAMISKKNVTQTTNGYNYLSIGNDIYLYTG VTSANADESNLGFILENMRTGEITKYSLASATEESARESAEGAVQEKSYK ATFPILINLNDKPLYIMGLKDNAGLVKEYALVDAVEYQNVIVATTVEEML SKYANKNDLEIDNATTESINGVVADLKSAVIKGDTVYFFKVDGKIYKVKA SVSDDLPYLENGKTFEGQVGKDNYLKTFKLR >SP_0293 hypothetical protein MIFNPICCMIREKKGDRDMAFTNTHMRSASFGIVTSLPDDIIDSFWYIID HFLKNVFELEEELEFQLLNNQGKITFHFSSQHLPTAIDFDFNHPFDPRYP PRVLVLDMDGRETILLPEENDLF >SP_0329 hypothetical protein MNDLGKYNELERSSKLTKRQFFENQMLDYTIIAHESFEIIRHSVYQTDDR EVENALAFEVKNDETDKLILLLSEDIGVGEKLCLVDGTKMRGKCLVYDKI NERMIRLQC >SP_1425 hypothetical protein MRITMSGNIQNSELFKFFNENSIKVVDFETKKETLKDIYLNRSK >SP_0699 hypothetical protein MSYFNDYKHKWEGKNELIFLTAILQNSLIAIF >SP_1718 hypothetical protein MNLFLIKMLSETISLFPEVLEEENFTRKKELLK >SP_0866 hypothetical protein MNKQQFIIMALFTAAETYFFNEAWMTGRYIMAAFWAILLFRNFRVSYVMG KIVDVIDQHFNRKD >SP_0174 hypothetical protein MVKHNFDVTDKTGKISSKHCFEITDKTDVV >SP_1481 hypothetical protein MICQFIRDMLDLPAKNVTILEGSNIHVLPSMPYSA >SP_1929 hypothetical protein MKREQDKLIRTVKSISNNVLEAEVYYSSFNLL >SP_1595 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1436 hypothetical protein MDNKKLKVKDLVSIGVFGVIYFAFMFGVGMMGLIPILFLIYPTVLAIVAG TVVMLFMAKVQKPWALFIFGMISPLVMFAAGHTYVVVVLSLIVMIIAELI RKIGNYNSFKYNMLSYAIFSTWICSSLMQMLLAKEKYMEWSLMTMGKDYV DVLEKLITYPHMALVALGAFLGGILGAYIGKALLKKHFSNGLYCVGYFTP CLILWCYLN >SP_1938 hypothetical protein MKRTYRDCKNILLKIFLVWGVIVDRMQTLSVLFTVSK >SP_1705 hypothetical protein MFKNFNNILLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVIFFKRELIS IFSYNDYSEANLEIPKLLLNLSLFMVGWLSVILLESDLADHYHHLIRYQS SSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMM LYLLLSYLIALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVM LIVSRLEEKFKKG >SP_0733 hypothetical protein MTVEEEKAFLARHLKATEAGEFVTIDALFQAYKKELGRSYTRDAFYQLLK RHGWRNITPRPEHPKKADAQTIVASKNKISIQEGKKAF >SP_1949 hypothetical protein MKNDFVIGKSLKELSLEEMQLVYGGTDGADPRSTIICSATLSFIASYLGS AQTRCGKDNKKK >SP_2005 hypothetical protein MFSGLDESFYHFPWELFAGFGMMSWLVREGLKLVGDVKKELEE >SP_0487 hypothetical protein MERIPLSILTFYIPKVPSYSIKEKQSKWLQSGYKSIKTDKAILSSSPIQT ILSVVESHHISLRSRTSLKRRK >SP_0861 hypothetical protein MSKKDKKIEIQVADAKVNVGKDSFEGYTLTIGKKVIGEIAELDGQFAIIK NGNVDSFYKKLEKAVEILIENYNLAK >SP_1760 conserved domain protein MIITQRQSIHWGEVGGTYMYGTTVSYYPDKSVRLYNPLLPSGEILKTWFS SVNYQAARTQPQLPLLKRKQEYQLSLVFDCQPENGVYTKITFFDRYGDIL EKKVEKVKDFIFTYPEDSYTYRVSLLSAGFESLTFYHFSIKEIRSV >SP_0560 hypothetical protein MEFLLVLCNLDYHLDKFKEPIQYLKHYLVKQQLDCKRDDHPKEFHNPNNR FDKKNSKKTKKISFSLLWLNEPPSRIH >SP_0995 IS630-Spn1, transposase Orf1 MWYNLLMAYSIDFRKKVLSYCERIGSITEASHVFQISRNTIYGWLKLKEK TGELNHQVKGTKPRKVDRDRLKNYLTDNPDAYLTEIASDFGCHPTTIHYA LKAMGYTRKKEPHLL >SP_0653 hypothetical protein MDKQYLHEKLDAMRQNFVESTHHERAMGVLDQAHMSKKMLKIKKKLVALE MERCQRKIEHKDCSKIDQKIKEQKEIFESCCKKD >SP_2047 conserved domain protein MWKKKKVKAGVLLYAVTIAAIFSLLLQFYLNRQVAHYQDYALNKEKLVAF AMAKRTKDKVEQESGEQFFNLGQVSYQNKKTGLVTRVRTDKSQYEFLFPS VKIKEEKRDKKEEVATDSSEKVEKKKSEEKPEKKENS >SP_2118 hypothetical protein MMNKYKVIYYVVVIALLVSVFLLIGMDLSWFNPYQSDQFVWVYFALIPVI EWIEKKSKNLASEKGE >SP_0449 hypothetical protein MNITNLFSIKTGCDETDRQLQKLFFQLDLQLGELTDQLRKLDSNFVPRSQ FVDTLDLNDVEYKEILNYFIFHRNDSEESLVEWLYDWISTNRYELPKEFS IRMAHKYHESVTEVFGDE >SP_1723 hypothetical protein MQAFYVKKEEKLSKDYHKINTGNQSNFYENVKDNEIKYFLTKVSNLFFKE FLMKQSKTINLKLAH >SP_1028 hypothetical protein MSTSSRVLVLKKFHGIMDGNRNVAVFFVGQ >SP_2154 IS3-Spn1, hypothetical protein, truncation MKLSYEDKVQIYELRKQGQSFKQLSKRFGVDVSGLKSSESLR >SP_1585 hypothetical protein MPIFVYHNIIKDNIIILFVFSHSVSLYKKAIHFEPLFLIYRLCYE >SP_0759 hypothetical protein MRLEAVFQPLILGEKNEIFNTQYSQLDGERSRGKIPDFA >SP_0874 hypothetical protein MTPFLAKECKGIPKIKIKNVDLTTFYQGMQKNAKE >SP_1864 conserved hypothetical protein MVEPNLESLIKDLYNHARHDLSEDLVAALLETTKKLPTTNEQLQAVRLSG LVNRELLLNPKHPAPELLNLARFVKREEAKYRGTATSALMYEELFKML >SP_1562 hypothetical protein MFNVKWTRIGKKMPENGIIGRKCFDREDIRDEFSTIIQSAILDQFVCKSM DDSYQSD >SP_1492 cell wall surface anchor family protein MVPKTATSTETKTITRIIHYVDKVTNQNVKEDVVQPVTLSRTKTENKVTG VVTYGEWTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYD RLSTPEKPIPQPNPEHPSVPTPNPELPNQETPTPDKPTPEPGTPKTETPV NPDPEVPTYETGKREELPNTGTEANATLASAGIMTLLAGLGLGFFKKKED EK >SP_1611 hypothetical protein MEQTLFELELLPEEDIIVTGLPKYCSFTCLITGR >SP_1503 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1642 hypothetical protein MKKIIFIKTIQLLVIDGIMLAFLTFKRGLTWDWILIYSGWLIFFHPVLLT YLSNQLCDHFS >SP_0612 hypothetical protein MKTDLHFENHQKNGIMVGKDSAESIRTFRIRG >SP_1304 hypothetical protein MVFLKIDKKTNVFLGNIFLKDKILKYAIRKENV >SP_0080 hypothetical protein MMPKMANRDRSPLSSSKSSSKAGLYGKIERSDKRE >SP_0854 hypothetical protein MSAYLKEALKGAAKTQTKTNFGKPDFHIEKYKQDSLVTYRN >SP_1832 hypothetical protein MVKINKICSIQGSSVENEDIVGSQNQYFWIIGGATDLYNSKEEIGYSVSE VVHILSESLSVNCKESKTLKQIFETALLEVKDEIGLNSYKLTEYSKMK >SP_0164 hypothetical protein MRSLFKKIVAVLVIGLILLGIARTPQVHKMARGIDPGPANFI >SP_0018 hypothetical protein MIMLQKIYEQMANFYDSIEEEYGPTFGDNFDWEHVHFKFLIYYLVRYGIG CRKDFIVYHYRVAYRLYLEKLVMNRGFISC >SP_1454 hypothetical protein MAHGDLLYHDGLFFSAKKEDGTYDFHENFEYVTPWLKQGD >SP_2025 hypothetical protein MRGLVLSAISNQCFELNGTRFLVCSLILIGP >SP_1303 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYYKRLQIVL >SP_1707 hypothetical protein MMEHLFKMIILLPCFYFFSWIDKDNRESKFFPIFYYFYWIYITLYALFSL AWTVFSVLFFNIVLRNLTDIKLWGIWLLLLLIAFASDWLAYVFFKKMLDL RRELGKSKGGRH >SP_1761 hypothetical protein MSKELNILQIGLANWENHYDIPENMSWYYFYPNSSKALREIIEKEDINRF HAVLIEDGQYSRDLFSYVKYFEPYTLFYNQNLQINDREVVDFLKKRCAQA IDFLSPQQLINDLSKSLFGGGYGDKLFPPTIQVNPNFTGAISYQGLDYVS LEGEFGQDFAQLAYWAYNIMVQKTLPIELWLEYEKEGNCDFRLVIRKMWS GSVDDFFEEVIVSEKDLEQALFMDSRDGDYFLSISVEARGRGTIKLGNLH QRWSRKQFGKFVLGGNILHDSKRDEINYFFHPGDFKPPLTVYFAGYRPAE GFEGYFMMKTLGCPFILFSDPRLEGGAFYLGTDELEGKVKDTITHYLDYL GFDHKDLILSGLSMGTFPALYYGASFEPHAIIVGKPLANLGTIASRGRLD APGVSNLAFDCLIHHTGGTSSQDMTELDQRFWKIFKQANFSKTTFGLSYM KDEEMDPQAYEQLVSYLCNTGAKILSKGTAGRHNDDTDTNISWFLHFYRM VLETGFGREKR >SP_1049 hypothetical protein MWSVATTKKVIILDFKLFYKHFVFVDKGNFDNKNPKR >SP_1292 SAP domain protein MNFFSKLFNLKQNNHNRDTNSDCNNFYLNELECGLTPGQLILIDWTQKTG RNYNFPRYFKYSLQIDPESTHNQLYKLGYFTKNKTLSYLTVVELKTILSK HNLATSGKKAELITRIINNVNIDNLDIPFEFKLTKEAQNLIIEHSDYIKA YYDKDITMEDYCKEKNNISFKATFGDIKWSLLNKQAHRNTVSGDFGCLSN TRKAQGRHLEQEGNIKHALIYYIESLIITISGLENNFSATDYPVYYPDSI PDYSLKHIQTLMESLSDDDYDFAFDEALFRFSILNANHFLSKEDIDYLRV NLPRSTAEEINNYLKKYECYSPLNNLELDDFE >SP_1657 hypothetical protein MRIDSIASSVTTGVSKVIVSFELLDVATVFSSLLVEVFFELYVPHATRRE RDNRATPIPINFFVLFMLKYLLFF >SP_1755 hypothetical protein MIEILIVLAIILSLALIVLVTIQPRQNQLFSMDATSNIGKPSYWQSNTLV KVLTLLVSLALFILLLTFMVITYK >SP_1820 hypothetical protein MAPRYQRPHTEVYSVCGLFSIRRLVYLLLGP >SP_1728 hypothetical protein MSVNLLTLLFIPVMVSSSGSEFQSGWQEHQLIAEKVSKTLDKTFDKDVRE IPTSQFYQKFVDEMGRTYSGNLILQELITVNGAYKATYIGELSSN >SP_0705 hypothetical protein MFGCLWYIFSTFRGLCIMKQFVQFYKKDFLAVLVYFILLLSCVLSSTVYL LRCRQYSIHPNVLEWILVLLQDMTTGVYCFPFTYILFFFYLMNNYFNRLE CRIRLKSIKHFTSFSFKLAALSTGIWTATLFLLIFLIAFSNGFSFSLEIK EVDFLREFYGISIANNASFFIGFFFSYIAYYFFLSLLTISSFSWFKKSNM SLVFLFTFLFVESLFWIYQLDNGIIGLLPIFQYMVNSNPYALIYWLTLLS IIIPLTVFSVHRNWRRV >SP_0840 hypothetical protein MKILKRYILELCFILSFALPFIKGTNADNGRCFVETYYGFTFLMEHAIVT AVFICSFLIAFLLKNDGRNGLLRVVIAF >SP_1253 hypothetical protein MAAKLWEEGKMVYASSASMTKRLKLAMSKV >SP_1174 conserved domain protein MKINKKYLAGSVAVLALSVCSYELGRYQAGQDKKESNRVAYIDGDQAGQK AENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAII SEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLKDAAHADNIRTK EEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTGD AYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQ PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPA QITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDS RPEEPSPQPTPEPSPSPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSR YIPAKDLSAETAAGIDSKLAKQESLSHKLGTKKTDLPSSDREFYNKAYDL LARIHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLVEDILAFLAPIR HPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYV TPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGA EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFD EGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQ ADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEES EEPQVETEKVEEKLREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQD NNTIMAEAEKLLALLKESK >SP_0031 hypothetical protein MIAKKIFSNPEITCQFIRDMLDLPAKNVTILEGSDIHVLLSMPYSVQDFY TSIDVLAELDNGTQVIIEIQVHHQNFFINHLWAYLCSQVNQNLEKIRQRE GDTH >SP_0503 hypothetical protein MPDIVFEIEFFHSDSLIFFYTSDDKSNSQKSQEDFSKNK >SP_0958 hypothetical protein MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLE SRIAANERAINENEEKLSQMSDTSSEEYQFAKNNLDVQKNLLTRKTEILT LLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALY PLNIKAHTLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNH LDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLVGSLISGFGQL DYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQ KMPVLFLSLIGIVGLLFGIQTIQPLQRIAHLIPFTYLRSVEILSGRLPKQ IDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRF >SP_2104 hypothetical protein MLKKYFSKYKWTDLFWILFVILTCLYIGNHDLFTLNHQEFSFRGSVWGLV LALYHLLFIDKFVISNRK >SP_0573 hypothetical protein MYNFSQSCYNQPIGIKEVTLMAVFVSLDGIVVEVLDVFSSFNGDSEFFLC IAF >SP_2115 hypothetical protein MKRVILLAVIQAVVLFFIIGALAYAFKGDFFYNYLAVVFAPIAGVLRFGT AYITEIVLPRKAAEIAEKRKAGKNSK >SP_2071 hypothetical protein MEHLVVLSFIFSKFLECGILRKRLNSDKNWRNSTKKLEKNEGKRYDRKEE ILEEEHVTY >SP_0188 hypothetical protein MSRKKYENDEKSQKKLKIGRKSDVFYGIID >SP_2141 glycosyl hydrolase-related protein MVRFTGLSLKQTQAIEVLKGHISLPDVEVAVTQSDQASISIEGEEGHYQL TYRKPHQLYRALSLLVTVLAEADKVEIEEQAAYEDLAYMVDCSRNAVLNV ASAKQMIEILALMGYSTFELYMEDTYQIEGQPYFGYFRGAYSAEELQEIE AYAQQFDVTFVPCIQTLAHLSAFVKWGVKEVQELRDVEDILLIGEEKVYD LIDGMFATLSKLKTRKVNIGMDEAHLVGLGRYLILNGVVDRSLLMCQHLE RVLDIADKYGFHCQMWSDMFFKLMSADGQYDRDVEIPEETRVYLDRLKDR VTLVYWDYYQDSEEKYNRNFRNHHKISHDLAFAGGAWKWIGFTPHNHFSR LVAIEANKACRANQIKEVIVTGWGDNGGETAQFSILPSLQIWAELSYRND LDGLSAHFKTNTGLTVEDFMQIDLANLLPDLPGNLSGINPNRYVFYQDIL CPILDQHMTPEQDKPHFAQAAETLANIKEKAGNYAYLFETQAQLNAILSS KVDVGRRIRQAYQADDKESLQQIARQELPELRSQIEDFHALFSHQWLKEN KVFGLDTVDIRMGGLLQRIKRAESRIEVYLAGQLDRIDELEVEILPFTDF YADKDFAATTANQWHTIATASTIYTT >SP_0162 hypothetical protein MSMWRDWAPMWWSFSVLSEIWYNSTNQFLGK >SP_1093 hypothetical protein MAFEKIIQLKNCRYDYTLSPSVKKFTLKDNTFFETKVGNYELTRLLEKVP NSGEGFQLKIIINKELTGAKINITDKFGLRLVDIFKSEDHHIHQEKFYFL MDSLVERGVFTKSER >SP_1579 hypothetical protein MYAFDSSLSNNRLELSDNIILCYNEEKTEVFKCQKQVISF >SP_0670 hypothetical protein MAKGFAKGLVTGVAGTVAAVAGAVYAFKKKVIEPEEQKAAFIEENRKKAA RRRVSR >SP_0906 hypothetical protein MAVKFTKRDDLDKMFEEFAKLPDLKQVTFPDDKEKKVKAEKKN >SP_2063 LysM domain protein MKKRMLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYT VQYGDTLSTIAEALGVDVTVLANLNKITNMDLIFPETVLTTTVNEAEEVT EVEIQTPQADSSEEVTTATADLTTNQVTVDDQTVQVADLSQPIAEVTKTV IASEEVAPSTGTSVPEEQTTETTRPVEEATPQETTPAEKQETQASPQAAS AVEVTTTSSEAKEVASSNGATAAVSTYQPEETKIISTTYEAPAAPDYAGL AVAKSENAGLQPQTAAFKEEIANLFGITSFSGYRPGDSGDHGKGLAIDFM VPERSELGDKIAEYAIQNMASRGISYIIWKQRFYAPFDSKYGPANTWNPM PDRGSVTENHYDHVHVSMNG >SP_1668 hypothetical protein MNLWDIFFTTQATEPPKFDLFWYVSLFTLLALTFYTAHRYREKKVYQRFF QILQTVQLILLYGWYWVNHMPLSESLPFYHCRMAMFVVLLLPGQSKYKQY FALLGTFGTLAAFVYPVPDAYPFPHITILSFIFGHLALLGNSLVYLLRQY NARLLDVKGIFLMTFALNALIFVVNLVTGGDYGFLTKPPLVGDHGLVANY LLVSIVLVATISLTKKILEFFLAQEAEKMIAKEA >SP_1629 hypothetical protein MLIYETVALVGMDSGISIKHILQKMKNKKLSQNP >SP_1217 hypothetical protein MLEIDLTVLIDLSYSYFILLYFYENLKKSVKSWISLICINKIVYMVD >SP_1640 hypothetical protein MLSTRFTGKLSKWGNYFGIVNTILSGAIDYILGNKAAIITYPVTFLIYTF AIKKWEASQEGRPNQMSQKQVKLAAIIISIIAFLFAFVTNYIGYGGKMNL LAYVTTIAFALSLIANALNALKLTTQWGFWLIYNFVQLTKAGIQGNFANI GKYIFYILNAIGALFVWNDEEVR >SP_1635 hypothetical protein MYLVIGIVLAFIVSFWKDNRSLWNPVLFLLSLISSYFYLSYLFYKNGYEN VQLAFYIFAFVLLPFLLFLSGIFLIYNGVILLKREGRSKPHYLSMLFDFY >SP_0052 hypothetical protein MKERLDDNPIVQGNWKTLGFQFRHETPLVAAAVPHKLR >SP_0491 hypothetical protein MCFVPYYKVNHCEEAFAWYQDVNLVYLVDGVKLPYSQADLGSHVFLFGSA W >SP_0142 hypothetical protein MLNLQFAETMELTEAELQDVRGGNLVNSMGGGGRSGISGWGVPGIYPGWG NQGMSPNRGAFDWTIDLADGLFGRRRR >SP_0879 hypothetical protein MYHETELATRKGNFSFFKIFLKKQSIINHNQRRECMSNYRRTSKPKTEHI KKGFTVFQKTVATIASILGLITASITIMNALDNNKNIKKEPTTSQTTTIV KEIQKESPKENTSPTKETNTSQEKTQQEETPKSSVKEEKKEDQKTATQDS STPASSKPATENEKQSNAPTSENKSNQ >SP_0134 hypothetical protein MESIIKNKKIVAINNGINVSNSDLDVVGVQDFKKEFCIPNNKKSFVMLEG WIQKKGRIDSLNLQKNYF >SP_1418 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1132 hypothetical protein MEILSKEIQLQGLQLLKQTLETLVELEKQRSSKLDLISRKELMDLLGISA TTLDNWEDLGLKRYQTPMDGAKKVFYRPSDVYLFLAIK >SP_0093 hypothetical protein MVKIRTRENMDIYILVPKKPLPSPDQPEESSDSYFRS >SP_1762 hypothetical protein MYYFIPAWYGSERTWHADITPWYFSHFRLEFDDTFHQIRLFQEQDIDSRL LVLAYQPHLRYFLYRHGVLEMDTYSVFDVMQDFHNLHTQVLSIRDIEWDD DCEFIYSPFTIIVQKNGKKFAKVEHGVEGFISDIQYFEPNGQIHMHHIVD DRGFVSSIIFFEDGQAAYQEYLNLKGEWQFRERLKEGGQVEVNPILGYRF KMLTYQNMGDLVAEFFENYLQTYVKDQDIFMLPSHSHHDQLVLDRLPSTN PKLLSLFIGRNPQDTFRDLDVTFEKSDLILVDREDSLRLLQELYPERMHQ CYHLSSFDTRLRLGRSQTKKESIIYFQLDFEQGIDNQALLQVLSFVAENK DTEVIFGAFAASQEQMNEVEGIVESFIQENIQSENLGKAIDYGDAENPLE ENQHQDLRLQFVNLNDELDLIKTLEFVRLIVDLNRHPHLYTQIAGISAGI PQINLVETVYVEHLKNGYLLADVTEFSKAAHYYTDRLKEWNESLIYSIDK IKEHTGQQFLGKLEKWIEEVKNVKGT >SP_1211 hypothetical protein MKQFKILSDKYLESITGSDGNLGPGFGVIIP >SP_0191 hypothetical protein MKKIVLVSLAFLFVLVGCGQKKETGPATKTEKDTLQSALPVIENAEKNTV VTKTLVLPKSDDGSQQTQTITYKDKTFLSLAIQQKRPVSDELKTYIDQHG VEETQKALLEAEEKDKSIIEARKLAGFKLETKLLSATELQTTTSFDFQVL DVKKASQLEHLKNIGLENLLKNEPSKYISDRLANGATEQ >SP_1677 hypothetical protein MAQKGVSLIKAAFDTDNFLMRFSEKVLDIVTANLLFVVSCLPIVTIGVAK ISLYETMFEVKKSRRVPVFKIYLRSFKQNLKLGLQLGLMELGIVFLTLSD LYLFWGQTALPFQLLKAICLGILIFLTIVMLASYPIAARYDLSWKEILQK GLMLASFNFPWFFLMLAILVLIVMVLYLSAFSLLLGGSVFLLFGFGLLVF IQTGLMEKIFAKYQ >SP_0068 hypothetical protein MDHTRLSSKDLWSAFPTSNSIMGENLAWNHDGFLKAIEQWRAEKADYVEK KIVVQTTGNLVTMSR >SP_0115 hypothetical protein MELVLPNNYVALEQEEMMYLDGGGVGRNWWNSRGSFATVLDVDLAIYSGG ATIYSAYAIKKAISANRGAITRTLRSLIIKHVGSAAGHLVNTALNVALTV TGFSLGGAIAYGADWADGSLDGYIFA >SP_0763 hypothetical protein MTIARFSRATESWNGLMVEISPCCLDFIQKSVTQALFKLLIH >SP_1793 hypothetical protein MKQKQPIVSRTKQHTFEELIQDQKLERLAKLSPDLVGRYGFTASCASSFA NLIKEAYGGKNLNVVYASRMLALWNIACSCYHKADGYSLADALFSDKKIC LDSYYYHKNTSNTITSDVIKDVYDNYNNYMVLTREATPEYIYVVQTEMPK DSDLYFYIREVLGLSFSTMHYAFLVKVLAGALARKYKPYRN >SP_1930 hypothetical protein MRERSATGAQGLSKSIKKHLNDLTRLTASLLGDEKLSAITSSSAVKADMH RFVIELEPVKSTILQNNDISLDQNEIFEILKNFLDG >SP_1729 IS1381, transposase OrfA/OrfB, truncation MREYRTYEEIAADFGIHESNLIRRSQWVEVTLVQSGVTISRTPLSFEDTI MIDVTEVKINRPKKQLANDSGKKKFHAMKAQAIVTSQGRIVSLDIAVNYS HDMKLFKMSCRNIGQAGKILADSDYQGLMKIYPQAQTPRKSSKLKPLTAE DKACNHALSKERSKVENIFAKVKTFKMFSTTYRNHRKRFGLRMNLIAGII NHELGF >SP_1047 hypothetical protein MNCRGHETRQRIVRDFEVQPKAHIKLLANQQKHSDAGATIEDEYYVFIAE SKIDGKKEVIQCCMGAARDFLELINHKGLPLFNPLVGDSHVNNRQEYDNT GSGNL >SP_2039 conserved hypothetical protein MLNKIRDYLDFAGLQYRNPDKAGAEREKMLAFRHKGQEARKVFTELAKAF QASHPEWQLQQTSQWMNQAQRLRPHFWVYLQRDGQVTEPMMALRLYGTST DFGISLEVSFIERKKDEQTLGKQAKVLDIPTVKGIYYLTYSNGQSQRWEA NEEKRRTLREKVRSQEVRKVLVKVDVPMTENSSEEEIVEGLLKSYSKILP YYLATRK >SP_1349 hypothetical protein MTFLDDYHKKHNYPLFYESYLQNVMEFLESQDIKNGVDAFVDDHQNLVFV LYGQGYRAEGKEGILTTQVTVKAYDEDKKPINFANLLDSLIY >SP_2187 conserved domain protein MYLGDLMEKAECGQFSILSFLLQESQTTVKAVMEETGFSKATLTKYVTLL NDKALDSGLELAIHSEDENLRLSIGAATKGRDIRSLFLESAVKYQILVYL LYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEH QIHYFYFCLFRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLD LVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLRLLRKVPSFFAGQH IPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMK KEELLGDYTEDHVTYELSQLCAQVYLYKGYILQDRYKYQLENRHPYLLME HDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQHMRIGL DLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKE QTPVYYLKNDLDMEDLVAIRQLLFT >SP_1039 hypothetical protein MDNDWNGLADLIANLIAKYAGALDLDNLPDPTPAKNQEMKNSFDMAKTQI ETD >SP_0327 hypothetical protein MKEERRQFFERVDGNQCRDYILSHCSKDYEKVKSSLERLMDNRFMFDSPW DMEPCSKIHQIQPMVWDQVFEDDPEWSYMLNRQEYLLQFMIGYLVEGDKD YIQKCKFFLFDWIEQVREFSPQSLMTRTLDTGIRSFTWLKLLLLLLKFDL LEEKELEKILVSLEKQIDFMKSYYRAKYTLSNWGILQTIPMLAIYHFFSD KMDLEEAYHFASEELKQQIETQILGDGSQFEQSILYHVEVYKALLDLCLL LPDLQDSYQELLEKMATYIQMMTGLDGRTLAFGDSDSTETTEILSLSAVV LNQEDLLNGLDVKVDLLSLLFLGREKVKRLQEFEKRAWQPKSMIFEDSGH VCIKDEHRYLFFKNGPLGSAHSHSDENSFCLQYQGQPIFIDAGRYSYREI YERYLLKSAWSHSTCIVDGKAPERITGSWEYEYYPHSLFCHHKEREGVHY IEGAYWSAEPDLPYLHRRKILMLVEDVWLLVDDIRCQGQHEVLTQFILDK DVTYQDGKINQLRLWSEVDFDLEDTIISPKSPIIHPKSKNHPESLP >SP_0361 IS1167, transposase, truncation MGGYFRNFENFKKRIFIALNIKKERTKFVLSQA >SP_0679 hypothetical protein MGILSIILGLLFPIVGLILGIIGLVLAISYQKESQLDYKIEKILNILGIV ISVVNWIVAIALIFR >SP_2147 hypothetical protein MLLQNLSSKECCIFLSFHTNISLYTDPSDNQMIIGHFLIICIFFEKFFKK ILYPFYRSLCICIYVFYPMLCIIT >SP_0853 hypothetical protein MKENNMLVNWKLESDVNDYVKKQFENLGLKKLQDY >SP_1494 hypothetical protein MLNVDQDFMSISKSNKSGSDWKKTFTVRITNRLANDLNNVLKQVDKDTPN TPTWLNSAASKAKDDDRVYKLLKTLIPGENYLSC >SP_0328 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0201 hypothetical protein MLSHAWDDTATLYRKSERLSPSAILSPLHYTATEENRNKLLNDLKEKQPK VIVVNDKVVVWSEVETLLKENYQQVKTDYSEFKVYKIK >SP_1556 hypothetical protein MNTDYLPLEKRCLSCLSWKVSAIPYFSEAAKVLCILIIEHAWNPV >SP_1351 hypothetical protein MIWVKATQLVDEMEQVGLIRFDEFGNVGILVLEGQ >SP_0172 hypothetical protein MIFIEYTTILLPLARDFVYVEGLGSYVVELFCSF >SP_1822 conserved domain protein MKDYKINFDLGKIEYFDNNCLIQVYKFISFYDICEMVFAFHLPPDELITN VIFKEKINSMLKCYIDRLLYVFINPTHFTEKVNLQFYGSFFSYEFICREV GNILKNKGVKCNLNFFEGEEYL >SP_1091 hypothetical protein MQIFYIKTKIFLSFFLFLLIFSQCFYKIEE >SP_1092 hypothetical protein MVSLQNRKDRAKMFELTYKDCYHVERTLKYEDHEALMLTLSGCVTLPDTL YVTSLTFRGKKVYQGLVGDLYRFLSHADFLHQN >SP_1490 hypothetical protein MLEIDLTVLNDLPYSCFILLYKPETVYIFSK >SP_1058 hypothetical protein MRSLFRKIVALLVIGLILLGTAGGTQVHKMARGIDPGPANGIYR >SP_1043 hypothetical protein MVVYIRQSKLPSEVSINKYNAQVGAYLQGEEVILYQSFSEIKELTSEDIV VDYIMETRALLKMMGLNVPVHDYPIELKEFYGRKIYAGILGEIVNIPDNW GKFIKPKAGSKVFTGRVVNGTHDLIGIGLPFDYPIWISEVVEFIAEWRCF VLDGRVLDVRPYTGDYHAQFDASVIDEAISCWKDAPIAYGLDIGVTRDGR TLVVEVNDGYALGNYGLSPLKSINFHRARWKEMVKPYFEKNEIFKIQQDV IF >SP_0072 hypothetical protein MQIAGIIFNSSTTNGDKVDFNPTENVDLRNNFASLVK >SP_0428 hypothetical protein MRLERSHFLWKNVIQVEKLPVEEMGYKIDKKRNIL >SP_1175 conserved domain protein MILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGI NAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYKLKD EDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHRE GGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHY IPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNP GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPA QITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDS RPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEEKGI SRYVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFYDKAY NLLTEAHKALFENKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAP ITHPERLGKPNSQIEYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDA YVTPHMGHSHWIGKDSLSDKEKVAAQAYTKEKGILPPSPDADVKANPTGD SAAAIYNRVKGEKRIPLVRLPYMVEHTVEVKNGNLIIPHKDHYHNIKFAW FDDHTYKAPNGYTLEDLFATIKYYVEHPDERPHSNDGWGNASEHVLGKKD HSEDPNKNFKADEEPVEETPAEPEVPQVETEKVEAQLKEAEVLLAKVTDS SLKANATETLAGLRNNLTLQIMDNNSIMAEAEKLLALLKGSNPSSVSKEK IN >SP_1775 conserved domain protein MRAQSFFLTFSFIRSKIKLALNKGVLNMIEITYIDASKNERTVTFESYED FERSQQACLIGVADYYPVQKLTYKGHNLDYHGTYGDIFFYLMKQDLSQYN >SP_1628 hypothetical protein MIIMRRFYSHLPYYLVILFFYWPLYELFLLVVSDPLTLKGLYINNLLFFT PLVILIVSLLYSYRFRFSL >SP_1836 hypothetical protein MMSSKDSKCYTKLLTSYFKPRDSHKKGKSYKNSLSEEKGASTTGVNYYQL KTTVFTLN >SP_1866 hypothetical protein MHVVQNLVNIPKLTRIFISKQKNNTPSKLFGFFMKFTENS >SP_1787 hypothetical protein MVLSGGKSAMPMTQKEMVKLLTAHGWIKTRGGKGSHIKMEKQGERPITIL HGELNKYTERGIRKQAGL >SP_0451 hypothetical protein MAKSNFEKVESVVGWVRDKKITGYRISKETNAREMSIIALAQGRAKVKNI SFETALGLIDFYEKNYEKFED >SP_0138 hypothetical protein MHSFGIHPRYPYKMSEIVEMSKKISDLNDKKEFHKKNCELRFQDSVLDNI NSVPGTTHLFSCLNIQTLVKIILYIFRYNSCMKIFNFRDKELIFFNKIVN GLIQNIEENLEDDIERILKYLYICLFNEIFIIKNKVNFFDDVEFNQTLSE FLDKL >SP_1679 hypothetical protein MVSSKYAARASFLDGQGITVDEMAWIIRGIVNALIGRYIKLGTYAAKYGI SMARSILSRVAATAAARVGLLTKISGWILRVAVNVADVYGNFANNIAAAW DAYDKIPNNGRINF >SP_1818 hypothetical protein MRMMFSSLVSDEPNFTAFSACFQQPQALCERKNCNFSIYYFLASSSLQSQ LGPCLHDQRH >SP_0518 hypothetical protein MSVLDEEYLKNTRKVYNDFCNQADNYRTSKDFIDNIPIEYLARYRELY >SP_0448 hypothetical protein MYNVITPSVIVLADQNKADWSYDENAVINIYDDANFEDGRLHMNFEQFFK LAQIAREEGLEIHSPFERAGATKSARYIAKWILRNKKH >SP_1265 hypothetical protein MSVRKNKFFKSRDYKACLRKNSKTLTNKNKMVIIKNGLK >SP_1363 conserved domain protein MFEHYSVADLFANLYKKRKANILALIALFALIAVPFTIKAVRNKNTVKDT TSYSTYLIYKITPPKESDKTILNHQIGGYSDFYGKLIDGNLNGAYLFNDV EPSELKKIASELDTTETTLKNSTNDYWWKKLTVYYMIDDAGVGVKILTSS KDANNLLEKKIDGLIEKFKHAYANVKIEKLETINSKELNANGETALGLNV KNLILRLVVIGVVCVILVVMGNVLVYLFNPTINRVGDFSQYQIDFVTEIT TIANLADVLSYKNTGQELTIVSSNKAILDKLKQSQEALKGMHFVDLQDVS SLLERDTVLLVEEYGVTRYKKFEQSLQILRNLNRSILGVATFKL >SP_1025 hypothetical protein MPSHYTRTKTFMDIYIKKAIIHQFSPDDTELFLADKFLNITPKIEEYLRK KIEHVYSDEAKTGIFEEENPFFNHITDDLLETSVTLANLWKEEFSISENL KTNDLIFVQFSKEGVEHFAFLRIALRETLTHLGGEVDNPIKLTQNNLPGF GTGADEALVVNLQSSKYHLIEKRIKYNGTFLNYFSDNLLAVAPKISPKKS IKELEKTAQRIAKSFNTDDFQFQSKVKSAIFNNLEESNELSPEKLANDLF DNNLTARLSFIDQVKEAVPEPVQFDEIDASRQLKKFENQKLSLSNGIELI VPNNVYQDAESVEFIQNENGTYSILIKNIEDIQSK >SP_1307 hypothetical protein MKKENEYVILTTASLGVMIGIVFAIFLDFPVEYGISLGLLNGIVLGSLIV YKNNKN >SP_0277 hypothetical protein MKLRIFAEDKPAKKVFEYQLELADRTILLSTALLSGAIALAGIFSALKEK >SP_0343 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0244 hypothetical protein MSHSFKKSLQKEILHRSSIAAFVTSRAFSDTVSPI >SP_0587 hypothetical protein MLGSLSFFLRNETKIQVIKIPSRQATQRKIGNRTTERLLLGRFIFFHRVV GKFSFQDTSLERFNTKVSKAFTLIAIGRLAKCFTNSLVK >SP_0183 hypothetical protein MQDQHAIKNKKTIKATAGAVAFSLTFLSYIQ >SP_1932 hypothetical protein MTLKDDDDPRIEEESEALENMILQYLGEDDAS >SP_0911 hypothetical protein MKHKEHILIGLLYLLSPFIGQLLVEHTHFISTEFTGTAYVICWLSVVISI HHFSKNVLSQQQK >SP_1133 hypothetical protein MKIVTFKPTKQIDDGFYLPGIDILFVSDKADAKDKEDVILFLSRNGLNKS >SP_0774 hypothetical protein MLSVNTILEKFYKEHQVKPFISPERELDTWLLSPKPVPKRNMDLLVDDSL AGDIILLWRIQFGTFTTET >SP_1189 hypothetical protein MVSGSVFADSALTTVDKANDIVLNVDGNKFYNVSVSEDIVNAGQILEDYF YVDKFGNINLKGTPEELAKNIGISVQEASLMYGAVKELPNVYERGPVGFR FNLGPQVRGMGGWAAGAFATGYAGWHLKQFAVNPVTSGFVAVISGAIGWA VKTAVENYWTVAVATVEVPFVNLVYTIDLP >SP_1805 hypothetical protein MSVEEKLNQAKGSIKEGVGKAIGDEKMEKEGAAEKVVSKVKEVAEDAKDA VEGAVEGVKNMLSGDDK >SP_1703 conserved domain protein MAHLKSFITRYSKVYIGLVLLIWLSFFFIPWDKPLLGIRIDIFIIQKILL AFGILSILMALLSKKVSLFVFGLICCLSLWINLFITFAILPIFGN >SP_1819 hypothetical protein MMNLSSIYSSMPTTKSKQKGWTNTKKASNTQ >SP_1962 hypothetical protein MIVGSNPSTAFYNLIYQVSNEQKAQFEGLFLFSLE >SP_0772 hypothetical protein MDFFYNGIAITPNTYLSAWFVNFIAALPLNFLIVEPIARFILSSFQKPFT GEEVEDFQDDDEIPTII >SP_1643 hypothetical protein MILSLVSLSDIPLFLQGTLLILGHLIPSYRICQSLKRDFPQAYQEPISFW SIL >SP_0114 hypothetical protein MQRLEVYKNYQHLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKN KKSYDRLAICYVRIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVETH YQPKKL >SP_1333 hypothetical protein MAERFWENLSIILAERNISWIELTRKMFAGEFHYPSELNRLYQKIRHYKM EQRMPQSPWVERIVQILDLDYEDLFRR >SP_1345 hypothetical protein MTKQMKLMECDLVHSVQIVAVTGVLLVGKIVNSFKL >SP_1931 hypothetical protein MMPANTKVIFQEMFADFQNYYVLIGGTATSIVLDSQGFKSRTTKDYDMVI IDEVKNKEFYTTLNHFLELGEYQGSQKDEKAQLFRFTTTNPEFPSMIELF SILPEYPLKKDGREIPLHFDQDASLSALLLDEDYYNILVHEKETIQGYSV LSNCGLYSSKISSNHVSFHLQPQNSVLSSLQLAS >SP_1065 hypothetical protein MLLFYVGRLNGQKYHCISFVSWHIKYRSILLMILKSLLSWRAIQICSTNE ESTILGYSVVKGQIKNLTHFQTL >SP_1443 IS66 family element, Orf1 MEFLLYTISKVKLLEDILMPQPIVPVEIPQSRRFDSKKRNDILLKIRIGK LEVSFFQSLNLEMVEQLLDKVLLYDNSSI >SP_1181 hypothetical protein MVNFSCLSILSPRFLLLSYHKLAHHSNTKTTK >SP_1925 hypothetical protein MSQSSYLSPLLWLKKEADKEKMSATQCQIFFFYYQMFELLFARESDMKDL CLGTKGFYFSQLEKNLLSGVSRFLKNLEGKVTLKANQEVSARKALFLALT TSQSDWQELAPVFDFYQTIGRLENPSLLSSQDRQHLMWIYQSALEKDYIV KVIGDKHFVLKRQDATKLTARQTQTLEILSQSEDLVNPVYVTLGEKGVLL LD >SP_1146 hypothetical protein MDFLNHSFDTKKVINTKINAVNSKNNVGKNFIDVYREMKEVPNNKIHQSK VNVALIKK >SP_0108 hypothetical protein MQRLEVYKNYQRLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKN KKSYDRLAICYVPIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVEIY YQAKER >SP_1233 hypothetical protein MSGLLYHTSVYAVKKEILVNTRKKTQFMTMTALLTAIAILIPIVMPFKIV IPPASYTLGSHIAIFIAMFLSPLMAVFVILASSFGFLMAGYPMVIVFRAF SHISFGALGALYLQKFPDTLDKPKSSWIFNFVLAVVHALAEVLACVVFYA TSGTNVENMFYVLFVLVGFGTIIHSMVDYTLALAVYKVLRKRR >SP_0316 hypothetical protein MVYLRAISPNHQPAPKDAGFHVVQALLLLTRHLPL >SP_1103 hypothetical protein MGLMAMLLITIRRENQALVNNKDYPLEMKGTLEIL >SP_0834 hemolysin-related protein MSAQITINHKKARYVRIELEGYNALSLAEVEVFCFIATNAETATQVSKPV QPISQTPVKDKTLTIQHSGAYIARYSITWEEVPVDKDGNQVVRSHSWEGS GRNQTAGFVLNLPIKENMRNLRVKIEKKTGLLWNRWQTIYENRPILAQPH RKITHWGTTLNSKVSDDDVL >SP_0495 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_1031 hypothetical protein MNDVAIILETKSEERDISKQIFIDELMKNIDII >SP_0809 hypothetical protein MKSTKEEIQTIKTLLKDSRTAKYHKRLQIVL >SP_0821 hypothetical protein MKIFVNLDYKKILFVRQKGFYLDMQGQSQLVLD >SP_0683 hypothetical protein MKPLSYVIRITFLLFVVKEKIEFFRYFTILPL >SP_1757 conserved hypothetical protein MIELYDSYSQESRDLHESLGATGLSQLGVVIDADGFLPDGLLSPFTYYLG YEDGKPLYFNQVPVSDFWEILGDNQSACIEDVTQERAVIHYADGMQARLV KQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVL LENHVTGDILLTLPGQSMRYFANKVEFITFFLQDLEIDTSQLIFNTLATP FLVSFHHPDKSGSDVLVWQEPLYDAIPGNMQLILESDNVRTKKIIIPNKA TYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEA IAGALPDVTFRIAAVTEMSSKLLDMLCYPNVALYQNASPQKIQELYQLSD IYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFESSEVA ALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGG >SP_0126 hypothetical protein MSRMSKFVIELSSFFLVHFYIRKRKGKVSIFLNYF >SP_1352 IS1380-Spn1, transposase MNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYL VTNDQRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGG QLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDST HFTTYGKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEE ADSFITPVLERFNQLLFRMDSGFATPKLYDLIEKTGQYYLIKLKKNTVLS RLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNL FYDVISLVTNMTSGTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTL IKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFLHVVGKCVRT GRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH >SP_0706 hypothetical protein MEMGKLSSHMWRLNQIIYTKYFWGYVLFWILICLGLWYWLEGNDRLVIEI LKGPNLSQNSFLVLSIWLLHWFIIHTFFLAVVYRRRASDFFMEVIRFSSI KLWIRYQIWTCFLYGLILIMVKVLVIQFMLQLPNWDIGVLFIVDSLNACV LVLFCFMLYALGANVQMNFACVSFFLLMIVFGGLFVGNRTNYLFYILNRG NGDIGRDLFLQLLFLVFLFQSIFYFTRQKRRFIE >SP_0040 hypothetical protein MSCYKLDKFSITVSINFFHPNLELASKRLELAFLFQNHLYF >SP_0693 hypothetical protein MRIVDKIKILPTPYEGHYHLYIPSSKKHVLVGKQEKNG >SP_0513 hypothetical protein MNSRVEFRIFTIVDLDKEEEHLHEMHLKGWRYRTSRFGLFYFDQCQPDDV IYRIYDSRFLKKV >SP_2124 hypothetical protein MINILYFLIILTIWQVFDEFSEKYDKMKKIRNQGEVYGADWKSL >SP_0269 hypothetical protein MFSAIFIQKLISNITNKKEKYLDKQGKILLQ >SP_1948 conserved domain protein MTNFNSNEKFCGKSLKSLSADEMSLIYGASDGAEPRWTPTPIILKSAAAS SKVCISAAVSGIGGLVSYNNDCLG >SP_2004 hypothetical protein MKRGIIYFFIGLSLLVWLVEMFTGWFDQTLLRQFIRGALGFGFMIFVVFL MRMEWLKGEYHEYD >SP_0528 blpC, peptide pheromone BlpC MDKKQNLTSFQELTTTELNQITGGGLWEDLLYNINRYAHYIT >SP_0531 blpI, bacteriocin BlpI MNTKMMEQFSVMDNEELEIVSGGRGNLGSAIGGCIGAVLLAAATGPITGG AATLICVGSGIMSSL >SP_0532 blpJ, bacteriocin BlpJ MNTKMLSQLEVMDTEMLAKVEGGYSSTDCQNALITGVTTGIITGGTGAGL ATLGVAGLAGAFVGAHIGAIGGGLTCLGGMVGDKLGLSW >SP_0533 blpK, bacteriocin BlpK MDTKMMSQFSVMDTEMLACVEGGGCNWGDFAKAGVGGGAARGLQLGIKTG TWQGAATGAAGGAILGGVAYAATCWW >SP_0539 blpM, bacteriocin BlpM MDTKIMEQFHEMDITMLSSIEGGKNNWQTNVLEGGGAAFGGWGLGTAICA ASGVGAPFMGACGYIGAKFGVDLWAGVTGATGGF >SP_0540 blpN, BlpN protein MNTYCNINETMLSEVYGGNSGGAAVVAALGCAAGGVKYGRLLGPWGAAIG GIGGAVVCGYLAYTATS >SP_0541 blpO, bacteriocin BlpO MDTKMMSQFAVMDNEMLACVEGGDIDWGRKISCAAGVAYGAIDGCATTV >SP_0524 blpT, BlpT protein, fusion MTDTDPIKRAHTLITDLNKAYQACKQASADDVRFQEQLNSILGFLAKAET VDNRFLIELEKFYQTSSLLMGLSALDPDAPTRAAWRAYDRFHFDQVKTKL ILNENQRAN >SP_0041 blpU, bacteriocin BlpU MNTKTMSQFEIMDTEMLACVEGGGCNWGDFAKAGVGGGAARGLQLGIKTR TWQGAATGAVGGAILGGVAYAATCWW >SP_0544 blpX, immunity protein BlpX MEVFNMKYRLFFVIFLSSVLDILLGTFLQISIVSIGWLVLYSGLFEAGVF LLANKGVAVKIKEVDIRNRFKFIFGKTLWFQILLLIFLIIKLYLGLDARL ILFYGHIFIVFNALMYLLSSSQVSLKKNKLSS >SP_0546 blpZ, BlpZ protein, fusion MYKHLFFLDSKTLDRLTPYILVLASDTIAFNVFVLTFVSAVVFNFLNSML ALMAIFIGAGYVVGFWLLILNENQRAN >SP_0123 ccs1, competence-induced protein Ccs1 MGWNFRVVNLLSLHSQTKNPSISRASKHLIQSKIQTKRHRSRGGEKASRE SQRSFSTRTWFVVVPVWQIEDSPHKRKQQKQ >SP_0200 ccs4, competence-induced protein Ccs4 MSVYGRVEEVHKENREPLEYQIEQESHHRESSRLPLVKILLWSTLVTGIT LGVPLLLDLMSAQEVQDFYAGWALHQTGKIYSDYYGSQGLLYYLLTYVSQ GGFFFAIFEWLALVAGGFFLFRSADTLTEQGDQAGHLVTIFYMLVTGLAF GGGYATLLALPFLFAAFSLVAAYLSNPSHDKGFVRIGLALAGGFFFAPLS SLLFIAVVSLGLLVFNLGHRRFAHGFYQFLAVALGFSLVFYPTAYYSAAT GSFGDAISGIRYPIDSIRFDFTSKILENMFFYGLLSLGLGFVVCIFLGLF QSKPFKLYVISVPASLVVILGLILLFFSQEPLHASYLMVVFPVFLLLLVT NIKSQQRGRSARRSRRETPVSLWSRFFKGNLYLLVFGFVYLLSVPFLMKF VLYPVPYQERNRLADLVKEETNTEDAISCMG >SP_2237 comC2, competence stimulating peptide 2 MKNTVKLEQFVALKEKDLQKIKGGEMRISRIILDFLFLRKK >SP_1449 cppA, cppA protein MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLE KLVLEEAPSMRTRKVEGRKKLARLIVKVENPLEIEGILSKTDSIHRLYKG QNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEI SMELHLPTDIESFLESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLV NELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEV >SP_0352 cps4G, capsular polysaccharide biosynthesis protein Cps4G MRVLFILSDNIYLTPYFNFYKELLKKLSISYDVIYWDKNINEIITKQNYY RISFSGKGKLSKILGYVKFRKEIKKKLKENDYDMILPLHSIVSFILVDFL LFSFKNRYIYDIRDYSYEKFLVYRLVQKQLVKNSLMNIVSSDGYKFFLPM GEYFTTHNLPNMIELNEVKQLKNNSTFPIQLSYIGLIRFQEQNKKIIDFF ANDSRFQLNFIGTNAGELREFCQEKNISNVNLVDTFQPKDTMSFYKNTDV VLNLYGNHTPLLDYALSNKLYFAALLYKPILVCEDTYMEKVSIENGFGFV LPMKDESEKDCLALYIQNLDRKQLIKNCDNFMDRISLEKQKTEIELEKRI LSLRKKND >SP_1850 dpnC, type II restriction endonuclease DpnI MELHFNLELVETYKSNSQKARILTEDWVYRQSYCPNCGNNPLNHFENNRP VADFYCNHCSEEFELKSKKGNFSSTINDGAYATMMKRVQADNNPNFFFLT YTKNFEVNNFLVLPKQFVTPKSIIQRKPLAPTARRAGWIGCNIDLSQVPS KGRIFLVQDGQVRDPEKVTKEFKQGLFLRKSSLSSRGWTIEILNCIDKIE GSEFTLEDMYRFESDLKNIFVKNNHIKEKIRQQLQILRDKEIIEFKGRGK YRKL >SP_1849 dpnD, DpnD protein MKTKQLVASEEVYDFLKVIWPDYETESRYDNLSLIVCTLSDPDCVRWLSE NMKFGDEKQLALMKEKYGWEVGDKLPEWLHSSYHRLLLIGELLESNLKLK KYTVEITETLSRLVSIEAENPDEAERLVREKYKSCEIVLDADDFQDYDTS IYE >SP_1964 endA, DNA-entry nuclease MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPS QALAESVLTDAVKSQIKGSLEWNGSGAFIVNGNKTNLDAKVSSKPYADNK TKTVGKETVPTVANALLSKATRQYKNRKETGNGSTSWTPPGWHQVKNLKG SYTHAVDRGHLLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTG QNYYESKVRKALDQNKRVRYRVTLYYASNEDLVPSASQIEAKSSDGELEF NVLVPNVQKGLQLDYRTGEVTVTQ >SP_1573 lytC, lysozyme MKTKIGLASICLLGLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKT QTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYKAWFYINSDGRYS QNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNK WYYFKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTY ANQEWQKVGGKWYYFKKWGYMARNEWQGNYYLTGSGAMATDEVIMDGTRY IFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVIDISEHN GRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYL YTYAENETDAESDAKQTIELIKKYNMNLSYPIYYDVENWEYVNKSKRAPS DTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLLQTRLKHPDILKHVNWVA AYTNALEWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWY >SP_0602 pep27, pep27 protein MRKEFHNVLSSGQLLADKRPARDYNRK