TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Organism: Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334
Gene type: CDS

Number of genes found: 230

Free access
Sort by:

 



# Streptococcus pneumoniae TIGR4, TIGR4; ATCC BAA-334

>SP_0955 DNA internalization-related competence protein ComEC/Rec2
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPW
KSAGKVLIICGIFGFWFVFQNWQQSQASQNLADSVERVRILPDTIKVNGD
SLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQR
NFGGFNYQAYLKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVW
IKTHFPDPMGNYMTGLLLGHLDTDFEEMNELYSSLGIIHLFALSGMQVGF
FMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLA
QHGVKGLDNFALTVLVLFIVMPNFFLTAGGVLSCAYAFILTMTSKEGEGL
KAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLI
SLALVYDLRKNIKGLTVLSLLITGLFFLTKYPLENEITMLDVGQGESIFL
RDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKI
DQLILTNTDKEHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTK
VRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKFLDKQFLFTGN
LEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVG
KSNRMKLPHQETLTRLEGINSKVYRTDQQGAIRFKGLDSWKIESVR
>SP_1627 conserved hypothetical protein
MKLAVIAANGQAGKAIVEEAVKRGHEVTAIVRSENKSQAESIIKKDLFEL
TKDDLTGFDAVISAFGAYTPDTLPLHSKSIELFNQLLAGTQTRFLVVGGA
GSLYIDETKTTRLLDTPDFPEEFKPLAKAQADELDLLRTKNNLNWTFVSP
AVDFIPDGEKTGNYILAGEIFTTNEKGISQISYADYAIGLVDELEKGHHI
KERISLLEK
>SP_1612 conserved domain protein
MDLLEKECLKCDKNFQQGDIWNYYYLSDKMPAQGWKIHISSQIKDAVNIF
KIVYKLSQLNNCSFKVVKNLEELKKINSPREMSPTANKFITLYPKSESEA
KSMICNLTNRLSEFKAPKILSDYQCGMHSPVHYRYGAFLKKQAYDEKNKK
VIYLLLDEKRKNYVEDKRQNFPSLPSWKMDLFSEEEKRIYFQTTCEVSSK
DSAINKYKMEKIIKRSNKGNVYRAIRKSDGQKVIIKQSRPFVNYDAEGEW
TALDDIKNEAHMLKKLADKSYTTNLTDEFYIVDDYFLVQEQVDGLNFEEF
IRETEHSLNIREKTLDNIVNIVSYIHKLGI
>SP_1518 conserved hypothetical protein
MSEKSREEEKLSFKEQILRDLEKVKGYDEVLKEDEAVVRTPANEPSTEEL
MADSLSTVEEIMRKAPTVPTHPSQGVPASPADEIQRETPGVPSHPSQDVP
SSPAEESGSRPGPGPVRPKKLEREYNETPTRVAVSYTTAEKKAEQAGPET
PTPATETVDIIRDTSRRSRREGAKPVKPKKEKKSHVKAFVISFLVFLALL
SAGGYFGYQYVLDSLLPIDANSKKYVTVGIPEGSNVQEIGTTLEKAGLVK
HGLIFSFYAKYKNYTDLKAGYYNLQKSMSTEDLLKELQKGGTDEPQEPVL
ATLTIPEGYTLDQIAQAVGQLQGDFKESLTAEAFLAKVQDETFISQAVAK
YPTLLESLPVKDSGARYRLEGYLFPATYSIKESTTIESLIDEMLAAMDKN
LSPYYSTIKSKNLTVNELLTIASLVEKEGAKTEDRKLIAGVFYNRLNRDM
PLQSNIAILYAQGKLGQNISLAEDVAIDTNIDSPYNVYKNVGLMPGPVDS
PSLDAIESSINQTKSDNLYFVADVTEGKVYYANNQEDHDRNVAEHVNSKL
N
>SP_0073 conserved hypothetical protein
MTYEYQSHIYLAEAVLNVKDLVSQTVFYQQIIGLEILSQTDTEVVLGLGG
KALVHLIQAQEGGEVREHYGLYHLAILLPTRKALADVLKHLTDLQIPLVG
GADHGYSEALYLEDLEGNGIELYRDKPVSTWDIREDGRIIGVTEVLAAQD
IYELGERVEPFILAEGTRMGHIHLSVKDSRKSRQFYQTVLGLEDKFSVPS
ASWIAAGDYHHHLAVNEWGGKGLDPRKQVLPGLAYYVIEVAHKEELLTIA
QRAQEVDAPIKWMTSIQLEITDSDGIVTRIRLAR
>SP_1951 conserved hypothetical protein
MNKIFIYAGVRNHNSKTLEYTKRLSSIISSRNNVDISFRTPFNSELEISN
SDSEELFKKGIDRQSNADDGGVIKKELLESDIIIISSPVYLQNVSVDTKN
FIERIGGWSHLFRLAGKFVVTLDVAESNGSDNVSEYLRDIFSYMGGQILH
QVSITNSLKDIAEAQLMEATYKIEDVLEGKIKYKTTDYQERAYQTLKLIL
ENYDSEHFEKMYWEKKRLFEANSLEEWYYVENIK
>SP_1290 conserved hypothetical protein
MNEKVFRDPVHNYIHVNNQIIYDLINTKEFQRLRRIKQLGTSSYTFHGGE
HSRFSHCLGVYEIARRITEIFEEKYPEEWNPAESLLTMTAALLHDLGHGA
YSHTFEHLFDTDHEAITQEIIQNPETEIHQVLLQVAPDFPEKVASVIDHT
YPNKQVVQLISSQIDADRMDYLLRDSYFTGASYGEFDLTRILRVIRPIEN
GIAFQRNGMHAIEDYVLSRYQMYMQVYFHPATRAMEVLLQNLLKRAKELY
PEDKDFFARTSPHLLPFFEKNVTLTDYLALDDGVMNTYFQLWMTSPDKIL
ADLSHRFVNRKVFKSITFSQEDQDQLTSMRKLVEDIGFDPDYYTAIHKNF
DLPYDIYRPESENPRTQIEILQKNGELAELSSLSPIVQSLAGSRHGDNRF
YFPKEMLDQNSIFASITQQFLHLIENDHFTPNKN
>SP_0628 HIT family protein
MLILRARWWTNMCLICQRIDLIKKEENPYFVKELETGYLVVGDHQYFEGY
SLFLAKEHVSELHHLKKETRLRFLEEMSLVQEAVAKAFAAEKMNIELLGN
GDAHLHWHLFPRRTGDMNGHGLKGRGPVWWVPFEEMTAETCQAKPDEIKR
LVKRLSSEVDKLLEIKE
>SP_0794 MutT/nudix family protein
MEIKNHFGVYCVCFENGKLLCIEKTRGPYQHRYDLPGGSQQLGEGLTETL
TREVMEETGFTVRSYSNPRIYDVFVREELKNFMVHHVIALYDVEMNESAP
QVTISEAVSDGANDSLGYIWMDIQEITEENASPLVLKVKSELLGFPELDK
TSYMNWKVNDEKSTCP
>SP_1686 oxidoreductase, Gfo/Idh/MocA family
MVKYGVVGTGYFGAELARYMQKNDGAEITLLYDPDNAEAIAEELGAKVAS
SLDELVSSDEVDCVIVATPNNLHKEPVIKAAQHGKNVFCEKPIALSYQDC
REMVDACKENNVTFMAGHIMNFFNGVHHAKELINQGVIGDVLYCHTARNG
WEEQQPSVSWKKIREKSGGHLYHHIHELDCVQFLMGGMPETVTMTGGNVA
HEGEHFGDEDDMIFVNMEFSNKRFALLEWGSAYRWGEHYVLIQGSKGAIR
LDLFNCKGTLKLDGQESYFLIHESQEEDDDRTRIYHSTEMDGAIAYGKPG
KRTPLWLSSVIDKEMRYLHEIMEGAPVSEEFAKLLTGEAALEAIATADAC
TQSMFEDRKVKLSEIVK
>SP_0989 MutT/nudix family protein
MEFEEKTLSRKEIYQGPIFKLVQDQVELPEGKGTAQRDLIFHNGAVCVLA
VTDEQKLILVKQYRKAIEAVSYEIPAGKLEVGENTAPVAAALRELEEETA
YTGKLELLYDFYSAIGFCNEKLKLYLASDLTKVENPRPQDEDETLEVLEV
SLEEAKELIQSGHICDAKTIMAVQYWELQKK
>SP_0580 acetyltransferase, GNAT family
MIRKVEMADVEVLAKIAKQTFRETFAYDNTEEQLQEYFEEAYSLKTLSTE
LGNPDSETYFIMHEEEIAGFLKVNWGSAQTERELEDAFEIQRLYVLQKFQ
GFGLGKQLFEFALELATKNSFSWAWLGVWEHNTKAQAFYNRYGFEKFSQH
HFMVGQKVDTDWLLRKKLR
>SP_0675 oxidoreductase, short chain dehydrogenase/reductase family
MPTILITGASGGLAQEMVKLLPNDQLILLGRNKEKLAQLYGNYSHAELIE
IDITDDSALEALVTDLYLRYGKIDVLINNAGYGIFEGFDQIADKDIHQMF
EVNTFALMNLSRHLAARMKESSKGHIINIVSMAGLIATGKSSLYSATKFA
AIGFSNALRLELMPYGVYVTTVNPGPIRTGFFDQADPDGTYLKSVDRFLL
EPDAVAKKIVKIIGKNKRELNLPILLNLAHKFYTLFPKLADKLAGETFNY
K
>SP_0379 conserved hypothetical protein
MKLFWTNNIYRQLLLNSCFSSFGDSIFYLAIINYVAQYNFAPLAILLISI
SEMVPLLSQLFLGILGDFQENRVKHALWIAKIKILLYAILTVFLVLSPFS
LVSVIMIVIINLISDTLSYLSAYMMNALYISVIKDDLHDAMGFRQSLMRV
VRIVANLAGAFLINVISIQTISLINTLTFVIAFLGLYVIRHTLYEVEKRI
EMSHTALSFKKYFQHLKQSLAVLLRLKDTVILLFLTTSMIAILDVSPRLI
ALRFIQQTLAQLSIGQLLALLSIIMSCGAILGNMTSSNLFKNIRFTHLLV
FCEISLLTLITSILCQAYIVIFMTSFISSTIIGILSPRLQAAVFAHIPSD
KMGTVGSALSTVDILAPSLLSLLALSIASGVSVQLALIFLYLILIALIFC
QWLVKFNTHN
>SP_1709 phosphoglycerate dehydrogenase-related protein
MALPTIAIVGRPNVGKSTLFNRIAGERISIVEDVEGVTRDRIYATGEWLN
RSFSMIDTGGIDDVDAPFMEQIKHQAEIAMEEADVIVFVVSGKEGITDAD
EYVARKLYKTHKPVILAVNKVDNPEMRNDIYDFYALGLGEPLPISSVHGI
GTGDVLDAIVENLPNEYEEENPDVIKFSLIGRPNVGKSSLINAILGEDRV
IASPVAGTTRDAIDTHFTDTDGQEFTMIDTAGMRKSGKVYENTEKYSVMR
AMRAIDRSDVVLMVINAEEGIREYDKRIAGFAHEAGKGMIIVVNKWDTLE
KDNHTMKNWEEDIREQFQYLPYAPIIFVSALTKQRLHKLPEMIKQISESQ
NTRIPSAVLNDVIMDAIAINPTPTDKGKRLKIFYATQVATKPPTFVIFVN
EEELMHFSYLRFLENQIRKAFVFEGTPIHLIARKRK
>SP_1634 hypothetical protein
MANIFDYLKDVAYDSYYDLPLNELDILTLIEITYLSFDNLVSTLPQRLLD
LAPQVPRDPTMLTSKNRLQLLDELAQHKRFKNCKLSHFINDIDPELQKQF
AAMTYRVSLDTYLIVFRGTDDSIIGWKEDFHLTYMKEIPAQKHALRYLKN
FFAHHPKQKVILAGHSKGGNLAIYAASQIEQSLQNQITAVYTFDAPGLHQ
ELTQTAGYQRIMDRSKIFIPQGSIIGMMLEIPAHQIIVQSTALGGIAQHD
TFSWQIEDKHFVQLDKTNSDSQQVDTTFKEWVATVPDEELQLYFDLFFGT
ILDAGISSINDLASLKALEYIHHLFVQAQSLTPEERETLGRLTQLLIDTR
YQAWKNR
>SP_0332 hypothetical protein
MKKETFTEKLIKRTYGISGPLDEYKRREADSIGNQVFIILFYLMIFGNLI
PLLLAYKYPQEVALIYPPLILVIALIASGYVTYQMKKTGITVIEPDMLNE
KESKQLHYPGLKAGLFFGLWMFFITPLLRILIDEGQDYFHSLLTIRNGVS
SILGSIFFGASIQFLISRRIAKTKKNQDED
>SP_0590 acetyltransferase, GNAT family
MIQARNKLSQEELSEAKKVINCCQNYDGTYRDPYLSNMLNFDPNMPAFFL
YYEKGELVGLLTVYADDQDVEVTILVHPGHRRQGIARALFTSFERETASF
PIRSVTFQTERIFLENHPDFASNWGLIEDEETETWLGKDRRPYPLANVSH
LEVLLADSSYQDQISQLKFQAFSEEHESREVVDRYVAEALKDPESRLYIL
LKAGQVIGTCTVDLSTNTNYLYGLAILEPERGKGYGSYLAKSLVNQLIEQ
NDKEFQIAVEDSNVGAKRLYEKIGFVKQTQVVYLNEKGARDSEV
>SP_0104 hydrolase, haloacid dehalogenase-like family
MTSITAIFFDLDGTLVDSSIGIHNAFTYTFKELGVPSPDAKTIRGFMGPP
LESSFATCLSKDQISEAVQIYRSYYKAKGIYEAQLFPQIIDLLEELSSSY
PLYITTTKDTSTAQDMAKNLEIHHFFDGIYGSSPEAPHKADVIHQALQTH
QLAPEQAIIIGDTKFDMLGARETGIQKLAITWGFGEQADLLNYQPDYIAH
KPLEVLAYFQ
>SP_1538 Cof family protein/peptidyl-prolyl cis-trans isomerase, cyclophilin type
MDAKLRYKAKKIKIVFFDIDDTLRNSKTGFIPTTIPTVFKQLREKGILTG
IASGRGIFGVVPEIRDLKPDFFVTLNGAYIEDKKGQVIYQHQIEKSDVEE
YISWAKQEGIEYGLVGSHDAKLSTRTDMMSEAINPIYPDLDVDPDFHEKE
DIYQMWTFEDKGDDLHLPDSLSDKLRMVRWHQHSSDIVPISGSKATGVEK
VVEHLGLKPEKVMVFGDGLNDLELFDYAGISVAMGISHDKIKEKADYITK
TLEEDGIFAALEVFGMVEKELHFPQVDIETVEGPLATIKTNHGDLRIKLF
PEHAPKTVANFVSLSKDGYYDGVIFHRIIKDFMIQGGDPTGTGMGGESIY
GESFEDEFSEELYNIRGALSMANAGPNTNGSQFFIVQNQHLPYSKKEITR
GGWPEPIAEIYANQGGTPHLDRRHTVFGQLADEASYAVLDAIAAVETGAM
DKPVEDVVIETIEIED
>SP_0793 oxidoreductase, short chain dehydrogenase/reductase family
MTKRVLITGVSSGIGLAQARLFLEKGYQVYGVDQGEKPLLEGDFRFLQRD
LTLDLEPIFDWCPQVDVLCNTAGVLDDYKPLLEQTAQDIQEIFEINYIIP
VELTRYYLTQMLENKKGIIINMCSIASSLAGGGGHAYTSSKHALAGFTKQ
LALDYAEAGIQVFGIAPGAVKTAMTAADFEPGGLADWVASETPIKRWIEP
EEIAELSLFLASGKASAMQGQILTIDGGWSLK
>SP_0287 xanthine/uracil permease family protein
MINDIILLLFVKIKRRLMMDKLFKLKENGTDVRTEVLAGLTTFFAMSYIL
FVNPQILSQTGMPAQGVFLATIIGAVAGTLMMAFYANLPYAQAPGMGLNA
FFTFTVVFGLGYSWQEALAMVFICGIISLIITLTNVRKMIIESIPNALRS
AISAGIGVFLAYVGIKNAGLLKFTIDPGNYTVVGEGADKAQATIAANSSA
VPGLVSFNNPAVLVALAGLAITIFFVIKGIKGGIILSILTTTVLAIAVGL
VDLSSIDFANNHVGAAFEDLKTIFGAALGSEGLGALVSDTARLPETLMAI
LAFSLTDIFDTIGTLIGTGEKVGIVATNGENHQSAKLDKALYSDLIGTTV
GAIAGTSNVTTYVESAAGIGAGGRTGLTALVVAICFAISSFFSPLLAIVP
TAATAPILIIVGIMMLGSLKNIHWDDMSEAVPAFFTSIFMGFSYSITQGI
AVGFLTYTLTKLVKGQVKDVHVMIWILDALFILNYISMAL
>SP_2224 peptidase, M16 family
MTKVVFEEKYYPAVKEMVYRTRLANGLTVALLPKKEFKEVYGSVTVQFGS
VDTFVTEVDGDVKQYPGGIAHFLEHKLFEREDSSDLMSAFTSLGADSNAF
TSFTKTNYLFSATDYFLENLDLLDELVTSAHFTEASILTEQDIIQQEREM
YQDDPDSCLFFSTLANLYPGTPLATDIVGSEESISQINLTNLQENFTKFY
KPVNMSLFLVGNFDVERVQDYFESKELKDSDFQEVAREKLFLQPVKPTDS
MRMEVSSPKLAIGVRGKREVSEADCYRHHILLKLLFAMMFGWTSDRFQKC
YESGKIDASLSLEVEITSRFHFVMLTMDTKEPVALSHQFRKAIRNFTKDL
DITEEHLDIIKREMFGEFFSSMNSLEFIATQYDAFENGEIIFDLPKILQE
ITLEDVLDAGHHLIDDGDIVDFTIFPS
>SP_0430 hypothetical protein
MEVRLPSRVIFQKSTMRNTKKLRQFGIFLLIILLSTYLPQTIGLYVTIIL
GLGADVYSLILTMGLVGSFLLLIWRLKKKKMLFIFEKKSWNWSFVFYLFA
TYVVYQILGNFWARYAHLINHRNIHDEYFTVLFSNGQPTFLSTILSFVLP
VIIGPVFEETLDRGYFMNTFFP
>SP_2178 conserved hypothetical protein, interruption
MTLTILYFPNAQMFLLFSALVGMLSQLKEVPESVFLQETVEENHLVNVYS
VLEVISTLAFSVFVLLMSYITESFGISISFWLSAICLMIEAILIYIRRDY
FK
>SP_0128 putative ribosomal-protein-alanine acetyltransferase
MIEIKRIQQQPDLAQAIYAVMAAVYLVSPWTLEQIQADLSQDQTWYALAY
DGAEVIGFLAVQENLFEAEVLQIAVKGAYQGQGIASALFAQLPTDKEIFL
EVRQSNQRAQAFYKKEKMTVIAERKAYYHDPVEDAIIMKREIDEG
>SP_0847 putative sugar ABC transporter, permease protein
MSKKLQQISVPLISVFLGILLGAIVMWIFGYDAIWGYEELFYTAFGSLRG
IGEIFRAMGPLVLIGLGFAVASRAGFFNVGLPGQALAGWILSGWFALSHP
DMPRPLMILATIVIALIAGGIVGAIPGILRAYLGTSEVIVTIMMNYIVLY
VGNAFIHAFPKDFMQSTDSTIRVGANATYQTPWLAELTGNSRMNIGIFFA
IIAVAVIWFMLKKTTLGFEIRAVGLNPHASEYAGISAKRTIILSMIISGA
LAGLGGAVEGLGTFQNVYVQGSSLAIGFNGMAVSLLAANSPIGILFAAFL
FGVLQVGAPGMNAAQVPSELVSIVTASIIFFVSVHYLIERFVKPKKQVKG
GK
>SP_1646 metallo-beta-lactamase superfamily protein
MKIHKTVNPVAYENTYYLEGEKHLIVVDPGSHWEAIRQTIEKINKPICAI
LLTHAHYDHIMSLDLVRETFGNPPVYIAESEASWLYTPVDNLSGLPRHDD
MADVVTKPAEHTFVFHEEYQLEEFRFKVLPTPGHSIGGVSLVFPDAHLVL
TGDALFRETIGRTDLPTGSMEQLLHSIQTQLFTLPNYDVYPGHGPATTIA
HEKAFNPFF
>SP_0805 hydrolase, haloacid dehalogenase-like family
MKGMKYHDYIWDLGGTLLDNYETSTAAFVETLALYGITQDHDSVYQALKV
STPFAIETFAPNLENFLEKYKENEARELEHPILFEGVSDLLEDISNQGGR
HFLVSHRNDQVLEILEKTSIAAYFTEVVTSSSGFKRKPNPESMLYLREKY
QISSGLVIGDRPIDIEAGQAAGLDTHLFTSIVNLRQVLDI
>SP_0741 conserved hypothetical protein
MKHFDTIVIGGGPAGMMATISSSFYGQKTLLIEKNRKLGKKLAGTGGGRC
NVTNNGSLDNLLAGIPGNGRFLYSVFSQFDNHDIINFFTENGVKLKVEDH
GRVFPASDKSRTIIEALEKKITELGGQVATQIEIVSVKKVDDQFVLKSAD
QTFTCEKLIVTTGGKSYPSTGSTGFGHEIARHFKHTITDLEAAESPLLTD
FPHKALQGISLDDVTLSYGKHVITHDLLFTHFGLSGPAALRMSSFVKGGE
VLSLDVLPQLSEKDLVTFLEENREKSLKNALKTLLPERLAEFFVQGYPEK
VKQLTEKEREQLVQSIKELKIPVTGKMSLAKSFVTKGGVSLKEINPKTLE
SKLVPGLHFAGEVMDINAHTGGFNITSALCTGWVAGSLHYD
>SP_0121 metallo-beta-lactamase superfamily protein
MAYTLKPEEVGVFAIGGLGEIGKNTYGIEYQDEIIIVDAGIKFPEDDLLG
IDYVIPDYSYIVDNIDRVKAVLITHGHEDHIGGIPFLLKQANVPIYAGPL
ALALIRGKLEEHGLLRNAKLYEINHNTELTFKNLKATFFRTTHSIPEPLG
IVIHTPQGKIVCTGDFKFDFTPVGEPADLHRMAALGEEGVLCLLSDSTNA
EVPTFTNSEKVVGQSIMKIIQGIEGRIIFASFASNIFRLQQATEAAVKTG
RKIAVFGRSMEKAIVNGIDLGYIKAPKGTFIEPNEIKDYPAGEVLILCTG
SQGEPMAALSRIANGTHRQVQLQPGDTVIFSSSPIPGNTTSVNKLINIIS
EAGVEVIHGKVNNIHTSGHGGQQEQKLMLCLIKPKYFMPVHGEYRMQKVH
AGLAVDTGVEKDNIFIMSNGDVLALTADSARIAGHFNAQDIYVDGNRIGE
IGAAVLKDRRDLSEDGVVLAVATVDFKSQMILSGPDILSRGFVYMRESGD
LIRQSQRILFNAIRIALKNKDASVQSVNGAIVNAIRPFLYENTEREPIII
PMILTPDEE
>SP_0286 Cof family protein
MTKKIIAVDLDGTLLNSDSQISDFTKRTIKKVAEKGHQVIITTGRPYRMS
KDFYRELGLDTPMINFNGSLTHLPDQVWDFEKCLTVDKKYLLDMVQRSED
IQADFIAGEYRKKFYITNPNEEIANPKLFGVEAFQPEDQFQPELVTKDPN
CILLQTRASDKYSLAKEMNAFYQHQLSINTWGGPLNILECTPKGVNKAFA
LDYLLKIMNRDKKDLIAFGDEHNDTEMLAFAGKGYAMKNANPELLPYADE
QISLTNDQDGVAKTLQDLFL
>SP_0095 conserved hypothetical protein
MAKDIRVLLYYLYTPIENAEQFAADHLAFCKSIGLKGRILVADEGINGTV
SGDYETTQKYMDYVHSLPGMEELWFKIDEENEQAFKKMFVRYKKEIVHLG
LEDNDFDNDINPLETTGAYLSPKEFKEALLDKDTVVLDTRNDYEYDLGHF
RGAIRPDIRNFRELPQWVRDNKEKFMDKRVVVYCTGGVRCEKFSGWMVRE
GYKDVGQLHGGIATYGKDPEVQGELWDGKMYVFDERIAVDVNHVNPTIVG
KDWFDGTPCERYVNCGNPFCNRRILTSEENEDKYLRGCSHECRVHPRNRY
VSKNELTQAEVIERLAAIGESLDQAATV
>SP_1089 glutamine amidotransferase, class I
MKKPVIGITGNEKTHPDDDIMMSYAAKGFVEGVKDAGGIPIILPIGDQEM
ACHYISLIDKLILTGGQNVDPKFYGEPKTIDSDDYHLQRDIFELALIKEA
IKQKKPIFSVCRGTQLFNVAMGGTLYQDIEDHWQDSSVEYTTQRLVTEPD
TVLQEIYGEISHINSFHHQSIKDLAPNLKVVAHDPKDGIIEAVMSTDDVA
FLGVQWHPELLFENRPKDKKLFDYVVNEL
>SP_1592 conserved domain protein
MIKLYSPYAFLSKYLGAILLSFFCYYAKEIGCNNLTLSVWNDNEGALRFY
QRQGMKPQETTMEMIID
>SP_0804 putative 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis protein
MVKVAVILAQGFEEIEALTVVDVLRRANITCDMVGFEEQVTGSHAIQVRA
DHVFDGDLSDYDMIVLPGGMPGSAHLRDNQTLIQELQSFEQEGKKLAAIC
AAPIALNQAEILKNKRYTCYDGVQEQILDGHYVKETVVVDGQLTTSRGPS
TALAFAYELVEQLGGDAESLRTGMLYRDVFGKNQ
>SP_1504 TPR domain protein
MLQALEEQDLTKAEHYFAKALENDSSDLLYELATYLEGIGFYPQAKEIYL
KIVEDFPEVHLNLAAIASEDGQIEEAFTYLEEIQADSDWYVSSLALKADL
YQLEGLTDVAREKLLEALTYSEDSLLILGLAELDSELENYQAAIQAYAQL
DNRSIYEQTGISTYQRIGFAYAQLGKFETATEFLEKALELEYDDLTAFEL
ASLYFDQEEYQKATLYFKQLDTISPDFEGYEYGYSQALHKEHQVQEALRI
AKQGLEKNPFETRLLLAASQFSYELHDASGAENYLLTAKEDAEDTEEILL
RLATIYLEQERYEDILELQSEEPENLLTKWMIARSYQEMDDLDTAYEYYQ
ELTGDLKDNPEFLEHYIYLLRELGHFEEAKVHAHTYLKLVPDDVQMQELF
ERL
>SP_1471 putative oxidoreductase
MLKLIAIVGTNSKRSTNRQLLQYMQKHFTDKAEIELVEIKAIPVFNKPAD
KQVPAEILEIAAKIEEADGVIIGTPEYDHSIPAVLMSALAWLSYGIYPLL
NKPIMITGASYGTLGSSRAQLQLRQILNAPEIKANVLPDEFLLSHSLQAF
NPSGDLVDLDVIKKLDAIFDDFRIFVKITEKLRNAQELLRKDAEDFDWEN
L
>SP_1519 acetyltransferase, GNAT family
MKIRQARLEDLDRIVELEFENFSVEEAIPPSVFEAHLREIQTSFLVAEKE
GRIMGYIEGPVGLHRHLQDQSFTEEIKDYSHEPGGYISVTCLSIAKEAQG
LGLGQKLLTALKEVALEDERDGINLTCHDYLIAYYEKHGFVNEGQSQSTF
AGETWYDMVWEMKK
>SP_2096 peptidase, M20/M25/M40 family
MLDLIQTRRDLHQIPEIGLEEFKTQAYLLDVIEKLTTGKDFVQIRTWRTG
ILVYLQGSQPERTIGWRTDIDGLPIVEQTGLPFASQHQGRMHACVHDFHM
TIALGCLERALEEQPKNNLLFLFQPAEENEAGGMLMYEDGAFGDWLPDQF
YGLHVRPDLKVGQIATNTHTLFAGTCEVKIRFKGKGGHAAFPHEANDALV
AASYFVTQVQSVVSRNVNPIEGAVVTFGVFQAGTTNNVITDTAFLHGTIR
ALTQDMSLLVQKRVKTVAEGVAAAFDMEVEVELKQGGYLPVENNPALARE
LMDFFDEKDGIELIDIEPAMTGEDFGYLLSKVDGVMFWLGIDSPYALHHP
QMSPKEEVLAIGVAAVSSFLKKKAAE
>SP_1090 conserved hypothetical protein
MKDKQFAIPKATAKRLSLYYRIFKRFHAEKIERANSKQIAEAIGIDSATV
RRDFSYFGELGRRGFGYDVKKLMTFFADLLNDNSITNVMLVGIGNMGHAL
LHYRFHERNKMKIIMAFDLDDHPEVGTQTPDGIPIYGISQIKDKIKDADV
KTAILTVPSVKSQEVANLLVDAGVKGILSFSPVHLHLPKDVVVQYVDLTS
ELQTLLYFMRKED
>SP_1750 conserved hypothetical protein
MAIENYIPDFAVEAVYDLTVPSLQAQGIKAVLVDLDNTLIAWNNPDGTPE
MKQWLHDLRDAGIGIIVVSNNTKKRVQRAVEKFGIDYVYWALKPFTFGID
RAMKEFHYDKKEVVMVGDQLMTDIRAAHRAGIRSILVKPLVQHDSIKTQI
NRTRERRVMRKITEKYGPITYKKGI
>SP_0320 oxidoreductase, short chain dehydrogenase/reductase family
MTNTSFSIEQFSLKGKIALITGASYGIGFAIAKSYAEAGATIVFNDINQD
LVNKGIEAYREVGIQAHGYVCDVTDEDGIQAMVKQIEQEVGVIDILVNNA
GIIRRVPMCEMSAADFRKVIDIDLNAPFIVSKAVIPSMIKKGHGKIINIC
SMMSELGRETVSAYAAAKGGLKMLTRNIASEYGGANIQCNGIGPGYIATP
QTAPLRELQEDGSRHPFDQFIIAKTPAARWGNTEDLMGPAVFLASDASNF
VNGHILYVDGGILAYIGKQPE
>SP_1742 conserved hypothetical protein
MTITGIIAEFNPFHNGHKYLLDQAEGLKIVAMSGNFMQRGEPAIVDKWTR
TQMALENGADLVVELPFLVSVQAADFFGQGAMDILDRLGIDSLVFGTEEV
RDYQKIADLYTEKGAEMEKFVENLPDSLSYPQKTQAMWKEFAGLDFSGNT
PNHVLALAYAKAVAGRNIKLHPIQRQGAGYHSVNKDVDFASATALRQHQK
DQDFLERFMPSVALFEQASKVIWEDYFPLLRYQILSNPDLTTIYQVNQEM
AVRIKEAIKTAQSVEELVELVTTKRYTKARVRRLLTYILMQARESDLPEA
IHVLGFTEKGRQHLKSLKGQVSLVSRIGKEPWDAMTQKADQIYQLGKPSI
AEQNFGRVPIRIETN
>SP_2098 membrane protein
MNQYQKKIVNGKIYSLLSGLIWGICGILGEYFFTHYQVSSGWITSMRLTL
AGSLVLIWSAIQLKSQVLDIWRDKKNYLPFLAYAILGIFSVQYFFYLCVE
YSNATTATILQFISPVFILFYNRLVYQKRASKSAVFYVLVAMLGVCLMAT
KGDLSQLSMTPLALITGLLSAMGVMFNVILPQPFAKRYGFVPTVGWGMIL
AGLFSNVLSPVYQLSFTLDIWSILICLIIAFFGTAFAFFISMKAVSLVSP
LVVAVISASEPLSSALLSVLFLGLVVDWSLLLAIALIILPMIFLSIEEAK
ESR
>SP_1270 alcohol dehydrogenase, zinc-containing
MINQIYQLTKPKFINVKYQEEAIDQENHILIRPNYMAVCHADQRYYQGKR
DPKILNKKLPMAMIHESCGTVISDPTGTYEVGQKVVMIPNQSPMQSDEEF
YENYMTGTHFLSSGFDGFMREFVSLPKDRVVAYDAIEDTVAAITEFVSVG
MHAMNRLLTLAHSKRERIAVIGDGSLAFVVANIINYTLPEAEIVVIGRHW
EKLELFSFAKECYITDNIPEDLAFDHAFECCGGDGTGPAINDLIRYIRPQ
GTILMMGVSEYKVNLNTRDALEKGLILVGSSRSGRIDFENAIQMMEVKKF
ANRLKNILYLEEPVREIKDIHRVFATDLNTAFKTVFKWEV
>SP_0153 putative membrane protein
MKKSILTTLLFAVLYFLCMGIGVLLGNLFDQTGNMFYAPAFTALVGGSVY
MILVAKVPRFGAITTIGLVIALFFLGTKHGAGSFLPGIICGLLADGVAHL
GKYKDKTKNFLSFIIFAFSTTGPILLMWIAPKAYMATLLARGKSQEYIDR
IMVAPNPGTVLLFIASIVIGALVGALIGQALSKKFAQKI
>SP_1074 conserved hypothetical protein
MAYTEEQIENIKTRILTALEEVIDPELGIDIVNLGLIYEIRFDGDTGQTE
IDMTLTTMGCPLADLLTDQIYDAMIEVPEVTDTEVKLVWYPAWTVEKMSR
YARIALGIK
>SP_1963 CBS domain protein
MEDPSSQNLLLQFVLLFILTVLNAFFSATEMAMVSLNRARVEQKAEEGDR
RYIRLLKVLENPNHFLSTIQVGITLITILSGASLTDTLGRVIASWLGNGE
TAQAVATFLSLAFLTYISIVFGELYPKRIALNLKDALAIRTAPIIIGIGK
LVSPFVWLLAASTNFLSHLTPMSFDDADEKMTRDEIAYMLTNSEETLDAD
EIEMLQGVFSLDELMAREVMVPRTDAFMVDIQDDSQAIIQSILKQNYSRI
PVYDGDKDNVIGIIHTKSLLKAGFVDGFDNIVWKRILQDPLFVPETIFVD
DLLKELRNTQRQMAILLDEYGGMAGLVTLEDLLEEIVGEIDDETDKAAID
VHQIGEDTYIVQGTMTLNDFNNYFDVELESDDVDTIAGYYLTGVGTIPTT
EKLSYELVSQNKQFILTNDKVKNGRVTKVKVQITEVEIEEETE
>SP_1045 conserved hypothetical protein TIGR00147
MKKAMVIINPTSGGEKALDYKEKLENKAKEYFEYVETKITEKALDATHFA
EEASREQYDAVVVFGGDGTVNEVISGIDERDYIPKLGIIPGGTGNLITKL
LEINQDIDGAIDELDFDLTNKIDIGKANDNYFGYIFSIGSLPEAIHNVEI
EDKTKFGILTYAVNTMKSVMTDQVFNIKVETENGNYVGEASHVLVLLTNY
FADKKIFEENKDGYANILILKDASIFSKLSVIPDLLKGDVVANDNIEYIK
ARNIKISSDSELESDVDGDKSDNLPVEIKVLAQRVEVFSKPKED
>SP_0938 tetrapyrrole methylase family protein
MQIQKSFKGQSPYGKLYLVATPIGNLDDMTFRAIQTLKEVDWIAAEDTRN
TGLLLKHFDISTKQISFHEHNAKEKIPDLIGFLKAGQSIAQVSDAGLPSI
SDPGHDLVKAAIEEEIAVVTVPGASAGISALIASGLAPQPHIFYGFLPRK
SGQQKQFFGLKKDYPETQIFYESPHRVADTLENMLEVYGDRSVVLVRELT
KIYEEYQRGTISELLESIAETPLKGECLLIVEGASQGVEEKDEEDLFVEI
QTRIQQGVKKNQAIKEVAKIYQWNKSQLYAAYHDWEEKQ
>SP_2055 alcohol dehydrogenase, zinc-containing
MKAYTYVKPGLASFVDVDKPVIRKPTDAIVRIVKTTICGTDLHIIKGDVP
TCQSGTILGHEGIGIVEEVGEGVSNFKKGDKVLISCVCACGKCYYCKKGI
YAHCEDEGGWIFGHLIDGMQAEYLRVPHADNTLYHTPEDLSDEALVMLSD
ILPTGYEIGVLKGKVEPGCSVAIIGSGPVGLAALLTAQFYSPAKLIMVDL
DDNRLETALSFGATHKVNSSDPEKAIKEIYDLTDGRGVDVAIEAVGIPAT
FDFCQKIIGVDGTVANCGVHGKPVEFDLDKLWIRNINVTTGLVSTNTTPQ
LLKALESHKIEPEKLVTHYFKLSEIEKAYEVFSKAADHHAIKVIIENDIS
EA
>SP_0950 acetyltransferase, GNAT family
MIDQLSKYYSCRILTEKDIPSILSLYESNPLYFQHCPPEPNFATVKEDML
CLPEGKAKADKFFVGFWNGSDLVAVMDFVYAYPDEETVFIGLFMVDQAYQ
RKGIGSHIVTEALAYFAKNFRKARLAYVKGNPQSQHFWEKQGFKSIGCEV
KQELYTVVIAEQSLED
>SP_0550 conserved hypothetical protein
MRVRNRKGATELLEANPQYVVLNPLEAKAKWRDLFGNDNPIHVEVGSGKG
AFVSGMAKQNPDINYIGIDIQKSVLSYALDKVLEVGVPNIKLLWVDGSDL
TDYFEDGEIDRLYLNFSDPWPKKRHEKRRLTYKTFLDTFKRILPENGEIH
FKTDNRGLFEYSLVSFSQYGMKLNGVWLDLHASDFEGNVMTEYEQKFSNK
GQVIYRVEAEF
>SP_0521 HIT family protein
MSDCIFCKIIAGEIPASKVYEDEQVLAFLDISQVTLGHTLVVPKEHYRNL
LEMDATSASQLFAQVPKVAQKVMKVTKAAGMNIISNCEEVAGQTVFHTHV
HLVPRYSADDDLKIDFIAHEPDFDKLAQVAETIKNA
>SP_0980 O-methyltransferase
MVESYSKNANHNMRRPVVKEEIVDLMRQRQKQVTGFLKELEDFARKENIP
IIPHETVAYFRFLMETMQPKNILEIGTAIGFSALLMAEHAPNAKITTIDR
NPEMIGFAKENFAQFDSRKQITLLEGDAVDVLSTLTESYDFVFMDSAKSK
YIVFLPEILKHLEVGGVVVLDDIFQGGDVAKDIMEVRRGQRTIYRGLQKL
FDATLDNPELTATLVPLGDGILMLRKNVADVQLSESE
>SP_0613 metallo-beta-lactamase superfamily protein
MSNISLTTLGGVRENGKNMYIAEIGESIFVLNVGLKYPENEQLGVDVVIP
NMDYLFENSDRIAGVFLTHGHADAIGALPYLLAEAKVPVFGSELTIELAK
LFVKGNDAVKKFNDFHVIDENTEIDFGGTVVSFFPTTYSVPESLGIVLKT
SEGSIVYTGDFKFDQTASESYATDFARLAEIGRDGVLALLSDSANADSNI
QVASESEVRDEITQTIADWEGRIIVAAVSSNLSRIQQIFDAADKTGRRIV
LTGFDIENIVRTAIRLKKLSLANEILLIKPKDMSRFEDHELIILETGRMG
EPINGLRKMSIGRHRYVEIKDGDLVYIATAPSIAKEAFVARVENMIYQAG
GVVKLITQSLHVSGHGNVRDLQLMINLLQPKYLFPVQGEYRELDAHAKAA
MAVGMLPERIFIPKKGTTMAYENGDFVPAGSVSAGDILIDGNAIGDVGNV
VLRDRKVLSEDGIFIVAITVNRREKKIVARARVHTRGFVYLKKSRDILRE
SSELINQTVEEYLQGDDFDWADLKGKVRDNLTKYLFDQTKRRPAILPVVM
EAK
>SP_1272 putative polysaccharide biosynthesis protein
MKSIKLNALSYMGIRVLNIIFPILTGTYVARVLDRTDYGYFNSVDTILSF
FLPFATYGVYNYGLRAISNVKDNKKDLNRTFSSLFYLCIACTILTTAVYI
LAYPLFFTDNPIVKKVYLVMGIQLIAQIFSIEWVNEALENYSFLFYKTAF
IRILMLVSIFLFVKNEHDIVVYTLVMSLSTLINYLISYFWIKRDIKLVKI
HLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSYYTIAQR
IVTVIAGVVTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRIFNFFIIPL
SFGLMVLGPNAILLYGSEKYIGGGILTSLFAFRTIILALDTILGSQILFT
NGYEKRITVYTVFAGLLNLGLNSLLFFNHIVAPEYYLLTTMLSETSLLVF
YIIFIHRKQLIHLGHIFSYTVRYSLFSLSFVAIYFLINFVYPVDMVINLP
FLINTGLIVLLSAISYISLLVFTKDSIFYEFLNHVLALKNKFKKS
>SP_0431 conserved domain protein
MDVILSAIIFGISHLILSHRDPISLLYYSLIGFFFALVYRSTDNLRLTIL
CHSFFNFLNHAKPIWIFVYNYIYYHFFR
>SP_0783 conserved hypothetical protein
MKKAHVYAIPAIGAALIAALAQISLPIGPVPFTLQNFAIGLIATVFRPRE
AVLSAGLYLLLGAIGLPVFAGGGAGFQALVGPTAGYLWFYLVYSGLTSSL
TNSKSGVVKIFLANLLGDALVFVGGILSLHFLAGMAFEKALAVGVLPFII
PDLGKLLAISFISRPLLQRLKNQAYFTN
>SP_1280 conserved hypothetical protein
MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYED
RMYFLDYQLSYTIVLASSRSMEPVELVESYPVTEVFMEGATNQLDQEVLD
DDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQIM
TEEEYQAQKAVKKEENSPFAGLQGLFDGDE
>SP_2125 conserved hypothetical protein
MRDDIKINDRALALQDQIIEKLEKVFDTDVELDVYNLGLIYEINLDETGL
CKIVMTFTDTACDCAESLPIEIVAGLKQIEGIKDIKVEVTWSPAWKITRI
SRYGRIALGLPPR
>SP_0776 KH domain protein
MDTIENLIIAIVKPLISQPDALTIKIEDTPEFLEYHLNLDQSDVGRVIGR
KGRTISAIRTIVYSVPTEYKKVRIVIDEK
>SP_1235 MutT/nudix family protein
MSRSQLTILTNICLIEDLETQRVVMQYRAPENNRWSGYAFPGGHVENDEA
FAESVIREIYEETGLTIQNPQLVGIKNWPLDTGGRYIVICYKATEFSGTL
QSSEEGEVSWVQKDQIPNLNLAYDMLPLMEMMEAPDKSEFFYPRRTEDDW
EKKIF
>SP_1447 membrane protein
MSNSLKGTLLTVVAGIAWGLSGTSGQYLMAHGISALVLTNLRLLIAGGIL
MLLAYATAKDKILVFLKDRKSLLSLLIFALIGLFLNQFAYLSAIQETNAG
TATVLQYVCPVGILIYSCIKDRVAPTLGEIVSIIFAIGGTFLIATHGQLD
QLSMTPAGLFWGLFSALTYALYIILPIALIKKWGSSLVIGVGMVIAGLVA
LPFTGVLQADIPTSLDFLLAFAGIILIGTVFAYTAFLKGASLIGPVKSSL
LASIEPISAIFFAFLIMNEQFYPIDFLGMAMILFAVTLISLKDLFLEK
>SP_0145 conserved hypothetical protein
MKVFLQNRDFRQLTINQWISTLGDTIFYLAFLNYVADASFAPLAILLITI
SETLPQVLQIFLGVLADFQHHRVLKYTVISFAKFLLYSIVSLSLSGQSFS
LLLVAFICLINLLSDTLSYFSGAMLTPIFIRIIGQDHLAEAIGFKQSTVS
LVKTISNILGGVLLGILSIQFISLLNALTFLIAFLGILFIKTDLLKVEKT
ISYQEGLSVKSFCQHLLQSSKLIWNMNKVLLVLFIISTSQAVINVTVPIS
TLFLRNQPFLNLQTGQSLALLATLELSALIVGSLVSGYLQHTISIKTALY
ASLVIQLLLLVGFATVRFDWILIFSTLDAFFAGVLSPRLQELVFKQIPEE
SMGAVQSSIGAITVVLPSLFTIALVTIATSFGTLAVSFVLLLFLLVAFVM
LLNIRESI
>SP_0638 conserved hypothetical protein
MKKYQRMHLIFIRQYIKQIMEYKVDFVVGVLGVFLTQGLNLLFLNVIFQH
IPFLEGWTFQEIAFIYGFSLIPKGMDHLFFDNLWALGQRLVRKGEFDKYL
TRPINPLFHILVETFQIDALGELLVGGILLGTTVTSIVWTLPKFLLFLVC
IPFATLIYTSLKIATASIAFWTKQSGAMIYIFYMFNDFAKYPISIYNSLL
RWLISFIVPFAFTAYYPASYFLQEKDVFFNVGGLMLISLVFFVISLKLWD
KGLDSYESAGS
>SP_1941 competence/damage-inducible protein CinA
MKAEIIAVGTEILTGQIVNTNAQFLSEKLAEIGVDVYFQTAVGDNEVRLL
SLLEIASQRSSLVILTGGLGPTEDDLTKQTLAKFLGKALVFDPQAQEKLD
IFFTLRPDYARTPNNERQAQIVEGAIPLPNETGLAVGGKLEVDGVTYVVL
PGPPSELKPMVLNQLLPKLMTGSKLYSRVLRFFGIGESQLVTILADLIDN
QIDPTLAPYAKTGEVTLRLSTKASSQEEANQALDILENQILDCQTFEGIS
LRDFCYGYGEETSLASIVVEELKRQGKTIAAAESLTAGLFQATVANFSGV
SSIFKGGFVTYSLEEKSRMLDIPAKNLEEHGVVSEFTAQKMAEQARSKTQ
SDFGISLTGVAGPDSLEGHPVGTVFIGLAQDQGTEVIKVNIGGRSRADVR
HIAVMHAFNLVRKALLSD
>SP_1346 putative membrane protein
MKRIIPVYIFQQVNVLLVSLYLLKFLCIGELTILQILYGSSLISFLWMYG
QRKQAHKVNMKSRMKWLGVEFVSLLIISLCFSLIHAQGSTNQANLIGLQH
QIPWFSFLLFLINASMVEEFLYREILWNLVRKLDIRVALTSVLFALAHHT
GTILAWCLYVSLGMFLGMVRYKSDLWGSMGLHLVWNLLVYSLLLF
>SP_1451 Cof family protein
MMIKVIATDMDGTLLDARGQLDLPRLEKILDQLDQRGIRFVIATGNEIHR
MRQLLSPLVDRVVLVVANGARIFENNELIQAQTWDDAIVNKALTHFKGRA
CQDQFVVTGMKGDFVKEGTIFTDLESFMTPEMIEKFYQRMQFVDELTSDL
FGGVLKMSMVVGEERLSSVLEEINALFDGRVRAVSSGYGCIDILQAGIHK
AWGLEELLKRWDLKSQEIMAFGDSENDVEMLEMAGIAYAMENADEKAKAV
ATALAPANSQGGVYQVLENWLEKGE
>SP_1234 transcriptional regulator, biotin repressor family
MTKDRKQALLQLLKEAPKALNGQRLAEHFHVTRQVIVQDIAILRADGSPI
LSTNRGYIYKDANANTYHHKLFKVKHEVEEIGRELLAIVDNGGRVQNILI
DHPVYGEIETLLKLTCRRDVQHFLEQVERSDFKPISELTDGIHYHLVEAE
TQQDLHYIEEALDQLGYLVKD
>SP_0570 conserved domain protein
MSEYWFSTNVDQIDEVDGKQCLIYSYYNIKASRNVEVLKGRSGTKKGLDY
WEPYAPQKQYEMERLPKNKYIGSSSTDRWDGIEKNVVFCDCKEYVSAFDL
FFYHYNFKKISTQRSKQDFIRLRSKPVADILKNNTSSYTRYKKEMVIDNV
KVDDKVCEIISEIMDESYTDIQILTHKLYSKGDDIKASKTIWMKKSGKEY
SEAFAGTGEARIILLVNDIVNAQSNSLILIDEPEISLHPSAIYKFKEFLL
QECLNKKHQIIITTHSTQLIKDFPREAVKLLVKNGEKVDVIENIDYQDAF
FELGDVYHSRKMIYVEDRLAKYILEFVITHSGSENLKQNLVVRYIPGGAN
QIICNNILNSSYLDSDNHYFWLDGDQNTNVSESNNLMNYLENGVVISDKI
PESDNKNLDDIIKLITGCPIKFNVSGNKGQKNNIELIAKQRSFIDYWAKY
VSYLPFPTPEFFLANLCNSVDREGYDFSKDGNGKEYFRKKTQVALGIENI
TSEDIFQEQRRAVSKIQPESSMFQCIKEKLEALF
>SP_0204 acetyltransferase, GNAT family
MELRRPRLADKKAVLDMMTEFEKFQSPHDGGFWDTENFVYEDWLESNQEQ
EMGINLPEGWVSAIQLVAFSEKGQAVGFLNLRLRLSNFLLEEGGHIGYSI
RPSERGKGYAKETLRQGLQVAKEKNIKKALVTCSVNNPASRAVILANGGI
FEDARNGVERYWIEVANE
>SP_1478 oxidoreductase, aldo/keto reductase family
MNTYQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHIDTAAIYQ
NEESVGQAIKDSGVPREEMFVTTKLWNSQQTYEQTRQALEKSIEKLGLDY
LDLYLIHWPNPKPLRENDAWKTRNAEVWRAMEDLYQEGKIRAIGVSNFLP
HHLDALLETATIVPAVNQVRLAPGVYQDQVVAYCREKGILLEAWGPFGQG
ELFDSKQVQEIAANHGKSVAQIALAWSLAEGFLPLPKSVTTSRIQANLDC
FGIELSHEERETLKTIAVQSGAPRVDDVDF
>SP_0905 conserved hypothetical protein
MTAFQQLPSSVLQTGAIFLSIIIEALPFVLIGSIVSGLIEVYITPDKVYH
FLPRNRWGRIFFGTFVGILFPSCECGIVPIINRFLEKKVPSYTAVPFLVT
APVINPIVLFATYSAFGNSFHVALLRALGSILVAVILGIFLGFFWQEPIQ
KENRLACHEHDFSYLSSAKKVFQVFVQAIDEFFDTGRYLVFGCLFASIIQ
VYVPTRILTSISATPLFAILLLMILAFLLSLCSEADAFIGASLLSSFGLA
PVLAFLVIGPMLDIKNILMMKNYLKARFISHFITIVTLVVLVYSLLIGVI
L
>SP_2031 conserved hypothetical protein
MPNVKEITRESWILATFPEWGTWLNEEIEEEVVPEGNFAMWWLGNCGTWI
KTPAGANVVMDLWSNRGKSTKKVKDMVRGHQMANMAGVRKLQPNLRVQPM
VIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGR
IWEGWGVPKERIIVVKPGDTIELKDMKIHAVESFDRTCLVTLPVNGADET
GGELAGLAVTDEEMAQKAVNYIFETPGGTIYHGADSHFSNYFAKHGKDFK
IDVALNNYGENPVGIQDKMTSIDLLRMAENLRTKVIIPVHYDIWSNFMAS
TNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQDQHLVEYHHPRGFDDC
FEQDSNIQFKALL
>SP_1071 ABC transporter, ATP-binding protein
MTAIVELKNATKIVKNGFDEEKIILNDVSLEIFERDFITILGGNGAGKST
LFNTIAGTLSLTSGTIRILGEDLTKFSPEKRAKYLSRVFQDPKMGTAPRM
TVAENLLIAKFRGEKRGLLPRRLTSYKDEFQATIEKVGNGLEKHLNTPIE
FLSGGQRQALSLLMATLKRPELLLLDEHTAALDPKTSVALMELTDEFVKK
DQLTALMITHHMEDALKYGNRLIVMKEGRIIQDLNQEEKAKMKISDYYQL
FE
>SP_1291 Cof family protein
MSIKLIAVDIDGTLVNSQKEITPEVFSAIQDAKEAGVKVVIATGRPIAGV
AKLLDDLQLRDEGDYVVTFNGALVQETATGHEIISESLTYEDYLDMEFLS
RKLGVHMHAITKDGIYTANRNIGKYTVHESTLVSMPIFYRTPEEMAGKEI
VKCMFIDEPEILDAAIEKIPAEFYERYSINKSAPFYLELLKKNVDKGSAI
THLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGNPEIKKIAKYI
TKTNDESGVAHAIRTWVL
>SP_1546 conserved domain protein
MQVLLFCCNIFYNNERVLEILRKRRHIMSKKVLFIVGSLRQGSFNHQMAL
EAEKALAGKAEVSYLDYSALPLFSQDLEVPTHPAVAAAREAVLVADAIWI
FSPVYNFSIPGTVKNLLDWLSRALDLSDTRGVSALQDKFVTVSSVANAGH
DQLFAIYKDLLPFIRTQGVGDFTAARVNDSAWADGKLVLEETVLNSLEKQ
AQDLVEAIK
>SP_1566 conserved hypothetical protein
MTKKQLHLVIVTGMSGAGKTVAIQSFEDLGYFTIDNMPPALLPKFLQLVE
IKEDNPKLALVVDMRSRSFFSEIQAVLDELENQDGLDFKILFLDAADKEL
VARYKETRRSHPLAADGRILDGIKLERELLAPLKNMSQNVVDTTELTPRE
LRKTLAEQFSDQEQAQSFRIEVMSFGFKYGIPIDADLVFDVRFLPNPYYL
PELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILPSYQKEGKSVLT
IAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRS
>SP_0181 conserved hypothetical protein
MMSNKNKEILIFAILYTVLFMFDGVKLLASLMPSAIANYLVYVVLALYGS
FLFKDRLIQQWKEIRKTKRKFFFGVLTGWLFLILMTVVFEFVSEMLKQFV
GLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQER
LSGLLSIILVGLVFALTHMHSLALSEWIGAVGYLGGGLAFSIIYVKEKEN
IYYPLLVHMLSNSLSLIILAISIVK
>SP_0922 carbon-nitrogen hydrolase family protein
MRNVRVATIQMQCAKDVATNIQTAERLVRQAAEQGAQIILLPELFEHPYF
CQERQYDYYQYAQSVAENTAIQHFKVIAKELQVVLPISFYEKDGNVLYNS
IAVIDADGEVLGVYRKTHIPDDHYYQEKFYFTPGNTGFKVWNTRYAKIGI
GICWDQWFPETARCLALNGAELLFYPTAIGSEPILDTDSCGHWQRTMQGH
AAANIVPVIAANRYGLEEVTPSEENGGQSSSLDFYGSSFMTDETGAILER
AERQEEAVLLATYDLDKGASERLNWGLFRDRRPEMYRQITD
>SP_1601 conserved hypothetical protein
MSKHYKLVFYSRIFLFLAAFTGVYLEITKHGGFGMLLYYTVLSNLLVTIF
TLYLLKVMSRVGENWQRPSLLRLKGGVTMSIMITCVIYHFLLAPIATNFY
TLENFLCHYIVPIWFLADTLFFDKQGQYKIWDPAVWTILPFLYMMFALFN
GLVLKLNIPNAKDNPFPYFFLNVNKGWNVVFKWCLIIFVAYMVAGFIFYF
IKQIKRKSS
>SP_0144 hypothetical protein
MKKIISHRYFIIVFLLVIADQKFSVLVLRSDLVTGLSDFAYYLSDMMLNF
LVVLFALIAMIWSGKWQKINSRKFKGSYLFYSFLALLAFVVWNFVTFFLF
PPTRNEISYQHAAPTFTGATAFLMYFFYPVIAGPIFEDMIYRGLVMTALE
KGKKWGLDVLGSAVLFGVSHISNHGWVLTDFVFYMGGGLIFAVLFRMTKS
IYWPIGLHIVYNGIGQLLMLL
>SP_1590 conserved hypothetical protein
MVYTSLSSKDGNYPYQLNIAHLYGNLMNTYGDNGNILMLKYVAEKLGAHV
TVDIVSLHDDFDENHYDIAFFGGGQDFEQSIIADDLPAKKESIDNYIQND
GVVLAICGGFQLLGQYYVEASGKRIEGLGVMGHYTLNQTNNRFIGDIKIH
NEDFDETYYGFENHQGRTFLSDDQKPLGQVVYGNGNNEEKVGEGVHYKNV
FGSYFHGPILSRNANLAYRLVTTALKKKYGQDIQLPAYEDILSQEIAEEY
SDVKSKADFS
>SP_1245 Cof family protein
MADIKLIALDLDGTLLTTDKRLTDRTKETLQAARDRGIKVVLTTGRPLKA
MDFFLHELGTDGQEDEYTITFNGGLVQKNTGEILDKTVFSYDDVARLYEE
TEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTY
NKCVTAFAQEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGL
AKLISHLGIDQSQVMACGDEANDLSMIEWAGLGVAMQNAVPEVKAAANVV
TPMTNDEEAVAWAIEEYVLKEN
>SP_1298 DHH subfamily 1 protein
MEICQQILEKIKEYDTIIIHRHMKPDPDALGSQVGLKALLEHHFPEKTIK
AVGFDEPTLTWMAEMDLVEDRAYQGALVIVCDTANTARIDDKRYSQGDFL
IKIDHHPNDDVYGDLSWVDTSSSSASEMITLFAQTTQLALADRDAELLFA
GIVGDTGRFLYPSTTARTLRLAAYLREHNFDFAALTRKMDTMSYKIAKLQ
GYIYDHLEVDENGAARVILSQKILKQYNITDAETAAIVGAPGRIDRVSLW
GIFVEQADGHYRVRLRSKVHPINEIAKEHDGGGHPLASGANSYSLEENEI
IYQKLKNLLKN
>SP_1578 putative methyltransferase
MSEAGHKFLAKLGKKRLRPGGKRATDWLIAEGGFSKEKRILEVACNRGTT
AIELAQRFGCKITAVDMDAQALEVAKKSAGTAGVAHLISFERANAMKLPY
QDASFDIVINEAMLTMQADQAKKKCVMEYLRVLKPGGLLLTHDVLLKEAK
ESIRQELSQAIHVNVGPLTQDGWEQVMIESGYCDVKALTGEMTLMKLSGM
IYDEGLLGTLKICVNACKKENRKQFLTMYKMFAKNKQKLGFIAMASYKSS
KR
>SP_1610 SAM-dependent methyltransferase
MISKRLELVASFVSQGAILLDVGSDHAYLPIELVERGQIKSAIAGEVVEG
PYQSAVKNVEAHGLKEKIQVRLANGLAAFEETDQVSVITIAGMGGRLIAR
ILEEGLGKLANVERLILQPNNREDDLRIWLQDHGFQIVAESILEEAGKFY
EILVVEAGQMKLSASDVRFGPFLSKEVSPVFVQKWQKEAEKLEFALGQIP
EKNLEERQVLVDKIQAIKEVLHVSK
>SP_1972 membrane protein
MFTEKEKWMNHTIIHDRAGLNQFYAKVYAFVGLGIGLSALVSGLMLTVFQ
SQLVYLLMQGRLWLTIATFAELALVFVASSMASRNSPAALPVFLLYSVLN
GFTLSFVVAFYTPGTVLSAFVSSALLFFVMAAVGIFTKKDLSGIGRAMMA
ALIGLLIAMVVNIFLASGFFDYMISVAMVLVFSGLIAWDNQKIRLAYEQS
QGRVATGWVVSMALSIYLDFINLFLSILRIFGRND
>SP_0674 metallo-beta-lactamase superfamily protein
MDIQFLGTGAGQPSKARNVSSLALKLLDEINEVWLFDCGEGTQNRILETT
IRPRKVSKIFITHLHGDHIFGLPGFLSSRAFQANEEQTDLEIYGPQGIKS
FVLTSLRVSGSRLPYRIHFHEFDQDSLGKILETDKFTVYAEELDHTIFCV
GYRVMQKDLEGTLDAEKLKAAGVPFGPLFGKIKNGQDLVLEDGTEIKAAD
YISAPRPGKIITILGDTRKTGASVRLAVNADVLVHESTYGKGDEKIARNH
GHSTNMQAAQVAVEAGAKRLLLNHISARFLSKDISKLKKDAATIFENVHV
VKDLEEVEI
>SP_0033 conserved hypothetical protein
MSQEFINPSDGVIRQYLATSKTLAVVGLSDREETTSNRVTKEMQARGYKI
IPVNPKAAGGEILGEKAYASLAEIPFPVDIVNVYRRSEFLPDVARDFLKA
DAKIFWAQLGLESLEAKEILRDGGCDDIVMNRCIKREHTRLIEEA
>SP_1378 conserved hypothetical protein
MVVMNRIRVSKRVEKKLAKGLVLLEASDLENVNLKDQEVEVQGQEGNFLG
TAYLSQQNKGLGWFISKDKVAFNQAFFETLFRKAKEKRNAYYQDDLTTAF
RLFNQEGDGFGGLTVDLYGDYAVFSWYNSYVYQIRQTISEAFRQVFPEVL
GAYEKIRFKGLDYESAHVYGQEAPDFFNVLENGVLYQVFMNDGLMTGIFL
DQHEVRGSLVDGLAMGKSLLNMFSYTAAFSVAAAMGGASHTTSVDLAKRS
RELSQAHFQANGLSTDEHRFIVMDVFEYFKYAKRKDLTYDVIVLDPPSFA
RNKKQTFSVAKDYHKLISQSLEILNPGGIIIASTNAANVSRQKFTEQIDK
GFAGRSYQILNKYGLPADFAYNKKDESSNYLKVISMKVSK
>SP_1944 conserved hypothetical protein TIGR00150
MYTKNEEELQALGERLGHLLAKNDVLILTGELGAGKTTFTKGLAKGLQIS
QMIKSPTYTIVREYEGRLPLYHLDVYRIEGDADSIDLDEFIFGGGVTVIE
WGNLLGDALPDAYLELEILKEADGRRLNFQAKGLRAEKLLEELQYGV
>SP_1155 GTP-binding protein
MATIQWFPGHMSKARRQVQENLKFVDFVTILVDARLPLSSQNPMLTKIVG
DKPKLLILNKADLADPAMTKEWRQYFESQGIQTLAINSKEQVTVKVVTDA
AKKLMADKIARQKERGIQIETLRTMIIGIPNAGKSTLMNRLAGKKIAVVG
NKPGVTKGQQWLKTNKDLEILDTPGILWPKFEDETVALKLALTGAIKDQL
LPMDEVTIFGINYFKEHYPEKLAERFKQMKIEEEAPVIIMDMTRALGFRD
DYDRFYSLFVKEVRDGKLGNYTLDTLEDLDGND
>SP_0738 hypothetical protein
MKMIFLTDLRKHDRIVKSINQTEGYLTTQVAFSYFEKGDQSLTMSEKSQW
GSKLGFILASAGWPSGLVPFGSFPT
>SP_1061 putative protein kinase
MNYEKYLNYLDYETEIEDAYHNLLLEYKISDNFSDEHWLYNLPSNITQSK
GFKIHLSASILNANLVAKKFFDFIFSREKKINFKILVSIKELSLQNTGLN
GYSQVGKFITIYPKDNKEFQRLLHKLEILYKGVKGVNIPSDFRFQLSEVV
YYRYGEFVKDSTFKDKRDKKIPSNVNVPIRDYYIPRYNTIPDQYIILEVI
SKNAKGGVYKVFNTQKRVYSLLKEASDLSLVDFTNRDSVNRLINEREILV
ELEKEEFTPKVFNDFYIKNSYFVEFEFIIADKLSEYNMVSGSYDWFLTLI
DYMEIINSKYNLSYRDLSFNNILIDNRNNIYIIDFEHALRKETQIEEKKL
PLFGTPGFYETNLNLLNNQPEDIMGLVLLLYWSQNLDDFNVFSKMSFLEA
MEYAMNFNSRRLDTLDKSDWLYDIYKKAFNYKYDTFSELKKDFIEVLK
>SP_1879 conserved hypothetical protein
MAKQTIIVMSDSHGDSLIVEEVRDRYVGKVDAVFHNGDSELRPDSPLWEG
IRVVKGNMDFYAGYPERLVTELGSTKIIQTHGHLFDINFNFQKLDYWAQE
EEAAICLYGHLHVPSAWLEGKILFLNPGSISQPRGTIRECLYARVEIDDS
YFKVDFLTRDHEVYPGLSKEFSR
>SP_1568 GTP-binding protein
MELNTHNAEILLSAANKSHYPQDELPEIALAGRSNVGKSSFINTMLNRKN
LARTSGKPGKTQLLNFFNIDDKMRFVDVPGYGYARVSKKEREKWGCMIEE
YLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADKIP
RGKWNKHESAIKKKLNFDPSDDFILFSSVSKAGMDEAWDAILEKL
>SP_1023 acetyltransferase, GNAT family
MLRDLRETDVKAICDINQEALGYTFSPEETASQLARLSQDSHHFLLGYED
AANHVLLGYVHAEVYESLYSKAGFNILALAVSPQAQGQGIGKSLLQGLEE
EAKRCGYGFIRLNSANHRLGAHAFYEKVGYTCDKMQKRFIRIC
>SP_0754 putative acetoin utilization protein AcuB
MAVKDFMTRKVVYISPDITVSHAADLMREQGLHRLPVIENDQLVGLVTEG
TIAQASPSKATSLSIYEMNYLLNKTKVKDVMIRDVVTVSGYASLEDATYL
MLKNKISILPVVDNHQVYGVITDRDVFQAFLEIAGYGEEGIRVRFVTEDE
VGVLGKIVSLIVEENLNISHTVNIPRKDGKVIIEVQIDGSIDLPALKEKF
EANGIQVEEIARTSAKVL
>SP_1529 putative polysaccharide biosynthesis protein
MLRGTAWLTASNFISRLLGAVYIIPWYIWMGAYAAKANGLFTMGYNIYAW
FLLVSTAGIPVAVAKQVAKYNTMREEEHSFALIRSFLGFMTGLGLVFALV
LYVFAPWLADLSGVGKDLIPIMQSLAWGVLIFPSMSVIRGFFQGMNNLKP
YAMSQIAEQVIRVIWMLLATFIIMKLGSGDYLAAVTQSTFAAFVGMVASF
AVLIYFLAQEGSLKRIFETGDKINSKRLLVDTIKEAIPFILTGSAIQLFQ
ILDQLTFINSMSWFTNYSNEDLVVMFSYFSANPNKITMILISVGVSIGSV
GLPLLTENYVKGDLKAASRLVQDSLTLLFMFLLPATVGVVMVGEPLYTVF
YGKPDSLALGLFVFAVLQSIILGLYMVLSPMLQAMFRNRKAVLYFIYGSI
AKLVLQLPTIALFHSYGPLISTTIALIIPNVLMYRDICKVTGVKRKVILK
RTILISLLTLVMFLLIGTIQWLLGFFFQPSGRLWSFFYVALVGAMGGGLY
MVMSLRTYLLDKVIGKAQADRLRAKFKLS
>SP_0074 acetyltransferase, CysE/LacA/LpxA/NodL family
MTSEYQKMIAGEFYRPSDPELRALAQASRQKQAAFNKEENPLKGAEIIKT
WFASTGKNLYINTRLMVDYGVNIHLGENFYSNWNLTMLDICPIRIGDNAM
IGPNCQFLTPLHPLDPQERNSGIEYGKPITIGDNFWTGGGVIVLPGVTLG
NNVVAGAGAVITKSFGDNVVLAGNPARVIKEIPVK
>SP_1411 conserved hypothetical protein
MLEVAYILVALALIVFLVYLIITVQKLGRVIDETEKTIKTLTSDVDVTLH
HTNELLAKVNVLADDINVKVATIDPLFSAVADLSLSVSDLNDHARVLSKK
ASSAGSKTLKTGASLSALRLASKFFKK
>SP_1282 ABC transporter, ATP-binding protein
MHYEHSRKGNHMIKINHLTITQNKDLRDLVSDLTMTIQDGEKVAIIGEEG
NGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYIPQKVPEDLKKKTLHDY
FFLDSIDLDYSILYRLAEELHFDSNRFASDQEIGNLSGGEALKIQLIHEL
AKPFEILFLDEPSNDLDLETVDWLKGQIQKTRQTVIFISHDEDFLSETAD
TIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDK
TMEKHRRVKQNVETALRATKDSTAGRLLAKKMKTVLSQEKRYEKAAQSMT
QKPLEEEQIQLFFSDIQPLPASKVLVQLEKENLSIDDRVLVQKLQLTVRG
QEKIGIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSP
IAYLSKTGEKEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDL
VLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVSHDRRFLKEVC
SIIYRMTEHGLKLVNLEDL
>SP_0143 conserved domain protein
MKKMKEVKFHLATGLLILTYYLIFNVTSDLDFMVALSDNMYYVFQVLLVL
ILGTIATIAFVKSEHWKECGRFQFRWSYLGVFLLSFFLLFVWANLTTYIF
PRTQNGSTVVEVATNLTGISYFVTRILYTSIIAPVSEEVVCRGLLMTSLS
KVKRYYLDVLVSAAIFGAMHVLQYGWITTDFIKYFGMGLIFCMMFRYTRS
IYWAIALHASWNSFLLIVTLLVFGY
>SP_2122 conserved hypothetical protein
MKLLFRNPAYRILTLSRFFNAFGVSIFNLVFIVYASTLSQASFAVAMANI
VMILPTLFTVFAGIRADYTRDKVKWMVYSGLFQAVLFFLAALVVQQASLF
AFSSLCLINVISDIISDFAGGLRMPLIKEKVAEDDLMEAYSFSQFITYIS
AIGGQAFGVWLLALSVNNFSLVAGINACFFLVSATILFLGKSKLSLSMSS
ADGENLKNEKLSIKDQFLTIYRNLRLVFLKSGQKNFGFMLFAVLLINSLG
GALGGIYNIFFLSHSLLNFSYTEALFINQFCVLVSVIISSLTGNDYFGKQ
SLPRLMMWETVGLSLVGLANLFNQVVLGLLFLFFTLYVSGKVQPKISAML
MKNLAPEVLARTSNFLGLLFTLSIPVGTACFSLVAVWNIQLTWMLFVGLS
LLAIFLTILNLKNDI
>SP_1019 acetyltransferase, GNAT family
MTIELRDVTMENYFDVLNLDVKEYQKQFIATNAISLAEAYVYTKNGDFVA
PLAVYDNDAIIGFVMIAYDKKIGISSGNYLLFRFMIDKNFQNQGYFKPIM
DKVLDYVRTAPAGLSNKLWLSYEPENEQARFCYLSYGFKETGEISENEVV
AIYDLTIEK
>SP_1997 Cof family protein
MEVKAVFFDIDGTLVNNRKSVLKSTKDAIKIVKEQGVLVGVATGRGPFFV
KELMDDLDLDFAVTYNGQYIFSKDRVLFTSPISKLHLRHLISYAKKEGTE
IALGTKDAMLGSKIMSFGLGSFSQRISRFVPSVLTRTVSQSFNRMVSKVV
PQKEEDLLHLMNQPIYQVLMLMTPEESEKAAADFEDLKLTRSNPFASDVI
NQGNSKLEGIRRVGKEYGFDLNQVMAFGDSDNDLEMLAGVGMSVAMGNGS
SSVKEAAKHITTSNQQDGIHKALEHFGVLSSEKVFVSRDYHFNKVKTFHH
MMDERTQEEPRAWDLEGATHRAGFKIEELVEFVRAASPSEEDFGQAVSQL
HQALDKAADKVAKKTPAQQDLIGQVDALIDTLYFTYGSFVLMGVDPERIF
DIVHQANMGKIFPDGKAHFDPVTHKILKPDNWEEKYAPEPAIKKELQRQL
KAYERHKERNKS
>SP_1482 oxidoreductase, Gfo/Idh/MocA family
MLKLGVIGTGAISHHFIEAAHTSGEYQLVAIYSRKLETAATFASRYQNIQ
LFDQLEVFFKSSFDLVYIASPNSLHFAQAKAALSAGKHVILEKPAVSQPQ
EWFDLIQTAEKNNCFIFEAARNYHEKAFTTIKNFLADKQVLGADFNYAKY
SSKMPDLLAGQTPNVFSDRFAGGALMDLGIYPLYAAVRLFGKANDATYHA
QQLDNSIDLNGDGILFYPDYQVHIKAGKNITSNLPCEIYTTDGTLTLNTI
EHIRSAIFTDHQGNQVQLPIQQAPHTMTEEVAAFAHMIQQPDLNLYQTWL
YDAGSVHELLYTMRQTAGIRFEAEK
>SP_2207 putative competence protein ComF
MKCLLCGQTMKTVLTFSSLLLLRNDDSCLCSDCDSTFERIGEENCPNCMK
TELSTKCQDCQLWCKEGVEVSHRAIFTYNQAMKDFFSRYKFDGDFLLRKV
FASFLSEELKKYKEYQFVVIPLSPDRYANRGFNQVEGLVEAAGFEYLDLL
EKREERASSSKNRSERLGTELPFFIKSGVTIPKKILLIDDIYTTGATINR
VKKLLEEAGAKDVKTFSLVR
>SP_1743 conserved hypothetical protein
MIMATYETFAAVYDAVMDDSLYDKWTNFSLRHLPKTKERKKLLELACGTG
IQSVRFSQAGFDVTGLDLSADMLKIAEKRATSAKQKIAFIEGNMLNLSKA
GKYDFVTCYSDSICYMQDEVEVGDVFKDVYNALNEEGVFIFDVHSTYQTD
EVFPGYSYHENAEDFAMLWDTYEDEVPHSIVHELTFFIKEADGSFSRHDE
VHEERTYEILTYDILLEQAGFKSFKLYADFEDKEPTETSTRWFFVAQK
>SP_1328 sodium:solute symporter family protein
MGTTGFTIIDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVT
SVSIFATMLSPISFLGLAGSSYAGSWILWFAQLGMVVAIPLTIRFILPIF
ARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSIIMYLPSAGLSV
LTGIDINILIILMGVVAIVYSYTGGLKSVLWTDFIQGVILISGVVLALFV
LIANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGFTIL
SSYASSQDLVQRFTTTQNIKKLNKMLFTNGVLSLATATVFYLIGTGLYVF
YQVQNADSAASNIPQDQIFMYFIAYQLPVGITGLILAAIYAASQSTISTG
LNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLAVGLFSIGVSIVMAHS
DIKSAYEWFNSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMV
FIKYFLPPTAVSYWAYSLISISVSVVSGYIVSVLTGNKVSAPKYTTIHDI
TEIKADSSWEVRH
>SP_0889 hypothetical protein
MTIYLTEKQIEKINALAIQRYSPNEKIQTVSPSALNMIVNLPEQFVFGKP
LYPTIFDKATILFVQLIKKHVFANANKRTAFFVLVKFLQLNGYRFSVTVE
EAVKMCVTIAVEALTDEKMTSYSKWISEHSVREKVKK
>SP_0298 conserved hypothetical protein
MNYIKRPHYLDFLRKHRDRPIIKVVSGVRRAGKSVLFQLYKEELLATGVD
EDQIIFINFEDLSYYDLRHFQTLFAYIKDQLVSKKTYYIFLDEIQYVEKF
ELVADSLFILANVDLYLTGSNAYFMSSQLATNLTGRYVEIEVLPLSFEEY
LSGQSLTENLNTTEIFNNYLFSAFPYLLQTSSYDEKIDYLRGIYNSILLN
DIVTRLGKPNPTIIERIVRTLLSSTGSLISTNKIRNTLVSQNVSISHNTL
ENYLTTLTDSLLFYSVPRFDVKGRALLQRLEKYYPVDLGLRHLLLPDQKE
DIRHILENMVYLELRRRYSQVYVGNLDKYEVDFVVVTDLGHYAYYQVSET
TLAPETLERELRPLEAIKDQFPKYLLTMDTIQPTANYNGIEKKSIIDWLL
EK
>SP_0777 hypothetical protein
MDFFMKRFEVSTEIGSLSVTYQKQKKVLVCLNGAGLLPSYENFSLILEKP
PPTIGYLTIDFPNTGRSPIHDQAGKNLDNLADAVYEVLEELGISEYILCA
HSWSGILACKLLEKPIKRQTLVAIEPTTKKVMFADFSENPYPEMEEQMRL
IDECGPELYFKNLTQATFSPETNKKIWELMQEKGLELENQDPEFQISGEI
TEEDFEDVSIEAHIPVFVFCQPYREKEYRESEYWTSNTKLILGGNHHYLQ
WSESEKIAAIIRELLE
>SP_1356 amidohydrolase family protein
MKVFQHVNIVTCDQDFHVYLDGILAVKDSQIVYVGQDKPAFLEQAEQIID
YQGAWIMPGLVNCHTHSAMTGLRGIRDDSNLHEWLNDYIWPAESEFTPDM
TTNAVKEALTEMLQSGTTTFNDMYNPNGVDIQQIYQVVKTSKMRCYFSPT
LFSSETETTAETISRTRSIIDEILKYKNPNFKVMVAPHSPYSCSRDLLEA
SLEMAKELNIPLHVHVAETKEESGIILKRYGKRPLAFLEELGYLDHPSVF
AHGVELNEREIERLASSQVAIAHNPISNLKLASGIAPIIQLQKAGVAVGI
ATDSVASNNNLDMFEEGRTAALLQKMKSGDASQFPIETALKVLTIEGAKV
LGMENQIGSLEVGKQADFLVIQPQGKIHLQPQENMLSHLVYAVKSSDVDD
VYIAGEQVVKQGQVLTVEL
>SP_0055 hypothetical protein
MIQIIVNTFIEKYKTGAVVEVLYASADQDKVQAKYEELAAQYPENYLAIY
NVPLDTDLNTLDHYPSVFIGKEEFE
>SP_1466 hemolysin
MNTSLKLSKQLSFGEEIANSVTHAVGAVIMLILLPISSIYSYEAHGFLSS
IGVSIFVISLFLMFLSSTIYHSMAYGSTHKYVLRIIDHSMIYVAIAGSYT
PVVLTLMNNWFGYLIIVIQWGTTIFGILYKIFAKKVNEKFSLALYLIMGW
LVLAIIPAIISQTTPVFWSLMVTGGLCYTVGAGFYAKKKPYFHMIWHLFI
LAASALQYIAIVYYM
>SP_1783 MutT/nudix family protein
MELEISDFTGCKIALFCGDKLLTILRDDKASIPWANMWELPGGGREGDES
PFECARREVYEELGIHLDEDCLLWSKIYPSVIFKGKKSVFMVGQLRQEQF
DNIIFGDEGQGYQLMNVEEFLSSSQVVPQLQERLKDYLKVSD
>SP_0923 Cof family protein
MIKLLALDMDGTLLNEAKEIPQAHITAIHKAIEKGVKLVLCTGRPLFGVL
PYYKKLELDLQNEYIIVNNGCSTHQTSDWSLVDWKELSPADIEYLYDLAE
KSDVQLTLFDESHYFVLGDKPNQVIENDAKLVFSDLTEISLEEATSGKFR
MFQGMFLGTKEQTDDFEQRFAEELCQRFSGVRSQPVIYEAMPLGTTKATA
LSRLAEILKIDSSEIMAMGDANNDIEMLQFAGLGIAMGNASDYIKSLADA
VTSSNEEDGVARAIEKYIL
>SP_1777 conserved hypothetical protein
MQHYETVEAVTFAYGQRHHLEIQITREIAKEQGIRHHILDMSLLGQITAQ
PDFATIHISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNL
LDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGLAEQRLFQHDLYPEK
IDNR
>SP_0443 conserved hypothetical protein
MSKITTSLFQEMVQAASTRLNKQAEYVNSLNVFPVPDGDTGTNMGMTIEN
GAKEVADKPASTVGEVASILAKGLLMGARGNSGVITSQLFRGFSQAIKDK
DELTGQDLALAFQSGVEVAYKAVMKPVEGTILTVSRGAAIGAKKKAEQTD
DAVEVMRAALEGAKTALAKTPDMLPVLKEVGVVDSGGQGLVFIYEGFLSA
LTGEYIASEDFVATPANMSEMINVEHHKSVAGHVATEDITFGYCTEIMVA
LKQGPTYAKDFDYDEFRNYLDELGDSLLVVNDDEIVKVHVHTEDPGLVMQ
EGLKYGSLVKVKVDNMRNQHEAQVEKEATQVIKSAEEKEYALIAVVAGKG
LADIFCSQGVDYVIEGGQTMNPSTEDFIKAVEQVNARNIIFLPNNKNIFM
AAQSAAEVLEQPAVVVEARTLPQGMTSLLAFDPSKSIEENQERMTAALSD
VVSGSVTTAVRDTTIDGLEIHENDNLGMVDGKILVSNPDMHQTLTETLKH
MLDEDSEIVTFYVGEDGSEELANEIAQEIVEEFEDVEVEIHQGQQPVYPY
LFSVE
>SP_1587 oxalate:formate antiporter
MKSNRYIIAFAGVILHLMLGSTYAWSVYRNPIIEKTGWDQASVAFAFSLA
IFCLGLSAAFMGRLVEKFGPKVMGSLSAFLYAGGNILTGFAIDRQELWLL
YLAYGILGGLGLGAGYITPVSTIIKWFPDKRGLATGLAIMGFGFASLLTS
PIAQHLIAGVGLVETFYILGASYFIIMLLASQFIKRPNEQELAILSSSGK
EKTASLTQGMAANQALKSNRFYMLWIIFFINIACGLGLISAASPMAQEMA
GLSTSHAAVMVGVLGIFNGFGRLLWASLSDYIGRPLTFSILLLVNLFFSL
SLWLFTDSVLFVVAMSILMTCYGAGFSLIPAYLSDIFGTKELAALHGYIL
TAWAMAGLAGPILLAETYKMAHSYTQTLFVFLILYSIALALSYYLGRSIK
KESQKALT
>SP_0882 conserved hypothetical protein
MNQSYFYLKMKEHKLKVPYTGKERRVRILLPKDYEKDTDRSYPVVYFHDG
QNVFNSKESFIGHSWKIIPAIKRNPDISRMIVVAIDNDGMGRMNEYAAWK
FQESPIPGQQFGGKGVEYAEFVMEVVKPFIDETYRTKADCQHTAMIGSSL
GGNITQFIGLEYQDQIGCLGVFSSANWLHQEAFNRYFECQKLSPDQRIFI
YVGTEEADDTDKTLMDGNIKQAYIDSSLCYYHDLIAGGVHLDNLVLKVQS
GAIHSEIPWSENLPDCLRFFAEKW
>SP_0967 conserved hypothetical protein TIGR00043
MYIEMVDETGQVSKEMLQQTQEILEFAAQKLGKEDKEMAVTFVTNERSHE
LNLEYRNTDRPTDVISLEYKPELEIAFDEEDLLENSELAEMMSEFDAYIG
ELFISIDKAHEQAEEYGHSFEREMGFLAVHGFLHINGYDHYTPEEEAEMF
GLQEEILTAYGLTRQ
>SP_0637 membrane protein
MIKLWRRYKPFINAGVQELITYRVNFILYRIGDVMGAFVAFYLWKAVFDS
SQESLIQGFSMADITLYIIMSFVTNLLTRSDSSFMIGEEVKDGSIIMRLL
RPVHFAASYLFTELGSKWLIFISVGLPFLSVIVLMKIISGQGIVEVLGLT
VIYLFSLTLAYLINFFFNICFGFSAFVFKNLWGSNLLKTSIVAFMSGSLI
PLAFFPKVVSDILSFLPFSSLIYTPVMIIVGKYDASQILQALLLQFFWLL
VMVGLSQLIWKRVQSFITIQGG
>SP_1878 CBS domain protein
MIAKEFETFLLGQEETFLTPAKNLAVLIDTHNADHATLLLSQMTYTRVPV
VTDEKQFVGTIGLRDIMAYQMEHDLSQEIMADTDIVHMTKTDVAVVSPDF
TITEVLHKLVDESFLPVVDAEGIFQGIITRKSILKAVNALLHDFSKEYEI
RCQ
>SP_1236 acetyltransferase, GNAT family
MTVIIKSMETPEEIEGKSFVHWQTWREAYDDLLPAEFQETMTLERCRLFS
QKYPENTLIAMDGVKIVGFISYGNCRDETIQAGEIIALYVLKDYYGKGIA
QKLVKAALTDLNHFSEIFLWVLKDNKRAIAFYQKMGFTFDGQEKILELGK
PIKEKRMVFYSK
>SP_1116 putative transporter
MNRYAVQLISRGAINKMGNMLYDYGNSVWLASMGTIGQTVLGMYQISELV
TSILVNPFGGVISDRFSRRKILMTADLVCGILCLAISFIRNDSWMIGALI
VANIVQAIAFAFSRTANKAIITEVVEKDEIVIYNSRLELVLQVVGVSSPV
LSFLVLQFASLHMTLLLDSLTFFIAFVLVAFLPKEEAKVQEKKAFTGRDI
FVDIKDGLHYIWHQQEIFFLLLVASSVNFFFAAFEFLLPFSNQLYGSEGA
YASILTMGAIGSIIGALLASKIKANIYNLLILLALTGVGVFMMGLPLPTF
LSFSGNLVCELFMTIFNIHFFTQVQTKVESEFLGRVLSTIFTLAILFMPI
AKGFMTVLPSVHLYSFLIIGLGVVALYFLALGYVRTHFEKLI
>SP_2230 ABC transporter, ATP-binding protein
MLTVSDVSLRFSDRKLFDDVNIKFTEGNTYGLIGANGAGKSTFLKILAGD
IEPTTGHISLGPDERLSVLRQNHFDYEDERAIDVVIMGNEKLYSIMKEKD
AIYMKEDFSDEDGVRAAELEGEFAELGGWEAESEASQLLQNLNIPEELHY
QNMSELANGEKVKVLLAKALFGKPDVLLLDEPTNGLDIQSITWLEDFLID
FDNTVIVVSHDRHFLNKVCTHMADLDFGKIKLYVGNYDFWKESSELAAKL
LADRNAKAEEKIKQLQEFVARFSANASKSRQATSRKKMLDKIELEEIVPS
SRKYPFINFKAEREIGNDLLTVENLTVKIDGETILDNISFILRPDDKTAL
IGQNDIQTTALIRAIMGDIDYEGTVKWGVTTSQSYLPKDNSADFAGGESI
LDWLRQFASKEEDDNTFLRGFLGRMLFSGDEVNKPVNVLSGGEKVRVMLS
KLMLLKSNVLVLDDPTNHLDLESISSLNDGLKNFKESIIFASHDHEFIQT
LANHIIVLSKNGVIDRIDETYDEFLENAEVQAKVKELWKD
>SP_0547 conserved domain protein
MEFFDKFHALCFGFLVLIIVITVPYTINHGGFFQNESALILVSLLVTSLS
VAYARKFEMISFGMLSKKQLLLFIAIFLLSVLETLVYIHFFAVSSGSGVQ
HLAEVSRGISLSLILTTSVFGPIQEELIFRGLLQGAVFDNSWLGLVLTSS
LFSFMHGPSNVPSFIFYLLGGLLLGFAYKKSQNLWVSTLVHMLYNSWPLL
YYL
>SP_0817 MutT/nudix family protein
MNTDYIARYGVYAVIPNPEQKQIVLVQEPNGAWFLPCGEIEAGENHQEAL
KHELIEELGFTAEIGTYYGQADEYFYSRHRDTYYYNPAYLYEATPFKEVQ
KPLENFNHIAWFPIDEAIKNLKRGSHKWAIESWKKQHKIG
>SP_0119 MutT/nudix family protein
MTQQDFRTKVDNTVFGVRATALIVQNHKLLVTKDKGKYYTIGGAIQVNES
TEDAVVREVKEELGVKAQAGQLAFVVENRFEVDGVSYHNIEFHYLVDLLE
DAPLTMQEDEKRQPCEWIDLDKLQNIQLVPVFLKTALPDWEGQLRHIHLE
E
>SP_0354 putative membrane protein
MQTKYICRVTLVTLSFIFAFCYLFWTLDNWNNGFLISNYVPSIFIWVCFL
IIFQITGFILQKVSIYDFSVWYLILSYFFMFGLIFNEYMGFQTTLLWSPS
NFYNNEELFHSYIFIIWILFCYSVGYLFFYSDGKVHYHSEVQNYQENEEK
ILYNAGRILTGVGFISRVITDSKTVLAVRAANSYSAYSEAASSGIIDDLG
VLMLPGVFSLFYSDKLSRVIKRTIFWVMLFYLILIMILTGSRKIQVFSIL
ALVLVYTQSLGITFSKKRVLVFLIVTVFLLNVLVVIRGHRFDLNTIGIYL
FDSFSSLDFVKNILGEVFSESGLTSLTVASAVTVVPSSIPYEYGMTFLRT
ILSIFPIGWLVGDFFDKASATVVINKFLGLPVGSSFVEELFWNFGYYGGV
FWSFVLGIFSGWRLNFRAFQTSKISKVIYFSVISQLLLLVRSSSIDVYRP
IMYSLIMIFIFRRLKK
>SP_1332 conserved domain protein
MKRIIPVYIFQQVNVLLVSLYLLKFLCIGELTILQVLYGASLFSFLWMYG
QRKQVVKVNMKTRMKCLGIGLASLLIISLCFSLIHAQGSTNQTNLIGLQH
QVPWFSFLLFLINASMVEEFLYREILWNLVRKLDIRVALTSVLFALVHHP
GTILAWCLYVSLGMFLGMVRYKSDLWGSMGLHLVWNFLVYSLLLF
>SP_1240 conserved hypothetical protein
MKNKRIFKDFQASKMSLNIYTSPLLAFVFVFIGEFVAFTLYGIGLLALIG
LARNFGEAGQNLASYLQTLHQSLTDKTSDFRLILGLLAFGFILNTVFRWT
RKVEKRPIRTLGFYRENFLSNLLKGFSLGLALFLLTLLGLVVLGQYRLES
IHLNPYSLAFVVFTIPFWILQGTAEEVVARAWLLPQLASRTNLKLAILIS
SLFFTLLHMGNSGLTPLSLVNLFLFGVAMALYLLKTDTVWGVAGIHGAWN
FAQGNLFGILVSGQPSGTSLMTFLPQGNQDWLSGGSFGIEGSIMTSLVLL
LLIVYLANKLKKENERM
>SP_2116 conserved domain protein
MKLLKNLGWILLALLSFLFIYGFIQGLATASLALGASPYAVTLLYVALAG
VYVYGIYKWYQKAPVHIEKSGFNRFIWLPVLVWFLSLVVQFFLPDDPSVN
QQIATDLTLSQPLFSFFAVVIFAPLTEEIVFRGMLARYLFPKQDNSKRTL
IFLLVSSLLFALIHFPGDVQQFFVYFSLGFSLGLAYISRKGLVYSISLHA
LNNLVGFLMILML
>SP_0168 putative macrolide efflux protein
MKNLIKLLIIRLIVNLADSVFYIVALWHVSNNYSSSMFLGIFIAVNYLPD
LLLIFFGPVIDRVNPQKILIISILVQLAVAVIFLLLLNQISFWVIMSLVF
ISVMASSISYVIEDVLIPQVVEYDKIVFANSLFSISYKVLDSIFNSFASF
LQVAVGFILLVKIDIGIFLLALFILLLLKFRTSNANIENFSFKYYKREVL
QGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIF
LTIAGLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTL
SLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVNTTIDSIISFGMPIG
SLVAGTLIDLNIELVLIAISIPYFLFSYIFYTDNGLKEFSIY
>SP_0925 conserved hypothetical protein
MKKRAIQILLALSLIFYKSTWFWRLFNYLAKPYLPASREFFQILLLMESG
VLFLAVIYLLVFAGKKIFHFKWQLRYFIYLLLGYIISYMSDFLFSYFISL
SSNQISLNETVEMMGRQEFPYFLLIVCFIAPIAEELIYRGVLMTTFFKNS
PWYGDVLLSAIIFGYIHINFALTPLAFFIYASGGLILALLYRMTKNLYYP
ILVHILINITAFWDVWLLLFSGS
>SP_1114 ABC transporter, ATP-binding protein
MIILQANKIERSFAGEVLFDNINLQVDERDRIALVGKNGAGKSTLLKILV
GEEEPTSGEINKKKDISLSYLAQDSRFESENTIYDEMLHVFNDLRRTERQ
LRQMELEMGEKSGEDLDKLMSDYDRLSENFRQAGGFTYEADIRAILNGFK
FDESMWQMKIAELSGGQNTRLALAKMLLEKPNLLVLDEPTNHLDIETIAW
LENYLVNYSGALIIVSHDRYFLDKVATITLDLTKHSLDRYVGNYSRFVEL
KEQKLVTEAKNYEKQQKEIAALEDFVNRNLVRASTTKRAQSRRKQLEKME
RLDKPEAGKKSANMTFQSEKTSGNVVLTVENAAVGYDGEVLSQPINLDLR
KMNAVAIVGPNGIGKSTFIKSIVDQIPFIKGEKRFGANVEVGYYDQTQSK
LTPSNTVLDELWNDFKLTPEVEIRNRLGAFLFSGDDVKKSVGMLSGGEKA
RLLLAKLSMENNNFLILDEPTNHLDIDSKEVLENALIDFDGTLLFVSHDR
YFINRVATHVLELSENGSTLYLGDYDYYVEKKATAEMSQTEEASTSNQAK
EASPVNDYQAQKESQKEVRKLMRQIESLEAEIEELESQSQAISEQMLETN
DADKLMELQAELDKISHRQEEAMLEWEELSEQV
>SP_1505 membrane protein
MEQKEKHFSLSWFFKWFLDNKAITVFLVTLLLGLNLFILSKISFLFSPVL
DFLAVVMLPVILSGLLYYLLNPIVDWMEKHKVNRVIAITIVFVIIALFII
WGLAVAIPNLQRQVLTFARNVPVYLEDIDRIVNGLVAQHLPDDFRPQLEQ
VLTNFSSQATVLASKVSSQAVNWVSAFISGASQVIVALIIVPFMLFYLLR
DGKGLRNYLTQFIPRKLKEPVGQVLSDVNQQLSNYVRGQVTVAIIVAVMF
IIFFKIIGLRYAVTLGVTAGILNLVPYLGSFLAMLPALVLGLIAGPVMLL
KVVIVFIVEQTIEGRFVSPLILGSQLNIHPINVLFVLLTSGSMFGIWGVL
LGIPVYASAKVVISAIFEWYKVVSGLYELEGEEVKSEQ
>SP_1943 acetyltransferase, GNAT family
MEYELLIREAEPKDAAELVAFLNRVSLETDFTSLDGDGILLTSEEMEIFL
NKQASSDNQITLLAFLNGKIAGIVNITADQRKRVRHIGDLFIVIGKRYWN
NGLGSLLLEEAIEWAQASGILRRLQLTVQTRNQAAVHLYQKHGFVIEGSQ
ERGAYIEEGKFIDVYLMGKLIG
>SP_0405 conserved hypothetical protein
MISFLLLLVLVWGFYIGYRRGLLLQVYYLISAMASAFMAGQFYKGLGEQF
HLLLPYANSQEGQGTFFFPSDQLFQLDKVFYAGIGYLLVFGIVYSIGRLL
GLLLHLIPSKKLGGKLFQVSAGILSMLVTLFVLQMALTILATIPMAVIQN
PLEKSIVAKHIIQSIPVTTSWLKQIWVTNLIG
>SP_1717 ABC transporter, ATP-binding protein
MLEVRSLEKSFGSKQVLFGIDFQARPGRILGLVGKNGAGKTTIFHSILKF
LEYQGEIGLDGQDIRQETYARIGYLPEERSLMPKLTVLEQVRYLATLKGM
DAKEVKEKLPQWMKRLEVKGKLTDKIKSLSKGNQQKIQLIITLIHEPDLI
ILDEPFSGLDPVNTELLKQVIFQEKERGATIIFSDHVMTNVEELCDDILM
IRDGRVVLHGPVQDVRNQYGKTRLFVSSERSKEELENLPHVKQVSLTKQG
SWKLILEDESAGRELFSILTQGQYIATFDQQAPTIDEIFKLESGVEV
>SP_2064 hydrolase, haloacid dehalogenase-like family
MQKTAFIWDLDGTLLDSYEAILSGIEETFAQFSIPYDKEKVREFIFKYSV
QDLLVRVAEDRNLDVEVLNQVRAQSLAEKNAQVVLMPGAREVLAWADESG
IQQFIYTHKGNNAFTILKDLGVESYFTEILTSQSGFVRKPSPEAATYLLD
KYQLNSDNTYYIGDRTLDVEFAQNSGIQSINFLESTYEGNHRIQALADIS
RIFETK
>SP_1855 alcohol dehydrogenase, zinc-containing
MKSAVYTKAGQVGLASIERPQIIEADDVIIRVVRACVCGSDLWRYRNPET
KAGHKNSGHEAIGIVEEAGEAITTVKPGDFVIVPFTHGCGECDACLAGFD
GSCDNHIGNNLGGDFQAEYIRFHYANWALVKIPGQPSDYTEGMLKSLLTL
ADVMPTGYHAARVANVQKGDKVVVIGDGAVGQCAVIAAKMRGASQIILMS
RHEDRQKMAMESGATAVVAERGQEGITKVREILGGGADAALECVGTEAAI
EQALGVLHNGGRMGFVGVPHYNNRALGSTFMQNISVAGGAASATTYDKQF
LLKAVLDGDINPGRVFTSSYKLEDIDQAYKDMDERKTIKSMIVIE
>SP_0256 acetyltransferase, GNAT family
MITIKKQEIVKLEDVLHLYQAVGWTNYTHQTEMLEQALSHSLVIYLALDG
DAVVGLIRLVGDGFSSVFVQDLIVLPSYQRQGIGSSLMKEALGNFKEAYQ
VQLATEETEKNVGFYRSMGFEILSTYDCTGMIWINREK
>SP_0770 ABC transporter, ATP-binding protein
MSILEVKNLSHGFGDRAIFEDVSFRLLKGEHIGLVGANGEGKSTFMSIVT
GKMLPDEGKVEWSKYVTAGYLDQHSVLAERQSVRDVLRTAFDELFKAEAR
INDLYMKMAEDGADVDALMEEVGELQDRLESRDFYTLDAKIDEVARALGV
MDFGMDTDVTSLSGGQRTKVLLAKLLLEKPDILLLDEPTNYLDAEHIDWL
KRYLQNYENAFVLISHDIPFLNDVINIVYHVENQQLTRYSGDYYQFQEVY
AMKKSQLEAAYERQQKEIADLKDFVARNKARVATRNMAMSRQKKLDKMDI
IELQSEKPKPSFDFKPARTPGRFIFQAKNLQIGYDRPLTKPLNLTFERNQ
KVAIIGANGIGKTTLLKSLLGIISPIAGEVERGDYLELGYFEQEVEGGNR
QTPLEAVWNAFPALNQAEVRAALARCGLTTKHIESQIQVLSGGEQAKVRF
CLLMNRENNVLVLDEPTNHLDVDAKDELKRALKEYRGSILMVCHEPDFYE
GWIDQIWDFNNLT
>SP_0285 alcohol dehydrogenase, zinc-containing
MKAVVVNPESTGVAIEEKVLRPLETGEALVEVEYCGVCHTDLHVAHGDFG
QVPGRVLGHEGIGIVKEIAPDVKSLKVGDRVSVAWFFEGCGTCEYCTTGR
ETLCRTVKNAGYSVDGGMAEQCIVTADYAVKVPDGLDPAQASSITCAGVT
TYKAIKEAKVEPGQWVVLYGAGGLGNLAVQYAKKVFNAHVIAVDINNDKL
ALAKEVGADIVINGLEVEDVAGLIKEKTDGGAHSAVVTAVSKVAFNQAVD
SIRAGGRVVAVGLPSEMMELSIVKTVLDGIQVIGSLVGTRKDLEEAFQFG
AEGLVVPVVQKRPVEDAVAIFDEMEKGQIQGRMVLDFTH
>SP_2040 putative jag protein
MVVFTGSTVEEAIQKGLKELDIPRMKAHIKVISREKKGFLGLFGKKPAQV
DIEAISETTVVKANQQVVKGVPKKINDLNEPVKTVSEETVDLGHVVDAIK
KIEEEGQGISDEVKAEILKHERHASTILEETGHIEILNELQIEEAMREEA
GADDLETEQDQAESQELEDLGLKVETNFDIEQVATEVMAYVQTIIDDMDV
EATLSNDYNRRSINLQIDTNEPGRIIGYHGKVLKALQLLAQNYLYNRYSR
TFYVTINVNDYVEHRAEVLQTYAQKLATRVLEEGRSHKTDPMSNSERKII
HRIISRMDGVTSYSEGDEPNRYVVVDTE
>SP_0565 conserved domain protein
MIGVVARENAAEQIKQYQKFTVNISDETSMLAMEQAGFISHQEKLERLGV
HYEISERTQTPILDACPLVLDCRVDRIVEEDGICHIFAKILERLVAPELL
DEKGHFKNQLFAPTYFMGDGYQRVYRYLDKRVDMKGSFIKKARKKDGKN
>SP_1902 conserved hypothetical protein
MKITKLEKKKRLYLMELDNGDKCYITEDTIVRFMLSRDKVISEEELKEIQ
DFAQFSYGKNLALYHLSFKARTEKEVREYLKKYDINENIVSQVIANLKEE
KWINDSQYAYAIINANQLSGDKGPYVLTQKLAQKGISKSTIEEILNDFDF
SEVAQRVANKLLKKYEGKLPARALQDKIIQNLTNKGFSYFDAKIAFDELD
SQVDEETTQELIFKELDKQYTKYARKYEGYELKQRLTQVLARKGYDFSDI
ASALREYL
>SP_0666 conserved hypothetical protein
MKLIFLHGLGQSAESWKEVRNLLTDYPSEAIELFPSGVSNYQQAKERVYQ
HLAQETEPFVLIGLSLGAALTLELSSYDLPNLRALILSGCPLKLAGNILF
YLQLLIFKLLPKRVFEKQGADKTLMVGVSEELKTLDLTDIAGTYPYPTLL
ICGSKDKPNLSSMKALHRLLTDSQFQIIPDGPHVLNKAKPKEFVEKTRSF
LELLK
>SP_0667 putative pneumococcal surface protein
MNKRLFSKMSLVTLPILALFSQSVLAEENIHFSSCKEAWANGYSDIHEGE
PGYSAKLDRDHDGVACELKNAPKGAFKAKQSTAIQINTSSATTSGWVKQD
GAWYYFDGNGNLVKNAWQGSYYLKADGKMAQSEWIYDSSYQAWYYLKSDG
SYAKNAWQGAYYLKSNGKMAQGEWVYDSSYQAWYYLKSDGSYARNAWQGN
YYLKSDGKMAKGEWVYDATYQAWYYLTSDGSYAYSTWQGNYYLKSDGKMA
VNEWVDGGRYYVGADGVWKEVQASTASSSNDSNSEYSAALGKAKSYNSLF
HMSKKRMYRQLTSDFDKFSNDAAQYAIDHLDD
>SP_2072 glutamine amidotransferase, class-I
MIGLSLAFERRGIMARTVVGVAANLCPVDAEGKIIHSSVSCRFAEIIRQV
GGLPLVIPVGDESVVRDYVEMIDKLILTGGQNVHPQFYGEKKTVESDDYN
LVRDEFELALLKEALRQNKPIMAICRGVQLVNVAFGGTLNQEIEGHWQGL
PFGTSHSIETVEGSVVAKLFGKESQVNSVHRQSIKDLAPNFRVTAIDSRD
QTIEAIESIDEHRIIGLQWHPEFLVNEEDGNLELFEYLLNEL
>SP_1536 conserved hypothetical protein
MEEEQLLKSGERINQLFSTDIKIIQNREVFSYSVDSVLLSRFPRFPKKGL
IVDFCAGNGAVGLFASTRTQAQILSVEIQERLADMAERSVRLNGLEEQMQ
VICDDLKNMPAHIQGSKVDMILCNPPYFKVNPYSNLNESEHYLLARHEIT
TNLEEICRSAQSILKSNGRLAMVHRPDRLLDILDTLKRHNLAPKRLQFVY
PKREKEANMLLIEAIKDGSTSGFKVLPPLIVHNDDGSYTPEIEEIYYGS
>SP_0846 sugar ABC transporter, ATP-binding protein
MAHENVIEMRDITKVFGGFVANDKINLHLRKGEIHALLGENGAGKSTLMN
MLAGLLEPTSGEIAVNGQVVNLDSPSKAASLGIGMVHQHFMLVEAFTVAE
NIILGSELTKNGVLDIAGASKEIKALSERYGLAVDPSAKVADISVGAQQR
VEILKTLYRGADILIFDEPTAVLTPSEIDELMAIMKNLVKEGKSIILITH
KLDEIRAVSDRVTVIRRGKSIETVEIAGATNADLAEMMVGRSVSFKTEKQ
ASKPKEVVLSIKDLVVNENRGVPAVKNLSLDVRAGEIVGIAGIDGNGQSE
LIQAITGLRKVESGSIELKGDSIVGLHPRQITELSVGHVPEDRHRDGLIL
EMMISENIALQTYYKEPHSKNGILNYSNITSYAKKLMEEFDVRAASELVP
AAALSGGNQQKAIIAREIDRDPDLLIVSQPTRGLDVGAIEYIHKRLIEER
DNGKAVLVVSFELDEILNVSDRIAVIHDGKIQGIVSPETTNKQELGVLMA
GGNLGKEKSDV
>SP_0652 conserved hypothetical protein
MKRPLEMAHDFLAEVVTKEDVVVDATMGNGHDTLFLAKLAKQVYAFDIQK
QALEKTQERLHQADLTNAQLILQGHETLDQFVIKAKAGIFNLGYLPAADK
SVITRPQTTIEALEKLCGLLVKGGRIAIMIYYGHEGGDLERDAVLDFVIQ
LNQQEYTAAIYRTLNQVNNPPFLVMIEKLERYRHG
>SP_0288 conserved hypothetical protein
MWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLS
IVVLALFIMGARKTKLASFNFSFFRAKDLARLGLSYLVIVGSNILGSILL
QLSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKI
FRGKENLGFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEM
SILLHMIVNGIAFCLLALVVIMSRTLGISV
>SP_1069 conserved hypothetical protein
MLKEIKRRNRMKNKRLIGIIAALAVLVAGSLIYSSMNKSEAQNNKDEKKI
TKIGVLQFVSHPSLDLIYKGIQDGLAEEGYKDDQVKIDFMNSEGDQSKVA
TMSKQLVANGNDLVVGIATPAAQGLASATKDLPVIMAAITDPIGANLVKD
LKKPGGNVTGVSDHNPAQQQVELIKALTPNVKTIGALYSSSEDNSKTQVE
EFKAYAEKAGLTVETFAVPSTNEIASTVTVMTSKVDAIWVPIDNTIASGF
PTVVSSNQSSKKPIYPSATAMVEVGGLASVVIDQHDLGVATGKMIVQVLK
GAKPADTPVNVFSTGKSVINKKIAQELGITIPESVLKEAGQVIE
>SP_0873 membrane protein
MFRRNKLFFWTTEILLLTIIFYLWRQMGSLINPFVSVLNTIMIPFLLGGF
FYYLTNPIVTFLNKVCKLNRLLGILITLCTLVWGMVIGVVYLLPILINQL
SSLIISSQTIYSRVQDLIIDLSNYPALQNLDVEATIQQLNLSYVDILQNI
LNSVSNSVGSVLSALISTVLILIMTPVFLVYFLLDGHKFLPMLERTILKR
DRLHIAGLLKNLNATIARYISGVSIDAIIIGCLAYIGYSIIGLKYALVFA
IFSGVANLIPYVGPSIGLIPMIIANIFTDPHRLLIAVIYMLVVQQVDGNI
LYPRIVGSVMKVHPITILVLLLLSSNIYGVVGMIVAVPTYSILKEISKFL
SHLYENHKIMKERERELAK
>SP_0760 hypothetical protein
MTSSEVEVNDLYQALPAAEPIHQDHYVRLLVEKLSEQGKNYYWTWAYNHI
GYNRYHEGVAILSKTPIEAREILVSDVDDPTDYHTRRVALAETVVDGKEL
AVASVHLSWWDKGFQEEWARFEAVLKKLNKPLLLAGDFNNPAGQEGYQAI
LASPLGLQDAFEVAQEKSGSYTVPPEIDGWKGNTEPLRIDYVFTTKELAV
ENLHVVFDGNKSPQVSDHYGLNAILNWK
>SP_1424 hypothetical protein
MKLNKQKNRMIYVLSNFLYAISVSIIYALNGIVLLVIVSKLGIPGDLGLN
FIVAIVVNTILLVLFYFLLSYIFYLYKLKSGLVFGILVALLLFISNILNT
MMMNTSNDLFIKAIELLPFLFFTCICGFKYDVY
>SP_0737 sodium-dependent transporter
MTAANGGGGFLLIFLISTILIGFPLLLAEFALGRSAGVSAIKTFGKLGKN
NKYNFIGWIGAFALFILLSFYSVIGGWILVYLGIEFGKLFQLGGTGDYAQ
LFTSIISNPAIALGAQAAFILLNIFIVSRGVQKGIERASKVMMPLLFIVF
VFIIGRSLSLPNAMEGVLYFLKPDFSKLTSTGLLYALGQSFFALSLGVTV
MLTYASYLDKKTNLVQSGISIVAMNISISIMAGLAIFQARSPFNIQSEGG
PSLLFIVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITN
QDNSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVS
NLLMPFGALYLSLFTGYIFKKALAMEELHLDERAWKQGLFQVWLFLLRFF
VSSFQSSSLWSSLPNLCNQKGLE
>SP_2094 conserved hypothetical protein
MKEIFDRRYPVTSFFLLVTALVFLLMLVTAGGNFDRADTLFRFGAMYGPA
IRLFPEQVWRLLSAIFVHIGWEHFIVNMLSLYYLGRQVEEIFGSKQFFFL
YLLSGMMGNLFVFVFSPKSLAAGASTSLYGLFAAIIVLRYATRNPYIQQL
GQSYLTLFVVNIIGSVLIPGISLAGHIGGAVGGAFLAVIFPVRGEKRMYN
TSQRLGAVVLFVGLAILLFYKGMGL
>SP_0636 ABC transporter, ATP-binding protein
MAMIEVEHLQKNFVKTVKEPGLKGALRSFIHPEKQTFEAVKDLTFEVPKG
QILGFIGANGAGKSTTIKMLTGILKPTSGFCRINGKIPQDNRQDYVKDIG
VVFGQRTQLWWDLALQETYTVLKEIYDVPDSLFHKRMDFLNEVLDLKDFI
KDPVRTLSLGQRMRADIAASLLHNPKVLFLDEPTIGLDVSVKDNIRRAIT
QINQEEETTILLTTHDLSDIEQLCDRIFMIDKGQEIFDGTVSQLKETFGK
MKTLSFELLPGQSHLVSHYDGLSDMTIDRQGNSLNIEFDSSRYQSADIIK
QTLSDFEIRDLKMVDTDIEDIIRRFYRKEL
>SP_1749 GTP-binding protein
MEEILCIGCGATIQTTDKAGLGFTPQSALEKGLETGEVYCQRCFRLRHYN
EITDVQLTNDDFLKLLHEVGDSDALVVNVIDIFDFNGSVIPGLPRFVSGN
DVLLVGNKKDILPKSVKSGKISQWLMKRAHEEGLRPVDVVLTSAQNKHAI
KEVIDKIEHYRKGRDVYVVGVTNVGKSTLINAIIQEITGDQNVITTSRFP
GTTLDKIEIPLDDGSYIYDTPGIIHRHQMAHYLTAKNLKYVSPKKEIKPK
TYQLNPEQTLFLGGLGRFDFIAGEKQGFTAFFDNELKLHRSKLEGASAFY
DKHLGTLLTPPNSKEKEDFPRLVQHVFTIKDKTDLVISGLGWIRVTGIAK
VAVWAPEGVAVVTRKAII
>SP_0845 lipoprotein
MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVD
DKSFNQSAWEGLQAWGKEHNLSKDNGFTYFQSTSEADYANNLQQAAGSYN
LIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGY
LAGVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDY
AGSFGDAAKGKTIAAAQYAAGADIVYQVAGGTGAGVFAEAKSLNESRPEN
EKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKAER
GEFPGGQVIVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEK
>SP_0768 conserved hypothetical protein
MKPSIHSLVHQTMQEWVLEQGEKKFRADQIWEWLYRKRVQSFEEMTNLSK
DLIAKLNDQFVVNPLKQRIVQESADGTVKYLFELPDGMLIETVLMRQHYG
LSVCVTTQVGCNIGCTFCASGLIKKQRDLNNGEIVAQIMLVQKYFDERGQ
DERISHIVVMGIGEPFDNYNNVLNFFRTINDDKGMAIGARHITVSTSGLA
HKIRDFADEGVQVNLAVSLHAPNNELRSSIMKINRAFPIEKLFAAIEYYI
ETTNRRVTFEYIMLNEVNDGVEQALELTELLKNIKKLSYVNLIPYNPVSE
HDQYSRSPKECVLAFYDTLKKKGVNCVVRQEHGTDIDAACGQLRSNTMKR
DRQKAVAAVNP
>SP_1464 acetyltransferase, GNAT family
MEIRLAFPNEVDAIMQVMEDAKKCLADAGSDQWQNGYPNADVIIDDIISG
QAYVALEEGELLAYAAVTKSPEEAYEAIYEGNWQAGESEYLVFHRIAVAA
DVQGKGVAQTFLEGLIEGFDYLDFRSDTHAENKVMQHIFEKLGFKQVGKM
PVDGERLAYQKLKK
>SP_1070 conserved hypothetical protein
MIVSIISQGFVWAILGLGIFMTFRILNFPDMTTEGSFPLGGAVAVTLITK
GVNPFLATLVAVGAGCLAGMAAGLLYTKGKIPTLLSGILVMTSCHSIMLL
IMGRANLGLLGTKQIQDVLPFDSDLNQLLTGLIFVSIVIALMLFFLDTKL
GQAYIATGDNPDMARSFGIHTGRMELMGLVLSNGVIALAGALIAQQEGYA
DVSRGIGVIVVGLASLIIGEVIFKSLSLAERLVTIVVGSIAYQFLVWAVI
ALGFNTSYLRLYSALILAVCLMIPTFKQTILKGAKLSK
>SP_1171 hydrolase, haloacid dehalogenase-like family
MFYKFLLFDLDHTLLDFDAAEDVALTQLLKEEGVADIQAYKDYYVPMNKA
LWKDLELKKISKQELVNTRFSRLFAHFGQEKDGSFLAQRYQFYLAQQGQT
LSGAHDLLDSLIERDYNLYAATNGITAIQTGRLAQSGLAPYFNQVFISEQ
LQTQKPDALFYEKIGQQIAGFSKEKTLMIGDSLTADIQGGNNAGIDTIWY
NPHHLENHTQAQPTYEVYSYQDLLDCLDKNILEKITF
>SP_1909 oxidoreductase, short chain dehydrogenase/reductase family
MAKNVVITGATSGIGEAIARAYLEQGEDVVLTGRRIDRLEALKAEFAETF
PNQTVWTFLLDVTDMTMVKTVCSDILETIGQIDILVNNAGLALGLAPYQD
YEELDMLTMLDTNVKGLMAVTRCFLPAMVKANQGHIINMGSTAGIYAYAG
AAVYSATKAAVKTFSDGLRIDTIATDIKVTTIQPGIVETDFSTVRFHGDK
ERAASVYQGIEALQAQDIADTVVYVTSQPRRVQITDMTIMANQQATGFMV
HKK
>SP_1410 conserved hypothetical protein
MGKLSSILLGTVSGAALALFLTSDKGKQVCSQAQDFLDDLREDPEYAKEQ
VCEKLTEVKEQATDFVLKTKEQVESGEITVDSILAQTKSYAFQATEASKN
QLNNLKEQWQEKAEALDDSEEIVIDITEE
>SP_0848 putative sugar ABC transporter, permease protein
MSIITLLPLLVSSMLIYSAPLIFTSIGGVFSERGGVVNVGLEGIMVMGAF
SGVVFNLEFAEQFGAATPWLSLLVAGLVGSVFSIIHAAATVHFRADHVVS
GTVLNLMAPALAVFLVKVLYNKGQTDNLSQTFGRFDFPVLANIPVIGDIF
FKSTSLLGYLAIAFSFLAWFILFKTQFGLRLRSVGEHPQAADTLGINVYK
MRYLGVIISGFLGGIGGAIYAQSISVNFSVTTIVGPGFIALAAMIFGKWN
PIGAMLSSLFFGLSQSLAVIGSQLPFLQGVPAVYLQIAPYVLTILVLAAF
FGKAVAPKADGINYIKSK
>SP_2225 conserved hypothetical protein
MELVHGISTHFIQSKKFKTNKITVRFTAPLSLDTIAGHMLSASMLETANQ
MYPTSQDLRRHLASLYGTDMSTNCFRRGQSHIIELTFTYVRDEFLSRKNV
LTSQILELVKETLFSPVVVDNGFDPALFEIEKKQLLASLAADMDDSFYFA
HKELDKLFFHDERLQLEYSDLRNRILAETPQSSYSCFQEFLANDRIDFFF
LGDFNEVEIQNVLESFGFKGRKGDVKVQYCQPYSNILQEGMVRKNVGQSI
LELGYHYCSKYGDEQHLPMIVMNGLLGGFAHSKLFTNVRENAGLAYTISS
ELDLFSGFLRMYAGINRENRNQARKMMNNQLLDLKKGYFTEFELNQTKEM
IRWSLLLSQDNQSSLIERAYQNALFGKSSADFKSWIAKLEQIDKDAICRV
ANNVKLQAIYFMEGIE
>SP_1516 acetyltransferase, GNAT family
MFYFISLYFRLENKESHKSQEIGNLIRVYNRSKREEAESEPLNLYVEDEK
GNLLAGLIAETFGNWLEIEYLFVKEELRGQGIGSKLLQQAESEAKNRNCC
FAFVNTYQFQAPDFYQKHGYKEVFSLQDCLYIRQRYYYQKNL
>SP_1246 Cof family protein
MTIKLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLS
LEKLFAGVRDDIIFIAENGSLVEYQGQDLYEATMSRDFYLATFEKLKTSP
YVDINKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDITDDIF
KFTTNFTEETLEDGEAWVNENVPGVKAMTTGFESIDIVLDYVDKGVAIVE
LVKKLGITMDQVMAFGDNLNDLHMMQVVGHPVAPENARPEILELAKTVIG
HHKERSVIAYMEGL
>SP_1520 acetyltransferase, GNAT family
MEIPIKIIQASKSDLPEIGALQTSSFPAEKQQLSHILEESIRKCADTFLL
ARDENQLLGYILSSPQSDNPQCLKVHSLVIEFDHQRQGLGTLLLAALKEV
AVELDYKGIRLESPDELLSYFEMNGFVDEEATLLYATSQGYSMIWFNPFY
LEEQ
>SP_1910 conserved hypothetical protein
MIFTYNKEHVGDVLMVIVKNSGDAKLDVERKGKVARVFLKDNGETVAWNI
FEVSSLFEIAERGQVFLTDEQVARLNQELQAEGFTEEIVNDKEPKFVVGE
IVEMVAHPDSDHLNICQVAVASDKTVQIVAGAPNARVGLKTIVALPGAMM
PKGNLIFPGELRGEKSFGMMCSPRELHLPNAPQKRGVLELSEDQVVGTPF
DPAKHWTA
>SP_0740 MutT/nudix family protein
MNRREAVEFVNMCMIKNGDKVLVQDRVNPDWSGITFPGGHVERGESFVDA
VIREVKEETGLIISKPQLCGIKNWYDDKDYRYVVLFYKTEHFTGELQSSD
EGKVWWEDFENLSHLKLATDDMSDMLRVFLEEDLSEFFYYKNGDDWLYDL
K
>SP_1739 KH domain protein
MSLAIAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQA
EREADLLVNEAKRESKSLKKEALLEAKEEARKYREEVDAEFKSERQELKQ
IESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEV
ERQKEAELERIGALSQAEARDIILAQTEENLTREIASRIREAEQEVKERS
DKMAKDILVQAMQRIAGEYVAESTNSTVHLPDDTMKGRIIGREGRNIRTF
ESLTGVDVIIDDTPEVVTLSGFDPIRREIARMTMEMLLKDGRIHPARIEE
LVEKNRQEIDNKIREYGEAAAYEIGAPNLHPDLMKIMGRLQFRTSYGQNV
LRHSIEVAKLAGIMASELGENAALARRAGFLHDIGKAIDHEVEGSHVEIG
MELARKYKEPPVVVNTIASHHGDVEAESVIAVIVAAADALSAARPGARSE
SLESYIKRLHDLEEIANGFEGVQTSFALQAGREIRIMVNPGKIKDDKVTI
LAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAK
>SP_1995 conserved hypothetical protein
MMKNSFQKSNFLYYGIILVSVLVEVIMICQLESLVPLLYPSFIGFLVFHV
LYHLILFLVAKRSGRWDYLMIWGLFLMFNLLYDSFLGLLFSRFIFWYVMC
LCKIPAISTFLTSPLARFFFYTKLVTHFSKLVAFSF
>SP_0157 hypothetical protein
MYSFTKGAIILSGGPIMKKKPIYLWVLLILSALISVPSLFGIVSPLPSKE
ALRAAQKQVAGVNAQQLEDQLNYTYRVAEASHSIFNVALIVLSTILVVVA
IVFLVRKNLQYANYTYVGYVLLAIIGSIYGYVGLQDAVQLVQDESMRLTV
SIGSKAVSIFYIVINVLFLALVFYKMWRQQKALAEEEETEELT
>SP_1079 GTP-binding protein, GTP1/Obg family
MFLDTAKIKVKAGNGGDGMVAFRREKYVPNGGPWGGDGGRGGNVVFVVDE
GLRTLMDFRYNRHFKADSGEKGMTKGMHGRGAEDLRVRVPQGTTVRDAET
GKVLTDLIEHGQEFIVAHGGRGGRGNIRFATPKNPAPEISENGEPGQERE
LQLELKILADVGLVGFPSVGKSTLLSVITSAKPKIGAYHFTTIVPNLGMV
RTQSGESFAVADLPGLIEGASQGVGLGTQFLRHIERTRVILHIIDMSASE
GRDPYEDYLAINKELESYNLRLMERPQIIVANKMDMPESQENLEDFKKKL
AENYDEFEELPAIFPISGLTKQGLATLLDATAELLDKTPEFLLYDESDME
EEAYYGFDEEEKAFEISRDDDATWVLSGEKLMKLFNMTNFDRDESVMKFA
RQLRGMGVDEALRARGAKDGDLVRIGKFEFEFVD
>SP_1553 ABC transporter, ATP-binding protein
MSDFIVEKLSKSVGDKTVFRDISFIIHDLDRIGLIGVNGTGKTTLLDVLS
GVSGFDGDVSPFSAKNDYQIGYLTQDPDFDDRKTVLDTVLSSELKEIQLI
REYELIMLDYSEDKQARLERVMAEMDSLQAWEIESQVKTVLSKLGIQDLS
TPVGELSGGLRRRVQLAQVLLGNHDLLLLDEPTNHLDIAIIEWLTLFLKN
SKKTVLFITHDRYFLDALSTRIFELDRAGLTEYQGNYQDYVRLKAEQDER
DAALLHKKEQLYKQELAWMRRQPQARATKQQARINRFHDLKKEVSGSSAE
TDLTMNFETSRIGKKVIEFQDVSFAYENKPILQNFNLLVQAKDRIGIVGD
NGVGKSTLLNLIAGSLEPTAGQVVIGETVRIAYFSQQIEGLDESKRVINY
LQEVAEEVKTSGGSTTSIAELLEQFLFPRSTHGTLIEKLSGGEKKRLYLL
KLLLEKPNVLLLDEPTNDLDIATLTVLENFLQGFAGPVLTVSHDRYFLDK
VATKILAFEDGKIRPFFGHYTDYLDEKAFETDMANQVQKAEKEKVVKVRE
DKKRMTYQEKQEWASIEGDIETLEKRIAAIEEEMQANGSDFGKLATLQKE
LDEKNEALLEKYERYEYLSEFDS
>SP_1422 hypothetical protein
MKNRFYYSQLLDEREEQLFNKAGSESFYICIALSLLSYIISVLAPSLFNS
NMLLIVIIIGTFYFFNRARYLGVTYYGRFHFTILGCFFLTLAITALLMLQ
NYQFNIEIYQHNPLNFKYLSAWVITYIIYLPWIFIGNLGLKSYGEWAQKK
FEQDMDELESGE
>SP_1600 putative membrane protein
MKQFLERASILALSLVLITSFSISSALPAMFDYYQGYSKEQIELLVSLPS
FGIMMMLLLNGFLEKIFPERLQISLGLLILSLSGTAPFWYQAYPFVFGTR
LLFGLGLGMINAKAISIISERYQGKRRIQMLGLRASAEVVGASLITLAVG
QLLAFGWTAIFLAYSAGFLVLPLYLLFVPYGKSKKEVKKRAKEASRLTRE
MKGLIFTLAIEAAVVVCTNTAITIRIPSLMVERGLGDAQLSSFVLSIMQL
IGIVAGVSFSFLISIFKEKLLLWSGITFGLGQIVIALSSSLWVVVAGSVL
AGFAYSVVLTTVFQLVSERIPAKLLNQATSFAVLGCSFGAFTTPFVLGAI
GLLTHNGMLVFSILGGWLIVISIFVMYLLQKRA
>SP_0160 conserved domain protein
MKKQVFHDAATGVLIGLILSILFSLIYAPNTYAPLNPYSLIGQVMDQHQV
HGALVLLYCTLIWATIGMLFNFGNRLFSRDWSMLRATLTHFFLMLAGFVP
LATLAGWFPFHWIFYLQLIIEFAIVYLIIWAILYKREAKKVDHINQLLEH
RK
>SP_0356 putative polysaccharide transporter
MKVDRISFIKNTSSLYILNIVKLLFPLLTLPYLTRVLSLDAYGMVIYVKA
LIAYVQLVIDFGFMISATKNIVNACTTPSKIGRIVGDTLVEKIFLSIISI
LIYTILMWQIPIMRENILFSVFYLLATVTNIFIFDFLFRGIEKMHAVAIP
YIISKTIITILTFIVVKDDSSILWIPILEGIGNLVAAVVSYRFLHYYGIK
LSFSYLSVWVKDLKESSIYFLSNFATTIFGVFTTVISGFYLQSQEIAFWG
IAMQLLSAAKSLYNPIANSLYPHMIRTKDIQSVKSINRIMFIPIIFGVLI
VLFFSNQILSIIGGEKYTVSADFLKYLLPAFVASFYSMIYGWPVLGAIDK
VKETTMTTILASIVQTLGLGIFILSDNFSLVTLAICSSMSEVVLWISRYL
IYFKNRSLFVRSK
>SP_1325 oxidoreductase, Gfo/Idh/MocA family
MKIYKKRIKIMVNYGIVGAGYFGADLARSMNKIEDAKVVAVFDPNHGEEV
AQELGSDVCASLDELVAREDIDCVIVASPSYLHREPVVKAAQHGKHVFCE
KPIALSYEDCKAMVDACKENNVIFMAGHIMNFFNGVHHAKELITQGKIGK
VLYCHAARTGWEEQQPTVSWKKLRSQSGGHLYHHIHELDCIQFIMGGLPE
KATMVGGNVYHKGENFGDEDDMLIVNLEYSDDRYAVLEYGNAFRWGEHYV
LIQGTEGAIKLDLFNTGGTLRVKGEGESHFLVHETQEEDDDRTAIYTGRG
MDGAIAYGKPGVRCPLWLQTCIDKEMEYLHDIIKGGEITEEFEKLLNGVA
ALESIATADACTLSVKEDRKVSLSEITNA
>SP_0404 hypothetical protein
MNFMANLNRFKFTFGKKSLTLTSEHDNLFMEEIAKVATEKYQAIKEQMPS
ADDETIALLLAVNCLSTQLSREIEFDDKEQELEELRHKLVTCKQEQSKIE
DSL
>SP_1344 conserved hypothetical protein
MDYNFNLEHPFFFTNNDYSTDTSIKYQVSLPFNWHEVMNNDEWVYQYPIG
KFVERQGWKIHISSEYNSSHELLQDVAKICHEMRIPFKHLSTEDKFIMRN
GKLVSRGFSGKFITCYPNQNELESVLQRLESALKQYNGPYILSDKRWDEA
PIYLRYGVFRPSRDDEKKVAIDELIVGDEVVKDERLPVFKIPKGIVPPDF
LNKWLDKKDKKQGDFPFIIDNAIRFSNSGGIYNARLKEDGKKIILKEARP
YTGLGFDGTYSSEKLASECKALKILNEWSETPKIYWHGKIWEHTFLGIEH
MKGVPLNRWVTNNFPLYEVVDKTKDYLLRVSKIVEKLIDLTNKFHSENVY
HQDLHLGNILVKDEDEISIIDWEQAVFSNDEKVVHKVAAPGFRAWRETLP
SEIDWYGIRQIAHYLYMPLVTTSDLTYNYVSQTRIEGKKLFESLGYTREH
IDYVESLLSYLDSKCPQIENISRKKVLKPMHEIRTIESEQDIQDFIIKLL
RGFTLTYGQWRKEFQSRFFPVHYYGLNFNQGIAFSDLAILWSYQQLAKKV
KNFKFDDYYEIRTQVINEAVNNFKKSSLSGLFDGKIGTIWLIYEFGEIDR
AVELFTTHFIEIFENSQNKNLYSGQAGILLVGLYFLSKGGIDNKLGEEIL
IRLREYTLNYIENPETFCKVGASDVQSNDPYENFGGLLYGHAGVAWLFGE
AYKLTGESIYKNGLELAVDKELVAYKVDSNNSLQYSQGHRLLPYLATGSA
GLLLLINRNKEILSSKYLKYLTSLERATDVVFCVLPGLFNGFCGLEVANN
IYSDIDDNFSGQKKLIEQLYRYLCVIEEGFVIAGDNGLKITTDIASGFAG
VAIGLVSIMDNKLTILPQI
>SP_0791 oxidoreductase, aldo/keto reductase family
MRYITLGQDDKELSEIVLGMMRIKDKSVKEVEELVETALSVGINAFDLAD
IYGRGRCEELLGLVLKNRPDLREKMWIQSKCGIRIEEFTYFDFSKDYIIK
SVDGILQRLKIDHLDSLLLHRPDALMESDQVAEAFNLLYKQGKVRDFGVS
NQNPMMMELLKKDVKQPLAVNQLQLSAAFTPGFESAFHVNMEDSQAAMRD
GSIFEYCQLHDVVIQAWSVLQFGYFKGNFVGNEKFQALNQVLDRLAIKYG
VTSSTIAISWILRYPAKMQAVVGTTNPKHLREVSRAANFSLTRKEWYEIY
LAAGNNLP
>SP_0640 hypothetical protein
MSRRKKAYQGRKIGSQLLATLESEARKKVGYLQVKTVAEGSNKDYDRTND
FYRGLGFKKLEIFPQLWNPQNPCQILIKKLE
>SP_0545 blpY, immunity protein BlpY
MKKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNIL
SLLFIGVMIVVLVKTGHGYLFRIPRKKWLWYSILTVLVLVFQISFNVQTA
KHVQSTAEGWAVLIGYSGTNFAELGIYIALFFLVPLMEELIYRGLLQHAF
FKHSRFGLDLLLPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTKS
IYPSYAVHVINNIVATFPFLLTFLHRVLG
>SP_1980 cbf1, cmp-binding-factor 1
MKKDELFEGFYLIKSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHN
IEAFTAGKVVHMKGRREVYNNTPQVNQITLRLPQAGEPNDPADFKVKSPV
DVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDKEFYSYPAAKTNHHAFE
TGLAYHTATMVRLADAISEVYPQLNKSLLYAGIMLHDLAKVIELTGPDQT
EYTVRGNLLGHIALIDSEITKTVMELGIDDTKEEVVLLRHVILSHHGLLE
YGSPVRPRIMEAEIIHMIDNLDASMMMMSTALALVDKGEMTNKIFAMDNR
SFYKPDLD
>SP_2190 cbpA, choline binding protein A
MFASKSERKVHYSIRKFSVGVASVVVASLVMGSVVHATENEGATQVPTSS
NRANESQAEQGEQPKKLDSERDKARKEVEEYVKKIVGESYAKSTKKRHTI
TVALVNELNNIKNEYLNKIVESTSESQLQILMMESRSKVDEAVSKFEKDS
SSSSSSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEE
DRRNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQKIKQAEA
EVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGEL
ATPDKKENDAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQ
KEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRNEEKVKQ
AKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPA
PAPKAEKPAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQ
PPKTEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSN
GAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGSMATGWLQY
NGSWYYLNANGSMATGWLQYNGSWYYLNANGDMATGWVKDGDTWYYLEAS
GAMKASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN
>SP_0377 cbpC, choline binding protein C
MKLLKKMMQVALAVFFFGLLATNTVFANTTGGRFVDKDNRKYYVKDDHKA
IYWHKIDGKTYYFGDIGEMVVGWQYLEIPGTGYRDNLFDNQPVNEIGLQE
KWYYFGQDGALLEQTDKQVLEAKTSENTGKVYGEQYPLSAEKRTYYFDNN
YAVKTGWIYEEGHWYYLNKLGNFGDDSYNPLPIGEVAKGWTQDFHVTIDI
DRSKPAPWYYLDASGKMLTDWQKVNGKWYYFGSSGSMATGWKYVRGKWYY
LDNKNGDMKTGWQYLGNKWYYLRSSGAMVTGWYQDGSTWYYLDPSNGDMK
IGWTKVNGKWYYLNSNGAMVTGSQTIDGKVYNFASSGEWI
>SP_2201 cbpD, choline binding protein D
MKILPFIARGTSYYLKMSVKKLVPFLVVGLMLAAGDSVYAYSRGNGSIAR
GDDYPAYYKNGSQEIDQWRMYSRQCTSFVAFRLSNVNGFEIPAAYGNANE
WGHRARREGYRVDNTPTIGSITWSTAGTYGHVAWVSNVMGDQIEIEEYNY
GYTESYNKRVIKANTMTGFIHFKDLDGGSVGNSQSSTSTGGTHYFKTKSA
IKTEPLASGTVIDYYYPGEKVHYDQILEKDGYKWLSYTAYNGSYRYVQLE
AVNKNPLGNSVLSSTGGTHYFKTKSAIKTEPLVSATVIDYYYPGEKVHYD
QILEKDGYKWLSYTAYNGSRRYIQLEGVTSSQNYQNQSGNISSYGSHSSS
TVGWKKINGSWYHFKSNGSKSTGWLKDGSSWYYLKLSGEMQTGWLKENGL
WYYLGSSGAMKTGWYQVSGKWYYSYSSGALAVNTTVDGYRVNSDGERV
>SP_0930 cbpE, choline binding protein E
MKKKLTSLALVGAFLGLSWYGNVQAQESSGNKIHFINVQEGGSDAIILES
NGHFAMVDTGEDYDFPDGSDSRYPWREGIETSYKHVLTDRVFRRLKELGV
QKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDN
LYGYDKVLQTAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGE
LKKIWDDNSNSLISVVKVNGKKIYLGGDLDNVHGAEDKYGPLIGKVDLMK
FNHHHDTNKSNTKDFIKNLSPSLIVQTSDSLPWKNGVDSEYVNWLKERGI
ERINAASKDYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQ
APDSTGEYAVGWNEIEGEWYYFNQTGILLQNQWKKWNNHWFYLTDSGASA
KNWKKIAGIWYYFNKENQMEIGWIQDKEQWYYLDVDGSMKTGWLQYMGQW
YYFAPSGEMKMGWVKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMK
QGWHKKANDWYFYKTDGSRAVGWIKDKDKWYFLKENGQLLVNGKTPEGYT
VDSSGAWLVDVSIEKSATIKTTSHSEIKESKEVVKKDLENKETSQHESVT
NFSTSQDLTSSTSQSSETSVNKSESEQ
>SP_0391 cbpF, choline binding protein F
MKLLKKMMQIALATFFFGLLATNTVFADDSEGWQFVQENGRTYYKKGDLK
ETYWRVIDGKYYYFDPLSGEMVVGWQYIPAPHKGVTIGPSPRIEIALRPD
WFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQR
SYHTLKTGWIYEEGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYD
EEKLKAAPWYYLNPATGIMQTGWQYLGNRWYYLHSSGAMATGWYKEGSTW
YYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGD
MKTGWFQVNGNWYYAYDSGALAVNTTVGGYYLNYNGEWVK
>SP_0069 cbpI, choline binding protein I
MGMAAFKNPNNQYKAITIAQTLGDDASSEELAGRYGSAVQCTEVTASNLS
TVKTKATVVEKPLKDFRASTSDQSGWVESNGKWYFYESGDVKTGWVKTDG
KWYYLNDLGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGA
MKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYYAYGSGALAVSTTTPDG
YRVNGNGEWVN
>SP_0378 cbpJ, choline binding protein J
MKILKKTMQVGLTVFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLK
ETYWRVIDGKYYYFDSLSGEMVVGWQYIPFPSKGSTIGPYPNGIRLEGFP
KSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYY
TNYYFNQNHSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDST
WYYLDPTTGIMQTGWQYLGNKWYYLRSSGAMATGWYQEGTTWYYLDHPNG
DMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQV
NGNWYYAYSSGALAVNTTVDGYSVNYNGEWVR
>SP_0978 coiA, competence protein CoiA
MFVARDARGELVNVLEDKLEKQAYTCPACGGQLHLRQGPSVRTHFAHKSL
KDCDFFFENESPEHLANKESLYHWLKKETKVQLEYPLSELKQIADVFVNG
NLALEVQCSPLPQKVLKERSEGYRSQGYQVLWLLGQKLWLKERLTRLQQG
FLYFSQNMGFYVWELDKEKQVLRLKYLIYQDLRGKLHYQIKEFSYGQGSL
LEILRLPYKRQKISHFTVSEDKDICRYIRQQLYYQNLFWMKEQAEAYQKG
ENILTYGLKEWYPQIRPIVGKFFQIEQDLTSYYQHFYTYYQKNPQNDWQK
LYPPAFYQQYFLKNMVE
>SP_0969 era, GTP-binding protein Era
MTFKSGFVAILGRPNVGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTT
DKEQIVFIDTPGIHKPKTALGDFMVESAYSTLREVDTVLFMVPADEARGK
GDDMIIERLKAAKVPVILVVNKIDKVHPDQLLSQIDDFRNQMDFKEIVPI
SALQGNNVSRLVDILSENLDEGFQYFPSDQITDHPERFLVSEMVREKVLH
LTREEIPHSVAVVVDSMKRDEETDKVHIRATIMVERDSQKGIIIGKGGAM
LKKIGSMARRDIELMLGDKVFLETWVKVKKNWRDKKLDLADFGYNEREY
>SP_0614 estA, tributyrin esterase
MAVMKIEYYSQVLDMEWGVNVLYPDANRVEEPECEDIPVLYLLHGMSGNH
NSWLKRTNVERLLRGTNLIVVMPNTSNGWYTDTQYGFDYYTALAEELPQV
LKRFFPNMTSKREKTFIAGLSMGGYGCFKLALTTNRFSHAASFSGALSFQ
NFSPESQNLGSPAYWRGVFGEIRDWTTSPYSLESLAKKSDKKTKLWAWCG
EQDFLYEANNLAVKNLKKLGFDVTYSHSAGTHEWYYWEKQLEVFLTTLPI
DFKLEERLT
>SP_0421 fabG, 3-oxoacyl-[acyl-carrier protein] reductase
MKLEHKNIFITGSSRGIGLAIAHKFAQAGANIVLNSRGAISEELLAEFSN
YGIKVVPISGDVSDFADAKRMIDQAIAELGSVDVLVNNAGITQDTLMLKM
TEADFEKVLKVNLTGAFNMTQSVLKPMMKAREGAIINMSSVVGLMGNIGQ
ANYAASKAGLIGFTKSVAREVASRNIRVNVIAPGMIESDMTAILSDKIKE
ATLAQIPMKEFGQAEQVADLTVFLAGQDYLTGQVVAIDGGLSM
>SP_0419 fabK, enoyl-(acyl-carrier-protein) reductase
MKTRITELLKIDYPIFQGGMAWVADGDLAGAVSKAGGLGIIGGGNAPKEV
VKANIDKIKSLTDKPFGVNIMLLSPFVEDIVDLVIEEGVKVVTTGAGNPS
KYMERFHEAGIIVIPVVPSVALAKRMEKIGADAVIAEGMEAGGHIGKLTT
MTLVRQVATAISIPVIAAGGIADGEGAAAGFMLGAEAVQVGTRFVVAKES
NAHPNYKEKILKARDIDTTISAQHFGHAVRAIKNQLTRDFELAEKDAFKQ
EDPDLEIFEQMGAGALAKAVVHGDVDGGSVMAGQIAGLVSKEETAEEILK
DLYYGAAKKIQEEASRWTGVVRND
>SP_2228 guaB, inosine-5'-monophosphate dehydrogenase
MSNWDTKFLKKGFTFDDVLLIPAESHVLPNDADLTTKLADNLTLNIPIIT
AAMDTVTESQMAIAIARAGGLGVIHKNMSIAQQADEVRKVKRSENGVIID
PFFLTPEHTIAEADELMGRYRISGVPVVETLENRKLVGILTNRDLRFISD
YNQPISNHMTSENLVTAPVGTDLATAESILQEHRIEKLPLVDEEGSLSGL
ITIKDIEKVIEFPNAAKDEFGRLLVAGAVGVTSDTFERAEALFEAGADAI
VIDTAHGHSAGVLRKIAEIRAHFPDRTLIAGNIATAEGARALYEAGVDVV
KVGIGPGSICTTRVIAGVGVPQVTAIYDAAAVAREYGKTIIADGGIKYSG
DIVKALAAGGNAVMLGSMFAGTDEAPGETEIFQGRKFKTYRGMGSIAAMK
KGSSDRYFQGSVNEANKLVPEGIEGRVAYKGAAADIVFQMIGGIRSGMGY
CGAANLKELHDNAQFIEMSGAGLKESHPHDVQITNEAPNYSM
>SP_0672 hflX, GTP-binding protein HflX
MIETEKKEERVLLIGVELQGMDSFDLSMEELASLAKTAGAVVVDSYRQKR
EKYDSKTFVGSGKLEEIALMVDAEEITTVIVNNRLTPRQNVNLEEVLGVK
VIDRMQLILDIFAMRARSHEGKLQVHLAQLKYLLPRLVGQGIMLSRQAGG
IGSRGPGESQLELNRRSVRNQITDIERQLKVVEKNRATVREKRLESSTFK
IGLIGYTNAGKSTIMNILTSKTQYEADELFATLDATTKSIHLGGNLQVTL
TDTVGFIQDLPTELVSSFKSTLEESKHVDLLVHVIDASNPYHEEHEKTVL
SIMKDLDMEDIPHLTLYNKADLVEDFTPTQTPYTLISAKSEDSRENLQAL
LLDKIKEIFEAFTLRVPFSKSYKIHDLESVAILEERDYQEDGEVITGYIS
EKNKWRLEEFYD
>SP_1268 licB, licB protein
MKSKNGVPFGLLSGIFWGLGLTVSAYIFSIFTDLSPFVVAATHDFLSIFI
LLAFLLVKEGKVRLSIFLNIRNVSVIIGALLAGPIGMQANLYAVKYIGSS
LASSVSAIYPAISVLLAFFFLKHKISKNTVFGIVLIIGGIIAQTYKVEQV
NSFYIGILCALVCAIAWGSESVLSSFAMESELSEIEALLIRQVTSFLSYL
VIVLFSHQSFTAVANGQLLGLMIVFAAFDMISYLAYYIAINRLQPAKATG
LNVSYVVWTVLFAVVFLGAPLDMLTIMTSLVVIAGVYIIIKE
>SP_0965 lytB, endo-beta-N-acetylglucosaminidase
MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTH
YQSWFYIKADANYAENEWLKQGDDYFYLKSGGYMAKSEWVEDKGAFYYLD
QDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEW
LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYI
KENGNYADKEWIFENGHYYYLKSGGYMAANEWIWDKESWFYLKFDGKMAE
KEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYD
SHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAW
YYFKSGGYMAKNETVDGYQLGSDGKWLGGKTTNENAAYYQVVPVTANVYD
SDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDAS
KDFIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGF
KLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHY
HINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVD
KGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKI
NEKLGGKD
>SP_0788 metG, methionyl-tRNA synthetase
MSEKNFYITTPIYYPSGKLHIGSAYTTIACDVLARYKRLMGYDVFYLTGL
DEHGQKIQQKAEEAGITPQAYVDGMAVGVKELWQLLDISYDKFIRTTDDY
HEKVVAQVFERLLAQDDIYLGEYSGWYSVSDEEFFTESQLAEVFRDEAGN
VTGGIAPSGHEVEWVSEESYFLRLSKYQDRLVEFFKAHPEFITPDGRLNE
MLRNFIEPGLEDLAVSRTTFTWGVPVPSNPKHVVYVWIDALLNYATALGY
AQDEHGNFDKFWNGTVFHMVGKDILRFHSIYWPILLMMLDVKLPDRLIAH
GWFVMKDGKMSKSKGNVVYPEMLVERYGLDPLRYYLMRNLPVGSDGTFTP
EDYVGRINYELANDLGNLLNRTVSMINKYFDGQIPAYVEGVTEFDHVLAE
VAEQSIADFHTHMEAVDYPRALEAVWTLISRTNKYIDETAPWVLAKDEAL
RDQLASVMSHLAASIRVVAHLIEPFMMETSRAVLTQLGLEEVSSLENLSL
ADFPADVTVVAKGTPIFPRLNMEEEIAYIKEQMEGNKPAVEKEWNPDEVE
LKLNKDEIKFEDFDKVEIRVAEVKEVSKVEGSDKLLQFRLDAGDGEDRQI
LSGIAKYYPNEQELVGKKVQIVANLKPRKMMKKYVSQGMILSAEHDGKLT
LLTVDPAVPNGSVIG
>SP_1168 mutX, mutator MutT protein
MPQLATICYIDNGKELLMLHRNKKPNDVHEGKWIGVGGKLERGETPQECA
AREILEETGLKAKPVLKGVITFPEFTPDLDWYTYVFKVTEFEGDLIDCNE
GTLEWVPYDEVLSKPTWEGDHTFVEWLLEDKPFFSAKFVYDGDKLLDTQV
DFYE
>SP_1469 nox, NADH oxidase
MSKIVVVGANHAGTACINTMLDNFGNENEIVVFDQNSNISFLGCGMALWI
GEQIDGAEGLFYSDKEKLEAKGAKVYMNSPVLSIDYDNKVVTAEVEGKEH
KESYEKLIFATGSTPILPPIEGVEIVKGNREFKATLENVQFVKLYQNAEE
VINKLSDKSQHLDRIAVVGGGYIGVELAEAFERLGKEVVLVDIVDTVLNG
YYDKDFTQMMAKNLEDHNIRLALGQTVKAIEGDGKVERLITDKESFDVDM
VILAVGFRPNTALAGGKIELFRNGAFLVDKKQETSIPDVYAVGDCATVYD
NARKDTSYIALASNAVRTGIVGAYNACGHELEGIGVQGSNGISIYGLHMV
STGLTLEKAKAAGYNATETGFNDLQKPEFMKHDNHEVAIKIVFDKDSREI
LGAQMVSHDIAISMGIHMFSLAIQEHVTIDKLALTDLFFLPHFNKPYNYI
TMAALTAEK
>SP_2136 pcpA, choline binding protein PcpA
MKKTTILSLTTAAVILAAYVPNEPILADTPSSEVIKETKVGSIIQQNNIK
YKVLTVEGNIGTVQVGNGVTPVEFEAGQDGKPFTIPTKITVGDKVFTVTE
VASQAFSYYPDETGRIVYYPSSITIPSSIKKIQKKGFHGSKAKTIIFDKG
SQLEKIEDRAFDFSELEEIELPASLEYIGTSAFSFSQKLKKLTFSSSSKL
ELISHEAFANLSNLEKLTLPKSVKTLGSNLFRLTTSLKHVDVEEGNESFA
SVDGVLFSKDKTQLIYYPSQKNDESYKTPKETKELASYSFNKNSYLKKLE
LNEGLEKIGTFAFADAIKLEEISLPNSLETIERLAFYGNLELKELILPDN
VKNFGKHVMNGLPKLKSLTIGNNINSLPSFFLSGVLDSLKEIHIKNKSTE
FSVKKDTFAIPETVKFYVTSEHIKDVLKSNLSTSNDIIVEKVDNIKQETD
VAKPKKNSNQGVVGWVKDKGLWYYLNESGSMATGWVKDKGLWYYLNESGS
MATGWVKDKGLWYYLNESGSMATGWVKDKGLWYYLNESGSMATGWVKDKG
LWYYLNESGSMATGWVKDKGLWYYLNESGSMATGWVTVSGKWYYTYNSGD
LLVNTTTPDGYRVNANGEWVG
>SP_0894 pepX, X-pro dipeptidyl-peptidase
MRFNQYSYINFPKENVLSELKKCGFDLQNTANHKDSLETFLRRFFFTYQD
TNYPLSILAADKKTDLLTFFQSEDELTADIFYTVAFQLLGFSYLVDFEDS
DVFRKETGFPIIYGDLIENLYQLLNTRTKKGNTLIDQLVSDGLIPEDNDY
HYFNGKSLATFSNQDVIREVVYVESRVDTDQKGLSDLVKVSIIRPRFDGK
IPAIMTASPYHQGTNDKASDKALYKMEGELEVKLPHKIELEKPQLNLVQP
QGKAELIAEAEEKLTHINSSYTLNDYFLPRGFANLYVSGVGTKDSTGFMT
NGDYQQIEAYKNVIDWLNGRCRAFTDHTRQRQVKADWSNGKVATTGLSYL
GTMSNGLATTGVDGLEVIIAEAGISSWYNYYRENGLVTSPGGYPGEDFDS
LAELTYSRNLLAGDYIRGNEAHQADLEKVKAQLDRKTGDYNQFWHDRNYL
LNAHKVKAEVVFTHGSQDWNVKPLHVYQMFHALPTHIHKHLFFHNGAHVY
MNNWQSIDFRESINALLTKKLLGQETDFQLPTVIWQDNTAPQTWLSLDNF
GGQENCETFSLGQEEQAIQNQYPDKDFERYGKTYQTFNTELYQGKANQIT
INLPVTKDLHLNGRAQLNLRIKSSTNKGLLSAQLLEFGQKKYLQPYPAIL
SARTIDNGRYHMLENLCELPFRPEAQRVVTKGYLNLQNRNDLLLVEDITA
DEWMDVQFELQPTIYKLKEGDTLRLVLYTTDFEITIRDNTDYHLTVDLAQ
SMLTLPC
>SP_0581 pheT, phenylalanyl-tRNA synthetase, beta subunit
MLVSYKWLKELVDIDVPSQELAEKMSTTGIEVEGVESPAAGLSKIVVGEV
LSCEDVPETHLHVCQINVGEEEERQIVCGAPNVRAGIKVMVALPGARIAD
NYKIKKGKIRGLESLGMICSLGELGISDSVVPKEFADGIQILPEDAVPGE
EVFSYLDLDDEIIELSITPNRADALSMCGVAHEVAAIYDKAVNFKKFTLT
ETNEAAADALSVSIETDKAPYYAARILDNVTIAPSPQWLQNLLMNEGIRP
INNVVDVTNYILLYFGQPMHAFDLDTFEGTDIRVREARDGEKLVTLDGEE
RDLAETDLVITVADKPVALAGVMGGQATEISEKSSRVILEAAVFNGKSIR
KTSGRLNLRSESSSRFEKGINVATVNEALDAAASMIAELAGATVRKGIVS
AGELDTSDVEVSSTLADVNRVLGTELSYADVEDVFRRLGFGLSGNADSFT
VSVPRRRWDITIEADLFEEIARIYGYDRLPTSLPKDDGTAGELTVIQKLR
RQVRTIAEGAGLTEIITYALTTPEKAVEFTAQPSNLTELMWPMTVDRSVL
RQNMISGILDTVAYNVARKNKNLALYEIGKVFEQTGNPKEELPNEINSFA
FALTGLVAEKDFQTAAVPVDFFYAKGILEALFTRLGLQVTYTATSEIVSL
HPGRTAVISLGDQVLGFLGQVHPVTAKAYDIPETYVAELNLSAIEGALQP
AVPFVEITKFPAVSRDVALLLKAEVTHQEVVDAIQAAGVKRLTDIKLFDV
FSGEKLGLGMKSMAYSLTFQNPEDSLTDEEVARYMEKIQASLEEKVNAEV
R
>SP_0972 pmrA, multi-drug resistance efflux pump
MTEINWKDNLRIAWFGNFLTGASISLVVPFMPIFVENLGVGSQQVAFYAG
LAISVSAISAALFSPIWGILADKYGRKPMMIRAGLAMTITMGGLAFVPNI
YWLIFLRLLNGVFAGFVPNATALIASQVPKEKSGSALGTLSTGVVAGTLT
GPFIGGFIAELFGIRTVFLLVGSFLFLAAILTICFIKEDFQPVAKEKAIP
TKELFTSVKYPYLLLNLFLTSFVIQFSAQSIGPILALYVRDLGQTENLLF
VSGLIVSSMGFSSMMSAGVMGKLGDKVGNHRLLVVAQFYSVIIYLLCANA
SSPLQLGLYRFLFGLGTGALIPGVNALLSKMTPKAGISRVFAFNQVFFYL
GGVVGPMAGSAVAGQFGYHAVFYATSLCVAFSCLFNLIQFRTLLKVKEI
>SP_0117 pspA, pneumococcal surface protein A
MNKKKMILTSLASVAILGAGFVTSQPTFVRAEESPQVVEKSSLEKKYEEA
KAKADTAKKDYETAKKKAEDAQKKYEDDQKRTEEKARKEAEASQKLNDVA
LVVQNAYKEYREVQNQRSKYKSDAEYQKKLTEVDSKIEKARKEQQDLQNK
FNEVRAVVVPEPNALAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEV
EAKELEIEKLQYEISTLEQEVATAQHQVDNLKKLLAGADPDDGTEVIEAK
LKKGEAELNAKQAELAKKQTELEKLLDSLDPEGKTQDELDKEAEEAELDK
KADELQNKVADLEKEISNLEILLGGADPEDDTAALQNKLAAKKAELAKKQ
TELEKLLDSLDPEGKTQDELDKEAEEAELDKKADELQNKVADLEKEISNL
EILLGGADSEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEEET
PAPAPQPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAP
KPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYNTDGSMAIGWLQ
NNGSWYYLNANGAMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVN
SNGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWA
KVNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLE
ASGAMKASQWFKVSDKWYYVNGLGALAVNTTVDGYKVNANGEWV
>SP_0370 recU, recombination protein U
MVNYPHKVSSQKRQTSLSQPKNFANRGMSFEKMINATNDYYLSQGLAVIH
KKPTPIQIVQVDYPQRSRAKIVEAYFRQASTTDYSGVYNGYYIDFEVKET
KQKRAIPMKNFHPHQIQHMEQVLAQQGICFVLLHFSSQQETYLLPAFDLI
RFYHQDKGQKSMPLEYIREYGYEIKAGAFPQIPYLNVIKEHLLGGKTR
>SP_2103 rrmA, rRNA (guanine-N1-)-methyltransferase
MNTNLKPKLQRFASATAFACPICQENLTLLETNFKCCNRHSFDLAKFGYV
NLVPQIKQSANYDKENFQNRQQILEAGFYQAILDAVSDLLASSKTTTTIL
DIGCGEGFYSRKLQESHSEKTFYAFDISKDSVQIAAKSEPNWAVNWFVGD
LARLPIKDANMDILLDIFSPANYGEFRRVLSKDGILIKVIPTENHLKEIR
QRVQDQLTNKEYSNQDIKEHFQEHFTILSSQTASLTKTITAEQLQALLSM
TPLLFHVDQSKIDWSKLTEITIEAEILVGKAF
>SP_1984 rsgA, ribosome small subunit-dependent GTPase A
MQGQIIKALAGFYYVESDGQVYQTRARGNFRKKGHTPYVGDWVDFSAEEN
SEGYILKIHERKNSLVRPPIVNIDQAVVIMSVKEPDFNSNLLDRFLVLLE
HKGIHPIVYISKMDLLEDRGELDFYQQTYGDIGYDFVTSKEELLSLLTGK
VTVFMGQTGVGKSTLLNKIAPDLNLETGEISDSLGRGRHTTRAVSFYNLN
GGKIADTPGFSSLDYEVSRAEDLNQAFPEIATVSRDCKFRTCTHTHEPSC
AVKPAVEEGVIATFRFDNYLQFLSEIENRRETYKKVSKKIPK
>SP_0977 tehB, tellurite resistance protein TehB
MEKLVAYKRMPLWNKQTMPEAVQQKHNTKVGTWGKITVLKGALKFIELTE
EGEVLAEHLFEAGADNPMAQPQAWHRVEAATDDVEWYLEFYCKPEDYFAK
KYNTNPVHSEVLEAMQTVKQGKALDLGCGQGRNSLFLAQQDFDVTAVDQN
GLALEILQSIVEQEDLDMPVGLYDINSASIEQEYDFIVSTVVLMFLQADR
IPAIIQNMQEKTSVGGYNLIVCAMDTEDYPCSVNFPFTFKEGELADYYKD
WELVKYNENPGHLHRRDENGNRIQLRFATLLAKKIK
>SP_1016 thdF, thiophene and furan oxidation protein ThdF
MITREFDTIAAISTPLGEGAIGIVRLSGTDSFAIAQKIFKGKDLNKVASH
TLNYGHIIDPLTGKVMDEVMVGAMKSPKTFTREDIIEINTHGGIAVTNEI
LQLAIREGARLAEPGEFTKRAFLNGRVDLTQAEAVMDIIRAKTDKAMNIA
VKQLDGSLSDLINNTRQEILNTLAQVEVNIDYPEYDDVEEATTAVVREKT
MEFEQLLTKLLRTARRGKILREGISTAIIGRPNVGKSSLLNNLLREDKAI
VTDIAGTTRDVIEEYVNINGVPLKLIDTAGIRETDDIVEQIGVERSKKAL
KEADLVLLVLNASEPLTAQDRQLLEISQETNRIILLNKTDLPETIETSEL
PEDVIRISVLKNQNIDKIEERINNLFFENAGLVEQDATYLSNARHISLIE
KAVESLQAVNQGLELGMPVDLLQVDLTRTWEILGEITGDAAPDELITQLF
SQFCLGK
>SP_1225 vicX, vicX protein
MSEIGFKYSILASGSSGNSFYLETSKKKLLVDAGLSGKKITSLLAEINRK
PEDLDAILITHEHSDHIHGVGVLARKYGMDLYANEKTWQAMENSKYLGKV
DSSQKHIFEMGKTKTFGDIDIESFGVSHDAVAPQFYRFMKDDKSFVLLTD
TGYVSDRMAGIVENADGYLIEANHDVEILRSGSYAWRLKQRILSDLGHLS
NEDGAEAMIRTLGNRTKKIYLGHLSKENNIKELAHMTMVNQLAQADLGVG
VDFKVYDTSPDTATPLTEI
>SP_1017 xylH, 4-oxalocrotonate tautomerase
MPFVRIDLFEGRTLEQKKALAKEVTEAVVRNTGAPQSAVHVIINDMPEGT
YFPQGEMRTK
>SP_1665 ylmE, ylmE protein
MNVKENTELVFREVAEASLSAHRESGSVSVIAVTKYVDVPTAEALLPLGV
HHIGENRVDKFLEKYEALKDRDVTWHLIGTLQRRKVKDVIQYVDYFHALD
SVKLAGEIQKRSDRVIKCFLQVNISKEESKHGFSREELLEILPELARLDK
IEYVGLMTMAPFEASSEQLKEIFKAAQDLQREIQEKQIPNMPMTELSMGM
SRDYKEAIQFGSTFVRIGTSFFK