TitleGenColors Logo

Gene list

Applied filters:

Gene type: CDS
Genomic element: pSS_046

Number of genes found: 238

Free access
Sort by:

 



# Shigella sonnei Ss046, Ss046

>SSO_P211 putative transposase
MDEKKLKALAAELAKGLKTEADLNQFSRMQTKLTVETVLNAELTDHLGHE
KSYIR
>SSO_P037 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_P132 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_P125 IS3 ORF1
MTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLYAWR
SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK
>SSO_P177 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P161 conserved hypothetical protein
MVGVAGAALAPLVKLLRHELLTRDVIHADETSLRLLDTRKGGKSCSGWLC
AYVSGERSGPPVVCFDSQTGRALRYPETWLQCWCGGTLVSDGYSVYKSLA
DNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKIAGLYRIEKLIRE
RCQRHDV
>SSO_P141 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW
VRQHERDTGGGEVGSPPLNVSV
>SSO_P082 IS100 ORF1
MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>SSO_P145 IS629 ORF1
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_P221 IS630 ORF
MTWELILDGYSESSYSATPRFAAARLPWFRVIYQPVYSPWVNHVERLWQA
LHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P126 conserved hypothetical protein
MNDNSLLRNSSLFIAYMGCVGWVSAYSYGWGTSFYYGFPWWVVGAGLDDV
ARSLLYAIIVMGILFTGWGIGILFFLLIKKRSKIQDLSFFRLFFAITLLF
FPVIFELLILKQYFILPLSLSCIISSLVISIIIRIYGRIFSVSCFSDIPF
VREHRIKLIMAGFLVYFWLFSFLVGWYKPQLKKEYQMLCYNNSWYYILAR
YDSRLVLSSSFKDDSNRFLIFNTEQSGFYEINDVYVRK
>SSO_P054 ISSfl1 ORF1
MVHKSDSDELSALRAENARIIKPLLLPEPATPRAGRPWAEHRKIINGMFW
VLCSGAPWRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGF
IDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQ
TEVASR
>SSO_P066 ISSfl1 ORF1
MACYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDAGDAANLLI
>SSO_P076 IS1353 putative transposase-like protein
MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY
RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI
TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE
>SSO_P176 conserved hypothetical protein
MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF
VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS
RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD
DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK
ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPKGREVTFSAFSDWLPRNR
AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP
TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV
QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV
NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR
LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV
KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA
EIYKFFTNALYVALTRATHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE
ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK
QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL
LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ
LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL
SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC
VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN
PGLKIRQGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE
RSEKSVGYWVGGIRRAAQKA
>SSO_P194 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P035 conserved hypothetical protein
MFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIPTVSDR
IAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCWRYSWILE
VDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGELIT
RTRGTPQGGVISPLLANLFHHYAFDLWMEREYRGYRLRGTLTIL
>SSO_P063 ISSfl1 ORF1
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>SSO_P238 putative IS91 ORF2
MARSAKPRKRKPASQRSKLPRYVVKLHEDDFFDEEDAEVLRFDSFDDAVE
CCADLNIPFFVDAGNKKLVFWFVRVDDEGYPEIARCTEREFATILSGISA
GGMYCPECGTVHWPDGVAPPF
>SSO_P187 putative antirestriction protein
MQYAKPVTLNVEECDRLSFLPYLFGNDFLYAEAYVYALAQKMMPEYQGGF
WHFIRLPDGGGYMMPDGDRFHMVNGANWFDRTVSADAAGIILTSLVINRQ
LWLYHDSGDAGLTLR
>SSO_P081 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P200 conserved hypothetical protein
MRKYIPLVLFIFSWPVLSADIHGRVVRVLDGDTIEVMDSLKAVRIRLVNI
DAPEKKQDYGRWSTDMMKSLVAGKTVTVTYFQRDRYGRILGQVYAPDGMN
INQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWIWM
HRK
>SSO_P237 IS91 ORF
MACGTTLMGYTQWCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYL
LSLVPDCPWQHIVFTLPCQYWSLVFHNRWLLTEMSRIAADVILEICHQAD
VEPGIFTVIHTWGRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSM
WRYRITRLLSRKYPDLVMPDALAAEGSSKREWNRFLDTHYRRGWNVNVSR
VMDNATHVAVYFGSYLKKPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYL
LMSGDEFMERFSWHVADKGFRMVRYYGFLSPAKRRLLEEVVYIITETVRK
TAMQITWRGMYQRLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHELL
ARMRWCG
>SSO_P172 putative IS orf
MAGRRLGVPKSTVCGMFVRFRNAGLSWPLPAGMSEQELDALLYGSASTVP
VVLTESTVMPKLPVVKKRPRRPNADQLRIS
>SSO_P006 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P178 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P030 iso-IS1 ORF2
MVTSDDWGSYAREVPKEKHLTGKIFTQRIERNNRTLRTRIKRLARKTICF
SRSVEIHEKVIGSFIEKHMFY
>SSO_P071 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRKNHGSLAAANRGVAEYELSE
>SSO_P055 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SSO_P151 putative reverse transcriptase, fragment
MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY
KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI
PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH
QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI
ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP
GVPFERYADDVVCHCHSQWQADALISGLRQRLAQCGLQLHPQKTRIVYCK
DADRRGDYPETSFDFLGYTFRPRLSMNRWGKTFVNFSPAMSARAGKAIRQ
EVRRIAVTSPCTSWRICSMRKSEAG
>SSO_P080 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_P122 putative transposase
MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS
RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGYYIIHRK
NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS
WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ
FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT
S
>SSO_P064 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SSO_P135 IS91 ORF
MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE
ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE
KNGEANHKERDVSAVTEG
>SSO_P074 IS600 ORF2
MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA
TTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHTDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFRIKYYQMTA
>SSO_P028 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P173 IS4 ORF
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P042 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_P033 putative transposase, fragment
MSEQKITGIDLAKTNFYLFSINAHGKPAGKTKLSRNQLLNWLVQQPKMTV
AMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYNDAQAIAEA
CQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRGLLAEYGIV
FSKGAADLRQK
>SSO_P174 putative IS orf
MLTELLTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQELDALL
YGSASTVPVVLTESTVMPKLPVVKKRPRRP
>SSO_P215 putative IS91 ORF2
MDAGNKKLVFWFVRVDDEGYPEIARCTEREFATILSGISAGGMYCPECGT
VHWPDGVAPPF
>SSO_P073 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P208 conserved hypothetical protein
MTYTVKFRDDALKEWLKLDKTIQQQFVKKLKKCSENPHIPSAKLRGLKDC
YKIKLRASGFRLVYQVIDDMLIIAVVAVGKRERSNVYNLASERMR
>SSO_P131 IS21 ORF1
MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF
MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT
VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ
DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA
DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE
QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW
DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS
SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_P014 IS91 ORF
MLPRFADIFQQGNRWLNWLEKQSVQMSRLEHYAGQDEIGLRYNSHRTKRE
ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE
KNGEANHKERDVSAVTEG
>SSO_P070 hypothetical protein
MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFTDFIAGHPSCTVCFWE
TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW
>SSO_P118 conserved hypothetical protein
MGVNFCNKIGIDQSEFEIESSIINSIANEVLNPISFLSNKDIINVLLRKI
SSECDLVRKDIYRCALELVVEKTPDDL
>SSO_P154 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_P045 ISSfl2 ORF
MALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRAFASAAHLAAY
SGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTR
KMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P157 IS100 ORF2
MLHEEKLARHQRKQAMYTRMVAFPAVKMFEEYDFTFATGAPQKQLQSLRS
LSFIERNENIVLQGTSDITNPRVGICV
>SSO_P133 IS629 ORF2
MFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYVSLA
YTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVE
LATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_P162 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIMLTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_P155 putative IS orf, fragment
MYLKIRDRLGYMSNTSSNFEMTGTLLGLELRKRKTPQEKIAIIQQTMEPG
MTVSHVARLHGIQPSLLLKWKK
>SSO_P214 IS91 ORF
MACGTTLMGYTQWCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYL
LSLVPDCPWQHIVFTLPCQYWSLVFHNRWLLTEMSRIAADVILEICHQAD
VESGIFTVIHTWGRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSM
WRYRITRLLSRKYPDLVMPDALAAEGSSKREWNRFLDTHYRRGWNVNVSR
VMDNATHVAVYFGSYLKKPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYL
LMSGDEFMERFSWHVADKGFRMVRYYGFLSPAKRRLLEEVVYIITETVRK
TAMQITWRGMYQRLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHELL
ARMRWCG
>SSO_P163 IS21 ORF1
MGYTGGRSMLRYYIQPKRKMRPSKRTVRFETQPGYQLQHDWGEVEVEVAG
QRCKVNFAVNTLGFSRSFHVFAAPKQDAEHTYESLVRAFRYFGGCVKTVL
VDNQKAAVLKNNNGKVVFNSGFLLLADHYNFLPRACRPRRARTKGKVERM
VKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETPEQRFA
LEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCGQPVSI
RISLDDELRIYSNEKLVASHRLCSASSGWQTVPEHHAPLWQQVSQVEHRP
LSAYEELL
>SSO_P053 conserved hypothetical protein
MSVKLRLPQCTDNKKTETDAIYDKVRSSYLLSCILKKNKNVGLILHAPSF
VSVSEKIARIVMANYSRNWSNSELASAVLMSESSLKRRMYKEVGSISTFV
HKIKLTEAIRKLRRTNTPISVISSELGYSSPSYFSKVFFKYLKTYPQNIR
KKNGR
>SSO_P210 ISSfl1 ORF2
MRQQQDEQGRFSICSRQAAVVHRDAYCNRNVVERCFGRLKEYRRIATRYD
KTARNYLAMVKLGCIRLFYQRLRN
>SSO_P156 conserved hypothetical protein
MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQ
KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE
QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF
NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY
ANSSSWKSKRLC
>SSO_P193 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P220 IS1294 ORF
MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC
YVQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM
PA
>SSO_P025 IS3 ORF1
MTKTVSTSKKTRKQHSPEFRSEALKLAERIGVAAAARELSLYESQLYAWR
SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK
>SSO_P029 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_P195 oriT nicking and unwinding protein, fragment
MNAERLFSTARELRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVA
LPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSR
NGESLLADNMQDGVRIARDNPDSGVVVSIAGEGRPWNPGAITDGRVWGDI
PDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPD
LPDGKTELAVREIAGQERDRAAITEREAALPESVLRESQREREAVREVAR
ENLLQERLQQMERDMVRDLQKEKTPGGD
>SSO_P139 IS3 ORF2
MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF
YASGPNQKWAGDITYYYSSPTAGKHGAPGY
>SSO_P148 conserved hypothetical protein
MNQKVKSVGSDNVIDDHHVFFADSRCDFVKVVSADVCDMGMQLLYFVFLL
LPVVAEFNLAA
>SSO_P128 conserved hypothetical protein
MKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAK
TIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVM
FHYGKLNMAF
>SSO_P222 hypothetical protein
MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFADFITGHPSCTVCFWE
TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW
>SSO_P228 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P127 putative transposase, fragment
MCWGRTALYMAALEAPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTI
MNAMLRKNEEWNESYL
>SSO_P034 conserved hypothetical protein
MDGKRISGVPFERYADDIVVHCSRMSDATRLKNRLSERFSEVGLVLNAGK
TNIAYIDTFKRRNVATSFTFLGYDFKVRTLKNFKGERYRKCMPGASNAAM
RKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSRNFNYRL
WSAMQSRLLKWMQSKYRLSNRKAQRKLTLVRKEYPKLFVHWYLLRASNE
>SSO_P159 sugar phosphate transport protein-like protein
MEPVYVILNALLDSGRFTRKLILLGLSGSFSYIFGSIVATLGMGLVVDYL
GWGATFIVLILSAVFAIIFTLMSRERSLEFEKE
>SSO_P199 conserved hypothetical protein
MDSETVHGTARSGVTSVPAGGPLFWKSVDAGWKRQKHGDGLPVLRPGQTG
SSLPEKGLNTATGAAGEGCNEKSSLHYSRSQKAERSL
>SSO_P002 IS2 ORF2
MVHATGLMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI
MPKPDGLTAAKNLAEAFEHYNE
>SSO_P043 IS2 ORF2
MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFVKTMKEDCI
AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST
>SSO_P130 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAFAAGDRVNRQFVAERPDQLWVADFTYVSTCVSASDIRR
>SSO_P206 conserved hypothetical protein
MAENGYGLAGLGMGKVKSVNQYRLTPGFGGFTPVSHVTAACRLTCRWRGI
RIIQAAFNAFAKV
>SSO_P068 ISSfl1 ORF2
MCRRCSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH
ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA
VIPRKSNEKMASDGRAQLDV
>SSO_P136 putative IS91 ORF2
MACDYRYKNRQYHCLSGSYMARSAKPRKRKPAPQRSKLLRYVVKLHEDDF
FDEEEAEVLRFDNFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP
EIARCTEREFATIPAGISADGMYCPECGTVHWPDGVIPPF
>SSO_P020 putative transposase, fragment
MISNEGEFMNEKQLTSNKLRALANELAKSLKNPEDLSQFDWMLKMKPYSM
LI
>SSO_P021 putative transposase, fragment
MTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRTLRDRNGTFEPQQ
LKKNQP
>SSO_P219 putative IS91 ORF2
MVCNYRYKNRQCHCLSGGYMARSAKPRKRKPASQRSKLPRYVVKLHEDDF
FDEEDAEVLRFDSFDDAVECCADLNIP
>SSO_P165 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P146 IS629 ORF2
MGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTW
QGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHH
SDKGSQYVSYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRK
SWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLA
A
>SSO_P001 putative resolvase, fragment
MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR
SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP
DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI
STVL
>SSO_P079 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P185 hypothetical protein
MIYPVEELLQIEVDHPVIPGPDIFLGLYHCLVGRTTGTEPVAVVAERAIS
QCLQYLHHSLLDEAIHHHLDAQQTFAAAGLQYGYSSHRGWAVSAGQQLRF
QLWPVVPQVVRQFTYAHAIDSRRTLIASCRWNSNQRARGRQR
>SSO_P048 IS1294 ORF
MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC
YVQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM
PA
>SSO_P235 IS629 ORF1
MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SSO_P217 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P061 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_P144 adhesion protein, fragment
MLPPNIRGYAPQITGIAETNARVVVSQQGRVIYDSTVPAGTFSIQDLSSS
VRGILDVEIFEQNGKRKHFQVEMCRCAFLIQTWSE
>SSO_P188 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P158 conserved hypothetical protein
MNAHWSSKKSNFFRKKIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG
VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK
>SSO_P011 conserved hypothetical protein
MKVSFKSLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNACHKVAIF
LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES
GFVSFVNREGKICHTAYVKSSDNSMAYYHANYSSIDKYITDMCGLICMRH
IESTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV
>SSO_P052 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P016 putative transposase
MEYRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCGMFVRFRNAGLSWP
LPAGMSEQELDALLYGSASTVPVVLTESTVMPKLPVVKKRPRRP
>SSO_P218 IS630 ORF
MQTMTSRSRQAAYSISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL
KENPKFTYRKLKN
>SSO_P075 putative IS orf
MLTDIFNSNYQCYGYRRLHAMLRHEGGRLSEKVVRRLMVEEQLVVSRNRR
RRYSSYCGEIGPAPDNLIARDFKAEQPNQK
>SSO_P010 conserved hypothetical protein
MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV
ESRRQAKGTRFLWQHSNKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD
IWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAKTIGKRLYGILNAMRHG
VSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVMFHYGKLNMAF
>SSO_P057 conserved hypothetical protein
MPGTTTAMSINFIGMTARTMNSNGSHGKPQIPVDYQKLLSIEDITFCRNR
WGNIGENALRRVAVGKKLSFFGSDRGGENAAII
>SSO_P209 IS1294 ORF
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>SSO_P121 putative transposase
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALVEHYGTVIIPARPRKPKDKPK
AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRLGEQHPAYASE
HENLRGPGYYH
>SSO_P027 IS100 ORF1
MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>SSO_P129 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_P023 conserved hypothetical protein
MPGATVADEFDKTLAFLEAIVNADNETTIGEIRSFADALDAVRFNRNKIN
RQLSKPNLASLALEHEVIWLGRSR
>SSO_P047 hypothetical protein
MSHNLEHQKVHTRMVKEVLKAVARANNHPYQSVFADFITGHPSCTVCFWE
TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW
>SSO_P189 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P168 conserved hypothetical protein
MAQVNMSVRIDAELKDAFMAAAKSMDRNGSQLIRDFMRQTVERQHNSWFR
DQVAAGRQQLECGDVLPHDMVESSAAAWRDEMSRKIADK
>SSO_P240 conserved hypothetical protein
MISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYGKKYISGI
TRSMAQLKIEEFINEKSRRLNYMKTMYSPCPEDFQPISRDEASTPEGSWL
TVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFPKENSDFLYIIVVF
RNDSPQGELRANRFIELYDIKREIMQVLRDESPELKSIKSEIIIAREMGE
LFSYASEEIDSYIKQMNDRLSQIKARMPVT
>SSO_P186 IS186 ORF1
MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS
LAFGEADYIVRVYWRGLRWLTAEGMRFDMMDFLRGLDCGKNGETTVMIGN
SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA
GHVLLLTSLSEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEL
ELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKKN
>SSO_P123 IS600 ORF2
MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP
AAFRIKYYQMTA
>SSO_P150 hypothetical protein
MFNAKIRGWIKYYGAFYKSALYLTLRQIDRKLVLWLPRKHKRLRGHRRRA
SHWLARVARSETRLFAHWPLLWGQASMRRAG
>SSO_P138 putative reverse transcriptase, fragment
MARTRSGRETSRTITAHRLRGNTGRRVIEGDLSSYFDTVHHRLLMKAVCR
RISDARFMRLLWKTPC
>SSO_P184 putative transposase
MCRLAVEYLLYAARKRGLEIGIFCTIHTLRLHFEEHLPLVVAGRRLGVPK
STVCSMFVRFRKAGLSWPLPAGMSERELDARLYGSASTVPVVLTESTVMP
EVPGVKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNW
RHQYRKGGLLPSGKNMPALLPVTLTPEPDNHGFDIIYMLSTHHQRFTFVR
LFDPYLIGSRPTFSHLAHHHIS
>SSO_P018 ISSfl4 ORF2
MISFPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRR
GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
DWKHPKRTERAGIRI
>SSO_P236 putative transposase
MWCFFNLFGVLIPIDERNLTRERTQVGLQAARARGRKGGRPKTLSKDKQA
LAVQLYNEKKHTVAQICVLMGISRPTLYKYIESARLFKK
>SSO_P056 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P183 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHQVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P207 conserved hypothetical protein
MVVSGISLRLVKQSQRCVMPNIILSETSASVSELKKNPMATVSAGDGYPV
AILNRNQPAFYCVPAELYERMLDALDDQELVKLVTERSNQPLHDVDLDSY
L
>SSO_P216 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P069 IS1294 ORF
MTRSGGDFQPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETCTNGET
VPES
>SSO_P017 IS4 ORF
MKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGE
MADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNL
VRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRD
LASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P019 putative IS orf, fragment
MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVL
PSPLRQSSAHKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVLEQLE
LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG
KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL
MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGSYRALYESGRITEAQQRIGELYAIE
AEVRGCSAEQRLAARKARAAPLMQSLYDWIQQQMKIHSLKMECLHGEHYY
PSGNSAGNSV
>SSO_P077 IS21 ORF1
MADVADRRELRQFRQTPEQRFTQEQEHLQPLLGTDFDIRHVSWDGYIEVG
GNRYSVPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVR
PGCSSVTAKSA
>SSO_P012 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDAARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P140 IS629 ORF2
MPLLDKLREQYGVGSVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL
KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVHTTVSRKAVAAGDRVNRHQGNMPRTPGGPQRLVYVVSAADKDKHTS
AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT
>SSO_P175 IS3 ORF2
MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG
DITYLRTDEVRLHPVSTEPHAF
>SSO_P036 IS21 ORF1
MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF
MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT
VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ
DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA
DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE
QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW
DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS
SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_P060 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_P124 IS3 ORF2
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD
SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS
RKFSPVSYRAHGLRCTGNSGHHHLFFF
>SSO_P134 putative transposase, fragment
MNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHVA
KQPGEVVDKTRQNEPPRVSWRVFYL
>SSO_P119 IS600 ORF2
MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA
TTSSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQHPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISS
AYGKTD
>SSO_P153 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P051 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNLFFEMKA
>SSO_P040 IS1294 ORF
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>SSO_P038 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_P078 hypothetical protein
MFISYSEVSIKNPDNSGQALPLTYVCCREQAEDGACWHLLTSGKAASAAD
ARRIVSHYERRWLTEEYHKAWKSGGTWNRCECRPGITLSAWWLSRRL
>SSO_P062 IS1294 ORF
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTLWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGCLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQWRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETCTNGETVPE
S
>SSO_P120 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P005 conserved hypothetical protein
MWCCRGTVCTIPYVDQYNRNDNFRFRAQPKYILGHLSNRLPDTAPFFNKK
SIIFESLLFIALSSIVSPLFAF
>SSO_P083 hypothetical protein
MVINKGKCRRCRVSQISDIIEPNIRCITLQIVESLQVKYTLVGVYVAKHV
IQVHLTNKDMSEVEDK
>SSO_P152 IS600 ORF2
MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA
PMESFWGTLKNGTGTE
>SSO_P180 conserved hypothetical protein
MNILFTESSPNIGGQELQAVAQMKALKKMGHSVLLVCRENSKIAFEASKL
GIDITFALFRNSLHIPTAWRLLGIVHGFQPNAIVCHSGHDSNIVGLVRLF
TWKHPFRIIRQKTYLTRKTKVFSINHFCDEVIVPGTSMKTHLEQEGCRTR
VTVVPPGFDFQKLYVDSRNSLPPNVLSWLASRRGCPVIAQVGMLRPEKGH
EFMLNLLFHLKMNGRQFCWLIVGSGSPELREHLQYQIDSMGMHDDVFIAD
NVFPAAPVYRVASLVVLPSENESFGMVLAEASAFSVPVLASQIGGIPDVI
QNNQTGTLLPAGNKHAWMCALNDFFNDPGRFYQMARQAKQDIEERFDINK
TALKILTLAKHK
>SSO_P137 putative reverse transcriptase, fragment
MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA
VKENWQWKPAVAYCCYADDCVPRRRVLGT
>SSO_P015 putative IS91 ORF2
MDAGNKKLVFWFVRVDDEGYPEIARCMEREFATIPAGISADGMYCPECGT
VHWPDGVIPPF
>SSO_P013 IS21 ORF2
MNKRAFFGAFLIFWGFKFLSMNCRYEKASIILTSNKGVADWGEMFGDHVL
ATAILNSCA
>SSO_P046 ISSfl2 ORF
MQPPGCRGSGKRLFDKALPNDENKLRSLISDLKQHGQILLVVDQPATIGA
LPVAVARSEGVLVGYLPGLAMRRIADLHAGEAKTDARDAAIIAEAARTLP
HALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRIRGLLTQIHPALER
VLGPRLEHPAVLDLLQRYPSPEKLASLGFAG
>SSO_P192 oriT nicking and unwinding protein, fragment
MMSIAQVRSAGSADNYYTDKDNYYVLGSMGERWAGQGAEQLGLQGSVDKD
VFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSVSMMAMLGGDK
RLIEAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLVMALFNHDTSR
DQEPQLHTHAVVTNVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLY
REKRKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQTIREAVGEDAS
LKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRAYRDAAEQRAY
TRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQFMYTDLLARTVGILPPE
NGVIERARAGIDEAISREQLIPLDREKGLFTSGIHMLDELSVRALSRDIM
KQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSLAIVSGQGGAAGQRERVA
ELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLLEGMAFT
PGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAM
KDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQ
VSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMY
RPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGETQVVRISSLDS
SWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSEDAMTVVVP
GRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNA
TLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLE
TAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFAAEGTG
FADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEG
KEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTT
QFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDT
QLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAVGGGRAVASGDT
DQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAVYSLINRDVER
ALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEA
FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREK
AGELGQVQVMVPVLNTANIRDGELRRLSTWENNPDALALVDNVYHRIAGI
SKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRIRFTKS
DRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAI
TAHGAQGASETFAIALEGTEGNRKLMR
>SSO_P166 putative IS orf, fragment
MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI
CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLL
>SSO_P179 conserved hypothetical protein
MYHHVSHCPGLVTLSPVTFRKQMKWLAENNWKTLSSDELEFFYRGGKLPR
KSVMLTFDDGYLDNWFQVYPLLKEFNLKAHIFLITGFIGNGPVRHSPGKE
YSHRDCEHQIATGNADNVMLRWSEVNEMLQSGLVEFHVHTHTHTRWDKKF
SSREEQCKHLRQDLLSGREYLKEMTGKCSKHLCWPEGYYNKDYIQVAEEL
GFYYLYTTERRMNAPAKGTTRIGRISTKERESCAWLKRRLFYYTTPFFSS
LLAFHKGPRLPDD
>SSO_P239 IS911 ORF2
MQTMTSRSRQAAYSGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISH
GSAGARSIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEH
VAIPNYLERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGW
AMSFSPDSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRY
QIRQSMSRRGNCWDNSPMERFFRSLKNEWMPMVGYVSFREAAHAITDYIV
GYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_P039 IS629 ORF1
MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SSO_P234 putative aquaporin
MFKPFSAEFFGTFWLVLGGCGSALISAAFPQLGIGFLGVALAFGLTVVTM
AYAVGHISGAHFNPAVTLGLWAGGRFPAARVLPYIIAQVIGGIAAAAVLY
GIASGKAGFDATTSGFAANGYGLHSPGGYALSACMLSEFVLSAFFVRSDR
KTRSCGLCATGDWSGNHPVN
>SSO_P149 conserved hypothetical protein
MSDATRLKNRLSVRFSEVGLVLNAGKTNIAYIDTFKRRNVATSFSFLGYD
FKVRTLKNFKGELYRKCMPGASNAAMCKITETIKKWRIHRSTAESLLDFA
RRYNAIVRGWIEYYGKFWSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQ
RKLALVRKQYPKLFAHWYLLRASNE
>SSO_P026 IS3 ORF2
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD
SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS
RKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLA
VVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHSDRGSQ
YCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHGEHFIS
REIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>SSO_P164 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P086 acp, Acp
MIKEKILSIVAFCYGIAYSKLSEETKFIEDLSADSLSLIEMLDMISFEFN
LRIDESALEHIITIGDLISVVKNSTKSI
>SSO_P197 finO, FinO
MTEQKRPVLTLKRKTEGETPVRSRKTIINVTTPPKWKVKKQKLAEKAARE
AELAAKKAQARQALSIYLNLPTLDEAVNTLKPWWPGLFDGDTPRLLACGI
RDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV
TEHISQEEEAYAAARLDKIRRQNRIKAELQAVLDEK
>SSO_P201 hmo, putative regulator
MAKTKQEWLYQLRRCSSVNTLEKIIHKNRDSLSNSERESFNSAADHRLAE
LITGKLYDRIPKEIWKYVR
>SSO_P143 icsA/virG, IcsA/VirG
MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIA
FAIPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH
ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM
ILGGNGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG
GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT
FAGGNGGAAYGYGYDGYGGNAITGDNLSIINNGAILGGNGGHWGDAINGS
NMTIANSGYIISGKEDDGTQNVVGNAIHITGGNNSLILHEGSVITGDVQV
NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN
SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS
VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE
NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN
AGILKMGTVEAMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNIN
APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL
GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS
DKNAFIQKGRIVAGSYDYRLKQGTVSGLNTNKWYLTSQMDNQESKQMSNQ
ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL
YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI
GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL
YSSWFQNEKERTGLYMDAWLQYGWFNNTVKGDGLTGEKYSSKGITGALEA
GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGV
NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS
NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY
TF
>SSO_P094 icsB, IcsB
MILKISNFIDASNTKGPIRVEDTEHGPILVAQKFNLKDLFFRTLSTINAK
INSQILNEQLKNYRLANQKSLLLFLKTLASEKSAESAFAAYEAVKNSIQH
SFTGKDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISI
KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI
SDQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD
KVYIPLSGDNKTKDGKISYNLFGLDETNMSKFICQKKADAFRQLANYKLI
SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNNVYAYANKVRQRIES
LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS
LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMSRVLNELKTGATDKKEE
IIEKSIKTIDYYNSLKSPDLGTKLYIHDLLQVNKLLLNNSHSNI
>SSO_P241 icsP/sopA, IcsP/SopA
MKLKFLVLALCVPAIFTTHATTNYPLFIPDNISTDISLGSLSGKTKERVY
HPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVSGWTTLGNQKAS
MVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGWLLNNLDYRLGL
IAGYQESRYSFNAMGGSYIYSENGGNRNKKGAHPSGERTIDYKQLFKIPY
IGLTANYRHENFEFGAELKYSGWVRSSDTDKHYQTETIFKDEIKNQNYCS
VAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTNISGTIKNSASI
EYIGFLTSAGIKYIF
>SSO_P203 insB, IS1 ORF2
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P067 insB, IS1 ORF2
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P160 insB, IS1 ORF2
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P087 ipaA, IpaA
MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEK
ESFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEILEFSNVLYS
LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL
RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET
VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLSRDLNTVAVFP
ELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIH
YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLSRNGNSL
LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKVFNHTPDNSDGIGSDL
LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKS
SASLSHRVASQINKFNSNTDSKVLQTDFFSRNGDTYLTRETIFEASKKVT
NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANTIN
YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLK
NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD
>SSO_P090 ipaB, IpaB
MHNVSTTTTGLSLAKILASTELGDNTIQAANDAANKLFSLTIADLTANKN
INTTNSHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN
KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK
IKDLENKINQIQTRLSELDPDSPEKKKLSREEIQLTIKKDAAVKDRTLIE
QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT
QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA
EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV
QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL
KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN
SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ
EVIADLLASMSNSQANRTDVAKAILQQTTA
>SSO_P089 ipaC, IpaC
MEIQNTKSTQILYTDISTKQTQSSSETQKSQNYQQIAAHIPLNVGKNPVL
TTTLNDDQLLKLSEQVQHDSEIIARLTDKKMKDLSEMSHTLTPENTLDIS
SLSSNAVSLIISVAVLLSALRTAETKLGSQLSLIAFDATKSAAENIVRQG
LAALSSSITGAVTQVGITGIGAKKTHSGISDQKGALRKNLATAQSLEKEL
AGSKLGLNKQIDTNITSPQTNSSTKFLGKNKLAPDNISLSTEHKTSLSSP
DISLQDKIDTQRRTYELNTLSAQQKQNIGRATMDTSAVAGNISTSGGRYA
SALEEEEQLISQASSKQAEEASQVSKEASQATNQLIQKLLNIIDSITQSR
NSTASQIAGNIRA
>SSO_P088 ipaD, IpaD
MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT
LHNIRTTNQALKKELSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS
RNEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYL
KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALE
ELKKKYEDKPLYPATNTVSQKEADKWLTELGGTIGKVSKKNGGYVVNINM
TPIDNMLKSLNNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK
YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF
>SSO_P212 ipaH1.4, invasion plasmid antigen
MIKSTNIQAIGSSIMHQINNIYSLTPFSLPMELTPSCNEFYLKAWSEWEK
NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK
NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP
ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN
NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT
MRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWFPENKQSD
VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS
AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL
LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV
KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA
DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE
QQIYRQLTDEVLALRLSENGSNHIA
>SSO_P059 ipaH4.5, invasion plasmid antigen
MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRI
QAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPEN
SPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSA
RFNSLETLPELPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPE
FPQRLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPENIITLPICTN
VNISGNPLSTRVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAV
TAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQ
VAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS
EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM
LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWG
PWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAER
EAGAQVMRETEQQIYRQLTDEVLA
>SSO_P058 ipaH7.8, invasion plasmid antigen
MFSVNNTHSSVSCSPSINSNSTSNEYYLRILTEWEKNSSPGEERGIAFNR
LSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLK
ELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLY
NLPALPEKLKFLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNN
IRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRVLQSLQRLTSS
PDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHE
EHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVA
ADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE
ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV
TANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKY
EMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEV
LALRLSENGSRLHHS
>SSO_P167 ipaH9.8, invasion plasmid antigen
MLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPGEERDEAVSRL
KECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLTNLPELPVTLK
KLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLLTMNISYNEIV
SLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDRNQISHIPESI
LNLRNECSIHISDNPLSSHALQALQRLTSSPDYHGPRIYFSMSDGQQNTL
HRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSAR
NTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRK
TLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIE
VYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEF
TDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGL
SGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGSQLHHS
>SSO_P084 ipaJ, IpaJ
MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQ
GCFLARKQMYDNSCGAASLLCAAKELGVDKIPQYKGSMSEMTRKSSLDLD
NRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVE
ESNIFSQVISFIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIG
VPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNSNFIGYTKIGI
SIVITNEAL
>SSO_P093 ipgA, IpgA
MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL
NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY
LSPDELIESLYEFLFCIKLTIANITSEVN
>SSO_P092 ipgB1, IpgB1
MQILNKILPQVEFAIPRPSFNSLSYNKLVKKILSVFNLKQRFPQKNFGCP
VNINKIRDNVIDKIKDSNSGNQLFCWMSQERTSYVSSMINRSIDEMAIHN
GVVLTSDNKKNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK
ILKRYSSNMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA
VYRQSNTN
>SSO_P024 ipgB2, IpgB2
MLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSS
VSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSREIGDNLRKQIFK
QVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTS
NVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF
>SSO_P091 ipgC, IpgC
MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKG
RIEEAEVFFRFLCIYDFYNVDYIMGLAAIYQIKEQFQQAADLYAVAFALG
KNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAI
QDIKE
>SSO_P095 ipgD, IpgD
MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEA
LNRLYLQNQTSLTGKSLLFARDRAEVFYEAIKLAGGDTSKIKAMMERLDT
YKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKN
WGPVNKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRES
DHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE
LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVN
ALKGLNSKRGEPTKLLIRNSDGLLKEVSVNLKVVTFNFGVNELALKMGLG
WRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ
IKEIINKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSGKDRTGMQD
AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV
PGNKVMKKLPLSSLELSYSERIGDSKIWNMVKGYSSFV
>SSO_P096 ipgE, IpgE
MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL
PENINDLIYALSLNYSEKICLATDDEGGNLIARLDLTGINEFEDVYVNTE
YYISRVRWLKDEFARRMKGY
>SSO_P097 ipgF, IpgF
MSRFVFILLCFIPYLGRADCWDKAGERYNIPSSLLKAIAEKESGFNKFAV
NVNNNGSKDYGIMQINDFHSKRLREMGYSEEMLISHPCLSVHYAAKLLNE
FMMMYGRGWEAVGAYNAGTSPKKKKERLKYAEDIYRRYLRIAAESKQNNR
RI
>SSO_P009 mkaD, mouse killing factor
MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYY
FKVASNVPNYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG
DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG
AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV
RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE
>SSO_P171 mob9, plasmid mobilization protein
MSLAGNPCVIRLAAQVCMWLKFIIRDRGGFSGGLLLFLPVCCRDRTERIL
AVHTIKILR
>SSO_P182 msbB2, MsbB2
MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIG
RWVGRLSRKARRRATINLSLCFPEKSDTEREIIVDNMFATALQSIVMMAE
LAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAA
QGEKMAAMFHQQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGY
WGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQAMIIPLFPVYD
EKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLK
LLKTRKSNEADPYP
>SSO_P109 mxiA, MxiA
MKVIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLA
ILVFMGSFYIERILSFSTFPSVLLITTLFRLALSISTSRLILVDADAGKI
ITTFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGM
PGKQMSIDADLKAGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAI
AGIIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLVSQIPALLI
SISAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFP
FFVFFLIAVTLTALFYYKKVVEKEKSLSESDSSGYTGTFDIDNSHDSSLA
MIENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPT
ILYRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVV
STSYNERVISWVDVSYTENLTNIDAKIKSAQDEFYHQLSQALLNNINEIF
GIQETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLK
LIMESLALWAPREKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYI
EDAIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAKKNFVLLVS
VDIRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI
>SSO_P108 mxiC, MxiC
MLDVKNTGVFSSAFIDKLNAMTNSDDGDETADAELDSGLANSKYIDSSDE
MASALSSFINRRDLEKLKGTNSDSQERILDGEEDEINHKIFDLKRTLKDN
LPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVESLTKIINEII
SGSEKSVNAGINSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYI
EWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGFVLSKLIAIKM
IRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVI
EVSRASHEDSVVYQTYLSSVNESPHDIFKSESEREIAINILRELVTSAYK
KELSR
>SSO_P107 mxiD, MxiD
MKKFNIKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFE
RFSALLNYPIVVSKQAAKKRISGEFDLSNPEEMLEKLTLLVGLIWYKDGN
ALYIYDSGELISKVILLENISLNYLIQYLKDANLYDHRYPIRGNISDKTF
YISGPPALVELVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNM
RGEDIVIPGVATVVERLLNNGKALSNRQAQNDPMPPFNITQKVSEDSNDF
SFSSVTNSSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDVAKRH
IELSLWIIDIDKSELNNLGVNWQGTASFGDSFGASFNMSSSASISTLDGN
KFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS
LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSQSNYNYNNENTSVLP
EVGRTKISTIARVPQGKSLLIGGYTHETNSNEIVSIPFLSSIPVIGNVFK
YKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSE
TTLLEDEKSLVSYLNY
>SSO_P106 mxiE, MxiE
MEGFFFVRNQNIKFSDNVNYHYRFNINSCAKFLAFWDYFSGALVEHSHAE
KCIHFYHENDLRDSCNTESMLDKLMLRFIFSSDQNVSNALAMIRMTESYH
LVLYLLRTIEKEKEVRIKSLTEHYGVSEAYFRSLCRKALGAKVKEQLNTW
RLVNGLLDVFLHNQTITSAAMNNGYASTSHFSNEIKTRLGFSARELSNIT
FLVKKINEKI
>SSO_P098 mxiG, MxiG
MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDN
FPENIIPVTDSKSDGIIYLTISKDNICQFSDEKGEQIDINSQFNSFEYDG
ISFHLKNMREDKSRGHILNGMYKNHSVFFFFAVIVVLIIIFSLSLKKDEV
KEIAEIIDDKRYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYIL
VSNKEINRIQQYINQRFPFINLYVLNLVSDKAELLVFLSKERNSSKDTEL
DKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNK
VTYSVREELTDEKLELINRLISEHKNIYGDQYIEFSVLLIDDDFKGKSYL
NSKDSYVMLNDKHWFFLDKNK
>SSO_P099 mxiH, MxiH
MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEY
QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNFR
>SSO_P100 mxiI, MxiI
MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIME
MVSNPESLNPESLAKLQTTLSNYSIGVSLAGTLARKTVSAVETLLKS
>SSO_P101 mxiJ, MxiJ
MIRYKGFILFLLLMLIGCEQREELISNLSQRQANEIISVLERHNITARKV
DGGKQGISVQVEKGTFASAVDLMRMYDLPNPERVDISQMFPTDSLVSSPR
AEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLEEKNISSKPMHISVI
AIYDSPKESELLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPV
KEIKSEFLTNEVIYLFLGMAVLVVILLVWAFKTGWFKRNKI
>SSO_P102 mxiK, MxiK
MIRMDGIYKKYLSIIFDPAFYINRNRLNLPSELLENGVIRSEINNLIINK
YDLNCDIEPLSGVTAMFVANWNLLPAVAYFIGSQESRLINHSEMVISYYG
GKISKQGEAAIRSGFWHLIAWKENISVGIYERINLLFNPIALEGNYTPVE
RNLSRLNEGMQYAKRHFTGIQTSCL
>SSO_P104 mxiL, MxiL
MINQINASNALQQRLNSEEFVNLNERLSSSQSFDEDIIYEIMQYFSQSEL
NSIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA
SLSIYNNIDFYIKTTTFDSLISVFEAGREADDSTW
>SSO_P105 mxiM, MxiM
MIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPVSKDYFSIPN
DLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKEVDGCFMDAQ
KIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGI
>SSO_P103 mxiN, MxiN
MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVI
KDATKKAESLRIDAVCDGYQIGIQTAFEHIIDYICEWKLKQNENRRNIED
YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLE
LDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL
TKNDKKYFKELAHKKLRQIAEDLLKENPVND
>SSO_P003 ospB, OspB
MNLDGVRPYCRIVNKKNESISDIAFAHIIKRVKNSSCTHPKAALVFLGEK
GFCDSNDVLSIMGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQS
EPIVINNDDDALNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSH
QLGLGSELIDVQTIISRMKDCGILNVKDIRFTSCGSADKVAPKNFNNAPA
ESLSCILNSLPFFKEKESLLEQIKKHLENDESLSDGLKISGYHGYGVHYG
QELFPYSHYRSTSIPADPEHTVKRSSQKKTFIINKELD
>SSO_P049 ospC1, OspC1
MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS
ANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL
DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN
IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISSHQKKHPLNTK
HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV
NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY
GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN
YEEINKQVTNKKIALQALFLSIANQKEDVALYILSNFEITRQDVISIKHE
LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE
NAEMIKLLLKYGATSDNKYI
>SSO_P065 ospC2, OspC2
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK
HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS
NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR
FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIRLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDV
AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN
SGDTMLDNAMKSKDSKMIDFLLKNGAVSGKRFGR
>SSO_P072 ospC3, OspC3
MKIPEAVNHINVQNNIDLVDGKTNPNKATKALQKNILRVTNSSSSGISEK
HLDHCANTVKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLS
NNILNIKSFDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQG
FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV
AEMEKMNNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKAN
SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI
>SSO_P022 ospD1, OspD1
MSINNYGLHPANNKNMHLIIGSNTANENKGMKSNIINVTNSAISHAINEE
KSGGGYSGVSFRKLAKIQSISIPTKNNKEYNRHNLFSLIWHGNADAARKY
GESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT
FDLSPKETIKLLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLT
EIADRLNNNEQDMFNIISDKIQELF
>SSO_P008 ospD2, OspD2
MPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQFKNK
TAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKVNYQ
LLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMKKNG
DFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTVFTC
DSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKLLPD
ELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSDGTP
AFYIALQNGCSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLCMSF
MNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNGHAD
SIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDILKI
LPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSFTTR
RLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIAEQF
SKKMKKTFIEIINRFNHFL
>SSO_P050 ospD3, OspD3
MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL
NCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS
EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM
ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL
DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPNNLLHPKVIYHAMRMG
LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL
AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL
LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL
ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK
LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA
SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMEIIKNENITPEEIAEHLDK
KNGSDFLEIMKNIKS
>SSO_P213 ospE2, truncated OspE2
MDILFLIKKTAIYELQIPATNRTKRLKFTATEIQWLTKINEAGIDKKQSQ
RYSDF
>SSO_P044 ospE2, OspE2
MLTQTIFPCLPQKQENIILEVSNPVLLSSTVTTDGYTVFNKKAAIYELQI
PAANRTKTLKFTATEMQWLTKINEVGIVEKQSQRHSNI
>SSO_P170 ospG, OspG
MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSA
LYKKYDLIGNQYNEILEMAWQESELFNAFYGDEASVVIQYGGDVYLRMLR
VPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKES
ESLFPIDFRNIYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL
>SSO_P031 parA, plasmid segregation protein
MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKSAVAKLP
KLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPK
YRDRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLD
LDPQSSATMFLNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGV
YVMPASIDDAFIASNWDTLCEEHLLGQNKHAILRENIIDKLKHDFDFILI
DTGPHLDAFLKNAIAAADIMFTPVPPAQVDFHSTLKYLARLPELVQIIEQ
DGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIFGGDMLDVSMPRLDGFE
RSGESFDTVISANPVTYVGSGEALKNARMAAEDFAKAVFDRIEFIRANY
>SSO_P032 parB, plasmid segregation protein
MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTEIKVDHD
KVDTQTFVVEEVNGREQTALTPDSLKDITRTIHLQQFYPCIGIRTGDLIE
ILDGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREI
GLRLVRLKEAGMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSE
LTYADYRQLAELSERLRLGDISIDEVVKNISPSIELITADDNLSEDEVKN
SIMRLITKEMSSLLDSGVKDKAVVTLLWKFDSKDKFARKRVKGRTFSYEF
GRLPLEVQDKLDRMIALVLKDNLNSL
>SSO_P004 phoN2/apy, PhoN2 (Apy)
MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPPPPAEDS
VVFQADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISREN
TPILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDENMA
ITGSYPSGHASFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQ
SDVEAGRLMGASVVAVLHNTPEFTKSLSEAKKEFEELNTPTNELTP
>SSO_P205 repA, RepA
MTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFDFAIH
VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE
CGLATESDAGTLSITRATRALTFLAELGLITYQTEYDPLIGCNIPTDITF
TPALFAALDVSEEAVASARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF
VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF
TASREAVKREVERRVKERMILSRNRNYSRLATASP
>SSO_P204 repB, RepB
MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN
PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK
>SSO_P112 spa13, Spa13
MEALDKRIIYFLQLENDLEPVGAQSVSQLFNTRRKIAIVKKHIIQYQSER
ILLKGRIEEIQKDIDEANASKRKLLHKESKICKRIGLIKRNNFAKQLILD
ELSQEDMKYGIR
>SSO_P110 spa15, Spa15
MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNE
QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL
RVVIKDDYVHDGIVFAEILHEFYQRMEILNEVL
>SSO_P115 spa24, Spa24
MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNM
TLNGIALIMALFVMKPIIEAGYENYLNGPQKFDTISDIVRFSDSGLMEYK
QYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDA
FKIGFYLYLPFVVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGW
GILSKALIEQYINVPA
>SSO_P116 spa29, Spa29
MDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIFL
VASGLITSGKVDIGSSVFEHVYFLMFKEIIVGLLLSFCLSLPFWIFHAVG
SIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILES
IQLSYNICPLFSQCSFRVSNILTFLTLLASQAVILASPVMIVLLLSEVLL
GVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKVQFFLGEHKFF
TNLFVR
>SSO_P113 spa32, Spa32
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN
YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP
DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR
NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL
QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV
GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>SSO_P114 spa33, Spa33
MLRIKHFDANEKLQILYAKQLCERFSIQTFKNKFTGSESLVTLTSVCGDW
VIRIDTLSFLKKKYEVFSGFSTQESLLHLSKCVFIESSSVFSIPELSDKI
TFRITNEIQYATTGSHLCCFSSSLGIIYFDKMPVLRNQVSLDLLHHLLEF
CLGSSNVRLATLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDNNEAKI
NLSESNGESEHTEVSLALFNYDDINVKVDFILLEKNMTINELKMYVENEL
FKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE
>SSO_P117 spa40, Spa40
MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDV
MLLYRYVIINDFEINEGKYFFAVVIVFFKIIGFPLFFCVLSAVLPTLVQT
KFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYF
FWINDRKIIFSQVFSSVDGLYLIWGGLFKDIILFFLAFSIFVIILDFVIE
FILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEILSEQTKSDIRN
SKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPT
VRDVKLARKLYKTHTKYSFVDFEHLDEVLRLIVWLEQVENTH
>SSO_P111 spa47, Spa47
MSYTKLLTQLSFPNRISGPILETSLSDVSIGEICNIQAGIESNEIVARAQ
VVGFHDEKTILSLIGNSRGLSRQTLIKPTAQFLHTQVGRGLLGAVVNPLG
EVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGIKVIDSLLTCG
EGQRMGIFASAGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYL
KNSEKKNRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTEGHKVALFIDS
LTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITA
FYTVLLEDDDFADPLAEEVRSILDGHIYLSRNLAQKGQFPAIDSLKSISR
VFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK
ISVVESFLKQDYRLGFTYEQTMELIGETIR
>SSO_P190 traD, TraD protein
MFSQIANIMLYCLFIFFWILVGLVLWVKISWQTFVNGCIYWWCTTLEGMR
DLIKSQPVYEIQYYGKTFRMNAAQVLHDKYMIWCGEQLWSAFVLATVVAL
VICLITFFVVSWILGRQGKQQSENEVTGGRQLTDNPKDVARMLKKDGRDS
DIRIGDLPIIRDSEIQNFCLHGTVGAGKSEVIRRLANYARQRGDMVVIYD
RSGEFVKSYYDPSIDKILNPLDARCAAWDLWKECLTQPDFDNTANTLIPM
GTKEDPFWQGSGRTIFAEAAYLMRNDPNRSYSKLVDTLLSIKIEKLRTYL
RNSPAANLVEEKIEKTAISIRAVLTNYVKAIRYLQGIEHNGEPFTIRDWM
RGVREDQKNGWLFISSNADTHASLKPVISMWLSIAIRGLLAMGENRNRRV
WFFCDELPTLHKLPDLVEILPEARKFGGCYVFGIQSYAQLEDIYGEKAAA
TLFDVMNTRAFFRSPSHKIAEFAVGEIGEKEHLKASEQYSYGADPVRDGV
STGKDMERQTLVSYSDIQSLPDLTCYVTLPGPYPAVKLSLKYQERPKVAP
EFIPRDINPEMENRLSAVLAAREAEGRQMASLFEPDVPEVVSGEDVTQAE
QPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPQQPVSPVINDKK
SDSGVNVPAGGIEQELKMKPEEEMEQQLPPGISESGEVVDMAAYEAWQQE
NHPDTWQQMQRREEVNINVHRERGEDVEPGDDF
>SSO_P191 trbH, conserved hypothetical protein
MNRSAPVFSSQAAHTFKFPGVISHNNQPPTAGMTCDHLIKWPDRASLTGK
FCSYLAGVCGCSSVVIQNINAGNKSLDHSEITFRHLAFFCTIYQLHQGDR
TDTHSPLVQVKTLPDAGGFVLYRKNADVGIEHKLQHQNDSLSCMPGCSLL
SIKSALTLCPSNHSSHVSPAGVMILVRPTAITSTRFTFSGNATAFGSLTA
WLRLLRNTVVSIICLLMWICLVYIYCGIDAGICQRDIRL
>SSO_P147 ushA, UshA
MIPLKKNITLIMFTLSLLTGNPAIAYETDKVYKITVLHTNDHHGHFWRNN
HGEYGLSSQKTLVDNIRQKVINNGGSVLLLSGGDINTGVPESDLQKAEPD
IRGMNLIGYDAMAVGNHEFDNPLNILRQQEKWATFPFLSANIYQKSTGRR
LFSPWKIFIRQNLKIAVIGLTTDDTAKTGNSEYFTDIEFRQPAAEARSVI
DELNQQEKPDIIIAATHMGHYDNGESGSNAPGDVEMARSLPTGSLAMIVG
GHSQAPVCMASDNKKQWNYIPGTTCVPDKQNGIWIVQAHEWGKYVGQADF
EFCNGTMKLVNYQLHPVNLKMRITREDGKTEFSFYTPEITEDPQMLSLLT
PFQNKGKAQLDVKVGVVNGRLEGDRSKVRFVQTSMGHLILSALTERIDAD
FAVVSGGEIRDSIESGNITYKDILKVQPFGNTVVSIDLTGKEVADYLATV
AQMKPDSGAYPQFLNTSFVVKKGKIEMLKIKGKSVDLNKKYRMTTFSFNA
TGGDGYPRIDNRPGYINTGFIDAEVLIEYIREHSPLDAASYEPKGEVSWQ
>SSO_P142 virA, VirA
MQTSNITNHERNDSSWMSTVKSTTEVSWNKLSFCDVLLKIITFGIYSPHE
TLAEKYSEKKLMDSFSPSLSQDKMDGEFAHANIDGISIRLCLNKGICSVF
YLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKP
APEVIETAINCCTSIIPNDDYFPVKDTDFNSVWHDIYRDIRASNSNSTKI
YFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVNKIINSNFPQQ
DLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSEN
IENRLNFMPEQRVINNCGHIIKINAVVPKNDTAISASGGRAYEVSSSILP
SHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV
>SSO_P085 virB, VirB
MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTN
VRDQESLTEESLADIIKTIKLQQFFPVIGREIDGRIEILDGTRRRASAIY
AGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMS
YKDIAKKENLSRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYY
KGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNIIKKSKNRKQN
PSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRD
IISRHLSSS
>SSO_P041 virF, VirF
MMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEGQIAFIER
NIQINVSIKKSDSINPFEIISLDRNLLLSIIRIMEPIYSFQHSYSEEKRG
LNKKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSDEEALYTS
ISIASSLSFSDQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRKRLESEKL
TFQQILLDIRMHHAAKLLLNSQSYINDVSRLIGISSPSYFIRKFNEYYGI
TPKKFYLYHKKF
>SSO_P181 virK, VirK
MFSVSNLSFIGFLKRIVFSSDSLPGKWEHRKFRFMYILRCAINPVASIRY
YYELRSLQCIEDILAIQPTLPARIHRPYLHKGGRAWSRGQYILEHYRFVQ
NLPEKYSEFLFPQKSVSLVQFIGKDGEDFDIQCSPSGFDREGELMLSLFF
NKIVIARLTFSVILTQNGHTAFIGGLQGAPKNTGPDVIRCATRACYGLFP
KRIIFEAFCALMKACNVSECLAVSEHSHVFRQLRYWYQKRKTFVAVYSDF
WESVAGKTCGDWYKLPTQVVRKPLSNIASKKRSEYRKRYALLDYIHETAI
RSLDAYPVNSEHQDLN
>SSO_P224 wbgT, putative UDP-glucose 6-dehydrogenase
MFYVHFMMDKKMKFDTLNAKIGIIGLGYVGLPLAVEFGKKVTTIGFDINK
SRIDELRNGHDSTLECSNLELLEATKLTYACSLDALKECNVFIVTVPTPI
DKHKQPDLTPLIKASETLGKIIKKGDVIIYESTVYPGATEEDCIPVVEKV
SGLKFNIDFFAGYSPERINPGDKEHRVTNILKVTSGSTPDVAEYVDQLYK
LIITVGTHKASSIKVAEAAKVIENTQRDVNIALINELSIIFNKLGIDTLE
VLEAAGTKWNFLPFRPGLVGGHCIGVDPYYLTHKAQSVGYHPEMILAGRR
LNDSMGQYVVSQLVKKMLKQRIQVEGANVLVMGLTFKENCPDLRNTKVID
IISELKEYNINIDIIDPWCSTDEAQHEYGLTLCEDPKVNHYDAIIIAVAH
NEFREMGESAIRALGKDEHVLFDLKYVLDKKSIDMRL
>SSO_P225 wbgU, putative UDP-glucose 4-epimerase
MDIYMSRYEEITQQLIFSPKTWLITGVAGFIGSNLLEKLLKLNQVVIGLD
NFSTGHQYNLDEVKTLVSTEQWSRFCFIEGDIRDLTTCEQVMKGVDHVLH
QAALGSVPRSIVDPITTNATNITGFLNILHAAKNAQVQSFTYAASSSTYG
DHPALPKVEENIGNPLSPYAVTKYVNEIYAQVYARTYGFKTIGLRYFNVF
GRRQDPNGAYAAVIPKWTAAMLKGDDVYINGDGETSRDFCYIDNVIQMNI
LSALAKDSAKDNIYNVAVGDRTTLNELSGYIYDELNLIHHIDKLSIKYRE
FRSGDVRHSQADVTKAIDLLKYRPNIKIREGLRLSMPWYVRFLKG
>SSO_P229 wbgV, putative glycosyltransferase
MLLEYVERKISLALSKYPKVRDVIKFFYLYIASLFGIILNKNKTVIQSKI
YEISIDDSEESFFGYYDHSPMSSNGRYVLFHSSAFSTKRHPKKVKYISIC
VKDLLNNKVYKLYDTRAFNWQQGSRLMWIDDDNIIFNDYENNGYISVVYS
LSLMKVIKKINYPIYDVNNYKAVTLDFSWLAKYDSDYGYYNKKSFSTDIS
IINLNTGGIELFLSLDEMLKRTNFKCNIDVEHVVNHFMFAPDGRSVMFIH
RYYTPKGKRERLIHWNLINDNVRVLINESIISHCCWNGNDEIIGFFGAEI
DSLNYYRLSIESCNTEKLFFDARKYSDGHPTIVHNRYIISDTYPDKNRIK
KLFVYDLVKNDYRELGLFYESMSFFSYSRCDLHPRISVDNRFLFVDSVHS
GKRKLYFMRSGICE
>SSO_P230 wbgW, putative glycosyltransferase
MSDVLVSLIIVCFNAEKYIEKSLLAFINQDVGLDKFELIIVDGDSSDNTI
SIVQDVFSKHSNIKHKIINNKKRTLATGWNIGVLEANGKFVCRVDAHSDI
PNNYISKLLDDYFNIMQFDDSVVGVGGVLTNSYKTKFGSIVADFYASKFG
VGNSPFRCVDKNNRLKKTDTAVFALYNKDVFFDVGLFNEVLDRNQDIDFH
KRVLSNNLSLYTDNSLFVEYYVRDNFKDFIKKGFLDGFWVVMSGAYYFRH
IVPLFFVLYLIVSFSLFFATGDYIYLSFLFFYFLISILFSIRDGRSFIGR
VFLPFIFLSYHISYGCGSLLSFLKRYFK
>SSO_P231 wbgX, putative glycosyltransferase
MKNFIPFALPEIGEEEIAEVIDSLRSGWITTGPKAKQFEQEFSNYLGANV
QSLAVNSATSGLHLALEAVGVKPGDQVIVPSYTFTATAEIVRYLGADPVI
VDVDRKTFNISVDAIEKAITNETKAIIPVHFAGLACDMDSILSIAKKYDL
KVVEDAAHAFPTTYKGSKIGTLDSDATVFSFYANKTMTTGEGGMVVSKNK
DIIERCKVMRLHGISRDAFDRYQSKTPSWFYEVVAPGFKYNMPDICAAIG
IHQLRKIDDFQKKRQRMAKIYDDALKELPLELPEWPTNASDIHAWHLYPI
RLKTDSAINRDDFIKKLSDLGIGCSVHFIPLHKQPVWRDTYNLNASDFPV
SEECYLNEISIPLYTKMTDQDQLFVIKSIRQLFM
>SSO_P232 wbgY, putative glycosyltransferase
MKRIFDVIVAGLGLLFLFPVFIIVSMLIVADSKGGVFFRQYRVGRFGKDF
RIHKFRTMFIDSEKKGRITVGQDARVTRVGWYLRKYKIDELPQLIDVLSG
TMSLVGPRPEVREFIDEYPDDIREKVLSVRPGITDLASIEMVDENEILSS
YDDPRRAYIDIILPIKQRYYLDYVANNSVKYDCVIIWKTIIKILSR
>SSO_P233 wbgZ, putative epimerase/dehydratase
MIDRILELPRIVKRGIIICIDVVMVIFSFWLSYWLRLDEQTAFLSAPMWF
AAAILTIFTVFIFIRIGLYRAVLRYVSAKIMLLIPVGILASTLSLVVISY
SLSIMLPRTVVGIYFLVLLLLTSGSRLLFRMILNYGVKGSAPVLIYGAGE
SGRQLLPALMQAKEYFPVAFVDDNPRLHKAVIHGVTVYPSDKLSYLVDRY
GIKKILLAMPSVSKSQRQKVITRLEHLPCEVLSIPGMVDLVEGRAQISNL
KKVSIDDLLGRDPVAPDAKLMAENITGKAVMVTGAGGSIGSELCRQIVRY
KPAKLVLFELSEYALYAIEKELSALCDKEVLNVPVIPLLGSVQRQNRLQM
VMKSFGIQTVYHAAAYKHVPLVEHNVVEGVRNNVFGTLYCAESAIESGVE
TFVLISTDKAVRPTNTMGTTKRLAELVLQALSARQSQTRFCMVRFGNVLG
SSGSVVPLFEKQIAQGGPVTLTHRDIIRYFMTIPEASQLVIQAGAMGHGG
DVFVLDMGDPVKIYDLAKRMIRLSGLSVRDDKNPDGDIAIEVTGLRPGEK
LYEELLIGDSVQGTSHPRIMTANEVMLPWQDLSLLLKELDQACHDFDHER
IRSLLLQAPAAFNPTDDICDLVWQQKKSLLSQASNVIRL
>SSO_P226 wzx, putative repeat unit transporter
MIDAGGTFLLKAIFQIGVFVYFTHVSDITTFGIISYVFTVYWFVLNFSDY
GFRTKLVKDISDNSYSASELLSRSDGVKTYVFFFIFIIFMFYSYVSDSIS
LTLLVYISSAYFVCISSGRFSLLQAVGRFRCELYINIYSTIIYIGCNLFL
SLFIEPLYYSAISIFIYSISLLVFSSHKCNVPCFHIKRPSILVYKDFLDA
TPFAILVLLNVVLSSIDLFILKEYFSYNSVAIYQVVTRVNTGLIIVFNVI
YTVLLPSFSYYLKNSEWGNIRKLQRYISLLVLLLCLCYYFFGIYFVGILF
GDEYKVISSATFLIMFMALIKYNFWLINELYLVCSGNQSERVKSYCIGVV
ISMAVFFYFIPRYGWSGAVFGSAIATLVIGIFYIISVKKDCGKILHDKYS
LMMIFVPIFFYFIINGQQRLLY
>SSO_P227 wzy, putative polysaccharide polymerase
MLIYLYPVLLLFNILPVFFYGQMNSDLERFFGVPIGYIPDLIFYFFVVLT
SIITLRFHVSLWTKKLLFLGIIFLIYISIQMLLLSADISGVVILLSFFSN
FIALVLLVSFCIGKDELYLTHSVRNINVVMCFGIICGVVKLFIGYSEDSN
FIVYLNRNATAIIVVCFYCVYSYFYRGRKSWYVSSVLYSLFFLFLDSRAG
IISFAISLFFVFLQLTKKEKLLISLFFVPLLTLGISFTDIGTRLERMLSS
SQVIFSGGNTLTKSQNDYRRVELVFIGVDVLKENYLIGTGLGVANYVKAI
DKKFLGSTNFGLAHNFYLSYSAQLGIIGFILLISVFYIMLSPIFKCGGYI
GKGCVFALAFYVFFNEYILTPAIYIYISIFLSVVFIRNSK
>SSO_P223 wzz, O antigen chain length determinant protein
MPKAEDEIDLFELLGTLWKKKWVILCVTLLTTGLAAVYAFTAKEQWTAKT
YIQAPRIAELGSYLKFHQAYARILNQPLDTNALANGLFSDLILIAESPDT
KVKFLESTEYYKKETNNLSTEQDKKIWLAEQANKGLVITPPKEKGNTSYY
IIQASADSAQEAYKLLQGYLKNVNNQAVTLSLDEFGQNVNTLLVNLNKEI
IDIDFQRKSEKLDQIAHIQRDLTTAEQAGIIDYRSSKGGFDNAQSSYKFL
LGEKLLSAELKATKDAPIIYPFRYYEVKRQIDELEGMLRDNIQAQAYRYQ
MKPSEPVIKDKPNKALILILGALPGAMFAIVGTLVYATLKDKTKLD
>SSO_P169 yacB, conserved hypothetical protein
MAAIEPDERIGYSASSLAGQPYKGRNGRVEGTSGPHKVACNVILCENLL
>SSO_P198 yigA, conserved hypothetical protein
MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD
WQQFARKRAEHCHRRCRGRV