TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Shigella sonnei Ss046, Ss046
Gene type: CDS

Number of genes found: 785

Free access
Sort by:

 



# Shigella sonnei Ss046, Ss046

>SSO_0150 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0020 hypothetical protein
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGVSTPRCLTVICHFHAR
>SSO_2909 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1821 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2654 IS629 ORF2
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELAILTWVDWYNNRRLLERLGHTPPAEA
EKAYYASIGNDDLAA
>SSO_P164 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2670 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3872 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_0427 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1731 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3855 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0747 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1261 putative crossover junction endodeoxyribonuclease
MRHEFILPYPPTVNTYWRRRDNTYFVSKAGERYRRDVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVR
GQPVSGGRLGVKIYPIMHEEQVKK
>SSO_3606 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1017 putative transposase subunit
MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS
RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK
NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS
WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ
FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT
S
>SSO_1612 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_0238 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_3826 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P149 conserved hypothetical protein
MSDATRLKNRLSVRFSEVGLVLNAGKTNIAYIDTFKRRNVATSFSFLGYD
FKVRTLKNFKGELYRKCMPGASNAAMCKITETIKKWRIHRSTAESLLDFA
RRYNAIVRGWIEYYGKFWSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQ
RKLALVRKQYPKLFAHWYLLRASNE
>SSO_0325 IS2 ORF1
MVVSAIASTPQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQH
GVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGK
KTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>SSO_1669 putative transposase subunit
MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS
RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK
NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS
WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ
FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT
S
>SSO_2730 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P132 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1152 putative phosphohydrolase
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILKASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>SSO_3593 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVW
>SSO_0876 putative single stranded DNA-binding protein
MTAQIAAYGRLVADPQLKTTSKGTQMTMASMAVPLPCSQADDGTATIWLS
VLAFGRQADALAKHQKGELVSVAGNMQVSQWTGQNGETRQGWQVIADSVI
SARTARPGGKKGQQGQATDALNRAKQQAGNDDPYGDNIPF
>SSO_3501 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3617 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1311 IS1 ORF
MSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SSO_2625 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1319 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2507 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P200 conserved hypothetical protein
MRKYIPLVLFIFSWPVLSADIHGRVVRVLDGDTIEVMDSLKAVRIRLVNI
DAPEKKQDYGRWSTDMMKSLVAGKTVTVTYFQRDRYGRILGQVYAPDGMN
INQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWIWM
HRK
>SSO_3935 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_0277 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_0270 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P194 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1706 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEVISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>SSO_1672 ISSfl2 ORF
MALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRAFASAAHLAAY
AGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTR
KMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_1989 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3277 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1684 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0077 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1565 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDHKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_3934 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_2175 hypothetical protein
MSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPARISPA
IQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVARELAG
FIWDMGRIAMSVAQQPQCHK
>SSO_3790 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P221 IS630 ORF
MTWELILDGYSESSYSATPRFAAARLPWFRVIYQPVYSPWVNHVERLWQA
LHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_1665 conserved hypothetical protein
MHSLILGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI
QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPENDSYAISEKSHGREE
IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY
ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS
GIRHIAINILTNDKVFKAGLRRKMRKAAMGRNYLASVLTGSGLS
>SSO_3014 conserved hypothetical protein
MVIYAFNKRLMEYFMKGKSALTLLLAGIFSCGTCQATGAEVTSESVFNIL
NSTGAATDKSYLSLNPDKYPNYRLLIHSAKLKNEIKSHYTKDEIQGLLTL
TENTRKLTLTEKPWGTFILASTFEDDKTAAETHYDAVWLRDSLWGYMALV
SDQGNSVAAKKVLLTLWDYMSTLDQIKRMQDVISNPKRLDGVPGQMNAVH
IRFDSNSPVMADVQEEGKPQLWNHKQNDALGLYLDLLIQAIDTGTINAED
WQKGDRLKSVALLIAYLDKANFYVMEDSGAWEEDARLNTSSVALVTSGLE
RLSNLLSKKDSVFVSDLLREAKANELDEPLSTTRLNHLIDKGYERITLQL
DLGGESPGYLEKDKHYREADAALLNVIYPANLAKINTRRKEQVLKIVKKL
AGPYGIKRYEKDNYQSANFWFNDIKTDTDQNSHAKRDGFAPIFPDICYHL
THYKPAAADIPVASDNPAHYADAIRYNARTPLQGSLLPLTRLVWA
>SSO_2028 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_4296 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3565 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWK
YPTAPKKSQSVA
>SSO_3742 IS2 ORF2
MERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPES
NGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYR
SPREYLRQRACNGLSDNRCLEI
>SSO_1192 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0301 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P064 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SSO_0467 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P177 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1919 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P183 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHQVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_1049 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1916 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3747 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0730 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_2439 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0581 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_3226 putative transposase
MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG
MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH
KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLV
PKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
>SSO_3676 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1564 IS2 ORF2
MRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLR
VTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEW
LTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYI
SIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSD
NRCLEI
>SSO_3086 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDAVVIWM
TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS
VELHDKVIGHYLNIKHYQ
>SSO_1268 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P174 putative IS orf
MLTELLTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQELDALL
YGSASTVPVVLTESTVMPKLPVVKKRPRRP
>SSO_0326 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1730 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1915 hypothetical bacteriophage protein
MRTVITYLRFSSAIQGAEGADSTRRQNDLFKQWLKKNGDAQIVASFSDEG
LSGYKGKHLTGQFGDMLARIEAGEFPEGTILLVESIDRIGRLEHLETEAL
MNRILGNGIEIHTLQDGLIYTKDALADDLGISIIQRVKAYIAHQKSKQKS
FRVSQKWGQRAKLALAGEQRLTKMVPGWIDPETFKLNEHAETVRLIFKLL
LDGESLHNIARHLQSNGIKSFSRRKDANGFSVHSVRTILRSETTIGTLPA
SQRNDRPAIPNYYEGVVDIPTFNKAQEILDKNRKGRTPASDNPLTINIFK
GLFRCQCGASVHPTGTKNKYAGVYRCNNHLDGRCDVPPLKRKPFDRWMID
NFLGMIDVGNDGESERKIAALQHEVEIVTARIKKATALLLEMDDIDELKI
QLKELNQKRTELQTTIDNMRRKASLTDKELPQLKDIDLMTKAGRVECQLI
LSKHLKGLTLGKDSVTVTLQNDTEITIPTNPLPLNDGSPIFEIADKELLD
IDAYQL
>SSO_P236 putative transposase
MWCFFNLFGVLIPIDERNLTRERTQVGLQAARARGRKGGRPKTLSKDKQA
LAVQLYNEKKHTVAQICVLMGISRPTLYKYIESARLFKK
>SSO_2759 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_0407 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVLGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHHQ
>SSO_2401 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P122 putative transposase
MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS
RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGYYIIHRK
NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS
WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ
FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT
S
>SSO_2448 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1188 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1803 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0342 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_3612 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1253 putative resolvase
MAILGYCRVSTDDQSITNQQMQIEEAGYNIAKWFADEAVSGSVKASLRNG
FSSLLAYAREGDTVVVVAVDRLGRDTIDVLSTVKALQAKGVTVISLREGF
DLSSAMGEAMLGIMSTLAQLERSLIAERRKAGIERAKAEGVHMGRPVKAS
SEAVQMLISQGKTRLQIQEELGISRATYYRLAK
>SSO_P228 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_2447 IS21 ORF1
MLSREDFYMIKQMHQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHASLWQQVSQGGTSTTECL
>SSO_4216 putative transposase
MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG
MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH
KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLV
PKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
>SSO_1171 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0656 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P173 IS4 ORF
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_3989 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWK
YPTAPKKSQSVA
>SSO_P175 IS3 ORF2
MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG
DITYLRTDEVRLHPVSTEPHAF
>SSO_4309 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P141 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW
VRQHERDTGGGEVGSPPLNVSV
>SSO_3824 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_1397 conserved hypothetical protein
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFADGKVELDESQQQALV
RELNEELGIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALQYPLAPADIPLLEAFMALRAARAAD
>SSO_0729 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_4312 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0969 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTLRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2578 putative DNA replication factor
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>SSO_P001 putative resolvase, fragment
MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR
SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP
DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI
STVL
>SSO_0725 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3285 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTYEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P071 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRKNHGSLAAANRGVAEYELSE
>SSO_0880 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_1940 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_3590 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKHVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTIGGFNSETV
SAPVI
>SSO_1264 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_0659 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2141 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3596 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2746 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P217 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2449 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLTRLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2760 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_P152 IS600 ORF2
MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA
PMESFWGTLKNGTGTE
>SSO_0144 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P039 IS629 ORF1
MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SSO_1567 IS2 ORF2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMCQRQ
>SSO_3838 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1376 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2671 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P037 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_3129 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2291 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_0361 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1191 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2433 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_4439 IS1 ORF
MDEQWGYVGAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSP
FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRK
SLSFSKSVELHDKVIGHYLNIKHYQ
>SSO_2246 IS2 ORF2
MANYRLISLALTLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDN
GEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPA
SPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTI
KRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQAS
NGLSDNRCLEI
>SSO_1255 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3013 IS2 ORF2
MSRAQLHVILRRTDDWMDGRHSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_1758 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2472 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P140 IS629 ORF2
MPLLDKLREQYGVGSVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL
KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVHTTVSRKAVAAGDRVNRHQGNMPRTPGGPQRLVYVVSAADKDKHTS
AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT
>SSO_4004 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLWQRACNGLSDNRCLEI
>SSO_4033 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P137 putative reverse transcriptase, fragment
MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA
VKENWQWKPAVAYCCYADDCVPRRRVLGT
>SSO_2022 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3591 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_4491 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P036 IS21 ORF1
MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF
MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT
VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ
DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA
DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE
QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW
DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS
SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_3288 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0718 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2910 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P172 putative IS orf
MAGRRLGVPKSTVCGMFVRFRNAGLSWPLPAGMSEQELDALLYGSASTVP
VVLTESTVMPKLPVVKKRPRRPNADQLRIS
>SSO_1068 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_0044 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2182 putative integrase
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFRGKISSDGCLKKEQMVVSAIAS
TPQLPMLGEKRLVSSVTKEDLLFVRRDLLTGYQKLSNGKISSIKGRSVVT
VNYYMTTIAGMFQFATDNGYTSGNPFNGLTPLKKSKIEPDPLTRDEFIRF
IEACRHQQTKNLWIIAVYTGIRHGELVSLAWEDIDLKARTITIRRNYTKL
GEFTPPKTDAGTGRTIHLVQPAIDALKSQAEMTMLGKQHSVEVKQREYGR
STVHKCTFVFSPQVIKQRQFSGPHYKVDSIRESWTSILKRAGLRHRKSYQ
SRHTYACWSLAAGANPSFIASQMGHTNAQMVFNVYGAWMKDNNHEQIELL
NKRLSESVPCMPHKKVG
>SSO_0740 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P075 putative IS orf
MLTDIFNSNYQCYGYRRLHAMLRHEGGRLSEKVVRRLMVEEQLVVSRNRR
RRYSSYCGEIGPAPDNLIARDFKAEQPNQK
>SSO_1738 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1176 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_3251 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDAVVIWM
TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS
VELHDKVIGHYLNIKHYQ
>SSO_P119 IS600 ORF2
MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA
TTSSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQHPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISS
AYGKTD
>SSO_2756 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1708 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3112 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3819 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P176 conserved hypothetical protein
MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF
VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS
RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD
DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK
ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPKGREVTFSAFSDWLPRNR
AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP
TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV
QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV
NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR
LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV
KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA
EIYKFFTNALYVALTRATHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE
ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK
QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL
LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ
LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL
SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC
VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN
PGLKIRQGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE
RSEKSVGYWVGGIRRAAQKA
>SSO_2932 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0701 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_3377 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P165 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P130 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAFAAGDRVNRQFVAERPDQLWVADFTYVSTCVSASDIRR
>SSO_P052 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1668 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2775 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_1655 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P045 ISSfl2 ORF
MALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRAFASAAHLAAY
SGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTR
KMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_1459 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1755 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P035 conserved hypothetical protein
MFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIPTVSDR
IAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCWRYSWILE
VDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGELIT
RTRGTPQGGVISPLLANLFHHYAFDLWMEREYRGYRLRGTLTIL
>SSO_1727 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_4458 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_P025 IS3 ORF1
MTKTVSTSKKTRKQHSPEFRSEALKLAERIGVAAAARELSLYESQLYAWR
SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK
>SSO_3205 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDIPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWQGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_2297 IS600 ORF2
MSISPLFRWPNSVCHFTRLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>SSO_1779 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1351 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2649 putative protein encoded within IS
MNDISSDDIFLLKQRLAERQLKTKPLLKSLESWLREKMKTLSRHSELAKA
FAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKNYLFFGSDHGG
ERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVGELLPWRVALP
TE
>SSO_P026 IS3 ORF2
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD
SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS
RKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLA
VVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHSDRGSQ
YCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHGEHFIS
REIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>SSO_1247 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1654 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_3408 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2722 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLPLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1978 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3901 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1031 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0237 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_1918 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1616 IS600 ORF2
MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD
RGSQYCAYDYRVIQEQLV
>SSO_1700 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_4483 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_2299 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3231 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_1920 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_4474 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_3744 putative integrase
MALTDIKVKTAKPKDKPYKLADGGGMYLLINTNGSKYWRMKYRFAGKEKM
LSIGVYPDVTLAGAREKRSEARKLLAAGGDPGEAKKEEKIAQQMSLKNTF
EAIAREWHQSKADRWSLRYRDEIIDTFEKDIFPYIGKRPIAEIKPMELLE
ALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPDLASALATPK
KVHFPFLTANELPHFLNDLAGYTGSIITKTATQIIMLTGVRTQELRFARW
EDIDFETKLWEIPAEVMKMKRPHIVPLSEQVIMLFKQLEPISKHHPLVFI
GRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILHEQGFNSAWI
EMQLAHVDKNSIRGTYNHALYLDGRREMMQWYADYIDSLSSRES
>SSO_0719 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P127 putative transposase, fragment
MCWGRTALYMAALEAPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTI
MNAMLRKNEEWNESYL
>SSO_3072 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2330 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARQGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1464 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3127 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSCIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3825 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3214 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_2680 IS629 ORF1
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_1759 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1540 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1312 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGND
LPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFV
KTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQ
RACNGLSDNRCLEI
>SSO_1671 ISSfl2 ORF
MPNCDRFRCHFAPHALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRI
RGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLGFAG
>SSO_1991 IS2 ORF2
MDGRHSHHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN
AKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRC
DNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNEL
PASPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK
TIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQ
ASNGLSDNRCLEI
>SSO_1393 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1256 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2667 putative protein encoded within IS
MIPLPSGIKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTSDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>SSO_2776 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVWICPYISRHLLSLNPLQARCRRYSRGER
>SSO_1263 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1988 putative virulence protein
MRRFAGACRFVFNRALARQNENHEVGNKYIPYGKMASWLVEWKNATETQW
LKDAPSQPLQQSLKDLERAYKNFFQNRAAFPRFKKRGQNDVFRYPQGVKL
DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGKWYISIQTESEVS
TPVHPSASMVGLDAGVAKLATLSDGTVFEPVNSFQKNQKKLARLQRQLSR
KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK
VSNMSKSAAGTVSQPGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWRGG
QVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTVNADVNGARNI
LAAGHAVLACGEMVQSGRPLKQEPTEMIQATA
>SSO_3992 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLTRLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P216 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2793 putative helicase
MITGVWKYRGKSSVHQPPHCRQDHRRGAQLVVVQLAQLPFLYQRCGCMTT
PVWRNDDLEGAVIGAFFLRGADHEVMDILITLPADIFSVRAYRDIYTGIC
RQARVSGVIDPVLLCNEMPELAPVITDTGRKTWVKSSLEHYVAALRRNAA
LRDAEKTLNEALQKLRDAHTCEAAEDALKDAQNMMVTLSTGKGVIQPVHI
DDVLPEVVERVECRNQGLEKSRTLMTGIDELDAKTGGMEPGDLVFIAARP
SMGKTELALDIIDKVTEQGHGVLLFTMEMANIQIGERMVSAAGGMPVSRL
KSVAHFEDEDWARFSQGVGRMTGRNIWMVDQANLTIDEICATTKHHLIKH
PETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIALSQL
SRGVESRPNKRPMNSDLKNSGEIEADADIILMLYRDEVYNPDTQATGIAE
INITKQRNGSLGTIYRRFYNGHFLPVDQESAQVLSTPMRQPQPRRYSNTR
TDSSKMERFF
>SSO_P082 IS100 ORF1
MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>SSO_4109 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1063 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_0824 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3611 IS629 ORF2
METTFVLDALEQALWARRPSGTIHYSDKGSQYVSLAYKARLKEAKLLAST
GSTGDSYDNAMAEIIKGL
>SSO_P151 putative reverse transcriptase, fragment
MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY
KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI
PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH
QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI
ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP
GVPFERYADDVVCHCHSQWQADALISGLRQRLAQCGLQLHPQKTRIVYCK
DADRRGDYPETSFDFLGYTFRPRLSMNRWGKTFVNFSPAMSARAGKAIRQ
EVRRIAVTSPCTSWRICSMRKSEAG
>SSO_4469 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3684 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1928 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3030 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
VVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1902 putative endonuclease of cryptic prophage
MTERIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKL
SLSGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLMDDEQFDEINI
VRAQPVSGGRLGVKIYPIMLEGQVKK
>SSO_1585 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2402 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P068 ISSfl1 ORF2
MCRRCSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH
ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA
VIPRKSNEKMASDGRAQLDV
>SSO_2157 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0516 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHHQ
>SSO_2724 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2669 putative transposase
MGELLLHPGISTTEASMNNNNTLYVGLDVHKESITVAYAINSEPVELMGK
IGTSPTDIQNLCKRLRSKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCA
PSLIPKKPGERVKTDRRDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARA
WASARDDLRHARQRLKSFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWR
QLAFDEHRRTIEDRQAQCERLESALKEAVTEWRLYPVVEALQAMRGIQFI
TAVGLISELGDLTRFEHPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSY
ARKLLVEAAWSYRHPARISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRK
LQAKGKNVNITIVAVARELAGFIWDMGRIAMSVAQQPQCHK
>SSO_1292 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2801 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P124 IS3 ORF2
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD
SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS
RKFSPVSYRAHGLRCTGNSGHHHLFFF
>SSO_3563 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2331 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1701 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0147 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_4342 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3241 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1686 IS1 ORF
MSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SSO_P210 ISSfl1 ORF2
MRQQQDEQGRFSICSRQAAVVHRDAYCNRNVVERCFGRLKEYRRIATRYD
KTARNYLAMVKLGCIRLFYQRLRN
>SSO_3553 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4475 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVLGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1643 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHHQ
>SSO_2992 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1707 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2814 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P056 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P073 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3947 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_2574 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P074 IS600 ORF2
MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA
TTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHTDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFRIKYYQMTA
>SSO_2274 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1990 IS2 ORF1
MIDVLGPEKRKRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_1907 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0575 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1619 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDHKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1692 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P155 putative IS orf, fragment
MYLKIRDRLGYMSNTSSNFEMTGTLLGLELRKRKTPQEKIAIIQQTMEPG
MTVSHVARLHGIQPSLLLKWKK
>SSO_P018 ISSfl4 ORF2
MISFPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRR
GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
DWKHPKRTERAGIRI
>SSO_3916 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVRRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_2934 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3837 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLPNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2248 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLTTAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_P076 IS1353 putative transposase-like protein
MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY
RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI
TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE
>SSO_P192 oriT nicking and unwinding protein, fragment
MMSIAQVRSAGSADNYYTDKDNYYVLGSMGERWAGQGAEQLGLQGSVDKD
VFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSVSMMAMLGGDK
RLIEAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLVMALFNHDTSR
DQEPQLHTHAVVTNVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLY
REKRKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQTIREAVGEDAS
LKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRAYRDAAEQRAY
TRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQFMYTDLLARTVGILPPE
NGVIERARAGIDEAISREQLIPLDREKGLFTSGIHMLDELSVRALSRDIM
KQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSLAIVSGQGGAAGQRERVA
ELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLLEGMAFT
PGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAM
KDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQ
VSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMY
RPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGETQVVRISSLDS
SWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSEDAMTVVVP
GRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNA
TLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLE
TAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFAAEGTG
FADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEG
KEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTT
QFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDT
QLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAVGGGRAVASGDT
DQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAVYSLINRDVER
ALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEA
FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREK
AGELGQVQVMVPVLNTANIRDGELRRLSTWENNPDALALVDNVYHRIAGI
SKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRIRFTKS
DRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAI
TAHGAQGASETFAIALEGTEGNRKLMR
>SSO_0025 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1796 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1770 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0426 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIIGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYVNATMFDRYLPFSCPLNTFVTDAVRFPPF
HHH
>SSO_1399 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4313 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2651 IS600 ORF2
MAHIRTRETYGTRRHQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0007 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4248 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1577 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4457 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_4044 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_4389 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGYYLNIKHYQ
>SSO_1728 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P027 IS100 ORF1
MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>SSO_0341 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P123 IS600 ORF2
MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP
AAFRIKYYQMTA
>SSO_3740 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_2795 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_0324 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_2685 conserved hypothetical protein
MPRTVTHNPDSPNNDDVLAASEKWDACKPPYTSAHMKICVAAAKIILAAS
GVARRSKYEKENYLRIDFSKAGKVTFYAEFPKKMGLKGKKLGEWPELAIQ
LAREKALGMADGGLRAESVHAALEMYRDDLKAKVARQKLSPDSFTTYGVR
IDRIKATFGEREVFSDVTYNRLVEVLDEWIATRSNNNALELFAELRRFWK
FCAPTLCNGRNVAASLPDDYVSSRVQKPTPTRLFTDIESIARLWLNVAAC
TSVHQKNAVRFMIITGVRPINVHNLRWDYVYEEAGEIVYPEGVIGMRGAM
KTQKAFRLPITPEIRRIIDEQKAWRDSVPECNRDYVFLQPRDPMQPFSKR
SLDKLVKTYSPDGAVKGIKHDGTVKGKDGAFNTMCRKFLKSNVIALIDRN
>SSO_4046 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1977 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P021 putative transposase, fragment
MTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRTLRDRNGTFEPQQ
LKKNQP
>SSO_2935 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_0873 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3464 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_0039 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0743 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKAIGHYLNIKHYQ
>SSO_1539 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P033 putative transposase, fragment
MSEQKITGIDLAKTNFYLFSINAHGKPAGKTKLSRNQLLNWLVQQPKMTV
AMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYNDAQAIAEA
CQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRGLLAEYGIV
FSKGAADLRQK
>SSO_3114 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_1051 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P017 IS4 ORF
MKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGE
MADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNL
VRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRD
LASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1223 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P121 putative transposase
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALVEHYGTVIIPARPRKPKDKPK
AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRLGEQHPAYASE
HENLRGPGYYH
>SSO_0148 IS2 ORF1
MGWQKCSGIKRSYLVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFE
PGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAM
KQIKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>SSO_1016 hypothetical protein
MIKHIIIAERAHQESLVKQKRIAQVWNHKTQLARELKKPMGKQAPGWLEL
SEDGSYYIVDEDKASLVNIIYDKRLSGMSMFAICKWLNEQGYLTINQRKV
RISKTKKPDGNWSALSVKHILTSRSVLGYLPAKISTEDRKTVLREEIEGF
YPQIVTDSKFYAVQQLLEETGKGKTSSGEHWLYVNILKGLIRCKCGLVMT
PTGIRKPVYQGTYRCNGNKESRCSYGTVSRKLLDTQLCSRLFSKLSQLHD
EATDTAKLDELQRRLNTVDGELEKLTETLIQLPNITQIQEALRVKQEEKD
ELIVQLSREKARVKSVSSLDLSGLDMESVEGRTEAQIIIKRLVKEIVVSG
NEKLVDIYLHNGNMIRGFPLDGKDDHTLTLEEATDEMQSLDDMLIFGEPV
TRIYPAGDMEEVDA
>SSO_4108 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_3914 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGGHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKGRKIYPAPSLTEHGMLNSGFVRGIENFRPKERMSILQLLLLH
VSWRVLSGIWAE
>SSO_3284 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2249 IS2 ORF2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRHSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLVSTPRCLTVICHFHAR
>SSO_4433 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3319 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_2298 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_P128 conserved hypothetical protein
MKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAK
TIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVM
FHYGKLNMAF
>SSO_1739 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2038 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_2682 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_0018 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_0977 IS1 ORF
MATAVRLHRFSTRYAPENHLYGHEWRWMPGNCTHYGRWPQHDFTSLKKLR
PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAH
VFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQ
RIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SSO_1078 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3182 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_2991 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P157 IS100 ORF2
MLHEEKLARHQRKQAMYTRMVAFPAVKMFEEYDFTFATGAPQKQLQSLRS
LSFIERNENIVLQGTSDITNPRVGICV
>SSO_1160 IS2 ORF2
MSRAQLHVILRRTDDWMDGRHSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKST
AVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQQASNGLSDNRCLEI
>SSO_2176 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRH
>SSO_0723 putative DNA replication factor
MKNIATGGVLERIRRLTPPHVTAPFRTVAEWREWQLAEGQKRSEEINRLN
RQLRVEKILNRSGIQPLHRKCSFANYQVQNDGQRYALSQAKSIADELMTG
CTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDD
GQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVG
MLTNLNYEVMKTLLGERVMDRMVMNGGRWVNFNWESWRPNVSHSRVVK
>SSO_1070 putative transposase
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK
AENGVLIVELWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE
HENLRGPGYYH
>SSO_1783 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2407 IS2 ORF2
MLGAIEKRFGDKVPEQSIQWLTDNGSAYRAHETRQFARELNLEPCTTAIS
SPQSNGMAERFVKTMKEDYIAFMPKPNVRTALHNLAVAIEHYNENHPHSA
LGYRSPREYRRQRVTLT
>SSO_1467 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4481 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P054 ISSfl1 ORF1
MVHKSDSDELSALRAENARIIKPLLLPEPATPRAGRPWAEHRKIINGMFW
VLCSGAPWRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGF
IDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQ
TEVASR
>SSO_0784 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P030 iso-IS1 ORF2
MVTSDDWGSYAREVPKEKHLTGKIFTQRIERNNRTLRTRIKRLARKTICF
SRSVEIHEKVIGSFIEKHMFY
>SSO_P154 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_P034 conserved hypothetical protein
MDGKRISGVPFERYADDIVVHCSRMSDATRLKNRLSERFSEVGLVLNAGK
TNIAYIDTFKRRNVATSFTFLGYDFKVRTLKNFKGERYRKCMPGASNAAM
RKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSRNFNYRL
WSAMQSRLLKWMQSKYRLSNRKAQRKLTLVRKEYPKLFVHWYLLRASNE
>SSO_4310 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P162 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIMLTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_4440 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_4470 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3811 putative transposase encoded within IS
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK
AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRLGEQHPAYASE
HENLRGPGYYH
>SSO_0045 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2655 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKNDATAG
>SSO_1772 IS600 ORF2
MSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR
QRRHSRLGNISPAAFREKYHQMAA
>SSO_3290 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3543 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2132 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHHQ
>SSO_4366 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P028 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_4490 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P139 IS3 ORF2
MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF
YASGPNQKWAGDITYYYSSPTAGKHGAPGY
>SSO_1615 IS2 ORF2
MRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLR
VTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEW
LTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYI
SIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSD
NRCLEI
>SSO_P146 IS629 ORF2
MGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTW
QGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHH
SDKGSQYVSYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRK
SWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLA
A
>SSO_2882 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3581 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2362 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4511 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3609 IS150 ORF B
MSHIRPDFPEPYGLPVDGLRGKTSIFSHPVYLFLREFSLQDSRGGSLSKK
AESLSSGKITIGTKEKVIIINELRQCHPLSQLLVIADLPRSTFYYHVKRL
NAPDPYQLVKQVILRIYHQHKGRYGYRRIRLACRNESILLNGKTIRKLMK
ELGISSLIRRKKYRTYRGEQGRTCNNLLKRQFYADRPNQKWVTDVTEFKV
DGRKLYLSPIMDLYNGEIVSYNLTERPLASMVKSMLLDAVEQLNKDDKPL
LHSDQGWQYQMPRWQRWLSDNGITQSMSRRGNCLDNAAMESFFSTLK
>SSO_P129 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_P166 putative IS orf, fragment
MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI
CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLL
>SSO_1073 IS600 ORF2
MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD
RGSQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHY
RFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3588 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_2681 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_3465 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_P138 putative reverse transcriptase, fragment
MARTRSGRETSRTITAHRLRGNTGRRVIEGDLSSYFDTVHHRLLMKAVCR
RISDARFMRLLWKTPC
>SSO_1172 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P029 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_3561 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3823 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_0732 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P042 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_P019 putative IS orf, fragment
MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVL
PSPLRQSSAHKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVLEQLE
LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG
KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL
MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGSYRALYESGRITEAQQRIGELYAIE
AEVRGCSAEQRLAARKARAAPLMQSLYDWIQQQMKIHSLKMECLHGEHYY
PSGNSAGNSV
>SSO_0941 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3116 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1069 putative transposase subunit
MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS
RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK
NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS
WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ
FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT
S
>SSO_3769 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1417 putative excinuclease subunit
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY
IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQTIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE
EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>SSO_0293 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3948 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_1891 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P060 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_P080 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_1325 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLSIKHYQ
>SSO_0816 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2993 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
VVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_3675 putative DNA processing protein
MDADSSDSTVTPGRLPDGLSSCPCFYLGKKLMNLSANAQATLLLTSDFSR
AAASEYKPLSNSEWGKFALWLKHQRISPAELLVPQPQEKLTGWSDPRISQ
ERILGLLARGHSLALAVDKWQRAGLWILTRGDADYPVRLKNRLRTDAPPV
LFGCGNKALLQAEGMAIVGSRDAPTDDLRYTQQLAAKLAQQGICVISGGA
RGIDECAMASALEAGGTAVGVLADSLLKTSTLVKWREGLIAGNLVLISPF
YPEVRFTVGNAMARNKYIYCLAESAMVVRAGMTGGTITGAMEALKHQWLP
VQVKPNQDMQSANSRLVENGASWSAEQAENVTIRLPDVPGLMYDRALRNA
QPELFSLHEDDANYAVMPAYTPVDFYQLFVAELAILAKESISIERLASCT
GLTIEQISVWLNRAEEEGRVIRLGEGHYQFR
>SSO_2453 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
AHQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P131 IS21 ORF1
MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF
MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT
VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ
DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA
DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE
QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW
DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS
SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_2780 hypothetical bacteriophage protein
MAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAE
EDLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIAI
QGNTITGAMGGRVKNPELTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAG
QKKATNPFLKIIES
>SSO_0628 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P063 ISSfl1 ORF1
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>SSO_1756 IS600 ORF1
MWSAGIDTSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3594 IS629 ORF2
METTFVLDALEQALWARRPSGTIHHSDKGSQYVSLAYKARLKEAKLLAST
GSTGDSYDNAMAEIIKGLYKAEVIHRKSWKNRTEVELATLTWVDWYNNRR
LLERLGHIPPAEAEKAYYASIGNDDLAA
>SSO_0149 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGIGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_3667 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVRRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1322 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0327 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_2041 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P013 IS21 ORF2
MNKRAFFGAFLIFWGFKFLSMNCRYEKASIILTSNKGVADWGEMFGDHVL
ATAILNSCA
>SSO_1683 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_4066 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0019 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1628 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1729 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3583 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGLCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_4512 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYSCSRRIALFFWALIVLLLFLSNEAIWLMPHPLPNSRSTLR
SERVRMTER
>SSO_1890 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1867 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1581 putative DNA replication factor
MKNIAAAGVLERIHRLAPQGAVPPYRTVEEWREWQLAEGRKRSEEINRQN
RQLRVEKILNRSGIQPLHSKCSFANYQVQNDGQKYALSQAKSIADELMTG
CTNFVFSGKTGTGKNHLAAAMGNRLMAKGRSVIIVTESDVMSVLHDSYDN
GKSGEKFLQELCGVDLLVLDEIGIQRETKNEQVVLHQIIDRRTASLCSVG
MLTNLNHAAMSTLLGERIMDRMTMNGCRWVTFNWDSWRSNVSFPGVVK
>SSO_1894 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0868 putative integrase
MSSAIHRLSDTLLRKLSGSPTTKNAFFNDGGNLSVRHSTSGLLTWYFTYR
AGTGRQVSPERLRLGNYPDLSLKAAREKAAQCRAWLAEGKNPRYELNRAV
QDALAPVTVKEALTYWLESYAKEKRTDYESLKSRINKHIISQIGALPLEK
CELRHWLACFDQMAKRSPVSAGFLLQVCKQALKYCRKRRYAISNVLDDMV
VGDVGKKAEISERVLTNKELGELLRALDEKIYPPYYSALIRLLIVFGCRA
TELRRSEVQEWDFKEMLWTVPKEHSKTKVAIFRPIPEAILPFVTKLVEQN
RHTGLLLGELKGQSSVSEYGRTAHRRINQAPWTLHDIRHTFTTMLNDLGV
DPHVVEQLTAHQLPGVQRVYNHSRYLDAKRDALNLWVERLELLQNNDEKI
VVMTPRIYSQNS
>SSO_3012 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_0292 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1159 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0307 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0088 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P006 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_2679 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAEIIKGLYKAEVIHRKSWKNRT
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_1924 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P010 conserved hypothetical protein
MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV
ESRRQAKGTRFLWQHSNKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD
IWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAKTIGKRLYGILNAMRHG
VSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVMFHYGKLNMAF
>SSO_2794 IS2 ORF2
MSRAQLHVILRRTDDWMDGRHSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKST
AVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQQASNGLSDNRCLEI
>SSO_1239 putative transposase
MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG
MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH
KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIG
>SSO_1709 putative virulence protein
MKRLQAFKFQLRPGGQQECEMRRFAGACRFVFNRALARQNENHEAGNKYI
PYGKMASWLVEWKNATETQWLKDSPSQPLQQSLKDLERAYKNFFRKRAAF
PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYRNSRQVTGVVKNV
TVSQSCGTWYISIQTESEVSTPAHPSASMVGLDAGVAKLATLSDGTVFEP
VNSFQKNQKTLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANIRRDYL
HKVTTAVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL
DQGWYEMHRQLEYKQLWRGGQVLAVPPAYTSQRCAYCGHTAKENRLSQSK
FRCQVCGYTANADVNGARNILAAGHAVLACGEMVQSGRPLKQEPTEMIQA
TA
>SSO_1816 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3584 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_0315 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0288 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1072 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_4435 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0702 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_2700 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2451 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2452 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_P043 IS2 ORF2
MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFVKTMKEDCI
AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST
>SSO_4311 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_1071 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_0823 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3249 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_4243 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1797 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P163 IS21 ORF1
MGYTGGRSMLRYYIQPKRKMRPSKRTVRFETQPGYQLQHDWGEVEVEVAG
QRCKVNFAVNTLGFSRSFHVFAAPKQDAEHTYESLVRAFRYFGGCVKTVL
VDNQKAAVLKNNNGKVVFNSGFLLLADHYNFLPRACRPRRARTKGKVERM
VKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETPEQRFA
LEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCGQPVSI
RISLDDELRIYSNEKLVASHRLCSASSGWQTVPEHHAPLWQQVSQVEHRP
LSAYEELL
>SSO_3607 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_3585 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1323 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1054 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_0726 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0901 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P016 putative transposase
MEYRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCGMFVRFRNAGLSWP
LPAGMSEQELDALLYGSASTVPVVLTESTVMPKLPVVKKRPRRP
>SSO_1015 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3598 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGGHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAW
>SSO_0731 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_1566 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGDCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_2592 conserved hypothetical protein
MATGGIHMELHCPQCQHVLDQDNGHARCPSCGEIIEMKALCPDCHQPLQV
LKACGAVDYFCQHGHGLISKKRVEFVLA
>SSO_3632 putative virulence protein
MRRFAGACRFVFNRALARQNENHEAGNKYIPYGKMASWLVEWKNATETQW
LKDAPSQPLQQSLKDLERAYKNFFQNRAAFPRFKKRGQNDVFRYPQGVKL
DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGKWYISIQTESEVS
TPVHPSASMVGLDAGVAKLATLSDGTVFEPVNSFQKNQKKLARLQRQLSR
KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK
VSNMSKSAAGTVSLPGRNVRAKSGLNRSILDQGWYEMRRQLAYKQLWRGG
QVLAVPPAYTSQRCVCCGHTAKENRLSQSKFRCQVCGYTANADVNGARNI
LAAGHAVLACGEMVQSGRSLKQEPTEMIQATA
>SSO_2181 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1267 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1670 putative transposase
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK
AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE
HENLRGPGYYH
>SSO_1310 putative phage integrase protein
MQRKKYDPNLPRNLTYRRRDKAYYWRNPLTKEEFTLGKISRRDAVTQAIE
ANHYIYKNYSPAALIEKLKGFDSFTMADWIERYKTILIRRKVSRNTYKIR
ANQLETIKEKLGEILLTEITTRHIAEFLDLWIEGGKNTMAGSMRSVLSDM
FREAIVEGRISQNPVTPTRAPKIVVTRERLKLKIYNCIREAADQLPAWFP
LAMDLALVTGQRREDITNMRFSDIYDDRLHVRQIKTGMMIAIPLSLSLPV
AGLRLGTVVERCRLVSRGDYLISAGIRKNSPDGSIHPDGLTKKFVAARKF
TGIQFSENPPTFHEIRSLAGRLYKETCGEEFAQRLLGHTSEKTTKMYLDE
REKTYLLL
>SSO_2699 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P235 IS629 ORF1
MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SSO_P133 IS629 ORF2
MFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYVSLA
YTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVE
LATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_2174 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P188 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0748 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1884 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0289 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_1757 IS21 ORF1
MEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQDAEHTYESLVRAFRYFGDC
VKTVLVDNQKAAVLKNNNGKVVFNSGFLLLADHYNFLPRACRPRRARTKG
KVERMVKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETP
EQRFALEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCG
QPVSIPLKSSKLKLSERFLKINFRSVKALPDYLFLKAL
>SSO_3040 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSVTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_0405 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVLGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1437 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRSQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3222 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIKRHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2024 IS2 ORF1
MTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQ
IKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>SSO_0173 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1189 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3131 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P012 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDAARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_0910 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1926 putative resolvase
MAILGYCRVSTDDQSITNQQMQIEEAGYNIAKWFTDEAVSGSVKASLRNG
FSRLLAYAREGDTVVVVAVDRLGRDTIDVLSTVKALQAKGVTVISLREGF
DLSSAMGEAMLGIMSTLAQLERSLIAERRKAGIERAKAEGVHMGRPVKAS
SEAVQTLISQGKTRLQIQEELGISRATYYRLAK
>SSO_2757 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P134 putative transposase, fragment
MNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHVA
KQPGEVVDKTRQNEPPRVSWRVFYL
>SSO_2668 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1288 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1320 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1269 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P125 IS3 ORF1
MTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLYAWR
SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK
>SSO_0017 putative transposase
MANYRLISLALTGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_P002 IS2 ORF2
MVHATGLMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI
MPKPDGLTAAKNLAEAFEHYNE
>SSO_0881 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_1265 IS2 ORF1
MRTLRKQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIK
ELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>SSO_1451 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_P046 ISSfl2 ORF
MQPPGCRGSGKRLFDKALPNDENKLRSLISDLKQHGQILLVVDQPATIGA
LPVAVARSEGVLVGYLPGLAMRRIADLHAGEAKTDARDAAIIAEAARTLP
HALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRIRGLLTQIHPALER
VLGPRLEHPAVLDLLQRYPSPEKLASLGFAG
>SSO_4468 IS1 ORF
MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK
SVELHDKVIGHYLNIKHYQ
>SSO_3566 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPDNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSWRENTATG
>SSO_2025 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3597 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P218 IS630 ORF
MQTMTSRSRQAAYSISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL
KENPKFTYRKLKN
>SSO_2666 putative protein encoded within IS
MNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTR
KPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIR
TVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYR
QSEIYGRQGVELSRSLLSGWVDACCRLLSPMEEALHGYVLTDGKLHVDDT
PVPVLLPGNKKTKTGRLWAYVRDDRNAGSTLAPAVWFAYSPDRKGIHPQT
HLAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHDVHVRTPSAL
TEEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSLESWLREKMK
TLSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRK
NYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRV
GELLPWRVALPTE
>SSO_3677 conserved hypothetical protein
MEKHGAELLLQRMLSNTSATFREGQWEAIDAVVNQRRKLLVVQRTGWGKS
AVYFIASKIFRDRGAGPTIIISPLLALMRNQVAAAERLGITAETLNSTNR
EEWQRISDKLLRGGVDCLLISPERLANQDFLETANTPVLGTTATANNRVV
EDIRQQLGDIVIQRGTLARESLALDALVLGEQSSRLAWLATVIPQFSKSG
IVYTLTTRDAELVAEWLRTNGISAFAYYSGVTCEGAEDSNTAREYLEQAL
LANKIKVLVATTALGMGFDKPDLGFVIHYQMPGSIVGYYQQVGRAGRAID
SAVGILLCGGEDRAIHKFFRESAFPAEAQIHEILNVLSENDGLTLRGIEQ
RTNLRYGQIEKALKLLVAENPSPVVYTEKLWRRTIVSFSPDHERINHLMN
QRKNELADVESYITTKECKMQFLRRALDEPSAERCGKCSSCLQHPLLSPD
IDSGLLHAANLFIKHADLPLNLNKQVASGAFTQYGFKGNLPAGLQGSTGR
VLSRWGDSGWGKQVAQEKKTGRFSDELVEACAEMVRQRWNPHPEPTWVCC
VPSLKHLDLVPDFARRLAAKLGLPFIDAIEKVVDNPPQKMQQNRFGDAAN
LLI
>SSO_1313 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_1605 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGYYLNIKHYQ
>SSO_2287 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3162 conserved hypothetical protein
MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDDPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR
LETDVYVPLA
>SSO_2905 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_0304 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0143 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P077 IS21 ORF1
MADVADRRELRQFRQTPEQRFTQEQEHLQPLLGTDFDIRHVSWDGYIEVG
GNRYSVPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVR
PGCSSVTAKSA
>SSO_1617 IS600 ORF2
MSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR
QRRHSRLGNISPAAFREKYHQMAA
>SSO_1521 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWK
YPTAPKKSQSVA
>SSO_2650 putative protein encoded within IS
MIPLPSGIKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTSDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>SSO_0509 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2247 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_1610 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1238 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0305 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2652 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0428 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_P051 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNLFFEMKA
>SSO_4297 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1055 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_P038 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_0488 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1811 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1415 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4441 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_0198 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_0290 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3589 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQ
>SSO_2660 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3904 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_0500 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGECTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKRYQ
>SSO_3749 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1198 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_2165 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1196 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHHQ
>SSO_P189 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P161 conserved hypothetical protein
MVGVAGAALAPLVKLLRHELLTRDVIHADETSLRLLDTRKGGKSCSGWLC
AYVSGERSGPPVVCFDSQTGRALRYPETWLQCWCGGTLVSDGYSVYKSLA
DNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKIAGLYRIEKLIRE
RCQRHDV
>SSO_2434 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SSO_2758 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELDEVA
>SSO_3008 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3610 IS629 ORF2
MRKVWRQLLREGIRVARCTVARLMAVMGLVGVLRGKKVRTTVSRKTVAAG
DRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVW
>SSO_2799 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1383 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_3467 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1018 putative transposase
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK
AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE
HENLRGPGYYH
>SSO_3318 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF
HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF
NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF
THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY
FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA
SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL
>SSO_1315 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0287 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_4456 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_P066 ISSfl1 ORF1
MACYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDAGDAANLLI
>SSO_2408 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2725 IS600 ORF2
MAHIRTRETYGTRRHQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1771 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGAVWSENINVA
>SSO_1666 hypothetical protein
MEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVH
DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSA
AGEQFMSLVRSQQCTV
>SSO_1674 H-repeat-associated protein-like protein
MEIKKLMEHFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC
HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMYSLVLGQIKTDE
KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG
NQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDEL
IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT
AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND
KVFKAGLRRKMRKAAMDRNYLASVLAGSGLS
>SSO_0653 putative transposase
METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA
DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA
QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN
PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP
KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK
AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT
RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL
VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG
TWTPERLIEQGNRTGPSTGRVVESMLKAKLHPELAYRAVLGLLALQKKYG
PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE
HENLRGPGYYH
>SSO_3936 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3988 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSWRENTATG
>SSO_2048 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1908 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P120 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1316 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1266 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_0429 IS600 ORF2
MLDGADNPNGELPSYITGADSYDNAPMESFWGTLKNESLSHYRFNNRDEA
ISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0508 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_0504 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGECTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1318 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_4218 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGGHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_4058 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1785 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_0654 putative transposase subunit
MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS
RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK
NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS
WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ
FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT
S
>SSO_0874 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_2653 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SSO_1009 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLPAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_1741 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_1224 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P081 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SSO_P145 IS629 ORF1
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_P178 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3592 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SSO_0425 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1808 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0869 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1895 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_3586 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P150 hypothetical protein
MFNAKIRGWIKYYGAFYKSALYLTLRQIDRKLVLWLPRKHKRLRGHRRRA
SHWLARVARSETRLFAHWPLLWGQASMRRAG
>SSO_3032 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_P055 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>SSO_3218 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_4329 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1760 IS21 ORF1
MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH
KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR
KMRPSKRTV
>SSO_1899 putative DNA adenine methyltransferase encoded by prophage
MLNTVKISSCELINADCLEFIRSLPENSVDLIVTDPPYFKVKPEGWDNQW
KGDDDYLKWLDQCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFSVL
NHIIWAKPSGRWNGCNKESLRAYFPATERILFAEHYQGPYRPKDAGYAAK
GSALKQHVMAPLISYFRDARAALGITAKQIADATGKKNMVSHWFSAGQWQ
LPNESDYLKLQALFARVAEEKHRRGELEKLHHQLVDTYTSLNRQYAELLS
EYKHLRRYFGVTVQVPYTDVWTHKPVQFYPGKHPCEKPAEMLQQIISASS
RPGDLIADFFMGLGSTVKAALALGRRAIGVELETERFEQTVREVQDLVSQ
NG
>SSO_1769 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1246 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3031 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1929 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1618 IS21 ORF1
MVKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETPEQRF
ALEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCGQPVS
IRISLDDELRIYSNEKLVASHRLCSASSGWQTVPEHHAPLWQQVSQVEHR
PLSAYEELL
>SSO_2356 putative regulator
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>SSO_3225 putative superfamily I DNA helicases
MDKNALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIV
SKFFKGEKDDVETVDVILRPKVYFRLLQHGKDRSAGAPDIVTPIVTPALL
SREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFD
DSVDKTAETDEEREARYAALQQEWRQYLYDSERLLKSVAGDWIEKPEQYE
LAEHGYIVKTAQSGGASSHILSLYDHLIVCNKDVPLFNRFASREVHAAES
LLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTG
KTTLVLSIIATQWARAALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQG
SGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA
LLFYLEKAKAAFPGKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQ
IRAARELIANDIEQYLDNLNKLLSGQEQKVTLLKSAKTEWKKYRAGESLI
YSLFSWLPAVRNKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLN
SAEREQTTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQA
DELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQDEKKKKGAKGVTARW
QRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAG
QVLPEVAAASFALAKKALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGS
TQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR
RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGELASSGS
RYNLLEAETIAVWLAENQQNIEAHYGKSLHEVVGIVTPFSAQVSTIKQVL
GKQGISTGTNEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNS
MLNVAVSRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDY
KERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITIISPWLTWQRLE
QTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNFKAALEKL
NALGIATKLVNRVHSKIVIGDDGLLCVGSFNWFSATREARYERYDTSMVY
CGDNLKGEIEAIYNSLERRQV
>SSO_P153 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P061 IS629 ORF2
MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SSO_P193 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0927 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_2736 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0862 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1237 hypothetical protein
MARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLVPK
QTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKKRR
PASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
>SSO_P079 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_P184 putative transposase
MCRLAVEYLLYAARKRGLEIGIFCTIHTLRLHFEEHLPLVVAGRRLGVPK
STVCSMFVRFRKAGLSWPLPAGMSERELDARLYGSASTVPVVLTESTVMP
EVPGVKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNW
RHQYRKGGLLPSGKNMPALLPVTLTPEPDNHGFDIIYMLSTHHQRFTFVR
LFDPYLIGSRPTFSHLAHHHIS
>SSO_0996 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2039 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_0570 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHHQ
>SSO_P186 IS186 ORF1
MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS
LAFGEADYIVRVYWRGLRWLTAEGMRFDMMDFLRGLDCGKNGETTVMIGN
SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA
GHVLLLTSLSEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEL
ELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKKN
>SSO_2929 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2791 putative endonuclease
MRHEFILPYPPTVNTYWRRRGSTYFVSKVGERYRRDVTLIVRQQRLKLNL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDEINIVR
GQLVPGGRLGVKIYEITGDNDGA
>SSO_4314 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_P239 IS911 ORF2
MQTMTSRSRQAAYSGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISH
GSAGARSIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEH
VAIPNYLERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGW
AMSFSPDSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRY
QIRQSMSRRGNCWDNSPMERFFRSLKNEWMPMVGYVSFREAAHAITDYIV
GYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC
>SSO_1304 conserved hypothetical protein
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVRPDRPISPQAMAI
HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM
KLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDI
MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWLFN
NLDSMSPELRLTLKHYLENT
>SSO_2754 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1711 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_0368 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_1979 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_3608 IS629 ORF1
MVLESQGEYDSQWAVICSITPKIGCTPETLRVWVRQHERDTRGGDGGLTT
AERQRLKELERENRELRCSNDILRQASAYFAKAEFDRLWKK
>SSO_3898 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVRRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SSO_2931 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_1594 IS1 ORF
MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3741 IS21 ORF2
MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH
QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV
ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE
RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG
FADWGEMFGDHVLTTAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT
TPISDDEMVESGQHQ
>SSO_0466 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIIGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1620 IS2 ORF2
MALTLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAE
AFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>SSO_3658 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_1906 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SSO_1784 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SSO_0555 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_4188 IS1 ORF
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_3185 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCACSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW
QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>SSO_2271 ada, O6-methylguanine-DNA methyltransferase
MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE
NVSFYANASEALAAGFRPCKRCQPDKANPRQHRLDKITHACRLLEQETPV
TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV
TTSILNAGFPDSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR
CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI
ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP
KAVRAVASACAANKLAIVIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE
NEER
>SSO_2121 alkA, 3-methyl-adenine DNA glycosylase II, inducible
MYTLNWQPPYDWSWMLGFLAARAVSGVETVADNYYARSLAVGEYRGVVTA
IPDIARHTLHINLSAGLEPVAAECLAKMSRLLDLQCNPQIVNGALGKLGA
GRPGLRLPGCIDAFEQGVRAILGQLVSVAMAAKLTAKVVQLYGERLDDFP
EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGSLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKRRFPGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>SSO_2270 alkB, DNA repair system specific for alkylated DNA
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV
TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHDLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
GDCRYNLTFRQAGKKE
>SSO_3518 dam, DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKTYFPEAELYH
FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>SSO_1788 dbpA, ATP-dependent RNA helicase
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAVISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV
AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA
QRANIISDMLQIKLNWQTPPANSSIVPLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKT
SRVRLLK
>SSO_2018 dcm, DNA cytosine methylase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPSAHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>SSO_3308 deaD, inducible ATP-independent RNA helicase
MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ
AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT
RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP
GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT
ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN
EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL
REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI
GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR
LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ
GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV
GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL
PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFGGERR
EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>SSO_0778 dinG, probably ATP-dependent helicase
MALTAALKAQIAAWYKALQEQIPDFIPRVPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTANNQ
EEQKRCAKLKGDLDTYRWDGLRDHTDIVIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDEVLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRVEPSIDNEEQHIAEMAAFFRKQVESKKHLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>SSO_0268 dinJ, damage-inducible protein J
MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFD
LREPNQLTIQSIKNSEAGVDVHKAKDADDLFDKLGI
>SSO_0274 dinP, damage-inducible protein P
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK
FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL
SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAK
IASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTC
GDVQKCDLVMLLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA
EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ
EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG
L
>SSO_3652 dnaA, DnaA
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW
VRDKYLNNINGLLTSFCGADAPQLRFEVGTKPVTQTPQAAVTSNVAAPAQ
VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA
RAAARQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY
MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF
FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE
TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT
GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE
SHDIKEDFSNLIRTLSS
>SSO_4232 dnaB, replicative DNA helicase; part of primosome
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD
VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV
GGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQ
GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM
LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT
MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV
PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR
LTFNGQWSRFDNYAGPQYDDE
>SSO_4507 dnaC, chromosome replication; initiation and chain elongation
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>SSO_0196 dnaE, DNA polymerase III alpha subunit
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERVKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR
LRQSLEAHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI
GSEQVELEFD
>SSO_3203 dnaG, DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>SSO_3651 dnaN, DNA polymerase III beta subunit
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>SSO_0229 dnaQ, DNA polymerase III epsilon subunit
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV
YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA
AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA
RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA
TIQRIVRQASKLRVVFATDEELAAHEARLDLVEKKGGSCLWRA
>SSO_0457 dnaX, DNA polymerase III tau and gamma subunits
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVVPTAVMTPTQVPPQSAPQQAPT
VPLPETTSQVLAARQQLQCVQGATKAKKSESAAATRARPVNNAALERLAS
VTDRVQARPVPSALEKASAKKEAYRWKATTPVMQQKEVVATPKALKKALE
HEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDNAV
CLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPLEW
RQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI
>SSO_3099 endA, DNA-specific endonuclease I
MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGC
KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQD
GGRKNCAKDPVYRKMESDMYNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ
CAMKVDFKEKAAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNK
MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>SSO_2955 exo, 5'-3' exonuclease
MRGLFPISHPAVACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR
PPMPEELHDEMPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>SSO_3402 fis, site-specific DNA inversion stimulation factor
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>SSO_2289 gyrA, DNA gyrase subunit A
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>SSO_3649 gyrB, DNA gyrase subunit B
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE
ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA
GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP
QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG
VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI
GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE
GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE
QQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKI
LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD
VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE
AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR
YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD
VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL
LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL
WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>SSO_0966 helD, DNA helicase IV
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR
GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS
GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD
NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA
QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG
RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT
AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR
RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL
AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGERCDLDT
TYRFNSRIGKVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL
DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ
QADYVIIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL
TRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>SSO_0065 hepA, probable ATP-dependent RNA helicase
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPVQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRMEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>SSO_1446 himA, integration host factor (IHF), alpha subunit
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>SSO_0914 himD, integration host factor (IHF), beta subunit
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>SSO_0594 holA, DNA polymerase III delta subunit
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VTARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQTQLRQ
AVQLLTRTELTLKQDYGQSVWAELKGLSLLLCHKPLADVFIDG
>SSO_1119 holB, DNA polymerase III delta prime subunit
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA
ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHHGAAQVTNVDVPGLVVELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>SSO_4444 holC, DNA polymerase III chi subunit
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>SSO_4522 holD, DNA polymerase III psi subunit
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG
AQVASPALTELRANPTARAALWQQICTYEHDFFPRND
>SSO_1733 hrpA, helicase, ATP-dependent
MLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP
EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR
GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM
VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR
RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD
TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE
ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA
RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS
RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL
EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM
IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ
KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY
REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM
VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT
EKVTVYGLPIVAARKVNYSQIDSALCRELFIRHALVEGDWQTRHAFFREN
LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRISHDVISARHFDSWWKK
VSRETPDLLNFEKSMLIKEGAEKIRKLDYPNFWHQGNLKLRLSYQFEPGA
DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV
PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL
KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI
WSFGQLPESYEQKRGNYKVKAWPALVDERDSVAIKLFDNPLEQKQVMWNG
LRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDK
LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR
LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR
LEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIE
ELRVSYFAQQLGTPYPISDKRILQAMEQISG
>SSO_0160 hrpB, ATP-dependent helicase
MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR
AQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
HYLPLPAHQRFDEAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASR
IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFEPRTGLTRLITQRVSQASMTQRAGRAGRLEPGICLHL
IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPVVN
LMAAKRLLQMLGALDGERLSAQGQKMAALGNDPRLAAMLVSAKNDDEAAT
AAKIAAILEEPPLMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLAGAFADRIAHRRGQDGRYQLANGMGAMLDADDALSRHEWLIAP
LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW
RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAAKWLPEYDWPAVDDESLLAALETWLLPHMTGVHSLRGLKSLDIYQ
ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGGVPLVLELLSPAQRPLQITRDLSAFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>SSO_4173 hupA, DNA-binding protein HU-alpha
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>SSO_0423 hupB, DNA-binding protein HU-beta
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>SSO_4466 insA, IS1 ORF1
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>SSO_P203 insB, IS1 ORF2
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P160 insB, IS1 ORF2
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_P067 insB, IS1 ORF2
MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SSO_2406 intC, putative prophage Sf6-like integrase
MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKIL
TVGKYPLMTLQEARDKAWTARKDISVGIVPVKAKKASSNNNSFSAIYKEW
YEHKKQVWSVGYATELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFED
RGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFL
PADQMPAFNKALATFSGSIVSLIATKVLRYTALRTKELRSMLWKNVDFEN
RIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKK
KPICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHEWPADAIEVQLAH
ANGGSVRGIYNHAQYLDKRREMMQWWADWLDEKVE
>SSO_0746 intE, putative integrase fragment
MPVHLIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGK
AASAKLIRSTLSDAFREAIAEGHITTKPVAATRAAKSEVRRSRLTADEYL
KIYQTAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKT
GVKIAIPTTLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVS
RYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSD
TMASQYRDDRGREWDKIEIK
>SSO_P090 ipaB, IpaB
MHNVSTTTTGLSLAKILASTELGDNTIQAANDAANKLFSLTIADLTANKN
INTTNSHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN
KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK
IKDLENKINQIQTRLSELDPDSPEKKKLSREEIQLTIKKDAAVKDRTLIE
QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT
QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA
EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV
QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL
KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN
SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ
EVIADLLASMSNSQANRTDVAKAILQQTTA
>SSO_2500 lig, DNA ligase
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
LPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLEQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKSAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEVLEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDEAEMLRLLGS
>SSO_1134 mfd, transcription-repair coupling factor
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ
NALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQDIISSRLSTLYQLPT
MQRGVLIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGY
RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ
RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK
GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTL
ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN
ANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDKVRNLALICESDLL
GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE
AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW
SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE
TTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK
QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVA
EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD
ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL
ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR
GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL
EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ
QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF
GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL
IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA
>SSO_2988 mutH, methyl-directed mismatch repair
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALAGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE
DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>SSO_4355 mutL, enzyme in methyl-directed mismatch repair
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG
AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA
SISSVSRLTLTSRTAEQQEAWQAYAEGRDMNVTVKPAAHPVGTTLEVLDL
FYNTPARRKFLRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQYR
AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL
AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD
VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRS
IPENRVAAGRNHFAEPAAREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKLKAPEPQEPALAANSQSFGRVLTIVHSDCALLE
RDGNISLLALPVAERWLRQVQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL
EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS
VFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLL
QSVDLHPAIKALKDE
>SSO_3772 mutM, formamidopyrimidine DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>SSO_2880 mutS, methyl-directed mismatch repair
MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>SSO_0107 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV
VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGEPWGKEGQ
PGEWMSLVGLNADDFPPANEPVIAKLKRL
>SSO_3235 mutY, adenine glycosylase
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP
YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH
GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA
VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS
LCPLQNGCIAAANNSWSLYPGKKTKQTLPERTGYFLLLQHEDEVLLAQRP
PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD
IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>SSO_0665 nei, endonuclease VIII/DNA N-glycosylase with an AP lyase activity
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVET
RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT
ILLYSASDIEMLRPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLDIPRFSYATRGLVDENKHHGALFRFKVFHRDGEPCERCGSIIEKTTL
SSRPFYWCPGCQH
>SSO_4171 nfi, endonuclease V
MIMDLASLRAQQIELASSVIREDRLDKDPPDLIAGVDVGFEQGGEVTRAA
MVLLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPD
LVFVDGHGISHPRRLGVASHFGLMVDVPTIGVAKKRLCGKFEPLSSEPGA
LAPLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRL
PEPTRWADAVASERPAFVRYTANQP
>SSO_2215 nfo, endonuclease IV
MKYIGAHVSAAGGLANAAIRAVEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKFRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARIVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQD
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>SSO_2431 nohB, bacteriophage DNA packaging protein
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIRWY
AERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELK
NARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDF
LKRDIIKAMNKAAALDELIPGLLSEYIEQSG
>SSO_1525 nth, endonuclease III
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH
NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>SSO_1276 ntpA, dATP pyrophosphohydrolase
MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAP
QAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVTRNTE
SWFCLALPHERQVVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA
>SSO_1798 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG
YERISATNPGGLSDKLRDYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVQRKEWLLRHEGYLLL
>SSO_3161 parC, DNA topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKVAVKGKSNQSVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASNGDS
EE
>SSO_3168 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL
QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPNSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>SSO_0657 phrB, deoxyribodipyrimidine photolyase (photoreactivation)
MTTHLVWFRQDLRLHDNLALAAACRNSSARVLALYIATPRQWATHNMSPR
QAELINAQLNGLQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHL
FYNYQYEVNERARDVEVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIEPSPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQALDGGVGSVWLNELIWREFYRHLITYHPSLCKHRPIIA
WTDRVQWQSNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDLEGEFIRQWLPELRNVPGKSVHEPWKWAQKAGVKLDYPQP
IVEHKEARVQTLAAYEAARKGK
>SSO_4036 polA, DNA polymerase I
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI
MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK
AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT
LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT
AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA
TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK
GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP
VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN
SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE
DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK
VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK
KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP
KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV
IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT
SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG
TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI
HQLMENCTRLDVPLLVEVGSGENWDQAH
>SSO_0066 polB, DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV
PRAQHILQGKQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG
VTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVAS
RPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRLPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHTSP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTF
VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYVRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>SSO_4104 priA, primosomal protein N (factor Y), putative helicase
MPVAHVALPVPLPRTFDYLLPEGMAVKAGCRVRVPFGKQQERIGVVVSVS
DVSELPLNELKAVVEVLDVEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL
PILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKI
WRDQVATLEFNDAALQTLRKKGLCDLASETPEFSDWRTNYAVSGERLRLN
TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV
MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV
IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI
PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQANNQVILFLNRRGFAPALLCHDCGWIAECPRCDH
YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP
LFPDVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD
VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH
PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP
LFLQQLRNLILSSPLADDKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR
LQHIINGTLALINTIPDSRKVKWVLDVDPIEG
>SSO_4384 priB, primosomal replication protein N
MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM
PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI
DSGD
>SSO_0454 priC, primosomal replication protein N''
MKTALLLEKLEGQLATLRQRCAPVSQFATLSARFDRHLFQTRATTLQACL
DEAGDNLAALCHAVEQQQLPQVAWLAEHLAAQLEAIAREASAWSLREWDS
APPKIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVE
AYEARLARCRHALEKIENRLARLTR
>SSO_2843 recA, DNA-dependent ATPase
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL
TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>SSO_2977 recB, DNA helicase ATP-dependent dsDNA/ssDNA exonuclease V subunit
MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF
PRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYEHL
LEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ
QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY
LQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS
GIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKFSQRFLEDRTKA
GGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRGELG
FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA
PGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK
MWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVR
ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL
QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ
IWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ
LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP
LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEVERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA
AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN
WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS
PGTFLHSLFEDLDFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA
PLNETGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC
PPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA
AMQAHRYDLQYQLYTLALHRYLRHRIADYDYDRHFGGVIYLFLRGVDKEH
PQQGIYTTRPNAGLIALMDEMFAGMTLEEA
>SSO_2979 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFSIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTHQLGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIRAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLSAFFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPE
TMEAIVEQSQRFLLPLFRFNQS
>SSO_2976 recD, DNA helicase ATP-dependent dsDNA/ssDNA exonuclease V subunit
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEASHPLLATCVSEIGELQNWEESLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPVSDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERAGQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRHSHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPIVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFSSRG
>SSO_3650 recF, ssDNA and dsDNA binding protein
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA
ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>SSO_3753 recG, DNA helicase
MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL
YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFS
AAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP
VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT
LHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP
LSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDVPMMRLVQGDVGS
GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFEPLGIEVGW
LAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH
RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE
LPPGRTPVTTVAIPDTRRTDIIDRVRHACMTEGRQAYWVCTLIEESELLE
AQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVAT
TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT
PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD
LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA
>SSO_3045 recJ, ssDNA exonuclease
MKQQIQLRRRAVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGETLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQVEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEADKFELFQQRFG
ELVTEWLDPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>SSO_2772 recN, protein used in recombination and DNA repair
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALEIARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKFD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLNKKARLQELARLLGGSEVTRNTLANAKEL
LAA
>SSO_2689 recO, protein interacts with RecR and possibly RecF proteins
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGATGTPEPALRRFELALLGHLGYGVNFTH
CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>SSO_3996 recQ, ATP-dependent DNA helicase
MAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMPTG
GGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNSTQT
REQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEAHC
ISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGLND
PLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAKVE
DTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGMGI
NKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWLRR
CLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCGNC
DICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIRDY
GHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAARP
VLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIADES
NVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMALIR
AHVDGDDEE
>SSO_0459 recR, recombination and repair
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>SSO_3949 rep, rep helicase, a single-stranded DNA dependent ATPase
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVGQTLGRKEARGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPS
QAAASAIGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR
GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>SSO_3951 rhlB, putative ATP-dependent RNA helicase
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>SSO_0776 rhlE, putative ATP-dependent RNA helicase
MSFDYLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG
QGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRKP
AAAQ
>SSO_0228 rnhA, RNase HI
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM
AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN
VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG
YQVEV
>SSO_0195 rnhB, RNAse HII
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALYEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS
>SSO_1504 rnt, RNase T
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLPAAEEV
>SSO_2443 rus, endodeoxyribonuclease RUS
MNTYSITLPWPPSNNRYYRHNRGRTHISAEGQAYRDNVTRIIKNAMLDIG
LAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVV
KMPVTKGGRLELTITEMGNE
>SSO_1280 ruvA, Holliday junction helicase subunit B
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARTDASSETLIREALR
AAL
>SSO_1281 ruvB, Holliday junction helicase subunit A
MIEADRLISAGTTLPEDGADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP
>SSO_1278 ruvC, Holliday junction nuclease
MAIILGIDPGSRVIGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA
GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF
EYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH
CHVSQNAMQMSESRLNLARGRLR
>SSO_2081 sbcB, deoxyribophosphodiesterase
MMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAIRTDNEFNVIGEPEVFY
CKPADDYLPQPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCIL
GYNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPE
GINWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTR
QPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWRGNTSWVAPLA
WHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKADLGDNAAVPVKL
VHINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAI
FAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVD
KRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYADELQML
AQQYADNKEKVALLKALWQYAEEIV
>SSO_0374 sbcC, ATP-dependent dsDNA exonuclease
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL
EKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLT
RQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT
WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLTLTADEVASALAQHSEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ
NVTLEQTQRNVALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALR
GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTTSLNITLQPQDDIQP
WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL
TGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET
LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ
QLKQDADNRQQQQTLLQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG
LTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTL
SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL
DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK
>SSO_0375 sbcD, ATP-dependent dsDNA exonuclease
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETTQTHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA
>SSO_2079 sbmC, SbmC protein
MNYEITQEEKRTVAGFHLVGPWEQTVKKGFEQLMMWVDNKNIVPKEWVAV
YYDNPDETPAEKLRCDTVVTVPNNFTLPENSEGVILTEISGGQYAVAVAR
VVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYV
AVQPKHH
>SSO_0641 seqA, negative modulator of initiation of replication
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR
VASPAIVEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS
LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN
TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI
>SSO_3426 smf, Predicted Rossmann-fold nucleotide-binding protein
MVDTEIWLRLMSISSLYGDDMVRIAHWLAKQSQIDAVGLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLVTTDYPGALFVEG
ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDG
VAHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLLEHGGALVSEFPLD
VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQRREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS
PDQQDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>SSO_P113 spa32, Spa32
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN
YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP
DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR
NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL
QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV
GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>SSO_2702 srmB, ATP-dependent RNA helicase
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>SSO_4239 ssb, ssDNA-binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>SSO_3841 tag, constitutive 3-methyl-adenine DNA glycosylase I
MERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVL
KKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR
AYLQMEQNGEPFADFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALK
KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYPGNKP
>SSO_4014 tatD, Mg-dependent DNase
MEYRMFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQ
KLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDF
NRNFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKL
PGAVLHCFTGTREEMQACVAHGIYIGITGWVYDERRGLELRELLPLIPAE
KLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAA
TTDANVKTRFGIAF
>SSO_1869 topA, DNA topoisomerase type I
MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADS
TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA
EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA
FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA
VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN
KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG
VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE
SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTIGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL
PAVNKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI
ISTIQDRGYVRVENRRFYAEKIGEIVTDRLEENFRELMNYDFTAQMENSL
DLVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP
TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED
AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR
IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP
PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG
WSAFYVDGKWVEGKK
>SSO_1392 topB, DNA topoisomerase III
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ
PDAYDSRYARWNLADLPIIPEKWQLQPRPSVTKQLNVIKRFLHEASEIVH
AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERALDRLR
SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT
PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ
DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE
AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA
ISVHAPDLLLQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLIARQYLMQFCPDVVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL
GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM
TGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHS
TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY
QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEVGSG
AIA
>SSO_2706 ung, uracil-DNA-glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQRGETPIDWMPVLPAESE
>SSO_4238 uvrA, excision nuclease subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>SSO_0758 uvrB, DNA repair excision nuclease subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>SSO_1205 uvrC, excinuclease ABC, subunit C
MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH
TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK
HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG
RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE
FEEAARIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR
QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN
LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ
QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR
AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL
AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP
DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML
LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>SSO_3986 uvrD, DNA-dependent ATPase I and helicase II
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE
NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA
HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK
DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW
LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD
QSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN
NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC
AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL
AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE
KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW
QDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY
VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV
SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV
AFQGQGIKWLVAAYARLETV
>SSO_2017 vsr, DNA mismatch endonuclease
MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGSP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
ISRLQELGWRVLIVWECALRGREKLTDEALTERLEEWICGEGASAQIDTQ
GIHLLA
>SSO_3777 waaP, lipopolysaccharide core biosynthesis protein
MVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSYFLK
WHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAFGEK
GINPLSRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVATMV
RDMHAVGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRVPRR
WRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFVAPLKDILKQEQGLLSQAE
AKATKIRERTIRKSL
>SSO_2104 wcaH, GDP-mannose mannosyl hydrolase
MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG
GRVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFT
THYVVLGFRFRVAEEELILPDEQHDDYRWLTPDALLASNDVHANSRAYFL
AEKRAGVPGL
>SSO_3984 xerC, site-specific recombinase, acts on cer sequence of ColE1, effects chromosome segregation at cell division
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ
CDVTMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>SSO_3047 xerD, site-specific recombinase
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>SSO_2591 xseA, exonuclease VII, large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE
SMQPAGEGLLQQKYEQLKAKLQAEGLFDQQYKKPLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPTAVQGDDAPGQIVRAIELANQRNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRTGQQQQRLTQRLNQ
QNPQPKIHRAQTRIQQLEYRLAETLRVQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGNVLKKVKQVKAGEMLTTRLEDGWIESEVKNIQPVKK
SRKKVH
>SSO_0399 xseB, exonuclease VII small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDTSLPPFTPDNE
>SSO_1408 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG
DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>SSO_0271 yafM, conserved hypothetical protein
MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRNAIIKVKRDRPFEI
NAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWE
HAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDW
AGDVTDINAGERIIL
>SSO_0372 yaiD, conserved hypothetical protein
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG
SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK
KTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDT
LALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSL
LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL
KRLKFCDELRDQNEDIDREDFALRFDADFILMTGELAALIQNLIEGLGGE
AQR
>SSO_0430 ybaV, conserved hypothetical protein
MRFSTIVSVVTLVWGISPRQPSGKNIIRWLLKKRTNGSVRYCQYTSVETK
AEAPAAQSKAAVPAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIV
SYREEYGPFKTVEDLKQVPGMGNSLVERNLAVLTL
>SSO_0442 ybaZ, conserved hypothetical protein
MLVSCAMRLHSGVFPDYAEKLPQEEKMEKEDSFPQRVWQIVAAIPEGYVT
TYGDVAKLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTISLTGPDL
QRQRQALLAEGVMVSGSGQIDLQRYRWNY
>SSO_0259 ybfL, putative receptor protein
MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI
KTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF
AVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDV
PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE
KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI
LTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS
>SSO_0863 ybjD, conserved hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNMEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIVTTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>SSO_0893 ycaJ, putative polynucleotide enzyme
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLGTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI
AELVNGDARRALNTLEMMADMAEVNDSGKRVLKPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>SSO_1120 ycfH, conserved hypothetical protein
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM
RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY
YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC
GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRL
LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF
ARLFHIDASRLQSIR
>SSO_1667 ydcC, putative receptor
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHST
DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSN
EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEKKFPLTELNNPAHDSSAMSEKSHGREEIRLHIVCDVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKEPEMMVRYYISSADLTAEKFATAIR
NHWHVENNLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINVLTNDKVF
KAGLRRKMRKAAMDRNYLASVLAGSGLS
>SSO_1347 yeaB, conserved hypothetical protein
MEYRSLMLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>SSO_2240 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNIDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRSDEKALFRDCIYELPLRYMIKHSYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAE
KRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIENFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RPPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>SSO_2312 yfaO, conserved hypothetical protein
MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWALSGGGVESGERIEEAL
RREIREELGEQLLLTEITPWTFSDDIRTKTYADGRKEEIYMIYLIFDCVS
ANREVKINEEFQDYAWVKPEDLVHYDLNVATRKTLRLKGLL
>SSO_2547 yffH, conserved hypothetical protein
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQTSHLMD
>SSO_2753 yfiL, conserved hypothetical protein
MMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVGMEDAISGSAIKDD
DAFSDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSGKSFPASCNNVESA
SQLHEVWQKGADENASTIRVMLPTY
>SSO_2902 ygbF, putative inner membrane protein
MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQL
AGCGNVVMAWATNIESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ
>SSO_2987 ygdP, putative invasion protein
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNTSAYRRKRG
>SSO_3206 ygjF, conserved hypothetical protein
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ
LKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ
PQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLE
KLVEAYRELDQALVVRGR
>SSO_3301 yhbQ, conserved hypothetical protein
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD
>SSO_3403 yhdJ, putative methyltransferase
MTMRTGCEPTRFGNEAKTIIHGDALAELKKLPTESVDLIFADPPYNIGKN
FDGLIEAWKEDLFIDWLFEVIVECHRVLKKQGSMYIMNSTENMPFIDLQC
RKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAIL
VEAKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ
KPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEY
IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLIAK
>SSO_3703 yhhF, conserved hypothetical protein
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>SSO_0260 yhhI, putative receptor
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRKGAIHVISAFSTMHSLVLGQIKTDEKSN
EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR
NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVLAESGLS
>SSO_3759 yicF, putative enzyme
MMMKVWMAILISILCWQSSVWAVCPAWSPARAQEEISRLQQQIKQWDDDY
WKEGKSEVEDGVYDQLSARLTQWQRCFGSEPRDVMMPPLNGAVMHPVAHT
GVRKMVDKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNG
LKGEDWTQKVSLISAVPQTVSGPLANSTLQGEIFLQREGHIQQQMGGINA
RAKVAGLMMRQGNSDTLNSLAVFVWAWPDGPQLMTDRLKELATAGFTLTQ
RYTRAVKNADEIARVRNEWWKAKLPFVTDGVVVRAAKEPESRHWLPGQAE
WLVAWKYQPVAQVAEVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNI
GSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFN
SLTCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHI
FSWLLLTPEQLQNTPGIAKSKSAQLWHQFNLARKQPFTRWVMAMGIPLTR
AALNASDERSWSQLLFSTEQFWQRLPGTGSGRARQVIEWKENAQIKKLGS
WLAAQQITGFEP
>SSO_4169 yjaD, conserved hypothetical protein
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLVQQQRRHDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL
MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>SSO_4529 yjjV, Mg-dependent DNase
MICRFIDTHCHFDFPPFSGDEEASLQRAAQAGVGKIIVPATEAENFARVQ
ALAEKYQPLYAALGLHPGMLEKHSDVSLDQLQQALERRPAKVVAVGEIGL
DLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRIHDKLAMHLKRHDL
PCTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLA
SLLLETDAPDMPLNGFQGQPNRPEQAVRVFDVLCELRPEPEDEIAEVLLN
NTYALFSVSG
>SSO_1353 yoaA, putative enzyme
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGCAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDLLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>SSO_3103 yqgF, conserved hypothetical protein
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY
>SSO_3172 yqiE, conserved hypothetical protein
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>SSO_3294 yraN, conserved hypothetical protein
MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI
DLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS
>SSO_3528 yrfE, conserved hypothetical protein
MSKSLQKPTILNVETVAHSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV