Gene list
Applied filters:
COG category: Replication, recombination and repair
Organism: Shigella sonnei Ss046, Ss046
Gene type: CDS
Number of genes found: 785
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Shigella sonnei Ss046, Ss046 >SSO_0150 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0020 hypothetical protein MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGVSTPRCLTVICHFHAR >SSO_2909 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1821 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2654 IS629 ORF2 MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKAEVIHRKSWKNRAEVELAILTWVDWYNNRRLLERLGHTPPAEA EKAYYASIGNDDLAA >SSO_P164 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2670 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3872 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_0427 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1731 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3855 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0747 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1261 putative crossover junction endodeoxyribonuclease MRHEFILPYPPTVNTYWRRRDNTYFVSKAGERYRRDVALIVRQQRLKLSL SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVR GQPVSGGRLGVKIYPIMHEEQVKK >SSO_3606 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1017 putative transposase subunit MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT S >SSO_1612 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_0238 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_3826 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P149 conserved hypothetical protein MSDATRLKNRLSVRFSEVGLVLNAGKTNIAYIDTFKRRNVATSFSFLGYD FKVRTLKNFKGELYRKCMPGASNAAMCKITETIKKWRIHRSTAESLLDFA RRYNAIVRGWIEYYGKFWSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQ RKLALVRKQYPKLFAHWYLLRASNE >SSO_0325 IS2 ORF1 MVVSAIASTPQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQH GVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGK KTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >SSO_1669 putative transposase subunit MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT S >SSO_2730 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P132 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1152 putative phosphohydrolase MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD IDCCRWVSAEEILKASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK GVI >SSO_3593 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVW >SSO_0876 putative single stranded DNA-binding protein MTAQIAAYGRLVADPQLKTTSKGTQMTMASMAVPLPCSQADDGTATIWLS VLAFGRQADALAKHQKGELVSVAGNMQVSQWTGQNGETRQGWQVIADSVI SARTARPGGKKGQQGQATDALNRAKQQAGNDDPYGDNIPF >SSO_3501 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3617 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1311 IS1 ORF MSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >SSO_2625 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1319 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2507 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P200 conserved hypothetical protein MRKYIPLVLFIFSWPVLSADIHGRVVRVLDGDTIEVMDSLKAVRIRLVNI DAPEKKQDYGRWSTDMMKSLVAGKTVTVTYFQRDRYGRILGQVYAPDGMN INQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWIWM HRK >SSO_3935 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_0277 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_0270 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P194 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1706 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEVISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >SSO_1672 ISSfl2 ORF MALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRAFASAAHLAAY AGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTR KMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_1989 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3277 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1684 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0077 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1565 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDHKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_3934 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_2175 hypothetical protein MSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPARISPA IQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVARELAG FIWDMGRIAMSVAQQPQCHK >SSO_3790 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P221 IS630 ORF MTWELILDGYSESSYSATPRFAAARLPWFRVIYQPVYSPWVNHVERLWQA LHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_1665 conserved hypothetical protein MHSLILGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPENDSYAISEKSHGREE IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS GIRHIAINILTNDKVFKAGLRRKMRKAAMGRNYLASVLTGSGLS >SSO_3014 conserved hypothetical protein MVIYAFNKRLMEYFMKGKSALTLLLAGIFSCGTCQATGAEVTSESVFNIL NSTGAATDKSYLSLNPDKYPNYRLLIHSAKLKNEIKSHYTKDEIQGLLTL TENTRKLTLTEKPWGTFILASTFEDDKTAAETHYDAVWLRDSLWGYMALV SDQGNSVAAKKVLLTLWDYMSTLDQIKRMQDVISNPKRLDGVPGQMNAVH IRFDSNSPVMADVQEEGKPQLWNHKQNDALGLYLDLLIQAIDTGTINAED WQKGDRLKSVALLIAYLDKANFYVMEDSGAWEEDARLNTSSVALVTSGLE RLSNLLSKKDSVFVSDLLREAKANELDEPLSTTRLNHLIDKGYERITLQL DLGGESPGYLEKDKHYREADAALLNVIYPANLAKINTRRKEQVLKIVKKL AGPYGIKRYEKDNYQSANFWFNDIKTDTDQNSHAKRDGFAPIFPDICYHL THYKPAAADIPVASDNPAHYADAIRYNARTPLQGSLLPLTRLVWA >SSO_2028 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_4296 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3565 IS4 ORF MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWK YPTAPKKSQSVA >SSO_3742 IS2 ORF2 MERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPES NGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYR SPREYLRQRACNGLSDNRCLEI >SSO_1192 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0301 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P064 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >SSO_0467 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P177 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1919 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P183 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHQVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_1049 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1916 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3747 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0730 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_2439 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0581 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_3226 putative transposase MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLV PKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY >SSO_3676 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1564 IS2 ORF2 MRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLR VTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEW LTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYI SIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSD NRCLEI >SSO_3086 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDAVVIWM TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS VELHDKVIGHYLNIKHYQ >SSO_1268 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P174 putative IS orf MLTELLTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQELDALL YGSASTVPVVLTESTVMPKLPVVKKRPRRP >SSO_0326 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1730 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1915 hypothetical bacteriophage protein MRTVITYLRFSSAIQGAEGADSTRRQNDLFKQWLKKNGDAQIVASFSDEG LSGYKGKHLTGQFGDMLARIEAGEFPEGTILLVESIDRIGRLEHLETEAL MNRILGNGIEIHTLQDGLIYTKDALADDLGISIIQRVKAYIAHQKSKQKS FRVSQKWGQRAKLALAGEQRLTKMVPGWIDPETFKLNEHAETVRLIFKLL LDGESLHNIARHLQSNGIKSFSRRKDANGFSVHSVRTILRSETTIGTLPA SQRNDRPAIPNYYEGVVDIPTFNKAQEILDKNRKGRTPASDNPLTINIFK GLFRCQCGASVHPTGTKNKYAGVYRCNNHLDGRCDVPPLKRKPFDRWMID NFLGMIDVGNDGESERKIAALQHEVEIVTARIKKATALLLEMDDIDELKI QLKELNQKRTELQTTIDNMRRKASLTDKELPQLKDIDLMTKAGRVECQLI LSKHLKGLTLGKDSVTVTLQNDTEITIPTNPLPLNDGSPIFEIADKELLD IDAYQL >SSO_P236 putative transposase MWCFFNLFGVLIPIDERNLTRERTQVGLQAARARGRKGGRPKTLSKDKQA LAVQLYNEKKHTVAQICVLMGISRPTLYKYIESARLFKK >SSO_2759 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_0407 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVLGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHHQ >SSO_2401 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P122 putative transposase MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGYYIIHRK NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT S >SSO_2448 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1188 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1803 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0342 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_3612 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1253 putative resolvase MAILGYCRVSTDDQSITNQQMQIEEAGYNIAKWFADEAVSGSVKASLRNG FSSLLAYAREGDTVVVVAVDRLGRDTIDVLSTVKALQAKGVTVISLREGF DLSSAMGEAMLGIMSTLAQLERSLIAERRKAGIERAKAEGVHMGRPVKAS SEAVQMLISQGKTRLQIQEELGISRATYYRLAK >SSO_P228 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_2447 IS21 ORF1 MLSREDFYMIKQMHQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHASLWQQVSQGGTSTTECL >SSO_4216 putative transposase MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLV PKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY >SSO_1171 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0656 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P173 IS4 ORF MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_3989 IS4 ORF MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWK YPTAPKKSQSVA >SSO_P175 IS3 ORF2 MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG DITYLRTDEVRLHPVSTEPHAF >SSO_4309 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P141 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW VRQHERDTGGGEVGSPPLNVSV >SSO_3824 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_1397 conserved hypothetical protein MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFADGKVELDESQQQALV RELNEELGIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA LVWCSPEEALQYPLAPADIPLLEAFMALRAARAAD >SSO_0729 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_4312 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0969 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTLRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2578 putative DNA replication factor MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL >SSO_P001 putative resolvase, fragment MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI STVL >SSO_0725 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3285 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTYEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P071 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRKNHGSLAAANRGVAEYELSE >SSO_0880 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_1940 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_3590 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKHVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTIGGFNSETV SAPVI >SSO_1264 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_0659 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2141 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3596 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2746 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P217 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2449 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLTRLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2760 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_P152 IS600 ORF2 MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA PMESFWGTLKNGTGTE >SSO_0144 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P039 IS629 ORF1 MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK >SSO_1567 IS2 ORF2 MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMCQRQ >SSO_3838 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1376 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2671 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P037 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_3129 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2291 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_0361 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1191 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2433 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_4439 IS1 ORF MDEQWGYVGAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSP FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRK SLSFSKSVELHDKVIGHYLNIKHYQ >SSO_2246 IS2 ORF2 MANYRLISLALTLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDN GEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPA SPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTI KRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQAS NGLSDNRCLEI >SSO_1255 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3013 IS2 ORF2 MSRAQLHVILRRTDDWMDGRHSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_1758 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2472 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P140 IS629 ORF2 MPLLDKLREQYGVGSVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR GKKVHTTVSRKAVAAGDRVNRHQGNMPRTPGGPQRLVYVVSAADKDKHTS AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT >SSO_4004 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLWQRACNGLSDNRCLEI >SSO_4033 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P137 putative reverse transcriptase, fragment MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA VKENWQWKPAVAYCCYADDCVPRRRVLGT >SSO_2022 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3591 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_4491 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P036 IS21 ORF1 MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_3288 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0718 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2910 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P172 putative IS orf MAGRRLGVPKSTVCGMFVRFRNAGLSWPLPAGMSEQELDALLYGSASTVP VVLTESTVMPKLPVVKKRPRRPNADQLRIS >SSO_1068 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_0044 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2182 putative integrase MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFRGKISSDGCLKKEQMVVSAIAS TPQLPMLGEKRLVSSVTKEDLLFVRRDLLTGYQKLSNGKISSIKGRSVVT VNYYMTTIAGMFQFATDNGYTSGNPFNGLTPLKKSKIEPDPLTRDEFIRF IEACRHQQTKNLWIIAVYTGIRHGELVSLAWEDIDLKARTITIRRNYTKL GEFTPPKTDAGTGRTIHLVQPAIDALKSQAEMTMLGKQHSVEVKQREYGR STVHKCTFVFSPQVIKQRQFSGPHYKVDSIRESWTSILKRAGLRHRKSYQ SRHTYACWSLAAGANPSFIASQMGHTNAQMVFNVYGAWMKDNNHEQIELL NKRLSESVPCMPHKKVG >SSO_0740 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P075 putative IS orf MLTDIFNSNYQCYGYRRLHAMLRHEGGRLSEKVVRRLMVEEQLVVSRNRR RRYSSYCGEIGPAPDNLIARDFKAEQPNQK >SSO_1738 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1176 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_3251 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDAVVIWM TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS VELHDKVIGHYLNIKHYQ >SSO_P119 IS600 ORF2 MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA TTSSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV YTCEIVGYAMGERMTKELTGKALFMALRSQHPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISS AYGKTD >SSO_2756 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1708 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3112 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3819 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P176 conserved hypothetical protein MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPKGREVTFSAFSDWLPRNR AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA EIYKFFTNALYVALTRATHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN PGLKIRQGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE RSEKSVGYWVGGIRRAAQKA >SSO_2932 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0701 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_3377 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P165 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P130 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAFAAGDRVNRQFVAERPDQLWVADFTYVSTCVSASDIRR >SSO_P052 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1668 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2775 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_1655 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P045 ISSfl2 ORF MALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRAFASAAHLAAY SGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTR KMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_1459 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1755 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P035 conserved hypothetical protein MFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIPTVSDR IAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCWRYSWILE VDISAFFDHVRHDLVLKALEHHGMPKWVILYCRRWMEAPMQSCENGELIT RTRGTPQGGVISPLLANLFHHYAFDLWMEREYRGYRLRGTLTIL >SSO_1727 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_4458 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_P025 IS3 ORF1 MTKTVSTSKKTRKQHSPEFRSEALKLAERIGVAAAARELSLYESQLYAWR SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK >SSO_3205 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDIPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWQGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_2297 IS600 ORF2 MSISPLFRWPNSVCHFTRLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >SSO_1779 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1351 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2649 putative protein encoded within IS MNDISSDDIFLLKQRLAERQLKTKPLLKSLESWLREKMKTLSRHSELAKA FAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKNYLFFGSDHGG ERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVGELLPWRVALP TE >SSO_P026 IS3 ORF2 MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS RKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLA VVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHSDRGSQ YCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHGEHFIS REIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >SSO_1247 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1654 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_3408 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2722 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLPLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1978 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3901 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1031 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0237 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_1918 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1616 IS600 ORF2 MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD RGSQYCAYDYRVIQEQLV >SSO_1700 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_4483 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_2299 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3231 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_1920 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_4474 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_3744 putative integrase MALTDIKVKTAKPKDKPYKLADGGGMYLLINTNGSKYWRMKYRFAGKEKM LSIGVYPDVTLAGAREKRSEARKLLAAGGDPGEAKKEEKIAQQMSLKNTF EAIAREWHQSKADRWSLRYRDEIIDTFEKDIFPYIGKRPIAEIKPMELLE ALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPDLASALATPK KVHFPFLTANELPHFLNDLAGYTGSIITKTATQIIMLTGVRTQELRFARW EDIDFETKLWEIPAEVMKMKRPHIVPLSEQVIMLFKQLEPISKHHPLVFI GRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILHEQGFNSAWI EMQLAHVDKNSIRGTYNHALYLDGRREMMQWYADYIDSLSSRES >SSO_0719 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P127 putative transposase, fragment MCWGRTALYMAALEAPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTI MNAMLRKNEEWNESYL >SSO_3072 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2330 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARQGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1464 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3127 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSCIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3825 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3214 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_2680 IS629 ORF1 MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_1759 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1540 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1312 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGND LPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFV KTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQ RACNGLSDNRCLEI >SSO_1671 ISSfl2 ORF MPNCDRFRCHFAPHALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRI RGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLGFAG >SSO_1991 IS2 ORF2 MDGRHSHHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN AKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRC DNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNEL PASPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK TIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQ ASNGLSDNRCLEI >SSO_1393 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1256 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2667 putative protein encoded within IS MIPLPSGIKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS GSQVKLLWSTSDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI DWRQPKRLLTSLTML >SSO_2776 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVWICPYISRHLLSLNPLQARCRRYSRGER >SSO_1263 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1988 putative virulence protein MRRFAGACRFVFNRALARQNENHEVGNKYIPYGKMASWLVEWKNATETQW LKDAPSQPLQQSLKDLERAYKNFFQNRAAFPRFKKRGQNDVFRYPQGVKL DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGKWYISIQTESEVS TPVHPSASMVGLDAGVAKLATLSDGTVFEPVNSFQKNQKKLARLQRQLSR KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK VSNMSKSAAGTVSQPGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWRGG QVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTVNADVNGARNI LAAGHAVLACGEMVQSGRPLKQEPTEMIQATA >SSO_3992 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLTRLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P216 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2793 putative helicase MITGVWKYRGKSSVHQPPHCRQDHRRGAQLVVVQLAQLPFLYQRCGCMTT PVWRNDDLEGAVIGAFFLRGADHEVMDILITLPADIFSVRAYRDIYTGIC RQARVSGVIDPVLLCNEMPELAPVITDTGRKTWVKSSLEHYVAALRRNAA LRDAEKTLNEALQKLRDAHTCEAAEDALKDAQNMMVTLSTGKGVIQPVHI DDVLPEVVERVECRNQGLEKSRTLMTGIDELDAKTGGMEPGDLVFIAARP SMGKTELALDIIDKVTEQGHGVLLFTMEMANIQIGERMVSAAGGMPVSRL KSVAHFEDEDWARFSQGVGRMTGRNIWMVDQANLTIDEICATTKHHLIKH PETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIALSQL SRGVESRPNKRPMNSDLKNSGEIEADADIILMLYRDEVYNPDTQATGIAE INITKQRNGSLGTIYRRFYNGHFLPVDQESAQVLSTPMRQPQPRRYSNTR TDSSKMERFF >SSO_P082 IS100 ORF1 MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC >SSO_4109 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1063 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_0824 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3611 IS629 ORF2 METTFVLDALEQALWARRPSGTIHYSDKGSQYVSLAYKARLKEAKLLAST GSTGDSYDNAMAEIIKGL >SSO_P151 putative reverse transcriptase, fragment MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP GVPFERYADDVVCHCHSQWQADALISGLRQRLAQCGLQLHPQKTRIVYCK DADRRGDYPETSFDFLGYTFRPRLSMNRWGKTFVNFSPAMSARAGKAIRQ EVRRIAVTSPCTSWRICSMRKSEAG >SSO_4469 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3684 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1928 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3030 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW VVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1902 putative endonuclease of cryptic prophage MTERIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKL SLSGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLMDDEQFDEINI VRAQPVSGGRLGVKIYPIMLEGQVKK >SSO_1585 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2402 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P068 ISSfl1 ORF2 MCRRCSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA VIPRKSNEKMASDGRAQLDV >SSO_2157 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0516 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHHQ >SSO_2724 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2669 putative transposase MGELLLHPGISTTEASMNNNNTLYVGLDVHKESITVAYAINSEPVELMGK IGTSPTDIQNLCKRLRSKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCA PSLIPKKPGERVKTDRRDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARA WASARDDLRHARQRLKSFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWR QLAFDEHRRTIEDRQAQCERLESALKEAVTEWRLYPVVEALQAMRGIQFI TAVGLISELGDLTRFEHPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSY ARKLLVEAAWSYRHPARISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRK LQAKGKNVNITIVAVARELAGFIWDMGRIAMSVAQQPQCHK >SSO_1292 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2801 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P124 IS3 ORF2 MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD SVVLAAAFTRSKQRYGAPRLTDELRAQGYHFNVKTVAASLRRQGLRAKAS RKFSPVSYRAHGLRCTGNSGHHHLFFF >SSO_3563 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2331 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1701 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0147 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_4342 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3241 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1686 IS1 ORF MSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >SSO_P210 ISSfl1 ORF2 MRQQQDEQGRFSICSRQAAVVHRDAYCNRNVVERCFGRLKEYRRIATRYD KTARNYLAMVKLGCIRLFYQRLRN >SSO_3553 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4475 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVLGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1643 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHHQ >SSO_2992 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1707 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2814 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P056 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P073 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3947 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_2574 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P074 IS600 ORF2 MAHIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRA TTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHTDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFRIKYYQMTA >SSO_2274 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1990 IS2 ORF1 MIDVLGPEKRKRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_1907 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0575 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1619 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDHKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1692 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P155 putative IS orf, fragment MYLKIRDRLGYMSNTSSNFEMTGTLLGLELRKRKTPQEKIAIIQQTMEPG MTVSHVARLHGIQPSLLLKWKK >SSO_P018 ISSfl4 ORF2 MISFPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRR GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI DWKHPKRTERAGIRI >SSO_3916 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVRRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_2934 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3837 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLPNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2248 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLTTAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_P076 IS1353 putative transposase-like protein MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE >SSO_P192 oriT nicking and unwinding protein, fragment MMSIAQVRSAGSADNYYTDKDNYYVLGSMGERWAGQGAEQLGLQGSVDKD VFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSVSMMAMLGGDK RLIEAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLVMALFNHDTSR DQEPQLHTHAVVTNVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLY REKRKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQTIREAVGEDAS LKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKETGFDIRAYRDAAEQRAY TRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQFMYTDLLARTVGILPPE NGVIERARAGIDEAISREQLIPLDREKGLFTSGIHMLDELSVRALSRDIM KQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSLAIVSGQGGAAGQRERVA ELVMMAREQGREVQIIAADRRSQMNLKQDERLSGELITGRRQLLEGMAFT PGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAM KDAGVNTYRWQGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQ VSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMY RPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGETQVVRISSLDS SWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSEDAMTVVVP GRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNA TLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETLLE TAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFAAEGTG FADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEG KEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTT QFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDT QLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAVGGGRAVASGDT DQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAVYSLINRDVER ALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEA FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREK AGELGQVQVMVPVLNTANIRDGELRRLSTWENNPDALALVDNVYHRIAGI SKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRIRFTKS DRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAI TAHGAQGASETFAIALEGTEGNRKLMR >SSO_0025 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1796 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1770 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0426 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIIGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYVNATMFDRYLPFSCPLNTFVTDAVRFPPF HHH >SSO_1399 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4313 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2651 IS600 ORF2 MAHIRTRETYGTRRHQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0007 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4248 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1577 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4457 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_4044 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_4389 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGYYLNIKHYQ >SSO_1728 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P027 IS100 ORF1 MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC >SSO_0341 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P123 IS600 ORF2 MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP AAFRIKYYQMTA >SSO_3740 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_2795 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_0324 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_2685 conserved hypothetical protein MPRTVTHNPDSPNNDDVLAASEKWDACKPPYTSAHMKICVAAAKIILAAS GVARRSKYEKENYLRIDFSKAGKVTFYAEFPKKMGLKGKKLGEWPELAIQ LAREKALGMADGGLRAESVHAALEMYRDDLKAKVARQKLSPDSFTTYGVR IDRIKATFGEREVFSDVTYNRLVEVLDEWIATRSNNNALELFAELRRFWK FCAPTLCNGRNVAASLPDDYVSSRVQKPTPTRLFTDIESIARLWLNVAAC TSVHQKNAVRFMIITGVRPINVHNLRWDYVYEEAGEIVYPEGVIGMRGAM KTQKAFRLPITPEIRRIIDEQKAWRDSVPECNRDYVFLQPRDPMQPFSKR SLDKLVKTYSPDGAVKGIKHDGTVKGKDGAFNTMCRKFLKSNVIALIDRN >SSO_4046 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1977 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P021 putative transposase, fragment MTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRTLRDRNGTFEPQQ LKKNQP >SSO_2935 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_0873 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3464 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_0039 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0743 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKAIGHYLNIKHYQ >SSO_1539 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P033 putative transposase, fragment MSEQKITGIDLAKTNFYLFSINAHGKPAGKTKLSRNQLLNWLVQQPKMTV AMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYNDAQAIAEA CQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRGLLAEYGIV FSKGAADLRQK >SSO_3114 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_1051 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P017 IS4 ORF MKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGE MADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNL VRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRD LASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1223 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P121 putative transposase METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALVEHYGTVIIPARPRKPKDKPK AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRLGEQHPAYASE HENLRGPGYYH >SSO_0148 IS2 ORF1 MGWQKCSGIKRSYLVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFE PGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAM KQIKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >SSO_1016 hypothetical protein MIKHIIIAERAHQESLVKQKRIAQVWNHKTQLARELKKPMGKQAPGWLEL SEDGSYYIVDEDKASLVNIIYDKRLSGMSMFAICKWLNEQGYLTINQRKV RISKTKKPDGNWSALSVKHILTSRSVLGYLPAKISTEDRKTVLREEIEGF YPQIVTDSKFYAVQQLLEETGKGKTSSGEHWLYVNILKGLIRCKCGLVMT PTGIRKPVYQGTYRCNGNKESRCSYGTVSRKLLDTQLCSRLFSKLSQLHD EATDTAKLDELQRRLNTVDGELEKLTETLIQLPNITQIQEALRVKQEEKD ELIVQLSREKARVKSVSSLDLSGLDMESVEGRTEAQIIIKRLVKEIVVSG NEKLVDIYLHNGNMIRGFPLDGKDDHTLTLEEATDEMQSLDDMLIFGEPV TRIYPAGDMEEVDA >SSO_4108 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_3914 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGGHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKGRKIYPAPSLTEHGMLNSGFVRGIENFRPKERMSILQLLLLH VSWRVLSGIWAE >SSO_3284 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2249 IS2 ORF2 MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRHSRHTDDT DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA LLLVSTPRCLTVICHFHAR >SSO_4433 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3319 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_2298 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_P128 conserved hypothetical protein MKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAK TIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVM FHYGKLNMAF >SSO_1739 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2038 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_2682 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_0018 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_0977 IS1 ORF MATAVRLHRFSTRYAPENHLYGHEWRWMPGNCTHYGRWPQHDFTSLKKLR PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAH VFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQ RIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >SSO_1078 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3182 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_2991 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P157 IS100 ORF2 MLHEEKLARHQRKQAMYTRMVAFPAVKMFEEYDFTFATGAPQKQLQSLRS LSFIERNENIVLQGTSDITNPRVGICV >SSO_1160 IS2 ORF2 MSRAQLHVILRRTDDWMDGRHSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKST AVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQQASNGLSDNRCLEI >SSO_2176 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRH >SSO_0723 putative DNA replication factor MKNIATGGVLERIRRLTPPHVTAPFRTVAEWREWQLAEGQKRSEEINRLN RQLRVEKILNRSGIQPLHRKCSFANYQVQNDGQRYALSQAKSIADELMTG CTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDD GQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVG MLTNLNYEVMKTLLGERVMDRMVMNGGRWVNFNWESWRPNVSHSRVVK >SSO_1070 putative transposase METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK AENGVLIVELWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE HENLRGPGYYH >SSO_1783 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2407 IS2 ORF2 MLGAIEKRFGDKVPEQSIQWLTDNGSAYRAHETRQFARELNLEPCTTAIS SPQSNGMAERFVKTMKEDYIAFMPKPNVRTALHNLAVAIEHYNENHPHSA LGYRSPREYRRQRVTLT >SSO_1467 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4481 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P054 ISSfl1 ORF1 MVHKSDSDELSALRAENARIIKPLLLPEPATPRAGRPWAEHRKIINGMFW VLCSGAPWRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGF IDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQ TEVASR >SSO_0784 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P030 iso-IS1 ORF2 MVTSDDWGSYAREVPKEKHLTGKIFTQRIERNNRTLRTRIKRLARKTICF SRSVEIHEKVIGSFIEKHMFY >SSO_P154 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_P034 conserved hypothetical protein MDGKRISGVPFERYADDIVVHCSRMSDATRLKNRLSERFSEVGLVLNAGK TNIAYIDTFKRRNVATSFTFLGYDFKVRTLKNFKGERYRKCMPGASNAAM RKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSRNFNYRL WSAMQSRLLKWMQSKYRLSNRKAQRKLTLVRKEYPKLFVHWYLLRASNE >SSO_4310 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P162 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIMLTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_4440 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_4470 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3811 putative transposase encoded within IS METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRLGEQHPAYASE HENLRGPGYYH >SSO_0045 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2655 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKNDATAG >SSO_1772 IS600 ORF2 MSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR QRRHSRLGNISPAAFREKYHQMAA >SSO_3290 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3543 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2132 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHHQ >SSO_4366 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P028 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_4490 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P139 IS3 ORF2 MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF YASGPNQKWAGDITYYYSSPTAGKHGAPGY >SSO_1615 IS2 ORF2 MRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLR VTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEW LTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYI SIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSD NRCLEI >SSO_P146 IS629 ORF2 MGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTW QGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHH SDKGSQYVSYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRK SWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLA A >SSO_2882 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3581 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2362 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4511 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3609 IS150 ORF B MSHIRPDFPEPYGLPVDGLRGKTSIFSHPVYLFLREFSLQDSRGGSLSKK AESLSSGKITIGTKEKVIIINELRQCHPLSQLLVIADLPRSTFYYHVKRL NAPDPYQLVKQVILRIYHQHKGRYGYRRIRLACRNESILLNGKTIRKLMK ELGISSLIRRKKYRTYRGEQGRTCNNLLKRQFYADRPNQKWVTDVTEFKV DGRKLYLSPIMDLYNGEIVSYNLTERPLASMVKSMLLDAVEQLNKDDKPL LHSDQGWQYQMPRWQRWLSDNGITQSMSRRGNCLDNAAMESFFSTLK >SSO_P129 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_P166 putative IS orf, fragment MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLL >SSO_1073 IS600 ORF2 MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD RGSQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHY RFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3588 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_2681 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_3465 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_P138 putative reverse transcriptase, fragment MARTRSGRETSRTITAHRLRGNTGRRVIEGDLSSYFDTVHHRLLMKAVCR RISDARFMRLLWKTPC >SSO_1172 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P029 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_3561 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3823 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_0732 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P042 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_P019 putative IS orf, fragment MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVL PSPLRQSSAHKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVLEQLE LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS PDRKGIHPQNHLAGYSGVLQADAYGSYRALYESGRITEAQQRIGELYAIE AEVRGCSAEQRLAARKARAAPLMQSLYDWIQQQMKIHSLKMECLHGEHYY PSGNSAGNSV >SSO_0941 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3116 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1069 putative transposase subunit MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT S >SSO_3769 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1417 putative excinuclease subunit MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR AALQALQTIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS >SSO_0293 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3948 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_1891 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P060 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_P080 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_1325 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLSIKHYQ >SSO_0816 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2993 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW VVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_3675 putative DNA processing protein MDADSSDSTVTPGRLPDGLSSCPCFYLGKKLMNLSANAQATLLLTSDFSR AAASEYKPLSNSEWGKFALWLKHQRISPAELLVPQPQEKLTGWSDPRISQ ERILGLLARGHSLALAVDKWQRAGLWILTRGDADYPVRLKNRLRTDAPPV LFGCGNKALLQAEGMAIVGSRDAPTDDLRYTQQLAAKLAQQGICVISGGA RGIDECAMASALEAGGTAVGVLADSLLKTSTLVKWREGLIAGNLVLISPF YPEVRFTVGNAMARNKYIYCLAESAMVVRAGMTGGTITGAMEALKHQWLP VQVKPNQDMQSANSRLVENGASWSAEQAENVTIRLPDVPGLMYDRALRNA QPELFSLHEDDANYAVMPAYTPVDFYQLFVAELAILAKESISIERLASCT GLTIEQISVWLNRAEEEGRVIRLGEGHYQFR >SSO_2453 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ AHQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P131 IS21 ORF1 MIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRHKMVKLKPF MDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKRKMRPSKRT VRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQ DAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVFNSGFLLLA DHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSFTHVNQQLE QWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSYFDIRHVSW DSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVASHRLCSAS SGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_2780 hypothetical bacteriophage protein MAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAE EDLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIAI QGNTITGAMGGRVKNPELTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAG QKKATNPFLKIIES >SSO_0628 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPRSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P063 ISSfl1 ORF1 MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR >SSO_1756 IS600 ORF1 MWSAGIDTSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3594 IS629 ORF2 METTFVLDALEQALWARRPSGTIHHSDKGSQYVSLAYKARLKEAKLLAST GSTGDSYDNAMAEIIKGLYKAEVIHRKSWKNRTEVELATLTWVDWYNNRR LLERLGHIPPAEAEKAYYASIGNDDLAA >SSO_0149 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGIGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_3667 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVRRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1322 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0327 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_2041 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P013 IS21 ORF2 MNKRAFFGAFLIFWGFKFLSMNCRYEKASIILTSNKGVADWGEMFGDHVL ATAILNSCA >SSO_1683 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_4066 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0019 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1628 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1729 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3583 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGLCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_4512 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYSCSRRIALFFWALIVLLLFLSNEAIWLMPHPLPNSRSTLR SERVRMTER >SSO_1890 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1867 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1581 putative DNA replication factor MKNIAAAGVLERIHRLAPQGAVPPYRTVEEWREWQLAEGRKRSEEINRQN RQLRVEKILNRSGIQPLHSKCSFANYQVQNDGQKYALSQAKSIADELMTG CTNFVFSGKTGTGKNHLAAAMGNRLMAKGRSVIIVTESDVMSVLHDSYDN GKSGEKFLQELCGVDLLVLDEIGIQRETKNEQVVLHQIIDRRTASLCSVG MLTNLNHAAMSTLLGERIMDRMTMNGCRWVTFNWDSWRSNVSFPGVVK >SSO_1894 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0868 putative integrase MSSAIHRLSDTLLRKLSGSPTTKNAFFNDGGNLSVRHSTSGLLTWYFTYR AGTGRQVSPERLRLGNYPDLSLKAAREKAAQCRAWLAEGKNPRYELNRAV QDALAPVTVKEALTYWLESYAKEKRTDYESLKSRINKHIISQIGALPLEK CELRHWLACFDQMAKRSPVSAGFLLQVCKQALKYCRKRRYAISNVLDDMV VGDVGKKAEISERVLTNKELGELLRALDEKIYPPYYSALIRLLIVFGCRA TELRRSEVQEWDFKEMLWTVPKEHSKTKVAIFRPIPEAILPFVTKLVEQN RHTGLLLGELKGQSSVSEYGRTAHRRINQAPWTLHDIRHTFTTMLNDLGV DPHVVEQLTAHQLPGVQRVYNHSRYLDAKRDALNLWVERLELLQNNDEKI VVMTPRIYSQNS >SSO_3012 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_0292 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1159 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0307 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0088 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P006 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWMNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_2679 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV SLAYTQRLKEAGLLASTGSTGDSYDNAMAEIIKGLYKAEVIHRKSWKNRT EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_1924 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P010 conserved hypothetical protein MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV ESRRQAKGTRFLWQHSNKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD IWSRPWSEERRNDWQRWLSLAANSDVPMMKNVAKTIGKRLYGILNAMRHG VSNGNAEALNSKIRLLRIKAKGYRNRERFKLGVMFHYGKLNMAF >SSO_2794 IS2 ORF2 MSRAQLHVILRRTDDWMDGRHSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKST AVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQQASNGLSDNRCLEI >SSO_1239 putative transposase MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIG >SSO_1709 putative virulence protein MKRLQAFKFQLRPGGQQECEMRRFAGACRFVFNRALARQNENHEAGNKYI PYGKMASWLVEWKNATETQWLKDSPSQPLQQSLKDLERAYKNFFRKRAAF PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYRNSRQVTGVVKNV TVSQSCGTWYISIQTESEVSTPAHPSASMVGLDAGVAKLATLSDGTVFEP VNSFQKNQKTLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANIRRDYL HKVTTAVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL DQGWYEMHRQLEYKQLWRGGQVLAVPPAYTSQRCAYCGHTAKENRLSQSK FRCQVCGYTANADVNGARNILAAGHAVLACGEMVQSGRPLKQEPTEMIQA TA >SSO_1816 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3584 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_0315 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0288 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1072 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_4435 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRSLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0702 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_2700 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2451 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2452 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_P043 IS2 ORF2 MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFVKTMKEDCI AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST >SSO_4311 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_1071 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_0823 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3249 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_4243 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1797 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P163 IS21 ORF1 MGYTGGRSMLRYYIQPKRKMRPSKRTVRFETQPGYQLQHDWGEVEVEVAG QRCKVNFAVNTLGFSRSFHVFAAPKQDAEHTYESLVRAFRYFGGCVKTVL VDNQKAAVLKNNNGKVVFNSGFLLLADHYNFLPRACRPRRARTKGKVERM VKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETPEQRFA LEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCGQPVSI RISLDDELRIYSNEKLVASHRLCSASSGWQTVPEHHAPLWQQVSQVEHRP LSAYEELL >SSO_3607 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_3585 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1323 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1054 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_0726 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0901 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P016 putative transposase MEYRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCGMFVRFRNAGLSWP LPAGMSEQELDALLYGSASTVPVVLTESTVMPKLPVVKKRPRRP >SSO_1015 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3598 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGGHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAW >SSO_0731 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_1566 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGDCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_2592 conserved hypothetical protein MATGGIHMELHCPQCQHVLDQDNGHARCPSCGEIIEMKALCPDCHQPLQV LKACGAVDYFCQHGHGLISKKRVEFVLA >SSO_3632 putative virulence protein MRRFAGACRFVFNRALARQNENHEAGNKYIPYGKMASWLVEWKNATETQW LKDAPSQPLQQSLKDLERAYKNFFQNRAAFPRFKKRGQNDVFRYPQGVKL DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGKWYISIQTESEVS TPVHPSASMVGLDAGVAKLATLSDGTVFEPVNSFQKNQKKLARLQRQLSR KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK VSNMSKSAAGTVSLPGRNVRAKSGLNRSILDQGWYEMRRQLAYKQLWRGG QVLAVPPAYTSQRCVCCGHTAKENRLSQSKFRCQVCGYTANADVNGARNI LAAGHAVLACGEMVQSGRSLKQEPTEMIQATA >SSO_2181 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1267 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1670 putative transposase METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE HENLRGPGYYH >SSO_1310 putative phage integrase protein MQRKKYDPNLPRNLTYRRRDKAYYWRNPLTKEEFTLGKISRRDAVTQAIE ANHYIYKNYSPAALIEKLKGFDSFTMADWIERYKTILIRRKVSRNTYKIR ANQLETIKEKLGEILLTEITTRHIAEFLDLWIEGGKNTMAGSMRSVLSDM FREAIVEGRISQNPVTPTRAPKIVVTRERLKLKIYNCIREAADQLPAWFP LAMDLALVTGQRREDITNMRFSDIYDDRLHVRQIKTGMMIAIPLSLSLPV AGLRLGTVVERCRLVSRGDYLISAGIRKNSPDGSIHPDGLTKKFVAARKF TGIQFSENPPTFHEIRSLAGRLYKETCGEEFAQRLLGHTSEKTTKMYLDE REKTYLLL >SSO_2699 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P235 IS629 ORF1 MVLESQGEYDSQWATICSIAPKIGCTPETLRVRVRQHERDTGGGDGGLTT AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK >SSO_P133 IS629 ORF2 MFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYVSLA YTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVE LATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_2174 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P188 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0748 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1884 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0289 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_1757 IS21 ORF1 MEVAGQRCKVNFAVNTLGFSRSFHVFAAPKQDAEHTYESLVRAFRYFGDC VKTVLVDNQKAAVLKNNNGKVVFNSGFLLLADHYNFLPRACRPRRARTKG KVERMVKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETP EQRFALEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCG QPVSIPLKSSKLKLSERFLKINFRSVKALPDYLFLKAL >SSO_3040 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSVTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_0405 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVLGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1437 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRSQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3222 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIKRHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2024 IS2 ORF1 MTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQ IKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >SSO_0173 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1189 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3131 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P012 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDAARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_0910 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1926 putative resolvase MAILGYCRVSTDDQSITNQQMQIEEAGYNIAKWFTDEAVSGSVKASLRNG FSRLLAYAREGDTVVVVAVDRLGRDTIDVLSTVKALQAKGVTVISLREGF DLSSAMGEAMLGIMSTLAQLERSLIAERRKAGIERAKAEGVHMGRPVKAS SEAVQTLISQGKTRLQIQEELGISRATYYRLAK >SSO_2757 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P134 putative transposase, fragment MNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHVA KQPGEVVDKTRQNEPPRVSWRVFYL >SSO_2668 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1288 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1320 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1269 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P125 IS3 ORF1 MTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLYAWR SKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKRLK >SSO_0017 putative transposase MANYRLISLALTGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_P002 IS2 ORF2 MVHATGLMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI MPKPDGLTAAKNLAEAFEHYNE >SSO_0881 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_1265 IS2 ORF1 MRTLRKQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIK ELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >SSO_1451 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_P046 ISSfl2 ORF MQPPGCRGSGKRLFDKALPNDENKLRSLISDLKQHGQILLVVDQPATIGA LPVAVARSEGVLVGYLPGLAMRRIADLHAGEAKTDARDAAIIAEAARTLP HALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRIRGLLTQIHPALER VLGPRLEHPAVLDLLQRYPSPEKLASLGFAG >SSO_4468 IS1 ORF MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK SVELHDKVIGHYLNIKHYQ >SSO_3566 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPDNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSWRENTATG >SSO_2025 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3597 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P218 IS630 ORF MQTMTSRSRQAAYSISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL KENPKFTYRKLKN >SSO_2666 putative protein encoded within IS MNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTR KPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIR TVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYR QSEIYGRQGVELSRSLLSGWVDACCRLLSPMEEALHGYVLTDGKLHVDDT PVPVLLPGNKKTKTGRLWAYVRDDRNAGSTLAPAVWFAYSPDRKGIHPQT HLAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHDVHVRTPSAL TEEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSLESWLREKMK TLSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRK NYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRV GELLPWRVALPTE >SSO_3677 conserved hypothetical protein MEKHGAELLLQRMLSNTSATFREGQWEAIDAVVNQRRKLLVVQRTGWGKS AVYFIASKIFRDRGAGPTIIISPLLALMRNQVAAAERLGITAETLNSTNR EEWQRISDKLLRGGVDCLLISPERLANQDFLETANTPVLGTTATANNRVV EDIRQQLGDIVIQRGTLARESLALDALVLGEQSSRLAWLATVIPQFSKSG IVYTLTTRDAELVAEWLRTNGISAFAYYSGVTCEGAEDSNTAREYLEQAL LANKIKVLVATTALGMGFDKPDLGFVIHYQMPGSIVGYYQQVGRAGRAID SAVGILLCGGEDRAIHKFFRESAFPAEAQIHEILNVLSENDGLTLRGIEQ RTNLRYGQIEKALKLLVAENPSPVVYTEKLWRRTIVSFSPDHERINHLMN QRKNELADVESYITTKECKMQFLRRALDEPSAERCGKCSSCLQHPLLSPD IDSGLLHAANLFIKHADLPLNLNKQVASGAFTQYGFKGNLPAGLQGSTGR VLSRWGDSGWGKQVAQEKKTGRFSDELVEACAEMVRQRWNPHPEPTWVCC VPSLKHLDLVPDFARRLAAKLGLPFIDAIEKVVDNPPQKMQQNRFGDAAN LLI >SSO_1313 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_1605 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGYYLNIKHYQ >SSO_2287 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3162 conserved hypothetical protein MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN QSQTFGVAWDDPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR LETDVYVPLA >SSO_2905 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQKDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_0304 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0143 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P077 IS21 ORF1 MADVADRRELRQFRQTPEQRFTQEQEHLQPLLGTDFDIRHVSWDGYIEVG GNRYSVPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVR PGCSSVTAKSA >SSO_1617 IS600 ORF2 MSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR QRRHSRLGNISPAAFREKYHQMAA >SSO_1521 IS4 ORF MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWK YPTAPKKSQSVA >SSO_2650 putative protein encoded within IS MIPLPSGIKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS GSQVKLLWSTSDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI DWRQPKRLLTSLTML >SSO_0509 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2247 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_1610 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1238 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0305 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2652 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0428 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_P051 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNLFFEMKA >SSO_4297 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1055 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_P038 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_0488 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1811 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1415 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4441 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLATAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_0198 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_0290 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3589 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQ >SSO_2660 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3904 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGCAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_0500 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGECTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKRYQ >SSO_3749 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1198 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_2165 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1196 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHHQ >SSO_P189 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P161 conserved hypothetical protein MVGVAGAALAPLVKLLRHELLTRDVIHADETSLRLLDTRKGGKSCSGWLC AYVSGERSGPPVVCFDSQTGRALRYPETWLQCWCGGTLVSDGYSVYKSLA DNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKIAGLYRIEKLIRE RCQRHDV >SSO_2434 IS2 ORF1 MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA VEYGRAKKWIAHAPLLPGDGE >SSO_2758 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELDEVA >SSO_3008 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3610 IS629 ORF2 MRKVWRQLLREGIRVARCTVARLMAVMGLVGVLRGKKVRTTVSRKTVAAG DRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVW >SSO_2799 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1383 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_3467 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1018 putative transposase METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKPHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE HENLRGPGYYH >SSO_3318 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTVRFETQPGYQLQHDWGEVEVEVAGQRCKVNFAVNTLGFSRSF HVFAAPKQDAEHTYESLVRAFRYFGGCVKTVLVDNQKAAVLKNNNGKVVF NSGFLLLADHYNFLPRACRPRRARTKGKVERMVKYLKENFFVRYRRFDSF THVNQQLEQWIADVADKRELRQFKETPEQRFALEQEHLQPLPDTDFDTSY FDIRHVSWDSYIEVGGNRYSVPEALCGQPVSIRISLDDELRIYSNEKLVA SHRLCSASSGWQTVPEHHAPLWQQVSQVEHRPLSAYEELL >SSO_1315 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0287 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_4456 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_P066 ISSfl1 ORF1 MACYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP WRDLPERYGSWKTVYNRFNRWSKSGVINIIFNRLLSLLDAGDAANLLI >SSO_2408 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2725 IS600 ORF2 MAHIRTRETYGTRRHQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1771 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGAVWSENINVA >SSO_1666 hypothetical protein MEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVH DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSA AGEQFMSLVRSQQCTV >SSO_1674 H-repeat-associated protein-like protein MEIKKLMEHFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMYSLVLGQIKTDE KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG NQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDEL IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND KVFKAGLRRKMRKAAMDRNYLASVLAGSGLS >SSO_0653 putative transposase METCFKILQLKFDKKLTNRCIGLTLHISASTVFEVLARFKASSLSWPLPA DISHDTLEKLIFPPKDTSASELVMPDMLYFDTEMRKPGVTRQLLWMEYKA QAGDKAMGYSHFCRCYRKWKKTRRLSMRQEHRAGEKLFIDFCGPTVPVIN PDTGEIRRVAIFVAVMGASNYTYVEACEGQDMMSWLNAHSRCLTFLGGVP KLLIPDNLRSAVKKADRYEPVINDSYQALAEHYGTVIIPARPRKPKDKPK AENGVLIVERWLLARIRNETFHTLRALNARLRELLTDMNNRPMKGYGNQT RAERFRMLDAPALSPLPLEPYEYTEYKAVKVGPDYHVEYARHWYSVPHEL VGQRLSLKVGQSVVQLWHKGQCVAQHPRSTHEYKHTTNPLHMPERHRRHG TWTPERLIEQGNRTGPSTGRVVESMLKAKLHPELAYRAVLGLLALQKKYG PERLEKACYVALHYNAPDRRFIDNLLRHHRDNVELPLSRQGEQHPAYASE HENLRGPGYYH >SSO_3936 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3988 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSWRENTATG >SSO_2048 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1908 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P120 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1316 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1266 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_0429 IS600 ORF2 MLDGADNPNGELPSYITGADSYDNAPMESFWGTLKNESLSHYRFNNRDEA ISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0508 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_0504 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGECTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1318 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_4218 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGGHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_4058 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1785 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_0654 putative transposase subunit MSDNLLNKLTQLKLPAMAGSLIRQRETPQTYDELSFEERLTLLVDDELLS RENSRVARLRKNACLKYQATPEGLRYPASRGLRAEQMRELLNGHYIIHRK NLLITGPTGCGKSWIANALGEQACRQKYSVRYCRTGRLLEQLAQGRVDGS WLKYLKQLQKIQVLILDDLGLEQLSNAQCNDLLEITEDRYGQSSTIVVSQ FPVDKWHGLMENPTTADAILDRLVHNSHRVVLQGESLRKNPPTVESSEKT S >SSO_0874 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_2653 putative transposase MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA RELAGFIWDMGRIAMSVAQQPQCHK >SSO_1009 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLPAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_1741 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_1224 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P081 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >SSO_P145 IS629 ORF1 MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_P178 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3592 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >SSO_0425 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1808 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0869 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1895 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_3586 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P150 hypothetical protein MFNAKIRGWIKYYGAFYKSALYLTLRQIDRKLVLWLPRKHKRLRGHRRRA SHWLARVARSETRLFAHWPLLWGQASMRRAG >SSO_3032 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_P055 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >SSO_3218 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_4329 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1760 IS21 ORF1 MLSREDFYMIKQMRQQGAYIVDIATQIGCSERTVRRYLKYPEPPARKTRH KMVKLKPFMDYIDMRLAENVWNSEVIFAEIKAMGYTGGRSMLRYYIQPKR KMRPSKRTV >SSO_1899 putative DNA adenine methyltransferase encoded by prophage MLNTVKISSCELINADCLEFIRSLPENSVDLIVTDPPYFKVKPEGWDNQW KGDDDYLKWLDQCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFSVL NHIIWAKPSGRWNGCNKESLRAYFPATERILFAEHYQGPYRPKDAGYAAK GSALKQHVMAPLISYFRDARAALGITAKQIADATGKKNMVSHWFSAGQWQ LPNESDYLKLQALFARVAEEKHRRGELEKLHHQLVDTYTSLNRQYAELLS EYKHLRRYFGVTVQVPYTDVWTHKPVQFYPGKHPCEKPAEMLQQIISASS RPGDLIADFFMGLGSTVKAALALGRRAIGVELETERFEQTVREVQDLVSQ NG >SSO_1769 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1246 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3031 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1929 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1618 IS21 ORF1 MVKYLKENFFVRYRRFDSFTHVNQQLEQWIADVADKRELRQFKETPEQRF ALEQEHLQPLPDTDFDTSYFDIRHVSWDSYIEVGGNRYSVPEALCGQPVS IRISLDDELRIYSNEKLVASHRLCSASSGWQTVPEHHAPLWQQVSQVEHR PLSAYEELL >SSO_2356 putative regulator MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC DEFTPDSLKALALWMKRNAKNEAVETETAE >SSO_3225 putative superfamily I DNA helicases MDKNALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIV SKFFKGEKDDVETVDVILRPKVYFRLLQHGKDRSAGAPDIVTPIVTPALL SREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFD DSVDKTAETDEEREARYAALQQEWRQYLYDSERLLKSVAGDWIEKPEQYE LAEHGYIVKTAQSGGASSHILSLYDHLIVCNKDVPLFNRFASREVHAAES LLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTG KTTLVLSIIATQWARAALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQG SGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA LLFYLEKAKAAFPGKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQ IRAARELIANDIEQYLDNLNKLLSGQEQKVTLLKSAKTEWKKYRAGESLI YSLFSWLPAVRNKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLN SAEREQTTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQA DELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQDEKKKKGAKGVTARW QRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAG QVLPEVAAASFALAKKALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGS TQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGELASSGS RYNLLEAETIAVWLAENQQNIEAHYGKSLHEVVGIVTPFSAQVSTIKQVL GKQGISTGTNEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNS MLNVAVSRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDY KERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITIISPWLTWQRLE QTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNFKAALEKL NALGIATKLVNRVHSKIVIGDDGLLCVGSFNWFSATREARYERYDTSMVY CGDNLKGEIEAIYNSLERRQV >SSO_P153 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P061 IS629 ORF2 MPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAKVIHRKSWKNRA EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >SSO_P193 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0927 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_2736 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0862 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1237 hypothetical protein MARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLVPK QTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKKRR PASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY >SSO_P079 IS2 ORF2 MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_P184 putative transposase MCRLAVEYLLYAARKRGLEIGIFCTIHTLRLHFEEHLPLVVAGRRLGVPK STVCSMFVRFRKAGLSWPLPAGMSERELDARLYGSASTVPVVLTESTVMP EVPGVKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNW RHQYRKGGLLPSGKNMPALLPVTLTPEPDNHGFDIIYMLSTHHQRFTFVR LFDPYLIGSRPTFSHLAHHHIS >SSO_0996 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2039 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_0570 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHHQ >SSO_P186 IS186 ORF1 MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS LAFGEADYIVRVYWRGLRWLTAEGMRFDMMDFLRGLDCGKNGETTVMIGN SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA GHVLLLTSLSEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEL ELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKKN >SSO_2929 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2791 putative endonuclease MRHEFILPYPPTVNTYWRRRGSTYFVSKVGERYRRDVTLIVRQQRLKLNL SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDEINIVR GQLVPGGRLGVKIYEITGDNDGA >SSO_4314 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_P239 IS911 ORF2 MQTMTSRSRQAAYSGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISH GSAGARSIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEH VAIPNYLERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGW AMSFSPDSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRY QIRQSMSRRGNCWDNSPMERFFRSLKNEWMPMVGYVSFREAAHAITDYIV GYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC >SSO_1304 conserved hypothetical protein MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVRPDRPISPQAMAI HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM KLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDI MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWLFN NLDSMSPELRLTLKHYLENT >SSO_2754 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1711 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_0368 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_1979 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_3608 IS629 ORF1 MVLESQGEYDSQWAVICSITPKIGCTPETLRVWVRQHERDTRGGDGGLTT AERQRLKELERENRELRCSNDILRQASAYFAKAEFDRLWKK >SSO_3898 IS4 ORF MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVRRKGKVCHLLTSMTD AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >SSO_2931 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_1594 IS1 ORF MPGNSTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3741 IS21 ORF2 MHELEVLLSRLKMEHLSYHVESLLEQAAKKELNYREFLCMALQQEWNGRH QRGMESRLKQARLPWVKTLEQFDFTFQPGIDRKVVRELAGLAFVERSENV ILLGPPGVGKTHLAIALGVKAVDAGHRVLFMPLDRLIATLMKAKQENRLE RQLQQLSYARVLILDEIGYLPMNREEASLFFRLLNRRYEKASIILTSNKG FADWGEMFGDHVLTTAILDRLLHHSTTLNIKGESYRLKEKRKAGVLTKNT TPISDDEMVESGQHQ >SSO_0466 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIIGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1620 IS2 ORF2 MALTLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAE AFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >SSO_3658 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_1906 IS600 ORF2 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >SSO_1784 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVPENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >SSO_0555 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_4188 IS1 ORF MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_3185 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCACSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLNPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSCVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLW QALHDTITRNHQCSSMWQLLKKVRHFMETVSPFPGGKHGLAKV >SSO_2271 ada, O6-methylguanine-DNA methyltransferase MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE NVSFYANASEALAAGFRPCKRCQPDKANPRQHRLDKITHACRLLEQETPV TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV TTSILNAGFPDSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP KAVRAVASACAANKLAIVIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE NEER >SSO_2121 alkA, 3-methyl-adenine DNA glycosylase II, inducible MYTLNWQPPYDWSWMLGFLAARAVSGVETVADNYYARSLAVGEYRGVVTA IPDIARHTLHINLSAGLEPVAAECLAKMSRLLDLQCNPQIVNGALGKLGA GRPGLRLPGCIDAFEQGVRAILGQLVSVAMAAKLTAKVVQLYGERLDDFP EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGSLPMTIPG DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKRRFPGMTP AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA >SSO_2270 alkB, DNA repair system specific for alkylated DNA MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHDLC QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT GDCRYNLTFRQAGKKE >SSO_3518 dam, DNA adenine methylase MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKTYFPEAELYH FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS ISSNGGTRKKVDELLALYKPGVVSPAKK >SSO_1788 dbpA, ATP-dependent RNA helicase MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAVISGRVQR DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT KKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA QRANIISDMLQIKLNWQTPPANSSIVPLEAEMATLCIDGGKKAKMRPGDV LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKT SRVRLLK >SSO_2018 dcm, DNA cytosine methylase MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL KRALANDSAWHRLSEKEFAHLQTLLPKPSAHHPHYAFRFIDLFAGIGGIR RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKE GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRIIMQTLDELGY DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL EPKIKQAVALRQQEAQHGRRSR >SSO_3308 deaD, inducible ATP-independent RNA helicase MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFGGERR EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA >SSO_0778 dinG, probably ATP-dependent helicase MALTAALKAQIAAWYKALQEQIPDFIPRVPQRQMIADVAKTLAGEEGRHL AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTANNQ EEQKRCAKLKGDLDTYRWDGLRDHTDIVIDDDLWRRLSTDKASCLNRNCY YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDEVLEICQ RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI VIPRMRVEPSIDNEEQHIAEMAAFFRKQVESKKHLGMLVLFASGRAMQRF LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG IVKKKEKTKSPRRRRR >SSO_0268 dinJ, damage-inducible protein J MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFD LREPNQLTIQSIKNSEAGVDVHKAKDADDLFDKLGI >SSO_0274 dinP, damage-inducible protein P MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAK IASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTC GDVQKCDLVMLLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG L >SSO_3652 dnaA, DnaA MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW VRDKYLNNINGLLTSFCGADAPQLRFEVGTKPVTQTPQAAVTSNVAAPAQ VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA RAAARQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE SHDIKEDFSNLIRTLSS >SSO_4232 dnaB, replicative DNA helicase; part of primosome MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV GGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQ GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR LTFNGQWSRFDNYAGPQYDDE >SSO_4507 dnaC, chromosome replication; initiation and chain elongation MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY >SSO_0196 dnaE, DNA polymerase III alpha subunit MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLL ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL EERLAFLFPDEEERVKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR LRQSLEAHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI GSEQVELEFD >SSO_3203 dnaG, DNA primase MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR QEELIARERTHGLSNEERLELWTLNQELAKK >SSO_3651 dnaN, DNA polymerase III beta subunit MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE DAASQSAAYVVMPMRL >SSO_0229 dnaQ, DNA polymerase III epsilon subunit MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA TIQRIVRQASKLRVVFATDEELAAHEARLDLVEKKGGSCLWRA >SSO_0457 dnaX, DNA polymerase III tau and gamma subunits MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE MTLLRALAFHPRMPLPEPEVPRQSFAPVVPTAVMTPTQVPPQSAPQQAPT VPLPETTSQVLAARQQLQCVQGATKAKKSESAAATRARPVNNAALERLAS VTDRVQARPVPSALEKASAKKEAYRWKATTPVMQQKEVVATPKALKKALE HEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDNAV CLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPLEW RQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI >SSO_3099 endA, DNA-specific endonuclease I MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGC KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQD GGRKNCAKDPVYRKMESDMYNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ CAMKVDFKEKAAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNK MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS >SSO_2955 exo, 5'-3' exonuclease MRGLFPISHPAVACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR PPMPEELHDEMPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQ ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR >SSO_3402 fis, site-specific DNA inversion stimulation factor MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN >SSO_2289 gyrA, DNA gyrase subunit A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYDTIVRMAQPFSLRY MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT ANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAE GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT IDGSAAEGDDEIAPEVDVDDEPEEE >SSO_3649 gyrB, DNA gyrase subunit B MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE QQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKI LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA NIDI >SSO_0966 helD, DNA helicase IV MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGERCDLDT TYRFNSRIGKVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ QADYVIIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL TRARHRVWALFNKENPSPFVEILKNLDVPVARKP >SSO_0065 hepA, probable ATP-dependent RNA helicase MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL INYLASPVQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR ALIDAARNEADEKLSAELSRMEALRAVNPNIRDDELTAIESNRQQVMESL DQAGWRLDALRLIVVTHQ >SSO_1446 himA, integration host factor (IHF), alpha subunit MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE >SSO_0914 himD, integration host factor (IHF), beta subunit MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG >SSO_0594 holA, DNA polymerase III delta subunit MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW VTARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQTQLRQ AVQLLTRTELTLKQDYGQSVWAELKGLSLLLCHKPLADVFIDG >SSO_1119 holB, DNA polymerase III delta prime subunit MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT LLMDALKRHHGAAQVTNVDVPGLVVELANHLSPSRLQAILGDVCHIREQL MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL >SSO_4444 holC, DNA polymerase III chi subunit MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLR TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK >SSO_4522 holD, DNA polymerase III psi subunit MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALT DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG AQVASPALTELRANPTARAALWQQICTYEHDFFPRND >SSO_1733 hrpA, helicase, ATP-dependent MLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT EKVTVYGLPIVAARKVNYSQIDSALCRELFIRHALVEGDWQTRHAFFREN LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRISHDVISARHFDSWWKK VSRETPDLLNFEKSMLIKEGAEKIRKLDYPNFWHQGNLKLRLSYQFEPGA DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI WSFGQLPESYEQKRGNYKVKAWPALVDERDSVAIKLFDNPLEQKQVMWNG LRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDK LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR LEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIE ELRVSYFAQQLGTPYPISDKRILQAMEQISG >SSO_0160 hrpB, ATP-dependent helicase MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKST WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR AQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER HYLPLPAHQRFDEAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASR IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR LVVDCAQERVARFEPRTGLTRLITQRVSQASMTQRAGRAGRLEPGICLHL IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPVVN LMAAKRLLQMLGALDGERLSAQGQKMAALGNDPRLAAMLVSAKNDDEAAT AAKIAAILEEPPLMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS SLIAPLLAGAFADRIAHRRGQDGRYQLANGMGAMLDADDALSRHEWLIAP LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL RLLCAAKWLPEYDWPAVDDESLLAALETWLLPHMTGVHSLRGLKSLDIYQ ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF GEATNPTIAQGGVPLVLELLSPAQRPLQITRDLSAFWKGAYREVQKEMKG RYPKHVWPDDPANTAPTRRTKKYS >SSO_4173 hupA, DNA-binding protein HU-alpha MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK >SSO_0423 hupB, DNA-binding protein HU-beta MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN >SSO_4466 insA, IS1 ORF1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >SSO_P203 insB, IS1 ORF2 MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P160 insB, IS1 ORF2 MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_P067 insB, IS1 ORF2 MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >SSO_2406 intC, putative prophage Sf6-like integrase MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKIL TVGKYPLMTLQEARDKAWTARKDISVGIVPVKAKKASSNNNSFSAIYKEW YEHKKQVWSVGYATELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFED RGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFL PADQMPAFNKALATFSGSIVSLIATKVLRYTALRTKELRSMLWKNVDFEN RIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKK KPICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHEWPADAIEVQLAH ANGGSVRGIYNHAQYLDKRREMMQWWADWLDEKVE >SSO_0746 intE, putative integrase fragment MPVHLIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGK AASAKLIRSTLSDAFREAIAEGHITTKPVAATRAAKSEVRRSRLTADEYL KIYQTAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKT GVKIAIPTTLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVS RYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSD TMASQYRDDRGREWDKIEIK >SSO_P090 ipaB, IpaB MHNVSTTTTGLSLAKILASTELGDNTIQAANDAANKLFSLTIADLTANKN INTTNSHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK IKDLENKINQIQTRLSELDPDSPEKKKLSREEIQLTIKKDAAVKDRTLIE QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ EVIADLLASMSNSQANRTDVAKAILQQTTA >SSO_2500 lig, DNA ligase MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD RLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG LPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLEQQEQL GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKSAQNVVNAL EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEVLEAASIEELQKVP DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAG KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA KAQELGIEVIDEAEMLRLLGS >SSO_1134 mfd, transcription-repair coupling factor MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ NALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQDIISSRLSTLYQLPT MQRGVLIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGY RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTL ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN ANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDKVRNLALICESDLL GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE TTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVA EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA >SSO_2988 mutH, methyl-directed mismatch repair MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALAGLVTPENLKRDKGW IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR GERILTLPRGFYLKKNFTSALLARHFLIQ >SSO_4355 mutL, enzyme in methyl-directed mismatch repair MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA SISSVSRLTLTSRTAEQQEAWQAYAEGRDMNVTVKPAAHPVGTTLEVLDL FYNTPARRKFLRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQYR AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRS IPENRVAAGRNHFAEPAAREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ QGEVYRQLLQTPAPMQKLKAPEPQEPALAANSQSFGRVLTIVHSDCALLE RDGNISLLALPVAERWLRQVQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS VFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLL QSVDLHPAIKALKDE >SSO_3772 mutM, formamidopyrimidine DNA glycosylase MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP IVATKHAQRATFYCRQCQK >SSO_2880 mutS, methyl-directed mismatch repair MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQL LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK SLV >SSO_0107 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGEPWGKEGQ PGEWMSLVGLNADDFPPANEPVIAKLKRL >SSO_3235 mutY, adenine glycosylase MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS LCPLQNGCIAAANNSWSLYPGKKTKQTLPERTGYFLLLQHEDEVLLAQRP PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV >SSO_0665 nei, endonuclease VIII/DNA N-glycosylase with an AP lyase activity MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVET RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT ILLYSASDIEMLRPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH ALLDIPRFSYATRGLVDENKHHGALFRFKVFHRDGEPCERCGSIIEKTTL SSRPFYWCPGCQH >SSO_4171 nfi, endonuclease V MIMDLASLRAQQIELASSVIREDRLDKDPPDLIAGVDVGFEQGGEVTRAA MVLLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPD LVFVDGHGISHPRRLGVASHFGLMVDVPTIGVAKKRLCGKFEPLSSEPGA LAPLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRL PEPTRWADAVASERPAFVRYTANQP >SSO_2215 nfo, endonuclease IV MKYIGAHVSAAGGLANAAIRAVEIDATAFALFTKNQRQWRAAPLTTQTID EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ GSNLGFKFEHLAAIIDGVEDKFRVGVCIDTCHAFAAGYDLRTPAECEKTF ADFARIVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQD DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA >SSO_2431 nohB, bacteriophage DNA packaging protein MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIRWY AERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELK NARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDF LKRDIIKAMNKAAALDELIPGLLSEYIEQSG >SSO_1525 nth, endonuclease III MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE DLCEYKEKVDI >SSO_1276 ntpA, dATP pyrophosphohydrolase MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAP QAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVTRNTE SWFCLALPHERQVVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA >SSO_1798 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG YERISATNPGGLSDKLRDYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT GYAGGVQRKEWLLRHEGYLLL >SSO_3161 parC, DNA topoisomerase IV subunit A MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKVAVKGKSNQSVVFVDSTG RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASNGDS EE >SSO_3168 parE, DNA topoisomerase IV subunit B MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV SSDEVLASQEVHDISVAIGIDPNSDDLSQLRYGKICILADADSDGLHIAT LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV >SSO_0657 phrB, deoxyribodipyrimidine photolyase (photoreactivation) MTTHLVWFRQDLRLHDNLALAAACRNSSARVLALYIATPRQWATHNMSPR QAELINAQLNGLQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHL FYNYQYEVNERARDVEVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKV FTPFKNAWLKRLREGMPECVAAPKVRSSGSIEPSPSITLNYPRQSFDTAH FPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR QCLHRLLAEQPQALDGGVGSVWLNELIWREFYRHLITYHPSLCKHRPIIA WTDRVQWQSNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF NPTTQGEKFDLEGEFIRQWLPELRNVPGKSVHEPWKWAQKAGVKLDYPQP IVEHKEARVQTLAAYEAARKGK >SSO_4036 polA, DNA polymerase I MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI HQLMENCTRLDVPLLVEVGSGENWDQAH >SSO_0066 polB, DNA polymerase II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV PRAQHILQGKQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG VTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPL KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVAS RPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRLPLRLGRDN SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHTSP GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFY GVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTF VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ ELYLRIFRNEPYQEYVRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVP PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >SSO_4104 priA, primosomal protein N (factor Y), putative helicase MPVAHVALPVPLPRTFDYLLPEGMAVKAGCRVRVPFGKQQERIGVVVSVS DVSELPLNELKAVVEVLDVEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL PILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKI WRDQVATLEFNDAALQTLRKKGLCDLASETPEFSDWRTNYAVSGERLRLN TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ AGLAPALITRMRQHLQANNQVILFLNRRGFAPALLCHDCGWIAECPRCDH YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP LFPDVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP LFLQQLRNLILSSPLADDKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR LQHIINGTLALINTIPDSRKVKWVLDVDPIEG >SSO_4384 priB, primosomal replication protein N MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI DSGD >SSO_0454 priC, primosomal replication protein N'' MKTALLLEKLEGQLATLRQRCAPVSQFATLSARFDRHLFQTRATTLQACL DEAGDNLAALCHAVEQQQLPQVAWLAEHLAAQLEAIAREASAWSLREWDS APPKIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVE AYEARLARCRHALEKIENRLARLTR >SSO_2843 recA, DNA-dependent ATPase MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN EDF >SSO_2977 recB, DNA helicase ATP-dependent dsDNA/ssDNA exonuclease V subunit MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF PRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYEHL LEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY LQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS GIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKFSQRFLEDRTKA GGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRGELG FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA PGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK MWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVR ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ IWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEVERLAEDLR LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS PGTFLHSLFEDLDFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA PLNETGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC PPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA AMQAHRYDLQYQLYTLALHRYLRHRIADYDYDRHFGGVIYLFLRGVDKEH PQQGIYTTRPNAGLIALMDEMFAGMTLEEA >SSO_2979 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL SQKFSIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE TGHLVEGLGEAQAWQAPLWKALVEYTHQLGQPRWHRANLYQRFIETLESA TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLL ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED PTLTPRDIIVMVADIDSYSPFIRAVFGSAPADRYLPYAISDRRARQSHPV LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLSAFFL PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPE TMEAIVEQSQRFLLPLFRFNQS >SSO_2976 recD, DNA helicase ATP-dependent dsDNA/ssDNA exonuclease V subunit MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH VCLPLSRLENNEASHPLLATCVSEIGELQNWEESLLASQAVSRGDEPTPM ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPVSDE INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ LASVEAGAVLGDICAYANAGFTAERAGQLSRLTGTHVPAGTGTEAASLRD SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG VAGLNERIEQFMQQKRKIHRHSHSRWYEGRPVMIARNDSALGLFNGDIGI ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA LILPSQRTPIVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL AALFSSRG >SSO_3650 recF, ssDNA and dsDNA binding protein MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV EKGKITD >SSO_3753 recG, DNA helicase MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFS AAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT LHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP LSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDVPMMRLVQGDVGS GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFEPLGIEVGW LAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE LPPGRTPVTTVAIPDTRRTDIIDRVRHACMTEGRQAYWVCTLIEESELLE AQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVAT TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA >SSO_3045 recJ, ssDNA exonuclease MKQQIQLRRRAVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR SLGIPVIVTDHHLPGETLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQVEALTLCEKLERS RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEADKFELFQQRFG ELVTEWLDPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV QLAYKLDINEFRGNRSLQIIIDNIWPI >SSO_2772 recN, protein used in recombination and DNA repair MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV TKHHQQALEIARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKFD EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH QHYFVSKETDGAMTETHMQSLNKKARLQELARLLGGSEVTRNTLANAKEL LAA >SSO_2689 recO, protein interacts with RecR and possibly RecF proteins MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL EYETRFSELFFDYLHCIQSLAGATGTPEPALRRFELALLGHLGYGVNFTH CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE >SSO_3996 recQ, ATP-dependent DNA helicase MAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMPTG GGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNSTQT REQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEAHC ISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGLND PLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAKVE DTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGMGI NKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWLRR CLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCGNC DICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIRDY GHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAARP VLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIADES NVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMALIR AHVDGDDEE >SSO_0459 recR, recombination and repair MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR F >SSO_3949 rep, rep helicase, a single-stranded DNA dependent ATPase MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI AAVTFTNKAAREMKERVGQTLGRKEARGLMISTFHTLGLDIIKREYAALG MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPS QAAASAIGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE ERMQKGQSHLANLKAMMAAKRGK >SSO_3951 rhlB, putative ATP-dependent RNA helicase MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT RPRTGNGPRRTGAPRNRRRSG >SSO_0776 rhlE, putative ATP-dependent RNA helicase MSFDYLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG QGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRKP AAAQ >SSO_0228 rnhA, RNase HI MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG YQVEV >SSO_0195 rnhB, RNAse HII MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS EKRRLALYEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL DIVFPQYGFAQHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS >SSO_1504 rnt, RNase T MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR WKRLGGWPLPAAEEV >SSO_2443 rus, endodeoxyribonuclease RUS MNTYSITLPWPPSNNRYYRHNRGRTHISAEGQAYRDNVTRIIKNAMLDIG LAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVV KMPVTKGGRLELTITEMGNE >SSO_1280 ruvA, Holliday junction helicase subunit B MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARTDASSETLIREALR AAL >SSO_1281 ruvB, Holliday junction helicase subunit A MIEADRLISAGTTLPEDGADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP >SSO_1278 ruvC, Holliday junction nuclease MAIILGIDPGSRVIGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF EYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH CHVSQNAMQMSESRLNLARGRLR >SSO_2081 sbcB, deoxyribophosphodiesterase MMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAIRTDNEFNVIGEPEVFY CKPADDYLPQPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCIL GYNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPE GINWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTR QPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWRGNTSWVAPLA WHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKADLGDNAAVPVKL VHINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAI FAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVD KRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYADELQML AQQYADNKEKVALLKALWQYAEEIV >SSO_0374 sbcC, ATP-dependent dsDNA exonuclease MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL EKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLT RQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA EHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA AITLTLTADEVASALAQHSEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ NVTLEQTQRNVALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALR GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTTSLNITLQPQDDIQP WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL TGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ QLKQDADNRQQQQTLLQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG LTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTL SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK >SSO_0375 sbcD, ATP-dependent dsDNA exonuclease MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETTQTHQVDAIIVAGDVF DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQ QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA >SSO_2079 sbmC, SbmC protein MNYEITQEEKRTVAGFHLVGPWEQTVKKGFEQLMMWVDNKNIVPKEWVAV YYDNPDETPAEKLRCDTVVTVPNNFTLPENSEGVILTEISGGQYAVAVAR VVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYV AVQPKHH >SSO_0641 seqA, negative modulator of initiation of replication MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR VASPAIVEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI >SSO_3426 smf, Predicted Rossmann-fold nucleotide-binding protein MVDTEIWLRLMSISSLYGDDMVRIAHWLAKQSQIDAVGLQQTGLTLRQAQ RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLVTTDYPGALFVEG ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDG VAHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLLEHGGALVSEFPLD VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQRREVFAL PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS PDQQDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW IAAVPGGYVRLRRACHVRRTNVFV >SSO_P113 spa32, Spa32 MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC >SSO_2702 srmB, ATP-dependent RNA helicase MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE >SSO_4239 ssb, ssDNA-binding protein MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG GAQSRPQQSAPAAPSNEPPMDFDDDIPF >SSO_3841 tag, constitutive 3-methyl-adenine DNA glycosylase I MERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVL KKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR AYLQMEQNGEPFADFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALK KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYPGNKP >SSO_4014 tatD, Mg-dependent DNase MEYRMFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQ KLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDF NRNFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKL PGAVLHCFTGTREEMQACVAHGIYIGITGWVYDERRGLELRELLPLIPAE KLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAA TTDANVKTRFGIAF >SSO_1869 topA, DNA topoisomerase type I MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADS TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA CQMTPAKYDSTTLTIGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL PAVNKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI ISTIQDRGYVRVENRRFYAEKIGEIVTDRLEENFRELMNYDFTAQMENSL DLVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG WSAFYVDGKWVEGKK >SSO_1392 topB, DNA topoisomerase III MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ PDAYDSRYARWNLADLPIIPEKWQLQPRPSVTKQLNVIKRFLHEASEIVH AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERALDRLR SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA ISVHAPDLLLQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA KVYNLIARQYLMQFCPDVVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM TGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHS TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEVGSG AIA >SSO_2706 ung, uracil-DNA-glycosylase MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTI PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC NHFVLANQWLEQRGETPIDWMPVLPAESE >SSO_4238 uvrA, excision nuclease subunit A MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT ITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >SSO_0758 uvrB, DNA repair excision nuclease subunit B MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNEL GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL EFEEAAQIRDQLHQLRELFIAAS >SSO_1205 uvrC, excinuclease ABC, subunit C MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE FEEAARIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH >SSO_3986 uvrD, DNA-dependent ATPase I and helicase II MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD QSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW QDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV AFQGQGIKWLVAAYARLETV >SSO_2017 vsr, DNA mismatch endonuclease MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGSP DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD ISRLQELGWRVLIVWECALRGREKLTDEALTERLEEWICGEGASAQIDTQ GIHLLA >SSO_3777 waaP, lipopolysaccharide core biosynthesis protein MVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSYFLK WHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAFGEK GINPLSRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVATMV RDMHAVGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRVPRR WRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFVAPLKDILKQEQGLLSQAE AKATKIRERTIRKSL >SSO_2104 wcaH, GDP-mannose mannosyl hydrolase MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG GRVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFT THYVVLGFRFRVAEEELILPDEQHDDYRWLTPDALLASNDVHANSRAYFL AEKRAGVPGL >SSO_3984 xerC, site-specific recombinase, acts on cer sequence of ColE1, effects chromosome segregation at cell division MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ CDVTMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK >SSO_3047 xerD, site-specific recombinase MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA >SSO_2591 xseA, exonuclease VII, large subunit MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE SMQPAGEGLLQQKYEQLKAKLQAEGLFDQQYKKPLPSPAHCVGVITSKTG AALHDILHVLKRRDPSLPVIIYPTAVQGDDAPGQIVRAIELANQRNECDV LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRTGQQQQRLTQRLNQ QNPQPKIHRAQTRIQQLEYRLAETLRVQLSATRERFGNAVTHLEAVSPLS TLARGYSVTTATDGNVLKKVKQVKAGEMLTTRLEDGWIESEVKNIQPVKK SRKKVH >SSO_0399 xseB, exonuclease VII small subunit MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ AKLQQAEQRVQILLSDNEDTSLPPFTPDNE >SSO_1408 xthA, exonuclease III MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFR HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI RSMEKPSDHAPVWATFRR >SSO_0271 yafM, conserved hypothetical protein MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRNAIIKVKRDRPFEI NAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWE HAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDW AGDVTDINAGERIIL >SSO_0372 yaiD, conserved hypothetical protein MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK KTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDT LALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSL LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL KRLKFCDELRDQNEDIDREDFALRFDADFILMTGELAALIQNLIEGLGGE AQR >SSO_0430 ybaV, conserved hypothetical protein MRFSTIVSVVTLVWGISPRQPSGKNIIRWLLKKRTNGSVRYCQYTSVETK AEAPAAQSKAAVPAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIV SYREEYGPFKTVEDLKQVPGMGNSLVERNLAVLTL >SSO_0442 ybaZ, conserved hypothetical protein MLVSCAMRLHSGVFPDYAEKLPQEEKMEKEDSFPQRVWQIVAAIPEGYVT TYGDVAKLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTISLTGPDL QRQRQALLAEGVMVSGSGQIDLQRYRWNY >SSO_0259 ybfL, putative receptor protein MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI KTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF AVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDV PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI LTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS >SSO_0863 ybjD, conserved hypothetical protein MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL VRLMPVLRLRDARFMRRIRNGTVPNVPNMEVTARQLDFLARELSSHPQNL SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH PIMLSVAWHLLNLLPLQRIVTTNSGELLSLTPVEHVCRLVRESSRVAAWR LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR AD >SSO_0893 ycaJ, putative polynucleotide enzyme MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN SALLSRARVYLLKSLGTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI AELVNGDARRALNTLEMMADMAEVNDSGKRVLKPELLTEIAGERSARFDN KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR >SSO_1120 ycfH, conserved hypothetical protein MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRL LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF ARLFHIDASRLQSIR >SSO_1667 ydcC, putative receptor MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHST DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSN EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG RLNKAFEKKFPLTELNNPAHDSSAMSEKSHGREEIRLHIVCDVPDELIDF TFEWKGLKKLCVAVSFRSIIAEQKKEPEMMVRYYISSADLTAEKFATAIR NHWHVENNLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINVLTNDKVF KAGLRRKMRKAAMDRNYLASVLAGSGLS >SSO_1347 yeaB, conserved hypothetical protein MEYRSLMLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP >SSO_2240 yejH, putative ATP-dependent helicase MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV ARNIDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT ATPFRLGKGWIYQFHYHGMVRSDEKALFRDCIYELPLRYMIKHSYLTPPE RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAE KRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIENFKAQR FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL RPPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG >SSO_2312 yfaO, conserved hypothetical protein MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWALSGGGVESGERIEEAL RREIREELGEQLLLTEITPWTFSDDIRTKTYADGRKEEIYMIYLIFDCVS ANREVKINEEFQDYAWVKPEDLVHYDLNVATRKTLRLKGLL >SSO_2547 yffH, conserved hypothetical protein MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI LLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQTSHLMD >SSO_2753 yfiL, conserved hypothetical protein MMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVGMEDAISGSAIKDD DAFSDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSGKSFPASCNNVESA SQLHEVWQKGADENASTIRVMLPTY >SSO_2902 ygbF, putative inner membrane protein MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQL AGCGNVVMAWATNIESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >SSO_2987 ygdP, putative invasion protein MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE FASVVMSLQENTPKPQNTSAYRRKRG >SSO_3206 ygjF, conserved hypothetical protein MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ LKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ PQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLE KLVEAYRELDQALVVRGR >SSO_3301 yhbQ, conserved hypothetical protein MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >SSO_3403 yhdJ, putative methyltransferase MTMRTGCEPTRFGNEAKTIIHGDALAELKKLPTESVDLIFADPPYNIGKN FDGLIEAWKEDLFIDWLFEVIVECHRVLKKQGSMYIMNSTENMPFIDLQC RKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAIL VEAKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ KPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEY IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLIAK >SSO_3703 yhhF, conserved hypothetical protein MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD >SSO_0260 yhhI, putative receptor MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS DDKDVIAIDGKTLRHSYDKSRRKGAIHVISAFSTMHSLVLGQIKTDEKSN EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG RLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDF TFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF KAGLRRKMRKAAMDRNYLASVLAESGLS >SSO_3759 yicF, putative enzyme MMMKVWMAILISILCWQSSVWAVCPAWSPARAQEEISRLQQQIKQWDDDY WKEGKSEVEDGVYDQLSARLTQWQRCFGSEPRDVMMPPLNGAVMHPVAHT GVRKMVDKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNG LKGEDWTQKVSLISAVPQTVSGPLANSTLQGEIFLQREGHIQQQMGGINA RAKVAGLMMRQGNSDTLNSLAVFVWAWPDGPQLMTDRLKELATAGFTLTQ RYTRAVKNADEIARVRNEWWKAKLPFVTDGVVVRAAKEPESRHWLPGQAE WLVAWKYQPVAQVAEVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNI GSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFN SLTCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHI FSWLLLTPEQLQNTPGIAKSKSAQLWHQFNLARKQPFTRWVMAMGIPLTR AALNASDERSWSQLLFSTEQFWQRLPGTGSGRARQVIEWKENAQIKKLGS WLAAQQITGFEP >SSO_4169 yjaD, conserved hypothetical protein MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ GEPVWLVQQQRRHDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA MCRAEYE >SSO_4529 yjjV, Mg-dependent DNase MICRFIDTHCHFDFPPFSGDEEASLQRAAQAGVGKIIVPATEAENFARVQ ALAEKYQPLYAALGLHPGMLEKHSDVSLDQLQQALERRPAKVVAVGEIGL DLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRIHDKLAMHLKRHDL PCTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLA SLLLETDAPDMPLNGFQGQPNRPEQAVRVFDVLCELRPEPEDEIAEVLLN NTYALFSVSG >SSO_1353 yoaA, putative enzyme MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGCAR QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDLLL KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR >SSO_3103 yqgF, conserved hypothetical protein MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY >SSO_3172 yqiE, conserved hypothetical protein MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH HQALKNEWA >SSO_3294 yraN, conserved hypothetical protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI DLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >SSO_3528 yrfE, conserved hypothetical protein MSKSLQKPTILNVETVAHSRLFTVESVDLEFSNGVRRVYERMRPTNREAV MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEV GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV