TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Shigella boydii Sb227, Sb227
Gene type: CDS

Number of genes found: 647

Free access
Sort by:

 



# Shigella boydii Sb227, Sb227

>SBO_3903 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSHLGNISPAAFREKYHQMAA
>SBO_3338 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVEHRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRGYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1221 putative P4-type integrase
MSLNDSKIRNLKPSSKPVKLSDSHGLYLLVNQGGSRIWYLKYRFSGKESR
VSLGAYPLVSLAEARQRRDDIRKLLTQNINPAHQRMSDKAAASPEKYFKA
VALAWHKTNKKWSPDYAARILASMENHIFPAIGHLPVTTLKTQNFTALLR
VIEDKGFLEVASRTRQQLCNIMRYAVQQGFTENNLAQHLEGVTAPPVKNH
YPALPLERLPELFERIGDYQQGRQLTRLAVVLTLHLFIRSSELRFARWGE
IDFRNKIWTIPATREAIEKVRFSGRGAKMRTPYIVPLSRQAIAILKQIKE
ISGHLELVFPGDHNPYKPMSENTVNRALRLMGYDTKTDVCGHGFRTMACS
ALVETELWSRDTVERQMSHQERNSVRAAYIHRAEHLDARTAMMQWWSDYL
DVSREGYVAPYIYARRHKAA
>SBO_2658 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVAY
>SBO_1042 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCVKWHTEFGHLNRGDMLTSEQHRCSNEKKKF
>SBO_0732 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1707 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1134 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRHMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGECDLYLDG
>SBO_3956 iso-IS1 ORF2
MLTTDEWGSYTRELPKEKHLTGKIFTQRIERNNLTLRTRIKRLARRTICF
SRSVELHEKVIGAFIENICSTNWSHHPSDELSLDDYQTEYTAKDFYLILL
FFLK
>SBO_1382 IS2 ORF1
MIDVLGPEKRRRRTTQERIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_2632 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_2370 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_2765 conserved hypothetical protein
MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVA
CIMLEPGTRVSHAAVHLAATVGTLQVWVGEAGVRVYSSGQPGGARADKLL
YQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYA
LLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGY
APAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLA
CRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDS
GHRGRG
>SBO_2603 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_2716 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYR
QSEIFARQGVELSRALLSNLVDACCQLMTPLNDALYRYVMNSRKVHTDDT
PVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWFAYSPDHQGKHPEQ
HLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCWAHARRKVHDVYIST
KSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQSKPLLTSLYKLMQ
EKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVC
LGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRYILSVLPEWP
SNRVDELLPWNVALTNK
>SBO_2648 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_3691 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1741 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQIAA
>SBO_0632 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1708 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_3243 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCW
AHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQM
QSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAE
ADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPE
AYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_4345 IS629 ORF1
MVLESQGEYDSQWATICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_4354 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_1978 putative transposase
MGNKNDVMDARAIWMAVQQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTA
QINALHGTLLEFGETIHKGRAAMEREFPEALERMKERLPPYLIMVLENQY
NRLNELDSLIEDIEKQLTSVARQNETCKRLLDIPGVGPLIATAAVATMGE
ASAFKSGREFAAYVGLVPKQTGSGGKVRLLGISKRGDTYLRTLFIHGARA
VALVAKEPGPWITELKKRRPASVAIVAMANKLARTVWAITAHDRKYDRNH
VSIRPY
>SBO_1413 IS2 ORF2
MSISPLFRWPNSVCHFKPKNTAVRSPESNGIAESFVKTIKRDYISIMPKP
DGLTAAKNLAEAFEHYNEWHPHSALGYHSPREYLRQRACNGLSDNRCLEI
>SBO_0982 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1652 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SBO_3647 putative integrase
MSRCASRFRISSKLSATFRTTHKSHQARNPAGFFVIMGYHKDATKRMIMV
YTKVYPMRFGYIALGTPPIQSLDIRKMPKLTDMQIRAWIKSGERFEGRAD
GNGLYLRYREADKTPTWRFRYKLAGKSRAMLIGSYSELSLSKARETAKEL
SARVALGYDVAGEKQKRKTEALAKMEAEKNAMRVSELAAEYFERQILPRW
KHPDILRRRIDKDINPCIGSMKVEDVKPRHIDDMLKGIVDRGAPTIATDV
LRWTRRIFDYGIKRHALEINPCSAFEVADAGGKEAARDRWLTRDELIQLF
KAMRTAKGFSRQNEITFKLLLALCVRKMELCAARWEEFDLDGAVWHLPEE
RSKNGDPIDIPLPSPAVEWLRELHTFSCNSAWVLPARKMQNRMIPHIQES
TLPVALAKVRAEMPDVPNFTIHDFRRTARTHLAALGVDPVVAERCLNHRI
KGVEGIYNRHQYFDERKAALAQWADLLVALESGKDYNVTPLRRAN
>SBO_1315 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERP
>SBO_3567 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_0267 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2010 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_0936 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSI
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAR
>SBO_3245 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2501 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_1383 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2326 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_4371 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1356 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1550 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0855 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2958 IS2 ORF2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYG
>SBO_3715 IS600 ORF2
MTKELTGKALFMALRSQRPPAGLIHHTDRGSQYCAYDYRVIQEQSGLKTS
MSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNR
QRRHSRLGNISPAAFREKYHQMAA
>SBO_1867 IS600 ORF2
MLTTSDRGSQYCAYDYRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKN
ESLSHYRFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQI
AA
>SBO_1481 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGHIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0623 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1267 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>SBO_3611 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRPFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARHLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDILFAMMRDGTFYTPQGS
>SBO_2604 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2524 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_1700 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0638 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2195 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_3317 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_0253 IS629 ORF1
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLSYMNRPGNPETKL
PEKEVNRMTKNTRFSPEVRQRAIRMVLESQGEYDSQWAAICSIAPKIGCT
PETLRVWVRQHERDTGGGDGGLTTAKRQRLKELERENRELRRSNDILRQA
SAYFAKAEFDRLWKK
>SBO_1905 putative phosphohydrolase
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILKASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>SBO_0512 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_3651 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_1181 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNCHVRAGLL
>SBO_0989 IS2 ORF2
MLRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
HDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_4344 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_3354 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_3120 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_0645 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMGKRQRSTVLTLIYW
>SBO_1935 putative DNA adenine methyltransferase encoded by prophage
MESLPADCLEFIWSLPENSVDLIVTDPPYFKVKPEGWDNQWKGDDDYLKW
LEQCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFSVLNHII
>SBO_0745 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1653 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1148 conserved hypothetical protein
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVRPDRPISPQAMAI
HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM
KLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDI
MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWSFN
NLDSMSPELRLTLKHYLENT
>SBO_0637 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_4350 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGSTVKQY
WTSCWVRWNAASATIFRRLQWSG
>SBO_0961 IS600 ORF2
MAHIRTRETYGIRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_4324 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFLLTTSRSIAARPLWMVSPNSSN
VPCRALICAVRNFTSCMRVRCRTSTDCCSSVFTAISLPGC
>SBO_0251 putative phage integrase
MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSK
LGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITE
SKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHL
SFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPE
PLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVA
LNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWK
AALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYA
HLAPNHLTEHARQIDSILNPSVPNLSQSRNKERTNDV
>SBO_0643 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_4391 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_4157 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_0220 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_0993 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1549 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_4351 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1141 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_0854 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_3714 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELVDNGIIVGRDRLARLRKELRLHCKQKRKFRATTNSDHNLPVTPNLL
NQNFTPTAPNQV
>SBO_1209 IS911 ORF2
MFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGARSIATMA
TRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFA
VTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSPDSRLTM
KALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNC
WDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYN
GGLPPNESENRYWKNSNSVASFC
>SBO_3114 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2075 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_4346 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1696 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_2128 putative single stranded DNA-binding protein
MTAQIAAYGRLVDDPQVKQTSKGTPMTLARMAVSLPCSQAQDGQATLWLS
VIAFGKQADFLAKHQKGDVASVSGTMQVSQWTGQNGETRQGWQVIADSVI
SARAARPGGNRRKTTGTQGNQPPAGGDDPYGDDISF
>SBO_3423 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_0268 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0649 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_1268 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_0633 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_4398 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCW
AHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQM
QSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAE
ADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPE
AYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_2525 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1868 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0830 IS629 ORF2
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGHIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>SBO_0622 IS911 ORF2
MFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGARSIATMA
TRRGYQMGRWLAGRHMKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFA
VTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSPDSRLTM
KALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNC
WDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYN
GGLPPNESENRYWKNSNSVASFC
>SBO_1715 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_1018 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2023 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSCKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1698 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLNRPGNPGD
>SBO_3241 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_0983 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_4394 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRPFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARHLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_1547 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIIMPKPDGLTAAKNLAEAFEHYNEWH
PHSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_4147 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0467 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2371 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1637 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCW
AHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQM
QSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAE
ADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPE
AYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_1635 ISSfl4 ORF1
MEQKALSAEPRRSFSNEFKLQMVKLASQPGASVARIAREHDINDNLLFKW
LRLWQNEGRISRRLPVTTSSDTGVELLPVEITPDEPKEPVAALTPSLSTQ
TTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR
>SBO_1412 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWEPRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSI
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_3355 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAE
>SBO_1710 IS911 ORF2
MVTLCQVFGVHHSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQFEPPRVSWRVF
YL
>SBO_1191 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRHIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARHLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_1391 putative DNA-invertase
MLIGYVRVSTNDQNTDLQQNALVCAGCEQIFEDKLSGTKTDRPGLKRALK
RLQKGDTLVVWKLDRLGRSMKHLISLVGELRERGINFRSLTDSIDTSSPM
GRFFFHVMGALAEMERELIVERTLAGLAAARARGRTGGRRPKLTKEQHEQ
IARLIKNGHDRKQLAIIYGIGTSTIYRYHPVGDIQTEETTGQTQENENR
>SBO_2903 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGLRLCGV
HY
>SBO_1185 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRREKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAGDRAKSGVVR
>SBO_0693 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_1588 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_3146 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1608 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_2079 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2391 IS911 ORF2
MTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTM
KALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNC
WDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYN
GGFPPNESENRYWKNSNSVASFC
>SBO_3739 putative single stranded DNA-binding protein
MTAQISAYGRLVADPQTRTTTNGNNMAMARLAVSLPCNAAEAGESTFWLG
VIAFGKQADALAKHQKGDLVSVAGNMQLNQWTGQDGGTQQGYQVVADSVI
SARTVRPGGKAGQRGQATDALRRAQQPSRDEYDQRPPFDDETPF
>SBO_3237 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_3424 IS4 ORF
MFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAG
NPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLT
LMDKGYYSLGLLNARSLAGEHRHWMIPLRKGAQYEELRKLGKGDHLVKLK
TSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADL
YSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQ
MIKMAEHLKGYWPNQLSFSESCGMVM
>SBO_2088 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRHMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0751 putative DNA adenine methylase
MKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKEIRAFAEKA
QRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLA
SVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKS
ATEIIAVSGPRCWVGFDYSRGVDSSAVYGVRA
>SBO_2517 putative DNA replication factor
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>SBO_1415 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_4322 putative P4-type integrase
MALTDAKIRAAKPTDKAYKLTDGTGMFLLVHPNGSRYWRLRYRILGKEKT
LALGVYPEVSLSEARTKRDEARKLISEGVDPCEQKRAKKVVPDLQLSFEH
IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVR
AAEAKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH
RPALDLSRLPELLSRIGSYKGQTVTQLAVMLNLLVFIRSSELRYARWSEI
DIDNAMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQAVAILTELQTW
AGENGLIFTGAHDPRKPISENVMNASHTIKHGAV
>SBO_2926 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIMNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALHPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNARSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>SBO_0266 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_3606 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1607 IS600 ORF1
MAALPKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARKGLGTPGS
RTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_0425 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_3874 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_0343 putative transposase
MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIG
MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH
KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLV
PKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
>SBO_4090 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1258 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAD
>SBO_1180 IS2 ORF1
MIDVLGPEKRKRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1203 conserved hypothetical protein
MVLHVSNTVNSTIFPSHHFANGRRMSQKLNAGQMNRPFFPFILRHHCMLM
YHSQSRSSFPVEYGCAVSRHNLPMSSGHSNMLSPDNVFIAIKPVDMRRGI
DSLTQYIQDELRSTWHEGAAFVFVNKVRSRIKVLRWDKHGVWLCTRRLHK
GSFRWPRANDAAWHLTPDEFNWLIAGVDWQQVKGHDLTKWVWQNEPELRP
ENTKNTLLTQ
>SBO_1670 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0624 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0960 IS600 ORF1
MSRKNQRYSKEFKAETVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_1260 IS629 ORF2
MMPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_3732 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1918 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1699 putative transposase
MLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLVPKQTGSGGKVRL
LGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKKRRPASVAIVAMA
NKLARTVWAITAHDRKYDRNHVSIRPY
>SBO_4397 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_1919 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_0240 ISSfl2 ORF
MTESSDYESIQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEAALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_1152 IS600 ORF2
MTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTS
MSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR
QRRHSRLGNISPAAFREKYHQMTAQKKNKW
>SBO_0642 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_0992 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANEARQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_0219 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_2334 putative regulator
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>SBO_2385 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_0729 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPVVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRNPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_4091 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_2025 IS600 ORF2
MALRSQRPLAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMSRKGNCYDNA
PMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQRRHSRLGNIS
PAAFREKYHQMTAQKKNKW
>SBO_3604 putative lipoprotein
MNLKKIFFSAVTVSVLCALTGCDYIEEGKPESSLLKQQEEHNNKIDLLEK
QQAQLKSQLETIQKQQTGIISSTKTLTHVIKSVKDQQNTFIFTEFNPAKT
KYFILNNGSVALAGRVLSIDATENGSVIHISLVNLLSTPISNIGFNATWG
GEKPVDAKEFARWQQLLFNTSMKSTLKLLPGQWQDINLTLKGVSPNNLGY
LKLAINMENIQFDNLPSAENRQKRSKK
>SBO_3843 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1705 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQIAA
>SBO_1151 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEPRLERDILKKATAYFAQESLKNTR
>SBO_0252 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_2078 IS600 ORF2
MSGNIGANPFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETY
GTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPV
APNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAM
GERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGL
KTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIF
YNRQRRHSRLGNISPAAFREKYHQMTAQKKNKW
>SBO_0430 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_0731 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERCFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAARLAP
>SBO_3692 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_4089 IS4 ORF
MSGNIGANPDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMM
VWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAV
RRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTH
AGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNT
LTLMDKGYYSLGLLNARSLAGEHRHWMIPLRKGAQYEELRKLGKGDHLVK
LKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMA
DLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVR
YQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLA
SMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>SBO_3440 putative transposase
MKYTPVGVDIAKHVIQIHFINEHTDEVVDKQLRRQDFLTFFGNREPCLIG
MEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIH
KGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLV
PKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
>SBO_1222 putative integrase
MAAALELSRLMGLRSQEVVQSAQSLRTWKQSLERGEPRLTVVFGTKGGRP
RETVILDAVAIRKALDNALVVAEDRHGRLIDKPDLKSAMKYWHSQASRLG
LTGTYSPHSLRYAWAQDAIRHYLAQGFSEKEALAMTAMDLGHGDGRGRYV
AQVYGRKDTD
>SBO_4342 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCW
AHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQM
QSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAE
ADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPE
AYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_4152 conserved hypothetical protein
MVARGFTGSETIVRDAVAKWRKGWNPPVTTAVRLPSVSRVSRWLMPWRIT
RDEENYASRFISLMCEKEPELKIAQQLALEFYRILKTQNKSQLSSWFTRV
HESGSAEFRRVAAGMEADAAAICEAISSRWSNGVVEGHVNRLKKMLKRQM
YGRAGFELLRQRVMSPLA
>SBO_3242 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_0354 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRNPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1711 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1150 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1426 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2181 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1184 IS911 ORF2
MTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTM
KALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNC
WDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYN
GGFPPNESENRYWKNSNSVASFC
>SBO_3145 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1516 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_2609 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_0650 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2419 IS4 ORF
MADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNL
VRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRD
LASMGQLVKYTDEKGKGLPESGKGEALEIPHSPEKEPVSCLTDWH
>SBO_1920 putative protein encoded within IS
MRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVS
SALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYC
EHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALYRYVMNSR
KVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWFAYSPDH
QGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCWAHARRKV
HDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQSKPLLT
SLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAE
RALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRYIL
SVLPEWPSNRVDELLPWNVALTNK
>SBO_0124 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>SBO_2568 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_2442 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_2625 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_1031 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_3316 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPVVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1135 IS911 ORF2
MKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGN
CWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEY
NGGFPPNESENRYWKNSNSVASFC
>SBO_2070 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1149 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0235 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_4442 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSI
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESEPFARKRA
>SBO_1259 ISSfl4 ORF1
MEQKALSAEPRRSFSNEFKLQMVKLASQPGASVARIAREHDINDNLLFKW
LRLWQNEGRISRRLPVTTSSDTGVELLPVEITPDEPKEPVAALTPSLSTQ
TTVSASSCKVEF
>SBO_1416 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_3353 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_1044 IS2 ORF2
MEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKR
DYISIMPKPDGLTAAKNLAEAFANHLAVEINFIAIFIVNFSRICCASN
>SBO_1605 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1261 IS629 ORF1
MTKNTRFSPEVRQRAIRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>SBO_0639 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_4372 IS629 ORF1
MVLESQGEYDSQWATICSIAPKIGCTPETLRVWVRQHERDTGGGDEGLTT
VERQRLKELERENRELRRSNDILRQASAYFAKAEFDRQ
>SBO_1934 putative DNA adenine methyltransferase encoded by prophage
MAPLISYFRDARAALGITAKQIADATGKKNMVSHWFSASQWQLPNESDYL
KLQALFARVAEEKHRRGELEKLHHQLVDTYTSLNRQYAELLSEYKHLRRY
FGVTVQVPYTDVWTHKPVQFYPGKHPCEKPAEMLQQIISASSRPGDLIAD
FFMGSGSTVKAALALGRRAIGVELETERFEQTVREVQDLVSQNG
>SBO_1636 putative protein encoded within IS
MNQKYLIRIAELECQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDERE
IEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDD
PQVPRQLRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAE
QLELVSSALKVIRTERVKKACTKCD
>SBO_3025 putative transposase
MNNNNTLYVGLDIHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SBO_0236 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0977 putative integrase for prophage
MKSRVYSWGYERGYVKANPCAGVSKFKAKNRERYVTDKEYQAVLSVAPLP
VFIAMEIAYLCAARVSDVLSLKWEQIGNDGIFIQQGKTGKKQIKAWSPRL
QAAIEKAKQLPTSAYVISNQYGNRYMYKGFNEMWVEARNRAGKISGILTD
FTFHDLKAKGISDYEGSSRDKQLFSGHKTEGQVLIYDRKVKVSPTLDVPL
PENIPSNSTCDFCH
>SBO_0932 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2882 conserved hypothetical protein
MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDDPATTAPKAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTVWGIIRHWLPASGEKMRKAPILFHYTNLAEGMTEQR
LETDVYVPLA
>SBO_1480 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1265 putative protein encoded within IS
MTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSE
RLAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNF
SDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCR
LNGIDPEAYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_1227 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIKLVRSLRAGNLSAVYVPGIENEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKSRKIYPAPSLTEHGMLNSGFVRGIENFRPKERMSILQLLLLH
VSWRVLSGIWAE
>SBO_2069 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISGKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0829 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFARAEFDRLWKK
>SBO_4154 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1262 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_1816 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2081 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_4396 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1112 IS911 ORF2
MAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNS
PMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLP
PNESENRYWKNSNSVASFC
>SBO_1917 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_0834 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_4174 putative P4-type integrase
MKLNARQVETAKPKDKTYKMADGGGLYLEVSAKGSKYWRMKYRRPSDKKE
DRLAFGVWPTVTLAQARAKRDQAKKLLAQGIDPKAEQKEAQAENSGAYSF
ETIAREWHASNKRWSEDHRSRVLRYLELYIFPHIGSSDIRQLKTSHLLAP
IKKVDASGKHDVAQRLQQRVTAIMRYAVQNDYIDSNPASDMAGALSTTKA
RHYPALPSSRFPEFLARLAAYRGRVMTRIAVELSLLTFVRSSELRFARWD
EFDFDKSLWRVPAKREEIKGVRYSYRGMKMKEEHIVQLSRQAMILLNQLK
QISGDKELLFPGDHDATKVMSENTVNSALRAMGYDTKTEVCGHGFRTMAR
GALGESGLWSEDAIERQLSHSERNNVRAAYIHTSEHLDERRLMVQWWADY
LQANKGKTITPYDFAKINKSR
>SBO_3072 ISSfl2 ORF
MTESSDYKSVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARHLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_2904 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2214 IS4 ORF
MDKGYYSLGLLNARSLAGEHRHWMIPLRKGAQYEELRKLGKGDHLVKLKT
SPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADLY
SHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQM
IKMAEHLKGYWPNQLSFSESCGMVGNAANLLI
>SBO_3884 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSESVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMNKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
TMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SBO_0355 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_4220 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2016 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIMNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>SBO_0743 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRHMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_2067 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_3826 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_1714 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAAQKKNKW
>SBO_0237 putative protein encoded within IS
MNQKYLIRIAELECQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDERE
IEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDD
PQVPRQLRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAE
QLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARV
LTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALYR
YVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWF
AYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAE
>SBO_2633 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_3713 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_4395 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0631 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_0468 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_4148 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_3759 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_0699 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_4392 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_3740 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIMNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNARSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>SBO_1254 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFTPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPLAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMPA
>SBO_2076 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCW
AHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQM
QSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAE
ADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPE
AYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_1706 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_1019 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_1223 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0356 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_2074 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_1264 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAE
>SBO_0987 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPEGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0926 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>SBO_3787 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_4079 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGPCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKRQENLPRPVIDRAWDAQLRLCKRYRKLQAKGKNVNITIVAVA
RELAGFIWDMGRIAMSVAQQPQCHK
>SBO_3356 putative protein encoded within IS
MTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSE
RLAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNF
SDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCR
LNGIDPEAYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_0838 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1712 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRL
>SBO_2080 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_0238 putative protein encoded within IS
MTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSE
RLAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNF
SDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCR
LNGIDPEAYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_2631 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGAEVA
>SBO_2327 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1257 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLLEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_3318 IS4 ORF
MSGDSGGQSRDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALE
RKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAVRRVFTKTAQL
WHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAGNPALHPQV
KMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYS
LGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKK
WPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIE
LGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHL
KGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLASMGQLVKLPT
RRGRAFPRVVKERPWKYPTAPKKSQSVA
>SBO_2534 conserved hypothetical protein
MNVEGMATGGIHMELHCPKCQHVLDQDNGHARCPSCGEFIEMKALCPDCH
QPLQVLKACGAVDYFCQHGHGLISKKRVEFVLA
>SBO_1815 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPATFREKYHQMAA
>SBO_1290 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_0431 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPLAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAAQKKNKW
>SBO_0646 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_4334 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_0426 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_3893 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1410 putative phage-related protein
MSEINYQALRERYSPVPVPKCPICGEEMSIQRISGAQVVYACSGYGDDGD
FKIGRTLADEHYEKSRVTVLDVGDPEVLALLDWLETKDNRIAELEKIATD
YALKFQKAQDALKHAALLHSRTAQQTNNFAVSLPDISEYFINDVFQPLRY
ERDVERAIIKAGGKALWQEKHEDRTHQSCDVNCGWFSPLTTDKNNT
>SBO_1938 putative integrase
MSPRPRKNSTDVAGLYEKFDRRTGRVYYQYKNPVTGKFHGLGTDKGKAEK
IASTANQRIAAAEAEYFMRKIDESPSATKRRGIRLKAWVDRYLKIQDTRL
KNGDIAATTHKEKTRMAAYLVSRLGNHPLKELEVRDFALILDEWLDKDMV
STARVNRGLWVDIYKEAQHAGEVPPGWNPPEATRKPIPKVTRARLTMEDW
QKIYNATPEKHFIRNAMLLAIVTGQRRDDICHMRFSDVWNEHLHITQGKT
GMRLALPLTLRCDAIGITLKEVIDGCRDRILSPYLIHSRHQKQPKPMSKD
NLSDYFAKARDLAGIFHQQEKLRQHFMNNALYQNGCTVHRVSIQKHY
>SBO_2902 IS629 ORF2
MAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGS
QYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWK
NRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1224 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2071 IS911 ORF1
MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST
MTRWVKQLRDERQGKTPKASPITPELNRPGNPGD
>SBO_1927 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2715 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_4155 putative protein in IS element
MIWQPEFTDKTLSRKPGAVQYDVLRQYVLMPGKVHADDIPVPVQEPGSGK
TRTARLWVYVRDDRNAGSQMPPAVWFAYSPDRKGIHPQNHLAGYSGVLQA
DAYGGYRALYESGRITEAACMAHARRKIHDVHARVPTDITTEALQRIGEL
YTIEAEVRGCSAEQRLAARKARAAPLMQSLYDRIQQQMKTLSRHSDTAKA
FAYLLKQWKALNVYCSNGWVEIDNNIAENALRGVAVGRKNWLFAGSDSGG
EHAVVLYSLIGTCRLNNVEPEKWLRYVIEHLQDWPANRVRDLLPWNEVYW
Q
>SBO_1735 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
LDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_1263 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCD
>SBO_0231 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQIAA
>SBO_0744 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1716 putative transposase
MSKLPTGVEIRGKYIRIWFMFRGKRCRETLKGWEITNSNIKKAGNLRALI
VHEISSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTT
NTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPR
SNKKGRTVRTVDSYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPD
PLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDFEKGIV
NVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEIT
FYHREYGRTEKQKLHFVFMPRVCNEKQKPYYSVSSLGARWNAAVKRAGIR
RRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMND
EQIAMLNARLS
>SBO_3731 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_3892 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPVVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_3422 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1204 ISSfl3 orfC
MNIRIWSGILPCMDISALNTTNDIEKLRAMALAMVQEVMSENAEKERELL
EKSRRIQLLEEMLKLVRQQRFGKKCETLAGMQRSLFEEDVDADIAALTAH
LDKLLPQSPEEDEKASRSRPIRKPLPVHLPRVEKMIQPDTDHCPECDEPL
HYIRDAVSEKLEYIPAHFVVNRYVRPQYSCPCCQKVFSGEMPAHILPKSA
VEPSVIAQVIINKYGDHLPLYRQQQVFARSDVGLPVSSMADMVGAAGAAL
SPLAALLHRELINRPVVHADETTLKILNTKKGGKSCSGYLWAYVSGERTG
PSVVCFDCRTGRSHEYPENWLQGWGGTLVVDGHKAYRTLANKVPEITLAG
CWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKIRQWR
QRYARPILEELWSWLEEQEPQCSPGKALHKAIAYALSHRVELSRFLEDGA
VPLDNNVCERAIKNVVLGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLE
PHAWLTDVLMRLPEWPEERLAELLPLEGFTFSG
>SBO_1043 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSEQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSQWNASSGV
>SBO_0931 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAAQKKNKW
>SBO_2957 IS2 ORF2
MPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDG
FEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERR
FGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIA
ESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPRE
YLRQRACNGLSDNRCLEI
>SBO_2386 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1074 putative virulence protein
MRRFAGACRFVFNRALARQNENHEVGNKYIPYGKMASWLVEWKNATETQW
LKDAPSQPLQQSLKDLERAYKNFFQNRAAFPRFKKRGQNDVFRYPQGVKL
DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGKWYISIQTESEVS
TPVHPSASMVGLDAGVAKLATLSDGTAFEPVNSFQKNQKKLARLQRQLSR
KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK
VSNMSKSAAGTVSQPGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWRGG
QVFAVPPACTSQRCACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNI
LAAGHAVLACGEMVQSGRPLKQEPTEMIQATA
>SBO_3842 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_0644 putative protein encoded within IS
MLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALY
RYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVW
FAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCW
AHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQM
QSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAE
ADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPE
AYLRYILSVLPEWPSNRVDELLPWNVALTNK
>SBO_0730 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1280 putative enzyme
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGSAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>SBO_4349 IS2 ORF2
MEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKR
DYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNG
LSDNRCLEI
>SBO_2420 IS4 ORF
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQA
>SBO_1330 conserved hypothetical protein
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVELDESQQQALV
RELNEELGIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALQYPLAPTDIPLLEAFMALRAARAAD
>SBO_3115 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_2024 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_0750 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_0927 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AKRQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_4153 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1067 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLKLHGISHGSAGAR
SIATMATRRGYQMGRWLAGRHMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_1142 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_2567 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2499 putative protein encoded within IS
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYR
QSEIFARQGVELSRALLSNLVDACCQLMTPLNDALYRYVMNSRKVHTDDT
PVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWFAYSPDHQGKHPEQ
HLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCWAHARRKVHDVYIST
KSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQSKPLLTSLYKLMQ
EKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVC
LGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRYILSVLPEWP
SNRVDELLPWNVALTNK
>SBO_2182 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELLRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_3238 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_4151 putative protein encoded within IS
MVSAKNETAVEIPGKLVAEKMKTLSRHSELAKAFAYALNQWPALTYYAND
GWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLND
VDPESYLRHVLGVIADWPVNRVSELLPWRIALPAK
>SBO_2608 IS2 ORF2
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_2481 putative transposase
MNNNNTLYVGLDVHKESITVAYAINSEPVELMGKIGTSPTDIQNLCKRLR
SKSSQVSIVYEAGLCGYGLYRRLVKSGFDCMVCAPSLIPKKPGERVKTDR
RDAIRLVRSLRAGDLSAVYVPGIEDEAFRDLARAWASARDDLRHARQRLK
SFLLVHGVHYVGRADWGPAHRRWLSKYSFESPWRQLAFDEHRRTIEDRQA
QCERLESALKEAVTEWRLYPVVEALQAMRGIQFITAVGLISELGDLTRFE
HPRQLMSWFGITPSEYSSGGSRHQGSITKAGNSYARKLLVEAAWSYRHPA
RISPAIQKSRKIYPAPSLTEHGMLNSGFVRGIENFRPKERMSILQLLLLH
VSWRVLSGIWAE
>SBO_0230 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_1695 ISSfl4 ORF3
MKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEE
RLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEF
CRDGWVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCK
QNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>SBO_1255 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>SBO_2068 IS911 ORF2
MVTLCQVFGVHRSSYRYWKKRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_2131 conserved hypothetical protein
MKSAPNLKKQPYDKMTEVIIFAGSDAWAHAKQWQEQDGRLAGDNVPPVVL
ADDQLDELADLRIIDEGRYCVRLYKAGHIRPSNINAIAHKLAAAGVTDAN
YYPEGMHSHMRENWREYLERVRGKEPVEEKNHQRKTTLPMSVGSTGYDTQ
LDYVVKGIIPAVSLCSIYGASGSYKSFLAGSWACHVATGRQWGGRRVAHG
AVLYVVGEGGIGVPRRVKAWEVVHDEQVKNLYLVNRPIFPAAPLDVDEMV
IAARQVERETGKPVRMIILDTLARCFGGNDENDSRDMGAFIRGCDELKRR
TGATVLVVHHSGKDETKGARGSSAFRASLDAEYRIRREDAGSKALVISCT
KMKDAEELKEAAYDLRVVELFTDADGELITSLVVVDDPRPPVELERIEEA
GNKTENHTALWGCIRSRTQNGDKCTIPLLRDDMKKLGYEMKNFRRWLYKL
EKDGVISIDGDDVAPL
>SBO_1929 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_1638 IS911 ORF2
MERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPP
NESEKRYWKNSNSVASFC
>SBO_3337 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARHLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_1742 IS600 ORF1
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKQQRILHRSR
>SBO_1697 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_2383 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARHLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>SBO_1349 putative excinuclease subunit
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY
IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQTIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE
EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LEDATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>SBO_3503 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDDRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGQVSCQQPTHRYKRGGHEYVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGFPPNESENRYWKNSNSVASFC
>SBO_2500 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPLAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVVSVNGAPY
>SBO_2526 IS629 ORF1
MVLESQGEYDSQWAAICSIAPKIGCTPETLRVWVRQHERDTGGGDGGLTT
AERQRLKELERENRELRRSNDILRQASAYFAKAEFDRLWKK
>SBO_1182 IS2 ORF2
MITVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSP
VEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKR
DYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNG
LSDNRCLEI
>SBO_1225 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_1242 IS4 ORF
MIPLRKGAQYEELRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVT
RKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLT
LRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGM
VMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWK
YPTAPKKSQSVA
>SBO_2498 putative protein encoded within IS
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>SBO_1110 IS911 ORF2
MVTLCQVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLKLHGISHGSAGAR
SIATMATRRGYQMGRWLAGRHMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTCIWTGKRWAYLAVVLDLFAIKPVGWAMSFSP
DSRLTMKALEMG
>SBO_2134 putative CP4-57-type integrase
MARKTPPLTTVQIKAARPAEKEYTLQDGGGLFLLVKPSGSKLWRFSYYRP
SDKKRILLSFGSLDDVSLADARKRRSEYRALISAGTDPQGHEKKKREAEA
RRQGNTFENVAAAWYQVKISQNLAPNTIKDIWRSLDKYVFPFIGNTPIDT
LTARRFVEVLTPIKERGNLETLKRVLQRVNEVMDYAANSGLIDANPAMNV
RKAFPSPVKKHMPTIRPEQLPELMQALSVSATERQTRLLIEWQLLTVTRP
AEASSTRWDEINLDAKQWTIPAGRMKMRRDHVIPLSGQAMAVLEAMKPIS
HHRNYVFPSLKDPQQPMNSQTANAALRRMGFAGVLVSHGLRAIFSTAANE
EGFEPDVIEAALAHVDTNEVRRAYNRSNYIEKRIVLMRWWGEFVEAAATG
VTLASGKRGIRAV
>SBO_0634 IS911 ORF2
MFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKNE
WMPVVGYVSFSEAVMTPTY
>SBO_1736 IS2 ORF1
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVTAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>SBO_1357 IS629 ORF2
MPLLDKLRKLYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEILRVYDGNHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>SBO_0357 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGQCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>SBO_0837 IS2 ORF2
MLRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPREYLRQRACNGLSDNRCLEI
>SBO_4156 IS600 ORF2
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGHCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAD
>SBO_2094 ada, O6-methylguanine-DNA methyltransferase; transcription activator/repressor
MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE
NVSFYANASEALAAGFRPCKRCQPDKANPRQHRLDKITHACRLLEQETPV
TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV
TTSILNAGFPDSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR
CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI
ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP
KAVRAVASACAANKLAIVIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE
NEER
>SBO_0895 alkA, 3-methyl-adenine DNA glycosylase II, inducible
MYTLNWQPPYDWSWMLGFLAARAVSGVETVADDYYARSLAVGEYRGVVTA
IPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGKLGA
ARPGLRLPGCIDAFEQGVRAILGQLVSVAMAAKLTAKVVQLYGERLDDFP
EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGSLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDGTDEL
>SBO_2095 alkB, AlkB
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV
APGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHDLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
TDCRYNLTFRQAGKKE
>SBO_3374 dam, DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH
FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>SBO_1718 dbpA, ATP-dependent RNA helicase
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAVISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV
AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA
QRANIISDMLQIKLNWQTPPANSSIVPLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKT
CRVRLLK
>SBO_1047 dcm, DNA cytosine methylase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPPAHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMAKGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFVAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>SBO_3220 deaD, inducible ATP-independent RNA helicase
MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ
AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT
RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP
GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT
ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN
EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL
REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI
GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR
LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ
GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV
GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL
PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFGGERR
EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>SBO_0687 dinG, probably ATP-dependent helicase
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ
EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDEVLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRFEPSIDNEEQHIAEMAAFFREQVESKKHLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>SBO_3675 dnaA, DnaA
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW
VRDKYLNNINGLLTSFCGADAPQLRFEVGTKPVTQTPQAAVTSNVAAPAQ
VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA
RAAARQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY
MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF
FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE
TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT
GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTIHSLPEIGDAFGGRDHTTVLHACRKIEQLREE
SHDIKEDFSNLIRTLSS
>SBO_4065 dnaB, replicative DNA helicase
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD
VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV
GGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQ
GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM
LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT
MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV
PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR
LTFNGQWSRFDNYAGPQYDDE
>SBO_4421 dnaC, chromosome replication; initiation and chain elongation
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>SBO_0172 dnaE, DNA polymerase III alpha subunit
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERVKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGVNAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYASGLAISLTDRQIDDQLLNR
LRQSLDPTALGQFQYISTIRGRMHARGCVLARRGVSLRAIVY
>SBO_2924 dnaG, DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQYLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>SBO_3676 dnaN, DNA polymerase III beta-subunit
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>SBO_0204 dnaQ, DNA polymerase III epsilon subunit
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV
YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA
AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA
RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA
TIQRIVRQASKLRVVFATDEELAAHEARLDLVEKKGGSCLWRA
>SBO_0370 dnaX, DNA polymerase III tau and gamma subunits
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVPSALEKAP
AKKEAYRWKATTPVMQQKEVVATPKALKKALEHEKTPELAAKLAAEAIER
DPWAARVSQLSLPKLVEQVALNAWKEESDNAVCLHLRSSQRHLNNRGAQQ
KLAEALSTLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARESIIA
DNNIQTLRRFFDAELDEESIRPI
>SBO_3045 endA, DNA-specific endonuclease I
MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGC
KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQD
GGRKNCAKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ
CAMKVDFKEKAAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNK
MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>SBO_2679 exo, 5'->3' exonuclease
MRGLFPISHPAVACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR
PPMPEELHDEMPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>SBO_4362 fimB, FimB
MKNKADNKKRNFLTHSEIESLLKAANTGPHAARNYCLTLLCFIHGFRASE
ICRLRISDIDLKAKCIYIHRLKKGFSTTPPLLNKEVQALKNWLSIRTSYP
HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRHSCGFA
LANMGIDTRLIQDYLGHRNIRHTVWYTASNAGRFYGIWDRARGRQRHAVL
>SBO_4363 fimE, FimE
MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY
QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTLERANWKGADRTD
AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG
ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
>SBO_3255 fis, site-specific DNA inversion stimulation factor
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>SBO_2065 gyrA, DNA gyrase, subunit A
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>SBO_3678 gyrB, DNA gyrase subunit B
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE
ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA
GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP
QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG
VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI
GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE
GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE
QQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKI
LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD
VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE
AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR
YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD
VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL
LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL
WETTMDPESRRMLRVTVKDAIAADLLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>SBO_2269 helD, DNA helicase IV
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYFIPFNQLLAIHCKR
GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS
GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD
NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA
QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG
RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT
AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR
RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL
AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGERCDLDT
TYRFNSRIGKVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL
DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ
QADYVIIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL
TRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>SBO_0046 hepA, probable ATP-dependent RNA helicase
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHMPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLLDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>SBO_1381 himA, integration host factor, alpha subunit
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>SBO_2200 himD, integration host factor (IHF), beta subunit
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>SBO_0504 holA, DNA polymerase III delta subunit
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VAARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQTQLRQ
AVQLLTRTELTLKQDYGQSVWAELEGLSLLLCHKPLADVFIDG
>SBO_1964 holB, DNA polymerase III delta prime subunit
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYTVTWLSREVTMSQDALLAALRLSAGSPSA
ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>SBO_4181 holC, DNA polymerase III chi subunit
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>SBO_4433 holD, DNA polymerase III psi subunit
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG
AQVASPALTELRANPTARAALWQQICTYEHDFFPRND
>SBO_1674 hrpA, helicase, ATP-dependent
MLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP
EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR
GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM
VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR
RPDLKIIITSGTIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD
TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE
ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA
RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS
RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL
EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM
IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ
KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY
REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM
VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT
EKVTVYGLPIVAARKVNYSQIDPALCRELFIRHALVEGDWQTRHAFFREN
LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRISHDVISARHFDSWWKK
VSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGA
DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV
PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL
KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI
WSFGQLPESYEQKRGNYKVKAWPALVDERDSVAIKLFDNPLEQKQAMWNG
LRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDQ
LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR
LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR
LEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIE
ELRVSYFAQQLGTPYPISDKRILQAMEQISG
>SBO_0137 hrpB, helicase, ATP-dependent
MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLEELLNEKPGDTVGYRMR
AQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
RYLPLPAHQRFDDAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASR
IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFDPRTGLTRLITQRVSQASMTQRAGRAGRLEPGICLHL
IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPTVN
LLAAKRLLQMLGALEGERLSAQGQKMAALGNDPRLAAMLVSAKNDDEAAT
AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLAGAFADRIARRRGQDGRYQLANGMGAMLDADDALSRHEWLIAP
LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW
RRLQIGLLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAAKWLPEYDWPAVDDESLLATLETWLLPHMTGVHSLRGLKSLDIYQ
ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGRVPLVLELLSPAQSPLQITRDLSAFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>SBO_4021 hupA, DNA-binding protein HU-alpha
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>SBO_0334 hupB, DNA-binding protein HU-beta
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>SBO_4299 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1298 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3568 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1073 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3561 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3895 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0466 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3454 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1278 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2986 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2466 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHCQ
>SBO_1163 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0352 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4357 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0096 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3718 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGWKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4038 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1959 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0464 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1429 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4248 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0063 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4236 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1655 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2384 insB, IS1 ORF2
MITDVWKYRGKSTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH
YLNIKHYQ
>SBO_0007 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWDYA
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWLLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2342 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4135 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYIQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3706 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0258 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2115 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1169 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1111 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0674 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3563 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1326 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2230 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2738 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1901 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0746 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1682 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2876 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLAILERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2030 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4082 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1891 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLLFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1667 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4095 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4193 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1477 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1039 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2528 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3441 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2993 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3911 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1752 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1266 insB, IS1 ORF2
MTLTHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWL
FYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMTDGWPLYESR
LKGKLHVISKRYTQRIERHNLNLRQHLASLGRKSLSFSKSVELHDKVIGH
YLNIKHYQ
>SBO_0407 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1610 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1373 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQRGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0216 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1370 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0021 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPASDVIVCAEMDEQWDYA
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1293 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0211 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2757 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2939 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1598 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2163 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1587 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKCYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1807 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2288 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3010 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKYYQ
>SBO_1778 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1274 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0576 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0967 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4184 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1183 insB, IS1 ORF2
MITDVWKYRGKSMIVCVKMDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHV
FGERTLTTLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQR
IERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SBO_0234 insB, IS1 ORF2
MIVCAEMDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL
LSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SBO_3548 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0402 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1029 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
GLHDKVIGHYLNIKHYQ
>SBO_1791 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLLFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1739 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3244 insB, IS1 ORF2
MIVCAEMDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL
LSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SBO_0974 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1353 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3660 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGWKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3352 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHIFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKVSVN
>SBO_0130 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2213 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0207 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2062 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0934 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1233 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKEKLHVISKRYTQRIERHNLNLRQHLARLGRMSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2906 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3603 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLNVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2187 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1360 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0428 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0013 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4447 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTHRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3415 insB, IS1 ORF2
MASDVAPVHALWALASTRFYVIKKLRPQSVTSRIQPGSDVIVCAEMDEQW
GYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVV
WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFS
KSVELHDKVIGHYLNIKHYQ
>SBO_1730 insB, IS1 ORF2
MSRQCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLTTLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3483 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLAAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0591 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1751 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1339 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLASLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4333 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2917 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGWKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2159 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1129 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3889 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1617 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2643 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4415 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0625 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1591 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2655 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2575 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3124 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1207 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1003 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1606 insB, IS1 ORF2
MSGNIGANPVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRIR
RTVVAHVFGERTLTTLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVI
SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SBO_4246 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3587 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3668 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3830 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4186 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1457 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1914 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1006 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0771 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2701 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0520 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3918 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1825 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKLV
ELHDKVIGHYLNIKHYQ
>SBO_3396 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1936 insB, IS1 ORF2
MIVCAEMDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL
LSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>SBO_0635 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1874 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3488 insB, IS1 ORF2
MASDVAPVHALWALASTRFYVIKKLRPQSVTSRMQAGSDVIVCAEMDEQW
GYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVV
WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFS
KSVELHDKVIGHYLNIKHYQ
>SBO_4097 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLNVISKCYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2072 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0434 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSNVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_3955 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4348 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4162 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0380 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKYYQ
>SBO_0724 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSNVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0127 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0565 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1865 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLLFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2683 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0033 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1678 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2387 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQ
>SBO_1170 insB, IS1 ORF2
MSRQCTHYGRRPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2863 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0941 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2847 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0264 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4062 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0413 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4275 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLNVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_0657 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_1021 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_2306 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSNRW
SCMTRSSGII
>SBO_0831 insB, IS1 ORF2
MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK
SVELHDKVIGHYLNIKHYQ
>SBO_3105 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLPAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>SBO_4164 intB, prophage P4 integrase
MSLLVKPGGSKYWRFRFRFGGKQHLMAFGVYPDVSLADARKKREEARKLV
AAGIDPREHKRAVKEEQAKEIITFEKVAREWLVTNQKWSEEHANRVKKSL
EDNIFPAIGSHNIAELGTRDLLIPIKAVEKSGRLEVASRLQQRTTAIMRY
AVQSGLIDYNPAQEMAGAVASSNRQHRPALELKRIPELLQKIDEYTGRPL
TRWATELTLLIFIRSSELRFARWSEIDFETSMWTIPPEREPIPGVKHSQR
GAKMRTPHLVPLSKQALAILKQIKQFCGEHELIFIGDHDPSKPMSENTVN
SALRVMGYDTKVEVCGHGFRTMACSSLIESGLWSKDAVERQMSHMERNSV
RAAYIHKAEHLDERKLMLQWWADFLDANREKGITPFDYAKINRGNGE
>SBO_2435 lig, DNA ligase
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELCELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
VPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLEQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKLAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDEAEMLRLLGS
>SBO_1947 mfd, transcription-repair coupling factor
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ
NALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQDIISSRLSTLYQLPT
MQRGVLIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGY
RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ
RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK
GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTL
ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTKHLPTKAAN
ANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDKVRNLALICESDLL
GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE
AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW
SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE
TTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK
QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVA
EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD
ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL
ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR
GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL
EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ
QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF
GLLPDPARTLLDVARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL
IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA
>SBO_2723 mutH, methyl-directed mismatch repair
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALAGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWTPNEEE
DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>SBO_4286 mutL, enzyme in methyl-directed mismatch repair
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG
AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA
SISSVSRLTLTSRTAEQQEAWQAYAEGRDMDVTVKPAAHPVGTTLEVLDL
FYNTPARRKFLRTEKTEFSHIDEIIRRIALARFDVTINLSHNGKIVRQYR
AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL
AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD
VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRS
IPENRVAAGRNHFAEPAAREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKLKAPEPQEPALAANSQSFGRVLTIVHSDCALLE
RDGNISLLALPVAERWLRQVQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL
EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS
VFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLL
QSVDLHPAIKALKDE
>SBO_3637 mutM, formamidopyrimidine DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>SBO_2787 mutS, methyl-directed mismatch repair
MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEDVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>SBO_0087 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV
VRELQEEVGITPQHFSLFEKLAYEFPDRHITLWFWLVESWEGVPWGKEGQ
PGEWMSLVGLNADDFPPANEPVIAKLKRL
>SBO_3029 mutY, adenine glycosylase
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP
YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH
GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA
VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS
LCPLQNGCIAAANNSWSLYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP
PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD
IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>SBO_0573 nei, endonuclease VIII and DNA N-glycosylase with an AP lyase activity
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVGT
RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT
ILLYSASDIEMLRPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLEIPRFSYATRGQVDENKHHGALFRFKVFHRDGELCERCGGIIEKTTL
SSRPFYWCPGCQH
>SBO_4019 nfi, endonuclease V
MIMDLASLRAQQIELASSVIREDRLDKDPPDLIAGADVGFEQGGEVTRAA
MVLLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPD
LVFVDGHGISHPRRLGVASHFGLMVDVPTIGVAKKRLCGKFEPLSSEPGA
LAPLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRL
PEPTRWADAVASERPAFVRYTANQP
>SBO_2168 nfo, endonuclease IV
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARIVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQD
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>SBO_1501 nth, endonuclease III
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH
NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>SBO_1188 ntpA, dATP pyrophosphohydrolase
MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAP
QAAMREVKEEVTIDVVAEQLTLIDSQRTVEFEIFSHLRHRYAPGVTRNTE
SWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA
>SBO_1726 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG
YERISATNPGGLSDKLSDYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVQRKEWLLRHEGYLLL
>SBO_2881 parC, DNA topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LVSERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS
EE
>SBO_2888 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL
QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>SBO_0567 phrB, deoxyribodipyrimidine photolyase
MTTHLVWFRQDLRLHDNLALAAACSNSSARVLALYIATPRQWATHNMSPR
QAELINAQLNGLQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHL
FYNYQYEVNERARDVEVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIEPSPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQALDGGAGSVWLSELIWREFYRHLMTYYPSLCKHCPFIA
WTDRVQWQSNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDREGEFIRRWLPELRDVPGKAVHEPWKWAQKAGVMLDYPQP
IVDHKEARLRTLAAYEEARKGA
>SBO_0773 pin, inversion of adjacent DNA; at locus of e14 element
MLIGYVRVSTNDQNTDLQRNALNCAGCERIFEDKISGTKPDRPGLKKLLR
TLSAGDTLVVWKLDRLGRSMRHLVTLIEELRQRGVNFRSLTDSIDTSTPM
GRFFFHVMGALAEMERELIVERTRAGLAAARAKGRVGGRRPKLTTEQWAQ
IGRLLEAGESRQRIALIFDVGVSTIYRKFPANESNESP
>SBO_3876 polA, DNA polymerase I
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI
MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK
AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT
LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT
AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLQQNKEVAYLSYQLA
TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK
GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP
VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN
SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNPLTFNQIALEEAGRYAAE
DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK
VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK
KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP
KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV
IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT
SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG
TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI
HQLMENCTRLDVPLLVEVGSGENWDQAH
>SBO_0047 polB, DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV
PRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG
VTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGSENGDASSLDFELEYVAS
RPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRLPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTF
VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYVRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>SBO_3952 priA, primosomal protein N'
MPVAHVALPVPLPRTFDYLLPEGMTVKAGCRVRVPFGKQQERIGVVVSVS
DVSELPLNELKAVVEVLDVEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL
PILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKI
WRDQVATLEFNDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLN
TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV
MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV
IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI
PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQANNQVILFLNRRGFAPALLCHDCGWIAECPRCDH
YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP
MFPGVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD
VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH
PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP
LFLQQLRNLILSSPLADDKLWILGPVPALAPKRGGRWRWQILLQHPSRVR
LQHIISGTLALINTIPDSRKVKWVLDVDPIEG
>SBO_4253 priB, primosomal replication protein N
MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM
PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI
DSGD
>SBO_0367 priC, primosomal replication protein N''
MKTALLLEKLEGQLATLRQRCAPVAQFATLSARFDRHLFQTRTTTLQACL
DEAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREASAWSLREWDS
APPKIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVE
AYEARLARCRHALEKIENRLARLTR
>SBO_3640 radC, DNA repair protein
MKVKNNSQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEM
LENFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREE
SPLLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHSRLFSGTLNHV
EVHPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFM
DLRVLDHIVIGRGEYVSFAERGWI
>SBO_2819 recA, DNA-dependent ATPase, DNA-and ATP-dependent coprotease
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL
TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>SBO_2710 recB, ATP-dependent dsDNA/ssDNA exonuclease V subunit
MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF
PRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERL
LEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ
QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY
LQGEAPVIKAPPPDDETLASRHAQIVARIATVKQQWRDAVGELDALIESS
GIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKFSQRFLEDRTKA
GGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRGELG
FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA
PGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK
MWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVR
ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL
QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ
IWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ
LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP
LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPKSVDLAEVERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA
AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN
WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS
PGTFLHSLFEDLDFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA
PLNEKGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC
PPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA
AMQAHRYDLQYQLYTLALHRYLRHRIADYDYDRHFGGVIYLFLRGVDKEH
PQQGIYTTRPNAGLIALMDEMFAGMTLEEA
>SBO_2712 recC, ATP-dependent dsDNA/ssDNA exonuclease V subunit
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEHEDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPEWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTHELGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRLWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGLSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDNIWYQRLWRQLTPE
TMEAIVEQSQRFLLPLFRFNQS
>SBO_2709 recD, ATP-dependent dsDNA/ssDNA exonuclease V subunit
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEASHPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPVSDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERARQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRHPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFSSRE
>SBO_3677 recF, RecF
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWACFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLVEQISTWRAEYSAGIA
ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>SBO_3725 recG, DNA helicase
MTGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL
YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFS
AAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP
VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT
LHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP
LSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDVPMMRLVQGDVGS
GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFEPLGIEVGW
LAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH
RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE
LPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWVCTLIEESELLE
AQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVAT
TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT
PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD
LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA
>SBO_3100 recJ, ssDNA exonuclease
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGETLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQIEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFELFQQRFG
ELVTEWLDPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>SBO_2751 recN, protein used in recombination and DNA repair
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALETARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKYD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLNKKARLQELARLLGGSEVTRNTLANAKEL
LAA
>SBO_2593 recO, RecO
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGATGTPEPALRRFELALLGHLGYGVNFTY
CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>SBO_3834 recQ, ATP-dependent DNA helicase
MNVAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMP
TGGGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNST
QTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEA
HCISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGL
NDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAK
VEDTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGM
GINKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWL
RRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCG
NCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIR
DYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAA
RPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIAD
ESNVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMCKLERFGKPFMAL
IRAHVDGDDEE
>SBO_0372 recR, recombination and repair
MQTSPLLTQLMEALRCLSGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>SBO_3788 rep, rep helicase
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVGQTLGRKEARGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPS
QAAASAIGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKYYAILYR
GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>SBO_3632 rfaP, RfaP
MVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSYFLK
WHRGTTLKEIIKNLLSLRMPVLGADREWSAIHRLRDVGVDTMYGVAFGEK
GINPLSRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVATMV
RDMHAAGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRVPRR
WRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFAAPLKDILKQEQGLLSQAE
AKATKIRERTIRKSL
>SBO_3790 rhlB, putative ATP-dependent RNA helicase
MSKTHLTEQKFSDFALHPKVVEVLEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>SBO_0685 rhlE, putative ATP-dependent RNA helicase
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLQNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSSDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG
QGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRKP
AAAQ
>SBO_0203 rnhA, RNase HI
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM
AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN
VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG
YQVEV
>SBO_0171 rnhB, RNAse HII
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALCEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPAMVVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS
>SBO_1483 rnt, RNase T
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMIAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLPAAEEV
>SBO_1178 ruvA, Holliday junction helicase subunit B
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDASSETLIREALR
AAL
>SBO_1177 ruvB, Holliday junction helicase subunit A
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP
>SBO_1186 ruvC, Holliday junction nuclease
MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA
GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF
EYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH
CHVSQNAMQMSESRLNLARGRLR
>SBO_0835 sbcB, deoxyribophosphodiesterase
MTDTDKQPTFLFHDYETFGTHPALDRPAQFAAIRTDSEFNVIGEPEVFYC
KPADDYLPQPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCILG
YNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPEG
INWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTRQ
PRLFDYLFTHRNKHKLMALIDVPQMPPLVHVSGMFGAWRGNTSWVAPLAW
HPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKTDLGDNAAVPVKLV
HINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAIF
AEAEPFTPTDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVDK
RIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYADELQMLV
QQYADDKEKVALLKALWQYAEEIV
>SBO_0291 sbcC, ATP-dependent dsDNA exonuclease
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL
EKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQEQQSLNWLT
RLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT
WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLTLTADEVATALAQHAEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ
NGSLEQTQRNVALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVKAYQALEPGVNQSRLLALENEVKKLGEEGAALR
GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQP
WLDAQDEHKRQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL
AGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTTLQNRIQQLTPILET
LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ
QLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG
LTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTL
SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL
DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK
>SBO_0292 sbcD, ATP-dependent dsDNA exonuclease
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIISSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
QEPPVWLDIEITTDEYLHDIQRKIQALAESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA
>SBO_0549 seqA, negative modulator of initiation of replication
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR
VASPAIAEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS
LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN
TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI
>SBO_3279 smf, Predicted Rossmann-fold nucleotide-binding protein involved in DNA uptake
MVDTDIWLRLMSISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLATTDYPGALFVEG
ELYALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDG
VAHKAALQVNGVSIAVLGNGLNTIHPRRHARLATSLLEHGGALVSEFPLD
VSPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS
PDQQDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>SBO_2606 srmB, ATP-dependent RNA helicase
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIEELRPKTRTPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>SBO_4088 ssb, ssDNA-binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>SBO_3551 tag, 3-methyl-adenine DNA glycosylase I, constitutive
MERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVL
KKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR
AYLQMEQNGEPFADFVWSFVNHQPQVTQATTLSEIPTSTPASDALSKALK
KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYPGNKP
>SBO_3853 tatD, Mg-dependent DNase
MEYRMFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQ
KLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAVQPEVVAIGECGLDF
NRNFSTPEEQERAFVAQLRIAAELNMPVFMHCRDAHERFMTLLEPWLDKL
PGAVLHCFTGTREEMQACVARGIYIGITGWVCDERRGLELRELLPLIPAE
KLLIETDAPYLLPRDFTPKPSSRRNEPAHLHHILQRIAHWRGEDAAWLAA
TTDANVKTLFGIAF
>SBO_1792 topA, DNA topoisomerase type I, omega protein
MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADS
TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA
EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA
FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA
VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN
KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG
VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE
SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL
PAVNKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI
ISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSL
DQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP
TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED
AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR
IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP
PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG
WSAFYVDGKWVEGKK
>SBO_1324 topB, DNA topoisomerase III
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ
PDAYDSRYARWNLADLPIVPEKWQLQPRPSVTRQLNVIKRFLHEASEIVH
AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLR
SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT
PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ
DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE
AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA
ISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLIARQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL
GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM
TGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHS
TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY
QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEVGSG
AIA
>SBO_2612 ung, uracil-DNA-glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQRGETPIDWMPVLPAESE
>SBO_4087 uvrA, excision nuclease subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>SBO_0666 uvrB, excision nuclease subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>SBO_1093 uvrC, excinuclease ABC, subunit C
MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH
TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK
HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG
RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE
FEEAACIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR
QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN
LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ
QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR
AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL
AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP
DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML
LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>SBO_3824 uvrD, DNA-dependent ATPase I and helicase II
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE
NCLPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA
HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK
DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW
LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD
QSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN
NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC
AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL
AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE
KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW
QDAVQLMTMHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY
VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV
SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV
AFQGQGIKWLVAAYARLETV
>SBO_1048 vsr, DNA mismatch endonuclease
MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGSP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
ISRLQELGWRVLIVWECALRGREKLTDEALTERLEEWICGEGASAQIDTQ
GIHLLA
>SBO_0878 wcaH, GDP-mannose mannosyl hydrolase
MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG
GRVQKDEMLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFT
THYVVLGFRFRVAEEDLLLPDEQHDDYRWLTPDALLASNDVHANSRAYFL
AEKRAGVPGL
>SBO_3822 xerC, site-specific recombinase
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ
CDVTMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>SBO_3098 xerD, site-specific recombinase
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>SBO_2533 xseA, exonuclease VII, large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE
SMQPAGEGLLQQKYEQLKAKLQAECLFDQQYKKPLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPTSVQGDDAPGQIVRAIELANQRNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRTGQQQQRLTQRLNQ
QNPQPKIHRAQTRIQQLEYRLAETLRVQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGNVLKKVKQVKAGEMLTTRLEDGWIESEVKNIQPVKK
SRKKVH
>SBO_0316 xseB, exonuclease VII, small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDASLTPFTPDNE
>SBO_1341 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG
DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>SBO_0289 yaiD, conserved hypothetical protein
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG
SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK
KTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDT
LALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSL
LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL
KRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGE
AQR
>SBO_0336 ybaV, conserved hypothetical protein
MKHGIKALLITLSLACAGMSHSALAAASVAKPTAVETKAEAPAAQSKAAV
PAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV
EDLKQVPGMGNSLVERNLAVLTL
>SBO_0349 ybaZ, conserved hypothetical protein
MLVSCAMRLHSGVFPDYAEKLPQEEKMEKEDSFPQRVWQIVAAIPEGYVT
TYGDVAKLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTISLTGPDL
QRQRQALLAEGVMVSGSGQIDLQRYRWNY
>SBO_0809 ybjD, conserved hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAIEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>SBO_0825 ycaJ, putative polynucleotide enzyme
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI
AELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPREIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>SBO_1963 ycfH, conserved hypothetical protein
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM
RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVEPLCETGLDYY
YTPETKVLQRESFIHHIQIARELNKPVIVHTRDARADTLAILREEKVTDC
GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRL
LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF
ARLFHIDASRLQSIR
>SBO_1271 yeaB, conserved hypothetical protein
MEYRSLTLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVIGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>SBO_2140 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAE
KRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIEDFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLALGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>SBO_2483 yffH, conserved hypothetical protein
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQTSHLMD
>SBO_2640 yfiL, conserved hypothetical protein
MMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVGMEDAISGSAIKDD
DAFGDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSGKSFPASCNNVESA
SQLHEVWQKGADENASTIRLN
>SBO_2722 ygdP, putative invasion protein
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEIHMPTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNTSAYRRKRG
>SBO_2927 ygjF, conserved hypothetical protein
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ
LKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ
PQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLE
KLVEAYRELDQALVVRGR
>SBO_3227 yhbQ, conserved hypothetical protein
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD
>SBO_3256 yhdJ, putative methyltransferase
MTMRTGCEPTRFGNEAKTIIHGDALAELKKIPAESVDLIFADPPYNIGKN
FDGLIEAWKEDLFIDWLFEVIAECHRVLKKQGSMYIMNSTENMPFIDLQC
RKLFTIKSRIVWSYDSSGVQAKKHYGSMYDPILMMVKDAKNYTFNGDAIL
VEAKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ
KPEALLKRIILASSNPGDIILDPFAGSFTTGAVAVSSGRKFIGIEINSEY
IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLIAK
>SBO_3462 yhhF, conserved hypothetical protein
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>SBO_4017 yjaD, conserved hypothetical protein
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLVQQQRRHDMGSVRQVIDLDVVLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL
MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>SBO_4439 yjjV, Mg-dependent DNase
MICRFIDTHCHFDFLPFSGDEEASLQRAAQAGVGKIIVPATEAENFARVQ
ALAEKYQPLYAALGLHPGMLEKHSDVSLEQLQQALERRPAKVVAVGEVGL
DLFGDDPQLERQQWLLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRHDL
SRTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLA
SLLLETDAPDMPLNGFQGQPNRPEQAVRVFDVLCELRPEPEDEIAEVLLN
NTYALFSVSG
>SBO_3041 yqgF, conserved hypothetical protein
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY
>SBO_2892 yqiE, conserved hypothetical protein
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHSLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>SBO_3234 yraN, conserved hypothetical protein
MATVPTRSGSPRQLTTKQTGDAWEVQARRWLEGKGLRFVAANVNERGGEI
DLIMREGWTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS
>SBO_3277 yrdD, putative DNA topoisomerase
MRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSSADGHIVKV
LEGQVCPACGANLVLRQGRFGMFIGCSNYPECEHTELIDKPDETAITCPQ
CRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPECHYPLLIEK
KTAQGVKHFCASKQCGKPVSAE
>SBO_3384 yrfE, conserved hypothetical protein
MSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV