TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: chromosome

Number of genes found: 669

Free access
Sort by:

 



# Shigella flexneri 2a str. 2457T, 2457T

>S1473 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S0543 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3169 putative superfamily I DNA helicase
MDENALGFASYWRNSLADAESGKGSFKRKDAQNFTHWHGIAAGRLDEAIV
SKFFEGEKDDVETVDVILRPKVYFRLLQHGKDRSAGAPDIVTPIVTPALL
SREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFD
DSVDKTAETDEEREARYAALQQEWRQYLYDSERLLKSVAGDWIEKPEQYE
LAEHGYIVKTAQSGGASSHILSLYDHLLVCNKDVPLFNRFASREVHAAES
LLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTG
KTTLVLSIIATQWARAALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQG
SGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA
LLFYLEKAKAAFPGKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQ
IRAARELIANDIEQYLDNLNKLLSGQEQKVTLLKSAKTEWKKYRAGESLI
YSLFSWLPAVRNKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLN
SAEREQTTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQA
DELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQDEKKKKGAKGVTARW
QRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAG
QVLPEVAAASFALAKKALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGS
TQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR
RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGELASSGS
RYNLLEAETIAVWLAENQQNIEAHYGKSLHEVVGIVTPFSAQVSTIKQVL
GKQDISTGTNEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNS
MLNVAVSRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDY
KERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITIISPWLTWQRLE
QTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNFKAALEKL
NALGIATKLVNRVHSKIVIGDDGLLCVGSFNWFSATREARYERYDTSMVY
CGDNLKGEIEAIYNSLERRQV
>S1944 ISSfl4 orf
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRILGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR
DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS
>S3967 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0985 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGVSPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S2133 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S4498 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1535 putative enzyme
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGSAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDYR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>S3937 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1469 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S1328 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARPVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3314 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S1135 IS600 orfB
MSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR
QRRHSRLGNISPAAFREKYHQIAA
>S0282 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3523 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0455 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3519 IS1 orfB
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2033 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1660 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S3268 hypothetical protein
MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDAPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR
LETDVYVPLA
>S3863 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S1462 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1236 putative phosphohydrolase
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>S4015 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1685 IS600
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0234 IS911 orfB
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>S2182 IS629 orfB
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSGV
KRSVRPSAGKPLPQATA
>S1658 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4611 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1715 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDIMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4639 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1027 IS2 orfB
MARGWGVSLVSRCLLVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALFRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGL
>S4050 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4022 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2738 ISSfl4 orf
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDVRDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>S2009 IS629 orfB
MWRSSLMCLPDTSWGAMETTFVLDALEQALWARRPSGTVHHSDKGSQYVS
LAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAE
VELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>S1862 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2827 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMGDLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S2011 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLITAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>S0488 IS2 orfA
MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE
>S3801 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0901 ISSfl3 orfC,D
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>S4218 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2832 IS4 orf
MFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAG
NPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLT
LMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGDHLVKLK
TSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADL
YSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQ
MIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLASM
GQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S2242 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4178 IS150 putative transposase
MKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRIGEI
YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY
RGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNN
EVISYSLSERPVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQN
ILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEFSNISELKDAV
TEYIEYYNSRRISLKLKGLTPIEYRNQTYMPRV
>S2110 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4551 IS600 orfB
MCQVFGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>S0716 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2879 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S3775 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
DTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1282 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0311 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4667 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3072 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRGTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0948 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLATAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S3549 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFLSVK
>S0736 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1132 IS629 orfB
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPEQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>S0487 IS2 orfB
MDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN
AKRVYRLMRQNALLLERKPAVSPSKRAHTGRVAVKESNQRWCSDGFEFCC
DNGERLRVTFALDCCDREALHWARTTGGFNSETVQDVMLGAVERRFGNDL
PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK
TIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQR
ACNGLSDNRCLEI
>S2721 putative DNA helicase
MALDLMAAFTELPPPIDYVLPNMVAGTVGALVSPGGAGKSMLALQLAAQI
AGGPDLLEIGEFPTGQVVYLPAEDPPAAIHHRLHALGAHLSAAERQAVAD
GLLIEPLIGKCPNIMAASWFDALKRAAEGRRLMILDTLRRFHIEEENASG
PMAQVVGHMEAIAADTGCSIVFLHHASKSAAMMGSGDQQQASRGSSVLVD
NIRWQSYLSGMTQGEAEILGVDDCQRGYFVRFGVSKANYGAPFQELWFRR
HDGGVLKPAVLERQCKVKRRQREEA
>S4610 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0327 IS629 orfB
MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYV
STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT
VHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAE
VIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIG
NDDLAA
>S0874 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3874 IS1 orfB
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARPVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2862 IS3 orfB
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYCQWPEPEVARRHHVLTYR
>S1959 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S0947 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTEPPRESWRLNFLRKR
>S3963 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1980 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S3875 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1059 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0249 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S0527 ISEhe3 orfB
MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD
LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF
NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV
LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN
CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS
SEQMSPTEYENQYYQRLGSV
>S1910 IS629 orfB
MAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLQERLGHIPP
AEAEKAYYASIGNDDLAA
>S4666 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0485 IS629 orfB
MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYV
STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT
VHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAE
VIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIG
NDDLAA
>S1422 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1522 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0312 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0952 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1026 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1602 putative excinuclease subunit
MVRRLTSPRLEFEAAAIYEYPEHLHSFLNDLPTRPGVYLFHGESDTMPLY
IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQSIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE
EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>S2483 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0732 hypothetical bacteriophage protein
MKHEEMNQRFNHLENEITELNKKLSALVSSEDENKRRDEHYAAIYDYCHK
VAHETFMKFLQEKFLPAALSEKEAAYLRPEYVITVNSAGEEEHKSDFIAS
APDKDQEPHRPFRVSCEEGEFVVYENGKPVRASHHHCLKIINLAIRCLKD
ENTRVMKRIGRCMGYLQVAAEIEALASGADMDAAVREALLRDFNTPPLRK
SLMTGSSRG
>S4231 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1953 IS2 orfA
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S3936 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2103 ISEhe3 orfB
MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD
LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF
NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV
LNALLMAVWWRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN
CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS
SEQMSPTEYENQYYQRLGSV
>S4839 putative P4-type integrase
MALSDVKVRSAKPEAKAYKLTDGEGMVLLVHPNGSKYWRLRYRFGGKEKM
LALGKYPEVSLADARARRDEARKLLANGVDPSENKKAVKVEQEQEAITFE
VVARDWHASNQKWSASHSARVLKSLEDNLFTAIGKRNIAELKTRDLLVPI
KAVESSGRLEVAARLQQRTTAIMRFAVQSGLIDYNPAQEIAGAVATAKRQ
HRAALELNRIPELLHRIDHYSGRPLTRLAVELTLLVFIRSSELRFARWSE
IDFETAMWTIPGEREQLEGVKHSQRGSKMRTPHLVPLSRQALSILEKIKS
MSGNRELIFVGDHDPRKPMSENTVNKALRVMGYDTKVEVCGHGFRTMACS
SLIESGLWSRDAVERQMSHQERSSVRAAYIHKAEHLGERRLMFEVVNKNW
PPS
>S3634 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S0263 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S3056 IS3 orfB
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWPGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLPCDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>S2650 IS1N orfB
MAFICELDEQWSYVGSKARQHWLGYAYNTKTGGVLAYTFGPRTDQTCREL
LALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQRIERNNLTLRTRI
KRLGRKTICFSRSVEIHEKVIGAFIEKHMFY
>S1418 ISSfl4 orf
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIAALH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTASP
VRSPGLTTPAK
>S1960 IS629 orfB
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>S3785 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1616 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2322 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2793 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S4131 IS1 orfB
MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1717 ISEhe3 orfA
MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS
STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD
>S4567 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4477 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S2880 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0685 putative phage transposase
MLQQGKFKTSQGCFEIARPTLEAHDYDREALWSKWDKASDSQRSLAEKWL
PSIQATDEMLNQGISTKTAFATVAGHYQVSASTLRDKYYQVQKFAKPDWA
AALVDGRGASRRNVHKSEFDEDAWQFLIADYLRPEKPAFRKCYERLELAA
REHGWSIPSRATAFRRIQQLDEAMVVACREGEHALMHLIPAQQRTVEHLD
AMQWINGDGYLHNVFVRWFNGDVIRPKTWFWQDVKTRKILGWRCDVSENI
DSIRLSFMDVVTRYSIPEDFHITIDNTRGAANKWLTGGAPNRYRFKVKED
DPKGLFLLMGAKMHWTSVVAGKGWGQAKPVERAFGVGGLEEYVDKHPALA
GAYTGPNPQAKPDNYGDHAVDAELFLKTFAEGVAMFNARTGRETEMCGGK
LSFDDVFEREYARTIVRKPTEEQKRMLLLPAEAVNVSRKGEFTLKVGGSL
KGAKNVYYNMALMNAGVKKVVVRFDPQQLHSTVY
>S4811 IS1 orfB
MAHVFGERTLATLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKR
YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>S1975 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1253 IS3 orfA
MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR
SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK
>S4472 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0486 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGDDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S1426 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S2728 hypothetical protein
MNVEGMATGGIHMELHCPKCQHVLDQDNGHARCPSCGEFIEMKALCPDCH
QPLQVLKACGAVDYFCQHGHGLISKKRVEFVLA
>S4144 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2861 IS3 orfA
MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR
SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK
>S2438 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S2106 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWKDGRRSRHSDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRIYRLMRQNALLLERKL
AVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDRKA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4331 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1627 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0941 putative integrase encoded by prophage CP-933K; partial
MSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDA
FREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLR
LAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTTLHVDA
LGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSF
EGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREW
DKIEIK
>S0484 IntA
MAISDTKLRTIYGKPYSGPQEVADADGLSVRISPKGVIQFQYRYRWHGKP
NRLGLGRYPSLSLKDARQITADLRKLYFSGTDPRTYFEEKVENSMTVAQC
LDYWFDN
>S3704 IS1 orfB
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2146 putative integrase for prophage CP-933U
MTFGRLWEKFLASAYYSDLSPRTQKDYLQHQKKLLAVFGKVPADSIKPEH
IRRYMDKRGEQSKTQANHEKSSMSRVYSWGYERGYVKANPCAGVSKFKAK
NRERYVTDKEYQAVLSVAPLPVFIAMEIAYLCAARVSDVLSLKWEQIGND
GIFIQQGKTGKKQIKAWSPRLQAAIEKAKQLPTSAYVISNQYGNRYMYKG
FNEMWVEARNRAGKISGILTDFTFHDLKAKGISDYEGSSRDKQLFSGHKT
EGQVLIYDRKVKVSPTLDVPLPENIPRKYSK
>S2318 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1609 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3610 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1239 IS1 orfB
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARPVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2299 IS1 orfB
MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1002 IS1 orfB, A
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLYTVLRHLKNSAESVTSRIQPGSD
VIVCAEMDEHWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL
LSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLVRKSLSFSKSVELHDKAIGHYLNIKHYQ
>S2335 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4440 ISSfl3 orfD
MEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLRPLYIALNDYVLEA
GKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGSSLPAAVWFAYSAD
RKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEAGCLAHARRKIHDE
DVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAVRKARSVQLMQSLY
DWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDGWVEIDNNIGENAL
RSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEVEPEDWLREVIEKL
NDWPSNQVHKLLPWNFSSVK
>S1131 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S0697 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S0708 IS911 orfB
MKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGN
CWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEY
NGGLPPNESENRYWKNSNSVASFC
>S3295 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKTQAAVGNLAHTTGQ
>S2190 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S0949 IS629 orfB
MPLLDKLREQYRVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMEVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA
>S2860 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGYRASARIMGVGLNTVLRHLKNSGRSGNLAHTTRQ
>S4130 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3547 ISSfl3 orfA
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>S1973 IS1 orfB
MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSK
SVELHDKVIGHYLNIKHYQ
>S4579 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
DTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4060 IS2 orfA
MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1254 IS3 orfB
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFEN
>S2317 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0833 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S4098 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKPLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2781 DNA-invertase
MLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGTKSERPGLKKLLK
TLSAGDTLVVWKLDRLGRSMRHLVVLVEALRERGINFRSLTDSIDTSTPM
GRFFFHVMGALAEMERELIVERTKAGLEAARAQGRIGGRRPKLSPE
>S4168 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3524 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1912 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>S1978 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S0722 putative replication protein DnaC
MMTFNLREQQKRLQARMDELRAEIAFAQKGEKPWPYRSCLMREGRGYCEK
HGEYHTHILVWSDRNGEDREEISCCPDCLIAEANDLTMELSSIKAEELTD
NAGIALRFRDCEFDNYLEVNPGAARNLAACRRYAENWPDMLENGTSLVMT
GSCGTGKNHLAVAMAKHIIRNYLASVEITDVMRLTRAVKNCWRNDSEKTA
DEVIERYASMDLLIIDEVGVQFGSAAEMAILQEIINARYESILPTILISN
LSPEELWAFISPRIADRITDGGRNWLSFNWPSYRSRIRGVAA
>S4061 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4167 IS1 orfB
MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0383 ISSfl3 orfA
MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK
QALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQECMPVPVAL
TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLFDPLTP
ELLRALIREMKGGIR
>S4439 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIGTTSS
>S4030 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S2698 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1069 IS1 orfB
MSRQCTHYGRWPLHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0498 IS150 orfB
MKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRIGEI
YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY
RGEVGQTAPNVLQRDFKLRGQTRSGLPMLLNLQSMGASCICLQ
>S1579 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1093 IS3 orfB
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>S0997 IS2 orfB
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0328 IS629, orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQV
>S4112 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4832 IS3 orfB
MKYVFIENHRAEFSIKAMCRVLRVARSGWYVWLRRRHQMSLRQQFRLTCD
AAVHKAFFEAKQRYGAPRLADELPEFNIKTIAASLRRQGLRAKASRKFSP
VSYRAHGLPVLENLLEQDFSASGPKPEVGG
>S1133 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S1005 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0262 ISEhe3 orfB
MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD
LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF
NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV
LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN
CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS
SEQMSPTEYENQYYQRLGSV
>S4471 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESCLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1962 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0078 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0905 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S0276 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1961 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4521 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S1325 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIGQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLSKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>S3388 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2951 putative phage transposase
MQSYVTVNDLLGVPGMPATTKGIRQALQRFSRDLGDVSRRREGTKAIEYH
IDCLPEITRKALRERYVEQLVATENNVSEVKAVTRKTRNPDAVQAIEAYR
GSPQLMEERLNALTENQRWVSEARAALVVEVLKLESAGNPGRLKAINFLV
EKARKGELPERLQQAAVNANAKRGANRTISRDPLYQWVLKYNQSQNAAER
LLLLAPGKRDEIKPEEISWLPEFLAQYRQVNGRPMSEAYEDFVAEWQRRH
ADEPYMLEVMPSYDVVRYAMKKLPEVVKQKGRVTGSEYRQLEGFTRRDWT
AMPVNYVWIGDGHGMKLKCAHPIHGRPFSPEVTFVIDGGTRFVVGWSLDL
AENVFAVAGAIQHGIRNHGKPFLYYSDNGSGETADMLDKEVVGILPRLGI
KHPTGIAGNPQGRGIIERLNRTLPMRIARRYRTYFGKGADRESLRVLNRD
LRSAFNALQQDKPLNDRQKAAMRELPSWAELIEAIREGVEWYNNRPHSEL
PMKPNGRHYSPTEFRKKRQAEEDTEIEWLSDLELRDMFRPMVERPVRRCE
IQWLNNIYYAPELRDEHGRKVLISYDIHDAERITVRRKDGSFICEAIWNG
NKRAAFAVSAEYHKQQQRIKGMRKRAEEKIRDAEDEGIQILEHKQAEPWL
SNVYRPVGNVVAVQQPEYEEEHDEEFERDFRLGMQKLFAMQEEDDPLA
>S2723 putative integrase
MLTDTKLRNLKPRDKLYKVNDREGLYVAVTPAGSISFRYNYSINGRQETI
TFGRYGVGGITLAEARELLGDAKKMVAAGKSPAKEKARDKARVKDAETFG
AWAEKWLRGYQMADSTRDMRRSVYERELKPKFSNQKLVEITHEDLRALAD
AIVERGAPATAVHVREIVLQVFRWAIERGQKVENPAELVRPTSIARFEPR
DRALTPEEIGLMYQYMERVGTSPTNRAAAKLLLLTMVRKSELTNATWSEI
NFSEALWTIPKERMKRRNPHLVFLSQQALDIFIAMKTFAGGSDFVLPSRY
DSDAPMSAATLNQVLTLTYKAAQKDGKSLTKFGPHDLRRTASTLLHEAGY
NTDWIEKCLAHEQKGVRAVYNKAEYREQRAAMLQDWADMIDEWTSGGSKG
>S0725 putative bacteriophage protein
MSVKIQTIPELLIQTRGNMTEVSRMLNCNRATVRKYAEDKEGKGHAIVDG
VLIVHRGWDRGKDSDA
>S4021 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S4049 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0079 IS1 orfB
MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2123 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2552 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0958 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1554 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S2187 ISSfl3 orfD
MEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEVE
PEDWLREVIEKLNDWPSNQVHKLLPWNFSSVK
>S3760 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1129 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0957 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1544 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S0283 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3586 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0715 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVAGIKDVYTCEIVGYA
MGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSG
LKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEI
FYNRQRRHSRLGNISPAAFREKYHQMAA
>S0686 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S4059 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3394 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKLLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1339 IS600 orfB
MAHIRTCETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVAGIKDVYTCEIVGYA
MGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFG
LKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEI
FYNRQRRHSRLGNISPATFREKYHQMAA
>S3395 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2697 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S4062 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S1500 IS1 orfB
MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4629 ISEhe3 orfA
MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS
STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD
>S0945 ISSfl3 orfA
MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK
QALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQECMPVPVAL
TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLSDPLTP
ELLRALIREMKGGIR
>S3180 reverse transcriptase-like protein
MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC
RYADDFVLIVKGTKAQAEAIREECRGVLEGSLKLRLNMDKTKITHVNDGF
IFLGHRIIRKRSRYGEMRVVSTIPQEKARNFAASLTALLSGNYSESKVDM
AEQLNRKLKGWAMFYQFVDFKAKVFSYIDRVVFWKLAHWLARKYRTGIAS
LMRWWCKSPKPGQSKTWVLFGKTNHGKLSGEILYRLVGQGKKLFRWRLPE
GNPYLRTETRNTYTSRFTEVAMAFASI
>S2300 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1813 putative virulence protein
MRRFAGACRFVFNRALALQNENHEAGNKYIPYGKMASWLVEWKNATETQW
LKDSPSQPLQQSLKDLERAYKNFFQKRAAFPRFKKRGQNAAFRYPQDVKL
DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGTWYISIQTESEVS
TPVHPSASMIGLDAGVAKLATLSDGTVFEPVNSFQKNQKTLARLQRQLSR
KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK
VSNMSKSAAGTVSQPGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWSGG
QVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNI
LAAGHAVLACGEMVQSGRSLKQEPTEMIQATA
>S0221 IS2 orfB
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>S4253 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4252 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1987 putative crossover junction endodeoxyribonuclease
MRHEFILPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDETNIVR
GLPVPGGRLGIKITELECA
>S4143 IS1 orfB
MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0909 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0875 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4638 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0322 integrase fragment
MRTDSNKAWKGALKRAGISNFRFHDLRHTWASWLVQSGVSLLALKEMGGW
ETLEMVQRYAHLSAGHLTEHASKIDAIISRNGTNTAQEENVVYLNAR
>S2184 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKTHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>S0020 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0959 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S2130 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1588 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIGQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S2942 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2565 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S0930 IS600 orfB
MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD
RGSQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHY
RFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S3389 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1423 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2986 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2943 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1397 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1483 IS103 orf
MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG
EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW
LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ
VRFLETRLMYLKKLKALAHPTKK
>S1251 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFVLDCCDREA
LHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4332 IS2 orfA
MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1890 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1058 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2510 putative regulator
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSGVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>S3132 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0323 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S0946 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S1543 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S3480 IS1 orfA
MAFISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1127 IS911 orfB
MATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQ
FAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRL
TMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRG
NCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHE
YNGGLPPNESENRYWKNSNSVASFC
>S3209 IS3 orfA
MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR
SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK
>S4605 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4631 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S0506 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1736 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S1281 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4097 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3004 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
EEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKRPVRLLN
>S2950 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA
>S1863 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1329 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0723 putative helicase
MTTPVWRNDDLEGAVIGAFFLRGADPEVMDILATLPADVFSVRAYQDIYT
GICRQARVSGVIDPVLLCNEMPELAPVITDTGRKTWVKSSLEHYVAALRR
NAALRDAEKTLNEALQKLRDAHTCEAAEDALKDAQNMMVTLSTGKGVIQP
VHIDDVLPEVVERVECRNQGLEKSRTLMTGIDELDAKTGGMEPGDLVFIA
ARPSMGKTELALDIIDKVTEQGHGVLLFTMEMANIQIGERMVSAAGGMPV
SRLKSVAHFEDEDWTRFSQGVGRMTGRNIWMVDQANLAIDEICATTKHHL
IKYPETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIAL
SQLSRGVESRPNKRPMNSDLKNSGEIEADADIILMLYRDEVYNPDTQARG
IAEINITKQRNGSLGTIYRRFYNGHFLPVDQESARVLSTPMKPGNPRRYS
NKRTDSSKMERFF
>S1136 hypothetical bacteriophage protein
MRRAISYIRFSSERQLKGDSVRRQSKLVTDWLDKNPEFYLDSSLSFKDLG
KSAFSGKHLKGGLGDFLTAIEKGLVKAGDTLLIESLDRLSRQDIDIASEL
LRRILRAGVDVVTLSDGEHYTRESLKDPLALIKSILIMQRAHEESLRKSE
RVQAAWNRKKELISEGIKVSRRCPAWLRLNDDRRTFTIIPDKVEVVKRAF
DLRLQGLSFWAITRTLNDEGHLSLNQYTPKQKGWSDTAVKKLLRNRAVIG
CFTPAGREEVQGYYPAIISESLFYRVQQLNTGQYGRASVSSNPLSVNLFR
GIIKCEVYWQ
>S4835 putative integrase
MNSSKAGGCHGINDIKVKTAKPKDKPYKLADGGGMYLLINTNGSKYWRMK
YRFAGKEKMLSIGVYPDVTLADAREKRSEARKLLAAGGDPGEAKKEEKIA
QQMSLKNTFEAIAREWHQLKADRWSLRYRDEIIDTFEKDIFPYIGKRPIA
EIKPMKLLEALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPD
LASALATPKKVHFPFLTANELPHFLNDLAGYTGSIITKTATQIIMLTGVR
TQELRFARWEDIDFETKLWEIPAEVMKMKRPHIVPLSEQVIMLFKQLEPI
SKHHPLVFIGRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILH
EQGFNSAWIEMQLAHVDKNSIRGTYNHALYLDGRREMMQWYADYIDSLSS
RES
>S3064 putative integrase
MLKSRTYLYQRNGVFYIRLRMKTTSRLTASLPSHNRYKLASVSLRTKDRR
TAMAHSRHIKSALKAIHADNPNASYEELREHLKTIVEWELSVSRDDLNDP
ESYQLYVDQYDDIKSNLREAVATERLTVDQHRYINDVIGVLKACQDRLNG
DSSGLLSYLEPETGSLRPSVSLSVLAEPEVPEPKALTLASLIEQYEQENA
QNWKPATLSENRASHSTLIEIFDYLDIQDVGKATRADMLRVREVLQQLPK
NRKQRFKSMPLSDLLNRESKTDCLDVVTINNKYLIKMAAVFKWAVRNDLI
AKNLTEGLELKVPQRKASDARDAFSPEQVGQLLVAAKAYSQKTSGKPYHY
YVTALAAITGARLNEVAQLQVKDVRTTEAGTVFIHINEDDSSLPGKSIKN
AHSDRCVPLVDGAYGFVLADFMSLVEDRRKTEGDNAMVFNGLKLMKNGYG
EQVSKWFNRTLLPKVLADRSGLAFHSFRHTVATQLKQHGVELAYAQAIMG
HSSGSITYDRYAKEVEVETLKEKLAESLSVKKIDGK
>S1617 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1453 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1798 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1877 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3184 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S4216 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S2983 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3609 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2152 putative integrase
MRHAVHQELIDTNPAANLGGVTTPPVRRHYPALPLERLPELLERIGAYHQ
GRELTRHAVLLMLHVFIRSSELRFARWSEIDFTNRVWTIPATREPIIGVR
YSGRGAKMRMPHIVPLSEQSIAILKQIKDITGNNELIFPGDHNPYKPMCE
NTVNKALRVMGYDTKKDICGHGFRAMACSALMESGLWAKDAVERQMSHQE
RNTVRMAYIHKAEHLEARKAMMQWWSDYLEACRESYAPPYTIGKNKFIP
>S0943 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGFRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0942 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA
>S1094 IS3 orfA
MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR
SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK
>S1686 IS600
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1911 IS629 orfB
MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYV
STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT
VHHSDKGSQYVSLALHTAA
>S0719 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S3154 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2072 putative virulence protein
MRYRNSRQVTGVVKNVTVSQSCGKWYISIQAESEVSTPVHPSASMVGLDA
GVAKLASLSDGTVFEPVNSFQKNQKTLARLQRQLSRKVKFSNNWQKQKRK
IQRLHSCIANIRRDYLHKVTTTVSKNHVMIVIEDLKVSNMSKSAAGTVSQ
PGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRC
ACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNILAAGHAVLACGEMV
QSGRSLKQEPTEMIQATA
>S3520 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2172 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1461 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0606 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0235 IS911 orfA
MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKRLR
ALDVRLPEQFSIIGKLRAHYPVVTLCHVFGVHRSSYRYWKNRPEKPDGRR
AVLRSQVLELHGISHGSAGARSIATMATRRGYQMGRWLAGRLMKELGLVS
CQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYIWTGKRWAY
LAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGGVMFHSDQG
SHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKNEWMPVVGY
VSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC
>S1797 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S3473 IS600 orf
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFRENIIRWLLKKRTNGSVRYCQY
TSKVAMIYIEQLELIHKSGDVLYPVKITRKSSGKTAFHLVPFGLNKTHDL
LEVEDASEAIRLVIDERHSIRCSTLTATITNKKGKRIKRTGIYSIKGVNI
KEYNVR
>S0910 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA
>S4113 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0923 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1482 IS103 orf
MKVLNVLRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRIGEI
YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY
RGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNN
EVISYSLSERPVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQN
ILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEFSNISELKDAV
TEYIEYYNSRRISLKLKGLTPNEYRNQTYMPRV
>S2279 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2020 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1992 hypothetical bacteriophage protein
MIIQSKLIRAALVCAAKNDVRYYLNGLHITPKHIEATNGSVALRMAHGIR
TKKNIIVQFEGGVPAKAETTELIFSKEPIAVHRDQFQRRLSITGIKLVDG
CFPDLDRIIPKKFDRCTHPVLQAGYLSYPEKMFGRERKFIPVQLRPSGDG
QAVRIQFDSIINSMYGNPEFVVMPCRDHGDFNVAQEHPE
>S3966 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S4480 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0456 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP
GTHQKIIDMAMNGVGCRASARIMDVGLNTVLRHLKNSGRSR
>S1659 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0928 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S0021 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
DTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4441 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4235 IS1 orfB
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1856 IS1 orfA
MASISIRCPSCSATEGVVRNSKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3633 IS2 orfA
MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S3399 IS2 orfB
MTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQF
ARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLA
EAFEHYNEWHPHSALGYRSPREYLRQRACNRLSDNRCLEI
>S3290 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1628 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0379 ISSfl3 orfD
MSSAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELS
RNTMVRWVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTK
TGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADA
YAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELYD
IEAEIRGSPAEERLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFD
YILNHWNALNEFCRDGRVEIDNNIGENALRSVAVGRKNYLFFGSDKGGES
AAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>S2109 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1398 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1130 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S2185 hypothetical protein
MYNDVLFPGEDINKPISIAAANRFVNRIRGGMDLGYWRTHDFRRTLVTRL
SEMNVEPHVTERMLGHELGGIMSVYNKHDWIEAQRKAYELHADKLFWHIR
SISD
>S1124 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S1952 IS2 orfB
MIVLIPVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0931 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0224 IS911 orfB
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSKAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>S3291 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0718 putative bacteriophage protein
MRPSELSDLLWAQVDRVAPHLLPNGKIEGHEWVAGNVNGDKGNSLKVNLI
GKKKWADFAEGDGGDMLDLWMACRGINLHQAMQEAKAFLGIKDDDHHFDA
RREKKFSRPDRKKIARYVTRTESHLEYLQSRGISPEVVKRYEVVSGKVWN
GERELDALVLPYKRDGELLQVKRISTERPDGKKVIMAEGDCEPCLFGWQA
LDAGVRVVVLCEGEIDCMSYAQYGISALSVPFGGGKGAKQQWIEFEYHNL
DRFEEIFISMDVDDVGREAAREIVSRLGEHRCRLVTLPYKDINECLMNGV
TEDEIWQYIGTASYFDPEELYSAREFYQDTINAFYGKQQYLFNPPWESLA
DKFQFREAELTLVNGVHGHGKACPLNEPILLADGTWTTHGNVKIGDQVAS
VDGNPSTVTGIFPQGVRDVYRVTFEDGRYVDCAGDHLWEVTSRGFTKGEK
RRVIDTFGLKRLSETKRHKNGVRIPEITGDFGDHSEPLAWVIGSLLGDGS
LSNGSVKFSNVEPYMIERMKAELPDYNFSGDGKDWLISTARGQVNPLMET
LRGYGLMGCTAKNKFIPRVFFSANKSTRIGMLCGLLETDGYVEKDGTLVF
SSASEELRNEVVNKNWPPS
>S1878 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4450 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0318 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S0472 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2794 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKHHQMAA
>S2243 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2336 IS1 B
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2338 IS629 orfB
MARCTVARLMAVMGLAGVLRGKKVRTTIGRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKSEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>S3705 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1578 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1454 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2180 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILCEPPRVSWRVFYL
>S1716 ISEhe3 orfB
MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD
LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF
NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV
LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN
CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS
SEQMSPTEYENQYYQRLGSV
>S3124 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1468 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA
>S1502 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S4147 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4479 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1998 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S2173 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S1920 hypothetical protein
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSLLVRPDRPISPQAMAI
HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM
KLARRLWPGLKYSNMALYKTRKLNVQTPLGLHHHRALYDCYITAALLIDI
MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERYPGYLRWLFN
NLDSMSPELRLTLKHYLENT
>S2189 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIGTTSS
>S2179 ISSfl3 orfA
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>S0136 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3289 IS2 orfB
MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW
ALLRRQAELDGMPAINAKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAV
KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV
QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT
AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP
HSALGYRSPRG
>S1110 IS600
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2982 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0382 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI
AWPHPKRTERPGIRI
>S4442 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1989 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S1503 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHYSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S4553 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1705 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREV
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S3188 IS2 orfA
MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4031 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAKMAKAFDYILNHWNALNEFCRDG
RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>S1963 putative integrase for prophage CP-933R
MESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDFEKGIVNVRRNLTILD
MFGPPKTNAGIRTVALLQPALEALKEQYKLTGHHRKSEITFYHREYGRTE
KQKLHFVFMPRVCNEKQKPYYSVSSLGTRWNAAVKRAGIRRRNPYHTRHT
FACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQIAMLNARL
S
>S1866 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1653 IS1
MAHVFGERTLATLERLLSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKR
YTQRIERHNLNLRQHLARLVRKSLSFSKSVELHDKVIGHYLNIKHYQ
>S4063 IS629 orfB
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLVYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>S0797 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIGQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLSKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S0705 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S4248 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNR
>S4550 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2034 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3125 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3884 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSSVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4632 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0471 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1240 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2337 IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S4568 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4314 IS4 orf
MTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQ
ELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGA
SPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSV
A
>S2122 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0996 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1774 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2107 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1972 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA
>S3210 IS3 orfB
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWPGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLPCDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>S1227 hypothetical bacteriophage protein
MKQWREKSRQLAERGDLTPADWSNLELYCVNYSIYRKAVADLAARGFSIV
NSQGGESRNPALSAKSDAERVMIKMASLLGFDPISRRKNPPETEEEDELD
RLE
>S0900 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S3776 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSHIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1837 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2181 IS629 orfA
MAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLMWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>S1134 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSSTAHTI
TGSYRSSLV
>S4148 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAEHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1775 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0526 ISEhe3 orfA
MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS
STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD
>S2440 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S2140 putative crossover junction endodeoxyribonuclease
MTERIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKL
NLSGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDEINI
VRGQLVPGGRLGIKITELGCA
>S4822 putative P4-type integrase
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKT
LALGVYPEVSLSEARTKRDEARKLISEGIDPCEQKRVKKVVPDLQLSFEH
IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVR
AAEAKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH
RPALELSRLPELLSRIDGYKGQPVTRLAVMLNLLVFIRSSELRYARWSEI
DIDNSMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQVVAILAELQTW
AGENGLIFTGAHDPRKPISENTVNKALRVMGYDTTQEVCGHGFRAMACSA
LIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANREKGISPFEYAKINNPLK
>S2104 ISEhe3 orfA
MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS
STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD
>S3344 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4630 ISEhe3 orfB
MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD
LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF
NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV
LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEG
>S0922 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4247 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2191 ISSfl3 orfA
MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK
QALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQECMPVPVAL
TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLFDPLTP
ELLRALIREMKGGIR
>S0317 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S3481 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2949 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2787 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVIPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0721 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S3214 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0956 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2386 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMVMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4029 ISSfl3 orfA
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>S2131 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S3131 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1123 IS629 orfB
MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPEQLWVADFTYV
STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT
VHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAE
VIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIG
NDDLAA
>S3401 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0950 hypothetical bacteriophage protein
MIWQPEFTDKTLSRKPGAVQLVTCKQNEVEPEDWLREVIEKLNDWPSNQV
HELLPWNFSSVK
>S0607 IS1 orfA
MLAWLPFPSDVLPAPLLKAWCVTGKSTAGHQRYLCSHCRKTWQLQFTYTA
SQPGTHQKIIDMAMNDVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4497 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTDSQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3800 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0707 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFSVTEPNQVWSVCDLYL
DG
>S2155 IS911 orfA
MADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKL
QRIEMENEILKKATALLMSDSLNSSR
>S0953 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2018 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S1521 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKEKLHVISKRYTQRIERHNLNLRQHLARLGRMSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1431 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQNEPARKLLG
>S4036 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3181 reverse transcriptase-like protein
MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDG
VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA
LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG
ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI
DVGLFRAAVKVCHRAVLYRRYYRTSC
>S2985 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1899 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S0871 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1675 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA
>S2134 ISSfl3 orfA
MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK
QALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQECMPVPVAL
TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLFDPLTP
ELLRALIREMKGGIR
>S4037 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0380 ISSfl3 orfC
MKVLAPGNGKTKTGRLWVYVRDDRNAGSSLPAAVWXAYSXDRXXXXPQXH
LAXXXGRKPDRQ
>S4554 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0523 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S2651 IS1N orfA
MTSVNIHCPRCQSAQVYRHGQNPKGRDRLRCRDCHRVFQFTYTYQARKPG
MKELITEMAFNGAGVRDTARTLKIGSNTVIRTLKNSRQSE
>S4461 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0927 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKSTAYFAQESLKNTR
>S1857 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3585 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2256 ISSfl4 orf
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASSRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>S0872 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4064 IS3 orfA
MTKPVSISKKPRKQHTPEFRNEALKLAERIGVAAAARELSLYESQLYAWR
SKQQQQMSSSERESELAAENVRLKRQLAEQAEELSILQKAATYFAKRLK
>S4475 ISEhe3a orf
MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS
STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD
>S4476 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0220 IS2 orfA
MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1891 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3055 IS3 orfA
MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR
SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK
>S4604 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0735 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S1836 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0899 ISSfl3 orfA
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLSDPLTPELLRALIREMKGGIR
>S2321 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0698 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1981 putative integrase
MIQLSSSHKLPAVYYLYQRNGVYYFRLRVRQSNNDRMTSISLRTKDRRTA
MAYSRHIKAALKAIHADRPNATYEEMREHLKDIAECELSMGRSDLFEPDM
RDIYRDQYGELGESLTDALASEPLSIDQHRYINEALKVLKACMRRIEAGD
SQPLIDYVDLFNDIDRQDNQADSVSLSVNAPEVKPEVTPSITIASLFEQY
EAENYQNWKPATLRENKASHAALIEIFDHLGLNADANRADMLRVRDVLQQ
LPRNRKQRFKDVPLADLLSREDKTDCLDVVTINNKYLIKMAAVFRWAVRN
DLIKKNMTEGLELKVPQRKASGARNAFSTEQVGQLLVAAKAYSQKTSGKP
YHYYVTALAAITGARLNEIAQLQVKDVRTTEAGTVYIHINEDDSSLPGKS
IKNAHSDRCVPLVDGAYGFILADFMALVETRRGADGDDAMVFDGLRLMKN
GYGEQVSKWFNRTLLPKVLVDRSGLAFHSFRHTVAAQLKQHGVELAYAQA
IMGHSSGSITYDRYAKEVEVDRLVNVMADVYKET
>S0701 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S0545 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4058 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4313 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVRAKEKSAIC
>S2689 putative DNA replication factor
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>S3941 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S3343 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0834 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATREGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S2158 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLACLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S2482 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S2448 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S4236 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2863 IS3 orfB
MVIDLWSRAVIGWSMSPRMTAQPALRCPADGAVAA
>S3708 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S4549 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4177 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0223 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWGKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S0326 IS600 orfB
MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD
RGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHY
RFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S3784 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4449 IS1 orfB
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3960 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S1714 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4460 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S3213 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVPHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0505 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3761 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREV
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S0372 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3548 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S4246 IS1 orfA
MAFISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4219 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0201 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S4437 ISSfl3 orfA
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>S2199 IS1 orfB
MAHVFGERTLATLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKR
YTQRIERHNLNLRQHLARLVRKSLSFSKSVELHDKVIGHYLNIKHYQ
>S1581 hypothetical protein
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVELDESQQQALV
RELNEELDIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALQYPLAPADIPLLEAFMALRAARAAD
>S1211 putative integrase of prophage CP-933C
MSRALNKLSDTQLRKINGTPAQKTAFLNDGGNLSVRHSTSGLLTWYFTYR
AGTGRGAPPERIKLGNYPDLSLKSAREKAAQCRAWLAEGKNPRHELNYTV
QEALKPVTVGDALTYWLESYAKENRVDYAALKKRLNNHVIQHIGAMPLDK
CELRHWLACFDQVAKRTPVTAGFLLQTCKQALKFCRRRRYAISNVLDDMS
VADVGKKPDISERVLSTKELGELLQALDKKIFSPYYIALIRLLIVFGCRT
VELRLSEISEWDFTEMLWTVPKEHSKTKVAIFRPIPEAILPFVTQLVEQN
RHTGLLLGEVKQETSVSQYGRLAHRRLNHPHWSLHDIRRTFTTMLNDLGV
DPHVVEQLTGHQMPGMQRVYNHSRYLDAKRNALDMWTERLGILAGTHENV
TTLPVARRK
>S2385 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S1070 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLYTVLRHLKNSGRSR
>S2157 IS600 orfA
MSRKTRRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S1865 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S4046 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S3189 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVECRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S0134 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1673 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S2795 hypothetical protein
MRIDRIKVTFGEREVFSDVTYNRLVEVLDEWIATRSNNNALELFAELRRF
WKFCAPTLCNGRNVAASLPDDYVSSRVQKPTPTRLFTDIESIARLWLNVA
ACTSVHQKNAVRFMIITGVRPINVHNLRWDYVYEEAGEIVYPEGVIGMRG
AMKTQKAFRLPITPEIRRIIDEQKAWRDSVPECNRDYVFLQPRDPMQPFS
KRSLDKLVKTYSPDGAVK
>S2551 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2859 putative CP4-57-type integrase
MARQTKPLSVKEIESAKPKEADYVLYDGDGLELLIKSSGSKIWQFRYIRP
VTKTRAKKSIGPYPSVTLADARNYRAESRSLLAKQIDPQEHQQEQLRSSL
EAKTNTFQLVAERWWNVKKASVTEDYAEDIWRSLERDVFPAIGDVSVTDI
KAHTLVQAVQPVQARGALETVRRLCQRINEVMIYAQNTGLIDAVPSVNIG
KAFEKPQKKNMPSIRPDQLSQLMQTMRTASISLSTRCLFMWQLLTITRPA
EAAEARWEEVDIEAQEWKIPAARMKMNRDHTVPLSDEAIAVLEMMKPLSG
NREFIFPSRIKPNQPMNSQTVNASLKRAGFGGVLVSHGLRSIASTALNEQ
GFPPDAIEAALAHVDKNEVRRAYNRSDYLEQRRPMMQWWANFVMAADRGS
MIEGGIKGMKLVG
>S0522 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S4481 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S1126 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S4230 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0960 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S3071 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0324 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S0703 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S0687 IS600 orfB
MAHIRTRETYGTRRLQTELAENGIIVGRDRLACLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI
SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>S3883 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S2788 IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S1111 IS600
MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA
TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV
YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD
YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGISPRQPSGKNIIRWLLKKE
QMVVSAIASTPQ
>S3288 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0316 putative phage integrase
MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSK
LGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITE
SKIYSAMQKMTNRRHEENWRLRAEACRKKGKPVPEYTPKPASVATKATHL
SFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPE
PLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVA
LNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWK
AALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYA
HLAPNHLTEHARQIDLILNPSVPNLSQSRNKEGTNDV
>S1499 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1610 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S0405 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S0325 putative phage integrase
MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSK
LGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITE
SKIYSAMQKMTNRRHEENWRLRAEACRKKGKPVPEYTPKPASVATKATHL
SFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPE
PLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVA
LNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWK
AALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYA
HLAPNHLTEHARQIDSILNPSVPNSSQSKNKEGTNDV
>S0250 IS629 orfB
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAKVIHRKSWKNRAEVELATLTWVDWYNNRRLPERLGHIPPAEA
EKAYYASIGNDDLAA
>S1676 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S3964 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S1738 IS911 orfA
MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER
QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR
>S1338 IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S4483 IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S0275 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2278 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S2200 IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMVMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>S1706 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S3177 hypothetical protein
MIYTTNAIESLNSVIRHAIKKRKVFPTDDSVKKVVWLAIQSASQKWTMPL
KDWRMAMSRFIIEFGDRLDGHF
>S2447 IS2 orfA
MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4175 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S4438 ISSfl3 orfB
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>S1004 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKAIGHYLNIKHYQ
>S1900 IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S4014 IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>S0497 IS103, IS103 orf
MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG
EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW
LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ
VRFLETRLMYLKKLKALAHPTKK
>S2427 ada, O6-methylguanine-DNA methyltransferase; transcription activator/repressor
MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE
NVSFYANASEALAAGFRPCKRCQPDKANPRQHRLDKITHACRLLEQETPV
TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV
TTSILNAGFPDSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR
CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI
ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP
KAVRAVASACAANKLAIVIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE
NEER
>S2258 alkA, 3-methyl-adenine DNA glycosylase II
MYILNWQPPYDWSWMLGFLAARAVSGVETVADSYYARSLAVGEYRGVVTA
IPDIARHTLHINLSAGLEPVAAECLAKMSRLLDLQCNPQIVNGALGKLGA
ARPGLRLPGSVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP
EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFLGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>S2426 alkB, alkylated DNA repair protein
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV
APGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHDLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
TDCRYNLTFRQAGKKE
>S1671 b4285, IS911 orfB
MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI
WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG
VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN
EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN
SNSVASFC
>S4357 dam, DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH
FAEKAQNAFFYCESYADSMERADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>S1447 dbpA, ATP-dependent RNA helicase
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQAVCDALNEVEQSALSLHGDLEQRDQTLVRFANGSARVLVATDVAA
RGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQR
ANIISDMLQIKLNWQTPPANSSIVTLEAEMATLCIDGGKKAKMRPGDVLG
ALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCR
VRLLK
>S2100 dcm, DNA cytosine methylase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKMLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPPEHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHQE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDKGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>S3420 deaD, inducible ATP-independent RNA helicase
MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ
AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT
RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP
GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT
ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN
EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL
REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI
GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR
LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ
GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV
GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL
PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFGGERR
EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>S0790 dinG, probable ATP-dependent helicase
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ
EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVTRREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITVPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDELLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEVQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRFEPSIDNEEQHIAEMAAFFREQVESKKYLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>S0300 dinP, damage-inducible protein P
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK
FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL
SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAK
IASDMNKPNGQFVITPAEVPAFLQTLPLEKIPGVGKVSAAKLEAMGLRTC
GDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA
EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ
EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG
L
>S4009 dnaA, replication initiation protein DnaA
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW
VRDKYLNNINGLLTSFCGADAPQLRFEVGTKPVTQTPQAAVTSNVAAPAQ
VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA
RAAACQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY
MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF
FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE
TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT
GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE
SHDIKEDFSNLIRTLSS
>S3577 dnaB, replicative DNA helicase; part of primosome
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD
VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV
GGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQ
GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM
LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT
MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV
PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR
LTFNGQWSRFDNYAGPQYDDE
>S4662 dnaC, chromosome replication protein DnaC
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>S0177 dnaE, DNA polymerase III, alpha subunit
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLALL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERLKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKSVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR
LRQSLEPHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI
GSEQVELEFD
>S3312 dnaG, DNA biosynthesis; DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>S4008 dnaN, DNA polymerase III, beta-subunit
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNLLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>S0209 dnaQ, DNA polymerase III, epsilon subunit
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV
YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA
AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA
RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA
TIQRIVRQASKLRVVFATDEELAAHEARLDLVQKKGGSCLWRA
>S0422 dnaX, DNA polymerase III, tau and gamma subunits; DNA elongation factor III
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEGNGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTLVPPQSAPQQAPT
VPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERLAS
VTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKALE
HEKTPELAVKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDNAV
CLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPLEW
RQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI
>S3140 endA, DNA-specific endonuclease I
MYRYLSIAAVVLSAAFSGPTLAEGINSFSQAKAAAVKVHADAPGTFYCGC
KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHIVPAWQFGHQRQCWQD
GGRKNCAKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ
CAMKVDFKEKAAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNK
MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>S3007 exo, 5-3 exonuclease
MRSLFLFSQPAIACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKADR
PPMPEELHDEMPALRAAFEQRGVPCWSASGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>S4467 fimB, recombinase; regulator for fimA
MRNKADNKKRNFLTHSEIESLLKAANTGPHATRNYCLILLCFIHGFRASE
ICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEVQALKNWLSIRTSYP
HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRYSCGFA
LANMGIDTRLI
>S4466 fimE, recombinase; regulator for fimA
MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY
QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTD
AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG
ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
>S3516 fis, site-specific DNA inversion stimulation factor; DNA-binding protein
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>S2444 gyrA, DNA gyrase, subunit A, type II topoisomerase
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>S4006 gyrB, DNA gyrase subunit B, type II topoisomerase
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE
ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA
GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP
QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG
VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI
GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE
GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE
QQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKI
LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD
VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE
AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR
YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD
VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL
LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL
WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>S1030 helD, DNA helicase IV
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR
GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS
GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD
NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA
QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG
RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT
AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR
RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL
AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGDRCDLDT
TYRFNSRIGEVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL
DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ
QADYVIIVGLQEGSGGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL
TRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>S0056 hepA, probable ATP-dependent RNA helicase
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGILAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPDQTEGFDDLIKSCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>S1636 himA, integration host factor (IHF), alpha subunit
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>S0972 himD, integration host factor (IHF), beta subunit
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>S0663 holA, DNA polymerase III, delta subunit
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VAVRAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQPQLRQ
AVQLLTRTELTLKQDYGQSVWAELEGLSLLLCHKPLADVFIDG
>S1183 holB, DNA polymerase III, delta prime subunit
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA
ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>S4491 holC, DNA polymerase III, chi subunit
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGSAPVEIAWPQKRSSSPRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>S4675 holD, DNA polymerase III, psi subunit
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIASPAHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG
AQVASPALTELRANPTARAALWQQICTYEHDFFPRND
>S1472 hrpA, helicase
MLRDRLRFSRRLHCVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP
EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR
GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM
VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR
RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD
TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE
ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA
RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS
RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL
EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM
IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ
KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY
REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM
VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT
EKVTVYGLPIVAARKVNYSQIDSALCRELFIRHALVEGDWQTRHAFFREN
LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRISHDVISARHFDSWWKK
VSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGA
DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV
PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL
KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI
WSFGQLPESYEQKRGNYKVKAWPALVDERDSVTIKLFDNPLEQKQAMWNG
LRRLLLLNISSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDQ
LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR
LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR
LEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIE
ELRVSYFAQQLGTPYPISDKRILQAMEQISG
>S0143 hrpB, helicase
MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR
AQNCVGLNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
RYLPLPAHQRFDEAVAVATAEMVRQESGSLLLFLPGVGEIQRVQEQLASR
IGSDVVLCPLYGVLSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFDPRTGLTRLITQRVSQASMTQRAGRAGRLEPGISLHL
IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPTVN
LLAAKRLLQMLGALEGERLSAQGQKMAALGNDPRLAAMLVNAKSDDEAAT
AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLAGAFADRIAHRRGQDGRYQLANSMGAMLDADDALSRHEWLIAP
LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW
RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAAKWLPEYDWPAVDDESLLATLETWLLPHMTGVHSLRGLKSLDIYQ
ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGRVPLVLKLLSPAQRPLQITRDLGAFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>S3663 hupA, DNA-binding protein HU-alpha (HU-2)
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>S0391 hupB, DNA-binding protein HU-beta, NS1 (HU-1)
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>S0264 is600a, IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>S2175 is629a, IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>S2176 is629b, IS629 orfB
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>S2263 issfl4, ISSfl4 orf
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS
>S2612 lig, DNA ligase
MESIEQQLTELRTTLCHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
VPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLEQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKLAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDEAEMLRLLGS
>S1198 mfd, transcription-repair coupling factor
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ
NALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQDIISSRLSTLYQLPT
MQRGILIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGY
RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ
RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK
GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTL
ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN
ANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDKARNLALICESDLL
GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE
AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW
SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE
TTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK
QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVA
EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD
ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL
ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR
GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL
EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ
QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF
GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL
IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA
>S3039 mutH, methyl-directed mismatch repair protein
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALDGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVARL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE
DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>S4593 mutL, enzyme in methyl-directed mismatch repair
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG
AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA
SISSVSRLTLTSRTAEQQEAWQAYAEGRDMDVTVKPAAHPVGTTLEVLDL
FYNTPARRKFLRTEKTEFSHIDEIIRRIALARFDVTINLSHNGKIVRQYR
AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL
AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD
VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRA
IPENRVAAGRNHFAEPAAREPVAPRYSPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKPKAPEPQEPALAANSQSFGRVLTIVHSDCALLE
RDGNISLLSLPVAERWLRQAQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL
EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS
VFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLL
QSVDLHPAIKALKDE
>S4094 mutM, formamidopyrimidine DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>S2944 mutS, methyl-directed mismatch repair protein
MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRSNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>S0098 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV
VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGVPWGKEGQ
PGEWMSLVGLNADDFPPANEPVIAKLKRL
>S3161 mutY, adenine glycosylase
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP
YFERFMAHFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH
GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA
VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS
LCPLQNGCIAATNNSWSLYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP
PSGLWGGLYCFPQFADEESLRHWLAQRQIAADNLTQLTAFRHTFSHFHLD
IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>S0596 nei, endonuclease VIII and DNA N-glycosylase with an AP lyase activity
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVET
RGKALLTHFPNGLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT
ILLYSASDIEMLRPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLEIPRFSYATRGQVDENKHHGALFRFKVFHRDGELCERCGGIIEKTTL
SSRPFYWCPGCQH
>S3665 nfi, endonuclease V (deoxyinosine 3 endonuclease)
MDLASLRAQQIELASSVICEDRLDKDPPDLIAGADVGFEQGGEVTRAAMV
LLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPDLV
FVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSKPGALA
PLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRLPE
PTRWADAVASERPAFVRYTANQP
>S2373 nfo, endonuclease IV
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARIVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQN
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>S1790 nth, endonuclease III
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDISVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH
NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>S1941 ntpA, dATP pyrophosphohydrolase
MSEGSQRRGSVKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVT
GSVEEGETAPQAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHR
YAPGVTRNTESWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQ
AIEQFVINAA
>S1435 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG
YERISATNPGGLSDKLREYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVQRKEWLLRHEGYLLL
>S2707 parA, resolvase
MRRTKPVAAPMVARVYLRVSTDAQDLERQEAITTAAKAAGYYVAGIYREK
ASGARADRPELLRMIGDLQPGEVVIAEKIDRISRLPLPEAERLVASIQAK
GARLAVPGVVDLSDLAAEAQGVAKIVLEAVQIMLFRLALQMARDDYEDRR
ERQRQGIELARQAGRYKGRRADPKRRAQVVALRKSGYSINKTAELAGYSA
AQVKRIWAEVSQAEAKQHGAFVEDALTEADALAAVGQDERQEERA
>S3267 parC, DNA topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS
EE
>S3275 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL
QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQGLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>S0602 phrB, deoxyribodipyrimidine photolyase (photoreactivation)
MTTHLAWFRQDLRLHDNLALAAACRNSSARVLALYIATPRKWETHNMSPR
QAELINAQLNGLQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHL
FYNYQYEVNERARDVDVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIKPAPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCENGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQALDGGAGSVWLSELIWREFYRHLMTYYPSLCKHRPFIA
WTDRVQWQSNPAHLQAWQKGKTGYPIIDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPITQGEKFDREGEFIRRWLPELRDVPGKAVHEPWKWAQKAGVKLDYPQP
IVEHKEARVQTLAAYEAARKGK
>S3813 polA, DNA polymerase I, 3--> 5 polymerase, 5--> 3 and 3--> 5 exonuclease
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI
MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK
AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT
LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT
AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA
TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEVGKWLQAK
GAKTAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP
VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISHER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN
SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE
DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK
VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK
KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP
KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV
IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT
SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKELGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG
TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI
HQLMENCTRLDVPLLVEVGSGENWDQAH
>S0057 polB, DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIHADQV
PRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG
VTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVAS
RPLLLEKLNAWFANHDPDVIIGWNVVQFDLRMLQKHAERYRLPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDKAKRQGNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDIIYGDTDSTF
VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYVRQTIDKLMAGELDARLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>S3734 priA, primosomal protein N (factor Y), putative helicase
MPVAHVALPVPLPRTFDYLLPEGMTVKAGCRVRVPFGKQQERIGIVVSVS
DASELPLTELKAVVEVLDGEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL
PILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKI
WRDQVATLEFNDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLN
TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV
MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV
IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI
PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQANNQVILFLNRRGFAPALLCHDCGWIAECPRCDH
YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP
LFPDVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD
VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH
PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP
LFLQQLRNLILSSPLADDKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR
LQHIISGTLALINTIPDSRKVKWVLDVDPIEG
>S4626 priB, primosomal replication protein N
MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM
PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI
DSGD
>S0419 priC, primosomal replication protein N
MKTALLLEKLEGQLATLRQRCAPVSQFATLSARFDRHLFQTRATTLQSCL
DEAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREASAWSLREWDS
APPKISRWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVK
AYEARLARCRHALEKIENRLARLTR
>S4091 radC, DNA repair protein
MKNNAQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEMLE
NFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREESP
LLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHSRLFSGTLNHVEV
HPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFMDL
RVLDHIVIGRGEYVSFAERGWI
>S2913 recA, DNA-dependent ATPase, DNA-and ATP-dependent coprotease
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL
TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>S3028 recB, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF
PRPLIVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERL
LEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ
QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY
LQGEPPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS
GIDRRKFNRSNQAKWIEKISAWAEEETNSYQLPESLEKFSQRFLEDRTKA
GGETPRHPLFEAIDQLLAEPLSIRDLLITRALAEIRETVAREKRRRGELG
FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA
PGMVNSVNKLFSQTDDTFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK
MWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVR
ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL
QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ
IWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ
LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP
LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEVERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA
AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN
WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS
PGTFLHSLFEDLDFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA
PLNETGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC
PPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA
AMQAHRYDLQYQLYTLALHRYLRHRIADYDYDLHFGGVIYLFLRGVDKEH
PQQGIYTTRPNAGLIALMDEMFAGMTLEEA
>S3030 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTHELGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNSLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEEA
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFHTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPE
TMETIVEQSQRFLLPLFRFNQS
>S3027 recD, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEASNPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHTIEVDEALLAQTLDKLFPVSDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERARQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRHPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFSSRE
>S4007 recF, Rec protein
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA
ADMDDTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>S4077 recG, DNA helicase
MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL
YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFS
AAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP
VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT
LHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP
LSANDTLKNKLLAALPFKPTGAQARVVAEIEHDMALDVPMMRLVQGDVGS
GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFAPLGIEVGW
LAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH
RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE
LPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWVCTLIEESELLE
AQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVAT
TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT
PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD
LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA
>S3077 recJ, ssDNA exonuclease
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQVEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFELFQQRFG
ELVTEWLAPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>S2853 recN, DNA repair protein RecN
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALETARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKYD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLDKKARLQELARLLGGSEVTRNTLANAKEL
LAA
>S2800 recO, RecO protein
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGDTGTPEPALRRFELALLGHLGYGVNFTH
CAGSGEPVDGTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>S3855 recQ, ATP-dependent DNA helicase
MNVAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMP
TGGGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNST
QTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEA
HCISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGL
NDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAK
VEDTAARLQSRGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGM
GINKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWL
RRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCG
NCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIR
DYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAA
RPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIAD
ESNVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMAL
IRAHVDGDDEE
>S0424 recR, recombination protein RecR
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>S1670 relB, negative regulator of translation
MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTL
LSDEDAELVEIVKERLRNPKPVRVTLDEL
>S3908 rep, rep helicase
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVGQTLGRKEAHGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPS
QAAASAIGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR
GNYQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSTKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>S3906 rhlB, putative ATP-dependent RNA helicase
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>S0788 rhlE, putative ATP-dependent RNA helicase
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSVAIHGNKSQGARTRALADFKSGDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGIRLM
NKRKTA
>S0208 rnhA, RNase HI
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM
AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN
VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG
YQVEV
>S0176 rnhB, RNAse HII
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALYEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEYGATEHHRRSFGPVKRALGLAS
>S1811 rnt, RNase T
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLPAAEEV
>S1665 rus, endodeoxyribonuclease RUS (Holliday junction resolvase)
MLIDLVLPYPPTVNTYWRRRGSTYFISEEGKRYRRAVALIVRQQRLKLSL
SGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVR
GQPVSGGRLGVKIYPIMH
>S1937 ruvA, Holliday junction helicase subunit B
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDTSSETLIREALR
AAL
>S1936 ruvB, Holliday junction helicase subunit A
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGAQEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP
>S1939 ruvC, Holliday junction nuclease
MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA
GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF
EYAARQVKQTVVGMGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH
CHVSQNAMQMSESRLNLTRGRLR
>S2193 sbcB, exonuclease I, 3--> 5 specific; deoxyribophosphodiesterase
MTDTDKQPTFLFHDYETFGTHPALDRPAQFAAIRTDDEFNVIGEPEVFYC
KPADDYLPRPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCILG
YNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPEG
INWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTRQ
PRLFDYLFTHRNKHKLMALIDVPQMKPLAHVSGMFGAWRGNTSWVAPLAW
HPENRNAVIMADLAGDISPLLELDIDTLRERLYTAKADLGDNAAVPVKLV
HINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAIF
AEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVDK
RIEKLLFNYRARNFPGTLDYAEQQRWLEHRHQVFTPEFLQGYADELQMLV
QQYADDKEKVALLKALWQYAEEIV
>S0342 sbcC, ATP-dependent dsDNA exonuclease
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL
EKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLT
RQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT
WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLTLTADEVATALAQHAEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ
NVTLEQTQRNVALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALR
GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQP
WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL
AGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET
LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ
QLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG
LTLDNLVHLANQQLTRLHVRYLLQRKASEALEVEVVDTWQADAVRDTRTL
SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL
DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK
>S0343 sbcD, ATP-dependent dsDNA exonuclease
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIISSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGSSKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEDLDESQQQRLQHLFTTTLRTLAGEHEA
>S0617 seqA, negative modulator of initiation of replication
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR
VASPAIVEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS
LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN
TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI
>S3542 smf, hypothetical protein
MVDIEIWLRLMSISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLQATTDYPGALFVEG
ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATCGVTITSGLARGIDG
VVHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLFEQGGALVSEFPLD
VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS
PDQQDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>S2811 srmB, ATP-dependent RNA helicase
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>S3584 ssb, ssDNA-binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>S4186 tag, 3-methyl-adenine DNA glycosylase I
MERCGWVSQGPLYIAYHDNEWGVPETDSKKLFEMICFEGQQAGLSWITVL
KKRENYRAYFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR
AYLQMEQNGEPFPDFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALK
KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYLGNKP
>S1361 topA, DNA topoisomerase type I, omega protein
MGKALVIVESPAKAKTINKYLGCDYVVKSSVGHIRDLPTSGSAAKKSADS
TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA
EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA
FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA
VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN
KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG
VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE
SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL
PAVDKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI
ISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSL
DQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP
TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED
AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR
IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP
PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG
WSAFYVDGKWVEGKK
>S1575 topB, DNA topoisomerase III
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ
PDAYDSRYARWNLADLPIVPEKWQLQPRPSVTKQLNVIKRFLHEASEIVH
AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLR
SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT
PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ
DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE
AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA
ISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLIARQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL
GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM
TGIARFVQDKDLKKVLRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHS
TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY
QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEAGSG
AIA
>S1261 umuC, mutagenesis and repair protein
MFALCDVNAFYASCETVFRPDLWGKPVVVLSNNDGCVIARNAEAKALGVK
MGDPWFKQKDLFRRCGVVCFSSNYELYADMSNRVMSTLEELSPRVEIYSI
DEAFCDLTGVRNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLAN
HAAKKWQRQTGGVVDLSNLERQRKLMSALPVDDVWGIGRRISKKLDAMGI
KTVLDLADTDIRFIRKHFNVMLERTVRELRGEPCLQLEEFAPTKQEIICS
RSFGERITDYTSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFALN
EPYYGNSASVKLLTPTQDSRDIINAATRSLDAIWQAGHRYQKAGVMLGDF
FSQGVAQLNLFDDNAPRPGSEQLMAVMDTLNAKEGRGTLYFAGQGIQQQW
QMKRAMLSPRYTTRSSDLLRVK
>S2815 ung, uracil-DNA-glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQRGETPIDWMPVLPAECE
>S3583 uvrA, excision nuclease subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>S0770 uvrB, excision nuclease subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVIVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMDEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDRSTPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMLPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>S2052 uvrC, excinuclease ABC, subunit C
MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH
TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK
HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG
RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE
FEEAACIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR
QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN
LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ
QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR
AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL
AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP
DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML
LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>S3865 uvrD, DNA-dependent ATPase I and helicase II
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE
NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA
HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK
DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW
LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD
QSIYGWRGAQVENIQRFLNDFPGAEIIRLEQNYRSTSNILSAANALIENN
NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC
AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL
AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE
KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW
QDAVQLMTMHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY
VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV
SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV
AFQGQGIKWLVAAYARLESV
>S2099 vsr, DNA mismatch endonuclease, patch repair protein
MVDVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
ISRLQELGWRVLIVWECALRRREKLTDAALTERLEEWICGEGASAQIDTQ
GIHLLA
>S4101 waaP, lipopolysaccharide core biosynthesis protein
MVWMVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSY
FLKWHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAF
GEKGMNPLTRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVA
TMVRDMHAAGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRV
PGRWRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFAAPLKDILKQEQGLLS
QAEAKATKIRERTIRKSL
>S4103 waaY, putative LPS biosynthesis protein
MIYNKTINGLKVFIKDNDPFYEQVLNDFLTCRVKTLKVFRSIDDTKVILI
DTARGPLVLKVYAPKHKMTERFLKSCIKKDYYENLIYQTDRVRGEGIQSI
NDYFLLAERKTLNFAHYYIMLIEYIEGVGLNEYLEISEDLKDQLSESIKE
LHQHGMVSGDPHKGNFIVSEKGLRLIDLSGKKTTAVLKAKDRIDLERHYN
IKNELKDFGYTYLIFKKKIKKVIRDVKVKLGLKSK
>S2237 wcaH, GDP-mannose mannosyl hydrolase
MFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPGG
RVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFTT
HYVVLGFRFRVAEEELLLPDEQHDDYRWLTPDALLASNDVHANSRAYFLA
EKRAGVPGL
>S3867 xerC, site-specific recombinase
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ
CDAAMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>S3079 xerD, site-specific recombinase
MKQELARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>S2727 xseA, exonuclease VII, large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE
SMQPAGEGLLQQKYEQLKAKLQAECLFDQQYKKPLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPTSVQGDDAPGQIVRAIELANQRNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSARQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRAGQQQQRLTQRLNQ
QNPQPKIHRTQTRIQQLEYRLAEILRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGNVLKKVKQVKTGEMLTTRLEDGWIESEVKNIQPVKK
SRKKVH
>S0367 xseB, exonuclease VII, small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDASLTPFTPDNE
>S1594 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG
DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGVVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>S0297 yafM, hypothetical protein
MSEYRRYYIKGGTWFFTVNLRNRRSHLLTTQFQTLRNAIINVKRDRPFEI
NAWVVLPEHMHCIWTLPESDDDFSSRWREIKKQFTHACGLKNIWQPRFWE
HAIRNTKDYRHHVDYIYINPEKHGWVKQVSDWPFSTFHRDVARGLYPIDW
AGDITDLSAGERIIL
>S0339 yaiD, hypothetical protein
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG
SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK
KTEKDSLKDEVLHSLLPRAFSRFSQTMMWLDTVNGLIMVDCASAKKAEDT
LALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSL
LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL
KRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGE
AQR
>S0393 ybaV, hypothetical protein
MKHGIKALLITLSLACAGMSHSALAAASVAKPTAVETKAEAPAAQSKAAV
PAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV
EDLKQVPGMGNSLVERNLAVLTL
>S0406 ybaZ, hypothetical protein
MEKEDSFPQRVWQIVAAIPEGYVTTYGDVAKLAGSPRAARQVGGVLKRLP
EGSTLPWHRVVNRHGTISLTGPDLQRQRQALLAEGVMVSGSGQIDLQRYR
WNY
>S0660 ybeL, putative alpha helical protein
MNKVAQYYRELVASLNERLRNGERDIDALVEQARERVIKTGELTRTEIDE
LTRAVRRDLEEFAMSYEESLKEESDSVFMRVIKESLWQELADITDKTQLE
WREIFQDLNHHGVYHSGEVVGLGNLVCEKCHFHLPIYTPEVLTLCPKCGY
DQFQRRPFEP
>S0876 ybjD, hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>S0892 ycaJ, putative polynucleotide enzyme
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDDTRRAI
AELVNGDARRALNTLEMMADMAEVYDSGKRVLKPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>S1184 ycfH, hypothetical protein
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM
RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY
YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC
GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDTARYVPLDRL
LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF
ARLFHIDASRLQSIR
>S1530 yeaB, hypothetical protein
MEYRSLTLDDFLSRFQLLRPQINREPLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVTP
>S3201 yeeS, putative RADC family DNA repair protein,
MVAGTMQQLSFLPGEMTTRERSLILRALKTLDRHLHEPGVAFTSTHAARE
WLILNMAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRA
LYHNAAAVVLAHNHPSGEVPPSKADRLITERLVQALALVDIRVPDHLIVG
GSQVFSFAEHGLL
>S2400 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAE
KRKGVMIFAATVEHAKEIVGLLPAEDAALITGDNPGAERDVLIEDFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>S2463 yfaO, hypothetical protein
MADDRGVFPGQWALSGGGVESGERIEEALRREIREELGEQLLLTEITPWT
FSDDIRTKTYADGRKEEIYMIYLIFDCVSANREVKINEEFQDYAWVKPED
LVHYDLNVATRKTLRLKGLL
>S2660 yffH, hypothetical protein
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNAKKKSVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQMSHLMD
>S2839 yfiL, hypothetical protein
MMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVGMEDAISGSAIKDD
DAFSDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSGKSFPASCNNVENA
SQLHEVWQKRADENASTIRLN
>S3038 ygdP, putative invasion protein
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNASAYRRKRG
>S3413 yhbQ, hypothetical protein
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD
>S3517 yhdJ, putative methyltransferase
MRTGCEPTRFGNEAKTIIHGDAFAELKKLPTESVDLLFADPPYNIGKNFD
GLIEAWKEDLFIDWLFEVIAECHRVLKKQGSMYIMNSTENMPFIDLQCRK
LFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAILVE
AKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQKP
KALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEYIK
MGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLIAK
>S4280 yhhF, hypothetical protein
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRVVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQHEAQGESDAD
>S0267 yi21_6, IS2 orfA
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>S0266 yi22_6, IS2 orfB
MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH
HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP
AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA
LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE
TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA
KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI
>S0968 yi41, IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S4082 yicF, putative enzyme
MKVWMAILISILCWQSSVWAVCPAWSPARAQEEISRLQQQIKQWDDDYWK
EGKSEVEDGVYDQLSARLTQWQRCFVSEPRDVMMPPLNGAVMHPVAHTGV
RKMADKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNGLK
GEDWTQKVSLISAVLQTVSGPLANSTLQGEIFLQREGHIQQQMGGINARA
KVAGLMMRQGNSDTLNSLAVFVWAWPDGPQLMTDRLKELATAGFTLTQRY
TRAVKNADEVARVRNEWWKAKLPFVTDGVVVRGAKEPESRHWLPGQAEWL
VAWKYQPVAQVAEVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNIGS
VRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFNPL
TCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHIFS
WLLLTPEQLQNTPGIAKSKSAQLWHQFNLARNQPFTRWVMAMGIPLTRAA
LNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGSWL
AAQQITGFEP
>S3836 yigW, hypothetical protein
MFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQKLAR
QYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDFNRNF
STPEEQERAFVAQLRIAAELNMPVFMHCRDAHERFMTLLEPWLDKLPGAV
LHCFTGTREEMQACVARGIYIGITGWVCDERRGLELRELLPLIPAEKLLI
ETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAATTDA
NVKTLFGIAF
>S3667 yjaD, hypothetical protein
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLIQQQRRHDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSLLLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL
MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>S4681 yjjV, hypothetical protein
MQALAENYQPLYAVLGLHPGMLEKHSDVSLEQLQQALERRPAKVVAVGEI
GLDLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRH
DLPRAGVVHGFSGSQQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLP
LASLLLETDAPDMPLNGFQGQPNRPEQAARVFAVLCELRPEPADEIAEVL
LNNTYAVFNVRG
>S3144 yqgF, hypothetical protein
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY
>S3279 yqiE, hypothetical protein
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>S3406 yraN, hypothetical protein
MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI
DLIMREGLTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS
>S3539 yrdD, putative DNA topoisomerase
MRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSSADGHIVKV
LEGQVCPVCGANLVLRQGRFGMFIGCSNYPECEHTELIDKPDETAITCPQ
CRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPECHYPLLIEK
KTAQGVKHFCASKQCGKPVSAE
>S4347 yrfE, hypothetical protein
MNKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV