Gene list
Applied filters:
COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: chromosome
Number of genes found: 669
![]() | ||||||
Show UniProt / TrEMBL protein name | ![]() |
View in Fasta format (DNA) | ![]() |
View as list | ![]() |
|
![]() |
# Shigella flexneri 2a str. 2457T, 2457T >S1673 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S3064 putative integrase MLKSRTYLYQRNGVFYIRLRMKTTSRLTASLPSHNRYKLASVSLRTKDRR TAMAHSRHIKSALKAIHADNPNASYEELREHLKTIVEWELSVSRDDLNDP ESYQLYVDQYDDIKSNLREAVATERLTVDQHRYINDVIGVLKACQDRLNG DSSGLLSYLEPETGSLRPSVSLSVLAEPEVPEPKALTLASLIEQYEQENA QNWKPATLSENRASHSTLIEIFDYLDIQDVGKATRADMLRVREVLQQLPK NRKQRFKSMPLSDLLNRESKTDCLDVVTINNKYLIKMAAVFKWAVRNDLI AKNLTEGLELKVPQRKASDARDAFSPEQVGQLLVAAKAYSQKTSGKPYHY YVTALAAITGARLNEVAQLQVKDVRTTEAGTVFIHINEDDSSLPGKSIKN AHSDRCVPLVDGAYGFVLADFMSLVEDRRKTEGDNAMVFNGLKLMKNGYG EQVSKWFNRTLLPKVLADRSGLAFHSFRHTVATQLKQHGVELAYAQAIMG HSSGSITYDRYAKEVEVETLKEKLAESLSVKKIDGK >S1133 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1026 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0928 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0872 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1953 IS2 orfA MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S2795 hypothetical protein MRIDRIKVTFGEREVFSDVTYNRLVEVLDEWIATRSNNNALELFAELRRF WKFCAPTLCNGRNVAASLPDDYVSSRVQKPTPTRLFTDIESIARLWLNVA ACTSVHQKNAVRFMIITGVRPINVHNLRWDYVYEEAGEIVYPEGVIGMRG AMKTQKAFRLPITPEIRRIIDEQKAWRDSVPECNRDYVFLQPRDPMQPFS KRSLDKLVKTYSPDGAVK >S0703 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S1975 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1878 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0485 IS629 orfB MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYV STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT VHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAE VIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIG NDDLAA >S3964 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1131 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >S1578 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1962 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3343 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4666 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2793 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4130 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1325 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIGQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLSKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >S0221 IS2 orfB MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL DCCDREALHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE I >S1469 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1627 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4441 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1862 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0715 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVAGIKDVYTCEIVGYA MGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSG LKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEI FYNRQRRHSRLGNISPAAFREKYHQMAA >S4178 IS150 putative transposase MKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRIGEI YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY RGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNN EVISYSLSERPVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQN ILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEFSNISELKDAV TEYIEYYNSRRISLKLKGLTPIEYRNQTYMPRV >S0871 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S1738 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S3394 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKLLSFSKSV ELHDKVIGHYLNIKHYQ >S2187 ISSfl3 orfD MEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEVE PEDWLREVIEKLNDWPSNQVHKLLPWNFSSVK >S1716 ISEhe3 orfB MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS SEQMSPTEYENQYYQRLGSV >S2943 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1863 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1239 IS1 orfB MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARPVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0708 IS911 orfB MKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGN CWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEY NGGLPPNESENRYWKNSNSVASFC >S4147 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0322 integrase fragment MRTDSNKAWKGALKRAGISNFRFHDLRHTWASWLVQSGVSLLALKEMGGW ETLEMVQRYAHLSAGHLTEHASKIDAIISRNGTNTAQEENVVYLNAR >S2942 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1126 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S3314 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S1856 IS1 orfA MASISIRCPSCSATEGVVRNSKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2009 IS629 orfB MWRSSLMCLPDTSWGAMETTFVLDALEQALWARRPSGTVHHSDKGSQYVS LAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAE VELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA >S1706 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0471 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2279 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1338 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1660 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S1836 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1111 IS600 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGISPRQPSGKNIIRWLLKKE QMVVSAIASTPQ >S0078 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4063 IS629 orfB MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLVYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA EKAYYASIGNDDLAA >S4450 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4175 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2651 IS1N orfA MTSVNIHCPRCQSAQVYRHGQNPKGRDRLRCRDCHRVFQFTYTYQARKPG MKELITEMAFNGAGVRDTARTLKIGSNTVIRTLKNSRQSE >S3634 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0326 IS600 orfB MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD RGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHY RFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S2130 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S2106 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWKDGRRSRHSDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRIYRLMRQNALLLERKL AVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDRKA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S3154 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1129 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S4113 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2034 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2880 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S3937 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2072 putative virulence protein MRYRNSRQVTGVVKNVTVSQSCGKWYISIQAESEVSTPVHPSASMVGLDA GVAKLASLSDGTVFEPVNSFQKNQKTLARLQRQLSRKVKFSNNWQKQKRK IQRLHSCIANIRRDYLHKVTTTVSKNHVMIVIEDLKVSNMSKSAAGTVSQ PGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRC ACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNILAAGHAVLACGEMV QSGRSLKQEPTEMIQATA >S2337 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >S3399 IS2 orfB MTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQF ARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLA EAFEHYNEWHPHSALGYRSPREYLRQRACNRLSDNRCLEI >S3547 ISSfl3 orfA MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >S3960 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S4822 putative P4-type integrase MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKT LALGVYPEVSLSEARTKRDEARKLISEGIDPCEQKRVKKVVPDLQLSFEH IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVR AAEAKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH RPALELSRLPELLSRIDGYKGQPVTRLAVMLNLLVFIRSSELRYARWSEI DIDNSMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQVVAILAELQTW AGENGLIFTGAHDPRKPISENTVNKALRVMGYDTTQEVCGHGFRAMACSA LIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD ANREKGISPFEYAKINNPLK >S4097 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2552 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1093 IS3 orfB MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >S1070 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLYTVLRHLKNSGRSR >S4449 IS1 orfB MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0234 IS911 orfB MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >S2728 hypothetical protein MNVEGMATGGIHMELHCPKCQHVLDQDNGHARCPSCGEFIEMKALCPDCH QPLQVLKACGAVDYFCQHGHGLISKKRVEFVLA >S2033 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4497 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTDSQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1134 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSSTAHTI TGSYRSSLV >S4521 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S0722 putative replication protein DnaC MMTFNLREQQKRLQARMDELRAEIAFAQKGEKPWPYRSCLMREGRGYCEK HGEYHTHILVWSDRNGEDREEISCCPDCLIAEANDLTMELSSIKAEELTD NAGIALRFRDCEFDNYLEVNPGAARNLAACRRYAENWPDMLENGTSLVMT GSCGTGKNHLAVAMAKHIIRNYLASVEITDVMRLTRAVKNCWRNDSEKTA DEVIERYASMDLLIIDEVGVQFGSAAEMAILQEIINARYESILPTILISN LSPEELWAFISPRIADRITDGGRNWLSFNWPSYRSRIRGVAA >S1813 putative virulence protein MRRFAGACRFVFNRALALQNENHEAGNKYIPYGKMASWLVEWKNATETQW LKDSPSQPLQQSLKDLERAYKNFFQKRAAFPRFKKRGQNAAFRYPQDVKL DQENSRIFLPKLGWMRYRNSRQVTGVVKNVTVSQSCGTWYISIQTESEVS TPVHPSASMIGLDAGVAKLATLSDGTVFEPVNSFQKNQKTLARLQRQLSR KVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLK VSNMSKSAAGTVSQPGRNVRAKSGLNRSILDQGWYEMRRQLEYKQLWSGG QVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNI LAAGHAVLACGEMVQSGRSLKQEPTEMIQATA >S4219 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2122 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4667 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2190 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S2256 ISSfl4 orf MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASSRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >S2335 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1135 IS600 orfB MSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNR QRRHSRLGNISPAAFREKYHQIAA >S2879 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4460 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0311 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4549 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0606 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2860 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGYRASARIMGVGLNTVLRHLKNSGRSGNLAHTTRQ >S4235 IS1 orfB MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1686 IS600 MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S1579 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0487 IS2 orfB MDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN AKRVYRLMRQNALLLERKPAVSPSKRAHTGRVAVKESNQRWCSDGFEFCC DNGERLRVTFALDCCDREALHWARTTGGFNSETVQDVMLGAVERRFGNDL PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK TIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQR ACNGLSDNRCLEI >S0505 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3473 IS600 orf MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFRENIIRWLLKKRTNGSVRYCQY TSKVAMIYIEQLELIHKSGDVLYPVKITRKSSGKTAFHLVPFGLNKTHDL LEVEDASEAIRLVIDERHSIRCSTLTATITNKKGKRIKRTGIYSIKGVNI KEYNVR >S1675 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA >S4030 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S3177 hypothetical protein MIYTTNAIESLNSVIRHAIKKRKVFPTDDSVKKVVWLAIQSASQKWTMPL KDWRMAMSRFIIEFGDRLDGHF >S3124 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2133 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S4610 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1211 putative integrase of prophage CP-933C MSRALNKLSDTQLRKINGTPAQKTAFLNDGGNLSVRHSTSGLLTWYFTYR AGTGRGAPPERIKLGNYPDLSLKSAREKAAQCRAWLAEGKNPRHELNYTV QEALKPVTVGDALTYWLESYAKENRVDYAALKKRLNNHVIQHIGAMPLDK CELRHWLACFDQVAKRTPVTAGFLLQTCKQALKFCRRRRYAISNVLDDMS VADVGKKPDISERVLSTKELGELLQALDKKIFSPYYIALIRLLIVFGCRT VELRLSEISEWDFTEMLWTVPKEHSKTKVAIFRPIPEAILPFVTQLVEQN RHTGLLLGEVKQETSVSQYGRLAHRRLNHPHWSLHDIRRTFTTMLNDLGV DPHVVEQLTGHQMPGMQRVYNHSRYLDAKRNALDMWTERLGILAGTHENV TTLPVARRK >S3761 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREV LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S2278 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S3213 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVPHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S3181 reverse transcriptase-like protein MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDG VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI DVGLFRAAVKVCHRAVLYRRYYRTSC >S1004 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKAIGHYLNIKHYQ >S0328 IS629, orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQV >S2827 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMGDLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S0922 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0701 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S4062 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVR VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >S2982 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0923 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S2109 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1581 hypothetical protein MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVELDESQQQALV RELNEELDIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA LVWCSPEEALQYPLAPADIPLLEAFMALRAARAAD >S2191 ISSfl3 orfA MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK QALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQECMPVPVAL TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLFDPLTP ELLRALIREMKGGIR >S1899 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4059 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0223 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWGKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S0455 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0985 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGVSPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S2018 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S0833 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0949 IS629 orfB MPLLDKLREQYRVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMEVMGLAGVLR GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA EVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNDDLAA >S3388 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1462 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4461 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1027 IS2 orfB MARGWGVSLVSRCLLVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALFRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGL >S3875 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4481 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S0136 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1736 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S2483 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1422 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4438 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S2832 IS4 orf MFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAG NPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLT LMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGDHLVKLK TSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADL YSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQ MIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLASM GQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S3776 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSHIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0945 ISSfl3 orfA MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK QALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQECMPVPVAL TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLSDPLTP ELLRALIREMKGGIR >S0947 ISSfl3 orfC MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTEPPRESWRLNFLRKR >S0960 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S3289 IS2 orfB MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVW ALLRRQAELDGMPAINAKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAV KESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETV QDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNT AVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHP HSALGYRSPRG >S4550 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2172 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S2336 IS1 B MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2299 IS1 orfB MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4811 IS1 orfB MAHVFGERTLATLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKR YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >S3936 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0725 putative bacteriophage protein MSVKIQTIPELLIQTRGNMTEVSRMLNCNRATVRKYAEDKEGKGHAIVDG VLIVHRGWDRGKDSDA >S0946 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S2155 IS911 orfA MADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKL QRIEMENEILKKATALLMSDSLNSSR >S3210 IS3 orfB MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWPGDITYLRTDEGWLYLAV VIDLWSRAVIGWSMSPRMTAQLPCDALQMALWRRKRPRNVIVHTDRGGQY CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >S0943 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGFRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1002 IS1 orfB, A MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLYTVLRHLKNSAESVTSRIQPGSD VIVCAEMDEHWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL LSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL ARLVRKSLSFSKSVELHDKAIGHYLNIKHYQ >S4112 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0959 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0523 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S3291 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S3520 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3389 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1058 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1987 putative crossover junction endodeoxyribonuclease MRHEFILPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKLSL SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDETNIVR GLPVPGGRLGIKITELECA >S3290 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4632 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1963 putative integrase for prophage CP-933R MESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDFEKGIVNVRRNLTILD MFGPPKTNAGIRTVALLQPALEALKEQYKLTGHHRKSEITFYHREYGRTE KQKLHFVFMPRVCNEKQKPYYSVSSLGTRWNAAVKRAGIRRRNPYHTRHT FACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQIAMLNARL S >S4629 ISEhe3 orfA MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD >S1094 IS3 orfA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK >S4050 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S1482 IS103 orf MKVLNVLRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRIGEI YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY RGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNN EVISYSLSERPVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQN ILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEFSNISELKDAV TEYIEYYNSRRISLKLKGLTPNEYRNQTYMPRV >S1069 IS1 orfB MSRQCTHYGRWPLHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1535 putative enzyme MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGSAR QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDYR LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR >S0698 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1877 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2565 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S3131 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4246 IS1 orfA MAFISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4568 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0732 hypothetical bacteriophage protein MKHEEMNQRFNHLENEITELNKKLSALVSSEDENKRRDEHYAAIYDYCHK VAHETFMKFLQEKFLPAALSEKEAAYLRPEYVITVNSAGEEEHKSDFIAS APDKDQEPHRPFRVSCEEGEFVVYENGKPVRASHHHCLKIINLAIRCLKD ENTRVMKRIGRCMGYLQVAAEIEALASGADMDAAVREALLRDFNTPPLRK SLMTGSSRG >S1972 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA >S1837 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2157 IS600 orfA MSRKTRRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1132 IS629 orfB MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPEQ LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA EKAYYASIGNDDLAA >S0249 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >S1715 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDIMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4131 IS1 orfB MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1500 IS1 orfB MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4442 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S1981 putative integrase MIQLSSSHKLPAVYYLYQRNGVYYFRLRVRQSNNDRMTSISLRTKDRRTA MAYSRHIKAALKAIHADRPNATYEEMREHLKDIAECELSMGRSDLFEPDM RDIYRDQYGELGESLTDALASEPLSIDQHRYINEALKVLKACMRRIEAGD SQPLIDYVDLFNDIDRQDNQADSVSLSVNAPEVKPEVTPSITIASLFEQY EAENYQNWKPATLRENKASHAALIEIFDHLGLNADANRADMLRVRDVLQQ LPRNRKQRFKDVPLADLLSREDKTDCLDVVTINNKYLIKMAAVFRWAVRN DLIKKNMTEGLELKVPQRKASGARNAFSTEQVGQLLVAAKAYSQKTSGKP YHYYVTALAAITGARLNEIAQLQVKDVRTTEAGTVYIHINEDDSSLPGKS IKNAHSDRCVPLVDGAYGFILADFMALVETRRGADGDDAMVFDGLRLMKN GYGEQVSKWFNRTLLPKVLVDRSGLAFHSFRHTVAAQLKQHGVELAYAQA IMGHSSGSITYDRYAKEVEVDRLVNVMADVYKET >S2189 ISSfl3 orfC MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIGTTSS >S2723 putative integrase MLTDTKLRNLKPRDKLYKVNDREGLYVAVTPAGSISFRYNYSINGRQETI TFGRYGVGGITLAEARELLGDAKKMVAAGKSPAKEKARDKARVKDAETFG AWAEKWLRGYQMADSTRDMRRSVYERELKPKFSNQKLVEITHEDLRALAD AIVERGAPATAVHVREIVLQVFRWAIERGQKVENPAELVRPTSIARFEPR DRALTPEEIGLMYQYMERVGTSPTNRAAAKLLLLTMVRKSELTNATWSEI NFSEALWTIPKERMKRRNPHLVFLSQQALDIFIAMKTFAGGSDFVLPSRY DSDAPMSAATLNQVLTLTYKAAQKDGKSLTKFGPHDLRRTASTLLHEAGY NTDWIEKCLAHEQKGVRAVYNKAEYREQRAAMLQDWADMIDEWTSGGSKG >S1891 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4638 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2738 ISSfl4 orf MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDVRDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >S1775 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1989 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4551 IS600 orfB MCQVFGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >S3963 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0383 ISSfl3 orfA MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK QALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQECMPVPVAL TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLFDPLTP ELLRALIREMKGGIR >S1544 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0224 IS911 orfB MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSKAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >S4015 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0522 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S2863 IS3 orfB MVIDLWSRAVIGWSMSPRMTAQPALRCPADGAVAA >S1227 hypothetical bacteriophage protein MKQWREKSRQLAERGDLTPADWSNLELYCVNYSIYRKAVADLAARGFSIV NSQGGESRNPALSAKSDAERVMIKMASLLGFDPISRRKNPPETEEEDELD RLE >S3966 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4218 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3760 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S4477 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S3295 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKTQAAVGNLAHTTGQ >S4031 ISSfl3 orfC MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAKMAKAFDYILNHWNALNEFCRDG RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >S4475 ISEhe3a orf MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD >S2386 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMVMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0927 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKSTAYFAQESLKNTR >S2551 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4630 ISEhe3 orfB MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEG >S2440 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S1714 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0686 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0382 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI AWPHPKRTERPGIRI >S1461 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2199 IS1 orfB MAHVFGERTLATLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKR YTQRIERHNLNLRQHLARLVRKSLSFSKSVELHDKVIGHYLNIKHYQ >S0488 IS2 orfA MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE >S4167 IS1 orfB MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0275 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0235 IS911 orfA MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKRLR ALDVRLPEQFSIIGKLRAHYPVVTLCHVFGVHRSSYRYWKNRPEKPDGRR AVLRSQVLELHGISHGSAGARSIATMATRRGYQMGRWLAGRLMKELGLVS CQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYIWTGKRWAY LAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGGVMFHSDQG SHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKNEWMPVVGY VSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC >S1588 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIGQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S0316 putative phage integrase MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSK LGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITE SKIYSAMQKMTNRRHEENWRLRAEACRKKGKPVPEYTPKPASVATKATHL SFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPE PLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVA LNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWK AALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYA HLAPNHLTEHARQIDLILNPSVPNLSQSRNKEGTNDV >S3784 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3055 IS3 orfA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK >S4058 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4439 ISSfl3 orfC MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIGTTSS >S4236 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2104 ISEhe3 orfA MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD >S4168 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4480 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S3072 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRGTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1398 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0687 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLACLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4835 putative integrase MNSSKAGGCHGINDIKVKTAKPKDKPYKLADGGGMYLLINTNGSKYWRMK YRFAGKEKMLSIGVYPDVTLADAREKRSEARKLLAAGGDPGEAKKEEKIA QQMSLKNTFEAIAREWHQLKADRWSLRYRDEIIDTFEKDIFPYIGKRPIA EIKPMKLLEALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPD LASALATPKKVHFPFLTANELPHFLNDLAGYTGSIITKTATQIIMLTGVR TQELRFARWEDIDFETKLWEIPAEVMKMKRPHIVPLSEQVIMLFKQLEPI SKHHPLVFIGRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILH EQGFNSAWIEMQLAHVDKNSIRGTYNHALYLDGRREMMQWYADYIDSLSS RES >S2200 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMVMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0472 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3585 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2787 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVIPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1952 IS2 orfB MIVLIPVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S3480 IS1 orfA MAFISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1890 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0958 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1397 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1136 hypothetical bacteriophage protein MRRAISYIRFSSERQLKGDSVRRQSKLVTDWLDKNPEFYLDSSLSFKDLG KSAFSGKHLKGGLGDFLTAIEKGLVKAGDTLLIESLDRLSRQDIDIASEL LRRILRAGVDVVTLSDGEHYTRESLKDPLALIKSILIMQRAHEESLRKSE RVQAAWNRKKELISEGIKVSRRCPAWLRLNDDRRTFTIIPDKVEVVKRAF DLRLQGLSFWAITRTLNDEGHLSLNQYTPKQKGWSDTAVKKLLRNRAVIG CFTPAGREEVQGYYPAIISESLFYRVQQLNTGQYGRASVSSNPLSVNLFR GIIKCEVYWQ >S0262 ISEhe3 orfB MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS SEQMSPTEYENQYYQRLGSV >S4554 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S3863 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S1123 IS629 orfB MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPEQLWVADFTYV STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT VHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAE VIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIG NDDLAA >S2179 ISSfl3 orfA MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >S0527 ISEhe3 orfB MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS SEQMSPTEYENQYYQRLGSV >S3184 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >S1473 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4014 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2482 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3708 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S2448 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S3609 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1959 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >S2438 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S0220 IS2 orfA MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE >S3633 IS2 orfA MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S2949 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1797 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0723 putative helicase MTTPVWRNDDLEGAVIGAFFLRGADPEVMDILATLPADVFSVRAYQDIYT GICRQARVSGVIDPVLLCNEMPELAPVITDTGRKTWVKSSLEHYVAALRR NAALRDAEKTLNEALQKLRDAHTCEAAEDALKDAQNMMVTLSTGKGVIQP VHIDDVLPEVVERVECRNQGLEKSRTLMTGIDELDAKTGGMEPGDLVFIA ARPSMGKTELALDIIDKVTEQGHGVLLFTMEMANIQIGERMVSAAGGMPV SRLKSVAHFEDEDWTRFSQGVGRMTGRNIWMVDQANLAIDEICATTKHHL IKYPETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIAL SQLSRGVESRPNKRPMNSDLKNSGEIEADADIILMLYRDEVYNPDTQARG IAEINITKQRNGSLGTIYRRFYNGHFLPVDQESARVLSTPMKPGNPRRYS NKRTDSSKMERFF >S3801 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0705 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S0276 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0545 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1468 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA >S2173 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4314 IS4 orf MTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQ ELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGA SPGRIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSV A >S3288 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE >S2322 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0716 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1521 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKEKLHVISKRYTQRIERHNLNLRQHLARLGRMSLSFSKSV ELHDKVIGHYLNIKHYQ >S4440 ISSfl3 orfD MEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLRPLYIALNDYVLEA GKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGSSLPAAVWFAYSAD RKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEAGCLAHARRKIHDE DVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAVRKARSVQLMQSLY DWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDGWVEIDNNIGENAL RSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEVEPEDWLREVIEKL NDWPSNQVHKLLPWNFSSVK >S0379 ISSfl3 orfD MSSAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVELS RNTMVRWVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTK TGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADA YAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELYD IEAEIRGSPAEERLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFD YILNHWNALNEFCRDGRVEIDNNIGENALRSVAVGRKNYLFFGSDKGGES AAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >S0899 ISSfl3 orfA MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLSDPLTPELLRALIREMKGGIR >S4832 IS3 orfB MKYVFIENHRAEFSIKAMCRVLRVARSGWYVWLRRRHQMSLRQQFRLTCD AAVHKAFFEAKQRYGAPRLADELPEFNIKTIAASLRRQGLRAKASRKFSP VSYRAHGLPVLENLLEQDFSASGPKPEVGG >S0456 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP GTHQKIIDMAMNGVGCRASARIMDVGLNTVLRHLKNSGRSR >S4604 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1453 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2242 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4611 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2180 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILCEPPRVSWRVFYL >S1911 IS629 orfB MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYV STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT VHHSDKGSQYVSLALHTAA >S2123 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1423 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4553 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0498 IS150 orfB MKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRIGEI YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY RGEVGQTAPNVLQRDFKLRGQTRSGLPMLLNLQSMGASCICLQ >S4022 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4216 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S3169 putative superfamily I DNA helicase MDENALGFASYWRNSLADAESGKGSFKRKDAQNFTHWHGIAAGRLDEAIV SKFFEGEKDDVETVDVILRPKVYFRLLQHGKDRSAGAPDIVTPIVTPALL SREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFD DSVDKTAETDEEREARYAALQQEWRQYLYDSERLLKSVAGDWIEKPEQYE LAEHGYIVKTAQSGGASSHILSLYDHLLVCNKDVPLFNRFASREVHAAES LLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTG KTTLVLSIIATQWARAALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQG SGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA LLFYLEKAKAAFPGKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQ IRAARELIANDIEQYLDNLNKLLSGQEQKVTLLKSAKTEWKKYRAGESLI YSLFSWLPAVRNKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLN SAEREQTTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQA DELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQDEKKKKGAKGVTARW QRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAG QVLPEVAAASFALAKKALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGS TQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGELASSGS RYNLLEAETIAVWLAENQQNIEAHYGKSLHEVVGIVTPFSAQVSTIKQVL GKQDISTGTNEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNS MLNVAVSRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDY KERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITIISPWLTWQRLE QTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNFKAALEKL NALGIATKLVNRVHSKIVIGDDGLLCVGSFNWFSATREARYERYDTSMVY CGDNLKGEIEAIYNSLERRQV >S1658 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1554 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S1610 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1617 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0607 IS1 orfA MLAWLPFPSDVLPAPLLKAWCVTGKSTAGHQRYLCSHCRKTWQLQFTYTA SQPGTHQKIIDMAMNDVGCRASARIMGVGLNTVLRHLKNSGRSR >S2794 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKHHQMAA >S4061 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S1653 IS1 MAHVFGERTLATLERLLSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKR YTQRIERHNLNLRQHLARLVRKSLSFSKSVELHDKVIGHYLNIKHYQ >S3548 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S1992 hypothetical bacteriophage protein MIIQSKLIRAALVCAAKNDVRYYLNGLHITPKHIEATNGSVALRMAHGIR TKKNIIVQFEGGVPAKAETTELIFSKEPIAVHRDQFQRRLSITGIKLVDG CFPDLDRIIPKKFDRCTHPVLQAGYLSYPEKMFGRERKFIPVQLRPSGDG QAVRIQFDSIINSMYGNPEFVVMPCRDHGDFNVAQEHPE >S3967 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1059 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1857 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4230 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2788 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4148 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAEHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2103 ISEhe3 orfB MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIVSPNRLQRQF NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV LNALLMAVWWRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS SEQMSPTEYENQYYQRLGSV >S0325 putative phage integrase MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSK LGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITE SKIYSAMQKMTNRRHEENWRLRAEACRKKGKPVPEYTPKPASVATKATHL SFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPE PLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVA LNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWK AALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYA HLAPNHLTEHARQIDSILNPSVPNSSQSKNKEGTNDV >S1110 IS600 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0996 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4471 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESCLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2134 ISSfl3 orfA MPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYSREFKVRLAK QALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQECMPVPVAL TDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGVVKLFDPLTP ELLRALIREMKGGIR >S2318 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1980 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S2983 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2020 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4098 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKPLSFSKSV ELHDKVIGHYLNIKHYQ >S2146 putative integrase for prophage CP-933U MTFGRLWEKFLASAYYSDLSPRTQKDYLQHQKKLLAVFGKVPADSIKPEH IRRYMDKRGEQSKTQANHEKSSMSRVYSWGYERGYVKANPCAGVSKFKAK NRERYVTDKEYQAVLSVAPLPVFIAMEIAYLCAARVSDVLSLKWEQIGND GIFIQQGKTGKKQIKAWSPRLQAAIEKAKQLPTSAYVISNQYGNRYMYKG FNEMWVEARNRAGKISGILTDFTFHDLKAKGISDYEGSSRDKQLFSGHKT EGQVLIYDRKVKVSPTLDVPLPENIPRKYSK >S0021 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP DTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0942 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA >S0372 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1910 IS629 orfB MAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLQERLGHIPP AEAEKAYYASIGNDDLAA >S3586 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2321 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1973 IS1 orfB MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSK SVELHDKVIGHYLNIKHYQ >S1960 IS629 orfB MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA EKAYYASIGNDDLAA >S3785 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1866 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0707 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFSVTEPNQVWSVCDLYL DG >S3610 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2385 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0948 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLATAERQRLKELERENRELRRSNDILRQASAYFAKA EFDRLWKK >S2185 hypothetical protein MYNDVLFPGEDINKPISIAAANRFVNRIRGGMDLGYWRTHDFRRTLVTRL SEMNVEPHVTERMLGHELGGIMSVYNKHDWIEAQRKAYELHADKLFWHIR SISD >S4479 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3125 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1240 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2859 putative CP4-57-type integrase MARQTKPLSVKEIESAKPKEADYVLYDGDGLELLIKSSGSKIWQFRYIRP VTKTRAKKSIGPYPSVTLADARNYRAESRSLLAKQIDPQEHQQEQLRSSL EAKTNTFQLVAERWWNVKKASVTEDYAEDIWRSLERDVFPAIGDVSVTDI KAHTLVQAVQPVQARGALETVRRLCQRINEVMIYAQNTGLIDAVPSVNIG KAFEKPQKKNMPSIRPDQLSQLMQTMRTASISLSTRCLFMWQLLTITRPA EAAEARWEEVDIEAQEWKIPAARMKMNRDHTVPLSDEAIAVLEMMKPLSG NREFIFPSRIKPNQPMNSQTVNASLKRAGFGGVLVSHGLRSIASTALNEQ GFPPDAIEAALAHVDKNEVRRAYNRSDYLEQRRPMMQWWANFVMAADRGS MIEGGIKGMKLVG >S1328 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARPVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2131 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4252 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4037 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1609 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0486 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGDDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >S0797 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIGQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLSKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S1431 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQNEPARKLLG >S4498 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0905 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S0526 ISEhe3 orfA MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD >S2110 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2781 DNA-invertase MLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGTKSERPGLKKLLK TLSAGDTLVVWKLDRLGRSMRHLVVLVEALRERGINFRSLTDSIDTSTPM GRFFFHVMGALAEMERELIVERTKAGLEAARAQGRIGGRRPKLSPE >S3395 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3189 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVECRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S1717 ISEhe3 orfA MSGKRYPEEFKTEAVKQVVDRGYSVASVATRLDITTHSLYAWIKKYGPDS STNKEQSDAQAEIRRLQKELKRVTDERDILKKAAAYFAKLSD >S1282 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0312 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1912 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >S0263 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S3344 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2107 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S3188 IS2 orfA MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0718 putative bacteriophage protein MRPSELSDLLWAQVDRVAPHLLPNGKIEGHEWVAGNVNGDKGNSLKVNLI GKKKWADFAEGDGGDMLDLWMACRGINLHQAMQEAKAFLGIKDDDHHFDA RREKKFSRPDRKKIARYVTRTESHLEYLQSRGISPEVVKRYEVVSGKVWN GERELDALVLPYKRDGELLQVKRISTERPDGKKVIMAEGDCEPCLFGWQA LDAGVRVVVLCEGEIDCMSYAQYGISALSVPFGGGKGAKQQWIEFEYHNL DRFEEIFISMDVDDVGREAAREIVSRLGEHRCRLVTLPYKDINECLMNGV TEDEIWQYIGTASYFDPEELYSAREFYQDTINAFYGKQQYLFNPPWESLA DKFQFREAELTLVNGVHGHGKACPLNEPILLADGTWTTHGNVKIGDQVAS VDGNPSTVTGIFPQGVRDVYRVTFEDGRYVDCAGDHLWEVTSRGFTKGEK RRVIDTFGLKRLSETKRHKNGVRIPEITGDFGDHSEPLAWVIGSLLGDGS LSNGSVKFSNVEPYMIERMKAELPDYNFSGDGKDWLISTARGQVNPLMET LRGYGLMGCTAKNKFIPRVFFSANKSTRIGMLCGLLETDGYVEKDGTLVF SSASEELRNEVVNKNWPPS >S3071 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2158 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLACLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S3705 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0484 IntA MAISDTKLRTIYGKPYSGPQEVADADGLSVRISPKGVIQFQYRYRWHGKP NRLGLGRYPSLSLKDARQITADLRKLYFSGTDPRTYFEEKVENSMTVAQC LDYWFDN >S0543 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2243 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0930 IS600 orfB MYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSD RGSQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHY RFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S2011 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLITAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >S0910 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA >S1659 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2697 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0317 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S1339 IS600 orfB MAHIRTCETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVAGIKDVYTCEIVGYA MGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFG LKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEI FYNRQRRHSRLGNISPATFREKYHQMAA >S3209 IS3 orfA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK >S1920 hypothetical protein MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSLLVRPDRPISPQAMAI HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM KLARRLWPGLKYSNMALYKTRKLNVQTPLGLHHHRALYDCYITAALLIDI MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERYPGYLRWLFN NLDSMSPELRLTLKHYLENT >S3132 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2300 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0201 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM SDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S2182 IS629 orfB MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL KKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSGV KRSVRPSAGKPLPQATA >S1628 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4313 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVRAKEKSAIC >S1236 putative phosphohydrolase MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD IDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK GVI >S4144 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3523 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4639 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1998 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S1426 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S4143 IS1 orfB MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0997 IS2 orfB MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S2140 putative crossover junction endodeoxyribonuclease MTERIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKL NLSGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDEINI VRGQLVPGGRLGIKITELGCA >S1685 IS600 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S2985 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4483 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S0909 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S3941 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S3481 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4332 IS2 orfA MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S2986 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0735 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0685 putative phage transposase MLQQGKFKTSQGCFEIARPTLEAHDYDREALWSKWDKASDSQRSLAEKWL PSIQATDEMLNQGISTKTAFATVAGHYQVSASTLRDKYYQVQKFAKPDWA AALVDGRGASRRNVHKSEFDEDAWQFLIADYLRPEKPAFRKCYERLELAA REHGWSIPSRATAFRRIQQLDEAMVVACREGEHALMHLIPAQQRTVEHLD AMQWINGDGYLHNVFVRWFNGDVIRPKTWFWQDVKTRKILGWRCDVSENI DSIRLSFMDVVTRYSIPEDFHITIDNTRGAANKWLTGGAPNRYRFKVKED DPKGLFLLMGAKMHWTSVVAGKGWGQAKPVERAFGVGGLEEYVDKHPALA GAYTGPNPQAKPDNYGDHAVDAELFLKTFAEGVAMFNARTGRETEMCGGK LSFDDVFEREYARTIVRKPTEEQKRMLLLPAEAVNVSRKGEFTLKVGGSL KGAKNVYYNMALMNAGVKKVVVRFDPQQLHSTVY >S4177 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1705 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREV LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S4839 putative P4-type integrase MALSDVKVRSAKPEAKAYKLTDGEGMVLLVHPNGSKYWRLRYRFGGKEKM LALGKYPEVSLADARARRDEARKLLANGVDPSENKKAVKVEQEQEAITFE VVARDWHASNQKWSASHSARVLKSLEDNLFTAIGKRNIAELKTRDLLVPI KAVESSGRLEVAARLQQRTTAIMRFAVQSGLIDYNPAQEIAGAVATAKRQ HRAALELNRIPELLHRIDHYSGRPLTRLAVELTLLVFIRSSELRFARWSE IDFETAMWTIPGEREQLEGVKHSQRGSKMRTPHLVPLSRQALSILEKIKS MSGNRELIFVGDHDPRKPMSENTVNKALRVMGYDTKVEVCGHGFRTMACS SLIESGLWSRDAVERQMSHQERSSVRAAYIHKAEHLGERRLMFEVVNKNW PPS >S4631 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0956 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0134 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2698 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4064 IS3 orfA MTKPVSISKKPRKQHTPEFRNEALKLAERIGVAAAARELSLYESQLYAWR SKQQQQMSSSERESELAAENVRLKRQLAEQAEELSILQKAATYFAKRLK >S0323 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S4476 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S2951 putative phage transposase MQSYVTVNDLLGVPGMPATTKGIRQALQRFSRDLGDVSRRREGTKAIEYH IDCLPEITRKALRERYVEQLVATENNVSEVKAVTRKTRNPDAVQAIEAYR GSPQLMEERLNALTENQRWVSEARAALVVEVLKLESAGNPGRLKAINFLV EKARKGELPERLQQAAVNANAKRGANRTISRDPLYQWVLKYNQSQNAAER LLLLAPGKRDEIKPEEISWLPEFLAQYRQVNGRPMSEAYEDFVAEWQRRH ADEPYMLEVMPSYDVVRYAMKKLPEVVKQKGRVTGSEYRQLEGFTRRDWT AMPVNYVWIGDGHGMKLKCAHPIHGRPFSPEVTFVIDGGTRFVVGWSLDL AENVFAVAGAIQHGIRNHGKPFLYYSDNGSGETADMLDKEVVGILPRLGI KHPTGIAGNPQGRGIIERLNRTLPMRIARRYRTYFGKGADRESLRVLNRD LRSAFNALQQDKPLNDRQKAAMRELPSWAELIEAIREGVEWYNNRPHSEL PMKPNGRHYSPTEFRKKRQAEEDTEIEWLSDLELRDMFRPMVERPVRRCE IQWLNNIYYAPELRDEHGRKVLISYDIHDAERITVRRKDGSFICEAIWNG NKRAAFAVSAEYHKQQQRIKGMRKRAEEKIRDAEDEGIQILEHKQAEPWL SNVYRPVGNVVAVQQPEYEEEHDEEFERDFRLGMQKLFAMQEEDDPLA >S2950 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMTA >S0318 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0380 ISSfl3 orfC MKVLAPGNGKTKTGRLWVYVRDDRNAGSSLPAAVWXAYSXDRXXXXPQXH LAXXXGRKPDRQ >S4472 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHLRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2317 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4331 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0697 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S1418 ISSfl4 orf MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIAALH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPALERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTASP VRSPGLTTPAK >S3056 IS3 orfB MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWPGDITYLRTDEGWLYLAV VIDLWSRAVIGWSMSPRMTAQLPCDALQMALWRRKRPRNVIVHTDRGGQY CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >S4029 ISSfl3 orfA MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >S4036 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1503 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHYSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S1774 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0736 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0901 ISSfl3 orfC,D MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >S1281 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S3874 IS1 orfB MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARPVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1865 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2338 IS629 orfB MARCTVARLMAVMGLAGVLRGKKVRTTIGRKAVAAGDRVNRQFVAERPDQ LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKSEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA EKAYYASIGNDDLAA >S0950 hypothetical bacteriophage protein MIWQPEFTDKTLSRKPGAVQLVTCKQNEVEPEDWLREVIEKLNDWPSNQV HELLPWNFSSVK >S1329 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4046 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1502 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S3549 ISSfl3 orfC MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFLSVK >S0283 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0953 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQHIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2510 putative regulator MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSGVCWLTPEEITARC DEFTPDSLKALALWMKRNAKNEAVETETAE >S1522 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S2447 IS2 orfA MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S1251 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFVLDCCDREA LHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S3524 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1499 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1900 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0952 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYIASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0957 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S1254 IS3 orfB MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFEN >S0900 ISSfl3 orfB MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >S2862 IS3 orfB MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR KFSPVSYRAHGLPVSENLLEQDFYCQWPEPEVARRHHVLTYR >S0721 IS911 orfA MKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLSTMTRWVKQLRDER QGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKATALLMSDSLNSSR >S0875 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1543 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S3800 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4605 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1454 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4021 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S3884 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSSVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S3268 hypothetical protein MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN QSQTFGVAWDAPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR LETDVYVPLA >S3004 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG EEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKRPVRLLN >S1676 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4579 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP DTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1978 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S2152 putative integrase MRHAVHQELIDTNPAANLGGVTTPPVRRHYPALPLERLPELLERIGAYHQ GRELTRHAVLLMLHVFIRSSELRFARWSEIDFTNRVWTIPATREPIIGVR YSGRGAKMRMPHIVPLSEQSIAILKQIKDITGNNELIFPGDHNPYKPMCE NTVNKALRVMGYDTKKDICGHGFRAMACSALMESGLWAKDAVERQMSHQE RNTVRMAYIHKAEHLEARKAMMQWWSDYLEACRESYAPPYTIGKNKFIP >S2689 putative DNA replication factor MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL >S1124 IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >S1616 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLKLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2861 IS3 orfA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK >S0941 putative integrase encoded by prophage CP-933K; partial MSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDA FREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLR LAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTTLHVDA LGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSF EGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREW DKIEIK >S3883 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S4253 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S3519 IS1 orfB MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S2721 putative DNA helicase MALDLMAAFTELPPPIDYVLPNMVAGTVGALVSPGGAGKSMLALQLAAQI AGGPDLLEIGEFPTGQVVYLPAEDPPAAIHHRLHALGAHLSAAERQAVAD GLLIEPLIGKCPNIMAASWFDALKRAAEGRRLMILDTLRRFHIEEENASG PMAQVVGHMEAIAADTGCSIVFLHHASKSAAMMGSGDQQQASRGSSVLVD NIRWQSYLSGMTQGEAEILGVDDCQRGYFVRFGVSKANYGAPFQELWFRR HDGGVLKPAVLERQCKVKRRQREEA >S1798 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S3775 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP DTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3180 reverse transcriptase-like protein MLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYC RYADDFVLIVKGTKAQAEAIREECRGVLEGSLKLRLNMDKTKITHVNDGF IFLGHRIIRKRSRYGEMRVVSTIPQEKARNFAASLTALLSGNYSESKVDM AEQLNRKLKGWAMFYQFVDFKAKVFSYIDRVVFWKLAHWLARKYRTGIAS LMRWWCKSPKPGQSKTWVLFGKTNHGKLSGEILYRLVGQGKKLFRWRLPE GNPYLRTETRNTYTSRFTEVAMAFASI >S0282 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0931 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S0874 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S1130 IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S2650 IS1N orfB MAFICELDEQWSYVGSKARQHWLGYAYNTKTGGVLAYTFGPRTDQTCREL LALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQRIERNNLTLRTRI KRLGRKTICFSRSVEIHEKVIGAFIEKHMFY >S1005 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S0324 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S4248 IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNR >S3401 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S2181 IS629 orfA MAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLMWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >S4567 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S4247 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0506 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S4437 ISSfl3 orfA MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >S3704 IS1 orfB MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0834 IS600 orfB MAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRA TTNPNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATREGWLYLAGIKDV YTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYD YRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAI SVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >S0719 IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S4231 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S3214 IS1 orfA MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR >S1483 IS103 orf MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ VRFLETRLMYLKKLKALAHPTKK >S4049 IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMKNELLKEAVEYGRAKKWIAHAPLLPGDGE >S1961 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0250 IS629 orfB MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKAKVIHRKSWKNRAEVELATLTWVDWYNNRRLPERLGHIPPAEA EKAYYASIGNDDLAA >S1944 ISSfl4 orf MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRILGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS >S1127 IS911 orfB MATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQ FAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRL TMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRG NCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHE YNGGLPPNESENRYWKNSNSVASFC >S1602 putative excinuclease subunit MVRRLTSPRLEFEAAAIYEYPEHLHSFLNDLPTRPGVYLFHGESDTMPLY IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR AALQALQSIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS >S0020 IS1 orfB MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0405 IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S2184 ISSfl3 orfC MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKTHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >S4060 IS2 orfA MIVLILVFRLVIGVQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0079 IS1 orfB MSCQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >S0327 IS629 orfB MAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYV STWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGT VHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAE VIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIG NDDLAA >S1253 IS3 orfA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWR SKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK >S0497 IS103, IS103 orf MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ VRFLETRLMYLKKLKALAHPTKK >S2427 ada, O6-methylguanine-DNA methyltransferase; transcription activator/repressor MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE NVSFYANASEALAAGFRPCKRCQPDKANPRQHRLDKITHACRLLEQETPV TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV TTSILNAGFPDSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP KAVRAVASACAANKLAIVIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE NEER >S2258 alkA, 3-methyl-adenine DNA glycosylase II MYILNWQPPYDWSWMLGFLAARAVSGVETVADSYYARSLAVGEYRGVVTA IPDIARHTLHINLSAGLEPVAAECLAKMSRLLDLQCNPQIVNGALGKLGA ARPGLRLPGSVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFLGMTP AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA >S2426 alkB, alkylated DNA repair protein MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV APGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHDLC QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT TDCRYNLTFRQAGKKE >S1671 b4285, IS911 orfB MKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYI WTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPGG VMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKN EWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKN SNSVASFC >S4357 dam, DNA adenine methylase MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH FAEKAQNAFFYCESYADSMERADDASVVYCDPPYAPLSATANFTAYHTNS FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS ISSNGGTRKKVDELLALYKPGVVSPAKK >S1447 dbpA, ATP-dependent RNA helicase MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQR DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT KKDCQAVCDALNEVEQSALSLHGDLEQRDQTLVRFANGSARVLVATDVAA RGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQR ANIISDMLQIKLNWQTPPANSSIVTLEAEMATLCIDGGKKAKMRPGDVLG ALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCR VRLLK >S2100 dcm, DNA cytosine methylase MQENISVTDSYSTGNAAQAMLEKLLQIYDVKMLVAQLNGVGENHWSAAIL KRALANDSAWHRLSEKEFAHLQTLLPKPPEHHPHYAFRFIDLFAGIGGIR RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHQE GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDKGKTFRIIMQTLDELGY DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL EPKIKQAVALRQQEAQHGRRSR >S3420 deaD, inducible ATP-independent RNA helicase MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFGGERR EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA >S0790 dinG, probable ATP-dependent helicase MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY YYRECPFFVTRREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG HHLPDVARDALEMSAEITVPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDELLEICQ RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEVQS KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI VIPRMRFEPSIDNEEQHIAEMAAFFREQVESKKYLGMLVLFASGRAMQRF LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG IVKKKEKTKSPRRRRR >S0300 dinP, damage-inducible protein P MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAK IASDMNKPNGQFVITPAEVPAFLQTLPLEKIPGVGKVSAAKLEAMGLRTC GDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG L >S4009 dnaA, replication initiation protein DnaA MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW VRDKYLNNINGLLTSFCGADAPQLRFEVGTKPVTQTPQAAVTSNVAAPAQ VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA RAAACQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE SHDIKEDFSNLIRTLSS >S3577 dnaB, replicative DNA helicase; part of primosome MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV GGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQ GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR LTFNGQWSRFDNYAGPQYDDE >S4662 dnaC, chromosome replication protein DnaC MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY >S0177 dnaE, DNA polymerase III, alpha subunit MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLALL ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL EERLAFLFPDEEERLKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN FIDRKHGREEISYPDVQWQHESLKSVLEPTYGIILYQEQVMQIAQVLSGY TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR LRQSLEPHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI GSEQVELEFD >S3312 dnaG, DNA biosynthesis; DNA primase MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR QEELIARERTHGLSNEERLELWTLNQELAKK >S4008 dnaN, DNA polymerase III, beta-subunit MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV IELMRMLDGGDNLLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE DAASQSAAYVVMPMRL >S0209 dnaQ, DNA polymerase III, epsilon subunit MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA TIQRIVRQASKLRVVFATDEELAAHEARLDLVQKKGGSCLWRA >S0422 dnaX, DNA polymerase III, tau and gamma subunits; DNA elongation factor III MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL SLVEAMVEGNGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTLVPPQSAPQQAPT VPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERLAS VTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKALE HEKTPELAVKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDNAV CLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPLEW RQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI >S3140 endA, DNA-specific endonuclease I MYRYLSIAAVVLSAAFSGPTLAEGINSFSQAKAAAVKVHADAPGTFYCGC KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHIVPAWQFGHQRQCWQD GGRKNCAKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ CAMKVDFKEKAAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNK MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS >S3007 exo, 5-3 exonuclease MRSLFLFSQPAIACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKADR PPMPEELHDEMPALRAAFEQRGVPCWSASGNEADDLAATLAVKVTQAGHQ ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR >S4467 fimB, recombinase; regulator for fimA MRNKADNKKRNFLTHSEIESLLKAANTGPHATRNYCLILLCFIHGFRASE ICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEVQALKNWLSIRTSYP HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRYSCGFA LANMGIDTRLI >S4466 fimE, recombinase; regulator for fimA MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTD AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV >S3516 fis, site-specific DNA inversion stimulation factor; DNA-binding protein MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN >S2444 gyrA, DNA gyrase, subunit A, type II topoisomerase MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT ANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAE GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT IDGSAAEGDDEIAPEVDVDDEPEEE >S4006 gyrB, DNA gyrase subunit B, type II topoisomerase MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE QQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKI LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA NIDI >S1030 helD, DNA helicase IV MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGDRCDLDT TYRFNSRIGEVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ QADYVIIVGLQEGSGGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL TRARHRVWALFNKENPSPFVEILKNLDVPVARKP >S0056 hepA, probable ATP-dependent RNA helicase MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS HRSQKVLVICAKAATALQLEQVLREREGILAAVFHEGMSIIERDRAAAWF AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL INYLASPDQTEGFDDLIKSCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL DQAGWRLDALRLIVVTHQ >S1636 himA, integration host factor (IHF), alpha subunit MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE >S0972 himD, integration host factor (IHF), beta subunit MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG >S0663 holA, DNA polymerase III, delta subunit MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW VAVRAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQPQLRQ AVQLLTRTELTLKQDYGQSVWAELEGLSLLLCHKPLADVFIDG >S1183 holB, DNA polymerase III, delta prime subunit MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT LLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQL MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL >S4491 holC, DNA polymerase III, chi subunit MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY RLDEALWARPAESFVPHNLAGEGPRGSAPVEIAWPQKRSSSPRDILISLR TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK >S4675 holD, DNA polymerase III, psi subunit MTSRRDWQLQQLGITQWSLRRPGALQGEIAIASPAHVRLVMVANDLPALT DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG AQVASPALTELRANPTARAALWQQICTYEHDFFPRND >S1472 hrpA, helicase MLRDRLRFSRRLHCVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT EKVTVYGLPIVAARKVNYSQIDSALCRELFIRHALVEGDWQTRHAFFREN LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRISHDVISARHFDSWWKK VSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGA DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI WSFGQLPESYEQKRGNYKVKAWPALVDERDSVTIKLFDNPLEQKQAMWNG LRRLLLLNISSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDQ LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR LEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIE ELRVSYFAQQLGTPYPISDKRILQAMEQISG >S0143 hrpB, helicase MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKST WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR AQNCVGLNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER RYLPLPAHQRFDEAVAVATAEMVRQESGSLLLFLPGVGEIQRVQEQLASR IGSDVVLCPLYGVLSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR LVVDCAQERVARFDPRTGLTRLITQRVSQASMTQRAGRAGRLEPGISLHL IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPTVN LLAAKRLLQMLGALEGERLSAQGQKMAALGNDPRLAAMLVNAKSDDEAAT AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS SLIAPLLAGAFADRIAHRRGQDGRYQLANSMGAMLDADDALSRHEWLIAP LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL RLLCAAKWLPEYDWPAVDDESLLATLETWLLPHMTGVHSLRGLKSLDIYQ ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF GEATNPTIAQGRVPLVLKLLSPAQRPLQITRDLGAFWKGAYREVQKEMKG RYPKHVWPDDPANTAPTRRTKKYS >S3663 hupA, DNA-binding protein HU-alpha (HU-2) MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK >S0391 hupB, DNA-binding protein HU-beta, NS1 (HU-1) MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN >S0264 is600a, IS600 orfA MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >S2175 is629a, IS629 orfA MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >S2176 is629b, IS629 orfB MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ LWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA EKAYYASIGNDDLAA >S2263 issfl4, ISSfl4 orf MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS >S2612 lig, DNA ligase MESIEQQLTELRTTLCHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD RLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG VPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLEQQEQL GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKLAQNVVNAL EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAG KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA KAQELGIEVIDEAEMLRLLGS >S1198 mfd, transcription-repair coupling factor MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ NALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQDIISSRLSTLYQLPT MQRGILIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGY RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTL ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN ANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDKARNLALICESDLL GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE TTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVA EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA >S3039 mutH, methyl-directed mismatch repair protein MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALDGLVTPENLKRDKGW IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVARL TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR GERILTLPRGFYLKKNFTSALLARHFLIQ >S4593 mutL, enzyme in methyl-directed mismatch repair MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA SISSVSRLTLTSRTAEQQEAWQAYAEGRDMDVTVKPAAHPVGTTLEVLDL FYNTPARRKFLRTEKTEFSHIDEIIRRIALARFDVTINLSHNGKIVRQYR AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRA IPENRVAAGRNHFAEPAAREPVAPRYSPAPASGSRPAAPWPNAQPGYQKQ QGEVYRQLLQTPAPMQKPKAPEPQEPALAANSQSFGRVLTIVHSDCALLE RDGNISLLSLPVAERWLRQAQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS VFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLL QSVDLHPAIKALKDE >S4094 mutM, formamidopyrimidine DNA glycosylase MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP IVATKHAQRATFYCRQCQK >S2944 mutS, methyl-directed mismatch repair protein MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQL LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI SSGRFRLSEPADRETMAAELQRSNPAELLYAEDFAEMSLIEGRRGLRRRP LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK SLV >S0098 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGVPWGKEGQ PGEWMSLVGLNADDFPPANEPVIAKLKRL >S3161 mutY, adenine glycosylase MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP YFERFMAHFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS LCPLQNGCIAATNNSWSLYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP PSGLWGGLYCFPQFADEESLRHWLAQRQIAADNLTQLTAFRHTFSHFHLD IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV >S0596 nei, endonuclease VIII and DNA N-glycosylase with an AP lyase activity MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVET RGKALLTHFPNGLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT ILLYSASDIEMLRPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH ALLEIPRFSYATRGQVDENKHHGALFRFKVFHRDGELCERCGGIIEKTTL SSRPFYWCPGCQH >S3665 nfi, endonuclease V (deoxyinosine 3 endonuclease) MDLASLRAQQIELASSVICEDRLDKDPPDLIAGADVGFEQGGEVTRAAMV LLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPDLV FVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSKPGALA PLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRLPE PTRWADAVASERPAFVRYTANQP >S2373 nfo, endonuclease IV MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF ADFARIVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQN DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA >S1790 nth, endonuclease III MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDISVNKA TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE DLCEYKEKVDI >S1941 ntpA, dATP pyrophosphohydrolase MSEGSQRRGSVKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVT GSVEEGETAPQAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHR YAPGVTRNTESWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQ AIEQFVINAA >S1435 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG YERISATNPGGLSDKLREYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT GYAGGVQRKEWLLRHEGYLLL >S2707 parA, resolvase MRRTKPVAAPMVARVYLRVSTDAQDLERQEAITTAAKAAGYYVAGIYREK ASGARADRPELLRMIGDLQPGEVVIAEKIDRISRLPLPEAERLVASIQAK GARLAVPGVVDLSDLAAEAQGVAKIVLEAVQIMLFRLALQMARDDYEDRR ERQRQGIELARQAGRYKGRRADPKRRAQVVALRKSGYSINKTAELAGYSA AQVKRIWAEVSQAEAKQHGAFVEDALTEADALAAVGQDERQEERA >S3267 parC, DNA topoisomerase IV subunit A MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS EE >S3275 parE, DNA topoisomerase IV subunit B MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK LADCTAQGLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV >S0602 phrB, deoxyribodipyrimidine photolyase (photoreactivation) MTTHLAWFRQDLRLHDNLALAAACRNSSARVLALYIATPRKWETHNMSPR QAELINAQLNGLQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHL FYNYQYEVNERARDVDVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKV FTPFKNAWLKRLREGMPECVAAPKVRSSGSIKPAPSITLNYPRQSFDTAH FPVEEKAAIAQLRQFCENGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR QCLHRLLAEQPQALDGGAGSVWLSELIWREFYRHLMTYYPSLCKHRPFIA WTDRVQWQSNPAHLQAWQKGKTGYPIIDAAMRQLNSTGWMHNRLRMITAS FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF NPITQGEKFDREGEFIRRWLPELRDVPGKAVHEPWKWAQKAGVKLDYPQP IVEHKEARVQTLAAYEAARKGK >S3813 polA, DNA polymerase I, 3--> 5 polymerase, 5--> 3 and 3--> 5 exonuclease MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEVGKWLQAK GAKTAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISHER ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE YMERTRAQAKELGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI HQLMENCTRLDVPLLVEVGSGENWDQAH >S0057 polB, DNA polymerase II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIHADQV PRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG VTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPL KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVAS RPLLLEKLNAWFANHDPDVIIGWNVVQFDLRMLQKHAERYRLPLRLGRDN SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS TEGFLDAWFSREKHCLPEIVTNIWHGRDKAKRQGNKPLSQALKIIMNAFY GVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDIIYGDTDSTF VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ ELYLRIFRNEPYQEYVRQTIDKLMAGELDARLVYRKRLRRPLSEYQRNVP PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >S3734 priA, primosomal protein N (factor Y), putative helicase MPVAHVALPVPLPRTFDYLLPEGMTVKAGCRVRVPFGKQQERIGIVVSVS DASELPLTELKAVVEVLDGEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL PILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKI WRDQVATLEFNDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLN TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ AGLAPALITRMRQHLQANNQVILFLNRRGFAPALLCHDCGWIAECPRCDH YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP LFPDVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP LFLQQLRNLILSSPLADDKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR LQHIISGTLALINTIPDSRKVKWVLDVDPIEG >S4626 priB, primosomal replication protein N MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI DSGD >S0419 priC, primosomal replication protein N MKTALLLEKLEGQLATLRQRCAPVSQFATLSARFDRHLFQTRATTLQSCL DEAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREASAWSLREWDS APPKISRWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVK AYEARLARCRHALEKIENRLARLTR >S4091 radC, DNA repair protein MKNNAQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEMLE NFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREESP LLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHSRLFSGTLNHVEV HPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFMDL RVLDHIVIGRGEYVSFAERGWI >S2913 recA, DNA-dependent ATPase, DNA-and ATP-dependent coprotease MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN EDF >S3028 recB, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF PRPLIVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERL LEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY LQGEPPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS GIDRRKFNRSNQAKWIEKISAWAEEETNSYQLPESLEKFSQRFLEDRTKA GGETPRHPLFEAIDQLLAEPLSIRDLLITRALAEIRETVAREKRRRGELG FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA PGMVNSVNKLFSQTDDTFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK MWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVR ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ IWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEVERLAEDLR LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS PGTFLHSLFEDLDFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA PLNETGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC PPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA AMQAHRYDLQYQLYTLALHRYLRHRIADYDYDLHFGGVIYLFLRGVDKEH PQQGIYTTRPNAGLIALMDEMFAGMTLEEA >S3030 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE TGHLVEGLGEAQAWQAPLWKALVEYTHELGQPRWHRANLYQRFIETLESA TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNSLL ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEEA PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFL PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ MRLQVNFHTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPE TMETIVEQSQRFLLPLFRFNQS >S3027 recD, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH VCLPLSRLENNEASNPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM ILCGDRLYLNRMWCNERTVARFFNEVNHTIEVDEALLAQTLDKLFPVSDE INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ LASVEAGAVLGDICAYANAGFTAERARQLSRLTGTHVPAGTGTEAASLRD SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG VAGLNERIEQFMQQKRKIHRHPHSRWYEGRPVMIARNDSALGLFNGDIGI ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL AALFSSRE >S4007 recF, Rec protein MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA ADMDDTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV EKGKITD >S4077 recG, DNA helicase MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFS AAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT LHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP LSANDTLKNKLLAALPFKPTGAQARVVAEIEHDMALDVPMMRLVQGDVGS GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFAPLGIEVGW LAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE LPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWVCTLIEESELLE AQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVAT TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA >S3077 recJ, ssDNA exonuclease MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR SLGIPVIVTDHHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQVEALTLCEKLERS RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFELFQQRFG ELVTEWLAPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV QLAYKLDINEFRGNRSLQIIIDNIWPI >S2853 recN, DNA repair protein RecN MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV TKHHQQALETARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKYD EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH QHYFVSKETDGAMTETHMQSLDKKARLQELARLLGGSEVTRNTLANAKEL LAA >S2800 recO, RecO protein MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL EYETRFSELFFDYLHCIQSLAGDTGTPEPALRRFELALLGHLGYGVNFTH CAGSGEPVDGTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE >S3855 recQ, ATP-dependent DNA helicase MNVAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMP TGGGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNST QTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEA HCISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGL NDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAK VEDTAARLQSRGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGM GINKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWL RRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCG NCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIR DYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAA RPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIAD ESNVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMAL IRAHVDGDDEE >S0424 recR, recombination protein RecR MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR F >S1670 relB, negative regulator of translation MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTL LSDEDAELVEIVKERLRNPKPVRVTLDEL >S3908 rep, rep helicase MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI AAVTFTNKAAREMKERVGQTLGRKEAHGLMISTFHTLGLDIIKREYAALG MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPS QAAASAIGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR GNYQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSTKAAEM RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE ERMQKGQSHLANLKAMMAAKRGK >S3906 rhlB, putative ATP-dependent RNA helicase MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT RPRTGNGPRRTGAPRNRRRSG >S0788 rhlE, putative ATP-dependent RNA helicase MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV FTRTKHGANHLAEQLNKDGIRSVAIHGNKSQGARTRALADFKSGDIRVLV ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGIRLM NKRKTA >S0208 rnhA, RNase HI MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG YQVEV >S0176 rnhB, RNAse HII MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS EKRRLALYEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL DIVFPQYGFAQHKGYPTAFHLEKLAEYGATEHHRRSFGPVKRALGLAS >S1811 rnt, RNase T MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR WKRLGGWPLPAAEEV >S1665 rus, endodeoxyribonuclease RUS (Holliday junction resolvase) MLIDLVLPYPPTVNTYWRRRGSTYFISEEGKRYRRAVALIVRQQRLKLSL SGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVR GQPVSGGRLGVKIYPIMH >S1937 ruvA, Holliday junction helicase subunit B MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDTSSETLIREALR AAL >S1936 ruvB, Holliday junction helicase subunit A MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM GLEMSDDGAQEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP >S1939 ruvC, Holliday junction nuclease MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF EYAARQVKQTVVGMGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH CHVSQNAMQMSESRLNLTRGRLR >S2193 sbcB, exonuclease I, 3--> 5 specific; deoxyribophosphodiesterase MTDTDKQPTFLFHDYETFGTHPALDRPAQFAAIRTDDEFNVIGEPEVFYC KPADDYLPRPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCILG YNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPEG INWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTRQ PRLFDYLFTHRNKHKLMALIDVPQMKPLAHVSGMFGAWRGNTSWVAPLAW HPENRNAVIMADLAGDISPLLELDIDTLRERLYTAKADLGDNAAVPVKLV HINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAIF AEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVDK RIEKLLFNYRARNFPGTLDYAEQQRWLEHRHQVFTPEFLQGYADELQMLV QQYADDKEKVALLKALWQYAEEIV >S0342 sbcC, ATP-dependent dsDNA exonuclease MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL EKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLT RQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA EHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA AITLTLTADEVATALAQHAEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ NVTLEQTQRNVALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALR GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQP WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL AGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ QLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG LTLDNLVHLANQQLTRLHVRYLLQRKASEALEVEVVDTWQADAVRDTRTL SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK >S0343 sbcD, ATP-dependent dsDNA exonuclease MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVF DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIISSQAGLNGIEKQ QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGSSKSDAVRDIY IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA SQQRETLSELSVEEVFNRRLALEDLDESQQQRLQHLFTTTLRTLAGEHEA >S0617 seqA, negative modulator of initiation of replication MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR VASPAIVEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI >S3542 smf, hypothetical protein MVDIEIWLRLMSISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQ RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLQATTDYPGALFVEG ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATCGVTITSGLARGIDG VVHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLFEQGGALVSEFPLD VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS PDQQDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW IAAVPGGYVRLRRACHVRRTNVFV >S2811 srmB, ATP-dependent RNA helicase MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE >S3584 ssb, ssDNA-binding protein MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG GAQSRPQQSAPAAPSNEPPMDFDDDIPF >S4186 tag, 3-methyl-adenine DNA glycosylase I MERCGWVSQGPLYIAYHDNEWGVPETDSKKLFEMICFEGQQAGLSWITVL KKRENYRAYFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR AYLQMEQNGEPFPDFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALK KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYLGNKP >S1361 topA, DNA topoisomerase type I, omega protein MGKALVIVESPAKAKTINKYLGCDYVVKSSVGHIRDLPTSGSAAKKSADS TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL PAVDKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI ISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSL DQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG WSAFYVDGKWVEGKK >S1575 topB, DNA topoisomerase III MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ PDAYDSRYARWNLADLPIVPEKWQLQPRPSVTKQLNVIKRFLHEASEIVH AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLR SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA ISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA KVYNLIARQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM TGIARFVQDKDLKKVLRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHS TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEAGSG AIA >S1261 umuC, mutagenesis and repair protein MFALCDVNAFYASCETVFRPDLWGKPVVVLSNNDGCVIARNAEAKALGVK MGDPWFKQKDLFRRCGVVCFSSNYELYADMSNRVMSTLEELSPRVEIYSI DEAFCDLTGVRNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLAN HAAKKWQRQTGGVVDLSNLERQRKLMSALPVDDVWGIGRRISKKLDAMGI KTVLDLADTDIRFIRKHFNVMLERTVRELRGEPCLQLEEFAPTKQEIICS RSFGERITDYTSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFALN EPYYGNSASVKLLTPTQDSRDIINAATRSLDAIWQAGHRYQKAGVMLGDF FSQGVAQLNLFDDNAPRPGSEQLMAVMDTLNAKEGRGTLYFAGQGIQQQW QMKRAMLSPRYTTRSSDLLRVK >S2815 ung, uracil-DNA-glycosylase MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTI PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC NHFVLANQWLEQRGETPIDWMPVLPAECE >S3583 uvrA, excision nuclease subunit A MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT ITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >S0770 uvrB, excision nuclease subunit B MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVIVVASVSAIYGLGD PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMDEL GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA ILYGDRSTPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG QNIAKTKAKGRGKSRPIVEPDNVPMDMLPKALQQKIHELEGLMMQHAQNL EFEEAAQIRDQLHQLRELFIAAS >S2052 uvrC, excinuclease ABC, subunit C MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE FEEAACIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH >S3865 uvrD, DNA-dependent ATPase I and helicase II MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD QSIYGWRGAQVENIQRFLNDFPGAEIIRLEQNYRSTSNILSAANALIENN NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW QDAVQLMTMHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV AFQGQGIKWLVAAYARLESV >S2099 vsr, DNA mismatch endonuclease, patch repair protein MVDVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRP DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD ISRLQELGWRVLIVWECALRRREKLTDAALTERLEEWICGEGASAQIDTQ GIHLLA >S4101 waaP, lipopolysaccharide core biosynthesis protein MVWMVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSY FLKWHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAF GEKGMNPLTRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVA TMVRDMHAAGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRV PGRWRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFAAPLKDILKQEQGLLS QAEAKATKIRERTIRKSL >S4103 waaY, putative LPS biosynthesis protein MIYNKTINGLKVFIKDNDPFYEQVLNDFLTCRVKTLKVFRSIDDTKVILI DTARGPLVLKVYAPKHKMTERFLKSCIKKDYYENLIYQTDRVRGEGIQSI NDYFLLAERKTLNFAHYYIMLIEYIEGVGLNEYLEISEDLKDQLSESIKE LHQHGMVSGDPHKGNFIVSEKGLRLIDLSGKKTTAVLKAKDRIDLERHYN IKNELKDFGYTYLIFKKKIKKVIRDVKVKLGLKSK >S2237 wcaH, GDP-mannose mannosyl hydrolase MFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPGG RVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFTT HYVVLGFRFRVAEEELLLPDEQHDDYRWLTPDALLASNDVHANSRAYFLA EKRAGVPGL >S3867 xerC, site-specific recombinase MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ CDAAMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK >S3079 xerD, site-specific recombinase MKQELARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA >S2727 xseA, exonuclease VII, large subunit MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE SMQPAGEGLLQQKYEQLKAKLQAECLFDQQYKKPLPSPAHCVGVITSKTG AALHDILHVLKRRDPSLPVIIYPTSVQGDDAPGQIVRAIELANQRNECDV LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD LRAPTPSAAAEVVSRNQQELLRQVQSARQRLEMAMDYYLANRTRRFTQIH HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRAGQQQQRLTQRLNQ QNPQPKIHRTQTRIQQLEYRLAEILRAQLSATRERFGNAVTHLEAVSPLS TLARGYSVTTATDGNVLKKVKQVKTGEMLTTRLEDGWIESEVKNIQPVKK SRKKVH >S0367 xseB, exonuclease VII, small subunit MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ AKLQQAEQRVQILLSDNEDASLTPFTPDNE >S1594 xthA, exonuclease III MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGVVDTFR HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI RSMEKPSDHAPVWATFRR >S0297 yafM, hypothetical protein MSEYRRYYIKGGTWFFTVNLRNRRSHLLTTQFQTLRNAIINVKRDRPFEI NAWVVLPEHMHCIWTLPESDDDFSSRWREIKKQFTHACGLKNIWQPRFWE HAIRNTKDYRHHVDYIYINPEKHGWVKQVSDWPFSTFHRDVARGLYPIDW AGDITDLSAGERIIL >S0339 yaiD, hypothetical protein MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK KTEKDSLKDEVLHSLLPRAFSRFSQTMMWLDTVNGLIMVDCASAKKAEDT LALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSL LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL KRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGE AQR >S0393 ybaV, hypothetical protein MKHGIKALLITLSLACAGMSHSALAAASVAKPTAVETKAEAPAAQSKAAV PAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV EDLKQVPGMGNSLVERNLAVLTL >S0406 ybaZ, hypothetical protein MEKEDSFPQRVWQIVAAIPEGYVTTYGDVAKLAGSPRAARQVGGVLKRLP EGSTLPWHRVVNRHGTISLTGPDLQRQRQALLAEGVMVSGSGQIDLQRYR WNY >S0660 ybeL, putative alpha helical protein MNKVAQYYRELVASLNERLRNGERDIDALVEQARERVIKTGELTRTEIDE LTRAVRRDLEEFAMSYEESLKEESDSVFMRVIKESLWQELADITDKTQLE WREIFQDLNHHGVYHSGEVVGLGNLVCEKCHFHLPIYTPEVLTLCPKCGY DQFQRRPFEP >S0876 ybjD, hypothetical protein MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR AD >S0892 ycaJ, putative polynucleotide enzyme MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDDTRRAI AELVNGDARRALNTLEMMADMAEVYDSGKRVLKPELLTEIAGERSARFDN KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR >S1184 ycfH, hypothetical protein MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDTARYVPLDRL LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF ARLFHIDASRLQSIR >S1530 yeaB, hypothetical protein MEYRSLTLDDFLSRFQLLRPQINREPLNHRQAAVLIPIVRRPQPGLLLTQ RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVTP >S3201 yeeS, putative RADC family DNA repair protein, MVAGTMQQLSFLPGEMTTRERSLILRALKTLDRHLHEPGVAFTSTHAARE WLILNMAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRA LYHNAAAVVLAHNHPSGEVPPSKADRLITERLVQALALVDIRVPDHLIVG GSQVFSFAEHGLL >S2400 yejH, putative ATP-dependent helicase MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAE KRKGVMIFAATVEHAKEIVGLLPAEDAALITGDNPGAERDVLIEDFKAQR FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG >S2463 yfaO, hypothetical protein MADDRGVFPGQWALSGGGVESGERIEEALRREIREELGEQLLLTEITPWT FSDDIRTKTYADGRKEEIYMIYLIFDCVSANREVKINEEFQDYAWVKPED LVHYDLNVATRKTLRLKGLL >S2660 yffH, hypothetical protein MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI LLYNAKKKSVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQMSHLMD >S2839 yfiL, hypothetical protein MMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVGMEDAISGSAIKDD DAFSDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSGKSFPASCNNVENA SQLHEVWQKRADENASTIRLN >S3038 ygdP, putative invasion protein MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE FASVVMSLQENTPKPQNASAYRRKRG >S3413 yhbQ, hypothetical protein MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >S3517 yhdJ, putative methyltransferase MRTGCEPTRFGNEAKTIIHGDAFAELKKLPTESVDLLFADPPYNIGKNFD GLIEAWKEDLFIDWLFEVIAECHRVLKKQGSMYIMNSTENMPFIDLQCRK LFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAILVE AKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQKP KALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEYIK MGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLIAK >S4280 yhhF, hypothetical protein MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRVVSQQLIKNLATLKA GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQHEAQGESDAD >S0267 yi21_6, IS2 orfA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >S0266 yi22_6, IS2 orfB MARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIH HVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNALLLERKP AVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANE TRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAA KNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >S0968 yi41, IS4 orf MHIGQALDLVSRYDSLRNPLTSLGDYLAPELISRCLAESGTVTLRKRRLP LEMMVWCIVGMVLERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT GDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRKLGKGD HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG GEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAY NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM RDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >S4082 yicF, putative enzyme MKVWMAILISILCWQSSVWAVCPAWSPARAQEEISRLQQQIKQWDDDYWK EGKSEVEDGVYDQLSARLTQWQRCFVSEPRDVMMPPLNGAVMHPVAHTGV RKMADKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNGLK GEDWTQKVSLISAVLQTVSGPLANSTLQGEIFLQREGHIQQQMGGINARA KVAGLMMRQGNSDTLNSLAVFVWAWPDGPQLMTDRLKELATAGFTLTQRY TRAVKNADEVARVRNEWWKAKLPFVTDGVVVRGAKEPESRHWLPGQAEWL VAWKYQPVAQVAEVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNIGS VRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFNPL TCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHIFS WLLLTPEQLQNTPGIAKSKSAQLWHQFNLARNQPFTRWVMAMGIPLTRAA LNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGSWL AAQQITGFEP >S3836 yigW, hypothetical protein MFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQKLAR QYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDFNRNF STPEEQERAFVAQLRIAAELNMPVFMHCRDAHERFMTLLEPWLDKLPGAV LHCFTGTREEMQACVARGIYIGITGWVCDERRGLELRELLPLIPAEKLLI ETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAATTDA NVKTLFGIAF >S3667 yjaD, hypothetical protein MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ GEPVWLIQQQRRHDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSLLLAQHTRHR NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA MCRAEYE >S4681 yjjV, hypothetical protein MQALAENYQPLYAVLGLHPGMLEKHSDVSLEQLQQALERRPAKVVAVGEI GLDLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRH DLPRAGVVHGFSGSQQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLP LASLLLETDAPDMPLNGFQGQPNRPEQAARVFAVLCELRPEPADEIAEVL LNNTYAVFNVRG >S3144 yqgF, hypothetical protein MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY >S3279 yqiE, hypothetical protein MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH HQALKNEWA >S3406 yraN, hypothetical protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI DLIMREGLTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >S3539 yrdD, putative DNA topoisomerase MRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSSADGHIVKV LEGQVCPVCGANLVLRQGRFGMFIGCSNYPECEHTELIDKPDETAITCPQ CRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPECHYPLLIEK KTAQGVKHFCASKQCGKPVSAE >S4347 yrfE, hypothetical protein MNKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEV GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV