Gene list
Applied filters:
COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: pCP301
Number of genes found: 124
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Shigella flexneri 2a str. 301, 301 >CP0179 IS629 ORF2 MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR GKKVHTTVSRKAVAAGDRVNRHQGNVPRTPGGPQRLVYVVSAADKDKHTS AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT >CP0099 ISSfl1 ORF1 MQSRFFTILRSNRHNLCGDLQQGMVHKSDSDELSALRAENARIIKPLLPP EPATPRAGRPWAEHRKIINGMFWVLCSSAPWRDLPERYGAWKTVYNRFNR WSKSGVINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIP ISTEIMGRVALAAVLAPKSIWQQTEVASR >CP0057 IS600 ORF2 MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELADNGIIVG >CP0187 orf, conserved hypothetical protein MPGASNAAMCKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKF WSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLALVRKQYPKLFAHW YLLRASNE >CP0239 IS630 ORF MVVSAIASTPQLHRGDRVSDVARTLCCARSSVGRWINWFTQSGVEGLKSL PAGRARRWPFEHICTLLRELVKHSPGDFGYQRSRWSTELLAIKINEITGC QLNAGTVRRWLPSVGIVWRRAAPTLRIRDPHKDEKMAAIHKALDECSAEH PVFYEDEVDIHLHPKIGADWQLRGQQKRVVTPGQNEKYYLAGALHSGTGK VSYVGGNSKSSALFISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL KENPKFRGIYQPVYSPWVNHVERLWQALHDTITRNHQCRSMWQLLKKVRH FMETVSPFPGGKHGLAKV >CP0001 IS2 ORF2 MVHATELMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI MPKPDGLTAAKNLAEAFEHYNEWHPHSALDYRSPREYLRQRANDNRCLEI >CP0066 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0212 ISSfl4 ORF2 MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI AWPHPKRTERPGIRI >CP0178 IS3 ORF2 MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF YASGPNQKWAGDITYYYSSPTAGKHGAPGY >CP0098 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >CP0210 iso-IS1 ORF1 MATVTVHCPRCHSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEARKPG VKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG >CP0248 oriT nicking and unwinding protein MLTGNLVMALFNHDTSRDQEPQLHTHAVVTNVTQYNGEWKTLSSDKVGKT GFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEA FSGRSQTIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKE TGFDIRAYRDAAEQRAYTRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQ FMYTDLLARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTFG IHMLDELSVRALSRDIMKQNRVTVHLEKSVPRTAGYSDAVSVLAQDRPSL AIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLS GELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVL ITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVGWPEI LRPA >CP0222 putative protein encoded within IS MSDGYSVYKSLADNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKI AGLYRIEKLIRERPVEKIRQWR >CP0171 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0193 orf, conserved hypothetical protein MTMATINARIDDDIKNQADEVLKLMNISQTQAIAAFYQYITEQKKLPFVI TSIVKTPHDLLRESTDMLAEALAVISNLQVWTEQQDGIGKAKLMEYYRRL DALYCCAKEKIGLLSDNRDAELGCVP >CP0026 IS100 ORF1 MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVSREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRLRDPHKITANKPAPYFGRFWTDGL ELCSVLFVLK >CP0254 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLACLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >CP0213 ISSfl4 ORF3 MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >CP0105 IS2 ORF2 MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL DCCDREALHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE I >CP0166 IS3 ORF1 MTHMTKTVSTSKKTRKQNSPEFCSEALKLAERIGVAAAARELSLYESQLY AWRSKLQQQMTSSERESELPA >CP0101 ISSfl2 ORF MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRILGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS >CP0270 IS3 ORF2 MECLHGEHFIYREIVRATVFNYIECNYNRWRRHRWCGGLSPEQFENQNLA >CP0076 iso-IS1 ORF1 MRFSMTTVTVHCPRCNSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEA RKLGVKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG >CP0159 IS600 ORF2 MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP AAFRIKYYQMTA >CP0002 putative resolvase MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI STVL >CP0029 IS100 ORF2 MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHE EKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFI ERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAADLLLQLSTAQ RQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAM ILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKGESYRLRQKRK AGVIAEANPE >CP0112 ISSfl4 ORF3 MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLPADAYAGYNVLYETGRVKEA GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >CP0107 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >CP0047 IS2 ORF2 MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFMKTMKEDCI AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST >CP0120 ISSfl2 ORF MTESSDYESVLVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH AGEAKTDARNAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS >CP0106 IS2 ORF1 MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >CP0058 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0077 iso-IS1 ORF2 MLAYTCGPRNDETCRELLALLTPFCIGMVTSDDWGSYAREVPEEKHLTGK IFTQRMNVTT >CP0012 orf, conserved hypothetical protein MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV ESRRQAKGTRFLWQHSDKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD IWSRPWSEERRNDWQRWLRPTVTSP >CP0205 ISSfl4 ORF2 MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >CP0040 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0183 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >CP0020 ISSfl4 ORF3 MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQILEAQERISALQEEMAETLGEQYDPVL PSPLRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLE LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS PDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRITEAACMAHVRRKIH DVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS LYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV >CP0223 ISSfl4 ORF3 MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTD VLTRLPEWPEERLAELLPLEGFTFTG >CP0087 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQSASPVPPDRARHS GPRITCVRRQKKP >CP0091 iso-IS10R ORF MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI SNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG KL >CP0108 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0206 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >CP0214 IS100 ORF2 MLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRS LSFIERNENIVLQGTSDITNPRVGICV >CP0030 iso-IS1 ORF2 MGRWWRYKWITFHPSLTQHWLWYAYNTKTGGVLAYTFGPRNDETCRELLA LLTPFCIGMVTSDDWGSYAREVPKEKHLTGKIFTQRIARNNRTLRTRIKR LARKTICFSRSVEIHEKVIGSFIEKHMFY >CP0118 IS150 ORF B MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE >CP0011 orf, conserved hypothetical protein MDDRIQAGKADMAACADEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA KGYRNRERFKLGVMFHYGKLNMAF >CP0074 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAMVKLGCIRLFYQRLRN >CP0234 orf, conserved hypothetical protein MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPEGREVTFSAFSDWLPRNR AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA EIYKFFTNALYVALTRAKHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN PGLKIRKGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE RSEKSVGYWVGGIRRAAQKA >CP0268 IS911 ORF1 MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKAT ALLMSDSLNSSR >CP0025 IS3 ORF2 MSKLILPSNTVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEG WLYLAVVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHS DRGSQYCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHG EHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >CP0021 putative transposase MDAENETVLNANMTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRT LRDRNGTFEPQQLKKNQP >CP0039 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNHQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HTPPAEAEKAYYASIGNDDLAA >CP0174 putative transposase MLNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHV AKQPGEVVDKTRQNEPPRFSWRVFYL >CP0204 ISSfl4 ORF3 MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT SPPSENPIASKPESPGRESSRKPLPAELPRETHRLLPAETSCPACGGVLK EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA GCLAHARRKTHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >CP0013 putative IS1 ORF MPEPVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSD DMHYKIIGWYLTINHHH >CP0267 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFARNPRPIQQ CFKFTLLAFSHLFLGNTGECHDGALSE >CP0256 orf, conserved hypothetical protein MNINQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWI WRHRK >CP0157 IS600 ORF2 MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTSSDHNLPVTPNLL NQNFTPTAPNQVWVADITYVATREGWLYLAGVKDVYTCEIVGYAMGERMT KELTGKALFMALRSQHPPAGLIHHSARGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISSAYGKTD >CP0211 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >CP0088 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >CP0172 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLALHTAA >CP0207 IS2 ORF2 MDGPRSSHTDDTDVLLGIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN AKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCC DNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDL PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK TVSAPSATSCENCPVWQQSRPSILMDTNDEFPDNKRYSLLPFLFA >CP0084 iso-IS10R ORF MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI SSLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG KL >CP0231 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GLRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAK AEFDRLWKK >CP0242 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0095 ISSfl4 ORF3 MDTSLAHENARLRALLQTQQDTIHQMAEYNRLLSQRMAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQIPFSRAIYATQALGCSDCSIKAILNS >CP0114 ISSfl4 ORF1 MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >CP0056 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGCIVGWRVSSSMGLTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLQERLG HIPPAEAEKAYYASIGNDDLAA >CP0173 IS629 ORF2 MYRWPCTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKN RAEVELATLTWVDWYNNRRLQERLGHIPPAEAEKAYYASIGNDDLAA >CP0208 IS2 ORF1 MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >CP0233 IS3 ORF2 MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG DITYLRTDEVRLHPVSTEPHAF >CP0008 IS600 ORF2 MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR RHSRLGNISPAAFREKYHQMAA >CP0085 IS100 ORF2 MLHEEKLARHQRKQAMYTRMAAFPAVKMFEEYDFTFATGAPQKQLQSLRS LSSERSPHNFPKT >CP0247 oriT nicking and unwinding protein MSKGYTFMMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGL QGSVDKDVFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSISMM AMLGGDKRLIEAHNQAVDFAVRQVEASAST >CP0017 IS4 ORF MPDSFMHIGQALDLGSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLR KRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQA RQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPEN DAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQ LIGQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRK LGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDA MRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVKQELWG VLLAYNLVRYQMIKMAEHLKGYCPNQLSFSESCGMVMRMLMTLQGASPGR IPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA >CP0033 putative transposase MQQPKMTVAMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYN DAQAIAEACQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRG LLAEYGIVFSKGAAELRQK >CP0113 ISSfl4 ORF2 MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT DRTAWHPDITRDKTRE >CP0006 putative protein encoded within IS MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS YAGAGLLAHVVTGKYADHLPLYRQSDLLFHTAI >CP0018 putative protein encoded within IS MEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPV TLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTPALLQ ILIREIKGSSH >CP0089 IS100 ORF2 MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAA DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG ESYRLRQKRKAGVIAEANPE >CP0069 IS21 ORF2 MTLTELLWRESEKLRRYKKEARLPVAKTLSEYDFIQLPELNGAQFQQLCE TTDWVDAGENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQE LRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYE RGSLVITSNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKT AKAVTSVT >CP0044 IS150 ORF1(ORF A) MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ VRFLETRLVYLKKLKALAHPTKK >CP0062 ISSfl1 ORF2 MQREKTPEWREKQKSSRGIRRGQGYRLVFQFPIRERCFGRLKEYRRIATR YDKTARNYLAMVKLGCIRLFYQRLRN >CP0232 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRMTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >CP0249 oriT nicking and unwinding protein MKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLD SRSRYLRDMYRPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGET QVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASV SEDAMTVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFA SVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI KARAGETLLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTE AKSFAAEGTGFADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAE KSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVV QGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDA QTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAG GGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAV YSLINRDVERALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEA QQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVL NSMIHDAREKAGELGKVQVMVPVLNTANIRDGELRRLSTWENNPDALALV DNVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVG TGDRIRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERA EQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK QHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAERLFSTARE LRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAG IWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQD GVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNG EPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRE IAGQERDRAAITEREAALPESVLREPQRVREAVREVARENLLQERLQQME RDMVRDLQKEKTPGGD >CP0081 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQ >CP0255 IS600 ORF1 MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0041 IS150 ORF B MRQQQDEQGRFSICSRQAAVVQRLMGILSLKAAIKVKRYRSYRGEVGQTA PYVLQRDFKATRPNEKWVTDFTEFAVNGRKLYLSPVIDLFNNEVISYSLS ERPVMNMVENMLDQAFKKLNEPPRESWRLNFLRKR >CP0075 IS630 ORF MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSVGIVWRRAAPTL RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLHPKIGADWQLRGQ QKRVVTPGQNEKYYLAGALHSGTGKVSYVGGNSKSSALFISLLKRLKATY RRAKTITLIVDNYIIHKSRETQSWLKENPKFRGIYQPVYSPWVNHVERLW QALHDTITRNHQCRSMWQLLKKVRHFMETVSPFPGGKHGLAKV >CP0184 IS629 ORF2 MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLVYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >CP0121 IS100 ORF1 MVTFETVMEIKILHKQGMSSRTIARELGLSRNTVKRYLQAKSEPPKYTPR PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC >CP0269 IS911 ORF2 MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL RPHEYNGGLPPNESENRYWKNSNSVASFC >CP0180 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW VRQHERDTGGGEVGSPPLNVSV >CP0189 IS600 ORF2 MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA PMESFWGTLKNGTGTE >CP0027 IS100 ORF1 MISLNCWHKSVDHIMLCLSRFLGIPQPFRAQTKGKVERMVQYTRNSFYIP LMTRLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQ SMLALPPEKKEYDVHPGENLVSFDNPPQHHPLSIYDSFCRGVA >CP0068 putative protein encoded within IS MLPSETMIWQPEFTDKTLSRKPGAVHAVRQQRSKALLTSLNEWMVEKNGT LSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRTVCLGKKN YMFFGSDHGGDRGALLYGLIGSCRLNGIDPEAYLRHILSVLPEWPSNRVD ELLPWNVVLTDK >CP0086 putative protein encoded within IS MRATAEEALKRISELYAIEDEIRGLPESECLAVRQQRSKALLTSLHEWMV EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVC LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP SNRVDDLLPWKVVLPSG >CP0170 orf, conserved hypothetical protein MDDRIQAGKADMAACTDEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA KGYRNRERFKLGVMFHYGKLNIAF >CP0191 ISSfl1 ORF2 MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI ATRYDKTARNYLAIVKLGCIRLFYQRLRN >CP0116 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0177 putative reverse transcriptase MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA VKENWQWKPAVAYCCYADDCVPRRRVLGT >CP0165 IS3 ORF2 MRSGWYTWCQRRTGISPRQQFRQHCDSVVLAAAFTRSKQRYGAPRLTDEL RAQGYHFNVKTVAASLRRQGLRAKASRKFTYRKLKNQTIPLSTPYAT >CP0073 ISSfl1 ORF1 MFWVLCSSAPWRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDA NGFIDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSI WQQTEVASR >CP0158 IS600 ORF1 MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR >CP0045 IS100 ORF2 MQYISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAF PAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHL AIAMGYEAFKIFYDISKISLELYHNIH >CP0188 putative reverse transcriptase MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP GVPFERYADDVVCHSRI >CP0080 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW GRQHERDTGGGDGGLITAERQRLKEPERENRELRRSNNILRLASAYFAKA EFDRLWKK >CP0055 IS629 ORF1 MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA EFDRLWKK >CP0019 ISSfl4 ORF2 MISFPAGSRIWLVAGITDMRNGFNGLVSKVQNVLKDDPFSGHLFIFRGRR GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI DWKHPKRTERAGIRI >CP0203 putative transposase MRFVQPRTETQQAIRALHRVRESLIRDKVKTTNQIHGFLLEFGISLPTGD AVIKRLSLVLAEHEIPEYLSRLLVRLHTHYLYLVEQIAELESELSQSINA DDTAQRIMTIPGVGPITASLLSSQLGDGKQFSCSRDFAASTGLVPRQYST GGKSTLLGISKRGDKNLRRLLVQCARSFMMQLERQHGKLAEWVREQLNKK HSNVVACALANKLARIA >CP0067 IS629 ORF2 MLREGIRVARCTVARLMVVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG HIPPAEAEKAYYASIGNDDLAA >CP0007 IS600 ORF2 MVVSAIASTPHLVYIRTRETYGTRRLQTELADNGIIVGRDRLAGLRKELR LHCKQKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADSVVQAFRN QPTEGAGRETAAYAVR >CP0192 ISSfl1 ORF1 MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSSAP WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR >CP0117 IS600 ORF2 MKRCVGYLVYIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCK QKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLY LAGVKDVYTCEIVGYAMGERMMKELTGKALFMALRSQRLPAGLIHHTDRG SQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRF KSRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >CP0049 insA, IS1 ORF1 MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC RASARIMGIGLNTVLRHLKNSGRSR >CP0065 insB, IS1 ORF2 MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLNRPGNP GD >CP0051 insB, IS1 ORF2 MIVCAEMDEHWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL LSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP0220 insB, IS1 ORF2 MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV ELHDKVIGHYLNIKHYQ >CP0128 ipaB, IpaB, secreted by the Mxi-Spa secretion machinery, required for entry into epithelial cells MHNVSTTTTGFPLAKILASTELGDNTIQAANDAANKLFSLTIADLTANQN INTTNAHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK IKDLENKINQIQTRLSELDPESPEKKKLSREEIQLTIKKDAAVKDRTLIE QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ EVIADLLASMSNSQANRTDVAKAILQQTTA >CP0151 spa32, Spa32, secreted by and component of the Mxi-Spa machinery MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC >CP0198 ycdA, orf, conserved hypothetical protein MSRFVLGNCIDVMARIPDNAIDFILTDPPYLVGFRDRQGRTIAGDKTDEW LQPACNEMYRVLKKDALMVSFYGWNRVDRFMSAWKNAGFSVVGHLVFTKN YTSKAAYVGYRHECAYILAKGRPRLPQNPLPDVLGWKYSGNRHHPTEKPV TSLQPLIESFTHPNAIVLDPFAGSGSTCVAALQSGRRYIGIELLEQYHRA GQQRLAAVQRAMQQGAANDDWFMPEAA >CP0253 yigB, orf, conserved hypothetical protein MLHYSGGLKYRWHLSDMENNMRKYIPLALFIFSWPVLSADIHGRVVRVLD GDTIEVMDSLKAVRIRLVNIDAPEKKQDYGRWSTDMMKSLVAGKTVTVTY >CP0257 yihA, orf, conserved hypothetical protein MKLIIFILIVLIIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILS SGEYWQRQKTLLTEREVSFMKGLFRIVDMKRWYLCPQVRVADIVQLNGNI RPRSRQWWQLFRMVSQWHVDVVIVELRSFSIVAAVELDDASHLRPERRRR DILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNTTGADQQSPEHRS