TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: pCP301

Number of genes found: 124

Free access
Sort by:

 



# Shigella flexneri 2a str. 301, 301

>CP0179 IS629 ORF2
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL
KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVHTTVSRKAVAAGDRVNRHQGNVPRTPGGPQRLVYVVSAADKDKHTS
AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT
>CP0099 ISSfl1 ORF1
MQSRFFTILRSNRHNLCGDLQQGMVHKSDSDELSALRAENARIIKPLLPP
EPATPRAGRPWAEHRKIINGMFWVLCSSAPWRDLPERYGAWKTVYNRFNR
WSKSGVINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIP
ISTEIMGRVALAAVLAPKSIWQQTEVASR
>CP0057 IS600 ORF2
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVG
>CP0187 orf, conserved hypothetical protein
MPGASNAAMCKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKF
WSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLALVRKQYPKLFAHW
YLLRASNE
>CP0239 IS630 ORF
MVVSAIASTPQLHRGDRVSDVARTLCCARSSVGRWINWFTQSGVEGLKSL
PAGRARRWPFEHICTLLRELVKHSPGDFGYQRSRWSTELLAIKINEITGC
QLNAGTVRRWLPSVGIVWRRAAPTLRIRDPHKDEKMAAIHKALDECSAEH
PVFYEDEVDIHLHPKIGADWQLRGQQKRVVTPGQNEKYYLAGALHSGTGK
VSYVGGNSKSSALFISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL
KENPKFRGIYQPVYSPWVNHVERLWQALHDTITRNHQCRSMWQLLKKVRH
FMETVSPFPGGKHGLAKV
>CP0001 IS2 ORF2
MVHATELMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI
MPKPDGLTAAKNLAEAFEHYNEWHPHSALDYRSPREYLRQRANDNRCLEI
>CP0066 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0212 ISSfl4 ORF2
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI
AWPHPKRTERPGIRI
>CP0178 IS3 ORF2
MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF
YASGPNQKWAGDITYYYSSPTAGKHGAPGY
>CP0098 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>CP0210 iso-IS1 ORF1
MATVTVHCPRCHSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEARKPG
VKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG
>CP0248 oriT nicking and unwinding protein
MLTGNLVMALFNHDTSRDQEPQLHTHAVVTNVTQYNGEWKTLSSDKVGKT
GFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEA
FSGRSQTIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKE
TGFDIRAYRDAAEQRAYTRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQ
FMYTDLLARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTFG
IHMLDELSVRALSRDIMKQNRVTVHLEKSVPRTAGYSDAVSVLAQDRPSL
AIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLS
GELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVL
ITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVGWPEI
LRPA
>CP0222 putative protein encoded within IS
MSDGYSVYKSLADNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKI
AGLYRIEKLIRERPVEKIRQWR
>CP0171 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0193 orf, conserved hypothetical protein
MTMATINARIDDDIKNQADEVLKLMNISQTQAIAAFYQYITEQKKLPFVI
TSIVKTPHDLLRESTDMLAEALAVISNLQVWTEQQDGIGKAKLMEYYRRL
DALYCCAKEKIGLLSDNRDAELGCVP
>CP0026 IS100 ORF1
MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVSREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRLRDPHKITANKPAPYFGRFWTDGL
ELCSVLFVLK
>CP0254 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLACLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>CP0213 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>CP0105 IS2 ORF2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>CP0166 IS3 ORF1
MTHMTKTVSTSKKTRKQNSPEFCSEALKLAERIGVAAAARELSLYESQLY
AWRSKLQQQMTSSERESELPA
>CP0101 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRILGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS
>CP0270 IS3 ORF2
MECLHGEHFIYREIVRATVFNYIECNYNRWRRHRWCGGLSPEQFENQNLA
>CP0076 iso-IS1 ORF1
MRFSMTTVTVHCPRCNSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEA
RKLGVKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG
>CP0159 IS600 ORF2
MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP
AAFRIKYYQMTA
>CP0002 putative resolvase
MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR
SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP
DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI
STVL
>CP0029 IS100 ORF2
MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHE
EKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFI
ERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAADLLLQLSTAQ
RQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAM
ILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKGESYRLRQKRK
AGVIAEANPE
>CP0112 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLPADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>CP0107 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>CP0047 IS2 ORF2
MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFMKTMKEDCI
AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST
>CP0120 ISSfl2 ORF
MTESSDYESVLVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARNAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>CP0106 IS2 ORF1
MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>CP0058 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0077 iso-IS1 ORF2
MLAYTCGPRNDETCRELLALLTPFCIGMVTSDDWGSYAREVPEEKHLTGK
IFTQRMNVTT
>CP0012 orf, conserved hypothetical protein
MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV
ESRRQAKGTRFLWQHSDKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD
IWSRPWSEERRNDWQRWLRPTVTSP
>CP0205 ISSfl4 ORF2
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>CP0040 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0183 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>CP0020 ISSfl4 ORF3
MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQILEAQERISALQEEMAETLGEQYDPVL
PSPLRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLE
LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG
KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL
MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRITEAACMAHVRRKIH
DVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS
LYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV
>CP0223 ISSfl4 ORF3
MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI
CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTD
VLTRLPEWPEERLAELLPLEGFTFTG
>CP0087 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQSASPVPPDRARHS
GPRITCVRRQKKP
>CP0091 iso-IS10R ORF
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT
KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD
IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL
PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI
SNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH
HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS
PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA
NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG
KL
>CP0108 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0206 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>CP0214 IS100 ORF2
MLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRS
LSFIERNENIVLQGTSDITNPRVGICV
>CP0030 iso-IS1 ORF2
MGRWWRYKWITFHPSLTQHWLWYAYNTKTGGVLAYTFGPRNDETCRELLA
LLTPFCIGMVTSDDWGSYAREVPKEKHLTGKIFTQRIARNNRTLRTRIKR
LARKTICFSRSVEIHEKVIGSFIEKHMFY
>CP0118 IS150 ORF B
MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY
RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI
TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE
>CP0011 orf, conserved hypothetical protein
MDDRIQAGKADMAACADEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA
AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA
KGYRNRERFKLGVMFHYGKLNMAF
>CP0074 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>CP0234 orf, conserved hypothetical protein
MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF
VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS
RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD
DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK
ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPEGREVTFSAFSDWLPRNR
AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP
TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV
QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV
NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR
LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV
KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA
EIYKFFTNALYVALTRAKHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE
ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK
QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL
LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ
LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL
SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC
VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN
PGLKIRKGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE
RSEKSVGYWVGGIRRAAQKA
>CP0268 IS911 ORF1
MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST
MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKAT
ALLMSDSLNSSR
>CP0025 IS3 ORF2
MSKLILPSNTVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEG
WLYLAVVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHS
DRGSQYCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHG
EHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>CP0021 putative transposase
MDAENETVLNANMTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRT
LRDRNGTFEPQQLKKNQP
>CP0039 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNHQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKAYYASIGNDDLAA
>CP0174 putative transposase
MLNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHV
AKQPGEVVDKTRQNEPPRFSWRVFYL
>CP0204 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRESSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKTHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>CP0013 putative IS1 ORF
MPEPVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSD
DMHYKIIGWYLTINHHH
>CP0267 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFARNPRPIQQ
CFKFTLLAFSHLFLGNTGECHDGALSE
>CP0256 orf, conserved hypothetical protein
MNINQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWI
WRHRK
>CP0157 IS600 ORF2
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTSSDHNLPVTPNLL
NQNFTPTAPNQVWVADITYVATREGWLYLAGVKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQHPPAGLIHHSARGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISSAYGKTD
>CP0211 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>CP0088 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>CP0172 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLALHTAA
>CP0207 IS2 ORF2
MDGPRSSHTDDTDVLLGIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN
AKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCC
DNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDL
PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK
TVSAPSATSCENCPVWQQSRPSILMDTNDEFPDNKRYSLLPFLFA
>CP0084 iso-IS10R ORF
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT
KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD
IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL
PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI
SSLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH
HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS
PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA
NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG
KL
>CP0231 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GLRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAK
AEFDRLWKK
>CP0242 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0095 ISSfl4 ORF3
MDTSLAHENARLRALLQTQQDTIHQMAEYNRLLSQRMAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQIPFSRAIYATQALGCSDCSIKAILNS
>CP0114 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>CP0056 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGCIVGWRVSSSMGLTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLQERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0173 IS629 ORF2
MYRWPCTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKN
RAEVELATLTWVDWYNNRRLQERLGHIPPAEAEKAYYASIGNDDLAA
>CP0208 IS2 ORF1
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>CP0233 IS3 ORF2
MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG
DITYLRTDEVRLHPVSTEPHAF
>CP0008 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>CP0085 IS100 ORF2
MLHEEKLARHQRKQAMYTRMAAFPAVKMFEEYDFTFATGAPQKQLQSLRS
LSSERSPHNFPKT
>CP0247 oriT nicking and unwinding protein
MSKGYTFMMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGL
QGSVDKDVFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSISMM
AMLGGDKRLIEAHNQAVDFAVRQVEASAST
>CP0017 IS4 ORF
MPDSFMHIGQALDLGSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLR
KRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQA
RQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPEN
DAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQ
LIGQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRK
LGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDA
MRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVKQELWG
VLLAYNLVRYQMIKMAEHLKGYCPNQLSFSESCGMVMRMLMTLQGASPGR
IPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>CP0033 putative transposase
MQQPKMTVAMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYN
DAQAIAEACQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRG
LLAEYGIVFSKGAAELRQK
>CP0113 ISSfl4 ORF2
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>CP0006 putative protein encoded within IS
MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE
LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS
YAGAGLLAHVVTGKYADHLPLYRQSDLLFHTAI
>CP0018 putative protein encoded within IS
MEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPV
TLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTPALLQ
ILIREIKGSSH
>CP0089 IS100 ORF2
MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ
LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAA
DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV
IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG
ESYRLRQKRKAGVIAEANPE
>CP0069 IS21 ORF2
MTLTELLWRESEKLRRYKKEARLPVAKTLSEYDFIQLPELNGAQFQQLCE
TTDWVDAGENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQE
LRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYE
RGSLVITSNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKT
AKAVTSVT
>CP0044 IS150 ORF1(ORF A)
MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG
EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW
LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ
VRFLETRLVYLKKLKALAHPTKK
>CP0062 ISSfl1 ORF2
MQREKTPEWREKQKSSRGIRRGQGYRLVFQFPIRERCFGRLKEYRRIATR
YDKTARNYLAMVKLGCIRLFYQRLRN
>CP0232 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRMTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0249 oriT nicking and unwinding protein
MKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLD
SRSRYLRDMYRPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGET
QVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASV
SEDAMTVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFA
SVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETLLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTE
AKSFAAEGTGFADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAE
KSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVV
QGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDA
QTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAG
GGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAV
YSLINRDVERALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEA
QQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVL
NSMIHDAREKAGELGKVQVMVPVLNTANIRDGELRRLSTWENNPDALALV
DNVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVG
TGDRIRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERA
EQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK
QHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAERLFSTARE
LRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAG
IWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQD
GVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNG
EPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRE
IAGQERDRAAITEREAALPESVLREPQRVREAVREVARENLLQERLQQME
RDMVRDLQKEKTPGGD
>CP0081 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQ
>CP0255 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0041 IS150 ORF B
MRQQQDEQGRFSICSRQAAVVQRLMGILSLKAAIKVKRYRSYRGEVGQTA
PYVLQRDFKATRPNEKWVTDFTEFAVNGRKLYLSPVIDLFNNEVISYSLS
ERPVMNMVENMLDQAFKKLNEPPRESWRLNFLRKR
>CP0075 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSVGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLHPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSYVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRGIYQPVYSPWVNHVERLW
QALHDTITRNHQCRSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>CP0184 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLVYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0121 IS100 ORF1
MVTFETVMEIKILHKQGMSSRTIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>CP0269 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>CP0180 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW
VRQHERDTGGGEVGSPPLNVSV
>CP0189 IS600 ORF2
MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA
PMESFWGTLKNGTGTE
>CP0027 IS100 ORF1
MISLNCWHKSVDHIMLCLSRFLGIPQPFRAQTKGKVERMVQYTRNSFYIP
LMTRLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQ
SMLALPPEKKEYDVHPGENLVSFDNPPQHHPLSIYDSFCRGVA
>CP0068 putative protein encoded within IS
MLPSETMIWQPEFTDKTLSRKPGAVHAVRQQRSKALLTSLNEWMVEKNGT
LSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRTVCLGKKN
YMFFGSDHGGDRGALLYGLIGSCRLNGIDPEAYLRHILSVLPEWPSNRVD
ELLPWNVVLTDK
>CP0086 putative protein encoded within IS
MRATAEEALKRISELYAIEDEIRGLPESECLAVRQQRSKALLTSLHEWMV
EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVC
LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP
SNRVDDLLPWKVVLPSG
>CP0170 orf, conserved hypothetical protein
MDDRIQAGKADMAACTDEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA
AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA
KGYRNRERFKLGVMFHYGKLNIAF
>CP0191 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAIVKLGCIRLFYQRLRN
>CP0116 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0177 putative reverse transcriptase
MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA
VKENWQWKPAVAYCCYADDCVPRRRVLGT
>CP0165 IS3 ORF2
MRSGWYTWCQRRTGISPRQQFRQHCDSVVLAAAFTRSKQRYGAPRLTDEL
RAQGYHFNVKTVAASLRRQGLRAKASRKFTYRKLKNQTIPLSTPYAT
>CP0073 ISSfl1 ORF1
MFWVLCSSAPWRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDA
NGFIDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSI
WQQTEVASR
>CP0158 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0045 IS100 ORF2
MQYISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAF
PAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHL
AIAMGYEAFKIFYDISKISLELYHNIH
>CP0188 putative reverse transcriptase
MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY
KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI
PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH
QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI
ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP
GVPFERYADDVVCHSRI
>CP0080 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLITAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0055 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>CP0019 ISSfl4 ORF2
MISFPAGSRIWLVAGITDMRNGFNGLVSKVQNVLKDDPFSGHLFIFRGRR
GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
DWKHPKRTERAGIRI
>CP0203 putative transposase
MRFVQPRTETQQAIRALHRVRESLIRDKVKTTNQIHGFLLEFGISLPTGD
AVIKRLSLVLAEHEIPEYLSRLLVRLHTHYLYLVEQIAELESELSQSINA
DDTAQRIMTIPGVGPITASLLSSQLGDGKQFSCSRDFAASTGLVPRQYST
GGKSTLLGISKRGDKNLRRLLVQCARSFMMQLERQHGKLAEWVREQLNKK
HSNVVACALANKLARIA
>CP0067 IS629 ORF2
MLREGIRVARCTVARLMVVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0007 IS600 ORF2
MVVSAIASTPHLVYIRTRETYGTRRLQTELADNGIIVGRDRLAGLRKELR
LHCKQKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADSVVQAFRN
QPTEGAGRETAAYAVR
>CP0192 ISSfl1 ORF1
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSSAP
WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>CP0117 IS600 ORF2
MKRCVGYLVYIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCK
QKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLY
LAGVKDVYTCEIVGYAMGERMMKELTGKALFMALRSQRLPAGLIHHTDRG
SQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRF
KSRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>CP0049 insA, IS1 ORF1
MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC
RASARIMGIGLNTVLRHLKNSGRSR
>CP0065 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLNRPGNP
GD
>CP0051 insB, IS1 ORF2
MIVCAEMDEHWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL
LSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>CP0220 insB, IS1 ORF2
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>CP0128 ipaB, IpaB, secreted by the Mxi-Spa secretion machinery, required for entry into epithelial cells
MHNVSTTTTGFPLAKILASTELGDNTIQAANDAANKLFSLTIADLTANQN
INTTNAHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN
KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK
IKDLENKINQIQTRLSELDPESPEKKKLSREEIQLTIKKDAAVKDRTLIE
QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT
QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA
EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV
QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL
KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN
SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ
EVIADLLASMSNSQANRTDVAKAILQQTTA
>CP0151 spa32, Spa32, secreted by and component of the Mxi-Spa machinery
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN
YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP
DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR
NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL
QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV
GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>CP0198 ycdA, orf, conserved hypothetical protein
MSRFVLGNCIDVMARIPDNAIDFILTDPPYLVGFRDRQGRTIAGDKTDEW
LQPACNEMYRVLKKDALMVSFYGWNRVDRFMSAWKNAGFSVVGHLVFTKN
YTSKAAYVGYRHECAYILAKGRPRLPQNPLPDVLGWKYSGNRHHPTEKPV
TSLQPLIESFTHPNAIVLDPFAGSGSTCVAALQSGRRYIGIELLEQYHRA
GQQRLAAVQRAMQQGAANDDWFMPEAA
>CP0253 yigB, orf, conserved hypothetical protein
MLHYSGGLKYRWHLSDMENNMRKYIPLALFIFSWPVLSADIHGRVVRVLD
GDTIEVMDSLKAVRIRLVNIDAPEKKQDYGRWSTDMMKSLVAGKTVTVTY
>CP0257 yihA, orf, conserved hypothetical protein
MKLIIFILIVLIIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILS
SGEYWQRQKTLLTEREVSFMKGLFRIVDMKRWYLCPQVRVADIVQLNGNI
RPRSRQWWQLFRMVSQWHVDVVIVELRSFSIVAAVELDDASHLRPERRRR
DILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNTTGADQQSPEHRS