TitleGenColors Logo

Gene list

Applied filters:

Gene type: CDS
Genomic element: pCP301

Number of genes found: 261

Free access
Sort by:

 



# Shigella flexneri 2a str. 301, 301

>CP0247 oriT nicking and unwinding protein
MSKGYTFMMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGL
QGSVDKDVFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSISMM
AMLGGDKRLIEAHNQAVDFAVRQVEASAST
>CP0117 IS600 ORF2
MKRCVGYLVYIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCK
QKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLY
LAGVKDVYTCEIVGYAMGERMMKELTGKALFMALRSQRLPAGLIHHTDRG
SQYCAYDYRVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRF
KSRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA
>CP0075 IS630 ORF
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSVGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLHPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSYVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKFRGIYQPVYSPWVNHVERLW
QALHDTITRNHQCRSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>CP0110 orf, conserved hypothetical protein
MIYGGFMKSGVQLNLRARESQRILIDAAAEILHKSRTDFILEMACKAAED
VILDRRVFNFNDRQYEEFIEMLDAPVADDPAIEKLLARKPQWDV
>CP0072 IS1294 ORF
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LNQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>CP0218 orf, hypothetical protein
MELKWTSKALSDLARLYDFLVLASKPAAARTVQSLTQAPVILLTHPRMGE
QLFQFEPREVRRIFTGEYEIRYELTGQTIYVLRLWHTRENR
>CP0033 putative transposase
MQQPKMTVAMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYN
DAQAIAEACQHGTIRPVPIKTLEQQDVQTFLNMRRLVSMERTQLINHIRG
LLAEYGIVFSKGAAELRQK
>CP0167 orf, conserved hypothetical protein
MNTALIVALMCMWYAVPAAAKETLLAMPRNSTEHCYAEINVHGPYGVYFR
VVPHPPGGKSWVECNSDYYYSDKPPGVQILGTRAGCRVYGICGTTSTLHV
AGRGVVCIKNICSPRGMIIHRIRKRPVVAVSDEM
>CP0118 IS150 ORF B
MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY
RWPGWLERVNAAGLIRSMSRKGCSPDNAACEGFFGRLKTEMYYGRKWSGI
TPEKFMQQVDAYIRWYNERRIKLSLGAVSPKMYRQQCGLE
>CP0217 orf, hypothetical protein
MQMKNNTAQATKVITAHVPLPMADKVDQMAARLERSRGWVIKQALSAWLA
QEEERNRLTLEALDDVTSGQVIDHQAVQSWADSLSTDHLLPVPR
>CP0069 IS21 ORF2
MTLTELLWRESEKLRRYKKEARLPVAKTLSEYDFIQLPELNGAQFQQLCE
TTDWVDAGENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQE
LRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYE
RGSLVITSNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKT
AKAVTSVT
>CP0212 ISSfl4 ORF2
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI
AWPHPKRTERPGIRI
>CP0180 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW
VRQHERDTGGGEVGSPPLNVSV
>CP0060a orf, hypothetical protein
MSHNLEHQKVHTRMVKEVLKAVARANNHPYKSVFADFITGHPSCTVCFWE
TFHKMYPDSPYEYVTFCHTCRRFDLYETEAEMKADDPKWW
>CP0192 ISSfl1 ORF1
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSSAP
WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>CP0112 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLPADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
RVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>CP0210 iso-IS1 ORF1
MATVTVHCPRCHSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEARKPG
VKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG
>CP0161 putative IS1294 ORF
MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD
TAALTRLQDTGG
>CP0087 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQSASPVPPDRARHS
GPRITCVRRQKKP
>CP0157 IS600 ORF2
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTSSDHNLPVTPNLL
NQNFTPTAPNQVWVADITYVATREGWLYLAGVKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQHPPAGLIHHSARGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISSAYGKTD
>CP0014 orf, conserved hypothetical protein
MKVSFKSLGYIFHDIYNKKHTIDEFNDVVKKAVLSGKINELNACHKVAIF
LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES
GFVSFVNREGKICHTAYVKSSDNSMTYYHANGSSIDKYITDMCGLICMRH
IESTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV
>CP0015 IS91 ORF
MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE
ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE
KNGEANHKERDVSAVTEG
>CP0089 IS100 ORF2
MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ
LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAA
DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV
IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG
ESYRLRQKRKAGVIAEANPE
>CP0201 orf, conserved hypothetical protein
MEIISNVRENRQVTVPAELLETLTQIAEQALWKREWAARDHGFPLPEYVT
RRQAMVDQARSLLKNNTHEND
>CP0043 IS1294 ORF
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>CP0266 orf, hypothetical protein
MSVKLRLPQLSSGEYLPGSLQDKILSDDCLEKEQMVVSAIASTPQASYHI
>CP0214 IS100 ORF2
MLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRS
LSFIERNENIVLQGTSDITNPRVGICV
>CP0088 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>CP0269 IS911 ORF2
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>CP0177 putative reverse transcriptase
MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA
VKENWQWKPAVAYCCYADDCVPRRRVLGT
>CP0012 orf, conserved hypothetical protein
MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV
ESRRQAKGTRFLWQHSDKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD
IWSRPWSEERRNDWQRWLRPTVTSP
>CP0025 IS3 ORF2
MSKLILPSNTVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEG
WLYLAVVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHS
DRGSQYCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHG
EHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA
>CP0064 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSYASTLKSPSYT
KSVSWQHYPLFVERP
>CP0267 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFARNPRPIQQ
CFKFTLLAFSHLFLGNTGECHDGALSE
>CP0171 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0008 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>CP0170 orf, conserved hypothetical protein
MDDRIQAGKADMAACTDEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA
AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA
KGYRNRERFKLGVMFHYGKLNIAF
>CP0216 orf, hypothetical protein
MEPVYVILNALLDSGRFTRKLILLGLSGTFSYIFGSIVATLGMGLVVDYL
GWGATFIVLILSAVFAIIFTLMSRERSLEFEKE
>CP0231 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GLRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAK
AEFDRLWKK
>CP0219 orf, hypothetical protein
MLSGQIFCIPLNNLVGDKINYDEITKITARDWRQYRAPGWQITHQKRYCQ
TLRVMTPTY
>CP0197 orf, hypothetical protein
MKYDGDGRATARFFSDKGCRRAPLFTAPADAARHKRCLWSVSRVRRARDG
RFYRSRLVSVTVYASPSPFSDERPSSRFRGITLLSKRRRLRYSTVGLTRY
RKR
>CP0166 IS3 ORF1
MTHMTKTVSTSKKTRKQNSPEFCSEALKLAERIGVAAAARELSLYESQLY
AWRSKLQQQMTSSERESELPA
>CP0061 IS1294 ORF
MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC
YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM
PA
>CP0233 IS3 ORF2
MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG
DITYLRTDEVRLHPVSTEPHAF
>CP0107 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>CP0102 orf, conserved hypothetical protein
MKKQIFINNKPPVVPYSGTHAKIFKYIEIPLPFFYFIYTSGEPFHISVQN
TVIYVSKYNGIFINKLVPFSLLFDRDISVLQRRDICVVRFTSEEISEHNV
LFDHDIERLKKISKAQLISPDYVLIDFSSGGPIKLSSML
>CP0016 putative IS91 ORF2
MDAGNKKLVFWFVRVDDEGYPEIARCMEREFATIPAGINADGMYCPECGT
VHWPDGVIPPF
>CP0205 ISSfl4 ORF2
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>CP0262 IS1294 ORF
MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG
>CP0076 iso-IS1 ORF1
MRFSMTTVTVHCPRCNSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEA
RKLGVKEQIVEMAHNGAGGRDTARTLKIGINTVIRTLKSSRPGG
>CP0060 orf, conserved hypothetical protein
MEVFMSTAASVRKTPREHQINIRATDEERAVIDYAASLVNKNRTDFIMEL
AYQEAKNIILDQRLFVLDNERYDSFITQLEAPVQNAEGRERLMAVKPEWK
>CP0034 orf, hypothetical protein
MHQPVKQLIARKFGLSCGFAMSIDTEKIKVRFSQINAGDFLFRHGIPLTF
VVKVYPSWRN
>CP0081 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQ
>CP0254 IS600 ORF2
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLACLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>CP0248 oriT nicking and unwinding protein
MLTGNLVMALFNHDTSRDQEPQLHTHAVVTNVTQYNGEWKTLSSDKVGKT
GFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEA
FSGRSQTIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKE
TGFDIRAYRDAAEQRAYTRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQ
FMYTDLLARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTFG
IHMLDELSVRALSRDIMKQNRVTVHLEKSVPRTAGYSDAVSVLAQDRPSL
AIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLS
GELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVL
ITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVGWPEI
LRPA
>CP0111 orf, hypothetical protein
MTLTTTALNGSSSRRFEGCVWTPPLISHTALHPTSRHWMHSWHTYAGNAE
IPGL
>CP0001 IS2 ORF2
MVHATELMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI
MPKPDGLTAAKNLAEAFEHYNEWHPHSALDYRSPREYLRQRANDNRCLEI
>CP0183 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>CP0030 iso-IS1 ORF2
MGRWWRYKWITFHPSLTQHWLWYAYNTKTGGVLAYTFGPRNDETCRELLA
LLTPFCIGMVTSDDWGSYAREVPKEKHLTGKIFTQRIARNNRTLRTRIKR
LARKTICFSRSVEIHEKVIGSFIEKHMFY
>CP0204 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRESSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKTHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>CP0062 ISSfl1 ORF2
MQREKTPEWREKQKSSRGIRRGQGYRLVFQFPIRERCFGRLKEYRRIATR
YDKTARNYLAMVKLGCIRLFYQRLRN
>CP0097 orf, conserved hypothetical protein
MTLPVFITVIADHDKPQPSGCLLESQGSLCPICRQRITHETGWNVHHKVK
KVMGAVKNYLTLSCYIQIAIDSYTVVKPALSKRAYKGLSGVPGNRYAPFL
GEGSPAMNCPYPTNIQNERNVLESAYNPL
>CP0175 IS91 ORF
MLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQDEIGLRYNSHRTKRE
ENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEAAITGRCGVRHNGDSE
KNGEANHKERDVSAVTEG
>CP0268 IS911 ORF1
MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST
MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKAT
ALLMSDSLNSSR
>CP0261 orf, hypothetical protein
MAENGYGLAGLGMGKVKSVNQYRLTPGFGGFTPVSHVTTACRLPCRWRGI
RIIQAAFNAFAKV
>CP0114 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>CP0178 IS3 ORF2
MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF
YASGPNQKWAGDITYYYSSPTAGKHGAPGY
>CP0029 IS100 ORF2
MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHE
EKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFI
ERNENIVLLGPSGVGKTHLAIAMGYEAVRVGIKVRFTTAADLLLQLSTAQ
RQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAM
ILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKGESYRLRQKRK
AGVIAEANPE
>CP0215 orf, conserved hypothetical protein
MNAHWSSKKSNFFRKNIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG
VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK
>CP0020 ISSfl4 ORF3
MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQILEAQERISALQEEMAETLGEQYDPVL
PSPLRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLE
LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG
KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL
MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRITEAACMAHVRRKIH
DVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS
LYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV
>CP0186 orf, hypothetical protein
MNQKVKSVGSDNVIDDHHVFFADSRCDFVKVVSAYVCDMGMQLLYFVFLL
LPVVAEFNLAA
>CP0193 orf, conserved hypothetical protein
MTMATINARIDDDIKNQADEVLKLMNISQTQAIAAFYQYITEQKKLPFVI
TSIVKTPHDLLRESTDMLAEALAVISNLQVWTEQQDGIGKAKLMEYYRRL
DALYCCAKEKIGLLSDNRDAELGCVP
>CP0169 orf, hypothetical protein
MAFILSSLILLFSASAFPFDTPWRMAFRIPYNLFPIVLATFFIMGTSLLA
AAISASRCVFPRSMVCSRYPLPVLSAPSTGSSASSVHAAISAFPAWIRSS
IYRCAATGSEFPLPDDDFQQEDADVHSEPPRESWRLNFLRKR
>CP0239 IS630 ORF
MVVSAIASTPQLHRGDRVSDVARTLCCARSSVGRWINWFTQSGVEGLKSL
PAGRARRWPFEHICTLLRELVKHSPGDFGYQRSRWSTELLAIKINEITGC
QLNAGTVRRWLPSVGIVWRRAAPTLRIRDPHKDEKMAAIHKALDECSAEH
PVFYEDEVDIHLHPKIGADWQLRGQQKRVVTPGQNEKYYLAGALHSGTGK
VSYVGGNSKSSALFISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL
KENPKFRGIYQPVYSPWVNHVERLWQALHDTITRNHQCRSMWQLLKKVRH
FMETVSPFPGGKHGLAKV
>CP0174 putative transposase
MLNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHV
AKQPGEVVDKTRQNEPPRFSWRVFYL
>CP0184 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLVYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0173 IS629 ORF2
MYRWPCTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKN
RAEVELATLTWVDWYNNRRLQERLGHIPPAEAEKAYYASIGNDDLAA
>CP0200 orf, conserved hypothetical protein
MRGESMYGTCETLCRALAAKYSGDTPLMLVIWSPEEIQALADGMDISLSD
HEIRTVLAHLEDIPED
>CP0100 orf, conserved hypothetical protein
MSWLISQCAHQCTDNKKTETDAIYDKVRSSYLLSCILKKNKNVGLILHAP
SFVSVSEKIARIVMANYSRNWSNSELASAVLMSESSLKRRMYKEVGSIST
FVHKIKLTEAIRKLRRTNTPISVISSELGYSSPSYFSKVFFKYLKTYPQN
IRKKNGR
>CP0120 ISSfl2 ORF
MTESSDYESVLVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARNAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>CP0162 orf, hypothetical protein
MSGRVKTRCTQSQSGRRFSCVAIHRSVAFFPQDGQARLPHELVTYLTWGH
SGLSQMYMLHAQYPRAAGQHFCDSLNLDIAQTARIQEGCPALVGREQTFQ
RAGSKSGQHEDGLTPGIL
>CP0017 IS4 ORF
MPDSFMHIGQALDLGSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLR
KRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQA
RQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPEN
DAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQ
LIGQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELRK
LGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDA
MRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVKQELWG
VLLAYNLVRYQMIKMAEHLKGYCPNQLSFSESCGMVMRMLMTLQGASPGR
IPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>CP0211 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>CP0249 oriT nicking and unwinding protein
MKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLD
SRSRYLRDMYRPGMVMEQWNPETRSHDRYVTERVTAQSHSLTLRNAQGET
QVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASV
SEDAMTVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFA
SVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQI
KARAGETLLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFSMVDLLTE
AKSFAAEGTGFADLGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAE
KSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMILETSDRFTVV
QGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDA
QTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAG
GGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQTPELREAV
YSLINRDVERALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEA
QQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVL
NSMIHDAREKAGELGKVQVMVPVLNTANIRDGELRRLSTWENNPDALALV
DNVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVG
TGDRIRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERA
EQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK
QHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAERLFSTARE
LRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGKSAG
IWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQD
GVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNG
EPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRE
IAGQERDRAAITEREAALPESVLREPQRVREAVREVARENLLQERLQQME
RDMVRDLQKEKTPGGD
>CP0168 orf, conserved hypothetical protein
MNDNSLLRNSSLFIAYMGCVGWVSAYSYGWGTSFYYGFPWWVVGAGLDDV
ARSLLYAIIVMGILFTGWGIGILFFLLIKKRSKIQDLSFFRLFFAITLLF
FPVIFELLILKQYFILPLSLSFIISSLVISIIIRIYGRIFSVSCFSDIPF
VREHRIKLIMAGFLVYFWFFSFLVGWYKPQLKKEYQMLCYNNSWYYVLAR
YDSRLVLSSSFKDDSNRFLIFNTEQSGFYEINDVYVRK
>CP0090 putative IS1294 ORF
MSGRHIPQQTDIPRVFLQPLHIQSRDGPAVYHPAATQHAFERVTTQELFH
HLCIAHFRHWFRFIHPQSTVHLRQLLSTHPVGKEPEVPHHLKKLLRDVLF
QPRDQLTLCPERSPHNFPKT
>CP0106 IS2 ORF1
MIVLILVFRLVIGEQIIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>CP0159 IS600 ORF2
MESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQRRHSRLGNISP
AAFRIKYYQMTA
>CP0222 putative protein encoded within IS
MSDGYSVYKSLADNHPGITSACCWSHAGRGFANLYKASREPRAGVELRKI
AGLYRIEKLIRERPVEKIRQWR
>CP0041 IS150 ORF B
MRQQQDEQGRFSICSRQAAVVQRLMGILSLKAAIKVKRYRSYRGEVGQTA
PYVLQRDFKATRPNEKWVTDFTEFAVNGRKLYLSPVIDLFNNEVISYSLS
ERPVMNMVENMLDQAFKKLNEPPRESWRLNFLRKR
>CP0188 putative reverse transcriptase
MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY
KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI
PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH
QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI
ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP
GVPFERYADDVVCHSRI
>CP0223 ISSfl4 ORF3
MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI
CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTD
VLTRLPEWPEERLAELLPLEGFTFTG
>CP0121 IS100 ORF1
MVTFETVMEIKILHKQGMSSRTIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>CP0002 putative resolvase
MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR
SERRLVKTLLADFQRVHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP
DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI
STVL
>CP0187 orf, conserved hypothetical protein
MPGASNAAMCKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKF
WSRNFSYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLALVRKQYPKLFAHW
YLLRASNE
>CP0073 ISSfl1 ORF1
MFWVLCSSAPWRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDA
NGFIDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSI
WQQTEVASR
>CP0059 orf, conserved hypothetical protein
MEINVTAPALLTDEHILQPFDCGNEVLSNWLRGRAMKNQMLNASRTFVIC
LEDTLRIVGYYSLATGSVTHAELGRSLRHNMPNPVPVVLLGRLAVDVCTQ
GHGFGKWLLSDAIHRVVNLADQVGIKAVMVHAIDDDARAFYERFGFVQSV
VAPNTLFYKV
>CP0095 ISSfl4 ORF3
MDTSLAHENARLRALLQTQQDTIHQMAEYNRLLSQRMAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQIPFSRAIYATQALGCSDCSIKAILNS
>CP0057 IS600 ORF2
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVG
>CP0263 putative IS1294 ORF
MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD
TAALTRLQDTGG
>CP0085 IS100 ORF2
MLHEEKLARHQRKQAMYTRMAAFPAVKMFEEYDFTFATGAPQKQLQSLRS
LSSERSPHNFPKT
>CP0101 ISSfl2 ORF
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRILGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQVLIALARRRCDVLFAMMRDGTFYTPQGS
>CP0179 IS629 ORF2
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL
KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVHTTVSRKAVAAGDRVNRHQGNVPRTPGGPQRLVYVVSAADKDKHTS
AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT
>CP0013 putative IS1 ORF
MPEPVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSD
DMHYKIIGWYLTINHHH
>CP0080 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLITAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0191 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAIVKLGCIRLFYQRLRN
>CP0113 ISSfl4 ORF2
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>CP0096 putative IS91 ORF2
MVCNYRYKNRQCHCLSGGYMARSAKSRKRKPASQRSKLPRYVVKLHEDDF
FDEEDAEVLRFD
>CP0099 ISSfl1 ORF1
MQSRFFTILRSNRHNLCGDLQQGMVHKSDSDELSALRAENARIIKPLLPP
EPATPRAGRPWAEHRKIINGMFWVLCSSAPWRDLPERYGAWKTVYNRFNR
WSKSGVINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIP
ISTEIMGRVALAAVLAPKSIWQQTEVASR
>CP0058 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0092 IS1294 ORF
MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQPDE
SPNDFYQNH
>CP0109 orf, conserved hypothetical protein
MGCVTAPEPLSSFHQVAEFVSSEAVLDDWLKQKELKNQAIGATRTFVVCR
KGTQQIVGFYSLATGSVNHTEATGNLRRNMPDPIPVIILARLAVDVSFRG
KGLGADLLHDAVRRCYRVAENIGVRAIMVHALTENAKQFYIHHGFKPSKT
QVQTLFLKLPQ
>CP0038 orf, hypothetical protein
MHQLRHPLLGCVFCNECDPHPFAPDPLQIFHHYYEWVRPCFPIRAGNRFS
RSLRWSGPGSCCLHTGCRAVSKQVSSALIRGRLYGPILASSKISMLHRTV
YFRSASRTHVTVDLCLFPESLTTSFLRMKQHRVVWSVLL
>CP0011 orf, conserved hypothetical protein
MDDRIQAGKADMAACADEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA
AANSDVPMMKNVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA
KGYRNRERFKLGVMFHYGKLNMAF
>CP0203 putative transposase
MRFVQPRTETQQAIRALHRVRESLIRDKVKTTNQIHGFLLEFGISLPTGD
AVIKRLSLVLAEHEIPEYLSRLLVRLHTHYLYLVEQIAELESELSQSINA
DDTAQRIMTIPGVGPITASLLSSQLGDGKQFSCSRDFAASTGLVPRQYST
GGKSTLLGISKRGDKNLRRLLVQCARSFMMQLERQHGKLAEWVREQLNKK
HSNVVACALANKLARIA
>CP0055 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>CP0208 IS2 ORF1
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>CP0209 orf, conserved hypothetical protein
MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQ
KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE
QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF
NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY
ANSSSWKSKRLC
>CP0232 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRMTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0158 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0256 orf, conserved hypothetical protein
MNINQFMVRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWI
WRHRK
>CP0091 iso-IS10R ORF
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT
KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD
IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL
PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI
SNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH
HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS
PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA
NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG
KL
>CP0189 IS600 ORF2
MVLRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA
PMESFWGTLKNGTGTE
>CP0068 putative protein encoded within IS
MLPSETMIWQPEFTDKTLSRKPGAVHAVRQQRSKALLTSLNEWMVEKNGT
LSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRTVCLGKKN
YMFFGSDHGGDRGALLYGLIGSCRLNGIDPEAYLRHILSVLPEWPSNRVD
ELLPWNVVLTDK
>CP0165 IS3 ORF2
MRSGWYTWCQRRTGISPRQQFRQHCDSVVLAAAFTRSKQRYGAPRLTDEL
RAQGYHFNVKTVAASLRRQGLRAKASRKFTYRKLKNQTIPLSTPYAT
>CP0045 IS100 ORF2
MQYISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAF
PAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHL
AIAMGYEAFKIFYDISKISLELYHNIH
>CP0255 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0066 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0206 ISSfl4 ORF1
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINNNLLVKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>CP0074 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>CP0213 ISSfl4 ORF3
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPEFPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIVQAPLPPKPIERGYA
SAGLLARILVSKYMEHIPLYRQSEIYARQGVELSRNTMVRWVSEMADKLR
PLYIALNDYVLEAGKVHADDTPVKVLAPGNGKTKTGRLWVYVRDDRNAGS
SLPAAVWFAYSADRKGEHPQLHLAKYQGVLQADAYAGYNVLYETGRVKEA
GCLAHARRKIHDEDVRRPTEMTQEALRRIAELYDIEAEIRGSPAEERLAV
RKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAFDYILNHWNALNEFCRDG
WVEIDNNIGENALRSVAVGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEV
EPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>CP0037 orf, hypothetical protein
MGSPECVPCHSTSGSVFPALVAETMFPNIAGSRSCIDCHHVAGRTFSKSA
RRIRGAPLLRRTSRYDAQTSAFGISNDFRCISYVIPFWVVSSAMNVIPIP
SLQTHYRSFITTTNGSAPVSRYVPETGSHVP
>CP0116 IS600 ORF1
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0098 ISSfl1 ORF2
MLSPGQAHESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNE
LKNNGIKAVIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRI
ATRYDKTARNYLAMVKLGCIRLFYQRLRN
>CP0264 putative transposase
MDEKKLKALAAELAKGLKTEADLNQFSRMQTKLTVETVLNAELTDHLGHE
KSYIR
>CP0028 ISSfl3 ORF
MCQQFNEITAMPVHKVCQNFFRDALAPFHQYRQNALMDATMALINGASLT
QTSIGRFLPGNAQVKNKIKRIDRLMGNEALHRDIPMIFRNITSMLTRQLS
LCVIAVDWSGYPSQEHHVLRASLLCDGRSIPLLSKVVPSEKQNNPLIQHD
FLDSLAQSLPPDARVIIVTDAGFQSAWFHHITSLGWDFIGRIRNNVQYCL
DNAPERWLKVSDSPECKTPEYMGAGRLVKERKKSIRGHFYTYKKSAKGRK
KKRSKGQSGLNKTDKEQSKSAKEAWLIFSSTNDFRAREIIKLYSRRMQIE
QNFRDEKNGRFGFGLRASKSRSTGRILVLSLLATLSTIVMWLLGYHAENK
GLHLKYQANSIKSRRVISYLTLAKNVLRHSPLILRRTVLSTVLNHLSRTY
RNMVLVY
>CP0270 IS3 ORF2
MECLHGEHFIYREIVRATVFNYIECNYNRWRRHRWCGGLSPEQFENQNLA
>CP0160 putative IS1294 ORF
MVVSAIASTPQAMCAVTDRAEPRQPDGCKLKTAPVRPGVAARSAEYVCSE
DLFY
>CP0056 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGCIVGWRVSSSMGLTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLQERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0242 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK
GLGTPGSRTLAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0026 IS100 ORF1
MVTFETVMEIKILHKQGMSSRAIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVSREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRLRDPHKITANKPAPYFGRFWTDGL
ELCSVLFVLK
>CP0083 orf, conserved hypothetical protein
MNAHWSSKKSNFLRKNIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG
VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK
>CP0172 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLALHTAA
>CP0207 IS2 ORF2
MDGPRSSHTDDTDVLLGIHHVIGELPTYGYRRVWALLRRQAELDGMPAIN
AKRVYRLMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCC
DNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDL
PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVK
TVSAPSATSCENCPVWQQSRPSILMDTNDEFPDNKRYSLLPFLFA
>CP0234 orf, conserved hypothetical protein
MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF
VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS
RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD
DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK
ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPEGREVTFSAFSDWLPRNR
AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP
TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV
QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV
NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR
LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV
KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA
EIYKFFTNALYVALTRAKHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE
ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK
QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL
LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ
LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL
SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC
VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN
PGLKIRKGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE
RSEKSVGYWVGGIRRAAQKA
>CP0067 IS629 ORF2
MLREGIRVARCTVARLMVVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>CP0039 IS629 ORF2
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNHQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKAYYASIGNDDLAA
>CP0027 IS100 ORF1
MISLNCWHKSVDHIMLCLSRFLGIPQPFRAQTKGKVERMVQYTRNSFYIP
LMTRLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQ
SMLALPPEKKEYDVHPGENLVSFDNPPQHHPLSIYDSFCRGVA
>CP0119 putative transposase
MAEWLGEIQKRVITVCDREADIWHYLYYKVSHGQRGACCTESPAGRGTRQ
ALRTAGSPGNRRKPHAECDAKRRAGSPSGPDVHQLQRSQHKKSRQQRPGA
PAHVCLLPGAGRGRCLLASADVRKSGECRRCTTYCQPLRATLADRGIPQG
VEKWWYMESLRMQTRDNLERMVVILAFIAVRVLGLRQGGVSEETQNDSCE
KILTPTEWKLLWVKLEGKPLPVQAPTLKWAWGDGMTANAQVVPVGASCGM
AGQTSGYG
>CP0047 IS2 ORF2
MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFMKTMKEDCI
AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST
>CP0007 IS600 ORF2
MVVSAIASTPHLVYIRTRETYGTRRLQTELADNGIIVGRDRLAGLRKELR
LHCKQKRKFRATTNSDHNLPVTPNLLNQNFTPTAPNQVWVADSVVQAFRN
QPTEGAGRETAAYAVR
>CP0023 orf, hypothetical protein
MLQRQRGKVGFAQLPVDFVAIEPDSVQGVGKRANLTNRCFIIRINDSFKK
RQGFIEFISNSGSGHTVTVYTKRRFQRGVFMNSLNTNVVKPVMYRLRSSA
DARP
>CP0050 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSISWQHYRIEDRWSYSSLK
>CP0044 IS150 ORF1(ORF A)
MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG
EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW
LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ
VRFLETRLVYLKKLKALAHPTKK
>CP0040 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>CP0018 putative protein encoded within IS
MEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPV
TLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTPALLQ
ILIREIKGSSH
>CP0176 putative IS91 ORF2
MACDYRYKNRQYHCLSGSYMARSAKPRKRKPAPQRSKLLRYVVKLHEDDF
FDEEEAEVLRFDNFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP
EIARCTEREFATIPAGINADGMYCPECGTVHWPDGVIPPF
>CP0019 ISSfl4 ORF2
MISFPAGSRIWLVAGITDMRNGFNGLVSKVQNVLKDDPFSGHLFIFRGRR
GDQIKVLWADSDGLCLFTRRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
DWKHPKRTERAGIRI
>CP0272 IS629 ORF1
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAIFSHENVINSVNQFKKYT
LYLRK
>CP0108 IS600 ORF1
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>CP0042 putative IS1294 ORF
MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD
TAALTRLQDTGG
>CP0084 iso-IS10R ORF
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT
KARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSD
IREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL
PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI
SSLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH
HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS
PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA
NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG
KL
>CP0071 putative IS1294 ORF
MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD
TAALTRLQDTGG
>CP0221 putative IS1 encoded protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYPAPVSPAAHFLPNGCGAPRQYNGGYGRCCRSCTGPAGETATP
>CP0086 putative protein encoded within IS
MRATAEEALKRISELYAIEDEIRGLPESECLAVRQQRSKALLTSLHEWMV
EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVC
LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP
SNRVDDLLPWKVVLPSG
>CP0077 iso-IS1 ORF2
MLAYTCGPRNDETCRELLALLTPFCIGMVTSDDWGSYAREVPEEKHLTGK
IFTQRMNVTT
>CP0164 IS1294 ORF
MVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQMVKQFLSRDPF
ECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>CP0105 IS2 ORF2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAGTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>CP0006 putative protein encoded within IS
MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE
LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS
YAGAGLLAHVVTGKYADHLPLYRQSDLLFHTAI
>CP0021 putative transposase
MDAENETVLNANMTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRT
LRDRNGTFEPQQLKKNQP
>CP0124 acp, Acp, putative acyl carrier protein
MIKEKILSIVAFCYGIAYSKLSEETKFIEDLSADSLSLIEMLDMISFEFN
LRIDESTLEHIITIGDLISVVKNSTKSI
>CP0224 ccdA, post-segregation antitoxin
MKQRITVTIDSDSYQLLKSANVNISGLVNTAMQKEARRLRAERWQAENQQ
GMAEIARFIEMNGSFADENRDW
>CP0225 ccdB, post-segregation toxin
MRTGTGEMQFKVYAYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLS
DKVSRELYPVVHVGDESWRMMTTDMASVPIFVIGEEVADLSHRENDIKNA
INLMFWGI
>CP0251 finO, FinO, putative fertility inhibition protein
MTEQKRPVLTLKRKTEGTAPVRSRKTIINVTTPPKWKVKKQKLAEKAARE
AELAAKKAQARQALSIYLNLPTLDEAVNTLKPWWPGLFDGDTPRLLACGI
RDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYV
TEHISQEEEAYAGARLAKIRRQNRIKAELQAVLDEK
>CP0182 icsA/virG, IcsA (VirG), outermembrane protein exposed to the bacterial surface by a C-terminal autotransporter domain and involved in the movement of intracellul
MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIA
FATPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH
ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM
ILGGSGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG
GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT
FAGGNGGAAYGYGYDGYGGNAITGDNLSVINNGAILGGNGGHWGDAINGS
NMTIANSGYIISGKEDDGTQNVAGNAIHITGGNNSLILHEGSVITGDVQV
NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN
SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS
VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE
NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN
AGILKMGTVEAMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNIN
APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL
GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS
DKNAFIQKGRIVAGSYDYRLKQGTVSGLNTNKWYLTSQMDNQESKQMSNQ
ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL
YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI
GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL
YSSWFQDEKERTGLYMDAWLQYSWFNNTVKGDGLTGEKYSSKGITGALEA
GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGG
NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS
NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY
TF
>CP0132 icsB, IcsB, invasion protein
MILKISNFIDASNTKGPIRVEDTEHGPILIAQKFNLKDLFFRTLSTINAK
INSQILNEQLKNYRLENQKSLLLFLNTLASEKSAESAFAAYEAAKNSIQH
SFTGRDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISI
KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI
SDQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD
KVYIPLSGDNKTKDGKISHNLFGLDETNMSKFICKKKADAFRQLANYKLI
SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNDVYAYANKVRQRIES
LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS
LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMTRVLNELKTEATDKKEE
IIEKSIKIIDYYNSLKSPDLGTKLYIHDLLQINKLLLNNSHSNI
>CP0271 icsP/sopA, IcsP (SopA), outermembrane protease of the OmpP family, involved in cleavage of surface exposed IcsA
MKLKFFVLALCVPAIFTTHATTNYPLFIPDNISTDISLGSLSGKTKERVY
HPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVSGWTTLGNQKAS
MVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGWLLNNLDYRLGL
IAGYQESRYSFNAMGGSYIYSENGGSRNKKGAHPSGERTIGYKQLFKIPY
IGLTANYRHENFEFGAELKYSGWVLSSDTDKHYQTETIFKDEIKNQNYCS
VAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTNISGTIKNSASI
EYIGFLTSAGIKYIF
>CP0049 insA, IS1 ORF1
MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC
RASARIMGIGLNTVLRHLKNSGRSR
>CP0065 insB, IS1 ORF2
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLNRPGNP
GD
>CP0051 insB, IS1 ORF2
MIVCAEMDEHWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERL
LSLLSAFEVVVWMTDGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL
ARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>CP0220 insB, IS1 ORF2
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLVRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>CP0125 ipaA, IpaA, secreted by the Mxi-Spa machinery, modulates entry of bacteria into epithelial cells
MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEK
ENFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEILEFSNVLYS
LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL
RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET
VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLSRDLNTVAVFP
ELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIH
YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLSRNGNSL
LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKAFNHTPDNSDGIGSDL
LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKS
SASLSHRVASQINKFNSNTDSKVLQTDFLSRNGDTYLTRETIFEASKKVT
NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANIRN
YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLK
NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD
>CP0128 ipaB, IpaB, secreted by the Mxi-Spa secretion machinery, required for entry into epithelial cells
MHNVSTTTTGFPLAKILASTELGDNTIQAANDAANKLFSLTIADLTANQN
INTTNAHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN
KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK
IKDLENKINQIQTRLSELDPESPEKKKLSREEIQLTIKKDAAVKDRTLIE
QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT
QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA
EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV
QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL
KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN
SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ
EVIADLLASMSNSQANRTDVAKAILQQTTA
>CP0127 ipaC, IpaC, secreted by the Mxi-Spa secretion machinery, required for entry into epithelial cells
MEIQNTKPTQILYTDISTKQTQSSSETQKSQNYQQIAAHIPLNVGKNPVL
TTTLNDDQLLKLSEQVQHDSEIIARLTDKKMKDRSEMSHTLTPENTLDIS
SLSSNAVSLIISVAVLLSALRTAETKLGSQLSLIAFDATKSAAENIVRQG
LAALSSSITGAVTQVGITGIGAKKTHSGISDQKGALRKNLATAQSLEKEL
AGSKLGLNKQIDTNITSPQTNSSTKFLGKNKLAPDNISLSTEHKTSLSSP
DISLQDKIDTQRRTYELNTLSAQQKQNIGRATMETSAVAGNISTSGGRYA
SALEEEEQLISQASSKQAEEASQVSKEASQATNQLIQKLLNIIDSINQSK
NSTASQIAGNIRA
>CP0126 ipaD, IpaD, secreted by the Mxi-Spa machinery, required for entry of bacteria into epithelial cells
MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT
LHNIRTTNQALKKELSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS
RHEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYL
KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALE
ELKEKYKDKPLYPANNTVSQEQANKWLTELGGTIGKVSQKNGGYVVSINM
TPIDNMLKSLDNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK
YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF
>CP0265 ipaH1.4, invasion plasmid antigen, secreted by the Mxi-Spa secretion machinery
MIKSTNIQAIGSGIMHQINNVYSLTPLSLPMELTPSCNEFYLKTWSEWEK
NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK
NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP
ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN
NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT
MRTLQQITTGPDYSGPQIFFSMGNSATISAPEHSLADAVTAWFPENKQSD
VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS
AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL
LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV
KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA
DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE
QQIYRQLTDEVLALRLSENGSNHIA
>CP0054 ipaH2.5, invasion plasmid antigen, probably secreted by the Mxi-Spa machinery
MIKSTNIQVIGSGIMHQINNIHSLTLFSLPVSLSPSCNEYYLKVWSEWER
NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK
NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP
ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN
NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT
MRTLQQITTGPDYSGPQIFFSMGNSATISAPEHSLADAVTAWFPENKQSD
VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS
AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL
LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV
KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA
DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE
QQIYRQLTDEVLA
>CP0079 ipaH4.5, invasion plasmid antigen, probably secreted by the Mxi-Spa machinery
MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRI
QAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPEN
SPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSA
RFNSLETLPELPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPE
FPQSLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPENIITLPICTN
VNISGNPLSTHVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAV
TAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQ
VAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS
EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM
LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWG
PWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAER
EAGAQVMRETEQQIYRQLTDEVLA
>CP0078 ipaH7.8, invasion plasmid antigen, probably secreted by the Mxi-Spa machinery
MFSVNNTHSSVSCSPSINSNSTSNEHYLRILTEWEKNSSPGEERGIAFNR
LSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLK
ELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLY
NLPALPEKLKFLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNN
IRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRVLQSLQRLTSS
PDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHE
EHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVA
ADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE
ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV
TANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKY
EMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEV
LALRLSENGSRLHHS
>CP0226 ipaH9.8, invasion plasmid antigen, secreted by the Mxi-Spa secretion machinery
MLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPGEERDEAVSRL
KECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLTNLPELPVTLK
KLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLLTMNISYNEIV
SLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDRNQISHIPESI
LNLRNECSIHISDNPLSSHALPALQRLTSSPDYHGPRIYFSMSDGQQNTL
HRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSAR
NTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRK
TLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIE
VYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEF
TDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYPQRVADRLKASGL
SGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLPENGSQLHHS
>CP0122 ipaJ, IpaJ, invasion plasmid antigen
MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQ
GCFLARKQMYDNSCGAASLLCAAKELGVDKIPQYKGSMSEMTRKSSLDLD
NRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVE
ESNIFSQVISFIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIG
VPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNSNFIGYTKIGI
SIVITNEAL
>CP0131 ipgA, IpgA, similarities to IpgE, putative chaperone
MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL
NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY
LSPDELIESLYEFLFCIKLTIANITSEVN
>CP0130 ipgB1, IpgB1, secreted by the Mxi-Spa machinery, function unknown
MQILNKILPQVEFAIPRPSFDSLSRNKLVKKILSVFNLKQRFPQKNFGCP
VNINKIRDSVIDKIKDSNSGNQLFCWMSQERTTYVSSMINRSIDEMAIHN
GVVLTSDNKRNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK
ILKRYSSDMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA
VYRQSNTN
>CP0024 ipgB2, IpgB2, probably secreted by the Mxi-Spa secretion machinery
MLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSILSS
VSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSRKIGDNLRKQIFK
QVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTS
NVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF
>CP0129 ipgC, IpgC, cytoplasmic chaperone for IpaB and IpaC
MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKG
RIEEAEVFFRFLCIYDFYNVDYIMGLAAIYQIKEQFQQAADLYAVAFALG
KNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAI
QDIKE
>CP0133 ipgD, IpgD, secreted by the Mxi-Spa machinery, modulates entry of bacteria into epithelial cells
MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEA
LNRLYLQNQTSLTGKSLLFARDRAEVFCEAIKLAGGDTSKIKAMMERLDT
YKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKN
WGPVDKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRES
DHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE
LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVN
ALKGLNSKRGEPTKLLIRNSDGLLKEVSVNLKVVTFNFGVNELALKMGLG
WRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ
IKEIVTKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSGKDRTGMQD
AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV
PGNKVMKKLPLSSLELSYSERIGDPKIWNMVKGYSSFV
>CP0134 ipgE, IpgE, cytoplasmic chaperone for IpgD
MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL
PENINDLIYALSLNYSEKICLATDDEGGNLIARLDLTGINEFEDVYVNTE
YYISRVRWLKDEFARRMKGY
>CP0135 ipgF, IpgF, periplasmic protein, similarities to the catalytic site of lyzozymes
MSRFVFILLCFIPYLGRADCWDKAGERYNIPSSLLKAIAEKESGFNKSAV
NVNNNGSKDYGIMQINDFHSKRLREMGYSEEMLISHPCLSVHYAAKLLNE
FMMMYGRGWEAVGAYNAGTSPKKKKERLKYAEDIYRRYLRIAAESKQNNR
RI
>CP0082 ipgH, invasion plasmid gene product
MRQLVISITEGLNMSLFTEPKEIERLPSEEIERLYPVLRYRVFISIFLGY
MGYYFVRNTTSVLSGVLHMSATEIGIISCAGFLSYGISKFVSGLISDRSN
SKVFLSLGLFLSGLVNFLIGYIPGIITSVTLFSTMYLLNGWIQGMGYPPG
AKTLVFWYEHRERITWATLWNLSHNVGGALAPVLIGFSFGFFGDSALDHA
RAAFIFPGVLCMAMSVLIYFIQVDRPVSVGLPPIEEWKGNVVSHPAKGRE
QGPRLSIPDIIRKHIIRNNKLIYCCIYGSFVYILRYGIVSWAPKFLSDSL
DVGGKDMGKLASMGGGSVFEIGGVAGMLLAGYLSVRLFRNSKPLTNTLFL
ALTIILLIAYWYVPSGNEYLWLNYTILILLGLAVYGPVMFIGLYSMELVP
KEAAGAASGLSGTFSYIFGSIVATLGMGLVVDYLGWGATFIVLILSAVFA
IIFTLMSRERSLEFEKE
>CP0010 mkaD, mouse killing factor
MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYY
FKVASNVPTYSDICQSFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG
DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG
AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV
RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE
>CP0228 mob9, plasmid mobilization protein
MSLAGNPCVIRLVAQVCMWLKFIIRDRGGFSGGLLLFLPVCCRDRTERIL
AVHTIKILR
>CP0238 msbB2, MsbB2, probable lipid A transacetylase
MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIG
RWVGRLSRKARRRATINLSLCFPEKSDTEREIIVDNMFATALQSIVMMAE
LAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAA
QGEKMAAMFHQQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGY
WGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQAMIIPLFPVYD
EKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLK
LLKTRKSNEADPYP
>CP0245 mvpA, plasmid maintenance protein
MLKFMLDTNICIFTIKNKPASVRERFNLNQGKMCISSVTLMELIYGAEKS
QMPERNLAVIEGFVSRIDVLDYDAAAATHTGQIRAELARQGRPVGPFDQM
IAGHARSRGLIIVTNNTREFERVGGLRTEDWS
>CP0246 mvpT, plasmid maintenance protein
METTVFLSNRSQAVRLPKAVALPENVKRVEVIAVGRTRIITPAGETWDEW
FDGHSVSTDFMDNREQPGMQERESF
>CP0147 mxiA, MxiA, innermembrane protein, component of the Mxi-Spa secretion machinery
MIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLAIL
VFMGSFYIERILSFSTFPSVLLITTLFRLALSISTSRLILVDADAGKIIT
TFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGMPG
KQMSIDADLKAGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAIAG
IIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLVSQIPALLISI
SAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFPFF
VFFLIAVTLTALFYYKKVVEKEKSLSESDSSGYTGTFDIDNSHDSSLAMI
ENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPTIL
YRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVVST
SYNERVISWVDVSYTENLTNIDAKIKSAQDEFYHQLSQALLNNINEIFGI
QETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLKLI
MESLALWAPREKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYIED
AIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAKKNFVLLVSVD
IRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI
>CP0146 mxiC, MxiC, secreted by and putative component of the Mxi-Spa secretion machinery, similarities to YopN (secreted by the type III secretion machinery of Yer
MLDVKNTGVFSSAFIDKLNAMTNSDDGDETADAELDSGLANSKYIDSSDE
MASALSSFINRRDLEKLKGTNSDSQERILDGEEDEINHKIFDLKRTLKDN
LPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVELLTKIINEII
SGSEKSVNAGINSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYI
EWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGFVLSKLIAIKM
IRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVI
EVSRASHEDSVVYQTYLSSVNESPHDIFKSESEREIAINILRELVTSAYK
KELSR
>CP0145 mxiD, MxiD, outermembrane protein of the secretin family, component of the Mxi-Spa secretion machinery
MKKFNIKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFE
RFSALLNYPIVVSKQAAKKRISGEFDLSNPEEMLEKLTLLVGLIWYKDGN
ALYIYDSGELISKVILLENISLNYLIQYLKDANLYDHRYPIRGNISDKTF
YISGPPALVELVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNM
RGEDIVIPGVATVVERLLNNGKALSNRQAQNDPMPPFNITQKVSEDSNDF
SFSSVTNSSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDIAKRH
IELSLWIIDIDKSELNNLGVNWQGTASFGDSFGASFNMSSSASISTLDGN
KFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS
LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSQSNYNYNNENTSVLP
EVGRTKISTIARVPQGKSLLIGGYTHETNSNEIISIPFLSSIPVIGNVFK
YKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSE
TTLLEDEKSLVSYLNY
>CP0144 mxiE, MxiE, similarities to transcriptional activators of the AraC family, function unknown
MEGFFFVRNQNIKFSDNVNYHYRFNINSCAKFLAFWDYFSGALVEHSHAE
KCIHFYHENDLRDSCNTESMLDKLMLRFIFSSDQNVSNALAMIRMTESYH
LVLYLLRTIEKEKEVRIKSLTEHYGVSEAYFRSLCRKALGAKVKEQLNTW
RLVNGLLDVFLHNQTITSAAMNNGYASTSHFSNEIKTRLGFSARELSNIT
FLVKKINEKI
>CP0136 mxiG, MxiG, component of the Mxi-Spa secretion machinery, contains one transmembrane segment
MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDN
FPENIIPVTDSKSDGIIYLTISKDNICQFSDEKGEQIDINSQFNSFEYDG
ISFHLKNMREDKSRGHILNGMYKNHSVFFFFAVIVVLIIIFSLSLKKDEV
KEIAEIIDDKRYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYIL
VSNKEINRIQQYINQRFPFINLYVLNLVSDKAELLVFLSKERNSSKDTEL
DKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNK
VTYSVREELTDEKLELINRLISEHKNIYGDQYIEFSVLLIDDDFKGKSYL
NSKDSYVMLNDKHWFFLDKNK
>CP0137 mxiH, MxiH, component of the Mxi-Spa secretion machinery
MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEY
QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNFR
>CP0138 mxiI, MxiI, component of the Mxi-Spa secretion machinery
MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIME
MVSNPESLNPESLAKLQTTLSNYSIGVSLAGTLARKTVSAVETLLKS
>CP0139 mxiJ, MxiJ, lipoprotein, component of the Mxi-Spa secretion machinery
MIRYKGFILFLLLMLIGCEQREELISNLSQRQANEIISVLERHNITARKV
DGGKQGISVQVEKGTFASAVDLMRMYDLPNPERVDISQMFPTDSLVSSPR
AEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLEEKNISSKPMHISVI
AIYDSPKESELLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPV
KEVKSEFLTNEVIYLFLGMAVLVVILLVWAFKTGWFKRNKI
>CP0140 mxiK, MxiK, putative component of the Mxi-Spa secretion machinery
MIRMDGIYKKYLSIIFDPAFYINRNRLNLPSELLENGVIRSEINNLIINK
YDLNCDIEPLSGVTAMFVANWNLLPAVAYFIGSQESRLINHSEMVISYYG
GKISKQGEAAIRSGFWHLIAWKENISVGIYERINLLFNPIALEGNYTPVE
RNLSRLNEGMQYAKRHFTGIQTSCL
>CP0142 mxiL, MxiL, secreted by and putative component of the Mxi-Spa secretion machinery
MINQINASNALQQRLNSEEVVNLNERLSSSQSFDEDIIYEIMQYFSQSEL
NSIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA
SLSIYNNIDFYIKTTTFDSLISVFEAGREADDSTW
>CP0143 mxiM, MxiM, lipoprotein, component of the Mxi-Spa secretion machinery
MIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPVSKDYFSIPN
DLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKEVDGCFMDAQ
KIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGI
>CP0141 mxiN, MxiN, putative component of the Mxi-Spa secretion machinery
MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVI
KEATKKAESLRIDAVCDGYQIGIQTAFEHIIDYICEWKLKQNENRRNIED
YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLE
LDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL
TKNDKKYFKELAHKKLRQIAEDLLKENPVND
>CP0003 ospB, OspB, protein secreted by the Mxi-Spa secretion machinery, function unknown
MNLDGVRPYCRIVNKKNESISDIAFAHIIKRVKNSSCTHPKAALVFLGEK
GFCDSNDVLSIMGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQS
EPIVINNDDDALNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSH
QLGLGSELIDVQTIISRMKDCGILNVKDIRFTSCGSADKVAPKNFNNAPA
ESLSCILNSLPFFKEKESLLEQIKKHLENDESLSDGLKISGYHGYGVHYG
QELFPYSHYRSTSIPADPEHTVKRSSQKKTFIINKELD
>CP0094 ospC1, OspC1, secreted by the Mxi-Spa secretion machinery, function unknown
MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS
ANIIKSFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL
DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN
IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISNHQKKHPLNTK
HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV
NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY
GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN
YEEINKQVTNKKIALQALFLSITNQKEDVALYILSNFEITRQDVISIKHE
LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE
NAEMIKLLLKYGATSDNKYI
>CP0063 ospC2, OspC2, probably secreted by the Mxi-Spa secretion machinery, function unknown
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK
HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS
NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR
FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIRLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDI
AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFISNGLVDVNKRFQKAN
SGDTMLDNAMKSKDSKTIDFLLKNGAVSGKRFGR
>CP0005 ospC3, OspC3, probably secreted by the Mxi-Spa secretion machinery, function unknown
MKNFLRKSIAAQSYSKMFSQGTSFKSLNLSLEAPSGARSSFRSLEHLDKV
SRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLSNIILNIKS
FDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKLPLNKTHHT
VDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFLDNFKEVVD
EVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQGFREFCYNK
NIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYE
EINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDVAEMEKMKN
NRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKANSGDTMLDN
AMKSKDSKMIDFFIKKWSGIRQTI
>CP0115 ospC4, OspC4, probably secreted by the Mxi-Spa secretion machinery, function unknown
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK
HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS
NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR
FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV
AEMEKMKNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKVN
SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI
>CP0022 ospD1, OspD1, secreted by the Mxi-Spa secretion machinery, function unknown
MSINNYGLHPANNKNMHLIIGSNTANENKGMKNNIINVTNTAISHAINEE
KSGGGYSGVSFRKLAKIQNISIPTKNNKEYNRHNLFSLIWHGNADAARKY
GESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT
FDLSPKETIKLLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLT
EIADRLNNNEQDMFNIISDKIQELF
>CP0009 ospD2, OspD2, probably secreted by the Mxi-Spa secretion machinery, function unknown
MPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQFKNK
TAPYFSEKRNVEVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKVNYQ
LLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMKKNG
DFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTVFTC
DSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKLLPD
ELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSDGTP
AFYIALQNGYSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLCMSF
MNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNGHAD
SIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDILKI
LPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSFTTR
RLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIAEQF
SKKMKKTFIEIINRFNHFL
>CP0093 ospD3, OspD3 (SenA), probably secreted by the Mxi-Spa secretion machinery, function unknown
MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL
NCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS
EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM
ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL
DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPDNLLHPKVIYHAMRMG
LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL
AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL
LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL
ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK
LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA
SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDK
KNGSDFLEIMKNIKS
>CP0227 ospG, OspG, secreted by the Mxi-Spa secretion machinery, function unknown
MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSA
LYKKYDLIGNQYNEILEMAWQESELFNAFYGDEASVVIQYGGDVYLRMLR
VPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKES
ESLFPIDFRNIYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL
>CP0031 parA, plasmid segregation protein
MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKSAVAKLP
KLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPK
YRDRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLD
LDPQSSATMFLNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGV
YVMPASIDDAFIASNWDTLCEEHLLGQNKHAILRENIIDKLKHDFDFILI
DTGPHLDAFLKNAIAAADIMFTPVPPAQVDFHSTLKYLARLPELVQIIEQ
DGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIFGGDMLDVSMPRLDGFE
RSGESFDTVISANPVTYVGSGEALKNARMAAEDFAKAVFDRIEFIRANY
>CP0032 parB, plasmid segregation protein
MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTEIKVDHD
KVDTQTFVVEEVNGREQTALTPDSLKDITRTIRLQQFYPCIGIRTGDLIE
ILDGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREI
GLRLVRLKEAGMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSE
LTYADYRQLAELSERLRLGDISIDEVVKNISPSIELITADDNLSEDEVKN
SIMRLITKEMSSLLDSGVKDKAVVTLLWKFDSKDKFARKRVKGRTFSYEF
GRLPLEVQDKLDRMIALVLKDNLNSL
>CP0190 phoN1, PhoN1, periplasmic non specific acid ohosphatase
MKRQLFTLSIVGVFSLNTFASFPPGNDVTTKPDLYYLTNDNAIDSLALLP
PPPQIGSIAFLNDQAMYEKGRLLRNTERGKLAAEDANLSSGGVANVFSAA
FGSPITAKDSPELHKLLTNMIEDAGDLATRSAKEYYMRIRPFAFYGVSTC
NTKEQDTLSRNGSYPSGHTSIGWATALVLSEINPARQDTILKRGYELGDS
RVICGYHWQSDVDAARIVGSAIVATLHSNPVFQAQLQKAKDEFANNQKK
>CP0004 phoN2/apy, PhoN2 (Apy), periplasmic phosphatase, apyrase, ATP diphosphohydrolase
MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPPPPAENS
VVFQADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISREN
TPILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDEKMA
ITGSYPSGHASFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQ
SDVEAGRLMGASVVAVLHNTPEFTKSLSEAKKEFEELNTPTNELTP
>CP0260 repA, RepA, replication protein
MTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFDFAIH
VAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIE
CGLATESAAGKLSITRATRALTFLAELGLITYQTEYDPLIGCYIPTDITF
TSALFAALDVSEEAVAAARRSRVEWENRQRKKQGLDTLGMDELMAKAWRF
VRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREISEGRF
TASREAVKREVERRVKERMILSRNRNYSRLATASP
>CP0258 repB, RepB, replication protein
MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN
PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK
>CP0236 rfbU, UDP-sugar hydrolase
MNILFTESSPNIGGQELQAVAQMKALKKMGHSVLLVCRENSKIAFEASKL
GIDITFALFRNSLHIPTAWRLLGIVHGFQPNAIVCHSGHDSNIVGLVRLF
TRKHPFRIIRQKTYLTRKTKVFSINHFCDEVIVPGTSMKTHLEQEGCRTR
VTVVPPGFDFQKLYVDSRNSLPPNVLSWLASRRGCPVIAQVGMLRPEKGH
EFMLNLLFHLKMNGRQFCWLIVGSGSPELREHLQYQIDSMGMHDDVFIAD
NVFPAAPVYRVASLVVLPSENESFGMVLAEASAFSVPVLASQIGGIPDVI
QNNQTGTLLPAGNKHAWMCALNDFFNDPGRFYQMARQAKQDIEERFDINK
TALKILTLAKHK
>CP0070 sepA, SepA, extracellular serine protease of the IgA1 protease family, secreted by a C-terminal autotransporter domain
MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVAALSLSS
AWPALSATVSAEIPYQIFRDFAENKGQFTPGTTNISIYDKQGNLVGKLDK
APMADFSSATITTGSLPPGDHTLYSPQYVVTAKHVSGSDTMSFGYAKNTY
TAVGTNNNSGLDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTEFY
RLGGGMQYVKDKNGNRTQVYTNGGFLVGGTVSALNSYNNGQMITAQTGDI
FNPANGPLANYLNMGDSGSPLFAYDSLQKKWVLIGVLSSGTNYGNNWVVT
TQDFLGQQPQNDFDKTIAYTSGEGVLQWKYDAANGTGTLTQGNTTWDMHG
KKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRVSALNGQTW
MGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDG
TVILNQQADADGKVQAFSSVGIASGRPTVVLSDSQQVNPDNISWGYRGGR
LELNGNNLTFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNT
VSIFGGRGAPGDLYYDSSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDR
NKAIDTVKQQKIEASSQPYMYHGQLNGNMDVNIPQLSGKDVLALDGSVNL
PEGSITKKSGTLIFQGHPVIHAGTTTSSSQSDWETRQFTLEKLKLDAATF
HLSRNGKMQGDINATNGSTVILGSSRVFTDRSDGTGNAVFSVEGSATATT
VGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRVGS
FVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSP
VISTTEGINLEDNASFSVKNMGYLSSDIHAGTTAATINLGDSDADAGKTD
SPLFSSLMKGYNAVLRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSR
IELGDGKHFATLQVKELSADNTTFLMHTNNSRADQLNVTDKLSGSNNSVL
VDFLNKPASEMSVTLITAPKGSDEKTFTAGTQQIGFSNVTPVISTEKTDD
ATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNNLNKRMGDLRDTQ
GDAGVWARIMNGTGSADGDYSDNYTHVQIGVDRKHELDGVDLFTGALLTY
TDSNASSHAFSGKNKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANF
ASLGTKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWE
DRGMALSMKDKDYNPLIGRTGVDVGRAFSGDDWKITARAGLGYQFDLLAN
GETVLQDASGEKRFEGEKDSRMLMTVGMNAEIKDNMRLGLELEKSAFGKY
NVDNAINANFRYVF
>CP0235 shf, putative carbohydrate transport protein
MLNEGGILFKANHVPVLMYHHVSHCPGLVTLSPVTFRKQIKWLAENNWKT
LSSDELEFFYRGGKLPRKSVMLTFDDGYLDNWFQVYPLLKEFNLKAHIFL
ITGFIGNGPVRHSPGKEYSHRDCEHQIATGNADNVMLRWSEVNEMLQSGL
VEFHVHTHTHTRWDKKFSSREEQCKHLRQDLLSGREYLKEMTGKCSKHLC
WPEGYYNKDYIQVAEELGFYYLYTTERRMNAPAKGTTRIGRISTKERESC
AWLKRRLFYYTTPFFSSLLAFHKGPRLPDD
>CP0150 spa13, Spa13, component of the Mxi-Spa secretion machinery
MEALDKRIIYFLQLENDLEPVGAQSVSQLFNTRRKIAIVKKHIIQYQSER
ILLKGRIEEIQKDIDEANASKRKLLHKESKICKRIGLIKRNNFAKQLILD
ELSQEDMKYGIR
>CP0148 spa15, Spa15, putative component of the Mxi-Spa secretion machinery
MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNE
QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL
RVVIKDDYVHDGIVFAEILHEFYQRMEILNGVL
>CP0153 spa24, Spa24, component of the Mxi-Spa secretion machinery
MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNM
TLNGIALIMALFVMKPIIEAGYENYLNGPQKFDTISDIVRFSDSGLMEYK
QYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDA
FKIGFYLYLPFVVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGW
GILSKALIEQYINIPA
>CP0155 spa29, Spa29, component of the Mxi-Spa secretion machinery
MDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIFL
VASGLITSGKVDIGSSVFEHVYFLMFKEIIVGLLLSFCLSLPFWIFHAVG
SIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILES
IQLSYNICPLFSQCSFRVSNILTFLTLLASQAVILASPVMIVLLLSEVLL
GVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKVQFFLGEHKFF
TNLFVR
>CP0151 spa32, Spa32, secreted by and component of the Mxi-Spa machinery
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN
YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP
DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR
NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL
QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV
GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>CP0152 spa33, Spa33, component of the Mxi-Spa secretion machinery
MCGDWVIRIDTLSFLKKKYEVFSGFSTQESLLHLSKCVFIESSSVFSIPE
LSDKITFRITNEIQYATTGSHLCCFSSSLGIIYFDKMPVLRNQVSLDSLH
HLLEFCLGSSNVRLATLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDN
NEAKINLSESNGESEHTEVSLALFNYDDINVKVDFILLEKNMTINELKMY
VENELFKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE
>CP0156 spa40, Spa40, component of the Mxi-Spa secretion machinery
MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDV
MLLYRYVIINDFEINEGKYFFAVVIVFFKIIGFPLFFCVLSAVLPTLVQT
KFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYF
FWINDRKIIFSQVFSSVDGLYLIWGRLFKDIILFFLAFSILVIILDFVIE
FILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEILSEQTKSDIRN
SKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPT
VRDVKLARKLYKTHTKYSFVDFEHLDEVLRLIVWLEQVENTH
>CP0149 spa47, Spa47, component of the Mxi-Spa secretion machinery, putative ATPase
MSYTKLLTQLSFPNRISGPILETSLSDVSIGEICNIQAGIESNEIVARAQ
VVGFHDEKTILSLIGNSRGLSRQTLIKPTAQFLHTQVGRGLLGAVVNPLG
EVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGIKVIDSLLTCG
EGQRMGIFASAGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYL
KNSEKKSRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTEGHKVALFIDS
LTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITA
FYTVLLEDDDFADPLAEEVRSILDGHIYLSRNLAQKGQFPAIDSLKSISR
VFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK
ISVVESFLKQDYRLGFTYEQTMELIGETIR
>CP0154 spa9, Spa9, component of the Mxi-Spa secretion machinery
MSDIVYMGNKALYLILIFSLWPVGIATVIGLSIGLLQTVTQLQEQTLPFG
IKLIGVSISLLLLSGWYGEVLLSFCHEIMFLIKSGV
>CP0195 stbA, plasmid stable inheritance protein
MLKVSCDDGSTNVKLAWLEDGEVRTSLSGNSFKEGWNPGLFNAGKVYNYV
VDEKKYTYDLGSTAVIGTTHVSYQYSTTNLLAIHHALLTSGLQPQDVELT
VTLPVTEFFDNDNQPNEERIERKKANVLREISLNKGETFKIKKVNVMPES
LPAAFESLKKDKVNKLERSLIIDLGGTTLDCGLILGAFEGISEIRGYSEI
GTSRITHTVMNALTKASTPCNYFIADELIKNRHDNEYLQTLINDVAEIKN
ISHVIDREVKSLAESIRQEISTFSGMNRIYLTGGGAELIYPHIKQYFPNL
KVNKVDEPQFALVKAMVHA
>CP0194 stbB, plasmid stable inheritance protein
MESSDPKKRKKVVAYLHPALYPQDNLTQQTIDSLPVQMRGDFYRQSLICG
AALYSVAPRLLTLISVFFSEKITAENLVKLIEQTTGYTSTSIDISVLKNI
IEASSENKSESITSKDDFEEQTRRNLSMLKK
>CP0259 tap, TapA
MPGKVQDFFLCSLLLRIVSAGWCD
>CP0243 traD, DNA transport protein
MSVKLRLPQISESGEVVDMAAYEAWQQENHPDTWQQMQRREEVNINVHRE
RGEDVEPGDDF
>CP0250 traX, F pilin acetylation protein
MTTDNTNTTRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLDHIN
LIFQLKQEWMFLAGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWGIIA
QFAYYLAGFPWYEGNILFAFAVAAQVLTWCTTRSGWRTAAAILLMALWGP
LSGTSYGIAGLLMLAVSNRLYRAEDRAERLALVACLLAVIPALNLATSDA
AAVAGLVMTVLTVGLVLCAGKSLPRFWPGDFFPTFYACHLAVLGVLAL
>CP0244 trbH, orf, conserved hypothetical protein
MNRSAPVFSSQAAHTFKFPGVISHNNQPPTAGMTCDHLIKWPDRASLTGK
FCSYLAGVCGCSSVVIQNINAGNKSLDHSEITFRHLAFFCTIYQLHQGDR
TDTHFPLVQVKTLPDAGGFVLYRKNADVGIEHKLQHQNDSLSCMPGCSLL
SIKSVLTLCPSNHSSHVSPAGVMILVRPTAITSTRFTFSGNATAFGSLTA
WLRLLRNTVVSIICLLMWICLVYIYCGIDTGICQRDIRL
>CP0185 ushA, UshA, probable periplasmic UDP-sugar hydrolase
MIPLKKNITLIMFTLSLLTGNPAIAYETDKVYKITVLHTNDHHGHFWRNN
HGEYGLSSQKTLVDNIRQKVINNGGSVLLLSGGDINTGVPESDLQKAEPD
IRGMNLIGYDAMAVGNHEFDNPLNILRQQEKWATFPFLSANIYQKSTGRR
LFSPWKIFIRQNLKIAVIGLTTDDTAKTGNSEYFTDIEFRQPAAEARSVI
DELNQQEKPDIIIAATHMGHYDNGESGSNAPGDVEMARSLPTGSLAMIVG
GHSQAPVCMASDNKKQWNYIPGTTCVPDKQNGIWIVQAHEWGKYVGQADF
EFCNGTMKLVNYQLHPVNLKMRITREDGKTEFSFYTPEITEDPQMLSLLT
PFQNKGKAQLDVKVGVVNGRLEGDRSKVRFVQTSMGHLILSALTERIDAD
FAVVSGGEIRDSIESGNITYKDILKVQPFGNTVVSIDLTGKEVADYLATV
AQMKPDSGAYPQFLNTSFVVKKGKIEMLKIKGKSVDLNKKYRMTTFSFNA
TGGDGYPRIDNRPGYINTGFIDAEVLIEYIRKHSPLDAASYEPKGEVSWQ
>CP0181 virA, VirA, secreted by the Mxi-Spa secretion machinery, function unknown
MQTSNITNHERNDSSWMSTVKSTTEVSWNKLSFCDILLKIITFGIYSPHE
TLAEKHSEKKLMDSFSPSLSQDKMDGEFAHANIDGISIRLCLNKGICSVF
YLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKP
APEVIETAINCCTSIIPNDDYFHVKDTDFNSVWHDIYRDIRASDSNSTKI
YFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVNKIINSNFPQQ
DLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSEN
IENRLNFMPEQRVINNCGHIIKINAVVPKNDTAISASGGRAYEVSSSILP
SHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV
>CP0123 virB, VirB, transcriptional activator required for tanscription of the ipa, mxi, and spa operons
MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTN
VRDQESLTEESLADIIKTIKLQQFFPVIGREIDGRIEILDGTRRRASAIY
AGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMS
YKDIAKKENLSRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYY
KGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNIIKKSKNRKQN
PSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRD
IISRHLSSS
>CP0046 virF, VirF, member of the AraC family of transcriptional activators, required for transcription of virB and icsA
MMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEGQIAFIER
NIQINVSIKKSDSINPFEIISLDRNLLLSIIRIMEPIYSFQHSYSEEKRG
LNKKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSDEEALYTS
ISIASSLSFSDQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRKRLESEKL
TFQQILLDIRMHHAAKLLLNSQSYINDVSRLIGISSPSYFIRKFNEYYGI
TPKKFYLYHKKF
>CP0237 virK, VirK, required for proper localization of IcsA (VirG) at the surface of bacteria
MFSVSNLSFIGFLKRIVFSSDSLPGKWEHRKFRFMYILRCAINPVASIRY
YYELRSLQCIEDILAIQPTLPARIHRPYLHKGGRAWSRGQYILEHYRFVQ
NLPEKYSEFLFPQKSVSLVQFIGKDGEDFDIQCSPSGFDREGELMLSLFF
NKIVIARLTFSVILTQNGHTAFIGGLQGAPKNTGPDIIRCATRACYGLFP
KRIIFEAFCALMKACNVSECLAVSEHSHVFRQLRYWYQKRKTFVAVYSDF
WESVAGKTCGDWYKLPTQVVRKPLSNIASKKRSEYRKRYALLDYIHETAI
RSLDAYPVNSEHYDLN
>CP0196 yccB, orf, conserved hypothetical protein
MRHGLMEAACERRIPMPNWCSNRMYFPGEPAQIAEIKRLASGAVTPLYRR
ATNEGIQLFLAGSAGLLQITENIRSEQCPGVTAAGRGAVSTENIAFTRWL
THLQNGVLLDEQNCLMLHELWLQSGTGQRRWEGLPDDARETITVHFTAKR
GDWCDIWGNEDVSVWWNRLCDNVVPEKTMPFDLLTVLPTRLDVEVNGFNG
GVLNGVPSAYHWYTERYGVKWPCGYDLNISSQGENFIQVDFDTPWCQPES
DVIAELSRRFSCTLEHWYAEQGCDFCGWQLYERGELVNVLWGELEWSSPT
DDDEQPEVTGPAWIVDNVAHYGG
>CP0198 ycdA, orf, conserved hypothetical protein
MSRFVLGNCIDVMARIPDNAIDFILTDPPYLVGFRDRQGRTIAGDKTDEW
LQPACNEMYRVLKKDALMVSFYGWNRVDRFMSAWKNAGFSVVGHLVFTKN
YTSKAAYVGYRHECAYILAKGRPRLPQNPLPDVLGWKYSGNRHHPTEKPV
TSLQPLIESFTHPNAIVLDPFAGSGSTCVAALQSGRRYIGIELLEQYHRA
GQQRLAAVQRAMQQGAANDDWFMPEAA
>CP0199 yceA, orf, conserved hypothetical protein
MNYAGHEKLRAEVAEVANAMCDLRTTMNEMERRYSFNADTLPERLVRQTL
FRANRLLMEAYTEILELDSCFKD
>CP0202 ycfA, orf, conserved hypothetical protein
MNETLNALICRHARNLLLAQGWPEETDVDQCNPNYPGWISIYVRLDAPRL
ATLLVNRHDGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQV
SFPYAGEWLTEDEIRAVLDAVRDAVCSVSCRGAEDARRIRAALTTSGQTL
LTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV
SDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARAEIRSMADA
LAKIRK
>CP0252 yigA, orf, conserved hypothetical protein
MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD
WQQFARKRAEHCHRRCRGRV
>CP0253 yigB, orf, conserved hypothetical protein
MLHYSGGLKYRWHLSDMENNMRKYIPLALFIFSWPVLSADIHGRVVRVLD
GDTIEVMDSLKAVRIRLVNIDAPEKKQDYGRWSTDMMKSLVAGKTVTVTY
>CP0257 yihA, orf, conserved hypothetical protein
MKLIIFILIVLIIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILS
SGEYWQRQKTLLTEREVSFMKGLFRIVDMKRWYLCPQVRVADIVQLNGNI
RPRSRQWWQLFRMVSQWHVDVVIVELRSFSIVAAVELDDASHLRPERRRR
DILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNTTGADQQSPEHRS