Gene list
Applied filters:
COG category: Unclassified
Organism: Shigella flexneri
Gene type: CDS
Number of genes found: 128
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Shigella flexneri >gid:1155068 S0005 orf, hypothetical MLPSETMIWQPEFTDKTFSRKLGAVPFTTCNVVLQGNGLPIPYVDQYNRN DNFRFRAQPKYILGHLSNRLPDTAPFFNKKINHF >gid:1155069 S0006 orf, hypothetical MIKFSLKIYKHIRIHTLRILKKSLTTILFFGVEISNHQEKLPLNKTHHTV YFGANAYIIDHDSPYGYMTLTEHFDNAIPPVFYHEHQSFFLDNFKEVVDE VSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIGKSKDQGFREFCYNKN IDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYEE INNKVTDKKMAHQALAYSLGDKKADIALYLLSKFNFTKQDVAEMEKMNNN IYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKANSGDTMLDNA MKSKDSKMIDFFIKKWSGIRQTI >gid:1155070 S0007 orf, hypothetical MKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFRSLEHLDKV SRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLSNNILNIKS FDKIQSENIQTHKNTYSEDIKEISNHDFVFWG >gid:1155071 S0008 ISEc8 orf, fragment MSRPSEINRLKALVAKLQRMQFGKSSEKLRAKTERWIQEAQERISALQEE MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS YAGAGLLAHVVTGKYADHLPLYRQSDLLFHTAI >gid:1155074 S0011 orf, hypothetical MQQRSAALHAAGAAYPGNIFVDTTFRPYPDQWAFLASMIPMNAHDIEPTI LRATGNTHPLDVTFIHEEDLATPWKPEQSSVYAHVNPYGIFELDMETRLP IEVVA >gid:1155079 S0016 orf, hypothetical MKVSFKSLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNACHKVAIF LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES GFVSFVNREGKICHTAYVKSSDNSMTYYHANGSSIDKYITDMCGLICMRH IESTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV >gid:1155082 S0019 IS91 orfB, fragment MRYGSLAGWRYSAFLMLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQ DEIGLRYNSHRTKREENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEA AITGRCGVRHNGDSEKNGEANHKERDVSAVTEG >gid:1155083 S0020 IS91 orfA, fragment MDAGNKKLVFWFVRVDDEGYPEIARCMEREFATIPAGINADGMYCPECGT VHWPDGVIPPF >gid:1155088 S0024 putative IS orf, fragment MISFPAGSRIWLVAGITDMRNGFNGLAQKFKTS >gid:1155089 S0025 putative IS orf, fragment MDTSLAHENARLRALLQMQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL VAKLQRMQFGKSSEKLRAKTERQILEAQERISALQEEMAETLGEQYDPVL PSPLRQSSARKPLPASLPRETRVIRPEDECCPACGGELSSLGCDVSEQLE LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS PDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRITEAACMAHVRRKIH DVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS LYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV >gid:1155090 S0026 IS3 orf, fragment MRATVFSYIECDYNRWRRHRWCGGLSPEQFENQSLTYDCVHIMWVGSIKT FYMCLLAARKFIPCRYHKHA >gid:1155091 S0027 putative transposase, fragment MISNEGEFMNEKQLTSNKLRALANELAKSLKNPEDLSQFDWMLKMKPYSM LI >gid:1155093 S0029 IS630 orf, fragment MMWPALHETITRNHQCRSIWPLLKKVRHFMETVSPLPGEKHSLDKV >gid:1155094 S0030 putative enterotoxin, fragment MSINNYGLHPANNKNMHLIIGSNTANENKGMKNNIINVTNTAISHAINEE KSGGGYSGVSFRKLAKIQNISIPTKNNKEYNRHNLFSLIWHGNADAARKY SESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT FDLSPKETIKLLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLT EIADRLNNNEQDMFNIISDKIQELF >gid:1155095 S0031 orf, hypothetical MPGATVADEFDKTLAFLEAIVNADNETTIGEIRSFADALDAVRFNRNKIN RQLSKPNLASLALEHEVIWLGRSR >gid:1155099 S0034 IS10 orf MCQQFNEITAMPVHKVCQNFFRDALAPFHQYRQNALMDATMALINGASLT QTSIGRFLPGNAQVKNKIKRIDRLMGNEALHRDIPMIFRNITSMLTRQLS LCVIAVDWSGYPSQEHHVLRASLLCDGRSIPLLSKVVPSEKQNNPLIQHD FLDSLAQSLPPDARVIIVTDAGFQSAWFHHITSLGWDFIGRIRNNVQYCL DNAPERWLKVSDSPECKTPEYMGAGRLVKERKKSIRGHFYTYKKSAKGRK KKRSKGQSGLNKTDKEQSKSAKEAWLIFSSTNDFRAREIIKLYSRRMQIE QNFRDEKNGRFGFGLRASKSRSTGRILVLSLLATLSTIVMWLLGYHAENK GLHLKYQANSIKSRRVISYLTLAKNVLRHSPLILRRTVLSTVLNHLSRTY RNMVLVY >gid:1155103 S0038 orf, hypothetical MHVTTGPSAPASSGWHVKQSASRVPWKSTKKSSAPSLKSTCSTDWKHYPR LLRQSRHRRPLPEHVSREIHHLEPEESCCPECGGELDYSGEISVFLPERH TILQ >gid:1155115 S0047 IS1294 orf MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >gid:1155135 S0064 IS1294 orf MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM PA >gid:1155136 S0065 orf, hypothetical MQREKTPEWREKQKSSRGIRRGQRYRLVFQFPIRERCFGRLKEYRRIATR YDKTARNYLAMVKLGCIRLFYQRLRN >gid:1155137 S0066 orf, hypothetical MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR FREFCYNKNIDPVSLDRIINFVFQPEYHIPRMLSTDNFKKIRLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDV AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN SGDTMLDNAMKSKDSKTIDFLLKNGAVSGKRFGR >gid:1155143 S0072 putative IS orf, fragment MQKWNWPHSRGWTGITIDDCWKGWAILLRQKQKKLIMLPSETMIWQPEFT DKTLSRKPGAVHAVRQQRSKALLTSLNEWMVEKNGTLSKKSRLGEAFSYV LNQWDALCYYSDDGLAEADNNTAERALRTVCLGKKNYMFFGSDHGGDRGA LLYGLIGSCRLNGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTDK >gid:1155147 S0076 IS1294 orf, fragment MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD TAALTRLQDTGG >gid:1155148 S0077 IS1294 transposase MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >gid:1155153 S0081 orf, hypothetical MPGTTTAMSINFIGITARTMNSNGSHGKPQIPVDYQKLLSIEDITFCRNR WGNIGENALRRVAVGKKLSFFGSDRGGENAAII >gid:1155168 S0095 IS1294 orf, fragment MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC DWVHLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM PA >gid:1155171 S0098 orf, hypothetical MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS ANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISNHQKKHPLNTK HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN YEEINKQVTNKKIALQALFLSITNQKEDVALYILSNFEITRQDVISIKHE LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE NAEMIKLLLKYGATSDNKYI >gid:1155173 S0100 orf, hypothetical MTLPVFITVIADHDKPQPSGCLLESQGSLCPICRQRITHETGWNVHHKVK KVMGAVKNYLTLSCYIQIAIDSYTVVKPALSKRAYKGLSGVPGNRYAPFL GEGSPAMNCPYPTNIQNERNVLESAYNPL >gid:1155177 S0104 orf, hypothetical MKKQIFINNKPPVVPYSGTHAKIFKYIEIPLPFFYFIYTSGEPFHISVQN TVIYVSKYNGIFINKLVPFSLLFDRDISVLQRRDICVVRFTSEEISEHNV LFDHDIERLKKISKAQLISPDYVLIDFSSVGGGEMNPMQCPG >gid:1155184 S0110 orf, hypothetical MPFYFRKECPLNSGYLSLRASKSRSTGRILVLSLLATLSTIVMWLLGYHA ENKGLHLKYQANSIKSRRVISYLTLAKNVLRHSPLILRRTVLSTVLNHLS RTYRNMVLVY >gid:1155186 S0112 orf, hypothetical MCISFAIYCQYAIVVKQMIYGGFMKSGVQLNLRARESQRILIDAAAEILH KSRTDFILEMACKAAEDVILDRRVFNFNDRQYEEFIEMLDAPVADDPAIE KLLARKPQWDV >gid:1155187 S0113 orf, hypothetical MAQVDLHLAKWIARKHKRARGSLVRAFEWLLRIRHDCPTLFAHWCLAYDT >gid:1155188 S0114 orf, hypothetical MSQPKPFEVSKYAVWKAYQRVKANRGAAGVDGQSVEAFEVTNRSYTR >gid:1155189 S0115 orf, hypothetical MCQVGVAEMNESELSMKCRKEPLGDVENVLSTQRMTSPADIWYWLGGIRH IGSMNATQALARNVGTCRLDAKGAARSRGTASA >gid:1155191 S0116 putative IS orf MMSSAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVEL SRNTMVRWVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKT KTGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQLHLAKYQGVLQAD AYAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELY DIEAEIRGSPAEERLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAF DYILNHWNALNEFCRDGWVEIDNNIGENALRSVAVGRKNYLFFGSDKGGE SAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >gid:1155194 S0119 putative IS orf MNSQTAKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV VKLFDPLTPELLRALIREMKGGIR >gid:1155195 S0120 putative IS orf MDSINVRFSSPPQDSCLLLYDSMEFKLDLIEKSYQLGACVAQPAREYGIN VSAPSATSCENCPVWQQSRPSILMDTNDEFPDSKRYSLLPFLFA >gid:1155196 S0121 orf, hypothetical MTLTILLVSVISNLTSPVPRRGEGYQAQGLIIDTTVKIAFSKGNVIKFRG FQSENHTTY >gid:1155197 S0122 orf, hypothetical MKIPEAVNHINVQNNIDLVDGKTNPNKATKALQKNVLRVTNSSSSGISEK HLDHCANTVKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR SLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLS NNILNIKSFDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKL PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQG FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV AEMEKMKNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKAN SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI >gid:1155203 S0127 putative transposase MPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVPEHHAP LWQQASERMAERLGEIQKRVITVCDREADIWHYLYYKVSHGQRGACCTES PAGRGTRQALRTAGSPGNRRKPHAECDAKRRAGSPSGPDVHQLQRSQHKK SRQQRPGAPAHVCLLPGAGRGRCLLASADVRKSGECRRCTTYCQPLRATL ADRGIPQGVEKWWYMESLRMQTRDNLERMVVIQAFIAVRVLGLRQGGVSE ETQNDSCEKILTPTEWKLLWVKLEGKPLPVQAPTLKWAWGDGMTANAQVV PVGASCGMAGQTSGYG >gid:1155218 S0141 orf, hypothetical MQSQISYVHLIILSHKWGIDCSFVPRFTTKKILIQQGSLFFYNTPSLIHQ ALYLVGFHDETSTTYAHEVHIMYSKRKFDYVNRLKFLNLLIEYIIYQIHF TSPQSPSNGELIKYEPQN >gid:1155252 S0173 IS1294 orf MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA >gid:1155254 S0175 orf, hypothetical MTHMTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLY AWRSKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKR LK >gid:1155255 S0176 orf, hypothetical MTMKMNTALIVALMCMWYAVPAAAKETLLAMPRNSTEHCYAEINVHGPYG VYFRVVPHPPGGKSWVECNSDYYYSDKPPGVQILGTRAGCRVYGICGTTS TLHVAGRGVVCIKNICSPRGMIIHRIRKRPVVAVSDEM >gid:1155256 S0177 orf, hypothetical MKIKRSTFISNIFYIISWFLMNDNSLLRNSSLFIAYMGCVGWVSAYSYGW GTSFYYGFPWWVVGAGLDDVARSLLYAIIVMGILFTGWGIGILFFLLIKK RSKIQDLSFFRLFFAITLLFFPVIFELLILKQYFILPLSLSFIISSLVIS IIIRIYGRIFSVSCFSDIPFVREHRIKLIMAGFLVYFWLFSFLVGWYKPQ LKKEYQMLCYNNSWYYVLARYDSRLVLSSSFKDDSNRFLIFNTEQSGFYE INDVYVRK >gid:1155257 S0178 putative transposase MTAAALLAEMPESSSLSRREISALVGVAQVNRDSGTLRGRRTIFGDCAGE EQLCTWRRLRPPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTIMNAM LRKNEEWNESYL >gid:1155258 S0179 orf, hypothetical MTESRQEKLIWLRAQMKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLR PTVTSP >gid:1155264 S0184 IS91 transposase, fragment MRYGSLAGWRYSAFLMLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQ DEIGLRYNSHRTKREENLVMSGDEFMERFSWHVADKGFRIVIRGPESGEA AITGRCGVRHNGDSEKNGEANHKERDVSAVTEG >gid:1155265 S0185 IS91 orfA MACDYRYKNRQYHCLSGSYMARSAKPRKRKPAPQRSKLLRYVVKLHEDDF FDEEEAEVLRFDNFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP EIARCTEREFATIPAGINADGMYCPECGTVHWPDGVIPPF >gid:1155274 S0194 orf, hypothetical MPFLSRLGQSRYKLVTGLPKTNNKATGDAFFSVKILRGPEPGEVARQITW GSVQPELNRPGNPGD >gid:1155280 S0199 orf, hypothetical MPLICGCNVSFPAFPSKGTPMMLYATLGYDFKVRTLKNFKGELYRKCMPG ASNAAMCKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSR NFSYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLALVRKQYPKLFAHWYLL RASNE >gid:1155299 S0216 putative IS orf MMSSAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVEL SRNTMVRWVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKT KTGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQLHLAKYQGVLQAD AYAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELY DIEAEIRGSPAEERLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAF DYILNHWNALNEFCRDGWVEIDNNIGENALRSVAVGRKNYLFFGSDKGGE SAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK >gid:1155309 S0225 orf, hypothetical MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLSDMDKDSELAATLQ KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY ANSSSWKSKRLC >gid:1155311 S0227 IS1-cat orf MSDNGNAHAGNNYGHVDKLHRNDSADNEQGNDSNLLIVFYVQIMPDDLVM QLHRF >gid:1155328 S0241 putative iso-IS1 orf MYDGDFKVLAWLPFPSDVLPAPLLKAWCVTAKALPDISAISALIAVKHGN YSSLTPPLSPVRTRKSLIWP >gid:1155329 S0242 putative IS orf MAGCRLGVPKSTVCGMFVRFRNAGLSWPLPAGMSEQELDALLYGSASTVP VVLTESTVMPKLPVVKKRPRRPNADQLRIS >gid:1155335 S0247 putative IS orf MLTELLTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQELDALL YGSASTVPVVLTESTVMPKLPVVKKTSPAALMPVS >gid:1155359 S0269 Tn501 orf, hypotheical MCCGAVFSDSLLDAGRIARVCGCLGQLLHAGLGIVEGDDRLACLESHVNF ADAFDLGNRLLDSDRAGGAGHARYGQRDGLGGGPDGGNNGGEGEGGKQFL HGELRSVEKWHDVGKSERDQNQRGHDPENELVSSSHLGNRADLTRFAGRC LPVDAPPGEEQRHQRHADKDGAIGFQHRQVADPSAAEPQGDQNQRPEAAS RGEDGGKPSSEERAAPGFWFRHALVLSN >gid:1155361 S0271 Tn501 orf, hypothetical MLCRLGRQGGQLHFQIGQRFAPTLDELTQQGKLRGRFVAVRRIQRPAQPR QRVEADARLEGRPHEAQPLQGGVIEQAVAAWCARHRAQQSAQQVVAHDMH AHPGIKSQPGHRVGVH >gid:1155365 S0275 IS600 orfA, fragment MSRKTRRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGGLAEFGKN RTLRFSGHP >gid:1155376 S0281 IS1294 orf MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV HLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ MVKQFLSRDPFECVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG >gid:1155377 S0282 orf, hypothetical MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD TAALTRLQDTGG >gid:1155386 S0290 orf, hypothetical MSIEIKMISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYDK KYISGITRSMAQLKIEEFINEKSRRLNYMKTMYSPCPEDFQPISRAEAST PEGSWLTVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFSKENSDFL YIIVVFRNDSPQGELRANRFIELYDIKREIMQVLRDESPEIKVY >gid:1155387 S0291 orf, hypothetical MDTGLSEVLVFKLFRTEAATPTVPSPAIVIALDIIKHCCPHYFLIDKVFS VETFHLGTIILFMVMFIFEYRD >gid:1155170 ShET2-1 enterotoxin MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL NCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPDNLLHPKVIYHAMRMG LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDK KNGSDFLEIMKNIKS >gid:1155318 ccdB post-segregation toxin MPMRTGTGEMQFKVYAYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARL LSDKVSRELYPVVHVGDESWRMMTTDMASVPIFVIGEEVADLSHRENDIK NAINLMFWGI >gid:1155352 finO fertility inhibition protein MWYLWPQEDDSSEAEHSMTEQKRPVLTLKRKTEGTAPVRSRKTIINVTTP PKWKVKKQKLAEKAAREAELAAKKAQARQALSIYLNLPTLDEAVNTLKPW WPGLFDGDTPRLLACGIRDVLLEDVAQRNIPLSHKKLRRALKAITRSESY LCAMKAGACRYDTEGYVTEHISQEEEAYAGARLAKIRHQNRIKAELQAVL DEK >gid:1155367 hmo putative regulator MAKTKQEWLYQLRRCSSVNTLEKIIHKNRDSLSNSERESFNSAADHRLAE LITGKLYDRIPKEIWKYVR >gid:1155217 icsB invasion protein MSLKISNFIDASNTKGPIRVEDTEHGPILIAQKFNLKDLFFRTLSTINAK INSQILNEQLKNYRLENQKSLLLFLNTLASEKSAESAFAAYEAAKNSIQH SFTGRDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISI KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI SEQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD KVYIPLSGDNKTKDGKISHNLFGLDETNMSKFICKKKADAFRQLANYKLI SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNDVYAYANKVRQRIES LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMTRVLNELKTEATDKKEE IIEKSIKIIDYYNSLKSPDLGTKLYIHDLLQINKLLLNNSHSNI >gid:1155138 insA IS1 repressor MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHKKIIDMAMNGVGC RASARIMGVGLNTVLRHLKNSGRSR >gid:1155122 insA IS1 repressor MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHKKIIDMAMNGVGC RASARIMGVGLNTVLRHLKNSGRSR >gid:1155139 insB IS1 transposase, fragment MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLNRPGNP GD >gid:1155310 insB iso-IS1 orfB MLPLSVNLMSSGASLAARLQQHWLWYAYNTKTGGVLAYTFGPRNDETCRE LLALFTPFCIY >gid:1155379 insB IS1-insB, fragment MRSLHFNSVTPSATKDKQVTRKGIFIQHMLYLERNNLTLRTRIKRLARKT ICFSRSVEIHEKSSAPSLKNTYSTDWKRHPKKYRFFTVNFI >gid:1155210 ipaA invasion protein MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEK ENFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEILEFSNVLYS LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLSRDLNTVAVFP ELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIH YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLSRNGNSL LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKVFNHTPDNSDGIGSDL LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKS SASLSHRVASQINKFNSNTDSKVLQTDFLSRNGDTYLTRETIFEASKKVT NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANIRN YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLK NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD >gid:1155212 ipaC invasion protein MLQKQFCNKLLLDTNKENVMEIQNTKPTQTLYTDISTKQTQSSSETQKSQ NYQQIAAHIPLNVGKNPVLTTTLNDDQLLKLSEQVQHDSEIIARLTDKKM KDLSEMSHTLTPENTLDISSLSSNAVSLIISVAVLLSALRTAETKLGSQL SLIAFDATKSAAENIVRQGLAALSSSITGAVTQVGITGIGAKKTHSGISD QKGALRKNLATAQSLEKELAGSKLGLNKQIDTNITSPQTNSSTKFLGKNK LAPDNISLSTEHKTSLSSPDISLQDKIDTQRRTYELNTLSAQQKQNIGRA TMETSAVAGNISTSGGRYASALEEEEQLISQASSKQAEEASQVSKEASQA TNQLIQKLLNIIDSINQSKNSAASQIAGNIRA >gid:1155211 ipaD invasion protein MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT LHNIRTTNQALKKELSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS RNEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYL KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALE ELKEKYKDKPLYPANNTVSQEQANKWLTELGGTIGKVSQKNGGYVVSINM TPIDNMLKSLDNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF >gid:1155378 ipaH1.4 invasion plasmid antigen MIRILVIMIKSTNIQAIGSGIMHQINNVYSLTPLSLPMELTPSCNEFYLK TWSEWEKNGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQI TTLEIRKNLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKI KELPFLPENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKL EGLALANNFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAG NPLSGHTMRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWF PENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAW LEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLF DNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEK LQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHA VLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGA QVMRETEQQIYRQLTDEVLALRLSENGSNHIA >gid:1155126 ipaH2.5 invasion plasmid antigen H MIKSTNIQVIGSGIMHQINNIHSLTLFSLPVSLSPSCNEYYLKVWSEWEK NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT MRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWFPENKQSD VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE QQIYRQLTDEVLA >gid:1155319 ipaH9.8 invasion plasmid antigen MSTGFNWMPIMLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPG EERDEAVSRLKECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLT NLPELPVTLKKLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLL TMNISYNEIVSLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDR NQISHIPESILNLRNECSIHISDNPLSSHALQALQRLTSSPDYHGPRIYF SMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFL DRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDR VALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKV RTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEA MVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYPQR VADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGS QLHHS >gid:1155207 ipaJ invasion plasmid antigen MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQ GCFLARKQMYDNSCGAASLLCAAKELGVDKIPQYKGSMSEMTRKSSLDLD NRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVE ESNIFSQVISFIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIG VPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNSNFIGYTKIGI SIVITNEAL >gid:1155216 ipgA invasion plasmid antigen MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY LSPDELIESLYEFLFCIKLTIANITSEVN >gid:1155215 ipgB invasion protein MQILNKILPQVEFAIPRPSFDSLSRNKLVKKILSVFNLKQRFPQKNFGCP VNINKIRDSVIDKIKDSNSGNQLFCWMSQERTTYVSSMINRSIDEMAIHN GVVLTSDNKKNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK ILKRYSSDMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA VYRQSNTN >gid:1155219 ipgD secreted protein MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEA LNRLYLQNQTSLTGKSLLFARDKAEVFCEAIKLAGGDTSKIKAMMERLDT YKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKN WGPVNKNISHHGKNYSFQLTPASHMKIGNKNIFVKEYNGKGICCASTRER DHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVS ALKGLNSKRGGPTKLLIRNSDGLLKEVSVNLKVVTFNFGVNELALKMGLG WRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ IKEIVNNKLQKNDNGEPYKLSQRVTLLAYTIGAVPCWNCKSGKDRTGMQD AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV PGNKVMKKLPLSSLELSYSERIGDPKIWNMVKGYSSFV >gid:1155220 ipgE invasion-associated protein MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL PENINDLIYALSLNYSEKICLATDDEGGNLIARLDLTGINEFEDVYVNTE YYISRVRWLKDEFARRMKGY >gid:1155076 mkaD mouse killing factor MPIKKPCLKLNLDSLNVVKSEIPQMLSANERLKNNFNILYNQIRQYPAYY FKVASNVPTYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE >gid:1155323 mob9 plasmid mobilization protein MSLAGNPCVIRLAAQVCMWLKFIIRDRGGFSGGLLLFLPVCCRDRTERIL AVHTIKILR >gid:1155232 mxiC invasion protein MLDVKNTGVFSSAFIDRLNAMTNSDDGDETADAELDSGLANSKYIDSSDE MASALSSFINRRDLEKLKGTNSDSQERILDGEEDEINHKIFDLKRTLKDN LPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVELLTKIINEII SGSEKSVNAGINSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYI EWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGFVLSKLIAIKM IRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVI EVSRASHEDSVVYQTYLSSVNESPHDIFKSESEREIAINILRELVTSAYK KELSR >gid:1155229 mxiE putative lipoprotein MSLKQGERQMIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPV SKDYFSIPNDLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKE VDGCFMDAQKIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRG I >gid:1155223 mxiH Type III secretion protein MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEY QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNFR >gid:1155224 mxiI Type III secretion protein MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIME MVSNPESLNPESLAKLQTTLSNYSIGVSLAGTLARKTVSAVETLLKS >gid:1155226 mxiK Type III secretion protein MGIQNRVVQEKQNMIRMDGIYKKYLSIIFDPAFYINRNRLNLPSELLENG VIRSEINNLIINKYDLNCDIEPLSGVTAMFVANWNLLPAVAYFIGSQESR LINHSEMVISYYGGKISKQGEAAIRSGFWHLIAWKENISVGIYERINLLF NPIALEGNYTPVERNLSRLNEGMQYAKRHFTGIQTSCL >gid:1155227 mxiL putative membrane protein MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVI KEATKKAESLRIDAVCDGYQIGIQTAFEHIIDYICEWKLKQNENRRNIED YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLE LDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL TKNDKKYFKELAHKKLRQIAEDLLKENPVND >gid:1155228 mxiM putative membrane protein MINQINASNALQQRLNSEEFVNLNERLSSSQSFDEDIIYEIMQYFSQSEL NSIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA SLSIYNNIDFYIKTTTFDSLISVFEAGREADDSTW >gid:1155372 repA1 replication protein MQDGVTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFD FAIHVAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITT LAIECGLATESAAGKLSITRATRALTFLAELGLITYQTEYDPLIGCYIPT DITFTSALFAALDVSEEAVAAARRSRVEWENRQRKKQGLDTLGMDELMAK AWRFVRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREIS EGRFTASREAVKREVERRVKERMILSRNRNYSRLATASP >gid:1155369 repA2 replication protein MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK >gid:1155371 repA6 positive regulator for repA1 expression MPGKVQDFFLCSLLLRIVSAGWCD >gid:1155145 sepA secreted protease MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVAALSLSS AWPALSATVSAEIPYQIFRDFAENKGQFTPGTTNISIYDKQGNLVGKLDK APMADFSSATITTGSLPPGDHTLYSPQYVVTAKHVSGSDTMSFGYAKNTY TAVGTNNNSGLDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTEFY RLGGGMQYVKDKNGNRTQVYTNGGFLVGGTVSALNSYNNGQMITAQTGDI FNPANGPLANYLNMGDSGSPLFAYDSLQKKWVLIGVLSSGTNYGNNWVVT TQDFLGQQPQNDFDKTIAYTSGEGVLQWKYDAANGTGTLTQGNTTWDMHG KKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRVSALNGQTW MGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDG TVILNQQADADGKVQAFSSVGIASGRPTVVLSDSQQVNPDNISWGYRGGR LELNGNNLTFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNT VSIFGGRGAPGDLYYDSSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDH NKAIDTVKQQKIEASSQPYMYHGQLNGNMDVNIPQLSGKDVLALDGSVNL PEGSITKKSGTLIFQGHPVIHAGTTTSSSQSDWETRQFTLEKLKLDAATF HLSRNGKMQGDINATNGSTVILGSSRVFTDRSDGTGNAVSSVEGSATATT VGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRVGS FVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSP VISTTEGINLEDNASFSVKNMGYLSSDIHAGTTAATINLGDSDADAGKTD SPLFSSLMKGYNAVLRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSR IELGDGKHFATLQVKELSADNTTFLMHTNNSRADQLNVTDKLSGSNNSVL VDFLNKPASEMSVTLITAPKGSDEKTFTAGTQQIGFSNVTPVISTEKTDD ATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNNLNKRMGDLRDTQ GDAGVWARIMNGTGSADGDYSDNYTHVQIGVDRKHELDGVDLFTGALLTY TDSNASSHAFSGKNKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANF ASLGTKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWE DRGMALSMKDKDYNPLIGRTGVDVGRAFSGDDWKITARAGLGYQFDLLAN GETVLQDASGEKRFEGEKDSRMLMTVGMNAEIKDNMRLGLELEKSAFGKY NVDNAINANFRYVF >gid:1155075 shET2-2 enterotoxin MSSMPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQF KNKTAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKV NYQLLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMK KNGDFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTV FTCDSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKL LPDELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSD GTPAFYIALQNGYSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLC MSFMNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNG HADSIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDI LKILPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSF TTRRLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIA EQFSKKMKKTFIEIINRFNHFL >gid:1155388 sopA VirG-specific protease MDISTKKVEFSMKLKFFVLALCVPAIFTTHATTNYPLFIPDNISTDISLG SLSGKTKERVYHPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVS GWTTLGNQKASMVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGW LLNNLDYRLGLIAGYQESRYSFNAMGGSYIYSENGGSRNKKGAHPSGERT IGYKQLFKIPYIGLTANYRHENFEFGAELKYSGWVLSSDTDKHYQTETIF KDEIKNQNYCSVAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTN ISGTIKNSASIEYIGFLTSAGIKYIF >gid:1155243 spa-orf10 orf, hypothetical MCYMGVNFCNKIGIDQSEFEIESSIINSIANEVLNPISFLSNKDIINVLL RKISSECDLVRKDIYRCALELVVEKTPDDL >gid:1155244 spa-orf11 orf, hypothetical MIRQQKRLTIILLLLGVDKRDYSSCNVKTLLYSIRDYAKSVNDHEILTES NRLLSHCISDSNGAFFKSSKYVPLKYLRKRRIARKIPND >gid:1155236 spa13 invasion protein MYRDVEALDKRIIYFLQLENDLEPVGAQSVSQLFNTRRKIAIVKKHIIQY QSERILLKGRIEEIQKDIDEANASKRKLLHKESKICKRIGLIKRNNFAKQ LILDELSQEDMKYGIR >gid:1155234 spa15 invasion protein MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNE QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL RVVIKDDYVHDGIVFAEILHEFYQRMEILNGVL >gid:1155240 spa9 Type III secretion protein MSDIVYMGNKALYLILIFSLWPVGIATVIGLSIGLLQTVTQLQEQTLPFG IKLIGVSISLLLLSGWYGEVLLSFCHEIMFLIKSGV >gid:1155288 stbB plasmid stable inheritance protein MESSDPKKRKKVVAYLHPALYPQDNLTQQTIDSLPVQMRGDFYRQSLICG AALYSVDPRLLTLISVFFSEKITAENLVKLIEQTTGYTSTSIDISVLKNI IEASSENKSESITSKDDFEEQTRRNLSMLKK >gid:1155364 tnpA Tn501 transposition transposase MPRRLILSATERGTLLALPESQDDLIRYYTFNDSDLSLIRQRRGDANRLG FAVQLCLLRYPGYALGTDSELPEPVILWVAKQVQTDPASWTKYGERDVTR REHAQELRTYLQLAPFGLSDFRALVRELTELAQQTDKGLLLAGQALESLR QKRRILPALSVIDRACSEAIARANRRVYRALVEPLTDSHRAKLDELLKLK AGSSITWLTWLRQAPLKPNSRHMLEHIERLKTFQLVDLPEVLGRHIHQNR LLKLAREGGQMTPKDLGKFEPQRRYATLAAVVLESTATVIDELVDLHDRI LVKLFSGAKHKHQQQFQKQGKAINDKVRLYSKIGQALLEAKEAGSDPYAA IEAVIPWDEFTESVSEAELLARPEGFDHLHLVGENFATLRRYTPALLEVL ELRAAPAAQGVLAAVQTLREMNADNLRKVPADAPTAFIKPRWKPLVITPE GLDRRFYEICALSELKNALRSGDIWVKGSRQFRDFDDYLLPAEKFAALKR EQALPLAINPNSDQYLEERLQLLDEQLATVARLAKDNELPDAILTESGLK ITPLDAAVPDRAQALIDQTSQLLPRIKITELLMDVDDWTGFSRHFTHLKD GAEAKDRTLLLSAILGDAINLGLTKMAESSPGLTYAKLSWLQAWHIRDET YSAALAELVNHQYQHAFAAHWGDGTTSSSDGQRFRAGGRGESTGHVNPKY GSEPGRLFYTHISDQYAPFSTRVVNVGVRDSTYVLDGLLYHESDLRIEEH YTDTAGFTDHVFALMHLLGFRFAPRIRDLGETKLYVPQGVQTYPTLRPLI GGTLNIKHVRAHWDDILRLASSIKQGTVTASLMLRKLGSYPRQNGLAVAL RELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNSLARAVFFNRLGEI RDRSFEQQRYRASGLNLVTAAIVLWNTVYLERATQGLVEAGKPVDGELLQ FLSPLGWEHINLTGDYVWRQSRRLEDGKFRPLRMPGKP >gid:1155389 tnpC IS629 orf, fragment MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAIFSHENVINSVNQFKKYT LYLRK >gid:1155165 tnpD IS629 orf MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG VKRSVRPSAGKPLPQATA >gid:1155277 tnpD IS629 orf MMPLLDKLREQYGGGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG VKRSV >gid:1155128 tnpD IS629 orf, fragment MPLLDKLRAQRDDWLKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALW HVSWRLWDLPVFSGVKRSVRPSAGKPLPQATA >gid:1155111 tnpD IS629 orf MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG VKRSVRPSAGKPLPQATA >gid:1155141 tnpDE IS629 orf MMPLLDKLRAQRDDWLKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDAL WHVSWRLWDLPVFSGVKRSVRPSAGKPLPQATA >gid:1155270 tnpE IS629 orf, fragment MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW IRQHERDTGGGEVGSPPLNVSV >gid:1155169 tnpE IS629 orf, fragment MTWELILDGYSESSYSATPRFAAARLPWFTDKTLSRKPGAVHCVSGFASM SGIPGAVHTNIPQTVDFGTPRRRPATTRGKCSSK >gid:1155346 traD DNA transport protein, fragment MSVKLRLPQISESGEVVDMAAYEAWQQENHPDTWQQMQRREEVNINVHRE RGEDVEPGDDF >gid:1155351 traX F pilin acetylation protein MYCVNHNGCGKRSGKLPGKICCRSVCSRWSGIWFVTCRKRKPRAETDTGR QTLMTTDNTNTTRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLD HINLIFQLKQEWMFLAGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWG IIAQFAYYLAGFPWYEGNILFAFAVAAQVLTWCETRSGWRTAAAILLMAL WGPLSGTSYGIAGLLMLAVSNRLYRAEDRAERLALVACLLAVIPALNLAT SDAAAVAGLVMTVLTVGLVLCAGKSLPRFWPGDFFPTFYACHLAVLGVLA L >gid:1155096 trcA putative chaperone MIIMLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSI LSSVSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSRKIGDNLRKQ IFKQVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKND TTSNVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF >gid:1155272 virG invasion protein MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIA FATPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM ILGGSGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT FAGGNGGAAYGYGYDGYGGNAITGDNLSVINNGAILGGNGGHWGDAINGS NMTIANSGYIISGKEDDGTQNVAGNAIHITGGNNSLILHEGSVITGDVQV NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN AGILKMGTVEAMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNIN APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS DKNAFIQKGRIVAGSYDYRLKQGTASGLNTNKWYLTSQMDNQESKQMSNQ ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL YSSWFQDEKERTGLYMDAWLQYSWFNNTVKGDGLTGEKYSSKGITGALEA GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGG NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY TF >gid:1155320 yacA orf, hypothetical MVMTRWQMAQVNMSVRIDAELKDAFMAAAKSMDRNGSQLIRDFMRQTVER QHNSWFRDQVAAGRQQLECGDVLPHDMVESSAAAWRDEMSRKIADK >gid:1155321 yacB orf, hypothetical MAAIEPDERIGYSASSLAGQPYKGRNGRVEGTSGPHKVACNVILCENLL >gid:1155290 yccB orf, hypothetical, fragment MRHGLMEAACERRIPMPNWCSNRMYFPGEPAQIAEIKRLASGAVTPLYRR ATNEGIQLFLAGSAGLLQITENIRSEQCPGVTAAGRGAVSTENIAFTRWL THLQNGVLLDEQNCLMLHELWLQSGTGQRRWEGLPDDARETITVHFTAKR GDWCDIWGNEDVSVWWNRLCDNVVPEKTMPFDLLTVLPTRLDVEVNGFNG GVLNGVPSAYHWYTERYGVKWPCGYDLNISSREKTSFRWISTRRGVSRKA TLLQN >gid:1155291 yccB orf, hypothetical, fragment MDFDTPWCQPESDVIAELSRRFSCTLEHWYAEQGCDFCGWQLYERGELVD VLWGELEWSSPTDDDEQPEVTGPAWIVDNVAHYGG >gid:1155293 yceA orf, hypothetical MNYAGHEKLRAEVAEVANAMCDLRTTMNEMERRYSFNADTLPERLVRQTL FRANRLLMEAYTEILELDSCFKD >gid:1155294 yceB orf, hypothetical, fragment MYGTCETLCRALAAKYSGDTPLMLVIWSPEEIQALADGMDISLSDHEIRT VLAHLEDIPED >gid:1155295 yceB orf, hypothetical, fragment MEIISNVRENRQVTVPAELLETLTQIAEQALWKREWAARDHGFPLPEYVT RRQAMVDQARSLLKNNTHEND >gid:1155296 ycfA orf, hypothetical MNETLNALICRHARNLLLAQGWPEETDVDQCNPNYPGWISIYVRLDAPRL ATLLVNRHDGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQV SFPYAGEWLTEDEIRAVLDAVRDAVCSVSCRGAEDARRIRAALTTSGQTL LTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV SDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARAEIRSMADA LAKIRK >gid:1155353 yigA orf, hypothetical MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD WQQFARKRAEHCHRRCRGRV