TitleGenColors Logo

Gene list

Applied filters:

Organism: Shigella flexneri
Gene type: CDS

Number of genes found: 293

Free access
Sort by:

 



# Shigella flexneri

>gid:1155157  IpaH4.5  invasion plasmid antigen
MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRI
QAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPEN
SPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSA
RFNSLETLPELPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPE
FPQSLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPENIITLPICTN
VNISGNPLSTHVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAV
TAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQ
VAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS
EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM
LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWG
PWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAER
EAGAQVMRETEQQIYRQLTDEVLA
>gid:1155156  IpaH7.8  invasion plasmid antigen
MFSVNNTHSSVSCSPSINSNSTSNEHYLRILTEWEKNSSPGEERGIAFNR
LSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLK
ELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLY
NLPALPEKLKFLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNN
IRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRVLQSLQRLTSS
PDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHE
EHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVA
ADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE
ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV
TANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKY
EMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEV
LALRLSENGSRLHHS
>gid:1155064  S0001  IS2 orf2, fragment
MVHATELMKHASSPGCWDFVEPKNTAVRSPESNRIAKSFVKTIKCDYISI
MPKPDGLTAAKNLAEAFEHYNEWHPHSALDYRSPREYLRQRANDNRCLEI
>gid:1155065  S0002  putative resolvase, fragment
MNHYPSVTSLETPEARCRSGVPPLPACRQRESIYGLIELFIQIVHRLSVR
SERRLVKTLLADFQRGHGKTALLFRIAEAALNNPDGLVKEVVYHLHIVEP
DRSGKRSSSYLAQLRDVSARGDAVKNGRTLPEQDSGLPALVSDPGLPRMI
STVL
>gid:1155066  S0003  orf, hypothetical
MNLDGVRPYCRIVNKKNESISDIAFAHIIKRVKNSSCTHPKAALVFLGEK
GFCDSNDVLSIMGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQS
EPIVINNDDDALNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSH
QLGLGSELIDVQTIISRMKDCGILNVKDIRFTSCGSADKVAPKNFNNAPA
ESLSCILNSLPFFKEKESLLEQIKKHLENDESLSDGLKISGYHGYGVHYG
QELFPYSHYRSTSIPADPEHTVKRSSQKKTFIINKELD
>gid:1155067  S0004  apyrase
MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPPPPAEDS
VVFLADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISREN
TPILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDEKMA
ITGSYPSGHASFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQ
SDVEAGRLMGASVVAVLHNTPEFTKSLSEAKKEFEELNTPTNELTP
>gid:1155068  S0005  orf, hypothetical
MLPSETMIWQPEFTDKTFSRKLGAVPFTTCNVVLQGNGLPIPYVDQYNRN
DNFRFRAQPKYILGHLSNRLPDTAPFFNKKINHF
>gid:1155069  S0006  orf, hypothetical
MIKFSLKIYKHIRIHTLRILKKSLTTILFFGVEISNHQEKLPLNKTHHTV
YFGANAYIIDHDSPYGYMTLTEHFDNAIPPVFYHEHQSFFLDNFKEVVDE
VSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIGKSKDQGFREFCYNKN
IDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYEE
INNKVTDKKMAHQALAYSLGDKKADIALYLLSKFNFTKQDVAEMEKMNNN
IYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKANSGDTMLDNA
MKSKDSKMIDFFIKKWSGIRQTI
>gid:1155070  S0007  orf, hypothetical
MKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFRSLEHLDKV
SRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLSNNILNIKS
FDKIQSENIQTHKNTYSEDIKEISNHDFVFWG
>gid:1155071  S0008  ISEc8 orf, fragment
MSRPSEINRLKALVAKLQRMQFGKSSEKLRAKTERWIQEAQERISALQEE
MAETLGEQYDPVLPSSLRQSSARKPLPASLPRAPRVIRPEEECCPACGGE
LSPLGCDVSEQLELISSAFKVIEKQRPKLACRRCDHIVQAPVPSKPIARS
YAGAGLLAHVVTGKYADHLPLYRQSDLLFHTAI
>gid:1155072  S0009  IS3 orfB, fragment
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTNSDHNLPVTPNLL
NQNFTPTAPNQVWVADSVVQAFRNQPTEGAGRETAAYAVR
>gid:1155073  S0010  ISEc8 orf
MSRKNQRYSTEFKAEAVKTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155074  S0011  orf, hypothetical
MQQRSAALHAAGAAYPGNIFVDTTFRPYPDQWAFLASMIPMNAHDIEPTI
LRATGNTHPLDVTFIHEEDLATPWKPEQSSVYAHVNPYGIFELDMETRLP
IEVVA
>gid:1155077  S0014  orf, hypothetical
MDDRIQAGKADMAACADEADEPVLGAERTGKGYLEQTMERGKTQRLAEMA
AANSDVPMMKSVAKTIGKRLYGILNAMRHGVSNGNAEALNSKIRLLRIKA
KGYRNRERFKLGVMFHYGKLNMAF
>gid:1155078  S0015  orf, hypothetical
MLNWLSKLRAARIHLPNAVEKIAFDRFHVAKQPGEVVDKTRQNEHPHLPV
ESRRQAKGTRFLWQHSDKWMTESRQEKLIWLRAQMKLTSLCWALKELAKD
IWSRPWSEERRNDWQRWLRPTVTSP
>gid:1155079  S0016  orf, hypothetical
MKVSFKSLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNACHKVAIF
LAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGES
GFVSFVNREGKICHTAYVKSSDNSMTYYHANGSSIDKYITDMCGLICMRH
IESTGIIFYMLDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV
>gid:1155080  S0017  IS3 orf2, fragment
MPSHHLYLANETQMVGLAPKHTAVQNGMAENFVKTMKYD
>gid:1155081  S0018  IS21 orf, fragment
MLNKRAFFGAFLIFWGFKFLSMNCRYEKASIILTSNKGVADWGEMFGDHV
LATAILDRLLHAEYQRRELPVKRET
>gid:1155082  S0019  IS91 orfB, fragment
MRYGSLAGWRYSAFLMLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQ
DEIGLRYNSHRTKREENLVMSGDEFMERFSWHVADKGFRMVIRGPESGEA
AITGRCGVRHNGDSEKNGEANHKERDVSAVTEG
>gid:1155083  S0020  IS91 orfA, fragment
MDAGNKKLVFWFVRVDDEGYPEIARCMEREFATIPAGINADGMYCPECGT
VHWPDGVIPPF
>gid:1155084  S0021  putative transposase
MEYRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCGMFVRFRNAGLSWP
LPAGMSEQELDALLYGSASTVPVVLTESTVMPKLPVVKKRPRRP
>gid:1155086  S0022  IS4 orf
MFPDSFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>gid:1155087  S0023  putative IS orf, fragment
MEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPV
TLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTPALLQ
ILIREIKGSSH
>gid:1155088  S0024  putative IS orf, fragment
MISFPAGSRIWLVAGITDMRNGFNGLAQKFKTS
>gid:1155089  S0025  putative IS orf, fragment
MDTSLAHENARLRALLQMQQDTIRQMAKYNRLLSQRVAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQILEAQERISALQEEMAETLGEQYDPVL
PSPLRQSSARKPLPASLPRETRVIRPEDECCPACGGELSSLGCDVSEQLE
LISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVAG
KYADYLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDILRQYVL
MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGGYRALYESGRITEAACMAHVRRKIH
DVHARVPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQS
LYDWIQQQMKIHSLKMECLHGEHYYPSGNSAGNSV
>gid:1155090  S0026  IS3 orf, fragment
MRATVFSYIECDYNRWRRHRWCGGLSPEQFENQSLTYDCVHIMWVGSIKT
FYMCLLAARKFIPCRYHKHA
>gid:1155091  S0027  putative transposase, fragment
MISNEGEFMNEKQLTSNKLRALANELAKSLKNPEDLSQFDWMLKMKPYSM
LI
>gid:1155092  S0028  putative transposase, fragment
MDAENETVLNANMTHHLGCEKNQLRSGSNSRNGCLTKIITTGDEPLEIRT
LRDRNGTFEPQQLKKNQP
>gid:1155093  S0029  IS630 orf, fragment
MMWPALHETITRNHQCRSIWPLLKKVRHFMETVSPLPGEKHSLDKV
>gid:1155094  S0030  putative enterotoxin, fragment
MSINNYGLHPANNKNMHLIIGSNTANENKGMKNNIINVTNTAISHAINEE
KSGGGYSGVSFRKLAKIQNISIPTKNNKEYNRHNLFSLIWHGNADAARKY
SESLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKT
FDLSPKETIKLLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLT
EIADRLNNNEQDMFNIISDKIQELF
>gid:1155095  S0031  orf, hypothetical
MPGATVADEFDKTLAFLEAIVNADNETTIGEIRSFADALDAVRFNRNKIN
RQLSKPNLASLALEHEVIWLGRSR
>gid:1155097  S0033  IS3 putative transposase, fragment
MSKLILPSNTVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEG
WLYLAVVIDLWSRAVIGWSMSPRLTAQLACDALQMALWRRKRPRNVIVHS
DRGSQYCSADYQALLKWHNLRGSMSAKGCCYDNACVESFFHSLKVECLHG
EHFISREIMRATVFNYIECDYNRWRRHSWCGGLSLEQFENQNLA
>gid:1155099  S0034  IS10 orf
MCQQFNEITAMPVHKVCQNFFRDALAPFHQYRQNALMDATMALINGASLT
QTSIGRFLPGNAQVKNKIKRIDRLMGNEALHRDIPMIFRNITSMLTRQLS
LCVIAVDWSGYPSQEHHVLRASLLCDGRSIPLLSKVVPSEKQNNPLIQHD
FLDSLAQSLPPDARVIIVTDAGFQSAWFHHITSLGWDFIGRIRNNVQYCL
DNAPERWLKVSDSPECKTPEYMGAGRLVKERKKSIRGHFYTYKKSAKGRK
KKRSKGQSGLNKTDKEQSKSAKEAWLIFSSTNDFRAREIIKLYSRRMQIE
QNFRDEKNGRFGFGLRASKSRSTGRILVLSLLATLSTIVMWLLGYHAENK
GLHLKYQANSIKSRRVISYLTLAKNVLRHSPLILRRTVLSTVLNHLSRTY
RNMVLVY
>gid:1155100  S0035  IS100 orfA, fragment
MISLNCWHKSVDHIMLCLSRFLGIPQPFRAQTKGKVERMVQYTRNSFYIP
LMTRLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQ
SMLALPPEKKEYDVHPGENLVSFDNPPQHHPLSIYDSFCRGVA
>gid:1155101  S0036  IS100 orfB, fragment
MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ
LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAA
DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV
IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG
ESYRLRQKRKAGVIAEANPE
>gid:1155102  S0037  IS1 InsB
MGRWWRYKWITFHPSLTQHWLWYAYNTKTGGVLAYTFGPRNDETCRELLA
LLTPFCIGMVTSDDWGSYAREVPKEKHLTGKIFTQRIARNNRTLRTRIKR
LARKTICFSRSVEIHEKVIGSFIEKHMFY
>gid:1155103  S0038  orf, hypothetical
MHVTTGPSAPASSGWHVKQSASRVPWKSTKKSSAPSLKSTCSTDWKHYPR
LLRQSRHRRPLPEHVSREIHHLEPEESCCPECGGELDYSGEISVFLPERH
TILQ
>gid:1155107  S0041  putative transposase, fragment
MSEQKITGIDLAKTNFYLFSINAHGKPAGKTKLSRNQLLNWLVQQPKMTV
AMEAGGASHYWAREIRKLDHDVILLPAQHVKAYQRCQKNDYNDAQAIAEA
CQHGTIRPVPILWSNRTSRLF
>gid:1155108  S0042  putative reverse transcriptase
MKYQKHWSGHHIWMCVATREPPGCDGQTLKMFDQQRDGNLYKIWNRLCSG
TWFPPPVLEKRIPKSNGKERILGIPTVSDRIAQGAIKLFMEEKLDPIFHA
DSYGYRPGKSAHDALKQCAIRCWRYSWILEVDISAFFDHVRHDLVLKALE
HHGMPKWVILYCRRWMEAPMQSCENGELITRTRGTPQGGVISPLLANLFL
HYAFDLWMEREYRGVPFERYADDIVVHCSRMSDATRLKNRLSERFSEVGL
LLNAGKTNIAYIDTFKRRNVATSFTFLGYDFKVRTLKNFKGERYRKCMPG
ASNAAMRKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSR
NFNYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLTLVRKEYPKLFVHWYLL
RASNE
>gid:1155113  S0046  IS3 orfB, fragment
MRQQQDEQGRFSICSRQAAVVQRLMGILSLKAAIKVKRYRSYRGEVGQTA
PYVLQRDFKATRPNEKWVTDFTEFAVNGRKLYLSPVIDLFNNEVISYSLS
ERPVMNMVENMLDQAFKKLNEPPRESWRLNFLRKR
>gid:1155115  S0047  IS1294 orf
MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>gid:1155116  S0048  IS150 orfB, fragment
MKVLNELRQFYPLDELLRATEIPRSTFYYHLKALSKPDKYADVKKRIGEI
YHENRGRYGYRRVTLSLHREGKQINHKTVHV
>gid:1155117  S0049  IS150 orfA
MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG
EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW
LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ
VRFLETRLVYLKKLKALAHPTKK
>gid:1155118  S0050  IS100 orfA, fragment
MQYISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAF
PAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHL
AIAMGYEAFKIFYDISKISLELYHNIH
>gid:1155120  S0052  IS2 orf2, fragment
MADNGSAYTAHETRQFARELNLEPCTTAVSSPQSNGIAERFMKTMKEDCI
AFMPKPRTALHNLAVAIEHYNENHPHSALGYLSPREYRRQRVMST
>gid:1155125  S0055  IS110 orf
MTESSDCESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPAPERVLGPRLVHPALLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFTALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>gid:1155132  S0062  orf, hypothetical
MEINVTAPALLTDEHILQPFDCGNEVLSNWLRGRAMKNQMLNASRTFVIC
LEDTLRIVGYYSLATGSVTHAELGRSLRHNMPNPVPVVLLGRLAVDVCTQ
GHGFGKWLLSDAIHRVVNLADQVGIKAVMVHAIDDDARAFYERFGFVQSV
VAPNTLFYKV
>gid:1155133  S0063  orf, hypothetical
MEVFMSTAASVRKTPREHQINIRATDEERAVIDYAASLVNKNRTDFIMEL
AYQEAKNIILDQRLFVLDNERYDSFITQLEAPVQNAEGRERLMAVKPEWK
>gid:1155135  S0064  IS1294 orf
MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC
YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM
PA
>gid:1155136  S0065  orf, hypothetical
MQREKTPEWREKQKSSRGIRRGQRYRLVFQFPIRERCFGRLKEYRRIATR
YDKTARNYLAMVKLGCIRLFYQRLRN
>gid:1155137  S0066  orf, hypothetical
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEK
HLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLS
NNTLNIKSFDKIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQR
FREFCYNKNIDPVSLDRIINFVFQPEYHIPRMLSTDNFKKIRLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDV
AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN
SGDTMLDNAMKSKDSKTIDFLLKNGAVSGKRFGR
>gid:1155143  S0072  putative IS orf, fragment
MQKWNWPHSRGWTGITIDDCWKGWAILLRQKQKKLIMLPSETMIWQPEFT
DKTLSRKPGAVHAVRQQRSKALLTSLNEWMVEKNGTLSKKSRLGEAFSYV
LNQWDALCYYSDDGLAEADNNTAERALRTVCLGKKNYMFFGSDHGGDRGA
LLYGLIGSCRLNGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTDK
>gid:1155144  S0073  putative IS orf, fragment
MDGVVGQGYRARFYSAGELLQELRKARAQLKLNELLLKLDRYRVIVVDDL
GYVKRDSAETGVLFELIAHRYERGSLVITSNHPFSMWGSIFVDETMAVAA
ADRLIHHGYMFELKGESYRKKTAKAVTSVT
>gid:1155147  S0076  IS1294 orf, fragment
MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD
TAALTRLQDTGG
>gid:1155148  S0077  IS1294 transposase
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>gid:1155149  S0078  IS1650 orfA
MFWVLCSGAPWRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDA
NGFIDWSATALDGSNIRALKCAAGAQKNIPISTEIMGRVALAAVLAPKSI
WQQTEVASR
>gid:1155150  S0079  IS1650 orfB, fragment
MCRRCSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH
ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA
VIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRIATRYDKTA
RNYLAMVKLGCIRLFYQRLRN
>gid:1155152  S0080  IS630 orf
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECSAEHPVFYEDEVDIHLHPKIGADWQLRGQ
QKRVVTPGQNEKYYLAGALHSGTGKVSYVGGNSKSSALFISLLKRLKATY
RRAKTITLIVDNYIIHKSRETQSWLKENPKLRGIYQPVYSPWVNHVERLW
QALHDTITRNHQCRSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>gid:1155153  S0081  orf, hypothetical
MPGTTTAMSINFIGITARTMNSNGSHGKPQIPVDYQKLLSIEDITFCRNR
WGNIGENALRRVAVGKKLSFFGSDRGGENAAII
>gid:1155154  S0082  iso-IS1-insA
MRFSMTTVTVHCPRCNSDEVYRHGRSCSRHERFRCRSCKRVFQLTYSYEA
RKLGVKEQIVEMAHNGAGGCYTARTLKIGINTVIRTLKSSRPGG
>gid:1155155  S0083  iso-IS1-insB, fragment
MLAYTCGPRNDETCRELLALLTPFCIGMVTSDDWGSYAREVPEEKHLTGK
IFTQRMNVTT
>gid:1155160  S0088  orf, hypothetical
MNAHWSSKKSNFFRKNIKLLTKYLFFESQGIPDKVDIVSRLKTYGYSISG
VETDDGYKALVRAFQLHFRQKNYDGIMDAETAAILYALLEKYFPGK
>gid:1155161  S0089  orf, hypothetical
MLHEEKLARHQRKQAMYTRMAAFPAVKMFEEYDFTFATGAPQKQLQSLRS
LSFIERNENIVLLGTSDITNPRVGICV
>gid:1155162  S0090  orf, hypothetical
MRATAEEALKRISELYAIEDEIRGLPESECLAVRQQRSKALLTSLHEWMV
EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNTAERALRAVC
LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP
SNRVDDLLPWKVVLPSG
>gid:1155167  S0094  IS100 orf
MDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQ
LQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAA
DLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQV
IAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLGRILHHSHVVQIKG
ESYRLRQKRKAGVIAEANPE
>gid:1155168  S0095  IS1294 orf, fragment
MTRSGGDFQPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRI
LGVKEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDC
DWVHLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFC
AIHTYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQ
RLLKAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRN
TARYLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQREL
VARLKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVC
YAQMVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYM
PA
>gid:1155171  S0098  orf, hypothetical
MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDS
ANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHL
DRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILN
IKSFNKIQSEGICTKRNTYADDIKKIANHDFVFFGVEISNHQKKHPLNTK
HHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEV
NKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCY
GKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKKVKIREISLEEAVTASN
YEEINKQVTNKKIALQALFLSITNQKEDVALYILSNFEITRQDVISIKHE
LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE
NAEMIKLLLKYGATSDNKYI
>gid:1155172  S0099  orf, hypothetical
MDTSLAHENARLRALLQTQQDTIHQMAEYNRLLSQRMAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQIPFSRAIYATQALGCSDCSIKAILNS
>gid:1155173  S0100  orf, hypothetical
MTLPVFITVIADHDKPQPSGCLLESQGSLCPICRQRITHETGWNVHHKVK
KVMGAVKNYLTLSCYIQIAIDSYTVVKPALSKRAYKGLSGVPGNRYAPFL
GEGSPAMNCPYPTNIQNERNVLESAYNPL
>gid:1155174  S0101  IS1650 orfB
MCRRCSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH
ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA
VIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRIATRYDKTA
RNYLAMVKLGCIRLFYQRLRN
>gid:1155175  S0102  IS1650 orfA
MQSRFFTILRSNRHNLCGDLQQGMVHKSDSDELSALRAENARIIKPLLPP
EPATPRAGRPWAEHRKIINGMFWVLCSGAPWRDLPERYGAWKTVYNRFNR
WSKSGVINIIFNRLLSLLDANGFIDWSATALDGSNIRALKCAAGAQKNIP
ISTEIMGRVALAAVLAPKSIWQQTEVASR
>gid:1155176  S0103  orf, hypothetical
MSWLISQCAHQCTDNKKTETDAIYDKVRSSYLLSCILKKNKNVGLILHAP
SFVSVSEKIARIVMANYSRNWSNSELASAVLMSESSLKRRMYKEVGSIST
FVHKIKLTEAIRKLRRTNTPISVISSELGYSSPSYFSKVFFKYLKTYPQN
IRKKNGR
>gid:1155177  S0104  orf, hypothetical
MKKQIFINNKPPVVPYSGTHAKIFKYIEIPLPFFYFIYTSGEPFHISVQN
TVIYVSKYNGIFINKLVPFSLLFDRDISVLQRRDICVVRFTSEEISEHNV
LFDHDIERLKKISKAQLISPDYVLIDFSSVGGGEMNPMQCPG
>gid:1155178  S0105  orf, hypothetical
MGDSVTPEILGNMIRQYFSQVRTEKESIQALNHLRRVLHEVSPFAQEPVD
CVLWVKADEVVANDYNPNVMALGEKKLLKQSLEKDGFTQPVVVSEEKNHY
LVVDGFHRQLLGREADTGKRLRGWLPVACINPERKGQAARIAATIRHNRA
RGKHQITLMSDIVRDLSRLGWTNERIGTELGMDQDEVLRLKQISGLTELF
QEEDFSSAWTVR
>gid:1155182  S0108  IS600 orfB
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RYSRLGNITPAAFREKYHQMAA
>gid:1155183  S0109  IS600 orfA
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155184  S0110  orf, hypothetical
MPFYFRKECPLNSGYLSLRASKSRSTGRILVLSLLATLSTIVMWLLGYHA
ENKGLHLKYQANSIKSRRVISYLTLAKNVLRHSPLILRRTVLSTVLNHLS
RTYRNMVLVY
>gid:1155185  S0111  orf, hypothetical
MGCVTAPEPLSSFHQVAEFVSGEAVLDDWLKQKELKNQAIGATRTFVVCR
KGTQQIVGFYSLATGSVNHTEATGNLRRNMPDPIPVIILARLAVDVSFRG
KGLGADLLHDAVRRCYRVAENIGVRAIMVHALTENAKQFYIHHGFKPSKT
QVQTLFLKLPQ
>gid:1155186  S0112  orf, hypothetical
MCISFAIYCQYAIVVKQMIYGGFMKSGVQLNLRARESQRILIDAAAEILH
KSRTDFILEMACKAAEDVILDRRVFNFNDRQYEEFIEMLDAPVADDPAIE
KLLARKPQWDV
>gid:1155187  S0113  orf, hypothetical
MAQVDLHLAKWIARKHKRARGSLVRAFEWLLRIRHDCPTLFAHWCLAYDT
>gid:1155188  S0114  orf, hypothetical
MSQPKPFEVSKYAVWKAYQRVKANRGAAGVDGQSVEAFEVTNRSYTR
>gid:1155189  S0115  orf, hypothetical
MCQVGVAEMNESELSMKCRKEPLGDVENVLSTQRMTSPADIWYWLGGIRH
IGSMNATQALARNVGTCRLDAKGAARSRGTASA
>gid:1155191  S0116  putative IS orf
MMSSAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVEL
SRNTMVRWVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKT
KTGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQLHLAKYQGVLQAD
AYAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELY
DIEAEIRGSPAEERLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAF
DYILNHWNALNEFCRDGWVEIDNNIGENALRSVAVGRKNYLFFGSDKGGE
SAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>gid:1155192  S0117  putative IS orf
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIGTTSS
>gid:1155193  S0118  putative IS orf
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>gid:1155194  S0119  putative IS orf
MNSQTAKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>gid:1155195  S0120  putative IS orf
MDSINVRFSSPPQDSCLLLYDSMEFKLDLIEKSYQLGACVAQPAREYGIN
VSAPSATSCENCPVWQQSRPSILMDTNDEFPDSKRYSLLPFLFA
>gid:1155196  S0121  orf, hypothetical
MTLTILLVSVISNLTSPVPRRGEGYQAQGLIIDTTVKIAFSKGNVIKFRG
FQSENHTTY
>gid:1155197  S0122  orf, hypothetical
MKIPEAVNHINVQNNIDLVDGKTNPNKATKALQKNVLRVTNSSSSGISEK
HLDHCANTVKNFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFR
SLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSNFNFRHQSNSNLS
NNILNIKSFDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKL
PLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFL
DNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQG
FREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLED
AIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV
AEMEKMKNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKAN
SGDTMLDNAMKSKDSKMIDFLLKNGAILGKRFEI
>gid:1155201  S0125  IS1353 orf
MLTDIFNSNYQCYGYRRLHAMLRHEGGRLSEKVVRRLMVEEQLVVSRNRR
RRYSSYCGEIGPAPDNLIARDFKAEQPNQK
>gid:1155202  S0126  IS1353 orf
MVDCFDGKVVSWSLSTRPDAELVNTMLDSAVETLNAGERPVIHSDRGGHY
RWVPSAPKCTANNAGWNDKAVQEIIRIPPSPFGLSAPPNMLFP
>gid:1155203  S0127  putative transposase
MPESLCGQLVSIRISLDDELRIYSNEQQVASHRLCSAAYGWQTVPEHHAP
LWQQASERMAERLGEIQKRVITVCDREADIWHYLYYKVSHGQRGACCTES
PAGRGTRQALRTAGSPGNRRKPHAECDAKRRAGSPSGPDVHQLQRSQHKK
SRQQRPGAPAHVCLLPGAGRGRCLLASADVRKSGECRRCTTYCQPLRATL
ADRGIPQGVEKWWYMESLRMQTRDNLERMVVIQAFIAVRVLGLRQGGVSE
ETQNDSCEKILTPTEWKLLWVKLEGKPLPVQAPTLKWAWGDGMTANAQVV
PVGASCGMAGQTSGYG
>gid:1155205  S0128  putative transposase for IS110
MTESSDCESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPAPERVLGPRLEHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALEVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASAAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQGS
>gid:1155206  S0129  IS100 orfA
MVTFETVMEIKILHKQGMSSRTIARELGLSRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVPGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGSTVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPGENLVSFDNPVTLFVPLIMGC
>gid:1155209  S0132  orf, hypothetical
MIKEKILSIVAFCYGIAYSKLSEETKFIEDLSADSLSLIEMLDMISFEFN
LRIDESALEHIITIGDLISVVKNSTKSI
>gid:1155218  S0141  orf, hypothetical
MQSQISYVHLIILSHKWGIDCSFVPRFTTKKILIQQGSLFFYNTPSLIHQ
ALYLVGFHDETSTTYAHEVHIMYSKRKFDYVNRLKFLNLLIEYIIYQIHF
TSPQSPSNGELIKYEPQN
>gid:1155245  S0168  IS600-like, orfB
MVVSAIASTPQVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTR
ETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRATTSSDHN
LPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDVYTCEIVG
YAMGERMTKELTGKALFMALRSQHPPAGLIHHSARGSQYCAYDYRVIQEQ
SGLKTSMSRKGNCYDNAPMEIFWGTLKNESLSHYRFKSRDISSAYGKTD
>gid:1155247  S0169  IS600-like, orfB
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>gid:1155248  S0170  IS600-like
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155249  S0171  IS600-like, orfA
MSRKNQRYSTEFKAEAVKTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155250  S0172  IS600-like, orfB, fragment
MESFWGTLKNESLSHYRFNSRDEAISVIREYIEIFYNRQRRHSRLGNISP
AAFRIKYYQMTA
>gid:1155252  S0173  IS1294 orf
MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>gid:1155253  S0174  IS3 orf, fragment
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTGISPRQQFRQHCD
SVVLAAAFTRSKQRYGAARLTDELRAQGYHFNVKTVAASLRRQGLRAKAS
RKFTYRKLKNQTIPLSTPYAT
>gid:1155254  S0175  orf, hypothetical
MTHMTKTVSTSKKTRKQHSPEFCSEALKLAERIGVAAAARELSLYESQLY
AWRSKLQQQMTSSERESELAAENARLKRQLAEQAEELAILQKAATYFAKR
LK
>gid:1155255  S0176  orf, hypothetical
MTMKMNTALIVALMCMWYAVPAAAKETLLAMPRNSTEHCYAEINVHGPYG
VYFRVVPHPPGGKSWVECNSDYYYSDKPPGVQILGTRAGCRVYGICGTTS
TLHVAGRGVVCIKNICSPRGMIIHRIRKRPVVAVSDEM
>gid:1155256  S0177  orf, hypothetical
MKIKRSTFISNIFYIISWFLMNDNSLLRNSSLFIAYMGCVGWVSAYSYGW
GTSFYYGFPWWVVGAGLDDVARSLLYAIIVMGILFTGWGIGILFFLLIKK
RSKIQDLSFFRLFFAITLLFFPVIFELLILKQYFILPLSLSFIISSLVIS
IIIRIYGRIFSVSCFSDIPFVREHRIKLIMAGFLVYFWLFSFLVGWYKPQ
LKKEYQMLCYNNSWYYVLARYDSRLVLSSSFKDDSNRFLIFNTEQSGFYE
INDVYVRK
>gid:1155257  S0178  putative transposase
MTAAALLAEMPESSSLSRREISALVGVAQVNRDSGTLRGRRTIFGDCAGE
EQLCTWRRLRPPRFNLVIKAFYMRLLAAGNAKKVALVACMRKLLTIMNAM
LRKNEEWNESYL
>gid:1155258  S0179  orf, hypothetical
MTESRQEKLIWLRAQMKLTSLCWALKELAKDIWSRPWSEERRNDWQRWLR
PTVTSP
>gid:1155263  S0183  putative transposase, fragment
MLNKRAFFGAFLIFWGFKFLSMDMNAGYIRAARIHLPNAVEKIAFDRFHV
AKQPGEVVDKTRQNEPPRFSWRVFYL
>gid:1155264  S0184  IS91 transposase, fragment
MRYGSLAGWRYSAFLMLPRFADIFQQGNRWLNWLEKQPVQMSRLEHYAGQ
DEIGLRYNSHRTKREENLVMSGDEFMERFSWHVADKGFRIVIRGPESGEA
AITGRCGVRHNGDSEKNGEANHKERDVSAVTEG
>gid:1155265  S0185  IS91 orfA
MACDYRYKNRQYHCLSGSYMARSAKPRKRKPAPQRSKLLRYVVKLHEDDF
FDEEEAEVLRFDNFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP
EIARCTEREFATIPAGINADGMYCPECGTVHWPDGVIPPF
>gid:1155266  S0186  putative reverse transcriptase, fragment
MPQGGVISPLLSNIILNEFDQYLNKRYLSGKARKDRWYWNHSIQRGRSTA
VKENWQWKPAVAYCCYADDCVPRRRVLGT
>gid:1155267  S0187  putative reverse transcriptase, fragment
MARTRSGRETSRTITAHRLRGNTGRRVIEGDLSSYFDTVHHRLLMKAVCR
RISDTRFMRLLWKTPC
>gid:1155268  S0188  IS3 putative transposase, fragment
MCSGYHFNVKTVAASLRRQELSAKASQKFSPISYRAHGLPVSENLLTQDF
YASGPNQKWAGDITYYYSSPTAGKHGAPGY
>gid:1155273  S0193  adhesion protein, fragment
MLPPNIRGYAPQITGIAETNARVVVSQQGRVIYDSTVPAGTFSIQDLSSS
VRGILDVEIFEQNGKRKHFQVEMCRCAFLIQTWSE
>gid:1155274  S0194  orf, hypothetical
MPFLSRLGQSRYKLVTGLPKTNNKATGDAFFSVKILRGPEPGEVARQITW
GSVQPELNRPGNPGD
>gid:1155280  S0199  orf, hypothetical
MPLICGCNVSFPAFPSKGTPMMLYATLGYDFKVRTLKNFKGELYRKCMPG
ASNAAMCKITETIKKWRIHRSTAESLLDFARRYNAIVRGWIEYYGKFWSR
NFSYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLALVRKQYPKLFAHWYLL
RASNE
>gid:1155281  S0200  putative reverse transcriptase, fragment
MKDRNGSGAKGLPHCADGAAATTGDNADGRTAVKSAKPFPVSKRQVWEAY
KRVKANRGAAGIDGQTLAGFDENVTDNLYKLWNRMASGSYMPQAVRRVDI
PKADGGVRPLGIPAVSDRIAQMVVKQILEPVLEPLFHADSYGYRPGKSAH
QAIAQARKRCWKFDWVVEVDIKGFFDDIDHDLLLKTVQHHTQARWVVMYI
ERRLKAPVQMPDGAMLARGRGTPQGGVISPLLSNLFLHYAFDMWMQRQFP
GVPFERYADDVVCHSRI
>gid:1155282  S0201  putative reverse transcriptase, fragment
MVLRSQRPPAGLIHHSARGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNA
PMESFWGTLKNGTGTE
>gid:1155285  S0203  IS1650 orfB
MCRRYSKKHPDIDGDNGPGRSRGGFGTKIHLATDGSGLPLNIVLSPGQAH
ESQFAQRLLDGIGVQRQNGSMKRRGHAVLADKAYSGRALRNELKNNGIKA
VIPRKSNEKMASDGRAQLDRDAYCNRNVVERCFGRLKEYRRIATRYDKTA
RNYLAIVKLGCIRLFYQRLRN
>gid:1155286  S0204  IS1650 orfA
MARYDLPDEAWTIIKPLLPPEPATPRAGRPWAEHRKIINGMFWVLCSGAP
WRDLPERYGAWKTVYNRFNRWSKSGVINIIFNRLLSLLDANGFIDWSATA
LDGSNIRALKCAAGTQKNIPISTEIMGRVALAAVLAPKSIWQQTEVASR
>gid:1155287  S0205  orf, hypothetical
MCDNNREMTMATINARIDDDIKNQADEVLKLMNISQTQAIAAFYQYITEQ
KKLPFVITSIVKTPHDLLRESTDMLAEALAVISNLQVWTEQQDGIGKAKL
MEYYRRLDALYCCAKEKIGLLSDNRDAELGCVP
>gid:1155297  S0215  IS1328 transposase, fragment
MRFVQPRTETQQAIRALHRVRESLIRDKVKTTNQIHGFLLEFGISLPTGD
AVIKRLSLVLAEHEIPEYLSRLLVRLHTHYLYLVEQIAELESELSQSINA
DDTAQRIMTIPGVGPITASLLSSQLGDGKQFSCSRDFAASTGLVPRQYST
GGKSTLLGISKRGDKNLRRLLVQCARPFMMQLERQHGKLAEWVREQLNKK
HSNVVACALANKLARIA
>gid:1155299  S0216  putative IS orf
MMSSAPLPPKPIERGYASAGLLARILVSKYMEHIPLYRQSEIYARQGVEL
SRNTMVRWVSEMADKLRPLYIALNDYVLEAGKVHADDTPVKVLAPGNGKT
KTGRLWVYVRDDRNAGSSLPAAVWFAYSADRKGEHPQLHLAKYQGVLQAD
AYAGYNVLYETGRVKEAGCLAHARRKIHDEDVRRPTEMTQEALRRIAELY
DIEAEIRGSPAEERLAVRKARSVQLMQSLYDWIQLQRKTLSKHAEMAKAF
DYILNHWNALNEFCRDGWVEIDNNIGENALRSVAVGRKNYLFFGSDKGGE
SAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHELLPWNFSSVK
>gid:1155300  S0217  putative IS orf
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRM
LFGQSSEKKRHKLENQIRQAEKRLSELENRLNTARNLLEDASSVTDSPDT
SPPSENPIASKPESPGRKSSRKPLPAELPRETHRLLPAETSCPACGGVLK
EMGETISEQLDIINTAFKVIETIRPKLACSRCDVIGTTSS
>gid:1155301  S0218  putative IS orf
MITLPTGTKIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRS
GKMVKILWADRDGLCLFTKRLAGDPGRESAPDASSVIHATGGDRVATSQT
DRTAWHPDITRDKTRE
>gid:1155302  S0219  putative IS orf
MNSQTTKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FLASGIAWPLPDSVSFAQLDAILYANRKKELTEPQIREGSWRKERRTSYS
REFKVRLAKQALQPGAVVARIAREHDINDNLLFKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKHDERPEGAPGNVPRCELHLKSGV
VKLFDPLTPELLRALIREMKGGIR
>gid:1155309  S0225  orf, hypothetical
MINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLSDMDKDSELAATLQ
KMINPSGGDGNCSGCALHACMAMLGYGVREAPVPNEISEYMTGFFHRHLE
QIDSEGIVSHPNETYSKFRERIAENILQNTSKGSVVMISIEQATHWIAGF
NDGEKIMFLDVQTGKGFNLYDPVEKSPDAFVDENSSVQVIHVSDQEFDHY
ANSSSWKSKRLC
>gid:1155311  S0227  IS1-cat orf
MSDNGNAHAGNNYGHVDKLHRNDSADNEQGNDSNLLIVFYVQIMPDDLVM
QLHRF
>gid:1155315  S0230  putative IS orf, fragment
MAALPCPLYRQQHIFSRMGVELPVSTMADMVGVAGAALAPLAKLLRHELL
TRDVIHADETSLRLLDTRKGGKSCSGWLCAYVSGERSGPPVVCFDSQTGR
ALRYPETWLQCWCGSTLVSDGYSVYKSLADNHPGITSACCWSHAGRGFAN
LYKASREPRAGVELRKIAGLYRIEKLIRERPVEKIRQWR
>gid:1155316  S0231  putative IS orf, fragment
MNDLFAWLEEQEPCCPPDGPLNKAINYILNRRDELSCFLGDGAVPLDNNI
CERAIRPVVMGRKAWLFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTD
VLTRLPEWPEERLAELLPLEGFTFTG
>gid:1155322  S0237  orf, hypothetical
MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSA
LYKKYDLIGNQYNEILEMAWQESELFNAFYGDEASVVIQYGGDVYLRMLR
VPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKES
ESLFPIDFRNIYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL
>gid:1155328  S0241  putative iso-IS1 orf
MYDGDFKVLAWLPFPSDVLPAPLLKAWCVTAKALPDISAISALIAVKHGN
YSSLTPPLSPVRTRKSLIWP
>gid:1155329  S0242  putative IS orf
MAGCRLGVPKSTVCGMFVRFRNAGLSWPLPAGMSEQELDALLYGSASTVP
VVLTESTVMPKLPVVKKRPRRPNADQLRIS
>gid:1155330  S0243  IS4 orf
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNGWSLAGEHRHWMIPLRKGAQYEELR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMADLYSNRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRGSEPPRESWRLNFLRKR
>gid:1155335  S0247  putative IS orf
MLTELLTRAYPCPPLTPRSTVCGLFARFRKSGLSWPLPAGMSEQELDALL
YGSASTVPVVLTESTVMPKLPVVKKTSPAALMPVS
>gid:1155336  S0248  IS3 transposase, fragment
MAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAG
DITYLRTDEVRLHPVSTEPHAF
>gid:1155337  S0249  orf, hypothetical
MFSKAFLRKISMFFARRKPAAMKICLYHTLNPDTIPGYKKFAQAIATDNF
VQADVRKIDTNLYRARLSIRDRLLFSLYRYHGETICLVLEYIRNHAYNTS
RFLRRNVVIDEGRLQQQPVPDPVDIATEALTYINPSHGRFHRLDKMLSFD
DDQQALYEHPLPLVIVGSAGSGKTALVLEKMKQAAGDILYLSLSSFLVEK
ARTLYDASGEGSEVQNIDFLSLTEFLETLRIPEGREVTFSAFSDWLPRNR
AIAALGAAHTLYEEFRGVIGAVASGNGPLSREAYLSLGIRQSLYGMEDRP
TVYVLFERYIAWLKQSHQYDSNLLSHQYLSLATPRYDVIFVDEVQDMTPV
QLQLVLKTLRHPGQFLLCGDANQIVHPSFFSWSSLKSLFFRQQQGNDTTV
NILQANYRNGHHVTALANRLLRLKQVRFSAIDRESHHFVRSCGQAEGTIR
LLDDREETKQELNAKTSLSNRVAVIVMHPEQKAQARCWFSTPLVFSVQEV
KGLEYETVILYNIVSAARQAFDDICEGLTPADLEGEARYSRPRDRQDRSA
EIYKFFTNALYVALTRATHNVYLVEQQVEHPLWSLLALTHQEEPLNLQEE
ISSRDEWQKTAHLLEKQGKQEQADTIRSRILQTSEMPWQIITAEDARQWK
QHILAGTADKTIQLQALEYSLIYSLFPLYNALYREDFKPTRQPRTKTLQL
LELKYFRPYSMNNPVAVLRDIERYGVDHRSPFNLTPLMSAARAGNIALVQ
LLLERGADPLLTGNDGLAAYHQVLSAAVSTPRYAQQKSAQLYTLLKPESL
SLQVEGRLIKLDNRQMAMFLVILMQALFHTHLGSALFFSEAFSAARLAEC
VVHLPEALLPERRKRRSYISSQLSQHEVNSKNPYGKKLFLRLNHGQYILN
PGLKIRQGDVWRAVYELQSPEDLGHDLQTYLQDMSPELVDMLGGKKGFYE
RSEKSVGYWVGGIRRAAQKA
>gid:1155342  S0254  IS630 orfA
MVVSAIASIPQLHRGDRVSDVARTLCCARSSVGRWINWFTQSGVEGLKSL
PAGRARRWPFEHICTLLRELVKHSPGDFGYQRSRWSTELLAIKINEITGC
QLNAGTVRRWLPSAGIVWRRAAPTLRIRDPHKDEKMAAIHKALDECSAEH
PVFYEDEVDIHLHPKIGADWQLRGQQKRVVTPGQNEKYYLAGALHSGTGK
VSYVGGNSKSSALFISLLKRLKATYRRAKTITLIVDNYIIHKSRETQSWL
KENPKFRGIYQPVYSPWVNHVERLWQALHDTITRNHQCRSMWQLLKKVRH
FMETVSPFPGGKHGLAKV
>gid:1155344  S0255  IS600 orfB
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>gid:1155345  S0256  IS600 orfA
MSRKTRRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155355  S0266  IS600 orfB
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNPNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>gid:1155356  S0267  IS600 orfA, fragment
MAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155359  S0269  Tn501 orf, hypotheical
MCCGAVFSDSLLDAGRIARVCGCLGQLLHAGLGIVEGDDRLACLESHVNF
ADAFDLGNRLLDSDRAGGAGHARYGQRDGLGGGPDGGNNGGEGEGGKQFL
HGELRSVEKWHDVGKSERDQNQRGHDPENELVSSSHLGNRADLTRFAGRC
LPVDAPPGEEQRHQRHADKDGAIGFQHRQVADPSAAEPQGDQNQRPEAAS
RGEDGGKPSSEERAAPGFWFRHALVLSN
>gid:1155361  S0271  Tn501 orf, hypothetical
MLCRLGRQGGQLHFQIGQRFAPTLDELTQQGKLRGRFVAVRRIQRPAQPR
QRVEADARLEGRPHEAQPLQGGVIEQAVAAWCARHRAQQSAQQVVAHDMH
AHPGIKSQPGHRVGVH
>gid:1155362  S0272  Tn501, conserved hypothetical orf
MTPPCNGCASCGRPSRRASASTRWRGCAGRWMRRTATKRPRSLPCCVSSS
SVGAKRWPIWKCSWPPCRPSRHSTRRVCHEQPRALAVRDAQTDHRLPVGR
TGCADLPLPPAHPRCRAGRHNRRCFPRRALGHRGARFDRPVPSVPVAGVA
GIQGKRMSAFRPDGWTTPELAQAVERGQLELHYQPVVDLRSGGIVGAEAL
LRWRHPTLGLLPPGQFLPVVESSGLMPEIGAWVLGEACRQMRDWRMLAWR
PFRLAVNASASQVGPDFDGWVKGVLADAELPAEYLEIELTESVAFGDPAI
FPALDALRQIGVRFAADDFGTGYSCLQHLKCCPISTLKIDQSFVAGLAND
RRDQTIVHTVIQLAHGLGMDVVAEGVETSASLDLLRQADCDTGQGFLFAK
PMPAAAFAVFVSQWRGATMNASDSTTTSCCVCCKEIPLDAAFTPEGAEYV
EHFCGLECYQRFEARAKTGNETDADPNACDSLPSD
>gid:1155365  S0275  IS600 orfA, fragment
MSRKTRRYSKEFKAEAVRTVLENQLSISEDASRLFLPEGTLGGLAEFGKN
RTLRFSGHP
>gid:1155366  S0276  conserved plasmid hypothetical protein
MNINQFMDRAGAAWVYEQYNTDPVLPVLQNEARQQKRGLWSDADPVPPWI
WRHRK
>gid:1155376  S0281  IS1294 orf
MLSAFTPRPLKRLFTTNQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYNCDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFILPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGASLSFRYLDHKTGETATETLTQRELVAR
LKQHIPEKFFKMVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQ
MVKQFLSRDPFECVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG
>gid:1155377  S0282  orf, hypothetical
MLTRKSIDTVLLSVGAEKLSQREWDWMKMLKPMDPPPAMVAASILERRGD
TAALTRLQDTGG
>gid:1155380  S0285  IS600 orfA
MSRKNQRYSTEFKAEAVKTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155381  S0286  IS600 orfB, fragment
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVG
>gid:1155384  S0288  IS911 orfA
MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST
MTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKAT
ALLMSDSLNSSQ
>gid:1155385  S0289  IS911 orfB
MVTLCHVFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATRSGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNY
LERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSP
DSRLTMKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSM
SRRGNCWDNSLMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSAL
RPHEYNGGLPPNESENRYWKNSNSVASFC
>gid:1155386  S0290  orf, hypothetical
MSIEIKMISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYDK
KYISGITRSMAQLKIEEFINEKSRRLNYMKTMYSPCPEDFQPISRAEAST
PEGSWLTVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFSKENSDFL
YIIVVFRNDSPQGELRANRFIELYDIKREIMQVLRDESPEIKVY
>gid:1155387  S0291  orf, hypothetical
MDTGLSEVLVFKLFRTEAATPTVPSPAIVIALDIIKHCCPHYFLIDKVFS
VETFHLGTIILFMVMFIFEYRD
>gid:1155170  ShET2-1  enterotoxin
MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNL
NCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVS
EEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNM
ALGLKIKETKNGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFL
DEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPDNLLHPKVIYHAMRMG
LTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTIL
AYGELLETSGLNLDKTVELLTAEGMGGRISGLSQALQNGHAETIKTYGRL
LKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL
ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK
LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQA
SKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDK
KNGSDFLEIMKNIKS
>gid:1155317  ccdA  post-segregation antitoxin
MKQRITVTIDSDSYQLLKSANVNISGLVNTAMQKEARRLRAERWQAENQQ
GMAEIARFIEMNGSFADENRDW
>gid:1155318  ccdB  post-segregation toxin
MPMRTGTGEMQFKVYAYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARL
LSDKVSRELYPVVHVGDESWRMMTTDMASVPIFVIGEEVADLSHRENDIK
NAINLMFWGI
>gid:1155352  finO  fertility inhibition protein
MWYLWPQEDDSSEAEHSMTEQKRPVLTLKRKTEGTAPVRSRKTIINVTTP
PKWKVKKQKLAEKAAREAELAAKKAQARQALSIYLNLPTLDEAVNTLKPW
WPGLFDGDTPRLLACGIRDVLLEDVAQRNIPLSHKKLRRALKAITRSESY
LCAMKAGACRYDTEGYVTEHISQEEEAYAGARLAKIRHQNRIKAELQAVL
DEK
>gid:1155367  hmo  putative regulator
MAKTKQEWLYQLRRCSSVNTLEKIIHKNRDSLSNSERESFNSAADHRLAE
LITGKLYDRIPKEIWKYVR
>gid:1155217  icsB  invasion protein
MSLKISNFIDASNTKGPIRVEDTEHGPILIAQKFNLKDLFFRTLSTINAK
INSQILNEQLKNYRLENQKSLLLFLNTLASEKSAESAFAAYEAAKNSIQH
SFTGRDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISI
KNNLLQEPTHTYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMI
SEQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKDPYGYWGVSAD
KVYIPLSGDNKTKDGKISHNLFGLDETNMSKFICKKKADAFRQLANYKLI
SKSENCAGMALNVLKAGNSEIYFPLPDVKLVATPNDVYAYANKVRQRIES
LNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS
LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMTRVLNELKTEATDKKEE
IIEKSIKIIDYYNSLKSPDLGTKLYIHDLLQINKLLLNNSHSNI
>gid:1155314  insA  IS1 orfB
MTSISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>gid:1155138  insA  IS1 repressor
MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHKKIIDMAMNGVGC
RASARIMGVGLNTVLRHLKNSGRSR
>gid:1155122  insA  IS1 repressor
MVRNGKSTAGHQRNLCSHCRKTWQLQFTYTASQPGTHKKIIDMAMNGVGC
RASARIMGVGLNTVLRHLKNSGRSR
>gid:1155327  insA  iso-IS1 orfA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>gid:1155123  insB  IS1 transposase
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>gid:1155313  insB  IS1 orfB
MSRQRTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>gid:1155326  insB  iso-IS1 orfB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSHIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>gid:1155379  insB  IS1-insB, fragment
MRSLHFNSVTPSATKDKQVTRKGIFIQHMLYLERNNLTLRTRIKRLARKT
ICFSRSVEIHEKSSAPSLKNTYSTDWKRHPKKYRFFTVNFI
>gid:1155310  insB  iso-IS1 orfB
MLPLSVNLMSSGASLAARLQQHWLWYAYNTKTGGVLAYTFGPRNDETCRE
LLALFTPFCIY
>gid:1155139  insB  IS1 transposase, fragment
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEHWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGCPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLNRPGNP
GD
>gid:1155210  ipaA  invasion protein
MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEK
ENFATSFNQKCLDFLFSSSGKEDVLRSIYSNSMNAYAKSEILEFSNVLYS
LVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQL
RQNIEHRLALLAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNET
VSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLSRDLNTVAVFP
ELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIH
YHINNDNRTYDNRVFDNRVYDNSYHENPENDAQSPTSQTNDLLSRNGNSL
LNPQRALVQKVTSVLPHSISDTVQTFANNSALEKVFNHTPDNSDGIGSDL
LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKS
SASLSHRVASQINKFNSNTDSKVLQTDFLSRNGDTYLTRETIFEASKKVT
NSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANIRN
YVTETNADTIDKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLK
NNNDITAENNNIYKAAKDVTTSLSKVLKNINKD
>gid:1155213  ipaB  invasion protein
MHNVSTTTTGFPLAKILTSTELGDNTIQAANDAANKLFSLTIADLTANQN
INTTNAHSTSNILIPELKAPKSLNASSQLTLLIGNLIQILGEKSLTALTN
KITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSK
IKDLENKINQIQTRLSELDPESPEKKKLSREEIQLTIKKDAAVKDRTLIE
QKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQKSLTGLASVT
QLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKA
EELNRVMGCVGKILGALLTIVSVVAAAFSGGASLALAAVGLALMVTDAIV
QAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFL
KNFSSQLDDLITNAVARLNKFLGAAGDEVISKQIISTHLNQAVLLGESVN
SATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQ
EVIADLLASMSNSQANRTDVAKAILQQTTA
>gid:1155212  ipaC  invasion protein
MLQKQFCNKLLLDTNKENVMEIQNTKPTQTLYTDISTKQTQSSSETQKSQ
NYQQIAAHIPLNVGKNPVLTTTLNDDQLLKLSEQVQHDSEIIARLTDKKM
KDLSEMSHTLTPENTLDISSLSSNAVSLIISVAVLLSALRTAETKLGSQL
SLIAFDATKSAAENIVRQGLAALSSSITGAVTQVGITGIGAKKTHSGISD
QKGALRKNLATAQSLEKELAGSKLGLNKQIDTNITSPQTNSSTKFLGKNK
LAPDNISLSTEHKTSLSSPDISLQDKIDTQRRTYELNTLSAQQKQNIGRA
TMETSAVAGNISTSGGRYASALEEEEQLISQASSKQAEEASQVSKEASQA
TNQLIQKLLNIIDSINQSKNSAASQIAGNIRA
>gid:1155211  ipaD  invasion protein
MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDT
LHNIRTTNQALKKELSQKTLTKTSLEEIALHSSQISMDVNKSAQLLDILS
RNEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYL
KVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALE
ELKEKYKDKPLYPANNTVSQEQANKWLTELGGTIGKVSQKNGGYVVSINM
TPIDNMLKSLDNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQK
YSNANSIFDNLVKVLSSTISSCTDTDKLFLHF
>gid:1155378  ipaH1.4  invasion plasmid antigen
MIRILVIMIKSTNIQAIGSGIMHQINNVYSLTPLSLPMELTPSCNEFYLK
TWSEWEKNGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQI
TTLEIRKNLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKI
KELPFLPENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKL
EGLALANNFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAG
NPLSGHTMRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWF
PENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAW
LEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLF
DNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEK
LQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHA
VLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGA
QVMRETEQQIYRQLTDEVLALRLSENGSNHIA
>gid:1155126  ipaH2.5  invasion plasmid antigen H
MIKSTNIQVIGSGIMHQINNIHSLTLFSLPVSLSPSCNEYYLKVWSEWEK
NGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRK
NLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLP
ENLTHLRVHNNRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALAN
NFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVNVAGNPLSGHT
MRTLQQITTGPDYSGPRIFFSMGNSATISAPEHSLADAVTAWFPENKQSD
VSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSAS
AELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL
LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV
KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEA
DRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETE
QQIYRQLTDEVLA
>gid:1155319  ipaH9.8  invasion plasmid antigen
MSTGFNWMPIMLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPG
EERDEAVSRLKECLINNSDELRLDRLNLSSLPDNLPAQITLLNVSYNQLT
NLPELPVTLKKLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLL
TMNISYNEIVSLPSLPQALKNLRATRNFLTELPAFSEGNNPVVREYFFDR
NQISHIPESILNLRNECSIHISDNPLSSHALQALQRLTSSPDYHGPRIYF
SMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFL
DRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDR
VALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKV
RTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEA
MVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYPQR
VADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGS
QLHHS
>gid:1155207  ipaJ  invasion plasmid antigen
MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQ
GCFLARKQMYDNSCGAASLLCAAKELGVDKIPQYKGSMSEMTRKSSLDLD
NRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVE
ESNIFSQVISFIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIG
VPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNSNFIGYTKIGI
SIVITNEAL
>gid:1155216  ipgA  invasion plasmid antigen
MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLL
NEHELFTENIINPEHILILNYSLSRDYGSSICLLPDTHQCVLTKKHYKKY
LSPDELIESLYEFLFCIKLTIANITSEVN
>gid:1155215  ipgB  invasion protein
MQILNKILPQVEFAIPRPSFDSLSRNKLVKKILSVFNLKQRFPQKNFGCP
VNINKIRDSVIDKIKDSNSGNQLFCWMSQERTTYVSSMINRSIDEMAIHN
GVVLTSDNKKNIFAAIEKKFPDIKLDEKSAQTSISHTALNEIASSGLRAK
ILKRYSSDMDLFNTQMKDLTNLVSSSVYDKIFNESTKVLQIEISAEVLKA
VYRQSNTN
>gid:1155214  ipgC  invasion plasmid antigen
MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKG
RIEEAEVFFRFLCIYDFYNVDYIMGLAAIYQIKEQFQQAADLYAVAFALG
KNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAI
QDIKE
>gid:1155219  ipgD  secreted protein
MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEA
LNRLYLQNQTSLTGKSLLFARDKAEVFCEAIKLAGGDTSKIKAMMERLDT
YKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKN
WGPVNKNISHHGKNYSFQLTPASHMKIGNKNIFVKEYNGKGICCASTRER
DHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSERAVAARNKAEE
LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVS
ALKGLNSKRGGPTKLLIRNSDGLLKEVSVNLKVVTFNFGVNELALKMGLG
WRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ
IKEIVNNKLQKNDNGEPYKLSQRVTLLAYTIGAVPCWNCKSGKDRTGMQD
AEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGV
PGNKVMKKLPLSSLELSYSERIGDPKIWNMVKGYSSFV
>gid:1155220  ipgE  invasion-associated protein
MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCAL
PENINDLIYALSLNYSEKICLATDDEGGNLIARLDLTGINEFEDVYVNTE
YYISRVRWLKDEFARRMKGY
>gid:1155221  ipgF  invasion-associated protein
MSRFVFILLCFIPHLGRADCWDKAGERYNIPSSLLKAIAEKESGFNKSAV
NVNNNGSKDYGIMQINDFHSKRLREMGYSEEMLISHPCLSVHYAAKLLNE
FMMMYGRGWEAVGAYNAGTSPKKKKERLKYAEDIYRRYLRIAAESKQNNR
RI
>gid:1155159  ipgH  composite orf
MPLLDKLREQYGVGPVCSELHIAPSTYLLNGWIQGMGYPPGAKTLVFWYE
HRERISWATLWNLSHNVGGALAPVLIGFSFGFFGDSALDHARAAFIFPGV
LCMAMSVLIYFIQVDRPVSVGLPPIEEWKGNVVSHPAKGREQGPRLSIPD
IIRKHIIRNNKLIYCCIYGSFVYILRYGIVSWAPKFLSDSLDVGGKDMGK
LASMGGGSVFEIGGVAGMLLAGYLSVRLFRNSKPLTNTLFLALTIILLIA
YWYVPSGNEYLWLNYTILILLGLAVYGPVMFIGLYSMELVPKEAAGAASG
LSGTFSYIFGSIVATLGMGLVVDYLGWGPHLSY
>gid:1155360  merA  Tn501, mercuric reductase
MTHLKITGMTCDSCAAHVKEALEKVPGVQSALVSYPKGTAQLAIVPGTSP
DALTAAVAGLGYKATLADAPLADNRVGLLDKVRGWMAAAEKHSGNEPPVQ
VAVIGSGGAAMAAALKAVEQGAQVTLIERGTIGGTCVNVGCVPSKIMIRA
AHIAHLRRESPFDGGIAATVPTIDRSKLLAQQQARVDELRHAKYEGILGG
NPAITVVHGEARFKDDQSLTVRLNEGGERVVMFDRCLVATGASPAVPPIP
GLKESPYWTSTEALASDTIPERLAVIGSSVVALELAQAFARLGSKVTVLA
RNTLFFREDPAIGEAVTAAFRAEGIEVLEHTQASQVAHMDGEFVLTTTHG
ELRADKLLVATGRTPNTRSLALDAAGVTVNAQGAIVIDQGMRTSNPNIYA
AGDCTDQPQFVYVAAAAGTRAAINMTGGDAALDLTAMPAVVFTDPQVATV
GYSEAEAHHDGIETDSRTLTLDNVPRALANFDTRGFIKLVIEEGSHRLIG
VQAVAPEAGELIQTAALAIRNRMTVQELADQLFPYLTMVEGLKLAAQTFN
KDVKQLSCCAG
>gid:1155358  merR  Tn501 repressor
MVQTCACPFELKLDSVTLLPYSCTESSDMENNLENLTIGVFAKAAGVNVE
TIRFYQRKGLLLEPDKPYGSIRRYGEADVTRVRFVKSAQRLGFSLDEIAE
LLRLEDGTHCEEASSLAEHKLKDVREKMADLARMEAVLSELVCACHARRG
NVSCPLIASLQGGASLAGSAMP
>gid:1155076  mkaD  mouse killing factor
MPIKKPCLKLNLDSLNVVKSEIPQMLSANERLKNNFNILYNQIRQYPAYY
FKVASNVPTYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVG
DKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIG
AQFTLYVKSDQECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDV
RPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE
>gid:1155323  mob9  plasmid mobilization protein
MSLAGNPCVIRLAAQVCMWLKFIIRDRGGFSGGLLLFLPVCCRDRTERIL
AVHTIKILR
>gid:1155341  msbB  acyltransferase
MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIG
RWVGRLSRKARRRATINLSLCFPEKSDTEREIIVDNMFATALQSIVMMAE
LAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAA
QGEKMAAMFHQQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGY
WGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQAMIIPLFPVYD
EKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLK
LLKTRKSNEADPYP
>gid:1155347  mvpA  plasmid maintenance protein
MLKFMLDTNICIFTIKNKPASVRERFNLNQGKMCISSVTLMELIYGAEKS
QMPERNLAVIEGFVSRIDVLDYDAAAATHTGQIRAELARQGRPVGPFDQM
IAGHARSRGLIIVTNNTREFERVGGLRTEDWS
>gid:1155348  mvpT  plasmid maintenance protein
METTVFLSNRSQAVRLPKAVALPENVKRVEVIAVGRTRIITPAGETWDEW
FDGHSVSTDFMDNREQPGMQERESF
>gid:1155233  mxiA  invasion protein
MIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLAIL
VFMGSFYIERILSFSTFPSVLLITTLFRLALSISTSRLILVDADAGKIIT
TFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGMPG
KQMSIDADLKAGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAIAG
IIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLVSQIPALLISI
SAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFPFF
VFFLIAVTLTALFYYKKVVEKEKSLSESDSSGYTGTFDIDNSHDSSLAMI
ENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPTIL
YRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVVST
SYNERVISWVDVSYTENLTNIDAKIKSAQDEFYHQLSQALLNNINEIFGI
QETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLKLI
MESLALWAPREKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYIED
AIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAKKNFVLLVSVD
IRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI
>gid:1155232  mxiC  invasion protein
MLDVKNTGVFSSAFIDRLNAMTNSDDGDETADAELDSGLANSKYIDSSDE
MASALSSFINRRDLEKLKGTNSDSQERILDGEEDEINHKIFDLKRTLKDN
LPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVELLTKIINEII
SGSEKSVNAGINSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYI
EWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGFVLSKLIAIKM
IRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVI
EVSRASHEDSVVYQTYLSSVNESPHDIFKSESEREIAINILRELVTSAYK
KELSR
>gid:1155231  mxiD  Type III secretion protein
MKKFNIKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFE
RFSALLNYPIVVSKQAAKKRISGEFDLSNPEEMLEKLTLLVGLIWYKDGN
ALYIYDSGELISKVILLENISLNYLIQYLKDANLYDHRYPIRGNISDKTF
YISGPPALVELVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNM
RGEDIVIPGVATVVERLLNNGKALSNRQAQNDPMPPFNITQKVSEDSNDF
SFSSVTNSSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDVAKRH
IELSLWIIDIDKSELNNLGVNWQGTASFGDSFGASFNMSSSASISTLDGN
KFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS
LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSQSNYNYNNENTSVLP
EVGRTKISTIARVPQGKSLLIGGYTHETNSNEIISIPFLSSIPVIGNVFK
YKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSE
TTLLEDEKSLVSYLNY
>gid:1155230  mxiD  Type III secretion protein
MEGFFFVRNQNIKFSDNVNYHYRFNINSCAKFLAFWDYFSGALVEHSHAE
KCIHFYHENDLRDSCNTESMLDKLMLRFIFSSDQNVSNALAMIRMTESYH
LVLYLLRTIEKEKEVRIKSLTEHYGVSEAYFRSLCRKALGAKVKEQLNTW
RLVNGLLDVFLHNQTITSAAMNNGYASTSHFSNEIKTRLGFSARELSNIT
FLVKKINEKI
>gid:1155229  mxiE  putative lipoprotein
MSLKQGERQMIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPV
SKDYFSIPNDLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKE
VDGCFMDAQKIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRG
I
>gid:1155222  mxiG  Type III secretion protein
MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDN
FPENIIPVTDSKSDGIIYLTISKDNICQFSDEKGEQIDINSQFNSFEYDG
ISFHLKNMREDKSRGHILNGMYKNHSVFFFFAVIVVLIIIFSLSLKKDEV
KEIAEIIDDKRYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYIL
VSNKEINRIQQYINQRFPFINLYVLNLVSDKAELLVFLSKERNSSKDTEL
DKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNK
VTYSVREELTDEKLELINRLISEHKNIYGDQYIEFSVLLIDDDFKGKSYL
NSKDSYVMLNDKHWFFLDKNK
>gid:1155223  mxiH  Type III secretion protein
MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEY
QSKLSEYTLYRNAQSNTVKVIKDVDAAIIQNFR
>gid:1155224  mxiI  Type III secretion protein
MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIME
MVSNPESLNPESLAKLQTTLSNYSIGVSLAGTLARKTVSAVETLLKS
>gid:1155225  mxiJ  Type III secretion protein
MIRYKGFILFLLLMLIGCEQREELISNLSQRQANEIISVLERHNITARKV
DGGKQGISVQVEKGTFASAVDLMRMYDLPNPERVDISQMFPTDSLVSSPR
AEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLEEKNISSKPMHISVI
AIYDSPKESELLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPV
KEVKSEFLTNEVIYLFLGMAVLVVILLVWAFKTGWFKRNKI
>gid:1155226  mxiK  Type III secretion protein
MGIQNRVVQEKQNMIRMDGIYKKYLSIIFDPAFYINRNRLNLPSELLENG
VIRSEINNLIINKYDLNCDIEPLSGVTAMFVANWNLLPAVAYFIGSQESR
LINHSEMVISYYGGKISKQGEAAIRSGFWHLIAWKENISVGIYERINLLF
NPIALEGNYTPVERNLSRLNEGMQYAKRHFTGIQTSCL
>gid:1155227  mxiL  putative membrane protein
MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVI
KEATKKAESLRIDAVCDGYQIGIQTAFEHIIDYICEWKLKQNENRRNIED
YITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLE
LDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKL
TKNDKKYFKELAHKKLRQIAEDLLKENPVND
>gid:1155228  mxiM  putative membrane protein
MINQINASNALQQRLNSEEFVNLNERLSSSQSFDEDIIYEIMQYFSQSEL
NSIDNDELHNKIEQLFNSRFPYLTAAQKSSLLNKLIDANQYVDLHEGFYA
SLSIYNNIDFYIKTTTFDSLISVFEAGREADDSTW
>gid:1155104  parA  plasmid segregation protein
MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKSAVAKLP
KLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPK
YRDRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLD
LDPQSSATMFLNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGV
YVMPASIDDAFIASNWDTLCEEHLLGQNKHAILRENIIDKLKHDFDFILI
DTGPHLDAFLKNAIAAADIMFTPVPPAQVDFHSTLKYLARLPELVQIIEQ
DGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIFGGDMLDVSMPRLDGFE
RSGESFDTVISANPVTYVGSGEALKNARMAAEDFAKAVFDRIEFIRANY
>gid:1155105  parB  plasmid segregation protein
MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTEIKVDHD
KVDTQTFVVEEVNGREQTALTPDSLKDITRTIRLQQFYPCIGIRTGDLIE
ILDGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREI
GLRLVRLKEAGMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSE
LTYADYRQLAELSERLRLGDISIDEVVKNISPSIELITADDNLSEDEVKN
SIMRLITKEMSSLLDSGVKDKAVVTLLWKFDSKDKFARKRVKGRTFSYEF
GRLPLEVQDKLDRMIALVLKDNLNSL
>gid:1155283  phoN-Sf  phoshatase precurser
MKRQLFTLSIVGVFSLNTFASFPPGNDVTTKPDLYYLTNDNAIDSLALLP
PPPQIGSIAFLNDQAMYEKGRLLRNTERGKLAAEDANLSSGGVANVFSAA
FGSPITAKDSPELHKLLTNMIEDAGDLATRSAKEYYMRIRPFAFYGVSTC
NTKEQDTLSRNGSYPSGHTSIGWATALVLSEINPARQDTILKRGYELGDS
RVICGYHWQSDVDAARIVGSAIVATLHSNPVFQAQLQKAKDEFANNQKK
>gid:1155372  repA1  replication protein
MQDGVTDLQQTYYRQVKNPNPVFTPREGARTLPFCGKLMEKAVGFTSRFD
FAIHVAHARSLGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITT
LAIECGLATESAAGKLSITRATRALTFLAELGLITYQTEYDPLIGCYIPT
DITFTSALFAALDVSEEAVAAARRSRVEWENRQRKKQGLDTLGMDELMAK
AWRFVRERFRSYQTELKSRGMKRARARRDADRQRQDIVTLVKRQLTREIS
EGRFTASREAVKREVERRVKERMILSRNRNYSRLATASP
>gid:1155369  repA2  replication protein
MSQTENAVTSSLSQKRFVRRGKPMTDSEKQMAAVARKRLTHKEIKVFVKN
PLKDLMVEYCEREGITQAQFVEKIIKDELQRLDILK
>gid:1155371  repA6  positive regulator for repA1 expression
MPGKVQDFFLCSLLLRIVSAGWCD
>gid:1155339  rfbU  UDP-sugar hydrolase
MNILFTESSPNIGGQELQAVAQMKALKKMGHSVLLVCRENSKIAFEASKL
GIDITFALFRNSLHIPTAWRLLGIVHGFQPNAIVCHSGHDSNIVGLVRLF
TRKHPFRIIRQKTYLTRKTKVFSINHFCDEVIVPGTSMKTHLEQEGCRTR
VTVVPPGFDFQKLYVDSRNSLPPNVLSWLASRRGCPVIAQVGMLRPEKGH
EFMLNLLFHLKMNGRQFCWLIVGSGSPELREHLQYQIDSMGMHDDVFIAD
NVFPAAPVYRVASLVVLPSENESFGMVLAEASAFSVPVLASQIGGIPDVI
QNNQTGTLLPAGNKHAWMCALNDFFNDPGRFYQMARQAKQDIEERFDINK
TALKILTLAKHK
>gid:1155145  sepA  secreted protease
MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVAALSLSS
AWPALSATVSAEIPYQIFRDFAENKGQFTPGTTNISIYDKQGNLVGKLDK
APMADFSSATITTGSLPPGDHTLYSPQYVVTAKHVSGSDTMSFGYAKNTY
TAVGTNNNSGLDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTEFY
RLGGGMQYVKDKNGNRTQVYTNGGFLVGGTVSALNSYNNGQMITAQTGDI
FNPANGPLANYLNMGDSGSPLFAYDSLQKKWVLIGVLSSGTNYGNNWVVT
TQDFLGQQPQNDFDKTIAYTSGEGVLQWKYDAANGTGTLTQGNTTWDMHG
KKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRVSALNGQTW
MGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDG
TVILNQQADADGKVQAFSSVGIASGRPTVVLSDSQQVNPDNISWGYRGGR
LELNGNNLTFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNT
VSIFGGRGAPGDLYYDSSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDH
NKAIDTVKQQKIEASSQPYMYHGQLNGNMDVNIPQLSGKDVLALDGSVNL
PEGSITKKSGTLIFQGHPVIHAGTTTSSSQSDWETRQFTLEKLKLDAATF
HLSRNGKMQGDINATNGSTVILGSSRVFTDRSDGTGNAVSSVEGSATATT
VGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRVGS
FVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSP
VISTTEGINLEDNASFSVKNMGYLSSDIHAGTTAATINLGDSDADAGKTD
SPLFSSLMKGYNAVLRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSR
IELGDGKHFATLQVKELSADNTTFLMHTNNSRADQLNVTDKLSGSNNSVL
VDFLNKPASEMSVTLITAPKGSDEKTFTAGTQQIGFSNVTPVISTEKTDD
ATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNNLNKRMGDLRDTQ
GDAGVWARIMNGTGSADGDYSDNYTHVQIGVDRKHELDGVDLFTGALLTY
TDSNASSHAFSGKNKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANF
ASLGTKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWE
DRGMALSMKDKDYNPLIGRTGVDVGRAFSGDDWKITARAGLGYQFDLLAN
GETVLQDASGEKRFEGEKDSRMLMTVGMNAEIKDNMRLGLELEKSAFGKY
NVDNAINANFRYVF
>gid:1155075  shET2-2  enterotoxin
MSSMPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSELIQF
KNKTAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKV
NYQLLSTPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMK
KNGDFVRTLSACTLNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTV
FTCDSHFELSQLSAKDFFDDFYWKIYGLEQPGQVIFEDRHNSPLTNTVKL
LPDELINSRVIYHAITKNLTEVLFILMEKYKNGEISQSKLVNLLATRSSD
GTPAFYIALQNGYSDIIQVYGKILNMCNLSQETILTLLAAVGANNVPGLC
MSFMNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFFALQNG
HADSIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDI
LKILPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSF
TTRRLVEMLSATNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIA
EQFSKKMKKTFIEIINRFNHFL
>gid:1155338  shf  putative carbohydrate transport protein
MLNEGGILFKANHVPVLMYHHVSHCPGLVTLSPVTFRKQIKWLAENNWKT
LSSDELEFFYRGGKLPRKSVMLTFDDGYLDNWFQVYPLLKEFNLKAHIFL
ITGFIGNGPVRHSPGKEYSHRDCEHQIATGNADNVMLRWSEVNEMLQSGL
VEFHVHTHTHTRWDKKFSSREEQCKHLRQDLLSGREYLKEMTGKCSKHLC
WPEGYYNKDYIQVAEELGFYYLYTTERRMNAPAKGTTRIGRISTKERESC
AWLKRRLFYYTTPFFSSLLAFHKGPRLPDD
>gid:1155388  sopA  VirG-specific protease
MDISTKKVEFSMKLKFFVLALCVPAIFTTHATTNYPLFIPDNISTDISLG
SLSGKTKERVYHPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVS
GWTTLGNQKASMVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGW
LLNNLDYRLGLIAGYQESRYSFNAMGGSYIYSENGGSRNKKGAHPSGERT
IGYKQLFKIPYIGLTANYRHENFEFGAELKYSGWVLSSDTDKHYQTETIF
KDEIKNQNYCSVAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTN
ISGTIKNSASIEYIGFLTSAGIKYIF
>gid:1155243  spa-orf10  orf, hypothetical
MCYMGVNFCNKIGIDQSEFEIESSIINSIANEVLNPISFLSNKDIINVLL
RKISSECDLVRKDIYRCALELVVEKTPDDL
>gid:1155244  spa-orf11  orf, hypothetical
MIRQQKRLTIILLLLGVDKRDYSSCNVKTLLYSIRDYAKSVNDHEILTES
NRLLSHCISDSNGAFFKSSKYVPLKYLRKRRIARKIPND
>gid:1155236  spa13  invasion protein
MYRDVEALDKRIIYFLQLENDLEPVGAQSVSQLFNTRRKIAIVKKHIIQY
QSERILLKGRIEEIQKDIDEANASKRKLLHKESKICKRIGLIKRNNFAKQ
LILDELSQEDMKYGIR
>gid:1155234  spa15  invasion protein
MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNE
QVMLWANFDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQL
RVVIKDDYVHDGIVFAEILHEFYQRMEILNGVL
>gid:1155239  spa24  Type III secretion protein
MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNM
TLNGIALIMALFVMKPIIEAGYENYLNGPQKFDTISDIVRFSDSGLMEYK
QYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDA
FKIGFYLYLPFVVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGW
GILSKALIEQYINIPA
>gid:1155241  spa29  Type III secretion protein
MMDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIF
LVASGLITSGKVDIGSSVFEHVYFLMFKEIIVGLLLSFCLSLPFWIFHAV
GSIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILE
SIQLSYNICPLFSQCSFRISNILTFLTLLASQAVILASPVMIVLLLSEVL
LGVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKVQFFLGEHKF
FTNLFVR
>gid:1155237  spa32  invasion protein
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASN
YFTIDKHFEHKHDKGEIYSGIKNAFELRNERATYSDIPESMAIKENILIP
DQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNR
NGQIGLKNHSLSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSL
QEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGSFVLKPSDQFV
GNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>gid:1155238  spa33  Type III secretion protein
MLRIKHFDANEKLQILYAKQLCERFSIQTFKNKFTGSESLVTLTSVCGDW
VIRIDTLSFLKKKYEVFSGFSTQESLLHLSKCVFIESSSVFSIPELSDKI
TFRITNEIQYATTGSHLCCFSSSLGIIYFDKMPVLRNQVSLDLLHHLLEF
CLGSSNVRLATLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDNNEAKI
NLSESNGESEHTEVSLALFNYDDINVKVDFILLEKNMTINELKMYVENEL
FKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE
>gid:1155242  spa40  Type III secretion protein
MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDV
MLLYRYVIINDFEINEGKYFFAVVIVFFKIIGFPLFFCVLSAVLPTLVQT
KFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYF
FWINDRKIIFSQVFSSVDGLYLIWGRLFKDIILFFLAFSILVIILDFVIE
FILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEILSEQTKSDIRN
SKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPT
VRDVKLARKLYKTHTKYSFVDFEHLDEVLRLIVWLEQVENTH
>gid:1155235  spa47  invasion protein
MSYTKLLTQLSFPNRISGPILETSLSDVSIGEICNIQAGIESNEIVARAQ
VVGFHDEKTILSLIGNSRGLSRQTLIKPTAQFLHTQVGRGLLGAVVNPLG
EVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGIKVIDSLLTCG
EGQRMGIFASAGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYL
KNSEKKSRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTEGHKVALFIDS
LTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITA
FYTVLLEDDDFADPLAEEVRSILDGHIYLSRNLAQKGQFPAIDSLKSISR
VFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK
ISVVESFLKQDYRLGFTYEQTMELIGETIR
>gid:1155240  spa9  Type III secretion protein
MSDIVYMGNKALYLILIFSLWPVGIATVIGLSIGLLQTVTQLQEQTLPFG
IKLIGVSISLLLLSGWYGEVLLSFCHEIMFLIKSGV
>gid:1155289  stbA  plasmid stable inheritance protein
MLKVSCDDGSTNVKLAWLEDGEVRTSLSGNSFKEGWNPGLFNAGKVYNYV
VDEKKYTYDLGSTAVIGTTHVSYQYSTTNLLAIHHALLTSGLQPQDVELT
VTLPVTEFFDNDNQPNEERIERKKANVLREISLNKGETFKIKKVNVMPES
LPAAFESLKKDKVNKLERSLIIDLGGTTLDCGLILGAFEGISEIRGYSEI
GTSRITHTVMNALTKASTPCNYFIADELIKNRHDNEYLQTLINDVAEIKN
ISHVIDREVKSLAESIRQEISTFSGMNRIYLTGGGAELIYPHIKQYFPNL
KVNKVDEPQFALVKAMVHA
>gid:1155288  stbB  plasmid stable inheritance protein
MESSDPKKRKKVVAYLHPALYPQDNLTQQTIDSLPVQMRGDFYRQSLICG
AALYSVDPRLLTLISVFFSEKITAENLVKLIEQTTGYTSTSIDISVLKNI
IEASSENKSESITSKDDFEEQTRRNLSMLKK
>gid:1155364  tnpA  Tn501 transposition transposase
MPRRLILSATERGTLLALPESQDDLIRYYTFNDSDLSLIRQRRGDANRLG
FAVQLCLLRYPGYALGTDSELPEPVILWVAKQVQTDPASWTKYGERDVTR
REHAQELRTYLQLAPFGLSDFRALVRELTELAQQTDKGLLLAGQALESLR
QKRRILPALSVIDRACSEAIARANRRVYRALVEPLTDSHRAKLDELLKLK
AGSSITWLTWLRQAPLKPNSRHMLEHIERLKTFQLVDLPEVLGRHIHQNR
LLKLAREGGQMTPKDLGKFEPQRRYATLAAVVLESTATVIDELVDLHDRI
LVKLFSGAKHKHQQQFQKQGKAINDKVRLYSKIGQALLEAKEAGSDPYAA
IEAVIPWDEFTESVSEAELLARPEGFDHLHLVGENFATLRRYTPALLEVL
ELRAAPAAQGVLAAVQTLREMNADNLRKVPADAPTAFIKPRWKPLVITPE
GLDRRFYEICALSELKNALRSGDIWVKGSRQFRDFDDYLLPAEKFAALKR
EQALPLAINPNSDQYLEERLQLLDEQLATVARLAKDNELPDAILTESGLK
ITPLDAAVPDRAQALIDQTSQLLPRIKITELLMDVDDWTGFSRHFTHLKD
GAEAKDRTLLLSAILGDAINLGLTKMAESSPGLTYAKLSWLQAWHIRDET
YSAALAELVNHQYQHAFAAHWGDGTTSSSDGQRFRAGGRGESTGHVNPKY
GSEPGRLFYTHISDQYAPFSTRVVNVGVRDSTYVLDGLLYHESDLRIEEH
YTDTAGFTDHVFALMHLLGFRFAPRIRDLGETKLYVPQGVQTYPTLRPLI
GGTLNIKHVRAHWDDILRLASSIKQGTVTASLMLRKLGSYPRQNGLAVAL
RELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNSLARAVFFNRLGEI
RDRSFEQQRYRASGLNLVTAAIVLWNTVYLERATQGLVEAGKPVDGELLQ
FLSPLGWEHINLTGDYVWRQSRRLEDGKFRPLRMPGKP
>gid:1155269  tnpC  IS629 orf, fragment
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHHDKRSARAQRDDWL
KKEILRVYDENHQVYAVRKVWHQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVHTTVSRKAVAAGDRVNRHQGNVPRTPGGPQRLVYVVSAADKDKHTS
AVPSALRQRCPQGFYPVQRYGAPRLTDELCALVTTLT
>gid:1155278  tnpC  IS629 orf
MARCTVARLMAVMGLAGVLRGKKVRMTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHIPPAEA
EKAYYASIGNDDLAA
>gid:1155129  tnpC  IS629 orf
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>gid:1155110  tnpC  IS629 orf
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>gid:1155163  tnpC  IS629 orf
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKLIMLPSETMIWQPEFTDKTLSRKPGAVQSASPVPPDRARHS
GPRITCVRRQKKP
>gid:1155334  tnpC  IS629 orf
MLREGIRVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQF
VAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFV
LDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLG
HIPPAEAEKAYYASIGNDDLAA
>gid:1155142  tnpC  IS629 orf
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHTPPAEA
EKAYYASIGNDDLAA
>gid:1155382  tnpC  IS629 orf, fragment
MAKFSVPLHRRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSY
DNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVGWYNNRRLLERLGH
IPPAEAEKAYYASIGNDDLAA
>gid:1155262  tnpC  IS629 orf
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGCIVGWRVSSSMETTFVLDALEQA
LWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHTPPAEA
EKAYYASIGNDDLAA
>gid:1155389  tnpC  IS629 orf, fragment
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAIFSHENVINSVNQFKKYT
LYLRK
>gid:1155333  tnpD  IS629 orf
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSGV
KRSVRPSAGKPLPQATA
>gid:1155277  tnpD  IS629 orf
MMPLLDKLREQYGGGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG
VKRSV
>gid:1155261  tnpD  IS629 orf
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSGV
KRSVRPSAGKPLPQATA
>gid:1155111  tnpD  IS629 orf
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG
VKRSVRPSAGKPLPQATA
>gid:1155128  tnpD  IS629 orf, fragment
MPLLDKLRAQRDDWLKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALW
HVSWRLWDLPVFSGVKRSVRPSAGKPLPQATA
>gid:1155165  tnpD  IS629 orf
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDALWHVSWRLWDLPVFSG
VKRSVRPSAGKPLPQATA
>gid:1155141  tnpDE  IS629 orf
MMPLLDKLRAQRDDWLKKEIQRVYDENHKVYGVKSGVSCYGKVSEWPDAL
WHVSWRLWDLPVFSGVKRSVRPSAGKPLPQATA
>gid:1155169  tnpE  IS629 orf, fragment
MTWELILDGYSESSYSATPRFAAARLPWFTDKTLSRKPGAVHCVSGFASM
SGIPGAVHTNIPQTVDFGTPRRRPATTRGKCSSK
>gid:1155140  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:1155127  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>gid:1155112  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:1155158  tnpE  IS629 orfA
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>gid:1155276  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:1155166  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:1155260  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>gid:1155270  tnpE  IS629 orf, fragment
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCILETLRVW
IRQHERDTGGGEVGSPPLNVSV
>gid:1155332  tnpE  IS629 orf
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWATICSIAPKIGCTPETLRVW
GRQHERDTGGGDGGLTTAERQRLKEPERENRELRRSNNILRLASAYFAKA
EFDRLWKK
>gid:1155306  tnpF  IS2 orf2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>gid:1155304  tnpF  IS2 orf2
MITDVWKYRGKSTSELIVLILVFRLVIGEQIIDVLEPEKRRRRTTQEKIA
IVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPA
SELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDG
E
>gid:1155180  tnpF  IS2 orf2
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>gid:1155303  tnpG  IS2 orf1
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTVSAPSATSCEN
CPVWQQSRPSILMDTNDEFPDNKRYSLLPFLFA
>gid:1155307  tnpG  IS2 orf1
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>gid:1155181  tnpG  IS2, orf1
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>gid:1155308  tnpG  IS2 orf1, fragment
MYLKIRDRLGYMSNTSSNFEMTGTLLGLELRKRKTPQEKIAIIQQTMEPG
MTVSHVARLHGIQPSLLLKWKK
>gid:1155130  tnpI  IS600 orfB, fragment
MCRVPGVSRSGYYDRVQHAPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELADNGIIVG
>gid:1155200  tnpI  IS600 orfB
MYIRTRETYGTRRLQTELADNGIIVGRDRLARLRKELRLHCKQKRKFRAT
TNSDHNLPVTPNLLNQNFTPTAPNQVWVADITYVATREGWLYLAGVKDVY
TCEIVGYAMGERMMKELTGKALFMALRSQRLPAGLIHHTDRGSQYCAYDY
RVIQEQSGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAIS
VIREYIEIFYNRQRRHSRLGNISPAAFRIKYYQMTA
>gid:1155199  tnpJ  IS600 orfA
MSRKTQRYSTEFKAEAVKTVPENQLSISEGASRLSVPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155131  tnpJ  IS600 orfA
MSRKNQRYSTEFKAEAVKTVPENQLSISEGASRLSLPEGTLGQWVTAARK
GLNTPGSRTVAELESEVMQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:1155363  tnpR  Tn501 transposition resolvase
MQGHRIGYVRVSSFDQNPERQLEQTQVSKVFTDKASGKDTQRPQLEALLS
FVREGDTVVVHSMDRLARNLDDLRRLVQKLTQRGVRIEFLKEGLVFTGED
SPMANLMLSVMGAFAEFERALIRERQREGITLAKQRGAYRGRKKALSDEQ
AATLRQRATAGEPKAQLAREFNISRETLYQYLRTDD
>gid:1155346  traD  DNA transport protein, fragment
MSVKLRLPQISESGEVVDMAAYEAWQQENHPDTWQQMQRREEVNINVHRE
RGEDVEPGDDF
>gid:1155350  traI  oriT nicking and unwinding protein, fragment
MLTGNLVMALFNHDTSRDQEPQLHTHAVVTNVTQYNGEWKTLSSDKVGKT
GFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEA
FSGRSQTIREAVGEDASLKSRDVAALDTRKSKQHVDPEVRMAEWMQTLKE
TGFDIRAYRDAAEQRAYTRTQTPGPASQDGPDVQQAVTQAIAGLSERKVQ
FMYTDLLARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTSG
IHMLDELSVRALSRDIMKQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSL
AIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNLKQDERLS
GELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVL
ITDSGQRTGTGSALMAMKDAGVNTYRWQGGEQRPATIISEPDRNVRYARL
AGDFAASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTA
LSPVWLDSRSRYLRDMYRPGMVMEQWNPETRSHDRYVTERVTAQSHSLTL
RNAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGD
RLQVASVSEDAMTVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVS
DSATVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSF
TVVSEQIKARAGETLLETAISLQKAGLHTPAQQAIHLALPVLESKNLAFS
MVDLLTEAKSFAAEGTGFADLGGEINAQIKRGDLLYVDVAKGYGTGLLVS
RASYEAEKSILRHILEGKEAVTPLMERVPGELMEKLTSGQRAATRMILET
SDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEM
RSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARA
YALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVVIMKEIVRQT
PELREAVYSLINRDVERALSGLERVKPSQVPRLEGAWAPEHSVTEFSHSQ
EAKLAEAQQKAMLKGEAFPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHL
NEDRRVLNSMIHDAREKAGELGKVQVMVPVLNTANIRDGELRRLSTWENN
PDALALVDNVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYT
PDTIRVGTGDRIRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVI
RPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAY
VALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAER
LFSTARELRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFD
RNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESL
LADNMQDGVRIARDNPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSV
QPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGK
TEQAVREIAGQERDRAAITEREAALPESVLREPQRVREAVREVARENLLQ
ERLQQMERDMVRDLQKEKTPGGD
>gid:1155349  traI  oriT nicking and unwinding protein, fragment
MSKGYTFMMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGQGAEQLGL
QGSVDKDVFTRLLEGRLPDGADLSRMQDGSNRHRPGYDLTFSAPKSISMM
AMLGGDKRLIEAHNQAVDFAVRQVEASAST
>gid:1155351  traX  F pilin acetylation protein
MYCVNHNGCGKRSGKLPGKICCRSVCSRWSGIWFVTCRKRKPRAETDTGR
QTLMTTDNTNTTRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLD
HINLIFQLKQEWMFLAGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWG
IIAQFAYYLAGFPWYEGNILFAFAVAAQVLTWCETRSGWRTAAAILLMAL
WGPLSGTSYGIAGLLMLAVSNRLYRAEDRAERLALVACLLAVIPALNLAT
SDAAAVAGLVMTVLTVGLVLCAGKSLPRFWPGDFFPTFYACHLAVLGVLA
L
>gid:1155096  trcA  putative chaperone
MIIMLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKINTSI
LSSVSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSRKIGDNLRKQ
IFKQVEKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKND
TTSNVANLISDQFFEKNVQYIDLKKLRGNMSDYITNLESPF
>gid:1155279  ushA  UDP-sugar hydrolase
MSEQRKPCKRGCVHTGTMIPLKKNITLIMFTLSLLTGNPAIAYETDKVYK
ITVLHTNDHHGHFWRNNHGEYGLSSQKTLVDNIRQKVINNGGSVLLLSGG
DINTGVPESDLQKAEPDIRGMNLIGYDAMAVGNHEFDNPLNILRQQEKWA
TFPFLSANIYQKSTGRRLFSPWKIFIRQNLKIAVIGLTTDDTAKTGNSEY
FTDIEFRQPAAEARSVIDELNQQEKPDIIIAATHMGHYDNGESGSNAPGD
VEMARSLPTGSLAMIVGGHSQAPVCMASDNKKQWNYIPGTTCVPDKQNGI
WIVQAHEWGKYVGQADFEFCNGTMKLVNYQLHPVNLKMRITREDGKTEFS
FYTPEITEDPQMLSLLTPFQNKGKAQLDVKVGVVNGRLEGDRSKVRFVQT
SMGHLILSALTERIDADFAVVSGGEIRDSIESGNITYKDILKVQPFGNTV
VSIDLTGKEVADYLATVAQMKPDSGAYPQFLNTSFVVKKGKIEMLKIKGK
SVDLNKKYRMTTFSFNATGGDGYPRIDNRPGYINTGFIDAEVLIEYIRKH
SPLDAASYEPKGEVSWQ
>gid:1155271  virA  secreted VirG-processing protein
MQTSNITNHERNDSSWMSTVKSTTEVSWNKLSFCDILLKIITFGIYSPHE
TLAEKHSEKKLMDSFSPSLSQDKMDGEFAHANIDGISIRLCLNKGICSVF
YLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKP
APEVIETAINCCTSIIPNDDYFHVKDTDFNSVWHDIYRDIRASDSNSTKI
YFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVNKIINSNFPQQ
DLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSEN
IENRLNFMPEQRVINNCGHIIKINAVVPKNDTAISASGGRAYEVSSSILP
SHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV
>gid:1155208  virB  transcriptional activator
MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTN
VRDQESLTEESLADIIKTIKLQQFFPVIGREIDGRIEILDGTRRRASAIY
AGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMS
YKDIAKKENLSRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYY
KGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNIIKKSKNRKQN
PSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRD
IISRHLSSS
>gid:1155119  virF  transcriptional activator of virulence loci
MVYSVEFMMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEG
QIAFIERNIQINVSIKKSDSINPFEIISLDRNLLLSIIRIMEPIYSFQHS
YSEEKRGLNKKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSD
EEALYTSISIASSLSFSDQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRK
RLESEKLTFQQILLDIRMHHAAKLLLNSQSYINDVSRLIGISSPSYFIRK
FNEYYGITPKKFYLYHKKF
>gid:1155272  virG  invasion protein
MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIA
FATPLSGTQELHFSEDNYEKLLTPVDGLSPLGAGEDGMDAWYITSSNPSH
ASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGM
ILGGSGGSGADHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNG
GDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITGSSLSVINKGT
FAGGNGGAAYGYGYDGYGGNAITGDNLSVINNGAILGGNGGHWGDAINGS
NMTIANSGYIISGKEDDGTQNVAGNAIHITGGNNSLILHEGSVITGDVQV
NNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN
SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFS
VSNISIINPLLTTGAIDYTKSYISDQNKLIYGLSWNDTDGDSHGEFNLKE
NAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNIN
AGILKMGTVEAMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNIN
APFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGNWHGKGGILSL
GAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSS
DKNAFIQKGRIVAGSYDYRLKQGTASGLNTNKWYLTSQMDNQESKQMSNQ
ESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL
YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQI
GGDILKTNFTDHDGLHVGIMGAYGYQDSKTHNKYTSYSSRGTVSGYTAGL
YSSWFQDEKERTGLYMDAWLQYSWFNNTVKGDGLTGEKYSSKGITGALEA
GYIYPTIRWTAHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGG
NNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSKQYGVIMNGMS
NHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKY
TF
>gid:1155340  virK  virulence protein
MFSVSNLSFIGFLKRIVFSSDSLPGKWEHRKFRFMYILRCAINPVASIRY
YYELRSLQCIEDILAIQPTLPARIHRPYLHKGGRAWSRGQYILEHYRFVQ
NLPEKYSEFLFPQKSVSLVQFIGKDGEDFDIQCSPSGFDREGELMLSLFF
NKIVIARLTFSVILTQNGHTAFIGGLQGAPKNTGPDVIRCATRACYGLFP
KRIIFEAFCALMKACNVSECLAVSEHSHVFRQLRYWYQKRKTFVAVYSDF
WESVAGKTCGDWYKLPTQVVRKPLSNIASKKRSEYRKRYALLDYIHETAI
RSLDAYPVNSEHQDLN
>gid:1155320  yacA  orf, hypothetical
MVMTRWQMAQVNMSVRIDAELKDAFMAAAKSMDRNGSQLIRDFMRQTVER
QHNSWFRDQVAAGRQQLECGDVLPHDMVESSAAAWRDEMSRKIADK
>gid:1155321  yacB  orf, hypothetical
MAAIEPDERIGYSASSLAGQPYKGRNGRVEGTSGPHKVACNVILCENLL
>gid:1155291  yccB  orf, hypothetical, fragment
MDFDTPWCQPESDVIAELSRRFSCTLEHWYAEQGCDFCGWQLYERGELVD
VLWGELEWSSPTDDDEQPEVTGPAWIVDNVAHYGG
>gid:1155290  yccB  orf, hypothetical, fragment
MRHGLMEAACERRIPMPNWCSNRMYFPGEPAQIAEIKRLASGAVTPLYRR
ATNEGIQLFLAGSAGLLQITENIRSEQCPGVTAAGRGAVSTENIAFTRWL
THLQNGVLLDEQNCLMLHELWLQSGTGQRRWEGLPDDARETITVHFTAKR
GDWCDIWGNEDVSVWWNRLCDNVVPEKTMPFDLLTVLPTRLDVEVNGFNG
GVLNGVPSAYHWYTERYGVKWPCGYDLNISSREKTSFRWISTRRGVSRKA
TLLQN
>gid:1155292  ycdA  orf, hypothetical
MSRFVLGNCIDVMARIPDNAIDFILTDPPYLVGFRDRQGRTIAGDKTDEW
LQPACNEMYRVLKKDALMVSFYGWNRVDRFMSAWKNAGFSVVGHLVFTKN
YTSKAAYVGYRHECAYILAKGRPRLPQNPLPDVLGWKYSGNRHHPTEKPV
TSLQPLIESFTHPNAIVLGPFAGSGSTCVAALQSGRRYIGIELLEQYHRA
GQQRLAAVQRAMQQGAANDDWFMPEAA
>gid:1155293  yceA  orf, hypothetical
MNYAGHEKLRAEVAEVANAMCDLRTTMNEMERRYSFNADTLPERLVRQTL
FRANRLLMEAYTEILELDSCFKD
>gid:1155295  yceB  orf, hypothetical, fragment
MEIISNVRENRQVTVPAELLETLTQIAEQALWKREWAARDHGFPLPEYVT
RRQAMVDQARSLLKNNTHEND
>gid:1155294  yceB  orf, hypothetical, fragment
MYGTCETLCRALAAKYSGDTPLMLVIWSPEEIQALADGMDISLSDHEIRT
VLAHLEDIPED
>gid:1155296  ycfA  orf, hypothetical
MNETLNALICRHARNLLLAQGWPEETDVDQCNPNYPGWISIYVRLDAPRL
ATLLVNRHDGVLPPHLASAIQKLTGTGAELVLSGSQWQSLPVLPADGTQV
SFPYAGEWLTEDEIRAVLDAVRDAVCSVSCRGAEDARRIRAALTTSGQTL
LTRQTRRFRLVVKESDHPCWLDEDDENLPVVLDAILNRGARFSAVEMYLV
SDCIEHILSSGLACDVLRIPDEPPRRWFDRGVLREVVREARAEIRSMADA
LAKIRK
>gid:1155353  yigA  orf, hypothetical
MNGFRNSSRNGQVWRYQRAGGRAVILEVSGRWMEAAEAWRRAACVAPRTD
WQQFARKRAEHCHRRCRGRV
>gid:1155354  yigB  orf, hypothetical, fragment
MSADIHGRVVRVLDGDTIEVMDSLKAVRIRLVNIDAPEKKQDYGRWSTDM
MKSLVAGKTVTVTY
>gid:1155368  yihA  conserved plasmid hypothetical protein
MKLIIFILIVLIIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILS
SGEYWQRQKTLLTEREVSFMKGLFRIVDMKRWYLCPQVRVADIVQLNGNI
RPRSRQWWQLFRMVSQWHVDVVIVELRSFSIVAAVELDDASHLRPERRRR
DILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNTTGADQQSPEHRS