TitleGenColors Logo

Gene list

Applied filters:

Gene type: CDS
Genomic element: megaplasmid

Number of genes found: 148

Free access
Sort by:

 



# Shewanella oneidensis MR-1, MR-1

>gid:864761  SOA0001  replication protein RepA
MDRSENALLPIPVTENLNSGKLLTISSTTSVQPNVLLRTGVFTPVGRRTN
ANDVKAQDLSNDLVNLDVCQKEGYDLVTVRGQRLNIETDFKVWCGIVLVF
SKYGYSSNTVKLTFSEFAKFCSYPSRRFDKNLRKQIGDSLGRIQSQSLSF
RRKNSEKAVHTGMLLRAMYDGEEDIVELMADETLWDLYRLDYQVLVSLRV
LEKLPRAEVAQCLYLYFTSLPENPHPVSFERLRERLRLETSKKEANRKIK
TGIQKLESIGYLSGSFAVKNGEQYYLIDQRYKKLEAAITGSLF
>gid:864762  SOA0002  type II restriction-modification system activator, putative
MRKPSKKLLDNLANNVRTFRLKNGISQEQLAEICGFHRTYIGSIERGERN
TTLSTLEVLAKTLNVSIAQLLNDDE
>gid:864763  SOA0003  type II restriction endonuclease, putative
MNSLMTGLIKVDNITRSGLTIYDRIRTGDQQFWLTSEELSAVLNLRLKGT
SFYGLAIRTRSKVAKELVCNALGYKVPSSFVKCQPRFSGQMFDTYVQKSN
NLQVWNEELDVERRYVIIRTDENDIITQVKVIAGSDLAVLDTTGTLTQKF
QARLTLSDQKMELISHFDTLNIQKILAPISLNLEIDSPVSQPQNGKLLPI
GECFRRLSKIVGCSFTDAGRTQERNRGAALHALVCKALGYTDYRDNGQFP
DIKHQLLEVKLQTSPTIDLGLVCPNSQANLDIEQLGLQQIRHCDVRYAIF
YGYIVNGLVHITNLYLTTGEDFFNRFEQFGGKVLNKKIQIPLPRDFFS
>gid:864764  SOA0004  type II DNA modification methyltransferase
MTMSRDCILDSTVDRYQHVGCRKSEGAHYTPTRLSQFVSEKIIEKLKKKE
SIVIADPAIGDGELILSFLSSLDSTDNIEVIGFDINLESIELSKKRILNF
YPNVRINLIHGDYLDYCINGNSDLCEYKLPKFDAIIANPPYVRTQVLGAE
QSQFLSKNFGLKGRVDIYQAFLIGMSKCLSEDGVAGVIVSNRFLTTKGTG
ALRQSLHDLYDIYNIWDFGDTKLFEAAVLPAVLLFKMKAEKPDFSTEFRS
IYETNENEVDFAETPVDAISLDGVVRCSNGKNYLIKAGLLDYDSSPKDIW
RVKDLTSEKWLDDVYRKTWATFGEVGKIRVGVKTTADNVFIKSSWIDETG
LVPELIRPLITHHVAERFKQSDAETKYILYTHEMVNGKRKAIDIDKYPIS
KMYLEQHRVQLEGRNYVIEANRKWFEIWVPQSPLLWAENKIVFRDICEEP
TFWLDNKQSIVNGDCYWMVNDYKKEETDLLWLVLAVANSKFIEYFYDIKF
NNKLYSNKRRFISQYVEQFPLPDPKSVISVKMIMLAKSIYNCTDKQERNE
SEKLLDDLVWEAFDLPCPIF
>gid:864765  SOA0005  hypothetical protein
MSKLTGVTGLNRRKSGKESQESPKPQEKITTVSTRRNITTGTSPLPLRLT
ENDRADLNLWLEALEAESGKNITAAKLFRGLISMRDKINTKALIRAINDV
T
>gid:864766  SOA0006  ParA family protein
MPIHKDFNVLHLSYILGRGYAVKTLAVINQKGGVGKTTTVINLSAQLAHE
GKRVLVIDLDPQANLSVVLTGGQFEFEHSITDVFESSKKCPIQQAIMPAQ
SNGEAIPNLCICPTDIRLSRVIEQSLTKVHRERILLKQLEAIASDFDIVI
LDCPPNLSLTSVNAMMAADMFLIPVDGGSFSLNGLADLLDALEEVKESEH
VNYAVFRNEFAKANKLINNFLDEQLASLEGKVLATTIRRSEDVGQASVSG
QTLLNYKPSSLTLADYKSLAKEVMQRLNV
>gid:864767  SOA0007  hypothetical protein
MYENKHGTPEHARMQPMRFLKLKSTAPRANGAAYKHALIVDAPRRFAPSS
QRTETSLSRQKNHLIEGAHDSGKTRWLVRLYENWADIWGAKIQEEPLYLS
ALEPITDWIDAPHVEAWWAEVECERALAADEPPRKWYKLSQKHKLEALSN
YLHATKTLLFLDDAHRLTGRKLQVARQCVLGSKYWIISCSSENRLPPTLR
TIVERGAPQRTRLESDASYDSTKVLMWMLIVGFAAAGVWEAAIVLGGFQM
LGTGRRAARAD
>gid:864768  SOA0008  hypothetical protein
MLIKRYAQILWISGFLLSNTACARDPWNDKEQKPDIRPFEVPRLQGDTQY
QDPIPSLLPAPEIDPDSIYTAALNCYPEESKFKVDVDLVAGYRRVSDQYD
TSGWPALSEHYIGVVGKIPLLSDSEDSRARDREYLRRTRTAEAVAGFAQA
LANRNFAYREIGLYMALEARAIVRVGKGIAPTEEQITMMEKVAEAQRNVT
KYSAEAVQFRLAIIAMCDDQKAPVLNDYLKQIAFLPKAK
>gid:864769  SOA0009  hypothetical protein
MPFYLKLNKRGNAMKTMRWKQVIIYSSIMSATFYIAAFAYFWLQPVPPMR
CMNELSELADIALQEYKAGRTKAFTQDMYFNEETKQWCYFDEVD
>gid:864770  SOA0010  conserved hypothetical protein
MKWINHTLIAGSICAVISPIHVAPVVIGATAPDWMEWVVKLTGRHIKHRG
VTHILTHWLIAALATTFLFDYQGMLTAFAWGGFSHIITDAMTVTGVPFSP
YSDRRFHLFGGRFRTGDPVEYAISFAVVVLCILFIKLLGDSGFIPFFYDW
AGFYDKGIIDAYEWKQNRFRIL
>gid:864771  SOA0011  conserved hypothetical protein
MKHYQEIFSKIKAAIANAPRNQATAEMHLQMIKYADQLEHITAKEFCEGT
DLKASFGTEFSKMRNISARLKAAGLDVAKI
>gid:864774  SOA0014  hypothetical protein
MPARVTCYELLLVLAVLPYNGWEALKAQIFR
>gid:864775  SOA0015  hypothetical protein
MRAHCCLAHTVACRGKSKGELVIATVRPAAMRAQSATPNARLSGASGI
>gid:864776  SOA0016  ISSod9, transposase
MPRRSILSAAERDSLLVLPDTQDELIRHYTFSEPDLSLIRQRRGDANRLG
IAVQLCLLRFPGQGLLPDATVPMPLLQWIGQQLQLDPVCWPQYAEREETR
REHLLELRVYLGMEPFSQVHHRQAVHTTTELALQTDKGIVLANSVVETLR
HKHIILPTLDVVERVCAEALTRANRRIYDTLTEPLSGSHRHRLDDLLKLR
DNNKTTTLAWLRLSPVKPNSRHMLEHIERLKVWQALDLPIGVDRLIHQNR
LLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRI
MGKLFNDAKKRHQKQFQASGKAINAKVRLFGRIGQVLIDAKQAGDDPFAA
IEGVISWEAFAKSVTEAQSLAQPEEFDFLYRLGESYATLRRYAPTFLTAL
KLRAAPAAKGVLEAIEVLRSMNNDNARKVPADAPIDFIKPRWQKLVITDT
GIDRRYYELCALSEMRNALRSGDIWVQGSRQFKDFEDYLVPPAKFVSLKQ
TNQLPLAVATDCEQYLNERLTQLETQLATVNSMAQANELPDAIITASGLK
ITPLDAVVPDTAQRLIDQAARILPHVKITELLLEVDEWTGFTRHFAHLKS
GVLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDET
YGAALSELVNAQYRHPFAEHWGDGSTSSSDGQNFRTGNKAESTGHINPKY
GSSPGRTFYTHISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEH
YTDTAGFTDHVFALMHLLGFRFAPRIRDLGETKLYIPKRDVTYEGLKSMI
GGTLNIKLIRTHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVAL
RELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGEI
RDRSFEQQRYRASGLNLVTAAIVLWNTVYLERVAHGLRAKGHAVDEELLQ
YLSPLGWEHINLTGDYLWRSSAKIGSGKFRPLRPLSPA
>gid:864777  SOA0017  ISSo9, nucleotidyltransferase domain protein
MRPSAVLALKRSVIRETASRFRVTNPRVFGSVLDGTDLDGSDLDLLVDAL
PGATLFDLGGLQDELESLLGLQVDLLTPGDLPPKFRAQVLAEARPV
>gid:864778  SOA0018  ISSod9, conserved hypothetical protein
MKVNRLPDYLDHMHQAASDACYFVEGLDKDEFLVDKRTQQAVIMSLIVIG
EASTKVMDGYSEFVIAHSDVPWRSMRGMRNRIAHGYFDINLDVVWDTVQT
ALPTLLSQLAAVRYDVDKNEGHDLC
>gid:864779  SOA0019  ISSod9, DNA-invertase
MLIGYARVSTQDQHLELQREALLKAGCEKVFEDTISGTRADRLGLSKALE
ILREGDTLVVWKLDRLGRSVKQLVELVSDLHKQNVQFKSLTDSIDTGTPS
GRFFFHVMASLAEMERDLIVERTRAGLDVARQLGRKGGRKPKMTDSKIES
AKKLLASGVPPKDVAKNLGVSVPTLYRWLPASAHA
>gid:864782  SOA0024  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYIKYYNLERLHSAN
NDLSPVEFENSQVKVSSLG
>gid:864783  SOA0025  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864784  SOA0026  site-specific recombinase, resolvase family
MTHGILSGTEEHTMIAKVLTEFMIDLAAAMARDDYETRRKRQAQGIEKAK
TLGKYLGRQPDHGLRQNIRLLLDEGKSWSQVQSLLKCSRSTIAKAVKLNE
LR
>gid:864785  SOA0028  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864786  SOA0029  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYIKYYNLERLHSAN
NDLSPVEFENSQVKVSSLG
>gid:864787  SOA0030  hypothetical protein
MASRNVEPRQACLPLVDHVIAGSEAKSPTEGNAQFKTNLL
>gid:864788  SOA0031  partition protein, ParB family, putative
MTNSIQAQATHTKATTLPASPKTQAGSHKPDIAQKSATAANAVVALASTA
KPVLLQLQINQLVLSEKNVRKENASKADDEQLYASILAHDILQNLIVEPM
NAQGLYPVLGGGRRLRQLIKAVKNNKLKPKTLVPVKLLTAEDVANYATEL
SMTENFTRAKMHPVDEFHAFADMVNKGASIADVAARFGVTAKFVQQRMKL
SMVAPVVLDAYKAGNVSLDVVMIFTIASVEKQVEVWELAGDRRYNENQFR
NMLKDAAVNADHYLAQFVGQEEYEKAGGVVTSDLFSDEVYLDDKALLESL
AIAKLELEGAKLVMQGWKWADYKLVSEYDELAGFGRLTLKEGEYDPAEMA
LAGCLLVLKSYGEPVSIYMGLVHKDDKKALAQLQASVQSDPLNVGDVKKI
EEKDTSGYSAALNDDLRAQRLIIAKHALMSAPSVALDTLHFSVCVSAFTD
SRYGSRPLHISVNDTTCHPKTGSLTDNKAVQLIEGVKASLNLAWVGLPTV
AERFNAFCALDVKEKEKQVAYATASMFEASIDNSHQAVEAVISTLDVKWS
NYWRPTAETFFKRVSLSCLIDMAQPVMGEQWALQAAALKKKDLANQVDSL
VNGERKGLDDAQKAYFDALMPAGF
>gid:864789  SOA0032  conserved hypothetical protein
MLFMVGIESPADEMQAFGIVIPVFEKLGYGCFSAADSQEEILFKAKEAIL
LTAEEVINDGHLVDSLNEGYRDYQTLHPKFDQWLALEVPLEALKAKQKRL
NITLSESQIVRIDSFVAFHREFKDRSDFLAKAADKLMNSAESVRSCSKVG
D
>gid:864790  SOA0033  hypothetical protein
MGKVTGIPFSEVKAQLMDNPEVVAAYEQAVQADEPIHIIPVQHSEGVIMP
INNIVSCTRQPNRASFSFVNLFGGHHYPVTMIFMPSGAVNVVVAGARYRC
QANQVQRVIDESGLEHEGLACLKYSIEAGELNFHIPADVHEQLKEHPEFQ
KCRLIEWEDEEM
>gid:864791  SOA0034  hypothetical protein
MSQRAIEIVKISDLKSVKQGEVFEWCIDYEEFQWRKGDSILRSRTGVDSP
WEIWPLTDNTKTAVNRKVFTLIK
>gid:864792  SOA0035  ISSod4, transposase
MTQPFNFEQALKDLQSGKSLTGKDSILGPLIKQLTEAALQAELEQHLAHD
PQPNRKNGKTPKTIKHPSGNFELDAPRDRNGTFEPQLIKKNQTTLTDEIE
RKVLSMFSIGMSYRDINQHVEDMYGLNVSNATVSAITDKLIPELKAWQQR
PLDSHYPIVWLDAIHYKVKEDGRYVSKAVYTLLALNMKGKKEILGLHLSE
NEGANYWLSVLTDLNNRGVKDILIACVDGLTGFPEAIASIFPHTETQLCV
IHQIRNSMKYVASKNQKAFMADLKPVYRAVSKEAAEMALDELEAKWGDAY
PLVINSWRRKWHNLSHYFKYPEHIRKVIYTTNAVEAVHRQFRKLTKTKGA
FPNENSLLKLLYAGILNASDKWTMPIHNWSLCLSQLAIYFEGRLDSVLEI
>gid:864793  SOA0036  HicB-related protein
MTFKGYHGSVEISPEDNILFGQVLFISPLINYEAETAKELEQAFQEAINA
YLADCVQQDIPPEKALEGIA
>gid:864794  SOA0037  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYIKYYNLERLHSAN
NDLSPVEFENSQVKVSSLG
>gid:864795  SOA0038  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864796  SOA0039  conserved domain protein
MSDGIRPEQIVDINFFWSWYEKEYSELEIYVTVIDGVLTKVRFSDCPYHF
SDDIVLTFEEVKAKKPHFTYAQVKQALLDGYLCPLSNDSKAPEPTPPNDD
NTRKVVSFGDFQDRLESKRDRLENASSKAAAESNKFYESSRSLASCIPFG
QPILVGHHSEGRARRHADKIFNDMGKSVAASKKAGYYADRAASVGTNGIA
SDDPEAIAKLKEKLAGLERSQETMKAINKVIRSKHMTDADKIEYMTQTHK
LTEKEAEELLKGDFCGRVGFASYSIANNSATIRTVRDRIEDLEKLHNQEA
LSASGKIEGLSWALYEEDGRIKITFDDIPSEALRLTIKNYAFKWSRYSKA
WVRKITPNAIYSAKQLIGKLTEN
>gid:864797  SOA0040  hypothetical protein
MQRIYELKKSGNSTVIRIPYDELKANDCAAGDKVIMSFHKVKTPRENWFN
KVSAAQAQEEVELMNKDFSNVDLELTDEWGTEW
>gid:864798  SOA0041  transcriptional regulator, PemK family
MVNQFDVVLIDLNPTVGAEINKTRPCVVISPDDINQHLRTVIIAPLTSTP
RGWPFRPVVQGSKVKSEVAIDQMRAVDKNRIIKKLGQLTAIESNLVASVV
KEFFE
>gid:864799  SOA0042  hypothetical protein
MSKFLFITLLAVISASFTTHANDYFVSINVLDSQKQSLSIPLTLIASEPA
GYKSDLEKIPYPVKECSSSNGKITERFYGKEFKRGYAINFSSGLPDKEFK
VTEYLIDDKEYPLYDFKKDCFANQVVQIVNEYSFTLNVLQATSKIVELQS
GGEVSIMVHASK
>gid:864800  SOA0043  hypothetical protein
MGIFWGFSIFARRLPYEAAKVSGDFQAKKNALNECVFCKRGGLIFACQ
>gid:864801  SOA0044  site-specific recombinase, phage integrase family
MPSIKLQQSMIDNLPTPQKITVFYDKYFPAFFMRAYPSGIRCYYVRFQYQ
GLRQLHVIGNANDITLDDARLEARKHIDALRYGVVSEPVPSLTLCAFAEE
FFPRYARHWKPSTLLNSQRGFTRHIAPVLGDVPLAALTRQHIEQWFDGMH
ASKGMANRLLPLLSVMMQQAEVYEYRPAQSNPCKGFKRYKSTHSERYLSE
EELKRLWLALDSHEKASPVAVMVLRLLILTGCRCNEVCSVKWADYRQGHW
YLPDSKTGAKTVFLSSFARELLSDWPQVSEHLFWHQSPSQPFTPACLDRF
WRPFRETIHLNDVRIHDLRHTYASIAVKHNINILTIGRLLGHALPETTLK
YTHLAKRDVQQAANVVSQLIAGEMQR
>gid:864802  SOA0045  site-specific recombinase, phage integrase family
MPDKPMQYVISRLIDNRTRTRILAPVSALTVKAAREKATGQLQAWREEHT
QPSMSASPRFDVFVEQTWKPMIFEHWKPSTKRSVLPALNRRLLAHFGRYP
LHMINHKMVQAWFDEISKTRKGAANRNLDTLQSVFKLALRQGHCLTNPCI
GIRQNKRRVLNRFLSVAEMQRLSVALDECEAVGGSIAQCADVLRLLLLTG
CRLSEITFLKGEYVRGNELHLPDSKTGAKVLYIGQAAVDILSRYHCKANT
ELFPMKQGASTSLVQSLWLRLRKQIGVEDVRLHDLRHTFASYAVMDGCSI
PMVASLLGHKKVTMTLRYTHVGDDSVEQASEVIGAVFKGIFTQPEPPKPL
IPRTNETPMPAVKVKQKRLPERKPTAKPKPKAVKKVKQVKPLQKLPTLTK
EELRAIQRFRDSELFDRVKIG
>gid:864803  SOA0046  hypothetical protein
MAEILANNIMPITIFLPAINLSVFLSFIGINQL
>gid:864804  SOA0047  hypothetical protein
MFPNRDFRKMLSYEKDVKINDETFSFKYKIVTGELSCKKDDILIFHQFLW
LPYKTFNIKINEKDYKLKTILLPINKCSLYYGKVPICRDLFPRLKRYTLI
SFTLSTIKKIAIVIALIFT
>gid:864805  SOA0048  prolyl oligopeptidase family protein
MKIHLFYKEVIVILKVLLISIFFSFSIQAVTINDFSRHPEFYDVKISPDG
KYLATLVNTEGRKTLAFLDSDTFKVIFALGGDKRDQVADYYWVNNERVIV
QVEQVRGSLEKPLNYGEIYAVNFDGKKGAMIYGYRAKTPTSNGGFLVDNL
KGDDQHVLIRSQALSRRTDVIPEIVKLNIYSGKTRRIKRAPLAYSQFLID
HAGVPRFVAGTDDKFTTQLYYSKGQGDDWQLFGDKFAGAFEPIAFAADNQ
SIYALKSTDDGPKGLYHYDLSTKKETKLYQSEMVDPTYAIGSQLNEVYGL
RIDEDYPNYLYLKPDSVDAKLHKSLVDAFNGDSVLVSSITEDGKQAIVHV
SSDRNPGDFYLFDTTQMKARFLMSSRGWINAKEMAETEPFRIKTKDGFTL
NGLMTLPKDKKTNLPTVILPHGGPHARDYWGFDPLVQMLANQGFAVVQVN
FRGSTGYGKNFEEAGYGKWGTKIQDDIMLATQYAIQQGIADENRMCILGI
SFGGYSALQSATRYPDTFKCAIGYAGVYDLEMLYNEGDVKDTSWGDAYLD
KTLGTDKAALKSQSPVHFVDMLKASVLIIHGEEDKRASIEHANALMKALD
KANIPYEKLIKDKEGHGFYKQENIGEANKKIVDFLNRKIGFK
>gid:864806  SOA0049  toxin secretion ABC transporter, ATP-binding subunit/permease protein, putative
MTIATDKHAALMASESNSPVDLLEFSGNKRVPLILQAEMAECGLACMAMI
ASFNGHKLDMAALRKRFTANLKGMNLQQLISLGDSIGLSSRALKCPLEEV
GKLALPCILHWDMNHFVVLTGVTKKSISINDPAAGKRTLSLQEFAKHFTG
IALELTPTKAFVKQDERQQMRLSQLWTKISGVNAALITLLLLSVLLQVFA
LVTPYYMQWVVDEVLVSQDQPLLIVLAIGFGLLVVINVFTTGVRSWLVLR
VSSLLNMQMGVNLLRHLLRLPMNYFEKRHIGDLVSRFGSLAQVRERLTTG
LVETVVDGVMSIAVLVMMLIYSVKLTLVVMAAVALYTLMRFALYRPLHRA
TEESIQAKAKEQSNFLENIRGIQTIKLFTCESARQGIWQNRYSEVINADI
RLGRLKISFDAMNKLLFGVENIIVIYMAAMIVMSGGLTIGMVLAFIAYKN
QMTERVASLIEQLIMFRMLRLHLDRISDIALHEQEAHQEGFTPLNVVKGR
LSLENVSFRYGENEPEVVSNLSLDIQAGESVAIVGASGCGKTTLVKLMLG
LLVPSSGRILLDGQAIQQIGLTQYRQQIAAVMQDDTLLSGSIADNITFFD
PEPNYVKMQQCAQLAVIDMDIAHMPMGYNSLVGDMGNQFSGGQVQRLLLA
RALYQSPSILFMDEATSHLDIMNEAKISEQIKNLNMTRIIIAHRPETIKQ
ADRVVVMHQGKIMTAEELQQAQSAS
>gid:864807  SOA0050  toxin secretion, membrane fusion protein
MSDLFRQQVVNEQKQRLYGDISLAQPLSIYTISIAILLIVTAIILFLYFS
HYARKETVRGYLVPDKGVIKTYANRIGNVDILHVKEGSDVNAGDPLVTVI
IRSSMASGFELSETLISELKQQQSILNQELDNNIELNAAETLRLKKRLSD
LSESMKVLSRQKQLLADKLNIQVAQKKQHDKLYKDGYLSELDYQLQLSKL
IEVKQEVENLESNKISIDRELNQTHAELVSLPFQFNLKQSDVHKRQSDIQ
RQLNEAENSYTFVIRAQESGTVAAVSVVEGEFIASNRPLMSIIPKGSSLV
AELLLPTRSAGFVKQGDEARLRFDAFPYQRFGFLHSEVLRVDKALLLDGE
ADLPVKLSEPVYRIKTTLSAQDMQAYGEAFPLKSGMLLEADIVLDRRTLL
DWLLDPIYSLRGRVS
>gid:864808  SOA0051  hypothetical protein
MARNKVIQVACPPDLYSKIKDYKGAKNLASDADAMRELTLFALRIIEHSN
DKDEGISTRELLEVLLDNVIKIHHQTSINYYQNFNAEQYNANMKEPDVVP
SYKRLMAKAEERTQQILAGDNP
>gid:864809  SOA0052  hypothetical protein
MGLDQNTNQYFNEAKPITESPVQTYLAKQGIDITQHENIRFHPAVFSSET
RQTYPALITNITNEKNETKAVEITYLDKSSSDIAALKINKRILGSKSGNS
TIISKGNNSDYSIVAVGVENALKINADNKNGADIIALNNNNDTKTINTHE
LRENVIVVLDSNNAQDTTKLSNDLTDKFSKENKHAIIIEPSSIPEGLNKH
ETIKQLVNEAVHNITKKDTTISKMIQSLNNDIAGSRNITAKEDSDNLSAS
RIKHDEHYQHLAMDSDRASKEKPLEFDTPNIGEKTR
>gid:864810  SOA0053  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864811  SOA0054  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYIKYYNLERLHSAN
NDLSPVEFENSQVKVSSLG
>gid:864812  SOA0055  hypothetical protein
MTSTAVACSILITTLLRNIVLISFDDAQPNSARRSAKCFSIAVREQLPLL
CSCAVVIPSEKFFITSSCLVVRVGKSSFSRLFSFSFWVIVLLLYRKNNMF
YHQNE
>gid:864813  SOA0056  hypothetical protein
MKKKYKPSTTSKTELIAFRCPVELKKKINDAVEKQQFQSVTDLIVQSVSE
KLGGESDLGS
>gid:864814  SOA0057  hypothetical protein
MCDEQLKYGVDFEAKVHYIARYYIVFYVVYFSKLECVGGGSLTLSIRQRS
RLTEWVGFFVPKFQGIERSLIFAIV
>gid:864815  SOA0058  hypothetical protein
MSACVMQKSKADKARIKRQKLRSAKREAALHVKTQNTAVGSNAYKASLEG
FKAKGARLPRYFNEVINNARKLIYSRKLEVDEGGISRNMSERTKRKLFEI
VVTMITTCDLLTGQVGMAKNIGFDTTSHDSLMLAHAKRWGEAIPSSTWYR
YIDVLKKSGVFTVQEVKVAIDDNSTGEEKTIRSVAAYKWLSTRFLQAIGV
YSDTIRAQIKQAYQRAVDKGLSFTWRVYKRQLPNKQRFTPDLFFSQSYSQ
PKTH
>gid:864816  SOA0059  conserved hypothetical protein
MSTVTPSAANAGAASKPARIELKTSPEVKELLERAAAINGINLTAFIINN
VREKALAIVESETTLNLNQRAWAQFETILDNPRKATPVLKTLFSEK
>gid:864817  SOA0060  acetyltransferase, GNAT family
MSTNYIDCQLLNKMERQPSFSSFDCGDPFLDSFAPKKLANADANNDSRVY
VAVDGDIGVGYATMKVFMLSNDEYKILSGKYPRQVPVVMLDQIAVDKAYQ
GKGIGKRLMRKVLEATVLVNELAAAKGLALWAHPRAKDFYESLGFEAIPD
ATKQVQDVELTLMFIHVETILDALK
>gid:864818  SOA0061  parA protein, putative
MAVIAFAQSKGGAGKTTACVTLAGELCRQASESGIQITLIDTDPNQHSAG
WAKKKGCPSNLILIEKSTEDTVLDDIETASSQTQFVLVDLEGTANFAVTQ
AISRADLVIVPCQGSEDDATEAARTIGLIRKQGRLLARKIPCCILLTRTS
AAIQTRLLKLIKQDFLDSGVHILNSSLIDREAFRAVRSYGGTVNNLDPAL
VSGIDKAASNAKAYANDVKRMLLEANNG
>gid:864819  SOA0062  hypothetical protein
MDNPRSKIVIDDNDEDISLLKVLAATSSKAAPLTDRQLKEVEAQSETLGF
KRRENRKRSPYIIQKNIKIRVGMDELLGEIGKAIGSKSDQETFDIAISKL
VESIGSVRLSSMLEELSKR
>gid:864820  SOA0063  ISSod8, transposase
MTLNFSGRHFPSDIIMQALRYYLAYKLSYREIEEMFAERNIHFDHSTLNR
WVIKYAPLLEAIFRKKKRPISGSWRMDETYIKMKGQWAYYYRAVDKFSAV
IDFYLSETRDEKAAHAFFTKAINQHGLPEKVVIDKSGANAAALDTVNIRL
WLSGCMLFMIEVLTVKYLNNIVEQSHRKVKGKINA
>gid:864821  SOA0065  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864822  SOA0066  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYIKYYNLERLHSAN
NDLSPVEFENSQVKVSSLG
>gid:864823  SOA0067  hypothetical protein
MKSYPKKDRDQALSVLQKLAENDEQLTQPQAVKLALLDQYLIKNRELVAS
EFIDRFRYIEYELTELAIIQRVAFDNFAEFTNCLTLAGRFGCLVRLVRGG
YFSGTDSLHIWPLLAALAINDSPTIAAYKKQFVPPFVTGHKVTVLACNAV
YAILGVTPVTDNLREKLTSLKDSKYSMAMLRSLAAIINQDMDSFIDNISI
MVKSNRSQQEYSPLEKCYCVNAHGLVNLWRFHTTNAPLRQLDLPLPWQNE
LNAFFTSGHESIPITFDPLGECDLSNLPIDQNTLVLIHSLTT
>gid:864824  SOA0068  conserved hypothetical protein
MSIFNELQASLEEAVEIHHGLKKPARVTRYEVADVKSIREDLLKVTQAEF
AATMGVSVDTVKSWEGKRRNPAGLAAKVLATIQQDPKIYAALAAH
>gid:864825  SOA0069  conserved hypothetical protein
MSHAIEFVETSVFTRQIKELSTDDELKDLQAELIAQPDKGDIIKGTGGLR
KVRMAVGNKGKSGSIRVLYVLALADKIYLVLAYPKAVKDSLTAEEKAKLK
LIVQSLKGESK
>gid:864826  SOA0070  hypothetical protein
MRWIQLLLLLPLITACQTVPTELKNLEWRHMVGSNADKIWVTEVLFYRQG
KLVDASTNGASSRVDAESLSERDYRWLITAGRRDVHPAPDRVEVEWISFH
DKKRYRISLVLPAELESMIAQPYRIKVRDEWLEVRRDNIGLGMTTGGYVE
AFLTNAKVKPDILLARGIAKEVTDDPDEKRFPLSSQFANRWADFDANYSK
AYQHYPVPSGMAWAPIMDAYRAAQPKTDTNPVQQ
>gid:864827  SOA0071  conserved hypothetical protein
MYMLPTACIEIREALAKVPYLAHIETQDDYEQALALMDDLVDDYDSNKFL
IEMLSLSIERWEEQADEFAEFNAAIAEMDSGIAVLKTLMAQYRLGVADLP
ELGSKSNVSKLLNAAEGKKLNRHHIEALSQRFGVPVSLFF
>gid:864828  SOA0072  conserved hypothetical protein
MHVISRKPFNDAAQKYPNDKDALVQIYSTLRSGTFTNPDDLRQVFPSLDN
FKYRDKWWVIDVGGNNLRIIAFIEFRDNRMYVKHVVSHADYSKLTDKYRR
TKE
>gid:864829  SOA0074  ISSod6, transposase
MPRLMLTDARWEKLFHLMKSTGRVYDKPEHRQTFEGILYRLRTGIPWRDL
PKEFGHWSTVFRRFHLWSKKGVLAHLFKALANLADIEWVFIDGSIVRAHQ
HSAGAATLSNESIGKSRGGNSTKIHLAVDSGGLPIYFELSEGQKHDITHA
PSLIEHLKQVDTVIADKGYDSDAFRELIANKGGKSVIPRRRYKNTPQERV
DWCLYRYRHLVENAFGRIKHYRAISTRYDKLARNYASMVSLAFMLMWLPM
YC
>gid:864830  SOA0075  hypothetical protein
MFSRDKMKKSESNIKAIDDFVDGADSNIRTKKVVDKKKPVSFSATDKQLA
EIQGLVKKYNALAYKNDNRQEINRSDLIMAMKSHFEKMSDIEFYKEINKL
GSVDLSRVIFRQHAKTL
>gid:864831  SOA0076  conserved domain protein
MEHMAETLTTAKDYNEKLKANVFINMSPTNSQSEKLEARKLLKEYPEFTL
LKSVIYERTAYRKSYAAAVGVHEWNDYKAKAEMSYLLMELMSNV
>gid:864832  SOA0077  site-specific recombinase, resolvase family
MYIFGYLRASTSDQNAKRAQSTLQKFVQDRGFRIAGWYIENESGASLQRP
ELLRLLDDAAKGDAIIIEQIDRLSRLDEKSWFTLKEMLHGKELKVISLDL
PTSHIALSPQITDEFTGSMIKAINSMMMDMLAAIARKDYQDRRRRQAEGI
KKAKEGGKYRGRQADSDLHEKIYQLRVVNKLSISDTAKLTNVSDRTVIRV
AKKLACDRSTG
>gid:864833  SOA0078  conserved hypothetical protein
MSRTTSVTIGSQLDEFVGQLISSGRYGSTSEVVRSALRLLERQENQTIAL
KMAIEAGEQSGECALSLHDIAAMVKQKHNV
>gid:864834  SOA0079  conserved hypothetical protein
MYKLSNLAAEDFERIFEYTLLNFGVKQADDYTVSMHNALLAITEQPLIGH
ECLEIAKELRRHNHHKHAIFYKKQPDGIYILRILHQQMEPLRHFYPDTD
>gid:864835  SOA0080  hypothetical protein
MEGGMKDGDVYSIPLFLTDVSGLKSFSRYDFTKEGMAFCFARIIEDRGGS
GLIIEVFNIQGGIEMSLSEIVNSSRLFNPVVIAGEAIKKKRWRLIGSTYY
EKERDSNYSEITFLIGPRDNRLIWKGGVEIPIDDSNCHDFEEYIIWPAVQ
IEQRIVHELEKIGKN
>gid:864836  SOA0083  ISSod4, transposase
MTQPFNFEQALKDLQSGKSLTGKDSILGPLIKQLTEAALQAELEQHLAHD
PQPNRKNGKTPKTIKHPSGNFELDAPRDRNGTFEPQLIKKNQTTLTDEIE
RKVLSMFSIGMSYRDINQHVEDMYGLNVSNATVSAITDKLIPELKAWQQR
PLDSHYPIVWLDAIHYKVKEDGRYVSKAVYTLLALNMKGKKEILGLHLSE
NEGANYWLSVLTDLNNRGVKDILIACVDGLTGFPEAIASIFPHTETQLCV
IHQIRNSMKYVASKNQKAFMADLKPVYRAVSKEAAEMALDELEAKWGDAY
PLVINSWRRKWHNLSHYFKYPEHIRKVIYTTNAVEAVHRQFRKLTKTKGA
FPNENSLLKLLYAGILNASDKWTMPIHNWSLCLSQLAIYFEGRLDSVLEI
>gid:864837  SOA0085  ISSod8, transposase
MTLNFSGRHFPSDIIMQALRYYLAYKLSYREIEEMFAERNIHFDHSTLNR
WVIKYAPLLEAIFRKKKRLVSGSWRMDETYIKMKGQWVYYYRAVDKFGAV
IDFYLSETRDEKAAHAFFTKAINQHGLPEKVVIDKSGANAAALDTVNIRL
WLSGCMLFMIEVLTVKYLNNIVEQSQPKSEG
>gid:864838  SOA0086  site-specific recombinase, resolvase family
MSRVGYARVSSTGQSLEVQLSKLHRAECNKIYQEKRSGRTAERSEFQSCM
SYLREGDTLVVTRLDRLARSVVHLAQIASRFQSEGIDLLVIDQNIDTSTS
TGRLMFNMLAAIAEFENDLRTERQAEGIAKAHENGVKFGRPVKLTDTLKQ
AIYDKRAEGATIGQLAKEYHLGEASIYRALNSVKSTQEITSKLISN
>gid:864839  SOA0087  conserved hypothetical protein
MNTLSANEAKIHFGDLLLKAQQSPIQINKNGKPVAVVISADEYQSIEALK
LHLLQSKAVKAMTDIQTGNLVDGNTFFDELAAGLYD
>gid:864840  SOA0088  plasmid stabilization protein ParE, putative
MAVTYHLTPDAQSDLIGIHRFTLAQWGATQSKTYLSGLKQTIQLLAETPT
LGKNRPEVRMNVFSFPYSSHVIYYIQHEHQFVVFGILHKSMVPLTHLAER
ETI
>gid:864841  SOA0090  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864842  SOA0091  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAVVERFFGSLKHDWIFKVAQPTREFMKQDVTAYIKYYNLERLHSAN
NDLSPVEFENSQVKVSSLG
>gid:864843  SOA0094  hypothetical protein
MDKNTANDDDQDHHRPVPQSRIKVTKLISQFQYGG
>gid:864844  SOA0095  partitioning protein A
MDSMQTTETFQVLKIGADAYIKRRNQRLLSNHRKDLRKFTRAEAFTYLDI
DAKTLDKYVATADFDPRRHEDSQWLINIEEMYQLRDLLPENLRKASKFKR
SDNQKMQVIVIQNQKGGVGKTVSAATIASGLATEFHQEYRVGLIDMDGQA
TLSMYYAPEADLEGCLSVGDLMMNNFDLDEGETLEQVVSNAFLPTTIPNL
RILPASQSDRAIEGWFHEQVFGQKLTSPYSLLNTIINAVQDEFDIIIIDT
PPSLGYATYNAYFAATSVVFPLSITENDIDATCSYFSYIPQVWALLANAN
HRGYDFMKILITNHRDSATTTDLMNTLYDHFAPYMYSNEFKHSEAIRQSS
SLLSTVFDMSKSEYPKSKATFQSAQQNCYEVTSQVLRDIVNVWREQEQA
>gid:864845  SOA0096  partitioning protein B
MAKKRGAMSPLGNAVGAEEAQKNAAKANIESLKRQITTEIQKVSDDVTLS
LKNLFGLESVGNSFLWQLASGATATFTEATLSYEQVRDSTYVTFDVNGRD
QALLNADSLQDLDSLAFQQFYPAVAREVNGKLDVLDGSRRRAWFLLQNGK
VDIFRILVTKDDISLSDAKALAKQLQTAKEHNLREIGQQCLSLEKANPKI
TQAEVAAQLGMSQAGVSKALKAAKVDERLVKLFPVASDLSHTDYALLNKV
MEVYEFEDELIEFINDLTQKIVIIQVEYSRAERKSAIIKAMKSELQIAKD
MKSKALVSVTNLATFDSSGIYARKRIKGRNFAYEFGRLSLDIQKQLDIAI
VDVLKKNKFNTTVD
>gid:864846  SOA0097  ISSod6, transposase
MPRLMLTDARWEKLFHLMKSTGRVYDKPEHRQTFEGILYRLRTGIPWRDL
PKEFGHWSTVFRRFHLWSKKGVLAHLFKALANLADIEWVFIDGSIVRAHQ
HSAGAATLSNESIGKSRGGNSTKIHLAVDSGGLPIYFELSEGQKHDITHA
PSLIEHLKQVDTVIADKGYDSDAFRELIANKGGKSVIPRRRYKNTPQERV
DWCLYRYRHLVENAFGRIKHYRAISTRYDKLARNYASMVSLAFMLMWLPM
YC
>gid:864847  SOA0098  conserved hypothetical protein
MFSFFINIFKLLKAIVVGVKNDQDFRILLFLLVTILIGSTLFYSSVEGWS
KVDALYFSVMTMSTIGYGDLVPTTDMSKIFTIIFSFLSIGIFVSLNTKIV
VMTLNQKKQKLFDRKLRKDNEEVKKHTQSNT
>gid:864848  SOA0099  conserved hypothetical protein
MKTSAVVNLLIALFIAVPATAQEQGKSITVFTAKKIVTMDPTQPTATAIA
VRDGMILGVGSLQDLAPWLKGSMYTINDQFKENVILPGFIDPHMHPMLGA
IAFQTVWITPEPWNVMGHKTPATIGEKAYRATLKKAFDSRDPNAPIFMTW
GFSSDTHGELSGRLLDDISKDVPILVLQRSLHEAYINTPLLTLLKTKGLN
PNKFKDHLQIDWSKNHFWEDGLFSVVLPFMSSFLLDPNAADPGYLKTRDY
LTYNGVTTVADMNTGGTNWELEISALKRNLDTPESPIRVRLTPDVMKLAA
ALKSPEAAMTLVNQMKQHNTDNLVFNGGIKLFADGAMFSQAMQINAPGYI
DGHKGEWITQPSSFVEFARSYWNAGYQIHVHTNGDGGAKMVLDTLQELEN
DKPRADHRFTVEHYGYADDGTSRRIAKLDAQVSANPFYLFDLGDRYAENG
LGFDRAARIAPLGGLASRNVPVALHSDFPMAPAEPLFLAWTAMSRETLSG
KVFSPSERLTLDQAIRAITIDAAYMIGMENEVGSIEAGKLADFAVLDKDP
YEVGMKGLRDIKVWGTVFRGKVHQAKK
>gid:864849  SOA0100  conserved hypothetical protein
MKFISSVVVCLSLSPFVHASSPPKTDAETIYIGGNIVTVTDSAPTAEALA
VKNGRILALGKTSDLLKLKGSNTQVVNLNGKTLIPGFIDGHGHVFNTGIQ
ALSANLLALPDGHVNDIAALQRELTVWAKKPENAKHGIILGFGYDDSQLA
EQRHPTRQELDAVSNDIPILIIHQSGHLATLNTKALTLAGFNSNSKDPEG
GKIRREADGKTPNGVLEETAFFGTLLPLFAKLNETENEAIFNAGMKLYAS
FGYTTAQEGRASSSAVKTMYNLAQQQKLPIDVAAYPDIQTAQEVIAPPYF
SAHYNNGFRVAGAKLNLDGSPQGKTAWLTKPYLIPPVGQEPDYKGYPSMS
DEKAAEYIALAQSKGWQLLTHVNGDAAIDQLLKGIEASEKIYGKPDRGFV
AIHAQTARQDQIERFKRLGVFPSFFPMHTFYWGDWHMDSVLGKERAQNIS
PTGWARELGMIYTSHHDSPVALPNSMRVYSATVNRISRTGRVLGPAQKAS
PLEGLKSQTIWAATQYKEEESKGSLEVGKLADLVILSDNPLTIPAEKLAD
IQIIETIKEGKTVYKRNEQQTSDKVNHGGCIDSPRCQAVATTAMVAAGVL
HHSH
>gid:864850  SOA0102  ISSod3, transposase
MLTILHQSLYQHCPEIHQKRLNTLMVACKALINADCLTLTHLGRHIDGTS
THTKHSIKRMDRLLGNPHLHHERLAVYQWHAKWLLTAHTMPTILVDWSDM
REGRELIALRASIAIKGRSITLYERTFPLVLQGTQTAHNQFLNELHKVLP
DNITPLIVTDAGFRNPWFRKVEQLGWYWLGRVRGLSVYRLHPFGRQFSLK
ALYPKASRRAKHVGRVALSVKKPLLCEMVLFRAPSKGRKGQRSTTTDCHH
TAQWTYELTAKEPWALVTNLTIEAMSPQKLVNIYQKRMQIEETFRDLKSP
AYGFGLRHSRTRYAARMDILLLIALLVQLAFWWVGLYGETQQLQRHFQAN
TVKKRNVLSTIRMGKELLRRRHDYPISADDLLCAAKKLAQLSLTHGCWGY
EL
>gid:864851  SOA0104  hypothetical protein
MNSPMLKWASRRGVGRCEGDEQVVFIILCHFSKIFKNRNDT
>gid:864852  SOA0105  ISSod8, transposase
MPLNFSGRHFPSDIIMQALRYYLAYKLSYREIEEMFAERNIHFDHSTLNR
WVIKYAPLLEAIFRKKKRPISGSWRMDETYIKMKGQWAYYYRAVDKFSAV
IDFYLSETRDEKAAHAFFTKAINQHGLPEKVVIDKSGANAAALDTVNIRL
WLSGCMLFMIEVLTVKYLNNIVEQSHRKVKGKINA
>gid:864853  SOA0106  methyl-accepting chemotaxis protein
MFKNMTLAQRLISVFFILSLLVLGVAWFSVVQLAGLHSNTTKITENLIPS
IRSSAQMHIALLDARRNELNMVIDVMTHDSAAIEISKQRFETAKSEFEAG
AQQYAKLNFVSEQDEQLFIKLGEAAEKYFSAHSSLVSAIDQGDMASANIM
IKTLTRQTLEVAGEETMNLRHENDRAAQEMVLQSENAYKTAKMLSIIVGF
STIFFVVVMAFLLIRQIQNPIMWLLKQTHEVSAGNLTNKLNMNAFARDEF
GQLAESFNEMQDNLHMLVSEVSNSIVQLSSAAEEISSVALHSSNNMETQQ
NELNQLATAMHEMQATVQDVARNTNDAANAATQASDTATQGSETVNDSIV
RIDKVAGAIEATAVVIRKLGDDSRNIGMVLEVIQGIAEQTNLLALNAAIE
AARAGEQGRGFAVVADEVRTLAKRTQDSTSHINSIISELQLRANEAEETM
QQSQEMMIETVCKAREAGESIAKISSSVSCISQMNIQIATATEEQGAVSE
ELNRNVANISGASEDVATGAKQMAMACNDLSHLATQLQDMVKKFHI
>gid:864854  SOA0107  hypothetical protein
MTQNYEHYHGCRPQSSFVHDDKNIYPIKFFFSTSEILESFNDFEDESLPD
SFRKYEMLSFAYDPGSDIYSISLSKVDFGKVYFYVLHEDAEIFGIWSSFQ
LFIESIVDDSNV
>gid:864855  SOA0108  hypothetical protein
MKVQNQAIITFKNISPSNKMQCNAICCLSCGKCDNNIIPYIEKHSLSPSD
AIEHVRNLNENAMRALCLLICNKYSSYDELIHHIWEGRIVGSGSLPVVIH
EVRLFLKKCPSLKVVNVRTKGYMLVETN
>gid:864856  SOA0109  hypothetical protein
MLNTRLELNKRIDNNVQLKYTEQKIFGNSKEIIGCEVLLDFEYAKKNQVA
WKYLTMMHNGASLLSLIVYVNESILRYVKSNINKVFINVERISLCNSRHV
GLITRLNEKMSEHNISLIVEITERGSDIPFVEIKNGLKSLQCSHVKLAMD
DYDYKKTDFRSDEIFDYDYIKVDFPKTKAELDSFNNFVFLTSMFTCLIVE
RIENENDFFSVKNNRIWGYQGFAFCQKESFEGWSDNTSDNNVF
>gid:864857  SOA0110  lipoprotein, putative
MINKHYISSLIPAALLGFFLVGCSGESRNNFIDPDAARLVAHPQVFVLKQ
GETQRVDLTQSVVAKNVASWKIADLDDKTGLGTILNPQATYFDYQATQSG
AGSFNYTVKGDNLTATSQIVLAVNAGETPGNNIPVADNITLSTFNNVDAI
IDLSAYITDADGDTLRINKLVSASNRFTLNGFQVTFAPDGFVGVDQAVYS
VDDGRGGYALAYIVVTSNDANPPAPNTAPVAKDDSLSMDVAKQSVLNINL
SGLISDADGDSLKVVTLYSHNDRAVLKENASVDYTPGNFRGVDQFTYLVS
DGKGGYDLGTVTVTVGDSTPPTPPTPSLVAYQQAFVLDVDQLQTIDVTQS
VTSEHLDAWSLTTVQDSTNLGVVSAKTATTFNYLAQTPGVAQIDYSVQGG
SLSANSTITVAINAPVTPDNHRPEAQDTQLETLNNASKTIDLQGKISDVD
GDTLSITLHGSARFSLNGTQVTFTPNGFVGLDQAVYSVEDGKGGYALANI
VAISEDANPPVPNMAPTANDAQFTLDVAKNVTFNIDLVAQRLIADADGDA
LSIAHIYTANNRATKQGATGITYTPGAFRGVDQFTYVITDGKDGYAINAI
TVVVNDSTPANKIPTAGPVTAKMLHNDPAITISVNSAVSDADGDTLKIVS
ISGALGQASINPANALEMLYEPKGFVGTDRFVYVVSDDNGGYAMGEVTVT
VTDSNPTAPVANTVQENTLLDTPINIDLSAYISDKETETANLVISNVTNA
TSPAVATLSGQTVIYTPNGFTGNDILTYTVTDGRHSTNGTIVISVNAHGA
HAIQADNLEGGTEPNAPFIHDLSALISTTDPTAGELIVVNAIGGALGTAT
VTNNILTYTPKLGVFGKDRLIYTVKDSHNPAHYTQGTISITIFAPAKPEI
TKLEAKKETDGYLKAYVTCRTCDVTQYKYAWIINGLTKSTGETYIPTAAD
DGFNIRLEVTGQDAYGQVTEMQYVVYAFSKVETIFSDMYAFAALKTDGSV
VTWGYSDYGGDSSSVAGQLTSGVKVIDSTSSAFAAIKEDGSVVTWGDSPY
GGDSSSVAGKLTSGVKVIYSNSSAFAAVKQDGSVVTWGSYGGDSSSVAGQ
LTSGVKVIYSTDSEFGAFAAVKEDGSVVTWGNSPYGGDSSSVAGQLTSGV
KVIYSTNNAFAAVKEDGSVVTWGDSGAGGDSSSVAGQLTSGVKVIYSTNN
AFAAVKEDGSVVTWGGYGGDSSSVAGQLTSGVKVIYSTNNAFAAVKEDGS
VVTWGDSGNGGDSSSVAGQLTSGVKVIDSTYSAFAAVKEDGSVVAWGNSG
YGGDSSSVAGQLTSGVKVIDSTTSAFAVVKEDGSVVTWGGSFYGGDSSSV
AGQLTSGVKVIYSTSSAFAVVKEDGSVVTWGGSFYGGDSSSVADKLAPNL
FLIETSIN
>gid:864858  SOA0112  lipoprotein, putative
MQNKHSIFPLAPAALLSLLLAGCGGDKGFLDPVPEVSSLTASPQVFVLKQ
GETQRVDLTQSVVAKNVASWKIADLDDKTGLGTILNPQATYFDYQATQSG
AGSFNYTVKGDNLTATSQIVLAVNAGETPGNNIPVADNITLSTFNNVDAI
IDLSAYITDADGDTLRINKLVSASNRFTLNGFQVTFAPDGFVGVDQAVYS
VDDGRGGYALAYIVVTSNDANPPAPNTAPVAKDDSLSMDVAKQSVLNINL
SGLISDADGDSLKVVTLYSHNDRAVLKENASVDYTPGNFRGVDQFTYLVS
DGKGGYDLGTVTVTVGDSTPPTPPTPSLVAYQQAFVLDVDQLQTIDVTQS
VTSEHLDAWSLTTVQDSTNLGVVSAKTATTFNYLAQTPGVAQIDYSVQGG
SLSANSTITVAINAPVTPDNHRPEAQDTQLETLNNASKTIDLQGKISDVD
GDTLSITLHGSARFSLNGTQVTFTPNGFVGLDQAVYSVEDGKGGYALANI
VAISEDANPPVPNMAPTANDAQFTLDVAKNVTFNIDLVAQRLIADADGDA
LSIAHIYTANNRATKQGATGITYTPGAFRGVDQFTYVITDGKDGYAINAI
TVVVNDSTPANKIPTAGPVTAKMLHNDPAITISVNSAVSDADGDTLKIVS
ISGALGQASINPANALEMLYEPKGFVGTDRFVYVVSDDNGGYAMGEVTVT
VTDSNPTAPVANTVQENTLLDTPINIDLSAYISDKETETANLVISNVTNA
TSPAVATLSGQTVIYTPNGFTGNDILTYTVTDGRHSTNGTIVISVNAHGA
HAIQADNLEGGTEPNAPFIHDLSALISTTDPTAGELIVVNAIGGALGTAT
VTNNILTYTPKLGVFGKDRLIYTVKDSHNPAHYTQGTISITIFAPAKPEI
TKLEAKKETDGYLKAYVTCRTCDVTQYKYAWIINGLTKSTGETYIPTAAD
DGFNIRLEVTGQDAYGQVTEMQYVVYAFSKVETIFSAKYTFAALKTDGSV
VTWGYSDYGGDSSSVAGQLTSGVKVIYSNSSAFAAVKEDGSVVTWGSYYD
GGDSSSVAGQLTSGVKVIYSTGSAFAAVKEDGSVVTWGSYGGNSSSVAGQ
LTSGVKVIYSTDNVFGAFAAVKEDGSVVTWGGSSYGGDSSSVAGQLTSGV
KVIYSNSGAFAAVKEDGSVVTWGNSGAGGDSSRVAGQLTSGVKVIDSTNN
AFAAVKEDGSVVTWGNSGAGGDSSSVAGQLTSGVKVIYSTNNAFAAVKED
GSVVTWGDSGAGGDSSSVAGQLTSGVKVIYSTGSAFAAVKQDGSVVTWGD
SGYGGDSSSVAGQLTSGVKVIDSTNNAFAAVKEDDSVVTWGSSGNGGDSS
SVAGQLTSGVKVIDSTSSAFAVVKEDGSVVTWGRSFYGGDSSSVADKLAP
NLFLIETSIN
>gid:864860  SOA0115  lipoprotein, putative
MQNKHSIFPLAPAALLSLLLAGCGGDKGFLDPVPEVSSLTASPQVFVLKQ
GETQRVDLTQSVVAKNVASWKIADLDDKTGLGTILNPQATYFDYQATQSG
AGSFNYTVKGDNLTATSQIVLAVNAGETPGNNIPVADNITLSTFNNVDAI
IDLSAYITDADGDTLRINKLVSASNRFTLNGFQVTFAPDGFVGVDQAVYS
VDDGRGGYALAYIVVTSNDANPPAPNTAPVAKDDSLSMDVAKQSVLNINL
SGLISDADGDSLKVVTLYSHNDRAVLKENASVDYTPGNFRGVDQFTYLVS
DGKGGYDLGTVTVTVGDSTPPTPPTPSLVAYQQAFVLDVDQLQTIDVTQS
VTSEHLDAWSLTTVQDSTNLGVVSAKTATTFNYLAQTPGVAQIDYSVQGG
SLSANSTITVAINAPVTPDNHRPEAQDTQLETLNNASKTIDLQGKISDVD
GDTLSITLHGSARFSLNGTQVTFTPNGFVGLDQAVYSVEDGKGGYALANI
VAISEDANPPVPNMAPTANDAQFTLDVAKNVTFNIDLVAQRLIADADGDA
LSIAHIYTANNRATKQGATGITYTPGAFRGVDQFTYVITDGKDGYAINAI
TVVVNDSTPANKIPTAGPVTAKMLHNDPAITISVNSAVSDADGDTLKIVS
ISGALGQASINPANALEMLYEPKGFVGTDRFVYVVSDDNGGYAMGEVTVT
VTDSNPTAPVANTVQENTLLDTPINIDLSAYISDKETETANLVISNVTNA
TSPAVATLSGQTVIYTPNGFTGNDILTYTVTDGRHSTNGTIVISVNAHGA
HAIQADNLEGGTEPNAPFIHDLSALISTTDPTAGELIVVNAIGGALGTAT
VTNNILTYTPKLGVFGKDRLIYTVKDSHNPAHYTQGTISITIFAPAKPEI
TKLEAKKETDGYLKAYVTCRTCDVTQYKYAWIINGLTKSTGETYIPTAAD
DGFNIRLEVTGQDAYGQVTEMQYVVYAFSKVETIFSAKYTFAALKTDGSV
VTWGYSDYGGDSSSVAGQLTSGVKVIYSNSSAFAAVKEDGSVVTWGSYYD
GGDSSSVAGQLTSGVKVIYSTGSAFAAVKEDGSVVTWGSYGGNSSSVAGQ
LTSGVKVIYSTDNVFGAFAAVKEDGSVVTWGGSSYGGDSSSVAGQLTSGV
KVIYSNSGAFAAVKEDGSVVTWGNSGAGGDSSRVAGQLTSGVKVIDSTNN
AFAAVKEDGSVVTWGNSGAGGDSSSVAGQLTSGVKVIYSTNNAFAAVKED
GSVVTWGDSGAGGDSSSVAGQLTSGVKVIYSTGSAFAAVKQDGSVVTWGD
SGYGGDSSSVAGQLTSGVKVIDSTNNAFAAVKEDDSVVTWGSSGNGGDSS
SVAGQLTSGVKVIDSTSSAFAVVKEDGSVVTWGRSFYGGDSSSVADKLAP
NLFLIETSIN
>gid:864861  SOA0117  hypothetical protein
MTLFKLILESVMTSIIKRTQRDYSLAFKLAVVDQVEKGELTYKQAQDKYG
IYRVSLVTQAWQTGLVPRNTVYINARTVDD
>gid:864862  SOA0118  conserved domain protein
MTEPTTLTPEQKIKALEAIVEKLRTDYGISVVKKATRETLQEKTDVGYHL
VTQCCRYLDISRQAYYQQCQRQKGEGKVLQTVQYERLFQPRVGTRKLQYL
LNKMHQIVIGRDRLFRCLRAYRLLVMPTRAYHKTTNSHHRFRCHPSLLKP
SEPQVNITRSEQVWVADITYLPLQNKGFEVQRNENVR
>gid:864863  SOA0119  ISSod13, transposase
MLHSNNPIIKHKTGLLNLAEELSNVSRACKVMGVSRDTFYRYRELVDDGG
VDALIEKSRRSPNLKNRVEEAVEQAVMEYAIEFPAHGQHRTSNELRKKGV
FVSGSGVRSIWLRHDLENFKKRLKALEAKVARDGIQLTDEQIAALERKKH
DDEACGEIETAHPGYLGSQDTFYVGNLKGVGRIYQQTYVDTYCKVAHCKL
YTTKTPITAADLLNDKVLPFYESQQLPVLRILTDRGTEYCGKVEHHDYQL
YLAINDIDHTKTKAMSPQTNGICERFHKTILNEFYQVTFRKKLYQTLEEL
QKDLDEWLSYYNNERTHQGKMCNGRTPVETLIDGKRVWAEKNLTRI
>gid:864864  SOA0120  ISSod12, transposase
MSKPRYKTTNWKQYNQALINRGSLTFWIDEDAIAKWKAKVEPPKKGRPQL
FSDLAITTALMVKRIFSMPLRALQGFINSVFKLGDIPLSCPHYTCISKRA
KTVNIAFKTKTRGAIEHLAIDSTGLKVYGEGEWKVKKHGTDGKRRVWRKL
HIAVDTHSHEIIAAELSLSGVTDAEVMPNLLQQTHRKIRNISGDGAYDTK
ACHEAVRRKRALVLIPPREGAALWEKGHPRNLAVSWQQLHGSNKQWKKRY
GYHRRSISETAMYRIKQLLGGTLSMRNYNAQVGETYAMIRALNKLTGLSM
PETHYVA
>gid:864865  SOA0121  hypothetical protein
MFYSEALTRLMGKLIGDADQICGYGFSSTYLGNKAMFIKYR
>gid:864866  SOA0122  conserved hypothetical protein
MYKFIITIFVMALSSTAMADQAGFQFTTGEPKTQGGFQGPNVKQVIRSVV
SANNASDDDKVELTGYIVSSIGDDDYIFRDATGDIKANIDDDLWHGYTVT
PDTKVIIRGEVDKDWSKVTVDVENIQIIQ
>gid:864867  SOA0123  conserved domain protein
MFSLWCLFVGTLITKEELMKIQILHKQSLPQRAIAKQLDISRNTVRKENS
QKSWTSQITLTNLCTGSPV
>gid:864868  SOA0127  hypothetical protein
MRILAHLCVPVVPVVPVVPVVPVVPVVPVVPVVPVVPESLSP
>gid:864869  SOA0129  transposase, IS3 family
MRQLCQLFDVHPSGYYAWRSCAKSKRQCDNERLIGQLKQCWLESGGVYGY
RKLHRDLRDLGEQCGINRVHRLMQRAGQRAQVGYRKPRTRSGEQHVVTQN
RLERQFNPLAPNKAWVTDITYIKTHEGWLYLGAVMDLFSRRIIGWSMGSR
ITKELALDALLMAVWSRKPAGKVLVHSAQGSQYTSHDWSEFLSAHGLEGS
MSRRGNCHDNAVAESFFQLLKRERIKRKIYATRDDAKMDVFNYIEMFYNP
KRQHSSNDGLSPLEYERQYFNEVKSRLVK
>gid:864870  SOA0130  ISSod4, transposase
MTQPFNFEQALKDLQSGKSLTGKDSILGPLIKQLTEAALQAELEQHLAHD
PQPNRKNGKTPKTIKHPSGNFELDAPRDRNGTFEPQLIKKNQTTLTDEIE
RKVLSMFSIGMSYRDINQHVEDMYGLNVSNATVSAITDKLIPELKAWQQR
PLDSHYPIVWLDAIHYKVKEDGRYVSKAVYTLLALNMEGKKDILGLHLSE
NEGANYWLSVLTDLNHRGVKDILIACVDGLTGFPEAIASIFPHTETQLCV
IHQIRNSMKYVASKNQKAFMADLKPVYRAVSKEAAEMALDELEAKWGDAY
PLVINSWRRKWHNLSHYFKYPEHIRKVIYTTNAVEAVHRQFRKLTKTKGA
FPNENSLLKLLYAGILNASDKWTMPIHNWSLCLSQLAIYFEGRLDSVLEI
>gid:864871  SOA0131  hypothetical protein
MDYITMKKIFYWNIFTILVLLLYPLIQGVFFSEVRQISFVENGKRISTKI
SHTAGFATIITSVDSEPIFINIAIALKFTDEKIIYIPFSRKSNKNGFVIN
KIRIIEHWPIDPFNIYYVIMYKKNKAIVHMSNKKLELTRDI
>gid:864872  SOA0132  conserved hypothetical protein
MAYIQLKFKLLVMPDGRNISMESLSDNEIKIITFLAKNYPLASNRDEILK
NCWEGKVVTENSVNVAISNIRSFFQKENIQDFIVTLRGEGYQLSEKIIIK
ESNDPAALNSLSKGINLFLTSIKNHPATFLLNIITCLILMTIIKLYIWII
LL
>gid:864873  SOA0135  hypothetical protein
MLMCVLLGSFYSPPSPPSFASIWDLFTNMRGVSQASVPKDSSRIELVGTL
PADTELELIGFYYSSICTEKEMYFPNGDIANPQWRTNGTTSQLRQVVTSL
HNASHFTQSVALQGGGACEWQLTNIKLNMSLLPSHRLWQEAQSLKKQYLA
ELPVAKVKDEALDKDAFKQSFDLIPTAKEADAASADSNDATITFSPIYYP
IYSYAPEDKQPFDVGLKSSLQREVSDRWSLQHSRRIRLADERVTRVNFTP
QIAEDYAVNIHILDRKSKVSYPDGTEFTYIKAGLRLYPYGSAESRFNSLQ
RSARLEDKRQLAKIYDSGIEHVFEGDKAKAEVLYRELGEAGDMESINWLR
ERARWGYIKDGQYWLARAAKLGDIQAQLELINQPLSVIPQGKVNTGQAKI
REQQAWQTLTELTKQGIPQAVSTMAYYQVFSCSPYFNASAALDNYRKSVK
ARPDLARDAAETYYYFKEDFEQSEEFWEIAAEQDLFAAAELANILLLEHH
RDATSRARYWLGKVTTLGVDKGLQKDILGNAYFQLATLLDTGEGGAVDSK
AALALYQQSPVGSPHNS
>gid:864874  SOA0136  ISSod3, transposase
MLTILHQSLYQHCPEIHQKRLNTLMVACKALINADCLTLTHLGRHIDGTS
THTKHSIKRMDRLLGNPHLHHERLAVYQWHAKWLLTAHTMPTILVDWSDM
REGRELIALRASIAIKGRSITLYERTFPLVLQGTQTAHNQFLNELHKVLP
DNITPLIVTDAGFRNPWFRKVEQLGWYWLGRVRGLSVYRLHPFGRQFSLK
ALYPKASRRAKHVGRVALSVKKPLLCEMVLFRAPSKGRKGQRSTTTDCHH
TAQWTYELTAKEPWALVTNLTIEAMSPQKLVNIYQKRMQIEETFRDLKSP
AYGFGLRHSRTRYAARMDILLLIALLVQLAFWWVGLYGETQQLQRHFQAN
TVKKRNVLSTIRMGKELLRRRHDYPISADDLLCAAKKLAQLSLTHGCWGY
EL
>gid:864875  SOA0137  conserved domain protein
MDINLEYSLHDTQFFWQALSIRYHHAQRLKQLQRELAGFEQIKKFTLLPE
AFSMEAGLITPTLKQRCKMIYHKYAHEINAMYNN
>gid:864876  SOA0138  hypothetical protein
MLKDADGQHLRGYGFEGDDLTKLPRLQVVLDEVAPTLTSFTVGSVRKVLS
QIPPQQYRMTYDVDAIEDPAIKAIVDGSTKLKPTLVGYRNSEGSPVDVKT
AVEYKLRYPLTRKDKDAQGNEHQYSLVTTFDRSGYSEEDGSFVLVSVAFE
ADDFDWLKSGGAVIPKTGPSYDFMSMGMGSTWAGLASESHYAQGVLDGPY
AAYDGRNGSVVYSGNYKQGKKVGKWMETENQAIYWEGEYLDGKKQGKWVL
QSVMDDADNYGFIHYNKGVPDGPSEIYTPDYDSGTEGAKKLSEKGIYLNG
VKDGEWLESDGSKGLYQKGVKQGAWTEKVTQGDYRFQTGDGNYLDGKRSG
TWVYIRTPDTNDAFTTRQSVTYLYKAEVNYENGLRQGEAKLFNKDNFMFA
LKHYDKGRLHGESLWYSAPNIVSNVANYRQGELDGLQMWLEPNGDLNELS
QYKFNAAVKLKPSQDECMMRKEPYAGDCDGVLISKDAETSSIKDGEQRVY
RGGLLSELTYYTNGSVDKEYRFDSNGRLTSLQAYKAGKEYGPSIRNSTDD
GYSLSSYTQMINNRAAGASYSFHPNGQLRDLSNFCQQEGEMWNGQPYFWS
NAGERCGIQREYFDNGVVSCIEELDSGYVVDKVCYDRNGKLSSEMVRIDE
QHVVHKRYINGVIYQEDPGFSDSSNVVKGRKIYYLENPKSHGVFKSYRNG
QLEYEKIYDMGKAGCMKKYDANGKQTPETANCQF
>gid:864877  SOA0139  hypothetical protein
MIERWIKPLHSPSFDALLWNPVIYIGASVFSTCFLVLKRVRQGLRGGTWK
DRVSICLGMICLGPLIVFSFMLSFGTYWHMLTKKPASQVEVVAELPSTWS
STGRGSCTGKLWVHPPGEPQHERILCYVPKELWAELKVGDRLLLNGDRSS
VAFTPADILILPN
>gid:864878  SOA0140  hypothetical protein
MTIVKYKAVNPVIGITVLGIAVMLAACDNSSTDSNNISIPTEREYQIISS
ETNLTDRDYSGNNNTMQARMILKYQLQSTHNDAATLSSPLSTSGNFIITP
SLLQFADRAIPLPISNIDERIQSNELSLMIKDGFNLSVNEGEVMALSPID
KPQEQDLAKMFEQWDNLKSLFQQVPMLPVKLEPKVGFSKVSPKDKHLTWT
VEQVTEDTLVAVLQGENTESPEQYQKTYGAFEVDRKTGWLNSLVLINQSM
RNGRLVNHRLAMAPKDKPFVMSIIWHAYADYDALRIAEDSQRTLNDIHKD
NLIYQDMDSQIVQALLPEHQGIIDNIGSWGEDAMKELQLRFVHGLTKKYF
SGEVLYQDIQAFDDKNNPIDIQFWQHNRSWVNNYGGKIESASDLLPIGWD
NTASKIKKINHISAKLTLIPEKNRLVTRPWSELLAKPYHFGGASLQITLI
DANTHLYKVVMRQTAEKGININFSQFNGQMGDAMVDQGYPTWLSASEQDL
LNVLFKYGEKHVNSNATAFLLKLTDIPETMSFYETEPQPQMAHTNPVTFM
HISDYYKDLANPTSGFDYSRETDFFLTDPSQLQLSVKEISDVKDIKSISD
DGHNFYIPMSPALAMSCKPEIEQGFTEGKQPVTWVYDTKALTGPAYHLAS
PDGVRRYFYDKKIQGKIHCQGDVTWQHVTLQPGERPWLVDIGPWLTQPSM
AENPSELLKQYFKIQDDEGTNLTIRLPENSAELSIKDVLVDGKFISVSGA
AAKVSIMQISAKPIELPYQFTFKPLP
>gid:864879  SOA0141  hypothetical protein
MRNIIHLLLAALVLFSANANAENVLLDSNWNVVKSKKSAHYYLKPPVETV
DGEFKVEIYYQDNDALFCAGTLLDNKLGKTLKIEQFRGAYQCYFDDGNPY
QQAFINDQGQLHGLFKKYIATGSHYVESHFENGVQQGLETGYWEGGNVRY
KTTYQNDKEVGISEHFSPEGELTAKICNDANGVNQYFNLGKLNREVHKVD
GLLQGIETTWGLNGQVISTQEYAKGQKQGDYFEYFDDNKVKRHYRYDNDY
KVGEQLDFHENGQLARKEITEAPWNVKLSEQYNDQGEILSSTEEKHQGDR
WIYQKRQYFKQGKLIRSNEEDQLKKWSLYEEFSEGELVARTESINNNRTG
LYIVSSGFFEDKPLLTQEYYQNGLRHGTYERIQLNDKQRTVIERGQYTHD
KPSGTWMKRQFEHGKSTFSYDDKGELHGEYRNETDSGQLLELLTFTHGKA
TGLCQRYANNGQLYEKGEYRDGKRHGDWILAKEYAYMAYPQPNRLYWIGR
YNMGAQVGLWEQVNMNNYRLKQEKYDDKGNLHGKQYTFAEDGSMEEIAEF
RHGEFISVKHEFPTSSQWMDLPSPLT
>gid:864880  SOA0142  ISSod13, transposase
MLHSNNPIIKHKTGLLNLAEELSNVSRACKVMGVSRDTFYRYRELVDDGG
VDALIEKSRRSPNLKNRVEEAVEQAVMEYAIEFPAHGQHRTSNELRKKGV
FVSGSGVRSIWLRHDLENFKKRLKALEAKVARDGIQLTDEQIAALERKKH
DDEACGEIETAHPGYLGSQDTFYVGNLKGVGRIYQQTYVDTYCKVAHCKL
YTTKTPITAADLLNDKVLPFYESQQLPVLRILTDRGTEYCGKVEHHDYQL
YLAINDIDHTKTKAMSPQTNGICERFHKTILNEFYQVTFRKKLYQTLEEL
QKDLDEWLSYYNNERTHQGKMCNGRTPVETLIDGKRVWAEKNLTRI
>gid:864881  SOA0144  ISSod4, transposase
MTQPFNFEQALKDLQSGKSLTGKDSILGPLIKQLTEAALQAELEQHLAHD
PQPNRKNGKTPKTIKHPSGNFELDAPRDRNGTFEPQLIKKNQTTLTDEIE
RKVLSMFSIGMSYRDINQHVEDMYGLNVSNATVSAITDKLIPELKAWQQR
PLDSHYPIVWLDAIHYKVKEDGRYVSKAVYTLLALNMKGKKEILGLHLSE
NEGANYWLSVLTDLNNRGVKDILIACVDGLTGFPEAIASIFPHTETQLCV
IHQIRNSMKYVASKNQKAFMADLKPVYRAVSKEAAEMALDELEAKWGDAY
PLVINSWRRKWHNLSHYFKYPEHIRKVIYTTNAVEAVHRQFRKLTKTKGA
FPNENSLLKLLYAGILNASDKWTMPIHNWSLCLSQLAIYFEGRLDSVLEI
>gid:864882  SOA0145  ISSod1, transposase OrfB
MSVSKSGYYDWHKRPANVISVETLKLYRLVRQLFKQSRGSLGNREMVKKL
RKEGYQVGRYLVRKIMHRLRLKATQRCAYKVTTQRKHSDAVADNLLNMNF
NPVSANQVWAGDVTYLKTGEGWMYLAVVMDLYSRRIVGWRIDKRMTTDLI
SKALIKAYNLRQPARGLVFHSDRGSQYTSKQFGRLLSSYGIRASMGDVGA
CWDNAEGVQNSVSG
>gid:864883  SOA0146  ISSod1, transposase OrfA
MSLKKSHKSYPQAFKDEAVLMVLEQGYSVADAAKSLGVSTSLLYNWKEKH
EALQQGITLEESERDELKRLRRENKELRMEKEILKKASAFFAREMK
>gid:864884  SOA0147  hypothetical protein
MHILVFEVTSAMSSVTANTTENQIDFTGSVEKSLSGFLSGTGGVF
>gid:864885  SOA0148  hypothetical protein
MPMFSANNLDVKYCSICITLNIMNDNPPSHILGRFY
>gid:864886  SOA0149  conserved hypothetical protein
MMKLTALVDNTRLDSRPELAVERGLSFHVETMGSQILFDTGSSSTFCENA
ALMNISIQDVDLAVISHRHHDHCNGTTHFIERNSKAKIYLKDCEDKNYLF
KAFGFKNDVGINKELLKNASERLIFVEHTTEILPNIFIITEISDKYEKPK
GNRYLYTQSENGYKNDTFDHELLLVVKESDGLIVFTGCAHSGVLNMVETA
IELFPNHRIKAVVGGFHLVGLPLLNSIGGTKQDIQAIGQALSHYPIDKLY
TGHCTGMKAFGLLKEVLGDRLEHLPTGRSVLI
>gid:864887  SOA0150  hypothetical protein
MTTACQTVPTEPKNLEWRHMVGSNSDELWVTEVLFYRQGKLVDASTNGSA
GFFGAEGLKEKDYRWSISKGSHTTRPIPDQVELEWISFHDKKRYRISLAL
PAELESIIAQPYRFKVRDEWREERRRNIGLGMATGGYVEAFLTNAKVKPD
ILLARGIAKEIQTSQDKENSVTKVYKVWFDDFDKAYGDAYQQYPTPSGMA
WAPIMDAYRAAQPKTDTNPVQ
>gid:864888  SOA0151  ISSod13, transposase
MLHSNNPIIKHKTGLLNLAEELSNVSRACKVMGVSRDTFYRYRELVDDGG
VDALIEKSRRSPNLKNRVEEAVEQAVMEYAIEFPAHGQHRTSNELRKKGV
FVSGSGVRSIWLRHDLENFKKRLKALEAKVARDGIQLTDEQIAALERKKH
DDEACGEIETAHPGYLGSQDTFYVGNLKGVGRIYQQTYVDTYCKVAHCKL
YTTKTPITAADLLNDKVLPFYESQQLPVLRILTDRGTEYCGKVEHHDYQL
YLAINDIDHTKTKAMSPQTNGICERFHKTILNEFYQVTFRKKLYQTLEEL
QKDLDEWLSYYNNERTHQGKMCNGRTPVETLIDGKRVWAEKNLTRI
>gid:864889  SOA0152  hypothetical protein
MSTHKMRKIHTAVFKVEALKLADKIYLVLAYPKAVKDSLTAEEKAKLKLI
VQSLKGESK
>gid:864890  SOA0153  heavy metal efflux pump, CzcA family
MLKYIIEASIRQRFMVLIIAIMITVWGVQELRKTPLDALPDLSDVQVIIK
TPYPGQAPKLVEEQVTYPLSTAMLAVPGAKTVRGYSMFGDSYVYVIFEEG
TDIYWARSRVLEYLSQISSRLPQNVQPSLGPDASGVGWVFEYALVDRSGN
LDLSQLKSLQDWYLKLELQSVAGVSEVATVGGMEQTYQIVLEPDKMAIYK
LDIASIKDAIEKSNSETGGSVIEMAEAEYMVRAKGYRQTLEDFREIPLGI
TSPSGTGLTLKDVATVRKGPASRRGIAELDGEGEVVGGIVVMRYGENALA
TIDAVKAKLEELKAGLPDGVEIIPTYDRSHLILKSVDNLFSKVVEEMLVV
GLVCLLFLLHARSTLVAVITLPLSILIAFIVMNKMGVNANIMSLGGIAIA
IGAVVDGAIVMIENLHKHLEHFKADHNREPDTKEHWRIVTEASIEVGPAL
FFSLIIITLSFVPVFALEAQEGRLFAPLAYTKTFAMAAAAFLSITLVPIL
MGYFIRGKIPSERSNPISRFLIAIYQPSLKLVLKFPKVTLMLALIALASA
WYPMTRMGSEFMPTLEEGDLLYMPTALPGISASKAAEVLQQTDRLIKTVP
EVARVFGKVGRAETATDPAPLTMLETTIMLKPHDEWREGMTLDGIIAQLQ
QTVKVPGLTNAWVQPIKTRIDMLSTGIKTPVGIKITGADVNELQSIGTKI
EAILSKVPHTKSAYAERSGGGRYIDISPKLDVAARYGMTLQDIQDVVRYA
IGGMDIGESVQGAERYPINLRYPRELRDNIEKLRELPVITKSGHYLPLRN
LADIEINDGAPMLKSENGRLISWVFIDIEGTSIGEYIATAKTALDAELVV
PPRYSYSFAGQYEYMQRVDAKLKQVIPMALAVIFILLMMTFGSTIQASII
MLSLPFALVGSTWLLYLLDYNISVAVAVGMIALAGVAAEFGVVMLVYLNN
AIKHRQEKDNYHSVSDLKEALMEGAVMRIRPKAMTVATIFFGLLPIMWGA
GSGNDVMQKIAAPMVGGMVTAPLLSLFLLPALYLLIYSRKLKKDA
>gid:864891  SOA0154  heavy metal efflux protein, putative
MRHMNQFNTVKKQAPKAVLILSSLLISTPWLSHAQLANTQEHTATEAAAN
SATAPKNNLLNQVKTYTCPMHPEVISHELGRCPKCNMFLVEKLDASNPAA
EHAQHQAADHQTSEHGHTSAAKAASQVFDTPTPKADSLVKAQNNETTIKY
VCPMHTHIISDVPGTCPICGMNLEKVETGGNTQEININVSGSMQQALALK
VAKVERDTLWKFVETVGQIDYDESQITHIHARVTGWIEKLMLKSVGDTVK
KDQLIYEIYSPDLINAQDDYLLAMDTAKANGQGRYKDLVRNAGLRLSFLG
FNDRQIKQLAESHQTQYRVPFYAKEDGIVKALDIRDGMYIQPSTEVMSVV
DLSKVWVIADVFENEQSWIAKGQKAEIAVPAMNISGIEGTIDYIYPELDP
VTRSLRVRVVLNNTEVDLRPKTLAKVSLFGGPNKDVLVIPQEALIQTGKE
NRVIVKQTDDSFTAKAVTVGMMSQGKAEIISGLNEGERVVISGQFLLDSE
ASLKGSLMRLSSGHQH
>gid:864892  SOA0155  conserved domain protein
MKTLTNLALIAFLLTSTSVFATAAPHAHQHDNANPQQQAHTHVCPMHPEV
TGNKGDTCPKCGMDLEPKATEVKTAEGTMHNAHEHH
>gid:864893  SOA0156  hypothetical protein
MLESSRSSFGRLMHHTLGRYFLVIFTLLALVGQSVMSNGHAMVPHTMDMS
AMNHEVPSQMHHATMMQMDTENSSDSHSMANTDCCNDKMLPGAKQHCCDG
TTSCSNDCGHCLTISVAGTLFSPHLWSSVSMSDTAMATPMPHFHSISLSS
AFKPPIA
>gid:864894  SOA0157  hypothetical protein
MKTTLLNKSLLGLTFVIGALLSSATVTAASLKSTDALSVISINGVPAKPF
KPIQLSAGKVLLELKYQEIFDYRADDSGNWVKSEPLYLVLDVKANDSYQI
TQPKIMTEAEARQFIKYPTIQLSINGDKAGEYPLQSHSQLMVKMLVSNPA
F
>gid:864895  SOA0158  hypothetical protein
MPTSASTSGRSGQIILVMDAEMNCTMPDCSTSDCLQYCQDSMTSHCQTHC
VSSLYIEPAVLELTFANPASKRIAIQGWAMKTADLGLITQPPIHCNVFRF
FQRKTD
>gid:864896  SOA0159  multidrug efflux transporter
MSWIFLLLGVMAEALSHVALKATDGFTRPIPAVMVILGHLTAFIFLGQAM
KGMPVGVVHALWAGMAIVTVTLLSALFYRQHLDMTAWIGMLLVALGVIMI
NLSQGHSH
>gid:864897  SOA0160  esterase, putative
MTIENISVSKSFGGWHKQYCHHSQTLNCGMRFAIYLPPQASSGKKVPVLY
WLSGLTCTDENFMQKAGAQALAAELGIAIVAPDTSPRGENVADDEGYDLG
KGAGFYVNATQAPWNRHYRMYDYVVDELPKLIESIFPVSDKRSIAGHSMG
GHGALVVALRNPDAYQSVSAFSPISNPINCPWGKKALTTYLGRDSATWME
YDASVLMRQAAQFVPALVDQGDADNFLVEQLKPEVLEAAAKVKGYPLELN
YREGYDHSYYFISSFIENHLRFHAEHLGK
>gid:864898  SOA0161  zinc-binding dehydrogenase
MTAQILKSKAAVAWAVGEPLSIEIVDVMPPQKGEVRVKMIATGVCHTDAF
TLSGDDPEGIFPCILGHEGGGIVESIGEGVTSVQVGDHVIPLYTPECGEC
KFCKSGKTNLCQKIRETQGKGLMPDGTSRFSKDGQIIYHYMGTSTFSEYT
VLPEISLAKVNPDAPLEEVCLLGCGVTTGMGAVMNTAKVEEGATVAIFGM
GGIGLSAVIGATMAKASRIIVIDINESKFELAGKLGATDFINPKDYDKPI
QDVIVELTDGGVDYSFECIGNVNVMRSALECCHKGWGESVVIGVAGAGQE
ISTRPFQLVTGRVWKGSAFGGVKGRSELPEYVERYLAGEFKLSDFITHTM
SLEQVNDAFDLMHQGKSIRTVIHFDK
>gid:864899  SOA0162  ISSod6, transposase
MPRLMLTDARWEKLFHLMKSTGRVYDKPEHRQTFEGILYRLRTGIPWRDL
PKEFGHWSTVFRRFHLWSKKGVLAHLFKALANLADIEWVFIDGSIVRAHQ
HSAGAATLSNESIGKSRGGNSTKIHLAVDSGGLPIYFELSEGQKHDITHA
PSLIEHLKQVDTVIADKGYDSDAFRELIANKGGKSVIPRRRYKNTPQERV
DWCLYRYRHLVENAFGRIKHYRAISTRYDKLARNYASMVSLAFMLMWLPM
YC
>gid:864900  SOA0164  iron-containing alcohol dehydrogenase
MSTTFYMPPMSLMGQHAIKLLGTELQARNFNKALIVTDKALVDIKLVDKL
TDELSAHDIAFAIFDGVKPNPTEKNIVQGLALLEAQKCDFVISFGGGSSH
DCAKGIALVAANGGHIRDYSKGVHLSAKPQLPLVTVNTTAGTAAEMTIFA
IVTNEEDETKYPIVDKNLTPIIAVNDSELMVAMPKFLTAATGMDALTHAV
EAYVSTAATPITDASAIKAIELIAQNLKAAVDNGEDREAREAMQYGEYLA
GMAFSNASLGYVHSMAHQLGGVYDLVHGLCNAILLPVVSRFNSAEKVERF
AEVAKAMGVDTVGMTLIDAAESGILAIEKLSASVGTDQKLSDLGVKEDKL
EFMAINALNDACSLTNPRKATTEDIINIFKKAM
>gid:864901  SOA0165  transcriptional regulator, LysR family
MFNWEGVSEFVAVAEAESFTKAAKQLGISTAQVSRQVSALETRMATKLFH
RTTRKVSVSEVGRIYYQHCRQVLDGLEEAERAITNLQSTPRGLLKITAPV
TYGEKTLVPLVNDFIAKYPELEVKINLTNQKVDLIDDGYDLAIRLGQLKD
SSIMAKRLGSRTQYVCASPDYVSTFGIPHSLSELEQHNCLLGTLDYWRFQ
ENGKTRNVRVKGNLTCNSGYALVDAAIKGIGIIQLPEYYVLPFLEDGQLV
PLLEQNRQPKEGIWALYPHNRHLSPKVRMLLDYLSEALS
>gid:864902  SOA0167  ISSod9, transposase
MPRRSILSAAERDSLLVLPDTQDELIRHYTFSEPDLSLIRQRRGDANRLG
IAVQLCLLRFPGQGLLPDATVPMPLLQWIGQQLQLDPVCWPQYAEREETR
REHLLELRVYLGMEPFSQVHHRQAVHTTTELALQTDKGIVLANSVVETLR
HKHIILPTLDVVERVCAEALTRANRRIYDTLTEPLSGSHRHRLDDLLKLR
DNNKTTTLAWLRLSPVKPNSRHMLEHIERLKVWQALDLPIGVDRLIHQNR
LLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRI
MGKLFNDAKKRHQKQFQASGKAINAKVRLFGRIGQVLIDAKQAGDDPFAA
IEGVISWEAFAKSVTEAQSLAQPEEFDFLYRLGESYATLRRYAPTFLTAL
KLRAAPAAKGVLEAIEVLRSMNNDNARKVPADAPIDFIKPRWQKLVITDT
GIDRRYYELCALSEMRNALRSGDIWVQGSRQFKDFEDYLVPPAKFVSLKQ
TNQLPLAVATDCEQYLNERLTQLETQLATVNSMAQANELPDAIITASGLK
ITPLDAVVPDTAQRLIDQAARILPHVKITELLLEVDEWTGFTRHFAHLKS
GVLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDET
YGAALSELVNAQYRHPFAEHWGDGSTSSSDGQNFRTGNKAESTGHINPKY
GSSPGRTFYTHISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEH
YTDTAGFTDHVFALMHLLGFRFAPRIRDLGETKLYIPKRDVTYEGLKSMI
GGTLNIKLIRTHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVAL
RELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGEI
RDRSFEQQRYRASGLNLVTAAIVLWNTVYLERVAHGLRAKGHAVDEELLQ
YLSPLGWEHINLTGDYLWRSSAKIGSGKFRPLRPLSPA
>gid:864903  SOA0168  ISSo9, nucleotidyltransferase domain protein
MRPSAVLALKRSVIRETASRFRVTNPRVFGSVLDGTDLDGSDLDLLVDAL
PGATLFDLGGLQDELESLLGLQVDLLTPGDLPPKFRAQVLAEARPV
>gid:864904  SOA0169  ISSod9, conserved hypothetical protein
MKVNRLPDYLDHMHQAASDACYFVEGLDKDEFLVDKRTQQAVIMSLIVIG
EASTKVMDGYSEFVIAHSDVPWRSMRGMRNRIAHGYFDINLDVVWDTVQT
ALPTLLSQLAAVRYDVDKNEGHDLC
>gid:864905  SOA0170  ISSod9, DNA-invertase
MLIGYARVSTQDQHLELQREALLKAGCEKVFEDTISGTRADRLGLSKALE
ILREGDTLVVWKLDRLGRSVKQLVELVSDLHKQNVQFKSLTDSIDTGTPS
GRFFFHVMASLAEMERDLIVERTRAGLDVARQLGRKGGRKPKMTDSKIES
AKKLLASGVPPKDVAKNLGVSVPTLYRWLPASAHA
>gid:864906  SOA0171  hypothetical protein
MKNLLIFSFILLAVAFSITQISPYYHKYSYCKLQSEEQTQPNYGYKLPKE
FENIGKNSDTNNLSDKDLTQLKKERELSFQKCMTK
>gid:864907  SOA0172  site-specific recombinase, resolvase family
MANYAYLRVSTDAQDVDNQKHGILEFANQHGLSNLSFVEDTVSGKHKWRA
RKLGDLLESLAPKDVIVFAEVSRMARSTLQVLEILEFCTERQIHVYIAKQ
KMILDDSMQSRITATVLGLAAEIEREFISLRTKEALAARKAAGMKLGRPA
GQAEKTKLDKHRKAIESYLDKGLSIRSIAKLIDEPSTTVNDYIKRHSLRE
RDQLEMRI
>gid:864908  SOA0173  hypothetical protein
MEEEKTKNPNHGGFRPGAGRKTKYEKTVVMRVPEKYKEAIQALITHLDDT
AMIDKSYRASESEPVYLRSLQDKKQHIIFRTEPMLPKT
>gid:864781  higA  proteic killer suppressor protein
MRQIRKPSHPGEFFKFTVLDERGISITSAAEHLGVTRKALSEFVNGKAKC
SHAMARRLAEATGTGVAIWINMQAKLDTWEAENMDTPLNITSLPEVNYG
>gid:864780  higB  proteic killer active protein
MAIKSFSHKKLKQFYFNDDASGLNPNHLKALSFALDAIDSSHHPKDLKGI
YSHKFSEKKGSGEGVYSIEINGNWRLTFQIDDDGAILLDYVDYHGKQIKA
R
>gid:864859  ompA  outer membrane protein A
MKYKLILLTFLPFSFSTLASEPPSYLPFQSYWYAGAGLGQGHYSNGSNPQ
SYDSVRDRFAGSVYLGYQVNPYLAPELSYQFLGSAYANYEQGQISGDFQQ
VVLAARFGYPLTTSLYPYVKVGGAGWFGDSEGLRSGSERGFSPIVAAGVE
YAFTPRLSGRLEHQYTDSLGADSIGYTDHHLTTLGLSWRFGHSAPVTAPT
PEVVTQVVELPPVVQIVEKRQFVYSEQKGNSLLFAHNSSVLNNTSQFADV
VSFLKQHPTASAVITGHTDSTGSDKYNQWLSERRAISVANYLVSQGVQSA
QLTSVGKGETASVADNTTENGRAMNRRVEITIPEFTVTKTVK
>gid:864772  umuC  umuC protein
MFALVDANSFYCSCEQVFRPDWRGKPVVVLSNNDGMVVAANRQAKEAGIP
KFVPYFQIRDLCQKKGVIALSSNYELYADLSAKMMQIIGRFAPEQYVYSI
DESFLSFTRCYPAITCLKTQAQAIRRAVWKEARLPVCVGIGSTLTLAKIA
NHAAKKISGYAGVCVIESEPQRLAILQQMPVGEVWGIGRRLSAKLTLMGI
TTAAQLAAMPPGLARKQFSIEIERTVRELNGQVCKQWDEARADKQQIFST
RSVGERIIDIDALQQALCKHAGIAAAKARQQGSSCKSMLVFASNSPHDER
PVSYKAVVQFPCPTSSTAEITAAVSRVLPQVFRSGVRYYKIGVGLIELMP
TKHIQYDLFHAPTENPALMQTFDKLNQRYGSDCIFMAAQGIEHKWAMRRD
KLTPQYTTKWLNLPKVQC
>gid:864773  umuD  umuD protein
MRVIPIPAQAGITGFESPAAEYTQLGLSLDELLIIHPSASFLCVAQGDSM
QGVGIYDGDVLIVDRHETARNGDVIVANFNGLFVCKIIDTQRRLLLSCNE
QFQPVTVQEFDDFSIEGVVIRSIRCHRPSSLLCA