TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Gene type: CDS
Genomic element: chromosome

Number of genes found: 288

Free access
Sort by:

 



# Shigella flexneri 2a str. 2457T, 2457T

>S1727 hypothetical protein
MTTIKDAFQFGIEPVRITDTDNIQVNEGLPTNADPQVYALQLAKTVKAML
NGVLKDAQENIPFPVEVLPTRNSLPTPIIAHTLADRSVVVPVRGGKRPEV
VTVPSGQEIVVEPIEQAILISEQTKLWDAKSSTGFTQGTLQQDAMNICEN
VVRTINARMVDVLESSKLLKTVELPVLTGSLTAKADAIMDALYENTESSF
GSEVSDYGIIAHESQLKALSRLAAKQGFSGEDAIVDMLGTDIAYYNGEDK
GVFMLAKRFTALSFGCFRHDGENITVVLSRDGDSQSHDLEILGKVFVVAE
AATTIKMGTGSATAVLPVVKRLKFTKTEA
>S1701 hypothetical protein
MIVITFNRATFPRLKITMIVRPQQHWLRRIFVWHGSVLSKISSRLLLNFL
FSIAVIFMLPWYTHLGIKFTLAPFSILGVAIAIFLGFRNNAGYARYVEAR
KLWGQLMIASRSLLREVKTTLPDSASVREFARLQIAFAHCLRMTLRKQPQ
AEVLAHYLKTGDLQRVLASNSPANRILLIMGEWLAVQRRNGQLSDILFIS
LNDRLNDISAVLAGCERIAYTPIPFAYTLILHRTVYLFCIMLPFALVVDL
HYMTPFISVLISYTFISLDCLAEELEDPFGTENNDLPLDAICNAIEIDLL
QMNDEAEIPAKILPDRHYQLT
>S2270 hypothetical protein
MLFSISNFNQGVIMAGWFELSKSSDNQFRFVLKAGNGETILTSELYTSKA
SAEKGIASVRSNSPQEERYEKKTASNGKFYFNLKAANHQIIGSSQMYATA
QSRETGIASVKANGTSQTVKDNT
>S1028 hypothetical protein
MAFMLSPLLKRYTWNSAWLYYARIFIALCGTTAFPWWLGDVKLTIPLTLG
MVAAALTDLDDRLAGRLRNLIITLFCFFIASASVELLFPWPWLFAIGLTL
STSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYEHWYQQPMYLLAGA
VWYNVLTLIGHLLFPVRPLQDNLARCYEQLARYLELKSRMFDPDIEDESQ
APLYDLALANGQLMATLNQTKLSLLTRLRGDRGQRGTRRTLHYYFVAQDI
HERASSSHIQYQTLREHFRHSDVLFRFQRLMSMQGQACQQLSRCILLRQS
YQHDPHFERAFTHIDAALERIRDNGAPADLLKTLGFLLNNLRAIDAQLAT
IESEQAQALPRNNDENELADDSPHGLSDIWLRLSRHFTPESALFRHAVRM
SLVLCFGYAIIQITGMHHGYWILLTSLFVCQPNYNATRHRLKLRIIGTLV
GIAIGIPVLWFVPSLEGQLVLLVITGVLFFAFRNVQYAHATMFITLLVLL
CFNLLGEGFEVALPRVIDTLIGCAIAWAAVSYIWPDWKFRNLPRMLERAT
EANCRYLDAILEQYHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPN
VTPQIREAAFRLLCLNHTFTSYISALGAHREQLTNPEILAFLDDAVCYVD
DALHHQPADEERVNQALAGLKQRMQQLEPRADSKEPLVVQQVGLLIALLP
EIGRLQRQITVPTSGRTGVFH
>S1508 hypothetical protein
MALNTPQITPTKKITVRAIGEELPRGDYQRCPQCDMLFSLPEINSHQSAY
CPRCQAKIRDGRDWSLTRLAAMAFTMLLLMPFAWGEPLLHIWLLGIRIDA
NVMQGIWQMTKQGDAITGSMVFFCVIGAPLILVSSIAYLWFGNRLGMNLR
PVLLMLERLKEWVMLDIYLVGIGVASIKVQDYAHIQAGVGLFSFVALVIL
TTVTLSHLNVEELWERYYPQRPATRRDEKLRVCLGCHFTGYPDQRGLCPR
CHIPLRLRRRHSLQKCWAALLASIVLLLPANLLPISIIYLNGGRQEDTIL
SGIMSLASSNIAVAGIVFIASILVPFTKVIVMFTLLLSIHFKCQQGLRTR
ILLLRMVTWIGRWSILDLFVISLTMSLINRDQILAFTMGPAAFYFGAAVI
LTILAVEWLDSRLLWDAHESGNARFDD
>S1888 putative glycoprotein
MITDLILHNHPRMKTITLNDNHIAHLNAKNTTKLEYLNLSNNNLLPTNDI
DQLISSKHLWHVLVNGINNDPLAQMQYWTAVRNIIDDTNEVTIDLSGLNL
TTQPPGLQNFTSINLDNNQLTHFDATNYDRLVKLSLNSNALESINFPQGR
NVSITHISMNNNALRNIDIDRLSSVTYFSAAHNQLEFVQLESCEWLQYLN
LSHNQLTDIVAGNKNELLLLDLSHNKLTSLHNVLFPNLNTLLINNNLLSE
IKIFYSNFRNVQTLNAANNQLKYINLDFLTYLPSIKSLRLDNNKITHIDT
NNTSDIGTLFPIIKQSKNLNFLNVSGKNN
>S1601 hypothetical protein
MEYFDMRKMSVNLWRNAAGETREICTFPPAKRDFYWRASIASIAANGEFS
LFPGMERIVTLLEGGEMLLESADRFNHTLKPLQPFAFTADQVVKAKLTAG
QMSMDFNIMTRLDVCKAKVRIAERTFTTFGSRGGVVFVINGAWQLGDKLL
TTDQGVCWFDGRHTLRLLQPQGKLLFSEINWLAGHSPDQVQ
>S2096 hypothetical protein
MSFMVSEEVTVKEGGPRMIVTGYSSGMVECRWYDGYGVKREAFHETDLVP
GEGSRSAEEV
>S1642 hypothetical protein
MTLSFITRWRDELPETYTALSPTPLNNARLIWHNTELANTLSIPSSLFKN
AAGVWGGETLLPGMSPLAQVYSGHQFVVWAGQLGDGRGILLGEQLLADGT
TMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSI
VTSDSPVYRETVEPGAMLMRVAPSHLRFGHFEHFYYRREPEKVRQLADFA
IRHYWSHLEDDEDKYRLWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSL
LGLTLDYGPFGFLDDYEPGFICNHSDHQGRYSFDNQPAVALWNLQRLAQT
LSPFVAVDALNEALDSYQQVLLTHYGQRMRQKLGFMTEQKEDNALLNELF
SLMARERSDYTRTFRMLSLTEQHSAASPLREEFIDRAAFDDWFARYRGRL
QQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMMELHRLHEAL
RNPFSDRDDDYVSRPPDWGKRLEVSCSS
>S0752 putative minor tail protein
METFHWKVRPDMNVVSEPKVVTVKLGDGYEQRRAAGLNNQLSTYSVTIRV
RKGEHPSLKAFLERHGGVRAFQWTPPYDWKLIRVVCRKWSASVGALWVTV
TADFEQVVN
>S2855 hypothetical protein
MLGKIAVEVAYALPEKQYLQRVTLQEGATVEEAIRASGLLELRTDIDLTK
NKVGIYSRPAKLSDSVHDGDRVEIYRPLIADPKELRRQRAEKSANK
>S1561 hypothetical protein
MNLDDIINSMTPEVYQRLSTAVELGKWPDGVALTEEQKENCLQLVMLWQA
RHNTEAQHMTIDTNGQMVMKSKQQLKEDFGISAKPIAMFK
>S2541 hypothetical protein
MKKKTTLSEEDQALFRQLMAGTRKIKQDTIVHRPQRKKISEVPVKRLIQE
QADASHYFSDEFQPLLNTEGPVKYVRPDVSHFEAKKLRRGDYSPELFLDL
HGLTQLQAKQELGALIAACRREHVFCACVMHGHGKHILKQQTPLWLAQHP
HVMAFHQAPKEYGGDAALLVLIEVEEWLPPELP
>S1487 hypothetical protein
MANSITADEIREQFSQAMSAMYQQEVPQYGTLLELVADVNLAVLENNPQL
HEKMVNADELARLNVERHGAIRVGTAQELATLRRMFAIMGMYPVSYYDLS
QAGVPVHSTAFRPIDDASLARNPFRVFTSLLRLELIENEILRQKAAEILR
QRDIFTPRCRQLLEEYEQQGGFNETQAQEFVQEALETFRWHQLATVDEET
YRALHNEHRLIADVVCFPGCHINHLTPRTLDIDRVQSMMPECGIEPKILI
EGPPRREVPILLRQTSFKALEETVLFAGQKQGTHTARFGEIEQRGVALTP
KGRQLYDDLLRNAGTGQDNLTHQMHLQETFRTFPDSEFLMRQQGLAWFRY
RLTPSGEAHRQAIHPGDDPQPLIERGWVVAQPITYEDFLPVSAAGIFQSN
LGNETQTRSHGNASREAFEQALGCPVLDEFQLYQEAEERSKRRCGLL
>S3864 hypothetical protein
MQSGPMLMENGVINPRIHPNVASRKIRNGVGINKHGNAVFLLSQQATNFY
DFACYAKAKLNVEQLLYLGGTISHMYMKGGAIPWQRYPFVTMISVERKG
>S1950 hypothetical bacteriophage protein
MLNVAIENQNGWNYSASAPHKTGAGRGNPNVTGAHSRAEAVFLCVMHSSI
QIMVGCAGQSQDWPGSGVTGISTPVRLTTLMVVENLGGELINLSLEDAIM
ATIPALSHPDVTIENGRAVTTSVAVAEFFHKRHDNVLRAIANTECSPEFN
ALNFEDVTYTDTKGEKRPMTKSPKTASFSW
>S2148 hypothetical protein
MQFCSSDEFASKTMIKWPWKVQESAHQTALPWQEALSIPLLTCLTEQEQS
KLVALAERFLQQKRLVPLQGFELNSLRSCRIALLFCLPVLELGLEWLDGF
HEVLIYPAPFVVDDEWEDDIGLVHNQRIVQSGQSWQQGPIVLNWLDIQDS
FDASGFNLIIHEVAHKLDTRNGDRASGVPFISLREVAGWEHDLHAAMNNI
QEEIELVGENAASIDAYAASDPAECFAVLSEYFFSAPELFAPRFPSLWQR
FCQFYQQDPLQRLHHANDTDSFSATNVH
>S3062 hypothetical protein
MSQQEISIINLDQLVSMTSVEIAELTGKEHKHVLRDIRNMVEELNGAKTE
HCSTLSSELNGSKFGLVGEEVYKDAKGESRTMYRLDRKHTFILVAGYSVH
LRAKCYDHIQTLERRVLQLEDQKKRAAIQSANRRGVTWGDYCKTYGLPAQ
KLMTALLQHRGLFRKNPISNEWSVNPKYSDCFRIIKPSDQKFSAGGYNFR
FNAKGLEVFGKPEMVDKMRGILIAFTGTDQQKQEHLLKLAQSGKVEGI
>S2579 hypothetical protein
MKRLIMATMVTAILASSTVWAADNAPVAAQQQTQQTQKTAAAERISEQGL
YAMRDVQVARLALFHGDPEKAKELTNEASALLSDDSTEWAKFAKPGKKTN
LNDDQYIVINASVGISESYVATPEKEAAIKIANEKMAKGDKKGAMEELRL
AGVGVMENQYLMPLKQTRNALADAQKLLDKKQYYEANLALKGAEDGIIVD
SEALFVN
>S0692 putative bacteriophage protein
MKYFFETRLGETRYRLADGSLLCKDVPIGRTGKQLYGADDLPKLKPDKFV
EIVVTRSPEQVFHPATLASFEGMSITILHPEDENGNVRLVNPENWKVLAV
GHLQNVRRGTGEQSDLMLADLIVKDESAIQLIEDGLREVSCGYDAEYEQT
EPGKAEQVDITGNHVALVPKGRAGNRCAIGDRDTMANQKKNWWNRMRAAI
KTGDADTMNELVESAPASVTGDEGDLPQGVNLNINLSPQQPLPDKAPEMG
GDPTGDSDDDLKTLLKALLAKLERNATGDNDNKPDDNPTGDGEDDEEETT
ITGDSAWRAEVIVPGIDLSRKMKPTAFKREVLASADKTLVRQIVGDADIR
KLPKQSVDMAFNAVSEVAKGRNTRATTGDAQRLNMGMTSIASLNKQNAEF
WANRKG
>S2731 hypothetical protein
MEIYENENDQVEAVKRFFAENGKALAVGGILGVGALTGWRYWNSHQVDSA
RSASLAYQNAVTAVSEGKPDSIPAAEKFAAENKNTYGALASLELAQQFVD
KNELEKAAAQLQQGLADTSDENLKAVINLRLARVQVQLKQADAALKTLDT
IKGEGWAAIVADLRGEALLSKGDKQGARSAWEAGVKSDVTPALSEMMQMK
INNLSI
>S0304 hypothetical protein
MGNYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHP
GRGYPIGAAFFSAGRFYPALVGNDIGCGMALWQTDILARKYNADKFEKRL
SALDDVAEESWLEENLPSAFAQHPWRSSLGSIGGGNHFAELQQVDQIINA
ELFALAGLDAQHLQLLVHSGSRGLGQSILQRHIASFSHHGLPEGSDDALA
FARFNRHLIALRIMQQVKATGSPVLDVAHNFVSACRIGDQQGVLHRKGAT
PDDCGLVVIPGSRGDYSRLVQPVRSEETLHSLAHGAGRKWGRTECKGRLA
AKYTATQLSRTELGSRVICRDKQLIFEEAPQAYKSAESVVQCQGRPD
>S3287 hypothetical protein
MTYPYRTTMVLNTYQYRETTMIDPKKIEQIARQVHESMPKGIREFGEDVE
KKIRQTLQAQLTRLDLVSREEFDVQTQVLLRTREKLALLEQRISELENRS
TEIKKQPDPETLPPTL
>S2505 hypothetical protein
MEMTNAQRLILSNQYKMMTMLDPANAERYRRLQTIIERGYGLQMRELDRE
FGELKEETCRTIIDIMDMYHALYVSWSNLQDQQSIDERRVTFLGFDAATE
ARYLGYVRFMVNVEGRYTHFDAGTHGFNAQTPMWEKYQRMLNVWHACPRQ
YHLSANEINQIINA
>S1523 hypothetical protein
MTITDLVLILFIAALLAFAIYDQFIMPRRNGPTLLAIPLLRRGRIDSVIF
VGLIVILIYNNVTNHGALITTWLLSALALMGFYIFWIRIPKIIFKQKGFF
FANVWIEYSRIKAMNLSEDGVLVMQLEQRRLLIRVRNIDDLEKIYKLLVS
GNAANLLI
>S0842 putative DEOR-type transcriptional regulator
MRRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFS
GIDELLLEAFSSFTEIMSRQYQAFFSDVSDAQGACQAITDMIYSSQVATP
DNMELMYQLYALASRKPLLKTVMQNWMQRSQQTLEQWFEPGTARALDAFI
EGMTLHFVTDRKPLSREEILRMVERVAG
>S2124 putative tail protein
MADSFQLKAIITAVDKVSAPLKGMQRQLKGFKKEFASLSLGAAGAGTAVL
GALALPVKSAIALESKMADVRKVVDGLDTPEAFKAMTEQVRDLSTELPMS
AEGIAEIVAAGGQAGIARDELMQFTDDAVKMGVAFDTTAEESGQMMAQWR
TAFKLTQGEVAGLADKINYLGNTGPASAKKISDVVTRIGPLGSVAGVASG
EIAAMGATIAGMGVESEIAATGIKNFMLSLTARDSATKSQKKVLRSLRIS
PKKLAADMQKDARGAMLHVLDSLAKVPKEKQAAVLKALFGKESLGAIAPL
LTNLDLLRTNFNRVADAQQYGGSMQKEYAARAATTENQLLLLQNQINAIS
STLGETFLPSLNEGIKEMKPFLEEVRTFVRENPEVVKTIAKTGAALLTMG
VAIGTLTRITKIMGSVMNMTPAKGLIALLVGGAYLIIDNWETVGPVVKKV
WQEVDQVVRAMGGWEQAVKTIATVSALYIGVKAVASIRAATVAQNQWTTA
AGKTGLKLKGLGKISLIGGLLELGMMAQEFEKEHPWLVKNFVADALNSGF
GLNDKFDEWGKQFHDFVYDMTGWQMPRGDGYLSPDKRYTPNVSLERNQLL
SLASSPATRSELKVTFDNAPPGMRVIDLPKTGDPFMKITHDVGYSPFKR
>S0693 putative bacteriophage protein
MGQVMQSIIAEQVKYIKSLPLEAADRVYDIQNKAIEAVVTGGRAEQFAKE
IASTGDVAKSRADLIARTELGRATGALDMARAIAIGSDGYIWRTADDGDV
RDSHDHMKGKFVRWDSPPTLDGMTGHAGELPNCRCYKEIVFVRVPFAMKR
AA
>S0373 hypothetical protein
MPGKTAALYDVDKTLKNARVELKTSPDAKNKLREAAQAVGVDLSAFILSA
AMERAESVLDNQRRRELSNQSWELMNQLIAEPAQPTLALKALMKRKNSDG
RQA
>S2506 hypothetical protein
MSTPDNRSVNFFSLFRRGQHYSKTWPLEKRLAPVFVENRVIKMTRYAIRF
MPPIAVFTLCWQIALGGLLGPAVATALFALSLPMQGLWWLGKRSVTPLPP
AILNWFYEVRGKLQESGQVLAPVEGKPDYQALADTLKRAFKQLDKTFLDD
L
>S0862 hypothetical protein
MQFSTTPTLEGLTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAY
EKELRKAREIAFEELGSQARALGADAVVGIDIDYETVGQNGSMLMVSVSG
TAVKTRR
>S2442 hypothetical protein
MRHGLLALICWLCCVVAHSEMLNVEQSRLFRAWFVRIAQEQLRQGPSPRW
YQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQ
NWNQGSGKNGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLM
VWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIY
RLNFLAR
>S0913 conserved hypothetical protein
MEALQASEIDYTIFFYNPNIHPQKEYLIRKDENIRFAEQHGVPFIDADYD
TDNWFERAKGMEWEPERGIRCTMCFDMRFERTALYAAENGFSVISSSLGI
SRWKNMQQVNDCGRRAVAHYPGMVYWDYNWRKQGGSSRMIEISKREKFYQ
QEYCGCVYSLRDTNLHRKSQGRPLIKIGQLHYGKEEKE
>S2435 hypothetical protein
MNWRRIVWLLALVTLPTLAEETPLQLVLRGAQHDQLYQLSSSGVTKVSAL
PDSLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESIT
RDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSV
TVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWF
ADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQ
VASGQCVEVELFARYPLKKITAEKSTTAVKPSVLNGRYRVTFTNGNHITF
VSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAI
RTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMTAWTQELIYAGD
PVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRS
TCQLLPKAKAWLAKKMPQWRRILQGETGYNEPDVFAVCRLVSGFPYTDRQ
QKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD
>S2903 hypothetical protein
MSEALSLFSLFASSFLSATLLPGNSEVVLVAMLLSGISHPWVLVLTATMG
NSLGGLTNVILGRFFPLRKTSRWQEKATGWLKRYGAVTLLLSWMPVVGDL
LCLLAGWMRISWGPVIFFLCLGKALRYVAVAAATVQGMMWWH
>S0694 putative bacteriophage protein
MARNKQALRRTVQATADGYENFIARVGMQTPNQHSASTYRANFTSRNRML
VEWSYRSSWIIGEAVDAIPDDMTRKGIRITSEIDAKDRGILESQLDELQI
WDALNDVLKWSRLYGGAVGFIMIEGQAPMTPLRPETIGKGKFKGILPLDR
WMVDPALTRRIKDMGPDLGKPEFYDVVTTATGIPAWRIHHSRLIRFDGVT
LPFQQKMTENEWGMSVVERIWDRLTAFDSATVGAAQLVYKAHLRTYSVEK
LRELIALGGPAYEALLKNIDLIRQFQSNEGMTLMDSRDKFETHQYSFSGL
DDILSQFAEQISGAVGIPLVRLFGQSPKGFSTGDADLANYYDRISSLQER
RLRLPVRRILDIMHRSELGKPLPDDFTFEFNPLWQMSDVDRSTVALNTTN
AISTALGDGLMTLKAAMTDLRENSDVTGIGASITNEDIENAEDEAPPGIG
EPDDEPQEPSGGNPVSNQPTQDSEGGRRHRKWSLRWFK
>S4814 hypothetical protein
MINNISDQASSFPGTQLNQSDNFLDSLREFFAILNPSRKGELSTWDTIYL
HLILAINADSDLIKNDVLLAENIPSANYQFNTFFSNTFEIDVKKYLGKSE
DNEVEIKAGNERISIGIRNVSNGKLERQQFLFPLDYENKLQEQLDKYFTI
ESHPLLYRYTIGSKIANVIFEKLYSRIDFNKEQYISFIKDAFIHFYDYSR
RYAISENIDKDAVTNNIALMSTFYDSDNTSGEVLNNDFTEEESFETALDV
EHAIVLGFADDNFETKPVHYQDLLTRFSAFQDTVFNLFPEMHSSHYHDIC
SVSVDMTKGTQCMIHLMVNEEVFMSLPVPVATMVREDASNLVNLKTLLND
GCFIKYSHFNDVALIKQNISNLYLSHTVVNESILKKCCFENGSLGDVKIT
NSNVINSAFKNISFRSVKINNVNTHSLKFINCNFFNVDMIRVNLSKCLFH
ECSMHGVKIKPWLPVKWTKELISDYLYGCLLSLYSICARDIYNMNAGNNV
KVAADAFLEIIFSLKNKYCIKLLSAQDRAFIYEFARMIFAYINDKSIEIL
LLSCFAAADQKAIQRYRPQSQDGEDFRSHLQYKLPLSAH
>S0215 putative cytoplasmic protein
MSASRQRPGNHLLPTLFDRLCDDAPNQKRDHGISVSPVQIKEIIRRDLSF
LLNTVSHEDDIDAARYPYAAASVLNYGLPPLAGSFLYEHKWDDIRRAILR
TITRFEPRLKASTLQIIPLQDERRQSGHNTLQFEIRGEILTQPYPTAFRV
RSALDMEQSRITFF
>S1532 hypothetical protein
MFAGLPSLIHEQQQKAVERIQELMAQGMSSGQAIALVAEELRANHSGELI
VARFEDEDE
>S1703 hypothetical protein
MHVTLVEINVHEDKVDEFIEVFRQNHLGSVQEEGNLRFDVLQDPEVNSRF
YIYEAYKDEDTVAFHKTTPHYKTCVAKLESLMTGPRKKRLFNGLMP
>S1902 hypothetical protein
MNQSLTLAFLIAAGIGLVVQNTLMARITQTSSTILIAMLLNSLVGIVLFV
SILWFKQGMAGFGELVSSVRWWTLIPGLLGSFFVFASISGYQNVGAATTI
AVLVASQLIGGLVLDIFRSHGVPLRALFGPICGAILLVVGAWLVARRSF
>S1520 hypothetical protein
MFAGGDDVFYGYPGQDVVMNITATVLLAFGMSMDAFAASIGKGAPLHKPK
FSEALRTGLIFGAVETLTPLIGWGMGMLASRFVLEWNHWIAFVLLIFLGG
RMIIEGFRGADDEDEEPRRRHGFWLLVTTAIATSLDAMAVGVGLAFLQVN
IIATALAIGCATLIMSTLGMMVGRFIGSIIGKKAEILGGLVLIGIGVQIL
WTHFHG
>S0285 hypothetical cytoplasmic protein
MTSLYRIQEGCFALPETFLDRTVNIFVPSGNERATPSLNIFRDTLRPDEN
LTTYIDRQIALMKKKT
>S1719 hypothetical protein
MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLT
LHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLSLYDWTGALIALCGMLI
IVAGWGRT
>S1869 hypothetical protein
MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQ
QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQR
LGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH
KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG
DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN
ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR
NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR
YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ
LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS
>S0689 putative bacteriophage protein
MITFDQATVDSSGAFLIGELERLDQTLNLPLVGYTWSRDIQLREDASIAD
DISSWTNTSFAAAGTGANPNGKNWVGKDSTAIAGVNVDIGKSGNPLNLWG
MELGWTVIELQAAQQVGRPIDTQKYDGMQLKWQMDNDEQVYVGDSALNLK
GLVTLDGVPVNNAAKTWATSTPDEIRASINQVLSDAWAASGYSVVPRDLL
IPPEQFALLSSIIVSSAGNQSLLTNLQTNTISYHQNGVPLNIRAVKWLKG
RGVGNKDRIVAYTNDKKYVRYPLVPLQSVPVQYRGLYQIVTYYGKLGAVE
PVYKETISYVDGI
>S0751 putative tail length tape measure protein precursor
MDQIANLVIDLGIDAAEFKNEIPRIKNLLNGAASDAERSSARMQRFMERQ
TQAARQTMQAASSAATAASAHAQTVEKNARAHERMAREVEQTRLRVDALN
QKMREEQAQARALAEAQDKAAAAFYRQIDSVKQAGAGLQELQRIQQQIRQ
ARNSGGVGQQDYLALISEITAKTRALTQAEEQATRQKAAFIRQLKEQATR
QNLSSSELLRARAAQLGVSSAAEVYIRKMERAGKATHSLGLKSAAARREL
GVLISQMARGNFGALRGSGITLANRAGWTGALMSPKGMMTGGVIGGLVAA
VLGLGKAWHDGRKEGEKFNRQLALTGHYAGVTVGQLWKLSRAISGNGITQ
HAAAGALAQVVGSGAFHGNDIGMVAKAAAQMERSVGQSVSDTINQFKRLK
DDPVNAAKALDNALHFLNATQLEQIRVLGEQGRSSDAARIAMSALAEETG
KRTSDIDNNLNALGSTLQTLSDWWKQFWDAAMNIGREDSLDAQIDALQEK
IQRAKKYPWTNASTQVEYDQQRLNDLQEKKRRKDLQDAKAQAERNYQEQQ
KRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERY
EKAIKKNTRATRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATERMTE
AHKQLLALQQRISDLDGKKLTADEKSVLARKNELIQALTLLDVKQQELQK
QTALNDLRKKTVQLTSQLADKERALREQHNLDIATAGMGDKQRQRYQAQL
RIRQEYRQQLQQLENDSRQKGTYGTEDYRRAEEVLKGSLKRQLNENKRYW
QELEVAQGDWKNGAMRAFQNFTADADNAAGTAEQMFTVAFSSAGNALATF
CTTGKLNFKSFTSSLLSDMARIMAQMAMMQAVKGVGSLFGFTTNADGGVY
QSADLSRYSGTVVNRPTFFAFAKGAGVMGEAGPEAILPLRRGADGKLGVV
ADTGGSGMVMFAPQYNIEINNDGTNGQIGPAALKVVYDLGKKAAADFMQQ
QARDGGRLSGAYR
>S4633 hypothetical protein
MLFDHLRDEVMRLDAGITQEVLKLYIAFKAETNFVDVVPQKSRLRLSLNM
QFHELVDPKGIAKDVTNVGRWGNGDVEIGFSDLAQLPYIMGLIRQAFEKQ
MESALV
>S0755 putative tail component
MKTGAEAIRALATQLPAFRQKLSDGWYQVRIAGRDAGETELSARLNEPLA
NGAVIHIVPRLVGAKSGGVFQAVLGAAVMAVAIWMPGVGIMASNLLFSLG
ASMTLGGVAQMLAPKARTPRTQTTDNGKQNTCFSSLDNMVAQGNVLPVLY
GEMRVGSRVVSQEISTADEGDGGQVVVIGR
>S1032 hypothetical protein
MKTGIVTTLIALCLPVSVFATTLRLSTDVDLLVLDGKKVSSSLLRGADSI
ELDNGPHQLVFRVEKTIHLSNSEERLYISPPLVVSFNTQLINQVNFRLPR
LENEREANHFDAAPRLELLDGDATPIPVKLDILAITSTAKTIDYEVEVER
YNKSAKRASLPQFATMMADDSTLLSGVSELDAIPPQSQVLTEQRLKYWFK
LADPQTRNTFLQWAEKQPSS
>S2434 hypothetical protein
MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINY
PASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDG
SFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWD
TDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPV
HGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGEL
TLVKSFDW
>S0925 hypothetical bacteriophage protein
MYQITKNGFVFLVMGFTGKKAAAFKEAYIAEFDRMEAELRQNNTPPADKM
IPGDGRTLVVHFDKFGNVEFTETVPDGALVCTLETFRFYLEKQGWTLVNR
GAIKNMTVEQLLSLK
>S2999 hypothetical protein
MTTHDRVRLQLQALEALLREHQHWRNDEPQPHQFNSTQPFFMDTMEPLEW
LQWGLIPRMHDLLNNNQPLPGAFAVAPYYEMALATDHPQRALILAELEKL
DALFADDAS
>S2161 hypothetical protein
MGRKWANIVAKKTAKDGATSKIYAKFGVEIYAAAKQGEPDPELNTSLKFV
IERAKQAQVPKHVIDKAIDKAKGGGDETFVQGRYEGFGPNGSMIIAETLT
SNVNRTIANVRTIFNKKGGNIGAAGSVSYMFDNTGVIVFKGTDPDHIFEI
LLEAEVDVRDVTEEEGNIVIYTEPADLHKGIAALKAAGITEFSTTELEMI
AQSEVELSPEDLEIFEGLVDALEDDDDVQKVYHNVANL
>S0753 minor tail protein
MQDIRQETLNECTRAEQSARVELWEIDLTEVGGERYFFCNEQNEKGEPVT
WQGRQYQAYPIQGSGFELNGRGCAARPTLTVSNLHGMVTGMAEDLQSLVG
GTVVRRKVYARFLDAVNFVNGNSDADPEQEVISRWRIEQCSELSAVSASF
VLATPTETDGAVFPGRIMLANTCMWTYRSDECGYTGRAVADEFDKPTTDI
RKDKCSKCMRGCELRNNTGNFGGFLSINKLSQ
>S0756 host specificity protein
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVDGPVDGLKSVLLNGT
PVLDSEGKTNFSGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPI
TRTITSANIDRLRLTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTE
KDITIKGKTTSQYLASVVVGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWS
SYTEIIDVKQCYPNTALVGVQVDSEQFGNQQVSRNYHLRGRILQVPSNYN
PQTRQYSGIWDGTLKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWA
LYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMP
VWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNVVE
VNWIDPDNGHETATELVEDTQAIVRYGRNVTKMDAFGCTSRGQAHRAGLW
LIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNS
QTRTLTLDREITLPSSGTTLISLVDGNGNPVSVEVQSVTDGVKVKVSRVP
DGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVD
NGAHFDGDRRGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGV
SFLLRLTVAADDGSERLVSTARTTETTYRFRQLALGNYSLTVRAVNARGQ
QGDPASVSFRIAAPAAPVTIELIPGYFQITVVPKLAVYDPTVQFEFWFSE
KRIADIRQVETSARYLGTALYWIAASINIRPGHDYYFYVRSVNTVGKSAF
VEAVGRASDDAEGYLSFYKGLINKTHLGKELWTQIDNGQLAPDLTEIRTS
ITNVSNEITQTVNKKLENQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMKD
GRLYIAGIGAGIENTPAGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQ
IFMNDVFLKRLTAPTITSGGNPPAFSLTPGGRLTAKNADISGNVNANSGT
LNNVTINKNCRVLGKLSANQIEGDLVKTVGKPFPRDSRAPERWPSGTITV
RVYDDQPFDRQIVIPAVAFRGAKHERKNNNIYSSCRLIVKKNGAEIYNRT
TLDNTLIYTGVIDMPAGHGHMTLEFSVSAWLVNGWYPTASISDLLVVVMK
KSTAGISIS
>S2537 putative transporting ATPase
MNSTHHYEQLIEIFNSCFADEFNTRLIKGDDEPIYLPADAEVPYNRIVFA
HGFYASAIHEISHWCIAGKARRELVDFGYWYCPDGRDAQTQSQFEDVEVK
PQAFDWLFCVAAGYPFNVSCDNLEGDFEPDRVVFQRRVHAQVMDYLTNGI
PERPARFIKALQNYYHTPELTAEQFPWPEALN
>S0012 putative oxidoreductase
MNVNYLNDSDLDFLQHCSEEQLANFARLLTHNEKGKTRLSSILMRNELFK
SMEGHPEQHRRNWQLIAGELQHFGGDSIANKLRGHGKLYRAILLDVSKRL
KLKADKEMSTFEIEQQLLEQFLRNTWKKMDEEHKQEFLHAVDARVNELEE
LLPLLMKDKLLAKGVSHLLSSQLTRILRTHAAMSVLGHGLLRGAGLGGPV
GAALNGVKAVSGSTYRVTIPAVLQIACLRRMVSATQV
>S4263 hypothetical protein
MIKLKTPNSMEIAEQPAVITYVPELNAFRGKFLGLSGYCDFVSDSIQGLQ
KEGELSLREYLEDCKAAGIEPYARTEKIKTFTLRYPESLSERLNNAAAQQ
QVSVNTYIIETLNERLNHL
>S2730 putative dehydrogenase
MQLRKLLLPGLLSVTLLSGCSLFNSEEDVVKMSPLPTVENQFTPTTAWST
SVGSGIGNFYSNLHPALADNVVYAADRAGLVKALNADDGKEIWSVSLAEK
DGWFSKEPALLSGGVTVSGGHVYIGSEKAQVYALNTSDGTVAWQTKVAGE
ALSRPVVSDGLVLIHTSNGQLQALNEADGAVKWTVNLDMPSLSLRGESAP
ATAFGAAVVGGDNGRVSAVLMEQGQMIWQQRISQATGSTEIDRLSDVDTT
PVVVNGVVFALAYNGNLTALDLRSGQIMWKRELGSVNDFIVDGNRIYLVD
QNDRVMALTIDGGVTLWAQSDLLHRLLTSPVLYNGNLVVGDSEGYLHWIN
VEDGRFVAQQKVDSSGFQTEPVAADGKLLIQAKDGTVYSITR
>S1223 putative head portal protein
MTLDEFMALAGTSNTGAGEYVSSGTAESLPAVMNAVTVISDAVATMPCYL
YLVRNEKGKEAREWLDSHPVDHILNERPNAWQTPYQFKRMMIRYCLLSGN
AYAVIQWGRDGFPAALHPYPPQSVNVEQTGDHNWRYCITDAYTGNIRNYL
PWEVLHLRYSTDDGFMGRSPVTICRESLGLGLAQQRHGASVMRDGMMAAG
VITSGEWLDGVKGKQALAALERYKGARNAGKTPILEGGMSYQQLGMSNQD
AEWLASRRFTIEDIARMFNVSPIFLQEYSNSTYSNFSEASRAFLTMTMRP
WLANFEQQIKNALLVASPVPGIRYQVEFDSADLLRATPGERFATYERGIK
SDVMCPNEAREREGLSPRDGGDEFS
>S2116 hypothetical protein
MRLTAKQVIWLKVCLHLAGLLPFLWLVWAINHGGLGADPVKDIQHFTGRT
ALKFLLAALLITPLARYAKQPLLIRTRRLLGLWCFAWATLHLTSYALLEL
GVNNLALLGKELITRPYLTLGIISWVILLALAFTSTQSMQRKLGKHWQQL
HNFVYLVAILAPIHYLWSVKIISPQPLIYAGLAVLLLALRYKKLLSLFNR
LRKQAHNKLSL
>S1300 hypothetical protein
MTSFSTLLSVHLISIALSVGLLTLRFWLRYQKHPQAFARWTRIVPPVVDT
VLLLSGIALMAKAHILPFSGQAQWLTEKLFGVIIYIVLGFIALDYRRMHS
QQARIIAFPLALVVLYIIIKLATTKVPLLG
>S1335 hypothetical protein
MSPSDCFTSSARCCKTSPTEVKQVVTMDMDLNNRLTEDETLEQAYDIFLE
LAADNLDPADVLLFNLQFEERGGAELFDPAEDWQEHVDFDLNPDFFAEVV
IGLADSEDGEISDVFARILLCREKDHKLCHIIWRE
>S2878 hypothetical protein
MGFWRIVITIILPPIGVLLGKGFGWAFIINILLTLLGYIPGLIHAFWVQT
RD
>S2556 hypothetical protein
MHIQRKSTMSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQ
TLAALTEKSRSVESEPCKISPTFTEESDGVRLDIDFTFACEAEMLIFQLG
LR
>S4700 creA, hypothetical protein
MKYKHLILSLSLIMLGPLAHAEEIGSVDTVFKMIGPDHKIVVEAFDDPDV
KNVTCYVSRAKTGGIKGGLGLAEDTSDAAISCQQVGPIELSDRIKNGKAQ
GEVVFKKRTSLVFKSLQVVRFYDAKRNALAYLAYSDKVVEGSPKNAISAV
PVMPWRQ
>S4356 damX, putative membrane protein
MDEFKPEDELKPDPSDRRTGRSRQSSERSERTERGEPQINFDDIELDDTD
DRRPTRAQKERNEEPEIEEEIDESEDETVDEERVERRPRKRKKAASKPAS
RQYMMMGVGILVLLLLIIGIGSALKAPSTTSSDQTASGEKSIDLAGNATD
QANGVQPAPGTTSAENTQQDVSLPPISSTPTQGQTPVATDGQQRVEVQGD
LNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRDTAKTQTAER
PSTTRPVRQQAVIEPKKPQATVKTEPKPVAQTPKRTEPEPAAPVASTKAP
AATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHY
TLQLSSSSNYDNLNGWAKKENLKNYVVYETTRNGQPWYVLVSGVYASKEE
AKKAVSTLPADVQAKNPWAKPLRQVQADLK
>S2528 dedA, hypothetical protein
MDLIYFLIDFILHIDVHLAELVAEYGVWVYAILFLILFCETGLVVMPFLP
GDSLLFVAGALASLETNDLNVHMMVVLMLIAAIVGDAVNYTIGRLFGEKL
FSNPNSKIFRRSYLDKTHQFYEKHGGKTIILARFVPIVRTFAPFVAGMGH
MSYRHFAAYNVIGALLWVLLFTYAGYFFGTIPMVQDNLKLLIVGIIVVSI
LPGVIEIIRHKRAAARAAK
>S2525 dedD, putative lipoprotein
MGTIVLVALGVIVLPGLLDGQKKHYQDEFAAIPLVPKAGDRDEPDMMPAA
TQALPTQPPEGAAEEVRAGDAAAPSLDPATIAANNTEFEPEPAPVAPPKP
KPVEPPKPKVEVPPAPKPEPKPVVEEKAAPTGKAYVVQLGALKNADKVNE
IVGKLRGAGYRVYTSPSTPVQGKITRILVGPDASKDKLKGSLGELKQLSG
LSGVVMGYTPN
>S2479 elaB, hypothetical protein
MSNQFGDTRIDDDLTLLSETLEEVLRSSGDPADQKYVELKAHAEKALDDV
KKRVSQASDSYYYRAKQAVYRADDYVHEKPWQGIGVGAAVGLVLGLLLAR
R
>S2168 erfK, hypothetical protein
MMRRVNILCSFALLFASQNSLAVTYPLPPEGSRLVGQSLTVTVPDHNTQP
LETFAAQYGQGLSNMLEANPGADVFLPKPGSQLTIPQQLILPATVRKGIV
VNVAEMRLYYYPPDSNTVEVFPIGIGQAGRETPRNWVTTVERKQEAPTWT
PTPNIRREYAKRGESLPAFVPAGPDNPMGLYAIYIGRLYAIHGTNANFGI
GLRVSQGCIRLRNDDIKYLFDNVPVGTRVQIIDQPVKYTTEPDGSKWLEV
HEPLSRNRAEYESDRKVPLPVTPSLRAFINGQEVDVNRANAALQHRSGMP
VQISSGSRQMF
>S0321 gtrAI, putative flippase
MLKLFVKYTSIGVLNTLIHWVVFGVCIYAAHTSQALANFTGFVVAVSFSF
FANARFTFKALTTAMRYMLYVGFMGILSVIVGWAADKCSLPPIVTLITFS
AISLVCGFVYSKFIVFRDAK
>S4222 hdeD, hypothetical protein
MLYIDKATILKFDLEMLKKHRRAIQFIAVLLFIVGLLCISFPFVSGDILS
TVVGALLICSGIALIVGLFSNRSHNFWPVLSGFLVAVAYLLIGYFFIRAP
ELGIFAIAAFIAGLFCVAGVIRLMSWYRQRSMKGSWLQLVIGVLDIVIAW
IFLGATPMVSVTLVSTLVGIELIFSAASLFSFASLFVKQQ
>S4652 hpaD, homoprotocatechuate dyoxygenase
MGKLALAAKITHVPSMYLSELPGKNHGCRQGAIDGHKEISKRCREMGVDT
IIVFDTHWLVNSAYHINCADHFEGVYTSNELPHFIRDMTYNYEGNPELGQ
LIAEEALKLGVRAKAHNIPSLKLEYGTLVPMRYMNEDKHFKVVSISAFCT
VHDFADSRKLGEAILKAIEQYDGTVAVLASGSLSHRFIDDQRAEEGMNSY
TREFDRQMDERVVKLWREGQFKEFCNMLPEYADYCYGEGNMHDTVMLLGM
LGWDKYDGKVEFITELFPSSGTGQVNAVFPLPA
>S3915 ilvM, acetolactate synthase II, small subunit
MQHQVNVSARFNPETLERVLRVVRHRGFHVCSMNMAAASDAQNINIELTV
ASPRSVDLLFSQLNKLVDVAHVAICQSTTTSQQIRA
>S0934 ipaH_2, invasion plasmid antigen
MREINMLRNISSCLFPHISTITSPNHYLSEWDDWEKQGLPEEQRTEAVRR
LRACLTSKGHKLDLRALALSSLPVLPACIKKLDVSCNKLTILTDLPENIK
ELIARDNFLTHISALPHYLITLDVSENQLENLPLLPDTIKSLSAEYNRLS
TLPSLPLNLKKLEVRNNELQTLPSLPSNLKILKVAHNHLTELPPLPRRLQ
LLFAYSNRLSNLPNIQENIIMRRFFYFENNQITTIPTNLFRLDPHITIEI
ANNPLSDQTLLFLIQQTSVPNFNGPQFRISLSDQNRLFLRQMLPQNLHSR
HIRVITEGGQNFQIPPLPETVAAWFPEADRREVSTQWTSFSTEENSRAFS
AFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESC
EDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIAR
DKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRT
AEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEY
PQRVADRLKASGLSDDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSE
NGSQLHHS
>S1268 ipaH_3, invasion plasmid antigen fragment
MKYLSPQKFSWGDAPWQIIDLSIAGKVNIQVDNNTIITLGTRLNQQHNEF
MMVAKWCEWAIQQDGLQENLQKNLYEILEENQQNKQSEIPQEDLKESLEE
IKENILEENLPASRIENRAEALRRMKECLITRRSMLNLSNLGLTSLPENL
PPHLIEFYCSKNVLTALPKVMPKWLLVLDCTDNVLILLPEGAALKTDGTE
VL
>S1947 ipaH_4, invasion plasmid antigen
MLPTNNNHRLISNSFSTYSIDTSRAYENYLTHWTEWKNNRIQEEQRDIAF
QRLVSCLQNQETNLDLSELGLTTLPEIPPGIKSINISKNNLSLIPPLPAS
LTQLNVSYNRLIELPALPQGLKLLNASHNQLITLPTLPISLKELHVSNNQ
LCSLPVLPELLETLDVSCNGLAVLPPLPFSLQEISAIGNLLSELPPLPHN
IHSIWAIDNMLTDIPYLPENLRNGYFDINQISHIPESILNLRNECSIDIS
DNPLSSHALQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAVTAWF
PENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAW
LEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLF
DNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEK
LQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHA
VLKRTEADRWALAEEQKYEMLENEYPQRVADRLKASGLSGDADAEREAGA
QVMRETEQQIYRQLTDEVLALRLPENGSQLHHS
>S2119 ipaH_5, invasion plasmid antigen
MLPVNNPPLSTGNVSFYRTTSIDNVHNNYLSEWVEWTKNSISGENRETAF
TRLQLCLENSETSLDLSCLGLRSLPRLPDNLDEINVSNNQLSMLPELPRA
LKELNASSNQLSALPELPVSLEYINVSDNHLFALPELPASLEYINVSDNH
LSVLPRLPMSLELLDAARNALEVIPDFPERDDHIIRIFWLNQNRITAIPE
SILGLSSDSVVNLRENQLSPRIMQTLLQQTAQPDYHGPRIYFSMSDGQQN
TLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVS
ARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNL
RKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDE
IEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREEN
EFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYPQRVADRLKAS
GLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGSQLHHS
>S2782 ipaH_7, invasion plasmid antigen
MREINMLRNISSCLFPHISTITSPNHYLSEWDDWEKQGLPEEQRTEAVRR
LRACLTSKGHKLDLRALALSSLPVLPACIKKLDVSCNKLTILTDLPENIK
ELIARDNFLTHISALPHYLITLDVSENQLENLPLLPDTIKSLSAEYNRLS
TLPSLPLNLKKLEVRNNELQTLPSLPSNLKILKVAHNHLTELPPLPRRLQ
LLFAYSNRLSNLPNIQENIIMRRFFYFENNQITTIPTNLFRLDPHITIEI
ANNPLSDQTLLFLIQQTSVPNFNGPQFRISLSDQNRLFLRQMLPQNLHSR
HIRVITEGGQNFQIPPLPETVAAWFPEADRREVSTQWTSFSTEENSRAFS
AFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFTVAADATESC
EDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIAR
DKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRT
AEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEY
PQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLPE
NGSQLHHS
>S3614 phnB, hypothetical protein
MPLSPYLSFAGNCSDAIAYYQRTLGAELLYKISFGEMPKSAQDSAENCPS
GMQFPDTAIAHANVRIAGSDIMMSDAIPSGKASYSGFTLVLDSQQVEEGK
RWFDNLAANGKIEMAWQETFWAHGFGKVTDKFGVPWMINVVKQQPTQ
>S1016 pqiA, paraquat-inducible protein A
MCEHHHAAKHILCSQCDMLVALPRLEHGQKAACPRCGTTLTVAWDAPRQR
PTAYALAALFMLLLSNLFPFVNMNVAGVTSEITLLEIPGVLFSEDYASLG
TFFLLFVQLVPAFCLITILLLVNRAELPVRLKEQLARVLFQLKTWGMAEI
FLAGVLVSFVKLMAYGSIGVGSSFLPWCLFCVLQLRAFQCVDRRWLWDDI
APMPELRQPLKPGVTGIRQGLRSCSCCTAILPADEPVCPRCSTKGYVRRR
NSLQWTLALLVTSIMLYLPANILPIMVTDLLGSKMPSTILAGVILLWSEG
SYPVAAVIFLASIMVPTLKMIAIAWLCWDAKGHGKRDSERMHLIYEVVEF
VGRWSMIDVFVIAVLSALVRMGGLMSIYPAMGALMFALVVIMTMFSAMTF
DPRLSWDRQPESEHEES
>S3788 rbn, tRNA processing exoribonuclease BN
MLKTIQDKARHRTRPLWAWLKLLWQRIDEDNMTTLAGNLAYVSLLSLVPL
VAVVFALFAAFPMFSDVSIQLRHFIFANFLPATGDVIQRYIEQFVANSNK
MTAVGACGLIVTALLLMYSIDSALNTIWRSKRARPKIYSFAVYWMILTLG
PLLAGASLAISSYLLSLRWASDLNTVIDNVLRIFPLLLSWISFWLLYSIV
PTIRVPNRDAIVGAFVAALLFEAGKKGFALYITMFPSYQLIYGVLAVIPI
LFVWVYWTWCIVLLGAEITVTLGEYRKLKQAAEQEEDDEP
>S2218 rfbI, glycosyl translocase
MLKIGKLLTSSFFSYFLIGIVNTALHWGVFYACYNNLAFGQGRSNIVGFI
CAATFSFFANARCSFKVSATKARYFIFIFFMGAMSYLFGVLFDLLALSPI
FTLFTFSLFSLVLGYCASKYFIFR
>S0284 rhsG, putative Rhs-family protein
MSFVSTGNKPVGNGGPVITTPPIAGESGGMSTGSAVTDVSGAAEEMAEQA
AADLFGALPEPSGLVKAAVAAAQAAAAAAGISDMAGAVQDAAASLAAGAP
GAHNVTVSGSAVPPQMLLFAGMNGSEGLGNLFSYTVQLKTLDALNLGYVS
PAANLPLQLMVGKDLCVSIELDGGGKRYISGLVTAARVVGHESRSVTYEL
RIEPWLKLLTHTSDYKAFQNKTVVDILDEVLDEYSFPVEKRLVENYPPRA
WQVQYGETDFDFIQRLMQEWGIYWWFEHSENSHTLVLVDAINGHKACPDS
PLVEWHQEGLKLDKEFIHTITASERLRTGKWVMDDFDFMKPRSLLKSTVA
SPRDTGHAEYEHYEWPGDYFTTGEGEMLTRIRMEAQRSPGSRAHGAGHIR
TLMTGYTFTLMNHPTAEINQEYLLVQTTLFLRDNAQHSGQDQHFTYVTTF
ELHPTSEVYRPQRTLSKPHTKGPQSAIVTGPAGQEIWTDKYGRVKVQFGW
DRYGKNDENSSCWIRVSYPWAGKGFGMIQIPRIGQEVLVDFKNGDPDLPI
IVGRTYNQDTMPPWGLPGMASQSGIFSHSLQGGPTNGNMLRFDDKTGAEE
VKFHAEKDLNTTVKNNETHTVMVDRIKTIVKNETISVGEDRNATITKNDG
LSVKLAQTINVGTTYRLDVGDQFTLRCGNAALVLHKDGSIEFCGKQLLLH
TSDVMQLIGSGIDMNPDGGTAMTADDIAPLPTPDVSE
>S4321 rtcB, hypothetical protein
MNYELLTTENAPVKMWTKGVPVEADARQQLINTAKMPFIFKHIAVMPDVH
LGKGSTIGSVIPTKGAIIPAAVGVDIGCGMNALRTALTAADLPENLAELR
QAIETAVPHGRTTGRCKRDKGAWENPPFNVDAKWAELEAGYQWLTQKYPR
FLNTNNYKHLGTLGTGNHFIEICLDESEQVWIMLHSGSRGIGNAIGTYFI
DLAQKEMQDQLETLPSRDLAYFMEGTEYFDDYLRAVAWAQLFASLNRDAM
MENVVTALQSITQKTVRQPQTLAMEEINCHHNYVQKEQLFGEEIYVTRKG
AVSARAGQYGIIPGSMGAKSFIVRGLGNEESFCSCSHGAGRVMSRTKAKK
LFSVEDQIRATAHVECRKDAEVIDEIPMAYKDIDAVMAAQSDLVEVIYTL
RQVVCVKG
>S0211 safA, putative cytoplasmic protein
MEIFLSDPISAECPCGPDLEYDPEYLLLFTRAAPREEAQYGDFVSTPENI
NWAELERDAHRLLMRSKDIRILVVLLRCRIQQAGARGLSEALTLLETLCS
TYPDAIHPQLLATEDITAEDAAVARSNALAALLDHEGVMADIRGITLSNN
AAMRLQVRDVERSLSALRPADALAPESVRQQLADLEARGTLPLDAFRQAA
ETTERLQRHARETLNDQAPDFSRLTQLLALLPGAVQSTTPEILPQPQAEQ
PENATIVHTEQMQAEQIALPVHIAEEISPMTGAEPRQIRDRNDALERLRV
IRRWFEHSEPSSPTIPLLRQAERLVGKRVSEVINEIPVELLEKWDALE
>S0213 safB, periplasmic chaperone of fimbral assembly machinery
MSNDFTQAQAPPWRYGFLNLMRRVDVQLCTVPAGNTWQPRMEKFRLGQTP
ALTFAPREIASVGWQEGRLHISLYSLVLWGPNGPLPLHYTELARNRTESR
R
>S0214 safC, outer membrane usher protein
MDPRLLEYYNRELSYLRETGAEFAARHPKVAARLGMQGTDIADPYVERMV
EAFSFLTARTQLKIDAEFPRFTQRLLEVVSPNYVTPTPSMSVAQLHPDTE
EGDLAKGFTVPRDTAFFSAIPEGESTACQFRSSQDVTLWPLAIEEARLTA
APPDMPALHRYLPANIHVAGALRITLRTFGELTFSQLAGLDRLPFYLCGE
ERTASHLLELLHTSAIAPLAGIPGHFDGALDVNLQQPVMYEGLEPDQGLL
PLAWNVFHGHNLLHEYFACPERFYFFTPTGLSAGLQKIDGGVAEIVILLN
RLPPDWLIHQTNAAQFSLFCTPVINLFPRVTARIDVTHSTTEQHLVVDRT
HPLDYEVFSVQEVEGLETDTTRKMAFRPLYHTRNNDEGNHGRYFSLRREP
RRLSENARRYGTRTPYTGSEVFLSLVDQYEAPYPENLRHITITAMVTNRD
LPCLIARNGRDDLTVDAAIPVAGVGLIRPPRSPQPPMAEREMAWRLIRQL
SFNYLPLADLDHRTGGQALRDLLNLFIPAHDSPQSRQVRSLIGCKTTPVT
RRLPGSGLLVYGRGVSCELTVDEEGFSGISPYLFGLVLEHYIARHVSINT
FSQMTLHSMQRGKIMTWPVRAGQRGSV
>S2358 sanA, vancomycin resistance protein
MLKRVFLSLLVLIGLLLLTVLGLDRWMSWKTAPYIYDELQDLPYRQVGVV
LGTAKYYRTGVINQYYRYRIQGAINAYNSGKVNYLLLSGDNALQSYNEPM
TMRKDLIAAGVDPSDIVLDYAGFRTLDSIVRTRKVFDTNDFIIITQRFHC
ERALFIALHMGIQAQCYAVPSPKDMLSVRIREFAARFGTLADLYIFKREP
RFLGPLVPIPAMHQVPEDAQGYPAVTPEQLLELQKKQGK
>S4618 sgaT, hypothetical protein
MHNIPGVRNTRLPLLQEIVMEILYNIFTVFFNQVMTNAPLLLGIVTCLGY
ILLRKSVSVIIKGTIKTIIGFMLLQAGSGILTSTFKPVVAKMSEVYGING
AISDTYASMMATIDRMGDAYSWVGYAVLLALALNICYVLLRRITGIRTIM
LTGHIMFQQAGLIAVTLFIFGYSMWTTIICTAILVSLYWGITSNMMYKPT
QEVTDGCGFSIGHQQQFASLIAYKVAPFLGKKEESVEDLKLPGWLNIFHD
NIVSTAIVMTIFFGAILLSFGIDTVQAMAGKVHWTVYILQTGFSFAVAIF
IITQGVRMFVAELSEAFNGISQRLIPGAVLAIDCAAIYSFAPNAVVWGFM
WGTIGQLIAVGILVACGSSILIIPGFIPMFFSNATIGVFANHFGGWRAAL
KICLVMGMIEIFGCVWVVKLTGMSAWMGMADWSILAPPMMQGFFSIGIAF
MAVIIVIALAYMFFAGRALRAEEDAEKQLAEQSA
>S4396 slyX, host factor for lysis of phiX174 infection
MQDLSLEARLAELESRLAFQEITIEELNVTVAAHEMEMAKLRDHLRLLTE
KLKASQPSNIASQAEETPPPHY
>S3540 smg, hypothetical protein
MYLFETYIHTEAELRVDQDKLEQDLTDAGFEREDIYNALLWLEKLADYQE
GLAEPMQLASDPLSMRIYTPEECERLDASCRGFLLFLEQIQVLNLETREM
VIERVLALDNAEFELDDLKWVILMVLFNIPGCENAYQQMEELLFEVNEGM
LH
>S3139 sprT, hypothetical protein
MKTSRLPIAIQQAVMRRLREKLAQANLKLGRNYPEPKLSYTQRGTSAGTA
WLESYEIRLNPVLLLENSEAFIEEVVPHELAHLLVWKHFGRVAPHGKEWK
WMMESVLGVPARRTHQFELQSVRRNTFPYRCKCQEHQLTVRRHNRVVRGE
AIYRCVHCGEQLVAK
>S2231 wcaK, putative galactokinase
MKLLILGNHTCGNRGDSAILRGLLDAINILNPHTEVDVMSRYPVSSSWLL
NRPVMGDPLFLQMTQHNSAAGVVGRIKKVLRRRYQHQVLLSRVTDTGKLR
NIAIAQGFTDFVRLLSGYDAIIQVGGSFFVDLYGVPQFEHALCTFMAKKP
LFMIGHSVGPFQDEQFNQLANYVFGHCDALILRESVSLDLMKRRNITTAK
VEHGVDTAWLVDHHTEDFTASYAVQHWLDVAAQQKTVAITLRELAPFDKR
LGTTQQAYEKAFAGVVNRILDEGYQVIALSTCTGIDSYNKDDRMVALNLR
QHISDPARYHVVMDELNDLEMGKILGACELTVGTRLHSAIISMNFATPAI
AINYEHKSAGIMQQLGLPEMAIDIRHLLDGSLQAMVADTLGQLPVLNARF
NEAVSRERQTGMQMVQSVLERIGEVK
>S0006 yaaA, hypothetical protein
MLILISPAKTLDYQSPLTTTRYTLPELLDNSQQLIHEARKLTPPQISTLM
RISDKLAGINAARFHDWQPDFTPANARQAILAFKGDVYTGLQAETFSEDD
FDFAQQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDI
ITNKLNEALAAQGDNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGK
FKIISFYAKKARGLMSRFIIENRLTKPEQLTGFNSEGYFFDEDSSSNGEL
VFKRYEQR
>S0010 yaaH, hypothetical protein
MGNTKLANPAPLGLMGFGMTTILLNLHNVGYFALDGIILAMGIFYGGIAQ
IFAGLLEYKKGNTFGLTAFTSYGSFWLTLVAILLMPKLGLTDAPNAQFLG
VYLGLWGVFTLFMFFGTLKGARVLQFVFFSLTVLFALLAIGNIAGNAAII
HFAGWIGLICGASAIYLAMGEVLNEQFGRTVLPIGESH
>S0080 yabB, hypothetical protein
MFRGATLVNLDSKGRLSVPTRYREQLLENAAGQMVCTIDIHHPCLLLYPL
PEWEIIEQKLSRLLSMNPVERRVQRLLLGHASECQMDGAGRLLIAPVLRQ
HAGLTKEVMLVGQFNKFELWDETTWHQQVKEDIDAEQLATGDLSERLQDL
SL
>S0062 yabI, hypothetical protein
MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIG
SGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDK
TEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLL
WPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWR
SGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPLMPVYIDILRK
VVGV
>S0101 yacF, hypothetical protein
MQTQVLFEHPLNEKMRTWLRIEFLIQQLTVNLPIVDHAGALHFFRNVSEL
LDVFERAEVRTELLKELNRQQRKLQTWIGVPGVDQSRIEALIQQLKAAVS
VLISAPRIGQFLREDRLIALVRQRLSIPGGCCSFDLPTLHIWLHLPQAQR
DSQVETWIASLNPLTQALTMVLDLIRQSAPFRKQTSLNGFYQDNGGDADL
LRLNLSLDSQLYPQISGHKSRFAIRFMPLDTENGQVPERLDFELACC
>S0100 yacG, hypothetical protein
MSETITVNCPTCGKTVVWGEISPFRPFCSKRCQLIDLGEWAAEEKRIPSS
GDLSESDDWSEEPKQ
>S0118 yacL, hypothetical protein
MARCDFGALPGAEEHTMDYEFLRDITGVVKVRMSMGHEVVGHWFNEEVKE
NLALLDEVEQAAHALKGSERSWQRAGHEYTLWMDAEEVMVRANQLEFAGD
EMEEGMNYYDEESLSLCGVEDFLQVVAAYRNFVQQK
>S0151 yadR, hypothetical protein
MSDDVALPLEFTDAAANKVKSLIADEDNPNLKLRVYITGGGCSGFQYGFT
FDDQVNEGDMTIEKQGVGLVVDPMSLQYLVGGSVDYTEGLEGSRFIVTNP
NAKSTCGCGSSFSI
>S0152 yadS, hypothetical protein
MLVYWLDIVGTAVFAIYGVLLAGKLRMDPFGVLVLGVVTAVGDGTIRDMA
LDHGPVFWVKDPTDLVVAMVTSMLTIVLVRQPRRLPKWMLPVLDAVGLAV
FVGISVNKAFNAEAGPLIAVCMGVITGVGGGIIRDVLVREIPMILRTEIY
ATACIIGGIVHATAYYTFSVPLETASMMGMVVTLLIRLAAIRWHLKLPTF
ALDENGR
>S0188 yaeB, hypothetical protein
MSSFQFEQIGVIRSPYKEKFAVPRQPGLVKSANGELHLIASYNQADAVRG
LEAFSHLWILFVFHQTMEGGWRPTVRPPRLGGNARMGVFATRSTFRPNPI
GMSLVELKEVVCHKGSVILKLGSLDLVDGTPVVDIKPYLPFAESLPDASA
SYAQSAPAAEMAVSFTAEVEKQLLTLEKRYPQLTLFIREVLAQDPRPAYR
KGEETGKTYAVWLHDFNVRWRVTDAGFEVFALEPR
>S0183 yaeQ, hypothetical protein
MALKATIYKATVNVADLDRNQFLDASLTLARHPSETQERMMLRLLAWLKY
ADERLQFTRGLCADDEPEAWLRNDHLGIDLWIELGLPDERRIKKACTQAA
EVALFTYNSRAAQIWWQQNQSKCVQFANLSVWYLDDEQLAKVSAFADRTM
TLQATIQDGVIWLSDDKNNLEVNLTAWQQPS
>S0203 yafD, hypothetical protein
MRKNTYAMRYVAGQPAERILPPGSFASIGQALPPGEPLSTEERIRILVWN
IYKQQRAEWLSVLKNYGKDAHLVLLQEAQTTPELVQFATANYLAADQVPA
FVLPQHPSGVMTLSAAHPVYCCPLREREPILRLAKSALVTVYPLPDTRLL
MVVNIHAVNFSLGVDVYSKQLLPIGDQIAHHSGPVIMAGDFNAWSRRRMN
ALYRFAREMSLRQVRFTDDQRRRAFGRPLDFVFYRGLNVSEASVLVTRAS
DHNPLLVEFSPGKPDK
>S0295 yafK, hypothetical protein
MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFK
EERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQR
NQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQG
IDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKP
GYDYFEQTRKPPTVSVANGRYVVSKPLSHEVVQPQLASNYTLPEAK
>S0335 yaiE, hypothetical protein
MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISG
ALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL
>S0331 yaiI, hypothetical protein
MPHSCREIHCFDNRWQKHKQNYAGRQKRDTIEDYLTKDDFMTIWVDADAC
PNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNE
IVRQCEAGDLVITADIPLAAEAIEKGAAALNSRGERYTPATIRERLTMRD
FMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG
>S0371 yajQ, hypothetical protein
MPSFDIVSEVDLQEARNAVDNASREVESRFDFRNVEASFELNDASKTIKV
LSESDFQVNQLLDILRAKLLKRGIEGSSLDVPENIVHSGKTWFVEAKLKQ
GIESATQKKIVKMIKDSKLKVQAQIQGDEIRVTGKSRDDLQAVMAMVRGG
DLGQPFQFKNFRD
>S0408 ybaA, hypothetical protein
MKYVDGFVVAVPADKKDAYREMAAKAAPLFKEFGALRIVECWASDVPDGK
VTDFRMAVKAEENEEVVFSWIEYPSKEVRDAANQKMMSDPRMKEFGESMP
FDGKRMIYGGFESIIDE
>S0423 ybaB, hypothetical protein
MFGKGGLGNLMKQAQQMQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAH
NCRRVEIDPSLLEDDKEMLEDLVAAAFNDAARRIEETQKEKMASVSSGMQ
LPPGFKMPF
>S0433 ybaK, hypothetical protein
MTPAVKLLEKNKISFQIHTYEHDPAETNFGDEVVKKLGLNPDQVYKTLLV
AVNGDMKHLAVAVTPVAGQLDLKKVAKALGAKKVEMADPMVAQRSTGYLV
GGISPLGQKKRLPTIIDAPAQEFATIYVSGGKRGLDIELAAGDLAKILDA
KFADIARRD
>S0420 ybaN, hypothetical 14.8kd protein
MQRIILIIIGWLAVVLGTLGVVLPVLPTTPFILLAAWCFARSSPRFHAWL
LYRSWFGSYLRFWQKHHAMPRGVKPRAILLILLTFAISLWFVQMPWVRIM
LLVILACLLFYMWRIPVIDEKQEKH
>S0434 ybaP, putative ligase
MDLLYRVKTLWAALRGNHYTWPAIDITLPGNRHFHLIGSIHMGSHDMAPL
PTRLLKKLKNADALIVEADVSTSDTPFANLPACEALEERISEEQLQNLQH
ISQEMGISPSLFSTQPLWQIAMVLQATQAQKLGLRAEYGIDYQLLQAAKQ
QHKPVIELEGAENQIAMLLQLPDKGLALLDDTLTHWHTNARLLQQMMSWW
LNAPPQNNDITLPNTFSQSLYDVLMHQRNLAWRDKLRAMPPGRYVVAVGA
LHLYGEGNLPQMLR
>S0404 ybaY, conserved hypothetical lipoprotein
MKLVHMASGLAVAIALAACADKSADIQTPAPAANTSISATQQPAIQQPNV
SGTVWIRQKVALPPDAVLTVTLSDASLADAPSKVLAQKAVRTEGKQSPFS
FVLPFNPADVQPNARILLSAAITVNDKLVFITDTVQPVINQGGTKADLTL
VPVQQTAVPVQASGGATTTVPSTSPTQVNPSSAVPAPTQY
>S0463 ybbF, hypothetical protein
MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDD
DPNPLHRQMAAAIKAVSDSGVPCYFIHGNRDFLLGKRFARESGMTLLPEE
KVLELYGRRVLIMHGDTLCTDDAGYQAFRAKVHKPWLQMLFLALPLFVRK
RIAARMRANSKEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPA
VHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF
>S0467 ybcJ, hypothetical protein
MIHRVSNMATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDG
AVETRKRCKIVAGQTVSFAGHSVQVVA
>S0494 ybdF, hypothetical protein
MDKQSLHETAKRLALELPFVELCWPFGPEFDVFKIGGKIFMLSSELRGVP
FINLKSDPQKSLLNQQIYPSIKPGYHMNKKHWISVYSGEEISEALLRDLI
NDSWNLVVDGLAKRDQKRVRPG
>S0496 ybdK, hypothetical protein
MPLPDFHVSEPFTLGIELEMQVVNPPGYDLSQDSSMLIDAVKNKITAGEV
KHDITESMLELATDVCRDINQAAGQFSAMQKVVLQAAADHHLEICGGGTP
PFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCASGDDAIYL
LHGLSRFVPHFIALSAASPYMQGTDTRFAPSRPNIFSAFPDNGPMPWVSN
WQQFEALFRCLSYTTMIDSIKDLHWDIRPSPHFGTVEVRVMDTPLTLSHA
VNMAGLIQATAHWLLTERPFKHQEKDYLLYKFNRFQACRYGLEGVITDPH
TGDRRPLTEDTLRLLEKIAPSAHKIGASSAIEALHRQVVSGLNEAQLMRD
FVADGGSLIGLVKKHCEIWAGD
>S0667 ybeA, hypothetical protein
MKLQLVAVGTKMPDWVQTGFTEYLRRFPKDMPFELIEIPAGKRGKNADIK
RILDKEGELMLAAAGKNRIVTLDIPGKPWDTPQLAAELERWKLDGRDVSL
LIGGPEGLSPACKAAAEQSWSLSALTLPHPLVRVLVAESLYRAWSITTNH
PYHRE
>S0666 ybeB, hypothetical protein
MIICTGTSSRHVMSIADHVVQESRAAGLLPLGVEGENSADWIVVDLGDVI
VHVMQEESRRLYELEKLWS
>S0672 ybeD, hypothetical protein
MKTKLNELLEFPTPFTYKVMGQALPELVDQVVEVVQRHAPGDYTPTVKPS
SKGNYHSVSITINATHIEQVETLYEELGKIDIVRMVL
>S0603 ybgA, hypothetical protein
MNLQRFDDSTLIRIFALHELHRLKEHGLTRGALLDYHSRYKLVFLAHSQP
EYRKLGPFVADIHQWQNLDDFYNQYRQRVIVLLSHPANPRDHTNVLMHVQ
GYFRPHIDSTERQQLAALIDSYRRGEQPLLAPLMRIKHYMALYPDAWLSG
QRYFELWPRVINLRHSGVL
>S0575 ybgE, hypothetical protein
MSKIIATLYAVMDKRPLRALSFVMALLLAGCMFWDPSRFAAKTSELEIWH
GLLLMWAVCAGVIHGVGFRPQKVLWQGIFCPLLADIVLIVGLIFFFF
>S0568 ybgF, hypothetical protein
MSSNFRHQLLSLSLLVGIAAPWAAFAQAPISSVSSGSVEDRVTQLERISN
AHSQLLTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQILLQIDS
LSSGGAAAQSTSSDQSGATASTTPTADAGTANAGAPVKSGDANTDYNAAI
ALVQDKSRQDDAMVAFQNFIKNYPDSTYLPNANYWLGQLNYNKGKKDDAA
YYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVISKYPGT
DGAKQAQKRLNAM
>S0600 ybgI, hypothetical protein
MKNTELEQLINEKLNSAAISDYAPNGLQVEGKETVQKIVTGVTASQALLD
EAVRLGADAVIVHHGYFWKGESPVIRGMKRNRLKTLLANDINLYGWHLPL
DAHPELGNNAQLAALLGITVMGEIEPLVPWGELTMPVPGMELASWIEARL
GRKPLWCGDTGPEVVQRVAWCTGGGQSFIDSAARFGVDAFITGEVSEQTI
HSAREQGLHFYAAGHHATERGGIRALSEWLNENTDLDVTFIDIPNPA
>S0937 ybhH, hypothetical protein
MAIMGSGNALEIDGIGGGNPLTSKVAIISRSSDPRADVDYLFAQVIVHEQ
RVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQ
TPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFD
DVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAG
KAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAI
SSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASV
IRTTRKIFSGEVYLP
>S0771 ybhK, putative structural protein
MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGR
IRRSEGGIAWGDMRNCLNQLITEPNVASAMFEYRFGGNGELSGHNLGNLM
LKALDHLSVRPLEAINLIRNLLKVDTHLIPMSEHPVDLMAIDDQGHEVYG
EVNIDQLTTPIQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPI
LLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKLAIMEQYVGKK
VIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHNALEKALQA
LG
>S0779 ybhN, hypothetical protein
MSKSHPRWRLAKKILTWLFFIAVIVLLVVYAKKVDWEEVWKVIRDYNRVA
LLSAVGLVVVSYLIYGCYDLLARFYCGHKLAKRQVMLVSFICYAFNLTLS
TWVGGIGMRYRLYSRLGLPGSTITRISSLSITTNWLGYILLAGIIFTAGV
VELPDYWYVDQTTLRILGIGLLMIIAVYLWFCAFAKHRHMTIKGQKLVLP
SWKFALAQMLISSVNWMVMGAIIWLLLGQSVNYFFVLGVLLVSSIAGVIV
HIPAGIGVLEAVFIALLAGEHTSKGSIIAALLAYRVLYYFIPLLLALICY
LLLESQAKKLRAKNEAAM
>S0789 ybiA, hypothetical protein
MPVRAQRIQHVMQDTIINFYSTSDDYGDFSNFAARPIKVDGNTWPTSEHY
FQAQKFLDEKYREEIRRVSSPMVAARMGRNRSKPLRKNWESVKEQVMRKA
LRAKFEQHAELRVLLLATAPAKLVEHTENDAYWGDGGNGKGKNRLGYLLM
ELREQLAIEK
>S0812 ybiS, hypothetical protein
MNMKLKTLFAAAFAVVGFCSTASAVTYPLPTDGSRLVGQNQVITIPEGNT
QPLEYFAAEYQMGLSNMMEANPGVDTFLPKGGTVLNIPQQLILPDTVHEG
IVINSAEMRLYYYPKGTNTVIVLPIGIGQLGKDTPINWTTKVEHKKAGPT
WTPTAKMHAEYRAAGEPLPAVVPAGPDNPMGLYALYIGRLYAIHGTNANF
GIGLRVSHGCVRLRNEDIKFLFEKVPVGTRVQFIDEPVKATTEPDGSRYI
EVHNPLSTTEAQFEGQEIVPITLTKSVQTVTGQPDVDQVVLDEAIKNRSG
MPVRLN
>S0795 ybiX, putative enzyme
MWGAGPTFWRTCMMYHIPGVLSPQDVAHFREQLEQAEWVDGRVTTGAQGA
QVKNNQQVDTRSALYAALQNEVLNAVNQHALFFAAALPRTLSTPLFNRYQ
NNETYGFHVDGAVRSHPQNGWMRTDLSATLFLSDPQSYDGGELVVNDTFG
QHRVKLPAGDLVLYPSSSLHCVTPVTRGVRVASFMWIQSMIRDDKKRAML
FELDKNIQSLKSRYGENEEILSLLNLYHNLLREWSEI
>S0870 ybjE, putative surface protein
MCHRAFRLLLCKDWIFMFSGLLIILVPLIVGYLIPLRQQAALKAINQLLS
WMVYLILFFMGISLAFLDNLASNLLAILHYSAVSITVILLCNIAALMWLE
RGLPWRNHHQQEKLPSRIAMALESLKLCGVVVIGFAIGLSGLAFLQHATE
ASEYTLILLLFLVGIQLRNNGMTLKQIVLNRRGMIVAVVVVVSSLIGGLI
NAFILDLPINTALAMASGFGWYSLSGILLTESFGPVIGSAAFFNDLAREL
IAIMLIPGLIRRSRSTALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGF
ILSLLVPILIAFFSA
>S0877 ybjX, putative enzyme
MVKSTSCITIDFMNMSQLTERTFTPSESLSSLSLFLSLARGQCRPGKFWH
RCSFRQKFLLRSLIMPRLSVEWMNELSHWPNLNVLLTRQPRLPVRLHRPY
LAANLSRKQLLEALRYHYALLRGCMSAEEFSLYLNTPGLQLAKLEGKNGE
QFTLELTMMISMDKEGDSTILFRNSEGIPLAEITFTLCEYQGKRTMFIGG
LQGAKWEIPHQEIQNATKACHGLFPKRLVMEAACLFAQRLQVEQIIAVSN
ETHIYRSLRYRDKEGKIHADYNAFWESVGGVCDAERHYRLPAQIARKEIA
EIASKKRAEYRRRYEMLDAIQPQMATMFRG
>S0964 ycaO, hypothetical protein
MTQTFIPGKDAALEDSIARFQQKLSDLGFQIEEASWLNPVPNVWSVHIRD
KECALCFTNGKGATKKAALASALGEYFERLSTNYFFADFWLGETIANGPF
VHYPNEKWFPLSENDDVPEGLLDDRLRAFYDPENELTGSMLIDLQSGNED
RGICGLPFTRQSDNQTVYIPMNIIGNLYVSNGMSAGNTRNEARVQGLSEV
FERYVKNRIIAESISLPEIPADVLARYPAVVEAIETLETEGFPIFAYDGS
LGGQYPVICVVLFNPANGTCFASFGAHPDFGVALERTVTELLQGRGLKDL
DVFTPPTFDDEEVAEHTNLETHFIDSSGLISWDLFKQDADYPFVDWNFSG
TTEEEFATLMAIFNKEDKEVYIADYEHLGVYACRIIVPGMSDIYPAEDLW
LANNSMGSHLRETILSLPGSEWEKEDYLNLIEQLDEEGFDDFTRVRELLG
LATGSDNGWYTLRIGELKAMLALAGGDLEQALVWTEWTMEFNSSVFSPER
ANYYRCLQTLLLLAQEEDRQPLQYLNAFVRMYGADAVEAASAAMSGEAAF
YGLQPVDSDLHAFAAHQSLLKAYEKLQRAKAAFWAK
>S0976 ycaQ, hypothetical protein
MSLPHLSLADARNLHLAAQGLLNKPRRRASLEDIPATISRMSLLQIDTIN
IVARSPYLVLFSRLGNYPAQWLDESLARGELMEYWAHEACFMPRSDFRLI
RHRMLAPEKMGWKYKDAWMQEHEAEIAQLIQHIHDKGPVRSADFEHPRKG
ASGWWEWKPHKRHLEGLFTAGKVMVIERRNFQRVYDLTHRVMPDWDDERD
LVSQTEAEIIMLDNSARSLGIFREQWLADYYRLKRPALAAWREARAEQQQ
IIAVHVEKLGNLWLHADLLPLLERALAGKLTATHSAVLSPFDPVVWDRKR
AEQLFDFSYRLECYTPAPKRQYGYFVLPLLHRGQLVGRMDAKMHRQTGIL
EVISLWLQEGIKPTTMLQKGLRQAITDFASWQQATRVTLGRCPQGLFTDC
RTGWEIDPVA
>S0977 ycaR, hypothetical protein
MDHRLLEIIACPVCNGKLWYNQEKQELICKLDNLAFPLRDGIPVLLETEA
RVLTADESKS
>S0986 ycbB, putative amidase
MLLNMMCGRRLSAISLCLAVTFAPLFNAQADEPEVIPGDSPVAVSEQGEA
LPQAQATAIMAGIQPLPEGAAEKARTQIESQLPAGYKPVYLNQLQLLYAA
RDMQPMWENRDAVKAFQQQLAEVAIAGFQPQFNKWVELLTDPGVNGMARD
VVLSDAMMGYLHFIANIPIKGTRWLYSSKPYALSTPPLSVINQWQLALDK
GQLPTFVAGLAPQHPQYAAMHESLLVLLSDTKPWPQLTGKATLRPGQWSN
DVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMD
KQTTSRSKPAPAVRAAYDNELVEAVKRFQAWQGLGADGAIGPATRDWLNV
TPAQRAGVLALNIQRLRLLPTELSTGIMVNIPAYSLLYYQNGNQVLDSRV
IVGRPDRKTPMMSSALNNVVVNPPWNVPPTLARKDILPKVRNDPGYLESH
GYTVMRGWNSREAIDPWEVDWSTITASNLPFRFQQAPGPRNSLGRYKFNM
PSSEAIYLHDTPNHNLFKRDTRALSSGCVRVNKASDLANMLLQDAGWNDK
RISDALKQGDTRYVNIRQSIPVNLYYLTAFVGADDRTQYRTDIYNYDLPA
RSSSQIVSKAEQLIR
>S0980 ycbC, hypothetical protein
MLFTLKKVIGNMLLPLPLMLLIIGAGLALLWFSRFQKTGKIFISIGWLAL
LLLSLQPVADRLLRPIESTYPTWNNSQKVDYIVVLGGGYTWNPQWAPSSN
LINNSLPRLNEGIRLWRENPGSKLIFTGGVAKTNTVSTAEVGARVAQSLG
VPREQIITLDLPKDTEEEAAAVKQAIGDAPFLLVTSASHLPRAMIFFQQE
GLNPLPAPANQLAIDSPLNPWERAIPSPVWLMHSDRVGYETLGRIWQWLK
GPSGEPRQE
>S1022 ycbG, putative dehydrogenase
MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENE
PVLVNGWIDKHMNPELVNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFIV
WQRLAGLAQRRGKTLSETIVQLIEDAENKEKYANKMSSLKQDLQALLGKE
>S0987 ycbK, hypothetical protein
MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGES
IKAEFFDGRGYIQEELAKLNHFFRDYRANKIKSIDPGLFDQLYRLQGLLG
TRKPVQLISGYRSIDTNNELRARSRGVAKKSYHTKGQAMDFHIEGIALSN
IRKAALSMRAGGVGYYPRSNFVHIDTGPARHW
>S1029 yccF, hypothetical protein
MRTVLNILNFVLGGFATTLGWLLATLVSIVLIFTLPLTRSCWEITKLSLV
PYGNEAIHVDELNPAGKNVLLNTGGTVLNIFWLIFFGWWLCLMHIATGIA
QCISIIGIPVGIANFKIAAIALWPVGRRVVSVETAQAAREANARRRFE
>S1034 yccV, hypothetical protein
MWNFTLISKVKISREVTMIASKFGIGQQVRHSLLGYLGVVVDIDPVYSLS
EPSPDELAVNDELRAAPWYHVVMEDDNGLPVHTYLAEAQLSSELQDEHPE
QPSMDELAQTIRKQLQAPRLRN
>S1151 yceH, hypothetical protein
MKYQLTALEARVIGCLLEKQVTTPEQYPLSVNGVVTACNQKTNREPVMNL
SESEVQEQLDNLVKRHYLRTVSGFGNRVTKYEQRFCNSEFGDLKLSAAEV
ALITTLLLRGAQTPGELRSRAARMYEFSDMAEVESTLEQLANREDGPFVV
RLAREPGKRESRYMHLFSGEVEDQPAVTAMSNMVDGDLQTRVEALEIEVA
ELKQRLDSLLAHLGD
>S1140 yceI, hypothetical protein
MKKSLLGLTFASLMFSAGSAVAADYKIDKEGQHAFVNFRIQHLGYSWLYG
TFKDFDGTFTFDEKNPAADKVNVTINTTSVDTNHAERDKHLRSADFLNTT
KYPQATFTSTSVKKDGDELDITGDLTLNGVTKPVTLEAKLIGQGDDPWGG
KRAGFEAEGKIKLKDFNIKTDLGPASQEVDLIISVEGVQQK
>S1210 ycfD, hypothetical protein
MLNMEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMES
EVDSRLVSHQDGKWQVSHGPFESYDHLGETNWSLLVQAVNHWHEPTAALM
RPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEK
LQMKQHCPHPDLLQVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMN
YSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPADVLPQEM
DKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDAL
KQGDVLVRLGGLRVLRIGDDVYANGEKIDSPHRPALDALASNIALTAENF
GDALEDPSFLAMLAALVNSGYWFFEG
>S1194 ycfJ, hypothetical protein
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTP
RQECRNVTVTHRRPVQDENRITGSVLGAVAGGVIGHQFGGGRGKDVATVV
GALGGGYAGNQIQGSLQESDTYTTTQQRCKTVYDKSEKMLGYDVTYKIGD
QQGKIRMDRDPGTQIPLDSNGQLILNNKV
>S1197 ycfS, hypothetical protein
MMIKTRFSRWLTFFTFAAAVALALPAKANTWPLPPASSRLVGENKFHVVE
NDGGSLEAIAKKYNVGFLALLQANPGVDPYVPRAGSVLTIPLQTLLPDAP
REGIVINIAELRLYYYPPGKNSVTVYPIGIGQLGGDTLTPTMVTTVSDKR
ANPTWTPTANIRARYKAQGIELPAVVPAGPDNPMGHHAIRLAAYGGVYLL
HGTNADFGIGMRVSSGCIRLRDDDIKTLFSQVTPGTKVNIINTPIKVSAE
PNGARLVEVHQPLSEKIDDDPQLLPITLNSAMQSFKDAAQTDAEVMQHVM
DVRSGMPVDVRRHQVSPQTL
>S1199 ycfT, hypothetical protein
MKQKELWINQIKGLCICLVVIYHSVITFYPHMTTFQHPLSEVLSKCWIYF
NLYLAPFRMPVFFFISGYLIRRYIDSVPWGNCLDKRIWNIFWVLALWGVV
QWLALSALNQWLAPERDLSNASNAAYADSTGEFLHGMITASTSLWYLYAL
IVYFVVCKIFSRLALPLFALFVLLSVAVNFVPTPWWGMNSVIRNLPYYSL
GAWFGATIMTCVKEVPLRRHLLMASLLTVLAVGAWLFTISLLLSLVSIVV
IMKLFYQYEQRFGMRSTSLLNVIGSNTIAIYTTHRILVEIFSLTLLAQMN
AARWSPQVELTLLLVYPFVSLFICTVAGLLVRKLSQRAFSDLLFSPPSLP
AAVSYSR
>S1265 ycgB, putative sporulation protein
MATIDSMNKDTTRLSDGPDWTFDLLDVYLAEIDRVAKLYRLDTYPHQIEV
ITSEQMMDAYSSVGMPINYPHWSFGKKFIETERLYKHGQQGLAYEIVINS
NPCIAYLMEENTITMQALVMAHACYGHNSFFKNNCLFRSWTDASSIVDYL
IFARKYITECEERYGVDEVERLLDSCHALMNYGVDRYKRPQKISLQEEKA
RQKSREEYLQSQVNMLWRTLPKREEEKTVAEARRYPSEPQENLLYFMEKN
APLLESWQREILRIVRKVSQYFYPQKQTQVMNEGCATFWHYTILNHLYDE
GKVTERFMLEFLHSHTNVVFQPPYNSPWYSGINPYALGFAMFQDIKRICQ
SPTEEDKYWFPDIAGSDWLETLHFAMRDFKDESFISQFLSPKVMRDFRFF
TVLDDDRHNYLEISAIHNEEGYREIRNRLSSQYNLSNLEPNIQIWNVDLR
GDRSLTLRYIPHNRAPLDRGRKEVLKHVHRLWGFDVMLEQQNEDGSIELL
ERCPPRMGNL
>S1256 ycgL, hypothetical protein
MPKPGILKSKSMFCAIYRSSKRDQTYLYVEKKDDFSRVPEELMKGFGQPQ
LAMILPLDGRKKLVNADIEKVKLALTEQGYYLQLPPPPEDLLKQHLSVMG
QKTDDTNK
>S1258 ycgN, hypothetical protein
MAEHLMSDVPFWQSKTLDEMSDAEWESLCDGCGQCCLHKLMDEDTDEIYF
TNVACRQLNIKTCQCRNYERRFEFEPDCIKLTRENLPTFEWLPMTCAYRL
LAEGKDLPAWHPLLTGSKAAMHGERISVRHIAVKESEVIDWQDHILNKPD
WAQ
>S1301 ychA, hypothetical protein
MRSLADFEFNKAPLCEGMILACEAIRRDFPSQDVYDELERLVSLAKEEIS
QLLPLEEQLEKLIALFYGDWGFKASRGVYRLSDALWLDQVLKNRQGSAVS
LGAVLLWVANRLDLPLLPVIFPTQLILRIECPDGEIWLINPFNGESLSEH
MLDGWLKGNISPSAELFYEDLDEADNIEVIRKLLDTLKASLMEENQMELA
LRTSEALLQFNPEDPYEIRDRGLIYAQLDCEHVALNDLSYFVERCPEDPI
SEMIRAQINNIAHKHIVLH
>S1327 ychE, putative channel protein
MIQTLFDFPVYFKFFIGLFALVNPVGIIPVFISMTSYQTAAARNKTNLTA
NLSVAIILWISLFLGDTILQLFGISIDSFRIAGGILVVTIAMSMISGKLG
EDKQNKQEKSETAVRESIGVVPLALPLMAGPGAISSTIVWGTRYHSISYL
FGFFVAIALFALCCWGLFRMAPWLVRVLRQTGINVITRIMGLLLMALGIE
FIVTGIKGIFPGLLN
>S1319 ychJ, hypothetical protein
MRSRYCAFVMQDADYLIKTWHPSCGAAALRAELMTGFAHTEWLGLTVFEH
CWQDADNIGFVSFVARFTEGGKTGAIIERSRFLKENGQWYYIDGTRPQFG
RNDPCPCGSGKKFKKCCGQ
>S1346 yciE, hypothetical protein
MNRIEHYHDWLRDAHAMEKQAESMLESMASRIDNYPELRARIEQHLSETK
NQIVQLETILDRNDISRSVIKDSMSKMAALGQSIGGIFPSDEIVKGSIGG
YVFEQFEITCYTSLLAAAKNAGDTVSIPIIEAILNEEKQMADWLIQHIPQ
TTEKFLIRSETDGVEAKK
>S1347 yciF, putative structural protein
MNMKTIEDVFIQLLSDTYSAEKQLTRALAKLARATSNEKLSQAFHAHLEE
THGQIERIDQVVESESNLKIKRMKCVAMEGLIEEANEVIESTEKNEVRDA
ALIAAAQKVEHYEIASYGTLATLAEQLGYRKAAKLLKETLEEEKATDIKL
TDLALNNVNKKAENKA
>S1340 yciI, hypothetical protein
MHDLNKDIIFPLQIFALSNNVLYNFPEQGVVPVLYVIYAQDKADSLEKRL
SVRPAHLARLQLLHDEGRLLTAGPMPAVDSNDPGAAGFTGSTVIAEFESL
EAAQAWADADPYVAAGVYEHVSVKPFKKVF
>S1366 yciS, hypothetical protein
MKYLLIFLLVLAIFVISVTLGAQNDQQVTFNYLLAQGEYRISTLLAVLFA
AGFAIGWLICGLFWLRVRVSLARAERKIKRLENQLSPATDVAVVPHSSAA
KE
>S1374 yciW, putative oxidoreductase
MEQRHITGKSHWYHETQSSTTEYDVLPLVPEAAKVSDPFLLDVILDEETL
APFLSWLVPARVLAVELFPDQLTVTRSQTFTAYERLSTALTVAQVCGVQR
LCNYYSARLTPLPGPDSSRESNHRLAQITQYARQLASSPSIIDNRSRQHL
NDVGLTVWDCVIINQIIGFIGFQARTIATFQAYLGHPVRWLPGLAIQNYA
DASLFADESIRWRSSYEVEKLPEEHTKSSTAELCQLAETLSLHPISLSLL
EKLLNSTRVNAQPDNQLAALLCARINGSPACFSTCMDSSNEYKKISTLLR
KGENEINRWADRHSVERATVQAIQWLTRAPDRFSAAQFSPLLEHEKSSTQ
IINLLVWSGLCGWINRLKIALGETY
>S1376 ycjD, hypothetical protein
MMDKIKSNARDLRRNLTLQERKLWRYLRSRRFGDFKFRRQHPVGSYILDF
ACCSARVVVELDGGQHDLAVAYDTRRTSWLESQGWTVLRFWNNEIDCNEE
AVLEIILQELNRRSPSP
>S1411 ycjF, hypothetical protein
MTEPLKPRIDFDGPLEVEQNPKFRAQQTFDENQAQNFAPATLDEAQEEEG
QVEAVMDAALRPKRSLWRKMVMGGLALFGASVVGQGIQWTMNAWQTQDWV
ALGGCAAGALIIGAGVGSVVTEWRRLWRLRQRAHERDEARDLLHSHGTGK
GRAFCEKLAQQAGIDQSHPALQRWYASIHETQNDREVVSLYAHLVQPVLD
AQARREISRSAAESTLMIAVSPLALVDMAFIAWRNLRLINRIATLYGIEL
GYYSRLRLFKLVLLNIAFAGASELVREVGMDWMSQDLAARLSTRAAQGIG
AGLLTVRLGIKAMELCRPLPWIDDDKPRLGDFRRQLIGQVKETLQKGKTP
SEK
>S1456 ydbL, hypothetical protein
MMKKTLLLCAFLVGLVSSNVMALTLDEARTQGRVGETFYGYLVALKTDAE
TEKLVTEINAERKASYQQRAKQNNVSVDDIAKLAGQKLVARAKPGEYVQG
INGKWVRKF
>S1490 ydcH, hypothetical protein
MSLFDKHNKLDHEIARKEGSDGRGYNAEVVRMKKQKLQLKDEMLKILQQE
SVKEV
>S1768 ydgA, hypothetical protein
MNKSLVAVGVIVALGVVWTGGAWYTGKKIETHLEDMVAQANAQLKLTAPE
SNLEVSYQNYHRGVFSSQLQLLVKPIAGKENPWIKSGQSVIFNESVDHGP
FPLAQLKKLNLIPSMASIQTTLVNNEVSKPLFDMAKGETPFEINSRIGYS
GDSSSDISLNPLNYEQKDEKVAFSGGEFQLNADRDGKAISLSGEAQSGRI
DAVNEYNQKVQLTFNNLKTDGSSTLGSFGERVGNQKLSLEKMTISVEGKE
LALLEGMEISGKSDLVNDGKTINSQLDYSLNSLKVQNQDLGSGKLTLKVG
QIDGEAWHQFSQQYNAQTQALLAQPEIANNPELYQEKVTEAFFSALPLML
KGDPVITIAPLSWKNSQGESALNLSLFLKDPATTKEAPQTLAQEVDRSVK
SLDAKLTIPVDMATELMTQVAKLEGYQEDQAKKLAKQQVEGASAMGQMFR
LTTLQDNTITTSLQYANGQITLNGQKMSLEDFVGMFADASS
>S1645 ydiA, hypothetical protein
MDNAVDRHVFYISDGTAITAEVLGHAVMSQFPVTISSITLPFVENESRAR
AVKDQIDAIYHQTGVRPLVFYSIVLPEIRAIILQSEGFCQDIVQALVAPL
QQEMKLDPTPIAHRTHGLNPNNLNKYDARIAAIDYTLAHDDGISLRNLDQ
AQVILLGVSRCGKTPTSLYLAMQFGIRAANYPFIADDMDNLVLPASLKPL
QHKLFGLTIDPERLAAIREERRENSRYASLRQCRMEVAEVEALYRKNQIP
WINSTNYSVEEIATKILDIMGLSRRMY
>S1846 ydiC, hypothetical protein
MDMHSGTFNPQDFAWQGLTLTPAAAIHIRELVAKQPGMVGVRLGVKQTGC
AGFGYVLDSVSEPDKDDLLFEHDGAKLFVPLQSMPFIDGTEVDFVREGLN
QIFKFHNPKAQNECGCGESFRV
>S1612 ydjC, hypothetical protein
MERLLIVNADDFGLSKGQNYGIIEACRNGIVTSTTALVNGQAIDHAVQLS
RDEPSLAIGMHFVLTMGKPLTAMPGLTRDGVLGKWIWQLAEEDALPLEEI
TQELASQYLRFIELFGRKPTHLDSHHHVHMFPQIFPIVARFAAEEGIALR
IDRQPLSNAGDLPANLRSSQGFSSAFYGEEISEALFLQVLDDASHRGDPS
LEVMCHPAFIDNTIRQSAYCFPRLTELEVLTSASLKYAIAERGYRLGSYL
DV
>S1593 ydjX, hypothetical protein
MNAERNFLFACLIFALVIYAIHAFGLFDLLTDLPHLQTLIRQSGLFGYSL
YILLFINAPLFLLPGSILVIAGGIVFGPLLGTQLSLIAATLASSCSFLLA
RWLGRDLLLKYVGHSHTFQAIEKGIARNGIDFLILTRLIPLFPYNIQNYA
YGFTTIAFWPYTLISALTTLPGIVIYTVMASDLANEGITLRFILQLCLAG
LALFILIQLAKLYARHKHVDLSASRRSPLTHPKNEG
>S1591 ydjZ, hypothetical protein
MKIQSRKIWYYRITLIILLFAMLLAWALLPGVHEFINRSVAAFAAVDQQG
IERFIQSYGALAAVVSFLLMILQAIAAPLPAFLITFANASLFGAFWGGLL
SWTSSMAGAALCFFIARVMGREVVEKLTGKTVLDSMDGFFTRYGKHTILV
CRLLPFVPFDPISYAAGLTSIRFRSFFIATGLGQLPATIVYSWAGSMLTG
GTFWFVTELFILFALTVVIFMAKKIWLERQKRND
>S1552 yeaK, hypothetical protein
MTEMAKGSVTHQQLIALLSQEGANFRVVTHEAVGKCEAVSEIRGTALGQG
AKALVCKVKGNGVNQHVLAILAADQQADLSQLASHIGGLRASLASPAEVD
ELTGCVFGAIPPFSFHPKLKLVADPLLFERFDEIAFNAGMLDKSVILKTA
DYLRIAQPELVNFRRTA
>S1550 yeaL, hypothetical protein
MFDVTLLILLGLAALGFISHNTTVAVSILVLIIVRVTPLSTFFPWIEKQG
LSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGG
RGVTLMGSQPQLVAGLLVGTVLGVALFRGVPVGPLIAAGLVSLIVGKQ
>S1547 yeaO, hypothetical protein
MKGRTNTMNIQCKRVYDPAEQSDGYRVLVDRLWPRGIKKTDLALDEWDKE
ITPSTELRKAFHGEVVDFATFREQYRAELAQHEQEGKRLADIAKKQPLTL
LYAAKNTTQNHALVLADWLRSL
>S1940 yebC, hypothetical protein
MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLR
AAVDKALSNNMTRDTLNRAIARGVGGDDDANMETIIYEGYGPGGTAIMIE
CLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKGDEDTI
MEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSM
IPSTKADMDAETAPKLMRLIDMLEDCDDVQEVYHNGEISDEVAATL
>S1922 yebE, hypothetical protein
MANWLNQLQSLLGQSSSSTSSSADQGLGKLLVSGALGGLAGLLVANKSAR
KLLTKYGTNALLVGGGAVAGTVLWNKYKDKIRAAHQDEPQFGAQSTPLDE
RTERLILALVFAAKSDGHIDANERAAIDQQLREAGVEEQGRVLIEQAIEQ
PLDPQRLATGVRNEEEALEIYFLSCAAIDIDHFMERSYLNALGDALKIPQ
DVREGIERDLEQQKRTLAE
>S2006 yecM, hypothetical protein
MANWQSIDELQDIASDLPRFTHALDELSRRLGLDITPLTADHISLRCHQN
VTAERWRRGFEQCGELLSEKMINGRPICLFKLHEPVQVAHWQFSNVELPW
PGEKRYPHEGWEHIEIVLPGDPETLNARALALLSDEGLSLPGISVKTSSP
KGEHERLPNPTLAVTDGKTTIKFHPWSIEEIVASEQSA
>S2097 yedI, hypothetical protein
MLLAGSSLLTLLDDIATLLDDISVMGKLAAKKTAGVLGDDLSLNAQQVSG
VRANRELPVVWGVAKGSLINKVILVPLALIISAFIPWAITPLLMIGGAFL
CFEGVEKVLHMLEARKHKEDPAQSQQRLEKLAAQDPLKFEKDKIKGAIRT
DFILSAEIVAITLGIVAEAPLLNQVLVLSGIALVVTVGVYGLVGVIVKID
DLGYWLAEKSSALMQALGKGLLIIAPWLMKALSIVGTLAMFLVGGGIVVH
GIAPLHHAIEHIAGQQSAVVAMILPTVLNLILGFIIGGIVVLGVKAVAKM
RGQAH
>S2177 yeeX, putative alpha helix protein
MLALTNSGCLNESDSHIIRGIKMETTKPSFQDVLEFVRLFRRKNKLQREI
QDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIIASMKGDYEDRVDD
YIIKNAELSKERRDISKKLKAMGEMKNGEAK
>S2319 yehR, hypothetical protein
MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGD
KVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKSTYTDT
YAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFK
EVK
>S2366 yeiB, hypothetical protein
MERNVTLDFVRGVAILGILLLNISAFGLPKAAYLNPAWYDAITPQDAWTW
AFLDLIGQVKFLTLFALLFGAGLQMLLPRGRRWIQSRLTLLVLLGFIHGL
LFWDGDILLAYGLVGLGVLLLLGLISDSQTSRAWTPDASAILYEKYWKLH
GGVEAISNRADGVGNSLLALGAQYGWQLAGMMLIGAALMRSGWLKGQFSL
RHYRRTGFVLVAIGVTINLPAIALQWQLDWAYRWCAFLLQMPRELSAPFQ
AIGYASLFYGFWPQLSRFKLVLAIACVGRMALTNYLLQTLICTTLFYHLG
LFMHFDRLELLAFVIPVWLANILFSVIWLRYFRQGPVEWLWRQLTLRAAG
PAISKTSR
>S2372 yeiH, hypothetical protein
MTNITLQKQHRTLWHFIPGLALSAVITGVALWGGSIPAVAGAGFSALTLA
ILLGMVLGNTIYPHIWKSCDGGVLFAKQYLLRLGIILYGFRLTFSQIADV
GISGIIIDVLTLSSTFLLACFLGQKVFGLDKHTSWLIGAGSSICGAAAVL
ATEPVVKAEASKVTVAVATVVIFGTVAIFLYPAIYPLMSQWFSPETFGIY
IGSTVHEVAQVVAAGHAISPDAENAAVISKMLRVMMLAPFLILLAARVKQ
LSGANSGEKSKITIPWFAILFIVVAIFNSFHLLPQSVVNMLVTLDTFLLA
MAMAALGLTTHVSALKKAGAKPLLMALVLFAWLIVGGGAINYVIQSVIA
>S2403 yejL, hypothetical protein
MPQISRYSNEQVEQLLAELLNVLEKHKAPTDLSLMVLGNMVTNLINTSIA
PAQRQAIANSFASALQSSINEDKAH
>S2459 yfaD, hypothetical protein
MTESTTSSPHDAVFKTFIFTPETARDFLEIHLPEPLRKLCNLQTLRLEPT
SFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRPMRYAT
AAMQSHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLY
TEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDHDLIGMVDRITTLLV
RGFTNDSQLQTLFNYLLQCGDTSRFTRFIQEIAERSPLQKERLMTIAERL
RQEGHQIGWQEGKLVGLQQGKLEGLQEGMHEQAIKIALRMLEQGIDRDQV
LAATQLSEADLAANNH
>S2509 yfcC, putative S-transferase
MQGNICAMSAITESKPTRRWAMPDTLVIIFFVAILTSLATWVVPVGMFDS
QEVQYQVDGQTKTRKVVDPHSFRILTNEAGEPEYHRVQLFTTGDERPGLM
NFPFEGLTSGSKYGTAVGIIMFMLVIGGAFGIVMRTGTIDNGILALIRHT
RGNEILFIPALFILFSLGGAVFGMGEEAVAFAIIIAPLMVRLGYDSITTV
LVTYIATQIGFASSWMNPFCVVVAQGIAGVPVLSGSGLRIVVWVIATLIG
LIFTMVYASRVKKNPLLSRVHESDRFFREKQADVEQRPFTFGDWLVLIVL
TAVMVWVIWGVIVNAWFIPEIASQFFTMGLVIGIIGVVFRLNGMTVNTMA
SSFTEGARMMIAPALLVGFAKGILLLVGNGEAGNASVLNTILNSIANAIS
GLDNAVAAWFMLLFQAVFNFFVTSGSGQAALTMPLLAPLGDLVGVNRQVT
VLAFQFGDGFSHIIYPTSASLMATLGVCRVDFRNWLKVGATLLGLLFIMS
SVVVIGAQLMGYH
>S2604 yfeD, hypothetical protein
MTTEELAECLGVAKQTVNRWTREKGWKTEKFPGVKGGRARLILVDTQVCE
FIQNTPAFHNTPMLMEAEERIAEYAPGARAPAYRQIINAIDNMTDIEQEK
VAQFLSREGIRNFLARLDIDESA
>S2734 yfgA, putative membrane protein
MNTEATHDQNEALTTGARLRNAREQLGLSQQAVAERLCLKVSTVRDIEED
KAPADLASTFLRGYIRSYARLVHIPEEELLPGLEKQAPLRAAKVAPMQSF
SLGKRRKKRDGWLMTFTWLVLFVVIGLSGAWWWQDHKAQQEEITTMADQS
SAELSSNSEQGQSVPLNTSTTTDPATTSTPPASVDTTATNTQTPAVTAPA
PAVDPQQNAVVSPSQANVDTAATPVPTAATTPDGAAPLPTDQAGVTTPAA
DPNALVMNFTADCWLEVTDATGKKLFSGMQRKDGNLNLTGQAPYKLKIGA
PAAVQIQYQGKPVDLSRFIRTNQVARLTLNAEQSPAQ
>S2747 yfhF, putative regulator
MSITLSDSAAARVNTFLANRGKGFGLRLGVRTSGCSGMAYVLEFVDEPTP
EDIVFEDKGVKVVVDGKSLQFLDGTQLDFVKEGLNEGFKFTNPNVKDECG
CGESFHV
>S2743 yfhJ, hypothetical protein
MGLKWTDSREIGEALYDAYPDLDPKTVRFTDMHQWICDLEDFDDDPQASN
EKILEAILLVWLDEAE
>S2830 yfiH, hypothetical protein
MSKLIVPQWPLPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEEN
RKRLFAAGNLPSKPVWLEQVHGKDVLKLTGEPYASKRADASYSNTPGTVC
AVMTADCLPVLFCNRAGTEVAAVHAGWRGLCAGVLEETVSCFADKPENIL
AWLGPAIGPRAFEVGAEVREAFMAVDAKASAAFIQHGDKYLADIYQLARQ
RLANVGVEQIFGGDRCTYTENETFFSYRRDKTTGRMASFIWLI
>S2818 yfiP, hypothetical protein
MAFFRRSLMTENAVLQLRAERIARATRPFLARGNRVRRCQRCLLPEKLCL
CSTITPAQAKSRFCLLMFDTEPMKPSNTGRLIADILPDTVAFQWSRTEPS
QDLLDLVQNPDYQPMVVFPASYADEQREVIFTPPAGKPPLFIMLDGTWPE
ARKMFRKSPYLDNLPVISVDLSRLSAYRLREAQAAGQYCTAEVAIALLDM
AGDTGAAAGLGEHFTRFKTRYLAGKTQHLGSITAEQLESV
>S2877 ygaU, hypothetical protein
MGLFNFVKDAGEKLWDAVTGQHDKDDQAKKVQEHLSKTGIPDADKVNIQI
ADGKATVTGDGLSQEAKEKILVAVGNISGIASVDDQVKTATPATASQFYT
VKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPDKIYPGQVLRIPEE
>S2961 ygbO, putative hydrogenase subunit
MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILK
NGCNTRFVADALAKFLKIHSREVSFAGQKDKHAVTEQWLCARVPGKEMPD
LSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLID
ICVKGVPNYFGAQRFGIGGSNLQGALRWAQTNTPVRDRNKRSFWLSAARS
ALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVND
KELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRA
MLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE
>S3016 ygdD, hypothetical protein
MTSRFMLIFAAISGFIFVALGAFGAHVLSKTMGAVEMGWIQTGLEYQAFH
TLAILGLAVAMQRRISIWFYWSSVFLALGTVLFSGSLYCLALSHLRLWAF
VTPVGGVSFLAGWALMLVGAIRLKRKGVSHE
>S3094 ygfB, hypothetical protein
MSIQNEMPGYNEMNQYLNQQGTGLTPAEMHGLISGMICGGNDDSSWLPLL
HDLTNEGMAFGHELAQALRKMHSATSDALQDDGFLFQLYLPDGDDVSVFD
RADALAGWVNHFLLGLGVTQPKLDKVTGETGEAIDDLRNIAQLGYDEDED
QEELEMSLEEIIEYVRVAALLCHDTFTHPQPTAPEVQKPTLH
>S3095 ygfE, hypothetical protein
MSAQPVDIQIFGRSLRVNCPPDQRDALNQAADDLNQRLQDLKERTRVTNT
EQLVFIAALNISYELAQEKAKTRDYAASMEQRIRMLQQTIEQALLEQGRI
TEKTNQNFE
>S3082 ygfY, hypothetical protein
MDINNKARIHWACRRGMRELDISIMPFFEHEYDSLSDDEKRIFIRLLECD
DPDLFNWLMNHGKPADAELEMMVRLIQTRNRERGPVAI
>S3107 yggE, putative actin
MKFKVIALAALMGISGMAAQANELPDGPHIVTSGTASVDAVPDIATLAIE
VNVAAKDAATAKKQADERVAQYISFLELNQIAKKDISSANLRTQPDYDYQ
DGKSILKGYRAVRTVEVTLRQLDKLNSLLDGALKAGLNEIRSVSLGVAQP
DAYKDKARKAAIDNAIHQAQELANGFHRKLGPVYSVRYHVSNYQPSPMVR
MMKADAAPVSAQETYEQAAIQFDDQVDVVFQLEPVDQQPAKTPAAQ
>S3141 yggJ, hypothetical protein
MRIPRIYHPEPLTSHSHIALCEDAANHIGRVLRMGPGQALQLFDGSNQVF
DAEITSASKKSVEVKVLEGQIDDRESPLHIHLGQVMSRGEKMEFTIQKSI
ELGVSLITPLFSERCGVKLDSERLNKKLQQWQKIAIAACEQCGRNRVPEI
RPAMDLEAWCAEQDEGLKLNLHPRASNSINTLPLPVERVRLLIGPEGGLS
ADEIAMTARYQFTDILLGPRVLRTETTALTAITALQVRFGDLG
>S3159 yggL, hypothetical protein
MGLNVREGEIMAKNRSRRLRKKMHIDEFQELGFSVAWRFPEGTSEEQIDK
IVDDFINEVIEPNKLAFDGSGYLAWEGLICMQEIGKCTEEHQAIVRKWLE
ECKLDEVRTSELFDVWWD
>S3147 yggT, putative resistance protein
MNTLTFLLSTVIELYTMVLLLRIWMQCAHCDFYTPFSQFVVKVAQPIIGP
LRRVIPAMGPIDSASLLVAYILSFIKAIVLFEVVTFLPIIWIAGLLILLK
TIGLLIFWVLLVMAIMSWVSQGRSPIEYVLIQLADPLLRPIRRLLPAMGG
IDFSPMILVLLLYAINMGVAEVLQATGNMLLPGLWMAL
>S3148 yggU, hypothetical protein
MDGVMSAVTVNDDGLVLRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQ
ANSHLVKFLGKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEIAALIN
>S3257 yghB, hypothetical protein
MAVIQDIIAALWQHDFAALADPHIVSVVYFVMFATLFLENGLLPASFLPG
DSLLILAGALIAQGVMDFLPTIAILTAAASLGCWLSYIQGSWLGNTKTVK
GWLAQLPAKYHQRATCMFDRHGLLALLAGRFLAFVRTLLPTMAGISGLPN
RRFQFFNWLSGLLWVSVVTSFGYALSMIPFVKRHEDQVMTFLMILPIALL
TAGLLGTLFVVIKKKYCNA
>S3282 ygiB, hypothetical protein
MGIYHWSRKTKMKRTKSIRHASFRKNWSARHLTPVALAVATVFMLAGCEK
SDETVSLYQNADDCSAANPGKSAECTTAYNNALKEAERTAPKYATREDCV
AEFGEGQCQQAPAQAGMAPENQAQAQQSSGSFWMPLMAGYMMGRLMGGGA
GFAQQPLFSSKNPASPAYGKYTDATGKNYGAAQPGRTMTVPKTAMAPKPA
TTTTVTRGGFGESVAKQSTMQRSATGTSSRSMGG
>S3284 ygiD, hypothetical protein
MTPLVKDIIMSSTRMPALFLGHGSPMNVLEDNLYTRSWQKLGMTLPRPQA
IVVVSAHWFTRGTGVTAMETPPTIHDFGGFPQALYDTHYPAPGSPALAQR
LVELLAPIPVTLDKEAWGFDHGSWGVLIKMYPDADIPMVQLSIDSSKPAA
WHFEMGRKLAALRDEGIMLVASGNVVHNLRTVKWHGDSSPYPWATSFNEY
VKANLTWQGPVEQHPLVNYLDHEGGALSNPTPEHYLPLLYVLGVWDGQEP
ITIPVDGIEMGSLSMLSVQIG
>S3300 ygiF, hypothetical protein
MAQEIELKFIVNHSAVEALRDHLNTLGGEHHDPVQLLNIYYETPDNWLRR
HDMGLRIRGENGRYEMTMKVAGRVTGGLHQRPEYNVALSEPTLDLAQLPT
EIWPNGELPADLASRVQSLFSTDFYREKWLVEIDGSRIEIALDQGEVKAG
EFAEPICELELELLSGDTRAVLKLANQLVSQTGLRQGSLSKAARGYHLAQ
GNPAREIKPTTILHVAAKADVEQGLEAALELVLAQWQYHEELWVRGNDAA
KEQVLAAISLVRHTLMLFGGIVPRKASTHLRDLLTQCEATIASAVSAVTA
VYSTETAMAKLALTEWLVSKAWQPFLDAKAQGKISDSFKRFADIHLSRHA
AELKSVFCQPLGDRYRDQLPRLMRDIDSILLLAGYYDPVVAQAWLENWQG
LRHAIATGQRIEIEHFRNEANNQEPFWLHSGKR
>S3305 ygiH, hypothetical protein
MSAIAPGMILIAYLCGSISSAILVCRLCGLPDPRTSGSGNPGATNVLRIG
GKGAAVAVLIFDVLKGMLPVWGAYELGVSPFWLGLIAIAACLGHIWPVFF
GFKGGKGVATAFGAIAPISWDLTGVMAGTWLLTVLLSGYSSLGAIVSALI
APFYVWWFKPQFTFPVSMLSCLILLRHHDNIQRLWRRQETKIWTKFKRKR
EKDPE
>S3274 ygiN, hypothetical protein
MLTVIAEIRTRPGQHHRQAVLDQFAKIVPTVLKEEGCHGYAPMVDCAAGV
SFQSMAPDSIVMIEQWESIAHLEAHLQTPHMKAYSEAVKGDVLEMNIRIL
QPGI
>S3269 ygiW, hypothetical protein
MKKFAAVIAVMALCSAPVMAAEQGGFSGPSVTQSQAGGFQGPNGSVTTVE
SAKSLRDDTWVTLRGNIVERISDDLYVFKDASGTINVDIDHKRWNGVTVT
PKDTVEIQGEVDKDWNSVEIDVKQIRKVNP
>S3330 ygjN, hypothetical protein
MHLITQKALKDAAEKYPQHKTELVALGNTIAKGYFKKPESLKAVFPSLDN
FKYLDKHYVFNVGGNELRVVAMVFFESQKCYIREVMTHKEYDFFTAVHRT
KGKK
>S3333 ygjQ, hypothetical protein
MLRAFARLLLRICFSRRTLKIACLLLLVAGATIFIADRVMVNASKQLTWG
DVNAVPARNVGLLLGARPGNRYFTRRIDTAAALYHAGKVKWLLVSGDNGR
KNYDEASGMQQALIAKGVPAKVIFCDYAGFSTLDSVVRAKKVFGENHITI
ISQEFHNQRTIWLAKQYGIDAIGFNAPDLNMKHGFYTQLREKLARVSAVI
DAKILHRQPKYLGPSVMIGPFSEHGCPAKE
>S3354 yhaH, putative cytochrome
MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAG
GEGILTIIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIV
FNCQAGTPGENRFGPDPKLEQE
>S3359 yhaM, hypothetical protein
MIRTSAASDARMGGATLPAMSNSGSGNQGITAIMPVVVVAEHFGADDERL
ARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETIS
MAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNE
GIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR
>S3360 yhaN, hypothetical protein
MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVER
VEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDAT
AQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHT
NIVHIETHNGVVFTHQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAA
IRFILDSAKLNCALSQEGLSGKWGLAYWRDAGKTVRARFAGERSLFIHCD
SYQRGIRCAYGRRYASGYE
>S3428 yhbC, hypothetical protein
MGLSTLEQKLTEMITAPVEALGFELVGIEFIRGRTSTLRIYIDSEDGINV
DDCADVSHQVSAVLDVEDPITVAYNLEVSSPGLDRPLFTAEHYARFVGEE
VTLVLRMAVQNRRKWQGVIKAVDGEMITVTVEGKDEVFALSNIQKANLVP
HF
>S3458 yhbN, hypothetical protein
MKFKTNKLSLNLVLASSLLAASIPAFAVTGDTDQPIHIESDQQSLDMQGN
VVTFTGNVIVTQGTIKINADKVVVTRPGGEQGKEVIDGYGKPATFYQMQD
NGKPVEGHASQMHYELAKDFVVLTGNAYLQQVDSNIKGDKITYLVKEQKM
QAFSDKGKRVTTVLVPSQLQDKNNKGQTPAQKKGN
>S3412 yhbP, hypothetical protein
METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEK
TRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKA
YNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA
>S3488 yhcB, hypothetical protein
MTWEYALIGLVVGIIIGAVAMRFGNRKLRQQQALQYELEKNKAELDEYRE
ELVSHFARSAELLDTMAHDYRQLYQHMAKSSSSLLPELSAEANPFRNRLA
ESEASNDQAPVQVPRDYSEGASGLLRTGAKRD
>S3495 yhcP, hypothetical protein
MGIFSIANQHIRFAVKLATAIVLALFVGFHFQLETPRWAVLTAAIVAAGP
AFAAGGEPYSGAIRYRGFLRIIGTFIGCIAGLVIIIAMIRAPLLMILVCC
IWAGFCTWISSLVRIENSYAWGLAGYTALIIVITIQPEPLLTPQFAVERC
SEIVIGIVCAIMADLLFSPRSIKQEVDRELESLLVAQYQLMQLCIKHGDG
EVVDKAWGDLVRRTTALQGMRSNLNMESSRWARANRRLKAINTLSLTLIT
QSCETYLIQNTRPELITDTFREFFDTPVETAQDVHKQLKRLRRVIAWTGE
RETPVTIYSWVAAATRYQLLKRGVISNTKINATEEEILQGEPEVKVESAE
RHHAMVNFWRTTLSCILGTLFWLWTGWTSGSGAMVMIAVVTSLAMRLPNP
RMVAIDFIYGTLAALPLGLLYFLVIIPNTQQSMLLLCISLAVLGFFLGIE
VQKRRLGSMGALASTINIIVLDNPMTFHFSQFLDSALGQIVGCVLAFTVI
LLVRDKSRDRTGRVLLNQFVSAAVSAMTTNVARRKENHLPALYQQLFLLM
NKFPGDLPKFRLALTMIIAHQRLRDAPIPVNEDLSAFHRQMRRTADHVIS
ARSDDKRRRYFGQLLEELEIYQEKLRIWQAPPQVTEPVHRLAGMLHKYQH
ALTDS
>S3500 yhdP, hypothetical protein
MDNLTAHITRENPGWQFSIPDTRITMDGKPWPSGALTLAWIPEQDVGGKD
NKRSDELRIRASNLELAGLEGIRPLAAKLSPALGDVWRSTQPSGKINTLA
LDIPLQAADKTRFQASWSDLAWKQWKLLPGAEHFSGTLSGSVENGLLTAS
MKQAKMPYETVFRAPLEIADGQATISWLNNDKGFQLDGRNIDVKAKAVHA
RGGFRYLQPANDEPWLGILAGISTDDGSQAWRYFPENLMGKDLVDYLSGA
IQGGEADNATLVYGGNPQLFPYKHNEGQFEVLVPLRNAKFAFQPDWPALT
NLDIELDFINDGLWMKTDGVNLGGVRASNLTAVIPDYSKEKLLIDADIKG
PGKAVGPYFDETPLKDSLGATLQELQLDGDVNARLHLDIPLNGELVTAKG
EVTLRNNSLFIKPLDSILKNLSGKFSFINGDLQSEPLTASWFNQPLNVDF
STKEGAKAYQVAVNLNGNWQPAKTGVLPEAVNEALSGSVAWDGKVGIDLP
YHAGATYNVELNGDLNNVSSHLPSPLAKPAGEPLAVNVKVDGNLNSFELT
GQAGADNHFNSRWLLGQKLTLDRAIWAADSKTLPPLPEQSGVELNMPPMN
GAEWLALFQKGAAESVGGAASFPQHITLRTPMLSLGNQQWNNLSIVSQPT
ANGTLVEAQGREINATLAMRNNAPWLANIKYLYYNPSVAKTRGDSTPSSP
FPTTERINFRGWPDAQIRCTECWFWGQKFGRIDSDITISGDTLTLTNGLI
DTGFSRLTADGEWVNNPGNERTSLKGKLRGQKIDAAAEFFGVTTPIRQSS
FNVDYDLHWRKAPWQPDEATLNGIIHTQLGKGEITEINTGHAGQLLRLLS
VDALMRKLRFDFRDTFGEGFYFDSIRSTAWIKDGVMHTDDTLVDGLEADI
AMKGSVNLVRRDLNMEAVVAPEISATVGVAAAFAVNPIVGAAVFAASKVL
GPLWSKVSILRYHISGPLDDPQINEVLRQPRKEKAQ
>S3512 yhdT, hypothetical protein
MDTRFVQAHKEARWALGLTLLYLAVWLVDAYLPGVAPGFTGFPRWFEMAC
ILTPLLFIGLCWAMVKFIYRDIPLEDDDAA
>S4398 yheO, hypothetical protein
MSRSLLTNETSELDLLDQRPFDQTDFDILKSYEAVVDGLAMLIGSHCEIV
LHSLQDLKCSAIRIANGEHTGRKIGSPITDLALRMLHDMTGADSSVSKCY
FTRAKSGVLMKSLTIAIRNREQRVIGLLCINMNLDVPFSQIMSTFVPPET
PDVGSSVNFASSVEDLVTQTLEFTIEEVNADRNVSNNAKNRQIVLNLYEK
GIFDIKDAINQVADRLNISKHTVYLYIRQFKSGDFQGQDK
>S4390 yheU, hypothetical protein
MLIPWQDLSPETLENLIESFVLREGTDYGEHERTLEQKVADVKRQLQCGE
AVLVWSELHETVNIMPRSQFRE
>S4386 yhfK, hypothetical protein
MWRRLIYHPDINYALRQTLVLCLPVAVGLMLGELRFGLLFSLVPACCNIA
GLDTPHKRFFKHLIIGASLFATCSLLTQLLLVKDVPLPFLLTGLTLVLGV
TAELGPLHAKLLPASLLAAIFTLSLAGYMPVWEPLLIYALGTLWYGLFNW
FWFWIWREQPLRESLSLLYRELADYCEAKYSLLTQHTDPEKALPPLLVRQ
QKAVDLITQCYQQMHMLSAQNNTDYKRMLRIFQEALDLQEHISVSLHQPE
EVQKLVERSHAEEVIRWNAQTVAARLRVLADDILYHRLPTRFTMEKQIGA
LEKIARQHPDNPVGQFCYWHFSRIARVLRTQKPLYARDLLADKQRRMPLL
PALKSYLSLKSPALRNAGRLSVMLSVASLMGTVLHLPKSYWILMTVLLVT
QNGYGATRLRIVNRSVGTVVGLIIAGVALHFKIPESYTLTLMLITTLASY
LILRKNYGWATVGFTITAVYTLQLLWLNGEQYILPRLIDTIIGCLIAFGG
TVWLWPQWQSGLLRKNAHDALEAYQEAIRLILSEDPQPTPLAWQRMRVNQ
AHNTLYNSLNQAMQEPAFNSHYLADMKLWVTHSQFIVEHINAMTTLAREH
RALPPELAQEYLQSCEIAIQRCQQRLEYDEPGSSGDANIMDAPEMQPHEG
AAGTLEQHLQRVIGHLNTMHTISSMAWRQRPHHGIWLSRKLRDSKA
>S4279 yhhL, hypothetical protein
MLINIGRLLMLCVWGFLILNLVHPFPRPLNIFVNVALIFTVLMHGMQLAL
LKSTLPKDGPQMTTAEKVRIFLFGVFELLVWQKKFKVKK
>S4277 yhhN, putative enzyme
MLWSFIAVCLSAWLSVDASYRGPTWQRWVFKPLTLLLLLLLAWQAPMFDA
ISYLVLAGLCASLLGDALTLLPRQRLMYAIGAFFLSHLLYTIYFASQMTL
SFFWPLPLVLLVLGALLLAIIWTRLEEYRWPICTFIGMTLVMVWLAGELW
FFRPTAPALSAFVGASLLFISNFVWLGSHYRRRFRADNAIAAACYFAGHF
LIVRSLYL
>S4274 yhhQ, hypothetical protein
MNVFSQTQRYKALFWLSLFHLLVITSSNYLVQLPVSIFGFHTTWGAFSFP
FIFLATDLTVRIFGAPLARRIIFAVMIPALLISYVISSLFYMGSWQGFGA
LAHFNLFVARIATASFMAYALGQILDVHVFNRLRQSRRWWLAPTASTLFG
NVSDTLAFFFIAFWRSPDAFMAEHWMEIALVDYCFKVLISIVFFLPMYGV
LLNMLLKRLADKSEINALQAS
>S4213 yhjD, hypothetical protein
MTQENEIKRPTQDLEHEPIKQLDNSEKGGKVSQALETVTTTAEKVQRQPV
IAHLIRATERFNDRLGNQFGAAITYFSFLSMIPILMVSFAAGGFVLASHP
MLLQDIFDKILQNISDPTLAATLKNTINTAVQQRTTVGLVGLAVALYSGI
NWMGNLREAIRAQSRDVWERSPQDQEKFWVKYLRDFISLIGLLIALIVTL
SITSVAGSAQQMIISALHLNSIEWLKPTWRLIGLAISIFANYLLFFWIFW
RLPRHRPRKKALIRGTFLAAIGFEVIKIVMTYTLPSLMKSPSGAAFGSVL
GLMAFFYFFARLTLFCAAWIATAEYKDDPRMPGKTQP
>S4163 yiaA, hypothetical protein
MDNKISTYSPAFSIVSWIALVGGIVTYLLGLWNAEMQLNEKGYYFAVLVL
GLFSAASYQKTVRDKYEGIPTTSIYYMTCLTVFIISVALLMVGLWNATLL
LSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKETQVTQEEYSE
>S4162 yiaB, hypothetical protein
MKTSKTVAKLLFVVGALVYLVGLWISCPLLSGKGYFLGVLMTATFGNYAY
LRAEKLGQLDNFFTHICQLVALITIGLLFIGVLNAPINAYEMVIYPIAFF
VCLFGQMRLFRSV
>S4164 yiaH, hypothetical protein
MQPKIYWIDNLRGIACLMVVMIHTTTWYVTNAHSVSPVTWDIANVLNSAS
RVSVPLFFMISGYLFFGERSAQPRHFLRIGLCLIFYSAIALLYIALFTSI
NMELALKNLLQKPVFYHLWFFFAIAVIYLVSPLIQVKNVGGKMLLVLMVV
IGIIANPNTVPQKIDGFEWLPINLYINGDTFYYILYGMLGRAIGMMDTQH
KALSWVSAALFATGVFIISRGTLYELQWRGNFADTWYLYCGPMVFICAIA
LLTLVKNTLDTRTIRGLGLISRHSLGIYGFHALIIHALRTRGIELKNWPI
LDIIWIFCATLAASLLLSMLVQRIDRNRLVS
>S4115 yibQ, hypothetical protein
MLAMPSAISVAVLPDSPHAREMATKAHNSGHEVLIHLPMAPLSKQPLEKN
TLRPEMSSDEIERIIRSAVNNVPYAVGINNHMGSKMTSNLFGMQKVMQAL
ERYNLYFLDSVTIGNTQAMRAAQGTGVKVIKRKVFLDDSQNEADIRVQFN
RAIDLARRNGSTIAIGHPHPSTVRVLQQMVYNLPPDITLVKASSLLNEPQ
VDTSTPPKNAVPDAPRNPFRGVKLCKPKKPIEPVYANRFFEVLSESISQS
TLIVYFQHQWQGWGKQPEAAKFNASAN
>S4085 yicC, putative alpha helix protein
MIRSMTAYARREIKGEWGSATWEMRSVNQRYLETYFRLPEQFRSLEPVVR
ERIRSRLTRGKVECTLRYEPDVSAQGELILNEKLAKQLVTAANWVKMKSD
EGEINPVDILRWPGVMAAQEQDLDAIAAEILAALDGTLDDFIVARETEGQ
ALKALIEQRLEGVTAEVVKVRAHMPEILQWQRERLVAKLEDAQVQLENNR
LEQELVLLAQRIDVAEELDRLEAHVKETYNILKKKEAVGRRLDFMMQEFN
RESNTLASKSINAEVTNSAIELKVLIEQMREQIQNIE
>S4083 yicG, hypothetical protein
MTMRTNRYPARQIFRERTMLLHILYLVGITAEAMTGALAAGRRRMDTFGV
IIIATATAIGGGSVRDILLGHYPLGWVKHPEYVIIVATAAVLTTIVAPVM
PYLRKVFLVLDALGLVVFSIIGAQVALDMGHGPIIAVVAAVTTGVFGGVL
RDMFCKRIPLVFQKELYAGVSFASAVLYIALQHYVSNHDVVIISTLVFGF
FARLLALRLKLGLPVFYYSHEGH
>S4005 yidB, hypothetical protein
MGLFDEVVGAFLKGDAGKYQAILSWVEEQGGIQVLLEKLQSGGLGAILST
WLSNQQRNQSVSGEQLESALGTNAVSDLGQKLGVDTSTASSLLAEQLPKI
IDALSPQGEVSAQANNDLLSAGMELLKGKLFR
>S3983 yidH, hypothetical protein
MKISRLGEAPDYRFSLANERTFLAWIRTALGFLAAGVGLDQLAPDFATPV
IRELLALLLCLFSGGLAMYGYLRWLRNEKAMRLKEDLPYTNSLLIISLIL
MVVAVIVMGLVLYAG
>S3920 yifE, hypothetical protein
MAESFTTTNRYFDNKHYPRGFSRHGDFTIKEAQLLERHGYAFNELDLGKR
EPVTEEEKLFVAVCRGEREPVTEAERVWSKYMTRIKRPKRFHTLSGGKPQ
VEGAEDYTDSDD
>S3868 yigA, hypothetical protein
MKQPGEELQETLTELDDRAVVDYLIKNPEFFIRNARAVEAIRVPHPVRGT
VSLVEWHMARARNHIHVLEENMALLMEQAIANEGLFYRLLYLQRSLTAAS
SLDDMLMRFHRWARDLGLAGASLRLFPDRWRLGAPSNHTHLALSRQSFEP
LRIQRLGQEQHYLGPLNGPELLVVLPEAKAVGSVAMSMLGSDADLGVVLF
TSRDASHYQQGQGTQLLHEIALMLPELLERWIERV
>S3862 yigE, hypothetical protein
MAHRLLIGKGMITLNLKRIFLTLTLLPLFAVAADDCALSDPTLTVQAYTV
NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP
LGLYIENGQQKV
>S3844 yigN, putative alpha helix chain
MVYAVIALVGVAIGWLFASYQHAQQKAEQLAEREEMVAELSAAKQQITQS
EHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDKIRQMI
NSEQRLSEQFENLANRIFEHSNRRVDEQNRQSLNSLLSPLREQLDGFRRQ
VQDSFGKEAQERHTLTHEIRNLQQLNAQMAQEAINLTRALKGDNKTQGNW
GEVVLTRVLEASGLREGYEYETQVSIENDARSRMQPDVIVRLPQGKDVVI
DAKMTLVAYERYFNAEDDYTRESALQEHIASVRNHIRLLGRKDYQQLPGL
RTLDYVLMFIPVEPAFLLALDRQPELITEALKNNIMLVSPTTLLVALRTI
ANLWRYEHQSRNAQQIADRASKLYDKMRLFIDDMSAIGQSLDKAQDNYRQ
AMKKLSSGRGNVLAQAEAFRGLGVEIKREINPDLAEQAVSQDEEYRLRSV
PEQPNDEAYQRDDEYNQQSR
>S3842 yigP, hypothetical protein
MPFKPLVTAGIESLLNTFLYRSPALKTARSRLLGKVLRVEVKGFSTSLIL
VFSERQVDVLGEWAGDADCTVIAYASVLPKLRDRQQLTALIRSGELEVQG
DIQVVQNFVALADLAEFDPAELLAPYTGDIAAEGISKAMRGGAKFLHHGI
KRQQRYVAEAITEEWRMAPGPLEVAWFAEETAAVERAVDALTKRLEKLEA
K
>S3828 yigZ, hypothetical protein
MESWLIPAAPVTVVEEIKKSRFITMLAHTDGVEAAKAFVESVRAEHPDAR
HHCVAWVAGAPDDSQQLGFSDDGEPAGTAGKPMLAQLMGSGVGEITAVVV
RYYGGILLGTGGLVKAYGGGVNQALRQLTTQRKTPLTEYTLQCEYSQLTG
IEALLGQCDGKIINSDYQAFVLLRVALPAAKVAEFSAKLADFSRGSLQLL
AIEE
>S3818 yihD, hypothetical protein
MKCKRLNEVIELLQPAWQKEPDFNLLQFLQKLAKESGFDGELADLTDDIL
IYHLKMRDSAKDAVIPGLQKDYEEDFKTALLRARGVIKE
>S3815 yihF, putative GTP-binding protein
MDIQSFAVLSGNIYMIRKSATGVIVALAVIWGGGTWYTGTQIQPGIEKFI
KDFNDAKKKGEHAYDMTLSYKNFDKGFFNSRFQMQMTFDNGAPDLNIKPG
QKVVFDVDVEHGPLPITMLMHGNVIPALAAAKVNLVNNELTQPLFIAAKN
KSPVEATLRFAFGGSFSTTLDVAPAEYGKFSFGEGQFTFNGDGSSLSNLD
IEGKVEDIVLQLSPMNKVTAKSFTIDSLARLEEKKFPVGESESKFNQINI
INHGEDVAQIDAFVAKTRLDRVKDKDYINVNLTYELDKLTKGNQQLGSGE
WSLIAESIDPSAVRQFIIQYNIAMQKQLAAHPELANDEVALQEVNAALFK
EYLPLLQKSEPTIKQPVRWKNALGELNANLDISIADPAKSSSSTNKDIKS
LNFDVKLPLNVVTETAKQLNLSEGMDAEKAQKQADKQISGMMTLGQMFQL
ITIDNNTASLQLRYTPGKVVFNGQEMSEEEFMSRAGRFVH
>S3810 yihI, hypothetical protein
MKPSSSNSRSKGHAKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGN
TTSGSKGQNAPKDPRIGSKTPIPLGVTEKVTKQHKPKSEKPMLSPQAELE
LLETDERLDALLERLEAGETLSAEEQSWVDAKLDRIDELMQKLGLSYDDD
EEEEEDEKQEDMMRLLRGN
>S3770 yiiL, hypothetical protein
MIRKAFVMQVNPDAHEEYQRRHNPIWPELEAVLKSHGVHNYAIYLDKARN
LLFAMVEIESEERWNAVASTDVCQRWWKYMTDVMPANPDNSPVSSELQEV
FYLP
>S3759 yiiM, hypothetical protein
MRYPVDVYTGKIQAYPEGKPSAIAKIQVDGELMLTELGLEGDEQAEKKVH
GGPDRALCHYPREHYLYWVREFPEQAELFVAPAFGENLSTDGLTESNVHM
GDIFRWGEALIQVSQPRSPCYKLNYHFDISDIAQLMQNTGKVGWLYSVIA
PGKVSADAPLELVSRVSDVTVQEAAAIAWHMPFDDDQYHRLLSAAGLSKS
WTRTMQKRRLSGKIEDFSRRLWGK
>S3747 yiiS, hypothetical protein
MKDVVDKCSTKGCAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKEL
AAAASSADEGASVAYKIKDLEGQVELDAAFTFSCQAEMIIFELSLRSLA
>S3741 yiiU, hypothetical protein
MTMSLEVFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQR
EELERENNHLKEQQNGWQERLQALLGRMEEV
>S3664 yjaG, hypothetical protein
MLQNPIHLRLERLESWQHVTFMACLCERMYPNYAMFCQQTGFGDGQIYRR
ILDLIWETLTVKDAKVNFDSQLEKFEEAIPSADDFDLYGVYPAIDACVAL
SELVHSRLSGETLEHAVEVSKTSITTVAMLEMTQAGREMGDEELKENPAV
EQEWDIQWEIFRLLAECEERDIELIKGLRADLRESGESNIGIIFQQ
>S3556 yjbA, hypothetical protein
MTSLSRPRVEFISTILQTVLNLGLLCLGLILVVFLGKETVHLADVLFAPE
QTSKYELVEGLVVYFLYFEFIALIVKYFQSGFHFPLRYFVYIGITAIVRL
IIVDHKSPLDVLIYSAAILLLVITLWLCNSKRLKRE
>S3571 yjbJ, hypothetical protein
MNKDEAGGNWKQFKGKVKEQWGKLTDDDMTIIEGKRDQLVGKIQERYGYQ
KDQAEKEVVDWETRNEYRW
>S3581 yjbQ, hypothetical protein
MWYQKTLTLSAKSRGFHLVTDEILNQLADMPRVNIGLLHLLLQHTSASLT
LNENCDPTVRHDMECFFLRTVPDNGNYEHDYEGADDMPSHIKSSMLGTSL
VLPVHKGRIQTGTWQGIWLGEHRIHGGSRRIIATLQGE
>S3582 yjbR, hypothetical protein
MTISELLQYCMAKPSAEQSVHNDWKATQIKVEDVLFAMVKEVENRPAVSL
KTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASY
QQAVNLLPEEKRKLLVQL
>S3593 yjcF, hypothetical protein
MRYNGLNNMFFPLCQINDNHSITSSSHTKKTKSDNYSKHHKNTLIDNKAL
SLFKIDDHEKVIGLIQKMKRIYDSLPSGKITKETDRKIHKYFIDIALYAN
NKCDDRITRRVYLSKEKDVSIKVVYFINNVAVHNNTIEIPQIVNGGYDFS
HLSLKGIVIKDEDLSNSNFAGCRLQNAIFQDCNMYKTNFYYAIMEKILFD
NCILDDSNFAQIKMADGTLNACSAMHVQFYNAAMNRANIKNTFLDYSNFY
MAYMAEVNLYKVIAPYVNLFKADLSFSKLDLINFEHADLSRVNLNKAILQ
NINLIDSKLFCTWLTNTFLEMVICTGSNMANVNFNNANLSNCHFNCSILT
KACMFNTRLYRVNFDEASVQGMGISILRGEENIPIDSDTLVTLQKFFEED
CTSHTGMSQTEDNINAVAMKITADIMQHAD
>S3595 yjcH, hypothetical protein
MNGTIYQRIEDNAHFRELVEKRQRFATILSIIMLAVYIGFILLIAFAPGW
LGTPLNPNTSVTRGIPIGVGVIVISFVLTGIYIWRANGEFDRLNNEVLHE
VQAS
>S2587 yjcQ, putative enzyme
MTFEIPFVALSLAVLFYGIQSNAFYTKFVAILFVVATVLEIGSLFLIYKW
SYGEPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIYGQTFPAMLDYPE
VVVRLTLWCIVVGLYPTLLMTLIGVLWFPNRAITQMHQALNDRLDDAISH
LTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAWWQSCVATVTY
IYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAVAEGQCWQSDWRLS
ESEAVAARECNLENICQTLLQLGQMNPNTPPTPAAKPPSMVADAFTNPDY
IRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVL
RFGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERS
SYIGTQMVVTFALATLENVFGPVYDLVEIRDRALGIIIGTVVSAMIYTFV
WPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFN
ACEEMCQRVALERQLDSEERALLIERSQTVIRQGRDILHAWDATWNSAQA
LDNALQPDRAGQFADALEKYAAGLATALSRSPQITLEETPTSQAILPTLL
KQEQHVCQLFARLPDWTAPALTPATEQAQGATQ
>S3628 yjdF, hypothetical protein
MTRTLKPLILNTGALALTLILIYTGISAHDKLTWLLEVTPVIIVVPLLLA
TARRYPLTLLLYTLIFFHAIILMVGGQYTYAKVPVGFEVQEWLGLSRNPY
DKLGHFFQGLVPALVAREILVRGMYVRGRKMVAFLVCCVALAISAMYELI
EWWAALAMGQGADDFLGTQGDQWDTQSDMFCALLGALTTVILLARFHCRQ
LRRYGLITG
>S4589 yjeF, hypothetical protein
MTDHTMKKNPVSIPHTVWYADDIRRGEREAADVLGLTLYELMLRAGEAAF
QVCRSAYPDARHWLVLCGHGNNGGDGYVVARLAKAVGIEVTLLAQESDKP
LPEEAALAREAWLNAGGEIHASNIVWPESVDLIVDALLGTGLQQAPRESI
SQLIDHANSHPAPIVAVDIPSGLLAETGATPGAVINADHTITFIALKLGL
LTGKARDVTGQLHFDSLGLDSWLAGQETKIQRFSAEQLSHWLKPRRPTSH
KGDHGRLVIIGGDHGTAGAIRMTGEAALRAGAGLVRVLTRSENIAPLLTA
RPELMVHELTMDSLTESLEWADVVVIGPGLGQQEWGKKALQKVENFRKPM
LWDADALNLLAINPDKRHNRVITPHPSEAARLLGCSVAEIESDRLHCAKR
LVQRYGGVAVLKGAGTVVAAHPDALGIIDAGNAGMAIGGKGDVLSGIIGA
LLGQKLSPYDAACAGCVAHGAAADVLAARFGTRGMLATDLFSTLQRIVNP
EVTDKNHDESSNSAP
>S4599 yjeT, hypothetical protein
MNSTIWLALALVLVLEGLGPMLYPKAWKKMISAMTNLPDNILRRFGGGLV
VAGVVVYYMLRKTIG
>S4518 yjgA, putative alpha helix protein
MTKQPEDWLDDVPGDDIEDEDDEIIWVSKSEIKRDAEELKRLGAEIVDLG
KNALDKIPLDADLRPAIELAQRIKMEGRRRQLQLIGKMLRQRDVEPIRQA
LDKLKNRHNQQVVLFHKLENLRDRLIDQGDDAIAEVLNLWPDADRQQLRT
LIRNAKKEKEGNKPPKSARQIFQYLRELAENEG
>S4495 yjgD, hypothetical protein
MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAA
VEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTL
AEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGVRH
>S4493 yjgN, hypothetical protein
MAQVINEMDVPSHSFVFHGTGERYFLICVVNVLLTIITLGIYLPWALMKC
KRYLYANMEVNGQRFSYGITGGNVFVSCLVFVFFYFAILMTVSADMPLVG
CVLTLLLLVLLIFMAAKGLRYQALMTSLNGVRFSFNCSLKGFWWVTFFLP
ILMAIGMGTVFFISTKMLHANSSSSVIISVVLMAIVGIVSIGIFNGTLYS
LVMSFLWSNTSFGIHRFKVKLDTAYCIKYAILAFLALLPFLAVAGYIIFD
QILNEYDSSGYANDDIENLQQFMEMQRKMIIAQLIYYFGIAVSTSYLTVS
LRNHFMSNLSLNDGRIRFRSTLTYHGMLYRMCALVVISGITGGLAYPLLK
IWMIDWQAKNTYLLGDLDDLPLINKEEQPDKGFLASISRGVMPSLPFL
>S4469 yjhT, hypothetical protein
MTTLTARVFTTAEIIYRKTVIALVCHLNCSRQETVTMNKTIMALAIMMAS
FAANASVLPETPVPFKSGTGAIDNDTVYIGLGSAGTAWYKLDTQAKDKKW
TALAAFPGGPREQATSAFIDGNLYVFGGIGKNSEGLTQVFNDVHKYNPKT
NSWVKLMSHAPMGMAGHVTFVHNGKAYVTGGVNQNIFNGYFEDLNEAGKD
STAIDKINAHYFDKKAEDYFFNKFLLSFDPSTQQWSYAGESPWYGTAGAA
VVNKGDKTWLINGEAKPGLRTDAVFELDFTGNNLKWNKLDPVSSPDGVAG
GFAGISNDSLIFAGGAGFKGSRENYQNGKNYAHEGLKKSYSTDIHLWHNG
KWDKSGELSQGRAYGVSLPWNNSLLIIGGETAGGKAVTDSVLISVKDNKV
TVQN
>S4445 yjiG, hypothetical protein
MTTQVRKNVMDMFIDGARRGFTIATTNLLPNVVMAFVIIQALKITGLLDW
VGHICEPVMALWGLPGEAATVLLAALMSMGGAVGVAASLATADALTGHDV
TVLLPAMYLMGNPVQNVGRCLGTAEVNAKYYPHIITVCVINALLSIWVMQ
LIV
>S4444 yjiH, hypothetical protein
MGIVMTQQGDAVAGELATEKVGIKGYLAFFLTIIFFPGVFSGTDSWWRVF
DFSVLNGSFGQLPGANGATTSFRGVGGAGAKDGFLFALELAPSVILSLGI
ISITDGLGGLRAAQQLMTPVLKPLLGIPGICSLALIANLQNTDAAAGMTK
ELAQEGEITERDKVIFAAYQTSGSAIITNYFSSGVAVFAFLGTSVIVPLA
VILVFKFVGANILRVWLNFEERRNPTQGAQA
>S4641 yjiX, hypothetical protein
MFGNLGQAKKYLGQAAKMLIGIPDYDNYVEHMKTNHPDKPYMSYEEFFRE
RQNARYGGDGKGGMRCC
>S4664 yjjB, hypothetical protein
MILMTSGLNIEWSTFMASMLVGTIGIQWSRWYLAHPKVFTVAAVIPMFPG
ISAYTAMISSVKISQLGYSEPLMITLLTNFLTASSIVGALSIGLSIPGLW
LYRKRPRV
>S0279 ykgG, putative transporter
MRQAGLSMAAKHHSNLARLATGWKHAIFLKLTERVSVVGLRNIRRRRKRM
DNRSEFLNNVAQALGRPLRLEPQAEDAPLNNYANERLTQLNQQQRCDAFI
QFASDVMLTRCELTSEAKAAEAAIRLCKELGDQSVVISGDTRLEELGISE
RLQQECNAVVWDPAKGAENISQAEQAKVGVVYAEYGLTESGGVVLFSAAE
RGRSLSLLPESSLFILRKSTILPRVAQLAEKLHQKAQAGERMPSCINIIS
GPSSTADIELIKVVGVHGPVKAVYLIIEDC
>S0482 ylcC, hypothetical protein
MKKALQVAMFSLFTVIGFNAQANEHHHETMSEAQPQVISATGVVKGVDLE
SKKITIHHDPIAAVNWPEMTMRFTITPQTKMSEIKTGDKVAFNFVQQGNL
SLLQDIKVSQ
>S0881 yljA, hypothetical protein
MGKTNDWLDFDQLAEEKVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKF
FSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLL
CTLEKA
>S1018 ymbA, hypothetical protein
MKKWLVTIAALWLAGCSSGEINKNYYQLPVVQSGTQSTASQGNRLLWVEQ
VTVPDYLAGNGVVYQTSDVKYVIANNNLWASPLDQQLRNTLVANLSTQLP
GWVVASQPLGSAQDTLNVTVTEFNGRYDGKVIVSGEWLLNHQGQLIKRPF
RLEGVQTQDGYDEMVKVLAGVWSQEAASIAQEIKRLP
>S1273 ymgE, hypothetical protein
MGIIAWIIFGLIAGIIGKLIMPGRDGGGFFLTCILGIVGAVVGGWLATMF
GIDGSISGFNLHSFLVAVVGAILVLGVFRLLRRE
>S1840 ynhG, hypothetical protein
MKRASLLTLTLIGAFSAIQAAWAVDYPLPPTGSRLVGQNQTYTVQEGDKN
LQAIARRFDTAAMLILEANNTIAPVPKPGTTITIPSQLLLPDAPRQGIIV
NLAELRLYYYPPGENIVQVYPIGIGLQGLETPVMETRVGQKIPNPTWTPT
AGIRQRSLERGIKLPPVVPAGPNNPLGRYALRLAHGNGEYLIHGTSAPDS
VGLRVSSGCIRMNAPDIKALFSSVRTGTPVKVINEPVKYSVEPNGMRYVE
VHRPLSAEEQQNVQTMPYTLPAGFTQFKDNKAVDQKLVDKALYRRAGYPV
SVSSGATPAASNAPSVESAQNGEPEQGNMLRATQ
>S1590 ynjA, hypothetical protein
MGLPPLSKIPFILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGW
LERKRSPLDPVVRSLVSARIAQMCLCEFCVDITSMKVAERTGSTDKLLAV
ADWRQSPLFSDEERLALEYAEAASVTPPTVDDALRTRLAAHFDAQALTEL
TALIGLQNLSARFNSAMDIPAQGLCRIPEKRS
>S2350 yohD, hypothetical protein
MLTGNQRETNWPMDLNTLISQYGYAALVIGSLAEGETVTLLGGVAAHQGL
LKFPLVVLSVALGGMIGDQVLYLCGRRFGGKLLRRFSKHQDKIERAQKLI
QRHPYLFVIGTRFMYGFRVIGPTLIGASQLPPKIFLPLNILGAFAWALIF
TTIGYAGGQVIAPWLHNLDQHLKHWVWLILVVVLVVGVRWWLKRRGKKKP
DNQA
>S2292 yohL, hypothetical protein
MSHTIRDKQKLKARASKIQGQVVALKKMLDEPHECAAVLQQIAAIRGAVN
GLMREVIKGHLTEHIVHQGDELKREEDLDVVLKVLDSYIK
>S2762 yphA, hypothetical protein
MPGAWRGARVCITFKSHYLAKGLVMNTLRYFDFGAARPVLLLIARIAVVL
IFIIFGFPKMMGFDGTVQYMASLGAPMPMLAAIIAVVMEVPAAILIVLGF
FTRPLAVLFIFYTLGTAVIGHHYWDMTGDAVGPNMINFWKNVSIAGAFLL
LAITGPGAISLDRR
>S3002 yqcD, hypothetical protein
MSSYANHQALAGLTLGKSTDYRDTYDASLLQGVPRSLNRDPLGLKADNLP
FHGTDIWTLYELSWLNAKGLPQVAVGHVELDYTSVNLIESKSFKLYLNSF
NQTRFNNWDEVRQTLERDLSTCAQGKISVALYRLDELEGQPIGHFNGTCI
DDQDITIDNYEFTTDYLENATSGEKVVEETLVSHLLKSNCLITHQPDWGS
IQIQYRGRQIDREKLLRYLVSFRHHNEFHEQCVERIFNDLLRFCQPEKLS
VYARYTRRGGLDINPWRSNSDFVPSTTRLVRQ
>S3250 yqhA, hypothetical protein
MERFLENAMYASRWLLAPVYFGLSLALVALALKFFQEIIHVLPNIFSMAE
SDLILVLLSLVDMTLVGGLLVMVMFSGYENFVSQLDISENKEKLKWLGKM
DATSLKNKVAASIVAISSIHLLRVFMDAKNVPDNKLMWYVIIHLTFVLSA
FVMGYLDRLTRHNH
>S3278 yqiB, putative enzyme
MKRYTPDFPEMMRLCEMNFSQLRRLLPRNDAPGETVSYQVANAQYRLTIV
ESTRYTTLVTIEQTAPAISYWSLPSMTVRLYHDAMVAEVCSSQQIFRFKA
RYDYPNKKLHQRDEKHQINQFLADWLRYCLAHGAMAIPVY
>S3346 yqjA, hypothetical protein
MELLTQLLQALWAQDFETLANPSMIGMLYFVLFVILFLENGLLPAAFLPG
DSLLVLVGVLIAKGAMGYPQTILLLTVAASLGCWVSYIQGRWLGNTRTVQ
NWLSHLPAHYHQRAHHLFHKHGLSALLIGRFIAFVRTLLPTIAGLSGLNN
ARFQFFNWMSGLLWVLILTTLGYMLGKTPVFLKYEDQLMSCLMLLPVVLL
VFGLAGSLVVLWKKKYGNRG
>S3349 yqjD, hypothetical protein
MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQS
RYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSR
R
>S3457 yrbK, hypothetical protein
MSKARRWVIIVLSLAVLVMIGINMAEKDDTAQVVVNNNDPTYKSEHTDTL
VYNPEGALSYRLIAQHVEYYSDQAVSWFTQPVLTTFDKDKIPTWSVKADK
AKLTNDRMLYLYGHVEVNALVPDSQLRRITTDNAQINLVTQDVTSEDLVT
LYGTTFNSSGLKMRGNLRSKNAELIEKVRTSYEIQNKQTQP
>S4530 ytfN, hypothetical protein
MSLWKKISLGVVIVILLLLGSVAFLVGTTSGLHLVFKAADRWVPGLDIGK
VTGGWRDLTLSDVRYEQLGVAVKAGNLHLGVGLECLWNSSVCINDLALKD
IQVNIDSKKMPPSEQVEEEEDSGPLDLSTPYPITLTRVALDNVNIKIDDT
TVSVMDFTSGLNWQEKTLTLKPTSLKGLLIALPKVVEVAQEEVVEPKIEN
PQPEEKPLGETLKDLFSRPVLPEMTDVHLPLNLNIEEFKGEQLRVTGDTD
ITVRTMLLKVSSIDGNTKLDALDIDSNQGIVNASGTAQLSDNWPVDITLN
STLNVEPLKGEKVKLKVGGALREQLEIGVNLSGPVDMDLRAQTRLAEAGL
PLNVEVNSKQLYWPFTGEKQYQADDLKLKLTGKMTDYTLSMRTAVKGQEI
PPATITLDAKGNEQQVNLDKLTVAALEGKTELKALLDWQQAISWRGELTL
NGINTAKEFPEWPSKLNGLIKTRGSLYGGTWQMEVPELKLTGNVKQNKVN
VDGTLKGNSYMQWMIPGLHLELGPNSAEVKGELGVKDLNLDATINAPGLD
NALPGLGGTAKGLVKVRGTVEAPQLLADITARGLRWQELSVAQVRVEGDI
KSTDQIAGKLDVRVEQISQPDVNINLVTLNAKGSEKQHELQLRIQGEPVS
GQLNLAGSFDRKEERWKGTLSNTRFQTPVGPWSLTRDIALDYRNKEQKIS
IGPHCWLNPNAELCVPQTIDAGAEGRAVVNLNRFDLAMLKPFMPETTQAS
GIFTGKADVAWDTTKEGLPQGSITLSGRNVQVTQTVNDAALPVAFQTLNL
TAELRNNRAELGWTIRLTNNGQFDGQVQVTDPQGRRNLGGNVNIRNFNLA
MINPIFTRGEKAAGMVSANLRLGGDVQSPQLFGQLQVTGVDIDGNFMPFD
MQPSQLAVNFNGMRSTLAGTVRTQQGEIYLNGDADWSQIENWRARVTAKG
SKVRITVPPMVRMDVSPDVVFEATPNLFTLDGRVDVPWARIVVHDLPESA
VGVSSDVVMLNDNLQPEEPKTASIPINSNLIVHVGNNVRIDAFGLKARLT
GDLNVVQDKQGLGLNGQINIPEGRFHAYGQDLIVRKGELLFSGPPDQPYL
NIEAIRNPDATEDDVIAGVRVTGLADEPKAEIFSDPAMSQQAALSYLLRG
QGLESDQSDSAAMTSMLIGLGVAQSGQIVGKIGETFGVSNLALDTLGVGD
SSQVVVSGYVLPGLQVKYGVGIFDSIATLTLRYRLMPKLYLEAVSGVDQA
LDLLYQFEF
>S4529 ytfP, hypothetical protein
MRIFVYGSLRHKQGNSHWMTNAQLLGDFSIDNYQLYSLGHYPGAVPGNGT
VHGEVYRIDNATLAELDALRTRGGEYARQLIQTPYGSAWMYVYQRPVDGL
KLIESGDWLDRDK